[Replication of Streptomyces plasmids: the DNA nucleotide sequence of plasmid pSB 24.2].
Bolotin, A P; Sorokin, A V; Aleksandrov, N N; Danilenko, V N; Kozlov, Iu I
1985-11-01
The nucleotide sequence of DNA in plasmid pSB 24.2, a natural deletion derivative of plasmid pSB 24.1 isolated from S. cyanogenus was studied. The plasmid amounted by its size to 3706 nucleotide pairs. The G-C composition was equal to 73 per cent. The analysis of the DNA structure in plasmid pSB 24.2 revealed the protein-encoding sequence of DNA, the continuity of which was significant for replication of the plasmid containing more than 1300 nucleotide pairs. The analysis also revealed two A-T-rich areas of DNA, the G-C composition of which was less than 55 per cent and a DNA area with a branched pin structure. The results may be of value in investigation of plasmid replication in actinomycetes and experimental cloning of DNA with this plasmid as a vector.
Nucleotide composition analysis of tRNA from leukemia patient cell samples and human cell lines.
Agris, P F
1975-01-01
A technique developed for analysis of less than microgram quantities of tRNA has been applied to the study of human leukemia. Leucocytes from peripheal blood and bone marrow samples of six, untreated leukemia patients and cells of five different established human cell lines were maintained for 18 hours in media containing (32P)-phosphate. Incorporation of radioactive phosphate into the cells from the patient samples was slightly less than that of the cell lines. Likewise, incorporation of (32P)-phosphate into the tRNA of the patient samples (approximately 5 x 106 DPM/mug tRNA) was also less then that incorporated into the tRNA of the cell lines. The major and minor nucleotide compositions of the unfractionated tRNA preparations from each patient sample and each cell line were determined and compared. Similarities and differences in the major and minor nucleotide compositions of the tRNA preparations are discussed with reference to types of leukemia and the importance of patient sample analysis versus analysis of cultured human cells. PMID:1057159
Dietary nitrogen alters codon bias and genome composition in parasitic microorganisms.
Seward, Emily A; Kelly, Steven
2016-11-15
Genomes are composed of long strings of nucleotide monomers (A, C, G and T) that are either scavenged from the organism's environment or built from metabolic precursors. The biosynthesis of each nucleotide differs in atomic requirements with different nucleotides requiring different quantities of nitrogen atoms. However, the impact of the relative availability of dietary nitrogen on genome composition and codon bias is poorly understood. Here we show that differential nitrogen availability, due to differences in environment and dietary inputs, is a major determinant of genome nucleotide composition and synonymous codon use in both bacterial and eukaryotic microorganisms. Specifically, low nitrogen availability species use nucleotides that require fewer nitrogen atoms to encode the same genes compared to high nitrogen availability species. Furthermore, we provide a novel selection-mutation framework for the evaluation of the impact of metabolism on gene sequence evolution and show that it is possible to predict the metabolic inputs of related organisms from an analysis of the raw nucleotide sequence of their genes. Taken together, these results reveal a previously hidden relationship between cellular metabolism and genome evolution and provide new insight into how genome sequence evolution can be influenced by adaptation to different diets and environments.
Freeman, Lindsay M; Pang, Lin; Fainman, Yeshaiahu
2018-05-09
The analysis of DNA has led to revolutionary advancements in the fields of medical diagnostics, genomics, prenatal screening, and forensic science, with the global DNA testing market expected to reach revenues of USD 10.04 billion per year by 2020. However, the current methods for DNA analysis remain dependent on the necessity for fluorophores or conjugated proteins, leading to high costs associated with consumable materials and manual labor. Here, we demonstrate a potential label-free DNA composition detection method using surface-enhanced Raman spectroscopy (SERS) in which we identify the composition of cytosine and adenine within single strands of DNA. This approach depends on the fact that there is one phosphate backbone per nucleotide, which we use as a reference to compensate for systematic measurement variations. We utilize plasmonic nanomaterials with random Raman sampling to perform label-free detection of the nucleotide composition within DNA strands, generating a calibration curve from standard samples of DNA and demonstrating the capability of resolving the nucleotide composition. The work represents an innovative way for detection of the DNA composition within DNA strands without the necessity of attached labels, offering a highly sensitive and reproducible method that factors in random sampling to minimize error.
Lee, Sheila; McMullen, D.; Brown, G. L.; Stokes, A. R.
1965-01-01
1. A theoretical analysis of the errors in multicomponent spectrophotometric analysis of nucleoside mixtures, by a least-squares procedure, has been made to obtain an expression for the error coefficient, relating the error in calculated concentration to the error in extinction measurements. 2. The error coefficients, which depend only on the `library' of spectra used to fit the experimental curves, have been computed for a number of `libraries' containing the following nucleosides found in s-RNA: adenosine, guanosine, cytidine, uridine, 5-ribosyluracil, 7-methylguanosine, 6-dimethylaminopurine riboside, 6-methylaminopurine riboside and thymine riboside. 3. The error coefficients have been used to determine the best conditions for maximum accuracy in the determination of the compositions of nucleoside mixtures. 4. Experimental determinations of the compositions of nucleoside mixtures have been made and the errors found to be consistent with those predicted by the theoretical analysis. 5. It has been demonstrated that, with certain precautions, the multicomponent spectrophotometric method described is suitable as a basis for automatic nucleotide-composition analysis of oligonucleotides containing nine nucleotides. Used in conjunction with continuous chromatography and flow chemical techniques, this method can be applied to the study of the sequence of s-RNA. PMID:14346087
Interactive computer programs for the graphic analysis of nucleotide sequence data.
Luckow, V A; Littlewood, R K; Rownd, R H
1984-01-01
A group of interactive computer programs have been developed which aid in the collection and graphical analysis of nucleotide and protein sequence data. The programs perform the following basic functions: a) enter, edit, list, and rearrange sequence data; b) permit automatic entry of nucleotide sequence data directly from an autoradiograph into the computer; c) search for restriction sites or other specified patterns and plot a linear or circular restriction map, or print their locations; d) plot base composition; e) analyze homology between sequences by plotting a two-dimensional graphic matrix; and f) aid in plotting predicted secondary structures of RNA molecules. PMID:6546437
Prokaryotic Nucleotide Composition Is Shaped by Both Phylogeny and the Environment
Reichenberger, Erin R.; Rosen, Gail; Hershberg, Uri; ...
2015-04-09
Here, the causes of the great variation in nucleotide composition of prokaryotic genomes have long been disputed. Here, we use extensive metagenomic and whole-genome data to demonstrate that both phylogeny and the environment shape prokaryotic nucleotide content. We show that across environments, various phyla are characterized by different mean guanine and cytosine (GC) values as well as by the extent of variation on that mean value. At the same time, we show that GC-content varies greatly as a function of environment, in a manner that cannot be entirely explained by disparities in phylogenetic composition. We find environmentally driven differences inmore » nucleotide content not only between highly diverged environments (e.g., soil, vs. aquatic vs. human gut) but also within a single type of environment. More specifically, we demonstrate that some human guts are associated with a microbiome that is consistently more GC-rich across phyla, whereas others are associated with a more AT-rich microbiome. These differences appear to be driven both by variations in phylogenetic composition and by environmental differences—which are independent of these phylogenetic composition differences. Combined, our results demonstrate that both phylogeny and the environment significantly affect nucleotide composition and that the environmental differences affecting nucleotide composition are far subtler than previously appreciated.« less
Prokaryotic nucleotide composition is shaped by both phylogeny and the environment.
Reichenberger, Erin R; Rosen, Gail; Hershberg, Uri; Hershberg, Ruth
2015-04-09
The causes of the great variation in nucleotide composition of prokaryotic genomes have long been disputed. Here, we use extensive metagenomic and whole-genome data to demonstrate that both phylogeny and the environment shape prokaryotic nucleotide content. We show that across environments, various phyla are characterized by different mean guanine and cytosine (GC) values as well as by the extent of variation on that mean value. At the same time, we show that GC-content varies greatly as a function of environment, in a manner that cannot be entirely explained by disparities in phylogenetic composition. We find environmentally driven differences in nucleotide content not only between highly diverged environments (e.g., soil, vs. aquatic vs. human gut) but also within a single type of environment. More specifically, we demonstrate that some human guts are associated with a microbiome that is consistently more GC-rich across phyla, whereas others are associated with a more AT-rich microbiome. These differences appear to be driven both by variations in phylogenetic composition and by environmental differences-which are independent of these phylogenetic composition differences. Combined, our results demonstrate that both phylogeny and the environment significantly affect nucleotide composition and that the environmental differences affecting nucleotide composition are far subtler than previously appreciated. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Ito, Jun; Herter, Thomas; Baidoo, Edward E K; Lao, Jeemeng; Vega-Sánchez, Miguel E; Michelle Smith-Moritz, A; Adams, Paul D; Keasling, Jay D; Usadel, Björn; Petzold, Christopher J; Heazlewood, Joshua L
2014-03-01
Understanding the intricate metabolic processes involved in plant cell wall biosynthesis is limited by difficulties in performing sensitive quantification of many involved compounds. Hydrophilic interaction liquid chromatography is a useful technique for the analysis of hydrophilic metabolites from complex biological extracts and forms the basis of this method to quantify plant cell wall precursors. A zwitterionic silica-based stationary phase has been used to separate hydrophilic nucleotide sugars involved in cell wall biosynthesis from milligram amounts of leaf tissue. A tandem mass spectrometry operating in selected reaction monitoring mode was used to quantify nucleotide sugars. This method was highly repeatable and quantified 12 nucleotide sugars at low femtomole quantities, with linear responses up to four orders of magnitude to several 100pmol. The method was also successfully applied to the analysis of purified leaf extracts from two model plant species with variations in their cell wall sugar compositions and indicated significant differences in the levels of 6 out of 12 nucleotide sugars. The plant nucleotide sugar extraction procedure was demonstrated to have good recovery rates with minimal matrix effects. The approach results in a significant improvement in sensitivity when applied to plant samples over currently employed techniques. Copyright © 2013 Elsevier Inc. All rights reserved.
de la Bastide, Paul Y; Leung, Wai Lam; Hintz, William E
2015-01-01
The ITS region of the rDNA gene was compared for Saprolegnia spp. in order to improve our understanding of nucleotide sequence variability within and between species of this genus, determine species composition in Canadian fin fish aquaculture facilities, and to assess the utility of ITS sequence variability in genetic marker development. From a collection of more than 400 field isolates, ITS region nucleotide sequences were studied and it was determined that there was sufficient consistent inter-specific variation to support the designation of species identity based on ITS sequence data. This non-subjective approach to species identification does not rely upon transient morphological features. Phylogenetic analyses comparing our ITS sequences and species designations with data from previous studies generally supported the clade scheme of Diéguez-Uribeondo et al. (2007) and found agreement with the molecular taxonomic cluster system of Sandoval-Sierra et al. (2014). Our Canadian ITS sequence collection will thus contribute to the public database and assist the clarification of Saprolegnia spp. taxonomy. The analysis of ITS region sequence variability facilitated genus- and species-level identification of unknown samples from aquaculture facilities and provided useful information on species composition. A unique ITS-RFLP for the identification of S. parasitica was also described. Copyright © 2014 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
A detailed analysis of codon usage patterns and influencing factors in Zika virus.
Singh, Niraj K; Tyagi, Anuj
2017-07-01
Recent outbreaks of Zika virus (ZIKV) in Africa, Latin America, Europe, and Southeast Asia have resulted in serious health concerns. To understand more about evolution and transmission of ZIKV, detailed codon usage analysis was performed for all available strains. A high effective number of codons (ENC) value indicated the presence of low codon usage bias in ZIKV. The effect of mutational pressure on codon usage bias was confirmed by significant correlations between nucleotide compositions at third codon positions and ENCs. Correlation analysis between Gravy values, Aroma values and nucleotide compositions at third codon positions also indicated some influence of natural selection. However, the low codon adaptation index (CAI) value of ZIKV with reference to human and mosquito indicated poor adaptation of ZIKV codon usage towards its hosts, signifying that natural selection has a weaker influence than mutational pressure. Additionally, relative dinucleotide frequencies, geographical distribution, and evolutionary processes also influenced the codon usage pattern to some extent.
Altier, Daniel J.; Dahlbacka, Glen; Ellanskaya, legal representative, Natalia; Herrmann, Rafael; Hunter-Cevera, Jennie; McCutchen, Billy F.; Presnail, James K.; Rice, Janet A.; Schepers, Eric; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser; Ellanskaya, deceased, Irina
2007-12-11
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
Altier, Daniel J.; Dahlbacka, Glen; Elleskaya, Irina; Ellanskaya, legal representative; Natalia; Herrmann, Rafael; Hunter-Cevera, Jennie; McCutchen, Billy F.; Presnail, James K.; Rice, Janet A.; Schepers, Eric; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser
2010-08-10
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
Altier, Daniel J [Waukee, IA; Dahlbacka, Glen [Oakland, CA; Elleskaya, Irina [Kyiv, UA; Ellanskaya, legal representative, Natalia; Herrmann, Rafael [Wilmington, DE; Hunter-Cevera, Jennie [Elliott City, MD; McCutchen, Billy F [College Station, IA; Presnail, James K [Avondale, PA; Rice, Janet A [Wilmington, DE; Schepers, Eric [Port Deposit, MD; Simmons, Carl R [Des Moines, IA; Torok, Tamas [Richmond, CA; Yalpani, Nasser [Johnston, IA
2011-04-12
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
Altier, Daniel J [Granger, IA; Dahlbacka, Glen [Oakland, CA; Ellanskaya, Irina [Kyiv, UA; Ellanskaya, legal representative, Natalia; Herrmann, Rafael [Wilmington, DE; Hunter-Cevera, Jennie [Elliott City, MD; McCutchen, Billy F [College Station, TX; Presnail, James K [Avondale, PA; Rice, Janet A [Wilmington, DE; Schepers, Eric [Port Deposit, MD; Simmons, Carl R [Des Moines, IA; Torok, Tamas [Richmond, CA; Yalpani, Nasser [Johnston, IA
2012-04-03
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.
A comprehensive bioinformatic analysis of hepatitis D virus full-length genomes.
Delfino, C M; Cerrudo, C S; Biglione, M; Oubiña, J R; Ghiringhelli, P D; Mathet, V L
2018-02-06
In association with hepatitis B virus (HBV), hepatitis delta virus (HDV) is a subviral agent that may promote severe acute and chronic forms of liver disease. Based on the percentage of nucleotide identity of the genome, HDV was initially classified into three genotypes. However, since 2006, the original classification has been further expanded into eight clades/genotypes. The intergenotype divergence may be as high as 35%-40% over the entire RNA genome, whereas sequence heterogeneity among the isolates of a given genotype is <20%; furthermore, HDV recombinants have been clearly demonstrated. The genetic diversity of HDV is related to the geographic origin of the isolates. This study shows the first comprehensive bioinformatic analysis of the complete available set of HDV sequences, using both nucleotide and protein phylogenies (based on an evolutionary model selection, gamma distribution estimation, tree inference and phylogenetic distance estimation), protein composition analysis and comparison (based on the presence of invariant residues, molecular signatures, amino acid frequencies and mono- and di-amino acid compositional distances), as well as amino acid changes in sequence evolution. Taking into account the congruent and consistent results of both nucleotide and amino acid analyses of GenBank available sequences (recorded as of January, 2017), we propose that the eight hepatitis D virus genotypes may be grouped into three large genogroups fully supported by their shared characteristics. © 2018 John Wiley & Sons Ltd.
NASA Technical Reports Server (NTRS)
Gorbunova, A. V.
1980-01-01
An investigation into the effect of hypokinesia on the ribonucleic acid (RNA) content, the nucleotide composition, and dynamics of protein content in the motoneuron of the rat spinal cord anterior horns is described. Methodology and findings are presented. The study results showed that the nucleotide composition of the total cellular RNA at all the studied periods of hypokinesia remained unchanged and is characteristic for the cytoplasmic, high polymer ribosomal RNA. This means that with a change in the functional state of the neuron the newly formed RNA of the nerve cell has the same composition of bases as the original RNA that belongs to the ribosomal type.
Kwon, Andrew T.; Chou, Alice Yi; Arenillas, David J.; Wasserman, Wyeth W.
2011-01-01
We performed a genome-wide scan for muscle-specific cis-regulatory modules (CRMs) using three computational prediction programs. Based on the predictions, 339 candidate CRMs were tested in cell culture with NIH3T3 fibroblasts and C2C12 myoblasts for capacity to direct selective reporter gene expression to differentiated C2C12 myotubes. A subset of 19 CRMs validated as functional in the assay. The rate of predictive success reveals striking limitations of computational regulatory sequence analysis methods for CRM discovery. Motif-based methods performed no better than predictions based only on sequence conservation. Analysis of the properties of the functional sequences relative to inactive sequences identifies nucleotide sequence composition can be an important characteristic to incorporate in future methods for improved predictive specificity. Muscle-related TFBSs predicted within the functional sequences display greater sequence conservation than non-TFBS flanking regions. Comparison with recent MyoD and histone modification ChIP-Seq data supports the validity of the functional regions. PMID:22144875
Ishikawa, Sohta A; Inagaki, Yuji; Hashimoto, Tetsuo
2012-01-01
In phylogenetic analyses of nucleotide sequences, 'homogeneous' substitution models, which assume the stationarity of base composition across a tree, are widely used, albeit individual sequences may bear distinctive base frequencies. In the worst-case scenario, a homogeneous model-based analysis can yield an artifactual union of two distantly related sequences that achieved similar base frequencies in parallel. Such potential difficulty can be countered by two approaches, 'RY-coding' and 'non-homogeneous' models. The former approach converts four bases into purine and pyrimidine to normalize base frequencies across a tree, while the heterogeneity in base frequency is explicitly incorporated in the latter approach. The two approaches have been applied to real-world sequence data; however, their basic properties have not been fully examined by pioneering simulation studies. Here, we assessed the performances of the maximum-likelihood analyses incorporating RY-coding and a non-homogeneous model (RY-coding and non-homogeneous analyses) on simulated data with parallel convergence to similar base composition. Both RY-coding and non-homogeneous analyses showed superior performances compared with homogeneous model-based analyses. Curiously, the performance of RY-coding analysis appeared to be significantly affected by a setting of the substitution process for sequence simulation relative to that of non-homogeneous analysis. The performance of a non-homogeneous analysis was also validated by analyzing a real-world sequence data set with significant base heterogeneity.
NASA Astrophysics Data System (ADS)
Harms, J.
1992-03-01
Growth rate expressed as dry weight, elemetnal composition (C, N, H), protein content and nucleotide composition (ATP, ADP, AMP, CTP, GTP and UTP) as well as adenosine were measured in laboratory cultured Hyas araneus larvae fed two different diets. One group was fed freshly hatched Artemia sp. nauplii, the other the diatom Odontella (Biddulphia) sinensis. Growth rate was reduced in the O. sinensis-fed group, reaching 20 to 50% of the growth rate of Artemia-fed larvae. In all cases, some further development to the next instar occurred when larvae were fed O. sinensis, although at reduced levels compared to Artemia-fed larvae. The adenylic energy charge was quite similar for the two nutritional conditions tested and therefore does not reflect the reduced growth rate in O. sinensis-fed larvae. The individual nucleotide content was clearly reduced in O. sinensis-fed larvae, reflecting the nutritional conditions already during early developmental periods. These reduced amount of nucleotides in O. sinensis-fed larvae were most obvious when adenylic nucleotide contents were pooled. Pooled adenylic nucleotides were found to be correlated with the individual content of carbon and protein, showing significant differences at both nutritional conditions tested.
Chiusano, M L; D'Onofrio, G; Alvarez-Valin, F; Jabbari, K; Colonna, G; Bernardi, G
1999-09-30
We investigated the relationships between the nucleotide substitution rates and the predicted secondary structures in the three states representation (alpha-helix, beta-sheet, and coil). The analysis was carried out on 34 alignments, each of which comprised sequences belonging to at least four different mammalian orders. The rates of synonymous substitution were found to be significantly different in regions predicted to be alpha-helix, beta-sheet, or coil. Likewise, the nonsynonymous rates also differ, although expectedly at a lower extent, in the three types of secondary structure, suggesting that different selective constraints associated with the different structures are affecting in a similar way the synonymous and nonsynonymous rates. Moreover, the base composition of the third codon positions is different in coding sequence regions corresponding to different secondary structures of proteins.
The complete mitochondrial genome of the stomatopod crustacean Squilla mantis
Cook, Charles E
2005-01-01
Background Animal mitochondrial genomes are physically separate from the much larger nuclear genomes and have proven useful both for phylogenetic studies and for understanding genome evolution. Within the phylum Arthropoda the subphylum Crustacea includes over 50,000 named species with immense variation in body plans and habitats, yet only 23 complete mitochondrial genomes are available from this subphylum. Results I describe here the complete mitochondrial genome of the crustacean Squilla mantis (Crustacea: Malacostraca: Stomatopoda). This 15994-nucleotide genome, the first described from a hoplocarid, contains the standard complement of 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, and a non-coding AT-rich region that is found in most other metazoans. The gene order is identical to that considered ancestral for hexapods and crustaceans. The 70% AT base composition is within the range described for other arthropods. A single unusual feature of the genome is a 230 nucleotide non-coding region between a serine transfer RNA and the nad1 gene, which has no apparent function. I also compare gene order, nucleotide composition, and codon usage of the S. mantis genome and eight other malacostracan crustaceans. A translocation of the histidine transfer RNA gene is shared by three taxa in the order Decapoda, infraorder Brachyura; Callinectes sapidus, Portunus trituberculatus and Pseudocarcinus gigas. This translocation may be diagnostic for the Brachyura. For all nine taxa nucleotide composition is biased towards AT-richness, as expected for arthropods, and is within the range reported for other arthropods. Codon usage is biased, and much of this bias is probably due to the skew in nucleotide composition towards AT-richness. Conclusion The mitochondrial genome of Squilla mantis contains one unusual feature, a 230 base pair non-coding region has so far not been described in any other malacostracan. Comparisons with other Malacostraca show that all nine genomes, like most other mitochondrial genomes, share a bias toward AT-richness and a related bias in codon usage. The nine malacostracans included in this analysis are not representative of the diversity of the class Malacostraca, and additional malacostracan sequences would surely reveal other unusual genomic features that could be useful in understanding mitochondrial evolution in this taxon. PMID:16091132
Ratkiewicz, A; Galasinski, W
1976-01-01
The characteristics of the ribonucleic acids of Guerin tumor was the subject of this work. The effect of tumor development on the structure of the ribonucleic acids in the liver of tumor bearing rats was studied. Some differences of nucleotide compositions in RNAs isolated from subcellular fractions of liver of control and tumor bearing rats and of cancer tissue were observed. The nucleotide compositions of cancer nuclear RNA is distinctly different from liver RNA. The changes in primary structure of liver RNAs due by development of tumor in rats may be result of metabolic peculiarities of these RNAs.
USDA-ARS?s Scientific Manuscript database
Plant cell-wall polysaccharide biosynthesis requires nucleotide-activated sugars. The prominent grass cell wall sugars, glucose (Glc), xylose (Xyl), and arabinose (Ara), are biosynthetically related via the UDP-sugar interconversion pathway. RNA-seq analysis of Brachypodium distachyon UDP-sugar inte...
Combined hairpin-antisense compositions and methods for modulating expression
Shanklin, John; Nguyen, Tam
2014-08-05
A nucleotide construct comprising a nucleotide sequence that forms a stem and a loop, wherein the loop comprises a nucleotide sequence that modulates expression of a target, wherein the stem comprises a nucleotide sequence that modulates expression of a target, and wherein the target modulated by the nucleotide sequence in the loop and the target modulated by the nucleotide sequence in the stem may be the same or different. Vectors, methods of regulating target expression, methods of providing a cell, and methods of treating conditions comprising the nucleotide sequence are also disclosed.
Combined hairpin-antisense compositions and methods for modulating expression
Shanklin, John; Nguyen, Tam Huu
2015-11-24
A nucleotide construct comprising a nucleotide sequence that forms a stem and a loop, wherein the loop comprises a nucleotide sequence that modulates expression of a target, wherein the stem comprises a nucleotide sequence that modulates expression of a target, and wherein the target modulated by the nucleotide sequence in the loop and the target modulated by the nucleotide sequence in the stem may be the same or different. Vectors, methods of regulating target expression, methods of providing a cell, and methods of treating conditions comprising the nucleotide sequence are also disclosed.
Taoka, Masato; Yamauchi, Yoshio; Nobe, Yuko; Masaki, Shunpei; Nakayama, Hiroshi; Ishikawa, Hideaki; Takahashi, Nobuhiro; Isobe, Toshiaki
2009-11-01
We describe here a mass spectrometry (MS)-based analytical platform of RNA, which combines direct nano-flow reversed-phase liquid chromatography (RPLC) on a spray tip column and a high-resolution LTQ-Orbitrap mass spectrometer. Operating RPLC under a very low flow rate with volatile solvents and MS in the negative mode, we could estimate highly accurate mass values sufficient to predict the nucleotide composition of a approximately 21-nucleotide small interfering RNA, detect post-transcriptional modifications in yeast tRNA, and perform collision-induced dissociation/tandem MS-based structural analysis of nucleolytic fragments of RNA at a sub-femtomole level. Importantly, the method allowed the identification and chemical analysis of small RNAs in ribonucleoprotein (RNP) complex, such as the pre-spliceosomal RNP complex, which was pulled down from cultured cells with a tagged protein cofactor as bait. We have recently developed a unique genome-oriented database search engine, Ariadne, which allows tandem MS-based identification of RNAs in biological samples. Thus, the method presented here has broad potential for automated analysis of RNA; it complements conventional molecular biology-based techniques and is particularly suited for simultaneous analysis of the composition, structure, interaction, and dynamics of RNA and protein components in various cellular RNP complexes.
Nucleotide Selectivity in Abiotic RNA Polymerization Reactions.
Coari, Kristin M; Martin, Rebecca C; Jain, Kopal; McGown, Linda B
2017-09-01
In order to establish an RNA world on early Earth, the nucleotides must form polymers through chemical rather than biochemical reactions. The polymerization products must be long enough to perform catalytic functions, including self-replication, and to preserve genetic information. These functions depend not only on the length of the polymers, but also on their sequences. To date, studies of abiotic RNA polymerization generally have focused on routes to polymerization of a single nucleotide and lengths of the homopolymer products. Less work has been done the selectivity of the reaction toward incorporation of some nucleotides over others in nucleotide mixtures. Such information is an essential step toward understanding the chemical evolution of RNA. To address this question, in the present work RNA polymerization reactions were performed in the presence of montmorillonite clay catalyst. The nucleotides included the monophosphates of adenosine, cytosine, guanosine, uridine and inosine. Experiments included reactions of mixtures of an imidazole-activated nucleotide (ImpX) with one or more unactivated nucleotides (XMP), of two or more ImpX, and of XMP that were activated in situ in the polymerization reaction itself. The reaction products were analyzed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) to identify the lengths and nucleotide compositions of the polymerization products. The results show that the extent of polymerization, the degree of heteropolymerization vs. homopolymerization, and the composition of the polymeric products all vary among the different nucleotides and depend upon which nucleotides and how many different nucleotides are present in the mixture.
Nucleotide Selectivity in Abiotic RNA Polymerization Reactions
NASA Astrophysics Data System (ADS)
Coari, Kristin M.; Martin, Rebecca C.; Jain, Kopal; McGown, Linda B.
2017-09-01
In order to establish an RNA world on early Earth, the nucleotides must form polymers through chemical rather than biochemical reactions. The polymerization products must be long enough to perform catalytic functions, including self-replication, and to preserve genetic information. These functions depend not only on the length of the polymers, but also on their sequences. To date, studies of abiotic RNA polymerization generally have focused on routes to polymerization of a single nucleotide and lengths of the homopolymer products. Less work has been done the selectivity of the reaction toward incorporation of some nucleotides over others in nucleotide mixtures. Such information is an essential step toward understanding the chemical evolution of RNA. To address this question, in the present work RNA polymerization reactions were performed in the presence of montmorillonite clay catalyst. The nucleotides included the monophosphates of adenosine, cytosine, guanosine, uridine and inosine. Experiments included reactions of mixtures of an imidazole-activated nucleotide (ImpX) with one or more unactivated nucleotides (XMP), of two or more ImpX, and of XMP that were activated in situ in the polymerization reaction itself. The reaction products were analyzed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) to identify the lengths and nucleotide compositions of the polymerization products. The results show that the extent of polymerization, the degree of heteropolymerization vs. homopolymerization, and the composition of the polymeric products all vary among the different nucleotides and depend upon which nucleotides and how many different nucleotides are present in the mixture.
Quantitative Understanding of SHAPE Mechanism from RNA Structure and Dynamics Analysis.
Hurst, Travis; Xu, Xiaojun; Zhao, Peinan; Chen, Shi-Jie
2018-05-10
The selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) method probes RNA local structural and dynamic information at single nucleotide resolution. To gain quantitative insights into the relationship between nucleotide flexibility, RNA 3D structure, and SHAPE reactivity, we develop a 3D Structure-SHAPE Relationship model (3DSSR) to rebuild SHAPE profiles from 3D structures. The model starts from RNA structures and combines nucleotide interaction strength and conformational propensity, ligand (SHAPE reagent) accessibility, and base-pairing pattern through a composite function to quantify the correlation between SHAPE reactivity and nucleotide conformational stability. The 3DSSR model shows the relationship between SHAPE reactivity and RNA structure and energetics. Comparisons between the 3DSSR-predicted SHAPE profile and the experimental SHAPE data show correlation, suggesting that the extracted analytical function may have captured the key factors that determine the SHAPE reactivity profile. Furthermore, the theory offers an effective method to sieve RNA 3D models and exclude models that are incompatible with experimental SHAPE data.
Galtier, N; Boursot, P
2000-03-01
A new, model-based method was devised to locate nucleotide changes in a given phylogenetic tree. For each site, the posterior probability of any possible change in each branch of the tree is computed. This probabilistic method is a valuable alternative to the maximum parsimony method when base composition is skewed (i.e., different from 25% A, 25% C, 25% G, 25% T): computer simulations showed that parsimony misses more rare --> common than common --> rare changes, resulting in biased inferred change matrices, whereas the new method appeared unbiased. The probabilistic method was applied to the analysis of the mutation and substitution processes in the mitochondrial control region of mouse. Distinct change patterns were found at the polymorphism (within species) and divergence (between species) levels, rejecting the hypothesis of a neutral evolution of base composition in mitochondrial DNA.
Isolated nucleic acids encoding antipathogenic polypeptides and uses thereof
Altier, Daniel J.; Crane, Virginia C.; Ellanskaya, Irina; Ellanskaya, Natalia; Gilliam, Jacob T.; Hunter-Cevera, Jennie; Presnail, James K.; Schepers, Eric J.; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser
2010-04-20
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from fungal fermentation broths. Nucleic acids that encode the antipathogenic polypeptides are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention are also disclosed.
Dass, J Febin Prabhu; Sudandiradoss, C
2012-07-15
5-HT (5-Hydroxy-tryptamine) or serotonin receptors are found both in central and peripheral nervous system as well as in non-neuronal tissues. In the animal and human nervous system, serotonin produces various functional effects through a variety of membrane bound receptors. In this study, we focus on 5-HT receptor family from different mammals and examined the factors that account for codon and nucleotide usage variation. A total of 110 homologous coding sequences from 11 different mammalian species were analyzed using relative synonymous codon usage (RSCU), correspondence analysis (COA) and hierarchical cluster analysis together with nucleotide base usage frequency of chemically similar amino acid codons. The mean effective number of codon (ENc) value of 37.06 for 5-HT(6) shows very high codon bias within the family and may be due to high selective translational efficiency. The COA and Spearman's rank correlation reveals that the nucleotide compositional mutation bias as the major factors influencing the codon usage in serotonin receptor genes. The hierarchical cluster analysis suggests that gene function is another dominant factor that affects the codon usage bias, while species is a minor factor. Nucleotide base usage was reported using Goldman, Engelman, Stietz (GES) scale reveals the presence of high uracil (>45%) content at functionally important hydrophobic regions. Our in silico approach will certainly help for further investigations on critical inference on evolution, structure, function and gene expression aspects of 5-HT receptors family which are potential antipsychotic drug targets. Copyright © 2012 Elsevier B.V. All rights reserved.
Meiler, Arno; Klinger, Claudia; Kaufmann, Michael
2012-09-08
The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.
2012-01-01
Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836
The primary structure of the thymidine kinase gene of fish lymphocystis disease virus.
Schnitzler, P; Handermann, M; Szépe, O; Darai, G
1991-06-01
The DNA nucleotide sequence of the thymidine kinase (TK) gene of fish lymphocystis disease virus (FLDV) which has been localized between the coordinates 0.678 to 0.688 of the viral genome was determined. The analysis of the DNA nucleotide sequence located between the recognition sites of HindIII (0.669 map unit; nucleotide position 1) and AccI (nucleotide position 2032) revealed the presence of an open reading frame of 954 bp on the lower strand of this region between nucleotide positions 1868 (ATG) and 915 (TAA). It encodes for a protein of 318 amino acid residues. The evolutionary relationships of the TK gene of FLDV to the other known TK genes was investigated using the method of progressive sequence alignment. These analyses revealed a high degree of diversity between the protein sequence of FLDV TK gene and the amino acid composition of other TKs tested. However, significant conservations were detected at several regions of amino acid residues of the FLDV TK protein when compared to the amino acid sequence of TKs of African swine fever virus, fowlpox virus, shope fibroma virus, and vaccinia virus and to the amino acid sequences of the cellular cytoplasmic TK of chicken, mouse, and man.
Dinucleotide Composition in Animal RNA Viruses Is Shaped More by Virus Family than by Host Species
Di Giallonardo, Francesca; Schlub, Timothy E.; Shi, Mang
2017-01-01
ABSTRACT Viruses use the cellular machinery of their hosts for replication. It has therefore been proposed that the nucleotide and dinucleotide compositions of viruses should match those of their host species. If this is upheld, it may then be possible to use dinucleotide composition to predict the true host species of viruses sampled in metagenomic surveys. However, it is also clear that different taxonomic groups of viruses tend to have distinctive patterns of dinucleotide composition that may be independent of host species. To determine the relative strength of the effect of host versus virus family in shaping dinucleotide composition, we performed a comparative analysis of 20 RNA virus families from 15 host groupings, spanning two animal phyla and more than 900 virus species. In particular, we determined the odds ratios for the 16 possible dinucleotides and performed a discriminant analysis to evaluate the capability of virus dinucleotide composition to predict the correct virus family or host taxon from which it was isolated. Notably, while 81% of the data analyzed here were predicted to the correct virus family, only 62% of these data were predicted to their correct subphylum/class host and a mere 32% to their correct mammalian order. Similarly, dinucleotide composition has a weak predictive power for different hosts within individual virus families. We therefore conclude that dinucleotide composition is generally uniform within a virus family but less well reflects that of its host species. This has obvious implications for attempts to accurately predict host species from virus genome sequences alone. IMPORTANCE Determining the processes that shape virus genomes is central to understanding virus evolution and emergence. One question of particular importance is why nucleotide and dinucleotide frequencies differ so markedly between viruses. In particular, it is currently unclear whether host species or virus family has the biggest impact on dinucleotide frequencies and whether dinucleotide composition can be used to accurately predict host species. Using a comparative analysis, we show that dinucleotide composition has a strong phylogenetic association across different RNA virus families, such that dinucleotide composition can predict the family from which a virus sequence has been isolated. Conversely, dinucleotide composition has a poorer predictive power for the different host species within a virus family and across different virus families, indicating that the host has a relatively small impact on the dinucleotide composition of a virus genome. PMID:28148785
Nucleic acids encoding antifungal polypeptides and uses thereof
Altier, Daniel J.; Ellanskaya, I. A.; Gilliam, Jacob T.; Hunter-Cevera, Jennie; Presnail, James K; Schepers, Eric; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser
2010-11-02
Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include an amino acid sequence, and variants and fragments thereof, for an antipathogenic polypeptide that was isolated from a fungal fermentation broth. Nucleic acid molecules that encode the antipathogenic polypeptides of the invention, and antipathogenic domains thereof, are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention are also disclosed.
Nucleotide sequence composition and method for detection of neisseria gonorrhoeae
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lo, A.; Yang, H.L.
1990-02-13
This patent describes a composition of matter that is specific for {ital Neisseria gonorrhoeae}. It comprises: at least one nucleotide sequence for which the ratio of the amount of the sequence which hybridizes to chromosomal DNA of {ital Neisseria gonorrhoeae} to the amount of the sequence which hybridizes to chromosomal DNA of {ital Neisseria meningitidis} is greater than about five. The ratio being obtained by a method described.
Takahashi, Mayumi; Wu, Xiwei; Ho, Michelle; Chomchan, Pritsana; Rossi, John J; Burnett, John C; Zhou, Jiehua
2016-09-22
The systemic evolution of ligands by exponential enrichment (SELEX) technique is a powerful and effective aptamer-selection procedure. However, modifications to the process can dramatically improve selection efficiency and aptamer performance. For example, droplet digital PCR (ddPCR) has been recently incorporated into SELEX selection protocols to putatively reduce the propagation of byproducts and avoid selection bias that result from differences in PCR efficiency of sequences within the random library. However, a detailed, parallel comparison of the efficacy of conventional solution PCR versus the ddPCR modification in the RNA aptamer-selection process is needed to understand effects on overall SELEX performance. In the present study, we took advantage of powerful high throughput sequencing technology and bioinformatics analysis coupled with SELEX (HT-SELEX) to thoroughly investigate the effects of initial library and PCR methods in the RNA aptamer identification. Our analysis revealed that distinct "biased sequences" and nucleotide composition existed in the initial, unselected libraries purchased from two different manufacturers and that the fate of the "biased sequences" was target-dependent during selection. Our comparison of solution PCR- and ddPCR-driven HT-SELEX demonstrated that PCR method affected not only the nucleotide composition of the enriched sequences, but also the overall SELEX efficiency and aptamer efficacy.
Takahashi, Mayumi; Wu, Xiwei; Ho, Michelle; Chomchan, Pritsana; Rossi, John J.; Burnett, John C.; Zhou, Jiehua
2016-01-01
The systemic evolution of ligands by exponential enrichment (SELEX) technique is a powerful and effective aptamer-selection procedure. However, modifications to the process can dramatically improve selection efficiency and aptamer performance. For example, droplet digital PCR (ddPCR) has been recently incorporated into SELEX selection protocols to putatively reduce the propagation of byproducts and avoid selection bias that result from differences in PCR efficiency of sequences within the random library. However, a detailed, parallel comparison of the efficacy of conventional solution PCR versus the ddPCR modification in the RNA aptamer-selection process is needed to understand effects on overall SELEX performance. In the present study, we took advantage of powerful high throughput sequencing technology and bioinformatics analysis coupled with SELEX (HT-SELEX) to thoroughly investigate the effects of initial library and PCR methods in the RNA aptamer identification. Our analysis revealed that distinct “biased sequences” and nucleotide composition existed in the initial, unselected libraries purchased from two different manufacturers and that the fate of the “biased sequences” was target-dependent during selection. Our comparison of solution PCR- and ddPCR-driven HT-SELEX demonstrated that PCR method affected not only the nucleotide composition of the enriched sequences, but also the overall SELEX efficiency and aptamer efficacy. PMID:27652575
PseKNC: a flexible web server for generating pseudo K-tuple nucleotide composition.
Chen, Wei; Lei, Tian-Yu; Jin, Dian-Chuan; Lin, Hao; Chou, Kuo-Chen
2014-07-01
The pseudo oligonucleotide composition, or pseudo K-tuple nucleotide composition (PseKNC), can be used to represent a DNA or RNA sequence with a discrete model or vector yet still keep considerable sequence order information, particularly the global or long-range sequence order information, via the physicochemical properties of its constituent oligonucleotides. Therefore, the PseKNC approach may hold very high potential for enhancing the power in dealing with many problems in computational genomics and genome sequence analysis. However, dealing with different DNA or RNA problems may need different kinds of PseKNC. Here, we present a flexible and user-friendly web server for PseKNC (at http://lin.uestc.edu.cn/pseknc/default.aspx) by which users can easily generate many different modes of PseKNC according to their need by selecting various parameters and physicochemical properties. Furthermore, for the convenience of the vast majority of experimental scientists, a step-by-step guide is provided on how to use the current web server to generate their desired PseKNC without the need to follow the complicated mathematical equations, which are presented in this article just for the integrity of PseKNC formulation and its development. It is anticipated that the PseKNC web server will become a very useful tool in computational genomics and genome sequence analysis. Copyright © 2014 Elsevier Inc. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Solovyev, V.V.; Salamov, A.A.; Lawrence, C.B.
1994-12-31
Discriminant analysis is applied to the problem of recognition 5`-, internal and 3`-exons in human DNA sequences. Specific recognition functions were developed for revealing exons of particular types. The method based on a splice site prediction algorithm that uses the linear Fisher discriminant to combine the information about significant triplet frequencies of various functional parts of splice site regions and preferences of oligonucleotide in protein coding and nation regions. The accuracy of our splice site recognition function is about 97%. A discriminant function for 5`-exon prediction includes hexanucleotide composition of upstream region, triplet composition around the ATG codon, ORF codingmore » potential, donor splice site potential and composition of downstream introit region. For internal exon prediction, we combine in a discriminant function the characteristics describing the 5`- intron region, donor splice site, coding region, acceptor splice site and Y-intron region for each open reading frame flanked by GT and AG base pairs. The accuracy of precise internal exon recognition on a test set of 451 exon and 246693 pseudoexon sequences is 77% with a specificity of 79% and a level of pseudoexon ORF prediction of 99.96%. The recognition quality computed at the level of individual nucleotides is 89%, for exon sequences and 98% for intron sequences. A discriminant function for 3`-exon prediction includes octanucleolide composition of upstream nation region, triplet composition around the stop codon, ORF coding potential, acceptor splice site potential and hexanucleotide composition of downstream region. We unite these three discriminant functions in exon predicting program FEX (find exons). FEX exactly predicts 70% of 1016 exons from the test of 181 complete genes with specificity 73%, and 89% exons are exactly or partially predicted. On the average, 85% of nucleotides were predicted accurately with specificity 91%.« less
Codon usage bias and phylogenetic analysis of mitochondrial ND1 gene in pisces, aves, and mammals.
Uddin, Arif; Choudhury, Monisha Nath; Chakraborty, Supriyo
2018-01-01
The mitochondrially encoded NADH:ubiquinone oxidoreductase core subunit 1 (MT-ND1) gene is a subunit of the respiratory chain complex I and involved in the first step of the electron transport chain of oxidative phosphorylation (OXPHOS). To understand the pattern of compositional properties, codon usage and expression level of mitochondrial ND1 genes in pisces, aves, and mammals, we used bioinformatic approaches as no work was reported earlier. In this study, a perl script was used for calculating nucleotide contents and different codon usage bias parameters. The codon usage bias of MT-ND1 was low but the expression level was high as revealed from high ENC and CAI value. Correspondence analysis (COA) suggests that the pattern of codon usage for MT-ND1 gene is not same across species and that compositional constraint played an important role in codon usage pattern of this gene among pisces, aves, and mammals. From the regression equation of GC12 on GC3, it can be inferred that the natural selection might have played a dominant role while mutation pressure played a minor role in influencing the codon usage patterns. Further, ND1 gene has a discrepancy with cytochrome B (CYB) gene in preference of codons as evident from COA. The codon usage bias was low. It is influenced by nucleotide composition, natural selection, mutation pressure, length (number) of amino acids, and relative dinucleotide composition. This study helps in understanding the molecular biology, genetics, evolution of MT-ND1 gene, and also for designing a synthetic gene.
High-Resolution Melt Analysis for Rapid Comparison of Bacterial Community Compositions
Hjelmsø, Mathis Hjort; Hansen, Lars Hestbjerg; Bælum, Jacob; Feld, Louise; Holben, William E.
2014-01-01
In the study of bacterial community composition, 16S rRNA gene amplicon sequencing is today among the preferred methods of analysis. The cost of nucleotide sequence analysis, including requisite computational and bioinformatic steps, however, takes up a large part of many research budgets. High-resolution melt (HRM) analysis is the study of the melt behavior of specific PCR products. Here we describe a novel high-throughput approach in which we used HRM analysis targeting the 16S rRNA gene to rapidly screen multiple complex samples for differences in bacterial community composition. We hypothesized that HRM analysis of amplified 16S rRNA genes from a soil ecosystem could be used as a screening tool to identify changes in bacterial community structure. This hypothesis was tested using a soil microcosm setup exposed to a total of six treatments representing different combinations of pesticide and fertilization treatments. The HRM analysis identified a shift in the bacterial community composition in two of the treatments, both including the soil fumigant Basamid GR. These results were confirmed with both denaturing gradient gel electrophoresis (DGGE) analysis and 454-based 16S rRNA gene amplicon sequencing. HRM analysis was shown to be a fast, high-throughput technique that can serve as an effective alternative to gel-based screening methods to monitor microbial community composition. PMID:24610853
Dinucleotide Composition in Animal RNA Viruses Is Shaped More by Virus Family than by Host Species.
Di Giallonardo, Francesca; Schlub, Timothy E; Shi, Mang; Holmes, Edward C
2017-04-15
Viruses use the cellular machinery of their hosts for replication. It has therefore been proposed that the nucleotide and dinucleotide compositions of viruses should match those of their host species. If this is upheld, it may then be possible to use dinucleotide composition to predict the true host species of viruses sampled in metagenomic surveys. However, it is also clear that different taxonomic groups of viruses tend to have distinctive patterns of dinucleotide composition that may be independent of host species. To determine the relative strength of the effect of host versus virus family in shaping dinucleotide composition, we performed a comparative analysis of 20 RNA virus families from 15 host groupings, spanning two animal phyla and more than 900 virus species. In particular, we determined the odds ratios for the 16 possible dinucleotides and performed a discriminant analysis to evaluate the capability of virus dinucleotide composition to predict the correct virus family or host taxon from which it was isolated. Notably, while 81% of the data analyzed here were predicted to the correct virus family, only 62% of these data were predicted to their correct subphylum/class host and a mere 32% to their correct mammalian order. Similarly, dinucleotide composition has a weak predictive power for different hosts within individual virus families. We therefore conclude that dinucleotide composition is generally uniform within a virus family but less well reflects that of its host species. This has obvious implications for attempts to accurately predict host species from virus genome sequences alone. IMPORTANCE Determining the processes that shape virus genomes is central to understanding virus evolution and emergence. One question of particular importance is why nucleotide and dinucleotide frequencies differ so markedly between viruses. In particular, it is currently unclear whether host species or virus family has the biggest impact on dinucleotide frequencies and whether dinucleotide composition can be used to accurately predict host species. Using a comparative analysis, we show that dinucleotide composition has a strong phylogenetic association across different RNA virus families, such that dinucleotide composition can predict the family from which a virus sequence has been isolated. Conversely, dinucleotide composition has a poorer predictive power for the different host species within a virus family and across different virus families, indicating that the host has a relatively small impact on the dinucleotide composition of a virus genome. Copyright © 2017 American Society for Microbiology.
NASA Astrophysics Data System (ADS)
Mackiewicz, P.; Gierlik, A.; Kowalczuk, M.; Szczepanik, D.; Dudek, M. R.; Cebrat, S.
1999-12-01
We have analysed protein coding and intergenic sequences in the Borrelia burgdorferi (the Lyme disease bacterium) genome using different kinds of DNA walks. Genes occupying the leading strand of DNA have significantly different nucleotide composition from genes occupying the lagging strand. Nucleotide compositional bias of the two DNA strands reflects the aminoacid composition of proteins. 96% of genes coding for ribosomal proteins lie on the leading DNA strand, which suggests that the positions of these as well as other genes are non-random. In the B. burgdorferi genome, the asymmetry in intergenic DNA sequences is lower than the asymmetry in the third positions in codons. All these characters of the B. burgdorferi genome suggest that both replication-associated mutational pressure and recombination mechanisms have established the specific structure of the genome and now any recombination leading to inversion of a gene in respect to the direction of replication is forbidden. This property of the genome allows us to assume that it is in a steady state, which enables us to fix some parameters for simulations of DNA evolution.
Genomic mid-range inhomogeneity correlates with an abundance of RNA secondary structures
Bechtel, Jason M; Wittenschlaeger, Thomas; Dwyer, Trisha; Song, Jun; Arunachalam, Sasi; Ramakrishnan, Sadeesh K; Shepard, Samuel; Fedorov, Alexei
2008-01-01
Background Genomes possess different levels of non-randomness, in particular, an inhomogeneity in their nucleotide composition. Inhomogeneity is manifest from the short-range where neighboring nucleotides influence the choice of base at a site, to the long-range, commonly known as isochores, where a particular base composition can span millions of nucleotides. A separate genomic issue that has yet to be thoroughly elucidated is the role that RNA secondary structure (SS) plays in gene expression. Results We present novel data and approaches that show that a mid-range inhomogeneity (~30 to 1000 nt) not only exists in mammalian genomes but is also significantly associated with strong RNA SS. A whole-genome bioinformatics investigation of local SS in a set of 11,315 non-redundant human pre-mRNA sequences has been carried out. Four distinct components of these molecules (5'-UTRs, exons, introns and 3'-UTRs) were considered separately, since they differ in overall nucleotide composition, sequence motifs and periodicities. For each pre-mRNA component, the abundance of strong local SS (< -25 kcal/mol) was a factor of two to ten greater than a random expectation model. The randomization process preserves the short-range inhomogeneity of the corresponding natural sequences, thus, eliminating short-range signals as possible contributors to any observed phenomena. Conclusion We demonstrate that the excess of strong local SS in pre-mRNAs is linked to the little explored phenomenon of genomic mid-range inhomogeneity (MRI). MRI is an interdependence between nucleotide choice and base composition over a distance of 20–1000 nt. Additionally, we have created a public computational resource to support further study of genomic MRI. PMID:18549495
Bamorovat, Mehdi; Sharifi, Iraj; Mohammadi, Mohammad Ali; Eybpoosh, Sana; Nasibi, Saeid; Aflatoonian, Mohammad Reza; Khosravi, Ahmad
2018-03-01
The precise identification of the parasite species causing leishmaniasis is essential for selecting proper treatment modality. The present study aims to compare the nucleotide variations of the ITS1, 7SL RNA, and Hsp70 sequences between non-healed and healed anthroponotic cutaneous leishmaniasis (ACL) patients in major foci in Iran. A case-control study was carried out from September 2015 to October 2016 in the cities of Kerman and Bam, in the southeast of Iran. Randomly selected skin-scraping lesions of 40 patients (20 non-healed and 20 healed) were examined and the organisms were grown in a culture medium. Promastigotes were collected by centrifugation and kept for further molecular examinations. The extracted DNA was amplified and sequenced. After global sequence alignment with BioEdit software, maximum likelihood phylogenetic analysis was performed in PhyML for typing of Leishmania isolates. Nucleotide composition of each genetic region was also compared between non-healed and healed patients. Our results showed that all isolates belonged to the Leishmania tropica complex, with their genetic composition in the ITS1 region being different among non-healed and healed patients. 7SL RNA and Hsp70 regions were genetically identical between both groups. Variability in nucleotide patterns observed between both groups in the ITS1 region may serve to encourage future research on the function of these polymorphisms and may improve our understanding of the role of parasite genome properties on patients' response to Leishmania treatment. Our results also do not support future use of 7SL RNA and Hsp70 regions of the parasite for comparative genomic analyses. Copyright © 2018 Elsevier Ltd. All rights reserved.
Switchgrass ubiquitin promoter (PVUBI2) and uses thereof
Stewart, C. Neal; Mann, David George James
2013-12-10
The subject application provides polynucleotides, compositions thereof and methods for regulating gene expression in a plant. Polynucleotides disclosed herein comprise novel sequences for a promoter isolated from Panicum virgatum (switchgrass) that initiates transcription of an operably linked nucleotide sequence. Thus, various embodiments of the invention comprise the nucleotide sequence of SEQ ID NO: 2 or fragments thereof comprising nucleotides 1 to 692 of SEQ ID NO: 2 that are capable of driving the expression of an operably linked nucleic acid sequence.
Hashimoto, Masayuki; Fukui, Mitsuru; Hayano, Kouichi; Hayatsu, Masahito
2002-01-01
Rhizobium sp. strain AC100, which is capable of degrading carbaryl (1-naphthyl-N-methylcarbamate), was isolated from soil treated with carbaryl. This bacterium hydrolyzed carbaryl to 1-naphthol and methylamine. Carbaryl hydrolase from the strain was purified to homogeneity, and its N-terminal sequence, molecular mass (82 kDa), and enzymatic properties were determined. The purified enzyme hydrolyzed 1-naphthyl acetate and 4-nitrophenyl acetate indicating that the enzyme is an esterase. We then cloned the carbaryl hydrolase gene (cehA) from the plasmid DNA of the strain and determined the nucleotide sequence of the 10-kb region containing cehA. No homologous sequences were found by a database homology search using the nucleotide and deduced amino acid sequences of the cehA gene. Six open reading frames including the cehA gene were found in the 10-kb region, and sequencing analysis shows that the cehA gene is flanked by two copies of insertion sequence-like sequence, suggesting that it makes part of a composite transposon. PMID:11872471
Szilágyi, András; Zachar, István; Szathmáry, Eörs
2013-01-01
Models of competitive template replication, although basic for replicator dynamics and primordial evolution, have not yet taken different sequences explicitly into account, neither have they analyzed the effect of resource partitioning (feeding on different resources) on coexistence. Here we show by analytical and numerical calculations that Gause's principle of competitive exclusion holds for template replicators if resources (nucleotides) affect growth linearly and coexistence is at fixed point attractors. Cases of complementary or homologous pairing between building blocks with parallel or antiparallel strands show no deviation from the rule that the nucleotide compositions of stably coexisting species must be different and there cannot be more coexisting replicator species than nucleotide types. Besides this overlooked mechanism of template coexistence we show also that interesting sequence effects prevail as parts of sequences that are copied earlier affect coexistence more strongly due to the higher concentration of the corresponding replication intermediates. Template and copy always count as one species due their constraint of strict stoichiometric coupling. Stability of fixed-point coexistence tends to decrease with the length of sequences, although this effect is unlikely to be detrimental for sequences below 100 nucleotides. In sum, resource partitioning (niche differentiation) is the default form of competitive coexistence for replicating templates feeding on a cocktail of different nucleotides, as it may have been the case in the RNA world. Our analysis of different pairing and strand orientation schemes is relevant for artificial and potentially astrobiological genetics. PMID:23990769
Manipulation of lignin composition in plants using a tissue-specific promoter
Chapple, Clinton C. S.
2003-08-26
The present invention relates to methods and materials in the field of molecular biology, the manipulation of the phenylpropanoid pathway and the regulation of proteins synthesis through plant genetic engineering. More particularly, the invention relates to the introduction of a foreign nucleotide sequence into a plant genome, wherein the introduction of the nucleotide sequence effects an increase in the syringyl content of the plant's lignin. In one specific aspect, the invention relates to methods for modifying the plant lignin composition in a plant cell by the introduction there into of a foreign nucleotide sequence comprising at issue specific plant promoter sequence and a sequence encoding an active ferulate-5-hydroxylase (F5H) enzyme. Plant transformants harboring an inventive promoter-F5H construct demonstrate increased levels of syringyl monomer residues in their lignin, rendering the polymer more readily delignified and, thereby, rendering the plant more readily pulped or digested.
Oligonucleotide fingerprinting of rRNA genes for analysis of fungal community composition.
Valinsky, Lea; Della Vedova, Gianluca; Jiang, Tao; Borneman, James
2002-12-01
Thorough assessments of fungal diversity are currently hindered by technological limitations. Here we describe a new method for identifying fungi, oligonucleotide fingerprinting of rRNA genes (OFRG). ORFG sorts arrayed rRNA gene (ribosomal DNA [rDNA]) clones into taxonomic clusters through a series of hybridization experiments, each using a single oligonucleotide probe. A simulated annealing algorithm was used to design an OFRG probe set for fungal rDNA. Analysis of 1,536 fungal rDNA clones derived from soil generated 455 clusters. A pairwise sequence analysis showed that clones with average sequence identities of 99.2% were grouped into the same cluster. To examine the accuracy of the taxonomic identities produced by this OFRG experiment, we determined the nucleotide sequences for 117 clones distributed throughout the tree. For all but two of these clones, the taxonomic identities generated by this OFRG experiment were consistent with those generated by a nucleotide sequence analysis. Eighty-eight percent of the clones were affiliated with Ascomycota, while 12% belonged to BASIDIOMYCOTA: A large fraction of the clones were affiliated with the genera Fusarium (404 clones) and Raciborskiomyces (176 clones). Smaller assemblages of clones had high sequence identities to the Alternaria, Ascobolus, Chaetomium, Cryptococcus, and Rhizoctonia clades.
Simple sequence repeats in Escherichia coli: abundance, distribution, composition, and polymorphism.
Gur-Arie, R; Cohen, C J; Eitan, Y; Shelef, L; Hallerman, E M; Kashi, Y
2000-01-01
Computer-based genome-wide screening of the DNA sequence of Escherichia coli strain K12 revealed tens of thousands of tandem simple sequence repeat (SSR) tracts, with motifs ranging from 1 to 6 nucleotides. SSRs were well distributed throughout the genome. Mononucleotide SSRs were over-represented in noncoding regions and under-represented in open reading frames (ORFs). Nucleotide composition of mono- and dinucleotide SSRs, both in ORFs and in noncoding regions, differed from that of the genomic region in which they occurred, with 93% of all mononucleotide SSRs proving to be of A or T. Computer-based analysis of the fine position of every SSR locus in the noncoding portion of the genome relative to downstream ORFs showed SSRs located in areas that could affect gene regulation. DNA sequences at 14 arbitrarily chosen SSR tracts were compared among E. coli strains. Polymorphisms of SSR copy number were observed at four of seven mononucleotide SSR tracts screened, with all polymorphisms occurring in noncoding regions. SSR polymorphism could prove important as a genome-wide source of variation, both for practical applications (including rapid detection, strain identification, and detection of loci affecting key phenotypes) and for evolutionary adaptation of microbes.
Predicting protein-binding regions in RNA using nucleotide profiles and compositions.
Choi, Daesik; Park, Byungkyu; Chae, Hanju; Lee, Wook; Han, Kyungsook
2017-03-14
Motivated by the increased amount of data on protein-RNA interactions and the availability of complete genome sequences of several organisms, many computational methods have been proposed to predict binding sites in protein-RNA interactions. However, most computational methods are limited to finding RNA-binding sites in proteins instead of protein-binding sites in RNAs. Predicting protein-binding sites in RNA is more challenging than predicting RNA-binding sites in proteins. Recent computational methods for finding protein-binding sites in RNAs have several drawbacks for practical use. We developed a new support vector machine (SVM) model for predicting protein-binding regions in mRNA sequences. The model uses sequence profiles constructed from log-odds scores of mono- and di-nucleotides and nucleotide compositions. The model was evaluated by standard 10-fold cross validation, leave-one-protein-out (LOPO) cross validation and independent testing. Since actual mRNA sequences have more non-binding regions than protein-binding regions, we tested the model on several datasets with different ratios of protein-binding regions to non-binding regions. The best performance of the model was obtained in a balanced dataset of positive and negative instances. 10-fold cross validation with a balanced dataset achieved a sensitivity of 91.6%, a specificity of 92.4%, an accuracy of 92.0%, a positive predictive value (PPV) of 91.7%, a negative predictive value (NPV) of 92.3% and a Matthews correlation coefficient (MCC) of 0.840. LOPO cross validation showed a lower performance than the 10-fold cross validation, but the performance remains high (87.6% accuracy and 0.752 MCC). In testing the model on independent datasets, it achieved an accuracy of 82.2% and an MCC of 0.656. Testing of our model and other state-of-the-art methods on a same dataset showed that our model is better than the others. Sequence profiles of log-odds scores of mono- and di-nucleotides were much more powerful features than nucleotide compositions in finding protein-binding regions in RNA sequences. But, a slight performance gain was obtained when using the sequence profiles along with nucleotide compositions. These are preliminary results of ongoing research, but demonstrate the potential of our approach as a powerful predictor of protein-binding regions in RNA. The program and supporting data are available at http://bclab.inha.ac.kr/RBPbinding .
Kiesler, Kevin M; Coble, Michael D; Hall, Thomas A; Vallone, Peter M
2014-01-01
A set of 711 samples from four U.S. population groups was analyzed using a novel mass spectrometry based method for mitochondrial DNA (mtDNA) base composition profiling. Comparison of the mass spectrometry results with Sanger sequencing derived data yielded a concordance rate of 99.97%. Length heteroplasmy was identified in 46% of samples and point heteroplasmy was observed in 6.6% of samples in the combined mass spectral and Sanger data set. Using discrimination capacity as a metric, Sanger sequencing of the full control region had the highest discriminatory power, followed by the mass spectrometry base composition method, which was more discriminating than Sanger sequencing of just the hypervariable regions. This trend is in agreement with the number of nucleotides covered by each of the three assays. Published by Elsevier Ireland Ltd.
Zhou, Shuai; Tang, Qing-Jiu; Zhang, Zhong; Li, Chuan-hua; Cao, Hui; Yang, Yan; Zhang, Jing-Song
2015-01-01
The nutritional composition of three recently domesticated culinary-medicinal mushroom species (Oudemansiella sudmusida, Lentinus squarrosulus, and Tremella aurantialba) was evaluated for contents of protein, fiber, fat, total sugar content, amino acid, carbohydrate, and nucleotide components. The data indicated that fruiting bodies of these three mushroom species contained abundant nutritional substances. The protein contents of L. squarrosulus and O. submucida were 26.32% and 14.70%, which could be comparable to other commercially cultivated species. T. aurantialba contained 74.11% of carbohydrate, of which soluble polysaccharide was 40.55%. Oudemansiella sudmusida contained 15.95% of arabitol as the highest sugar alcohol in three mushrooms. These mushrooms also possessed distinct taste by their flavor component composition. Among them, L. squarrosulus contained 10.68% and 9.25% of monosodium glutamate-like and sweet amino acids, which were higher than the other two mushrooms. However, the nucleotide amounts of the three mushrooms were all lower than those of other commercially cultivated mushrooms. Among them, L. squarrosulus contained the highest amount of flavor nucleotides, which was 1.01‰. Results revealed that these three mushroom species are potentially suitable resources for commercial cultivation and healthy food.
Vieira, Elsa; Brandão, Tiago; Ferreira, Isabel M P L V O
2013-09-18
The present work evaluates the influence of serial yeast repitching on nucleotide composition of brewer's spent yeast extracts produced without addition of exogenous enzymes. Two procedures for disrupting cell walls were compared, and the conditions for low-cost and efficient RNA hydrolysis were selected. A HILIC methodology was validated for the quantification of nucleotides and nucleosides in yeast extracts. Thirty-seven samples of brewer's spent yeast ( Saccharomyces pastorianus ) organized according to the number of serial repitchings were analyzed. Nucleotides accounted for 71.1-88.2% of the RNA products; 2'AMP was the most abundant (ranging between 0.08 and 2.89 g/100 g dry yeast). 5'GMP content ranged between 0.082 and 0.907 g/100 g dry yeast. The sum of 5'GMP, 5'IMP, and 5'AMP represented between 25 and 32% of total nucleotides. This works highlights for the first time that although serial repitching influences the content of monophosphate nucleotides and nucleosides, the profiles of these RNA hydrolysis products are not affected.
Compositions and methods for detecting single nucleotide polymorphisms
Yeh, Hsin-Chih; Werner, James; Martinez, Jennifer S.
2016-11-22
Described herein are nucleic acid based probes and methods for discriminating and detecting single nucleotide variants in nucleic acid molecules (e.g., DNA). The methods include use of a pair of probes can be used to detect and identify polymorphisms, for example single nucleotide polymorphism in DNA. The pair of probes emit a different fluorescent wavelength of light depending on the association and alignment of the probes when hybridized to a target nucleic acid molecule. Each pair of probes is capable of discriminating at least two different nucleic acid molecules that differ by at least a single nucleotide difference. The methods can probes can be used, for example, for detection of DNA polymorphisms that are indicative of a particular disease or condition.
Methods for making nucleotide probes for sequencing and synthesis
Church, George M; Zhang, Kun; Chou, Joseph
2014-07-08
Compositions and methods for making a plurality of probes for analyzing a plurality of nucleic acid samples are provided. Compositions and methods for analyzing a plurality of nucleic acid samples to obtain sequence information in each nucleic acid sample are also provided.
Sample, Paul J.; Gaston, Kirk W.; Alfonzo, Juan D.; Limbach, Patrick A.
2015-01-01
Ribosomal ribonucleic acid (RNA), transfer RNA and other biological or synthetic RNA polymers can contain nucleotides that have been modified by the addition of chemical groups. Traditional Sanger sequencing methods cannot establish the chemical nature and sequence of these modified-nucleotide containing oligomers. Mass spectrometry (MS) has become the conventional approach for determining the nucleotide composition, modification status and sequence of modified RNAs. Modified RNAs are analyzed by MS using collision-induced dissociation tandem mass spectrometry (CID MS/MS), which produces a complex dataset of oligomeric fragments that must be interpreted to identify and place modified nucleosides within the RNA sequence. Here we report the development of RoboOligo, an interactive software program for the robust analysis of data generated by CID MS/MS of RNA oligomers. There are three main functions of RoboOligo: (i) automated de novo sequencing via the local search paradigm. (ii) Manual sequencing with real-time spectrum labeling and cumulative intensity scoring. (iii) A hybrid approach, coined ‘variable sequencing’, which combines the user intuition of manual sequencing with the high-throughput sampling of automated de novo sequencing. PMID:25820423
[Analysis of horizontal transfer gene of Bombyx mori NPV].
Duan, Hai-Rong; Qiu, De-Bin; Gong, Cheng-Liang; Huang, Mo-Li
2011-06-01
For research on genetic characters and evolutionary origin of the genome of baculoviruses, a comprehensive homology search and phylogenetic analysis of the complete genomes of Bombyx mori NPV and Bombyx mori were used. Three horizontally transferred genes (inhibitor of apoptosis, chitinase, and UDP-glucosyltransferase) were identified, and there was evidence that all of these genes were derived from the insect host. The results of analysis showed lots of differences between the features of horizontal transferred genes and the ones of whole genomic genes, such as nucleotide composition, codon usagebias and selection pressure. These results reconfirmed that the horizontally transferred genes are exogenous. The analysis of gene function suggested that horizontally transferred genes acquired from an ancestral host insect can increase the efficiency of baculoviruses transmission.
Liu, Bin; Long, Ren; Chou, Kuo-Chen
2016-08-15
Regulatory DNA elements are associated with DNase I hypersensitive sites (DHSs). Accordingly, identification of DHSs will provide useful insights for in-depth investigation into the function of noncoding genomic regions. In this study, using the strategy of ensemble learning framework, we proposed a new predictor called iDHS-EL for identifying the location of DHS in human genome. It was formed by fusing three individual Random Forest (RF) classifiers into an ensemble predictor. The three RF operators were respectively based on the three special modes of the general pseudo nucleotide composition (PseKNC): (i) kmer, (ii) reverse complement kmer and (iii) pseudo dinucleotide composition. It has been demonstrated that the new predictor remarkably outperforms the relevant state-of-the-art methods in both accuracy and stability. For the convenience of most experimental scientists, a web server for iDHS-EL is established at http://bioinformatics.hitsz.edu.cn/iDHS-EL, which is the first web-server predictor ever established for identifying DHSs, and by which users can easily get their desired results without the need to go through the mathematical details. We anticipate that IDHS-EL: will become a very useful high throughput tool for genome analysis. bliu@gordonlifescience.org or bliu@insun.hit.edu.cn Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Updating Our View of Organelle Genome Nucleotide Landscape
Smith, David Roy
2012-01-01
Organelle genomes show remarkable variation in architecture and coding content, yet their nucleotide composition is relatively unvarying across the eukaryotic domain, with most having a high adenine and thymine (AT) content. Recent studies, however, have uncovered guanine and cytosine (GC)-rich mitochondrial and plastid genomes. These sequences come from a small but eclectic list of species, including certain green plants and animals. Here, I review GC-rich organelle DNAs and the insights they have provided into the evolution of nucleotide landscape. I emphasize that GC-biased mitochondrial and plastid DNAs are more widespread than once thought, sometimes occurring together in the same species, and suggest that the forces biasing their nucleotide content can differ both among and within lineages, and may be associated with specific genome architectural features and life history traits. PMID:22973299
Complex codon usage pattern and compositional features of retroviruses.
RoyChoudhury, Sourav; Mukherjee, Debaprasad
2013-01-01
Retroviruses infect a wide range of organisms including humans. Among them, HIV-1, which causes AIDS, has now become a major threat for world health. Some of these viruses are also potential gene transfer vectors. In this study, the patterns of synonymous codon usage in retroviruses have been studied through multivariate statistical methods on ORFs sequences from the available 56 retroviruses. The principal determinant for evolution of the codon usage pattern in retroviruses seemed to be the compositional constraints, while selection for translation of the viral genes plays a secondary role. This was further supported by multivariate analysis on relative synonymous codon usage. Thus, it seems that mutational bias might have dominated role over translational selection in shaping the codon usage of retroviruses. Codon adaptation index was used to identify translationally optimal codons among genes from retroviruses. The comparative analysis of the preferred and optimal codons among different retroviral groups revealed that four codons GAA, AAA, AGA, and GGA were significantly more frequent in most of the retroviral genes inspite of some differences. Cluster analysis also revealed that phylogenetically related groups of retroviruses have probably evolved their codon usage in a concerted manner under the influence of their nucleotide composition.
Saeed, Isaam; Tang, Sen-Lin; Halgamuge, Saman K.
2012-01-01
An approach to infer the unknown microbial population structure within a metagenome is to cluster nucleotide sequences based on common patterns in base composition, otherwise referred to as binning. When functional roles are assigned to the identified populations, a deeper understanding of microbial communities can be attained, more so than gene-centric approaches that explore overall functionality. In this study, we propose an unsupervised, model-based binning method with two clustering tiers, which uses a novel transformation of the oligonucleotide frequency-derived error gradient and GC content to generate coarse groups at the first tier of clustering; and tetranucleotide frequency to refine these groups at the secondary clustering tier. The proposed method has a demonstrated improvement over PhyloPythia, S-GSOM, TACOA and TaxSOM on all three benchmarks that were used for evaluation in this study. The proposed method is then applied to a pyrosequenced metagenomic library of mud volcano sediment sampled in southwestern Taiwan, with the inferred population structure validated against complementary sequencing of 16S ribosomal RNA marker genes. Finally, the proposed method was further validated against four publicly available metagenomes, including a highly complex Antarctic whale-fall bone sample, which was previously assumed to be too complex for binning prior to functional analysis. PMID:22180538
Hijaz, Faraj; Manthey, John A.; Van der Merwe, Deon; Killiny, Nabil
2016-01-01
ABSTRACT Currently, the global citrus production is declining due to the spread of Huanglongbing (HLB). HLB, otherwise known as citrus greening, is caused by Candidatus Liberibacter asiaticus (CLas) and is transmitted by the Asian citrus psyllids (ACP), Diaphorina citri Kuwayama. ACP transmits CLas bacterium while feeding on the citrus phloem sap. Multiplication of CLas in the phloem of citrus indicates that the sap contains all the essential nutrients needed for CLas. In this study, we investigated the micro- and macro-nutrients, nucleotides, and others secondary metabolites of phloem sap from pineapple sweet orange. The micro- and macro-nutrients were analyzed using inductively coupled plasma-mass spectroscopy (ICP-MS) and inductively coupled plasma-optical emission spectroscopy (ICP-OES). Nucleotides and other secondary metabolites analysis was accomplished by reversed phase HPLC coupled with UV, fluorescence detection, or negative mode electrospray ionization mass spectrometry (ESI-MS). Calcium (89 mM) was the highest element followed by potassium (38.8 mM) and phosphorous (24 mM). Magnesium and sulfur were also abundant and their concentrations were 15 and 9 mM, respectively. The rest of the elements were found in low amounts (< 2mM). The concentrations of ATP, ADP, and AMP were 16, 31, and 3 µ mole/Kg fwt, respectively. GTP, GMP. NAD, FMN, FAD, and riboflavin were found at concentrations below (3 µ mole/Kg fwt). The phloem was rich in nomilin 124 mM and limonin 176 µ mole/Kg fwt. Hesperidin, vicenin-2, sinensetin, and nobiletin were the most predominant flavonoids. In addition, several hydroxycinnamates were detected. The results of this study will increase our knowledge about the nature and the chemical composition of citrus phloem sap. PMID:27171979
Lin, Chung-Jian; Huang, Chi-Chung; Huang, Chao-Ching; Chiang, Yu-Chung; Chiang, Tzen-Yuh
2012-01-01
Background Pinus massoniana, an ecologically and economically important conifer, is widespread across central and southern mainland China and Taiwan. In this study, we tested the central–marginal paradigm that predicts that the marginal populations tend to be less polymorphic than the central ones in their genetic composition, and examined a founders' effect in the island population. Methodology/Principal Findings We examined the phylogeography and population structuring of the P. massoniana based on nucleotide sequences of cpDNA atpB-rbcL intergenic spacer, intron regions of the AdhC2 locus, and microsatellite fingerprints. SAMOVA analysis of nucleotide sequences indicated that most genetic variants resided among geographical regions. High levels of genetic diversity in the marginal populations in the south region, a pattern seemingly contradicting the central–marginal paradigm, and the fixation of private haplotypes in most populations indicate that multiple refugia may have existed over the glacial maxima. STRUCTURE analyses on microsatellites revealed that genetic structure of mainland populations was mediated with recent genetic exchanges mostly via pollen flow, and that the genetic composition in east region was intermixed between south and west regions, a pattern likely shaped by gene introgression and maintenance of ancestral polymorphisms. As expected, the small island population in Taiwan was genetically differentiated from mainland populations. Conclusions/Significance The marginal populations in south region possessed divergent gene pools, suggesting that the past glaciations might have low impacts on these populations at low latitudes. Estimates of ancestral population sizes interestingly reflect a recent expansion in mainland from a rather smaller population, a pattern that seemingly agrees with the pollen record. PMID:22952747
Hijaz, Faraj; Manthey, John A; Van der Merwe, Deon; Killiny, Nabil
2016-06-02
Currently, the global citrus production is declining due to the spread of Huanglongbing (HLB). HLB, otherwise known as citrus greening, is caused by Candidatus Liberibacter asiaticus (CLas) and is transmitted by the Asian citrus psyllids (ACP), Diaphorina citri Kuwayama. ACP transmits CLas bacterium while feeding on the citrus phloem sap. Multiplication of CLas in the phloem of citrus indicates that the sap contains all the essential nutrients needed for CLas. In this study, we investigated the micro- and macro-nutrients, nucleotides, and others secondary metabolites of phloem sap from pineapple sweet orange. The micro- and macro-nutrients were analyzed using inductively coupled plasma-mass spectroscopy (ICP-MS) and inductively coupled plasma-optical emission spectroscopy (ICP-OES). Nucleotides and other secondary metabolites analysis was accomplished by reversed phase HPLC coupled with UV, fluorescence detection, or negative mode electrospray ionization mass spectrometry (ESI-MS). Calcium (89 mM) was the highest element followed by potassium (38.8 mM) and phosphorous (24 mM). Magnesium and sulfur were also abundant and their concentrations were 15 and 9 mM, respectively. The rest of the elements were found in low amounts (< 2mM). The concentrations of ATP, ADP, and AMP were 16, 31, and 3 µ mole/Kg fwt, respectively. GTP, GMP. NAD, FMN, FAD, and riboflavin were found at concentrations below (3 µ mole/Kg fwt). The phloem was rich in nomilin 124 mM and limonin 176 µ mole/Kg fwt. Hesperidin, vicenin-2, sinensetin, and nobiletin were the most predominant flavonoids. In addition, several hydroxycinnamates were detected. The results of this study will increase our knowledge about the nature and the chemical composition of citrus phloem sap.
Tambong, J T; Xu, R; Sadiku, A; Chen, Q; Badiss, A; Yu, Q
2014-04-01
Serratia marcescens strains isolated from entomopathogenic nematodes (Rhabditis sp.) were examined for their pathogenicity and establishment in wax moth (Galleria mellonella) larvae. All the Serratia strains were potently pathogenic to G. mellonella larvae, leading to death within 48 h. The strains were shown to possess a metalloprotease gene encoding for a novel serralysin-like protein. Rapid establishment of the bacteria in infected larvae was confirmed by specific polymerase chain reaction (PCR) detection of a DNA fragment encoding for this protein. Detection of the viable Serratia strains in infected larvae was validated using the SYBR Green reverse transcriptase real-time PCR assay targeting the metalloprotease gene. Nucleotide sequences of the metalloprotease gene obtained in our study showed 72 single nucleotide polymorphisms (SNP) and 3 insertions compared with the metalloprotease gene of S. marcescens E-15. The metalloprotease gene had 60 synonymous and 8 nonsynonymous substitutions relative to the closest GenBank entry, S. marcescens E-15. A comparison of the amino acid composition of the new serralysin-like protein with that of the serralysin protein of S. marcescens E-15 revealed differences at 11 positions and a new aspartic acid residue. Analysis of the effect of protein variation suggests that a new aspartic acid residue resulting from nonsynonymous nucleotide mutations in the protein structure could have the most significant effect on its biological function. The new metalloprotease gene and (or) its product could have applications in plant agricultural biotechnology.
Composition bias and the origin of ORFan genes
Yomtovian, Inbal; Teerakulkittipong, Nuttinee; Lee, Byungkook; Moult, John; Unger, Ron
2010-01-01
Motivation: Intriguingly, sequence analysis of genomes reveals that a large number of genes are unique to each organism. The origin of these genes, termed ORFans, is not known. Here, we explore the origin of ORFan genes by defining a simple measure called ‘composition bias’, based on the deviation of the amino acid composition of a given sequence from the average composition of all proteins of a given genome. Results: For a set of 47 prokaryotic genomes, we show that the amino acid composition bias of real proteins, random ‘proteins’ (created by using the nucleotide frequencies of each genome) and ‘proteins’ translated from intergenic regions are distinct. For ORFans, we observed a correlation between their composition bias and their relative evolutionary age. Recent ORFan proteins have compositions more similar to those of random ‘proteins’, while the compositions of more ancient ORFan proteins are more similar to those of the set of all proteins of the organism. This observation is consistent with an evolutionary scenario wherein ORFan genes emerged and underwent a large number of random mutations and selection, eventually adapting to the composition preference of their organism over time. Contact: ron@biocoml.ls.biu.ac.il Supplementary information: Supplementary data are available at Bioinformatics online. PMID:20231229
Zhao, Ying-Tao; Wang, Meng; Fu, San-Xiong; Yang, Wei-Cai; Qi, Cun-Kou; Wang, Xiu-Jie
2012-02-01
MicroRNAs (miRNAs) and small interfering RNAs are important regulators of plant development and seed formation, yet their population and abundance in the oil crop Brassica napus are still not well understood, especially at different developmental stages and among cultivars with varied seed oil contents. Here, we systematically analyzed the small RNA expression profiles of Brassica napus seeds at early embryonic developmental stages in high-oil-content and low-oil-content B. napus cultivars, both cultured in two environments. A total of 50 conserved miRNAs and 9 new miRNAs were identified, together with some new miRNA targets. Expression analysis revealed some miRNAs with varied expression levels in different seed oil content cultivars or at different embryonic developmental stages. A large number of 23-nucleotide small RNAs with specific nucleotide composition preferences were also identified, which may present new classes of functional small RNAs.
Song, Fan; Shi, Aimin; Zhou, Xuguo; Cai, Wanzhi
2012-01-01
Background Nabidae, a family of predatory heteropterans, includes two subfamilies and five tribes. We previously reported the complete mitogenome of Alloeorhynchus bakeri, a representative of the tribe Prostemmatini in the subfamily Prostemmatinae. To gain a better understanding of architecture and evolution of mitogenome in Nabidae, mitogenomes of five species representing two tribes (Gorpini and Nabini) in the subfamily Nabinae were sequenced, and a comparative mitogenomic analysis of three nabid tribes in two subfamilies was carried out. Methodology/Principal Findings Nabid mitogenomes share a similar nucleotide composition and base bias, except for the control region, where differences are observed at the subfamily level. In addition, the pattern of codon usage is influenced by the GC content and consistent with the standard invertebrate mitochondrial genetic code and the preference for A+T-rich codons. The comparison among orthologous protein-coding genes shows that different genes have been subject to different rates of molecular evolution correlated with the GC content. The stems and anticodon loops of tRNAs are extremely conserved, and the nucleotide substitutions are largely restricted to TψC and DHU loops and extra arms, with insertion-deletion polymorphisms. Comparative analysis shows similar rates of substitution between the two rRNAs. Long non-coding regions are observed in most Gorpini and Nabini mtDNAs in-between trnI-trnQ and/or trnS2-nad1. The lone exception, Nabis apicalis, however, has lost three tRNAs. Overall, phylogenetic analysis using mitogenomic data is consistent with phylogenies constructed mainly form morphological traits. Conclusions/Significance This comparative mitogenomic analysis sheds light on the architecture and evolution of mitogenomes in the family Nabidae. Nucleotide diversity and mitogenomic traits are phylogenetically informative at subfamily level. Furthermore, inclusion of a broader range of samples representing various taxonomic levels is critical for the understanding of mitogenomic evolution in damsel bugs. PMID:23029320
Uncoupling protein 1 binds one nucleotide per monomer and is stabilized by tightly bound cardiolipin
Lee, Yang; Willers, Chrissie; Kunji, Edmund R. S.; Crichton, Paul G.
2015-01-01
Uncoupling protein 1 (UCP1) catalyzes fatty acid-activated, purine nucleotide-sensitive proton leak across the mitochondrial inner membrane of brown adipose tissue to produce heat, and could help combat obesity and metabolic disease in humans. Studies over the last 30 years conclude that the protein is a dimer, binding one nucleotide molecule per two proteins, and unlike the related mitochondrial ADP/ATP carrier, does not bind cardiolipin. Here, we have developed novel methods to purify milligram amounts of UCP1 from native sources by using covalent chromatography that, unlike past methods, allows the protein to be prepared in defined conditions, free of excess detergent and lipid. Assessment of purified preparations by TLC reveal that UCP1 retains tightly bound cardiolipin, with a lipid phosphorus content equating to three molecules per protein, like the ADP/ATP carrier. Cardiolipin stabilizes UCP1, as demonstrated by reconstitution experiments and thermostability assays, indicating that the lipid has an integral role in the functioning of the protein, similar to other mitochondrial carriers. Furthermore, we find that UCP1 is not dimeric but monomeric, as indicated by size exclusion analysis, and has a ligand titration profile in isothermal calorimetric measurements that clearly shows that one nucleotide binds per monomer. These findings reveal the fundamental composition of UCP1, which is essential for understanding the mechanism of the protein. Our assessment of the properties of UCP1 indicate that it is not unique among mitochondrial carriers and so is likely to use a common exchange mechanism in its primary function in brown adipose tissue mitochondria. PMID:26038550
Blanquart, Samuel; Lartillot, Nicolas
2006-11-01
Variations of nucleotidic composition affect phylogenetic inference conducted under stationary models of evolution. In particular, they may cause unrelated taxa sharing similar base composition to be grouped together in the resulting phylogeny. To address this problem, we developed a nonstationary and nonhomogeneous model accounting for compositional biases. Unlike previous nonstationary models, which are branchwise, that is, assume that base composition only changes at the nodes of the tree, in our model, the process of compositional drift is totally uncoupled from the speciation events. In addition, the total number of events of compositional drift distributed across the tree is directly inferred from the data. We implemented the method in a Bayesian framework, relying on Markov Chain Monte Carlo algorithms, and applied it to several nucleotidic data sets. In most cases, the stationarity assumption was rejected in favor of our nonstationary model. In addition, we show that our method is able to resolve a well-known artifact. By Bayes factor evaluation, we compared our model with 2 previously developed nonstationary models. We show that the coupling between speciations and compositional shifts inherent to branchwise models may lead to an overparameterization, resulting in a lesser fit. In some cases, this leads to incorrect conclusions, concerning the nature of the compositional biases. In contrast, our compound model more flexibly adapts its effective number of parameters to the data sets under investigation. Altogether, our results show that accounting for nonstationary sequence evolution may require more elaborate and more flexible models than those currently used.
Simple Sequence Repeats in Escherichia coli: Abundance, Distribution, Composition, and Polymorphism
Gur-Arie, Riva; Cohen, Cyril J.; Eitan, Yuval; Shelef, Leora; Hallerman, Eric M.; Kashi, Yechezkel
2000-01-01
Computer-based genome-wide screening of the DNA sequence of Escherichia coli strain K12 revealed tens of thousands of tandem simple sequence repeat (SSR) tracts, with motifs ranging from 1 to 6 nucleotides. SSRs were well distributed throughout the genome. Mononucleotide SSRs were over-represented in noncoding regions and under-represented in open reading frames (ORFs). Nucleotide composition of mono- and dinucleotide SSRs, both in ORFs and in noncoding regions, differed from that of the genomic region in which they occurred, with 93% of all mononucleotide SSRs proving to be of A or T. Computer-based analysis of the fine position of every SSR locus in the noncoding portion of the genome relative to downstream ORFs showed SSRs located in areas that could affect gene regulation. DNA sequences at 14 arbitrarily chosen SSR tracts were compared among E. coli strains. Polymorphisms of SSR copy number were observed at four of seven mononucleotide SSR tracts screened, with all polymorphisms occurring in noncoding regions. SSR polymorphism could prove important as a genome-wide source of variation, both for practical applications (including rapid detection, strain identification, and detection of loci affecting key phenotypes) and for evolutionary adaptation of microbes.[The sequence data described in this paper have been submitted to the GenBank data library under accession numbers AF209020–209030 and AF209508–209518.] PMID:10645951
The archaebacterial origin of eukaryotes.
Cox, Cymon J; Foster, Peter G; Hirt, Robert P; Harris, Simon R; Embley, T Martin
2008-12-23
The origin of the eukaryotic genetic apparatus is thought to be central to understanding the evolution of the eukaryotic cell. Disagreement about the source of the relevant genes has spawned competing hypotheses for the origins of the eukaryote nuclear lineage. The iconic rooted 3-domains tree of life shows eukaryotes and archaebacteria as separate groups that share a common ancestor to the exclusion of eubacteria. By contrast, the eocyte hypothesis has eukaryotes originating within the archaebacteria and sharing a common ancestor with a particular group called the Crenarchaeota or eocytes. Here, we have investigated the relative support for each hypothesis from analysis of 53 genes spanning the 3 domains, including essential components of the eukaryotic nucleic acid replication, transcription, and translation apparatus. As an important component of our analysis, we investigated the fit between model and data with respect to composition. Compositional heterogeneity is a pervasive problem for reconstruction of ancient relationships, which, if ignored, can produce an incorrect tree with strong support. To mitigate its effects, we used phylogenetic models that allow for changing nucleotide or amino acid compositions over the tree and data. Our analyses favor a topology that supports the eocyte hypothesis rather than archaebacterial monophyly and the 3-domains tree of life.
Organizational heterogeneity of vertebrate genomes.
Frenkel, Svetlana; Kirzhner, Valery; Korol, Abraham
2012-01-01
Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.
Composition for nucleic acid sequencing
Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY
2008-08-26
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
Alnazawi, Mohamed; Altaher, Abdallah; Kandeel, Mahmoud
2017-01-01
Middle East Respiratory Syndrome Coronavirus (MERS CoV) is a new emerging viral disease characterized by high fatality rate. Understanding MERS CoV genetic aspects and codon usage pattern is important to understand MERS CoV survival, adaptation, evolution, resistance to innate immunity, and help in finding the unique aspects of the virus for future drug discovery experiments. In this work, we provide comprehensive analysis of 238 MERS CoV full genomes comprised of human (hMERS) and camel (cMERS) isolates of the virus. MERS CoV genome shaping seems to be under compositional and mutational bias, as revealed by preference of A/T over G/C nucleotides, preferred codons, nucleotides at the third position of codons (NT3s), relative synonymous codon usage, hydropathicity (Gravy), and aromaticity (Aromo) indices. Effective number of codons (ENc) analysis reveals a general slight codon usage bias. Codon adaptation index reveals incomplete adaptation to host environment. MERS CoV showed high ability to resist the innate immune response by showing lower CpG frequencies. Neutrality evolution analysis revealed a more significant role of mutation pressure in cMERS over hMERS. Correspondence analysis revealed that MERS CoV genomes have three genetic clusters, which were distinct in their codon usage, host, and geographic distribution. Additionally, virtual screening and binding experiments were able to identify three new virus-encoded helicase binding compounds. These compounds can be used for further optimization of inhibitors.
MAGGIO, R; SIEKEVITZ, P; PALADE, G E
1963-08-01
This paper describes the subfractionation of nuclei isolated from guinea pig liver by the procedure presented in the first article of the series (8). Centrifugation in a density gradient system of nuclear fractions disrupted by sonication permits the isolation of the following subfractions: (a) a nucleolar subfraction which consists mainly of nucleoli surrounded by a variable amount of nucleolus-associated chromatin and contaminated by chromatin blocks derived primarily from von Kupffer cell nuclei; (b) and (c), two nucleoplasmic subfractions (I and II) which consist mainly of chromatin threads in a coarser (I) or finer (II) degree of fragmentation. The protein, RNA, and DNA content of these subfractions was determined, and their RNA's characterized in terms of NaCl-solubility, nucleotide composition, and in vivo nucleotide turnover, using inorganic (32)P as a marker. The results indicate that there are at least three types of RNA in the nucleus (one in the nucleolus and two in the nucleoplasm or chromatin), which differ from one another in NaCl-solubility, nucleotide composition, turnover, and possibly sequence. Possible relations among these RNA's and those of the cytoplasm are discussed.
Guilfoyle, Amy P; Deshpande, Chandrika N; Schenk, Gerhard; Maher, Megan J; Jormakka, Mika
2014-12-12
GDP release from GTPases is usually extremely slow and is in general assisted by external factors, such as association with guanine exchange factors or membrane-embedded GPCRs (G protein-coupled receptors), which accelerate the release of GDP by several orders of magnitude. Intrinsic factors can also play a significant role; a single amino acid substitution in one of the guanine nucleotide recognition motifs, G5, results in a drastically altered GDP release rate, indicating that the sequence composition of this motif plays an important role in spontaneous GDP release. In the present study, we used the GTPase domain from EcNFeoB (Escherichia coli FeoB) as a model and applied biochemical and structural approaches to evaluate the role of all the individual residues in the G5 loop. Our study confirms that several of the residues in the G5 motif have an important role in the intrinsic affinity and release of GDP. In particular, a T151A mutant (third residue of the G5 loop) leads to a reduced nucleotide affinity and provokes a drastically accelerated dissociation of GDP.
Comparative Mitogenomic Analysis of Species Representing Six Subfamilies in the Family Tenebrionidae
Zhang, Hong-Li; Liu, Bing-Bing; Wang, Xiao-Yang; Han, Zhi-Ping; Zhang, Dong-Xu; Su, Cai-Na
2016-01-01
To better understand the architecture and evolution of the mitochondrial genome (mitogenome), mitogenomes of ten specimens representing six subfamilies in Tenebrionidae were selected, and comparative analysis of these mitogenomes was carried out in this study. Ten mitogenomes in this family share a similar gene composition, gene order, nucleotide composition, and codon usage. In addition, our results show that nucleotide bias was strongly influenced by the preference of codon usage for A/T rich codons which significantly correlated with the G + C content of protein coding genes (PCGs). Evolutionary rate analyses reveal that all PCGs have been subjected to a purifying selection, whereas 13 PCGs displayed different evolution rates, among which ATPase subunit 8 (ATP8) showed the highest evolutionary rate. We inferred the secondary structure for all RNA genes of Tenebrio molitor (Te2) and used this as the basis for comparison with the same genes from other Tenebrionidae mitogenomes. Some conserved helices (stems) and loops of RNA structures were found in different domains of ribosomal RNAs (rRNAs) and the cloverleaf structure of transfer RNAs (tRNAs). With regard to the AT-rich region, we analyzed tandem repeat sequences located in this region and identified some essential elements including T stretches, the consensus motif at the flanking regions of T stretch, and the secondary structure formed by the motif at the 3′ end of T stretch in major strand, which are highly conserved in these species. Furthermore, phylogenetic analyses using mitogenomic data strongly support the relationships among six subfamilies: ((Tenebrionidae incertae sedis + (Diaperinae + Tenebrioninae)) + (Pimeliinae + Lagriinae)), which is consistent with phylogenetic results based on morphological traits. PMID:27258256
Kochanowski, N; Blanchard, F; Cacan, R; Chirat, F; Guedon, E; Marc, A; Goergen, J-L
2006-01-15
Analysis of intracellular nucleotide and nucleotide sugar contents is essential in studying protein glycosylation of mammalian cells. Nucleotides and nucleotide sugars are the donor substrates of glycosyltransferases, and nucleotides are involved in cellular energy metabolism and its regulation. A sensitive and reproducible ion-pair reverse-phase high-performance liquid chromatography (RP-HPLC) method has been developed, allowing the direct and simultaneous detection and quantification of some essential nucleotides and nucleotide sugars. After a perchloric acid extraction, 13 molecules (8 nucleotides and 5 nucleotide sugars) were separated, including activated sugars such as UDP-glucose, UDP-galactose, GDP-mannose, UDP-N-acetylglucosamine, and UDP-N-acetylgalactosamine. To validate the analytical parameters, the reproducibility, linearity of calibration curves, detection limits, and recovery were evaluated for standard mixtures and cell extracts. The developed method is capable of resolving picomolar quantities of nucleotides and nucleotide sugars in a single chromatographic run. The HPLC method was then applied to quantify intracellular levels of nucleotides and nucleotide sugars of Chinese hamster ovary (CHO) cells cultivated in a bioreactor batch process. Evolutions of the titers of nucleotides and nucleotide sugars during the batch process are discussed.
Palanisamy, Navaneethan; Akaberi, Dario; Lennerstrand, Johan; Lundkvist, Åke
2018-05-10
Alkhumra hemorrhagic fever virus (AHFV), a relatively new member of the Flaviviruses, was discovered in Saudi Arabia 23 years ago. AHFV is classified in the tick-borne encephalitis virus serocomplex, along with the Kyasanur forest disease virus (KFDV) and tick-borne encephalitis virus (TBEV). Currently, very little is known about the pathologies of AHFV. In this study, using the available genome information of AHFV, KFDV and TBEV, we have predicted and compared the following aspects of these viruses: evolution, nucleotide and protein compositions, recombination, codon frequency, substitution rate, N- and O-glycosylation sites, signal peptide and cleavage site, transmembrane region, secondary structure of 5' and 3' UTRs and RNA-RNA interactions. Additionally, we have modeled the 3D protease and RNA-dependent RNA polymerase structures for AHFV, KFDV and TBEV. Recombination analysis showed no evidence of recombination in the AHFV genome with that of either KFDV or TBEV, although single break point analysis showed that nucleotide position 7399 (in the NS4B) is a breakpoint location. AHFV, KFDV and TBEV are very similar in terms of codon frequency, the number of transmembrane regions, properties of the polyprotein, RNA-RNA interaction sequences, NS3 protease and NS5 polymerase structures and 5' UTR structure. Using genome sequences, we showed the similarities between these closely- related viruses on several different areas.
Database of amino acid-nucleotide contacts in contacts in DNA-homeodomain protein
NASA Astrophysics Data System (ADS)
Grokhlina, T. I.; Zrelov, P. V.; Ivanov, V. V.; Polozov, R. V.; Chirgadze, Yu. N.; Sivozhelezov, V. S.
2013-09-01
The analysis of amino acid-nucleotide contacts in interfaces of the protein-DNA complexes, intended to find consistencies in the protein-DNA recognition, is a complex problem that requires an analysis of the physicochemical characteristics of these contacts and the positions of the participating amino acids and nucleotides in the chains of the protein and the DNA, respectively, as well as conservatism of these contacts. Thus, those heterogeneous data should be systematized. For this purpose we have developed a database of amino acid-nucleotide contacts ANTPC (Amino acid Nucleotide Type Position Conservation) following the archetypal example of the proteins in the homeodomain family. We show that it can be used to compare and classify the interfaces of the protein-DNA complexes.
Gasowska, A
2005-08-01
The interactions between pyrimidine nucleotides: cytidine-5'-diphosphate (CDP) and cytidine-5'-triphosphate (CTP) and Cu(II) ions, spermine (Spm) and 1,11-diamino-4,8-diazaundecane (3,3,3-tet) have been studied. The composition and stability constants of the complexes formed have been determined by means of the potentiometric method, while the centres of interactions in the ligands have been identified by the spectral methods (UV-Vis, Ultraviolet and Visible spectroscopy; EPR, electron spin resonance; NMR). In the systems without metal, formation of the molecular complexes nucleotide-polyamine with the interaction centres at the endocyclic nitrogen atom of purine ring N3, the oxygen atoms of the phosphate group from the nucleotide and protonated nitrogen atoms of the polyamine have been detected. Significant differences have been found in the metallation between the systems with Spm and with 3,3,3-tet. In the systems with spermine, mainly protonated species are formed with the phosphate group of the nucleotide and deprotonated nitrogen atoms of the polyamine making the coordination centres, while the donor nitrogen atom of the nucleotide N3 is involved in the intramolecular interligand interactions, additionally stabilising the complex. In the systems with 3,3,3-tet, the MLL' type species are formed in which the oxygen atoms of the phosphate group and nitrogen atoms of the polyamine are involved in metallation, whereas the N3 atom from the pyrimidine ring of the nucleotide is located outside the inner coordination sphere of copper ion. The main centre of Cu(II) interaction in the nucleotide, both in the system with Spm and 3,3,3-tet is the phosphate group of the nucleotide.
Montes-Pérez, Rubén C; García, Adán W Echeverría; Castro, Jorge Zavala; Gamboa, Militza G Alfaro
2006-09-01
The objective of this work was to estimate the nucleotidic variation between two groups of tepezcuintles (Agouti paca) from the states of Campeche and Quintana Roo, Mexico and within members of each group. Blood samples were collected from eleven A. paca kept in captivity. DNA from leukocytic cells was used for Ramdom Amplification of DNA Polimorphism (RAPD). The primers three 5'-d(GTAGACCCGT)- 3' and six 5'-d(CCCGTCAGCA)- 3' were selected from de Amersham kit (Ready.To.Go. RAPD Analysis Beads, Amersham Pharmacia Biotech), because they produced an adequate number of bands. The electrophoretic pattern of bands obtained was analyzed using software for phylogenetic analysis based on the UPGMA method, to estimate the units of nucleotidic variation. The phylogenetic tree obtained with primer three reveals a dicotomic grouping between the animals from both states in the Yucatan Peninsula showing a divergent value of 1.983 nucleotides per hundred. Animals from Quintana Roo show a grouping with primer six; an additional grouping was observed with animals from Campeche. Nucleotidic variation between both groups was 2.118 nucleotides per hundred. The nucleotidic variation for the two primers within the groups from both states, showed fluctuating values from 0.46 to 1.68 nucleotides per hundred, which indicates that nucleotidic variation between the two groups of animals is around two nucleotides per hundred and, within the groups, less than 1.7 nucleotides per hundred.
Yu-Han, Qian; Hai-Yan, Wu; Xiao-Yu, Ji; Wei-Wei, Yu; Yu-Zhou, Du
2014-01-01
This study determined the mitochondrial genome sequence of the stonefly, Kamimuria wangi. In order to investigate the relatedness of stonefly to other members of Neoptera, a phylogenetic analysis was undertaken based on 13 protein-coding genes of mitochondrial genomes in 13 representative insects. The mitochondrial genome of the stonefly is a circular molecule consisting of 16,179 nucleotides and contains the 37 genes typically found in other insects. A 10-bp poly-T stretch was observed in the A+T-rich region of the K. wangi mitochondrial genome. Downstream of the poly-T stretch, two regions were located with potential ability to form stem-loop structures; these were designated stem-loop 1 (positions 15848–15651) and stem-loop 2 (15965–15998). The arrangement of genes and nucleotide composition of the K. wangi mitogenome are similar to those in Pteronarcys princeps, suggesting a conserved genome evolution within the Plecoptera. Phylogenetic analysis using maximum likelihood and Bayesian inference of 13 protein-coding genes supported a novel relationship between the Plecoptera and Ephemeroptera. The results contradict the existence of a monophyletic Plectoptera and Plecoptera as sister taxa to Embiidina, and thus requires further analyses with additional mitogenome sampling at the base of the Neoptera. PMID:24466028
Yu-Han, Qian; Hai-Yan, Wu; Xiao-Yu, Ji; Wei-Wei, Yu; Yu-Zhou, Du
2014-01-01
This study determined the mitochondrial genome sequence of the stonefly, Kamimuria wangi. In order to investigate the relatedness of stonefly to other members of Neoptera, a phylogenetic analysis was undertaken based on 13 protein-coding genes of mitochondrial genomes in 13 representative insects. The mitochondrial genome of the stonefly is a circular molecule consisting of 16,179 nucleotides and contains the 37 genes typically found in other insects. A 10-bp poly-T stretch was observed in the A+T-rich region of the K. wangi mitochondrial genome. Downstream of the poly-T stretch, two regions were located with potential ability to form stem-loop structures; these were designated stem-loop 1 (positions 15848-15651) and stem-loop 2 (15965-15998). The arrangement of genes and nucleotide composition of the K. wangi mitogenome are similar to those in Pteronarcys princeps, suggesting a conserved genome evolution within the Plecoptera. Phylogenetic analysis using maximum likelihood and Bayesian inference of 13 protein-coding genes supported a novel relationship between the Plecoptera and Ephemeroptera. The results contradict the existence of a monophyletic Plectoptera and Plecoptera as sister taxa to Embiidina, and thus requires further analyses with additional mitogenome sampling at the base of the Neoptera.
Bahnsen, U; Oosting, P; Swaab, D F; Nahke, P; Richter, D; Schmale, H
1992-01-01
Familial neurohypophyseal diabetes insipidus in humans is a rare disease transmitted as an autosomal dominant trait. Affected individuals have very low or undetectable levels of circulating vasopressin and suffer from polydipsia and polyuria. An obvious candidate gene for the disease is the vasopressin-neurophysin (AVP-NP) precursor gene on human chromosome 20. The 2 kb gene with three exons encodes a composite precursor protein consisting of the neuropeptide vasopressin and two associated proteins, neurophysin and a glycopeptide. Cloning and nucleotide sequence analysis of both alleles of the AVP-NP gene present in a Dutch ADNDI family reveals a point mutation in one allele of the affected family members. Comparison of the nucleotide sequences shows a G----T transversion within the neurophysin-encoding exon B. This missense mutation converts a highly conserved glycine (Gly17 of neurophysin) to a valine residue. RFLP analysis of six related family members indicates cosegregation of the mutant allele with the DI phenotype. The mutation is not present in 96 chromosomes of an unrelated control group. These data suggest that a single amino acid exchange within a highly conserved domain of the human vasopressin-associated neurophysin is the primary cause of one form of ADNDI. Images PMID:1740104
Panwar, Bharat; Raghava, Gajendra P S
2015-04-01
The RNA-protein interactions play a diverse role in the cells, thus identification of RNA-protein interface is essential for the biologist to understand their function. In the past, several methods have been developed for predicting RNA interacting residues in proteins, but limited efforts have been made for the identification of protein-interacting nucleotides in RNAs. In order to discriminate protein-interacting and non-interacting nucleotides, we used various classifiers (NaiveBayes, NaiveBayesMultinomial, BayesNet, ComplementNaiveBayes, MultilayerPerceptron, J48, SMO, RandomForest, SMO and SVM(light)) for prediction model development using various features and achieved highest 83.92% sensitivity, 84.82 specificity, 84.62% accuracy and 0.62 Matthew's correlation coefficient by SVM(light) based models. We observed that certain tri-nucleotides like ACA, ACC, AGA, CAC, CCA, GAG, UGA, and UUU preferred in protein-interaction. All the models have been developed using a non-redundant dataset and are evaluated using five-fold cross validation technique. A web-server called RNApin has been developed for the scientific community (http://crdd.osdd.net/raghava/rnapin/). Copyright © 2015 Elsevier Inc. All rights reserved.
Genetic Diversity and Phylogenetic Evolution of Tibetan Sheep Based on mtDNA D-Loop Sequences
Yue, Yaojing; Guo, Xian; Guo, Tingting; Chu, Min; Wang, Fan; Han, Jilong; Feng, Ruilin; Sun, Xiaoping; Niu, Chune; Yang, Bohui; Guo, Jian; Yuan, Chao
2016-01-01
The molecular and population genetic evidence of the phylogenetic status of the Tibetan sheep (Ovis aries) is not well understood, and little is known about this species’ genetic diversity. This knowledge gap is partly due to the difficulty of sample collection. This is the first work to address this question. Here, the genetic diversity and phylogenetic relationship of 636 individual Tibetan sheep from fifteen populations were assessed using 642 complete sequences of the mitochondrial DNA D-loop. Samples were collected from the Qinghai-Tibetan Plateau area in China, and reference data were obtained from the six reference breed sequences available in GenBank. The length of the sequences varied considerably, between 1031 and 1259 bp. The haplotype diversity and nucleotide diversity were 0.992±0.010 and 0.019±0.001, respectively. The average number of nucleotide differences was 19.635. The mean nucleotide composition of the 350 haplotypes was 32.961% A, 29.708% T, 22.892% C, 14.439% G, 62.669% A+T, and 37.331% G+C. Phylogenetic analysis showed that all four previously defined haplogroups (A, B, C, and D) were found in the 636 individuals of the fifteen Tibetan sheep populations but that only the D haplogroup was found in Linzhou sheep. Further, the clustering analysis divided the fifteen Tibetan sheep populations into at least two clusters. The estimation of the demographic parameters from the mismatch analyses showed that haplogroups A, B, and C had at least one demographic expansion in Tibetan sheep. These results contribute to the knowledge of Tibetan sheep populations and will help inform future conservation programs about the Tibetan sheep native to the Qinghai-Tibetan Plateau. PMID:27463976
Li, Fei; Hullar, Meredith A J; Schwarz, Yvonne; Lampe, Johanna W
2009-09-01
In the human gut, commensal bacteria metabolize food components that typically serve as energy sources. These components have the potential to influence gut bacterial community composition. Cruciferous vegetables, such as broccoli and cabbage, contain distinctive compounds that can be utilized by gut bacteria. For example, glucosinolates can be hydrolyzed by certain bacteria, and dietary fibers can be fermented by a range of species. We hypothesized that cruciferous vegetable consumption would alter growth of certain bacteria, thereby altering bacterial community composition. We tested this hypothesis in a randomized, crossover, controlled feeding study. Fecal samples were collected from 17 participants at the end of 2 14-d intake periods: a low-phytochemical, low-fiber basal diet (i.e. refined grains without fruits or vegetables) and a high ("double") cruciferous vegetable diet [basal diet + 14 g cruciferous vegetables/(kg body weightd)]. Fecal bacterial composition was analyzed by the terminal restriction fragment length polymorphism (tRFLP) method using the bacterial 16S ribosomal RNA gene and nucleotide sequencing. Using blocked multi-response permutation procedures analysis, we found that overall bacterial community composition differed between the 2 consumption periods (delta = 0.603; P = 0.011). The bacterial community response to cruciferous vegetables was individual-specific, as revealed by nonmetric multidimensional scaling ordination analysis. Specific tRFLP fragments that characterized each of the diets were identified using indicator species analysis. Putative species corresponding to these fragments were identified through gene sequencing as Eubacterium hallii, Phascolarctobacterium faecium, Burkholderiales spp., Alistipes putredinis, and Eggerthella spp. In conclusion, human gut bacterial community composition was altered by cruciferous vegetable consumption, which could ultimately influence gut metabolism of bioactive food components and host exposure to these compounds.
Chen, Yaowen; Li, Zongcheng; Hu, Shuofeng; Zhang, Jian; Wu, Jiaqi; Shao, Ningsheng; Bo, Xiaochen; Ni, Ming; Ying, Xiaomin
2017-02-01
Gut microbes play a critical role in human health and disease, and researchers have begun to characterize their genomes, the so-called gut metagenome. Thus far, metagenomics studies have focused on genus- or species-level composition and microbial gene sets, while strain-level composition and single-nucleotide polymorphism (SNP) have been overlooked. The gut metagenomes of type 2 diabetes (T2D) patients have been found to be enriched with butyrate-producing bacteria and sulfate reduction functions. However, it is not known whether the gut metagenomes of T2D patients have characteristic strain patterns or SNP distributions. We downloaded public gut metagenome datasets from 170 T2D patients and 174 healthy controls and performed a systematic comparative analysis of their metagenome SNPs. We found that Bacteroides coprocola, whose relative abundance did not differ between the groups, had a characteristic distribution of SNPs in the T2D patient group. We identified 65 genes, all in B. coprocola, that had remarkably different enrichment of SNPs. The first and sixth ranked genes encode glycosyl hydrolases (GenBank accession EDU99824.1 and EDV02301.1). Interestingly, alpha-glucosidase, which is also a glycosyl hydrolase located in the intestine, is an important drug target of T2D. These results suggest that different strains of B. coprocola may have different roles in human gut and a specific set of B. coprocola strains are correlated with T2D.
Krishnan, Neeraja M; Seligmann, Hervé; Stewart, Caro-Beth; De Koning, A P Jason; Pollock, David D
2004-10-01
Reconstruction of ancestral DNA and amino acid sequences is an important means of inferring information about past evolutionary events. Such reconstructions suggest changes in molecular function and evolutionary processes over the course of evolution and are used to infer adaptation and convergence. Maximum likelihood (ML) is generally thought to provide relatively accurate reconstructed sequences compared to parsimony, but both methods lead to the inference of multiple directional changes in nucleotide frequencies in primate mitochondrial DNA (mtDNA). To better understand this surprising result, as well as to better understand how parsimony and ML differ, we constructed a series of computationally simple "conditional pathway" methods that differed in the number of substitutions allowed per site along each branch, and we also evaluated the entire Bayesian posterior frequency distribution of reconstructed ancestral states. We analyzed primate mitochondrial cytochrome b (Cyt-b) and cytochrome oxidase subunit I (COI) genes and found that ML reconstructs ancestral frequencies that are often more different from tip sequences than are parsimony reconstructions. In contrast, frequency reconstructions based on the posterior ensemble more closely resemble extant nucleotide frequencies. Simulations indicate that these differences in ancestral sequence inference are probably due to deterministic bias caused by high uncertainty in the optimization-based ancestral reconstruction methods (parsimony, ML, Bayesian maximum a posteriori). In contrast, ancestral nucleotide frequencies based on an average of the Bayesian set of credible ancestral sequences are much less biased. The methods involving simpler conditional pathway calculations have slightly reduced likelihood values compared to full likelihood calculations, but they can provide fairly unbiased nucleotide reconstructions and may be useful in more complex phylogenetic analyses than considered here due to their speed and flexibility. To determine whether biased reconstructions using optimization methods might affect inferences of functional properties, ancestral primate mitochondrial tRNA sequences were inferred and helix-forming propensities for conserved pairs were evaluated in silico. For ambiguously reconstructed nucleotides at sites with high base composition variability, ancestral tRNA sequences from Bayesian analyses were more compatible with canonical base pairing than were those inferred by other methods. Thus, nucleotide bias in reconstructed sequences apparently can lead to serious bias and inaccuracies in functional predictions.
Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N
2014-03-01
DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea.
Goncearenco, Alexander; Ma, Bin-Guang; Berezovsky, Igor N.
2014-01-01
DNA, RNA and proteins are major biological macromolecules that coevolve and adapt to environments as components of one highly interconnected system. We explore here sequence/structure determinants of mechanisms of adaptation of these molecules, links between them, and results of their mutual evolution. We complemented statistical analysis of genomic and proteomic sequences with folding simulations of RNA molecules, unraveling causal relations between compositional and sequence biases reflecting molecular adaptation on DNA, RNA and protein levels. We found many compositional peculiarities related to environmental adaptation and the life style. Specifically, thermal adaptation of protein-coding sequences in Archaea is characterized by a stronger codon bias than in Bacteria. Guanine and cytosine load in the third codon position is important for supporting the aerobic life style, and it is highly pronounced in Bacteria. The third codon position also provides a tradeoff between arginine and lysine, which are favorable for thermal adaptation and aerobicity, respectively. Dinucleotide composition provides stability of nucleic acids via strong base-stacking in ApG dinucleotides. In relation to coevolution of nucleic acids and proteins, thermostability-related demands on the amino acid composition affect the nucleotide content in the second codon position in Archaea. PMID:24371267
Functional analysis of regulatory single-nucleotide polymorphisms.
Pampín, Sandra; Rodríguez-Rey, José C
2007-04-01
The identification of regulatory polymorphisms has become a key problem in human genetics. In the past few years there has been a conceptual change in the way in which regulatory single-nucleotide polymorphisms are studied. We revise the new approaches and discuss how gene expression studies can contribute to a better knowledge of the genetics of common diseases. New techniques for the association of single-nucleotide polymorphisms with changes in gene expression have been recently developed. This, together with a more comprehensive use of the old in-vitro methods, has produced a great amount of genetic information. When added to current databases, it will help to design better tools for the detection of regulatory single-nucleotide polymorphisms. The identification of functional regulatory single-nucleotide polymorphisms cannot be done by the simple inspection of DNA sequence. In-vivo techniques, based on primer-extension, and the more recently developed 'haploChIP' allow the association of gene variants to changes in gene expression. Gene expression analysis by conventional in-vitro techniques is the only way to identify the functional consequences of regulatory single-nucleotide polymorphisms. The amount of information produced in the last few years will help to refine the tools for the future analysis of regulatory gene variants.
Nishizawa, M; Nishizawa, K
2000-10-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the 'between gene' GC content heterogeneity, which is linked to 'isochores', is a principal factor associated with the bias in substitution patterns in human, 'within gene' heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed.
Nishizawa, Manami; Nishizawa, Kazuhisa
2000-01-01
The tendency for repetitiveness of nucleotides in DNA sequences has been reported for a variety of organisms. We show that the tendency for repetitive use of amino acids is widespread and is observed even for segments conserved between human and Drosophila melanogaster at the level of >50% amino acid identity. This indicates that repetitiveness influences not only the weakly constrained segments but also those sequence segments conserved among phyla. Not only glutamine (Q) but also many of the 20 amino acids show a comparable level of repetitiveness. Repetitiveness in bases at codon position 3 is stronger for human than for D.melanogaster, whereas local repetitiveness in intron sequences is similar between the two organisms. While genes for immune system-specific proteins, but not ancient human genes (i.e. human homologs of Escherichia coli genes), have repetitiveness at codon bases 1 and 2, repetitiveness at codon base 3 for these groups is similar, suggesting that the human genome has at least two mechanisms generating local repetitiveness. Neither amino acid nor nucleotide repetitiveness is observed beyond the exon boundary, denying the possibility that such repetitiveness could mainly stem from natural selection on mRNA or protein sequences. Analyses of mammalian sequence alignments show that while the ‘between gene’ GC content heterogeneity, which is linked to ‘isochores’, is a principal factor associated with the bias in substitution patterns in human, ‘within gene’ heterogeneity in nucleotide composition is also associated with such bias on a more local scale. The relationship amongst the various types of repetitiveness is discussed. PMID:11000273
Wang, Aishuai; Sun, Yuena; Wu, Changwen
2016-11-01
The complete mitochondrial genome of the Cheilodactylus quadricornis was firstly determined in the present study. The mitochondrial genome of C. quadricornis is 16 521 nucleotides, comprising 13 protein-coding genes and 2 ribosomal RNA genes, 22 tRNA genes and 2 main non-coding regions (the control region and the origin of the light-strand replication). The overall base composition was T, 26.3%; C, 29.6%; A, 27.8% and G, 16.3%. The gene arrangement, base composition, and tRNA structures of the complete mitochondrial genome of C. quadricornis is similar to other teleosts. Only two central conserved sequence blocks (CSB-2 and CSB-3) were identified in the control region. In addition, the conserved motif 5'-GCCGG-3' was identified in the origin of light-strand replication of C. quadricornis. The complete mitochondrial genome of C. quadricornis was used to construct phylogenetic tree, which shows that C. quadricornis and C. variegatus clustered in a clade and formed a sister relationship. This mitogenome sequence data would play an important role in population genetics and phylogenetic analysis of the Cheilodactylidae.
Nasrullah, Izza; Butt, Azeem M; Tahir, Shifa; Idrees, Muhammad; Tong, Yigang
2015-08-26
The Marburg virus (MARV) has a negative-sense single-stranded RNA genome, belongs to the family Filoviridae, and is responsible for several outbreaks of highly fatal hemorrhagic fever. Codon usage patterns of viruses reflect a series of evolutionary changes that enable viruses to shape their survival rates and fitness toward the external environment and, most importantly, their hosts. To understand the evolution of MARV at the codon level, we report a comprehensive analysis of synonymous codon usage patterns in MARV genomes. Multiple codon analysis approaches and statistical methods were performed to determine overall codon usage patterns, biases in codon usage, and influence of various factors, including mutation pressure, natural selection, and its two hosts, Homo sapiens and Rousettus aegyptiacus. Nucleotide composition and relative synonymous codon usage (RSCU) analysis revealed that MARV shows mutation bias and prefers U- and A-ended codons to code amino acids. Effective number of codons analysis indicated that overall codon usage among MARV genomes is slightly biased. The Parity Rule 2 plot analysis showed that GC and AU nucleotides were not used proportionally which accounts for the presence of natural selection. Codon usage patterns of MARV were also found to be influenced by its hosts. This indicates that MARV have evolved codon usage patterns that are specific to both of its hosts. Moreover, selection pressure from R. aegyptiacus on the MARV RSCU patterns was found to be dominant compared with that from H. sapiens. Overall, mutation pressure was found to be the most important and dominant force that shapes codon usage patterns in MARV. To our knowledge, this is the first detailed codon usage analysis of MARV and extends our understanding of the mechanisms that contribute to codon usage and evolution of MARV.
Complexity: an internet resource for analysis of DNA sequence complexity
Orlov, Y. L.; Potapov, V. N.
2004-01-01
The search for DNA regions with low complexity is one of the pivotal tasks of modern structural analysis of complete genomes. The low complexity may be preconditioned by strong inequality in nucleotide content (biased composition), by tandem or dispersed repeats or by palindrome-hairpin structures, as well as by a combination of all these factors. Several numerical measures of textual complexity, including combinatorial and linguistic ones, together with complexity estimation using a modified Lempel–Ziv algorithm, have been implemented in a software tool called ‘Complexity’ (http://wwwmgs.bionet.nsc.ru/mgs/programs/low_complexity/). The software enables a user to search for low-complexity regions in long sequences, e.g. complete bacterial genomes or eukaryotic chromosomes. In addition, it estimates the complexity of groups of aligned sequences. PMID:15215465
Novel Synthesis and Phenotypic Analysis of Mutant Clouds for Hepatitis E Virus Genotype 1.
Agarwal, Shubhra; Baccam, Prasith; Aggarwal, Rakesh; Veerapu, Naga Suresh
2018-02-15
Many RNA viruses exist as an ensemble of genetically diverse, replicating populations known as a mutant cloud. The genetic diversity (cloud size) and composition of this mutant cloud may influence several important phenotypic features of the virus, including its replication capacity. We applied a straightforward, bacterium-free approach using error-prone PCR coupled with reverse genetics to generate infectious mutant RNA clouds with various levels of genetic diversity from a genotype 1 strain of hepatitis E virus (HEV). Cloning and sequencing of a genomic fragment encompassing 70% of open reading frame 1 ( ORF1 ) or of the full genome from variants in the resultant clouds showed the occurrence of nucleotide mutations at a frequency on the order of 10 -3 per nucleotide copied and the existence of marked genetic diversity, with a high normalized Shannon entropy value. The mutant clouds showed transient replication in cell culture, while wild-type HEV did not. Cross-sectional data from these cell cultures supported the existence of differential effects of clouds of various sizes and compositions on phenotypic characteristics, such as the replication level of (+)-RNA progeny, the amounts of double-stranded RNA (a surrogate for the rate of viral replication) and ORF1 protein, and the expression of interferon-stimulated genes. Since mutant cloud size and composition influenced the viral phenotypic properties, a better understanding of this relationship may help to provide further insights into virus evolution and prediction of emerging viral diseases. IMPORTANCE Several biological or practical limitations currently prevent the study of phenotypic behavior of a mutant cloud in vitro We developed a simple and rapid method for synthesizing mutant clouds of hepatitis E virus (HEV), a single-stranded (+)-RNA [ss(+) RNA] virus, with various and controllable levels of genetic diversity, which could then be used in a cell culture system to study the effects of cloud size and composition on viral phenotype. In a cross-sectional analysis, we demonstrated that a particular mutant cloud which had an extremely high genetic diversity had a replication rate exceeding that of wild-type HEV. This method should thus provide a useful model for understanding the phenotypic behavior of ss(+) RNA viruses. Copyright © 2018 American Society for Microbiology.
Cyclic nucleotide binding proteins in the Arabidopsis thaliana and Oryza sativa genomes
Bridges, Dave; Fraser, Marie E; Moorhead, Greg BG
2005-01-01
Background Cyclic nucleotides are ubiquitous intracellular messengers. Until recently, the roles of cyclic nucleotides in plant cells have proven difficult to uncover. With an understanding of the protein domains which can bind cyclic nucleotides (CNB and GAF domains) we scanned the completed genomes of the higher plants Arabidopsis thaliana (mustard weed) and Oryza sativa (rice) for the effectors of these signalling molecules. Results Our analysis found that several ion channels and a class of thioesterases constitute the possible cyclic nucleotide binding proteins in plants. Contrary to some reports, we found no biochemical or bioinformatic evidence for a plant cyclic nucleotide regulated protein kinase, suggesting that cyclic nucleotide functions in plants have evolved differently than in mammals. Conclusion This paper provides a molecular framework for the discussion of cyclic nucleotide function in plants, and resolves a longstanding debate about the presence of a cyclic nucleotide dependent kinase in plants. PMID:15644130
Lakshmanan, Lakshmi Narayanan; Gruber, Jan; Halliwell, Barry; Gunawan, Rudiyanto
2015-01-01
Non D-loop direct repeats (DRs) in mitochondrial DNA (mtDNA) have been commonly implicated in the mutagenesis of mtDNA deletions associated with neuromuscular disease and ageing. Further, these DRs have been hypothesized to put a constraint on the lifespan of mammals and are under a negative selection pressure. Using a compendium of 294 mammalian mtDNA, we re-examined the relationship between species lifespan and the mutagenicity of such DRs. Contradicting the prevailing hypotheses, we found no significant evidence that long-lived mammals possess fewer mutagenic DRs than short-lived mammals. By comparing DR counts in human mtDNA with those in selectively randomized sequences, we also showed that the number of DRs in human mtDNA is primarily determined by global mtDNA properties, such as the bias in synonymous codon usage (SCU) and nucleotide composition. We found that SCU bias in mtDNA positively correlates with DR counts, where repeated usage of a subset of codons leads to more frequent DR occurrences. While bias in SCU and nucleotide composition has been attributed to nucleotide mutational bias, mammalian mtDNA still exhibit higher SCU bias and DR counts than expected from such mutational bias, suggesting a lack of negative selection against non D-loop DRs. PMID:25855815
USDA-ARS?s Scientific Manuscript database
Principal component analysis (PCA) with 36,621 polymorphic genome-anchored single nucleotide polymorphisms (SNPs) identified collectively for Capsicum annuum and Capsicum baccatum was used to show the distribution of these 2 important incompatible cultivated pepper species. Estimated mean nucleotide...
Jacobsson, Josefin A.; Almén, Markus Sällman; Benedict, Christian; Hedberg, Lilia A.; Michaëlsson, Karl; Brooks, Samantha; Kullberg, Joel; Axelsson, Tomas; Johansson, Lars; Ahlström, Håkan; Fredriksson, Robert; Lind, Lars; Schiöth, Helgi B.
2011-01-01
Background The rs9939609 single-nucleotide polymorphism (SNP) in the fat mass and obesity (FTO) gene has previously been associated with higher BMI levels in children and young adults. In contrast, this association was not found in elderly men. BMI is a measure of overweight in relation to the individuals' height, but offers no insight into the regional body fat composition or distribution. Objective To examine whether the FTO gene is associated with overweight and body composition-related phenotypes rather than BMI, we measured waist circumference, total fat mass, trunk fat mass, leg fat mass, visceral and subcutaneous adipose tissue, and daily energy intake in 985 humans (493 women) at the age of 70 years. In total, 733 SNPs located in the FTO gene were genotyped in order to examine whether rs9939609 alone or the other SNPs, or their combinations, are linked to obesity-related measures in elderly humans. Design Cross-sectional analysis of the Prospective Investigation of the Vasculature in Uppsala Seniors (PIVUS) cohort. Results Neither a single SNP, such as rs9939609, nor a SNP combination was significantly linked to overweight, body composition-related measures, or daily energy intake in elderly humans. Of note, these observations hold both among men and women. Conclusions Due to the diversity of measurements included in the study, our findings strengthen the view that the effect of FTO on body composition appears to be less profound in later life compared to younger ages and that this is seemingly independent of gender. PMID:21637715
Correa-Rodríguez, María; Schmidt-RioValle, Jacqueline; Rueda-Medina, Blanca
2017-11-01
The aim of the present study was to investigate the possible influence of low-density lipoprotein receptor-related protein 5 (LRP5) and sclerostin (SOST) genes as genetic factors contributing to calcaneal quantitative ultrasound (QUS) and body composition variables in a population of young Caucasian adults. The study population comprised a total of 575 individuals (mean age 20.41years; SD 2.36) whose bone mass was assessed through QUS to determine broadband ultrasound attenuation (BUA, dB/MHz). Body composition measurements were performed using a body composition analyser. Seven single-nucleotide polymorphisms (SNPs) of LRP5 (rs2306862, rs599083, rs556442 and rs3736228) and SOST (rs4792909, rs851054 and rs2023794) were selected as genetic markers and genotyped using TaqMan OpenArray ® technology. Linear regression analysis was used to test the possible association of the tested SNPs with QUS and body composition parameters. Linear regression analysis revealed that the rs3736228 SNP of LPR5 was significantly associated with BUA after adjustment for age, sex, weight, height, physical activity and calcium intake (P = 0.028, β (95% CI) = 0.089 (0.099-1.691). For the remaining SNPs, no significant association with the QUS measurement was observed. Regarding body composition, no significant association was found between LRP5 and SOST polymorphisms and body mass index, total fat mass and total lean mass after adjustment for age and sex as covariates. We concluded that the rs3736228 LRP5 genetic polymorphism influences calcaneal QUS parameter in a population of young Caucasian adults. This finding suggests that LRP5 might be an important genetic marker contributing to bone mass accrual early in life.
O'Toole, Amanda S.; Miller, Stacy; Haines, Nathan; Zink, M. Coleen; Serra, Martin J.
2006-01-01
Thermodynamic parameters are reported for duplex formation of 48 self-complementary RNA duplexes containing Watson–Crick terminal base pairs (GC, AU and UA) with all 16 possible 3′ double-nucleotide overhangs; mimicking the structures of short interfering RNAs (siRNA) and microRNAs (miRNA). Based on nearest-neighbor analysis, the addition of a second dangling nucleotide to a single 3′ dangling nucleotide increases stability of duplex formation up to 0.8 kcal/mol in a sequence dependent manner. Results from this study in conjunction with data from a previous study [A. S. O'Toole, S. Miller and M. J. Serra (2005) RNA, 11, 512.] allows for the development of a refined nearest-neighbor model to predict the influence of 3′ double-nucleotide overhangs on the stability of duplex formation. The model improves the prediction of free energy and melting temperature when tested against five oligomers with various core duplex sequences. Phylogenetic analysis of naturally occurring miRNAs was performed to support our results. Selection of the effector miR strand of the mature miRNA duplex appears to be dependent upon the identity of the 3′ double-nucleotide overhang. Thermodynamic parameters for 3′ single terminal overhangs adjacent to a UA pair are also presented. PMID:16820533
Wu, Lei; He, Yao; Zhang, Di
2015-11-01
To systematically evaluate the association between single nucleotide polymorphism of rs2231142 genetic susceptibility and gout in East Asian population. The literature retrieval was conducted by using English databases (Medline, EMbase), Chinese databases (CNKI, Vip, Wanfang, SinaMed) and others to collect the published papers on the association between single nucleotide polymorphism of rs2231142 genetic susceptibility and gout by the end of December 2014. Meta-analysis was performed with software Stata 12.0. Nine studies were included. There were significant associations between increased risk of gout and single nucleotide polymorphism of rs2231142, the combined OR was 2.04 (95%CI: 1.82-2.28) for A allele and C allele, 1.97 (95%CI: 1.57-2.48) for CA and CC, 3.71 (95%CI: 3.07-4.47) for AA and CC. Sex and region specific subgroup analysis showed less heterogeneity. There is significant association between gout and single nucleotide polymorphism of rs2231142 in East Asian population, and A allele is a high risk gene for gout.
Pyridine nucleotides in regulation of cell death and survival by redox and non-redox reactions.
Novak Kujundžić, Renata; Žarković, Neven; Gall Trošelj, Koraljka
2014-01-01
Changes of the level and ratios of pyridine nucleotides determine metabolism- dependent cellular redox status and the activity of poly(ADP-ribose) polymerases (PARPs) and sirtuins, thereby influencing several processes closely related to cell survival and death. Pyridine nucleotides participate in numerous metabolic reactions whereby their net cellular level remains constant, but the ratios of NAD+/NADP+ and NADH/NADPH oscillate according to metabolic changes in response to diverse stress signals. In non-redox reactions, NAD+ is degraded and quickly, afterward, resynthesized in the NAD+ salvage pathway, unless overwhelming activation of PARP-1 consumes NAD+ to the point of no return, when the cell can no longer generate enough ATP to accommodate NAD+ resynthesis. The activity of PARP-1 is mandatory for the onset of cytoprotective autophagy on sublethal stress signals. It has become increasingly clear that redox status, largely influenced by the metabolism-dependent composition of the pyridine nucleotides pool, plays an important role in the synthesis of pro-apoptotic and anti-apoptotic sphingolipids. Awareness of the involvement of the prosurvival sphingolipid, sphingosine-1-phosphate, in transition from inflammation to malignant transformation has recently emerged. Here, the participation of pyridine nucleotides in redox and non-redox reactions, sphingolipid metabolism, and their role in cell fate decisions is reviewed.
2010-01-01
We examined the analysis of nucleotides and nucleotide sugars by chromatography on porous graphitic carbon with mass spectrometric detection, a method that evades contamination of the MS instrument with ion pairing reagent. At first, adenosine triphosphate (ATP) and other triphosphate nucleotides exhibited very poor chromatographic behavior on new columns and could hardly be eluted from columns previously cleaned with trifluoroacetic acid. Satisfactory performance of both new and older columns could, however, be achieved by treatment with reducing agent and, unexpectedly, hydrochloric acid. Over 40 nucleotides could be detected in cell extracts including many isobaric compounds such as ATP, deoxyguanosine diphosphate (dGTP), and phospho-adenosine-5′-phosphosulfate or 3′,5′-cyclic adenosine 5'-monophosphate (AMP) and its much more abundant isomer 2′,3′-cylic AMP. A fast sample preparation procedure based on solid-phase extraction on carbon allowed detection of very short-lived analytes such as cytidine 5'-monophosphate (CMP)-2-keto-deoxy-octulosonic acid. In animal cells and plant tissues, about 35 nucleotide sugars were detected, among them rarely considered metabolites such as uridine 5'-diphosphate (UDP)-l-arabinopyranose, UDP-l-arabinofuranose, guanosine 5'-diphosphate (GDP)-l-galactofuranose, UDP-l-rhamnose, and adenosine diphosphate (ADP)-sugars. Surprisingly, UDP-arabinopyranose was also found in Chinese hamster ovary (CHO) cells. Due to the unique structural selectivity of graphitic carbon, the method described herein distinguishes more nucleotides and nucleotide sugars than previously reported approaches. PMID:21043458
DNA Asymmetric Strand Bias Affects the Amino Acid Composition of Mitochondrial Proteins
Min, Xiang Jia; Hickey, Donal A.
2007-01-01
Abstract Variations in GC content between genomes have been extensively documented. Genomes with comparable GC contents can, however, still differ in the apportionment of the G and C nucleotides between the two DNA strands. This asymmetric strand bias is known as GC skew. Here, we have investigated the impact of differences in nucleotide skew on the amino acid composition of the encoded proteins. We compared orthologous genes between animal mitochondrial genomes that show large differences in GC and AT skews. Specifically, we compared the mitochondrial genomes of mammals, which are characterized by a negative GC skew and a positive AT skew, to those of flatworms, which show the opposite skews for both GC and AT base pairs. We found that the mammalian proteins are highly enriched in amino acids encoded by CA-rich codons (as predicted by their negative GC and positive AT skews), whereas their flatworm orthologs were enriched in amino acids encoded by GT-rich codons (also as predicted from their skews). We found that these differences in mitochondrial strand asymmetry (measured as GC and AT skews) can have very large, predictable effects on the composition of the encoded proteins. PMID:17974594
Higher-level phylogeny of paraneopteran insects inferred from mitochondrial genome sequences
Li, Hu; Shao, Renfu; Song, Nan; Song, Fan; Jiang, Pei; Li, Zhihong; Cai, Wanzhi
2015-01-01
Mitochondrial (mt) genome data have been proven to be informative for animal phylogenetic studies but may also suffer from systematic errors, due to the effects of accelerated substitution rate and compositional heterogeneity. We analyzed the mt genomes of 25 insect species from the four paraneopteran orders, aiming to better understand how accelerated substitution rate and compositional heterogeneity affect the inferences of the higher-level phylogeny of this diverse group of hemimetabolous insects. We found substantial heterogeneity in base composition and contrasting rates in nucleotide substitution among these paraneopteran insects, which complicate the inference of higher-level phylogeny. The phylogenies inferred with concatenated sequences of mt genes using maximum likelihood and Bayesian methods and homogeneous models failed to recover Psocodea and Hemiptera as monophyletic groups but grouped, instead, the taxa that had accelerated substitution rates together, including Sternorrhyncha (a suborder of Hemiptera), Thysanoptera, Phthiraptera and Liposcelididae (a family of Psocoptera). Bayesian inference with nucleotide sequences and heterogeneous models (CAT and CAT + GTR), however, recovered Psocodea, Thysanoptera and Hemiptera each as a monophyletic group. Within Psocodea, Liposcelididae is more closely related to Phthiraptera than to other species of Psocoptera. Furthermore, Thysanoptera was recovered as the sister group to Hemiptera. PMID:25704094
Effects of preservation methods on amino acids and 5'-nucleotides of Agaricus bisporus mushrooms.
Liu, Ying; Huang, Fan; Yang, Hong; Ibrahim, S A; Wang, Yan-Feng; Huang, Wen
2014-04-15
In this study, the proximate composition, free amino acids content and 5'-nucleotides in frozen, canned and salted Agaricus bisporus (A. bisporus) were investigated. We found that the three kinds of A. bisporus products were good sources of protein, with amount varying in the ranges of 16.54-24.35g/100g (dry weight). Freezing, canning and salting process, followed by 6months of storage led to a significant reduction in free amino acids, especially tyrosine, alanine, glutamine and cysteine. There were medium levels of MSG-like amino acids in frozen A. bisporus and canned A. bisporus, and low levels of MSG-like amino acids in salted A. bisporus. The mount of flavor 5'-nucleotides in frozen A. bisporus was higher than that of canned and salted A. bisporus. The present study thus suggests that freezing is beneficial for the preservation of A. bisporus. Copyright © 2013 Elsevier Ltd. All rights reserved.
Complete genome sequence of a novel avian paramyxovirus isolated from wild birds in South Korea.
Jeong, Jipseol; Kim, Youngsik; An, Injung; Wang, Seung-Jun; Kim, Yongkwan; Lee, Hyun-Jeong; Choi, Kang-Seuk; Im, Se-Pyeong; Min, Wongi; Oem, Jae-Ku; Jheong, Weonhwa
2018-01-01
A novel avian paramyxovirus (APMV), Cheonsu1510, was isolated from wild bird feces in South Korea and serologically and genetically characterized. In hemagglutination inhibition tests, antiserum against Cheonsu1510 showed low reactivity with other APMVs and vice versa. The complete genome of Cheonsu1510 comprised 15,408 nucleotides, contained six open reading frames (3'-N-P-M-F-HN-L-5'), and showed low sequence identity to other APMVs (< 63%) and a unique genomic composition. Phylogenetic analysis revealed that Cheonsu1510 was related to but distinct from APMV-1, -9, and -15. These results suggest that Cheonsu1510 represents a new APMV serotype, APMV-17.
Hartmann, Luise; Stephenson, Christine F; Verkamp, Stephanie R; Johnson, Krystal R; Burnworth, Bettina; Hammock, Kelle; Brodersen, Lisa Eidenschink; de Baca, Monica E; Wells, Denise A; Loken, Michael R; Zehentner, Barbara K
2014-12-01
Array comparative genomic hybridization (aCGH) has become a powerful tool for analyzing hematopoietic neoplasms and identifying genome-wide copy number changes in a single assay. aCGH also has superior resolution compared with fluorescence in situ hybridization (FISH) or conventional cytogenetics. Integration of single nucleotide polymorphism (SNP) probes with microarray analysis allows additional identification of acquired uniparental disomy, a copy neutral aberration with known potential to contribute to tumor pathogenesis. However, a limitation of microarray analysis has been the inability to detect clonal heterogeneity in a sample. This study comprised 16 samples (acute myeloid leukemia, myelodysplastic syndrome, chronic lymphocytic leukemia, plasma cell neoplasm) with complex cytogenetic features and evidence of clonal evolution. We used an integrated manual peak reassignment approach combining analysis of aCGH and SNP microarray data for characterization of subclonal abnormalities. We compared array findings with results obtained from conventional cytogenetic and FISH studies. Clonal heterogeneity was detected in 13 of 16 samples by microarray on the basis of log2 values. Use of the manual peak reassignment analysis approach improved resolution of the sample's clonal composition and genetic heterogeneity in 10 of 13 (77%) patients. Moreover, in 3 patients, clonal disease progression was revealed by array analysis that was not evident by cytogenetic or FISH studies. Genetic abnormalities originating from separate clonal subpopulations can be identified and further characterized by combining aCGH and SNP hybridization results from 1 integrated microarray chip by use of the manual peak reassignment technique. Its clinical utility in comparison to conventional cytogenetic or FISH studies is demonstrated. © 2014 American Association for Clinical Chemistry.
Voltage-gated calcium channel and antisense oligonucleotides thereto
NASA Technical Reports Server (NTRS)
Friedman, Peter A. (Inventor); Duncan, Randall L. (Inventor); Hruska, Keith A. (Inventor); Barry, Elizabeth L. R. (Inventor)
1998-01-01
An antisense oligonucleotide of 10 to 35 nucleotides in length that can hybridize with a region of the .alpha..sub.1 subunit of the SA-Cat channel gene DNA or mRNA is provided, together with pharmaceutical compositions containing and methods utilizing such antisense oligonucleotide.
Discrete RNA libraries from pseudo-torsional space
Humphris-Narayanan, Elisabeth
2012-01-01
The discovery that RNA molecules can fold into complex structures and carry out diverse cellular roles has led to interest in developing tools for modeling RNA tertiary structure. While significant progress has been made in establishing that the RNA backbone is rotameric, few libraries of discrete conformations specifically for use in RNA modeling have been validated. Here, we present six libraries of discrete RNA conformations based on a simplified pseudo-torsional notation of the RNA backbone, comparable to phi and psi in the protein backbone. We evaluate the ability of each library to represent single nucleotide backbone conformations and we show how individual library fragments can be assembled into dinucleotides that are consistent with established RNA backbone descriptors spanning from sugar to sugar. We then use each library to build all-atom models of 20 test folds and we show how the composition of a fragment library can limit model quality. Despite the limitations inherent in using discretized libraries, we find that several hundred discrete fragments can rebuild RNA folds up to 174 nucleotides in length with atomic-level accuracy (<1.5Å RMSD). We anticipate the libraries presented here could easily be incorporated into RNA structural modeling, analysis, or refinement tools. PMID:22425640
DOE Office of Scientific and Technical Information (OSTI.GOV)
Akabayov, B.; Akabayov, S; Lee , S
Gene 5 of bacteriophage T7 encodes a DNA polymerase (gp5) responsible for the replication of the phage DNA. Gp5 polymerizes nucleotides with low processivity, dissociating after the incorporation of 1 to 50 nucleotides. Thioredoxin (trx) of Escherichia coli binds tightly (Kd = 5 nM) to a unique segment in the thumb subdomain of gp5 and increases processivity. We have probed the molecular basis for the increase in processivity. A single-molecule experiment reveals differences in rates of enzymatic activity and processivity between gp5 and gp5/trx. Small angle X-ray scattering studies combined with nuclease footprinting reveal two conformations of gp5, one inmore » the free state and one upon binding to trx. Comparative analysis of the DNA binding clefts of DNA polymerases and DNA binding proteins show that the binding surface contains more hydrophobic residues than other DNA binding proteins. The balanced composition between hydrophobic and charged residues of the binding site allows for efficient sliding of gp5/trx on the DNA. We propose a model for trx-induced conformational changes in gp5 that enhance the processivity by increasing the interaction of gp5 with DNA.« less
Oluwayelu, D O; Todd, D; Olaleye, O D
2008-12-01
This work reports the first molecular analysis study of chicken anaemia virus (CAV) in backyard chickens in Africa using molecular cloning and sequence analysis to characterize CAV strains obtained from commercial chickens and Nigerian backyard chickens. Partial VP1 gene sequences were determined for three CAVs from commercial chickens and for six CAV variants present in samples from a backyard chicken. Multiple alignment analysis revealed that the 6% and 4% nucleotide diversity obtained respectively for the commercial and backyard chicken strains translated to only 2% amino acid diversity for each breed. Overall, the amino acid composition of Nigerian CAVs was found to be highly conserved. Since the partial VP1 gene sequence of two backyard chicken cloned CAV strains (NGR/CI-8 and NGR/CI-9) were almost identical and evolutionarily closely related to the commercial chicken strains NGR-1, and NGR-4 and NGR-5, respectively, we concluded that CAV infections had crossed the farm boundary.
Error correction and diversity analysis of population mixtures determined by NGS
Burroughs, Nigel J.; Evans, David J.; Ryabov, Eugene V.
2014-01-01
The impetus for this work was the need to analyse nucleotide diversity in a viral mix taken from honeybees. The paper has two findings. First, a method for correction of next generation sequencing error in the distribution of nucleotides at a site is developed. Second, a package of methods for assessment of nucleotide diversity is assembled. The error correction method is statistically based and works at the level of the nucleotide distribution rather than the level of individual nucleotides. The method relies on an error model and a sample of known viral genotypes that is used for model calibration. A compendium of existing and new diversity analysis tools is also presented, allowing hypotheses about diversity and mean diversity to be tested and associated confidence intervals to be calculated. The methods are illustrated using honeybee viral samples. Software in both Excel and Matlab and a guide are available at http://www2.warwick.ac.uk/fac/sci/systemsbiology/research/software/, the Warwick University Systems Biology Centre software download site. PMID:25405074
Fatty acid composition and desaturase gene expression in flax (Linum usitatissimum L.).
Thambugala, Dinushika; Cloutier, Sylvie
2014-11-01
Little is known about the relationship between expression levels of fatty acid desaturase genes during seed development and fatty acid (FA) composition in flax. In the present study, we looked at promoter structural variations of six FA desaturase genes and their relative expression throughout seed development. Computational analysis of the nucleotide sequences of the sad1, sad2, fad2a, fad2b, fad3a and fad3b promoters showed several basic transcriptional elements including CAAT and TATA boxes, and several putative target-binding sites for transcription factors, which have been reported to be involved in the regulation of lipid metabolism. Using semi-quantitative reverse transcriptase PCR, the expression patterns throughout seed development of the six FA desaturase genes were measured in six flax genotypes that differed for FA composition but that carried the same desaturase isoforms. FA composition data were determined by phenotyping the field grown genotypes over four years in two environments. All six genes displayed a bell-shaped pattern of expression peaking at 20 or 24 days after anthesis. Sad2 was the most highly expressed. The expression of all six desaturase genes did not differ significantly between genotypes (P = 0.1400), hence there were no correlations between FA desaturase gene expression and variations in FA composition in relatively low, intermediate and high linolenic acid genotypes expressing identical isoforms for all six desaturases. These results provide further clues towards understanding the genetic factors responsible for FA composition in flax.
Sidell, Neil; Mathad, Raveendra I.; Shu, Feng-jue; Zhang, Zhenjiang; Kallen, Caleb B.; Yang, Danzhou
2011-01-01
DNA-intercalating molecules can impair DNA replication, DNA repair, and gene transcription. We previously demonstrated that XR5944, a DNA bis-intercalator, specifically blocks binding of estrogen receptor-α (ERα) to the consensus estrogen response element (ERE). The consensus ERE sequence is AGGTCAnnnTGACCT, where nnn is known as the tri-nucleotide spacer. Recent work has shown that the tri-nucleotide spacer can modulate ERα-ERE binding affinity and ligand-mediated transcriptional responses. To further understand the mechanism by which XR5944 inhibits ERα-ERE binding, we tested its ability to interact with consensus EREs with variable tri-nucleotide spacer sequences and with natural but non-consensus ERE sequences using one dimensional nuclear magnetic resonance (1D 1H NMR) titration studies. We found that the tri-nucleotide spacer sequence significantly modulates the binding of XR5944 to EREs. Of the sequences that were tested, EREs with CGG and AGG spacers showed the best binding specificity with XR5944, while those spaced with TTT demonstrated the least specific binding. The binding stoichiometry of XR5944 with EREs was 2:1, which can explain why the spacer influences the drug-DNA interaction; each XR5944 spans four nucleotides (including portions of the spacer) when intercalating with DNA. To validate our NMR results, we conducted functional studies using reporter constructs containing consensus EREs with tri-nucleotide spacers CGG, CTG, and TTT. Results of reporter assays in MCF-7 cells indicated that XR5944 was significantly more potent in inhibiting the activity of CGG- than TTT-spaced EREs, consistent with our NMR results. Taken together, these findings predict that the anti-estrogenic effects of XR5944 will depend not only on ERE half-site composition but also on the tri-nucleotide spacer sequence of EREs located in the promoters of estrogen-responsive genes. PMID:21333738
Jing, Ruiyong; Liu, Junjie; Yu, Zhenhua; Liu, Xiaobing; Wang, Guanghua
2014-01-01
Numerous studies have revealed the high diversity of cyanophages in marine and freshwater environments, but little is currently known about the diversity of cyanophages in paddy fields, particularly in Northeast (NE) China. To elucidate the genetic diversity of cyanophages in paddy floodwaters in NE China, viral capsid assembly protein gene (g20) sequences from five floodwater samples were amplified with the primers CPS1 and CPS8. Denaturing gradient gel electrophoresis (DGGE) was applied to distinguish different g20 clones. In total, 54 clones differing in g20 nucleotide sequences were obtained in this study. Phylogenetic analysis showed that the distribution of g20 sequences in this study was different from that in Japanese paddy fields, and all the sequences were grouped into Clusters α, β, γ and ε. Within Clusters α and β, three new small clusters (PFW-VII∼-IX) were identified. UniFrac analysis of g20 clone assemblages demonstrated that the community compositions of cyanophage varied among marine, lake and paddy field environments. In paddy floodwater, community compositions of cyanophage were also different between NE China and Japan. PMID:24533125
Analysis of the synonymous codon usage bias in recently emerged enterovirus D68 strains.
Karniychuk, Uladzimir U
2016-09-02
Understanding the codon usage pattern of a pathogen and relationship between pathogen and host's codon usage patterns has fundamental and applied interests. Enterovirus D68 (EV-D68) is an emerging pathogen with a potentially high public health significance. In the present study, the synonymous codon usage bias of 27 recently emerged, and historical EV-D68 strains was analyzed. In contrast to previously studied enteroviruses (enterovirus 71 and poliovirus), EV-D68 and human host have a high discrepancy between favored codons. Analysis of viral synonymous codon usage bias metrics, viral nucleotide/dinucleotide compositional parameters, and viral protein properties showed that mutational pressure is more involved in shaping the synonymous codon usage bias of EV-D68 than translation selection. Computation of codon adaptation indices allowed to estimate expression potential of the EV-D68 genome in several commonly used laboratory animals. This approach requires experimental validation and may provide an auxiliary tool for the rational selection of laboratory animals to model emerging viral diseases. Enterovirus D68 genome compositional and codon usage data can be useful for further pathogenesis, animal model, and vaccine design studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Conservation/Mutation in the Splice Sites of Mitochondrial Solute Carrier Genes of Vertebrates.
Calvello, Rosa; Panaro, Maria A; Salvatore, Rosaria; Mitolo, Vincenzo; Cianciulli, Antonia
2016-10-01
The "canonical" introns begin by the dinucleotide GT and end by the dinucleotide AG. GT, together with a few downstream nucleotides, and AG, with a few of the immediately preceding nucleotides, are thought to be the strongest splicing signals (5'ss and 3'ss, respectively). We examined the composition of the intronic initial and terminal hexanucleotides of the mitochondrial solute carrier genes (SLC25A's) of zebrafish, chicken, mouse, and human. These genes are orthologous and we selected the transcripts in which the arrangement of exons and introns was superimposable in the species considered. Both 5'ss and 3'ss were highly polymorphic, with 104 and 126 different configurations, respectively, in our sample. In the line of evolution from zebrafish to chicken, as well as in that from zebrafish to mammals, the average nucleotide conservation in the four variable nucleotides was about 50 % at 5' and 40 % at 3'. In the divergent evolution of mouse and human, the conservation was about 80 % at 5' and 70 % at 3'. Despite these changes, the splicing signals remain strong enough to operate at the same site. At both 5' and 3', the frequency of a nucleotide at a given position in the zebrafish sequence is positively correlated with its conservation in chicken and mammals, suggesting that selection continued to operate in birds and mammals along similar lines.
Superstatistical model of bacterial DNA architecture
NASA Astrophysics Data System (ADS)
Bogachev, Mikhail I.; Markelov, Oleg A.; Kayumov, Airat R.; Bunde, Armin
2017-02-01
Understanding the physical principles that govern the complex DNA structural organization as well as its mechanical and thermodynamical properties is essential for the advancement in both life sciences and genetic engineering. Recently we have discovered that the complex DNA organization is explicitly reflected in the arrangement of nucleotides depicted by the universal power law tailed internucleotide interval distribution that is valid for complete genomes of various prokaryotic and eukaryotic organisms. Here we suggest a superstatistical model that represents a long DNA molecule by a series of consecutive ~150 bp DNA segments with the alternation of the local nucleotide composition between segments exhibiting long-range correlations. We show that the superstatistical model and the corresponding DNA generation algorithm explicitly reproduce the laws governing the empirical nucleotide arrangement properties of the DNA sequences for various global GC contents and optimal living temperatures. Finally, we discuss the relevance of our model in terms of the DNA mechanical properties. As an outlook, we focus on finding the DNA sequences that encode a given protein while simultaneously reproducing the nucleotide arrangement laws observed from empirical genomes, that may be of interest in the optimization of genetic engineering of long DNA molecules.
Lomozik, L; Gasowska, A; Krzysko, G
2006-11-01
The interactions of Cu(II) ions with adenosine-5'-monophosphate (AMP), cytidine-5'-monophosphate (CMP) and 1,12-diamino-4,9-dioxadodecane (OSpm) were studied. A potentiometric method was applied to determine the composition and stability constants of complexes formed, while the mode of interactions was analysed by spectral methods (ultraviolet and visible spectroscopy (UV-Vis), electron paramagnetic resonance (EPR), (13)C NMR, (31)P NMR). In metal-free systems, molecular complexes nucleotide-polyamine (NMP)H(x)(OSpm) were formed. The endocyclic nitrogen atoms of the purine ring N(1), N(7), the nitrogen atom of the pyrimidine ring N(3), the oxygen atoms of the phosphate group of the nucleotide and the protonated nitrogen atoms of the polyamine were the reaction centres. The mode of interaction of the metal ion with OSpm and the nucleotides (AMP or CMP) in the coordination compounds was established. In the system Cu(II)/OSpm the dinuclear complex Cu(2)(OSpm) forms, while in the ternary systems Cu(II)/nucleotide/OSpm the species type MH(x)LL' and MLL' appear. In the MH(x)LL' type species, the main centres of copper (II) ion binding in the nucleotide are the phosphate groups. The protonated amino groups of OSpm are involved in non-covalent interaction with the nitrogen atoms N(1), N(7) or N(3) of the purine or pyrimidine ring, whereas at higher pH, deprotonated nitrogen atoms of polyamine are engaged in metallation in MLL' species.
Schermerhorn, Kelly M.; Gardner, Andrew F.
2015-01-01
Family D DNA polymerases (polDs) have been implicated as the major replicative polymerase in archaea, excluding the Crenarchaeota branch, and bear little sequence homology to other DNA polymerase families. Here we report a detailed kinetic analysis of nucleotide incorporation and exonuclease activity for a Family D DNA polymerase from Thermococcus sp. 9°N. Pre-steady-state single-turnover nucleotide incorporation assays were performed to obtain the kinetic parameters, kpol and Kd, for correct nucleotide incorporation, incorrect nucleotide incorporation, and ribonucleotide incorporation by exonuclease-deficient polD. Correct nucleotide incorporation kinetics revealed a relatively slow maximal rate of polymerization (kpol ∼2.5 s−1) and especially tight nucleotide binding (Kd(dNTP) ∼1.7 μm), compared with DNA polymerases from Families A, B, C, X, and Y. Furthermore, pre-steady-state nucleotide incorporation assays revealed that polD prevents the incorporation of incorrect nucleotides and ribonucleotides primarily through reduced nucleotide binding affinity. Pre-steady-state single-turnover assays on wild-type 9°N polD were used to examine 3′-5′ exonuclease hydrolysis activity in the presence of Mg2+ and Mn2+. Interestingly, substituting Mn2+ for Mg2+ accelerated hydrolysis rates >40-fold (kexo ≥110 s−1 versus ≥2.5 s−1). Preference for Mn2+ over Mg2+ in exonuclease hydrolysis activity is a property unique to the polD family. The kinetic assays performed in this work provide critical insight into the mechanisms that polD employs to accurately and efficiently replicate the archaeal genome. Furthermore, despite the unique properties of polD, this work suggests that a conserved polymerase kinetic pathway is present in all known DNA polymerase families. PMID:26160179
Plastid: nucleotide-resolution analysis of next-generation sequencing and genomics data.
Dunn, Joshua G; Weissman, Jonathan S
2016-11-22
Next-generation sequencing (NGS) informs many biological questions with unprecedented depth and nucleotide resolution. These assays have created a need for analytical tools that enable users to manipulate data nucleotide-by-nucleotide robustly and easily. Furthermore, because many NGS assays encode information jointly within multiple properties of read alignments - for example, in ribosome profiling, the locations of ribosomes are jointly encoded in alignment coordinates and length - analytical tools are often required to extract the biological meaning from the alignments before analysis. Many assay-specific pipelines exist for this purpose, but there remains a need for user-friendly, generalized, nucleotide-resolution tools that are not limited to specific experimental regimes or analytical workflows. Plastid is a Python library designed specifically for nucleotide-resolution analysis of genomics and NGS data. As such, Plastid is designed to extract assay-specific information from read alignments while retaining generality and extensibility to novel NGS assays. Plastid represents NGS and other biological data as arrays of values associated with genomic or transcriptomic positions, and contains configurable tools to convert data from a variety of sources to such arrays. Plastid also includes numerous tools to manipulate even discontinuous genomic features, such as spliced transcripts, with nucleotide precision. Plastid automatically handles conversion between genomic and feature-centric coordinates, accounting for splicing and strand, freeing users of burdensome accounting. Finally, Plastid's data models use consistent and familiar biological idioms, enabling even beginners to develop sophisticated analytical workflows with minimal effort. Plastid is a versatile toolkit that has been used to analyze data from multiple NGS assays, including RNA-seq, ribosome profiling, and DMS-seq. It forms the genomic engine of our ORF annotation tool, ORF-RATER, and is readily adapted to novel NGS assays. Examples, tutorials, and extensive documentation can be found at https://plastid.readthedocs.io .
USDA-ARS?s Scientific Manuscript database
Nucleotide-activated sugars are essential substrates for plant cell wall carbohydrate-polymer biosynthetic glycosyltransferase enzymes. The most prevalent sugars in grass cell walls include glucose (Glc), xylose (Xyl), and arabinose (Ara). These sugars are biosynthetically related via the uridine di...
Statistical analysis of nucleotide sequences of the hemagglutinin gene of human influenza A viruses.
Ina, Y; Gojobori, T
1994-01-01
To examine whether positive selection operates on the hemagglutinin 1 (HA1) gene of human influenza A viruses (H1 subtype), 21 nucleotide sequences of the HA1 gene were statistically analyzed. The nucleotide sequences were divided into antigenic and nonantigenic sites. The nucleotide diversities for antigenic and nonantigenic sites of the HA1 gene were computed at synonymous and nonsynonymous sites separately. For nonantigenic sites, the nucleotide diversities were larger at synonymous sites than at nonsynonymous sites. This is consistent with the neutral theory of molecular evolution. For antigenic sites, however, the nucleotide diversities at nonsynonymous sites were larger than those at synonymous sites. These results suggest that positive selection operates on antigenic sites of the HA1 gene of human influenza A viruses (H1 subtype). PMID:8078892
Symbiotic Bacteria Associated with Stomach Discs of Human Lice▿ †
Sasaki-Fukatsu, Kayoko ; Koga, Ryuichi; Nikoh, Naruo; Yoshizawa, Kazunori; Kasai, Shinji; Mihara, Minoru; Kobayashi, Mutsuo; Tomita, Takashi; Fukatsu, Takema
2006-01-01
The symbiotic bacteria associated with the stomach disc, a large aggregate of bacteriocytes on the ventral side of the midgut, of human body and head lice were characterized. Molecular phylogenetic analysis of 16S rRNA gene sequences showed that the symbionts formed a distinct and well-defined clade in the Gammaproteobacteria. The sequences exhibited AT-biased nucleotide composition and accelerated molecular evolution. In situ hybridization revealed that in nymphs and adult males, the symbiont was localized in the stomach disc, while in adult females, the symbiont was not in the stomach disc but in the lateral oviducts and the posterior pole of the oocytes due to female-specific symbiont migration. We propose the designation “Candidatus Riesia pediculicola” for the louse symbionts. PMID:16950915
Bavykin, Sergei G.; Mirzabekova, legal representative, Natalia V.; Mirzabekov, deceased, Andrei D.
2007-12-04
The present invention relates to methods and compositions for using nucleotide sequence variations of 16S and 23S rRNA within the B. cereus group to discriminate a highly infectious bacterium B. anthracis from closely related microorganisms. Sequence variations in the 16S and 23S rRNA of the B. cereus subgroup including B. anthracis are utilized to construct an array that can detect these sequence variations through selective hybridizations and discriminate B. cereus group that includes B. anthracis. Discrimination of single base differences in rRNA was achieved with a microchip during analysis of B. cereus group isolates from both single and in mixed samples, as well as identification of polymorphic sites. Successful use of a microchip to determine the appropriate subgroup classification using eight reference microorganisms from the B. cereus group as a study set, was demonstrated.
ISRNA: an integrative online toolkit for short reads from high-throughput sequencing data.
Luo, Guan-Zheng; Yang, Wei; Ma, Ying-Ke; Wang, Xiu-Jie
2014-02-01
Integrative Short Reads NAvigator (ISRNA) is an online toolkit for analyzing high-throughput small RNA sequencing data. Besides the high-speed genome mapping function, ISRNA provides statistics for genomic location, length distribution and nucleotide composition bias analysis of sequence reads. Number of reads mapped to known microRNAs and other classes of short non-coding RNAs, coverage of short reads on genes, expression abundance of sequence reads as well as some other analysis functions are also supported. The versatile search functions enable users to select sequence reads according to their sub-sequences, expression abundance, genomic location, relationship to genes, etc. A specialized genome browser is integrated to visualize the genomic distribution of short reads. ISRNA also supports management and comparison among multiple datasets. ISRNA is implemented in Java/C++/Perl/MySQL and can be freely accessed at http://omicslab.genetics.ac.cn/ISRNA/.
Huiet, L; Feldstein, P A; Tsai, J H; Falk, B W
1993-12-01
Primer extension analyses and a PCR-based cloning strategy were used to identify and characterize 5' nucleotide sequences on the maize stripe virus (MStV) RNA4 mRNA transcripts encoding the major noncapsid protein (NCP). Direct RNA sequence analysis by primer extension showed that the NCP mRNA transcripts had 10-15 nucleotides beyond the 5' terminus of the MStV RNA4 nucleotide sequence. MStV genomic RNAs isolated from ribonucleoprotein particles (RNPs) lacked the additional 5' nucleotides. cDNA clones representing the 5' region of the mRNA transcripts were constructed, and the nucleotide sequences of the 5' regions were determined for 16 clones. Each was found to have a distinct 10-15 nucleotide sequence immediately 5' of the MStV RNA4 sequence. Eleven of 16 clones had the correct MStV RNA4 5' nucleotide sequence, while five showed minor variations at or near the 5' most MStV RNA4 nucleotide. These characteristics show strong similarities to other viral mRNA transcripts which are synthesized by cap snatching.
Rule, G S; Pratt, E A; Chin, C C; Wold, F; Ho, C
1985-01-01
Recombinant DNA plasmids containing the gene for the membrane-bound D-lactate dehydrogenase (D-LDH) of Escherichia coli linked to the promoter PL from lambda were constructed. After induction, the levels of D-LDH were elevated 300-fold over that of the wild type and amounted to 35% of the total cellular protein. The nucleotide sequence of the D-LDH gene was determined and shown to agree with the amino acid composition and the amino-terminal sequence of the purified enzyme. Removal of the amino-terminal formyl-Met from D-LDH was not inhibited in cells which contained these high levels of D-LDH. Images PMID:3882663
Eastwood, Heather; Xia, Fang; Lo, Mei-Chu; Zhou, Jing; Jordan, John B; McCarter, John; Barnhart, Wesley W; Gahm, Kyung-Hyun
2015-11-10
Analysis of nucleotide sugars, nucleoside di- and triphosphates and sugar-phosphates is an essential step in the process of understanding enzymatic pathways. A facile and rapid separation method was developed to analyze these compounds present in an enzymatic reaction mixture utilized to produce nucleotide sugars. The Primesep SB column explored in this study utilizes hydrophobic interactions as well as electrostatic interactions with the phosphoric portion of the nucleotide sugars. Ammonium formate buffer was selected due to its compatibility with mass spectrometry. Negative ion mode mass spectrometry was adopted for detection of the sugar phosphate (fucose-1-phophate), as the compound is not amenable to UV detection. Various mobile phase conditions such as pH, buffer concentration and organic modifier were explored. The semi-preparative separation method was developed to prepare 30mg of the nucleotide sugar. (19)F NMR was utilized to determine purity of the purified fluorinated nucleotide sugar. The collected nucleotide sugar was found to be 99% pure. Published by Elsevier B.V.
Clonal architecture of secondary acute myeloid leukemia defined by single-cell sequencing.
Hughes, Andrew E O; Magrini, Vincent; Demeter, Ryan; Miller, Christopher A; Fulton, Robert; Fulton, Lucinda L; Eades, William C; Elliott, Kevin; Heath, Sharon; Westervelt, Peter; Ding, Li; Conrad, Donald F; White, Brian S; Shao, Jin; Link, Daniel C; DiPersio, John F; Mardis, Elaine R; Wilson, Richard K; Ley, Timothy J; Walter, Matthew J; Graubert, Timothy A
2014-07-01
Next-generation sequencing has been used to infer the clonality of heterogeneous tumor samples. These analyses yield specific predictions-the population frequency of individual clones, their genetic composition, and their evolutionary relationships-which we set out to test by sequencing individual cells from three subjects diagnosed with secondary acute myeloid leukemia, each of whom had been previously characterized by whole genome sequencing of unfractionated tumor samples. Single-cell mutation profiling strongly supported the clonal architecture implied by the analysis of bulk material. In addition, it resolved the clonal assignment of single nucleotide variants that had been initially ambiguous and identified areas of previously unappreciated complexity. Accordingly, we find that many of the key assumptions underlying the analysis of tumor clonality by deep sequencing of unfractionated material are valid. Furthermore, we illustrate a single-cell sequencing strategy for interrogating the clonal relationships among known variants that is cost-effective, scalable, and adaptable to the analysis of both hematopoietic and solid tumors, or any heterogeneous population of cells.
Evolving nucleotide binding surfaces
NASA Technical Reports Server (NTRS)
Kieber-Emmons, T.; Rein, R.
1981-01-01
An analysis is presented of the stability and nature of binding of a nucleotide to several known dehydrogenases. The employed approach includes calculation of hydrophobic stabilization of the binding motif and its intermolecular interaction with the ligand. The evolutionary changes of the binding motif are studied by calculating the Euclidean deviation of the respective dehydrogenases. Attention is given to the possible structural elements involved in the origin of nucleotide recognition by non-coded primordial polypeptides.
Genotyping by Sequencing in Almond: SNP Discovery, Linkage Mapping, and Marker Design
Goonetilleke, Shashi N.; March, Timothy J.; Wirthensohn, Michelle G.; Arús, Pere; Walker, Amanda R.; Mather, Diane E.
2017-01-01
In crop plant genetics, linkage maps provide the basis for the mapping of loci that affect important traits and for the selection of markers to be applied in crop improvement. In outcrossing species such as almond (Prunus dulcis Mill. D. A. Webb), application of a double pseudotestcross mapping approach to the F1 progeny of a biparental cross leads to the construction of a linkage map for each parent. Here, we report on the application of genotyping by sequencing to discover and map single nucleotide polymorphisms in the almond cultivars “Nonpareil” and “Lauranne.” Allele-specific marker assays were developed for 309 tag pairs. Application of these assays to 231 Nonpareil × Lauranne F1 progeny provided robust linkage maps for each parent. Analysis of phenotypic data for shell hardness demonstrated the utility of these maps for quantitative trait locus mapping. Comparison of these maps to the peach genome assembly confirmed high synteny and collinearity between the peach and almond genomes. The marker assays were applied to progeny from several other Nonpareil crosses, providing the basis for a composite linkage map of Nonpareil. Applications of the assays to a panel of almond clones and a panel of rootstocks used for almond production demonstrated the broad applicability of the markers and provide subsets of markers that could be used to discriminate among accessions. The sequence-based linkage maps and single nucleotide polymorphism assays presented here could be useful resources for the genetic analysis and genetic improvement of almond. PMID:29141988
Neugebauer, Tomasz; Bordeleau, Eric; Burrus, Vincent; Brzezinski, Ryszard
2015-01-01
Data visualization methods are necessary during the exploration and analysis activities of an increasingly data-intensive scientific process. There are few existing visualization methods for raw nucleotide sequences of a whole genome or chromosome. Software for data visualization should allow the researchers to create accessible data visualization interfaces that can be exported and shared with others on the web. Herein, novel software developed for generating DNA data visualization interfaces is described. The software converts DNA data sets into images that are further processed as multi-scale images to be accessed through a web-based interface that supports zooming, panning and sequence fragment selection. Nucleotide composition frequencies and GC skew of a selected sequence segment can be obtained through the interface. The software was used to generate DNA data visualization of human and bacterial chromosomes. Examples of visually detectable features such as short and long direct repeats, long terminal repeats, mobile genetic elements, heterochromatic segments in microbial and human chromosomes, are presented. The software and its source code are available for download and further development. The visualization interfaces generated with the software allow for the immediate identification and observation of several types of sequence patterns in genomes of various sizes and origins. The visualization interfaces generated with the software are readily accessible through a web browser. This software is a useful research and teaching tool for genetics and structural genomics.
Analysis of single nucleotide polymorphisms in case-control studies.
Li, Yonghong; Shiffman, Dov; Oberbauer, Rainer
2011-01-01
Single nucleotide polymorphisms (SNPs) are the most common type of genetic variants in the human genome. SNPs are known to modify susceptibility to complex diseases. We describe and discuss methods used to identify SNPs associated with disease in case-control studies. An outline on study population selection, sample collection and genotyping platforms is presented, complemented by SNP selection, data preprocessing and analysis.
Matsuda, M; Tai, K; Moore, J E; Millar, B C; Murayama, O
2004-01-01
Nucleotide sequencing after TA cloning of the amplicon of the almost-full length recA gene from three strains of UPTC (A1, A2, and A3) isolated from seagulls in Northern Ireland, the phenotypical and genotypical characteristics of which have been demonstrated to be indistinguishable, clarified nucleotide differences at three nucleotide positions among the three strains. In conclusion, the nucleotide sequences of the recA gene were found to discriminate among the three strains of UPTC, A1, A2, and A3, which are indistinguishable phenotypically and genotypically. Thus, the present study strongly suggests that nucleotide sequence data of the amplicon of a suitable gene or region could aid in discriminating among isolates of the UPTC group, which are indistinguishable phenotypically and genotypically. Copyright 2004 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
An, Jianyu; Yin, Mengqi; Zhang, Qin; Gong, Dongting; Jia, Xiaowen; Guan, Yajing; Hu, Jin
2017-09-11
Luffa cylindrica (L.) Roem. is an economically important vegetable crop in China. However, the genomic information on this species is currently unknown. In this study, for the first time, a genome survey of L. cylindrica was carried out using next-generation sequencing (NGS) technology. In total, 43.40 Gb sequence data of L. cylindrica , about 54.94× coverage of the estimated genome size of 789.97 Mb, were obtained from HiSeq 2500 sequencing, in which the guanine plus cytosine (GC) content was calculated to be 37.90%. The heterozygosity of genome sequences was only 0.24%. In total, 1,913,731 contigs (>200 bp) with 525 bp N 50 length and 1,410,117 scaffolds (>200 bp) with 885.01 Mb total length were obtained. From the initial assembled L. cylindrica genome, 431,234 microsatellites (SSRs) (≥5 repeats) were identified. The motif types of SSR repeats included 62.88% di-nucleotide, 31.03% tri-nucleotide, 4.59% tetra-nucleotide, 0.96% penta-nucleotide and 0.54% hexa-nucleotide. Eighty genomic SSR markers were developed, and 51/80 primers could be used in both "Zheda 23" and "Zheda 83". Nineteen SSRs were used to investigate the genetic diversity among 32 accessions through SSR-HRM analysis. The unweighted pair group method analysis (UPGMA) dendrogram tree was built by calculating the SSR-HRM raw data. SSR-HRM could be effectively used for genotype relationship analysis of Luffa species.
Morton, B R; Oberholzer, V M; Clegg, M T
1997-09-01
Substitutions occurring in noncoding sequences of the plant chloroplast genome violate the independence of sites that is assumed by substitution models in molecular evolution. The probability that a substitution at a site is a transversion, as opposed to a transition, increases significantly with increasing A + T content of the two adjacent nucleotides. In the present study, this dependency of substitutions on local context is examined further in a number of noncoding regions from the chloroplast genome of members of the grass family (Poaceae). Two features were examined; the influence of specific neighboring bases, as opposed to the general A + T content, on transversion proportion and an influence on substitutions by nucleotides other than the two immediately adjacent to the site of substitution. In both cases, a significant effect was found. In the case of specific nucleotides, transversion proportion is significantly higher at sites with a pyrimidine immediately 5' on either strand. Substitutions at sites of the type YNR, where N is the site of substitution, have the highest rate of transversion. This specific effect is secondary to the A + T content effect such that, in terms of proportion of substitutions that are transversions, the nucleotides are ranked T > A > C > G as to their effect when they are immediately 5' to the site of substitution. In the case of nucleotides other than the immediate neighbors, a significant influence on substitution dynamics is observed in the case where the two neighboring bases are both A and/or T. Thus, substitutions are primarily, but not exclusively, influenced by the composition of the two nucleotides that are immediately adjacent. These results indicate that the pattern of molecular evolution of the plant chloroplast genome is extremely complex as a result of a variety of inter-site dependencies.
Rancour, David M.; Hatfield, Ronald D.; Marita, Jane M.; Rohr, Nicholas A.; Schmitz, Robert J.
2015-01-01
Nucleotide-activated sugars are essential substrates for plant cell-wall carbohydrate-polymer biosynthesis. The most prevalent grass cell wall (CW) sugars are glucose (Glc), xylose (Xyl), and arabinose (Ara). These sugars are biosynthetically related via the UDP–sugar interconversion pathway. We sought to target and generate UDP–sugar interconversion pathway transgenic Brachypodium distachyon lines resulting in CW carbohydrate composition changes with improved digestibility and normal plant stature. Both RNAi-mediated gene-suppression and constitutive gene-expression approaches were performed. CWs from 336 T0 transgenic plants with normal appearance were screened for complete carbohydrate composition. RNAi mutants of BdRGP1, a UDP-arabinopyranose mutase, resulted in large alterations in CW carbohydrate composition with significant decreases in CW Ara content but with minimal change in plant stature. Five independent RNAi-RGP1 T1 plant lines were used for in-depth analysis of plant CWs. Real-time PCR analysis indicated that gene expression levels for BdRGP1, BdRGP2, and BdRGP3 were reduced in RNAi-RGP1 plants to 15–20% of controls. CW Ara content was reduced by 23–51% of control levels. No alterations in CW Xyl and Glc content were observed. Corresponding decreases in CW ferulic acid (FA) and ferulic acid-dimers (FA-dimers) were observed. Additionally, CW p-coumarates (pCA) were decreased. We demonstrate the CW pCA decrease corresponds to Ara-coupled pCA. Xylanase-mediated digestibility of RNAi-RGP1 Brachypodium CWs resulted in a near twofold increase of released total carbohydrate. However, cellulolytic hydrolysis of CW material was inhibited in leaves of RNAi-RGP1 mutants. Our results indicate that targeted manipulation of UDP–sugar biosynthesis can result in biomass with substantially altered compositions and highlights the complex effect CW composition has on digestibility. PMID:26136761
Arvier, Matthieu; Lagoutte, Laëtitia; Johnson, Gyasi; Dumas, Jean-François; Sion, Benoit; Grizard, Genevieve; Malthièry, Yves; Simard, Gilles; Ritz, Patrick
2007-11-01
The composition of the mitochondrial inner membrane and uncoupling protein [such as adenine nucleotide translocator (ANT)] contents are the main factors involved in the energy-wasting proton leak. This leak is increased by glucocorticoid treatment under nonphosphorylating conditions. The aim of this study was to investigate mechanisms involved in glucocorticoid-induced proton leak and to evaluate the consequences in more physiological conditions (between states 4 and 3). Isolated liver mitochondria, obtained from dexamethasone-treated rats (1.5 mg.kg(-1).day(-1)), were studied by polarography, Western blotting, and high-performance thin-layer chromatography. We confirmed that dexamethasone treatment in rats induces a proton leak in state 4 that is associated with an increased ANT content, although without any change in membrane surface or lipid composition. Between states 4 and 3, dexamethasone stimulates ATP synthesis by increasing both the mitochondrial ANT and F1-F0 ATP synthase content. In conclusion, dexamethasone increases mitochondrial capacity to generate ATP by modifying ANT and ATP synthase. The side effect is an increased leak in nonphosphorylating conditions.
Uncovering the polymerase-induced cytotoxicity of an oxidized nucleotide
Freudenthal, Bret D.; Beard, William A.; Perera, Lalith; ...
2014-11-17
Oxidative stress promotes genomic instability and human diseases. A common oxidized nucleoside is 8-oxo-7,8-dihydro-2’-deoxyguanosine found both in DNA (8-oxo-G) and as a free nucleotide (8-oxo-dGTP). Nucleotide pools are especially vulnerable to oxidative damage. Therefore cells encode an enzyme (MutT/MTH1) that removes free oxidized nucleotides. This cleansing function is required for cancer cell survival and to modulate E. coli antibiotic sensitivity in a DNA polymerase (pol)-dependent manner. How polymerase discriminates between damaged and non-damaged nucleotides is not well understood. This analysis is essential given the role of oxidized nucleotides in mutagenesis, cancer therapeutics, and bacterial antibiotics. Even with cellular sanitizing activities,more » nucleotide pools contain enough 8-oxo-dGTP to promote mutagenesis. This arises from the dual coding potential where 8-oxo-dGTP(anti) base pairs with cytosine (Cy) and 8-oxodGTP(syn) utilizes its Hoogsteen edge to base pair with adenine (Ad). Here in this paper we utilized time-lapse crystallography to follow 8-oxo-dGTP insertion opposite Ad or Cy with human DNA pol β, to reveal that insertion is accommodated in either the syn- or anti-conformation, respectively. For 8-oxo-dGTP(anti) insertion, a novel divalent metal relieves repulsive interactions between the adducted guanine base and the triphosphate of the oxidized nucleotide. With either templating base, hydrogen bonding interactions between the bases are lost as the enzyme reopens after catalysis, leading to a cytotoxic nicked DNA repair intermediate. Combining structural snapshots with kinetic and computational analysis reveals how 8-oxodGTP utilizes charge modulation during insertion that can lead to a blocked DNA repair intermediate.« less
Uncovering the polymerase-induced cytotoxicity of an oxidized nucleotide
NASA Astrophysics Data System (ADS)
Freudenthal, Bret D.; Beard, William A.; Perera, Lalith; Shock, David D.; Kim, Taejin; Schlick, Tamar; Wilson, Samuel H.
2015-01-01
Oxidative stress promotes genomic instability and human diseases. A common oxidized nucleoside is 8-oxo-7,8-dihydro-2'-deoxyguanosine, which is found both in DNA (8-oxo-G) and as a free nucleotide (8-oxo-dGTP). Nucleotide pools are especially vulnerable to oxidative damage. Therefore cells encode an enzyme (MutT/MTH1) that removes free oxidized nucleotides. This cleansing function is required for cancer cell survival and to modulate Escherichia coli antibiotic sensitivity in a DNA polymerase (pol)-dependent manner. How polymerases discriminate between damaged and non-damaged nucleotides is not well understood. This analysis is essential given the role of oxidized nucleotides in mutagenesis, cancer therapeutics, and bacterial antibiotics. Even with cellular sanitizing activities, nucleotide pools contain enough 8-oxo-dGTP to promote mutagenesis. This arises from the dual coding potential where 8-oxo-dGTP(anti) base pairs with cytosine and 8-oxo-dGTP(syn) uses its Hoogsteen edge to base pair with adenine. Here we use time-lapse crystallography to follow 8-oxo-dGTP insertion opposite adenine or cytosine with human pol β, to reveal that insertion is accommodated in either the syn- or anti-conformation, respectively. For 8-oxo-dGTP(anti) insertion, a novel divalent metal relieves repulsive interactions between the adducted guanine base and the triphosphate of the oxidized nucleotide. With either templating base, hydrogen-bonding interactions between the bases are lost as the enzyme reopens after catalysis, leading to a cytotoxic nicked DNA repair intermediate. Combining structural snapshots with kinetic and computational analysis reveals how 8-oxo-dGTP uses charge modulation during insertion that can lead to a blocked DNA repair intermediate.
Conservation of the structure and organization of lupin mitochondrial nad3 and rps12 genes.
Rurek, M; Oczkowski, M; Augustyniak, H
1998-01-01
A high level of the nucleotide sequence conservation of mitochondrial nad3 and rps12 genes was found in four lupin species. The only differences concern three nucleotides in the Lupinus albus rps12 gene and three nucleotides insertion in the L. mutabilis spacer. Northern blot analysis as well as RT-PCR confirmed cotranscription of the L. luteus genes because the transcripts detected were long enough.
A novel MALDI–TOF based methodology for genotyping single nucleotide polymorphisms
Blondal, Thorarinn; Waage, Benedikt G.; Smarason, Sigurdur V.; Jonsson, Frosti; Fjalldal, Sigridur B.; Stefansson, Kari; Gulcher, Jeffery; Smith, Albert V.
2003-01-01
A new MALDI–TOF based detection assay was developed for analysis of single nucleotide polymorphisms (SNPs). It is a significant modification on the classic three-step minisequencing method, which includes a polymerase chain reaction (PCR), removal of excess nucleotides and primers, followed by primer extension in the presence of dideoxynucleotides using modified thermostable DNA polymerase. The key feature of this novel assay is reliance upon deoxynucleotide mixes, lacking one of the nucleotides at the polymorphic position. During primer extension in the presence of depleted nucleotide mixes, standard thermostable DNA polymerases dissociate from the template at positions requiring a depleted nucleotide; this principal was harnessed to create a genotyping assay. The assay design requires a primer- extension primer having its 3′-end one nucleotide upstream from the interrogated site. The assay further utilizes the same DNA polymerase in both PCR and the primer extension step. This not only simplifies the assay but also greatly reduces the cost per genotype compared to minisequencing methodology. We demonstrate accurate genotyping using this methodology for two SNPs run in both singleplex and duplex reactions. We term this assay nucleotide depletion genotyping (NUDGE). Nucleotide depletion genotyping could be extended to other genotyping assays based on primer extension such as detection by gel or capillary electrophoresis. PMID:14654708
Cuomo, Francesca; Mosca, Monica; Murgia, Sergio; Avino, Pasquale; Ceglie, Andrea; Lopez, Francesco
2013-11-15
In this work, the interaction of nucleotide-monophosphates (NMPs) with unilamellar liposomes made of 1,2-Dioleoyl-3-Trimethylammonium-Propane (DOTAP) and 1,2-Dioleoyl-sn-Glycero-3-Phosphoethanolamine (DOPE) was investigated. Here, we demonstrate how adsorption is affected by the type of nucleotide-monophosphate. Dynamic light scattering (DLS) results revealed, for each NMP, that a distinguishable concentration exists at which a significant growth of the aggregates occurs. Adenosine 5'-monophosphate (AMP) and guanosine 5'-monophosphate (GMP) have shown a higher propensity to induce liposome aggregation process and in particular GMP appears to be the most effective. From ζ-potential experiments we found that liposomes loaded with purine based nucleotides (AMP and GMP) are able to decrease the ζ-potential values to a greater extent in comparison with the pyrimidine based nucleotides thimydine 5'-monophosphate (TMP) and uridine 5'-monophosphate (UMP). Moreover, a careful analysis of nucleotide-liposome interactions revealed that nucleotides have different capacity to induce the formation of nucleotide-liposome complexes, and purine based nucleotides have higher affinities with lipid membranes. On the whole, the data emphasize that the mechanisms driving the interactions between liposomes and NMPs are also influenced by the existence of hydrophobic forces. Copyright © 2013 Elsevier Inc. All rights reserved.
Kondo, Jiro; Westhof, Eric
2011-10-01
Nucleotide bases are recognized by amino acid residues in a variety of DNA/RNA binding and nucleotide binding proteins. In this study, a total of 446 crystal structures of nucleotide-protein complexes are analyzed manually and pseudo pairs together with single and bifurcated hydrogen bonds observed between bases and amino acids are classified and annotated. Only 5 of the 20 usual amino acid residues, Asn, Gln, Asp, Glu and Arg, are able to orient in a coplanar fashion in order to form pseudo pairs with nucleotide bases through two hydrogen bonds. The peptide backbone can also form pseudo pairs with nucleotide bases and presents a strong bias for binding to the adenine base. The Watson-Crick side of the nucleotide bases is the major interaction edge participating in such pseudo pairs. Pseudo pairs between the Watson-Crick edge of guanine and Asp are frequently observed. The Hoogsteen edge of the purine bases is a good discriminatory element in recognition of nucleotide bases by protein side chains through the pseudo pairing: the Hoogsteen edge of adenine is recognized by various amino acids while the Hoogsteen edge of guanine is only recognized by Arg. The sugar edge is rarely recognized by either the side-chain or peptide backbone of amino acid residues.
Replication-associated mutational asymmetry in the human genome.
Chen, Chun-Long; Duquenne, Lauranne; Audit, Benjamin; Guilbaud, Guillaume; Rappailles, Aurélien; Baker, Antoine; Huvet, Maxime; d'Aubenton-Carafa, Yves; Hyrien, Olivier; Arneodo, Alain; Thermes, Claude
2011-08-01
During evolution, mutations occur at rates that can differ between the two DNA strands. In the human genome, nucleotide substitutions occur at different rates on the transcribed and non-transcribed strands that may result from transcription-coupled repair. These mutational asymmetries generate transcription-associated compositional skews. To date, the existence of such asymmetries associated with replication has not yet been established. Here, we compute the nucleotide substitution matrices around replication initiation zones identified as sharp peaks in replication timing profiles and associated with abrupt jumps in the compositional skew profile. We show that the substitution matrices computed in these regions fully explain the jumps in the compositional skew profile when crossing initiation zones. In intergenic regions, we observe mutational asymmetries measured as differences between complementary substitution rates; their sign changes when crossing initiation zones. These mutational asymmetries are unlikely to result from cryptic transcription but can be explained by a model based on replication errors and strand-biased repair. In transcribed regions, mutational asymmetries associated with replication superimpose on the previously described mutational asymmetries associated with transcription. We separate the substitution asymmetries associated with both mechanisms, which allows us to determine for the first time in eukaryotes, the mutational asymmetries associated with replication and to reevaluate those associated with transcription. Replication-associated mutational asymmetry may result from unequal rates of complementary base misincorporation by the DNA polymerases coupled with DNA mismatch repair (MMR) acting with different efficiencies on the leading and lagging strands. Replication, acting in germ line cells during long evolutionary times, contributed equally with transcription to produce the present abrupt jumps in the compositional skew. These results demonstrate that DNA replication is one of the major processes that shape human genome composition.
Genomic diversity of the human intestinal parasite Entamoeba histolytica
2012-01-01
Background Entamoeba histolytica is a significant cause of disease worldwide. However, little is known about the genetic diversity of the parasite. We re-sequenced the genomes of ten laboratory cultured lines of the eukaryotic pathogen Entamoeba histolytica in order to develop a picture of genetic diversity across the genome. Results The extreme nucleotide composition bias and repetitiveness of the E. histolytica genome provide a challenge for short-read mapping, yet we were able to define putative single nucleotide polymorphisms in a large portion of the genome. The results suggest a rather low level of single nucleotide diversity, although genes and gene families with putative roles in virulence are among the more polymorphic genes. We did observe large differences in coverage depth among genes, indicating differences in gene copy number between genomes. We found evidence indicating that recombination has occurred in the history of the sequenced genomes, suggesting that E. histolytica may reproduce sexually. Conclusions E. histolytica displays a relatively low level of nucleotide diversity across its genome. However, large differences in gene family content and gene copy number are seen among the sequenced genomes. The pattern of polymorphism indicates that E. histolytica reproduces sexually, or has done so in the past, which has previously been suggested but not proven. PMID:22630046
Jiang, Shao-Tong; Hong, Gui-Yun; Yu, Miao; Li, Na; Yang, Ying; Liu, Yan-Qun; Wei, Zhao-Jun
2009-05-22
The complete mitochondrial genome (mitogenome) of Eriogyna pyretorum (Lepidoptera: Saturniidae) was determined as being composed of 15,327 base pairs (bp), including 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The arrangement of the PCGs is the same as that found in the other sequenced lepidopteran. The AT skewness for the E. pyretorum mitogenome is slightly negative (-0.031), indicating the occurrence of more Ts than As. The nucleotide composition of the E. pyretorum mitogenome is also biased toward A + T nucleotides (80.82%). All PCGs are initiated by ATN codons, except for cytochrome c oxidase subunit 1 and 2 (cox1 and cox2). Two of the 13 PCGs harbor the incomplete termination codon by T. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA, with the exception of trnS1(AGN) and trnS2(UCN). Phylogenetic analysis among the available lepidopteran species supports the current morphology-based hypothesis that Bombycoidea, Geometroidea, Notodontidea, Papilionoidea and Pyraloidea are monophyletic. As has been previously suggested, Bombycidae (Bombyx mori and Bombyx mandarina), Sphingoidae (Manduca sexta) and Saturniidae (Antheraea pernyi, Antheraea yamamai, E. pyretorum and Caligula boisduvalii) formed a group.
Jiang, Shao-Tong; Hong, Gui-Yun; Yu, Miao; Li, Na; Yang, Ying; Liu, Yan-Qun; Wei, Zhao-Jun
2009-01-01
The complete mitochondrial genome (mitogenome) of Eriogyna pyretorum (Lepidoptera: Saturniidae) was determined as being composed of 15,327 base pairs (bp), including 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The arrangement of the PCGs is the same as that found in the other sequenced lepidopteran. The AT skewness for the E. pyretorum mitogenome is slightly negative (-0.031), indicating the occurrence of more Ts than As. The nucleotide composition of the E. pyretorum mitogenome is also biased toward A + T nucleotides (80.82%). All PCGs are initiated by ATN codons, except for cytochrome c oxidase subunit 1 and 2 (cox1 and cox2). Two of the 13 PCGs harbor the incomplete termination codon by T. All tRNA genes have a typical clover-leaf structure of mitochondrial tRNA, with the exception of trnS1(AGN) and trnS2(UCN). Phylogenetic analysis among the available lepidopteran species supports the current morphology-based hypothesis that Bombycoidea, Geometroidea, Notodontidea, Papilionoidea and Pyraloidea are monophyletic. As has been previously suggested, Bombycidae (Bombyx mori and Bombyx mandarina), Sphingoidae (Manduca sexta) and Saturniidae (Antheraea pernyi, Antheraea yamamai, E. pyretorum and Caligula boisduvalii) formed a group. PMID:19471586
Yang, Seung Hak; Lim, Joung Soo; Khan, Modabber Ahmed; Kim, Bong Soo; Choi, Dong Yoon; Lee, Eun Young; Ahn, Hee Kwon
2015-01-01
The leachate generated by the decomposition of animal carcass has been implicated as an environmental contaminant surrounding the burial site. High-throughput nucleotide sequencing was conducted to investigate the bacterial communities in leachates from the decomposition of pig carcasses. We acquired 51,230 reads from six different samples (1, 2, 3, 4, 6 and 14 week-old carcasses) and found that sequences representing the phylum Firmicutes predominated. The diversity of bacterial 16S rRNA gene sequences in the leachate was the highest at 6 weeks, in contrast to those at 2 and 14 weeks. The relative abundance of Firmicutes was reduced, while the proportion of Bacteroidetes and Proteobacteria increased from 3–6 weeks. The representation of phyla was restored after 14 weeks. However, the community structures between the samples taken at 1–2 and 14 weeks differed at the bacterial classification level. The trend in pH was similar to the changes seen in bacterial communities, indicating that the pH of the leachate could be related to the shift in the microbial community. The results indicate that the composition of bacterial communities in leachates of decomposing pig carcasses shifted continuously during the study period and might be influenced by the burial site. PMID:26500442
Detecting and Removing Ascertainment Bias in Microsatellites from the HGDP-CEPH Panel
Eriksson, Anders; Manica, Andrea
2011-01-01
Although ascertainment bias in single nucleotide polymorphisms is a well-known problem, it is generally accepted that microsatellites have mutation rates too high for bias to be a concern. Here, we analyze in detail the large set of microsatellites typed for the Human Genetic Diversity Panel (HGDP)-CEPH panel. We develop a novel framework based on rarefaction to compare heterozygosity across markers with different mutation rates. We find that, whereas di- and tri-nucleotides show similar patterns of within- and between-population heterozygosity, tetra-nucleotides are inconsistent with the other two motifs. In addition, di- and tri-nucleotides are consistent with 16 unbiased tetra-nucleotide markers, whereas the HPGP-CEPH tetra-nucleotides are significantly different. This discrepancy is due to the HGDP-CEPH tetra-nucleotides being too homogeneous across Eurasia, even after their slower mutation rate is taken into account by rarefying the other markers. The most likely explanation for this pattern is ascertainment bias. We strongly advocate the exclusion of tetra-nucleotides from future population genetics analysis of this dataset, and we argue that other microsatellite datasets should be investigated for the presence of bias using the approach outlined in this article. PMID:22384358
De Laurentiis, Evelina Ines; Mercier, Evan; Wieden, Hans-Joachim
2016-10-28
Little is known about the conservation of critical kinetic parameters and the mechanistic strategies of elongation factor (EF) Ts-catalyzed nucleotide exchange in EF-Tu in bacteria and particularly in clinically relevant pathogens. EF-Tu from the clinically relevant pathogen Pseudomonas aeruginosa shares over 84% sequence identity with the corresponding elongation factor from Escherichia coli Interestingly, the functionally closely linked EF-Ts only shares 55% sequence identity. To identify any differences in the nucleotide binding properties, as well as in the EF-Ts-mediated nucleotide exchange reaction, we performed a comparative rapid kinetics and mutagenesis analysis of the nucleotide exchange mechanism for both the E. coli and P. aeruginosa systems, identifying helix 13 of EF-Ts as a previously unnoticed regulatory element in the nucleotide exchange mechanism with species-specific elements. Our findings support the base side-first entry of the nucleotide into the binding pocket of the EF-Tu·EF-Ts binary complex, followed by displacement of helix 13 and rapid binding of the phosphate side of the nucleotide, ultimately leading to the release of EF-Ts. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Array of nucleic acid probes on biological chips for diagnosis of HIV and methods of using the same
Chee, Mark; Gingeras, Thomas R.; Fodor, Stephen P. A.; Hubble, Earl A.; Morris, MacDonald S.
1999-01-19
The invention provides an array of oligonucleotide probes immobilized on a solid support for analysis of a target sequence from a human immunodeficiency virus. The array comprises at least four sets of oligonucleotide probes 9 to 21 nucleotides in length. A first probe set has a probe corresponding to each nucleotide in a reference sequence from a human immunodeficiency virus. A probe is related to its corresponding nucleotide by being exactly complementary to a subsequence of the reference sequence that includes the corresponding nucleotide. Thus, each probe has a position, designated an interrogation position, that is occupied by a complementary nucleotide to the corresponding nucleotide. The three additional probe sets each have a corresponding probe for each probe in the first probe set. Thus, for each nucleotide in the reference sequence, there are four corresponding probes, one from each of the probe sets. The three corresponding probes in the three additional probe sets are identical to the corresponding probe from the first probe or a subsequence thereof that includes the interrogation position, except that the interrogation position is occupied by a different nucleotide in each of the four corresponding probes.
Keppetipola, Niroshika; Shuman, Stewart
2007-01-01
Clostridium thermocellum polynucleotide kinase-phosphatase (CthPnkp) catalyzes 5′ and 3′ end-healing reactions that prepare broken RNA termini for sealing by RNA ligase. The central phosphatase domain of CthPnkp belongs to the dinuclear metallophosphoesterase superfamily exemplified by bacteriophage λ phosphatase (λ-Pase). CthPnkp is a Ni2+/Mn2+-dependent phosphodiesterase-monoesterase, active on nucleotide and non-nucleotide substrates, that can be transformed toward narrower metal and substrate specificities via mutations of the active site. Here we characterize the Mn2+-dependent 2′,3′ cyclic nucleotide phosphodiesterase activity of CthPnkp, the reaction most relevant to RNA repair pathways. We find that CthPnkp prefers a 2′,3′ cyclic phosphate to a 3′,5′ cyclic phosphate. A single H189D mutation imposes strict specificity for a 2′,3′ cyclic phosphate, which is cleaved to form a single 2′-NMP product. Analysis of the cyclic phosphodiesterase activities of mutated CthPnkp enzymes illuminates the active site and the structural features that affect substrate affinity and kcat. We also characterize a previously unrecognized phosphodiesterase activity of λ-Pase, which catalyzes hydrolysis of bis-p-nitrophenyl phosphate. λ-Pase also has cyclic phosphodiesterase activity with nucleoside 2′,3′ cyclic phosphates, which it hydrolyzes to yield a mixture of 2′-NMP and 3′-NMP products. We discuss our results in light of available structural and functional data for other phosphodiesterase members of the binuclear metallophosphoesterase family and draw inferences about how differences in active site composition influence catalytic repertoire. PMID:17986465
Vigne, Emmanuelle; Bergdoll, Marc; Guyader, Sébastien; Fuchs, Marc
2004-08-01
The nematode-borne Grapevine fanleaf virus, from the genus Nepovirus in the family Comoviridae, causes severe degeneration of grapevines in most vineyards worldwide. We characterized 347 isolates from transgenic and conventional grapevines from two vineyard sites in the Champagne region of France for their molecular variant composition. The population structure and genetic diversity were examined in the coat protein gene by IC-RT-PCR-RFLP analysis with EcoRI and StyI, and nucleotide sequencing, respectively. RFLP data suggested that 55 % (191 of 347) of the isolates had a population structure consisting of one predominant variant. Sequencing data of 51 isolates representing the different restrictotypes confirmed the existence of mixed infection with a frequency of 33 % (17 of 51) and showed two major predominant haplotypes representing 71 % (60 of 85) of the sequence variants. Comparative nucleotide diversity among population subsets implied a lack of genetic differentiation according to host (transgenic vs conventional) or field site for most restrictotypes (17 of 18 and 13 of 18) and for haplotypes in most phylogenetic groups (seven of eight and six of eight), respectively. Interestingly, five of the 85 haplotypes sequenced had an intermediate divergence (0.036-0.066) between the lower (0.005-0.028) and upper range (0.083-0.138) of nucleotide variability, suggesting the occurrence of homologous RNA recombination. Sequence alignments clearly indicated a mosaic structure for four of these five variants, for which recombination sites were identified and parental lineages proposed. This is the first in-depth characterization of the population structure and genetic diversity in a nepovirus.
Body mass index modulates aromatic DNA adduct levels and their persistence in smokers.
Godschalk, Roger W L; Feldker, Dorien E M; Borm, Paul J A; Wouters, Emiel F M; van Schooten, Frederik-Jan
2002-08-01
Smokers with a low body mass index (BMI; weight/height(2)) have a higher risk for developing lung malignancies as compared with smokers of average weight, but there is no mechanistic explanation for this observation. Carcinogens in cigarette smoke are thought to elicit cancer by the formation of DNA adducts, which give the opportunity to additionally investigate the biological link between BMI and lung cancer. DNA adduct levels in peripheral blood lymphocytes of 24 healthy smoking volunteers (0.76 +/- 0.41 adducts per 10(8) nucleotides) positively correlated with cigarette consumption (r = 0.51; P = 0.01) and were inversely related with BMI (r = -0.48; P = 0.02). A significant overall relationship was observed when both parameters were included in multiple regression analysis (r = 0.63; P = 0.007). Moreover, body composition may affect DNA adduct persistence, because lipophilic tobacco smoke-derived carcinogens accumulate in adipose tissue and can be mobilized once exposure ceases. Therefore, DNA adduct levels and BMI were reassessed in all of the subjects after a nonsmoking period of 22 weeks. Adduct levels declined to 0.44 +/- 0.23 per 10(8) nucleotides (P = 0.002), and the estimated half-life was 11 weeks on the basis of exponential decay to background levels in never-smoking controls (0.33 +/- 0.18 per 10(8) nucleotides). Overweight subjects (BMI >25) with little weight gain after smoking cessation (
Lara-Ramírez, Edgar E.; Salazar, Ma Isabel; López-López, María de Jesús; Salas-Benito, Juan Santiago; Sánchez-Varela, Alejandro
2014-01-01
The increasing number of dengue virus (DENV) genome sequences available allows identifying the contributing factors to DENV evolution. In the present study, the codon usage in serotypes 1–4 (DENV1–4) has been explored for 3047 sequenced genomes using different statistics methods. The correlation analysis of total GC content (GC) with GC content at the three nucleotide positions of codons (GC1, GC2, and GC3) as well as the effective number of codons (ENC, ENCp) versus GC3 plots revealed mutational bias and purifying selection pressures as the major forces influencing the codon usage, but with distinct pressure on specific nucleotide position in the codon. The correspondence analysis (CA) and clustering analysis on relative synonymous codon usage (RSCU) within each serotype showed similar clustering patterns to the phylogenetic analysis of nucleotide sequences for DENV1–4. These clustering patterns are strongly related to the virus geographic origin. The phylogenetic dependence analysis also suggests that stabilizing selection acts on the codon usage bias. Our analysis of a large scale reveals new feature on DENV genomic evolution. PMID:25136631
Characterization of the porcine epidemic diarrhea virus codon usage bias.
Chen, Ye; Shi, Yuzhen; Deng, Hongjuan; Gu, Ting; Xu, Jian; Ou, Jinxin; Jiang, Zhiguo; Jiao, Yiren; Zou, Tan; Wang, Chong
2014-12-01
Porcine epidemic diarrhea virus (PEDV) has been responsible for several recent outbreaks of porcine epidemic diarrhea (PED) and has caused great economic loss in the swine-raising industry. Considering the significance of PEDV, a systemic analysis was performed to study its codon usage patterns. The relative synonymous codon usage value of each codon revealed that codon usage bias exists and that PEDV tends to use codons that end in T. The mean ENC value of 47.91 indicates that the codon usage bias is low. However, we still wanted to identify the cause of this codon usage bias. A correlation analysis between the codon compositions (A3s, T3s, G3s, C3s, and GC3s), the ENC values, and the nucleotide contents (A%, T%, G%, C%, and GC%) indicated that mutational bias plays role in shaping the PEDV codon usage bias. This was further confirmed by a principal component analysis between the codon compositions and the axis values. Using the Gravy, Aroma, and CAI values, a role of natural selection in the PEDV codon usage pattern was also identified. Neutral analysis indicated that natural selection pressure plays a more important role than mutational bias in codon usage bias. Natural selection also plays an increasingly significant role during PEDV evolution. Additionally, gene function and geographic distribution also influence the codon usage bias to a degree. Copyright © 2014 Elsevier B.V. All rights reserved.
Pal, Shilpee; Sarkar, Indrani; Roy, Ayan; Mohapatra, Pradeep K Das; Mondal, Keshab C; Sen, Arnab
2018-02-01
The present study has been aimed to the comparative analysis of high GC composition containing Corynebacterium genomes and their evolutionary study by exploring codon and amino acid usage patterns. Phylogenetic study by MLSA approach, indel analysis and BLAST matrix differentiated Corynebacterium species in pathogenic and non-pathogenic clusters. Correspondence analysis on synonymous codon usage reveals that, gene length, optimal codon frequencies and tRNA abundance affect the gene expression of Corynebacterium. Most of the optimal codons as well as translationally optimal codons are C ending i.e. RNY (R-purine, N-any nucleotide base, and Y-pyrimidine) and reveal translational selection pressure on codon bias of Corynebacterium. Amino acid usage is affected by hydrophobicity, aromaticity, protein energy cost, etc. Highly expressed genes followed the cost minimization hypothesis and are less diverged at their synonymous positions of codons. Functional analysis of core genes shows significant difference in pathogenic and non-pathogenic Corynebacterium. The study reveals close relationship between non-pathogenic and opportunistic pathogenic Corynebaterium as well as between molecular evolution and survival niches of the organism.
Trucco, Verónica; de Breuil, Soledad; Bejerman, Nicolás; Lenardon, Sergio; Giolitti, Fabián
2014-06-01
The complete nucleotide sequence of an Alfalfa mosaic virus (AMV) isolate infecting alfalfa (Medicago sativa L.) in Argentina, AMV-Arg, was determined. The virus genome has the typical organization described for AMV, and comprises 3,643, 2,593, and 2,038 nucleotides for RNA1, 2 and 3, respectively. The whole genome sequence and each encoding region were compared with those of other four isolates that have been completely sequenced from China, Italy, Spain and USA. The nucleotide identity percentages ranged from 95.9 to 99.1 % for the three RNAs and from 93.7 to 99 % for the protein 1 (P1), protein 2 (P2), movement protein and coat protein (CP) encoding regions, whereas the amino acid identity percentages of these proteins ranged from 93.4 to 99.5 %, the lowest value corresponding to P2. CP sequences of AMV-Arg were compared with those of other 25 available isolates, and the phylogenetic analysis based on the CP gene was carried out. The highest percentage of nucleotide sequence identity of the CP gene was 98.3 % with a Chinese isolate and 98.6 % at the amino acid level with four isolates, two from Italy, one from Brazil and the remaining one from China. The phylogenetic analysis showed that AMV-Arg is closely related to subgroup I of AMV isolates. To our knowledge, this is the first report of a complete nucleotide sequence of AMV from South America and the first worldwide report of complete nucleotide sequence of AMV isolated from alfalfa as natural host.
Bhatt, Vaibhav D; Dande, Suchitra S; Patil, Nitin V; Joshi, Chaitanya G
2013-04-01
Rumen microorganisms play an important role in ruminant digestion and absorption of nutrients and have great potential applications in the field of rumen adjusting, food fermentation and biomass utilization etc. In order to investigate the composition of microorganisms in the rumen of camel (Camelus dromedarius), this study delves in the microbial diversity by culture-independent approach. It includes comparison of rumen samples investigated in the present study to other currently available metagenomes to reveal potential differences in rumen microbial systems. Pyrosequencing based metagenomics was applied to analyze phylogenetic and metabolic profiles by MG-RAST, a web based tool. Pyrosequencing of camel rumen sample yielded 8,979,755 nucleotides assembled to 41,905 sequence reads with an average read length of 214 nucleotides. Taxonomic analysis of metagenomic reads indicated Bacteroidetes (55.5 %), Firmicutes (22.7 %) and Proteobacteria (9.2 %) phyla as predominant camel rumen taxa. At a finer phylogenetic resolution, Bacteroides species dominated the camel rumen metagenome. Functional analysis revealed that clustering-based subsystem and carbohydrate metabolism were the most abundant SEED subsystem representing 17 and 13 % of camel metagenome, respectively. A high taxonomic and functional similarity of camel rumen was found with the cow metagenome which is not surprising given the fact that both are mammalian herbivores with similar digestive tract structures and functions. Combined pyrosequencing approach and subsystems-based annotations available in the SEED database allowed us access to understand the metabolic potential of these microbiomes. Altogether, these data suggest that agricultural and animal husbandry practices can impose significant selective pressures on the rumen microbiota regardless of rumen type. The present study provides a baseline for understanding the complexity of camel rumen microbial ecology while also highlighting striking similarities and differences when compared to other animal gastrointestinal environments.
Misra, S N; Anjaiah, K; Joseph, G; Abdi, S H
1992-02-01
The interactions of praseodymium(III) and neodymium(III) with nucleosides and nucleotides have been studied in different stoichiometry in water and water-DMF mixtures by employing absorption difference and comparative absorption spectrophotometry. The 4f-4f bands were analysed by linear curve analysis followed by gaussian curve analysis, and various spectral parameters were computed, using partial and multiple regression method. The magnitude of changes in both energy interaction and intensity were used to explore the degree of outer and inner sphere coordination, incidence of covalency and the extent of metal 4f-orbital involvement in chemical bonding. Crystalline complexes of the type [Ln(nucleotide)2(H2O)2]- (where nucleotide--GMP or IMP) were characterized by IR, 1H NMR, 31P NMR data. These studies indicated that the binding of the nucleotide is through phosphate oxygen in a bidentate manner and the complexes undergo substantial ionisation in aqueous medium, thereby supporting the observed weak 4f-4f bands and lower values for nephelauxetic effect (1-beta), bonding (b) and covalency (delta) parameters derived from coulombic and spin orbit interaction parameters.
Microbiome-Metabolome Responses in the Cecum and Colon of Pig to a High Resistant Starch Diet.
Sun, Yue; Su, Yong; Zhu, Weiyun
2016-01-01
Currently, knowledge about the impact of long-term intake of high resistant starch diet on pig hindgut microbiota and metabolite profile is limited. In this study, a combination of the pyrosequencing and the mass spectrometry (MS)-based metabolomics techniques were used to investigate the effects of a raw potato starch (RPS, high in resistant starch) diet on microbial composition and microbial metabolites in the hindgut of pig. The results showed that Coprococcus, Ruminococcus, and Turicibacter increased significantly, while Sarcina and Clostridium decreased in relative abundances in the hindgut of pigs fed RPS. The metabolimic analysis revealed that RPS significantly affected starch and sucrose metabolites, amino acid turnover or protein biosynthesis, lipid metabolites, glycolysis, the pentose phosphate pathway, inositol phosphate metabolism, and nucleotide metabolism. Furthermore, a Pearson's correlation analysis showed that Ruminococcus and Coprococcus were positively correlated with glucose-6-phosphate, maltose, arachidonic acid, 9, 12-octadecadienoic acid, oleic acid, phosphate, but negatively correlated with α-aminobutyric acid. However, the correlation of Clostridium and Sarcina with these compounds was in the opposite direction. The results suggest that RPS not only alters the composition of the gut microbial community but also modulates the metabolic pathway of microbial metabolism, which may further affect the hindgut health of the host.
Microbiome-Metabolome Responses in the Cecum and Colon of Pig to a High Resistant Starch Diet
Sun, Yue; Su, Yong; Zhu, Weiyun
2016-01-01
Currently, knowledge about the impact of long-term intake of high resistant starch diet on pig hindgut microbiota and metabolite profile is limited. In this study, a combination of the pyrosequencing and the mass spectrometry (MS)-based metabolomics techniques were used to investigate the effects of a raw potato starch (RPS, high in resistant starch) diet on microbial composition and microbial metabolites in the hindgut of pig. The results showed that Coprococcus, Ruminococcus, and Turicibacter increased significantly, while Sarcina and Clostridium decreased in relative abundances in the hindgut of pigs fed RPS. The metabolimic analysis revealed that RPS significantly affected starch and sucrose metabolites, amino acid turnover or protein biosynthesis, lipid metabolites, glycolysis, the pentose phosphate pathway, inositol phosphate metabolism, and nucleotide metabolism. Furthermore, a Pearson's correlation analysis showed that Ruminococcus and Coprococcus were positively correlated with glucose-6-phosphate, maltose, arachidonic acid, 9, 12-octadecadienoic acid, oleic acid, phosphate, but negatively correlated with α-aminobutyric acid. However, the correlation of Clostridium and Sarcina with these compounds was in the opposite direction. The results suggest that RPS not only alters the composition of the gut microbial community but also modulates the metabolic pathway of microbial metabolism, which may further affect the hindgut health of the host. PMID:27303373
An, Jianyu; Yin, Mengqi; Zhang, Qin; Gong, Dongting; Jia, Xiaowen; Guan, Yajing; Hu, Jin
2017-01-01
Luffa cylindrica (L.) Roem. is an economically important vegetable crop in China. However, the genomic information on this species is currently unknown. In this study, for the first time, a genome survey of L. cylindrica was carried out using next-generation sequencing (NGS) technology. In total, 43.40 Gb sequence data of L. cylindrica, about 54.94× coverage of the estimated genome size of 789.97 Mb, were obtained from HiSeq 2500 sequencing, in which the guanine plus cytosine (GC) content was calculated to be 37.90%. The heterozygosity of genome sequences was only 0.24%. In total, 1,913,731 contigs (>200 bp) with 525 bp N50 length and 1,410,117 scaffolds (>200 bp) with 885.01 Mb total length were obtained. From the initial assembled L. cylindrica genome, 431,234 microsatellites (SSRs) (≥5 repeats) were identified. The motif types of SSR repeats included 62.88% di-nucleotide, 31.03% tri-nucleotide, 4.59% tetra-nucleotide, 0.96% penta-nucleotide and 0.54% hexa-nucleotide. Eighty genomic SSR markers were developed, and 51/80 primers could be used in both “Zheda 23” and “Zheda 83”. Nineteen SSRs were used to investigate the genetic diversity among 32 accessions through SSR-HRM analysis. The unweighted pair group method analysis (UPGMA) dendrogram tree was built by calculating the SSR-HRM raw data. SSR-HRM could be effectively used for genotype relationship analysis of Luffa species. PMID:28891982
Molecular characterization of the vitamin D receptor (VDR) gene in Holstein cows.
Ali, Mayar O; El-Adl, Mohamed A; Ibrahim, Hussam M M; Elseedy, Youssef Y; Rizk, Mohamed A; El-Khodery, Sabry A
2018-06-01
Vitamin D plays a vital role in calcium homeostasis, growth, and immunoregulation. Because little is known about the vitamin D receptor (VDR) gene in cattle, the aim of the present investigation was to present the molecular characterization of exons 5 and 6 of the VDR gene in Holstein cows. DNA extraction, genomic sequencing, phylogenetic analysis, synteny mapping and single nucleotide gene polymorphism analysis of the VDR gene were performed to assess blood samples collected from 50 clinically healthy Holstein cows. The results revealed the presence of a 450-base pair (bp) nucleotide sequence that resembled exons 5 and 6 with intron 5 enclosed between these exons. Sequence alignment and phylogenetic analysis revealed a close relationship between the sequenced VDR region and that found in Hereford cattle. A close association between this region and the corresponding region in small ruminants was also documented. Moreover, a single nucleotide polymorphism (SNP) that caused the replacement of a glutamate with an arginine in the deduced amino acid sequence was detected at position 7 of exon 5. In conclusion, Holstein and Hereford cattle differ with respect to exon 5 of the VDR gene. Phylogenetic analysis of the VDR gene based on nucleotide sequence produced different results from prior analyses based on amino acid sequence. Copyright © 2018 Elsevier Ltd. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kirst, Matias
2015-04-15
Poplars trees are well suited for biofuel production due to their fast growing habit, favorable wood composition and adaptation to a broad range of environments. The availability of a reference genome sequence, ease of vegetative propagation and availability of transformation methods also make poplar an ideal model for the study of wood formation and biomass growth in woody, perennial plants. The objective of this project was to conduct a genome-wide association genetics study to identify genes that regulate bioenergy traits in Populus deltoides (eastern cottonwood). Populus deltoides is a genetically diverse keystone forest species in North America and an importantmore » short rotation woody crop for the bioenergy industry. We searched for associations between eight growth and wood composition traits and common and low-frequency single-nucleotide polymorphisms (SNPs) detected by targeted resequencing of 18,153 genes in a population of 391 unrelated individuals. To increase power to detect associations with low-frequency variants, multiple-marker association tests were used in combination with single-marker association tests. Significant associations were discovered for all phenotypes and are indicative that low-frequency polymorphisms contribute to phenotypic variance of several bioenergy traits. These polymorphism are critical tools for the development of specialized plant feedstocks for bioenergy.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kirst, Matias
2014-04-14
Poplars trees are well suited for biofuel production due to their fast growing habit, favorable wood composition and adaptation to a broad range of environments. The availability of a reference genome sequence, ease of vegetative propagation and availability of transformation methods also make poplar an ideal model for the study of wood formation and biomass growth in woody, perennial plants. The objective of this project was to conduct a genome-wide association genetics study to identify genes that regulate bioenergy traits in Populus deltoides (eastern cottonwood). Populus deltoides is a genetically diverse keystone forest species in North America and an importantmore » short rotation woody crop for the bioenergy industry. We searched for associations between eight growth and wood composition traits and common and low-frequency single-nucleotide polymorphisms (SNPs) detected by targeted resequencing of 18,153 genes in a population of 391 unrelated individuals. To increase power to detect associations with low-frequency variants, multiple-marker association tests were used in combination with single-marker association tests. Significant associations were discovered for all phenotypes and are indicative that low-frequency polymorphisms contribute to phenotypic variance of several bioenergy traits. These polymorphism are critical tools for the development of specialized plant feedstocks for bioenergy.« less
Ryu, Jihye; Lee, Chaeyoung
2014-12-01
Positive selection not only increases beneficial allele frequency but also causes augmentation of allele frequencies of sequence variants in close proximity. Signals for positive selection were detected by the statistical differences in subsequent allele frequencies. To identify selection signatures in Korean cattle, we applied a composite log-likelihood (CLL)-based method, which calculates a composite likelihood of the allelic frequencies observed across sliding windows of five adjacent loci and compares the value with the critical statistic estimated by 50,000 permutations. Data for a total of 11,799 nucleotide polymorphisms were used with 71 Korean cattle and 209 foreign beef cattle. As a result, 147 signals were identified for Korean cattle based on CLL estimates (P < 0.01). The signals might be candidate genetic factors for meat quality by which the Korean cattle have been selected. Further genetic association analysis with 41 intragenic variants in the selection signatures with the greatest CLL for each chromosome revealed that marbling score was associated with five variants. Intensive association studies with all the selection signatures identified in this study are required to exclude signals associated with other phenotypes or signals falsely detected and thus to identify genetic markers for meat quality. © 2014 Stichting International Foundation for Animal Genetics.
Liu, Bin; Liu, Fule; Fang, Longyun; Wang, Xiaolong; Chou, Kuo-Chen
2015-04-15
In order to develop powerful computational predictors for identifying the biological features or attributes of DNAs, one of the most challenging problems is to find a suitable approach to effectively represent the DNA sequences. To facilitate the studies of DNAs and nucleotides, we developed a Python package called representations of DNAs (repDNA) for generating the widely used features reflecting the physicochemical properties and sequence-order effects of DNAs and nucleotides. There are three feature groups composed of 15 features. The first group calculates three nucleic acid composition features describing the local sequence information by means of kmers; the second group calculates six autocorrelation features describing the level of correlation between two oligonucleotides along a DNA sequence in terms of their specific physicochemical properties; the third group calculates six pseudo nucleotide composition features, which can be used to represent a DNA sequence with a discrete model or vector yet still keep considerable sequence-order information via the physicochemical properties of its constituent oligonucleotides. In addition, these features can be easily calculated based on both the built-in and user-defined properties via using repDNA. The repDNA Python package is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/repDNA/. bliu@insun.hit.edu.cn or kcchou@gordonlifescience.org Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Phenomenological Partial Specific Volumes for G-Quadruplex DNAs
Hellman, Lance M.; Rodgers, David W.; Fried, Michael G.
2009-01-01
Accurate partial specific volume (ν̄) values are required for sedimentation velocity and sedimentation equilibrium analyses. For nucleic acids, the estimation of these values is complicated by the fact that ν̄ depends on base composition, secondary structure, solvation and the concentrations and identities of ions in the surrounding buffer. Here we describe sedimentation equilibrium measurements of the apparent isopotential partial specific volume φ′ for two G-quadruplex DNAs and a single-stranded DNA of similar molecular weight and base composition. The G-quadruplex DNAs are a 22 nucleotide fragment of the human telomere consensus sequence and a 27 nucleotide fragment from the human c-myc promoter. The single-stranded DNA is 26 nucleotides long and is designed to have low propensity to form secondary structures. Parallel measurements were made in buffers containing NaCl and in buffers containing KCl, spanning the range 0.09M ≤ [salt] ≤ 2.3M. Limiting values of φ′, extrapolated to [salt] = 0M, were: 22-mer (NaCl-form), 0.525 ± 0.004 mL/g; 22-mer (KCl-form), 0.531 ± 0.006 mL/g; 27-mer (NaCl-form), 0.548 ± 0.005 mL/g; 27-mer (KCl-form), 0.557 ± 0.006 mL/g; 26-mer (NaCl-form), 0.555 ± 0.004 mL/g; 26-mer (KCl-form), 0.564 ± 0.006 mL/g. Small changes in φ′ with [salt] suggest that large changes in counterion association or hydration are unlikely to take place over these concentration ranges. PMID:19238377
Cherepanov, A V; de Vries, S
2001-01-01
The interaction of nucleotides with T4 DNA and RNA ligases has been characterized using ultraviolet visible (UV-VIS) absorbance and fluorescence spectroscopy. Both enzymes bind nucleotides with the K(d) between 0.1 and 20 microM. Nucleotide binding results in a decrease of absorbance at 260 nm due to pi-stacking with an aromatic residue, possibly phenylalanine, and causes red-shifting of the absorbance maximum due to hydrogen bonding with the exocyclic amino group. T4 DNA ligase is shown to have, besides the catalytic ATP binding site, another noncovalent nucleotide binding site. ATP bound there alters the pi-stacking of the nucleotide in the catalytic site, increasing its optical extinction. The K(d) for the noncovalent site is approximately 1000-fold higher than for the catalytic site. Nucleotides quench the protein fluorescence showing that a tryptophan residue is located in the active site of the ligase. The decrease of absorbance around 298 nm suggests that the hydrogen bonding interactions of this tryptophan residue are weakened in the ligase-nucleotide complex. The excitation/emission properties of T4 RNA ligase indicate that its ATP binding pocket is in contact with solvent, which is excluded upon binding of the nucleotide. Overall, the spectroscopic analysis reveals important similarities between T4 ligases and related nucleotidyltransferases, despite the low sequence similarity. PMID:11721015
Manchester, Keith L
2004-01-30
An analysis is made of the rate constants for the reactions involving the interactions of EF-Tu, EF-Ts, GDP, and GTP recently derived by Gromadski et al. [Biochemistry 41 (2002) 162]. Though their measured values appear to allow a reasonable rate of nucleotide exchange sufficient to support rates of protein synthesis in vivo, their data underestimate the thermodynamic barrier involved in nucleotide exchange and therefore cannot be considered definitive. A kinetic scheme consistent with the thermodynamic barrier can be achieved by modification of various rate constants, particularly of those involving the release of EF-Ts from EF-Tu.GTP.EF-Ts, but such constants are markedly different from what are experimentally observed. It thus remains impossible at present satisfactorily to model guanine nucleotide exchange on EF-Tu, catalysed by EF-Ts by a double displacement mechanism, with experimentally derived rate constants. Metabolic control analysis has been applied to determine the degree of flux control of the different steps in the pathway.
MGAS: a powerful tool for multivariate gene-based genome-wide association analysis.
Van der Sluis, Sophie; Dolan, Conor V; Li, Jiang; Song, Youqiang; Sham, Pak; Posthuma, Danielle; Li, Miao-Xin
2015-04-01
Standard genome-wide association studies, testing the association between one phenotype and a large number of single nucleotide polymorphisms (SNPs), are limited in two ways: (i) traits are often multivariate, and analysis of composite scores entails loss in statistical power and (ii) gene-based analyses may be preferred, e.g. to decrease the multiple testing problem. Here we present a new method, multivariate gene-based association test by extended Simes procedure (MGAS), that allows gene-based testing of multivariate phenotypes in unrelated individuals. Through extensive simulation, we show that under most trait-generating genotype-phenotype models MGAS has superior statistical power to detect associated genes compared with gene-based analyses of univariate phenotypic composite scores (i.e. GATES, multiple regression), and multivariate analysis of variance (MANOVA). Re-analysis of metabolic data revealed 32 False Discovery Rate controlled genome-wide significant genes, and 12 regions harboring multiple genes; of these 44 regions, 30 were not reported in the original analysis. MGAS allows researchers to conduct their multivariate gene-based analyses efficiently, and without the loss of power that is often associated with an incorrectly specified genotype-phenotype models. MGAS is freely available in KGG v3.0 (http://statgenpro.psychiatry.hku.hk/limx/kgg/download.php). Access to the metabolic dataset can be requested at dbGaP (https://dbgap.ncbi.nlm.nih.gov/). The R-simulation code is available from http://ctglab.nl/people/sophie_van_der_sluis. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.
NASA Astrophysics Data System (ADS)
Xie, Xian-Hua; Yu, Zu-Guo; Ma, Yuan-Lin; Han, Guo-Sheng; Anh, Vo
2017-09-01
There has been a growing interest in visualization of metagenomic data. The present study focuses on the visualization of metagenomic data using inter-nucleotide distances profile. We first convert the fragment sequences into inter-nucleotide distances profiles. Then we analyze these profiles by principal component analysis. Finally the principal components are used to obtain the 2-D scattered plot according to their source of species. We name our method as inter-nucleotide distances profiles (INP) method. Our method is evaluated on three benchmark data sets used in previous published papers. Our results demonstrate that the INP method is good, alternative and efficient for visualization of metagenomic data.
A movie of the RNA polymerase nucleotide addition cycle.
Brueckner, Florian; Ortiz, Julio; Cramer, Patrick
2009-06-01
During gene transcription, RNA polymerase (Pol) passes through repetitive cycles of adding a nucleotide to the growing mRNA chain. Here we obtained a movie of the nucleotide addition cycle by combining structural information on different functional states of the Pol II elongation complex (EC). The movie illustrates the two-step loading of the nucleoside triphosphate (NTP) substrate, closure of the active site for catalytic nucleotide incorporation, and the presumed two-step translocation of DNA and RNA, which is accompanied by coordinated conformational changes in the polymerase bridge helix and trigger loop. The movie facilitates teaching and a mechanistic analysis of transcription and can be downloaded from http://www.lmb.uni-muenchen.de/cramer/pr-materials.
Nicolaï, Adrien; Delarue, Patrice; Senet, Patrick
2013-01-01
ATP regulates the function of many proteins in the cell by transducing its binding and hydrolysis energies into protein conformational changes by mechanisms which are challenging to identify at the atomic scale. Based on molecular dynamics (MD) simulations, a method is proposed to analyze the structural changes induced by ATP binding to a protein by computing the effective free-energy landscape (FEL) of a subset of its coordinates along its amino-acid sequence. The method is applied to characterize the mechanism by which the binding of ATP to the nucleotide-binding domain (NBD) of Hsp70 propagates a signal to its substrate-binding domain (SBD). Unbiased MD simulations were performed for Hsp70-DnaK chaperone in nucleotide-free, ADP-bound and ATP-bound states. The simulations revealed that the SBD does not interact with the NBD for DnaK in its nucleotide-free and ADP-bound states whereas the docking of the SBD was found in the ATP-bound state. The docked state induced by ATP binding found in MD is an intermediate state between the initial nucleotide-free and final ATP-bound states of Hsp70. The analysis of the FEL projected along the amino-acid sequence permitted to identify a subset of 27 protein internal coordinates corresponding to a network of 91 key residues involved in the conformational change induced by ATP binding. Among the 91 residues, 26 are identified for the first time, whereas the others were shown relevant for the allosteric communication of Hsp70 s in several experiments and bioinformatics analysis. The FEL analysis revealed also the origin of the ATP-induced structural modifications of the SBD recently measured by Electron Paramagnetic Resonance. The pathway between the nucleotide-free and the intermediate state of DnaK was extracted by applying principal component analysis to the subset of internal coordinates describing the transition. The methodology proposed is general and could be applied to analyze allosteric communication in other proteins.
Nicolaï, Adrien; Delarue, Patrice; Senet, Patrick
2013-01-01
ATP regulates the function of many proteins in the cell by transducing its binding and hydrolysis energies into protein conformational changes by mechanisms which are challenging to identify at the atomic scale. Based on molecular dynamics (MD) simulations, a method is proposed to analyze the structural changes induced by ATP binding to a protein by computing the effective free-energy landscape (FEL) of a subset of its coordinates along its amino-acid sequence. The method is applied to characterize the mechanism by which the binding of ATP to the nucleotide-binding domain (NBD) of Hsp70 propagates a signal to its substrate-binding domain (SBD). Unbiased MD simulations were performed for Hsp70-DnaK chaperone in nucleotide-free, ADP-bound and ATP-bound states. The simulations revealed that the SBD does not interact with the NBD for DnaK in its nucleotide-free and ADP-bound states whereas the docking of the SBD was found in the ATP-bound state. The docked state induced by ATP binding found in MD is an intermediate state between the initial nucleotide-free and final ATP-bound states of Hsp70. The analysis of the FEL projected along the amino-acid sequence permitted to identify a subset of 27 protein internal coordinates corresponding to a network of 91 key residues involved in the conformational change induced by ATP binding. Among the 91 residues, 26 are identified for the first time, whereas the others were shown relevant for the allosteric communication of Hsp70 s in several experiments and bioinformatics analysis. The FEL analysis revealed also the origin of the ATP-induced structural modifications of the SBD recently measured by Electron Paramagnetic Resonance. The pathway between the nucleotide-free and the intermediate state of DnaK was extracted by applying principal component analysis to the subset of internal coordinates describing the transition. The methodology proposed is general and could be applied to analyze allosteric communication in other proteins. PMID:24348227
Das, G; Henning, D; Wright, D; Reddy, R
1988-01-01
Whereas the genes coding for trimethyl guanosine-capped snRNAs are transcribed by RNA polymerase II, the U6 RNA genes are transcribed by RNA polymerase III. In this study, we have analyzed the cis-regulatory elements involved in the transcription of a mouse U6 snRNA gene in vitro and in frog oocytes. Transcriptional analysis of mutant U6 gene constructs showed that, unlike most known cases of polymerase III transcription, intragenic sequences except the initiation nucleotide are dispensable for efficient and accurate transcription of U6 gene in vitro. Transcription of 5' deletion mutants in vitro and in frog oocytes showed that the upstream region, within 79 bp from the initiation nucleotide, contains elements necessary for U6 gene transcription. Transcription studies were carried out in frog oocytes with U6 genes containing 5' distal sequence; these studies revealed that the distal element acts as an orientation-dependent enhancer when present upstream to the gene, while it is orientation-independent but distance-dependent enhancer when placed down-stream to the U6 gene. Analysis of 3' deletion mutants showed that the transcription termination of U6 RNA is dependent on a T cluster present on the 3' end of the gene, thus providing further support to other lines of evidence that U6 genes are transcribed by RNA polymerase III. These observations suggest the involvement of a composite of components of RNA polymerase II and III transcription machineries in the transcription of U6 genes by RNA polymerase III. Images PMID:3366121
Genotyping by Sequencing in Almond: SNP Discovery, Linkage Mapping, and Marker Design.
Goonetilleke, Shashi N; March, Timothy J; Wirthensohn, Michelle G; Arús, Pere; Walker, Amanda R; Mather, Diane E
2018-01-04
In crop plant genetics, linkage maps provide the basis for the mapping of loci that affect important traits and for the selection of markers to be applied in crop improvement. In outcrossing species such as almond ( Prunus dulcis Mill. D. A. Webb), application of a double pseudotestcross mapping approach to the F 1 progeny of a biparental cross leads to the construction of a linkage map for each parent. Here, we report on the application of genotyping by sequencing to discover and map single nucleotide polymorphisms in the almond cultivars "Nonpareil" and "Lauranne." Allele-specific marker assays were developed for 309 tag pairs. Application of these assays to 231 Nonpareil × Lauranne F 1 progeny provided robust linkage maps for each parent. Analysis of phenotypic data for shell hardness demonstrated the utility of these maps for quantitative trait locus mapping. Comparison of these maps to the peach genome assembly confirmed high synteny and collinearity between the peach and almond genomes. The marker assays were applied to progeny from several other Nonpareil crosses, providing the basis for a composite linkage map of Nonpareil. Applications of the assays to a panel of almond clones and a panel of rootstocks used for almond production demonstrated the broad applicability of the markers and provide subsets of markers that could be used to discriminate among accessions. The sequence-based linkage maps and single nucleotide polymorphism assays presented here could be useful resources for the genetic analysis and genetic improvement of almond. Copyright © 2018 Goonetilleke et al.
An improved model for whole genome phylogenetic analysis by Fourier transform.
Yin, Changchuan; Yau, Stephen S-T
2015-10-07
DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
Ishida, Hisashi; Matsumoto, Atsushi
2016-09-01
In order to understand how MutS recognizes mismatched DNA and induces the reaction of DNA repair using ATP, the dynamics of the complexes of MutS (bound to the ADP and ATP nucleotides, or not) and DNA (with mismatched and matched base-pairs) were investigated using molecular dynamics simulations. As for DNA, the structure of the base-pairs of the homoduplex DNA which interacted with the DNA recognition site of MutS was intermittently disturbed, indicating that the homoduplex DNA was unstable. As for MutS, the disordered loops in the ATPase domains, which are considered to be necessary for the induction of DNA repair, were close to (away from) the nucleotide-binding sites in the ATPase domains when the nucleotides were (not) bound to MutS. This indicates that the ATPase domains changed their structural stability upon ATP binding using the disordered loop. Conformational analysis by principal component analysis showed that the nucleotide binding changed modes which have structurally solid ATPase domains and the large bending motion of the DNA from higher to lower frequencies. In the MutS-mismatched DNA complex bound to two nucleotides, the bending motion of the DNA at low frequency modes may play a role in triggering the formation of the sliding clamp for the following DNA-repair reaction step. Moreover, MM-PBSA/GBSA showed that the MutS-homoduplex DNA complex bound to two nucleotides was unstable because of the unfavorable interactions between MutS and DNA. This would trigger the ATP hydrolysis or separation of MutS and DNA to continue searching for mismatch base-pairs. Proteins 2016; 84:1287-1303. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Cosenza, Gianfranco; Macciotta, Nicolò P P; Nudda, Anna; Coletta, Angelo; Ramunno, Luigi; Pauciullo, Alfredo
2017-05-01
The oxytocin receptor, also known as OXTR, is a protein which functions as receptor for the hormone and neurotransmitter oxytocin and the complex oxytocin-oxytocin receptor plays an important role in the uterus during calving. A characterisation of the river buffalo OXTR gene, amino acid sequences and phylogenetic analysis is presented. The DNA regions of the OXTR gene spanning exons 1, 2 and 3 of ten Mediterranean river buffalo DNA samples were analysed and 7 single nucleotide polymorphisms were found. We focused on the g.129C > T SNP detected in exon 3 and responsible for the amino acid replacement CGCArg > TGCCys in position 353. The relative frequency of T allele was of 0·257. An association study between this detected polymorphism and milk fatty acids composition in Italian Mediterranean river buffalo was carried out. The fatty acid composition traits, fatty acid classes and fat percentage of 306 individual milk samples were determined. Associations between OXTR g.129C > T genotype and milk fatty acids composition were tested using a mixed linear model. The OXTR CC genotype was found significantly associated with higher contents of odd branched-chain fatty acids (OBCFA) (P < 0·0006), polyunsaturated FA (PUFA n 3 and n 6) (P < 0·0032 and P < 0·0006, respectively), stearic acid (C18) (P < 0·02) and lower level of palmitic acid (C16) (P < 0·02). The results of this study suggest that the OXTR CC animals might be useful in selection toward the improvement of milk fatty acid composition.
Arias-Pulido, Hugo; Peyton, Cheri L; Torrez-Martínez, Norah; Anderson, D Nelson; Wheeler, Cosette M
2005-07-20
While HPV 16 variant lineages have been well characterized, the knowledge about HPV 18 variants is limited. In this study, HPV 18 nucleotide variations in the E2 hinge region were characterized by sequence analysis in 47 control and 51 tumor specimens. Fifty of these specimens were randomly selected for sequencing of an LCR-E6 segment and 20 samples representative of LCR-E6 and E2 sequence variants were examined across the L1 region. A total of 2770 nucleotides per HPV 18 variant genome were considered in this study. HPV 18 variant nucleotides were linked among all gene segments analyzed and grouped into three main branches: Asian-American (AA), European (E), and African (Af). These three branches were equally distributed among controls and cases and when stratified by Hispanic and non-Hispanic ethnicities. Among invasive cervical cancer cases, no significant differences in the three HPV variant branches were observed among ethnic groups or when stratified by histopathology (squamous vs. adenocarcinoma). The Af branch showed the greatest nucleotide variability when compared to the HPV 18 reference sequence and was more closely related to HPV 45 than either AA or E branches. Our data also characterize nucleotide and amino acid variations in the L1 capsid gene among HPV 18 variants, which may be relevant to vaccine strategies and subsequent studies of naturally occurring HPV 18 variants. Several novel HPV 18 nucleotide variations were identified in this study.
Yusoff, K; Millar, N S; Chambers, P; Emmerson, P T
1987-01-01
The nucleotide sequence of the L gene of the Beaudette C strain of Newcastle disease virus (NDV) has been determined. The L gene is 6704 nucleotides long and encodes a protein of 2204 amino acids with a calculated molecular weight of 248822. Mung bean nuclease mapping of the 5' terminus of the L gene mRNA indicates that the transcription of the L gene is initiated 11 nucleotides upstream of the translational start site. Comparison with the amino acid sequences of the L genes of Sendai virus and vesicular stomatitis virus (VSV) suggests that there are several regions of homology between the sequences. These data provide further evidence for an evolutionary relationship between the Paramyxoviridae and the Rhabdoviridae. A non-coding sequence of 46 nucleotides downstream of the presumed polyadenylation site of the L gene may be part of a negative strand leader RNA. Images PMID:3035486
Free amino acids and 5'-nucleotides in Finnish forest mushrooms.
Manninen, Hanna; Rotola-Pukkila, Minna; Aisala, Heikki; Hopia, Anu; Laaksonen, Timo
2018-05-01
Edible mushrooms are valued because of their umami taste and good nutritional values. Free amino acids, 5'-nucleotides and nucleosides were analyzed from four Nordic forest mushroom species (Lactarius camphoratus, Boletus edulis, Cantharellus cibarius, Craterellus tubaeformis) using high precision liquid chromatography analysis. To our knowledge, these taste components were studied for the first time from Craterellus tubaeformis and Lactarius camphoratus. The focus was on the umami amino acids and 5'-nucleotides. The free amino acid and 5'-nucleotide/nucleoside contents of studied species differed from each other. In all studied samples, umami amino acids were among five major free amino acids. The highest concentration of umami amino acids was on L. camphoratus whereas B. edulis had the highest content of sweet amino acids and C. cibarius had the highest content of bitter amino acids. The content of umami enhancing 5'-nucleotides were low in all studied species. Copyright © 2017 Elsevier Ltd. All rights reserved.
Li, Yongqiang; Deng, Congliang; Bian, Yong; Zhao, Xiaoli; Zhou, Qi
2017-04-01
Apple stem grooving virus (ASGV), apple chlorotic leaf spot virus (ACLSV), and prunus necrotic ringspot virus (PNRSV) were identified in a crab apple tree by small RNA deep sequencing. The complete genome sequence of ACLSV isolate BJ (ACLSV-BJ) was 7554 nucleotides and shared 67.0%-83.0% nucleotide sequence identity with other ACLSV isolates. A phylogenetic tree based on the complete genome sequence of all available ACLSV isolates showed that ACLSV-BJ clustered with the isolates SY01 from hawthorn, MO5 from apple, and JB, KMS and YH from pear. The complete nucleotide sequence of ASGV-BJ was 6509 nucleotides (nt) long and shared 78.2%-80.7% nucleotide sequence identity with other isolates. ASGV-BJ and the isolate ASGV_kfp clustered together in the phylogenetic tree as an independent clade. Recombination analysis showed that isolate ASGV-BJ was a naturally occurring recombinant.
USDA-ARS?s Scientific Manuscript database
: Hemoglobin-y gene of channel catfish , lctalurus punctatus, was cloned and sequenced . Total RNA from head kidneys was isolated, reverse transcribed and amplified . The sequence of the channel catfish hemoglobin-y gene consists of 600 nucleotides . Analysis of the nucleotide sequence reveals one o...
NASA Technical Reports Server (NTRS)
Sokolova, Z. A.
1980-01-01
The influence of sinusoidal modulated currents was studied and physical loads on the nucleic acid content and the nucleotide composition of the total RNA in muscles of rats of various ages under conditions of hypodynamia were measured. Methodology utilized is described and conclusions are presented.
de Keyzer, Jeanine; Steel, Gregor J.; Hale, Sarah J.; Humphries, Daniel; Stirling, Colin J.
2009-01-01
Protein translocation and folding in the endoplasmic reticulum of Saccharomyces cerevisiae involves two distinct Hsp70 chaperones, Lhs1p and Kar2p. Both proteins have the characteristic domain structure of the Hsp70 family consisting of a conserved N-terminal nucleotide binding domain and a C-terminal substrate binding domain. Kar2p is a canonical Hsp70 whose substrate binding activity is regulated by cochaperones that promote either ATP hydrolysis or nucleotide exchange. Lhs1p is a member of the Grp170/Lhs1p subfamily of Hsp70s and was previously shown to function as a nucleotide exchange factor (NEF) for Kar2p. Here we show that in addition to this NEF activity, Lhs1p can function as a holdase that prevents protein aggregation in vitro. Analysis of the nucleotide requirement of these functions demonstrates that nucleotide binding to Lhs1p stimulates the interaction with Kar2p and is essential for NEF activity. In contrast, Lhs1p holdase activity is nucleotide-independent and unaffected by mutations that interfere with ATP binding and NEF activity. In vivo, these mutants show severe protein translocation defects and are unable to support growth despite the presence of a second Kar2p-specific NEF, Sil1p. Thus, Lhs1p-dependent nucleotide exchange activity is vital for ER protein biogenesis in vivo. PMID:19759005
Vertebrate codon bias indicates a highly GC-rich ancestral genome.
Nabiyouni, Maryam; Prakash, Ashwin; Fedorov, Alexei
2013-04-25
Two factors are thought to have contributed to the origin of codon usage bias in eukaryotes: 1) genome-wide mutational forces that shape overall GC-content and create context-dependent nucleotide bias, and 2) positive selection for codons that maximize efficient and accurate translation. Particularly in vertebrates, these two explanations contradict each other and cloud the origin of codon bias in the taxon. On the one hand, mutational forces fail to explain GC-richness (~60%) of third codon positions, given the GC-poor overall genomic composition among vertebrates (~40%). On the other hand, positive selection cannot easily explain strict regularities in codon preferences. Large-scale bioinformatic assessment, of nucleotide composition of coding and non-coding sequences in vertebrates and other taxa, suggests a simple possible resolution for this contradiction. Specifically, we propose that the last common vertebrate ancestor had a GC-rich genome (~65% GC). The data suggest that whole-genome mutational bias is the major driving force for generating codon bias. As the bias becomes prominent, it begins to affect translation and can result in positive selection for optimal codons. The positive selection can, in turn, significantly modulate codon preferences. Copyright © 2013 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Arndt, Peter F.; Hwa, Terence; Petrov, Dmitri A.
2005-06-01
This study presents the first global, 1 Mbp level analysis of patterns of nucleotide substitutions along the human lineage. The study is based on the analysis of a large amount of repetitive elements deposited into the human genome since the mammalian radiation, yielding a number of results that would have been difficult to obtain using the more conventional comparative method of analysis. This analysis revealed substantial and consistent variability of rates of substitution, with the variability ranging up to 2-fold among different regions. The rates of substitutions of C or G nucleotides with A or T nucleotides vary much more sharply than the reverse rates suggesting that much of that variation is due to differences in mutation rates rather than in the probabilities of fixation of C/G vs. A/T nucleotides across the genome. For all types of substitution we observe substantially more hotspots than coldspots, with hotspots showing substantial clustering over tens of Mbp's. Our analysis revealed that GC-content of surrounding sequences is the best predictor of the rates of substitution. The pattern of substitution appears very different near telomeres compared to the rest of the genome and cannot be explained by the genome-wide correlations of the substitution rates with GC content or exon density. The telomere pattern of substitution is consistent with natural selection or biased gene conversion acting to increase the GC-content of the sequences that are within 10-15 Mbp away from the telomere.
Staniszewska-Slezak, Emilia; Malek, Kamilla; Baranska, Malgorzata
2015-08-05
Raman spectroscopy and four excitation lines in the visible (Vis: 488, 532, 633 nm) and near infrared (NIR: 785 nm) were used for biochemical analysis of rat tissue homogenates, i.e. myocardium, brain, liver, lung, intestine, and kidney. The Vis Raman spectra are very similar for some organs (brain/intestines and kidney/liver) and dominated by heme signals when tissues of lung and myocardium were investigated (especially with 532 nm excitation). On the other hand, the NIR Raman spectra are specific for each tissue and more informative than the corresponding ones collected with the Vis excitations. The spectra analyzed without any special pre-processing clearly illustrate different chemical composition of each tissue and give information about main components e.g. lipids or proteins, but also about the content of some specific compounds such as amino acid residues, nucleotides and nucleobases. However, in order to obtain the whole spectral information about tissues complex composition the spectra of Vis and NIR excitations should be collected and analyzed together. A good agreement of data gathered from Raman spectra of the homogenates and those obtained previously from Raman imaging of the tissue cross-sections indicates that the presented here approach can be a method of choice for an investigation of biochemical variation in animal tissues. Moreover, the Raman spectral profile of tissue homogenates is specific enough to be used for an investigation of potential pathological changes the organism undergoes, in particular when supported by the complementary FTIR spectroscopy. Copyright © 2015 Elsevier B.V. All rights reserved.
Bocianowski, Jan; Mikołajczyk, Katarzyna; Bartkowiak-Broda, Iwona
2012-02-01
One of the goals in oilseed rape programs is to develop genotypes producing oil with low linolenic acid content (C18:3, ≤3%). Low linolenic mutant lines of canola rapeseed were obtained via chemical mutagenesis at the Plant Breeding and Acclimatization Institute - NRI, in Poznan, Poland, and allele-specific SNP markers were designed for monitoring of two statistically important single nucleotide polymorphisms detected by SNaPshot analysis in two FAD3 desaturase genes, BnaA.FAD3 and BnaC.FAD3, respectively. Strong negative correlation between the presence of mutant alleles of the genes and linolenic acid content was revealed by analysis of variance. In this paper we present detailed characteristics of the markers by estimation of the additive and dominance effects of the FAD3 genes with respect to particular fatty acid content in seed oil, as well as by calculation of the phenotypic variation of seed oil fatty acid composition accounted by particular allele-specific marker. The obtained percentage of variation in fatty acid composition was considerable only for linolenic acid content and equaled 35.6% for BnaA.FAD3 and 39.3% for BnaC.FAD3, whereas the total percentage of variation in linolenic acid content was 53.2% when accounted for mutations in both genes simultaneously. Our results revealed high specificity of the markers for effective monitoring of the wild-type and mutated alleles of the Brassica napus FAD3 desaturase genes in the low linolenic mutant recombinants in breeding programs.
Kawaguchi, Fuki; Kigoshi, Hiroto; Nakajima, Ayaka; Matsumoto, Yuta; Uemoto, Yoshinobu; Fukushima, Moriyuki; Yoshida, Emi; Iwamoto, Eiji; Akiyama, Takayuki; Kohama, Namiko; Kobayashi, Eiji; Honda, Takeshi; Oyama, Kenji; Mannen, Hideyuki; Sasazaki, Shinji
2018-05-17
Fatty acid composition is an important indicator of beef quality. The objective of this study was to search the potential candidate region for fatty acid composition. We performed pool-based genome-wide association studies (GWAS) for oleic acid percentage (C18:1) in a Japanese Black cattle population from the Hyogo prefecture. GWAS analysis revealed two novel candidate regions on BTA9 and BTA14. The most significant single nucleotide polymorphisms (SNPs) in each region were genotyped in a population (n = 899) to verify their effect on C18:1. Statistical analysis revealed that both SNPs were significantly associated with C18:1 (p = .0080 and .0003), validating the quantitative trait loci (QTLs) detected in GWAS. We subsequently selected VNN1 and LYPLA1 genes as candidate genes from each region on BTA9 and BTA14, respectively. We sequenced full-length coding sequence (CDS) of these genes in eight individuals and identified a nonsynonymous SNP T66M on VNN1 gene as a putative candidate polymorphism. The polymorphism was also significantly associated with C18:1, but the p value (p = .0162) was higher than the most significant SNP on BTA9, suggesting that it would not be responsible for the QTL. Although further investigation will be needed to determine the responsible gene and polymorphism, our findings would contribute to development of selective markers for fatty acid composition in the Japanese Black cattle of Hyogo. © 2018 Japanese Society of Animal Science.
Bustamante, Luis; Sáez, Vania; Hinrichsen, Patricio; Castro, María H; Vergara, Carola; von Baer, Dietrich; Mardones, Claudia
2017-04-05
A novel 'Red Globe' (RG)-derived grape variety, 'Pink Globe' (PG), was described and registered as a new genotype, with earlier ripening and sweeter taste than those of RG. Microsatellite analysis revealed that PG and RG are undifferentiable; however, the PG VvmybA1c contains six single-nucleotide polymorphisms within the coding and noncoding region, possibly related to the reduced VvmybA1 expression levels. Conversely, HPLC-DAD-ESI-MS/MS analysis showed significantly lower anthocyanin content in PG skin than in RG skin, and PG had no detectable trihydroxylated anthocyanins. Total flavonols did not differ between the variants, although some quercetin derivate concentrations were lower in PG. HPLC-FLD analysis revealed slightly higher concentrations of epicatechin and a procyanidin dimer in PG seeds, although the antioxidant capacity of crude extracts from either variety did not differ significantly. These differences, particularly in monomeric anthocyanin content, can be attributed to altered activity of a MYB-type transcription factor, reducing Vvufgt expression.
Biological nanopore MspA for DNA sequencing
NASA Astrophysics Data System (ADS)
Manrao, Elizabeth A.
Unlocking the information hidden in the human genome provides insight into the inner workings of complex biological systems and can be used to greatly improve health-care. In order to allow for widespread sequencing, new technologies are required that provide fast and inexpensive readings of DNA. Nanopore sequencing is a third generation DNA sequencing technology that is currently being developed to fulfill this need. In nanopore sequencing, a voltage is applied across a small pore in an electrolyte solution and the resulting ionic current is recorded. When DNA passes through the channel, the ionic current is partially blocked. If the DNA bases uniquely modulate the ionic current flowing through the channel, the time trace of the current can be related to the sequence of DNA passing through the pore. There are two main challenges to realizing nanopore sequencing: identifying a pore with sensitivity to single nucleotides and controlling the translocation of DNA through the pore so that the small single nucleotide current signatures are distinguishable from background noise. In this dissertation, I explore the use of Mycobacterium smegmatis porin A (MspA) for nanopore sequencing. In order to determine MspA's sensitivity to single nucleotides, DNA strands of various compositions are held in the pore as the resulting ionic current is measured. DNA is immobilized in MspA by attaching it to a large molecule which acts as an anchor. This technique confirms the single nucleotide resolution of the pore and additionally shows that MspA is sensitive to epigenetic modifications and single nucleotide polymorphisms. The forces from the electric field within MspA, the effective charge of nucleotides, and elasticity of DNA are estimated using a Freely Jointed Chain model of single stranded DNA. These results offer insight into the interactions of DNA within the pore. With the nucleotide sensitivity of MspA confirmed, a method is introduced to controllably pass DNA through the pore. Using a DNA polymerase, DNA strands are stepped through MspA one nucleotide at a time. The steps are observable as distinct levels on the ionic-current time-trace and are related to the DNA sequence. These experiments overcome the two fundamental challenges to realizing MspA nanopore sequencing and pave the way to the development of a commercial technology.
Sequencing and phylogenetic analysis of tobacco virus 2, a polerovirus from Nicotiana tabacum.
Zhou, Benguo; Wang, Fang; Zhang, Xuesong; Zhang, Lina; Lin, Huafeng
2017-07-01
The complete genome sequence of a new virus, provisionally named tobacco virus 2 (TV2), was determined and identified from leaves of tobacco (Nicotiana tabacum) exhibiting leaf mosaic, yellowing, and deformity, in Anhui Province, China. The genome sequence of TV2 comprises 5,979 nucleotides, with 87% nucleotide sequence identity to potato leafroll virus (PLRV). Its genome organization is similar to that of PLRV, containing six open reading frames (ORFs) that potentially encode proteins with putative functions in cell-to-cell movement and suppression of RNA silencing. Phylogenetic analysis of the nucleotide sequence placed TV2 alongside members of the genus Polerovirus in the family Luteoviridae. To the best our knowledge, this study is the first report of a complete genome sequence of a new polerovirus identified in tobacco.
Yao, Chiou-Ju; Chen, Ching-Hung; Hsiao, Chung-Der
2016-07-01
In this study, we used the next-generation sequencing method to deduce the complete mitogenome of Ginkgo-toothed beaked whale (Mesoplodon ginkgodens) for the first time. The nucleotide composition was asymmetric (33.3% A, 25.3% C, 12.6% G, and 28.7% T) with an overall GC content of 37.9%. The length of the assembled mitogenome was 16,339 bp and follows the typical vertebrate arrangement, including 13 protein coding genes, 22 transfer RNAs, 2 ribosomal RNAs genes, and a non-coding control region of D-loop. The D-loop contains 870 bp and is located between tRNA-Pro and tRNA-Phe. The complete mitogenome of Ginkgo-toothed beaked whale deduced in this study provides essential and important DNA molecular data for further phylogenetic and evolutionary analysis for cetaceans.
Hemipteran Mitochondrial Genomes: Features, Structures and Implications for Phylogeny
Wang, Yuan; Chen, Jing; Jiang, Li-Yun; Qiao, Ge-Xia
2015-01-01
The study of Hemipteran mitochondrial genomes (mitogenomes) began with the Chagas disease vector, Triatoma dimidiata, in 2001. At present, 90 complete Hemipteran mitogenomes have been sequenced and annotated. This review examines the history of Hemipteran mitogenomes research and summarizes the main features of them including genome organization, nucleotide composition, protein-coding genes, tRNAs and rRNAs, and non-coding regions. Special attention is given to the comparative analysis of repeat regions. Gene rearrangements are an additional data type for a few families, and most mitogenomes are arranged in the same order to the proposed ancestral insect. We also discuss and provide insights on the phylogenetic analyses of a variety of taxonomic levels. This review is expected to further expand our understanding of research in this field and serve as a valuable reference resource. PMID:26039239
Fluorescent signatures for variable DNA sequences
Rice, John E.; Reis, Arthur H.; Rice, Lisa M.; Carver-Brown, Rachel K.; Wangh, Lawrence J.
2012-01-01
Life abounds with genetic variations writ in sequences that are often only a few hundred nucleotides long. Rapid detection of these variations for identification of genetic diseases, pathogens and organisms has become the mainstay of molecular science and medicine. This report describes a new, highly informative closed-tube polymerase chain reaction (PCR) strategy for analysis of both known and unknown sequence variations. It combines efficient quantitative amplification of single-stranded DNA targets through LATE-PCR with sets of Lights-On/Lights-Off probes that hybridize to their target sequences over a broad temperature range. Contiguous pairs of Lights-On/Lights-Off probes of the same fluorescent color are used to scan hundreds of nucleotides for the presence of mutations. Sets of probes in different colors can be combined in the same tube to analyze even longer single-stranded targets. Each set of hybridized Lights-On/Lights-Off probes generates a composite fluorescent contour, which is mathematically converted to a sequence-specific fluorescent signature. The versatility and broad utility of this new technology is illustrated in this report by characterization of variant sequences in three different DNA targets: the rpoB gene of Mycobacterium tuberculosis, a sequence in the mitochondrial cytochrome C oxidase subunit 1 gene of nematodes and the V3 hypervariable region of the bacterial 16 s ribosomal RNA gene. We anticipate widespread use of these technologies for diagnostics, species identification and basic research. PMID:22879378
Purifying Selection on Exonic Splice Enhancers in Intronless Genes
Savisaar, Rosina; Hurst, Laurence D.
2016-01-01
Exonic splice enhancers (ESEs) are short nucleotide motifs, enriched near exon ends, that enhance the recognition of the splice site and thus promote splicing. Are intronless genes under selection to avoid these motifs so as not to attract the splicing machinery to an mRNA that should not be spliced, thereby preventing the production of an aberrant transcript? Consistent with this possibility, we find that ESEs in putative recent retrocopies are at a higher density and evolving faster than those in other intronless genes, suggesting that they are being lost. Moreover, intronless genes are less dense in putative ESEs than intron-containing ones. However, this latter difference is likely due to the skewed base composition of intronless sequences, a skew that is in line with the general GC richness of few exon genes. Indeed, after controlling for such biases, we find that both intronless and intron-containing genes are denser in ESEs than expected by chance. Importantly, nucleotide-controlled analysis of evolutionary rates at synonymous sites in ESEs indicates that the ESEs in intronless genes are under purifying selection in both human and mouse. We conclude that on the loss of introns, some but not all, ESE motifs are lost, the remainder having functions beyond a role in splice promotion. These results have implications for the design of intronless transgenes and for understanding the causes of selection on synonymous sites. PMID:26802218
Liu, Qiu-Ning; Chai, Xin-Yue; Bian, Dan-Dan; Zhou, Chun-Lin; Tang, Bo-Ping
2016-01-01
The mitochondrial (mt) genome can provide important information for the understanding of phylogenetic relationships. The complete mt genome of Plodia interpunctella (Lepidoptera: Pyralidae) has been sequenced. The circular genome is 15 287 bp in size, encoding 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The AT skew of this mt genome is slightly negative, and the nucleotide composition is biased toward A+T nucleotides (80.15%). All PCGs start with the typical ATN (ATA, ATC, ATG, and ATT) codons, except for the cox1 gene which may start with the CGA codon. Four of the 13 PCGs harbor the incomplete termination codon T or TA. All the tRNA genes are folded into the typical clover-leaf structure of mitochondrial tRNA, except for trnS1 (AGN) in which the DHU arm fails to form a stable stem-loop structure. The overlapping sequences are 35 bp in total and are found in seven different locations. A total of 240 bp of intergenic spacers are scattered in 16 regions. The control region of the mt genome is 327 bp in length and consisted of several features common to the sequenced lepidopteran insects. Phylogenetic analysis based on 13 PCGs using the Maximum Likelihood method shows that the placement of P. interpunctella was within the Pyralidae.
Wang, Pei; Song, Fan; Cai, Wanzhi
2014-01-01
Insect mitochondrial genomes are very important to understand the molecular evolution as well as for phylogenetic and phylogeographic studies of the insects. The Miridae are the largest family of Heteroptera encompassing more than 11,000 described species and of great economic importance. For better understanding the diversity and the evolution of plant bugs, we sequence five new mitochondrial genomes and present the first comparative analysis of nine mitochondrial genomes of mirids available to date. Our result showed that gene content, gene arrangement, base composition and sequences of mitochondrial transcription termination factor were conserved in plant bugs. Intra-genus species shared more conserved genomic characteristics, such as nucleotide and amino acid composition of protein-coding genes, secondary structure and anticodon mutations of tRNAs, and non-coding sequences. Control region possessed several distinct characteristics, including: variable size, abundant tandem repetitions, and intra-genus conservation; and was useful in evolutionary and population genetic studies. The AGG codon reassignments were investigated between serine and lysine in the genera Adelphocoris and other cimicomorphans. Our analysis revealed correlated evolution between reassignments of the AGG codon and specific point mutations at the antidocons of tRNALys and tRNASer(AGN). Phylogenetic analysis indicated that mitochondrial genome sequences were useful in resolving family level relationship of Cimicomorpha. Comparative evolutionary analysis of plant bug mitochondrial genomes allowed the identification of previously neglected coding genes or non-coding regions as potential molecular markers. The finding of the AGG codon reassignments between serine and lysine indicated the parallel evolution of the genetic code in Hemiptera mitochondrial genomes. PMID:24988409
Nucleic acid analysis using terminal-phosphate-labeled nucleotides
Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY
2008-04-22
The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.
NASA Astrophysics Data System (ADS)
Schultz, Christian P.; Bârzu, Octavian; Mantsch, Henry H.
2000-03-01
The functional role of CMP kinases is to regenerate mono-phosphate nucleotides in cells by transferring phosphate residues from tri-phosphorylated nucleotides to monophosphorylated nucleotides. These enzymes possess two binding sites and maintain a highly conserved secondary structure. They are essential for cell survival. Herein we compare the infrared spectra of two similar, but not identical enzymes, the CMP kinases from Escherichia coli and Bacillus subtilis. A two-dimensional cross correlation analysis of the infrared spectra reveals differences in the denaturation behavior of the two proteins. Different secondary structure elements show different time-delayed or advanced unfolding events in the two enzymes. When bound to the active sites, the two nucleotide-substrates CMP and ATP exert a stabilizing effect on the structure of both proteins. The changes observed upon thermal denaturation are different for the two enzymes. Model 2D correlations are used to simulate the different denaturation of the two enzymes. Thermal denaturation and aggregation can be distinguished as two processes separated in time.
[Determination of genetic bases of auxotrophy in Yersinia pestis ssp. caucasica strains].
Odinokov, G N; Eroshenko, G A; Kukleva, L M; Shavina, N Iu; Krasnov, Ia M; Kutyrev, V V
2012-04-01
Based on the results of computer analysis of nucleotide sequences in strains Yersinia pestis and Y. pseudotuberculosis recorded in the files of NCBI GenBank database, differences between genes argA, aroG, aroF, thiH, and thiG of strain Pestoides F (subspecies caucasica) were found, compared to other strains of plaque agent and pseudotuberculosis microbe. Using PCR with calculated primers and the method of sequence analysis, the structure of variable regions of these genes was studied in 96 natural Y. pestis and Y. pseudotuberculosis strains. It was shown that all examined strains of subspecies caucasica, unlike strains of plague-causing agent of other subspecies and pseudotubercolosis microbe, had identical mutations in genes argA (integration of the insertion sequence IS100), aroG (insertion of ten nucleotides), aroF (inserion of IS100), thiH (insertion of nucleotide T), and thiG (deletion of 13 nucleotides). These mutations are the reason for the absence in strains belonging to this subspecies of the ability to synthesize arginine, phenylalanine, tyrosine, and vitamin B1 (thiamine), and cause their auxotrophy for these growth factors.
Quantitative Analysis of Guanine Nucleotide Exchange Factors (GEFs) as Enzymes
Randazzo, Paul A; Jian, Xiaoying; Chen, Pei-Wen; Zhai, Peng; Soubias, Olivier; Northup, John K
2014-01-01
The proteins that possess guanine nucleotide exchange factor (GEF) activity, which include about ~800 G protein coupled receptors (GPCRs),1 15 Arf GEFs,2 81 Rho GEFs,3 8 Ras GEFs,4 and others for other families of GTPases,5 catalyze the exchange of GTP for GDP on all regulatory guanine nucleotide binding proteins. Despite their importance as catalysts, relatively few exchange factors (we are aware of only eight for ras superfamily members) have been rigorously characterized kinetically.5–13 In some cases, kinetic analysis has been simplistic leading to erroneous conclusions about mechanism (as discussed in a recent review14). In this paper, we compare two approaches for determining the kinetic properties of exchange factors: (i) examining individual equilibria, and; (ii) analyzing the exchange factors as enzymes. Each approach, when thoughtfully used,14,15 provides important mechanistic information about the exchange factors. The analysis as enzymes is described in further detail. With the focus on the production of the biologically relevant guanine nucleotide binding protein complexed with GTP (G•GTP), we believe it is conceptually simpler to connect the kinetic properties to cellular effects. Further, the experiments are often more tractable than those used to analyze the equilibrium system and, therefore, more widely accessible to scientists interested in the function of exchange factors. PMID:25332840
Bergman, Juraj; Mitrikeski, Petar T.
2015-01-01
Summary Sporulation efficiency in the yeast Saccharomyces cerevisiae is a well-established model for studying quantitative traits. A variety of genes and nucleotides causing different sporulation efficiencies in laboratory, as well as in wild strains, has already been extensively characterised (mainly by reciprocal hemizygosity analysis and nucleotide exchange methods). We applied a different strategy in order to analyze the variation in sporulation efficiency of laboratory yeast strains. Coupling classical quantitative genetic analysis with simulations of phenotypic distributions (a method we call phenotype modelling) enabled us to obtain a detailed picture of the quantitative trait loci (QTLs) relationships underlying the phenotypic variation of this trait. Using this approach, we were able to uncover a dominant epistatic inheritance of loci governing the phenotype. Moreover, a molecular analysis of known causative quantitative trait genes and nucleotides allowed for the detection of novel alleles, potentially responsible for the observed phenotypic variation. Based on the molecular data, we hypothesise that the observed dominant epistatic relationship could be caused by the interaction of multiple quantitative trait nucleotides distributed across a 60--kb QTL region located on chromosome XIV and the RME1 locus on chromosome VII. Furthermore, we propose a model of molecular pathways which possibly underlie the phenotypic variation of this trait. PMID:27904371
The complete sequence of Cymbidium mosaic virus from Vanilla fragrans in Hainan, China.
He, Zhen; Jiang, Dongmei; Liu, Aiqin; Sang, Liwei; Li, Wenfeng; Li, Shifang
2011-06-01
The complete nucleotide sequence of Cymbidium mosaic virus (CymMV) isolated from vanilla in Hainan province, China was determined for the first time. It comprised 6,224 nucleotides; sequence analysis suggested that the isolate we obtained was a member of the genus Potexvirus, and its sequence shared 86.67-96.61% identities with previously reported sequences. Phylogenetic analysis suggested that CymMV from vanilla fragrans was clustered into subgroup A and the isolates in this subgroup displayed little regional difference.
Mangericao, Tatiana C; Peng, Zhanhao; Zhang, Xuegong
2016-01-11
CRISPR has been becoming a hot topic as a powerful technique for genome editing for human and other higher organisms. The original CRISPR-Cas (Clustered Regularly Interspaced Short Palindromic Repeats coupled with CRISPR-associated proteins) is an important adaptive defence system for prokaryotes that provides resistance against invading elements such as viruses and plasmids. A CRISPR cassette contains short nucleotide sequences called spacers. These unique regions retain a history of the interactions between prokaryotes and their invaders in individual strains and ecosystems. One important ecosystem in the human body is the human gut, a rich habitat populated by a great diversity of microorganisms. Gut microbiomes are important for human physiology and health. Metagenome sequencing has been widely applied for studying the gut microbiomes. Most efforts in metagenome study has been focused on profiling taxa compositions and gene catalogues and identifying their associations with human health. Less attention has been paid to the analysis of the ecosystems of microbiomes themselves especially their CRISPR composition. We conducted a preliminary analysis of CRISPR sequences in a human gut metagenomic data set of Chinese individuals of type-2 diabetes patients and healthy controls. Applying an available CRISPR-identification algorithm, PILER-CR, we identified 3169 CRISPR cassettes in the data, from which we constructed a set of 1302 unique repeat sequences and 36,709 spacers. A more extensive analysis was made for the CRISPR repeats: these repeats were submitted to a more comprehensive clustering and classification using the web server tool CRISPRmap. All repeats were compared with known CRISPRs in the database CRISPRdb. A total of 784 repeats had matches in the database, and the remaining 518 repeats from our set are potentially novel ones. The computational analysis of CRISPR composition based contigs of metagenome sequencing data is feasible. It provides an efficient approach for finding potential novel CRISPR arrays and for analysing the ecosystem and history of human microbiomes.
Kondo, Jiro; Westhof, Eric
2011-01-01
Nucleotide bases are recognized by amino acid residues in a variety of DNA/RNA binding and nucleotide binding proteins. In this study, a total of 446 crystal structures of nucleotide–protein complexes are analyzed manually and pseudo pairs together with single and bifurcated hydrogen bonds observed between bases and amino acids are classified and annotated. Only 5 of the 20 usual amino acid residues, Asn, Gln, Asp, Glu and Arg, are able to orient in a coplanar fashion in order to form pseudo pairs with nucleotide bases through two hydrogen bonds. The peptide backbone can also form pseudo pairs with nucleotide bases and presents a strong bias for binding to the adenine base. The Watson–Crick side of the nucleotide bases is the major interaction edge participating in such pseudo pairs. Pseudo pairs between the Watson–Crick edge of guanine and Asp are frequently observed. The Hoogsteen edge of the purine bases is a good discriminatory element in recognition of nucleotide bases by protein side chains through the pseudo pairing: the Hoogsteen edge of adenine is recognized by various amino acids while the Hoogsteen edge of guanine is only recognized by Arg. The sugar edge is rarely recognized by either the side-chain or peptide backbone of amino acid residues. PMID:21737431
Rasmussen, C.; Purcell, M.K.; Gregg, J.L.; LaPatra, S.E.; Winton, J.R.; Hershberger, P.K.
2010-01-01
The mesomycetozoean parasite Ichthyophonus hoferi is most commonly associated with marine fish hosts but also occurs in some components of the freshwater rainbow trout Oncorhynchus mykiss aquaculture industry in Idaho, USA. It is not certain how the parasite was introduced into rainbow trout culture, but it might have been associated with the historical practice of feeding raw, ground common carp Cyprinus carpio that were caught by commercial fisherman. Here, we report a major genetic division between west coast freshwater and marine isolates of Ichthyophonus hoferi. Sequence differences were not detected in 2 regions of the highly conserved small subunit (18S) rDNA gene; however, nucleotide variation was seen in internal transcribed spacer loci (ITS1 and ITS2), both within and among the isolates. Intra-isolate variation ranged from 2.4 to 7.6 nucleotides over a region consisting of ~740 bp. Majority consensus sequences from marine/anadromous hosts differed in only 0 to 3 nucleotides (99.6 to 100% nucleotide identity), while those derived from freshwater rainbow trout had no nucleotide substitutions relative to each other. However, the consensus sequences between isolates from freshwater rainbow trout and those from marine/anadromous hosts differed in 13 to 16 nucleotides (97.8 to 98.2% nucleotide identity).
Nawaz, Zarqa; Kakar, Kaleem Ullah; Saand, Mumtaz A; Shu, Qing-Yao
2014-10-04
Cyclic nucleotide-gated channels (CNGCs) are Ca2+-permeable cation transport channels, which are present in both animal and plant systems. They have been implicated in the uptake of both essential and toxic cations, Ca2+ signaling, pathogen defense, and thermotolerance in plants. To date there has not been a genome-wide overview of the CNGC gene family in any economically important crop, including rice (Oryza sativa L.). There is an urgent need for a thorough genome-wide analysis and experimental verification of this gene family in rice. In this study, a total of 16 full length rice CNGC genes distributed on chromosomes 1-6, 9 and 12, were identified by employing comprehensive bioinformatics analyses. Based on phylogeny, the family of OsCNGCs was classified into four major groups (I-IV) and two sub-groups (IV-A and IV- B). Likewise, the CNGCs from all plant lineages clustered into four groups (I-IV), where group II was conserved in all land plants. Gene duplication analysis revealed that both chromosomal segmentation (OsCNGC1 and 2, 10 and 11, 15 and 16) and tandem duplications (OsCNGC1 and 2) significantly contributed to the expansion of this gene family. Motif composition and protein sequence analysis revealed that the CNGC specific domain "cyclic nucleotide-binding domain (CNBD)" comprises a "phosphate binding cassette" (PBC) and a "hinge" region that is highly conserved among the OsCNGCs. In addition, OsCNGC proteins also contain various other functional motifs and post-translational modification sites. We successively built a stringent motif: (LI-X(2)-[GS]-X-[FV]-X-G-[1]-ELL-X-W-X(12,22)-SA-X(2)-T-X(7)-[EQ]-AF-X-L) that recognizes the rice CNGCs specifically. Prediction of cis-acting regulatory elements in 5' upstream sequences and expression analyses through quantitative qPCR demonstrated that OsCNGC genes were highly responsive to multiple stimuli including hormonal (abscisic acid, indoleacetic acid, kinetin and ethylene), biotic (Pseudomonas fuscovaginae and Xanthomonas oryzae pv. oryzae) and abiotic (cold) stress. There are 16 CNGC genes in rice, which were probably expanded through chromosomal segmentation and tandem duplications and comprise a PBC and a "hinge" region in the CNBD domain, featured by a stringent motif. The various cis-acting regulatory elements in the upstream sequences may be responsible for responding to multiple stimuli, including hormonal, biotic and abiotic stresses.
Promoter for Sindbis virus RNA-dependent subgenomic RNA transcription.
Levis, R; Schlesinger, S; Huang, H V
1990-04-01
Sindbis virus is a positive-strand RNA enveloped virus, a member of the Alphavirus genus of the Togaviridae family. Two species of mRNA are synthesized in cells infected with Sindbis virus; one, the 49S RNA, is the genomic RNA; the other, the 26S RNA, is a subgenomic RNA that is identical in sequence to the 3' one-third of the genomic RNA. Ou et al. (J.-H. Ou, C. M. Rice, L. Dalgarno, E. G. Strauss, and J. H. Strauss, Proc. Natl. Acad. Sci. USA 79:5235-5239, 1982) identified a highly conserved region 19 nucleotides upstream and 2 nucleotides downstream from the start of the 26S RNA and proposed that in the negative-strand template, these nucleotides compose the promoter for directing the synthesis of the subgenomic RNA. Defective interfering (DI) RNAs of Sindbis virus were used to test this proposal. A 227-nucleotide sequence encompassing 98 nucleotides upstream and 117 nucleotides downstream from the start site of the Sindbis virus subgenomic RNA was inserted into a DI genome. The DI RNA containing the insert was replicated and packaged in the presence of helper virus, and cells infected with these DI particles produced a subgenomic RNA of the size and sequence expected if the promoter was functional. The initiating nucleotide was identical to that used for Sindbis virus subgenomic mRNA synthesis. Deletion analysis showed that the minimal region required to detect transcription of a subgenomic RNA from the negative-strand template of a DI RNA was 18 or 19 nucleotides upstream and 5 nucleotides downstream from the start of the subgenomic RNA.
DNA Nucleotide Sequence Restricted by the RI Endonuclease
Hedgpeth, Joe; Goodman, Howard M.; Boyer, Herbert W.
1972-01-01
The sequence of DNA base pairs adjacent to the phosphodiester bonds cleaved by the RI restriction endonuclease in unmodified DNA from coliphage λ has been determined. The 5′-terminal nucleotide labeled with 32P and oligonucleotides up to the heptamer were analyzed from a pancreatic DNase digest. The following sequence of nucleotides adjacent to the RI break made in λ DNA was deduced from these data and from the 3′-dinucleotide sequence and nearest-neighbor analysis obtained from repair synthesis with the DNA polymerase of Rous sarcoma virus [Formula: see text] The RI endonuclease cleavage of the phosphodiester bonds (indicated by arrows) generates 5′-phosphoryls and short cohesive termini of four nucleotides, pApApTpT. The most striking feature of the sequence is its symmetry. PMID:4343974
Some parameters relevant to affinity chromatography on immobilized nucleotides
Lowe, C. R.; Harvey, M. J.; Craven, D. B.; Dean, P. D. G.
1973-01-01
1. The suitability of cellulose and Sepharose as supports for affinity chromatography of two groups of cofactor-linked enzymes, dehydrogenases and kinases, was examined. Sepharose was found to be superior. 2. The selective capacities of the columns were measured by frontal analysis and are discussed in relation to the nucleotide contents. 3. The effect of various concentrations of enzyme and of non-specific protein on the performance of the affinity columns, and the effects of equilibration time, flow rate, sample volume and dilution of the nucleotide were examined. 4. The effect of interposing polymethylene and polyglycine extension arms between the matrix backbone and the nucleotide was investigated for several cofactor-dependent enzymes. Maximum binding was observed with an extension arm 0.8–1nm long. PMID:4354739
Structure and Evolution of Chlorate Reduction Composite Transposons
Clark, Iain C.; Melnyk, Ryan A.; Engelbrektson, Anna; Coates, John D.
2013-01-01
ABSTRACT The genes for chlorate reduction in six bacterial strains were analyzed in order to gain insight into the metabolism. A newly isolated chlorate-reducing bacterium (Shewanella algae ACDC) and three previously isolated strains (Ideonella dechloratans, Pseudomonas sp. strain PK, and Dechloromarinus chlorophilus NSS) were genome sequenced and compared to published sequences (Alicycliphilus denitrificans BC plasmid pALIDE01 and Pseudomonas chloritidismutans AW-1). De novo assembly of genomes failed to join regions adjacent to genes involved in chlorate reduction, suggesting the presence of repeat regions. Using a bioinformatics approach and finishing PCRs to connect fragmented contigs, we discovered that chlorate reduction genes are flanked by insertion sequences, forming composite transposons in all four newly sequenced strains. These insertion sequences delineate regions with the potential to move horizontally and define a set of genes that may be important for chlorate reduction. In addition to core metabolic components, we have highlighted several such genes through comparative analysis and visualization. Phylogenetic analysis places chlorate reductase within a functionally diverse clade of type II dimethyl sulfoxide (DMSO) reductases, part of a larger family of enzymes with reactivity toward chlorate. Nucleotide-level forensics of regions surrounding chlorite dismutase (cld), as well as its phylogenetic clustering in a betaproteobacterial Cld clade, indicate that cld has been mobilized at least once from a perchlorate reducer to build chlorate respiration. PMID:23919996
Li, Wei; Zhang, Xin-Cheng; Zhao, Jian; Shi, Yan; Zhu, Xin-Ping
2015-01-25
Cuora trifasciata has become one of the most critically endangered species in the world. The complete mitochondrial genome of C. trifasciata (Chinese three-striped box turtle) was determined in this study. Its mitochondrial genome is a 16,575-bp-long circular molecule that consists of 37 genes that are typically found in other vertebrates. And the basic characteristics of the C. trifasciata mitochondrial genome were also determined. Moreover, a comparison of C. trifasciata with Cuora cyclornata, Cuora pani and Cuora aurocapitata indicated that the four mitogenomics differed in length, codons, overlaps, 13 protein-coding genes (PCGs), ND3, rRNA genes, control region, and other aspects. Phylogenetic analysis with Bayesian inference and maximum likelihood based on 12 protein-coding genes of the genus Cuora indicated the phylogenetic position of C. trifasciata within Cuora. The phylogenetic analysis also showed that C. trifasciata from Vietnam and China formed separate monophyletic clades with different Cuora species. The results of nucleotide base compositions, protein-coding genes and phylogenetic analysis showed that C. trifasciata from these two countries may represent different Cuora species. Copyright © 2014 Elsevier B.V. All rights reserved.
Blanco López, S L; Moal, J; San Juan Serrano, F
2000-09-01
Reversed-phase HPLC was applied to obtain a sensitive and efficient means for quantitating nucleotides in the mussel Mytilus galloprovincialis. We obtained a good separation of adenylic, guanylic, uridylic and cytidylic nucleotides. Adenine nucleotides play a critical role in the regulation and integration of cellular metabolism; particularly in the mantle tissue in the mussel, they are involved in the regulation of the enzyme glycogen phosphorylase, a key enzyme in the transfer of bioenergetic reserves (glycogen) to gametogenic development; it is of great importance to have a measure of the concentrations in vivo during the reproductive cycle of the organism. Different elution conditions were tested: isocratic versus step gradient elution, different mobile phase pH and the type and proportion of ion-pairing agent added to the mobile phase. The best method was selected and the separation and accurate determination of adenine, citidine, guanine and uridine nucleotides was accomplished within a 20-min run, with UV-Vis detection (254 nm).
Two nucleotide binding sites modulate ( sup 3 H) glyburide binding to rat cortex membranes
DOE Office of Scientific and Technical Information (OSTI.GOV)
Johnson, D.E.; Gopalakrishnan, M.; Triggle, D.J.
1991-03-11
The effects of nucleotides on the binding of the ATP-dependent K{sup +}-channel antagonist ({sup 3}H)glyburide (GLB) to rat cortex membranes were examined. Nucleotide triphosphates (NTPs) and nucleotide diphosphate (NDPs) inhibited the binding of GLB. This effect was dependent on the presence of dithiothreitol (DTT). Inhibition of binding by NTPs, with the exception of ATP{gamma}S, was dependent on the presence of Mg{sup 2+}. GLB binding showed a biphasic response to ADP: up to 3 mM, ADP inhibited binding, and above this concentration GLB binding increased rapidly, and was restored to normal levels by 10 mM ADP. In the presence of Mg{supmore » 2+}, ADP did not stimulate binding. Saturation analysis in the presence of Mg{sup 2+} and increasing concentrations of ADP showed that ADP results primarily in a change of the B{sub max} for GLB binding. The differential effects of NTPS and NDPs indicate that two nucleotide binding sites regulate GLB binding.« less
Non-uniqueness of factors constraint on the codon usage in Bombyx mori.
Jia, Xian; Liu, Shuyu; Zheng, Hao; Li, Bo; Qi, Qi; Wei, Lei; Zhao, Taiyi; He, Jian; Sun, Jingchen
2015-05-06
The analysis of codon usage is a good way to understand the genetic and evolutionary characteristics of an organism. However, there are only a few reports related with the codon usage of the domesticated silkworm, Bombyx mori (B. mori). Hence, the codon usage of B. mori was analyzed here to reveal the constraint factors and it could be helpful to improve the bioreactor based on B. mori. A total of 1,097 annotated mRNA sequences from B. mori were analyzed, revealing there is only a weak codon bias. It also shows that the gene expression level is related to the GC content, and the amino acids with higher general average hydropathicity (GRAVY) and aromaticity (Aromo). And the genes on the primary axis are strongly positively correlated with the GC content, and GC3s. Meanwhile, the effective number of codons (ENc) is strongly correlated with codon adaptation index (CAI), gene length, and Aromo values. However, the ENc values are correlated with the second axis, which indicates that the codon usage in B. mori is affected by not only mutation pressure and natural selection, but also nucleotide composition and the gene expression level. It is also associated with Aromo values, and gene length. Additionally, B. mori has a greater relative discrepancy in codon preferences with Drosophila melanogaster (D. melanogaster) or Saccharomyces cerevisiae (S. cerevisiae) than with Arabidopsis thaliana (A. thaliana), Escherichia coli (E. coli), or Caenorhabditis elegans (C. elegans). The codon usage bias in B. mori is relatively weak, and many influence factors are found here, such as nucleotide composition, mutation pressure, natural selection, and expression level. Additionally, it is also associated with Aromo values, and gene length. Among them, natural selection might play a major role. Moreover, the "optimal codons" of B. mori are all encoded by G and C, which provides useful information for enhancing the gene expression in B. mori through codon optimization.
Hsiao, Tun-Jen; Wu, Lawrence Shih-Hsin; Hwang, Yuchi; Huang, Shih-Yi; Lin, Eugene
2010-04-01
Sibutramine, a serotonin and norepinephrine reuptake inhibitor, is used as an anti-obesity drug. Several pharmacogenetic studies have shown correlations between sibutramine effects and genetic variants, such as the 825C/T (rs5443) single nucleotide polymorphism (SNP) in the guanine nucleotide binding protein beta polypeptide 3 (GNB3) gene. In this study, our goal was to investigate whether a common SNP, -866G/A (rs659366), in the uncoupling protein 2 (UCP2) gene could influence weight reduction and body composition under sibutramine therapy in an obese Taiwanese population. The study included 131 obese patients, 44 in the placebo group and 87 in the sibutramine group. We assessed the measures of weight loss and body fat reduction at the end of a 12-week treatment period by analysis of covariance (ANCOVA) models using gender, baseline weight, and body fat percentage at baseline as covariates. By comparing the placebo and sibutramine groups with ANCOVA, our data showed a strong effect of sibutramine on weight loss in the combined UCP2 -866 AA + GA genotype groups (p < 0.001). Similarly, a strong effect of sibutramine on body fat percentage loss was found for individuals with the AA or GA genotypes (p < 0.001). In contrast, sibutramine had no significant effect on weight loss (p = 0.063) or body fat percentage loss (p = 0.194) for individuals with the wild-type GG genotype, compared with the placebo group of the same genotype. Moreover, a potential gene-gene interaction between UCP2 and GNB3 was identified by multiple linear regression models for the weight loss (p < 0.001) and for the percent fat loss (p = 0.031) in response to sibutramine. The results suggest that the UCP2 gene may contribute to weight loss and fat change in response to sibutramine therapy in obese Taiwanese patients.
Sung, Nuri; Lee, Jungsoon; Kim, Ji-Hyun; Chang, Changsoo; Joachimiak, Andrzej; Lee, Sukyeong; Tsai, Francis T. F.
2016-01-01
Heat-shock protein of 90 kDa (Hsp90) is an essential molecular chaperone that adopts different 3D structures associated with distinct nucleotide states: a wide-open, V-shaped dimer in the apo state and a twisted, N-terminally closed dimer with ATP. Although the N domain is known to mediate ATP binding, how Hsp90 senses the bound nucleotide and facilitates dimer closure remains unclear. Here we present atomic structures of human mitochondrial Hsp90N (TRAP1N) and a composite model of intact TRAP1 revealing a previously unobserved coiled-coil dimer conformation that may precede dimer closure and is conserved in intact TRAP1 in solution. Our structure suggests that TRAP1 normally exists in an autoinhibited state with the ATP lid bound to the nucleotide-binding pocket. ATP binding displaces the ATP lid that signals the cis-bound ATP status to the neighboring subunit in a highly cooperative manner compatible with the coiled-coil intermediate state. We propose that TRAP1 is a ligand-activated molecular chaperone, which couples ATP binding to dramatic changes in local structure required for protein folding. PMID:26929380
Quantitative trait nucleotide analysis using Bayesian model selection.
Blangero, John; Goring, Harald H H; Kent, Jack W; Williams, Jeff T; Peterson, Charles P; Almasy, Laura; Dyer, Thomas D
2005-10-01
Although much attention has been given to statistical genetic methods for the initial localization and fine mapping of quantitative trait loci (QTLs), little methodological work has been done to date on the problem of statistically identifying the most likely functional polymorphisms using sequence data. In this paper we provide a general statistical genetic framework, called Bayesian quantitative trait nucleotide (BQTN) analysis, for assessing the likely functional status of genetic variants. The approach requires the initial enumeration of all genetic variants in a set of resequenced individuals. These polymorphisms are then typed in a large number of individuals (potentially in families), and marker variation is related to quantitative phenotypic variation using Bayesian model selection and averaging. For each sequence variant a posterior probability of effect is obtained and can be used to prioritize additional molecular functional experiments. An example of this quantitative nucleotide analysis is provided using the GAW12 simulated data. The results show that the BQTN method may be useful for choosing the most likely functional variants within a gene (or set of genes). We also include instructions on how to use our computer program, SOLAR, for association analysis and BQTN analysis.
Dubey, Bhawna; Meganathan, P R; Haque, Ikramul
2012-07-01
This paper reports the complete mitochondrial genome sequence of an endangered Indian snake, Python molurus molurus (Indian Rock Python). A typical snake mitochondrial (mt) genome of 17258 bp length comprising of 37 genes including the 13 protein coding genes, 22 tRNA genes, and 2 ribosomal RNA genes along with duplicate control regions is described herein. The P. molurus molurus mt. genome is relatively similar to other snake mt. genomes with respect to gene arrangement, composition, tRNA structures and skews of AT/GC bases. The nucleotide composition of the genome shows that there are more A-C % than T-G% on the positive strand as revealed by positive AT and CG skews. Comparison of individual protein coding genes, with other snake genomes suggests that ATP8 and NADH3 genes have high divergence rates. Codon usage analysis reveals a preference of NNC codons over NNG codons in the mt. genome of P. molurus. Also, the synonymous and non-synonymous substitution rates (ka/ks) suggest that most of the protein coding genes are under purifying selection pressure. The phylogenetic analyses involving the concatenated 13 protein coding genes of P. molurus molurus conformed to the previously established snake phylogeny.
Synonymous codon usage patterns in different parasitic platyhelminth mitochondrial genomes.
Chen, L; Yang, D Y; Liu, T F; Nong, X; Huang, X; Xie, Y; Fu, Y; Zheng, W P; Zhang, R H; Wu, X H; Gu, X B; Wang, S X; Peng, X R; Yang, G Y
2013-02-27
We analyzed synonymous codon usage patterns of the mitochondrial genomes of 43 parasitic platyhelminth species. The relative synonymous codon usage, the effective number of codons (NC) and the frequency of G+C at the third synonymously variable coding position were calculated. Correspondence analysis was used to determine the major variation trends shaping the codon usage patterns. Among the mitochondrial genomes of 19 trematode species, the GC content of third codon positions varied from 0.151 to 0.592, with a mean of 0.295 ± 0.116. In cestodes, the mean GC content of third codon positions was 0.254 ± 0.044. A comparison of the nucleotide composition at 4-fold synonymous sites revealed that, on average, there was a greater abundance of codons ending on U (51.9%) or A (22.7%) than on C (6.3%) or G (19.14%). Twenty-two codons, including UUU, UUA and UUG, were frequently used. In the NC-plot, most of points were distributed well below or around the expected NC curve. In addition to compositional constraints, the degree of hydrophobicity and the aromatic amino acids also influenced codon usage in the mitochondrial genomes of these 43 parasitic platyhelminth species.
Improved nucleic acid descriptors for siRNA efficacy prediction.
Sciabola, Simone; Cao, Qing; Orozco, Modesto; Faustino, Ignacio; Stanton, Robert V
2013-02-01
Although considerable progress has been made recently in understanding how gene silencing is mediated by the RNAi pathway, the rational design of effective sequences is still a challenging task. In this article, we demonstrate that including three-dimensional descriptors improved the discrimination between active and inactive small interfering RNAs (siRNAs) in a statistical model. Five descriptor types were used: (i) nucleotide position along the siRNA sequence, (ii) nucleotide composition in terms of presence/absence of specific combinations of di- and trinucleotides, (iii) nucleotide interactions by means of a modified auto- and cross-covariance function, (iv) nucleotide thermodynamic stability derived by the nearest neighbor model representation and (v) nucleic acid structure flexibility. The duplex flexibility descriptors are derived from extended molecular dynamics simulations, which are able to describe the sequence-dependent elastic properties of RNA duplexes, even for non-standard oligonucleotides. The matrix of descriptors was analysed using three statistical packages in R (partial least squares, random forest, and support vector machine), and the most predictive model was implemented in a modeling tool we have made publicly available through SourceForge. Our implementation of new RNA descriptors coupled with appropriate statistical algorithms resulted in improved model performance for the selection of siRNA candidates when compared with publicly available siRNA prediction tools and previously published test sets. Additional validation studies based on in-house RNA interference projects confirmed the robustness of the scoring procedure in prospective studies.
Pulmonary preservation studies: effects on endothelial function and pulmonary adenine nucleotides.
Paik, Hyo Chae; Hoffmann, Steven C; Egan, Thomas M
2003-02-27
Lung transplantation is an effective therapy plagued by a high incidence of early graft dysfunction, in part because of reperfusion injury. The optimal preservation solution for lung transplantation is unknown. We performed experiments using an isolated perfused rat lung model to test the effect of lung preservation with three solutions commonly used in clinical practice. Lungs were retrieved from Sprague-Dawley rats and flushed with one of three solutions: modified Euro-Collins (MEC), University of Wisconsin (UW), or low potassium dextran and glucose (LPDG), then stored cold for varying periods before reperfusion with Earle's balanced salt solution using the isolated perfused rat lung model. Outcome measures were capillary filtration coefficient (Kfc), wet-to-dry weight ratio, and lung tissue levels of adenine nucleotides and cyclic AMP. All lungs functioned well after 4 hr of storage. By 6 hr, UW-flushed lungs had a lower Kfc than LPDG-flushed lungs. After 8 hr of storage, only UW-flushed lungs had a measurable Kfc. Adenine nucleotide levels were higher in UW-flushed lungs after prolonged storage. Cyclic AMP levels correlated with Kfc in all groups. Early changes in endothelial permeability seemed to be better attenuated in lungs flushed with UW compared with LPDG or MEC; this was associated with higher amounts of adenine nucleotides. MEC-flushed lungs failed earlier than LPDG-flushed or UW-flushed lungs. The content of the solution may be more important for lung preservation than whether the ionic composition is intracellular or extracellular.
Shimamoto, I; Sonoda, S; Vazquez, P; Minaka, N; Nishiguchi, M
1998-01-01
The 3' terminal 2378 nucleotides of a wasabi strain of crucifer tobamovirus (CTMV-W) infectious to crucifer plants was determined. This includes the 3' non-coding region of 235 nucleotides, coat protein (CP) gene (468 nucleotides), movement protein (MP) gene (798 nucleotides) and C-terminal partial readthrough portion of 180 K protein gene (940 nucleotides). Comparison of the sequence with homologous regions of thirteen other tobamovirus genomes showed that it had much higher identity to those of four other crucifer tobamoviruses, 85.2% to cr-TMV and turnip vein-clearing virus (TVCV), 87.4% to oilseed rape mosaic virus (ORMV) and 87.1% to TMV-Cg, than to those of other tobamoviruses. Thus CTMV-W was most similar to ORMV and TMV-Cg in sequence, but only marginally so, whereas the location and size of its MP gene was the same as cr-TMV amd TVCV. These results, together with other analyses, show that CTMV-W is a new crucifer tobamovirus, that the five crucifer tobamoviruses can be classified into two subgroups based on MP gene organization, and that the rate of sequence change is not the same in all lineages.
Butte, Nancy F; Voruganti, V Saroja; Cole, Shelley A; Haack, Karin; Comuzzie, Anthony G; Muzny, Donna M; Wheeler, David A; Chang, Kyle; Hawes, Alicia; Gibbs, Richard A
2011-09-22
Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5' and 3' flanking regions of IRS2 (∼14.5 kb), were bidirectionally sequenced for single nucleotide polymorphism (SNP) discovery in 934 Hispanic children using 3730XL DNA Sequencers. Additionally, 15 SNPs derived from Illumina HumanOmni1-Quad BeadChips were analyzed. Measured genotype analysis tested associations between SNPs and obesity and diabetes-related traits. Bayesian quantitative trait nucleotide analysis was used to statistically infer the most likely functional polymorphisms. A total of 140 SNPs were identified with minor allele frequencies (MAF) ranging from 0.001 to 0.47. Forty-two of the 70 coding SNPs result in nonsynonymous amino acid substitutions relative to the consensus sequence; 28 SNPs were detected in the promoter, 12 in introns, 28 in the 3'-UTR, and 2 in the 5'-UTR. Two insertion/deletions (indels) were detected. Ten independent rare SNPs (MAF = 0.001-0.009) were associated with obesity-related traits (P = 0.01-0.00002). SNP 10510452_139 in the promoter region was shown to have a high posterior probability (P = 0.77-0.86) of influencing BMI, fat mass, and waist circumference in Hispanic children. SNP 10510452_139 contributed between 2 and 4% of the population variance in body weight and composition. None of the SNPs or indels were associated with diabetes-related traits or accounted for a previously identified quantitative trait locus on chromosome 13 for fasting serum glucose. Rare but not common IRS2 variants may play a role in the regulation of body weight but not an essential role in fasting glucose homeostasis in Hispanic children.
2013-01-01
Background Obesity, excess fat tissue in the body, can underlie a variety of medical complaints including heart disease, stroke and cancer. The pig is an excellent model organism for the study of various human disorders, including obesity, as well as being the foremost agricultural species. In order to identify genetic variants associated with fatness, we used a selective genomic approach sampling DNA from animals at the extreme ends of the fat and lean spectrum using estimated breeding values derived from a total population size of over 70,000 animals. DNA from 3 breeds (Sire Line Large White, Duroc and a white Pietrain composite line (Titan)) was used to interrogate the Illumina Porcine SNP60 Genotyping Beadchip in order to identify significant associations in terms of single nucleotide polymorphisms (SNPs) and copy number variants (CNVs). Results By sampling animals at each end of the fat/lean EBV (estimate breeding value) spectrum the whole population could be assessed using less than 300 animals, without losing statistical power. Indeed, several significant SNPs (at the 5% genome wide significance level) were discovered, 4 of these linked to genes with ontologies that had previously been correlated with fatness (NTS, FABP6, SST and NR3C2). Quantitative analysis of the data identified putative CNV regions containing genes whose ontology suggested fatness related functions (MCHR1, PPARα, SLC5A1 and SLC5A4). Conclusions Selective genotyping of EBVs at either end of the phenotypic spectrum proved to be a cost effective means of identifying SNPs and CNVs associated with fatness and with estimated major effects in a large population of animals. PMID:24225222
Novel methodologies for spectral classification of exon and intron sequences
NASA Astrophysics Data System (ADS)
Kwan, Hon Keung; Kwan, Benjamin Y. M.; Kwan, Jennifer Y. Y.
2012-12-01
Digital processing of a nucleotide sequence requires it to be mapped to a numerical sequence in which the choice of nucleotide to numeric mapping affects how well its biological properties can be preserved and reflected from nucleotide domain to numerical domain. Digital spectral analysis of nucleotide sequences unfolds a period-3 power spectral value which is more prominent in an exon sequence as compared to that of an intron sequence. The success of a period-3 based exon and intron classification depends on the choice of a threshold value. The main purposes of this article are to introduce novel codes for 1-sequence numerical representations for spectral analysis and compare them to existing codes to determine appropriate representation, and to introduce novel thresholding methods for more accurate period-3 based exon and intron classification of an unknown sequence. The main findings of this study are summarized as follows: Among sixteen 1-sequence numerical representations, the K-Quaternary Code I offers an attractive performance. A windowed 1-sequence numerical representation (with window length of 9, 15, and 24 bases) offers a possible speed gain over non-windowed 4-sequence Voss representation which increases as sequence length increases. A winner threshold value (chosen from the best among two defined threshold values and one other threshold value) offers a top precision for classifying an unknown sequence of specified fixed lengths. An interpolated winner threshold value applicable to an unknown and arbitrary length sequence can be estimated from the winner threshold values of fixed length sequences with a comparable performance. In general, precision increases as sequence length increases. The study contributes an effective spectral analysis of nucleotide sequences to better reveal embedded properties, and has potential applications in improved genome annotation.
Yamada, Yoshiji; Sakuma, Jun; Takeuchi, Ichiro; Yasukochi, Yoshiki; Kato, Kimihiko; Oguri, Mitsutoshi; Fujimaki, Tetsuo; Horibe, Hideki; Muramatsu, Masaaki; Sawabe, Motoji; Fujiwara, Yoshinori; Taniguchi, Yu; Obuchi, Shuichi; Kawai, Hisashi; Shinkai, Shoji; Mori, Seijiro; Arai, Tomio; Tanaka, Masashi
2017-06-13
We have performed exome-wide association studies to identify genetic variants that influence body mass index or confer susceptibility to obesity or metabolic syndrome in Japanese. The exome-wide association study for body mass index included 12,890 subjects, and those for obesity and metabolic syndrome included 12,968 subjects (3954 individuals with obesity, 9014 controls) and 6817 subjects (3998 individuals with MetS, 2819 controls), respectively. Exome-wide association studies were performed with Illumina HumanExome-12 DNA Analysis BeadChip or Infinium Exome-24 BeadChip arrays. The relation of genotypes of single nucleotide polymorphisms to body mass index was examined by linear regression analysis, and that of allele frequencies of single nucleotide polymorphisms to obesity or metabolic syndrome was evaluated with Fisher's exact test. The exome-wide association studies identified six, 11, and 40 single nucleotide polymorphisms as being significantly associated with body mass index, obesity (P <1.21 × 10-6), or metabolic syndrome (P <1.20 × 10-6), respectively. Subsequent multivariable logistic regression analysis with adjustment for age and sex revealed that three and five single nucleotide polymorphisms were related (P < 0.05) to obesity or metabolic syndrome, respectively, with one of these latter polymorphisms-rs7350481 (C/T) at chromosome 11q23.3-also being significantly (P < 3.13 × 10-4) associated with metabolic syndrome. The polymorphism rs7350481 may thus be a novel susceptibility locus for metabolic syndrome in Japanese. In addition, single nucleotide polymorphisms in three genes (CROT, TSC1, RIN3) and at four loci (ANKK1, ZNF804B, CSRNP3, 17p11.2) were implicated as candidate determinants of obesity and metabolic syndrome, respectively.
Dai, Weiran; Ye, Ziliang; Lu, Haili; Su, Qiang; Li, Hui; Li, Lang
2018-02-23
The results showed that there was a certain correlation between the single nucleotide polymorphism of IL-10-1082G/A and rheumatic heart disease, but there was no systematic study to verify this conclusion. Systematic review of the association between single nucleotide polymorphism of IL-10-1082G/A locus and rheumatic heart disease. Computer retrieval PubMed, EMbase, Cochrane Library, CBM, CNKI, VIP and Data WanFang, the retrieval time limit from inception to June 2017. A case control study of single nucleotide polymorphisms and rheumatic heart disease in patients with rheumatic heart disease in the IL-10-1082G/A was collected. Two researchers independently screened the literature, extracted data and evaluated the risk of bias in the study, and using RevMan5.3 software for data analysis. A total of 3 case control studies were included, including 318 patients with rheumatic heart disease and 502 controls. Meta-analysis showed that there was no correlation between IL-10-1082G/A gene polymorphism and rheumatic heart disease [AA+AG VS GG: OR = 0.62, 95% CI (0.28, 1.39), P = 0.25; AA VS AG+GG: OR = 0.73, 95% CI (0.54, 1.00), P = 0.05; AA VS GG: OR = 0.70, 95% CI(0.47, 1.05), P = 0.08; AG VS GG: OR = 0.65, 95% CI (0.22, 1.92), P = 0.43; A VS G: OR = 0.87, 95% CI (0.71, 1.06), P = 0.17]. When AA is a recessive gene, the single nucleotide polymorphism of IL-10-1082G/A is associated with the presence of rheumatic heart disease. Due to the limitations of the quantity and quality of the included literatures, the further research results were still needed.
Promoter for Sindbis virus RNA-dependent subgenomic RNA transcription.
Levis, R; Schlesinger, S; Huang, H V
1990-01-01
Sindbis virus is a positive-strand RNA enveloped virus, a member of the Alphavirus genus of the Togaviridae family. Two species of mRNA are synthesized in cells infected with Sindbis virus; one, the 49S RNA, is the genomic RNA; the other, the 26S RNA, is a subgenomic RNA that is identical in sequence to the 3' one-third of the genomic RNA. Ou et al. (J.-H. Ou, C. M. Rice, L. Dalgarno, E. G. Strauss, and J. H. Strauss, Proc. Natl. Acad. Sci. USA 79:5235-5239, 1982) identified a highly conserved region 19 nucleotides upstream and 2 nucleotides downstream from the start of the 26S RNA and proposed that in the negative-strand template, these nucleotides compose the promoter for directing the synthesis of the subgenomic RNA. Defective interfering (DI) RNAs of Sindbis virus were used to test this proposal. A 227-nucleotide sequence encompassing 98 nucleotides upstream and 117 nucleotides downstream from the start site of the Sindbis virus subgenomic RNA was inserted into a DI genome. The DI RNA containing the insert was replicated and packaged in the presence of helper virus, and cells infected with these DI particles produced a subgenomic RNA of the size and sequence expected if the promoter was functional. The initiating nucleotide was identical to that used for Sindbis virus subgenomic mRNA synthesis. Deletion analysis showed that the minimal region required to detect transcription of a subgenomic RNA from the negative-strand template of a DI RNA was 18 or 19 nucleotides upstream and 5 nucleotides downstream from the start of the subgenomic RNA. Images PMID:2319651
Burroughs, A. Maxwell; Zhang, Dapeng; Schäffer, Daniel E.; Iyer, Lakshminarayan M.; Aravind, L.
2015-01-01
Cyclic di- and linear oligo-nucleotide signals activate defenses against invasive nucleic acids in animal immunity; however, their evolutionary antecedents are poorly understood. Using comparative genomics, sequence and structure analysis, we uncovered a vast network of systems defined by conserved prokaryotic gene-neighborhoods, which encode enzymes generating such nucleotides or alternatively processing them to yield potential signaling molecules. The nucleotide-generating enzymes include several clades of the DNA-polymerase β-like superfamily (including Vibrio cholerae DncV), a minimal version of the CRISPR polymerase and DisA-like cyclic-di-AMP synthetases. Nucleotide-binding/processing domains include TIR domains and members of a superfamily prototyped by Smf/DprA proteins and base (cytokinin)-releasing LOG enzymes. They are combined in conserved gene-neighborhoods with genes for a plethora of protein superfamilies, which we predict to function as nucleotide-sensors and effectors targeting nucleic acids, proteins or membranes (pore-forming agents). These systems are sometimes combined with other biological conflict-systems such as restriction-modification and CRISPR/Cas. Interestingly, several are coupled in mutually exclusive neighborhoods with either a prokaryotic ubiquitin-system or a HORMA domain-PCH2-like AAA+ ATPase dyad. The latter are potential precursors of equivalent proteins in eukaryotic chromosome dynamics. Further, components from these nucleotide-centric systems have been utilized in several other systems including a novel diversity-generating system with a reverse transcriptase. We also found the Smf/DprA/LOG domain from these systems to be recruited as a predicted nucleotide-binding domain in eukaryotic TRPM channels. These findings point to evolutionary and mechanistic links, which bring together CRISPR/Cas, animal interferon-induced immunity, and several other systems that combine nucleic-acid-sensing and nucleotide-dependent signaling. PMID:26590262
Ma, Liang; Salas, Omar; Bowler, Kyle; Oren-Young, Liat; Bar-Peled, Maor; Sharon, Amir
2017-02-01
Botrytis cinerea is a model plant-pathogenic fungus that causes grey mould and rot diseases in a wide range of agriculturally important crops. A previous study has identified two enzymes and corresponding genes (bcdh, bcer) that are involved in the biochemical transformation of uridine diphosphate (UDP)-glucose, the major fungal wall nucleotide sugar precursor, to UDP-rhamnose. We report here that deletion of bcdh, the first biosynthetic gene in the metabolic pathway, or of bcer, the second gene in the pathway, abolishes the production of rhamnose-containing glycans in these mutant strains. Deletion of bcdh or double deletion of both bcdh and bcer has no apparent effect on fungal development or pathogenicity. Interestingly, deletion of the bcer gene alone adversely affects fungal development, giving rise to altered hyphal growth and morphology, as well as reduced sporulation, sclerotia production and virulence. Treatments with wall stressors suggest the alteration of cell wall integrity. Analysis of nucleotide sugars reveals the accumulation of the UDP-rhamnose pathway intermediate UDP-4-keto-6-deoxy-glucose (UDP-KDG) in hyphae of the Δbcer strain. UDP-KDG could not be detected in hyphae of the wild-type strain, indicating fast conversion to UDP-rhamnose by the BcEr enzyme. The correlation between high UDP-KDG and modified cell wall and developmental defects raises the possibility that high levels of UDP-KDG result in deleterious effects on cell wall composition, and hence on virulence. This is the first report demonstrating that the accumulation of a minor nucleotide sugar intermediate has such a profound and adverse effect on a fungus. The ability to identify molecules that inhibit Er (also known as NRS/ER) enzymes or mimic UDP-KDG may lead to the development of new antifungal drugs. © 2016 BSPP AND JOHN WILEY & SONS LTD.
StructAlign, a Program for Alignment of Structures of DNA-Protein Complexes.
Popov, Ya V; Galitsyna, A A; Alexeevski, A V; Karyagina, A S; Spirin, S A
2015-11-01
Comparative analysis of structures of complexes of homologous proteins with DNA is important in the analysis of DNA-protein recognition. Alignment is a necessary stage of the analysis. An alignment is a matching of amino acid residues and nucleotides of one complex to residues and nucleotides of the other. Currently, there are no programs available for aligning structures of DNA-protein complexes. We present the program StructAlign, which should fill this gap. The program inputs a pair of complexes of DNA double helix with proteins and outputs an alignment of DNA chains corresponding to the best spatial fit of the protein chains.
Masking as an effective quality control method for next-generation sequencing data analysis.
Yun, Sajung; Yun, Sijung
2014-12-13
Next generation sequencing produces base calls with low quality scores that can affect the accuracy of identifying simple nucleotide variation calls, including single nucleotide polymorphisms and small insertions and deletions. Here we compare the effectiveness of two data preprocessing methods, masking and trimming, and the accuracy of simple nucleotide variation calls on whole-genome sequence data from Caenorhabditis elegans. Masking substitutes low quality base calls with 'N's (undetermined bases), whereas trimming removes low quality bases that results in a shorter read lengths. We demonstrate that masking is more effective than trimming in reducing the false-positive rate in single nucleotide polymorphism (SNP) calling. However, both of the preprocessing methods did not affect the false-negative rate in SNP calling with statistical significance compared to the data analysis without preprocessing. False-positive rate and false-negative rate for small insertions and deletions did not show differences between masking and trimming. We recommend masking over trimming as a more effective preprocessing method for next generation sequencing data analysis since masking reduces the false-positive rate in SNP calling without sacrificing the false-negative rate although trimming is more commonly used currently in the field. The perl script for masking is available at http://code.google.com/p/subn/. The sequencing data used in the study were deposited in the Sequence Read Archive (SRX450968 and SRX451773).
NASA Astrophysics Data System (ADS)
Holden, Todd; Marchese, P.; Tremberger, G., Jr.; Cheung, E.; Subramaniam, R.; Sullivan, R.; Schneider, P.; Flamholz, A.; Lieberman, D.; Cheung, T.
2008-08-01
We have characterized function related DNA sequences of various organisms using informatics techniques, including fractal dimension calculation, nucleotide and multi-nucleotide statistics, and sequence fluctuation analysis. Our analysis shows trends which differentiate extremophile from non-extremophile organisms, which could be reproduced in extraterrestrial life. Among the systems studied are radiation repair genes, genes involved in thermal shocks, and genes involved in drug resistance. We also evaluate sequence level changes that have occurred during short term evolution (several thousand generations) under extreme conditions.
Winterhagen, Patrick; Wünsche, Jens-Norbert
2016-05-01
Within a polyembryonic mango seedling tree population, the genetic background of individuals should be identical because vigorous plants for cultivation are expected to develop from nucellar embryos representing maternal clones. Due to the fact that the mango cultivar 'Hôi' is assigned to the polyembryonic ecotype, an intra-cultivar variability of ethylene receptor genes was unexpected. Ethylene receptors in plants are conserved, but the number of receptors or receptor isoforms is variable regarding different plant species. However, it is shown here that the ethylene receptor MiETR1 is present in various isoforms within the mango cultivar 'Hôi'. The investigation of single nucleotide polymorphisms revealed that different MiETR1 isoforms can not be discriminated simply by individual single nucleotide exchanges but by the specific arrangement of single nucleotide polymorphisms at certain positions in the exons of MiETR1. Furthermore, an MiETR1 isoform devoid of introns in the genomic sequence was identified. The investigation demonstrates some limitations of high resolution melting and ScreenClust analysis and points out the necessity of sequencing to identify individual isoforms and to determine the variability within the tree population.
Small Cofactors May Assist Protein Emergence from RNA World: Clues from RNA-Protein Complexes
Shen, Liang; Ji, Hong-Fang
2011-01-01
It is now widely accepted that at an early stage in the evolution of life an RNA world arose, in which RNAs both served as the genetic material and catalyzed diverse biochemical reactions. Then, proteins have gradually replaced RNAs because of their superior catalytic properties in catalysis over time. Therefore, it is important to investigate how primitive functional proteins emerged from RNA world, which can shed light on the evolutionary pathway of life from RNA world to the modern world. In this work, we proposed that the emergence of most primitive functional proteins are assisted by the early primitive nucleotide cofactors, while only a minority are induced directly by RNAs based on the analysis of RNA-protein complexes. Furthermore, the present findings have significant implication for exploring the composition of primitive RNA, i.e., adenine base as principal building blocks. PMID:21789260
Matsuda, Fumio; Nakabayashi, Ryo; Yang, Zhigang; Okazaki, Yozo; Yonemaru, Jun-ichi; Ebana, Kaworu; Yano, Masahiro; Saito, Kazuki
2015-01-01
Plants produce structurally diverse secondary (specialized) metabolites to increase their fitness for survival under adverse environments. Several bioactive compounds for new drugs have been identified through screening of plant extracts. In this study, genome-wide association studies (GWAS) were conducted to investigate the genetic architecture behind the natural variation of rice secondary metabolites. GWAS using the metabolome data of 175 rice accessions successfully identified 323 associations among 143 single nucleotide polymorphisms (SNPs) and 89 metabolites. The data analysis highlighted that levels of many metabolites are tightly associated with a small number of strong quantitative trait loci (QTLs). The tight association may be a mechanism generating strains with distinct metabolic composition through the crossing of two different strains. The results indicate that one plant species produces more diverse phytochemicals than previously expected, and plants still contain many useful compounds for human applications. PMID:25267402
Temporal Stability of the Human Skin Microbiome.
Oh, Julia; Byrd, Allyson L; Park, Morgan; Kong, Heidi H; Segre, Julia A
2016-05-05
Biogeography and individuality shape the structural and functional composition of the human skin microbiome. To explore these factors' contribution to skin microbial community stability, we generated metagenomic sequence data from longitudinal samples collected over months and years. Analyzing these samples using a multi-kingdom, reference-based approach, we found that despite the skin's exposure to the external environment, its bacterial, fungal, and viral communities were largely stable over time. Site, individuality, and phylogeny were all determinants of stability. Foot sites exhibited the most variability; individuals differed in stability; and transience was a particular characteristic of eukaryotic viruses, which showed little site-specificity in colonization. Strain and single-nucleotide variant-level analysis showed that individuals maintain, rather than reacquire, prevalent microbes from the environment. Longitudinal stability of skin microbial communities generates hypotheses about colonization resistance and empowers clinical studies exploring alterations observed in disease states. Copyright © 2016 Elsevier Inc. All rights reserved.
Jiang, Jianping; Gao, Yahui; Hou, Yali; Li, Wenhui; Zhang, Shengli; Zhang, Qin; Sun, Dongxiao
2016-01-01
The use of whole-genome resequencing to obtain more information on genetic variation could produce a range of benefits for the dairy cattle industry, especially with regard to increasing milk production and improving milk composition. In this study, we sequenced the genomes of eight Holstein bulls from four half- or full-sib families, with high and low estimated breeding values (EBVs) of milk protein percentage and fat percentage at an average effective depth of 10×, using Illumina sequencing. Over 0.9 million nonredundant short insertions and deletions (indels) [1-49 base pairs (bp)] were obtained. Among them, 3,625 indels that were polymorphic between the high and low groups of bulls were revealed and subjected to further analysis. The vast majority (76.67%) of these indels were novel. Follow-up validation assays confirmed that most (70%) of the randomly selected indels represented true variations. The indels that were polymorphic between the two groups were annotated based on the cattle genome sequence assembly (UMD3.1.69); as a result, nearly 1,137 of them were found to be located within 767 annotated genes, only 5 (0.138%) of which were located in exons. Then, by integrated analysis of the 767 genes with known quantitative trait loci (QTL); significant single-nucleotide polymorphisms (SNPs) previously identified by genome-wide association studies (GWASs) to be associated with bovine milk protein and fat traits; and the well-known pathways involved in protein, fat synthesis, and metabolism, we identified a total of 11 promising candidate genes potentially affecting milk composition traits. These were FCGR2B, CENPE, RETSAT, ACSBG2, NFKB2, TBC1D1, NLK, MAP3K1, SLC30A2, ANGPT1 and UGDH. Our findings provide a basis for further study and reveal key genes for milk composition traits in dairy cattle.
Ge, Liya; Yong, Jean Wan Hong; Tan, Swee Ngin; Yang, Xin Hao; Ong, Eng Shi
2006-11-10
A method based on solid-phase extraction (SPE) and capillary zone electrophoresis-tandem mass spectrometry (CZE-MS/MS) is described for the separation and determination of six cytokinin nucleotides in coconut water. The best CZE separation for the six cytokinin nucleotide standards was achieved using a 25 mM ammonium formate/formic acid buffer (pH 3.8) and 2% (v/v) methanol with an applied gradient separation voltage (25 kV for 32 min, and then a linear gradient to 30 kV in 5 min, finally 30 kV to the end of separation) in less than 60 min. MS/MS with multiple reaction monitoring (MRM) detection was carried out to obtain sufficient selectivity and sensitivity for the cytokinin nucleotides. The combined use of on-line sample stacking and CZE-MS/MS achieved limits of detection (LODs) in the range of 0.06-0.19 microM for the six cytokinin nucleotides at a signal-to-noise ratio of 3. Furthermore, a novel dual-step SPE procedure was developed for the pre-concentration and purification of cytokinin nucleotides using Oasis HLB and Oasis MAX cartridges. The recoveries of the cytokinin nucleotides after the dual-step SPE were in the range of 44-71%. The combination of off-line SPE, on-line sample stacking and CZE-MS/MS approach was successfully applied to screen for endogenous cytokinin nucleotides present in coconut water sample. trans-Zeatin riboside-5'-monophosphate (ZMP) was detected and quantified in coconut water by CZE-MS/MS after SPE and on-line sample stacking.
Kuwahara, Masayasu; Obika, Satoshi; Nagashima, Jun-ichi; Ohta, Yuki; Suto, Yoshiyuki; Ozaki, Hiroaki; Sawai, Hiroaki; Imanishi, Takeshi
2008-08-01
In order to systematically analyze the effects of nucleoside modification of sugar moieties in DNA polymerase reactions, we synthesized 16 modified templates containing 2',4'-bridged nucleotides and three types of 2',4'-bridged nucleoside-5'-triphospates with different bridging structures. Among the five types of thermostable DNA polymerases used, Taq, Phusion HF, Vent(exo-), KOD Dash and KOD(exo-), the KOD Dash and KOD(exo-) DNA polymerases could smoothly read through the modified templates containing 2'-O,4'-C-methylene-linked nucleotides at intervals of a few nucleotides, even at standard enzyme concentrations for 5 min. Although the Vent(exo-) DNA polymerase also read through these modified templates, kinetic study indicates that the KOD(exo-) DNA polymerase was found to be far superior to the Vent(exo-) DNA polymerase in accurate incorporation of nucleotides. When either of the DNA polymerase was used, the presence of 2',4'-bridged nucleotides on a template strand substantially decreased the reaction rates of nucleotide incorporations. The modified templates containing sequences of seven successive 2',4'-bridged nucleotides could not be completely transcribed by any of the DNA polymerases used; yields of longer elongated products decreased in the order of steric bulkiness of the modified sugars. Successive incorporation of 2',4'-bridged nucleotides into extending strands using 2',4'-bridged nucleoside-5'-triphospates was much more difficult. These data indicate that the sugar modification would have a greater effect on the polymerase reaction when it is adjacent to the elongation terminus than when it is on the template as well, as in base modification.
Advances and prospects on biomolecules functionalized carbon nanotubes.
Cui, Daxiang
2007-01-01
In recent years, functionalization of carbon nanotubes (CNTs) with biomolecules such as nucleotide acids, proteins, and polymers as well as cells have emerged as a new exciting field. Theoretical and experimental studies of structure and function of bio-inspired CNT composites have made great advances. The importance of nucleic acids, proteins, and polymers to the fundamental developments in CNT-based bio-nano-composites or devices has been recognized. In particular, biomechanics, biochemistry, thermodynamics, electronic, optical, and magnetic properties of the bio-inspired CNT composites have become a new interdisciplinary frontier in life science and nanomaterial science. Here we review some of the main advances in this field over the past few years, explore the application prospects, and discuss the issues, approaches, and challenges, with the aim of stimulating a broader interest in developing CNT-based bio-nanotechnology.
Khamrin, Pattara; Okitsu, Shoko; Ushijima, Hiroshi; Maneekarn, Niwat
2013-07-01
Epidemiological surveillance of human bocavirus (HBoV) was conducted on fecal specimens collected from hospitalized children with diarrhea in Chiang Mai, Thailand in 2011. By partial sequence analysis of VP1 gene, an unusual strain of HBoV (CMH-S011-11), was initially identified as HBoV4. The complete genome sequence of CMH-S011-11 was performed and analyzed further to clarify whether it was a recombinant strain or a new HBoV variant. Analysis of complete genome sequence revealed that the coding sequence starting from NS1, NP1 to VP1/VP2 was 4795 nucleotides long. Interestingly, the nucleotide sequence of NS1 gene of CMH-S011-11 was most closely related to the HBoV2 reference strains detected in Pakistan, which contradicted to the initial genotyping result of the partial VP1 region in the previous study. In addition, comparison of NP1 nucleotide sequence of CMH-S011-11 with those of other HBoV1-4 reference strains also revealed a high level of sequence identity with HBoV2. On the other hand, nucleotide sequence of VP1/VP2 gene of CMH-S011-11 was most closely related to those of HBoV4 reference strains detected in Nigeria. The overall full-length sequence analysis revealed that this CMH-S011-11 was grouped within HBoV4 species, but located in a separate branch from other HBoV4 prototype strains. Recombination analysis revealed that CMH-S011-11 was the result of recombination between HBoV2 and HBoV4 strains with the break point located near the start codon of VP2. Copyright © 2013 Elsevier B.V. All rights reserved.
The augmentation algorithm and molecular phylogenetic trees
NASA Technical Reports Server (NTRS)
Holmquist, R.
1978-01-01
Moore's (1977) augmentation procedure is discussed, and it is concluded that the procedure is valid for obtaining estimates of the total number of fixed nucleotide substitutions both theoretically and in practice, for both simulated and real data, and in agreement, for experimentally dense data sets, with stochastic estimates of the divergence, provided the restrictions on codon mutability resulting from natural selection are explicitly allowed for. Tateno and Nei's (1978) critique that the augmentation procedure has a systematic bias toward overestimation of the total number of nucleotide replacements is disputed, and a data analysis suggests that ancestral sequences inferred by the method of parsimony contain a large number of incorrectly assigned nucleotides.
Li, Su-Xia
2004-12-01
Single nucleotide polymorphism (SNP) is the third genetic marker after restriction fragment length polymorphism (RFLP) and short tandem repeat. It represents the most density genetic variability in the human genome and has been widely used in gene location, cloning, and research of heredity variation, as well as parenthood identification in forensic medicine. As steady heredity polymorphism, single nucleotide polymorphism is becoming the focus of attention in monitoring chimerism and minimal residual disease in the patients after allogeneic hematopoietic stem cell transplantation. The article reviews SNP heredity characterization, analysis techniques and its applications in allogeneic stem cell transplantation and other fields.
Mapping RNA Structure In Vitro with SHAPE Chemistry and Next-Generation Sequencing (SHAPE-Seq).
Watters, Kyle E; Lucks, Julius B
2016-01-01
Mapping RNA structure with selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistry has proven to be a versatile method for characterizing RNA structure in a variety of contexts. SHAPE reagents covalently modify RNAs in a structure-dependent manner to create adducts at the 2'-OH group of the ribose backbone at nucleotides that are structurally flexible. The positions of these adducts are detected using reverse transcriptase (RT) primer extension, which stops one nucleotide before the modification, to create a pool of cDNAs whose lengths reflect the location of SHAPE modification. Quantification of the cDNA pools is used to estimate the "reactivity" of each nucleotide in an RNA molecule to the SHAPE reagent. High reactivities indicate nucleotides that are structurally flexible, while low reactivities indicate nucleotides that are inflexible. These SHAPE reactivities can then be used to infer RNA structures by restraining RNA structure prediction algorithms. Here, we provide a state-of-the-art protocol describing how to perform in vitro RNA structure probing with SHAPE chemistry using next-generation sequencing to quantify cDNA pools and estimate reactivities (SHAPE-Seq). The use of next-generation sequencing allows for higher throughput, more consistent data analysis, and multiplexing capabilities. The technique described herein, SHAPE-Seq v2.0, uses a universal reverse transcription priming site that is ligated to the RNA after SHAPE modification. The introduced priming site allows for the structural analysis of an RNA independent of its sequence.
Trento, Alfonsina; Viegas, Mariana; Galiano, Mónica; Videla, Cristina; Carballal, Guadalupe; Mistchenko, Alicia S.; Melero, José A.
2006-01-01
A total of 47 clinical samples were identified during an active surveillance program of respiratory infections in Buenos Aires (BA) (1999 to 2004) that contained sequences of human respiratory syncytial virus (HRSV) with a 60-nucleotide duplication in the attachment (G) protein gene. This duplication was analogous to that previously described for other three viruses also isolated in Buenos Aires in 1999 (A. Trento et al., J. Gen. Virol. 84:3115-3120, 2003). Phylogenetic analysis indicated that BA sequences with that duplication shared a common ancestor (dated about 1998) with other HRSV G sequences reported worldwide after 1999. The duplicated nucleotide sequence was an exact copy of the preceding 60 nucleotides in early viruses, but both copies of the duplicated segment accumulated nucleotide substitutions in more recent viruses at a rate apparently higher than in other regions of the G protein gene. The evolution of the viruses with the duplicated G segment apparently followed the overall evolutionary pattern previously described for HRSV, and this genotype has replaced other prevailing antigenic group B genotypes in Buenos Aires and other places. Thus, the duplicated segment represents a natural tag that can be used to track the dissemination and evolution of HRSV in an unprecedented setting. We have taken advantage of this situation to reexamine the molecular epidemiology of HRSV and to explore the natural history of this important human pathogen. PMID:16378999
Bennett, Mark; Tu, Shin-Lin; Upton, Chris; McArtor, Cassie; Gillett, Amber; Laird, Tanya; O'Dea, Mark
2017-10-15
Poxviruses have previously been detected in macropods with cutaneous papillomatous lesions, however to date, no comprehensive analysis of a poxvirus from kangaroos has been performed. Here we report the genome sequences of a western grey kangaroo poxvirus (WKPV) and an eastern grey kangaroo poxvirus (EKPV), named for the host species from which they were isolated, western grey (Macropus fuliginosus) and eastern grey (Macropus giganteus) kangaroos. Poxvirus DNA from WKPV and EKPV was isolated and entire coding genome regions determined through Roche GS Junior and Illumina Miseq sequencing, respectively. Viral genomes were assembled using MIRA and SPAdes, and annotations performed using tools available from the Viral Bioinformatics Resource Centre. Histopathology and transmission electron microscopy analysis was also performed on WKPV and its associated lesions. The WKPV and EKPV genomes show 96% identity (nucleotide) to each other and phylogenetic analysis places them on a distinct branch between the established Molluscipoxvirus and Avipoxvirus genera. WKPV and EKPV are 170 kbp and 167 kbp long, containing 165 and 162 putative genes, respectively. Together, their genomes encode up to 47 novel unique hypothetical proteins, and possess virulence proteins including a major histocompatibility complex class II inhibitor, a semaphorin-like protein, a serpin, a 3-β-hydroxysteroid dehydrogenase/δ 5→4 isomerase, and a CD200-like protein. These viruses also encode a large putative protein (WKPV-WA-039 and EKPV-SC-038) with a C-terminal domain that is structurally similar to the C-terminal domain of a cullin, suggestive of a role in the control of host ubiquitination. The relationship of these viruses to members of the Molluscipoxvirus and Avipoxvirus genera is discussed in terms of sequence similarity, gene content and nucleotide composition. A novel genus within subfamily Chordopoxvirinae is proposed to accommodate these two poxvirus species from kangaroos; we suggest the name, Thylacopoxvirus (thylaco-: [Gr.] thylakos meaning sac or pouch). Copyright © 2017 Elsevier B.V. All rights reserved.
Loconsole, Giuliana; Onelge, Nuket; Yokomi, Raymond K; Kubaa, Raied Abou; Savino, Vito; Saponari, Maria
2013-01-01
The RNA genome of pathogenic and non-pathogenic variants of citrus Hop stunt viroid (HSVd) differ by five to six nucleotides located within the variable (V) domain referred to as the "cachexia expression motif". Sensitive hosts such as mandarin and its hybrids are seriously affected by cachexia disease. Current methods to differentiate HSVd variants rely on lengthy greenhouse biological indexing on Parson's Special mandarin and/or direct nucleotide sequence analysis of amplicons from RT-PCR of HSVd-infected plants. Two independent high throughput assays to segregate HSVd variants by real-time RT-PCR and High-Resolution Melting Temperature (HRM) analysis were developed: one based on EVAGreen dye; the other based on TaqMan probes. Primers for both assays targeted three differentiating nucleotides in the V domain which separated HSVd variants into three clusters by distinct melting temperatures with a confidence level higher than 98%. The accuracy of the HRM assays were validated by nucleotide sequencing of representative samples within each HRM cluster and by testing 45 HSVd-infected field trees from California, Italy, Spain, Syria and Turkey. To our knowledge, this is the first report of a rapid and sensitive approach to detect and differentiate HSVd variants associated with different biological behaviors. Although, HSVd is found in several crops including citrus, cachexia variants are restricted to some citrus-growing areas, particularly the Mediterranean Region. Rapid diagnosis for cachexia and non-cachexia variants is, thus, important for the management of HSVd in citrus and reduces the need for bioindexing and sequencing analysis. Copyright © 2013 Elsevier Ltd. All rights reserved.
Nucleic acid and nucleotide-mediated synthesis of inorganic nanoparticles
NASA Astrophysics Data System (ADS)
Berti, Lorenzo; Burley, Glenn A.
2008-02-01
Since the advent of practical methods for achieving DNA metallization, the use of nucleic acids as templates for the synthesis of inorganic nanoparticles (NPs) has become an active area of study. It is now widely recognized that nucleic acids have the ability to control the growth and morphology of inorganic NPs. These biopolymers are particularly appealing as templating agents as their ease of synthesis in conjunction with the possibility of screening nucleotide composition, sequence and length, provides the means to modulate the physico-chemical properties of the resulting NPs. Several synthetic procedures leading to NPs with interesting photophysical properties as well as studies aimed at rationalizing the mechanism of nucleic acid-templated NP synthesis are now being reported. This progress article will outline the current understanding of the nucleic acid-templated process and provides an up to date reference in this nascent field.
Cytokinin nucleotides contents in sexual buds of Douglas-fir
DOE Office of Scientific and Technical Information (OSTI.GOV)
Imbault, N.; Doumas, P.; Bonnet-Masimbert, N.
1989-04-01
Cytokinin nucleotides were extracted from male and female buds of Pseudotsuga menxiesii by 10 % perchloric acid. They were prepurified on cation exchanger columns (CBA, Amersham) and then separated by two HPLC systems. The first one (Partisil 10 SAX, 10{mu}m, Wathman) separates the mono-, di- and tri-phosphates groups which were collected. The second one (Ultraspher, 5 {mu}m, Beckman) separates the cytokinin nucleotides inside each group. After separation, cytokinin nucleotides were assayed by radioimmunoassay with anti ribosyl zeatin (RZ) and anti isopentenyladenosine (iPA) antibodies. The analysis showed in the monophosphate (mono-P) group one immunoreactant peak in RZ fraction which co-chromatographied withmore » RZ-5{prime}-mono-P and two peaks in the iPA fraction. One of them co-chromatographied with iPA-5{prime}-mono-P. In the diphosphate group, there were three peaks which reacted with anti RZ antibodies and one with anti iPA antibodies. The nucleotides obtained after the first HPLC system, were hydrolysed by a 5{prime}-nucleotidase showed compounds co-chromatographing with RZ and iPA. We did not observe any qualitative differences between the male and female buds. This is the first evidence of cytokinin nucleotides in tissue from woody plants.« less
García-Ruiz, Adriana; Ruiz-López, Felipe de J.; Van Tassell, Curtis P.; Montaldo, Hugo H.; Huson, Heather J.
2015-01-01
The Mexican Holstein (HO) industry has imported Canadian and US (CAN + USA) HO germplasm for use in two different production systems, the conventional (Conv) and the low income (Lowi) system. The objective of this work was to study the genetic composition and differentiation of the Mexican HO cattle, considering the production system in which they perform and their relationship with the Canadian and US HO populations. The analysis included information from 149, 303, and 173 unrelated or with unknown pedigree HO animals from the Conv, Lowi, and CAN + USA populations, respectively. Canadian and US Jersey (JE) and Brown Swiss (BS) genotypes (162 and 86, respectively) were used to determine if Mexican HOs were hybridized with either of these breeds. After quality control filtering, a total of 6,617 out of 6,836 single nucleotide polymorphism markers were used. To describe the genetic diversity across the populations, principal component (PC), admixture composition, and linkage disequilibrium (LD; r2) analyses were performed. Through the PC analysis, HO × JE and HO × BS crossbreeding was detected in the Lowi system. The Conv system appeared to be in between Lowi and CAN + USA populations. Admixture analysis differentiated between the genetic composition of the Conv and Lowi systems, and five ancestry groups associated to sire’s country of origin were identified. The minimum distance between markers to estimate a useful LD was found to be 54.5 kb for the Mexican HO populations. At this average distance, the persistence of phase across autosomes of Conv and Lowi systems was 0.94, for Conv and CAN + USA was 0.92 and for the Lowi and CAN + USA was 0.91. Results supported the flow of germplasm among populations being Conv a source for Lowi, and dependent on migration from CAN + USA. Mexican HO cattle in Conv and Lowi populations share common ancestry with CAN + USA but have different genetic signatures. PMID:25709615
García-Ruiz, Adriana; Ruiz-López, Felipe de J; Van Tassell, Curtis P; Montaldo, Hugo H; Huson, Heather J
2015-01-01
The Mexican Holstein (HO) industry has imported Canadian and US (CAN + USA) HO germplasm for use in two different production systems, the conventional (Conv) and the low income (Lowi) system. The objective of this work was to study the genetic composition and differentiation of the Mexican HO cattle, considering the production system in which they perform and their relationship with the Canadian and US HO populations. The analysis included information from 149, 303, and 173 unrelated or with unknown pedigree HO animals from the Conv, Lowi, and CAN + USA populations, respectively. Canadian and US Jersey (JE) and Brown Swiss (BS) genotypes (162 and 86, respectively) were used to determine if Mexican HOs were hybridized with either of these breeds. After quality control filtering, a total of 6,617 out of 6,836 single nucleotide polymorphism markers were used. To describe the genetic diversity across the populations, principal component (PC), admixture composition, and linkage disequilibrium (LD; r(2) ) analyses were performed. Through the PC analysis, HO × JE and HO × BS crossbreeding was detected in the Lowi system. The Conv system appeared to be in between Lowi and CAN + USA populations. Admixture analysis differentiated between the genetic composition of the Conv and Lowi systems, and five ancestry groups associated to sire's country of origin were identified. The minimum distance between markers to estimate a useful LD was found to be 54.5 kb for the Mexican HO populations. At this average distance, the persistence of phase across autosomes of Conv and Lowi systems was 0.94, for Conv and CAN + USA was 0.92 and for the Lowi and CAN + USA was 0.91. Results supported the flow of germplasm among populations being Conv a source for Lowi, and dependent on migration from CAN + USA. Mexican HO cattle in Conv and Lowi populations share common ancestry with CAN + USA but have different genetic signatures.
Genetic diversity and classification of Tibetan yak populations based on the mtDNA COIII gene.
Song, Q Q; Chai, Z X; Xin, J W; Zhao, S J; Ji, Q M; Zhang, C F; Ma, Z J; Zhong, J C
2015-03-13
To determine the level of genetic diversity and phylogenetic relationships among Tibetan yak populations, the mitochondrial DNA cytochrome c oxidase subunit 3 (COIII) genes of 378 yak individuals from 16 populations were analyzed in this study. The results showed that the length of cytochrome c oxidase subunit 3 gene sequences was 781 bp, with nucleotide frequencies of 29.2, 29.4, 26.1, and 15.2% for T, C, A, and G, respectively. A total of 26 haplotypes were identified, with 69 polymorphic sites, including 11 parsimony-informative sites and 58 single-nucleotide polymorphism sites. No deletions/insertions were found in sequence comparison, indicating that nucleotide mutation types were transitions and transversions. Haplotype and nucleotide diversities were 0.562 and 0.00138, respectively, indicating a high level of genetic diversity in Tibetan yak populations. Phylogenetic relationship analysis indicated that Tibetan yak populations are divided into 2 groups.
Hall, L; Laird, J E; Craig, R K
1984-01-01
Nucleotide sequence analysis of cloned guinea-pig casein B cDNA sequences has identified two casein B variants related to the bovine and rat alpha s1 caseins. Amino acid homology was largely confined to the known bovine or predicted rat phosphorylation sites and within the 'signal' precursor sequence. Comparison of the deduced nucleotide sequence of the guinea-pig and rat alpha s1 casein mRNA species showed greater sequence conservation in the non-coding than in the coding regions, suggesting a functional and possibly regulatory role for the non-coding regions of casein mRNA. The results provide insight into the evolution of the casein genes, and raise questions as to the role of conserved nucleotide sequences within the non-coding regions of mRNA species. Images Fig. 1. PMID:6548375
The possible role of human milk nucleotides as sleep inducers.
Sánchez, Cristina L; Cubero, Javier; Sánchez, Javier; Chanclón, Belén; Rivero, Montserrat; Rodríguez, Ana B; Barriga, Carmen
2009-02-01
Breast-milk contains a potent mixture of diverse components, such as the non-protein nitrogen fraction which includes nucleotides, whose variation in levels is evident throughout lactation. In addition, these substances play an important role in sleep homeostasis. In the present study, human milk samples were analyzed using a capillary electrophoresis system. The rhythmicity of each nucleotide was studied by cosinor analysis. It was found that the nucleotides 5'AMP, 5'GMP, 5'CMP, and 5'IMP have significant (P < 0.05) circadian rhythms, the acrophases of the first two being during the night, and of the latter two during the day. While 5'UMP did not show a clear circadian rhythm, there was an increase in its levels at night. In conclusion, the rise in nocturnal levels of 5'AMP, 5'GMP, and 5'UMP could be involved in inducing the 'hypnotic' action of breast-milk at night in the infant.
He, Shui-Lian; Yang, Yang; Morrell, Peter L; Yi, Ting-Shuang
2015-01-01
Foxtail millet (Setaria italica (L.) Beauv) is one of the earliest domesticated grains, which has been cultivated in northern China by 8,700 years before present (YBP) and across Eurasia by 4,000 YBP. Owing to a small genome and diploid nature, foxtail millet is a tractable model crop for studying functional genomics of millets and bioenergy grasses. In this study, we examined nucleotide sequence diversity, geographic structure, and levels of linkage disequilibrium at four nuclear loci (ADH1, G3PDH, IGS1 and TPI1) in representative samples of 311 landrace accessions across its cultivated range. Higher levels of nucleotide sequence and haplotype diversity were observed in samples from China relative to other sampled regions. Genetic assignment analysis classified the accessions into seven clusters based on nucleotide sequence polymorphisms. Intralocus LD decayed rapidly to half the initial value within ~1.2 kb or less.
Mao, H G; Dong, X Y; Cao, H Y; Xu, N Y; Yin, Z Z
2018-04-01
1. Diacylglycerol acyltransferase (DGAT) plays an important role in the synthesis of triacylglycerol, but its effects on meat quality and carcass composition in pigeons are unclear. In this study, single-nucleotide polymorphisms (SNPs) in the exons of the DGAT2 gene were identified and analysed by using DNA sequencing methods in 200 domestic pigeons (Columba livia). The associations between DGAT2 polymorphisms and carcass and meat quality traits were also analysed. 2. Sequencing results showed that 5 nucleotide mutations were detected in exons 3, 4, 5 and 6 of the DGAT2 gene. The analysis revealed three genotypes (AA, AB and BB) in G18398T and G22484C, in which the AA genotype and A allele had the highest frequency. 3. In the SNP of G18398T located in exon 5, individuals with genotype BB had significantly higher meat quality and lower abdominal fat content than those with AA or AB genotype. In the SNP of G22484C located in exon 6, the genotype AA showed highest carcass trait values, while the genotype BB represented better meat quality, compared to AA and AB genotypes. 4. The results imply that DGAT2 gene has a close relationship with carcass and meat quality traits in pigeons, and the SNPs of G18398T and G22484C can be used as genetic markers for marker-assisted breeding in pigeon.
Yang, Ming Ru; Zhou, Zhi Jun; Chang, Yan Lin; Zhao, Le Hong
2012-08-01
To help determine whether the typical arthropod arrangement was a synapomorphy for the whole Tettigoniidae, we sequenced the mitochondrial genome (mitogenome) of the quiet-calling katydids, Xizicus fascipes (Orthoptera: Tettigoniidae: Meconematinae). The 16,166-bp nucleotide sequences of X. fascipes mitogenome contains the typical gene content, gene order, base composition, and codon usage found in arthropod mitogenomes. As a whole, the X. fascipes mitogenome contains a lower A+T content (70.2%) found in the complete orthopteran mitogenomes determined to date. All protein-coding genes started with a typical ATN codon. Ten of the 13 protein-coding genes have a complete termination codon, but the remaining three genes (COIII, ND5 and ND4) terminate with incomplete T. All tRNAs have the typical clover-leaf structure of mitogenome tRNA, except for tRNA(Ser(AGN)), in which lengthened anticodon stem (9 bp) with a bulged nuleotide in the middle, an unusual T-stem (6 bp in constrast to the normal 5 bp), a mini DHU arm (2 bp) and no connector nucleotides. In the A+T-rich region, two (TA)n conserved blocks that were previously described in Ensifera and two 150-bp tandem repeats plus a partial copy of the composed at 61 bp of the beginning were present. Phylogenetic analysis found: i) the monophyly of Conocephalinae was interrupted by Elimaea cheni from Phaneropterinae; and ii) Meconematinae was the most basal group among these five subfamilies.
Lin, Hao; Deng, En-Ze; Ding, Hui; Chen, Wei; Chou, Kuo-Chen
2014-01-01
The σ54 promoters are unique in prokaryotic genome and responsible for transcripting carbon and nitrogen-related genes. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapidly and effectively identifying the σ54 promoters. Here, a predictor called ‘iPro54-PseKNC’ was developed. In the predictor, the samples of DNA sequences were formulated by a novel feature vector called ‘pseudo k-tuple nucleotide composition’, which was further optimized by the incremental feature selection procedure. The performance of iPro54-PseKNC was examined by the rigorous jackknife cross-validation tests on a stringent benchmark data set. As a user-friendly web-server, iPro54-PseKNC is freely accessible at http://lin.uestc.edu.cn/server/iPro54-PseKNC. For the convenience of the vast majority of experimental scientists, a step-by-step protocol guide was provided on how to use the web-server to get the desired results without the need to follow the complicated mathematics that were presented in this paper just for its integrity. Meanwhile, we also discovered through an in-depth statistical analysis that the distribution of distances between the transcription start sites and the translation initiation sites were governed by the gamma distribution, which may provide a fundamental physical principle for studying the σ54 promoters. PMID:25361964
2011-01-01
Theileria parasites cause a benign infection of cattle in parts of Australia where they are endemic, but have, in recent years, been suspected of being responsible for a number of outbreaks of disease in cattle near the coast of New South Wales. The objective of this study was to identify and characterize the species of Theileria in cattle on six farms in New South Wales where disease outbreaks have occurred, and compare with Theileria from three disease-free farms in Queensland that is endemic for Theileria. Special reference was made to sub-typing of T. orientalis by type-specific PCR and sequencing of the small subunit (SSU) rRNA gene, and sequence analysis of the gene encoding a polymorphic merozoite/piroplasm surface protein (MPSP) that may be under immune selection. Nucleotide sequencing of SSU rRNA and MPSP genes revealed the presence of four Theileria genotypes: T. orientalis (buffeli), T. orientalis (ikeda), T. orientalis (chitose) and T. orientalis type 4 (MPSP) or type C (SSU rRNA). The majority of animals showed mixed infections while a few showed single infection. When MPSP nucleotide sequences were translated into amino acids, base transition did not change amino acid composition of the protein product, suggesting possible silent polymorphism. The occurrence of ikeda and type 4 (type C) previously not reported to occur and silent mutation is thought to have enhanced parasite evasion of the host immune response causing the outbreak. PMID:21338493
Genetic and phenotypic variability of iris color in Buenos Aires population
Hohl, Diana María; Bezus, Brenda; Ratowiecki, Julia; Catanesi, Cecilia Inés
2018-01-01
Abstract The aim of this work was to describe the phenotypic and genotypic variability related to iris color for the population of Buenos Aires province (Argentina), and to assess the usefulness of current methods of analysis for this country. We studied five Single Nucleotide Polymorphisms (SNPs) included in the IrisPlex kit, in 118 individuals, and we quantified eye color with Digital Iris Analysis Tool. The markers fit Hardy-Weinberg equilibrium for the whole sample, but not for rs12913832 within the group of brown eyes (LR=8.429; p=0.004). We found a remarkable association of HERC2 rs12913832 GG with blue color (p < 0.01) but the other markers did not show any association with iris color. The results for the Buenos Aires population differ from those of other populations of the world for these polymorphisms (p < 0,01). The differences we found might respond to the admixed ethnic composition of Argentina; therefore, methods of analysis used in European populations should be carefully applied when studying the population of Argentina. These findings reaffirm the importance of this investigation in the Argentinian population for people identification based on iris color. PMID:29658972
EBI metagenomics--a new resource for the analysis and archiving of metagenomic data.
Hunter, Sarah; Corbett, Matthew; Denise, Hubert; Fraser, Matthew; Gonzalez-Beltran, Alejandra; Hunter, Christopher; Jones, Philip; Leinonen, Rasko; McAnulla, Craig; Maguire, Eamonn; Maslen, John; Mitchell, Alex; Nuka, Gift; Oisel, Arnaud; Pesseat, Sebastien; Radhakrishnan, Rajesh; Rocca-Serra, Philippe; Scheremetjew, Maxim; Sterk, Peter; Vaughan, Daniel; Cochrane, Guy; Field, Dawn; Sansone, Susanna-Assunta
2014-01-01
Metagenomics is a relatively recently established but rapidly expanding field that uses high-throughput next-generation sequencing technologies to characterize the microbial communities inhabiting different ecosystems (including oceans, lakes, soil, tundra, plants and body sites). Metagenomics brings with it a number of challenges, including the management, analysis, storage and sharing of data. In response to these challenges, we have developed a new metagenomics resource (http://www.ebi.ac.uk/metagenomics/) that allows users to easily submit raw nucleotide reads for functional and taxonomic analysis by a state-of-the-art pipeline, and have them automatically stored (together with descriptive, standards-compliant metadata) in the European Nucleotide Archive.
EBI metagenomics—a new resource for the analysis and archiving of metagenomic data
Hunter, Sarah; Corbett, Matthew; Denise, Hubert; Fraser, Matthew; Gonzalez-Beltran, Alejandra; Hunter, Christopher; Jones, Philip; Leinonen, Rasko; McAnulla, Craig; Maguire, Eamonn; Maslen, John; Mitchell, Alex; Nuka, Gift; Oisel, Arnaud; Pesseat, Sebastien; Radhakrishnan, Rajesh; Rocca-Serra, Philippe; Scheremetjew, Maxim; Sterk, Peter; Vaughan, Daniel; Cochrane, Guy; Field, Dawn; Sansone, Susanna-Assunta
2014-01-01
Metagenomics is a relatively recently established but rapidly expanding field that uses high-throughput next-generation sequencing technologies to characterize the microbial communities inhabiting different ecosystems (including oceans, lakes, soil, tundra, plants and body sites). Metagenomics brings with it a number of challenges, including the management, analysis, storage and sharing of data. In response to these challenges, we have developed a new metagenomics resource (http://www.ebi.ac.uk/metagenomics/) that allows users to easily submit raw nucleotide reads for functional and taxonomic analysis by a state-of-the-art pipeline, and have them automatically stored (together with descriptive, standards-compliant metadata) in the European Nucleotide Archive. PMID:24165880
Scopesi, Fabio; Canini, Silvana; Arioni, Cesare; Mazzella, Massimo; Gazzolo, Diego; Lantieri, Pasquale B; Bonacci, Wanda; Serra, Giovanni
2006-06-01
Recently we demonstrated an increased 2,3-diphosphoglycerate (2,3-DPG) erythrocyte concentration in rat pups subjected to nucleotide-enriched artificial feeding. The present study was carried out to test the hypothesis that a possible increase in 2,3-DPG concentration can also be obtained in human neonates who are fed nucleotide-enriched formula. Preterm neonates born or referred to the neonatal intensive care unit of the G. Gaslini Hospital, Genoa University, with a gestational age >30 weeks and <37 weeks were enrolled in our randomized trial. Recruitment took place within 48-72 hours from birth. Only newborns of mothers deciding not to breast-feed were eligible to be randomized for the supplemented group (FN) or non-supplemented group (RF). Breast-fed newborns were considered the control group (C). The study window (for supplementation and blood samples) was restricted to the first two weeks following birth (from the 2nd (t1) to the 16th (t2) day of life). At the end of our study, only 21 neonates were eligible for statistical analysis. The stimulating action of dietary nucleotides on 2,3-DPG concentration failed to be demonstrated; increases in 2,3-DPG concentration that were observed in newborns fed with nucleotide supplemented formula (FN) were comparable to those observed in newborns fed with regular formula (RF) and breast-fed newborns. The EC recommendation for the amount of nucleotides allowed in formula milk does not seem to be high enough to have positive effects on 2,3-DPG synthesis. Whether this possible 'pharmacological' effect can be achieved by a higher intake of ingested nucleotides and/or a change in the proportions of single nucleotides contained in milk formulas remain interesting end points to be elucidated.
Molee, A.; Kongroi, K.; Kuadsantia, P.; Poompramun, C.; Likitdecharote, B.
2016-01-01
The aim of the present study was to investigate the effect of single nucleotide polymorphisms in the major histocompatibility complex (MHC) class II gene on resistance to Newcastle disease virus and body weight of the Thai indigenous chicken, Leung Hang Khao (Gallus gallus domesticus). Blood samples were collected for single nucleotide polymorphism analysis from 485 chickens. Polymerase chain reaction sequencing was used to classify single nucleotide polymorphisms of class II MHC. Body weights were measured at the ages of 3, 4, 5, and 7 months. Titres of Newcastle disease virus at 2 weeks to 7 months were determined and the correlation between body weight and titre was analysed. The association between single nucleotide polymorphisms and body weight and titre were analysed by a generalized linear model. Seven single nucleotide polymorphisms were identified: C125T, A126T, C209G, C242T, A243T, C244T, and A254T. Significant correlations between log titre and body weight were found at 2 and 4 weeks. Associations between single nucleotide polymorphisms and titre were found for C209G and A254T, and between all single nucleotide polymorphisms (except A243T) and body weight. The results showed that class II MHC is associated with both titre of Newcastle disease virus and body weight in Leung Hang Khao chickens. This is of concern because improved growth traits are the main goal of breeding selection. Moreover, the results suggested that MHC has a pleiotropic effect on the titre and growth performance. This mechanism should be investigated in a future study. PMID:26732325
Schnitzler, P; Delius, H; Scholz, J; Touray, M; Orth, E; Darai, G
1987-12-01
The genome of the fish lymphocystis disease virus (FLDV) was screened for the existence of repetitive DNA sequences using a defined and complete gene library of the viral genome (98 kbp) by DNA-DNA hybridization, heteroduplex analysis, and restriction fine mapping. A repetitive DNA sequence was detected at the coordinates 0.034 to 0.057 and 0.718 to 0.736 map units (m.u.) of the FLDV genome. The first region (0.034 to 0.057 m.u.) corresponds to the 5' terminus of the EcoRI FLDV DNA fragment B (0.034 to 0.165 m.u.) and the second region (0.718 to 0.736 m.u.) is identical to the EcoRI DNA fragment M of the viral genome. The DNA nucleotide sequence of the EcoRI FLDV DNA fragment M was determined. This analysis revealed the presence of many short direct and inverted repetitions, e.g., a 18-mer direct repetition (TTTAAAATTTAATTAA) that started at nucleotide positions 812 and 942 and a 14-mer inverted repeat (TTAAATTTAAATTT) at nucleotide positions 820 and 959. Only short open reading frames were detected within this region. The DNA repetitions are discussed as sequences that play a possible regulatory role for virus replication. Furthermore, hybridization experiments revealed that the repetitive DNA sequences are conserved in the genome of different strains of fish lymphocystis disease virus isolated from two species of Pleuronectidae (flounder and dab).
Wang, Zhaoxi; Claus Henn, Birgit; Wang, Chaolong; Wei, Yongyue; Su, Li; Sun, Ryan; Chen, Han; Wagner, Peter J; Lu, Quan; Lin, Xihong; Wright, Robert; Bellinger, David; Kile, Molly; Mazumdar, Maitreyi; Tellez-Rojo, Martha Maria; Schnaas, Lourdes; Christiani, David C
2017-07-28
Neurodevelopment is a complex process involving both genetic and environmental factors. Prenatal exposure to lead (Pb) has been associated with lower performance on neurodevelopmental tests. Adverse neurodevelopmental outcomes are more frequent and/or more severe when toxic exposures interact with genetic susceptibility. To explore possible loci associated with increased susceptibility to prenatal Pb exposure, we performed a genome-wide gene-environment interaction study (GWIS) in young children from Mexico (n = 390) and Bangladesh (n = 497). Prenatal Pb exposure was estimated by cord blood Pb concentration. Neurodevelopment was assessed using the Bayley Scales of Infant Development. We identified a locus on chromosome 8, containing UNC5D, and demonstrated evidence of its genome-wide significance with mental composite scores (rs9642758, p meta = 4.35 × 10 -6 ). Within this locus, the joint effects of two independent single nucleotide polymorphisms (SNPs, rs9642758 and rs10503970) had a p-value of 4.38 × 10 -9 for mental composite scores. Correlating GWIS results with in vitro transcriptomic profiles identified one common gene, SLC1A5, which is involved in synaptic function, neuronal development, and excitotoxicity. Further analysis revealed interconnected interactions that formed a large network of 52 genes enriched with oxidative stress genes and neurodevelopmental genes. Our findings suggest that certain genetic polymorphisms within/near genes relevant to neurodevelopment might modify the toxic effects of Pb exposure via oxidative stress.
Amorocho, Diego F; Abreu-Grobois, F Alberto; Dutton, Peter H; Reina, Richard D
2012-01-01
Mitochondrial DNA analyses have been useful for resolving maternal lineages and migratory behavior to foraging grounds (FG) in sea turtles. However, little is known about source rookeries and haplotype composition of foraging green turtle aggregations in the southeastern Pacific. We used mitochondrial DNA control region sequences to identify the haplotype composition of 55 green turtles, Chelonia mydas, captured in foraging grounds of Gorgona National Park in the Colombian Pacific. Amplified fragments of the control region (457 bp) revealed the presence of seven haplotypes, with haplotype (h) and nucleotide (π) diversities of h = 0.300±0.080 and π = 0.009±0.005 respectively. The most common haplotype was CMP4 observed in 83% of individuals, followed by CMP22 (5%). The genetic composition of the Gorgona foraging population primarily comprised haplotypes that have been found at eastern Pacific rookeries including Mexico and the Galapagos, as well as haplotypes of unknown stock origin that likely originated from more distant western Pacific rookeries. Mixed stock analysis suggests that the Gorgona FG population is comprised mostly of animals from the Galapagos rookery (80%). Lagrangian drifter data showed that movement of turtles along the eastern Pacific coast and eastward from distant western and central Pacific sites was possible through passive drift. Our results highlight the importance of this protected area for conservation management of green turtles recruited from distant sites along the eastern Pacific Ocean.
Amorocho, Diego F.; Abreu-Grobois, F. Alberto; Dutton, Peter H.; Reina, Richard D.
2012-01-01
Mitochondrial DNA analyses have been useful for resolving maternal lineages and migratory behavior to foraging grounds (FG) in sea turtles. However, little is known about source rookeries and haplotype composition of foraging green turtle aggregations in the southeastern Pacific. We used mitochondrial DNA control region sequences to identify the haplotype composition of 55 green turtles, Chelonia mydas, captured in foraging grounds of Gorgona National Park in the Colombian Pacific. Amplified fragments of the control region (457 bp) revealed the presence of seven haplotypes, with haplotype (h) and nucleotide (π) diversities of h = 0.300±0.080 and π = 0.009±0.005 respectively. The most common haplotype was CMP4 observed in 83% of individuals, followed by CMP22 (5%). The genetic composition of the Gorgona foraging population primarily comprised haplotypes that have been found at eastern Pacific rookeries including Mexico and the Galapagos, as well as haplotypes of unknown stock origin that likely originated from more distant western Pacific rookeries. Mixed stock analysis suggests that the Gorgona FG population is comprised mostly of animals from the Galapagos rookery (80%). Lagrangian drifter data showed that movement of turtles along the eastern Pacific coast and eastward from distant western and central Pacific sites was possible through passive drift. Our results highlight the importance of this protected area for conservation management of green turtles recruited from distant sites along the eastern Pacific Ocean. PMID:22319635
Ibarbalz, Federico M.; Orellana, Esteban; Figuerola, Eva L. M.
2016-01-01
ABSTRACT This study was conducted to investigate whether functions encoded in the metagenome could improve our ability to understand the link between microbial community structures and functions in activated sludge. By analyzing data sets from six industrial and six municipal wastewater treatment plants (WWTPs), covering different configurations, operational conditions, and geographic regions, we found that wastewater influent composition was an overriding factor shaping the metagenomic composition of the activated sludge samples. Community GC content profiles were conserved within treatment plants on a time scale of years and between treatment plants with similar influent wastewater types. Interestingly, GC contents of the represented phyla covaried with the average GC contents of the corresponding WWTP metagenome. This suggests that the factors influencing nucleotide composition act similarly across taxa and thus the variation in nucleotide contents is driven by environmental differences between WWTPs. While taxonomic richness and functional richness were correlated, shotgun metagenomics complemented taxon-based analyses in the task of classifying microbial communities involved in wastewater treatment systems. The observed taxonomic dissimilarity between full-scale WWTPs receiving influent types with varied compositions, as well as the inferred taxonomic and functional assignment of recovered genomes from each metagenome, were consistent with underlying differences in the abundance of distinctive sets of functional categories. These conclusions were robust with respect to plant configuration, operational and environmental conditions, and even differences in laboratory protocols. IMPORTANCE This work contributes to the elucidation of drivers of microbial community assembly in wastewater treatment systems. Our results are significant because they provide clear evidence that bacterial communities in WWTPs assemble mainly according to influent wastewater characteristics. Differences in bacterial community structures between WWTPs were consistent with differences in the abundance of distinctive sets of functional categories, which were related to the metabolic potential that would be expected according to the source of the wastewater. PMID:27316957
Du, Shuhui; Wang, Zhaoshan; Ingvarsson, Pär K; Wang, Dongsheng; Wang, Junhui; Wu, Zhiqiang; Tembrock, Luke R; Zhang, Jianguo
2015-10-01
Historical tectonism and climate oscillations can isolate and contract the geographical distributions of many plant species, and they are even known to trigger species divergence and ultimately speciation. Here, we estimated the nucleotide variation and speciation in three closely related Populus species, Populus tremuloides, P. tremula and P. davidiana, distributed in North America and Eurasia. We analysed the sequence variation in six single-copy nuclear loci and three chloroplast (cpDNA) fragments in 497 individuals sampled from 33 populations of these three species across their geographic distributions. These three Populus species harboured relatively high levels of nucleotide diversity and showed high levels of nucleotide differentiation. Phylogenetic analysis revealed that P. tremuloides diverged earlier than the other two species. The cpDNA haplotype network result clearly illustrated the dispersal route from North America to eastern Asia and then into Europe. Molecular dating results confirmed that the divergence of these three species coincided with the sundering of the Bering land bridge in the late Miocene and a rapid uplift of the Qinghai-Tibetan Plateau around the Miocene/Pliocene boundary. Vicariance-driven successful allopatric speciation resulting from historical tectonism and climate oscillations most likely played roles in the formation of the disjunct distributions and divergence of these three Populus species. © 2015 John Wiley & Sons Ltd.
Liu, Wei-long; Yang, Gui-lin; Wei, Qing; Zhang, Ming-xia; Chen, Xin-chun; Liu, Ying-xia; Gao, Yang; Zhou, Bo-ping
2011-02-01
To investigate the characteristics of molecular epidemiology and molecular evolution of 5 EV 71 (enterovirus 71, EV71) strains from 5 Shenzhen patients with hand-food-mouth disease associated with EV 71 infection. 5 EV 71 strains were isolated, and sequenced to analyzed the full length gene sequences in order to compare nucleotide and amino acid homology with other EV71 strains from other regions and countries as well as previous strains across the world through bioinformatics software. 5 strains of EV 71 belonged to sub-genotype C4 by analysis of nucleotide sequences of VP1 and VP4 of EV 71. The differences of nucleotide and amino acid sequences were much small with nucleotide homology of 93% and amino acid homology of 98% among these 5 strains. A phylogenetic tree analysis indicated that 2008 Shenzhen epidemic strains were the most close to 2004 Shenzhen circulating strains, and also much close to 1998 Shenzhen epidemic strains and 2008 Fuyang Anhui strains. The dead strain was very close to 2008 Fuyang Anhui epidemic strains. It can be speculated that this epidemic strains of EV 71 probably originate from the same ancient strain in the history, may from 1998 Shenzhen strain.
Pre-Steady-State Kinetic Analysis of Single-Nucleotide Incorporation by DNA Polymerases
Su, Yan; Guengerich, F. Peter
2016-01-01
Pre-steady-state kinetic analysis is a powerful and widely used method to obtain multiple kinetic parameters. This protocol provides a step-by-step procedure for pre-steady-state kinetic analysis of single-nucleotide incorporation by a DNA polymerase. It describes the experimental details of DNA substrate annealing, reaction mixture preparation, handling of the RQF-3 rapid quench-flow instrument, denaturing polyacrylamide DNA gel preparation, electrophoresis, quantitation, and data analysis. The core and unique part of this protocol is the rationale for preparation of the reaction mixture (the ratio of the polymerase to the DNA substrate) and methods for conducting pre-steady-state assays on an RQF-3 rapid quench-flow instrument, as well as data interpretation after analysis. In addition, the methods for the DNA substrate annealing and DNA polyacrylamide gel preparation, electrophoresis, quantitation and analysis are suitable for use in other studies. PMID:27248785
Schachner, Anna; Marek, Ana; Grafl, Beatrice; Hess, Michael
2016-04-15
Forty-eight fowl aviadenoviruses (FAdVs) isolated from recent IBH outbreaks across Europe were investigated, by utilizing for the first time the two major adenoviral antigenic domains, hexon loop-1 and fiber, for compound molecular characterization of IBH-associated FAdVs. Successful target gene amplification, following virus isolation in cell culture or from FTA-card samples, demonstrated presence of FAdVs in all cases indicative for IBH. Based on hexon loop-1 analysis, 31 European field isolates exhibited highest nucleotide identity (>97.2%) to reference strains FAdV-2 or -11 representing FAdV-D, while 16 and one European isolates shared >96.0% nucleotide identity with FAdV-8a and -8b, or FAdV-7, the prototype strains representing FAdV-E. These results extend recognition of specific FAdV-D and FAdV-E affiliate genotypes as causative agents of IBH to the European continent. In all isolates, species specificity determined by fiber gene analysis correlated with hexon-based typing. A threshold of 72.0% intraspecies nucleotide identity between fibers from investigated prototype and field strains corresponded with demarcation criteria proposed for hexon, suggesting fiber-based analysis as a complementary tool for molecular FAdV typing. A limited number of strains exhibited inconsistencies between hexon and fiber subclustering, indicating potential constraints for single-gene based typing of those FAdVs. Within FAdV-D, field isolate fibers shared a high degree of nucleotide (>96.7%) and aa (>95.8%) identity, while FAdV-E field isolate fibers displayed greater nucleotide divergence of up to 22.6%, resulting in lower aa identities of >81.7%. Furthermore, comparison with FAdVs from IBH outbreaks outside Europe revealed close genetic relationship in the fiber, independent of the strains' geographic origin. Copyright © 2016 Elsevier B.V. All rights reserved.
Schroeder, Rebekka Y; Zhu, Anting; Eubel, Holger; Dahncke, Kathleen; Witte, Claus-Peter
2018-01-01
Nucleotide catabolism in Arabidopsis thaliana and Saccharomyces cerevisiae leads to the release of ribose, which requires phosphorylation to ribose-5-phosphate mediated by ribokinase (RBSK). We aimed to characterize RBSK in plants and yeast, to quantify the contribution of plant nucleotide catabolism to the ribose pool, and to investigate whether ribose carbon contributes to dark stress survival of plants. We performed a phylogenetic analysis and determined the kinetic constants of plant-expressed Arabidopsis and yeast RBSKs. Using mass spectrometry, several metabolites were quantified in AtRBSK mutants and double mutants with genes of nucleoside catabolism. Additionally, the dark stress performance of several nucleotide metabolism mutants and rbsk was compared. The plant PfkB family of sugar kinases forms nine major clades likely representing distinct biochemical functions, one of them RBSK. Nucleotide catabolism is the dominant ribose source in plant metabolism and is highly induced by dark stress. However, rbsk cannot be discerned from the wild type in dark stress. Interestingly, the accumulation of guanosine in a guanosine deaminase mutant strongly enhances dark stress symptoms. Although nucleotide catabolism contributes to carbon mobilization upon darkness and is the dominant source of ribose, the contribution appears to be of minor importance for dark stress survival. © 2017 The Authors. New Phytologist © 2017 New Phytologist Trust.
UrQt: an efficient software for the Unsupervised Quality trimming of NGS data.
Modolo, Laurent; Lerat, Emmanuelle
2015-04-29
Quality control is a necessary step of any Next Generation Sequencing analysis. Although customary, this step still requires manual interventions to empirically choose tuning parameters according to various quality statistics. Moreover, current quality control procedures that provide a "good quality" data set, are not optimal and discard many informative nucleotides. To address these drawbacks, we present a new quality control method, implemented in UrQt software, for Unsupervised Quality trimming of Next Generation Sequencing reads. Our trimming procedure relies on a well-defined probabilistic framework to detect the best segmentation between two segments of unreliable nucleotides, framing a segment of informative nucleotides. Our software only requires one user-friendly parameter to define the minimal quality threshold (phred score) to consider a nucleotide to be informative, which is independent of both the experiment and the quality of the data. This procedure is implemented in C++ in an efficient and parallelized software with a low memory footprint. We tested the performances of UrQt compared to the best-known trimming programs, on seven RNA and DNA sequencing experiments and demonstrated its optimality in the resulting tradeoff between the number of trimmed nucleotides and the quality objective. By finding the best segmentation to delimit a segment of good quality nucleotides, UrQt greatly increases the number of reads and of nucleotides that can be retained for a given quality objective. UrQt source files, binary executables for different operating systems and documentation are freely available (under the GPLv3) at the following address: https://lbbe.univ-lyon1.fr/-UrQt-.html .
Identification of the initiation site of poliovirus polyprotein synthesis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dorner, A.J.; Dorner, L.F.; Larsen, G.R.
1982-06-01
The complete nucleotide sequence of poliovirus RNA has a long open reading frame capable of encoding the precursor polyprotein NCVPOO. The first AUG codon in this reading frame is located 743 nucleotides from the 5' end of the RNA and is preceded by eight AUG codons in all three reading frames. Because all proteins that map at the amino terminus of the polyprotein (P1-1a, VPO, and VP4) are blocked at their amino termini and previous studies of ribosome binding have been inconclusive, direct identification of the initiation site of protein synthesis was difficult. We separated and identified all of themore » tryptic peptides of capsid protein VP4 and correlated these peptides with the amino acid sequence predicted to follow the AUG codon at nucleotide 743. Our data indicate that VP4 begins with a blocked glycine that is encoded immediately after the AUG codon at nucleotide 743. An S1 nuclease analysis of poliovirus mRNA failed to reveal a splice in the 5' region. We concluded that synthesis of poliovirus polyprotein is initiated at nucleotide 743, the first AUG codon in the long open reading frame.« less
Nucleotide sequence of a resistance breaking mutant of southern bean mosaic virus.
Lee, L; Anderson, E J
1998-01-01
SBMV-S is a resistance-breaking mutant of an Arkansas isolate of the bean strain of southern bean mosaic virus (SBMV-BARK) that is able to move systemically in Phaseolus vulgaris cvs. Pinto and Great Northern, whereas the wild-type SBMV-BARK causes local necrotic lesions and is restricted to the inoculated leaves of these hosts. Sequence analysis of the 4136 nucleotide genomes of SBMV-BARK and SBMV-S revealed seven nucleotide differences, but only four deduced amino acid changes. A single amino acid change occurred in the C-terminal region of the putative RNA-dependent RNA polymerase and three differences were identified in the N-terminal portion of the virus coat protein. SBMV-BARK and SBMV-S were compared with other sobemoviruses and were found to contain a high level of nucleotide sequence identity (91.3%) to SBMV-B. Unlike SBMV-B however, SBMV-BARK and SBMV-S contained four putative overlapping open reading frames, making them more similar in genome organization to the cowpea strain, SBMV-C. The possibility exists that mutations or even errors, that resulted in mis-identification of open reading frames, occurred in previously published information on nucleotide sequence and genomic organization for SBMV-B.
Sorimachi, Kenji; Okayasu, Teiji; Ohhira, Shuji
2015-04-01
Normalized nucleotide and amino acid contents of complete genome sequences can be visualized as radar charts. The shapes of these charts depict the characteristics of an organism's genome. The normalized values calculated from the genome sequence theoretically exclude experimental errors. Further, because normalization is independent of both target size and kind, this procedure is applicable not only to single genes but also to whole genomes, which consist of a huge number of different genes. In this review, we discuss the applications of the normalization of the nucleotide and predicted amino acid contents of complete genomes to the investigation of genome structure and to evolutionary research from primitive organisms to Homo sapiens. Some of the results could never have been obtained from the analysis of individual nucleotide or amino acid sequences but were revealed only after the normalization of nucleotide and amino acid contents was applied to genome research. The discovery that genome structure was homogeneous was obtained only after normalization methods were applied to the nucleotide or predicted amino acid contents of genome sequences. Normalization procedures are also applicable to evolutionary research. Thus, normalization of the contents of whole genomes is a useful procedure that can help to characterize organisms.
Kovalchuk, Sergey I.; Ziganshin, Rustam H.; Starkov, Vladislav G.; Tsetlin, Victor I.; Utkin, Yuri N.
2016-01-01
Venoms of most Russian viper species are poorly characterized. Here, by quantitative chromato-mass-spectrometry, we analyzed protein and peptide compositions of venoms from four Vipera species (V. kaznakovi, V. renardi, V. orlovi and V. nikolskii) inhabiting different regions of Russia. In all these species, the main components were phospholipases A2, their content ranging from 24% in V. orlovi to 65% in V. nikolskii. Altogether, enzyme content in venom of V. nikolskii reached ~85%. Among the non-enzymatic proteins, the most abundant were disintegrins (14%) in the V. renardi venom, C-type lectin like (12.5%) in V. kaznakovi, cysteine-rich venom proteins (12%) in V. orlovi and venom endothelial growth factors (8%) in V. nikolskii. In total, 210 proteins and 512 endogenous peptides were identified in the four viper venoms. They represented 14 snake venom protein families, most of which were found in the venoms of Vipera snakes previously. However, phospholipase B and nucleotide degrading enzymes were reported here for the first time. Compositions of V. kaznakovi and V. orlovi venoms were described for the first time and showed the greatest similarity among the four venoms studied, which probably reflected close relationship between these species within the “kaznakovi” complex. PMID:27077884
Kovalchuk, Sergey I; Ziganshin, Rustam H; Starkov, Vladislav G; Tsetlin, Victor I; Utkin, Yuri N
2016-04-12
Venoms of most Russian viper species are poorly characterized. Here, by quantitative chromato-mass-spectrometry, we analyzed protein and peptide compositions of venoms from four Vipera species (V. kaznakovi, V. renardi, V. orlovi and V. nikolskii) inhabiting different regions of Russia. In all these species, the main components were phospholipases A₂, their content ranging from 24% in V. orlovi to 65% in V. nikolskii. Altogether, enzyme content in venom of V. nikolskii reached ~85%. Among the non-enzymatic proteins, the most abundant were disintegrins (14%) in the V. renardi venom, C-type lectin like (12.5%) in V. kaznakovi, cysteine-rich venom proteins (12%) in V. orlovi and venom endothelial growth factors (8%) in V. nikolskii. In total, 210 proteins and 512 endogenous peptides were identified in the four viper venoms. They represented 14 snake venom protein families, most of which were found in the venoms of Vipera snakes previously. However, phospholipase B and nucleotide degrading enzymes were reported here for the first time. Compositions of V. kaznakovi and V. orlovi venoms were described for the first time and showed the greatest similarity among the four venoms studied, which probably reflected close relationship between these species within the "kaznakovi" complex.
Karaushu, E. V.; Kravzova, T. R.; Vorobey, N. A.; Kiriziy, D. A.; Olkhovich, O. P.; Taran, N. Yu.; Kots, S. Ya.; Omarova, E.
2015-01-01
Seed inoculation with bacterial consortium was found to increase legume yield, providing a higher growth than the standard nitrogen treatment methods. Alfalfa plants were inoculated by mono- and binary compositions of nitrogen-fixing microorganisms. Their physiological and biochemical properties were estimated. Inoculation by microbial consortium of Sinorhizobium meliloti T17 together with a new cyanobacterial isolate Nostoc PTV was more efficient than the single-rhizobium strain inoculation. This treatment provides an intensification of the processes of biological nitrogen fixation by rhizobia bacteria in the root nodules and an intensification of plant photosynthesis. Inoculation by bacterial consortium stimulates growth of plant mass and rhizogenesis and leads to increased productivity of alfalfa and to improving the amino acid composition of plant leaves. The full nucleotide sequence of the rRNA gene cluster and partial sequence of the dinitrogenase reductase (nifH) gene of Nostoc PTV were deposited to GenBank (JQ259185.1, JQ259186.1). Comparison of these gene sequences of Nostoc PTV with all sequences present at the GenBank shows that this cyanobacterial strain does not have 100% identity with any organisms investigated previously. Phylogenetic analysis showed that this cyanobacterium clustered with high credibility values with Nostoc muscorum. PMID:26114100
Karaushu, E V; Lazebnaya, I V; Kravzova, T R; Vorobey, N A; Lazebny, O E; Kiriziy, D A; Olkhovich, O P; Taran, N Yu; Kots, S Ya; Popova, A A; Omarova, E; Koksharova, O A
2015-01-01
Seed inoculation with bacterial consortium was found to increase legume yield, providing a higher growth than the standard nitrogen treatment methods. Alfalfa plants were inoculated by mono- and binary compositions of nitrogen-fixing microorganisms. Their physiological and biochemical properties were estimated. Inoculation by microbial consortium of Sinorhizobium meliloti T17 together with a new cyanobacterial isolate Nostoc PTV was more efficient than the single-rhizobium strain inoculation. This treatment provides an intensification of the processes of biological nitrogen fixation by rhizobia bacteria in the root nodules and an intensification of plant photosynthesis. Inoculation by bacterial consortium stimulates growth of plant mass and rhizogenesis and leads to increased productivity of alfalfa and to improving the amino acid composition of plant leaves. The full nucleotide sequence of the rRNA gene cluster and partial sequence of the dinitrogenase reductase (nifH) gene of Nostoc PTV were deposited to GenBank (JQ259185.1, JQ259186.1). Comparison of these gene sequences of Nostoc PTV with all sequences present at the GenBank shows that this cyanobacterial strain does not have 100% identity with any organisms investigated previously. Phylogenetic analysis showed that this cyanobacterium clustered with high credibility values with Nostoc muscorum.
Wang, Xiaodan; Ma, Dehong; Huang, Xinwei; Li, Lihua; Li, Duo; Zhao, Yujiao; Qiu, Lijuan; Pan, Yue; Chen, Junying; Xi, Juemin; Shan, Xiyun; Sun, Qiangming
2017-06-15
In the past few decades, dengue has spread rapidly and is an emerging disease in China. An unexpected dengue outbreak occurred in Xishuangbanna, Yunnan, China, resulting in 1331 patients in 2013. In order to obtain the complete genome information and perform mutation and evolutionary analysis of causative agent related to this largest outbreak of dengue fever. The viruses were isolated by cell culture and evaluated by genome sequence analysis. Phylogenetic trees were then constructed by Neighbor-Joining methods (MEGA6.0), followed by analysis of nucleotide mutation and amino acid substitution. The analysis of the diversity of secondary structure for E and NS1 protein were also performed. Then selection pressures acting on the coding sequences were estimated by PAML software. The complete genome sequences of two isolated strains (YNSW1, YNSW2) were 10,710 and 10,702 nucleotides in length, respectively. Phylogenetic analysis revealed both strain were classified as genotype II of DENV-3. The results indicated that both isolated strains of Xishuangbanna in 2013 and Laos 2013 stains (KF816161.1, KF816158.1, LC147061.1, LC147059.1, KF816162.1) were most similar to Bangladesh (AY496873.2) in 2002. After comparing with the DENV-3SS (H87) 62 amino acid substitutions were identified in translated regions, and 38 amino acid substitutions were identified in translated regions compared with DENV-3 genotype II stains Bangladesh (AY496873.2). 27(YNSW1) or 28(YNSW2) single nucleotide changes were observed in structural protein sequences with 7(YNSW1) or 8(YNSW2) non-synonymous mutations compared with AY496873.2. Of them, 4 non-synonymous mutations were identified in E protein sequences with (2 in the β-sheet, 2 in the coil). Meanwhile, 117(YNSW1) or 115 (YNSW2) single nucleotide changes were observed in non-structural protein sequences with 31(YNSW1) or 30 (YNSW2) non-synonymous mutations. Particularly, 14 single nucleotide changes were observed in NS1 sequences with 4/14 non-synonymous substitutions (4 in the coil). Selection pressure analysis revealed no positive selection in the amino acid sites of the genes encoding for structural and non-structural proteins. This study may help understand the intrinsic geographical relatedness of dengue virus 3 and contributes further to research on their infectivity, pathogenicity and vaccine development. Copyright © 2017 Elsevier B.V. All rights reserved.
Lima, L S; Gramacho, K P; Carels, N; Novais, R; Gaiotto, F A; Lopes, U V; Gesteira, A S; Zaidan, H A; Cascardo, J C M; Pires, J L; Micheli, F
2009-07-14
In order to increase the efficiency of cacao tree resistance to witches' broom disease, which is caused by Moniliophthora perniciosa (Tricholomataceae), we looked for molecular markers that could help in the selection of resistant cacao genotypes. Among the different markers useful for developing marker-assisted selection, single nucleotide polymorphisms (SNPs) constitute the most common type of sequence difference between alleles and can be easily detected by in silico analysis from expressed sequence tag libraries. We report the first detection and analysis of SNPs from cacao-M. perniciosa interaction expressed sequence tags, using bioinformatics. Selection based on analysis of these SNPs should be useful for developing cacao varieties resistant to this devastating disease.
Optimization of protein buffer cocktails using Thermofluor.
Reinhard, Linda; Mayerhofer, Hubert; Geerlof, Arie; Mueller-Dieckmann, Jochen; Weiss, Manfred S
2013-02-01
The stability and homogeneity of a protein sample is strongly influenced by the composition of the buffer that the protein is in. A quick and easy approach to identify a buffer composition which increases the stability and possibly the conformational homogeneity of a protein sample is the fluorescence-based thermal-shift assay (Thermofluor). Here, a novel 96-condition screen for Thermofluor experiments is presented which consists of buffer and additive parts. The buffer screen comprises 23 different buffers and the additive screen includes small-molecule additives such as salts and nucleotide analogues. The utilization of small-molecule components which increase the thermal stability of a protein sample frequently results in a protein preparation of higher quality and quantity and ultimately also increases the chances of the protein crystallizing.
Mosaic organization of DNA nucleotides
NASA Technical Reports Server (NTRS)
Peng, C. K.; Buldyrev, S. V.; Havlin, S.; Simons, M.; Stanley, H. E.; Goldberger, A. L.
1994-01-01
Long-range power-law correlations have been reported recently for DNA sequences containing noncoding regions. We address the question of whether such correlations may be a trivial consequence of the known mosaic structure ("patchiness") of DNA. We analyze two classes of controls consisting of patchy nucleotide sequences generated by different algorithms--one without and one with long-range power-law correlations. Although both types of sequences are highly heterogenous, they are quantitatively distinguishable by an alternative fluctuation analysis method that differentiates local patchiness from long-range correlations. Application of this analysis to selected DNA sequences demonstrates that patchiness is not sufficient to account for long-range correlation properties.
Molecular evidence of father-to-child transmission of hepatitis B virus.
Tajiri, Hitoshi; Tanaka, Yasuhito; Kagimoto, Seiiti; Murakami, Jun; Tokuhara, Daisuke; Mizokami, Masashi
2007-07-01
At present in Japan, only high-risk infants born to chronic hepatitis B virus (HBV)-infected mothers are given HBV vaccine. However, children can contract the virus from other HBV-infected family members, including fathers. The aim of this study is to present substantial and unequivocal evidence of father-to-child transmission of HBV infection using techniques including homology analysis and phylogenetic analysis. Thirteen chronic HBV-infected members of five families that included eight children and their respective fathers were enrolled in this study. Homology analysis and phylogenetic analyses of 2 coding region, the S gene and X gene, from the HBV genome were performed comparing the 13 nucleotide sequences from the 13 subjects. The nucleotide homology among the five sets of fathers and children was quite high (99.3-100%). A phylogenetic tree constructed on the 13 nucleotide sequences showed that all 5 sets of fathers and children were grouped into the same cluster with high bootstrap values. These results strongly indicate that father-to-child transmission is an important route of HBV infection in Japan and it is recommend that universal vaccination against HBV infection be instituted immediately in Japan for all children, in accordance with the WHO recommendation of 1997.
Kutryb-Zajac, Barbara; Yuen, Ada H Y; Khalpey, Zain; Zukowska, Paulina; Slominska, Ewa M; Taylor, Patricia M; Goldstein, Steven; Heacox, Albert E; Lavitrano, Marialuisa; Chester, Adrian H; Yacoub, Magdi H; Smolenski, Ryszard T
2016-04-01
Extracellular nucleotide metabolism controls thrombosis and inflammation and may affect degeneration and calcification of aortic valve prostheses. We evaluated the effect of different decellularization strategies on enzyme activities involved in extracellular nucleotide metabolism. Porcine valves were tested intact or decellularized either by detergent treatment or hypotonic lysis and nuclease digestion. The rates of ATP hydrolysis, AMP hydrolysis, and adenosine deamination were estimated by incubation of aorta or valve leaflet sections with substrates followed by HPLC analysis. We demonstrated relatively high activities of ecto-enzymes on porcine valve as compared to the aortic wall. Hypotonic lysis/nuclease digestion preserved >80 % of ATP and AMP hydrolytic activity but reduced adenosine deamination to <10 %. Detergent decellularization completely removed (<5 %) all these activities. These results demonstrate high intensity of extracellular nucleotide metabolism on valve surface and indicate that various valve decellularization techniques differently affect ecto-enzyme activities that could be important in the development of improved valve prostheses.
Simultaneous determination of 5'-monophosphate nucleotides in infant formulas by HPLC-MS.
Ren, Yiping; Zhang, Jingshun; Song, Xiaodan; Chen, Xiaochun; Li, Duo
2011-04-01
A method was developed for simultaneous determination of 5'-monophosphate nucleotides, adenosine 5'-monophosphate, cytidine 5'-monophosphate, guanosine 5'-monophosphate, inosine 5'-monophosphate, and uridine 5'-monophosphate in infant formulas by high-performance liquid chromatography-mass spectrometry equipped with electrospray ionization source. The complete chromatographic separation of five nucleotides was achieved through a Symmetry C(18) column, after a binary gradient elution with water containing 0.1% formic acid and acetonitrile as mobile phase. The multi-reaction monitoring mode was applied for tandem mass spectrometry analysis. The established method was further validated by determining the linearity (R(2) > 0.999), recovery (92.0-105.0%), and precision (relative standard deviation ≤6.97%). To verify the applicability of the method, thirty commercially available infant formulas were randomly purchased from the supermarkets in Hangzhou, China, and then analyzed. The results showed that the developed method is validated, sensitive, and reliable for quantitation of nucleotides in infant formulas.
Mustafa, Saima; Fatima, Hira; Fatima, Sadia; Khosa, Tafheem; Akbar, Atif; Shaikh, Rehan Sadiq; Iqbal, Furhan
2018-01-01
To find out a correlation between the single nucleotide polymorphisms in cluster of differentiation 28 and cluster of differentiation 40 genes with Graves' disease, if any. This case-control study was conducted at the Multan Institute of Nuclear Medicine and Radiotherapy, Multan, Pakistan, and comprised blood samples of Graves' disease patients and controls. Various risk factors were also correlated either with the genotype at each single-nucleotide polymorphism or with various combinations of genotypes studied during present investigation. Of the 160 samples, there were 80(50%) each from patients and controls. Risk factor analysis revealed that gender (p=0.008), marital status (p<0.001), education (p<0.001), smoking (p<0.001), tri-iodothyronine (P <0.001), thyroxin (p<0.001) and thyroid-stimulating hormone (p<0.000) levels in blood were associated with Graves' disease. Both single-nucleotide polymorphisms in both genes were not associated with Graves' disease, either individually or in any combined form.
Correlation approach to identify coding regions in DNA sequences
NASA Technical Reports Server (NTRS)
Ossadnik, S. M.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.
1994-01-01
Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law correlations, whereas coding regions typically display only short-range correlations. We develop an algorithm based on this finding that enables investigators to perform a statistical analysis on long DNA sequences to locate possible coding regions. The algorithm is particularly successful in predicting the location of lengthy coding regions. For example, for the complete genome of yeast chromosome III (315,344 nucleotides), at least 82% of the predictions correspond to putative coding regions; the algorithm correctly identified all coding regions larger than 3000 nucleotides, 92% of coding regions between 2000 and 3000 nucleotides long, and 79% of coding regions between 1000 and 2000 nucleotides. The predictive ability of this new algorithm supports the claim that there is a fundamental difference in the correlation property between coding and noncoding sequences. This algorithm, which is not species-dependent, can be implemented with other techniques for rapidly and accurately locating relatively long coding regions in genomic sequences.
Information Entropy Analysis of the H1N1 Genetic Code
NASA Astrophysics Data System (ADS)
Martwick, Andy
2010-03-01
During the current H1N1 pandemic, viral samples are being obtained from large numbers of infected people world-wide and are being sequenced on the NCBI Influenza Virus Resource Database. The information entropy of the sequences was computed from the probability of occurrence of each nucleotide base at every position of each set of sequences using Shannon's definition of information entropy, [ H=∑bpb,2( 1pb ) ] where H is the observed information entropy at each nucleotide position and pb is the probability of the base pair of the nucleotides A, C, G, U. Information entropy of the current H1N1 pandemic is compared to reference human and swine H1N1 entropy. As expected, the current H1N1 entropy is in a low entropy state and has a very large mutation potential. Using the entropy method in mature genes we can identify low entropy regions of nucleotides that generally correlate to critical protein function.
Zulfiqar, Awais; Zhang, Jie; Cui, Xiaofeng; Qian, Yajuan; Zhou, Xueping; Xie, Yan
2012-01-01
A begomovirus disease complex associated with Vernonia cinerea showing yellow vein symptoms was studied. The full-length genomic DNA was comprised of 2739 nucleotides (nt) and contained the typical genome structure of begomoviruses. Comparison analysis showed that it shared the highest (78.9%) nucleotide sequence identity with recently characterized Vernonia yellow vein virus (VeYVV) from India. For associated satellites, betasatellite showed the highest nucleotide sequence identity (52.1%) with Vernonia yellow vein virus betasatellite (VeYVVB) and alphasatellite shared the highest sequence identity (70.7%) with Gossypium mustelinium symptomless alphasatellite (GMusSLA). It is a member of a distinct species with cognate alpha- and betasatellites for which the name Vernonia yellow vein Fujian virus (VeYVFjV) is proposed.
Durand, Pierre M; Oelofse, Andries J; Coetzer, Theresa L
2006-11-04
The completed genome sequences of the malaria parasites P. falciparum, P. y. yoelii and P. vivax have revealed some unusual features. P. falciparum contains the most AT rich genome sequenced so far--over 90% in some regions. In comparison, P. y. yoelii is approximately 77% and P. vivax is approximately 55% AT rich. The evolutionary reasons for these findings are unknown. Mobile genetic elements have a considerable impact on genome evolution but a thorough investigation of these elements in Plasmodium has not been undertaken. We therefore performed a comprehensive genome analysis of these elements and their derivatives in the three Plasmodium species. Whole genome analysis was performed using bioinformatic methods. Forty potential protein encoding sequences with features of transposable elements were identified in P. vivax, eight in P. y. yoelii and only six in P. falciparum. Further investigation of the six open reading frames in P. falciparum revealed that only one is potentially an active mobile genetic element. Most of the open reading frames identified in all three species are hypothetical proteins. Some represent annotated host proteins such as the putative telomerase reverse transcriptase genes in P. y. yoelii and P. falciparum. One of the P. vivax open reading frames identified in this study demonstrates similarity to telomerase reverse transcriptase and we conclude it to be the orthologue of this gene. There is a divergence in the frequencies of mobile genetic elements in the three Plasmodium species investigated. Despite the limitations of whole genome analytical methods, it is tempting to speculate that mobile genetic elements might have been a driving force behind the compositional bias of the P. falciparum genome.
Unbiased Characterization of Anopheles Mosquito Blood Meals by Targeted High-Throughput Sequencing
Logue, Kyle; Keven, John Bosco; Cannon, Matthew V.; Reimer, Lisa; Siba, Peter; Walker, Edward D.; Zimmerman, Peter A.; Serre, David
2016-01-01
Understanding mosquito host choice is important for assessing vector competence or identifying disease reservoirs. Unfortunately, the availability of an unbiased method for comprehensively evaluating the composition of insect blood meals is very limited, as most current molecular assays only test for the presence of a few pre-selected species. These approaches also have limited ability to identify the presence of multiple mammalian hosts in a single blood meal. Here, we describe a novel high-throughput sequencing method that enables analysis of 96 mosquitoes simultaneously and provides a comprehensive and quantitative perspective on the composition of each blood meal. We validated in silico that universal primers targeting the mammalian mitochondrial 16S ribosomal RNA genes (16S rRNA) should amplify more than 95% of the mammalian 16S rRNA sequences present in the NCBI nucleotide database. We applied this method to 442 female Anopheles punctulatus s. l. mosquitoes collected in Papua New Guinea (PNG). While human (52.9%), dog (15.8%) and pig (29.2%) were the most common hosts identified in our study, we also detected DNA from mice, one marsupial species and two bat species. Our analyses also revealed that 16.3% of the mosquitoes fed on more than one host. Analysis of the human mitochondrial hypervariable region I in 102 human blood meals showed that 5 (4.9%) of the mosquitoes unambiguously fed on more than one person. Overall, analysis of PNG mosquitoes illustrates the potential of this approach to identify unsuspected hosts and characterize mixed blood meals, and shows how this approach can be adapted to evaluate inter-individual variations among human blood meals. Furthermore, this approach can be applied to any disease-transmitting arthropod and can be easily customized to investigate non-mammalian host sources. PMID:26963245
Singh, Vinod Kumar; Krishnamachari, Annangarachari
2016-09-01
Genome-wide experimental studies in Saccharomyces cerevisiae reveal that autonomous replicating sequence (ARS) requires an essential consensus sequence (ACS) for replication activity. Computational studies identified thousands of ACS like patterns in the genome. However, only a few hundreds of these sites act as replicating sites and the rest are considered as dormant or evolving sites. In a bid to understand the sequence makeup of replication sites, a content and context-based analysis was performed on a set of replicating ACS sequences that binds to origin-recognition complex (ORC) denoted as ORC-ACS and non-replicating ACS sequences (nrACS), that are not bound by ORC. In this study, DNA properties such as base composition, correlation, sequence dependent thermodynamic and DNA structural profiles, and their positions have been considered for characterizing ORC-ACS and nrACS. Analysis reveals that ORC-ACS depict marked differences in nucleotide composition and context features in its vicinity compared to nrACS. Interestingly, an A-rich motif was also discovered in ORC-ACS sequences within its nucleosome-free region. Profound changes in the conformational features, such as DNA helical twist, inclination angle and stacking energy between ORC-ACS and nrACS were observed. Distribution of ACS motifs in the non-coding segments points to the locations of ORC-ACS which are found far away from the adjacent gene start position compared to nrACS thereby enabling an accessible environment for ORC-proteins. Our attempt is novel in considering the contextual view of ACS and its flanking region along with nucleosome positioning in the S. cerevisiae genome and may be useful for any computational prediction scheme.
The nucleotide sequence and genome organization of Plasmopara halstedii virus.
Heller-Dohmen, Marion; Göpfert, Jens C; Pfannstiel, Jens; Spring, Otmar
2011-03-17
Only very few viruses of Oomycetes have been studied in detail. Isometric virions were found in different isolates of the oomycete Plasmopara halstedii, the downy mildew pathogen of sunflower. However, complete nucleotide sequences and data on the genome organization were lacking. Viral RNA of different P. halstedii isolates was subjected to nucleotide sequencing and analysis of the viral genome. The N-terminal sequence of the viral coat protein was determined using Top-Down MALDI-TOF analysis. The complete nucleotide sequences of both single-stranded RNA segments (RNA1 and RNA2) were established. RNA1 consisted of 2793 nucleotides (nt) exclusive its 3' poly(A) tract and a single open-reading frame (ORF1) of 2745 nt. ORF1 was framed by a 5' untranslated region (5' UTR) of 18 nt and a 3' untranslated region (3' UTR) of 30 nt. ORF1 contained motifs of RNA-dependent RNA polymerases (RdRp) and showed similarities to RdRp of Scleropthora macrospora virus A (SmV A) and viruses within the Nodaviridae family. RNA2 consisted of 1526 nt exclusive its 3' poly(A) tract and a second ORF (ORF2) of 1128 nt. ORF2 coded for the single viral coat protein (CP) and was framed by a 5' UTR of 164 nt and a 3' UTR of 234 nt. The deduced amino acid sequence of ORF2 was verified by nano-LC-ESI-MS/MS experiments. Top-Down MALDI-TOF analysis revealed the N-terminal sequence of the CP. The N-terminal sequence represented a region within ORF2 suggesting a proteolytic processing of the CP in vivo. The CP showed similarities to CP of SmV A and viruses within the Tombusviridae family. Fragments of RNA1 (ca. 1.9 kb) and RNA2 (ca. 1.4 kb) were used to analyze the nucleotide sequence variation of virions in different P. halstedii isolates. Viral sequence variation was 0.3% or less regardless of their host's pathotypes, the geographical origin and the sensitivity towards the fungicide metalaxyl. The results showed the presence of a single and new virus type in different P. halstedii isolates. Insignificant viral sequence variation indicated that the virus did not account for differences in pathogenicity of the oomycete P. halstedii.
Molecular identification of Trichuris vulpis and Trichuris suis isolated from different hosts.
Cutillas, Cristina; de Rojas, Manuel; Ariza, Concepción; Ubeda, José Manuel; Guevara, Diego
2007-01-01
Trichuris suis was isolated from the cecum of two different hosts (Sus scrofa domestica -- swine and Sus scrofa scrofa -- wild boar) and Trichuris vulpis from dogs in Sevilla, Spain. Genomic DNA was isolated and internal transcribed spacers (ITS)1-5.8S-ITS2 segment from the ribosomal DNA (rDNA) was amplified and sequenced using polymerase chain reaction techniques. The sequence of T. suis from both hosts was 1,396 bp in length while that of T. vulpis was 1,044 bp. ITS1 of both populations isolated of T. suis was 661 nucleotides in length, while the ITS2 was 534 nucleotides in length. Furthermore, the ITS1 of T. vulpis was 410 nucleotides in length, while the ITS2 was 433 nucleotides in length. One hundred fifty-four nucleotides were observed along the 5.8S gene of T. suis and T. vulpis. Intraindividual and intraspecific variations were detected in the rDNA of both species. The presence of microsatellites was observed in all the individuals assayed. Sequence analysis of the ITSs and the 5.8S gene has demonstrated no sequence differences between T. suis isolated from both hosts (S. scrofa domestica -- swine and S. scrofa scrofa -- wild boar). Nevertheless, clear differences were detected between the ITS1 and ITS2 of T. suis and T. vulpis. Furthermore, a comparative molecular analysis between both species and the previously published ITS1-5.8S-ITS2 sequence data of Trichuris ovis, Trichuris leporis, Trichuris muris, Trichuris arvicolae, and Trichuris skrjabini was carried out. A common homology zone was detected in the ITS1 sequence of all species of trichurids.
[SSR loci information analysis in transcriptome of Andrographis paniculata].
Li, Jun-Ren; Chen, Xiu-Zhen; Tang, Xiao-Ting; He, Rui; Zhan, Ruo-Ting
2018-06-01
To study the SSR loci information and develop molecular markers, a total of 43 683 Unigenes in transcriptome of Andrographis paniculata were used to explore SSR. The distribution frequency of SSR and the basic characteristics of repeat motifs were analyzed using MicroSAtellite software, SSR primers were designed by Primer 3.0 software and then validated by PCR. Moreover, the gene function analysis of SSR Unigene was obtained by Blast. The results showed that 14 135 SSR loci were found in the transcriptome of A. paniculata, which distributed in 9 973 Unigenes with a distribution frequency of 32.36%. Di-nucleotide and Tri-nucleotide repeat were the main types, accounted for 75.54% of all SSRs. The repeat motifs of AT/AT and CCG/CGG were the predominant repeat types of Di-nucleotide and Tri-nucleotide, respectively. A total of 4 740 pairs of SSR primers with the potential to produce polymorphism were designed for maker development. Ten pairs of primers in 20 pairs of randomly picked primers produced fragments with expected molecular size. The gene function of Unigenes containing SSR were mostly related to the basic metabolism function of A. paniculata. The SSR markers in transcriptome of A. paniculata show rich type, strong specificity and high potential of polymorphism, which will benefit the candidate gene mining and marker-assisted breeding. Copyright© by the Chinese Pharmaceutical Association.
Feldman, Sanford H; Ntenda, Abraham M
2011-01-01
We used high-fidelity PCR to amplify 2 overlapping regions of the ribosomal gene complex from the rodent fur mite Myobia musculi. The amplicons encompassed a large portion of the mite's ribosomal gene complex spanning 3128 nucleotides containing the entire 18S rRNA, internal transcribed spacer (ITS) 1, 5.8S rRNA, ITS2, and a portion of the 5′-end of the 28S rRNA. M. musculi’s 179-nucleotide 5.8S rRNA nucleotide sequence was not conserved, so this region was identified by conservation of rRNA secondary structure. Maximum likelihood and Bayesian inference phylogenetic analyses were performed by using multiple sequence alignment consisting of 1524 nucleotides of M. musculi 18S rRNA and homologous sequences from 42 prostigmatid mites and the tick Dermacentor andersoni. The phylograms produced by both methods were in agreement regarding terminal, secondary, and some tertiary phylogenetic relationships among mites. Bayesian inference discriminated most infraordinal relationships between Eleutherengona and Parasitengona mites in the suborder Anystina. Basal relationships between suborders Anystina and Eupodina historically determined by comparing differences in anatomic characteristics were less well-supported by our molecular analysis. Our results recapitulated similar 18S rRNA sequence analyses recently reported. Our study supports M. musculi as belonging to the suborder Anystina, infraorder Eleutherenona, and superfamily Cheyletoidea. PMID:22330574
Shirasawa, Kenta; Hirakawa, Hideki; Nunome, Tsukasa; Tabata, Satoshi; Isobe, Sachiko
2016-01-01
Genome-wide mutations induced by ethyl methanesulfonate (EMS) and gamma irradiation in the tomato Micro-Tom genome were identified by a whole-genome shotgun sequencing analysis to estimate the spectrum and distribution of whole-genome DNA mutations and the frequency of deleterious mutations. A total of ~370 Gb of paired-end reads for four EMS-induced mutants and three gamma-ray-irradiated lines as well as a wild-type line were obtained by next-generation sequencing technology. Using bioinformatics analyses, we identified 5920 induced single nucleotide variations and insertion/deletion (indel) mutations. The predominant mutations in the EMS mutants were C/G to T/A transitions, while in the gamma-ray mutants, C/G to T/A transitions, A/T to T/A transversions, A/T to G/C transitions and deletion mutations were equally common. Biases in the base composition flanking mutations differed between the mutagenesis types. Regarding the effects of the mutations on gene function, >90% of the mutations were located in intergenic regions, and only 0.2% were deleterious. In addition, we detected 1,140,687 spontaneous single nucleotide polymorphisms and indel polymorphisms in wild-type Micro-Tom lines. We also found copy number variation, deletions and insertions of chromosomal segments in both the mutant and wild-type lines. The results provide helpful information not only for mutation research, but also for mutant screening methodology with reverse-genetic approaches. © 2015 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
OmpF, a nucleotide-sensing nanoprobe, computational evaluation of single channel activities
NASA Astrophysics Data System (ADS)
Abdolvahab, R. H.; Mobasheri, H.; Nikouee, A.; Ejtehadi, M. R.
2016-09-01
The results of highthroughput practical single channel experiments should be formulated and validated by signal analysis approaches to increase the recognition precision of translocating molecules. For this purpose, the activities of the single nano-pore forming protein, OmpF, in the presence of nucleotides were recorded in real time by the voltage clamp technique and used as a means for nucleotide recognition. The results were analyzed based on the permutation entropy of current Time Series (TS), fractality, autocorrelation, structure function, spectral density, and peak fraction to recognize each nucleotide, based on its signature effect on the conductance, gating frequency and voltage sensitivity of channel at different concentrations and membrane potentials. The amplitude and frequency of ion current fluctuation increased in the presence of Adenine more than Cytosine and Thymine in milli-molar (0.5 mM) concentrations. The variance of the current TS at various applied voltages showed a non-monotonic trend whose initial increasing slope in the presence of Thymine changed to a decreasing one in the second phase and was different from that of Adenine and Cytosine; e.g., by increasing the voltage from 40 to 140 mV in the 0.5 mM concentration of Adenine or Cytosine, the variance decreased by one third while for the case of Thymine it was doubled. Moreover, according to the structure function of TS, the fractality of current TS differed as a function of varying membrane potentials (pd) and nucleotide concentrations. Accordingly, the calculated permutation entropy of the TS, validated the biophysical approach defined for the recognition of different nucleotides at various concentrations, pd's and polarities. Thus, the promising outcomes of the combined experimental and theoretical methodologies presented here can be implemented as a complementary means in pore-based nucleotide recognition approaches.
MicroRNA Targeting Specificity in Mammals: Determinants Beyond Seed Pairing
Grimson, Andrew; Farh, Kyle Kai-How; Johnston, Wendy K.; Garrett-Engele, Philip; Lim, Lee P.; Bartel, David P.
2013-01-01
Summary Mammalian microRNAs (miRNAs) pair to 3'UTRs of mRNAs to direct their posttranscriptional repression. Important for target recognition are ~7-nt sites that match the seed region of the miRNA. However, these seed matches are not always sufficient for repression, indicating that other characteristics help specify targeting. By combining computational and experimental approaches, we uncovered five general features of site context that boost site efficacy: AU-rich nucleotide composition near the site, proximity to sites for co-expressed miRNAs (which leads to cooperative action), proximity to residues pairing to miRNA nucleotides 13–16, and positioning within the 3'UTR at least 15 nt from the stop codon and away from the center of long UTRs. A model combining these context determinants quantitatively predicts site performance both for exogenously added miRNAs and for endogenous miRNA-message interactions. Because it predicts site efficacy without recourse to evolutionary conservation, the model also identifies effective nonconserved sites and siRNA off-targets. PMID:17612493
Choudhry, Shweta; Baskin, Laurence S; Lammer, Edward J; Witte, John S; Dasgupta, Sudeshna; Ma, Chen; Surampalli, Abhilasha; Shen, Joel; Shaw, Gary M; Carmichael, Suzan L
2015-05-01
Estrogenic endocrine disruptors acting via estrogen receptors α (ESR1) and β (ESR2) have been implicated in the etiology of hypospadias, a common congenital malformation of the male external genitalia. We determined the association of single nucleotide polymorphisms in ESR1 and ESR2 genes with hypospadias in a racially/ethnically diverse study population of California births. We investigated the relationship between hypospadias and 108 ESR1 and 36 ESR2 single nucleotide polymorphisms in 647 cases and 877 population based nonmalformed controls among infants born in selected California counties from 1990 to 2003. Subgroup analyses were performed by race/ethnicity (nonHispanic white and Hispanic subjects) and by hypospadias severity (mild to moderate and severe). Odds ratios for 33 of the 108 ESR1 single nucleotide polymorphisms had p values less than 0.05 (p = 0.05 to 0.007) for risk of hypospadias. However, none of the 36 ESR2 single nucleotide polymorphisms was significantly associated. In stratified analyses the association results were consistent by disease severity but different sets of single nucleotide polymorphisms were significantly associated with hypospadias in nonHispanic white and Hispanic subjects. Due to high linkage disequilibrium across the single nucleotide polymorphisms, haplotype analyses were conducted and identified 6 haplotype blocks in ESR1 gene that had haplotypes significantly associated with an increased risk of hypospadias (OR 1.3 to 1.8, p = 0.04 to 0.00001). Similar to single nucleotide polymorphism analysis, different ESR1 haplotypes were associated with risk of hypospadias in nonHispanic white and Hispanic subjects. No significant haplotype association was observed for ESR2. The data provide evidence that ESR1 single nucleotide polymorphisms and haplotypes influence the risk of hypospadias in white and Hispanic subjects, and warrant further examination in other study populations. Copyright © 2015 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
Compositional searching of CpG islands in the human genome
NASA Astrophysics Data System (ADS)
Luque-Escamilla, Pedro Luis; Martínez-Aroza, José; Oliver, José L.; Gómez-Lopera, Juan Francisco; Román-Roldán, Ramón
2005-06-01
We report on an entropic edge detector based on the local calculation of the Jensen-Shannon divergence with application to the search for CpG islands. CpG islands are pieces of the genome related to gene expression and cell differentiation, and thus to cancer formation. Searching for these CpG islands is a major task in genetics and bioinformatics. Some algorithms have been proposed in the literature, based on moving statistics in a sliding window, but its size may greatly influence the results. The local use of Jensen-Shannon divergence is a completely different strategy: the nucleotide composition inside the islands is different from that in their environment, so a statistical distance—the Jensen-Shannon divergence—between the composition of two adjacent windows may be used as a measure of their dissimilarity. Sliding this double window over the entire sequence allows us to segment it compositionally. The fusion of those segments into greater ones that satisfy certain identification criteria must be achieved in order to obtain the definitive results. We find that the local use of Jensen-Shannon divergence is very suitable in processing DNA sequences for searching for compositionally different structures such as CpG islands, as compared to other algorithms in literature.
Li, Tang; Chamberlin, Stephen G; Caraco, M Daniel; Liberles, David A; Gaucher, Eric A; Benner, Steven A
2006-01-01
Background The exchange of nucleotides at synonymous sites in a gene encoding a protein is believed to have little impact on the fitness of a host organism. This should be especially true for synonymous transitions, where a pyrimidine nucleotide is replaced by another pyrimidine, or a purine is replaced by another purine. This suggests that transition redundant exchange (TREx) processes at the third position of conserved two-fold codon systems might offer the best approximation for a neutral molecular clock, serving to examine, within coding regions, theories that require neutrality, determine whether transition rate constants differ within genes in a single lineage, and correlate dates of events recorded in genomes with dates in the geological and paleontological records. To date, TREx analysis of the yeast genome has recognized correlated duplications that established a new metabolic strategies in fungi, and supported analyses of functional change in aromatases in pigs. TREx dating has limitations, however. Multiple transitions at synonymous sites may cause equilibration and loss of information. Further, to be useful to correlate events in the genomic record, different genes within a genome must suffer transitions at similar rates. Results A formalism to analyze divergence at two fold redundant codon systems is presented. This formalism exploits two-state approach-to-equilibrium kinetics from chemistry. This formalism captures, in a single equation, the possibility of multiple substitutions at individual sites, avoiding any need to "correct" for these. The formalism also connects specific rate constants for transitions to specific approximations in an underlying evolutionary model, including assumptions that transition rate constants are invariant at different sites, in different genes, in different lineages, and at different times. Therefore, the formalism supports analyses that evaluate these approximations. Transitions at synonymous sites within two-fold redundant coding systems were examined in the mouse, rat, and human genomes. The key metric (f2), the fraction of those sites that holds the same nucleotide, was measured for putative ortholog pairs. A transition redundant exchange (TREx) distance was calculated from f2 for these pairs. Pyrimidine-pyrimidine transitions at these sites occur approximately 14% faster than purine-purine transitions in various lineages. Transition rate constants were similar in different genes within the same lineages; within a set of orthologs, the f2 distribution is only modest overdispersed. No correlation between disparity and overdispersion is observed. In rodents, evidence was found for greater conservation of TREx sites in genes on the X chromosome, accounting for a small part of the overdispersion, however. Conclusion The TREx metric is useful to analyze the history of transition rate constants within these mammals over the past 100 million years. The TREx metric estimates the extent to which silent nucleotide substitutions accumulate in different genes, on different chromosomes, with different compositions, in different lineages, and at different times. PMID:16545144
Kristián, Tibor; Weatherby, Tina M; Bates, Timothy E; Fiskum, Gary
2002-12-01
Calcium overload of neural cell mitochondria plays a key role in excitotoxic and ischemic brain injury. This study tested the hypothesis that brain mitochondria consist of subpopulations with differential sensitivity to calcium-induced inner membrane permeability transition, and that this sensitivity is greatly reduced by physiological levels of adenine nucleotides. Isolated non-synaptosomal rat brain mitochondria were incubated in a potassium-based medium in the absence or presence of ATP or ADP. Measurements were made of medium and intramitochondrial free calcium, light scattering, mitochondrial ultrastructure, and the elemental composition of electron-opaque deposits within mitochondria treated with calcium. In the absence of adenine nucleotides, calcium induced a partial decrease in light scattering, accompanied by three distinct ultrastructural morphologies, including large-amplitude swelling, matrix vacuolization and a normal appearance. In the presence of ATP or ADP the mitochondrial calcium uptake capacity was greatly enhanced and calcium induced an increase rather than a decrease in mitochondrial light scattering. Approximately 10% of the mitochondria appeared damaged and the rest contained electron-dense precipitates that contained calcium, as determined by electron-energy loss spectroscopy. These results indicate that brain mitochondria are heterogeneous in their response to calcium. In the absence of adenine nucleotides, approximately 20% of the mitochondrial population exhibit morphological alterations consistent with activation of the permeability transition, but less than 10% exhibit evidence of osmotic swelling and membrane disruption in the presence of ATP or ADP.
Stoddard, Colby D.; Widmann, Jeremy; Trausch, Jeremiah J.; Marcano-Velázquez, Joan G.; Knight, Rob; Batey, Robert T.
2013-01-01
Direct sensing of intracellular metabolite concentrations by riboswitch RNAs provides an economical and rapid means to maintain metabolic homeostasis. Since many organisms employ the same class of riboswitch to control different genes or transcription units, it is likely that functional variation exists in riboswitches such that activity is tuned to meet cellular needs. Using a bioinformatic approach, we have identified a region of the purine riboswitch aptamer domain that displays conservation patterns linked to riboswitch activity. Aptamer domain compositions within this region can be divided into nine classes that display a spectrum of activities. Naturally occurring compositions in this region favor rapid association rate constants and slow dissociation rate constants for ligand binding. Using X-ray crystallography and chemical probing, we demonstrate that both the free and bound states are influenced by the composition of this region and that modest sequence alterations have a dramatic impact on activity. The introduction of non-natural compositions result in the inability to regulate gene expression in vivo, suggesting that aptamer domain activity is highly plastic and thus readily tunable to meet cellular needs. PMID:23485418
In-Depth Analysis of HA and NS1 Genes in A(H1N1)pdm09 Infected Patients.
Caglioti, Claudia; Selleri, Marina; Rozera, Gabriella; Giombini, Emanuela; Zaccaro, Paola; Valli, Maria Beatrice; Capobianchi, Maria Rosaria
2016-01-01
In March/April 2009, a new pandemic influenza A virus (A(H1N1)pdm09) emerged and spread rapidly via human-to-human transmission, giving rise to the first pandemic of the 21th century. Influenza virus may be present in the infected host as a mixture of variants, referred to as quasi-species, on which natural and immune-driven selection operates. Since hemagglutinin (HA) and non-structural 1 (NS1) proteins are relevant in respect of adaptive and innate immune responses, the present study was aimed at establishing the intra-host genetic heterogeneity of HA and NS1 genes, applying ultra-deep pyrosequencing (UDPS) to nasopharyngeal swabs (NPS) from patients with confirmed influenza A(H1N1)pdm09 infection. The intra-patient nucleotide diversity of HA was significantly higher than that of NS1 (median (IQR): 37.9 (32.8-42.3) X 10-4 vs 30.6 (27.4-33.6) X 10-4 substitutions/site, p = 0.024); no significant correlation for nucleotide diversity of NS1 and HA was observed (r = 0.319, p = 0.29). Furthermore, a strong inverse correlation between nucleotide diversity of NS1 and viral load was observed (r = - 0.74, p = 0.004). For both HA and NS1, the variants appeared scattered along the genes, thus indicating no privileged mutation site. Known polymorphisms, S203T (HA) and I123V (NS1), were observed as dominant variants (>98%) in almost all patients; three HA and two NS1 further variants were observed at frequency >40%; a number of additional variants were detected at frequency <6% (minority variants), of which three HA and four NS1 variants were novel. In few patients multiple variants were observed at HA residues 203 and 222. According to the FLUSURVER tool, some of these variants may affect immune recognition and host range; however, these inferences are based on H5N1, and their extension to A(H1N1)pdm09 requires caution. More studies are necessary to address the significance of the composite nature of influenza virus quasi-species within infected patients.
Structural basis for modulation and agonist specificity of HCN pacemaker channels.
Zagotta, William N; Olivier, Nelson B; Black, Kevin D; Young, Edgar C; Olson, Rich; Gouaux, Eric
2003-09-11
The family of hyperpolarization-activated, cyclic nucleotide-modulated (HCN) channels are crucial for a range of electrical signalling, including cardiac and neuronal pacemaker activity, setting resting membrane electrical properties and dendritic integration. These nonselective cation channels, underlying the I(f), I(h) and I(q) currents of heart and nerve cells, are activated by membrane hyperpolarization and modulated by the binding of cyclic nucleotides such as cAMP and cGMP. The cAMP-mediated enhancement of channel activity is largely responsible for the increase in heart rate caused by beta-adrenergic agonists. Here we have investigated the mechanism underlying this modulation by studying a carboxy-terminal fragment of HCN2 containing the cyclic nucleotide-binding domain (CNBD) and the C-linker region that connects the CNBD to the pore. X-ray crystallographic structures of this C-terminal fragment bound to cAMP or cGMP, together with equilibrium sedimentation analysis, identify a tetramerization domain and the mechanism for cyclic nucleotide specificity, and suggest a model for ligand-dependent channel modulation. On the basis of amino acid sequence similarity to HCN channels, the cyclic nucleotide-gated, and eag- and KAT1-related families of channels are probably related to HCN channels in structure and mechanism.
Glutamine 89 is a key residue in the allosteric modulation of human serine racemase activity by ATP.
Canosa, Andrea V; Faggiano, Serena; Marchetti, Marialaura; Armao, Stefano; Bettati, Stefano; Bruno, Stefano; Percudani, Riccardo; Campanini, Barbara; Mozzarelli, Andrea
2018-06-13
Serine racemase (SR) catalyses two reactions: the reversible racemisation of L-serine and the irreversible dehydration of L- and D-serine to pyruvate and ammonia. SRs are evolutionarily related to serine dehydratases (SDH) and degradative threonine deaminases (TdcB). Most SRs and TdcBs - but not SDHs - are regulated by nucleotides. SR binds ATP cooperatively and the nucleotide allosterically stimulates the serine dehydratase activity of the enzyme. A H-bond network comprising five residues (T52, N86, Q89, E283 and N316) and water molecules connects the active site with the ATP-binding site. Conservation analysis points to Q89 as a key residue for the allosteric communication, since its mutation to either Met or Ala is linked to the loss of control of activity by nucleotides. We verified this hypothesis by introducing the Q89M and Q89A point mutations in the human SR sequence. The allosteric communication between the active site and the allosteric site in both mutants is almost completely abolished. Indeed, the stimulation of the dehydratase activity by ATP is severely diminished and the binding of the nucleotide is no more cooperative. Ancestral state reconstruction suggests that the allosteric control by nucleotides established early in SR evolution and has been maintained in most eukaryotic lineages.
Barkan, A; Mertz, J E
1981-02-01
The nucleotide sequences of 10 viable yet partially defective deletion mutants of simian virus 40 were determined. The deletions mapped within, and, in many cases, 5' to, the predominant leader sequence of the late viral mRNA's. They ranged from 74 to 187 nucleotide pairs in length. Six of the mutants had lost the sequence that corresponds to the "cap" site (5' terminus) of the most abundant class of 16S mRNA's. One of these mutants had a deletion that extended 103 nucleotide pairs into the region preceding this primary cap site and, therefore, was missing many secondary cap sites as well. A seventh mutant lacked the entire major 16S leader sequence except for the first six nucleotides at its 5' end and the last nine at its 3' end. Although these mutants differed in the size and position of their deletions, we were unable to discover any simple correlations between their growth characteristics and their DNA sequences. This finding indicates that the secondary structures of the RNA transcripts may play a more important role than the exact nucleotide sequence of the RNAs in determining how they function within the cell.
Yi, Ping; Chen, Zhuqin; Zhao, Yan; Guo, Jianxin; Fu, Huabin; Zhou, Yuanguo; Yu, Lili; Li, Li
2009-03-01
The discovery of fetal DNA in maternal plasma has opened up an approach for noninvasive diagnosis. We have now assessed the possibility of detecting single-nucleotide differences between fetal and maternal DNA in maternal plasma by polymerase chain reaction (PCR)/ligase detection reaction((LDR)/capillary electrophoresis. PCR/LDR/capillary electrophoresis was applied to detect the genotype of c.454-397T>gene (ESR1) from experimental DNA models of maternal plasma at different sensitivity levels and 13 maternal plasma samples.alphaC in estrogen receptor. (1) Our results demonstrated that the technique could discriminate low abundance single-nucleotide mutation with a mutant/normal allele ratio up to 1:10 000. (2) Examination of ESR1 c.454-397T>C genotypes by using the method of restriction fragment length analysis was performed in 25 pregnant women, of whom 13 pregnant women had homozygous genotypes. The c.454-397T>C genotypes of paternally inherited fetal DNA in maternal plasma of these 13 women were detected by PCR/LDR/capillary electrophoresis, which were accordant with the results of umbilical cord blood. PCR/LDR/capillary electrophoresis has very high sensitivity to distinguish low abundance single nucleotide differences and can discriminate point mutations and single-nucleotide polymorphisms(SNPs) of paternally inherited fetal DNA in maternal plasma.
Correa-Rodríguez, María; Schmidt-RioValle, Jacqueline; González-Jiménez, Emilio; Rueda-Medina, Blanca
2017-06-01
Obesity is considered an increasingly serious health problem determined by multiple genetic and environmental factors. Estrogens have been found to play a major role in body weight and adiposity regulation through estrogen receptor 1 ( ESR1). The aim of this study was to determine whether genotype and haplotype frequencies of ESR1 polymorphisms are associated with body composition measures in a population of 572 young adults. A lack of significant association between genotypes of ESR1 gene polymorphisms and obesity phenotypes was seen after adjustment for confounding factors. Linkage disequilibrium (LD) analysis identified a single LD block for the ESR1 gene including PvuII and XbaI single-nucleotide polymorphisms (SNPs) (pairwise r 2 = .66). None of the haplotypes identified revealed statistically significant associations with any of the obesity phenotypes. Our results suggest that polymorphisms of the ESR1 gene do not contribute significantly to the genetic risk for obesity phenotypes in a population of young Caucasian adults.
Zhao, Xiu-Ju; Chen, Yu-Lian; Fu, Bing; Zhang, Wen; Liu, Zhiguo; Zhuo, Hexian
2017-03-01
Understanding the metabolic and transcription basis of pumpkin seed oil (PSO) intervention on metabolic disease (MD) is essential to daily nutrition and health. This study analyzed the liver metabolic variations of Wistar rats fed normal diet (CON), high-fat diet (HFD) and high-fat plus PSO diet (PSO) to establish the relationship between the liver metabolite composition/transcript profile and the effects of PSO on MD. By using proton nuclear magnetic resonance spectroscopy together with multivariate data analysis, it was found that, compared with CON rats, HFD rats showed clear dysfunctions of choline metabolism, glucose metabolism and nucleotide and amino acid metabolism. Using quantitative real-time polymerase chain reaction (qPCR), it was found that, compared with HFD rats, PSO rats showed alleviated endoplasmic reticulum stress accompanied by lowered unfolded protein response. These findings provide useful information to understand the metabolic alterations triggered by MD and to evaluate the effects of PSO intervention. © 2016 Society of Chemical Industry. © 2016 Society of Chemical Industry.
Newborn Urinary Metabolic Signatures of Prematurity and Other Disorders: A Case Control Study.
Diaz, Sílvia O; Pinto, Joana; Barros, António S; Morais, Elisabete; Duarte, Daniela; Negrão, Fátima; Pita, Cristina; Almeida, Maria do Céu; Carreira, Isabel M; Spraul, Manfred; Gil, Ana M
2016-01-04
This work assesses the urinary metabolite signature of prematurity in newborns by nuclear magnetic resonance (NMR) spectroscopy, while establishing the role of possible confounders and signature specificity, through comparison to other disorders. Gender and delivery mode are shown to impact importantly on newborn urine composition, their analysis pointing out at specific metabolite variations requiring consideration in unmatched subject groups. Premature newborns are, however, characterized by a stronger signature of varying metabolites, suggestive of disturbances in nucleotide metabolism, lung surfactants biosynthesis and renal function, along with enhancement of tricarboxylic acid (TCA) cycle activity, fatty acids oxidation, and oxidative stress. Comparison with other abnormal conditions (respiratory depression episode, large for gestational age, malformations, jaundice and premature rupture of membranes) reveals that such signature seems to be largely specific of preterm newborns, showing that NMR metabolomics can retrieve particular disorder effects, as well as general stress effects. These results provide valuable novel information on the metabolic impact of prematurity, contributing to the better understanding of its effects on the newborn's state of health.
Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.
Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing
2016-12-01
Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.
[Genetic Structure of Urban Population of the Common Hamster (Cricetus cricetus)].
Feoktistova, N Yu; Meschersky, I G; Surov, A V; Bogomolov, P L; Tovpinetz, N N; Poplavskaya, N S
2016-02-01
Over the past half-century, the common hamster (Cricetus cricetus), along with range-wide decline of natural populations, has actively populated the cities. The study of the genetic structure of urban populations of common hamster may shed light on features of the habitation of this species in urban landscapes. This article is focused on the genetic structure of common hamster populations in Simferopol (Crimea), one of the largest known urban populations of this species. On the basis of the analysis of nucleotide sequences of the cytochrome b gene and mtDNA control region, and the allelic composition of ten microsatellite loci of nDNA, we revealed that, despite the fact that some individuals can move throughout the city at considerable distances, the entire population of the city is represented by separate demes confined to different areas. These demes are characterized by a high degree of the genetic isolation and reduced genetic diversity compared to that found for the city as a whole.
Suzuki, Karen M; Arias, Maria C; Giangarelli, Douglas C; Freiria, Gabriele A; Sofia, Silvia H
2010-04-01
Euglossa fimbriata is a euglossine species widely distributed in Brazil and occurring primarily in Atlantic Forest remnants. In this study, the genetic mitochondrial structure of E. fimbriata from six Atlantic Forest fragments was studied by RFLP analysis of three PCR-amplified mtDNA gene segments (16S, COI-COII, and cyt b). Ten composite haplotypes were identified, six of which were exclusive and represented singleton mitotypes. Low haplotype diversity (0.085-0.289) and nucleotide diversity (0.000-0.002) were detected within samples. AMOVA partitioned 91.13% of the overall genetic variation within samples and 8.87% (phi(st) = 0.089; P < 0.05) among samples. Pairwise comparisons indicated high levels of differentiation among some pairs of samples (phi(st) = 0.161-0.218; P < 0.05). These high levels indicate that these populations of E. fimbriata, despite their highly fragmented landscape, apparently have not suffered loss of genetic variation, suggesting that this particular population is not currently endangered.
C/N Ratio Drives Soil Actinobacterial Cellobiohydrolase Gene Diversity
Prendergast-Miller, Miranda T.; Poonpatana, Pabhon; Farrell, Mark; Bissett, Andrew; Macdonald, Lynne M.; Toscas, Peter; Richardson, Alan E.; Thrall, Peter H.
2015-01-01
Cellulose accounts for approximately half of photosynthesis-fixed carbon; however, the ecology of its degradation in soil is still relatively poorly understood. The role of actinobacteria in cellulose degradation has not been extensively investigated despite their abundance in soil and known cellulose degradation capability. Here, the diversity and abundance of the actinobacterial glycoside hydrolase family 48 (cellobiohydrolase) gene in soils from three paired pasture-woodland sites were determined by using terminal restriction fragment length polymorphism (T-RFLP) analysis and clone libraries with gene-specific primers. For comparison, the diversity and abundance of general bacteria and fungi were also assessed. Phylogenetic analysis of the nucleotide sequences of 80 clones revealed significant new diversity of actinobacterial GH48 genes, and analysis of translated protein sequences showed that these enzymes are likely to represent functional cellobiohydrolases. The soil C/N ratio was the primary environmental driver of GH48 community compositions across sites and land uses, demonstrating the importance of substrate quality in their ecology. Furthermore, mid-infrared (MIR) spectrometry-predicted humic organic carbon was distinctly more important to GH48 diversity than to total bacterial and fungal diversity. This suggests a link between the actinobacterial GH48 community and soil organic carbon dynamics and highlights the potential importance of actinobacteria in the terrestrial carbon cycle. PMID:25710367
Tsuchida, Shuichi; Kagi, Akiko; Koyama, Hidekazu; Tagawa, Masahiro
2007-12-01
Xanthine urolithiasis was found in a 4-year-old spayed female Himalayan cat with a 10-month history of intermittent haematuria and dysuria. Ultrasonographs indicated the existence of several calculi in the bladder that were undetectable by survey radiographic examination. Four bladder stones were removed by cystotomy. The stones were spherical brownish-yellow and their surface was smooth and glossy. Quantitative mineral analysis showed a representative urolith to be composed of more than 95% xanthine. Ultrasonographic examination of the bladder 4.5 months postoperatively indicated the recurrence of urolithiasis. Analysis of purine concentration in urine and blood showed that the cat excreted excessive amounts of xanthine. In order to test the hypothesis that xanthinuria was caused by a homozygote of the inherited mutant allele of a gene responsible for deficiency of enzyme activity in purine degradation pathway, the allele composition of xanthine dehydrogenase (XDH) gene (one of the candidate genes for hereditary xanthinuria) was evaluated. The cat with xanthinuria was a heterozygote of the polymorphism. A single nucleotide polymorphism analysis of the cat XDH gene strongly indicated that the XDH gene of the patient cat was composed of two kinds of alleles and ruled out the hypothesis that the cat inherited the same recessive XDH allele suggesting no activity from a single ancestor.
Gui, Linsheng; Hong, Jieyun; Raza, Sayed Haidar Abbas; Zan, Linsen
2017-04-01
Sirtuin 3 (SIRT3) is a mitochondrial nicotinamide adenine dinucleotide (NAD)-dependent deacetylase. It has crucial roles in regulating the respiratory chain, in adenosine triphosphate (ATP) production, and in both the citric acid and urea cycles. The aim of this study was to investigate whether SIRT3 could be used as a candidate gene in the breeding of cattle. Expression analysis by quantitative real-time polymerase chain reactions (qPCR) indicated that expression levels of SIRT3 were highest in the kidney, rumen, liver, omasum and muscle. Using sequencing technology on a total of 913 cattle representing three indigenous Chinese beef cattle breeds, three single nucleotide polymorphisms (SNPs) were identified in the promoter region of SIRT3, and five haplotypes representing five potential transcription factor compositions of polymorphic potential cis-acting elements. Association analysis indicated that the Hap3/8 diplotype performed better than other combinations in intramuscular fat content. In addition, the promoter activity with Hap1 haplotype was higher than the Hap8 haplotype, consistent with the association analysis. The results indicate that the polymorphisms in transcription factor binding sites of SIRT3 promoter may affect the transcriptional activity of SIRT3, and thus alter intramuscular fat content in beef cattle. Copyright © 2016 Elsevier Ltd. All rights reserved.
Dai, Ronghua; Fang, Yu; Zhao, Wenjing; Liu, Shuyun; Ding, Jinmei; Xu, Ke; Yang, Lingyu; He, Chuan; Ding, Fangmei; Meng, He
2016-08-01
The study reported in this Regional Research Communication aimed to analyse the genetic polymorphisms of β-casein in Chinese Holstein cows. β-casein has received considerable research interest in the dairy industry and animal breeding in recent years as a source not only of high quality protein, but also of bioactive peptides that may be linked to health effects. Morever, the polymorphic nature of β-casein and its association with milk production traits, composition, and quality also attracted several efforts in evaluating the allelic distribution of β-casein locus as a potential dairy trait marker. However, few data on beta-casein variants are available for the Chinese Holstein cow. In the present paper, one hundred and thirty three Holstein cows were included in the analysis. Results revealed the presence of 5 variants (A1, A2, A3, B and I), preponderance of the genotype A1A2 (0·353) and superiorities of A1/A2 alleles (0·432 and 0·459, respectively) in the population. Sequence analysis of β-casein gene in the cows showed four nucleotide changes in exon 7. Our study can provide reference and guidance for selection for superior milk for industrial applications and crossbreeding and genetic improvement programmes.
Parrish, R Ryley; Day, Jeremy J; Lubin, Farah D
2012-07-01
DNA methylation is an epigenetic modification that is essential for the development and mature function of the central nervous system. Due to the relevance of this modification to the transcriptional control of gene expression, it is often necessary to examine changes in DNA methylation patterns with both gene and single-nucleotide resolution. Here, we describe an in-depth basic protocol for direct bisulfite sequencing of DNA isolated from brain tissue, which will permit direct assessment of methylation status at individual genes as well as individual cytosine molecules/nucleotides within a genomic region. This method yields analysis of DNA methylation patterns that is robust, accurate, and reproducible, thereby allowing insights into the role of alterations in DNA methylation in brain tissue.
Hostnik, Peter; Picard-Meyer, Evelyne; Rihtarič, Danijela; Toplak, Ivan; Cliquet, Florence
2014-04-01
Oral vaccination campaigns to eliminate fox rabies were initiated in Slovenia in 1995. In May 2012, a young fox (Vulpes vulpes) with typical rabies signs was captured. Its brain and salivary gland tissues were found to contain vaccine strain SAD B19. The Basic Logical Alignment Search Tool alignment of 589 nucleotides determined from the N gene of the virus isolated from the brain and salivary glands of the affected fox was 100% identical to the GenBank reference SAD B19 strain. Sequence analysis of the N and M genes (4,351 nucleotides) showed two nucleotide modifications at position 1335 (N gene) and 3114 (M gene) in the KC522613 isolate identified in the fox compared to SAD B19.
Single nucleotide polymorphism analysis using different colored dye dimer probes
NASA Astrophysics Data System (ADS)
Marmé, Nicole; Friedrich, Achim; Denapaite, Dalia; Hakenbeck, Regine; Knemeyer, Jens-Peter
2006-09-01
Fluorescence quenching by dye dimer formation has been utilized to develop hairpin-structured DNA probes for the detection of a single nucleotide polymorphism (SNP) in the penicillin target gene pbp2x, which is implicated in the penicillin resistance of Streptococcus pneumoniae. We designed two specific DNA probes for the identification of the pbp2x genes from a penicillin susceptible strain R6 and a resistant strain Streptococcus mitis 661 using green-fluorescent tetramethylrhodamine (TMR) and red-fluorescent DY-636, respectively. Hybridization of each of the probes to its respective target DNA sequence opened the DNA hairpin probes, consequently breaking the nonfluorescent dye dimers into fluorescent species. This hybridization of the target with the hairpin probe achieved single nucleotide specific detection at nanomolar concentrations via increased fluorescence.
Schürch, A C; Arredondo-Alonso, S; Willems, R J L; Goering, R V
2018-04-01
Whole genome sequence (WGS)-based strain typing finds increasing use in the epidemiologic analysis of bacterial pathogens in both public health as well as more localized infection control settings. This minireview describes methodologic approaches that have been explored for WGS-based epidemiologic analysis and considers the challenges and pitfalls of data interpretation. Personal collection of relevant publications. When applying WGS to study the molecular epidemiology of bacterial pathogens, genomic variability between strains is translated into measures of distance by determining single nucleotide polymorphisms in core genome alignments or by indexing allelic variation in hundreds to thousands of core genes, assigning types to unique allelic profiles. Interpreting isolate relatedness from these distances is highly organism specific, and attempts to establish species-specific cutoffs are unlikely to be generally applicable. In cases where single nucleotide polymorphism or core gene typing do not provide the resolution necessary for accurate assessment of the epidemiology of bacterial pathogens, inclusion of accessory gene or plasmid sequences may provide the additional required discrimination. As with all epidemiologic analysis, realizing the full potential of the revolutionary advances in WGS-based approaches requires understanding and dealing with issues related to the fundamental steps of data generation and interpretation. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Phylogenetic analysis of the envelope protein (domain lll) of dengue 4 viruses
Mota, Javier; Ramos-Castañeda, José; Rico-Hesse, Rebeca; Ramos, Celso
2011-01-01
Objective To evaluate the genetic variability of domain III of envelope (E) protein and to estimate phylogenetic relationships of dengue 4 (Den-4) viruses isolated in Mexico and from other endemic areas of the world. Material and Methods A phylogenetic study of domain III of envelope (E) protein of Den-4 viruses was conducted in 1998 using virus strains from Mexico and other parts of the world, isolated in different years. Specific primers were used to amplify by RT-PCR the domain III and to obtain nucleotide sequence. Based on nucleotide and deduced aminoacid sequence, genetic variability was estimated and a phylogenetic tree was generated. To make an easy genetic analysis of domain III region, a Restriction Fragment Length Polymorphism (RFLP) assay was performed, using six restriction enzymes. Results Study results demonstrate that nucleotide and aminoacid sequence analysis of domain III are similar to those reported from the complete E protein gene. Based on the RFLP analysis of domain III using the restriction enzymes Nla III, Dde I and Cfo I, Den-4 viruses included in this study were clustered into genotypes 1 and 2 previously reported. Conclusions Study results suggest that domain III may be used as a genetic marker for phylogenetic and molecular epidemiology studies of dengue viruses. The English version of this paper is available too at: http://www.insp.mx/salud/index.html PMID:12132320
2012-01-01
The increasing size and complexity of exome/genome sequencing data requires new tools for clinical geneticists to discover disease-causing variants. Bottlenecks in identifying the causative variation include poor cross-sample querying, constantly changing functional annotation and not considering existing knowledge concerning the phenotype. We describe a methodology that facilitates exploration of patient sequencing data towards identification of causal variants under different genetic hypotheses. Annotate-it facilitates handling, analysis and interpretation of high-throughput single nucleotide variant data. We demonstrate our strategy using three case studies. Annotate-it is freely available and test data are accessible to all users at http://www.annotate-it.org. PMID:23013645
Kawaguchi, Fuki; Okura, Kazuki; Oyama, Kenji; Mannen, Hideyuki; Sasazaki, Shinji
2017-03-01
Previous studies have indicated that some leptin gene polymorphisms were associated with economically important traits in cattle breeds. However, polymorphisms in the leptin gene have not been reported thus far in Japanese Black cattle. Here, we aimed to identify the leptin gene polymorphisms which are associated with carcass traits and fatty acid composition in Japanese Black cattle. We sequenced the full-length coding sequence of leptin gene for eight Japanese Black cattle. Sequence comparison revealed eight single nucleotide polymorphisms (SNPs). Three of these were predicted to cause amino acid substitutions: Y7F, R25C and A80V. Then, we genotyped these SNPs in two populations (JB1 with 560 animals and JB2 with 450 animals) and investigated the effects on the traits. Y7F in JB1 and A80V in JB2 were excluded from statistical analysis because the minor allele frequencies were low (< 0.1). Association analysis revealed that Y7F had a significant effect on the dressed carcass weight in JB2; R25C had a significant effect on C18:0 and C14:1 in JB1 and JB2, respectively; and A80V had a significant effect on C16:0, C16:1, C18:1, monounsaturated fatty acid and saturated fatty acid in JB1. The results suggested that these SNPs could be used as an effective marker for the improvement of Japanese Black cattle. © 2016 Japanese Society of Animal Science.
Voruganti, V. Saroja; Cole, Shelley A.; Haack, Karin; Comuzzie, Anthony G.; Muzny, Donna M.; Wheeler, David A.; Chang, Kyle; Hawes, Alicia; Gibbs, Richard A.
2011-01-01
Our objective was to resequence insulin receptor substrate 2 (IRS2) to identify variants associated with obesity- and diabetes-related traits in Hispanic children. Exonic and intronic segments, 5′ and 3′ flanking regions of IRS2 (∼14.5 kb), were bidirectionally sequenced for single nucleotide polymorphism (SNP) discovery in 934 Hispanic children using 3730XL DNA Sequencers. Additionally, 15 SNPs derived from Illumina HumanOmni1-Quad BeadChips were analyzed. Measured genotype analysis tested associations between SNPs and obesity and diabetes-related traits. Bayesian quantitative trait nucleotide analysis was used to statistically infer the most likely functional polymorphisms. A total of 140 SNPs were identified with minor allele frequencies (MAF) ranging from 0.001 to 0.47. Forty-two of the 70 coding SNPs result in nonsynonymous amino acid substitutions relative to the consensus sequence; 28 SNPs were detected in the promoter, 12 in introns, 28 in the 3′-UTR, and 2 in the 5′-UTR. Two insertion/deletions (indels) were detected. Ten independent rare SNPs (MAF = 0.001–0.009) were associated with obesity-related traits (P = 0.01–0.00002). SNP 10510452_139 in the promoter region was shown to have a high posterior probability (P = 0.77–0.86) of influencing BMI, fat mass, and waist circumference in Hispanic children. SNP 10510452_139 contributed between 2 and 4% of the population variance in body weight and composition. None of the SNPs or indels were associated with diabetes-related traits or accounted for a previously identified quantitative trait locus on chromosome 13 for fasting serum glucose. Rare but not common IRS2 variants may play a role in the regulation of body weight but not an essential role in fasting glucose homeostasis in Hispanic children. PMID:21771880
Mullen, M P; Berry, D P; Howard, D J; Diskin, M G; Lynch, C O; Berkowicz, E W; Magee, D A; MacHugh, D E; Waters, S M
2010-12-01
Growth hormone, produced in the anterior pituitary gland, stimulates the release of insulin-like growth factor-I from the liver and is of critical importance in the control of nutrient utilization and partitioning for lactogenesis, fertility, growth, and development in cattle. The aim of this study was to discover novel polymorphisms in the bovine growth hormone gene (GH1) and to quantify their association with performance using estimates of genetic merit on 848 Holstein-Friesian AI (artificial insemination) dairy sires. Associations with previously reported polymorphisms in the bovine GH1 gene were also undertaken. A total of 38 novel single nucleotide polymorphisms (SNP) were identified across a panel of 22 beef and dairy cattle by sequence analysis of the 5' promoter, intronic, exonic, and 3' regulatory regions, encompassing approximately 7 kb of the GH1 gene. Following multiple regression analysis on all SNP, associations were identified between 11 SNP (2 novel and 9 previously identified) and milk fat and protein yield, milk composition, somatic cell score, survival, body condition score, and body size. The G allele of a previously identified SNP in exon 5 at position 2141 of the GH1 sequence, resulting in a nonsynonymous substitution, was associated with decreased milk protein yield. The C allele of a novel SNP, GH32, was associated with inferior carcass conformation. In addition, the T allele of a previously characterized SNP, GH35, was associated with decreased survival. Both GH24 (novel) and GH35 were independently associated with somatic cell count, and 3 SNP, GH21, 2291, and GH35, were independently associated with body depth. Furthermore, 2 SNP, GH24 and GH63, were independently associated with carcass fat. Results of this study further demonstrate the multifaceted influences of GH1 on milk production, fertility, and growth-related traits in cattle. Copyright © 2010 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
2011-01-01
The genomic DNA sequence of a novel enteric uncultured microphage, ΦCA82 from a turkey gastrointestinal system was determined utilizing metagenomics techniques. The entire circular, single-stranded nucleotide sequence of the genome was 5,514 nucleotides. The ΦCA82 genome is quite different from other microviruses as indicated by comparisons of nucleotide similarity, predicted protein similarity, and functional classifications. Only three genes showed significant similarity to microviral proteins as determined by local alignments using BLAST analysis. ORF1 encoded a predicted phage F capsid protein that was phylogenetically most similar to the Microviridae ΦMH2K member's major coat protein. The ΦCA82 genome also encoded a predicted minor capsid protein (ORF2) and putative replication initiation protein (ORF3) most similar to the microviral bacteriophage SpV4. The distant evolutionary relationship of ΦCA82 suggests that the divergence of this novel turkey microvirus from other microviruses may reflect unique evolutionary pressures encountered within the turkey gastrointestinal system. PMID:21714899
Hydration sites of unpaired RNA bases: a statistical analysis of the PDB structures.
Kirillova, Svetlana; Carugo, Oliviero
2011-10-19
Hydration is crucial for RNA structure and function. X-ray crystallography is the most commonly used method to determine RNA structures and hydration and, therefore, statistical surveys are based on crystallographic results, the number of which is quickly increasing. A statistical analysis of the water molecule distribution in high-resolution X-ray structures of unpaired RNA nucleotides showed that: different bases have the same penchant to be surrounded by water molecules; clusters of water molecules indicate possible hydration sites, which, in some cases, match those of the major and minor grooves of RNA and DNA double helices; complex hydrogen bond networks characterize the solvation of the nucleotides, resulting in a significant rigidity of the base and its surrounding water molecules. Interestingly, the hydration sites around unpaired RNA bases do not match, in general, the positions that are occupied by the second nucleotide when the base-pair is formed. The hydration sites around unpaired RNA bases were found. They do not replicate the atom positions of complementary bases in the Watson-Crick pairs.
Reference genotype and exome data from an Australian Aboriginal population for health-based research
Tang, Dave; Anderson, Denise; Francis, Richard W.; Syn, Genevieve; Jamieson, Sarra E.; Lassmann, Timo; Blackwell, Jenefer M.
2016-01-01
Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found ~99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians. PMID:27070114
Tang, Dave; Anderson, Denise; Francis, Richard W; Syn, Genevieve; Jamieson, Sarra E; Lassmann, Timo; Blackwell, Jenefer M
2016-04-12
Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found ~99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians.
Hydration sites of unpaired RNA bases: a statistical analysis of the PDB structures
2011-01-01
Background Hydration is crucial for RNA structure and function. X-ray crystallography is the most commonly used method to determine RNA structures and hydration and, therefore, statistical surveys are based on crystallographic results, the number of which is quickly increasing. Results A statistical analysis of the water molecule distribution in high-resolution X-ray structures of unpaired RNA nucleotides showed that: different bases have the same penchant to be surrounded by water molecules; clusters of water molecules indicate possible hydration sites, which, in some cases, match those of the major and minor grooves of RNA and DNA double helices; complex hydrogen bond networks characterize the solvation of the nucleotides, resulting in a significant rigidity of the base and its surrounding water molecules. Interestingly, the hydration sites around unpaired RNA bases do not match, in general, the positions that are occupied by the second nucleotide when the base-pair is formed. Conclusions The hydration sites around unpaired RNA bases were found. They do not replicate the atom positions of complementary bases in the Watson-Crick pairs. PMID:22011380
Glucose Limitation Alters Glutamine Metabolism in MUC1-Overexpressing Pancreatic Cancer Cells.
Gebregiworgis, Teklab; Purohit, Vinee; Shukla, Surendra K; Tadros, Saber; Chaika, Nina V; Abrego, Jaime; Mulder, Scott E; Gunda, Venugopal; Singh, Pankaj K; Powers, Robert
2017-10-06
Pancreatic cancer cells overexpressing Mucin 1 (MUC1) rely on aerobic glycolysis and, correspondingly, are dependent on glucose for survival. Our NMR metabolomics comparative analysis of control (S2-013.Neo) and MUC1-overexpressing (S2-013.MUC1) cells demonstrates that MUC1 reprograms glutamine metabolism upon glucose limitation. The observed alteration in glutamine metabolism under glucose limitation was accompanied by a relative decrease in the proliferation of MUC1-overexpressing cells compared with steady-state conditions. Moreover, glucose limitation induces G1 phase arrest where S2-013.MUC1 cells fail to enter S phase and synthesize DNA because of a significant disruption in pyrimidine nucleotide biosynthesis. Our metabolomics analysis indicates that glutamine is the major source of oxaloacetate in S2-013.Neo and S2-013.MUC1 cells, where oxaloacetate is converted to aspartate, an important metabolite for pyrimidine nucleotide biosynthesis. However, glucose limitation impedes the flow of glutamine carbons into the pyrimidine nucleotide rings and instead leads to a significant accumulation of glutamine-derived aspartate in S2-013.MUC1 cells.
Molecular detection of kobuviruses in European roe deer (Capreolus capreolus) in Italy.
Di Martino, Barbara; Di Profio, Federica; Melegari, Irene; Di Felice, Elisabetta; Robetto, Serena; Guidetti, Cristina; Orusa, Riccardo; Martella, Vito; Marsilio, Fulvio
2015-08-01
Kobuvirus RNA was found in 6.6 % (13/198) of stool specimens from roe deer (Capreolus capreolus) captured during the regular hunting season. Upon sequence analysis of a fragment of the 3D gene, nine strains displayed the highest nucleotide sequence identity (91.2-97.4 %) to bovine kobuviruses previously detected in either diarrhoeic or asymptomatic calves. Interestingly, four strains were genetically related to the newly discovered caprine kobuviruses (84.2-87.6 % nucleotide identity) identified in black goats in Korea.
González, Carolina; Tabernero, David; Cortese, Maria Francesca; Gregori, Josep; Casillas, Rosario; Riveiro-Barciela, Mar; Godoy, Cristina; Sopena, Sara; Rando, Ariadna; Yll, Marçal; Lopez-Martinez, Rosa; Quer, Josep; Esteban, Rafael; Buti, Maria; Rodríguez-Frías, Francisco
2018-05-21
To detect hyper-conserved regions in the hepatitis B virus (HBV) X gene ( HBX ) 5' region that could be candidates for gene therapy. The study included 27 chronic hepatitis B treatment-naive patients in various clinical stages (from chronic infection to cirrhosis and hepatocellular carcinoma, both HBeAg-negative and HBeAg-positive), and infected with HBV genotypes A-F and H. In a serum sample from each patient with viremia > 3.5 log IU/mL, the HBX 5' end region [nucleotide (nt) 1255-1611] was PCR-amplified and submitted to next-generation sequencing (NGS). We assessed genotype variants by phylogenetic analysis, and evaluated conservation of this region by calculating the information content of each nucleotide position in a multiple alignment of all unique sequences (haplotypes) obtained by NGS. Conservation at the HBx protein amino acid (aa) level was also analyzed. NGS yielded 1333069 sequences from the 27 samples, with a median of 4578 sequences/sample (2487-9279, IQR 2817). In 14/27 patients (51.8%), phylogenetic analysis of viral nucleotide haplotypes showed a complex mixture of genotypic variants. Analysis of the information content in the haplotype multiple alignments detected 2 hyper-conserved nucleotide regions, one in the HBX upstream non-coding region (nt 1255-1286) and the other in the 5' end coding region (nt 1519-1603). This last region coded for a conserved amino acid region (aa 63-76) that partially overlaps a Kunitz-like domain. Two hyper-conserved regions detected in the HBX 5' end may be of value for targeted gene therapy, regardless of the patients' clinical stage or HBV genotype.
Azab, Hassan A; Hussein, Belal H M; El-Falouji, Abdullah I
2012-03-01
Eu(III)-9-acridinecarboxylate (9-ACA) complex was synthesized and characterized by elemental analysis, conductivity measurement, IR spectroscopy, thermal analysis, mass spectroscopy, (1)H-NMR, fluorescence and ultraviolet spectra. The results indicated that the composition of this complex is [Eu(III)-(9-ACA)(2)(NCS)(C(2)H(5)OH)(2)] 2.5 H(2)O and the oxygen of the carbonyl group coordinated to Eu(III). The interaction between the complex with nucleotides guanosine 5'- monophosphate (5'-GMP), adenosine 5'-diphosphates (5'-ADP), inosine (5'-IMP) and CT-DNA was studied by fluorescence spectroscopy. The fluorescence intensity of Eu(III)-9-acridinecarboxylate complex was enhanced with the addition of CT-DNA. The effect of pH values on the fluorescence intensity of Eu(III) complex was investigated. Under experimental conditions, the linear range was 9-50 ng mL(-1) for calf thymus DNA (CT- DNA) and the corresponding detection limit was 5 ng mL(-1). The results showed that Eu(III)-(9-ACA)(2) complex binds to CT-DNA with stability constant of 2.41 × 10(4) M.
Genetic Diversity and Molecular Evolution of Chinese Waxy Maize Germplasm
Zheng, Hongjian; Wang, Hui; Yang, Hua; Wu, Jinhong; Shi, Biao; Cai, Run; Xu, Yunbi; Wu, Aizhong; Luo, Lijun
2013-01-01
Waxy maize (Zea mays L. var. certaina Kulesh), with many excellent characters in terms of starch composition and economic value, has grown in China for a long history and its production has increased dramatically in recent decades. However, the evolution and origin of waxy maize still remains unclear. We studied the genetic diversity of Chinese waxy maize including typical landraces and inbred lines by SSR analysis and the results showed a wide genetic diversity in the Chinese waxy maize germplasm. We analyzed the origin and evolution of waxy maize by sequencing 108 samples, and downloading 52 sequences from GenBank for the waxy locus in a number of accessions from genus Zea. A sharp reduction of nucleotide diversity and significant neutrality tests (Tajima’s D and Fu and Li’s F*) were observed at the waxy locus in Chinese waxy maize but not in nonglutinous maize. Phylogenetic analysis indicated that Chinese waxy maize originated from the cultivated flint maize and most of the modern waxy maize inbred lines showed a distinct independent origin and evolution process compared with the germplasm from Southwest China. The results indicated that an agronomic trait can be quickly improved to meet production demand by selection. PMID:23818949
Ito, Hiroya; Ogawa, Torata; Fukamizu, Dai; Morinaga, Yuiko; Kusumoto, Masahiro
2016-11-01
The aim of our study was to reveal the molecular basis of the serologic nontypeability of 2 Actinobacillus pleuropneumoniae field isolates. Nine field strains of A. pleuropneumoniae, the causative agent of porcine pleuropneumonia, were isolated from pigs raised on the same farm and sent to our diagnostic laboratory for serotyping. Seven of the 9 strains were identified as serovar 15 strains by immunodiffusion tests. However, 2 strains, designated FH24-2 and FH24-5, could not be serotyped with antiserum prepared against serovars 1-15. Strain FH24-5 showed positive results in 2 serovar 15-specific PCR tests, whereas strain FH24-2 was only positive in 1 of the 2 PCR tests. The nucleotide sequence analysis of gene clusters involved in capsular polysaccharide biosynthesis of the 2 nontypeable strains revealed that both had been rendered nontypeable by the action of ISApl1, a transposable element of A. pleuropneumoniae belonging to the IS30 family. The results showed that ISApl1 of A. pleuropneumoniae can interfere with both the serologic and molecular typing methods, and that nucleotide sequence analysis across the capsular gene clusters is the best means of determining the cause of serologic nontypeability in A. pleuropneumoniae. © 2016 The Author(s).
Li, Peipei; Piao, Yongjun; Shon, Ho Sun; Ryu, Keun Ho
2015-10-28
Recently, rapid improvements in technology and decrease in sequencing costs have made RNA-Seq a widely used technique to quantify gene expression levels. Various normalization approaches have been proposed, owing to the importance of normalization in the analysis of RNA-Seq data. A comparison of recently proposed normalization methods is required to generate suitable guidelines for the selection of the most appropriate approach for future experiments. In this paper, we compared eight non-abundance (RC, UQ, Med, TMM, DESeq, Q, RPKM, and ERPKM) and two abundance estimation normalization methods (RSEM and Sailfish). The experiments were based on real Illumina high-throughput RNA-Seq of 35- and 76-nucleotide sequences produced in the MAQC project and simulation reads. Reads were mapped with human genome obtained from UCSC Genome Browser Database. For precise evaluation, we investigated Spearman correlation between the normalization results from RNA-Seq and MAQC qRT-PCR values for 996 genes. Based on this work, we showed that out of the eight non-abundance estimation normalization methods, RC, UQ, Med, TMM, DESeq, and Q gave similar normalization results for all data sets. For RNA-Seq of a 35-nucleotide sequence, RPKM showed the highest correlation results, but for RNA-Seq of a 76-nucleotide sequence, least correlation was observed than the other methods. ERPKM did not improve results than RPKM. Between two abundance estimation normalization methods, for RNA-Seq of a 35-nucleotide sequence, higher correlation was obtained with Sailfish than that with RSEM, which was better than without using abundance estimation methods. However, for RNA-Seq of a 76-nucleotide sequence, the results achieved by RSEM were similar to without applying abundance estimation methods, and were much better than with Sailfish. Furthermore, we found that adding a poly-A tail increased alignment numbers, but did not improve normalization results. Spearman correlation analysis revealed that RC, UQ, Med, TMM, DESeq, and Q did not noticeably improve gene expression normalization, regardless of read length. Other normalization methods were more efficient when alignment accuracy was low; Sailfish with RPKM gave the best normalization results. When alignment accuracy was high, RC was sufficient for gene expression calculation. And we suggest ignoring poly-A tail during differential gene expression analysis.
Minnicelli, Carolina; Segges, Priscilla; Stefanoff, Gustavo; Kristcevic, Flavia; Ezpeleta, Joaquin; Tapia, Elizabeth; Niedobitek, Gerald; Barros, Mário Henrique M.
2018-01-01
ABSTRACT Interleukin-10 (IL10) is an immune regulatory cytokine. Single nucleotide polymorphisms (SNPs) in IL10 promoter have been associated with prognosis in adult classical Hodgkin lymphoma (cHL). We analyzed IL10 SNPs −1082 and −592 in respect of therapy response, gene expression and tumor microenvironment (TME) composition in 98 pediatric patients with cHL. As confirmatory results, we found that −1082AA/AG; −592CC genotypes and ATA haplotype were associated with unfavourable prognosis: Progression-free survival (PFS) was shorter in −1082AA+AG (72.2%) than in GG patients (100%) (P = 0.024), and in −592AA (50%) and AC (74.2%) vs. CC patients (87.0%) (P = 0.009). In multivariate analysis, the −592CC genotype and the ATA haplotype retained prognostic impact (HR: 0.41, 95% CI 0.2–0.86; P = 0.018, and HR: 3.06 95% CI 1.03–9.12; P = 0.044, respectively). Our analysis further led to some new observations, namely: (1) Low IL10 mRNA expression was associated with −1082GG genotype (P = 0.014); (2) IL10 promoter polymorphisms influence TME composition;−1082GG/−592CC carriers showed low numbers of infiltrating cells expressing MAF transcription factor (20 vs. 78 and 49 vs. 108 cells/mm2, respectively; P< 0.05); while ATA haplotype (high expression) associated with high numbers of MAF+ cells (P = 0.005). Specifically, −1082GG patients exhibited low percentages of CD68+MAF+ (M2-like) intratumoral macrophages (15.04% vs. 47.26%, P = 0.017). Considering ours as an independent validation cohort, our results give support to the clinical importance of IL10 polymorphisms in the full spectrum of cHL, and advance the concept of genetic control of microenvironment composition as a basis for susceptibility and therapeutic response. PMID:29721365
Vera-Lozada, Gabriela; Minnicelli, Carolina; Segges, Priscilla; Stefanoff, Gustavo; Kristcevic, Flavia; Ezpeleta, Joaquin; Tapia, Elizabeth; Niedobitek, Gerald; Barros, Mário Henrique M; Hassan, Rocio
2018-01-01
Interleukin-10 (IL10) is an immune regulatory cytokine. Single nucleotide polymorphisms (SNPs) in IL10 promoter have been associated with prognosis in adult classical Hodgkin lymphoma (cHL). We analyzed IL10 SNPs -1082 and -592 in respect of therapy response, gene expression and tumor microenvironment (TME) composition in 98 pediatric patients with cHL. As confirmatory results, we found that -1082AA/AG; -592CC genotypes and ATA haplotype were associated with unfavourable prognosis: Progression-free survival (PFS) was shorter in -1082AA+AG (72.2%) than in GG patients (100%) (P = 0.024), and in -592AA (50%) and AC (74.2%) vs. CC patients (87.0%) (P = 0.009). In multivariate analysis, the -592CC genotype and the ATA haplotype retained prognostic impact (HR: 0.41, 95% CI 0.2-0.86; P = 0.018, and HR: 3.06 95% CI 1.03-9.12; P = 0.044, respectively). Our analysis further led to some new observations, namely: (1) Low IL10 mRNA expression was associated with -1082GG genotype (P = 0.014); (2) IL10 promoter polymorphisms influence TME composition;-1082GG/-592CC carriers showed low numbers of infiltrating cells expressing MAF transcription factor (20 vs. 78 and 49 vs. 108 cells/mm 2 , respectively; P< 0.05); while ATA haplotype (high expression) associated with high numbers of MAF+ cells (P = 0.005). Specifically, -1082GG patients exhibited low percentages of CD68+MAF+ (M2-like) intratumoral macrophages (15.04% vs. 47.26%, P = 0.017). Considering ours as an independent validation cohort, our results give support to the clinical importance of IL10 polymorphisms in the full spectrum of cHL, and advance the concept of genetic control of microenvironment composition as a basis for susceptibility and therapeutic response.
Cecchinato, A; Ribeca, C; Chessa, S; Cipolat-Gotet, C; Maretto, F; Casellas, J; Bittante, G
2014-07-01
The aim of this study was to investigate 96 single-nucleotide polymorphisms (SNPs) from 54 candidate genes, and test the associations of the polymorphic SNPs with milk yield, composition, milk urea nitrogen (MUN) content and somatic cell score (SCS) in individual milk samples from Italian Brown Swiss cows. Milk and blood samples were collected from 1271 cows sampled once from 85 herds. Milk production, quality traits (i.e. protein, casein, fat and lactose percentages), MUN and SCS were measured for each milk sample. Genotyping was performed using a custom Illumina VeraCode GoldenGate approach. A Bayesian linear animal model that considered the effects of herd, days in milk, parity, SNP genotype and additive polygenic effect was used for the association analysis. Our results showed that 14 of the 51 polymorphic SNPs had relevant additive effects on at least one of the aforementioned traits. Polymorphisms in the glucocorticoid receptor DNA-binding factor 1 (GRLF1), prolactin receptor (PRLR) and chemokine ligand 2 (CCL2) were associated with milk yield; an SNP in the stearoyl-CoA desaturase (SCD-1) was related to fat content; SNPs in the caspase recruitment domain 15 protein (CARD15) and lipin 1 (LPIN1) affected the protein and casein contents; SNPs in growth hormone 1 (GH1), lactotransferrin (LTF) and SCD-1 were relevant for casein number; variants in beta casein (CSN2), GH1, GRLF1 and LTF affected lactose content; SNPs in beta-2 adrenergic receptor (ADRB2), serpin peptidase inhibitor (PI) and SCD-1 were associated with MUN; and SNPs in acetyl-CoA carboxylase alpha (ACACA) and signal transducer and activator of transcription 5A (STAT5A) were relevant in explaining the variation of SCS. Although further research is needed to validate these SNPs in other populations and breeds, the association between these markers and milk yield, composition, MUN and SCS could be exploited in gene-assisted selection programs for genetic improvement purposes.
4G/5G Plasminogen Activator Inhibitor-1 Polymorphisms and Haplotypes Are Associated with Pneumonia
Yende, Sachin; Angus, Derek C.; Ding, Jingzhong; Newman, Anne B.; Kellum, John A.; Li, Rongling; Ferrell, Robert E.; Zmuda, Joseph; Kritchevsky, Stephen B.; Harris, Tamara B.; Garcia, Melissa; Yaffe, Kristine; Wunderink, Richard G.
2007-01-01
Rationale: Plasminogen activator inhibitor (PAI)-1 inhibits urokinase and tissue plasminogen activator, required for host response to infection. Whether variation within the PAI-1 gene is associated with increased susceptibility to infection is unknown. Objectives: To ascertain the role of the 4G/5G polymorphism and other genetic variants within the PAI-1 gene. We hypothesized that variants associated with increased PAI-1 expression would be associated with an increased occurrence of community-acquired pneumonia (CAP). Methods: Longitudinal analysis (>12 yr) of the Health, Aging, and Body Composition cohort, aged 65–74 years at start of analysis. Measurements and Main Results: We genotyped the 4G/5G PAI-1 polymorphism and six additional single nucleotide polymorphisms. Of the 3,075 subjects, 272 (8.8%) had at least one hospitalization for CAP. Among whites, variants at the PAI4G,5G, PAI2846, and PAI7343 sites had higher risk of CAP (P = 0.018, 0.021, and 0.021, respectively). At these sites, variants associated with higher PAI-1 expression were associated with increased CAP susceptibility. Compared with the 5G/5G genotypes at PAI4G,5G site, the 4G/4G and 4G/5G genotypes were associated with a 1.98-fold increased risk of CAP (95% confidence interval, 1.2–3.2; P = 0.006). In whole blood stimulation assay, subjects with a 4G allele had 3.3- and 1.9-fold increased PAI-1 expression (P = 0.043 and 0.034, respectively). In haplotype analysis, the 4G/G/C/A haplotype at the PAI4G,5G, PAI2846, PAI4588, and PAI7343 single nucleotide polymorphisms was associated with higher CAP susceptibility, whereas the 5G/G/C/A haplotype was associated with lower CAP susceptibility. No associations were seen among blacks. Conclusions: Genotypes associated with increased expression of PAI-1 were associated with increased susceptibility to CAP in elderly whites. PMID:17761618
Le Rhun, Emilie; Bertrand, Nicolas; Dumont, Aurélie; Tresch, Emmanuelle; Le Deley, Marie-Cécile; Mailliez, Audrey; Preusser, Matthias; Weller, Michael; Revillion, Françoise; Bonneterre, Jacques
2017-12-01
The PI3K-AKT-mTOR pathway may be involved in the development of central nervous system (CNS) metastasis from breast cancer. Accordingly, herein we explored whether single nucleotide polymorphisms (SNPs) of this pathway are associated with altered risk of CNS metastasis formation in metastatic breast cancer patients. The GENEOM study (NCT00959556) included blood sample collection from breast cancer patients treated in the neoadjuvant, adjuvant or metastatic setting. We identified patients with CNS metastases for comparison with patients without CNS metastasis, defined as either absence of neurological symptoms or normal brain magnetic resonance imaging (MRI) before death or during 5-year follow-up. Eighty-eight SNPs of phosphoinositide 3-kinase (PI3K)/protein kinase B (AKT)/mammalian (or mechanistic) target of rapamycin (mTOR) pathway genes were selected for analysis: AKT1 (17 SNPs), AKT2 (4), FGFR1 (2), mTOR (7), PDK1 (4), PI3KR1 (11), PI3KCA (20), PTEN (17), RPS6KB1 (6). Of 342 patients with metastases, 207 fulfilled the inclusion criteria: One-hundred-and-seven patients remained free of CNS metastases at last follow-up or date of death whereas 100 patients developed CNS metastases. Among clinical parameters, hormonal and human epidermal growth factor receptor-2 (HER2) status as well as vascular tumour emboli was associated with risk of CNS metastasis. Only PI3KR1-rs706716 was associated with CNS metastasis in univariate analysis after Bonferroni correction (p < 0.00085). Multivariate analysis showed associations between AKT1-rs3803304, AKT2-rs3730050, PDK1-rs11686903 and PI3KR1-rs706716 and CNS metastasis . PI3KR1-rs706716 may be associated with CNS metastasis in metastatic breast cancer patients and could be included in a predictive composite score to detect early CNS metastasis irrespective of breast cancer subtype. Copyright © 2017 Elsevier Ltd. All rights reserved.
Fiallo-Olivé, Elvira; Navas-Castillo, Jesús; Moriones, Enrique; Martínez-Zubiaur, Yamila
2012-01-01
As a result of surveys conducted during the last few years to search for wild reservoirs of begomoviruses in Cuba, we detected a novel bipartite begomovirus, sida yellow mottle virus (SiYMoV), infecting Sida rhombifolia plants. The complete genome sequence was obtained, showing that DNA-A was 2622 nucleotides (nt) in length and that it was most closely related (87.6% nucleotide identity) to DNA-A of an isolate of sida golden mosaic virus (SiGMV) that infects snap beans (Phaseolus vulgaris) in Florida. The DNA-B sequence was 2600 nt in length and shared the highest nucleotide identity (75.1%) with corchorus yellow spot virus (CoYSV). Phylogenetic relationship analysis showed that both DNA components of SiYMoV were grouped in the Abutilon clade, along with begomoviruses from Florida and the Caribbean islands. We also present here the complete nucleotide sequence of a novel strain of sida yellow vein virus found infecting Malvastrum coromandelianum and an isolate of euphorbia mosaic virus that was found for the first time infecting Euphorbia heterophylla in Cuba.
Viral replication. Structural basis for RNA replication by the hepatitis C virus polymerase.
Appleby, Todd C; Perry, Jason K; Murakami, Eisuke; Barauskas, Ona; Feng, Joy; Cho, Aesop; Fox, David; Wetmore, Diana R; McGrath, Mary E; Ray, Adrian S; Sofia, Michael J; Swaminathan, S; Edwards, Thomas E
2015-02-13
Nucleotide analog inhibitors have shown clinical success in the treatment of hepatitis C virus (HCV) infection, despite an incomplete mechanistic understanding of NS5B, the viral RNA-dependent RNA polymerase. Here we study the details of HCV RNA replication by determining crystal structures of stalled polymerase ternary complexes with enzymes, RNA templates, RNA primers, incoming nucleotides, and catalytic metal ions during both primed initiation and elongation of RNA synthesis. Our analysis revealed that highly conserved active-site residues in NS5B position the primer for in-line attack on the incoming nucleotide. A β loop and a C-terminal membrane-anchoring linker occlude the active-site cavity in the apo state, retract in the primed initiation assembly to enforce replication of the HCV genome from the 3' terminus, and vacate the active-site cavity during elongation. We investigated the incorporation of nucleotide analog inhibitors, including the clinically active metabolite formed by sofosbuvir, to elucidate key molecular interactions in the active site. Copyright © 2015, American Association for the Advancement of Science.
Mechanism of nucleotide sensing in group II chaperonins.
Pereira, Jose H; Ralston, Corie Y; Douglas, Nicholai R; Kumar, Ramya; Lopez, Tom; McAndrew, Ryan P; Knee, Kelly M; King, Jonathan A; Frydman, Judith; Adams, Paul D
2012-02-01
Group II chaperonins mediate protein folding in an ATP-dependent manner in eukaryotes and archaea. The binding of ATP and subsequent hydrolysis promotes the closure of the multi-subunit rings where protein folding occurs. The mechanism by which local changes in the nucleotide-binding site are communicated between individual subunits is unknown. The crystal structure of the archaeal chaperonin from Methanococcus maripaludis in several nucleotides bound states reveals the local conformational changes associated with ATP hydrolysis. Residue Lys-161, which is extremely conserved among group II chaperonins, forms interactions with the γ-phosphate of ATP but shows a different orientation in the presence of ADP. The loss of the ATP γ-phosphate interaction with Lys-161 in the ADP state promotes a significant rearrangement of a loop consisting of residues 160-169. We propose that Lys-161 functions as an ATP sensor and that 160-169 constitutes a nucleotide-sensing loop (NSL) that monitors the presence of the γ-phosphate. Functional analysis using NSL mutants shows a significant decrease in ATPase activity, suggesting that the NSL is involved in timing of the protein folding cycle.
Major, Peter; Embley, T. Martin
2017-01-01
Plasma membrane-located nucleotide transport proteins (NTTs) underpin the lifestyle of important obligate intracellular bacterial and eukaryotic pathogens by importing energy and nucleotides from infected host cells that the pathogens can no longer make for themselves. As such their presence is often seen as a hallmark of an intracellular lifestyle associated with reductive genome evolution and loss of primary biosynthetic pathways. Here, we investigate the phylogenetic distribution of NTT sequences across the domains of cellular life. Our analysis reveals an unexpectedly broad distribution of NTT genes in both host-associated and free-living prokaryotes and eukaryotes. We also identify cases of within-bacteria and bacteria-to-eukaryote horizontal NTT transfer, including into the base of the oomycetes, a major clade of parasitic eukaryotes. In addition to identifying sequences that retain the canonical NTT structure, we detected NTT gene fusions with HEAT-repeat and cyclic nucleotide binding domains in Cyanobacteria, pathogenic Chlamydiae and Oomycetes. Our results suggest that NTTs are versatile functional modules with a much wider distribution and a broader range of potential roles than has previously been appreciated. PMID:28164241
Single-molecule comparison of DNA Pol I activity with native and analog nucleotides
NASA Astrophysics Data System (ADS)
Gul, Osman; Olsen, Tivoli; Choi, Yongki; Corso, Brad; Weiss, Gregory; Collins, Philip
2014-03-01
DNA polymerases are critical enzymes for DNA replication, and because of their complex catalytic cycle they are excellent targets for investigation by single-molecule experimental techniques. Recently, we studied the Klenow fragment (KF) of DNA polymerase I using a label-free, electronic technique involving single KF molecules attached to carbon nanotube transistors. The electronic technique allowed long-duration monitoring of a single KF molecule while processing thousands of template strands. Processivity of up to 42 nucleotide bases was directly observed, and statistical analysis of the recordings determined key kinetic parameters for the enzyme's open and closed conformations. Subsequently, we have used the same technique to compare the incorporation of canonical nucleotides like dATP to analogs like 1-thio-2'-dATP. The analog had almost no affect on duration of the closed conformation, during which the nucleotide is incorporated. On the other hand, the analog increased the rate-limiting duration of the open conformation by almost 40%. We propose that the thiolated analog interferes with KF's recognition and binding, two key steps that determine its ensemble turnover rate.
In vitro nonenzymatic glycation of guanosine 5'-triphosphate by dihydroxyacetone phosphate.
Li, Yuyuan; Cohenford, Menashi A; Dutta, Udayan; Dain, Joel A
2008-11-01
Dihydroxyacetone phosphate (DHAP) is a glycolytic intermediate that has been found to be significantly elevated in the erythrocytes of diabetic patients and patients with triosephosphate isomerase deficiency. DHAP spontaneously breaks down to methylglyoxal, a potent glycating agent that reacts with proteins and nucleic acids in vivo to form advanced glycation endproducts (AGEs). Like methylglyoxal, DHAP itself is also a glycating metabolite, capable of condensing with proteins and altering their structure or function. The objective of this investigation was to evaluate the susceptibility of nucleotides to nonenzymatic attack by DHAP, and to determine the factors influencing the rate and extent of nucleotide glycation by this sugar. Of the four nucleotide triphosphates (ATP, CTP, GTP and UTP) that were studied, only GTP was reactive, forming a wide range of UV and fluorescent products with DHAP. Increases in temperature and nucleotide concentration enhanced the rate and extent of GTP glycation by DHAP and promoted the heterogeneity of AGEs. Capillary electrophoresis, HPLC, and mass spectrometry allowed for a thorough analysis of the glycated products and demonstrated that the reaction of DHAP with GTP occurred via the classical Amadori pathway.
Schoeman, Elizna M; Lopez, Genghis H; McGowan, Eunike C; Millard, Glenda M; O'Brien, Helen; Roulis, Eileen V; Liew, Yew-Wah; Martin, Jacqueline R; McGrath, Kelli A; Powley, Tanya; Flower, Robert L; Hyland, Catherine A
2017-04-01
Blood group single nucleotide polymorphism genotyping probes for a limited range of polymorphisms. This study investigated whether massively parallel sequencing (also known as next-generation sequencing), with a targeted exome strategy, provides an extended blood group genotype and the extent to which massively parallel sequencing correctly genotypes in homologous gene systems, such as RH and MNS. Donor samples (n = 28) that were extensively phenotyped and genotyped using single nucleotide polymorphism typing, were analyzed using the TruSight One Sequencing Panel and MiSeq platform. Genes for 28 protein-based blood group systems, GATA1, and KLF1 were analyzed. Copy number variation analysis was used to characterize complex structural variants in the GYPC and RH systems. The average sequencing depth per target region was 66.2 ± 39.8. Each sample harbored on average 43 ± 9 variants, of which 10 ± 3 were used for genotyping. For the 28 samples, massively parallel sequencing variant sequences correctly matched expected sequences based on single nucleotide polymorphism genotyping data. Copy number variation analysis defined the Rh C/c alleles and complex RHD hybrids. Hybrid RHD*D-CE-D variants were correctly identified, but copy number variation analysis did not confidently distinguish between D and CE exon deletion versus rearrangement. The targeted exome sequencing strategy employed extended the range of blood group genotypes detected compared with single nucleotide polymorphism typing. This single-test format included detection of complex MNS hybrid cases and, with copy number variation analysis, defined RH hybrid genes along with the RHCE*C allele hitherto difficult to resolve by variant detection. The approach is economical compared with whole-genome sequencing and is suitable for a red blood cell reference laboratory setting. © 2017 AABB.
Molecular population genetics of inversion breakpoint regions in Drosophila pseudoobscura.
Wallace, Andre G; Detweiler, Don; Schaeffer, Stephen W
2013-07-08
Paracentric inversions in populations can have a profound effect on the pattern and organization of nucleotide variability along a chromosome. Regions near inversion breakpoints are expected to have greater levels of differentiation because of reduced genetic exchange between different gene arrangements whereas central regions in the inverted segments are predicted to have lower levels of nucleotide differentiation due to greater levels of genetic flux among different karyotypes. We used the inversion polymorphism on the third chromosome of Drosophila pseudoobscura to test these predictions with an analysis of nucleotide diversity of 18 genetic markers near and away from inversion breakpoints. We tested hypotheses about how the presence of different chromosomal arrangements affects the pattern and organization of nucleotide variation. Overall, markers in the distal segment of the chromosome had greater levels of nucleotide heterozygosity than markers within the proximal segment of the chromosome. In addition, our results rejected the hypothesis that the breakpoints of derived inversions will have lower levels of nucleotide variability than breakpoints of ancestral inversions, even when strains with gene conversion events were removed. High levels of linkage disequilibrium were observed within all 11 breakpoint regions as well as between the ends of most proximal and distal breakpoints. The central region of the chromosome had the greatest levels of linkage disequilibrium compared with the proximal and distal regions because this is the region that experiences the highest level of recombination suppression. These data do not fully support the idea that genetic exchange is the sole force that influences genetic variation on inverted chromosomes.
Duellman, Tyler; Warren, Christopher; Yang, Jay
2014-01-01
Microribonucleic acids (miRNAs) work with exquisite specificity and are able to distinguish a target from a non-target based on a single nucleotide mismatch in the core nucleotide domain. We questioned whether miRNA regulation of gene expression could occur in a single nucleotide polymorphism (SNP)-specific manner, manifesting as a post-transcriptional control of expression of genetic polymorphisms. In our recent study of the functional consequences of matrix metalloproteinase (MMP)-9 SNPs, we discovered that expression of a coding exon SNP in the pro-domain of the protein resulted in a profound decrease in the secreted protein. This missense SNP results in the N38S amino acid change and a loss of an N-glycosylation site. A systematic study demonstrated that the loss of secreted protein was due not to the loss of an N-glycosylation site, but rather an SNP-specific targeting by miR-671-3p and miR-657. Bioinformatics analysis identified 41 SNP-specific miRNA targeting MMP-9 SNPs, mostly in the coding exon and an extension of the analysis to chromosome 20, where the MMP-9 gene is located, suggesting that SNP-specific miRNAs targeting the coding exon are prevalent. This selective post-transcriptional regulation of a target messenger RNA harboring genetic polymorphisms by miRNAs offers an SNP-dependent post-transcriptional regulatory mechanism, allowing for polymorphic-specific differential gene regulation. PMID:24627221
Kachhap, Sangita; Singh, Balvinder
2015-01-01
In most of homeodomain-DNA complexes, glutamine or lysine is present at 50th position and interacts with 5th and 6th nucleotide of core recognition region. Molecular dynamics simulations of Msx-1-DNA complex (Q50-TG) and its variant complexes, that is specific (Q50K-CC), nonspecific (Q50-CC) having mutation in DNA and (Q50K-TG) in protein, have been carried out. Analysis of protein-DNA interactions and structure of DNA in specific and nonspecific complexes show that amino acid residues use sequence-dependent shape of DNA to interact. The binding free energies of all four complexes were analysed to define role of amino acid residue at 50th position in terms of binding strength considering the variation in DNA on stability of protein-DNA complexes. The order of stability of protein-DNA complexes shows that specific complexes are more stable than nonspecific ones. Decomposition analysis shows that N-terminal amino acid residues have been found to contribute maximally in binding free energy of protein-DNA complexes. Among specific protein-DNA complexes, K50 contributes more as compared to Q50 towards binding free energy in respective complexes. The sequence dependence of local conformation of DNA enables Q50/Q50K to make hydrogen bond with nucleotide(s) of DNA. The changes in amino acid sequence of protein are accommodated and stabilized around TAAT core region of DNA having variation in nucleotides.
Salem, Nida’ M.; Miller, W. Allen; Rowhani, Adib; Golino, Deborah A.; Moyne, Anne-Laure; Falk, Bryce W.
2015-01-01
We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5′- and 3′-RACE showed the RSDaV genomic RNA to be 5,808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3′-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5′ ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5′ end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3′ cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae. PMID:18329064
Salem, Nida' M; Miller, W Allen; Rowhani, Adib; Golino, Deborah A; Moyne, Anne-Laure; Falk, Bryce W
2008-06-05
We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5'- and 3'-RACE showed the RSDaV genomic RNA to be 5808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3'-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5' ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5' end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3' cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae.
Getacher Feleke, Daniel; Nateghpour, Mehdi; Motevalli Haghi, Afsaneh; Hajjaran, Homa; Farivar, Leila; Mohebali, Mehdi; Raoofian, Reza
2015-01-01
Parasite lactate dehydrogenase (pLDH) is extensively employed as malaria rapid diagnostic tests (RDTs). Moreover, it is a well-known drug target candidate. However, the genetic diversity of this gene might influence performance of RDT kits and its drug target candidacy. This study aimed to determine polymorphism of pLDH gene from Iranian isolates of P. vivax and P. falciparum. Genomic DNA was extracted from whole blood of microscopically confirmed P. vivax and P. falciparum infected patients. pLDH gene of P. falciparum and P. vivax was amplified using conventional PCR from 43 symptomatic malaria patients from Sistan and Baluchistan Province, Southeast Iran from 2012 to 2013. Sequence analysis of 15 P. vivax LDH showed fourteen had 100% identity with P. vivax Sal-1 and Belem strains. Two nucleotide substitutions were detected with only one resulted in amino acid change. Analysis of P. falciparum LDH sequences showed six of the seven sequences had 100% homology with P. falciparum 3D7 and Mzr-1. Moreover, PfLDH displayed three nucleotide changes that resulted in changing only one amino acid. PvLDH and PfLDH showed 75%-76% nucleotide and 90.4%-90.76% amino acid homology. pLDH gene from Iranian P. falciparum and P. vivax isolates displayed 98.8-100% homology with 1-3 nucleotide substitutions. This indicated this gene was relatively conserved. Additional studies can be done weather this genetic variation can influence the performance of pLDH based RDTs or not.
Detecting and Analyzing Genetic Recombination Using RDP4.
Martin, Darren P; Murrell, Ben; Khoosal, Arjun; Muhire, Brejnev
2017-01-01
Recombination between nucleotide sequences is a major process influencing the evolution of most species on Earth. The evolutionary value of recombination has been widely debated and so too has its influence on evolutionary analysis methods that assume nucleotide sequences replicate without recombining. When nucleic acids recombine, the evolution of the daughter or recombinant molecule cannot be accurately described by a single phylogeny. This simple fact can seriously undermine the accuracy of any phylogenetics-based analytical approach which assumes that the evolutionary history of a set of recombining sequences can be adequately described by a single phylogenetic tree. There are presently a large number of available methods and associated computer programs for analyzing and characterizing recombination in various classes of nucleotide sequence datasets. Here we examine the use of some of these methods to derive and test recombination hypotheses using multiple sequence alignments.
Yadav, Pragya D; Vincent, Martin J; Khristova, Marina; Kale, Charuta; Nichol, Stuart T; Mishra, Akhilesh C; Mourya, Devendra T
2011-07-01
Nairobi sheep disease (NSD) virus, the prototype tick-borne virus of the genus Nairovirus, family Bunyaviridae is associated with acute hemorrhagic gastroenteritis in sheep and goats in East and Central Africa. The closely related Ganjam virus found in India is associated with febrile illness in humans and disease in livestock. The complete S, M and L segment sequences of Ganjam and NSD virus and partial sequence analysis of Ganjam viral RNA genome S, M and L segments encoding regions (396 bp, 701 bp and 425 bp) of the viral nucleocapsid (N), glycoprotein precursor (GPC) and L polymerase (L) proteins, respectively, was carried out for multiple Ganjam virus isolates obtained from 1954 to 2002 and from various regions of India. M segments of NSD and Ganjam virus encode a large ORF for the glycoprotein precursor (GPC), (1627 and 1624 amino acids in length, respectively) and their L segments encode a very large L polymerase (3991 amino acids). The complete S, M and L segments of NSD and Ganjam viruses were more closely related to one another than to other characterized nairoviruses, and no evidence of reassortment was found. However, the NSD and Ganjam virus complete M segment differed by 22.90% and 14.70%, for nucleotide and amino acid respectively, and the complete L segment nucleotide and protein differing by 9.90% and 2.70%, respectively among themselves. Ganjam and NSD virus, complete S segment differed by 9.40-10.40% and 3.2-4.10 for nucleotide and proteins while among Ganjam viruses 0.0-6.20% and 0.0-1.4%, variation was found for nucleotide and amino acids. Ganjam virus isolates differed by up to 17% and 11% at the nucleotide level for the partial S and L gene fragments, respectively, with less variation observed at the deduced amino acid level (10.5 and 2%, S and L, respectively). However, the virus partial M gene fragment (which encodes the hypervariable mucin-like domain) of these viruses differed by as much as 56% at the nucleotide level. Phylogenetic analysis of partial sequence differences suggests considerable mixing and movement of Ganjam virus strains within India, with no clear relationship between genetic lineages and virus geographic origin or year of isolation. Surprisingly, NSD virus does not represent a distinct lineage, but appears as a variant with other Ganjam virus among NSD virus group. Copyright © 2011 Elsevier B.V. All rights reserved.
Rangannan, Vetriselvi; Bansal, Manju
2009-12-01
The rapid increase in genome sequence information has necessitated the annotation of their functional elements, particularly those occurring in the non-coding regions, in the genomic context. Promoter region is the key regulatory region, which enables the gene to be transcribed or repressed, but it is difficult to determine experimentally. Hence an in silico identification of promoters is crucial in order to guide experimental work and to pin point the key region that controls the transcription initiation of a gene. In this analysis, we demonstrate that while the promoter regions are in general less stable than the flanking regions, their average free energy varies depending on the GC composition of the flanking genomic sequence. We have therefore obtained a set of free energy threshold values, for genomic DNA with varying GC content and used them as generic criteria for predicting promoter regions in several microbial genomes, using an in-house developed tool PromPredict. On applying it to predict promoter regions corresponding to the 1144 and 612 experimentally validated TSSs in E. coli (50.8% GC) and B. subtilis (43.5% GC) sensitivity of 99% and 95% and precision values of 58% and 60%, respectively, were achieved. For the limited data set of 81 TSSs available for M. tuberculosis (65.6% GC) a sensitivity of 100% and precision of 49% was obtained.
Capillary electrophoretic analysis of synthetic short-chain oligoribonucleotides.
Cellai, L; Onori, A M; Desiderio, C; Fanali, S
1998-12-01
Thirty synthetic oligoribonucleotides, 3 to 18 nucleotides (nt) long, were analyzed by capillary electrophoresis, under nondenaturing conditions, using a commercial kit. The migration time t(m) was dependent on nt length and composition, capillary length, operating temperature, and type of sieving polymer. Under fixed experimental conditions, the t(m) proved predictable by the equation: t(m) = [0.22(n-1) + 6.14A/n + 6.86G/n + 3.61 (C+U)/n] min, for n>3, where A/n, G/n, C/n, U/n is the frequency of each type of nt within the oligonucleotide (ONT). The equation accounts for the influence of charge-to-mass ratio on t(m), but not for structural effects, if present. This approximation is acceptable for short ONTs. The possibility of detecting n+1, n-1, n-2 impurities, having predicted the t(m), is of crucial importance in assessing the purity of synthetic ONTs dedicated to structural studies. This appears to be feasible. High resolution was shown among homologous series of ONTs of increasing length, and in some cases, even within groups of ONTs of the same length but different composition. The addition of 7 M urea to the buffer, as denaturing agent, accelerates the t(m) and significantly lowers the resolution for the shortest ONTs. It was also possible to monitor the state of association of mixtures of RNA and DNA sequence-complementary strands.
Balintová, Jana; Plucnara, Medard; Vidláková, Pavlína; Pohl, Radek; Havran, Luděk; Fojta, Miroslav; Hocek, Michal
2013-09-16
Benzofurazane has been attached to nucleosides and dNTPs, either directly or through an acetylene linker, as a new redox label for electrochemical analysis of nucleotide sequences. Primer extension incorporation of the benzofurazane-modified dNTPs by polymerases has been developed for the construction of labeled oligonucleotide probes. In combination with nitrophenyl and aminophenyl labels, we have successfully developed a three-potential coding of DNA bases and have explored the relevant electrochemical potentials. The combination of benzofurazane and nitrophenyl reducible labels has proved to be excellent for ratiometric analysis of nucleotide sequences and is suitable for bioanalytical applications. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Norder, Heléne; Bergström, Åsa; Uhnoo, Ingrid; Aldén, Jöran; Weiss, Lars; Czajkowski, Jan; Magnius, Lars
1998-01-01
Four hepatitis C virus transmission chains at three dialysis units were disclosed by limited sequencing; three of these were disclosed by analysis of the NS5-B region of the genome. Dialysis on the same shift as that during which infected patients were dialyzed was the common factor for seven patients in two chains. Two nurses exposed to needle sticks and their sources of infection constituted two other chains. The strains of three chains belonged to subtype 1a and formed clusters with an intrachain variability of 0 to 6 nucleotides compared to 8 to 37 nucleotides for unrelated strains within this subtype. The clusters were supported by bootstrap values ranging from 89 to 100%. PMID:9738071
Taira, Chiaki; Matsuda, Kazuyuki; Yamaguchi, Akemi; Sueki, Akane; Koeda, Hiroshi; Takagi, Fumio; Kobayashi, Yukihiro; Sugano, Mitsutoshi; Honda, Takayuki
2013-09-23
Single nucleotide alterations such as single nucleotide polymorphisms (SNP) and single nucleotide mutations are associated with responses to drugs and predisposition to several diseases, and they contribute to the pathogenesis of malignancies. We developed a rapid genotyping assay based on the allele-specific polymerase chain reaction (AS-PCR) with our droplet-PCR machine (droplet-AS-PCR). Using 8 SNP loci, we evaluated the specificity and sensitivity of droplet-AS-PCR. Buccal cells were pretreated with proteinase K and subjected directly to the droplet-AS-PCR without DNA extraction. The genotypes determined using the droplet-AS-PCR were then compared with those obtained by direct sequencing. Specific PCR amplifications for the 8 SNP loci were detected, and the detection limit of the droplet-AS-PCR was found to be 0.1-5.0% by dilution experiments. Droplet-AS-PCR provided specific amplification when using buccal cells, and all the genotypes determined within 9 min were consistent with those obtained by direct sequencing. Our novel droplet-AS-PCR assay enabled high-speed amplification retaining specificity and sensitivity and provided ultra-rapid genotyping. Crude samples such as buccal cells were available for the droplet-AS-PCR assay, resulting in the reduction of the total analysis time. Droplet-AS-PCR may therefore be useful for genotyping or the detection of single nucleotide alterations. Copyright © 2013 Elsevier B.V. All rights reserved.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Sakamoto, C.; Matozaki, T.; Nagao, M.
1987-09-01
Guanine nucleotides and pertussis toxin were used to investigate whether somatostatin receptors interact with the guanine nucleotide inhibitory protein (NI) on pancreatic acinar membranes in the rat. Guanine nucleotides reduced /sup 125/I-(Tyr/sup 1/)somatostatin binding to acinar membranes up to 80%, with rank order of potency being 5'-guanylyl imidodiphosphate (Gpp(NH)p)>GTP>TDP>GMP. Scatchard analysis revealed that the decrease in somatostatin binding caused by Gpp(NH)p was due to the decrease in the maximum binding capacity without a significant change in the binding affinity. The inhibitory effect of Gpp(NH)p was partially abolished in the absence of Mg/sup 2 +/. When pancreatic acini were treated withmore » 1 ..mu..g/ml pertussis toxin for 4 h, subsequent /sup 125/I-(Tyr/sup 1/)somatostatin binding to acinar membranes was reduced. Pertussis toxin treatment also abolished the inhibitory effect of somatostatin on vasoactive intestinal peptide-stimulated increase in cellular content of adenosine 3',5'-cyclic monophosphate (cAMP) in the acini. The present results suggest that 1) somatostatin probably functions in the pancreas to regulate adenylate cyclase enzyme system via Ni, 2) the extent of modification of Ni is correlated with the ability of somatostatin to inhibit cAMP accumulation in acini, and 3) guanine nucleotides also inhibit somatostatin binding to its receptor.« less
Mitochondrial control-region sequence variation in aboriginal Australians.
van Holst Pellekaan, S; Frommer, M; Sved, J; Boettcher, B
1998-01-01
The mitochondrial D-loop hypervariable segment 1 (mt HVS1) between nucleotides 15997 and 16377 has been examined in aboriginal Australian people from the Darling River region of New South Wales (riverine) and from Yuendumu in central Australia (desert). Forty-seven unique HVS1 types were identified, varying at 49 nucleotide positions. Pairwise analysis by calculation of BEPPI (between population proportion index) reveals statistically significant structure in the populations, although some identical HVS1 types are seen in the two contrasting regions. mt HVS1 types may reflect more-ancient distributions than do linguistic diversity and other culturally distinguishing attributes. Comparison with sequences from five published global studies reveals that these Australians demonstrate greatest divergence from some Africans, least from Papua New Guinea highlanders, and only slightly more from some Pacific groups (Indonesian, Asian, Samoan, and coastal Papua New Guinea), although the HVS1 types vary at different nucleotide sites. Construction of a median network, displaying three main groups, suggests that several hypervariable nucleotide sites within the HVS1 are likely to have undergone mutation independently, making phylogenetic comparison with global samples by conventional methods difficult. Specific nucleotide-site variants are major separators in median networks constructed from Australian HVS1 types alone and for one global selection. The distribution of these, requiring extended study, suggests that they may be signatures of different groups of prehistoric colonizers into Australia, for which the time of colonization remains elusive. PMID:9463317
Evidence of codon usage in the nearest neighbor spacing distribution of bases in bacterial genomes
NASA Astrophysics Data System (ADS)
Higareda, M. F.; Geiger, O.; Mendoza, L.; Méndez-Sánchez, R. A.
2012-02-01
Statistical analysis of whole genomic sequences usually assumes a homogeneous nucleotide density throughout the genome, an assumption that has been proved incorrect for several organisms since the nucleotide density is only locally homogeneous. To avoid giving a single numerical value to this variable property, we propose the use of spectral statistics, which characterizes the density of nucleotides as a function of its position in the genome. We show that the cumulative density of bases in bacterial genomes can be separated into an average (or secular) plus a fluctuating part. Bacterial genomes can be divided into two groups according to the qualitative description of their secular part: linear and piecewise linear. These two groups of genomes show different properties when their nucleotide spacing distribution is studied. In order to analyze genomes having a variable nucleotide density, statistically, the use of unfolding is necessary, i.e., to get a separation between the secular part and the fluctuations. The unfolding allows an adequate comparison with the statistical properties of other genomes. With this methodology, four genomes were analyzed Burkholderia, Bacillus, Clostridium and Corynebacterium. Interestingly, the nearest neighbor spacing distributions or detrended distance distributions are very similar for species within the same genus but they are very different for species from different genera. This difference can be attributed to the difference in the codon usage.
Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A
2012-05-01
The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.
Mechanism of the Exchange Reaction in HRAS from Multiscale Modeling
Kapoor, Abhijeet; Travesset, Alex
2014-01-01
HRAS regulates cell growth promoting signaling processes by cycling between active (GTP-bound) and inactive (GDP-bound) states. Understanding the transition mechanism is central for the design of small molecules to inhibit the formation of RAS-driven tumors. Using a multiscale approach involving coarse-grained (CG) simulations, all-atom classical molecular dynamics (CMD; total of 3.02 µs), and steered molecular dynamics (SMD) in combination with Principal Component Analysis (PCA), we identified the structural features that determine the nucleotide (GDP) exchange reaction. We show that weakening the coupling between the SwitchI (residues 25–40) and SwitchII (residues 59–75) accelerates the opening of SwitchI; however, an open conformation of SwitchI is unstable in the absence of guanine nucleotide exchange factors (GEFs) and rises up towards the bound nucleotide to close the nucleotide pocket. Both I21 and Y32, play a crucial role in SwitchI transition. We show that an open SwitchI conformation is not necessary for GDP destabilization but is required for GDP/Mg escape from the HRAS. Further, we present the first simulation study showing displacement of GDP/Mg away from the nucleotide pocket. Both SwitchI and SwitchII, delays the escape of displaced GDP/Mg in the absence of GEF. Based on these results, a model for the mechanism of GEF in accelerating the exchange process is hypothesized. PMID:25272152
Khan, A S
1984-01-01
The sequence of 363 nucleotides near the 3' end of the pol gene and 564 nucleotides from the 5' terminus of the env gene in an endogenous murine leukemia viral (MuLV) DNA segment, cloned from AKR/J mouse DNA and designated as A-12, was obtained. For comparison, the nucleotide sequence in an analogous portion of AKR mink cell focus-forming (MCF) 247 MuLV provirus was also determined. Sequence features unique to MCF247 MuLV DNA in the 3' pol and 5' env regions were identified by comparison with nucleotide sequences in analogous regions of NFS -Th-1 xenotropic and AKR ecotropic MuLV proviruses. These included (i) an insertion of 12 base pairs encoding four amino acids located 60 base pairs from the 3' terminus of the pol gene and immediately preceding the env gene, (ii) the deletion of 12 base pairs (encoding four amino acids) and the insertion of 3 base pairs (encoding one amino acid) in the 5' portion of the env gene, and (iii) single base substitutions resulting in 2 MCF247 -specific amino acids in the 3' pol and 23 in the 5' env regions. Nucleotide sequence comparison involving the 3' pol and 5' env regions of AKR MCF247 , NFS xenotropic, and AKR ecotropic MuLV proviruses with the cloned endogenous MuLV DNA indicated that MCF247 proviral DNA sequences were conserved in the cloned endogenous MuLV proviral segment. In fact, total nucleotide sequence identity existed between the endogenous MuLV DNA and the MCF247 MuLV provirus in the 3' portion of the pol gene. In the 5' env region, only 4 of 564 nucleotides were different, resulting in three amino acid changes between AKR MCF247 MuLV DNA and the endogenous MuLV DNA present in clone A-12. In addition, nucleotide sequence comparison indicated that Moloney-and Friend-MCF MuLVs were also highly related in the 3' pol and 5' env regions to the cloned endogenous MuLV DNA. These results establish the role of endogenous MuLV DNA segments in generation of recombinant MCF viruses. PMID:6328017
2010-01-01
Background Infectious hematopoietic necrosis virus (IHNV) is the type species of the genus Novirhabdovirus, within the family Rhabdoviridae, infecting several species of wild and hatchery reared salmonids. Similar to other rhabdoviruses, IHNV has a linear single-stranded, negative-sense RNA genome of approximately 11,000 nucleotides. The IHNV genome encodes six genes; the nucleocapsid, phosphoprotein, matrix protein, glycoprotein, non-virion protein and polymerase protein genes, respectively. This study describes molecular characterization of the virulent IHNV strain 220-90, belonging to the M genogroup, and its phylogenetic relationships with available sequences of IHNV isolates worldwide. Results The complete genomic sequence of IHNV strain 220-90 was determined from the DNA of six overlapping clones obtained by RT-PCR amplification of genomic RNA. The complete genome sequence of 220-90 comprises 11,133 nucleotides (GenBank GQ413939) with the gene order of 3'-N-P-M-G-NV-L-5'. These genes are separated by conserved gene junctions, with di-nucleotide gene spacers. An additional uracil nucleotide was found at the end of the 5'-trailer region, which was not reported before in other IHNV strains. The first 15 of the 16 nucleotides at the 3'- and 5'-termini of the genome are complementary, and the first 4 nucleotides at 3'-ends of the IHNV are identical to other novirhadoviruses. Sequence homology and phylogenetic analysis of the glycoprotein genes show that 220-90 strain is 97% identical to most of the IHNV strains. Comparison of the virulent 220-90 genomic sequences with less virulent WRAC isolate shows more than 300 nucleotides changes in the genome, which doesn't allow one to speculate putative residues involved in the virulence of IHNV. Conclusion We have molecularly characterized one of the well studied IHNV isolates, 220-90 of genogroup M, which is virulent for rainbow trout, and compared phylogenetic relationship with North American and other strains. Determination of the complete nucleotide sequence is essential for future studies on pathogenesis of IHNV using a reverse genetics approach and developing efficient control strategies. PMID:20085652
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-01-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield. PMID:25333064
Krawitz, Peter M; Schiska, Daniela; Krüger, Ulrike; Appelt, Sandra; Heinrich, Verena; Parkhomchuk, Dmitri; Timmermann, Bernd; Millan, Jose M; Robinson, Peter N; Mundlos, Stefan; Hecht, Jochen; Gross, Manfred
2014-09-01
Usher syndrome is an autosomal recessive disorder characterized both by deafness and blindness. For the three clinical subtypes of Usher syndrome causal mutations in altogether 12 genes and a modifier gene have been identified. Due to the genetic heterogeneity of Usher syndrome, the molecular analysis is predestined for a comprehensive and parallelized analysis of all known genes by next-generation sequencing (NGS) approaches. We describe here the targeted enrichment and deep sequencing for exons of Usher genes and compare the costs and workload of this approach compared to Sanger sequencing. We also present a bioinformatics analysis pipeline that allows us to detect single-nucleotide variants, short insertions and deletions, as well as copy number variations of one or more exons on the same sequence data. Additionally, we present a flexible in silico gene panel for the analysis of sequence variants, in which newly identified genes can easily be included. We applied this approach to a cohort of 44 Usher patients and detected biallelic pathogenic mutations in 35 individuals and monoallelic mutations in eight individuals of our cohort. Thirty-nine of the sequence variants, including two heterozygous deletions comprising several exons of USH2A, have not been reported so far. Our NGS-based approach allowed us to assess single-nucleotide variants, small indels, and whole exon deletions in a single test. The described diagnostic approach is fast and cost-effective with a high molecular diagnostic yield.
Figueiredo, Joana; Simões, Maria José; Gomes, Paula; Barroso, Cristina; Pinho, Diogo; Conceição, Luci; Fonseca, Luís; Abrantes, Isabel; Pinheiro, Miguel; Egas, Conceição
2013-01-01
The pinewood nematode, Bursaphelenchus xylophilus, is native to North America but it only causes damaging pine wilt disease in those regions of the world where it has been introduced. The accurate detection of the species and its dispersal routes are thus essential to define effective control measures. The main goals of this study were to analyse the genetic diversity among B. xylophilus isolates from different geographic locations and identify single nucleotide polymorphism (SNPs) markers for geographic origin, through a comparative transcriptomic approach. The transcriptomes of seven B. xylophilus isolates, from Continental Portugal (4), China (1), Japan (1) and USA (1), were sequenced in the next generation platform Roche 454. Analysis of effector gene transcripts revealed inter-isolate nucleotide diversity that was validated by Sanger sequencing in the genomic DNA of the seven isolates and eight additional isolates from different geographic locations: Madeira Island (2), China (1), USA (1), Japan (2) and South Korea (2). The analysis identified 136 polymorphic positions in 10 effector transcripts. Pairwise comparison of the 136 SNPs through Neighbor-Joining and the Maximum Likelihood methods and 5-mer frequency analysis with the alignment-independent bilinear multivariate modelling approach correlated the SNPs with the isolates geographic origin. Furthermore, the SNP analysis indicated a closer proximity of the Portuguese isolates to the Korean and Chinese isolates than to the Japanese or American isolates. Each geographic cluster carried exclusive alleles that can be used as SNP markers for B. xylophilus isolate identification. PMID:24391785
Wu, Shuang; Nakamoto, Shingo; Kanda, Tatsuo; Jiang, Xia; Nakamura, Masato; Miyamura, Tatsuo; Shirasawa, Hiroshi; Sugiura, Nobuyuki; Takahashi-Nakaguchi, Azusa; Gonoi, Tohru; Yokosuka, Osamu
2014-01-01
Hepatitis A virus (HAV) is a causative agent of acute viral hepatitis for which an effective vaccine has been developed. Here we describe ultra-deep pyrosequences (UDPSs) of HAV 5'-untranslated region (5'UTR) among cases of the same outbreak, which arose from a single source, associated with a revolving sushi bar. We determined the reference sequence from HAV-derived clone from an attendant by the Sanger method. Sixteen UDPSs from this outbreak and one from another sporadic case were compared with this reference. Nucleotide errors yielded a UDPS error rate of < 1%. This study confirmed that nucleotide substitutions of this region are transition mutations in outbreak cases, that insertion was observed only in non-severe cases, and that these nucleotide substitutions were different from those of the sporadic case. Analysis of UDPSs detected low-prevalence HAV variations in 5'UTR, but no specific mutations associated with severity in these outbreak cases. To our surprise, HAV strains in this outbreak conserved HAV IRES sequence even if we performed analysis of UDPSs. UDPS analysis of HAV 5'UTR gave us no association between the disease severity of hepatitis A and HAV 5'UTR substitutions. It might be more interesting to perform ultra-deep sequencing of full length HAV genome in order to reveal possible unknown genomic determinants associated with disease severity. Further studies will be needed. PMID:24396287
Chen, Hao; Dou, Yanguo; Tang, Yi; Zhang, Zhenjie; Zheng, Xiaoqiang; Niu, Xiaoyu; Yang, Jing; Yu, Xianglong; Diao, Youxiang
2015-01-01
A newly emerged duck parvovirus, which causes beak atrophy and dwarfism syndrome (BADS) in Cherry Valley ducks, has appeared in Northern China since March 2015. To explore the genetic diversity among waterfowl parvovirus isolates, the complete genome of an identified isolate designated SDLC01 was sequenced and analyzed in the present study. Genomic sequence analysis showed that SDLC01 shared 90.8%-94.6% of nucleotide identity with goose parvovirus (GPV) isolates and 78.6%-81.6% of nucleotide identity with classical Muscovy duck parvovirus (MDPV) isolates. Phylogenetic analysis of 443 nucleotides (nt) of the fragment A showed that SDLC01 was highly similar to a mule duck isolate (strain D146/02) and close to European GPV isolates but separate from Asian GPV isolates. Analysis of the left inverted terminal repeat regions revealed that SDLC01 had two major segments deleted between positions 160-176 and 306-322 nt compared with field GPV and MDPV isolates. Phylogenetic analysis of Rep and VP1 encoded by two major open reading frames of parvoviruses revealed that SDLC01 was distinct from all GPV and MDPV isolates. The viral pathogenicity and genome characterization of SDLC01 suggest that the novel GPV (N-GPV) is the causative agent of BADS and belongs to a distinct GPV-related subgroup. Furthermore, N-GPV sequences were detected in diseased ducks by polymerase chain reaction and viral proliferation was demonstrated in duck embryos and duck embryo fibroblast cells.
Nucleotide Sequence Analysis of RNA Synthesized from Rabbit Globin Complementary DNA
Poon, Raymond; Paddock, Gary V.; Heindell, Howard; Whitcome, Philip; Salser, Winston; Kacian, Dan; Bank, Arthur; Gambino, Roberto; Ramirez, Francesco
1974-01-01
Rabbit globin complementary DNA made with RNA-dependent DNA polymerase (reverse transcriptase) was used as template for in vitro synthesis of 32P-labeled RNA. The sequences of the nucleotides in most of the fragments resulting from combined ribonuclease T1 and alkaline phosphatase digestion have been determined. Several fragments were long enough to fit uniquely with the α or β globin amino-acid sequences. These data demonstrate that the cDNA was copied from globin mRNA and contained no detectable contaminants. Images PMID:4139714
Methods and kits for nucleic acid analysis using fluorescence resonance energy transfer
Kwok, Pui-Yan; Chen, Xiangning
1999-01-01
A method for detecting the presence of a target nucleotide or sequence of nucleotides in a nucleic acid is disclosed. The method is comprised of forming an oligonucleotide labeled with two fluorophores on the nucleic acid target site. The doubly labeled oligonucleotide is formed by addition of a singly labeled dideoxynucleoside triphosphate to a singly labeled polynucleotide or by ligation of two singly labeled polynucleotides. Detection of fluorescence resonance energy transfer upon denaturation indicates the presence of the target. Kits are also provided. The method is particularly applicable to genotyping.
Van Kreijl, C F; Bos, J L
1977-01-01
The repeating nucleotide sequence of 68 base pairs in the mtDNA from an ethidium-induced cytoplasmic petite mutant of yeast has been determined. For sequence analysis specifically primed and terminated RNA copies, obtained by in vitro transcription of the separated strands, were use. The sequence consists of 66 consecutive AT base pairs flanked by two GC pairs and comprises nearly all of the mutant mitochondrial genome. The sequence, moreover, also represents the first part of wild-type mtDNA sequence so far. Images PMID:198740
Fungal Taxa Target Different Carbon Substrates in Harvard Forest Soils
NASA Astrophysics Data System (ADS)
Hanson, C. A.; Allison, S. D.; Wallenstein, M. D.; Mellilo, J. M.; Treseder, K. K.
2006-12-01
The mineralization of soil organic carbon is a major component of the global carbon cycle and is largely controlled by soil microbial communities. However, little is known about the functional roles of soil microbes or whether different microbial taxa target different carbon substrates under natural conditions. To examine this possibility, we assessed the community composition of active fungi by using a novel nucleotide analog technique in soils from the Harvard Forest. We hypothesized that fungal community composition would shift in response to the addition of different substrates and that specific fungal taxa would respond differentially to particular carbon sources. To test this hypothesis, we added a nucleotide analog probe directly to soils in conjunction with one of five carbon compounds of increasing recalcitrance: glycine, sucrose, cellulose, tannin-protein complex, and lignin. During 48 hour incubations, the nucleotide analog was incorporated into newly replicated DNA of soil organisms that proliferated following the addition of the substrates. In this way, we labeled the DNA of microbes that respond to a particular carbon source. Labeled DNA was isolated and fungal Internal Transcribed Spacer (ITS) regions of ribosomal DNA (rDNA) were sequenced and analyzed to identify active fungi to near-species resolution. Diversity analyses at the ≥97% sequence similarity level indicated that taxonomic richness was greater under cellulose (Shannon Index: 3.23 ± 0.11 with ± 95% CI) and lignin (2.87 ± 0.15) additions than the other treatments (2.34 ± 0.16 to 2.64 ± 0.13). In addition, community composition of active fungi shifted under glycine, sucrose, and cellulose additions. Specifically, the community under glycine was significantly different from communities under control, cellulose, and tannin-protein (P<0.05). Additionally, the sucrose and cellulose communities were marginally different from the control community (P = 0.059 and 0.054, respectively) and each other (P = 0.058). Together these results support our hypothesis that fungal communities change in response to different carbon sources. We found 11 fungal operational taxonomic units (OTUs) whose relative abundances differed at least marginally significantly among substrates. One OTU related to Mortierella increased in abundance under cellulose, but was absent or rare under the other substrates. Another OTU related to an unidentified Basidiomycete was only present under lignin addition, while yet another OTU closely related to Mortierella macrocystis greatly increased in abundance under tannin-protein and slightly increased in response to lignin and sucrose. This confirms our hypothesis that particular taxa respond differently to specific carbon substrates and suggests that some fungal taxa may specialize in the break-down of particular carbon sources in soils. Overall, our results imply that microbes have varying roles in the mineralization of soil carbon, and thus microbial community composition may be an important control over ecosystem carbon dynamics and storage, especially in relation to global change.
El-Sabrout, Karim; Aggag, Sarah A.
2017-01-01
Aim: In this study, we examined parts of six growth genes (growth hormone [GH], melanocortin 4 receptor [MC4R], growth hormone receptor [GHR], phosphorglycerate mutase [PGAM], myostatin [MSTN], and fibroblast growth factor [FGF]) as specific primers for two rabbit lines (V-line, Alexandria) using nucleotide sequence analysis, to investigate association between detecting single nucleotide polymorphism (SNP) of these genes and body weight (BW) at market. Materials and Methods: Each line kits were grouped into high and low weight rabbits to identify DNA markers useful for association studies with high BW. DNA from blood samples of each group was extracted to amplify the six growth genes. SNP technique was used to study the associate polymorphism in the six growth genes and marketing BW (at 63 days) in the two rabbit lines. The purified polymerase chain reaction products were sequenced in those had the highest and lowest BW in each line. Results: Alignment of sequence data from each group revealed the following SNPs: At nucleotide 23 (A-C) and nucleotide 35 (T-G) in MC4R gene (sense mutation) of Alexandria and V-line high BW. Furthermore, we detected the following SNPs variation between the two lines: A SNP (T-C) at nucleotide 27 was identified by MC4R gene (sense mutation) and another one (A-C) at nucleotide 14 was identified by GHR gene (nonsense mutation) of Alexandria line. The results of individual BW at market (63 days) indicated that Alexandria rabbits had significantly higher BW compared with V-line rabbits. MC4R polymorphism showed significant association with high BW in rabbits. Conclusion: The results of polymorphism demonstrate the possibility to detect an association between BW in rabbits and the efficiency of the used primers to predict through the genetic specificity using the SNP of MC4R. PMID:28246458
Proks, Peter; de Wet, Heidi; Ashcroft, Frances M
2014-11-01
Sulfonylureas, which stimulate insulin secretion from pancreatic β-cells, are widely used to treat both type 2 diabetes and neonatal diabetes. These drugs mediate their effects by binding to the sulfonylurea receptor subunit (SUR) of the ATP-sensitive K(+) (KATP) channel and inducing channel closure. The mechanism of channel inhibition is unusually complex. First, sulfonylureas act as partial antagonists of channel activity, and second, their effect is modulated by MgADP. We analyzed the molecular basis of the interactions between the sulfonylurea gliclazide and Mg-nucleotides on β-cell and cardiac types of KATP channel (Kir6.2/SUR1 and Kir6.2/SUR2A, respectively) heterologously expressed in Xenopus laevis oocytes. The SUR2A-Y1206S mutation was used to confer gliclazide sensitivity on SUR2A. We found that both MgATP and MgADP increased gliclazide inhibition of Kir6.2/SUR1 channels and reduced inhibition of Kir6.2/SUR2A-Y1206S. The latter effect can be attributed to stabilization of the cardiac channel open state by Mg-nucleotides. Using a Kir6.2 mutation that renders the KATP channel insensitive to nucleotide inhibition (Kir6.2-G334D), we showed that gliclazide abolishes the stimulatory effects of MgADP and MgATP on β-cell KATP channels. Detailed analysis suggests that the drug both reduces nucleotide binding to SUR1 and impairs the efficacy with which nucleotide binding is translated into pore opening. Mutation of one (or both) of the Walker A lysines in the catalytic site of the nucleotide-binding domains of SUR1 may have a similar effect to gliclazide on MgADP binding and transduction, but it does not appear to impair MgATP binding. Our results have implications for the therapeutic use of sulfonylureas. © 2014 Proks et al.
An extended sequence specificity for UV-induced DNA damage.
Chung, Long H; Murray, Vincent
2018-01-01
The sequence specificity of UV-induced DNA damage was determined with a higher precision and accuracy than previously reported. UV light induces two major damage adducts: cyclobutane pyrimidine dimers (CPDs) and pyrimidine(6-4)pyrimidone photoproducts (6-4PPs). Employing capillary electrophoresis with laser-induced fluorescence and taking advantages of the distinct properties of the CPDs and 6-4PPs, we studied the sequence specificity of UV-induced DNA damage in a purified DNA sequence using two approaches: end-labelling and a polymerase stop/linear amplification assay. A mitochondrial DNA sequence that contained a random nucleotide composition was employed as the target DNA sequence. With previous methodology, the UV sequence specificity was determined at a dinucleotide or trinucleotide level; however, in this paper, we have extended the UV sequence specificity to a hexanucleotide level. With the end-labelling technique (for 6-4PPs), the consensus sequence was found to be 5'-GCTC*AC (where C* is the breakage site); while with the linear amplification procedure, it was 5'-TCTT*AC. With end-labelling, the dinucleotide frequency of occurrence was highest for 5'-TC*, 5'-TT* and 5'-CC*; whereas it was 5'-TT* for linear amplification. The influence of neighbouring nucleotides on the degree of UV-induced DNA damage was also examined. The core sequences consisted of pyrimidine nucleotides 5'-CTC* and 5'-CTT* while an A at position "1" and C at position "2" enhanced UV-induced DNA damage. Crown Copyright © 2017. Published by Elsevier B.V. All rights reserved.
NullSeq: A Tool for Generating Random Coding Sequences with Desired Amino Acid and GC Contents.
Liu, Sophia S; Hockenberry, Adam J; Lancichinetti, Andrea; Jewett, Michael C; Amaral, Luís A N
2016-11-01
The existence of over- and under-represented sequence motifs in genomes provides evidence of selective evolutionary pressures on biological mechanisms such as transcription, translation, ligand-substrate binding, and host immunity. In order to accurately identify motifs and other genome-scale patterns of interest, it is essential to be able to generate accurate null models that are appropriate for the sequences under study. While many tools have been developed to create random nucleotide sequences, protein coding sequences are subject to a unique set of constraints that complicates the process of generating appropriate null models. There are currently no tools available that allow users to create random coding sequences with specified amino acid composition and GC content for the purpose of hypothesis testing. Using the principle of maximum entropy, we developed a method that generates unbiased random sequences with pre-specified amino acid and GC content, which we have developed into a python package. Our method is the simplest way to obtain maximally unbiased random sequences that are subject to GC usage and primary amino acid sequence constraints. Furthermore, this approach can easily be expanded to create unbiased random sequences that incorporate more complicated constraints such as individual nucleotide usage or even di-nucleotide frequencies. The ability to generate correctly specified null models will allow researchers to accurately identify sequence motifs which will lead to a better understanding of biological processes as well as more effective engineering of biological systems.
Hu, Xiao-di; Gao, Li-zhi
2016-01-01
In this study, we determined the complete mitochondrial (mt) genome of eastern lowland gorilla, Gorilla beringei graueri for the first time. The total genome was 16,416 bp in length. It contained a total of 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and 1 control region (D-loop region). The base composition was A (30.88%), G (13.10%), C (30.89%) and T (25.13%), indicating that the percentage of A+T (56.01%) was higher than G+C (43.99%). Comparisons with the other publicly available Gorilla mitogenome showed the conservation of gene order and base compositions but a bunch of nucleotide diversity. This complete mitochondrial genome sequence will provide valuable genetic information for further studies on conservation genetics of eastern lowland gorilla.
NASA Astrophysics Data System (ADS)
Holden, Todd; Gadura, N.; Dehipawala, S.; Cheung, E.; Tuffour, M.; Schneider, P.; Tremberger, G., Jr.; Lieberman, D.; Cheung, T.
2011-10-01
Technologically important extremophiles including oil eating microbes, uranium and rocket fuel perchlorate reduction microbes, electron producing microbes and electrode electrons feeding microbes were compared in terms of their 16S rRNA sequences, a standard targeted sequence in comparative phylogeny studies. Microbes that were reported to have survived a prolonged dormant duration were also studied. Examples included the recently discovered microbe that survives after 34,000 years in a salty environment while feeding off organic compounds from other trapped dead microbes. Shannon entropy of the 16S rRNA nucleotide composition and fractal dimension of the nucleotide sequence in terms of its atomic number fluctuation analyses suggest a selected range for these extremophiles as compared to other microbes; consistent with the experience of relatively mild evolutionary pressure. However, most of the microbes that have been reported to survive in prolonged dormant duration carry sequences with fractal dimension between 1.995 and 2.005 (N = 10 out of 13). Similar results are observed for halophiles, red-shifted chlorophyll and radiation resistant microbes. The results suggest that prolonged dormant duration, in analogous to high salty or radiation environment, would select high fractal 16S rRNA sequences. Path analysis in structural equation modeling supports a causal relation between entropy and fractal dimension for the studied 16S rRNA sequences (N = 7). Candidate choices for high fractal 16S rRNA microbes could offer protection for prolonged spaceflights. BioBrick gene network manipulation could include extremophile 16S rRNA sequences in synthetic biology and shed more light on exobiology and future colonization in shielded spaceflights. Whether the high fractal 16S rRNA sequences contain an asteroidlike extra-terrestrial source could be speculative but interesting.
Hughes, Laura B; Reynolds, Richard J; Brown, Elizabeth E; Kelley, James M; Thomson, Brian; Conn, Doyt L; Jonas, Beth L; Westfall, Andrew O; Padilla, Miguel A; Callahan, Leigh F; Smith, Edwin A; Brasington, Richard D; Edberg, Jeffrey C; Kimberly, Robert P; Moreland, Larry W; Plenge, Robert M; Bridges, S Louis
2010-12-01
Large-scale genetic association studies have identified >20 rheumatoid arthritis (RA) risk alleles among individuals of European ancestry. The influence of these risk alleles has not been comprehensively studied in African Americans. We therefore sought to examine whether these validated RA risk alleles are associated with RA risk in an African American population. Twenty-seven candidate single-nucleotide polymorphisms (SNPs) were genotyped in 556 autoantibody-positive African Americans with RA and 791 healthy African American control subjects. Odds ratios (ORs) and 95% confidence intervals (95% CIs) for each SNP were compared with previously published ORs for RA patients of European ancestry. We then calculated a composite genetic risk score (GRS) for each individual based on the sum of all risk alleles. Overlap of the ORs and 95% CIs between the European and African American populations was observed for 24 of the 27 candidate SNPs. Conversely, 3 of the 27 SNPs (CCR6 rs3093023, TAGAP rs394581, and TNFAIP3 rs6920220) demonstrated ORs in the opposite direction from those reported for RA patients of European ancestry. The GRS analysis indicated a small but highly significant probability that African American patients relative to control subjects were enriched for the risk alleles validated in European RA patients (P = 0.00005). The majority of RA risk alleles previously validated for RA patients of European ancestry showed similar ORs in our population of African Americans with RA. Furthermore, the aggregate GRS supports the hypothesis that these SNPs are risk alleles for RA in the African American population. Future large-scale genetic studies are needed to validate these risk alleles and identify novel RA risk alleles in African Americans. Copyright © 2010 by the American College of Rheumatology.
Stukenbrock, Eva H.; Dutheil, Julien Y.
2018-01-01
Meiotic recombination is an important driver of evolution. Variability in the intensity of recombination across chromosomes can affect sequence composition, nucleotide variation, and rates of adaptation. In many organisms, recombination events are concentrated within short segments termed recombination hotspots. The variation in recombination rate and positions of recombination hotspot can be studied using population genomics data and statistical methods. In this study, we conducted population genomics analyses to address the evolution of recombination in two closely related fungal plant pathogens: the prominent wheat pathogen Zymoseptoria tritici and a sister species infecting wild grasses Z. ardabiliae. We specifically addressed whether recombination landscapes, including hotspot positions, are conserved in the two recently diverged species and if recombination contributes to rapid evolution of pathogenicity traits. We conducted a detailed simulation analysis to assess the performance of methods of recombination rate estimation based on patterns of linkage disequilibrium, in particular in the context of high nucleotide diversity. Our analyses reveal overall high recombination rates, a lack of suppressed recombination in centromeres, and significantly lower recombination rates on chromosomes that are known to be accessory. The comparison of the recombination landscapes of the two species reveals a strong correlation of recombination rate at the megabase scale, but little correlation at smaller scales. The recombination landscapes in both pathogen species are dominated by frequent recombination hotspots across the genome including coding regions, suggesting a strong impact of recombination on gene evolution. A significant but small fraction of these hotspots colocalize between the two species, suggesting that hotspot dynamics contribute to the overall pattern of fast evolving recombination in these species. PMID:29263029
Stukenbrock, Eva H; Dutheil, Julien Y
2018-03-01
Meiotic recombination is an important driver of evolution. Variability in the intensity of recombination across chromosomes can affect sequence composition, nucleotide variation, and rates of adaptation. In many organisms, recombination events are concentrated within short segments termed recombination hotspots. The variation in recombination rate and positions of recombination hotspot can be studied using population genomics data and statistical methods. In this study, we conducted population genomics analyses to address the evolution of recombination in two closely related fungal plant pathogens: the prominent wheat pathogen Zymoseptoria tritici and a sister species infecting wild grasses Z. ardabiliae We specifically addressed whether recombination landscapes, including hotspot positions, are conserved in the two recently diverged species and if recombination contributes to rapid evolution of pathogenicity traits. We conducted a detailed simulation analysis to assess the performance of methods of recombination rate estimation based on patterns of linkage disequilibrium, in particular in the context of high nucleotide diversity. Our analyses reveal overall high recombination rates, a lack of suppressed recombination in centromeres, and significantly lower recombination rates on chromosomes that are known to be accessory. The comparison of the recombination landscapes of the two species reveals a strong correlation of recombination rate at the megabase scale, but little correlation at smaller scales. The recombination landscapes in both pathogen species are dominated by frequent recombination hotspots across the genome including coding regions, suggesting a strong impact of recombination on gene evolution. A significant but small fraction of these hotspots colocalize between the two species, suggesting that hotspot dynamics contribute to the overall pattern of fast evolving recombination in these species. Copyright © 2018 Stukenbrock and Dutheil.
Su, Yingjuan; Wang, Ting; Zheng, Bo; Jiang, Yu; Chen, Guopei; Gu, Hongya
2004-11-01
Sequences of chloroplast DNA (cpDNA) atpB- rbcL intergenic spacers of individuals of a tree fern species, Alsophila spinulosa, collected from ten relict populations distributed in the Hainan and Guangdong provinces, and the Guangxi Zhuang region in southern China, were determined. Sequence length varied from 724 bp to 731 bp, showing length polymorphism, and base composition was with high A+T content between 63.17% and 63.95%. Sequences were neutral in terms of evolution (Tajima's criterion D=-1.01899, P>0.10 and Fu and Li's test D*=-1.39008, P>0.10; F*=-1.49775, P>0.10). A total of 19 haplotypes were identified based on nucleotide variation. High levels of haplotype diversity (h=0.744) and nucleotide diversity (Dij=0.01130) were detected in A. spinulosa, probably associated with its long evolutionary history, which has allowed the accumulation of genetic variation within lineages. Both the minimum spanning network and neighbor-joining trees generated for haplotypes demonstrated that current populations of A. spinulosa existing in Hainan, Guangdong, and Guangxi were subdivided into two geographical groups. An analysis of molecular variance indicated that most of the genetic variation (93.49%, P<0.001) was partitioned among regions. Wright's isolation by distance model was not supported across extant populations. Reduced gene flow by the Qiongzhou Strait and inbreeding may result in the geographical subdivision between the Hainan and Guangdong + Guangxi populations (FST=0.95, Nm=0.03). Within each region, the star-like pattern of phylogeography of haplotypes implied a population expansion process during evolutionary history. Gene genealogies together with coalescent theory provided significant information for uncovering phylogeography of A. spinulosa.
Mitsui, Jun; Fukuda, Yoko; Azuma, Kyo; Tozaki, Hirokazu; Ishiura, Hiroyuki; Takahashi, Yuji; Goto, Jun; Tsuji, Shoji
2010-07-01
We have recently found that multiple rare variants of the glucocerebrosidase gene (GBA) confer a robust risk for Parkinson disease, supporting the 'common disease-multiple rare variants' hypothesis. To develop an efficient method of identifying rare variants in a large number of samples, we applied multiplexed resequencing using a next-generation sequencer to identification of rare variants of GBA. Sixteen sets of pooled DNAs from six pooled DNA samples were prepared. Each set of pooled DNAs was subjected to polymerase chain reaction to amplify the target gene (GBA) covering 6.5 kb, pooled into one tube with barcode indexing, and then subjected to extensive sequence analysis using the SOLiD System. Individual samples were also subjected to direct nucleotide sequence analysis. With the optimization of data processing, we were able to extract all the variants from 96 samples with acceptable rates of false-positive single-nucleotide variants.
Rabie, M; Ratti, C; Abdel Aleem, E; Fattouh, F
Tomato yellow leaf curl virus (TYLCV) infections of tomato crops in Egypt were widely spread in 2014. Infected symptomatic tomato plants from different governorates were sampled. TYLCV strains Israel and Mild (TYLCV-IL, TYLCV-Mild) were identified by multiplex and real-time PCR. In addition, nucleotide sequence analysis of the V1 and V2 protein genes, revealed ten TYLCV Egyptian isolates (TYLCV from TY1 to 10). Phylogenetic analysis showed their high degree of relatedness with TYLCV-IL Jordan isolate (98%). Here we have showed the complete nucleotide sequence of the TYLCV Egyptian isolate TY10, sampled from El Beheira. A high degree of similarity to other previously reported Egyptian isolates and isolates from Jordan and Japan reflect the importance of phylogenetic analysis in monitoring virus genetic diversity and possibilities for divergence of more virulent strains or genotypes.
Prediction and phylogenetic analysis of mammalian short interspersed elements (SINEs).
Rogozin, I B; Mayorov, V I; Lavrentieva, M V; Milanesi, L; Adkison, L R
2000-09-01
The presence of repetitive elements can create serious problems for sequence analysis, especially in the case of homology searches in nucleotide sequence databases. Repetitive elements should be treated carefully by using special programs and databases. In this paper, various aspects of SINE (short interspersed repetitive element) identification, analysis and evolution are discussed.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Helfenbein, Kevin G.; Brown, Wesley M.; Boore, Jeffrey L.
We have sequenced the complete mitochondrial DNA (mtDNA) of the articulate brachiopod Terebratalia transversa. The circular genome is 14,291 bp in size, relatively small compared to other published metazoan mtDNAs. The 37 genes commonly found in animal mtDNA are present; the size decrease is due to the truncation of several tRNA, rRNA, and protein genes, to some nucleotide overlaps, and to a paucity of non-coding nucleotides. Although the gene arrangement differs radically from those reported for other metazoans, some gene junctions are shared with two other articulate brachiopods, Laqueus rubellus and Terebratulina retusa. All genes in the T. transversa mtDNA,more » unlike those in most metazoan mtDNAs reported, are encoded by the same strand. The A+T content (59.1 percent) is low for a metazoan mtDNA, and there is a high propensity for homopolymer runs and a strong base-compositional strand bias. The coding strand is quite G+T-rich, a skew that is shared by the confamilial (laqueid) specie s L. rubellus, but opposite to that found in T. retusa, a cancellothyridid. These compositional skews are strongly reflected in the codon usage patterns and the amino acid compositions of the mitochondrial proteins, with markedly different usage observed between T. retusa and the two laqueids. This observation, plus the similarity of the laqueid non-coding regions to the reverse complement of the non-coding region of the cancellothyridid, suggest that an inversion that resulted in a reversal in the direction of first-strand replication has occurred in one of the two lineages. In addition to the presence of one non-coding region in T. transversa that is comparable to those in the other brachiopod mtDNAs, there are two others with the potential to form secondary structures; one or both of these may be involved in the process of transcript cleavage.« less
Chloroplast DNA Structural Variation, Phylogeny, and Age of Divergence among Diploid Cotton Species.
Chen, Zhiwen; Feng, Kun; Grover, Corrinne E; Li, Pengbo; Liu, Fang; Wang, Yumei; Xu, Qin; Shang, Mingzhao; Zhou, Zhongli; Cai, Xiaoyan; Wang, Xingxing; Wendel, Jonathan F; Wang, Kunbo; Hua, Jinping
2016-01-01
The cotton genus (Gossypium spp.) contains 8 monophyletic diploid genome groups (A, B, C, D, E, F, G, K) and a single allotetraploid clade (AD). To gain insight into the phylogeny of Gossypium and molecular evolution of the chloroplast genome in this group, we performed a comparative analysis of 19 Gossypium chloroplast genomes, six reported here for the first time. Nucleotide distance in non-coding regions was about three times that of coding regions. As expected, distances were smaller within than among genome groups. Phylogenetic topologies based on nucleotide and indel data support for the resolution of the 8 genome groups into 6 clades. Phylogenetic analysis of indel distribution among the 19 genomes demonstrates contrasting evolutionary dynamics in different clades, with a parallel genome downsizing in two genome groups and a biased accumulation of insertions in the clade containing the cultivated cottons leading to large (for Gossypium) chloroplast genomes. Divergence time estimates derived from the cpDNA sequence suggest that the major diploid clades had diverged approximately 10 to 11 million years ago. The complete nucleotide sequences of 6 cpDNA genomes are provided, offering a resource for cytonuclear studies in Gossypium.
Chloroplast DNA Structural Variation, Phylogeny, and Age of Divergence among Diploid Cotton Species
Li, Pengbo; Liu, Fang; Wang, Yumei; Xu, Qin; Shang, Mingzhao; Zhou, Zhongli; Cai, Xiaoyan; Wang, Xingxing; Wendel, Jonathan F.; Wang, Kunbo
2016-01-01
The cotton genus (Gossypium spp.) contains 8 monophyletic diploid genome groups (A, B, C, D, E, F, G, K) and a single allotetraploid clade (AD). To gain insight into the phylogeny of Gossypium and molecular evolution of the chloroplast genome in this group, we performed a comparative analysis of 19 Gossypium chloroplast genomes, six reported here for the first time. Nucleotide distance in non-coding regions was about three times that of coding regions. As expected, distances were smaller within than among genome groups. Phylogenetic topologies based on nucleotide and indel data support for the resolution of the 8 genome groups into 6 clades. Phylogenetic analysis of indel distribution among the 19 genomes demonstrates contrasting evolutionary dynamics in different clades, with a parallel genome downsizing in two genome groups and a biased accumulation of insertions in the clade containing the cultivated cottons leading to large (for Gossypium) chloroplast genomes. Divergence time estimates derived from the cpDNA sequence suggest that the major diploid clades had diverged approximately 10 to 11 million years ago. The complete nucleotide sequences of 6 cpDNA genomes are provided, offering a resource for cytonuclear studies in Gossypium. PMID:27309527
Naumova O, Y u; Rychkov S, Y u
1998-03-01
On the basis of analysis of mtDNA from skeletal remains, dated by 14C 4020-3210 BC, from the Ust'-Ida I Neolithic burial ground in Cis-Baikal area of Siberia, we obtained genetic characteristics of the ancient Mongoloid population. Using the 7 restriction enzymes for the analysis of site's polymorphism in 16,106-16,545 region of mtDNA, we studied the structure of the most frequent DNA haplotypes, and estimated the intrapopulational nucleotide diversity of the Neolithic population. Comparison of the Neolithic and modern indigeneous populations from Siberia, Mongolia and Ural showed, that the ancient Siberian population is one of the ancestors of the modern population of Siberia. From genetic distance, in the assumption of constant nucleotide substitution rate, we estimated the divergence time between the Neolithic and the modern Siberian population. This divergence time (5572 years ago) is conformed to the age of skeletal remains (5542-5652 years). With use of the 14C dates of the skeletal remains, nucleotide substitution rate in mtDNA was estimated as 1% sequence divergence for 8938-9115 years.
Dong, Biqin; Almassalha, Luay M.; Stypula-Cyrus, Yolanda; Urban, Ben E.; Chandler, John E.; Nguyen, The-Quyen; Sun, Cheng; Zhang, Hao F.; Backman, Vadim
2016-01-01
Visualizing the nanoscale intracellular structures formed by nucleic acids, such as chromatin, in nonperturbed, structurally and dynamically complex cellular systems, will help expand our understanding of biological processes and open the next frontier for biological discovery. Traditional superresolution techniques to visualize subdiffractional macromolecular structures formed by nucleic acids require exogenous labels that may perturb cell function and change the very molecular processes they intend to study, especially at the extremely high label densities required for superresolution. However, despite tremendous interest and demonstrated need, label-free optical superresolution imaging of nucleotide topology under native nonperturbing conditions has never been possible. Here we investigate a photoswitching process of native nucleotides and present the demonstration of subdiffraction-resolution imaging of cellular structures using intrinsic contrast from unmodified DNA based on the principle of single-molecule photon localization microscopy (PLM). Using DNA-PLM, we achieved nanoscopic imaging of interphase nuclei and mitotic chromosomes, allowing a quantitative analysis of the DNA occupancy level and a subdiffractional analysis of the chromosomal organization. This study may pave a new way for label-free superresolution nanoscopic imaging of macromolecular structures with nucleotide topologies and could contribute to the development of new DNA-based contrast agents for superresolution imaging. PMID:27535934
Thermodynamics of RNA duplexes modified with unlocked nucleic acid nucleotides
Pasternak, Anna; Wengel, Jesper
2010-01-01
Thermodynamics provides insights into the influence of modified nucleotide residues on stability of nucleic acids and is crucial for designing duplexes with given properties. In this article, we introduce detailed thermodynamic analysis of RNA duplexes modified with unlocked nucleic acid (UNA) nucleotide residues. We investigate UNA single substitutions as well as model mismatch and dangling end effects. UNA residues placed in a central position makes RNA duplex structure less favourable by 4.0–6.6 kcal/mol. Slight destabilization, by ∼0.5–1.5 kcal/mol, is observed for 5′- or 3′-terminal UNA residues. Furthermore, thermodynamic effects caused by UNA residues are extremely additive with ΔG°37 conformity up to 98%. Direct mismatches involving UNA residues decrease the thermodynamic stability less than unmodified mismatches in RNA duplexes. Additionally, the presence of UNA residues adjacent to unpaired RNA residues reduces mismatch discrimination. Thermodynamic analysis of UNA 5′- and 3′-dangling ends revealed that stacking interactions of UNA residues are always less favourable than that of RNA residues. Finally, circular dichroism spectra imply no changes in overall A-form structure of UNA–RNA/RNA duplexes relative to the unmodified RNA duplexes. PMID:20562222
Thakur, Chandar S.; Brown, Margaret E.; Sama, Jacob N.; Jackson, Melantha E.
2010-01-01
Since RNAs lie at the center of most cellular processes, there is a need for synthesizing large amounts of RNAs made from stable isotope-labeled nucleotides to advance the study of their structure and dynamics by nuclear magnetic resonance (NMR) spectroscopy. A particularly effective means of obtaining labeled nucleotides is to harvest these nucleotides from bacteria grown in defined minimal media supplemented with 15NH4Cl and various carbon sources. Given the high cost of carbon precursors required for labeling nucleic acids for NMR studies, it becomes important to evaluate the optimal growth for commonly used strains under standard minimal media conditions. Such information is lacking. In this study, we characterize the growth for Escherichia coli strains K12, K10zwf, and DL323 in three minimal media with isotopic-labeled carbon sources of acetate, glycerol, and glycerol combined with formate. Of the three media, the LeMaster-Richards and the Studier media outperform the commonly used M9 media and both support optimal growth of E. coli for the production of nucleotides. However, the growth of all three E. coli strains in acetate is reduced almost twofold compared to growth in glycerol. Analysis of the metabolic pathway and previous gene array studies help to explain this differential growth in glycerol and acetate. These studies should benefit efforts to make selective 13C-15N isotopic-labeled nucleotides for synthesizing biologically important RNAs. Electronic supplementary material The online version of this article (doi:10.1007/s00253-010-2813-y) contains supplementary material, which is available to authorized users. PMID:20730533
Mallik, Saurav; Kundu, Sudip
2017-04-01
Understanding the molecular evolution of macromolecular complexes in the light of their structure, assembly, and stability is of central importance. Here, we address how the modular organization of native molecular contacts shapes the selection pressure on individual residue sites of ribosomal complexes. The bacterial ribosomal complex is represented as a residue contact network where nodes represent amino acid/nucleotide residues and edges represent their van der Waals interactions. We find statistically overrepresented native amino acid-nucleotide contacts (OaantC, one amino acid contacts one or multiple nucleotides, internucleotide contacts are disregarded). Contact number is defined as the number of nucleotides contacted. Involvement of individual amino acids in OaantCs with smaller contact numbers is more random, whereas only a few amino acids significantly contribute to OaantCs with higher contact numbers. An investigation of structure, stability, and assembly of bacterial ribosome depicts the involvement of these OaantCs in diverse biophysical interactions stabilizing the complex, including high-affinity protein-RNA contacts, interprotein cooperativity, intersubunit bridge, packing of multiple ribosomal RNA domains, etc. Amino acid-nucleotide constituents of OaantCs with higher contact numbers are generally associated with significantly slower substitution rates compared with that of OaantCs with smaller contact numbers. This evolutionary rate heterogeneity emerges from the strong purifying selection pressure that conserves the respective amino acid physicochemical properties relevant to the stabilizing interaction with OaantC nucleotides. An analysis of relative molecular orientations of OaantC residues and their interaction energetics provides the biophysical ground of purifying selection conserving OaantC amino acid physicochemical properties. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
García-Márquez, Adrián; Gijsbers, Abril; de la Mora, Eugenio; Sánchez-Puig, Nuria
2015-01-01
Ribosome biogenesis is orchestrated by the action of several accessory factors that provide time and directionality to the process. One such accessory factor is the GTPase EFL1 involved in the cytoplasmic maturation of the ribosomal 60S subunit. EFL1 and SBDS, the protein mutated in the Shwachman-Diamond syndrome (SBDS), release the anti-association factor eIF6 from the surface of the ribosomal subunit 60S. Here we report a kinetic analysis of fluorescent guanine nucleotides binding to EFL1 alone and in the presence of SBDS using fluorescence stopped-flow spectroscopy. Binding kinetics of EFL1 to both GDP and GTP suggests a two-step mechanism with an initial binding event followed by a conformational change of the complex. Furthermore, the same behavior was observed in the presence of the SBDS protein irrespective of the guanine nucleotide evaluated. The affinity of EFL1 for GTP is 10-fold lower than that calculated for GDP. Association of EFL1 to SBDS did not modify the affinity for GTP but dramatically decreased that for GDP by increasing the dissociation rate of the nucleotide. Thus, SBDS acts as a guanine nucleotide exchange factor (GEF) for EFL1 promoting its activation by the release of GDP. Finally, fluorescence anisotropy measurements showed that the S143L mutation present in the Shwachman-Diamond syndrome altered a surface epitope for EFL1 and largely decreased the affinity for it. These results suggest that loss of interaction between these proteins due to mutations in the disease consequently prevents the nucleotide exchange regulation the SBDS exerts on EFL1. PMID:25991726
Heinz, Eva; Hacker, Christian; Dean, Paul; Mifsud, John; Goldberg, Alina V.; Williams, Tom A.; Nakjang, Sirintra; Gregory, Alison; Hirt, Robert P.; Lucocq, John M.; Kunji, Edmund R. S.; Embley, T. Martin
2014-01-01
Microsporidia are obligate intracellular parasites of most animal groups including humans, but despite their significant economic and medical importance there are major gaps in our understanding of how they exploit infected host cells. We have investigated the evolution, cellular locations and substrate specificities of a family of nucleotide transport (NTT) proteins from Trachipleistophora hominis, a microsporidian isolated from an HIV/AIDS patient. Transport proteins are critical to microsporidian success because they compensate for the dramatic loss of metabolic pathways that is a hallmark of the group. Our data demonstrate that the use of plasma membrane-located nucleotide transport proteins (NTT) is a key strategy adopted by microsporidians to exploit host cells. Acquisition of an ancestral transporter gene at the base of the microsporidian radiation was followed by lineage-specific events of gene duplication, which in the case of T. hominis has generated four paralogous NTT transporters. All four T. hominis NTT proteins are located predominantly to the plasma membrane of replicating intracellular cells where they can mediate transport at the host-parasite interface. In contrast to published data for Encephalitozoon cuniculi, we found no evidence for the location for any of the T. hominis NTT transporters to its minimal mitochondria (mitosomes), consistent with lineage-specific differences in transporter and mitosome evolution. All of the T. hominis NTTs transported radiolabelled purine nucleotides (ATP, ADP, GTP and GDP) when expressed in Escherichia coli, but did not transport radiolabelled pyrimidine nucleotides. Genome analysis suggests that imported purine nucleotides could be used by T. hominis to make all of the critical purine-based building-blocks for DNA and RNA biosynthesis during parasite intracellular replication, as well as providing essential energy for parasite cellular metabolism and protein synthesis. PMID:25474405
The Utility of Chromosomal Microarray Analysis in Developmental and Behavioral Pediatrics
ERIC Educational Resources Information Center
Beaudet, Arthur L.
2013-01-01
Chromosomal microarray analysis (CMA) has emerged as a powerful new tool to identify genomic abnormalities associated with a wide range of developmental disabilities including congenital malformations, cognitive impairment, and behavioral abnormalities. CMA includes array comparative genomic hybridization (CGH) and single nucleotide polymorphism…
Xin, Min; Zhang, Peipei; Liu, Wenwen; Ren, Yingdang; Cao, Mengji; Wang, Xifeng
2017-10-01
The complete nucleotide sequence of a novel positive single-stranded (+ss) RNA virus, tentatively named watermelon virus A (WVA), was determined using a combination of three methods: RNA sequencing, small RNA sequencing, and Sanger sequencing. The full genome of WVA is comprised of 8,372 nucleotides (nt), excluding the poly (A) tail, and contains four open reading frames (ORFs). The largest ORF, ORF1 encodes a putative replication-associated polyprotein (RP) with three conserved domains. ORF2 and ORF4 encode a movement protein (MP) and coat protein (CP), respectively. The putative product encoded by ORF3, of an estimated molecular mass of 25 kDa, has no significant similarity with other proteins. Identity and phylogenetic analysis indicate that WVA is a new virus, closely related to members of the family Betaflexiviridae. However, the final taxonomic allocation of WVA within the family is yet to be determined.
Real-time observation of the conformational dynamics of mitochondrial Hsp70 by spFRET.
Sikor, Martin; Mapa, Koyeli; von Voithenberg, Lena Voith; Mokranjac, Dejana; Lamb, Don C
2013-05-29
The numerous functions of the important class of molecular chaperones, heat shock proteins 70 (Hsp70), rely on cycles of intricate conformational changes driven by ATP-hydrolysis and regulated by cochaperones and substrates. Here, we used Förster resonance energy transfer to study the conformational dynamics of individual molecules of Ssc1, a mitochondrial Hsp70, in real time. The intrinsic dynamics of the substrate-binding domain of Ssc1 was observed to be uncoupled from the dynamic interactions between substrate- and nucleotide-binding domains. Analysis of the fluctuations in the interdomain separation revealed frequent transitions to a nucleotide-free state. The nucleotide-exchange factor Mge1 did not induce ADP release, as expected, but rather facilitated binding of ATP. These results indicate that the conformational cycle of Ssc1 is more elaborate than previously thought and provide insight into how the Hsp70s can perform a wide variety of functions.
Mitochondrial DNA variants observed in Alzheimer disease and Parkinson disease patients
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shoffner, J.M.; Brown, M.D.; Torroni, A.
1993-07-01
Mitochondrial DNA (mtDNA) variants associated with Alzheimer disease (AD) and Parkinson disease (PD) were sought by restriction endonuclease analysis in a cohort of 71 late-onset Caucasian patients. A tRNA[sup Gln] gene variant at nucleotide pair (np) 4336 that altered a moderately conserved nucleotide was present in 9/173 (5.2%) of the patients surveyed but in only 0.7% of the general Caucasian controls. One of these patients harbored an additional novel 12S rRNA 5-nucleotide insertion at np 956-965, while a second had a missense variant at np 3397 that converted a highly conserved methionine to a valine. This latter mutation was alsomore » found in an independent AD + PD patient, as was a heteroplasmic 16S rRNA variant at np 3196. Additional studies will be required to determine the significance, if any, of these mutations. 122 refs., 4 figs., 2 tabs.« less
Single-cell analysis of intercellular heteroplasmy of mtDNA in Leber hereditary optic neuropathy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kobayashi, Y.; Sharpe, H.; Brown, N.
1994-07-01
The authors have investigated the distribution of mutant mtDNA molecules in single cells from a patient with Leber hereditary optic neuropathy (LHON). LHON is a maternally inherited disease that is characterized by a sudden-onset bilateral loss of central vision, which typically occurs in early adulthood. More than 50% of all LHON patients carry an mtDNA mutation at nucleotide position 11778. This nucleotide change converts a highly conserved arginine residue to histidine at codon 340 in the NADH-ubiquinone oxidoreductase subunit 4 (ND4) gene of mtDNA. In the present study, the authors used PCR amplification of mtDNA from lymphocytes to investigate mtDNAmore » heteroplasmy at the single-cell level in a LHON patient. They found that most cells were either homoplasmic normal or homoplasmic mutant at nucleotide position 11778. Some (16%) cells contained both mutant and normal mtDNA.« less
Musi, Elgilda; Islam, Naziba; Drosopoulos, Joan H F
2007-05-01
Human CD39/NTPDase1 is an endothelial cell membrane-associated nucleotidase. Its large extracellular domain rapidly metabolizes nucleotides, especially ADP released from activated platelets, inhibiting further platelet activation/recruitment. Previous studies using our recombinant soluble CD39 demonstrated the importance of residues S57, D54, and D213 for enzymatic/biological activity. We now report effects of S57A, D54A, and D213A mutations on full-length (FL)CD39 function. Enzymatic activity of alanine modified FLCD39s was less than wild-type, contrasting the enhanced activity of their soluble counterparts. Furthermore, conservative substitutions D54E and D213E led to enzymes with activities greater than the alanine modified FLCD39s, but less than wild-type. Reductions in mutant activities were primarily associated with reduced catalytic rates. Differences in enzymatic activity were not attributable to gross changes in the nucleotide binding pocket or the enzyme's ability to multimerize. Thus, composition of the active site of wild-type CD39 appears optimized for ADPase function in the context of the transmembrane domains.
Marasco, Michelle; Li, Weiyi; Lynch, Michael
2017-01-01
Abstract All eukaryotes have three essential nuclear multisubunit RNA polymerases, abbreviated as Pol I, Pol II and Pol III. Plants are remarkable in having two additional multisubunit RNA polymerases, Pol IV and Pol V, which synthesize noncoding RNAs that coordinate RNA-directed DNA methylation for silencing of transposons and a subset of genes. Based on their subunit compositions, Pols IV and V clearly evolved as specialized forms of Pol II, but their catalytic properties remain undefined. Here, we show that Pols IV and V differ from one another, and Pol II, in nucleotide incorporation rate, transcriptional accuracy and the ability to discriminate between ribonucleotides and deoxyribonucleotides. Pol IV transcription is considerably more error-prone than Pols II or V, which may be tolerable in its synthesis of short RNAs that serve as precursors for siRNAs targeting non-identical members of transposon families. By contrast, Pol V exhibits high fidelity transcription, similar to Pol II, suggesting a need for Pol V transcripts to faithfully reflect the DNA sequence of target loci to which siRNA–Argonaute silencing complexes are recruited. PMID:28977461
Gorzkiewicz, Michał; Buczkowski, Adam; Appelhans, Dietmar; Voit, Brigitte; Pułaski, Łukasz; Pałecz, Bartłomiej; Klajnert-Maculewicz, Barbara
2018-06-10
Adenosine analogue drugs (such as fludarabine or cladribine) require transporter-mediated uptake into cells and subsequent phosphorylation for anticancer activity. Therefore, application of nanocarrier systems for direct delivery of active triphosphate forms has been proposed. Here, we applied isothermal titration calorimetry and zeta potential titration to determine the stoichiometry and thermodynamic parameters of interactions between 4th generation poly(propyleneimine) dendrimers (unmodified or sugar-modified for increased biocompatibility) and ATP as a model adenosine nucleotide. We showed that glycodendrimers have the ability to efficiently interact with nucleoside triphosphates and to form stable complexes via electrostatic interactions between the ionized phosphate and amino groups on the nucleotide and the dendrimer, respectively. The complexation process is spontaneous, enthalpy-driven and depends on buffer composition (strongest interactions in organic buffer) and pH (more binding sites in acidic pH). These properties allow us to consider maltose-modified dendrimers as especially promising carriers for adenosine analogues. Copyright © 2018 Elsevier B.V. All rights reserved.
CNG site-specific and methyl-sensitive endonuclease WEN1 from wheat seedlings.
Fedoreyeva, L I; Vanyushin, B F
2011-06-01
Endonuclease WEN1 with apparent molecular mass about 27 kDa isolated from cytoplasmic vesicular fraction of aging coleoptiles of wheat seedlings has expressed site specificity action. This is a first detection and isolation of a site-specific endonuclease from higher eukaryotes, in general, and higher plants, in particular. The enzyme hydrolyzes deoxyribooligonucleotides of different composition on CNG (N is G, A, C, or T) sites by splitting the phosphodiester bond between C and N nucleotide residues in CNG sequence independent from neighbor nucleotide context except for CCCG. WEN1 prefers to hydrolyze methylated λ phage DNA and double-stranded deoxyribooligonucleotides containing 5-methylcytosine sites (m(5)CAG, m(5)CTG) compared with unmethylated substrates. The enzyme is also able to hydrolyze single-stranded substrates, but in this case it splits unmethylated substrates predominantly. Detection in wheat seedlings of WEN1 endonuclease that is site specific, sensitive to the substrate methylation status, and modulated with S-adenosyl-L-methionine indicates that in higher plants restriction--modification systems or some of their elements, at least, may exist.
Kaneko, Kentaro; Takamatsu, Takeshi; Inomata, Takuya; Oikawa, Kazusato; Itoh, Kimiko; Hirose, Kazuko; Amano, Maho; Nishimura, Shin-Ichiro; Toyooka, Kiminori; Matsuoka, Ken; Pozueta-Romero, Javier; Mitsui, Toshiaki
2016-01-01
Nucleotide pyrophosphatase/phosphodiesterases (NPPs) are widely distributed N-glycosylated enzymes that catalyze the hydrolytic breakdown of numerous nucleotides and nucleotide sugars. In many plant species, NPPs are encoded by a small multigene family, which in rice are referred to NPP1–NPP6. Although recent investigations showed that N-glycosylated NPP1 is transported from the endoplasmic reticulum (ER)–Golgi system to the chloroplast through the secretory pathway in rice cells, information on N-glycan composition and subcellular localization of other NPPs is still lacking. Computer-assisted analyses of the amino acid sequences deduced from different Oryza sativa NPP-encoding cDNAs predicted all NPPs to be secretory glycoproteins. Confocal fluorescence microscopy observation of cells expressing NPP2 and NPP6 fused with green fluorescent protein (GFP) revealed that NPP2 and NPP6 are plastidial proteins. Plastid targeting of NPP2–GFP and NPP6–GFP was prevented by brefeldin A and by the expression of ARF1(Q71L), a dominant negative mutant of ADP-ribosylation factor 1 that arrests the ER to Golgi traffic, indicating that NPP2 and NPP6 are transported from the ER–Golgi to the plastidial compartment. Confocal laser scanning microscopy and high-pressure frozen/freeze-substituted electron microscopy analyses of transgenic rice cells ectopically expressing the trans-Golgi marker sialyltransferase fused with GFP showed the occurrence of contact of Golgi-derived membrane vesicles with cargo and subsequent absorption into plastids. Sensitive and high-throughput glycoblotting/mass spectrometric analyses showed that complex-type and paucimannosidic-type glycans with fucose and xylose residues occupy approximately 80% of total glycans of NPP1, NPP2 and NPP6. The overall data strongly indicate that the trans-Golgi compartments participate in the Golgi to plastid trafficking and targeting mechanism of NPPs. PMID:27335351
Deniaud, Aurélien; Panwar, Pankaj; Frelet-Barrand, Annie; Bernaudat, Florent; Juillan-Binard, Céline; Ebel, Christine; Rolland, Norbert; Pebay-Peyroula, Eva
2012-01-01
Background Chloroplast ATP/ADP transporters are essential to energy homeostasis in plant cells. However, their molecular mechanism remains poorly understood, primarily due to the difficulty of producing and purifying functional recombinant forms of these transporters. Methodology/Principal Findings In this work, we describe an expression and purification protocol providing good yields and efficient solubilization of NTT1 protein from Arabidopsis thaliana. By biochemical and biophysical analyses, we identified the best detergent for solubilization and purification of functional proteins, LAPAO. Purified NTT1 was found to accumulate as two independent pools of well folded, stable monomers and dimers. ATP and ADP binding properties were determined, and Pi, a co-substrate of ADP, was confirmed to be essential for nucleotide steady-state transport. Nucleotide binding studies and analysis of NTT1 mutants lead us to suggest the existence of two distinct and probably inter-dependent binding sites. Finally, fusion and deletion experiments demonstrated that the C-terminus of NTT1 is not essential for multimerization, but probably plays a regulatory role, controlling the nucleotide exchange rate. Conclusions/Significance Taken together, these data provide a comprehensive molecular characterization of a chloroplast ATP/ADP transporter. PMID:22438876
Posttranscriptional modifications in the A-loop of 23S rRNAs from selected archaea and eubacteria.
Hansen, M A; Kirpekar, F; Ritterbusch, W; Vester, B
2002-02-01
Posttranscriptional modifications were mapped in helices 90-92 of 23S rRNA from the following phylogenetically diverse organisms: Haloarcula marismortui, Sulfolobus acidocaldarius, Bacillus subtilis, and Bacillus stearothermophilus. Helix 92 is a component of the ribosomal A-site, which contacts the aminoacyl-tRNA during protein synthesis, implying that posttranscriptional modifications in helices 90-92 may be important for ribosome function. RNA fragments were isolated from 23S rRNA by site-directed RNase H digestion. A novel method of mapping modifications by analysis of short, nucleotide-specific, RNase digestion fragments with Matrix Assisted Laser Desorption/Ionization Mass Spectrometry (MALDI-MS) was utilized. The MALDI-MS data were complemented by two primer extension techniques using reverse transcriptase. One technique utilizes decreasing concentrations of deoxynucleotide triphosphates to map 2'-O-ribose methylations. In the other, the rRNA is chemically modified, followed by mild alkaline hydrolysis to map pseudouridines (psis). A total of 10 posttranscriptionally methylated nucleotides and 6 psis were detected in the five organisms. Eight of the methylated nucleotides and one psi have not been reported previously. The distribution of modified nucleotides and their locations on the surface of the ribosomal peptidyl transferase cleft suggests functional importance.
The extent of linkage disequilibrium in beef cattle breeds using high-density SNP genotypes.
Porto-Neto, Laercio R; Kijas, James W; Reverter, Antonio
2014-03-24
The extent of linkage disequilibrium (LD) between molecular markers impacts genome-wide association studies and implementation of genomic selection. The availability of high-density single nucleotide polymorphism (SNP) genotyping platforms makes it possible to investigate LD at an unprecedented resolution. In this work, we characterised LD decay in breeds of beef cattle of taurine, indicine and composite origins and explored its variation across autosomes and the X chromosome. In each breed, LD decayed rapidly and r2 was less than 0.2 for marker pairs separated by 50 kb. The LD decay curves clustered into three groups of similar LD decay that distinguished the three main cattle types. At short distances between markers (<10 kb), taurine breeds showed higher LD (r2=0.45) than their indicine (r2=0.25) and composite (r2=0.32) counterparts. This higher LD in taurine breeds was attributed to a smaller effective population size and a stronger bottleneck during breed formation. Using all SNPs on only the X chromosome, the three cattle types could still be distinguished. However for taurine breeds, the LD decay on the X chromosome was much faster and the background level much lower than for indicine breeds and composite populations. When using only SNPs that were polymorphic in all breeds, the analysis of the X chromosome mimicked that of the autosomes. The pattern of LD mirrored some aspects of the history of breed populations and showed a sharp decay with increasing physical distance between markers. We conclude that the availability of the HD chip can be used to detect association signals that remained hidden when using lower density genotyping platforms, since LD dropped below 0.2 at distances of 50 kb.
Gritsun, T S; Venugopal, K; Zanotto, P M; Mikhailov, M V; Sall, A A; Holmes, E C; Polkinghorne, I; Frolova, T V; Pogodina, V V; Lashkevich, V A; Gould, E A
1997-05-01
The complete nucleotide sequence of two tick-transmitted flaviviruses, Vasilchenko (Vs) from Siberia and louping ill (LI) from the UK, have been determined. The genomes were respectively, 10928 and 10871 nucleotides (nt) in length. The coding strategy and functional protein sequence motifs of tick-borne flaviviruses are presented in both Vs and LI viruses. The phylogenies based on maximum likelihood, maximum parsimony and distance analysis of the polyproteins, identified Vs virus as a member of the tick-borne encephalitis virus subgroup within the tick-borne serocomplex, genus Flavivirus, family Flaviviridae. Comparative alignment of the 3'-untranslated regions revealed deletions of different lengths essentially at the same position downstream of the stop codon for all tick-borne viruses. Two direct 27 nucleotide repeats at the 3'-end were found only for Vs and LI virus. Immediately following the deletions a region of 332-334 nt with relatively conserved primary structure (67-94% identity) was observed at the 3'-non-coding end of the virus genome. Pairwise comparisons of the nucleotide sequence data revealed similar levels of variation between the coding region, and the 5' and 3'-termini of the genome, implying an equivalent strong selective control for translated and untranslated regions. Indeed the predicted folding of the 5' and 3'-untranslated regions revealed patterns of stem and loop structures conserved for all tick-borne flaviviruses suggesting a purifying selection for preservation of essential RNA secondary structures which could be involved in translational control and replication. The possible implications of these findings are discussed.
Hoh, Joseph F Y; Li, Zhao-Bo; Qin, Han; Hsu, Michael K H; Rossmanith, Gunther H
2007-01-01
Mechanical properties of the jaw-closing muscles of the cat are poorly understood. These muscles are known to differ in myosin and fibre type compositions from limb muscles. This work aims to correlate mechanical properties of single fibres in cat jaw and limb muscles with their myosin subunit compositions. The stiffness minimum frequency, f(min), which reflects isometric cross-bridge kinetics, was measured in Ca(2+)-activated glycerinated fast and slow fibres from cat jaw and limb muscles for temperatures ranging between 15 and 30 degrees C by mechanical perturbation analysis. At 15 degrees C, f(min) was 0.5 Hz for limb-slow fibres, 4-6 Hz for jaw-slow fibres, and 10-13 Hz for limb-fast and jaw-fast fibres. The activation energy for f(min) obtained from the slope of the Arrhenius plot for limb-slow fibres was 30-40% higher than values for the other three types of fibres. SDS-PAGE and western blotting using highly specific antibodies verified that limb-fast fibres contained IIA or IIX myosin heavy chain (MyHC). Jaw-fast fibres expressed masticatory MyHC while both jaw-fast and jaw-slow fibres expressed masticatory myosin light chains (MLCs). The nucleotide sequences of the 3' ends of the slow MyHC cDNAs isolated from cat masseter and soleus cDNA libraries showed identical coding and 3'-untranslated regions, suggesting that jaw-slow and limb-slow fibres express the same slow MyHC gene. We conclude that the isometric cross-bridge cycling kinetics of jaw-fast and limb-fast fibres detected by f(min) are indistinguishable in spite of differences in MyHC and light chain compositions. However, jaw-slow fibres, in which the same slow MyHCs are found in combination with MLCs of the jaw type, show enhanced cross-bridge cycling kinetics and reduced activation energy for cross-bridge detachment.
Genetic Variation of the Ghrelin Signalling System in Individuals with Amphetamine Dependence
Jayaram-Lindström, Nitya; Nilsson, Staffan; Toren, Kjell; Rosengren, Annika; Engel, Jörgen A.; Franck, Johan
2013-01-01
The development of amphetamine dependence largely depends on the effects of amphetamine in the brain reward systems. Ghrelin, an orexigenic peptide, activates the reward systems and is required for reward induced by alcohol, nicotine, cocaine and amphetamine in mice. Human genetic studies have shown that polymorphisms in the pre-proghrelin (GHRL) as well as GHS-R1A (GHSR) genes are associated with high alcohol consumption, increased weight and smoking in males. Since the heritability factor underlying drug dependence is shared between different drugs of abuse, we here examine the association between single nucleotide polymorphisms (SNPs) and haplotypes in the GHRL and GHSR, and amphetamine dependence. GHRL and GHSR SNPs were genotyped in Swedish amphetamine dependent individuals (n = 104) and controls from the general population (n = 310). A case-control analysis was performed and SNPs and haplotypes were additionally tested for association against Addiction Severity Interview (ASI) composite score of drug use. The minor G-allele of the GHSR SNP rs2948694, was more common among amphetamine dependent individuals when compared to controls (pc = 0.02). A significant association between the GHRL SNP rs4684677 and ASI composite score of drug use was also reported (pc = 0.03). The haplotype analysis did not add to the information given by the individual polymorphisms. Although genetic variability of the ghrelin signalling system is not a diagnostic marker for amphetamine dependence and problem severity of drug use, the present results strengthen the notion that ghrelin and its receptor may be involved in the development of addictive behaviours and may thus serve as suitable targets for new treatments of such disorders. PMID:23579732
Genetic variation of the ghrelin signalling system in individuals with amphetamine dependence.
Suchankova, Petra; Jerlhag, Elisabet; Jayaram-Lindström, Nitya; Nilsson, Staffan; Toren, Kjell; Rosengren, Annika; Engel, Jörgen A; Franck, Johan
2013-01-01
The development of amphetamine dependence largely depends on the effects of amphetamine in the brain reward systems. Ghrelin, an orexigenic peptide, activates the reward systems and is required for reward induced by alcohol, nicotine, cocaine and amphetamine in mice. Human genetic studies have shown that polymorphisms in the pre-proghrelin (GHRL) as well as GHS-R1A (GHSR) genes are associated with high alcohol consumption, increased weight and smoking in males. Since the heritability factor underlying drug dependence is shared between different drugs of abuse, we here examine the association between single nucleotide polymorphisms (SNPs) and haplotypes in the GHRL and GHSR, and amphetamine dependence. GHRL and GHSR SNPs were genotyped in Swedish amphetamine dependent individuals (n = 104) and controls from the general population (n = 310). A case-control analysis was performed and SNPs and haplotypes were additionally tested for association against Addiction Severity Interview (ASI) composite score of drug use. The minor G-allele of the GHSR SNP rs2948694, was more common among amphetamine dependent individuals when compared to controls (pc = 0.02). A significant association between the GHRL SNP rs4684677 and ASI composite score of drug use was also reported (pc = 0.03). The haplotype analysis did not add to the information given by the individual polymorphisms. Although genetic variability of the ghrelin signalling system is not a diagnostic marker for amphetamine dependence and problem severity of drug use, the present results strengthen the notion that ghrelin and its receptor may be involved in the development of addictive behaviours and may thus serve as suitable targets for new treatments of such disorders.
Yoo, Ran Hee; Lee, Seung-Won; Lim, Seungmo; Zhao, Fumei; Igori, Davaajargal; Baek, Dasom; Hong, Jin-Sung; Lee, Su-Heon; Moon, Jae Sun
2017-12-01
Two novel viruses, isolated in Bonghwa, Republic of Korea, from an Ixeridium dentatum plant with yellowing mottle symptoms, have been provisionally named Ixeridium yellow mottle-associated virus 1 (IxYMaV-1) and Ixeridium yellow mottle-associated virus 2 (IxYMaV-2). IxYMaV-1 has a genome of 6,017 nucleotides sharing a 56.4% sequence identity with that of cucurbit aphid-borne yellows virus (genus Polerovirus). The IxYMaV-2 genome of 4,196 nucleotides has a sequence identity of less than 48.3% with e other species classified within the genus Umbravirus. Genome properties and phylogenetic analysis suggested that IxYMaV-1 and -2 are representative isolates of new species classifiable within the genus Polerovirus and Umbravirus, respectively.
Bioinformatic Analysis of Strawberry GSTF12 Gene
NASA Astrophysics Data System (ADS)
Wang, Xiran; Jiang, Leiyu; Tang, Haoru
2018-01-01
GSTF12 has always been known as a key factor of proanthocyanins accumulate in plant testa. Through bioinformatics analysis of the nucleotide and encoded protein sequence of GSTF12, it is more advantageous to the study of genes related to anthocyanin biosynthesis accumulation pathway. Therefore, we chosen GSTF12 gene of 11 kinds species, downloaded their nucleotide and protein sequence from NCBI as the research object, found strawberry GSTF12 gene via bioinformation analyse, constructed phylogenetic tree. At the same time, we analysed the strawberry GSTF12 gene of physical and chemical properties and its protein structure and so on. The phylogenetic tree showed that Strawberry and petunia were closest relative. By the protein prediction, we found that the protein owed one proper signal peptide without obvious transmembrane regions.
Prediction of siRNA potency using sparse logistic regression.
Hu, Wei; Hu, John
2014-06-01
RNA interference (RNAi) can modulate gene expression at post-transcriptional as well as transcriptional levels. Short interfering RNA (siRNA) serves as a trigger for the RNAi gene inhibition mechanism, and therefore is a crucial intermediate step in RNAi. There have been extensive studies to identify the sequence characteristics of potent siRNAs. One such study built a linear model using LASSO (Least Absolute Shrinkage and Selection Operator) to measure the contribution of each siRNA sequence feature. This model is simple and interpretable, but it requires a large number of nonzero weights. We have introduced a novel technique, sparse logistic regression, to build a linear model using single-position specific nucleotide compositions which has the same prediction accuracy of the linear model based on LASSO. The weights in our new model share the same general trend as those in the previous model, but have only 25 nonzero weights out of a total 84 weights, a 54% reduction compared to the previous model. Contrary to the linear model based on LASSO, our model suggests that only a few positions are influential on the efficacy of the siRNA, which are the 5' and 3' ends and the seed region of siRNA sequences. We also employed sparse logistic regression to build a linear model using dual-position specific nucleotide compositions, a task LASSO is not able to accomplish well due to its high dimensional nature. Our results demonstrate the superiority of sparse logistic regression as a technique for both feature selection and regression over LASSO in the context of siRNA design.
Translational genomics for analysis of complex traits in peanut and sorghum
USDA-ARS?s Scientific Manuscript database
The integration of sequencing and genotype data from natural variation studies (by whole genome resequencing [wgs] or genotype by sequencing [gbs]), transcriptome (RNA-seq) and mutant analysis (also by wgs) facilitated the development of DNA markers in the form of single nucleotide polymorphic (SNP)...
Aromatic-degrading Sphingomonas isolates from the deep subsurface
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fredrickson, J.K.; Romine, M.F.; Balkwill, D.L.
An obligately aerobic chemoheterotrophic bacterium (strain F199) previously isolated from Southeast Coastal Plain subsurface sediments and shown to degrade toluene, naphthalene, and other aromatic compounds was characterized by analysis of its 16S rRNA nucleotide base sequence and cellular lipid composition. Strain F199 contained 2-OH14:0 and 18:1{omega}7c as the predominant cellular fatty acids and sphingolipids that are characteristic of the genus Sphingomonas. Phylogenetic analysis of its 16SrRNA sequence indicated that F199 was most closely related to Sphingomonas capsulata among the bacteria currently in the Ribosomal Database. Five additional isolates from deep Southeast Coastal Plain sediments were determined by 16S rRNA sequencemore » analysis to be closely related to F199. These strains also contained characteristic sphingolipids. Four of these five strains could also grow on a broad range of aromatic compounds and could mineralize [{sup 14C}]toluene and [{sup 14C}]naphthalene. S. capsulata (ATCC 14666), Sphingomonas paucimobiolis (ATCC 29837), and one of the subsurface isolates were unable to grow on any of the aromatic compounds or mineralize toluene or naphthalene. These results indicate that bacteria within the genus Sphingomonas are present in Southeast Coastal Plain subsurface sediments and that the capacity for degrading a broad range of substituted aromatic compounds appears to be common among Sphingomonas species from this environment. 41 refs., 2 figs., 5 tabs.« less
Allen, Alexandra M; Barker, Gary L A; Berry, Simon T; Coghill, Jane A; Gwilliam, Rhian; Kirby, Susan; Robinson, Phil; Brenchley, Rachel C; D'Amore, Rosalinda; McKenzie, Neil; Waite, Darren; Hall, Anthony; Bevan, Michael; Hall, Neil; Edwards, Keith J
2011-12-01
Food security is a global concern and substantial yield increases in cereal crops are required to feed the growing world population. Wheat is one of the three most important crops for human and livestock feed. However, the complexity of the genome coupled with a decline in genetic diversity within modern elite cultivars has hindered the application of marker-assisted selection (MAS) in breeding programmes. A crucial step in the successful application of MAS in breeding programmes is the development of cheap and easy to use molecular markers, such as single-nucleotide polymorphisms. To mine selected elite wheat germplasm for intervarietal single-nucleotide polymorphisms, we have used expressed sequence tags derived from public sequencing programmes and next-generation sequencing of normalized wheat complementary DNA libraries, in combination with a novel sequence alignment and assembly approach. Here, we describe the development and validation of a panel of 1114 single-nucleotide polymorphisms in hexaploid bread wheat using competitive allele-specific polymerase chain reaction genotyping technology. We report the genotyping results of these markers on 23 wheat varieties, selected to represent a broad cross-section of wheat germplasm including a number of elite UK varieties. Finally, we show that, using relatively simple technology, it is possible to rapidly generate a linkage map containing several hundred single-nucleotide polymorphism markers in the doubled haploid mapping population of Avalon × Cadenza. © 2011 The Authors. Plant Biotechnology Journal © 2011 Society for Experimental Biology, Association of Applied Biologists and Blackwell Publishing Ltd.
Lühr, B; Scheller, J; Meyer, P; Kramer, W
1998-02-01
We have analysed the correction of defined mismatches in wild-type and msh2, msh3, msh6 and msh3 msh6 mutants of Saccharomyces cerevisiae in two different yeast strain backgrounds by transformation with plasmid heteroduplex DNA constructs. Ten different base/base mismatches, two single-nucleotide loops and a 38-nucleotide loop were tested. Repair of all types of mismatches was severely impaired in msh2 and msh3 msh6 mutants. In msh6 mutants, repair efficiency of most base/base mismatches was reduced to a similar extent as in msh3 msh6 double mutants. G/T and A/C mismatches, however, displayed residual repair in msh6 mutants in one strain background, implying a role for Msh3p in recognition of base/base mismatches. Furthermore, the efficiency of repair of base/base mismatches was considerably reduced in msh3 mutants in one strain background, indicating a requirement for MSH3 for fully efficient mismatch correction. Also the efficiency of repair of the 38-nucleotide loop was reduced in msh3 mutants, and to a lesser extent in msh6 mutants. The single-nucleotide loop with an unpaired A was less efficiently repaired in msh3 mutants and that with an unpaired T was less efficiently corrected in msh6 mutants, indicating non-redundant functions for the two proteins in the recognition of single-nucleotide loops.
Im, JongOne; Sen, Suman; Lindsay, Stuart; Zhang, Peiming
2018-06-28
In the present study, we demonstrate a tunneling nanogap technique to identify individual RNA nucleotides, which can be used as a mechanism to read the nucleobases for direct sequencing of RNA in a solid-state nanopore. The tunneling nanogap is composed of two electrodes separated by a distance of <3 nm and functionalized with a recognition molecule. When a chemical entity is captured in the gap, it generates electron tunneling currents, a process we call recognition tunneling (RT). Using RT nanogaps created in a scanning tunneling microscope (STM), we acquired the electron tunneling signals for the canonical and two modified RNA nucleotides. To call the individual RNA nucleotides from the RT data, we adopted a machine learning algorithm, support vector machine (SVM), for the data analysis. Through the SVM, we were able to identify the individual RNA nucleotides and distinguish them from their DNA counterparts with reasonably high accuracy. Since each RNA nucleoside contains a hydroxyl group at the 2'-position of its sugar ring in an RNA strand, it allows for the formation of a tunneling junction at a larger nanogap compared to the DNA nucleoside in a DNA strand, which lacks the 2' hydroxyl group. It also proves advantageous for the manufacture of RT devices. This study is a proof-of-principle demonstration for the development of an RT nanopore device for directly sequencing single RNA molecules, including those bearing modifications.
Global Shifts in Genome and Proteome Composition Are Very Tightly Coupled
Brbić, Maria; Warnecke, Tobias; Kriško, Anita; Supek, Fran
2015-01-01
The amino acid composition (AAC) of proteomes differs greatly between microorganisms and is associated with the environmental niche they inhabit, suggesting that these changes may be adaptive. Similarly, the oligonucleotide composition of genomes varies and may confer advantages at the DNA/RNA level. These influences overlap in protein-coding sequences, making it difficult to gauge their relative contributions. We disentangle these effects by systematically evaluating the correspondence between intergenic nucleotide composition, where protein-level selection is absent, the AAC, and ecological parameters of 909 prokaryotes. We find that G + C content, the most frequently used measure of genomic composition, cannot capture diversity in AAC and across ecological contexts. However, di-/trinucleotide composition in intergenic DNA predicts amino acid frequencies of proteomes to the point where very little cross-species variability remains unexplained (91% of variance accounted for). Qualitatively similar results were obtained for 49 fungal genomes, where 80% of the variability in AAC could be explained by the composition of introns and intergenic regions. Upon factoring out oligonucleotide composition and phylogenetic inertia, the residual AAC is poorly predictive of the microbes’ ecological preferences, in stark contrast with the original AAC. Moreover, highly expressed genes do not exhibit more prominent environment-related AAC signatures than lowly expressed genes, despite contributing more to the effective proteome. Thus, evolutionary shifts in overall AAC appear to occur almost exclusively through factors shaping the global oligonucleotide content of the genome. We discuss these results in light of contravening evidence from biophysical data and further reading frame-specific analyses that suggest that adaptation takes place at the protein level. PMID:25971281
Construction and characterization of poliovirus subgenomic replicons.
Kaplan, G; Racaniello, V R
1988-05-01
Poliovirus RNAs containing in-frame deletions within the capsid-coding region were produced by in vitro transcription of altered poliovirus type 1 cDNA by using bacteriophage T7 RNA polymerase. Three RNAs were transcribed that contained deletions of 2,317 nucleotides (bases 747 to 3064), 1,781 nucleotides (bases 1,175 to 2,956), and 1,295 nucleotides (bases 1,175 to 2,470). All three subgenomic RNAs replicated after transfection into HeLa cells, demonstrating that sequences encoding the capsid polypeptides are not essential for viral RNA replication in vivo. Viral RNA containing the largest deletion (R1) replicated approximately three times better than full-length RNA produced in vitro. Northern blot (RNA blot) hybridization analysis of total cellular RNA from HeLa cells at different times after transfection with R1 demonstrated the presence of increasing amounts of the expected 5.1-kilobase subgenomic RNA. Analysis by immunoprecipitation of viral proteins induced after transfection of R1 RNA into HeLa cells revealed the presence of proteins 2Apro, 2C, and 3Dpol and its precursors, suggesting that the polyprotein cleavages are similar to those occurring in virus-infected cells. Replication of P2/Lansing virion RNA was inhibited by cotransfection with the R1 replicon, as demonstrated by hybridization analysis with a serotype-specific oligonucleotide probe. A higher level of inhibition of RNA replication was observed when P2/Lansing RNA was cotransfected into HeLa cells with truncated R1 transcripts (R1-PvuII) that were missing 395 3' nucleotides and a poly(A) tail. These internally and terminally deleted RNAs inhibited the replication of subgenomic replicons R1, R2, and R3 and caused a reduction in plaque size when cotransfected with P1/Mahoney or P2/Lansing viral RNA, suggesting that individual cells had received both RNAs. No inhibition of plaque size was observed when replicon RNAs were used that were missing 1,384 or 1,839 3' nucleotides or contained plasmid-derived sequences downstream of the 3' poly(A). The trans-acting inhibitory effect of R1-PvuII on the replication of poliovirus P2/Lansing RNA did not involve entry of RNA into cells and appeared to reduce viral translation and RNA synthesis late in the infection cycle.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Choy, F.Y.M.; Wei, C.; Applegarth, D.A.
1994-06-01
Gaucher disease is the most frequent lysosomal lipid storage disease. It results from deficient glucocerebrosidase activity and is transmitted as an autosomal recessive trait. Three clinical forms of Gaucher disease have been described: type 1, non-neuronopathic; type 2, acute neuronopathic; and type 3, subacute neuronopathic. We have sequenced the full length cDNA of the glucocerebrosidase gene and identified an uncommon mutation in nucleotide position 1604 (genoma DNA nucleotide position 6683) from a Gaucher disease patient of Jewish-Polish-Russian descent with type 1 Gaucher disease. It is a G{yields}A transition in exon 11 that results in {sup 496}Arg{yields}{sup 496}His of glucocerebrosidase. Thismore » missense mutation is present in the heterozygous form and creates a new cleavage site for the endonuclease HphI. We have developed a simple method to detect the presence of this mutation by using HphI restriction fragment length polymorphism analysis of glucocerebrosidase genomic DNA or cDNA. The mutation in the other Gaucher allele of this patient is an A{yields}G transition at cDNA nucleotide position 1226 which creates an XhoI cleavage site after PCR mismatch amplification. The presence of this mutation was also confirmed by sequence analysis. Based on previous reports that mutation 1226 is present only in type 1 Gaucher disease and the observation that there is no neurological involvement in this patient, we conclude that our patient with the 1226/1604 genotype is diagnosed as having type 1 Gaucher disease. Since it was also postulated that mutation 1226 in the homozygous form will usually result in a good prognosis, we speculate that the orthopedic complications and the unusual presence of glomerulosclerosis in this patient may be attributable to the mutation at nucleotide 1604. This speculation will require a description of more patients with this mutation for confirmation. 32 refs., 5 figs.« less
Presence of a consensus DNA motif at nearby DNA sequence of the mutation susceptible CG nucleotides.
Chowdhury, Kaushik; Kumar, Suresh; Sharma, Tanu; Sharma, Ankit; Bhagat, Meenakshi; Kamai, Asangla; Ford, Bridget M; Asthana, Shailendra; Mandal, Chandi C
2018-01-10
Complexity in tissues affected by cancer arises from somatic mutations and epigenetic modifications in the genome. The mutation susceptible hotspots present within the genome indicate a non-random nature and/or a position specific selection of mutation. An association exists between the occurrence of mutations and epigenetic DNA methylation. This study is primarily aimed at determining mutation status, and identifying a signature for predicting mutation prone zones of tumor suppressor (TS) genes. Nearby sequences from the top five positions having a higher mutation frequency in each gene of 42 TS genes were selected from a cosmic database and were considered as mutation prone zones. The conserved motifs present in the mutation prone DNA fragments were identified. Molecular docking studies were done to determine putative interactions between the identified conserved motifs and enzyme methyltransferase DNMT1. Collective analysis of 42 TS genes found GC as the most commonly replaced and AT as the most commonly formed residues after mutation. Analysis of the top 5 mutated positions of each gene (210 DNA segments for 42 TS genes) identified that CG nucleotides of the amino acid codons (e.g., Arginine) are most susceptible to mutation, and found a consensus DNA "T/AGC/GAGGA/TG" sequence present in these mutation prone DNA segments. Similar to TS genes, analysis of 54 oncogenes not only found CG nucleotides of the amino acid Arg as the most susceptible to mutation, but also identified the presence of similar consensus DNA motifs in the mutation prone DNA fragments (270 DNA segments for 54 oncogenes) of oncogenes. Docking studies depicted that, upon binding of DNMT1 methylates to this consensus DNA motif (C residues of CpG islands), mutation was likely to occur. Thus, this study proposes that DNMT1 mediated methylation in chromosomal DNA may decrease if a foreign DNA segment containing this consensus sequence along with CG nucleotides is exogenously introduced to dividing cancer cells. Copyright © 2017 Elsevier B.V. All rights reserved.
Chakraborty, Srirupa; Zheng, Wenjun
2015-01-27
We have employed molecular dynamics (MD) simulation to investigate, with atomic details, the structural dynamics and energetics of three major ATPase states (ADP, APO, and ATP state) of a human kinesin-1 monomer in complex with a tubulin dimer. Starting from a recently solved crystal structure of ATP-like kinesin-tubulin complex by the Knossow lab, we have used flexible fitting of cryo-electron-microscopy maps to construct new structural models of the kinesin-tubulin complex in APO and ATP state, and then conducted extensive MD simulations (total 400 ns for each state), followed by flexibility analysis, principal component analysis, hydrogen bond analysis, and binding free energy analysis. Our modeling and simulation have revealed key nucleotide-dependent changes in the structure and flexibility of the nucleotide-binding pocket (featuring a highly flexible and open switch I in APO state) and the tubulin-binding site, and allosterically coupled motions driving the APO to ATP transition. In addition, our binding free energy analysis has identified a set of key residues involved in kinesin-tubulin binding. On the basis of our simulation, we have attempted to address several outstanding issues in kinesin study, including the possible roles of β-sheet twist and neck linker docking in regulating nucleotide release and binding, the structural mechanism of ADP release, and possible extension and shortening of α4 helix during the ATPase cycle. This study has provided a comprehensive structural and dynamic picture of kinesin's major ATPase states, and offered promising targets for future mutational and functional studies to investigate the molecular mechanism of kinesin motors.
Rybakowska, I M; Slominska, E M; Romaszko, P; Olkowicz, M; Kaletha, K; Smolenski, R T
2015-06-01
AMP-regulated protein kinase (AMPK) is involved in regulation of energy-generating pathways in response to the metabolic needs in different organs including the heart. The activity of AMPK is mainly controlled by AMP concentration that in turn could be affected by nucleotide metabolic pathways. This study aimed to develop a procedure for measurement of AMPK activity together with nucleotide metabolic enzymes and its application for studies of mice treated with high-fat diet. The method developed was based on analysis of conversion of AMARA peptide to pAMARA by partially purified heart homogenate by liquid chromatography/mass spectrometry (LC/MS). Activities of the enzymes of nucleotide metabolism were evaluated by analysis of conversion of substrates into products by HPLC. The method was applied for analysis of hearts of mice fed 12 weeks with low- (LFD) or high-fat diet (HFD). The optimized method for AMPK activity analysis (measured in presence of AMP) revealed change of activity from 0.089 ± 0.035 pmol/min/mg protein in LFD to 0.024 ± 0.002 in HFD. This coincided with increase of adenosine deaminase (ADA) activity from 0.11 ± 0.02 to 0.19 ± 0.06 nmol/mg tissue/min and decrease of AMP-deaminase (AMPD) activity from 1.26 ± 0.35 to 0.56 ± 0.15 nmol/mg tissue/min for LFD and HFD, respectively. We have proven quality of our LC/MS method for analysis of AMPK activity. We observed decrease in AMPK activity in the heart of mice treated with high-fat diet. However, physiological consequences of this change could be modulated by decrease in AMPD activity.
Wu, L-P; Yang, T; Liu, H-W; Postman, J; Li, R
2018-05-01
A large contig with sequence similarities to several nucleorhabdoviruses was identified by high-throughput sequencing analysis from a black currant (Ribes nigrum L.) cultivar. The complete genome sequence of this new nucleorhabdovirus is 14,432 nucleotides long. Its genomic organization is very similar to those of unsegmented plant rhabdoviruses, containing six open reading frames in the order 3'-N-P-P3-M-G-L-5. The virus, which is provisionally named "black currant-associated rhabdovirus", is 41-52% identical in its genome nucleotide sequence to other nucleorhabdoviruses and may represent a new species in the genus Nucleorhabdovirus.
NASA Astrophysics Data System (ADS)
Tsyganov, M. M.; Ibragimova, M. K.; Karabut, I. V.; Freydin, M. B.; Choinzonov, E. L.; Litvyakov, N. V.
2015-11-01
Our previous research establishes that changes of expression of the ATP-binding cassette genes family is connected with the neoadjuvant chemotherapy effect. However, the mechanism of regulation of resistance gene expression remains unclear. As many researchers believe, single nucleotide polymorphisms can be involved in this process. Thereupon, microarray analysis is used to study polymorphisms in ATP-binding cassette genes. It is thus found that MDR gene expression is connected with 5 polymorphisms, i.e. rs241432, rs241429, rs241430, rs3784867, rs59409230, which participate in the regulation of expression of own genes.
Wen, Chiu-Ming
2017-08-01
An aquabirnavirus was isolated from diseased marbled eels (Anguilla marmorata; MEIPNV1310) with gill haemorrhages and associated mortality. Its genome segment sequences were obtained through next-generation sequencing and compared with published aquabirnavirus sequences. The results indicated that the genome sequence of MEIPNV1310 contains segment A (3099 nucleotides) and segment B (2789 nucleotides). Phylogenetic analysis showed that MEIPNV1310 is closely related to the infectious pancreatic necrosis Ab strain within genogroup II. This genome sequence is beneficial for studying the geographic distribution and evolution of aquabirnaviruses.
Nakajima, Kazuki; Ito, Emi; Ohtsubo, Kazuaki; Shirato, Ken; Takamiya, Rina; Kitazume, Shinobu; Angata, Takashi; Taniguchi, Naoyuki
2013-01-01
Nucleotide sugars are the donor substrates of various glycosyltransferases, and an important building block in N- and O-glycan biosynthesis. Their intercellular concentrations are regulated by cellular metabolic states including diseases such as cancer and diabetes. To investigate the fate of UDP-GlcNAc, we developed a tracing method for UDP-GlcNAc synthesis and use, and GlcNAc utilization using 13C6-glucose and 13C2-glucosamine, respectively, followed by the analysis of mass isotopomers using LC-MS. Metabolic labeling of cultured cells with 13C6-glucose and the analysis of isotopomers of UDP-HexNAc (UDP-GlcNAc plus UDP-GalNAc) and CMP-NeuAc revealed the relative contributions of metabolic pathways leading to UDP-GlcNAc synthesis and use. In pancreatic insulinoma cells, the labeling efficiency of a 13C6-glucose motif in CMP-NeuAc was lower compared with that in hepatoma cells. Using 13C2-glucosamine, the diversity of the labeling efficiency was observed in each sugar residue of N- and O-glycans on the basis of isotopomer analysis. In the insulinoma cells, the low labeling efficiencies were found for sialic acids as well as tri- and tetra-sialo N-glycans, whereas asialo N-glycans were found to be abundant. Essentially no significant difference in secreted hyaluronic acids was found among hepatoma and insulinoma cell lines. This indicates that metabolic flows are responsible for the low sialylation in the insulinoma cells. Our strategy should be useful for systematically tracing each stage of cellular GlcNAc metabolism. PMID:23720760
Chen, Hao; Dou, Yanguo; Tang, Yi; Zhang, Zhenjie; Zheng, Xiaoqiang; Niu, Xiaoyu; Yang, Jing; Yu, Xianglong; Diao, Youxiang
2015-01-01
A newly emerged duck parvovirus, which causes beak atrophy and dwarfism syndrome (BADS) in Cherry Valley ducks, has appeared in Northern China since March 2015. To explore the genetic diversity among waterfowl parvovirus isolates, the complete genome of an identified isolate designated SDLC01 was sequenced and analyzed in the present study. Genomic sequence analysis showed that SDLC01 shared 90.8%–94.6% of nucleotide identity with goose parvovirus (GPV) isolates and 78.6%–81.6% of nucleotide identity with classical Muscovy duck parvovirus (MDPV) isolates. Phylogenetic analysis of 443 nucleotides (nt) of the fragment A showed that SDLC01 was highly similar to a mule duck isolate (strain D146/02) and close to European GPV isolates but separate from Asian GPV isolates. Analysis of the left inverted terminal repeat regions revealed that SDLC01 had two major segments deleted between positions 160–176 and 306–322 nt compared with field GPV and MDPV isolates. Phylogenetic analysis of Rep and VP1 encoded by two major open reading frames of parvoviruses revealed that SDLC01 was distinct from all GPV and MDPV isolates. The viral pathogenicity and genome characterization of SDLC01 suggest that the novel GPV (N-GPV) is the causative agent of BADS and belongs to a distinct GPV-related subgroup. Furthermore, N-GPV sequences were detected in diseased ducks by polymerase chain reaction and viral proliferation was demonstrated in duck embryos and duck embryo fibroblast cells. PMID:26465143
Major soybean maturity gene haplotypes revealed by SNPViz analysis of 72 sequenced soybean genomes
USDA-ARS?s Scientific Manuscript database
In this Genomics Era, vast amounts of next generation sequencing data have become publicly-available for multiple genomes across hundreds of species. Analysis of these large-scale datasets can become cumbersome, especially when comparing nucleotide polymorphisms across many samples within a dataset...
Liu, Xiaoming; Fu, Yun-Xin; Maxwell, Taylor J.; Boerwinkle, Eric
2010-01-01
It is known that sequencing error can bias estimation of evolutionary or population genetic parameters. This problem is more prominent in deep resequencing studies because of their large sample size n, and a higher probability of error at each nucleotide site. We propose a new method based on the composite likelihood of the observed SNP configurations to infer population mutation rate θ = 4Neμ, population exponential growth rate R, and error rate ɛ, simultaneously. Using simulation, we show the combined effects of the parameters, θ, n, ɛ, and R on the accuracy of parameter estimation. We compared our maximum composite likelihood estimator (MCLE) of θ with other θ estimators that take into account the error. The results show the MCLE performs well when the sample size is large or the error rate is high. Using parametric bootstrap, composite likelihood can also be used as a statistic for testing the model goodness-of-fit of the observed DNA sequences. The MCLE method is applied to sequence data on the ANGPTL4 gene in 1832 African American and 1045 European American individuals. PMID:19952140
Horiguchi, Sayaka; Nakayama, Kazuhiro; Iwamoto, Sadahiko; Ishijima, Akiko; Minezaki, Takayuki; Baba, Mamiko; Kontai, Yoshiko; Horikawa, Chika; Kawashima, Hiroshi; Shibata, Hiroshi; Kagawa, Yasuo; Kawabata, Terue
2016-02-01
We investigated whether the single nucleotide polymorphism rs174547 (T/C) of the fatty acid desaturase-1 gene, FADS1, is associated with changes in erythrocyte membrane and plasma phospholipid (PL) long-chain polyunsaturated fatty acid (LCPUFA) composition in elderly Japanese participants (n=124; 65 years or older; self-feeding and oral intake). The rs174547 C-allele carriers had significantly lower arachidonic acid (ARA; n-6 PUFA) and higher linoleic acid (LA, n-6 PUFA precursor) levels in erythrocyte membrane and plasma PL (15% and 6% ARA reduction, respectively, per C-allele), suggesting a low LA to ARA conversion rate in erythrocyte membrane and plasma PL of C-allele carriers. α-linolenic acid (n-3 PUFA precursor) levels were higher in the plasma PL of C-allele carriers, whereas levels of the n-3 LCPUFAs eicosapentaenoic acid (EPA) or docosahexaenoic acid (DHA) were unchanged in erythrocyte membrane and plasma PL. Thus, rs174547 genotypes were significantly associated with different ARA compositions of the blood of elderly Japanese. Copyright © 2015 Elsevier Ltd. All rights reserved.
Regier, Jerome C.; Mitter, Charles; Zwick, Andreas; Bazinet, Adam L.; Cummings, Michael P.; Kawahara, Akito Y.; Sohn, Jae-Cheon; Zwickl, Derrick J.; Cho, Soowon; Davis, Donald R.; Baixeras, Joaquin; Brown, John; Parr, Cynthia; Weller, Susan; Lees, David C.; Mitter, Kim T.
2013-01-01
Background Higher-level relationships within the Lepidoptera, and particularly within the species-rich subclade Ditrysia, are generally not well understood, although recent studies have yielded progress. We present the most comprehensive molecular analysis of lepidopteran phylogeny to date, focusing on relationships among superfamilies. Methodology / Principal Findings 483 taxa spanning 115 of 124 families were sampled for 19 protein-coding nuclear genes, from which maximum likelihood tree estimates and bootstrap percentages were obtained using GARLI. Assessment of heuristic search effectiveness showed that better trees and higher bootstrap percentages probably remain to be discovered even after 1000 or more search replicates, but further search proved impractical even with grid computing. Other analyses explored the effects of sampling nonsynonymous change only versus partitioned and unpartitioned total nucleotide change; deletion of rogue taxa; and compositional heterogeneity. Relationships among the non-ditrysian lineages previously inferred from morphology were largely confirmed, plus some new ones, with strong support. Robust support was also found for divergences among non-apoditrysian lineages of Ditrysia, but only rarely so within Apoditrysia. Paraphyly for Tineoidea is strongly supported by analysis of nonsynonymous-only signal; conflicting, strong support for tineoid monophyly when synonymous signal was added back is shown to result from compositional heterogeneity. Conclusions / Significance Support for among-superfamily relationships outside the Apoditrysia is now generally strong. Comparable support is mostly lacking within Apoditrysia, but dramatically increased bootstrap percentages for some nodes after rogue taxon removal, and concordance with other evidence, strongly suggest that our picture of apoditrysian phylogeny is approximately correct. This study highlights the challenge of finding optimal topologies when analyzing hundreds of taxa. It also shows that some nodes get strong support only when analysis is restricted to nonsynonymous change, while total change is necessary for strong support of others. Thus, multiple types of analyses will be necessary to fully resolve lepidopteran phylogeny. PMID:23554903
NASA Astrophysics Data System (ADS)
Nallala, Jayakrupakar; Gobinet, Cyril; Diebold, Marie-Danièle; Untereiner, Valérie; Bouché, Olivier; Manfait, Michel; Sockalingum, Ganesh Dhruvananda; Piot, Olivier
2012-11-01
Innovative diagnostic methods are the need of the hour that could complement conventional histopathology for cancer diagnosis. In this perspective, we propose a new concept based on spectral histopathology, using IR spectral micro-imaging, directly applied to paraffinized colon tissue array stabilized in an agarose matrix without any chemical pre-treatment. In order to correct spectral interferences from paraffin and agarose, a mathematical procedure is implemented. The corrected spectral images are then processed by a multivariate clustering method to automatically recover, on the basis of their intrinsic molecular composition, the main histological classes of the normal and the tumoral colon tissue. The spectral signatures from different histological classes of the colonic tissues are analyzed using statistical methods (Kruskal-Wallis test and principal component analysis) to identify the most discriminant IR features. These features allow characterizing some of the biomolecular alterations associated with malignancy. Thus, via a single analysis, in a label-free and nondestructive manner, main changes associated with nucleotide, carbohydrates, and collagen features can be identified simultaneously between the compared normal and the cancerous tissues. The present study demonstrates the potential of IR spectral imaging as a complementary modern tool, to conventional histopathology, for an objective cancer diagnosis directly from paraffin-embedded tissue arrays.
Genetic differentiation of Alaska Chinook salmon: the missing link for migratory studies.
Templin, William D; Seeb, James E; Jasper, James R; Barclay, Andrew W; Seeb, Lisa W
2011-03-01
Most information about Chinook salmon genetic diversity and life history originates from studies from the West Coast USA, western Canada and southeast Alaska; less is known about Chinook salmon from western and southcentral Alaska drainages. Populations in this large area are genetically distinct from populations to the south and represent an evolutionary legacy of unique genetic, phenotypic and life history diversity. More genetic information is necessary to advance mixed stock analysis applications for studies involving these populations. We assembled a comprehensive, open-access baseline of 45 single nucleotide polymorphisms (SNPs) from 172 populations ranging from Russia to California. We compare SNP data from representative populations throughout the range with particular emphasis on western and southcentral Alaska. We grouped populations into major lineages based upon genetic and geographic characteristics, evaluated the resolution for identifying the composition of admixtures and performed mixed stock analysis on Chinook salmon caught incidentally in the walleye pollock fishery in the Bering Sea. SNP data reveal complex genetic structure within Alaska and can be used in applications to address not only regional issues, but also migration pathways, bycatch studies on the high seas, and potential changes in the range of the species in response to climate change. © 2011 Blackwell Publishing Ltd.
El-Halawany, Nermin; Abd-El-Monsif, Shawky A; Al-Tohamy Ahmed, F M; Hegazy, Lamees; Abdel-Shafy, Hamdy; Abdel-Latif, Magdy A; Ghazi, Yasser A; Neuhoff, Christiane; Salilew-Wondim, Dessie; Schellander, Karl
2017-03-01
Mastitis is an infectious disease of the mammary gland that leads to reduced milk production and change in milk composition. Complement component C3 plays a major role as a central molecule of the complement cascade involving in killing of microorganisms, either directly or in cooperation with phagocytic cells. C3 cDNA were isolated, from Egyptian buffalo and cattle, sequenced and characterized. The C3 cDNA sequences of buffalo and cattle consist of 5025 and 5019 bp, respectively. Buffalo and cattle C3 cDNAs share 99% of sequence identity with each other. The 4986 bp open reading frame in buffalo encodes a putative protein of 1661 amino acids-as in cattle-and includes all the functional domains. Further, analysis of the C3 cDNA sequences detected six novel single-nucleotide polymorphisms (SNPs) in buffalo and three novel SNPs in cattle. The association analysis of the detected SNPs with milk somatic cell score as an indicator of mastitis revealed that the most significant association in buffalo was found in the C>A substitution (ss: 1752816097) in exon 27, whereas in cattle it was in the C>T substitution (ss: 1752816085) in exon 12. Our findings provide preliminary information about the contribution of C3 polymorphisms to mastitis resistance in buffalo and cattle.
Roetker, Nicholas S; Page, C David; Yonker, James A; Chang, Vicky; Roan, Carol L; Herd, Pamela; Hauser, Taissa S; Hauser, Robert M; Atwood, Craig S
2013-10-01
We examined depression within a multidimensional framework consisting of genetic, environmental, and sociobehavioral factors and, using machine learning algorithms, explored interactions among these factors that might better explain the etiology of depressive symptoms. We measured current depressive symptoms using the Center for Epidemiologic Studies Depression Scale (n = 6378 participants in the Wisconsin Longitudinal Study). Genetic factors were 78 single nucleotide polymorphisms (SNPs); environmental factors-13 stressful life events (SLEs), plus a composite proportion of SLEs index; and sociobehavioral factors-18 personality, intelligence, and other health or behavioral measures. We performed traditional SNP associations via logistic regression likelihood ratio testing and explored interactions with support vector machines and Bayesian networks. After correction for multiple testing, we found no significant single genotypic associations with depressive symptoms. Machine learning algorithms showed no evidence of interactions. Naïve Bayes produced the best models in both subsets and included only environmental and sociobehavioral factors. We found no single or interactive associations with genetic factors and depressive symptoms. Various environmental and sociobehavioral factors were more predictive of depressive symptoms, yet their impacts were independent of one another. A genome-wide analysis of genetic alterations using machine learning methodologies will provide a framework for identifying genetic-environmental-sociobehavioral interactions in depressive symptoms.
Large-scale, multi-genome analysis of alternate open reading frames in bacteria and archaea.
Veloso, Felipe; Riadi, Gonzalo; Aliaga, Daniela; Lieph, Ryan; Holmes, David S
2005-01-01
Analysis of over 300,000 annotated genes in 105 bacterial and archaeal genomes reveals an unexpectedly high frequency of large (>300 nucleotides) alternate open reading frames (ORFs). Especially notable is the very high frequency of alternate ORFs in frames +3 and -1 (where the annotated gene is defined as frame +1). The occurrence of alternate ORFs is correlated with genomic G+C content and is strongly influenced by synonymous codon usage bias. The frequency of alternate ORFs in frame -1 is also influenced by the occurrence of codons encoding leucine and serine in frame +1. Although some alternate ORFs have been shown to encode proteins, many others are probably not expressed because they lack appropriate signals for transcription and translation. These latter can be mis-annotated by automatic gene finding programs leading to errors in public databases. Especially prone to mis-annotation is frame -1, because it exhibits a potential codon usage and theoretical capacity to encode proteins with an amino acid composition most similar to real genes. Some alternate ORFs are conserved across bacterial or archaeal species, and can give rise to misannotated "conserved hypothetical" genes, while others are unique to a genome and are misidentified as "hypothetical orphan" genes, contributing significantly to the orphan gene paradox.
Pearce, John M.; Ramey, Andrew M.; Flint, Paul L.; Koehler, Anson V.; Fleskes, Joseph P.; Franson, J. Christian; Hall, Jeffrey S.; Derksen, Dirk V.; Ip, Hon S.
2009-01-01
Although continental populations of avian influenza viruses are genetically distinct, transcontinental reassortment in low pathogenic avian influenza (LPAI) viruses has been detected in migratory birds. Thus, genomic analyses of LPAI viruses could serve as an approach to prioritize species and regions targeted by North American surveillance activities for foreign origin highly pathogenic avian influenza (HPAI). To assess the applicability of this approach, we conducted a phylogenetic and population genetic analysis of 68 viral genomes isolated from the northern pintail (Anas acuta) at opposite ends of the Pacific migratory flyway in North America. We found limited evidence for Asian LPAI lineages on wintering areas used by northern pintails in California in contrast to a higher frequency on breeding locales of Alaska. Our results indicate that the number of Asian LPAI lineages observed in Alaskan northern pintails, and the nucleotide composition of LPAI lineages, is not maintained through fall migration. Accordingly, our data indicate that surveillance of Pacific Flyway northern pintails to detect foreign avian influenza viruses would be most effective in Alaska. North American surveillance plans could be optimized through an analysis of LPAI genomics from species that demonstrate evolutionary linkages with European or Asian lineages and in regions that have overlapping migratory flyways with areas of HPAI outbreaks.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Pandey, V.N.; Modak, M.J.
Terminal deoxynucleotidyltransferase is the only DNA polymerase that is strongly inhibited in the presence of ATP. We have labeled calf terminal deoxynucleotidyltransferase with (/sup 32/P)ATP in order to identify its binding site in terminal deoxynucleotidyltransferase. The specificity of ATP cross-linking to terminal deoxynucleotidyltransferase is shown by the competitive inhibition of the overall cross-linking reaction by deoxynucleoside triphosphates, as well as the ATP analogs Ap4A and Ap5A. Tryptic peptide mapping of (/sup 32/P)ATP-labeled enzyme revealed a peptide fraction that contained the majority of cross-linked ATP. The properties, chromatographic characteristics, amino acid composition, and sequence analysis of this peptide fraction were identicalmore » with those found associated with dTTP cross-linked terminal deoxynucleotidyl-transferase peptide. The involvement of the same 2 cysteine residues in the crosslinking of both nucleotides further confirmed the unity of the ATP and dTTP binding domain that contains residues 224-237 in the primary amino acid sequence of calf terminal deoxynucleotidyltransferase.« less
The complete mitochondrial genome of Cricetulus kamensis (Rodentia: Cricetidae).
Kang, Chunlan; Yue, Hao; Liu, Mengyao; Huang, Ting; Liu, Yang; Zhang, Xiuyue; Yue, Bisong; Zeng, Tao; Liu, Shaoying
2016-01-01
The Cricetulus kamensis is endemic to China and is popular as pet. In the present study, the complete mitogenome of C. kamensis was first determined. It was 16,270 bp in length and the composition and arrangement of its genes are analogous to most other mammals. The overall base composition of heavy strand is 33.2% A, 26.8% T, 27.2% C and 12.7% G. The sequence is highly G-C poor (∼40%) and A is the most numerous nucleotide followed by T >C >G, which is similar to other mammalian mitochondrial genomes. It is notable that three extra bases "CAT" were inserted in cytb at the 3' end position and no stop codon was found for this coding region. The mitogenome sequence of C. kamensis could contribute to a better solution of its phylogenetic position and phylogenetic relationship within Cricetinae in the future.
Steer, Penelope A.; Kirkpatrick, Naomi C.; O'Rourke, Denise; Noormohammadi, Amir H.
2009-01-01
Identification of fowl adenovirus (FAdV) serotypes is of importance in epidemiological studies of disease outbreaks and the adoption of vaccination strategies. In this study, real-time PCR and subsequent high-resolution melting (HRM)-curve analysis of three regions of the hexon gene were developed and assessed for their potential in differentiating 12 FAdV reference serotypes. The results were compared to previously described PCR and restriction enzyme analyses of the hexon gene. Both HRM-curve analysis of a 191-bp region of the hexon gene and restriction enzyme analysis failed to distinguish a number of serotypes used in this study. In addition, PCR of the region spanning nucleotides (nt) 144 to 1040 failed to amplify FAdV-5 in sufficient quantities for further analysis. However, HRM-curve analysis of the region spanning nt 301 to 890 proved a sensitive and specific method of differentiating all 12 serotypes. All melt curves were highly reproducible, and replicates of each serotype were correctly genotyped with a mean confidence value of more than 99% using normalized HRM curves. Sequencing analysis revealed that each profile was related to a unique sequence, with some sequences sharing greater than 94% identity. Melting-curve profiles were found to be related mainly to GC composition and distribution throughout the amplicons, regardless of sequence identity. The results presented in this study show that the closed-tube method of PCR and HRM-curve analysis provides an accurate, rapid, and robust genotyping technique for the identification of FAdV serotypes and can be used as a model for developing genotyping techniques for other pathogens. PMID:19036935
Lim, Chun Shen; Brown, Chris M
2016-09-01
Many viruses contain RNA elements that modulate splicing and/or promote nuclear export of their RNAs. The RNAs of the major human pathogen, hepatitis B virus (HBV) contain a large (~600 bases) composite cis-acting 'post-transcriptional regulatory element' (PRE). This element promotes expression from these naturally intronless transcripts. Indeed, the related woodchuck hepadnavirus PRE (WPRE) is used to enhance expression in gene therapy and other expression vectors. These PRE are likely to act through a combination of mechanisms, including promotion of RNA nuclear export. Functional components of both the HBV PRE and WPRE are 2 conserved RNA cis-acting stem-loop (SL) structures, SLα and SLβ. They are within the coding regions of polymerase (P) gene, and both P and X genes, respectively. Based on previous studies using mutagenesis and/or nuclear magnetic resonance (NMR), here we propose 2 covariance models for SLα and SLβ. The model for the 30-nucleotide SLα contains a G-bulge and a CNGG(U) apical loop of which the first and the fourth loop residues form a CG pair and the fifth loop residue is bulged out, as observed in the NMR structure. The model for the 23-nucleotide SLβ contains a 7-base-pair stem and a 9-nucleotide loop. Comparison of the models with other RNA structural elements, as well as similarity searches of human transcriptome and viral genomes demonstrate that SLα and SLβ are specific to HBV transcripts. However, they are well conserved among the hepadnaviruses of non-human primates, the woodchuck and ground squirrel.
Lim, Chun Shen; Brown, Chris M.
2016-01-01
ABSTRACT Many viruses contain RNA elements that modulate splicing and/or promote nuclear export of their RNAs. The RNAs of the major human pathogen, hepatitis B virus (HBV) contain a large (~600 bases) composite cis-acting 'post-transcriptional regulatory element' (PRE). This element promotes expression from these naturally intronless transcripts. Indeed, the related woodchuck hepadnavirus PRE (WPRE) is used to enhance expression in gene therapy and other expression vectors. These PRE are likely to act through a combination of mechanisms, including promotion of RNA nuclear export. Functional components of both the HBV PRE and WPRE are 2 conserved RNA cis-acting stem-loop (SL) structures, SLα and SLβ. They are within the coding regions of polymerase (P) gene, and both P and X genes, respectively. Based on previous studies using mutagenesis and/or nuclear magnetic resonance (NMR), here we propose 2 covariance models for SLα and SLβ. The model for the 30-nucleotide SLα contains a G-bulge and a CNGG(U) apical loop of which the first and the fourth loop residues form a CG pair and the fifth loop residue is bulged out, as observed in the NMR structure. The model for the 23-nucleotide SLβ contains a 7-base-pair stem and a 9-nucleotide loop. Comparison of the models with other RNA structural elements, as well as similarity searches of human transcriptome and viral genomes demonstrate that SLα and SLβ are specific to HBV transcripts. However, they are well conserved among the hepadnaviruses of non-human primates, the woodchuck and ground squirrel. PMID:27031749
Li, Guang-Qing; Liu, Zi; Shen, Hong-Bin; Yu, Dong-Jun
2016-10-01
As one of the most ubiquitous post-transcriptional modifications of RNA, N 6 -methyladenosine ( [Formula: see text]) plays an essential role in many vital biological processes. The identification of [Formula: see text] sites in RNAs is significantly important for both basic biomedical research and practical drug development. In this study, we designed a computational-based method, called TargetM6A, to rapidly and accurately target [Formula: see text] sites solely from the primary RNA sequences. Two new features, i.e., position-specific nucleotide/dinucleotide propensities (PSNP/PSDP), are introduced and combined with the traditional nucleotide composition (NC) feature to formulate RNA sequences. The extracted features are further optimized to obtain a much more compact and discriminative feature subset by applying an incremental feature selection (IFS) procedure. Based on the optimized feature subset, we trained TargetM6A on the training dataset with a support vector machine (SVM) as the prediction engine. We compared the proposed TargetM6A method with existing methods for predicting [Formula: see text] sites by performing stringent jackknife tests and independent validation tests on benchmark datasets. The experimental results show that the proposed TargetM6A method outperformed the existing methods for predicting [Formula: see text] sites and remarkably improved the prediction performances, with MCC = 0.526 and AUC = 0.818. We also provided a user-friendly web server for TargetM6A, which is publicly accessible for academic use at http://csbio.njust.edu.cn/bioinf/TargetM6A.
Yuan, Kejun; Wang, Changjun; Xin, Li; Zhang, Anning; Ai, Chengxiang
2013-07-25
A farnesyl diphosphate synthase gene (FPPS2), which contains 11 introns and 12 exons, was isolated from the apple cultivar "White Winter Pearmain". When it was compared to our previously reported FPPS1, its each intron size was different, its each exon size was the same as that of FPPS1 gene, 30 nucleotide differences were found in its coding sequence. Based on these nucleotide differences, specific primers were designed to perform expression analysis; the results showed that it expressed in both fruit and leaf, its expression level was obviously lower than that of FPPS1 gene in fruit which was stored at 4°C for 5 weeks. This is the first report concerning two FPPS genes and their expression comparison in apples. Copyright © 2013 Elsevier B.V. All rights reserved.
Sasaya, Takahide; Kusaba, Shinnosuke; Ishikawa, Koichi; Koganezawa, Hiroki
2004-09-01
Lettuce big-vein virus (LBVV) is the type species of the genus Varicosavirus and is a two-segmented negative-sense single-stranded RNA virus. The larger LBVV genome segment (RNA1) consists of 6797 nt and encodes an L polymerase that resembles that of rhabdoviruses. Here, the nucleotide sequence of the second LBVV genome segment (RNA2) is reported. LBVV RNA2 consisted of 6081 nt and contained antisense information for five major ORFs: ORF1 (nt 210-1403 on the viral RNA), ORF2 (nt 1493-2494), ORF3 (nt 2617-3489), ORF4 (nt 3843-4337) and ORF5 (nt 4530-5636), which had coding capacities of 44, 36, 32, 19 and 41 kDa, respectively. The gene at the 3' end of the viral RNA encoded a coat protein, while the other four genes encoded proteins of unknown functions. The 3'-terminal 11 nt of LBVV RNA2 were identical to those of LBVV RNA1, and the 5'-terminal regions of LBVV RNA1 and RNA2 contained a long common nucleotide stretch of about 100 nt. Northern blot analysis using probes specific to the individual ORFs revealed that LBVV transcribes monocistronic RNAs. Analysis of the terminal sequences, and primer extension and RNase H digestion analysis of LBVV mRNAs, suggested that LBVV utilizes a transcription termination/initiation strategy comparable with that of rhabdoviruses.
Large meta-analysis of genome-wide association studies identifies five loci for lean body mass.
Zillikens, M Carola; Demissie, Serkalem; Hsu, Yi-Hsiang; Yerges-Armstrong, Laura M; Chou, Wen-Chi; Stolk, Lisette; Livshits, Gregory; Broer, Linda; Johnson, Toby; Koller, Daniel L; Kutalik, Zoltán; Luan, Jian'an; Malkin, Ida; Ried, Janina S; Smith, Albert V; Thorleifsson, Gudmar; Vandenput, Liesbeth; Hua Zhao, Jing; Zhang, Weihua; Aghdassi, Ali; Åkesson, Kristina; Amin, Najaf; Baier, Leslie J; Barroso, Inês; Bennett, David A; Bertram, Lars; Biffar, Rainer; Bochud, Murielle; Boehnke, Michael; Borecki, Ingrid B; Buchman, Aron S; Byberg, Liisa; Campbell, Harry; Campos Obanda, Natalia; Cauley, Jane A; Cawthon, Peggy M; Cederberg, Henna; Chen, Zhao; Cho, Nam H; Jin Choi, Hyung; Claussnitzer, Melina; Collins, Francis; Cummings, Steven R; De Jager, Philip L; Demuth, Ilja; Dhonukshe-Rutten, Rosalie A M; Diatchenko, Luda; Eiriksdottir, Gudny; Enneman, Anke W; Erdos, Mike; Eriksson, Johan G; Eriksson, Joel; Estrada, Karol; Evans, Daniel S; Feitosa, Mary F; Fu, Mao; Garcia, Melissa; Gieger, Christian; Girke, Thomas; Glazer, Nicole L; Grallert, Harald; Grewal, Jagvir; Han, Bok-Ghee; Hanson, Robert L; Hayward, Caroline; Hofman, Albert; Hoffman, Eric P; Homuth, Georg; Hsueh, Wen-Chi; Hubal, Monica J; Hubbard, Alan; Huffman, Kim M; Husted, Lise B; Illig, Thomas; Ingelsson, Erik; Ittermann, Till; Jansson, John-Olov; Jordan, Joanne M; Jula, Antti; Karlsson, Magnus; Khaw, Kay-Tee; Kilpeläinen, Tuomas O; Klopp, Norman; Kloth, Jacqueline S L; Koistinen, Heikki A; Kraus, William E; Kritchevsky, Stephen; Kuulasmaa, Teemu; Kuusisto, Johanna; Laakso, Markku; Lahti, Jari; Lang, Thomas; Langdahl, Bente L; Launer, Lenore J; Lee, Jong-Young; Lerch, Markus M; Lewis, Joshua R; Lind, Lars; Lindgren, Cecilia; Liu, Yongmei; Liu, Tian; Liu, Youfang; Ljunggren, Östen; Lorentzon, Mattias; Luben, Robert N; Maixner, William; McGuigan, Fiona E; Medina-Gomez, Carolina; Meitinger, Thomas; Melhus, Håkan; Mellström, Dan; Melov, Simon; Michaëlsson, Karl; Mitchell, Braxton D; Morris, Andrew P; Mosekilde, Leif; Newman, Anne; Nielson, Carrie M; O'Connell, Jeffrey R; Oostra, Ben A; Orwoll, Eric S; Palotie, Aarno; Parker, Stephen C J; Peacock, Munro; Perola, Markus; Peters, Annette; Polasek, Ozren; Prince, Richard L; Räikkönen, Katri; Ralston, Stuart H; Ripatti, Samuli; Robbins, John A; Rotter, Jerome I; Rudan, Igor; Salomaa, Veikko; Satterfield, Suzanne; Schadt, Eric E; Schipf, Sabine; Scott, Laura; Sehmi, Joban; Shen, Jian; Soo Shin, Chan; Sigurdsson, Gunnar; Smith, Shad; Soranzo, Nicole; Stančáková, Alena; Steinhagen-Thiessen, Elisabeth; Streeten, Elizabeth A; Styrkarsdottir, Unnur; Swart, Karin M A; Tan, Sian-Tsung; Tarnopolsky, Mark A; Thompson, Patricia; Thomson, Cynthia A; Thorsteinsdottir, Unnur; Tikkanen, Emmi; Tranah, Gregory J; Tuomilehto, Jaakko; van Schoor, Natasja M; Verma, Arjun; Vollenweider, Peter; Völzke, Henry; Wactawski-Wende, Jean; Walker, Mark; Weedon, Michael N; Welch, Ryan; Wichmann, H-Erich; Widen, Elisabeth; Williams, Frances M K; Wilson, James F; Wright, Nicole C; Xie, Weijia; Yu, Lei; Zhou, Yanhua; Chambers, John C; Döring, Angela; van Duijn, Cornelia M; Econs, Michael J; Gudnason, Vilmundur; Kooner, Jaspal S; Psaty, Bruce M; Spector, Timothy D; Stefansson, Kari; Rivadeneira, Fernando; Uitterlinden, André G; Wareham, Nicholas J; Ossowski, Vicky; Waterworth, Dawn; Loos, Ruth J F; Karasik, David; Harris, Tamara B; Ohlsson, Claes; Kiel, Douglas P
2017-07-19
Lean body mass, consisting mostly of skeletal muscle, is important for healthy aging. We performed a genome-wide association study for whole body (20 cohorts of European ancestry with n = 38,292) and appendicular (arms and legs) lean body mass (n = 28,330) measured using dual energy X-ray absorptiometry or bioelectrical impedance analysis, adjusted for sex, age, height, and fat mass. Twenty-one single-nucleotide polymorphisms were significantly associated with lean body mass either genome wide (p < 5 × 10 -8 ) or suggestively genome wide (p < 2.3 × 10 -6 ). Replication in 63,475 (47,227 of European ancestry) individuals from 33 cohorts for whole body lean body mass and in 45,090 (42,360 of European ancestry) subjects from 25 cohorts for appendicular lean body mass was successful for five single-nucleotide polymorphisms in/near HSD17B11, VCAN, ADAMTSL3, IRS1, and FTO for total lean body mass and for three single-nucleotide polymorphisms in/near VCAN, ADAMTSL3, and IRS1 for appendicular lean body mass. Our findings provide new insight into the genetics of lean body mass.Lean body mass is a highly heritable trait and is associated with various health conditions. Here, Kiel and colleagues perform a meta-analysis of genome-wide association studies for whole body lean body mass and find five novel genetic loci to be significantly associated.
Mangrauthia, Satendra K; Malathi, P; Agarwal, Surekha; Ramkumar, G; Krishnaveni, D; Neeraja, C N; Madhav, M Sheshu; Ladhalakshmi, D; Balachandran, S M; Viraktamath, B C
2012-06-01
Rice tungro disease, one of the major constraints to rice production in South and Southeast Asia, is caused by a combination of two viruses: Rice tungro spherical virus (RTSV) and Rice tungro bacilliform virus (RTBV). The present study was undertaken to determine the genetic variation of RTSV population present in tungro endemic states of Indian subcontinent. Phylogenetic analysis based on coat protein sequences showed distinct divergence of Indian RTSV isolates into two groups; one consisted isolates from Hyderabad (Andhra Pradesh), Cuttack (Orissa), and Puducherry and another from West Bengal, Coimbatore (Tamil Nadu), and Kanyakumari (Tamil Nadu). The results obtained from phylogenetic study were further supported with the SNPs (single nucleotide polymorphism), INDELs (insertion and deletion) and evolutionary distance analysis. In addition, sequence difference count matrix revealed 2-68 nucleotides differences among all the Indian RTSV isolates taken in this study. However, at the protein level these differences were not significant as revealed by Ka/Ks ratio calculation. Sequence identity at nucleotide and amino acid level was 92-100% and 97-100%, respectively, among Indian isolates of RTSV. Understanding of the population structure of RTSV from tungro endemic regions of India would potentially provide insights into the molecular diversification of this virus.
Ewing's sarcoma: analysis of single nucleotide polymorphism in the EWS gene.
Silva, Deborah S B S; Sawitzki, Fernanda R; De Toni, Elisa C; Graebin, Pietra; Picanco, Juliane B; Abujamra, Ana Lucia; de Farias, Caroline B; Roesler, Rafael; Brunetto, Algemir L; Alho, Clarice S
2012-11-10
We aimed to investigate single nucleotide polymorphisms (SNPs) in the EWS gene breaking region in order to analyze Ewing's sarcoma susceptibility. The SNPs were investigated in a healthy subject population and in Ewing's sarcoma patients from Southern Brazil. Genotyping was performed by TaqMan® assay for allelic discrimination using Real-Time PCR. The analysis of incidence of SNPs or different SNP-arrangements revealed a higher presence of homozygote TT-rs4820804 in Ewing's sarcoma patients (p=0.02; Chi Square Test). About 300 bp from the rs4820804 SNP lies a palindromic hexamer (5'-GCTAGC-3') and three nucleotides (GTC), which were previously identified to be in close vicinity of the breakpoint junction in both EWS and FLI1 genes. This DNA segment surrounding the rs4820804 SNP is likely to indicate a breakpoint region. If the T-rs4820804 allele predisposes a DNA fragment to breakage, homozygotes (TT-rs4820804) would have double the chance of having a chromosome break, increasing the chances for a translocation to occur. In conclusion, the TT-rs4820804 EWS genotype can be associated with Ewing's sarcoma and the SNP rs4820804 can be a candidate marker to understand Ewing's sarcoma susceptibility. Copyright © 2012 Elsevier B.V. All rights reserved.
Identification of two allelic IgG1 C(H) coding regions (Cgamma1) of cat.
Kanai, T H; Ueda, S; Nakamura, T
2000-01-31
Two types of cDNA encoding IgG1 heavy chain (gamma1) were isolated from a single domestic short-hair cat. Sequence analysis indicated a higher level of similarity of these Cgamma1 sequences to human Cgamma1 sequence (76.9 and 77.0%) than to mouse sequence (70.0 and 69.7%) at the nucleotide level. Predicted primary structures of both the feline Cgamma1 genes, designated as Cgamma1a and Cgamma1b, were similar to that of human Cgamma1 gene, for instance, as to the size of constant domains, the presence of six conserved cysteine residues involved in formation of the domain structure, and the location of a conserved N-linked glycosylation site. Sequence comparison between the two alleles showed that 7 out of 10 nucleotide differences were within the C(H)3 domain coding region, all leading to nonsynonymous changes in amino acid residues. Partial sequence analysis of genomic clones showed three nucleotide substitutions between the two Cgamma1 alleles in the intron between the CH2 and C(H)3 domain coding regions. In 12 domestic short-hair cats used in this study, the frequency of Cgamma1a allele (62.5%) was higher than that of the Cgamma1b allele (37.5%).
Wang, Mengyun; Li, Qiaoxin; Gu, Chengyuan; Zhu, Yao; Yang, Yajun; Wang, Jiucun; Jin, Li; He, Jing; Ye, Dingwei; Wei, Qingyi
2017-04-11
Genetic variants of nucleotide excision repair (NER) genes have been extensively investigated for their roles in the development of prostate cancer (PCa); however, the published results have been inconsistent. In a hospital-based case-control study of 1,004 PCa cases and 1,055 cancer-free controls, we genotyped eight potentially functional single nucleotide polymorphisms (SNPs) of NER genes (i.e., XPC, rs2228001 T>G and rs1870134 G>C; XPD, rs13181 T>G and rs238406 G>T; XPG, rs1047768 T>C, rs751402 C>T, and rs17655 G>C; and XPF, rs2276464 G>C) and assessed their associations with risk of PCa by using logistic regression analysis. Among these eight SNPs investigated, only XPC rs1870134 CG/CC variant genotypes were associated with a decreased risk of prostate cancer under a dominant genetic model (adjusted odds ratio [OR] = 0.77, 95% confidence interval [CI] = 0.64-1.91, P = 0.003). Phenotype-genotype analysis also suggested that the XPC rs1870134 CG/CC variant genotypes were associated with significantly decreased expression levels of XPC mRNA in a mix population of different ethnicities. These findings suggested that XPC SNPs may contribute to risk of PCa in Eastern Chinese men.
[Molecular epidemiological analysis of rubella virus isolates from 2001 to 2011 in Shanghai, China].
Li, Chong-Shan; Yang, Yu-Ying; Wang, Jian-Guo; Zhu, Zhen; Tang, Wei; Li, Zhi; Sun, Xiao-Dong; Xu, Wen-Bo
2012-03-01
Throat swabs collected from patients whose serum was measles IgM negative and rubella IgM positive during 2001-2011 were used to conduct cell culture for rubella virus. After identification of cell culture with RT-PCR, nucleotide of gene E1 of rubella virus was amplified and sequenced, followed by molecular epidemiological analysis. A total of 31 rubella viruses were isolated from 60 throat swabs. Compared 27 isolates with the WHO reference strains of all genotypes, phylogenetic tree was constructed based on the amplified 739 nucleotide fragment. These isolates belonged to two different genotypes respectively. Isolates 11009, 11052 and 11106 in 2011 belonged to genotype 2B, and others belonged to genotype 1E. Most of mutations were nonsense mutation, and sequence of amino acid was highly conserved. Amino acid sequence of most isolates of genotype 1E was identical, which suggested rubella viruses from same transmission chain might be transmitted continually since 2001. Rubella virus genotype 2B was found to be popular for the first time in Shanghai in 2011. The nucleotide sequences of these genotype 2B isolates showed 99% identity compared with that of isolates recently from Vietnam, Japan and Argentina. The resources of these strains were not confirmed due to the absence of rubella virus surveillance before.
Fan, Jing; Chen, Chunxian; Yu, Qibin; Li, Zheng-Guo; Gmitter, Frederick G
2010-10-01
Three putative terpenoid UDP-glycosyltransferase (UGT) genes, designated CsUGT1, CsUGT2, and CsUGT3, were isolated and characterized in 'Valencia' sweet orange (Citrus sinensis L. Osbeck). CsUGT1 consisted of 1493 nucleotides with an open reading frame encoding 492 amino acids, CsUGT2 consisted of 1727 nucleotides encoding 504 amino acids, and CsUGT3 consisted of 1705 nucleotides encoding 468 amino acids. CsUGT3 had a 145 bp intron at 730-874, whereas CsUGT1 and CsUGT2 had none. The three deduced glycosyltransferase proteins had a highly conserved plant secondary product glycosyltransferase motif in the C terminus. Phylogenetic analysis showed that CsUGT1 and CsUGT3 were classified into group L of glycosyltransferase family 1, and CsUGT2 was classified into group D. Through Southern blotting analysis, CsUGT1 was found to have two copies in the sweet orange genome, whereas CsUGT2 and CsUGT3 had at least seven and nine copies, respectively. CsUGT1, CsUGT2, and CsUGT3 were constitutively expressed in leaf, flower, and fruit tissues. The results facilitate further investigation of the function of terpenoid glycosyltransferases in citrus and the biosynthesis of terpenoid glycosides in vitro.
Sun, Yan-Lin; Kang, Ho-Min; Kim, Young-Sik; Baek, Jun-Pill; Zheng, Shi-Lin; Xiang, Jin-Jun; Hong, Soon-Kwan
2014-05-04
The tomato ( Solanum lycopersicum ) is a major vegetable crop worldwide. To satisfy popular demand, more than 500 tomato varieties have been bred. However, a clear variety identification has not been found. Thorough understanding of the phylogenetic relationship and hybridization information of tomato varieties is very important for further variety breeding. Thus, in this study, we collected 26 tomato varieties and attempted to distinguish them based on the 5S rRNA region, which is widely used in the determination of phylogenetic relations. Sequence analysis of the 5S rRNA region suggested that a large number of nucleotide variations exist among tomato varieties. These variable nucleotide sites were also informative regarding hybridization. Chromas sequencing of Yellow Mountain View and Seuwiteuking varieties indicated three and one variable nucleotide sites in the non-transcribed spacer (NTS) of the 5S rRNA region showing hybridization, respectively. Based on a phylogenetic tree constructed using the 5S rRNA sequences, we observed that 16 tomato varieties were divided into three groups at 95% similarity. Rubiking and Sseommeoking, Lang Selection Procedure and Seuwiteuking, and Acorn Gold and Yellow Mountain View exhibited very high identity with their partners. This work will aid variety authentication and provides a basis for further tomato variety breeding.
Genetic Diversity and Molecular Evolution of a Violaxanthin De-epoxidase Gene in Maize.
Xu, Jing; Li, Zhigang; Yang, Haorui; Yang, Xiaohong; Chen, Cuixia; Li, Hui
2016-01-01
Violaxanthin de-epoxidase (VDE) has a critical role in the carotenoid biosynthesis pathway, which is involved in protecting the photosynthesis apparatus from damage caused by excessive light. Here, a VDE gene in maize, ZmVDE1, was cloned and shown to have functional domains in common with the gramineous VDE protein. Candidate gene association analysis indicated that no polymorphic sites in ZmVDE1 were significant association with any of the examined carotenoid-related traits at P = 0.05 in an association panel containing 155 maize inbred lines. Nucleotide diversity analysis of VDE1 in maize and teosinte indicated that its exon had less genetic variation, consistent with the conserved function of VDE1 in plants. In addition, dramatically reduced nucleotide diversity, fewer haplotypes and a significantly negative parameter deviation for Tajima's D test of ZmVDE1 in maize and teosinte suggested that a potential selective force had acted across the ZmVDE1 locus. We further identified a 4.2 Mb selective sweep with low recombination surrounding the ZmVDE1 locus that resulted in severely reduced nucleotide diversity on chromosome 2. Collectively, natural selection and the conserved domains of ZmVDE1 might show an important role in the xanthophyll cycle of the carotenoid biosynthesis pathway.
Genetic Diversity and Molecular Evolution of a Violaxanthin De-epoxidase Gene in Maize
Xu, Jing; Li, Zhigang; Yang, Haorui; Yang, Xiaohong; Chen, Cuixia; Li, Hui
2016-01-01
Violaxanthin de-epoxidase (VDE) has a critical role in the carotenoid biosynthesis pathway, which is involved in protecting the photosynthesis apparatus from damage caused by excessive light. Here, a VDE gene in maize, ZmVDE1, was cloned and shown to have functional domains in common with the gramineous VDE protein. Candidate gene association analysis indicated that no polymorphic sites in ZmVDE1 were significant association with any of the examined carotenoid-related traits at P = 0.05 in an association panel containing 155 maize inbred lines. Nucleotide diversity analysis of VDE1 in maize and teosinte indicated that its exon had less genetic variation, consistent with the conserved function of VDE1 in plants. In addition, dramatically reduced nucleotide diversity, fewer haplotypes and a significantly negative parameter deviation for Tajima’s D test of ZmVDE1 in maize and teosinte suggested that a potential selective force had acted across the ZmVDE1 locus. We further identified a 4.2 Mb selective sweep with low recombination surrounding the ZmVDE1 locus that resulted in severely reduced nucleotide diversity on chromosome 2. Collectively, natural selection and the conserved domains of ZmVDE1 might show an important role in the xanthophyll cycle of the carotenoid biosynthesis pathway. PMID:27507987
Single-Molecule Counting of Point Mutations by Transient DNA Binding
NASA Astrophysics Data System (ADS)
Su, Xin; Li, Lidan; Wang, Shanshan; Hao, Dandan; Wang, Lei; Yu, Changyuan
2017-03-01
High-confidence detection of point mutations is important for disease diagnosis and clinical practice. Hybridization probes are extensively used, but are hindered by their poor single-nucleotide selectivity. Shortening the length of DNA hybridization probes weakens the stability of the probe-target duplex, leading to transient binding between complementary sequences. The kinetics of probe-target binding events are highly dependent on the number of complementary base pairs. Here, we present a single-molecule assay for point mutation detection based on transient DNA binding and use of total internal reflection fluorescence microscopy. Statistical analysis of single-molecule kinetics enabled us to effectively discriminate between wild type DNA sequences and single-nucleotide variants at the single-molecule level. A higher single-nucleotide discrimination is achieved than in our previous work by optimizing the assay conditions, which is guided by statistical modeling of kinetics with a gamma distribution. The KRAS c.34 A mutation can be clearly differentiated from the wild type sequence (KRAS c.34 G) at a relative abundance as low as 0.01% mutant to WT. To demonstrate the feasibility of this method for analysis of clinically relevant biological samples, we used this technology to detect mutations in single-stranded DNA generated from asymmetric RT-PCR of mRNA from two cancer cell lines.
Molecular Characterization of Bombyx mori Cytoplasmic Polyhedrosis Virus Genome Segment 4
Ikeda, Keiko; Nagaoka, Sumiharu; Winkler, Stefan; Kotani, Kumiko; Yagi, Hiroaki; Nakanishi, Kae; Miyajima, Shigetoshi; Kobayashi, Jun; Mori, Hajime
2001-01-01
The complete nucleotide sequence of the genome segment 4 (S4) of Bombyx mori cytoplasmic polyhedrosis virus (BmCPV) was determined. The 3,259-nucleotide sequence contains a single long open reading frame which spans nucleotides 14 to 3187 and which is predicted to encode a protein with a molecular mass of about 130 kDa. Western blot analysis showed that S4 encodes BmCPV protein VP3, which is one of the outer components of the BmCPV virion. Sequence analysis of the deduced amino acid sequence of BmCPV VP3 revealed possible sequence homology with proteins from rice ragged stunt virus (RRSV) S2, Nilaparvata lugens reovirus S4, and Fiji disease fijivirus S4. This may suggest that plant reoviruses originated from insect viruses and that RRSV emerged more recently than other plant reoviruses. A chimeric protein consisting of BmCPV VP3 and green fluorescent protein (GFP) was constructed and expressed with BmCPV polyhedrin using a baculovirus expression vector. The VP3-GFP chimera was incorporated into BmCPV polyhedra and released under alkaline conditions. The results indicate that specific interactions occur between BmCPV polyhedrin and VP3 which might facilitate BmCPV virion occlusion into the polyhedra. PMID:11134312
Nucleotide sequence and phylogenetic analysis of Cucurbit yellow stunting disorder virus RNA 2.
Livieratos, Ioannis C; Coutts, Robert H A
2002-06-01
The complete nucleotide sequence of Cucurbit yellow stunting disorder virus (CYSDV) RNA 2, a whitefly (Bemisia tabaci)-transmitted closterovirus with a bi-partite genome, is reported. CYSDV RNA 2 is 7,281 nucleotides long and contains the closterovirus hallmark gene array with a similar arrangement to the prototype member of the genus Crinivirus, Lettuce infectious yellows virus (LIYV). CYSDV RNA 2 contains open reading frames (ORFs) potentially encoding in a 5' to 3' direction for proteins of 5 kDa (ORF 1; hydrophobic protein), 62 kDa (ORF 2; heat shock protein 70 homolog, HSP70h), 59 kDa (ORF 3; protein of unknown function), 9 kDa (ORF 4; protein of unknown function), 28.5 kDa (ORF 5; coat protein, CP), 53 kDa (ORF 6; coat protein minor, CPm), and 26.5 kDa (ORF 7; protein of unknown function). Pairwise comparisons of CYSDV RNA 2-encoded proteins (HSP70h, p59 and CPm) among the closteroviruses showed that CYSDV is closely related to LIYV. Phylogenetic analysis based on the amino acid sequence of the HSP70h, indicated that CYSDV clusters with other members of the genus Crinivirus, and it is related to Little cherry virus-1 (LChV-1), but is distinct from the aphid- or mealybug-transmitted closteroviruses.
Sun, Xiao-Dong; Li, Chong-Shan; Tang, Xian; Li, Zhi; Zhang, Yan; Tang, Wei; Wang, Jing; Wang, Hui-Ling; Yang, Yan-Ji; Li, Jia; Yuan, Zheng-An; Xu, Wen-Bo
2013-11-01
This study analyzed the genetic characterization on first imported measles virus of genotype D8 in Chinese mainland. Serums were collected from the suspicious MV patients to detect IgM antibody in ELISA. Throat swabs were cultured in Vero/SLAM cell line to get measles virus isolates. Part of the nucleotide sequence of the 3' terminus of nucleoprotein (N) gene of these isolates were amplified by RT-PCR, and the amplicons were directly sequenced. The phylogenetic analysis was based on the nucleotide sequence about 456 base pairs of the 3' terminus of nucleoprotein (N) gene. Results showed that it reported 1 105 suspicious measles cases in shanghai, 2012, including 590 confirmed cases and 2 clinical case. The reported morbidity was 2.52 per one hundred thousand. 247 measles viruses were isolated from 984 throat swabs specimen. Most of them belonged to sub-genotype H1a except Shanghai12-239 was genotype D8. The homology of nucleotide and amino acid sequences were 97.8% and 98.6% respectively between Shanghai12-239 and WHO reference strain (Manchester. UNK30.94(D8)AF280803). Those were 89.6%-94.5% and 88.7%-95.3% between Shanghai12-239 and WHO reference strains of other genotypes.
Oxidation of monoterpenes in Protium heptaphyllum oleoresins.
Albino, Rayane C; Oliveira, Prissila C; Prosdocimi, Francisco; da Silva, Osman F; Bizzo, Humberto R; Gama, Paola E; Sakuragui, Cássia M; Furtado, Carolina; de Oliveira, Danilo R
2017-04-01
Protium heptaphyllum (Burseraceae) oleoresins are rich in volatile monoterpenes, exhibiting a chemical composition that can be strongly altered with time. The present work aimed to discuss the temporal change of the volatile composition of these oleoresins, and search for related supporting evidence. Samples of P. heptaphyllum oleoresin were collected separately for fresh (n = 10) and aged (n = 8) oleoresins, with the essential oils obtained by hydrodistillation analyzed by GC-FID and GC-MS. Fresh oleoresins were characterized by a high content of terpinolene (28.2-69.7%), whereas aged ones contained large amounts of p-cymene (18.7-43.0%) and p-cymen-8-ol (8.2-31.8%). Multivariate analyses were performed based on the yield and major essential oil components to clearly demonstrate the existence of two subsets (fresh and aged oleoresins). In addition, an analysis of the partial genome sequencing of the species was carried out, producing the largest amount of data for the genus Protium. Subsequently, were searched for nucleotide sequences responsible for the enzymes involved in the biosynthesis of monoterpenes. Two hypotheses were formulated to understand the oxidation process during aging of the oleoresins: (i) a natural chemical oxidation of terpenes and (ii) an oxidation catalyzed by enzymes produced by microorganisms associated with the plant. The results suggested that terpinolene was most likely oxidized to p-cymene, which, in turn, was oxidized into p-cymen-8-ol during natural aging of the exudate due to abiotic factors. Copyright © 2017 Elsevier Ltd. All rights reserved.
Species Based Synonymous Codon Usage in Fusion Protein Gene of Newcastle Disease Virus
Kumar, Chandra Shekhar; Kumar, Sachin
2014-01-01
Newcastle disease is highly pathogenic to poultry and many other avian species. However, the Newcastle disease virus (NDV) has also been reported from many non-avian species. The NDV fusion protein (F) is a major determinant of its pathogenicity and virulence. The functionalities of F gene have been explored for the development of vaccine and diagnostics against NDV. Although the F protein is well studied but the codon usage and its nucleotide composition from NDV isolated from different species have not yet been explored. In present study, we have analyzed the factors responsible for the determination of codon usage in NDV isolated from four major avian host species. The F gene of NDV is analyzed for its base composition and its correlation with the bias in codon usage. Our result showed that random mutational pressure is responsible for codon usage bias in F protein of NDV isolates. Aromaticity, GC3s, and aliphatic index were not found responsible for species based synonymous codon usage bias in F gene of NDV. Moreover, the low amount of codon usage bias and expression level was further confirmed by a low CAI value. The phylogenetic analysis of isolates was found in corroboration with the relatedness of species based on codon usage bias. The relationship between the host species and the NDV isolates from the host does not represent a significant correlation in our study. The present study provides a basic understanding of the mechanism involved in codon usage among species. PMID:25479071