Sample records for full-length coding region

  1. Novel exon 1 protein-coding regions N-terminally extend human KCNE3 and KCNE4.

    PubMed

    Abbott, Geoffrey W

    2016-08-01

    The 5 human (h)KCNE β subunits each regulate various cation channels and are linked to inherited cardiac arrhythmias. Reported here are previously undiscovered protein-coding regions in exon 1 of hKCNE3 and hKCNE4 that extend their encoded extracellular domains by 44 and 51 residues, which yields full-length proteins of 147 and 221 residues, respectively. Full-length hKCNE3 and hKCNE4 transcript and protein are expressed in multiple human tissues; for hKCNE4, only the longer protein isoform is detectable. Two-electrode voltage-clamp electrophysiology revealed that, when coexpressed in Xenopus laevis oocytes with various potassium channels, the newly discovered segment preserved conversion of KCNQ1 by hKCNE3 to a constitutively open channel, but prevented its inhibition of Kv4.2 and KCNQ4. hKCNE4 slowing of Kv4.2 inactivation and positive-shifted steady-state inactivation were also preserved in the longer form. In contrast, full-length hKCNE4 inhibition of KCNQ1 was limited to 40% at +40 mV vs. 80% inhibition by the shorter form, and augmentation of KCNQ4 activity by hKCNE4 was entirely abolished by the additional segment. Among the genome databases analyzed, the longer KCNE3 is confined to primates; full-length KCNE4 is widespread in vertebrates but is notably absent from Mus musculus Findings highlight unexpected KCNE gene diversity, raise the possibility of dynamic regulation of KCNE partner modulation via splice variation, and suggest that the longer hKCNE3 and hKCNE4 proteins should be adopted in future mechanistic and genetic screening studies.-Abbott, G. W. Novel exon 1 protein-coding regions N-terminally extend human KCNE3 and KCNE4. © FASEB.

  2. Integrative Annotation of 21,037 Human Genes Validated by Full-Length cDNA Clones

    PubMed Central

    Imanishi, Tadashi; Itoh, Takeshi; Suzuki, Yutaka; O'Donovan, Claire; Fukuchi, Satoshi; Koyanagi, Kanako O; Barrero, Roberto A; Tamura, Takuro; Yamaguchi-Kabata, Yumi; Tanino, Motohiko; Yura, Kei; Miyazaki, Satoru; Ikeo, Kazuho; Homma, Keiichi; Kasprzyk, Arek; Nishikawa, Tetsuo; Hirakawa, Mika; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Ashurst, Jennifer; Jia, Libin; Nakao, Mitsuteru; Thomas, Michael A; Mulder, Nicola; Karavidopoulou, Youla; Jin, Lihua; Kim, Sangsoo; Yasuda, Tomohiro; Lenhard, Boris; Eveno, Eric; Suzuki, Yoshiyuki; Yamasaki, Chisato; Takeda, Jun-ichi; Gough, Craig; Hilton, Phillip; Fujii, Yasuyuki; Sakai, Hiroaki; Tanaka, Susumu; Amid, Clara; Bellgard, Matthew; Bonaldo, Maria de Fatima; Bono, Hidemasa; Bromberg, Susan K; Brookes, Anthony J; Bruford, Elspeth; Carninci, Piero; Chelala, Claude; Couillault, Christine; de Souza, Sandro J.; Debily, Marie-Anne; Devignes, Marie-Dominique; Dubchak, Inna; Endo, Toshinori; Estreicher, Anne; Eyras, Eduardo; Fukami-Kobayashi, Kaoru; R. Gopinath, Gopal; Graudens, Esther; Hahn, Yoonsoo; Han, Michael; Han, Ze-Guang; Hanada, Kousuke; Hanaoka, Hideki; Harada, Erimi; Hashimoto, Katsuyuki; Hinz, Ursula; Hirai, Momoki; Hishiki, Teruyoshi; Hopkinson, Ian; Imbeaud, Sandrine; Inoko, Hidetoshi; Kanapin, Alexander; Kaneko, Yayoi; Kasukawa, Takeya; Kelso, Janet; Kersey, Paul; Kikuno, Reiko; Kimura, Kouichi; Korn, Bernhard; Kuryshev, Vladimir; Makalowska, Izabela; Makino, Takashi; Mano, Shuhei; Mariage-Samson, Regine; Mashima, Jun; Matsuda, Hideo; Mewes, Hans-Werner; Minoshima, Shinsei; Nagai, Keiichi; Nagasaki, Hideki; Nagata, Naoki; Nigam, Rajni; Ogasawara, Osamu; Ohara, Osamu; Ohtsubo, Masafumi; Okada, Norihiro; Okido, Toshihisa; Oota, Satoshi; Ota, Motonori; Ota, Toshio; Otsuki, Tetsuji; Piatier-Tonneau, Dominique; Poustka, Annemarie; Ren, Shuang-Xi; Saitou, Naruya; Sakai, Katsunaga; Sakamoto, Shigetaka; Sakate, Ryuichi; Schupp, Ingo; Servant, Florence; Sherry, Stephen; Shiba, Rie; Shimizu, Nobuyoshi; Shimoyama, Mary; Simpson, Andrew J; Soares, Bento; Steward, Charles; Suwa, Makiko; Suzuki, Mami; Takahashi, Aiko; Tamiya, Gen; Tanaka, Hiroshi; Taylor, Todd; Terwilliger, Joseph D; Unneberg, Per; Veeramachaneni, Vamsi; Watanabe, Shinya; Wilming, Laurens; Yasuda, Norikazu; Yoo, Hyang-Sook; Stodolsky, Marvin; Makalowski, Wojciech; Go, Mitiko; Nakai, Kenta; Takagi, Toshihisa; Kanehisa, Minoru; Sakaki, Yoshiyuki; Quackenbush, John; Okazaki, Yasushi; Hayashizaki, Yoshihide; Hide, Winston; Chakraborty, Ranajit; Nishikawa, Ken; Sugawara, Hideaki; Tateno, Yoshio; Chen, Zhu; Oishi, Michio; Tonellato, Peter; Apweiler, Rolf; Okubo, Kousaku; Wagner, Lukas; Wiemann, Stefan; Strausberg, Robert L; Isogai, Takao; Auffray, Charles; Nomura, Nobuo; Sugano, Sumio

    2004-01-01

    The human genome sequence defines our inherent biological potential; the realization of the biology encoded therein requires knowledge of the function of each gene. Currently, our knowledge in this area is still limited. Several lines of investigation have been used to elucidate the structure and function of the genes in the human genome. Even so, gene prediction remains a difficult task, as the varieties of transcripts of a gene may vary to a great extent. We thus performed an exhaustive integrative characterization of 41,118 full-length cDNAs that capture the gene transcripts as complete functional cassettes, providing an unequivocal report of structural and functional diversity at the gene level. Our international collaboration has validated 21,037 human gene candidates by analysis of high-quality full-length cDNA clones through curation using unified criteria. This led to the identification of 5,155 new gene candidates. It also manifested the most reliable way to control the quality of the cDNA clones. We have developed a human gene database, called the H-Invitational Database (H-InvDB; http://www.h-invitational.jp/). It provides the following: integrative annotation of human genes, description of gene structures, details of novel alternative splicing isoforms, non-protein-coding RNAs, functional domains, subcellular localizations, metabolic pathways, predictions of protein three-dimensional structure, mapping of known single nucleotide polymorphisms (SNPs), identification of polymorphic microsatellite repeats within human genes, and comparative results with mouse full-length cDNAs. The H-InvDB analysis has shown that up to 4% of the human genome sequence (National Center for Biotechnology Information build 34 assembly) may contain misassembled or missing regions. We found that 6.5% of the human gene candidates (1,377 loci) did not have a good protein-coding open reading frame, of which 296 loci are strong candidates for non-protein-coding RNA genes. In addition, among 72,027 uniquely mapped SNPs and insertions/deletions localized within human genes, 13,215 nonsynonymous SNPs, 315 nonsense SNPs, and 452 indels occurred in coding regions. Together with 25 polymorphic microsatellite repeats present in coding regions, they may alter protein structure, causing phenotypic effects or resulting in disease. The H-InvDB platform represents a substantial contribution to resources needed for the exploration of human biology and pathology. PMID:15103394

  3. Pseudo-polyprotein translated from the full-length ORF1 of capillovirus is important for pathogenicity, but a truncated ORF1 protein without variable and CP regions is sufficient for replication.

    PubMed

    Hirata, Hisae; Yamaji, Yasuyuki; Komatsu, Ken; Kagiwada, Satoshi; Oshima, Kenro; Okano, Yukari; Takahashi, Shuichiro; Ugaki, Masashi; Namba, Shigetou

    2010-09-01

    The first open-reading frame (ORF) of the genus Capillovirus encodes an apparently chimeric polyprotein containing conserved regions for replicase (Rep) and coat protein (CP), while other viruses in the family Flexiviridae have separate ORFs encoding these proteins. To investigate the role of the full-length ORF1 polyprotein of capillovirus, we generated truncation mutants of ORF1 of apple stem grooving virus by inserting a termination codon into the variable region located between the putative Rep- and CP-coding regions. These mutants were capable of systemic infection, although their pathogenicity was attenuated. In vitro translation of ORF1 produced both the full-length polyprotein and the smaller Rep protein. The results of in vivo reporter assays suggested that the mechanism of this early termination is a ribosomal -1 frame-shift occurring downstream from the conserved Rep domains. The mechanism of capillovirus gene expression and the very close evolutionary relationship between the genera Capillovirus and Trichovirus are discussed. Copyright (c) 2010. Published by Elsevier B.V.

  4. Complete mitochondrial genome of Lutzomyia (Nyssomyia) umbratilis (Diptera: Psychodidae), the main vector of Leishmania guyanensis.

    PubMed

    Kocher, Arthur; Gantier, Jean-Charles; Holota, Hélène; Jeziorski, Céline; Coissac, Eric; Bañuls, Anne-Laure; Girod, Romain; Gaborit, Pascal; Murienne, Jérôme

    2016-11-01

    The nearly complete mitochondrial genome of Lutzomyia umbratilis Ward & Fraiha, 1977 (Psychodidae: Phlebotominae), considered as the main vector of Leishmania guyanensis, is presented. The sequencing has been performed on an Illumina Hiseq 2500 platform, with a genome skimming strategy. The full nuclear ribosomal RNA segment was also assembled. The mitogenome of L. umbratilis was determined to be at least 15,717 bp-long and presents an architecture found in many mitogenomes of insect (13 protein-coding genes, 22 transfer RNAs, two ribosomal RNAs, and one non-coding region also referred as the control region). The control region contains a large repeated element of c. 370 bp and a poly-AT region of unknown length. This is the first mitogenome of Psychodidae to be described.

  5. Identification and expression analysis of duck interleukin-17D in Riemeralla anatipestifer infection

    USDA-ARS?s Scientific Manuscript database

    Interleukin (IL)-17D is a proinflammatory cytokine with limited information on its biological functions. Here we provide the description of the sequence, bioactivity, and mRNA expression profile of duck IL-17D homologue. A full-length duck IL-17D (duIL-17D) cDNA with a 624-bp coding region was ident...

  6. Molecular cloning, characterization and mRNA expression of duck interleukin-17F

    USDA-ARS?s Scientific Manuscript database

    Interleukin-17F (IL-17F) is a proinflammatory cytokine that plays an important role in gut homeostasis. A full-length duck IL-17F (duIL-17F) cDNA with a 501-bp coding region was identified in ConA-activated splenic lymphocytes. duIL-17F is predicted to encode 166 amino acids, including a 26-amino ...

  7. Chicken IL-17F: Identification and comparative expression analysis in Eimeria-Infected chickens

    USDA-ARS?s Scientific Manuscript database

    Interleukin-17F (IL-17F), belonging to the IL-17 family, is a proinflammatory cytokine and plays an important role in gut homeostasis. A full-length chicken IL-17F (chIL-17F) cDNA with a 510-bp coding region was first identified from ConA-activated splenic lymphocytes of chickens. The chIL-17F share...

  8. Expression of simian virus 40 T antigen in Escherichia coli: localization of T-antigen origin DNA-binding domain to within 129 amino acids.

    PubMed Central

    Arthur, A K; Höss, A; Fanning, E

    1988-01-01

    The genomic coding sequence of the large T antigen of simian virus 40 (SV40) was cloned into an Escherichia coli expression vector by joining new restriction sites, BglII and BamHI, introduced at the intron boundaries of the gene. Full-length large T antigen, as well as deletion and amino acid substitution mutants, were inducibly expressed from the lac promoter of pUC9, albeit with different efficiencies and protein stabilities. Specific interaction with SV40 origin DNA was detected for full-length T antigen and certain mutants. Deletion mutants lacking T-antigen residues 1 to 130 and 260 to 708 retained specific origin-binding activity, demonstrating that the region between residues 131 and 259 must carry the essential binding domain for DNA-binding sites I and II. A sequence between residues 302 and 320 homologous to a metal-binding "finger" motif is therefore not required for origin-specific binding. However, substitution of serine for either of two cysteine residues in this motif caused a dramatic decrease in origin DNA-binding activity. This region, as well as other regions of the full-length protein, may thus be involved in stabilizing the DNA-binding domain and altering its preference for binding to site I or site II DNA. Images PMID:2835505

  9. Evaluation of vector-primed cDNA library production from microgram quantities of total RNA.

    PubMed

    Kuo, Jonathan; Inman, Jason; Brownstein, Michael; Usdin, Ted B

    2004-12-15

    cDNA sequences are important for defining the coding region of genes, and full-length cDNA clones have proven to be useful for investigation of the function of gene products. We produced cDNA libraries containing 3.5-5 x 10(5) primary transformants, starting with 5 mug of total RNA prepared from mouse pituitary, adrenal, thymus, and pineal tissue, using a vector-primed cDNA synthesis method. Of approximately 1000 clones sequenced, approximately 20% contained the full open reading frames (ORFs) of known transcripts, based on the presence of the initiating methionine residue codon. The libraries were complex, with 94, 91, 83 and 55% of the clones from the thymus, adrenal, pineal and pituitary libraries, respectively, represented only once. Twenty-five full-length clones, not yet represented in the Mammalian Gene Collection, were identified. Thus, we have produced useful cDNA libraries for the isolation of full-length cDNA clones that are not yet available in the public domain, and demonstrated the utility of a simple method for making high-quality libraries from small amounts of starting material.

  10. The complete mitochondrial genome of Chrysopa pallens (Insecta, Neuroptera, Chrysopidae).

    PubMed

    He, Kun; Chen, Zhe; Yu, Dan-Na; Zhang, Jia-Yong

    2012-10-01

    The complete mitochondrial genome of Chrysopa pallens (Neuroptera, Chrysopidae) was sequenced. It consists of 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA (rRNA) genes, and a control region (AT-rich region). The total length of C. pallens mitogenome is 16,723 bp with 79.5% AT content, and the length of control region is 1905 bp with 89.1% AT content. The non-coding regions of C. pallens include control region between 12S rRNA and trnI genes, and a 75-bp space region between trnI and trnQ genes.

  11. [Sequencing and analysis of complete genome of rabies viruses isolated from Chinese Ferret-Badger and dog in Zhejiang province].

    PubMed

    Lei, Yong-Liang; Wang, Xiao-Guang; Tao, Xiao-Yan; Li, Hao; Meng, Sheng-Li; Chen, Xiu-Ying; Liu, Fu-Ming; Ye, Bi-Feng; Tang, Qing

    2010-01-01

    Based on sequencing the full-length genomes of four Chinese Ferret-Badger and dog, we analyze the properties of rabies viruses genetic variation in molecular level, get the information about rabies viruses prevalence and variation in Zhejiang, and enrich the genome database of rabies viruses street strains isolated from China. Rabies viruses in suckling mice were isolated, overlapped fragments were amplified by RT-PCR and full-length genomes were assembled to analyze the nucleotide and deduced protein similarities and phylogenetic analyses from Chinese Ferret-Badger, dog, sika deer, vole, used vaccine strain were determined. The four full-length genomes were sequenced completely and had the same genetic structure with the length of 11, 923 nts or 11, 925 nts including 58 nts-Leader, 1353 nts-NP, 894 nts-PP, 609 nts-MP, 1575 nts-GP, 6386 nts-LP, and 2, 5, 5 nts- intergenic regions(IGRs), 423 nts-Pseudogene-like sequence (psi), 70 nts-Trailer. The four full-length genomes were in accordance with the properties of Rhabdoviridae Lyssa virus by BLAST and multi-sequence alignment. The nucleotide and amino acid sequences among Chinese strains had the highest similarity, especially among animals of the same species. Of the four full-length genomes, the similarity in amino acid level was dramatically higher than that in nucleotide level, so the nucleotide mutations happened in these four genomes were most synonymous mutations. Compared with the reference rabies viruses, the lengths of the five protein coding regions had no change, no recombination, only with a few point mutations. It was evident that the five proteins appeared to be stable. The variation sites and types of the four genomes were similar to the reference vaccine or street strains. And the four strains were genotype 1 according to the multi-sequence and phylogenetic analyses, which possessed the distinct district characteristics of China. Therefore, these four rabies viruses are likely to be street viruses already existing in the natural world.

  12. Improvement of genome assembly completeness and identification of novel full-length protein-coding genes by RNA-seq in the giant panda genome.

    PubMed

    Chen, Meili; Hu, Yibo; Liu, Jingxing; Wu, Qi; Zhang, Chenglin; Yu, Jun; Xiao, Jingfa; Wei, Fuwen; Wu, Jiayan

    2015-12-11

    High-quality and complete gene models are the basis of whole genome analyses. The giant panda (Ailuropoda melanoleuca) genome was the first genome sequenced on the basis of solely short reads, but the genome annotation had lacked the support of transcriptomic evidence. In this study, we applied RNA-seq to globally improve the genome assembly completeness and to detect novel expressed transcripts in 12 tissues from giant pandas, by using a transcriptome reconstruction strategy that combined reference-based and de novo methods. Several aspects of genome assembly completeness in the transcribed regions were effectively improved by the de novo assembled transcripts, including genome scaffolding, the detection of small-size assembly errors, the extension of scaffold/contig boundaries, and gap closure. Through expression and homology validation, we detected three groups of novel full-length protein-coding genes. A total of 12.62% of the novel protein-coding genes were validated by proteomic data. GO annotation analysis showed that some of the novel protein-coding genes were involved in pigmentation, anatomical structure formation and reproduction, which might be related to the development and evolution of the black-white pelage, pseudo-thumb and delayed embryonic implantation of giant pandas. The updated genome annotation will help further giant panda studies from both structural and functional perspectives.

  13. A biological inspired fuzzy adaptive window median filter (FAWMF) for enhancing DNA signal processing.

    PubMed

    Ahmad, Muneer; Jung, Low Tan; Bhuiyan, Al-Amin

    2017-10-01

    Digital signal processing techniques commonly employ fixed length window filters to process the signal contents. DNA signals differ in characteristics from common digital signals since they carry nucleotides as contents. The nucleotides own genetic code context and fuzzy behaviors due to their special structure and order in DNA strand. Employing conventional fixed length window filters for DNA signal processing produce spectral leakage and hence results in signal noise. A biological context aware adaptive window filter is required to process the DNA signals. This paper introduces a biological inspired fuzzy adaptive window median filter (FAWMF) which computes the fuzzy membership strength of nucleotides in each slide of window and filters nucleotides based on median filtering with a combination of s-shaped and z-shaped filters. Since coding regions cause 3-base periodicity by an unbalanced nucleotides' distribution producing a relatively high bias for nucleotides' usage, such fundamental characteristic of nucleotides has been exploited in FAWMF to suppress the signal noise. Along with adaptive response of FAWMF, a strong correlation between median nucleotides and the Π shaped filter was observed which produced enhanced discrimination between coding and non-coding regions contrary to fixed length conventional window filters. The proposed FAWMF attains a significant enhancement in coding regions identification i.e. 40% to 125% as compared to other conventional window filters tested over more than 250 benchmarked and randomly taken DNA datasets of different organisms. This study proves that conventional fixed length window filters applied to DNA signals do not achieve significant results since the nucleotides carry genetic code context. The proposed FAWMF algorithm is adaptive and outperforms significantly to process DNA signal contents. The algorithm applied to variety of DNA datasets produced noteworthy discrimination between coding and non-coding regions contrary to fixed window length conventional filters. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Cost-effective sequencing of full-length cDNA clones powered by a de novo-reference hybrid assembly.

    PubMed

    Kuroshu, Reginaldo M; Watanabe, Junichi; Sugano, Sumio; Morishita, Shinichi; Suzuki, Yutaka; Kasahara, Masahiro

    2010-05-07

    Sequencing full-length cDNA clones is important to determine gene structures including alternative splice forms, and provides valuable resources for experimental analyses to reveal the biological functions of coded proteins. However, previous approaches for sequencing cDNA clones were expensive or time-consuming, and therefore, a fast and efficient sequencing approach was demanded. We developed a program, MuSICA 2, that assembles millions of short (36-nucleotide) reads collected from a single flow cell lane of Illumina Genome Analyzer to shotgun-sequence approximately 800 human full-length cDNA clones. MuSICA 2 performs a hybrid assembly in which an external de novo assembler is run first and the result is then improved by reference alignment of shotgun reads. We compared the MuSICA 2 assembly with 200 pooled full-length cDNA clones finished independently by the conventional primer-walking using Sanger sequencers. The exon-intron structure of the coding sequence was correct for more than 95% of the clones with coding sequence annotation when we excluded cDNA clones insufficiently represented in the shotgun library due to PCR failure (42 out of 200 clones excluded), and the nucleotide-level accuracy of coding sequences of those correct clones was over 99.99%. We also applied MuSICA 2 to full-length cDNA clones from Toxoplasma gondii, to confirm that its ability was competent even for non-human species. The entire sequencing and shotgun assembly takes less than 1 week and the consumables cost only approximately US$3 per clone, demonstrating a significant advantage over previous approaches.

  15. Plasma interface of the EC waves to the LHD peripheral region

    NASA Astrophysics Data System (ADS)

    Kubo, S.; Igami, H.; Tsujimura, T. I.; Shimozuma, T.; Takahashi, H.; Yoshimura, Y.; Nishiura, M.; Makino, R.; Mutoh, T.

    2015-12-01

    In order to realize an efficient ECRH and also to reduce stray radiation due to non-absorbed power during ECRH, it is necessary to excite a wave that is absorbed well near the electron cyclotron resonance. In the normal fusion magnetic field confinement machine and in the electron cyclotron frequency range, WKB approximation is valid almost all the way from antenna to the absorption region due to the large scale-length of the plasma density λn and the magnetic shear τs as compared with the local wavelength λ0. In these situation, it is well known that the O/X mode propagates as O/X mode if τs ≫ λ0. Even in these situation, if τs and λn are comparable and |1/λO-1/λX|τs ≪ 1, there still remains the question from where "X" - or "O" - mode become "X" - or "O" mode at the peripheral region. In order to simulate this situation, one dimensional full wave calculation code which solve electromagnetic wave equation under arbitrary magnetic field configuration and arbitrary density profile for a given polarization state are developed and incorporated in the upgraded ray tracing code LHDGauss. It is tried to find the density and shear scale lengths region where the mode mixing effect is not negligible.

  16. Cost-Effective Sequencing of Full-Length cDNA Clones Powered by a De Novo-Reference Hybrid Assembly

    PubMed Central

    Sugano, Sumio; Morishita, Shinichi; Suzuki, Yutaka

    2010-01-01

    Background Sequencing full-length cDNA clones is important to determine gene structures including alternative splice forms, and provides valuable resources for experimental analyses to reveal the biological functions of coded proteins. However, previous approaches for sequencing cDNA clones were expensive or time-consuming, and therefore, a fast and efficient sequencing approach was demanded. Methodology We developed a program, MuSICA 2, that assembles millions of short (36-nucleotide) reads collected from a single flow cell lane of Illumina Genome Analyzer to shotgun-sequence ∼800 human full-length cDNA clones. MuSICA 2 performs a hybrid assembly in which an external de novo assembler is run first and the result is then improved by reference alignment of shotgun reads. We compared the MuSICA 2 assembly with 200 pooled full-length cDNA clones finished independently by the conventional primer-walking using Sanger sequencers. The exon-intron structure of the coding sequence was correct for more than 95% of the clones with coding sequence annotation when we excluded cDNA clones insufficiently represented in the shotgun library due to PCR failure (42 out of 200 clones excluded), and the nucleotide-level accuracy of coding sequences of those correct clones was over 99.99%. We also applied MuSICA 2 to full-length cDNA clones from Toxoplasma gondii, to confirm that its ability was competent even for non-human species. Conclusions The entire sequencing and shotgun assembly takes less than 1 week and the consumables cost only ∼US$3 per clone, demonstrating a significant advantage over previous approaches. PMID:20479877

  17. Replication of poliovirus RNA and subgenomic RNA transcripts in transfected cells.

    PubMed Central

    Collis, P S; O'Donnell, B J; Barton, D J; Rogers, J A; Flanegan, J B

    1992-01-01

    Full-length and subgenomic poliovirus RNAs were transcribed in vitro and transfected into HeLa cells to study viral RNA replication in vivo. RNAs with deletion mutations were analyzed for the ability to replicate in either the absence or the presence of helper RNA by using a cotransfection procedure and Northern (RNA) blot analysis. An advantage of this approach was that viral RNA replication and genetic complementation could be characterized without first isolating conditional-lethal mutants. A subgenomic RNA with a large in-frame deletion in the capsid coding region (P1) replicated more efficiently than full-length viral RNA transcripts. In cotransfection experiments, both the full-length and subgenomic RNAs replicated at slightly reduced levels and appeared to interfere with each other's replication. In contrast, a subgenomic RNA with a similarly sized out-of-frame deletion in P1 did not replicate in transfected cells, either alone or in the presence of helper RNA. Similar results were observed with an RNA transcript containing a large in-frame deletion spanning the P1, P2, and P3 coding regions. A mutant RNA with an in-frame deletion in the P1-2A coding sequence was self-replicating but at a significantly reduced level. The replication of this RNA was fully complemented after cotransfection with a helper RNA that provided 2A in trans. A P1-2A-2B in-frame deletion, however, totally blocked RNA replication and was not complemented. Control experiments showed that all of the expected viral proteins were both synthesized and processed when the RNA transcripts were translated in vitro. Thus, our results indicated that 2A was a trans-acting protein and that 2B and perhaps other viral proteins were cis acting during poliovirus RNA replication in vivo. Our data support a model for poliovirus RNA replication which directly links the translation of a molecule of plus-strand RNA with the formation of a replication complex for minus-strand RNA synthesis. Images PMID:1328676

  18. Identification of a new genotype H wild-type mumps virus strain and its molecular relatedness to other virulent and attenuated strains.

    PubMed

    Amexis, Georgios; Rubin, Steven; Chatterjee, Nando; Carbone, Kathryn; Chumakov, Kostantin

    2003-06-01

    A single clinical isolate of mumps virus designated 88-1961 was obtained from a patient hospitalized with a clinical history of upper respiratory tract infection, parotitis, severe headache, fever and lymphadenopathy. We have sequenced the full-length genome of 88-1961 and compared it against all available full-length sequences of mumps virus. Based upon its nucleotide sequence of the SH gene 88-1961 was identified as a genotype H mumps strain. The overall extent of nucleotide and amino acid differences between each individual gene and protein of 88-1961 and the full-length mumps samples showed that the missense to silent ratios were unevenly distributed. Upon evaluation of the consensus sequence of 88-1961, four positions were found to be clearly heterogeneous at the nucleotide level (NP 315C/T, NP 318C/T, F 271A/C, and HN 855C/T). Sequence analysis revealed that the amino acid sequences for the NP, M, and the L protein were the most conserved, whereas the SH protein exhibited the highest variability among the compared mumps genotypes A, B, and G. No identifying molecular patterns in the non-coding (intergenic) or coding regions of 88-1961 were found when we compared it against relatively virulent (Urabe AM9 B, Glouc1/UK96, 87-1004 and 87-1005) and non-virulent mumps strains (Jeryl Lynn and all Urabe Am9 A substrains). Copyright 2003 Wiley-Liss, Inc.

  19. Both the stroma and thylakoid lumen of tobacco chloroplasts are competent for the formation of disulphide bonds in recombinant proteins.

    PubMed

    Bally, Julia; Paget, Eric; Droux, Michel; Job, Claudette; Job, Dominique; Dubald, Manuel

    2008-01-01

    Plant chloroplasts are promising vehicles for recombinant protein production, but the process of protein folding in these organelles is not well understood in comparison with that in prokaryotic systems, such as Escherichia coli. This is particularly true for disulphide bond formation which is crucial for the biological activity of many therapeutic proteins. We have investigated the capacity of tobacco (Nicotiana tabacum) chloroplasts to efficiently form disulphide bonds in proteins by expressing in this plant cell organelle a well-known bacterial enzyme, alkaline phosphatase, whose activity and stability strictly depend on the correct formation of two intramolecular disulphide bonds. Plastid transformants have been generated that express either the mature enzyme, localized in the stroma, or the full-length coding region, including its signal peptide. The latter has the potential to direct the recombinant alkaline phosphatase into the lumen of thylakoids, giving access to this even less well-characterized organellar compartment. We show that the chloroplast stroma supports the formation of an active enzyme, unlike a normal bacterial cytosol. Sorting of alkaline phosphatase to the thylakoid lumen occurs in the plastid transformants translating the full-length coding region, and leads to larger amounts and more active enzyme. These results are compared with those obtained in bacteria. The implications of these findings on protein folding properties and competency of chloroplasts for disulphide bond formation are discussed.

  20. Complete mitochondrial genome of endangered Yellow-shouldered Amazon (Amazona barbadensis): two control region copies in parrot species of the Amazona genus.

    PubMed

    Urantowka, Adam Dawid; Hajduk, Kacper; Kosowska, Barbara

    2013-08-01

    Amazona barbadensis is an endangered species of parrot living in northern coastal Venezuela and in several Caribbean islands. In this study, we sequenced full mitochondrial genome of the considered species. The total length of the mitogenome was 18,983 bp and contained 13 protein-coding genes, 22 transfer RNA genes, two ribosomal RNA genes, duplicated control region, and degenerate copies of ND6 and tRNA (Glu) genes. High degree of identity between two copies of control region suggests their coincident evolution and functionality. Comparative analysis of both the control region sequences from four Amazona species revealed their 89.1% identity over a region of 1300 bp and indicates the presence of distinctive parts of two control region copies.

  1. Identification and characterization of a novel serine-threonine kinase gene from the Xp22 region.

    PubMed

    Montini, E; Andolfi, G; Caruso, A; Buchner, G; Walpole, S M; Mariani, M; Consalez, G; Trump, D; Ballabio, A; Franco, B

    1998-08-01

    Eukaryotic protein kinases are part of a large and expanding family of proteins. Through our transcriptional mapping effort in the Xp22 region, we have isolated and sequenced the full-length transcript of STK9, a novel cDNA highly homologous to serine-threonine kinases. A number of human genetic disorders have been mapped to the region where STK9 has been localized including Nance-Horan (NH) syndrome, oral-facial-digital syndrome type 1 (OFD1), and a novel locus for nonsyndromic sensorineural deafness (DFN6). To evaluate the possible involvement of STK9 in any of the above-mentioned disorders, a 2416-bp full-length cDNA was assembled. The entire genomic structure of the gene, which is composed of 20 coding exons, was determined. Northern analysis revealed a transcript larger than 9.5 kb in several tissues including brain, lung, and kidney. The mouse homologue (Stk9) was identified and mapped in the mouse in the region syntenic to human Xp. This location is compatible with the location of the Xcat mutant, which shows congenital cataracts very similar to those observed in NH patients. Sequence homologies, expression pattern, and mapping information in both human and mouse make STK9 a candidate gene for the above-mentioned disorders. Copyright 1998 Academic Press.

  2. HIV1 V3 loop hypermutability is enhanced by the guanine usage bias in the part of env gene coding for it.

    PubMed

    Khrustalev, Vladislav Victorovich

    2009-01-01

    Guanine is the most mutable nucleotide in HIV genes because of frequently occurring G to A transitions, which are caused by cytosine deamination in viral DNA minus strands catalyzed by APOBEC enzymes. Distribution of guanine between three codon positions should influence the probability for G to A mutation to be nonsynonymous (to occur in first or second codon position). We discovered that nucleotide sequences of env genes coding for third variable regions (V3 loops) of gp120 from HIV1 and HIV2 have different kinds of guanine usage biases. In the HIV1 reference strain and 100 additionally analyzed HIV1 strains the guanine usage bias in V3 loop coding regions (2G>1G>3G) should lead to elevated nonsynonymous G to A transitions occurrence rates. In the HIV2 reference strain and 100 other HIV2 strains guanine usage bias in V3 loop coding regions (3G>2G>1G) should protect V3 loops from hypermutability. According to the HIV1 and HIV2 V3 alignment, insertion of the sequence enriched with 2G (21 codons in length) occurred during the evolution of HIV1 predecessor, while insertion of the different sequence enriched with 3G (19 codons in length) occurred during the evolution of HIV2 predecessor. The higher is the level of 3G in the V3 coding region, the lower should be the immune escaping mutation occurrence rates. This hypothesis was tested in this study by comparing the guanine usage in V3 loop coding regions from HIV1 fast and slow progressors. All calculations have been performed by our algorithms "VVK In length", "VVK Dinucleotides" and "VVK Consensus" (www.barkovsky.hotmail.ru).

  3. Examples of Linking Codes Within GeoFramework

    NASA Astrophysics Data System (ADS)

    Tan, E.; Choi, E.; Thoutireddy, P.; Aivazis, M.; Lavier, L.; Quenette, S.; Gurnis, M.

    2003-12-01

    Geological processes usually encompass a broad spectrum of length and time scales. Traditionally, a modeling code (solver) is written to solve a problem with specific length and time scales in mind. The utility of the solver beyond the designated purpose is usually limited. Furthermore, two distinct solvers, even if each can solve complementary parts of a new problem, are difficult to link together to solve the problem as a whole. For example, Lagrangian deformation model with visco-elastoplastic crust is used to study deformation near plate boundary. Ideally, the driving force of the deformation should be derived from underlying mantle convection, and it requires linking the Lagrangian deformation model with a Eulerian mantle convection model. As our understanding of geological processes evolves, the need of integrated modeling codes, which should reuse existing codes as much as possible, begins to surface. GeoFramework project addresses this need by developing a suite of reusable and re-combinable tools for the Earth science community. GeoFramework is based on and extends Pyre, a Python-based modeling framework, recently developed to link solid (Lagrangian) and fluid (Eulerian) models, as well as mesh generators, visualization packages, and databases, with one another for engineering applications. Under the framework, a solver is aware of the existence of other solvers and can interact with each other via exchanging information across adjacent boundary. A solver needs to conform a standard interface and provide its own implementation for exchanging boundary information. The framework also provides facilities to control the coordination between interacting solvers. We will show an example of linking two solvers within GeoFramework. CitcomS is a finite element code which solves for thermal convection within a 3D spherical shell. CitcomS can solve for problems either within a full spherical (global) domain or a restricted (regional) domain of a full sphere by using different meshers. We can embed a regional CitcomS solver within a global CitcomS solver. We not that linking instances of the same solver is conceptually equivalent to linking to different solvers. The global solver has a coarser grid and a longer stable time step than the regional solver. Therefore, a global-solver time step consists of several regional-solver time steps. The time-marching scheme is described below. First, the global solver is advanced one global-solver time step. Then, the regional solver is advanced for several regional-solver time steps until it catches up global solver. Within each regional-solver time step, the velocity field of the global solver is interpolated in time and then is imposed to the regional solver as boundary conditions. Finally, the temperature field of the regional solver is extrapolated in space and is fed back to the global. These two solvers are linked and synchronized by the time-marching scheme. An effort to embed a visco-elastoplastic representation of the crust within viscous mantle flow is underway.

  4. Genome-wide comparisons of phylogenetic similarities between partial genomic regions and the full-length genome in Hepatitis E virus genotyping.

    PubMed

    Wang, Shuai; Wei, Wei; Luo, Xuenong; Cai, Xuepeng

    2014-01-01

    Besides the complete genome, different partial genomic sequences of Hepatitis E virus (HEV) have been used in genotyping studies, making it difficult to compare the results based on them. No commonly agreed partial region for HEV genotyping has been determined. In this study, we used a statistical method to evaluate the phylogenetic performance of each partial genomic sequence from a genome wide, by comparisons of evolutionary distances between genomic regions and the full-length genomes of 101 HEV isolates to identify short genomic regions that can reproduce HEV genotype assignments based on full-length genomes. Several genomic regions, especially one genomic region at the 3'-terminal of the papain-like cysteine protease domain, were detected to have relatively high phylogenetic correlations with the full-length genome. Phylogenetic analyses confirmed the identical performances between these regions and the full-length genome in genotyping, in which the HEV isolates involved could be divided into reasonable genotypes. This analysis may be of value in developing a partial sequence-based consensus classification of HEV species.

  5. [Complete genome sequencing and analyses of rabies viruses isolated from wild animals (Chinese Ferret-Badger) in Zhejiang province].

    PubMed

    Lei, Yong-Liang; Wang, Xiao-Guang; Liu, Fu-Ming; Chen, Xiu-Ying; Ye, Bi-Feng; Mei, Jian-Hua; Lan, Jin-Quan; Tang, Qing

    2009-08-01

    Based on sequencing the full-length genomes of two Chinese Ferret-Badger, we analyzed the properties of rabies viruses genetic variation in molecular level to get information on prevalence and variation of rabies viruses in Zhejiang, and to enrich the genome database of rabies viruses street strains isolated from Chinese wildlife. Overlapped fragments were amplified by RT-PCR and full-length genomes were assembled to analyze the nucleotide and deduced protein similarities and phylogenetic analyses of the N genes from Chinese Ferret-Badger, sika deer, vole, dog. Vaccine strains were then determined. The two full-length genomes were completely sequenced to find out that they had the same genetic structure with 11 923 nts including 58 nts-Leader, 1353 nts-NP, 894 nts-PP, 609 nts-MP, 1575 nts-GP, 6386 nts-LP, and 2, 5, 5 nts- intergenic regions (IGRs), 423 nts-Pseudogene-like sequence (Psi), 70 nts-Trailer. The two full-length genomes were in accordance with the properties of Rhabdoviridae Lyssa virus by blast and multi-sequence alignment. The nucleotide and amino acid sequences among Chinese strains had the highest similarity, especially among animals of the same species. Of the two full-length genomes, the similarity in amino acid level was dramatically higher than that in nucleotide level, so that the nucleotide mutations happened in these two genomes were most probably as synonymous mutations. Compared to the referenced rabies viruses, the lengths of the five protein coding regions did not show any changes or recombination, but only with a few-point mutations. It was evident that the five proteins appeared to be stable. The variation sites and types of the two ferret badgers genomes were similar to the referenced vaccine or street strains. The two strains were genotype 1 according to the multi-sequence and phylogenetic analyses, which possessing the distinct geographyphic characteristics of China. All the evidence suggested a cue that these two ferret badgers rabies viruses were likely to be street virus that already circulating in wildlife.

  6. An improved and validated RNA HLA class I SBT approach for obtaining full length coding sequences.

    PubMed

    Gerritsen, K E H; Olieslagers, T I; Groeneweg, M; Voorter, C E M; Tilanus, M G J

    2014-11-01

    The functional relevance of human leukocyte antigen (HLA) class I allele polymorphism beyond exons 2 and 3 is difficult to address because more than 70% of the HLA class I alleles are defined by exons 2 and 3 sequences only. For routine application on clinical samples we improved and validated the HLA sequence-based typing (SBT) approach based on RNA templates, using either a single locus-specific or two overlapping group-specific polymerase chain reaction (PCR) amplifications, with three forward and three reverse sequencing reactions for full length sequencing. Locus-specific HLA typing with RNA SBT of a reference panel, representing the major antigen groups, showed identical results compared to DNA SBT typing. Alleles encountered with unknown exons in the IMGT/HLA database and three samples, two with Null and one with a Low expressed allele, have been addressed by the group-specific RNA SBT approach to obtain full length coding sequences. This RNA SBT approach has proven its value in our routine full length definition of alleles. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  7. Large-scale analysis of full-length cDNAs from the tomato (Solanum lycopersicum) cultivar Micro-Tom, a reference system for the Solanaceae genomics.

    PubMed

    Aoki, Koh; Yano, Kentaro; Suzuki, Ayako; Kawamura, Shingo; Sakurai, Nozomu; Suda, Kunihiro; Kurabayashi, Atsushi; Suzuki, Tatsuya; Tsugane, Taneaki; Watanabe, Manabu; Ooga, Kazuhide; Torii, Maiko; Narita, Takanori; Shin-I, Tadasu; Kohara, Yuji; Yamamoto, Naoki; Takahashi, Hideki; Watanabe, Yuichiro; Egusa, Mayumi; Kodama, Motoichiro; Ichinose, Yuki; Kikuchi, Mari; Fukushima, Sumire; Okabe, Akiko; Arie, Tsutomu; Sato, Yuko; Yazawa, Katsumi; Satoh, Shinobu; Omura, Toshikazu; Ezura, Hiroshi; Shibata, Daisuke

    2010-03-30

    The Solanaceae family includes several economically important vegetable crops. The tomato (Solanum lycopersicum) is regarded as a model plant of the Solanaceae family. Recently, a number of tomato resources have been developed in parallel with the ongoing tomato genome sequencing project. In particular, a miniature cultivar, Micro-Tom, is regarded as a model system in tomato genomics, and a number of genomics resources in the Micro-Tom-background, such as ESTs and mutagenized lines, have been established by an international alliance. To accelerate the progress in tomato genomics, we developed a collection of fully-sequenced 13,227 Micro-Tom full-length cDNAs. By checking redundant sequences, coding sequences, and chimeric sequences, a set of 11,502 non-redundant full-length cDNAs (nrFLcDNAs) was generated. Analysis of untranslated regions demonstrated that tomato has longer 5'- and 3'-untranslated regions than most other plants but rice. Classification of functions of proteins predicted from the coding sequences demonstrated that nrFLcDNAs covered a broad range of functions. A comparison of nrFLcDNAs with genes of sixteen plants facilitated the identification of tomato genes that are not found in other plants, most of which did not have known protein domains. Mapping of the nrFLcDNAs onto currently available tomato genome sequences facilitated prediction of exon-intron structure. Introns of tomato genes were longer than those of Arabidopsis and rice. According to a comparison of exon sequences between the nrFLcDNAs and the tomato genome sequences, the frequency of nucleotide mismatch in exons between Micro-Tom and the genome-sequencing cultivar (Heinz 1706) was estimated to be 0.061%. The collection of Micro-Tom nrFLcDNAs generated in this study will serve as a valuable genomic tool for plant biologists to bridge the gap between basic and applied studies. The nrFLcDNA sequences will help annotation of the tomato whole-genome sequence and aid in tomato functional genomics and molecular breeding. Full-length cDNA sequences and their annotations are provided in the database KaFTom http://www.pgb.kazusa.or.jp/kaftom/ via the website of the National Bioresource Project Tomato http://tomato.nbrp.jp.

  8. Modifying scoping codes to accurately calculate TMI-cores with lifetimes greater than 500 effective full-power days

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bai, D.; Levine, S.L.; Luoma, J.

    1992-01-01

    The Three Mile Island unit 1 core reloads have been designed using fast but accurate scoping codes, PSUI-LEOPARD and ADMARC. PSUI-LEOPARD has been normalized to EPRI-CPM2 results and used to calculate the two-group constants, whereas ADMARC is a modern two-dimensional, two-group diffusion theory nodal code. Problems in accuracy were encountered for cycles 8 and higher as the core lifetime was increased beyond 500 effective full-power days. This is because the heavier loaded cores in both {sup 235}U and {sup 10}B have harder neutron spectra, which produces a change in the transport effect in the baffle reflector region, and the burnablemore » poison (BP) simulations were not accurate enough for the cores containing the increased amount of {sup 10}B required in the BP rods. In the authors study, a technique has been developed to take into account the change in the transport effect in the baffle region by modifying the fast neutron diffusion coefficient as a function of cycle length and core exposure or burnup. A more accurate BP simulation method is also developed, using integral transport theory and CPM2 data, to calculate the BP contribution to the equivalent fuel assembly (supercell) two-group constants. The net result is that the accuracy of the scoping codes is as good as that produced by CASMO/SIMULATE or CPM2/SIMULATE when comparing with measured data.« less

  9. The alpaca melanocortin 1 receptor: gene mutations, transcripts, and relative levels of expression in ventral skin biopsies.

    PubMed

    Chandramohan, Bathrachalam; Renieri, Carlo; La Manna, Vincenzo; La Terza, Antonietta

    2015-01-01

    The objectives of the present study were to characterize the MC1R gene, its transcripts and the single nucleotide polymorphisms (SNPs) associated with coat color in alpaca. Full length cDNA amplification revealed the presence of two transcripts, named as F1 and F2, differing only in the length of their 5'-terminal untranslated region (UTR) sequences and presenting a color specific expression. Whereas the F1 transcript was common to white and colored (black and brown) alpaca phenotypes, the shorter F2 transcript was specific to white alpaca. Further sequencing of the MC1R gene in white and colored alpaca identified a total of twelve SNPs; among those nine (four silent mutations (c.126C>A, c.354T>C, c.618G>A, and c.933G>A); five missense mutations (c.82A>G, c.92C>T, c.259A>G, c.376A>G, and c.901C>T)) were observed in coding region and three in the 3'UTR. A 4 bp deletion (c.224 227del) was also identified in the coding region. Molecular segregation analysis uncovered that the combinatory mutations in the MC1R locus could cause eumelanin and pheomelanin synthesis in alpaca. Overall, our data refine what is known about the MC1R gene and provides additional information on its role in alpaca pigmentation.

  10. Characterization of 25 full-length S-RNase alleles, including flanking regions, from a pool of resequenced apple cultivars.

    PubMed

    De Franceschi, Paolo; Bianco, Luca; Cestaro, Alessandro; Dondini, Luca; Velasco, Riccardo

    2018-06-01

    Data obtained from Illumina resequencing of 63 apple cultivars were used to obtain full-length S-RNase sequences using a strategy based on both alignment and de novo assembly of reads. The reproductive biology of apple is regulated by the S-RNase-based gametophytic self-incompatibility system, that is genetically controlled by the single, multi-genic and multi-allelic S locus. Resequencing of apple cultivars provided a huge amount of genetic data, that can be aligned to the reference genome in order to characterize variation to a genome-wide level. However, this approach is not immediately adaptable to the S-locus, due to some peculiar features such as the high degree of polymorphism, lack of colinearity between haplotypes and extensive presence of repetitive elements. In this study we describe a dedicated procedure aimed at characterizing S-RNase alleles from resequenced cultivars. The S-genotype of 63 apple accessions is reported; the full length coding sequence was determined for the 25 S-RNase alleles present in the 63 resequenced cultivars; these included 10 previously incomplete sequences (S 5 , S 6a , S 6b , S 8 , S 11 , S 23 , S 39 , S 46 , S 50 and S 58 ). Moreover, sequence divergence clearly suggests that alleles S 6a and S 6b , proposed to be neutral variants of the same alleles, should be instead considered different specificities. The promoter sequences have also been analyzed, highlighting regions of homology conserved among all the alleles.

  11. Expressed gene sequence of the IFN-gamma-response chemokine CXCL9 of cattle, horses, and swine

    USDA-ARS?s Scientific Manuscript database

    This report describes the cloning and characterization of expressed gene sequences of bovine, equine, and swine CXCL9 from RNA obtained from peripheral blood mononuclear cell (PBMC) or other tissues. The bovine coding region was 378 nucleotides in length, while the equine and swine coding regions w...

  12. [Construction and expression of recombinant lentiviral vectors of AKT2,PDK1 and BAD].

    PubMed

    Zhu, Jing; Chen, Bo-Jiang; Huang, Na; Li, Wei-Min

    2014-03-01

    To construct human protein kinase B (ATK2), phosphoinositide-dependent kinase 1 (PDK1) and bcl-2-associated death protein (BAD) lentiviral expression vector, and to determine their expressions in 293T cells. Total RNA was extracted from lung cancer tissues. The full-length coding regions of human ATK2, BAD and PDK1 cDNA were amplified via RT-PCR using specific primers, subcloned into PGEM-Teasy and then sequenced for confirmation. The full-length coding sequence was cut out with a specific restriction enzyme digest and subclone into pCDF1-MCS2-EF1-copGFP. The plasmids were transfected into 293T cells using the calcium phosphate method. The over expression of AKT2, BAD and PDK1 were detected by Western blot. AKT2, PDK1 and BAD were subcloned into pCDF1-MCS2-EF1-copGFP, with an efficiency of transfection of 100%, 95%, and 90% respectively. The virus titers were 6.7 x 10(6) PFU/mL in the supernatant. After infection, the proteins of AKT2, PDK1 and BAD were detected by Western blot. The lentivial vector pCDF1-MCS2-EF1-copGFP containing AKT2, BAD and PDK1 were successfully constructed and expressed in 293T cells.

  13. Human somatostatin I: sequence of the cDNA.

    PubMed Central

    Shen, L P; Pictet, R L; Rutter, W J

    1982-01-01

    RNA has been isolated from a human pancreatic somatostatinoma and used to prepare a cDNA library. After prescreening, clones containing somatostatin I sequences were identified by hybridization with an anglerfish somatostatin I-cloned cDNA probe. From the nucleotide sequence of two of these clones, we have deduced an essentially full-length mRNA sequence, including the preprosomatostatin coding region, 105 nucleotides from the 5' untranslated region and the complete 150-nucleotide 3' untranslated region. The coding region predicts a 116-amino acid precursor protein (Mr, 12.727) that contains somatostatin-14 and -28 at its COOH terminus. The predicted amino acid sequence of human somatostatin-28 is identical to that of somatostatin-28 isolated from the porcine and ovine species. A comparison of the amino acid sequences of human and anglerfish preprosomatostatin I indicated that the COOH-terminal region encoding somatostatin-14 and the adjacent 6 amino acids are highly conserved, whereas the remainder of the molecule, including the signal peptide region, is more divergent. However, many of the amino acid differences found in the pro region of the human and anglerfish proteins are conservative changes. This suggests that the propeptides have a similar secondary structure, which in turn may imply a biological function for this region of the molecule. Images PMID:6126875

  14. Massive Collection of Full-Length Complementary DNA Clones and Microarray Analyses:. Keys to Rice Transcriptome Analysis

    NASA Astrophysics Data System (ADS)

    Kikuchi, Shoshi

    2009-02-01

    Completion of the high-precision genome sequence analysis of rice led to the collection of about 35,000 full-length cDNA clones and the determination of their complete sequences. Mapping of these full-length cDNA sequences has given us information on (1) the number of genes expressed in the rice genome; (2) the start and end positions and exon-intron structures of rice genes; (3) alternative transcripts; (4) possible encoded proteins; (5) non-protein-coding (np) RNAs; (6) the density of gene localization on the chromosome; (7) setting the parameters of gene prediction programs; and (8) the construction of a microarray system that monitors global gene expression. Manual curation for rice gene annotation by using mapping information on full-length cDNA and EST assemblies has revealed about 32,000 expressed genes in the rice genome. Analysis of major gene families, such as those encoding membrane transport proteins (pumps, ion channels, and secondary transporters), along with the evolution from bacteria to higher animals and plants, reveals how gene numbers have increased through adaptation to circumstances. Family-based gene annotation also gives us a new way of comparing organisms. Massive amounts of data on gene expression under many kinds of physiological conditions are being accumulated in rice oligoarrays (22K and 44K) based on full-length cDNA sequences. Cluster analyses of genes that have the same promoter cis-elements, that have similar expression profiles, or that encode enzymes in the same metabolic pathways or signal transduction cascades give us clues to understanding the networks of gene expression in rice. As a tool for that purpose, we recently developed "RiCES", a tool for searching for cis-elements in the promoter regions of clustered genes.

  15. The cDNA-derived amino acid sequence of hemoglobin II from Lucina pectinata.

    PubMed

    Torres-Mercado, Elineth; Renta, Jessicca Y; Rodríguez, Yolanda; López-Garriga, Juan; Cadilla, Carmen L

    2003-11-01

    Hemoglobin II from the clam Lucina pectinata is an oxygen-reactive protein with a unique structural organization in the heme pocket involving residues Gln65 (E7), Tyr30 (B10), Phe44 (CD1), and Phe69 (E11). We employed the reverse transcriptase-polymerase chain reaction (RT-PCR) and methods to synthesize various cDNA(HbII). An initial 300-bp cDNA clone was amplified from total RNA by RT-PCR using degenerate oligonucleotides. Gene-specific primers derived from the HbII-partial cDNA sequence were used to obtain the 5' and 3' ends of the cDNA by RACE. The length of the HbII cDNA, estimated from overlapping clones, was approximately 2114 bases. Northern blot analysis revealed that the mRNA size of HbII agrees with the estimated size using cDNA data. The coding region of the full-length HbII cDNA codes for 151 amino acids. The calculated molecular weight of HbII, including the heme group and acetylated N-terminal residue, is 17,654.07 Da.

  16. Directional RNA-seq reveals highly complex condition-dependent transcriptomes in E. coli K12 through accurate full-length transcripts assembling.

    PubMed

    Li, Shan; Dong, Xia; Su, Zhengchang

    2013-07-30

    Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamically transcribed under different conditions, and a large portion of genes and intergenic regions have antisense RNA (asRNA) and non-coding RNA (ncRNA) transcripts, respectively. Ironically, similar studies have not been conducted in the model bacterium E coli K12, thus it is unknown whether or not the bacterium possesses similar complex transcriptomes. Furthermore, although RNA-seq becomes the major method for analyzing the complexity of prokaryotic transcriptome, it is still a challenging task to accurately assemble full length transcripts using short RNA-seq reads. To fill these gaps, we have profiled the transcriptomes of E. coli K12 under different culture conditions and growth phases using a highly specific directional RNA-seq technique that can capture various types of transcripts in the bacterial cells, combined with a highly accurate and robust algorithm and tool TruHMM (http://bioinfolab.uncc.edu/TruHmm_package/) for assembling full length transcripts. We found that 46.9 ~ 63.4% of expressed operons were utilized in their putative alternative forms, 72.23 ~ 89.54% genes had putative asRNA transcripts and 51.37 ~ 72.74% intergenic regions had putative ncRNA transcripts under different culture conditions and growth phases. As has been demonstrated in many other prokaryotes, E. coli K12 also has a highly complex and dynamic transcriptomes under different culture conditions and growth phases. Such complex and dynamic transcriptomes might play important roles in the physiology of the bacterium. TruHMM is a highly accurate and robust algorithm for assembling full-length transcripts in prokaryotes using directional RNA-seq short reads.

  17. Directional RNA-seq reveals highly complex condition-dependent transcriptomes in E. coli K12 through accurate full-length transcripts assembling

    PubMed Central

    2013-01-01

    Background Although prokaryotic gene transcription has been studied over decades, many aspects of the process remain poorly understood. Particularly, recent studies have revealed that transcriptomes in many prokaryotes are far more complex than previously thought. Genes in an operon are often alternatively and dynamically transcribed under different conditions, and a large portion of genes and intergenic regions have antisense RNA (asRNA) and non-coding RNA (ncRNA) transcripts, respectively. Ironically, similar studies have not been conducted in the model bacterium E coli K12, thus it is unknown whether or not the bacterium possesses similar complex transcriptomes. Furthermore, although RNA-seq becomes the major method for analyzing the complexity of prokaryotic transcriptome, it is still a challenging task to accurately assemble full length transcripts using short RNA-seq reads. Results To fill these gaps, we have profiled the transcriptomes of E. coli K12 under different culture conditions and growth phases using a highly specific directional RNA-seq technique that can capture various types of transcripts in the bacterial cells, combined with a highly accurate and robust algorithm and tool TruHMM (http://bioinfolab.uncc.edu/TruHmm_package/) for assembling full length transcripts. We found that 46.9 ~ 63.4% of expressed operons were utilized in their putative alternative forms, 72.23 ~ 89.54% genes had putative asRNA transcripts and 51.37 ~ 72.74% intergenic regions had putative ncRNA transcripts under different culture conditions and growth phases. Conclusions As has been demonstrated in many other prokaryotes, E. coli K12 also has a highly complex and dynamic transcriptomes under different culture conditions and growth phases. Such complex and dynamic transcriptomes might play important roles in the physiology of the bacterium. TruHMM is a highly accurate and robust algorithm for assembling full-length transcripts in prokaryotes using directional RNA-seq short reads. PMID:23899370

  18. The Alpaca Melanocortin 1 Receptor: Gene Mutations, Transcripts, and Relative Levels of Expression in Ventral Skin Biopsies

    PubMed Central

    Renieri, Carlo; La Terza, Antonietta

    2015-01-01

    The objectives of the present study were to characterize the MC1R gene, its transcripts and the single nucleotide polymorphisms (SNPs) associated with coat color in alpaca. Full length cDNA amplification revealed the presence of two transcripts, named as F1 and F2, differing only in the length of their 5′-terminal untranslated region (UTR) sequences and presenting a color specific expression. Whereas the F1 transcript was common to white and colored (black and brown) alpaca phenotypes, the shorter F2 transcript was specific to white alpaca. Further sequencing of the MC1R gene in white and colored alpaca identified a total of twelve SNPs; among those nine (four silent mutations (c.126C>A, c.354T>C, c.618G>A, and c.933G>A); five missense mutations (c.82A>G, c.92C>T, c.259A>G, c.376A>G, and c.901C>T)) were observed in coding region and three in the 3′UTR. A 4 bp deletion (c.224 227del) was also identified in the coding region. Molecular segregation analysis uncovered that the combinatory mutations in the MC1R locus could cause eumelanin and pheomelanin synthesis in alpaca. Overall, our data refine what is known about the MC1R gene and provides additional information on its role in alpaca pigmentation. PMID:25685836

  19. Palindromic repetitive DNA elements with coding potential in Methanocaldococcus jannaschii.

    PubMed

    Suyama, Mikita; Lathe, Warren C; Bork, Peer

    2005-10-10

    We have identified 141 novel palindromic repetitive elements in the genome of euryarchaeon Methanocaldococcus jannaschii. The total length of these elements is 14.3kb, which corresponds to 0.9% of the total genomic sequence and 6.3% of all extragenic regions. The elements can be divided into three groups (MJRE1-3) based on the sequence similarity. The low sequence identity within each of the groups suggests rather old origin of these elements in M. jannaschii. Three MJRE2 elements were located within the protein coding regions without disrupting the coding potential of the host genes, indicating that insertion of repeats might be a widespread mechanism to enhance sequence diversity in coding regions.

  20. cDNA cloning and characterization of Type I procollagen alpha1 chain in the skate Raja kenojei.

    PubMed

    Hwang, Jae-Ho; Yokoyama, Yoshihiro; Mizuta, Shoshi; Yoshinaka, Reiji

    2006-05-01

    A full-length cDNA of the Type I procollagen alpha1 [pro-alpha1(I)] chain (4388 bp), coding for 1463 amino acid residues in the total length, was determined by RACE PCR using a cDNA library constructed from 4-week embryo of the skate Raja kenojei. The helical region of the skate pro-alpha1(I) chain consisted of 1014 amino acid residues - the same as other fibrillar collagen alpha chains from higher vertebrates. Comparison on denaturation temperatures of Type I collagens from the skate, rainbow trout (Oncorhynchus mykiss) and rat (Rattus norvegicus) revealed that the number of Gly-Pro-Pro and Gly-Gly in the alpha1(I) chains could be directly related to the thermal stability of the helix. The expression property of the skate pro-alpha1(I) chain mRNA and phylogenetic analysis with other vertebrate pro-alpha1(I) chains suggested that skate pro-alpha1(I) chain could be a precursor form of the skate Type I collagen alpha1 chain. The present study is the first evidence for the primary structure of full-length pro-alpha1(I) chain in an elasmobranch.

  1. Complete sequence of Tvv1, a family of Ty 1 copia-like retrotransposons of Vitis vinifera L., reconstituted by chromosome walking.

    PubMed

    Pelsy, F.; Merdinoglu, D.

    2002-09-01

    A chromosome-walking strategy was used to sequence and characterize retrotransposons in the grapevine genome. The reconstitution of a family of retroelements, named Tvv1, was achieved by six successive steps. These elements share a single, highly conserved open reading frame 4,153 nucleotides-long, putatively encoding the gag, pro, int, rt and rh proteins. Comparison of the Tvv1 open reading frame coding potential with those of drosophila copia and tobacco Tnt1, revealed that Tvv1 is closely related to Ty 1 copia-like retrotransposons. A highly variable untranslated leader region, upstream of the open reading frame, allowed us to differentiate Tvv1 variants, which represent a family of at least 28 copies, in varying sizes. This internal region is flanked by two long terminal repeats in direct orientation, sized between 149 and 157 bp. Among elements theoretically sized from 4,970 to 5,550 bp, we describe the full-length sequence of a reference element Tvv1-1, 5,343 nucleotides-long. The full-length sequence of Tvv1-1 compared to pea PDR1 shows a 53.3% identity. In addition, both elements contain long terminal repeats of nearly the same size in which the U5 region could be entirely absent. Therefore, we assume that Tvv1 and PDR1 could constitute a particular class of short LTRs retroelements.

  2. The full mitochondrial genome sequence of Raillietina tetragona from chicken (Cestoda: Davaineidae).

    PubMed

    Liang, Jian-Ying; Lin, Rui-Qing

    2016-11-01

    In the present study, the complete mitochondrial DNA (mtDNA) sequence of Raillietina tetragona was sequenced and its gene contents and genome organizations was compared with that of other tapeworm. The complete mt genome sequence of R. tetragona is 14,444 bp in length. It contains 12 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and two non-coding region. All genes are transcribed in the same direction and have a nucleotide composition high in A and T. The contents of A + T of the complete mt genome are 71.4% for R. tetragona. The R. tetragona mt genome sequence provides novel mtDNA marker for studying the molecular epidemiology and population genetics of Raillietina and has implications for the molecular diagnosis of chicken cestodosis caused by Raillietina.

  3. Complete mitochondrial genome from South American catfish Pseudoplatystoma reticulatum (Eigenmann & Eigenmann) and its impact in Siluriformes phylogenetic tree.

    PubMed

    Villela, Luciana Cristine Vasques; Alves, Anderson Luis; Varela, Eduardo Sousa; Yamagishi, Michel Eduardo Beleza; Giachetto, Poliana Fernanda; da Silva, Naiara Milagres Augusto; Ponzetto, Josi Margarete; Paiva, Samuel Rezende; Caetano, Alexandre Rodrigues

    2017-02-01

    The cachara (Pseudoplatystoma reticulatum) is a Neotropical freshwater catfish from family Pimelodidae (Siluriformes) native to Brazil. The species is of relative economic importance for local aquaculture production and basic biological information is under development to help boost efforts to domesticate and raise the species in commercial systems. The complete cachara mitochondrial genome was obtained by assembling Illumina RNA-seq data from pooled samples. The full mitogenome was found to be 16,576 bp in length, showing the same basic structure, order, and genetic organization observed in other Pimelodidae, with 13 protein-coding genes, 2 rNA genes, 22 trNAs, and a control region. Observed base composition was 24.63% T, 28.47% C, 31.45% A, and 15.44% G. With the exception of NAD6 and eight tRNAs, all of the observed mitochondrial genes were found to be coded on the H strand. A total of 107 SNPs were identified in P. reticulatum mtDNA, 67 of which were located in coding regions. Of these SNPs, 10 result in amino acid changes. Analysis of the obtained sequence with 94 publicly available full Siluriformes mitogenomes resulted in a phylogenetic tree that generally agreed with available phylogenetic proposals for the order. The first report of the complete Pseudoplatystoma reticulatum mitochondrial genome sequence revealed general gene organization, structure, content, and order similar to most vertebrates. Specific sequence and content features were observed and may have functional attributes which are now available for further investigation.

  4. HLA-E regulatory and coding region variability and haplotypes in a Brazilian population sample.

    PubMed

    Ramalho, Jaqueline; Veiga-Castelli, Luciana C; Donadi, Eduardo A; Mendes-Junior, Celso T; Castelli, Erick C

    2017-11-01

    The HLA-E gene is characterized by low but wide expression on different tissues. HLA-E is considered a conserved gene, being one of the least polymorphic class I HLA genes. The HLA-E molecule interacts with Natural Killer cell receptors and T lymphocytes receptors, and might activate or inhibit immune responses depending on the peptide associated with HLA-E and with which receptors HLA-E interacts to. Variable sites within the HLA-E regulatory and coding segments may influence the gene function by modifying its expression pattern or encoded molecule, thus, influencing its interaction with receptors and the peptide. Here we propose an approach to evaluate the gene structure, haplotype pattern and the complete HLA-E variability, including regulatory (promoter and 3'UTR) and coding segments (with introns), by using massively parallel sequencing. We investigated the variability of 420 samples from a very admixed population such as Brazilians by using this approach. Considering a segment of about 7kb, 63 variable sites were detected, arranged into 75 extended haplotypes. We detected 37 different promoter sequences (but few frequent ones), 27 different coding sequences (15 representing new HLA-E alleles) and 12 haplotypes at the 3'UTR segment, two of them presenting a summed frequency of 90%. Despite the number of coding alleles, they encode mainly two different full-length molecules, known as E*01:01 and E*01:03, which corresponds to about 90% of all. In addition, differently from what has been previously observed for other non classical HLA genes, the relationship among the HLA-E promoter, coding and 3'UTR haplotypes is not straightforward because the same promoter and 3'UTR haplotypes were many times associated with different HLA-E coding haplotypes. This data reinforces the presence of only two main full-length HLA-E molecules encoded by the many HLA-E alleles detected in our population sample. In addition, this data does indicate that the distal HLA-E promoter is by far the most variable segment. Further analyses involving the binding of transcription factors and non-coding RNAs, as well as the HLA-E expression in different tissues, are necessary to evaluate whether these variable sites at regulatory segments (or even at the coding sequence) may influence the gene expression profile. Copyright © 2017 Elsevier Ltd. All rights reserved.

  5. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Yang, Zhipan; Lu, Qingtao; Wen, Xiaogang

    Highlights: Black-Right-Pointing-Pointer Rice rubisco activase promoter was analyzed in transgenic Arabidopsis system. Black-Right-Pointing-Pointer Region conferring tissue specific and light inducible expression of Rca was identified. Black-Right-Pointing-Pointer -58 to +43 bp region mediates tissue-specific expression of rice Rca. Black-Right-Pointing-Pointer Light inducible expression of rice Rca is mediated by -297 to -58 bp region. Black-Right-Pointing-Pointer Rice nuclear proteins bind specifically with the light inducible region. -- Abstract: To gain a better understanding of the regulatory mechanism of the rice rubisco activase (Rca) gene, variants of the Rca gene promoter (one full-length and four deletion mutants) fused to the coding region of themore » bacterial reporter gene {beta}-glucuronidase (GUS) were introduced into Arabidopsis via Agrobacterium-mediated transformation. Our results show that a 340 bp fragment spanning from -297 to +43 bp relative to the transcription initiation site is enough to promote tissue-specific and light-inducible expression of the rice Rca gene as done by the full-length promoter (-1428 to +43 bp). Further deletion analysis indicated that the region conferring tissue-specificity of Rca expression is localized within a 105 bp fragment from -58 to +43 bp, while light-inducible expression of Rca is mediated by the region from -297 to -58 bp. Gel shift assays and competition experiments demonstrated that rice nuclear proteins bind specifically with the fragment conferring light responsiveness at more than one binding site. This implies that multiple cis-elements may be involved in light-induced expression of the rice Rca gene. These works provide a useful reference for understanding transcriptional regulation mechanism of the rice Rca gene, and lay a strong foundation for further detection of related cis-elements and trans-factors.« less

  6. Comparisons between Arabidopsis thaliana and Drosophila melanogaster in relation to Coding and Noncoding Sequence Length and Gene Expression

    PubMed Central

    Caldwell, Rachel; Lin, Yan-Xia; Zhang, Ren

    2015-01-01

    There is a continuing interest in the analysis of gene architecture and gene expression to determine the relationship that may exist. Advances in high-quality sequencing technologies and large-scale resource datasets have increased the understanding of relationships and cross-referencing of expression data to the large genome data. Although a negative correlation between expression level and gene (especially transcript) length has been generally accepted, there have been some conflicting results arising from the literature concerning the impacts of different regions of genes, and the underlying reason is not well understood. The research aims to apply quantile regression techniques for statistical analysis of coding and noncoding sequence length and gene expression data in the plant, Arabidopsis thaliana, and fruit fly, Drosophila melanogaster, to determine if a relationship exists and if there is any variation or similarities between these species. The quantile regression analysis found that the coding sequence length and gene expression correlations varied, and similarities emerged for the noncoding sequence length (5′ and 3′ UTRs) between animal and plant species. In conclusion, the information described in this study provides the basis for further exploration into gene regulation with regard to coding and noncoding sequence length. PMID:26114098

  7. The complete mitochondrial genome of Rapana venosa (Gastropoda, Muricidae).

    PubMed

    Sun, Xiujun; Yang, Aiguo

    2016-01-01

    The complete mitochondrial (mt) genome of the veined rapa whelk, Rapana venosa, was determined using genome walking techniques in this study. The total length of the mt genome sequence of R. venosa was 15,271 bp, which is comparable to the reported Muricidae mitogenomes to date. It contained 13 protein-coding genes, 21 transfer RNA genes, and two ribosomal RNA genes. A bias towards a higher representation of nucleotides A and T (69%) was detected in the mt genome of R. venosa. A small number of non-coding nucleotides (302 bp) was detected, and the largest non-coding region was 74 bp in length.

  8. Genomic Analysis of Vaccine-Derived Poliovirus Strains in Stool Specimens by Combination of Full-Length PCR and Oligonucleotide Microarray Hybridization

    PubMed Central

    Laassri, Majid; Dragunsky, Eugenia; Enterline, Joan; Eremeeva, Tatiana; Ivanova, Olga; Lottenbach, Kathleen; Belshe, Robert; Chumakov, Konstantin

    2005-01-01

    Sabin strains of poliovirus used in the manufacture of oral poliovirus vaccine (OPV) are prone to genetic variations that occur during growth in cell cultures and the organisms of vaccine recipients. Such derivative viruses often have increased neurovirulence and transmissibility, and in some cases they can reestablish chains of transmission in human populations. Monitoring for vaccine-derived polioviruses is an important part of the worldwide campaign to eradicate poliomyelitis. Analysis of vaccine-derived polioviruses requires, as a first step, their isolation in cell cultures, which takes significant time and may yield viral stocks that are not fully representative of the strains present in the original sample. Here we demonstrate that full-length viral cDNA can be PCR amplified directly from stool samples and immediately subjected to genomic analysis by oligonucleotide microarray hybridization and nucleotide sequencing. Most fecal samples from healthy children who received OPV were found to contain variants of Sabin vaccine viruses. Sequence changes in the 5′ untranslated region were common, as were changes in the VP1-coding region, including changes in a major antigenic site. Analysis of stool samples taken from cases of acute flaccid paralysis revealed the presence of mixtures of recombinant polioviruses, in addition to the emergence of new sequence variants. Avoiding the need for cell culture isolation dramatically shortened the time needed for identification and analysis of vaccine-derived polioviruses and could be useful for preliminary screening of clinical samples. The amplified full-length viral cDNA can be archived and used to recover live virus for further virological studies. PMID:15956413

  9. Specific DNA binding of the two chicken Deformed family homeodomain proteins, Chox-1.4 and Chox-a.

    PubMed Central

    Sasaki, H; Yokoyama, E; Kuroiwa, A

    1990-01-01

    The cDNA clones encoding two chicken Deformed (Dfd) family homeobox containing genes Chox-1.4 and Chox-a were isolated. Comparison of their amino acid sequences with another chicken Dfd family homeodomain protein and with those of mouse homologues revealed that strong homologies are located in the amino terminal regions and around the homeodomains. Although homologies in other regions were relatively low, some short conserved sequences were also identified. E. coli-made full length proteins were purified and used for the production of specific antibodies and for DNA binding studies. The binding profiles of these proteins to the 5'-leader and 5'-upstream sequences of Chox-1.4 and Chox-a coding regions were analyzed by immunoprecipitation and DNase I footprint assays. These two Chox proteins bound to the same sites in the 5'-flanking sequences of their coding regions with various affinities and their binding affinities to each site were nearly the same. The consensus sequences of the high and low affinity binding sites were TAATGA(C/G) and CTAATTTT, respectively. A clustered binding site was identified in the 5'-upstream of the Chox-a gene, suggesting that this clustered binding site works as a cis-regulatory element for auto- and/or cross-regulation of Chox-a gene expression. Images PMID:1970866

  10. Efficiency of VIGS and gene expression in a novel bipartite potexvirus vector delivery system as a function of strength of TGB1 silencing suppression.

    PubMed

    Lim, Hyoun-Sub; Vaira, Anna Maria; Domier, Leslie L; Lee, Sung Chul; Kim, Hong Gi; Hammond, John

    2010-06-20

    We have developed plant virus-based vectors for virus-induced gene silencing (VIGS) and protein expression, based on Alternanthera mosaic virus (AltMV), for infection of a wide range of host plants including Nicotiana benthamiana and Arabidopsis thaliana by either mechanical inoculation of in vitro transcripts or via agroinfiltration. In vivo transcripts produced by co-agroinfiltration of bacteriophage T7 RNA polymerase resulted in T7-driven AltMV infection from a binary vector in the absence of the Cauliflower mosaic virus 35S promoter. An artificial bipartite viral vector delivery system was created by separating the AltMV RNA-dependent RNA polymerase and Triple Gene Block (TGB)123-Coat protein (CP) coding regions into two constructs each bearing the AltMV 5' and 3' non-coding regions, which recombined in planta to generate a full-length AltMV genome. Substitution of TGB1 L(88)P, and equivalent changes in other potexvirus TGB1 proteins, affected RNA silencing suppression efficacy and suitability of the vectors from protein expression to VIGS. Published by Elsevier Inc.

  11. An integrated PCR colony hybridization approach to screen cDNA libraries for full-length coding sequences.

    PubMed

    Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain

    2011-01-01

    cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.

  12. Desmoglein 4 diversity and correlation analysis with coat color in goat.

    PubMed

    E, G X; Zhao, Y J; Ma, Y H; Cao, G L; He, J N; Na, R S; Zhao, Z Q; Jiang, C D; Zhang, J H; Arlvd, S; Chen, L P; Qiu, X Y; Hu, W; Huang, Y F

    2016-03-04

    Desmoglein 4 (DSG4) has an important role in the development of wool traits in domestic animals. The full-length DSG4 gene, which contains 3918 bp, a complete open-reading-frame, and encodes a 1040-amino acid protein, was amplified from Liaoning cashmere goat. The sequence was compared with that of DSG4 from other animals and the results show that the DSG4 coding region is consistent with interspecies conservation. Thirteen single-nucleotide polymorphisms (SNPs) were identified in a highly variable region of DSG4, and one SNP (M-1, G>T) was significantly correlated with white and black coat color in goat. Haplotype distribution of the highly variable region of DSG4 was assessed in 179 individuals from seven goat breeds to investigate its association with coat color and its differentiation among populations. However, the lack of a signature result indicates DGS4 haplotypes related with the color of goat coat.

  13. Cloning, characterization, and expression of Cytochrome b ( Cytb)—a key mitochondrial gene from Prorocentrum donghaiense

    NASA Astrophysics Data System (ADS)

    Zhao, Liyuan; Mi, Tiezhu; Zhen, Yu; Yu, Zhigang

    2012-05-01

    Mitochondrial cytochrome b (Cytb), one of the few proteins encoded by the mitochondrial DNA, plays an important role in transferring electrons. As a mitochondrial gene, it has been widely used for phylogenetic analysis. Previously, a 949-bp fragment of the coding gene and mRNA editing were characterized from Prorocentrum donghaiense, which might prove useful for resolving P. donghaiense from closely related species. However, the full-length coding region has not been characterized. In this study, we used rapid amplification of cDNA ends (RACE) to obtain full-length, 1 124 bp cDNA. Cytb transcript contained a standard initiation codon ATG, but did not have a recognizable stop codon. Homology comparison showed that the P. donghaiense Cytb had a high sequence identity to Cytb sequences from other dinoflagellate species. Phylogenetic analysis placed Cytb from P. donghaiense in the clade of dinoflagellates and it clustered together strongly with that from P. minimum. Based on the full-length sequence, we inferred 32 editing events at different positions, accounting for 2.93% of the Cytb gene. 34.4% (11) of the changes were A to G, 25% (8) were T to C, and 25% (8) were C to U, with smaller proportions of G to C and G to A edits (9.4% (3) and 6.2% (2), respectively). The expression level of the Cytb transcript was quantified by real-time PCR with a TaqMan probe at different times during the whole growth phase. The average Cytb transcript was present at 39.27±7.46 copies of cDNA per cell during the whole growth cycle, and the expression of Cytb was relatively stable over the different phases. These results deepen our understanding of the structure and characteristics of Cytb in P. donghaiense, and confirmed that Cytb in P. donghaiense is a candidate reference gene for studying the expression of other genes.

  14. Termination and read-through proteins encoded by genome segment 9 of Colorado tick fever virus.

    PubMed

    Mohd Jaafar, Fauziah; Attoui, Houssam; De Micco, Philippe; De Lamballerie, Xavier

    2004-08-01

    Genome segment 9 (Seg-9) of Colorado tick fever virus (CTFV) is 1884 bp long and contains a large open reading frame (ORF; 1845 nt in length overall), although a single in-frame stop codon (at nt 1052-1054) reduces the ORF coding capacity by approximately 40 %. However, analyses of highly conserved RNA sequences in the vicinity of the stop codon indicate that it belongs to a class of 'leaky terminators'. The third nucleotide positions in codons situated both before and after the stop codon, shows the highest variability, suggesting that both regions are translated during virus replication. This also suggests that the stop signal is functionally leaky, allowing read-through translation to occur. Indeed, both the truncated 'termination' protein and the full-length 'read-through' protein (VP9 and VP9', respectively) were detected in CTFV-infected cells, in cells transfected with a plasmid expressing only Seg-9 protein products, and in the in vitro translation products from undenatured Seg-9 ssRNA. The ratios of full-length and truncated proteins generated suggest that read-through may be down-regulated by other viral proteins. Western blot analysis of infected cells and purified CTFV showed that VP9 is a structural component of the virion, while VP9' is a non-structural protein.

  15. The complete mitochondrial genome of the Giant Manta ray, Manta birostris.

    PubMed

    Hinojosa-Alvarez, Silvia; Díaz-Jaimes, Pindaro; Marcet-Houben, Marina; Gabaldón, Toni

    2015-01-01

    The complete mitochondrial genome of the giant manta ray (Manta birostris), consists of 18,075 bp with rich A + T and low G content. Gene organization and length is similar to other species of ray. It comprises of 13 protein-coding genes, 2 rRNAs genes, 23 tRNAs genes and 1 non-coding sequence, and the control region. We identified an AT tandem repeat region, similar to that reported in Mobula japanica.

  16. A Comparison of Six MMPI Short Forms: Code Type Correspondence and Indices of Psychopathology.

    ERIC Educational Resources Information Center

    Willcockson, James C.; And Others

    1983-01-01

    Compared six Minnesota Multiphasic Personality Inventory (MMPI) short forms with the full-length MMPI for ability to identify code-types and indices of psychopathology in renal dialysis patients (N=53) and paranoid schizophrenics (N=58). Results suggested that the accuracy of the short forms fluctuates for different patient populations and…

  17. The complete chloroplast genome sequence of Hibiscus syriacus.

    PubMed

    Kwon, Hae-Yun; Kim, Joon-Hyeok; Kim, Sea-Hyun; Park, Ji-Min; Lee, Hyoshin

    2016-09-01

    The complete chloroplast genome sequence of Hibiscus syriacus L. is presented in this study. The genome is composed of 161 019 bp in length, with a typical circular structure containing a pair of inverted repeats of 25 745 bp of length separated by a large single-copy region and a small single-copy region of 89 698 bp and 19 831 bp of length, respectively. The overall GC content is 36.8%. One hundred and fourteen genes were annotated, including 81 protein-coding genes, 4 ribosomal RNA genes and 29 transfer RNA genes.

  18. Feature-selective Attention in Frontoparietal Cortex: Multivoxel Codes Adjust to Prioritize Task-relevant Information.

    PubMed

    Jackson, Jade; Rich, Anina N; Williams, Mark A; Woolgar, Alexandra

    2017-02-01

    Human cognition is characterized by astounding flexibility, enabling us to select appropriate information according to the objectives of our current task. A circuit of frontal and parietal brain regions, often referred to as the frontoparietal attention network or multiple-demand (MD) regions, are believed to play a fundamental role in this flexibility. There is evidence that these regions dynamically adjust their responses to selectively process information that is currently relevant for behavior, as proposed by the "adaptive coding hypothesis" [Duncan, J. An adaptive coding model of neural function in prefrontal cortex. Nature Reviews Neuroscience, 2, 820-829, 2001]. Could this provide a neural mechanism for feature-selective attention, the process by which we preferentially process one feature of a stimulus over another? We used multivariate pattern analysis of fMRI data during a perceptually challenging categorization task to investigate whether the representation of visual object features in the MD regions flexibly adjusts according to task relevance. Participants were trained to categorize visually similar novel objects along two orthogonal stimulus dimensions (length/orientation) and performed short alternating blocks in which only one of these dimensions was relevant. We found that multivoxel patterns of activation in the MD regions encoded the task-relevant distinctions more strongly than the task-irrelevant distinctions: The MD regions discriminated between stimuli of different lengths when length was relevant and between the same objects according to orientation when orientation was relevant. The data suggest a flexible neural system that adjusts its representation of visual objects to preferentially encode stimulus features that are currently relevant for behavior, providing a neural mechanism for feature-selective attention.

  19. Wide Distribution of Mitochondrial Genome Rearrangements in Wild Strains of the Cultivated Basidiomycete Agrocybe aegerita

    PubMed Central

    Barroso, G.; Blesa, S.; Labarere, J.

    1995-01-01

    We used restriction fragment length polymorphisms to examine mitochondrial genome rearrangements in 36 wild strains of the cultivated basidiomycete Agrocybe aegerita, collected from widely distributed locations in Europe. We identified two polymorphic regions within the mitochondrial DNA which varied independently: one carrying the Cox II coding sequence and the other carrying the Cox I, ATP6, and ATP8 coding sequences. Two types of mutations were responsible for the restriction fragment length polymorphisms that we observed and, accordingly, were involved in the A. aegerita mitochondrial genome evolution: (i) point mutations, which resulted in strain-specific mitochondrial markers, and (ii) length mutations due to genome rearrangements, such as deletions, insertions, or duplications. Within each polymorphic region, the length differences defined only two mitochondrial types, suggesting that these length mutations were not randomly generated but resulted from a precise rearrangement mechanism. For each of the two polymorphic regions, the two molecular types were distributed among the 36 strains without obvious correlation with their geographic origin. On the basis of these two polymorphisms, it is possible to define four mitochondrial haplotypes. The four mitochondrial haplotypes could be the result of intermolecular recombination between allelic forms present in the population long enough to reach linkage equilibrium. All of the 36 dikaryotic strains contained only a single mitochondrial type, confirming the previously described mitochondrial sorting out after cytoplasmic mixing in basidiomycetes. PMID:16534984

  20. Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)

    PubMed Central

    Andreassen, Rune; Lunner, Sigbjørn; Høyheim, Bjørn

    2009-01-01

    Background Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs) are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar duplicate sequences caused by the relatively recent genome duplication in the salmonid ancestor represents a particular challenge for the genome project. FLIcs will therefore be an extremely useful resource for the Atlantic salmon sequencing project. In addition to be helpful in order to distinguish between duplicate genome regions and in determining correct gene structures, FLIcs are an important resource for functional genomic studies and for investigation of regulatory elements controlling gene expression. In contrast to the large number of ESTs available, including the ESTs from 23 developmental and tissue specific cDNA libraries contributed by the Salmon Genome Project (SGP), the number of sequences where the full-length of the cDNA insert has been determined has been small. Results High quality full-length insert sequences from 560 pre-smolt white muscle tissue specific cDNAs were generated, accession numbers [GenBank: BT043497 - BT044056]. Five hundred and ten (91%) of the transcripts were annotated using Gene Ontology (GO) terms and 440 of the FLIcs are likely to contain a complete coding sequence (cCDS). The sequence information was used to identify putative paralogs, characterize salmon Kozak motifs, polyadenylation signal variation and to identify motifs likely to be involved in the regulation of particular genes. Finally, conserved 7-mers in the 3'UTRs were identified, of which some were identical to miRNA target sequences. Conclusion This paper describes the first Atlantic salmon FLIcs from a tissue and developmental stage specific cDNA library. We have demonstrated that many FLIcs contained a complete coding sequence (cCDS). This suggests that the remaining cDNA libraries generated by SGP represent a valuable cCDS FLIc source. The conservation of 7-mers in 3'UTRs indicates that these motifs are functionally important. Identity between some of these 7-mers and miRNA target sequences suggests that they are miRNA targets in Salmo salar transcripts as well. PMID:19878547

  1. Cloning and expression of a cDNA coding for catalase from zebrafish (Danio rerio).

    PubMed

    Ken, C F; Lin, C T; Wu, J L; Shaw, J F

    2000-06-01

    A full-length complementary DNA (cDNA) clone encoding a catalase was amplified by the rapid amplication of cDNA ends-polymerase chain reaction (RACE-PCR) technique from zebrafish (Danio rerio) mRNA. Nucleotide sequence analysis of this cDNA clone revealed that it comprised a complete open reading frame coding for 526 amino acid residues and that it had a molecular mass of 59 654 Da. The deduced amino acid sequence showed high similarity with the sequences of catalase from swine (86.9%), mouse (85.8%), rat (85%), human (83.7%), fruit fly (75.6%), nematode (71.1%), and yeast (58.6%). The amino acid residues for secondary structures are apparently conserved as they are present in other mammal species. Furthermore, the coding region of zebrafish catalase was introduced into an expression vector, pET-20b(+), and transformed into Escherichia coli expression host BL21(DE3)pLysS. A 60-kDa active catalase protein was expressed and detected by Coomassie blue staining as well as activity staining on polyacrylamide gel followed electrophoresis.

  2. Novel Leptospira interrogans protein Lsa32 is expressed during infection and binds laminin and plasminogen.

    PubMed

    Domingos, Renan F; Fernandes, Luis G; Romero, Eliete C; de Morais, Zenaide M; Vasconcellos, Silvio A; Nascimento, Ana L T O

    2015-04-01

    Pathogenic Leptospira is the aetiological agent of leptospirosis, a life-threatening disease of human and veterinary concern. The quest for novel antigens that could mediate host-pathogen interactions is being pursued. Owing to their location, these antigens have the potential to elicit numerous activities, including immune response and adhesion. This study focuses on a hypothetical protein of Leptospira, encoded by the gene LIC11089, and its three derived fragments: the N-terminal, intermediate and C terminus regions. The gene coding for the full-length protein and fragments was cloned and expressed in Escherichia coli BL21(SI) strain by using the expression vector pAE. The recombinant protein and fragments tagged with hexahistidine at the N terminus were purified by metal affinity chromatography. The leptospiral full-length protein, named Lsa32 (leptospiral surface adhesin, 32 kDa), adheres to laminin, with the C terminus region being responsible for this interaction. Lsa32 binds to plasminogen in a dose-dependent fashion, generating plasmin when an activator is provided. Moreover, antibodies present in leptospirosis serum samples were able to recognize Lsa32. Lsa32 is most likely a new surface protein of Leptospira, as revealed by proteinase K susceptibility. Altogether, our data suggest that this multifaceted protein is expressed during infection and may play a role in host-L. interrogans interactions. © 2015 The Authors.

  3. Complete mitochondrial genome of a Asian lion (Panthera leo goojratensis).

    PubMed

    Li, Yu-Fei; Wang, Qiang; Zhao, Jian-ning

    2016-01-01

    The entire mitochondrial genome of this Asian lion (Panthera leo goojratensis) was 17,183 bp in length, gene composition and arrangement conformed to other lions, which contained the typical structure of 22 tRNAs, 2 rRNAs, 13 protein-coding genes and a non-coding region. The characteristic of the mitochondrial genome was analyzed in detail.

  4. Capacity, cutoff rate, and coding for a direct-detection optical channel

    NASA Technical Reports Server (NTRS)

    Massey, J. L.

    1980-01-01

    It is shown that Pierce's pulse position modulation scheme with 2 to the L pulse positions used on a self-noise-limited direct detection optical communication channel results in a 2 to the L-ary erasure channel that is equivalent to the parallel combination of L completely correlated binary erasure channels. The capacity of the full channel is the sum of the capacities of the component channels, but the cutoff rate of the full channel is shown to be much smaller than the sum of the cutoff rates. An interpretation of the cutoff rate is given that suggests a complexity advantage in coding separately on the component channels. It is shown that if short-constraint-length convolutional codes with Viterbi decoders are used on the component channels, then the performance and complexity compare favorably with the Reed-Solomon coding system proposed by McEliece for the full channel. The reasons for this unexpectedly fine performance by the convolutional code system are explored in detail, as are various facets of the channel structure.

  5. Complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi (Rajiformes: Rajidae).

    PubMed

    Li, Weidong; Chen, Xiao; Liu, Wenai; Sun, Renjie; Zhou, Haolang

    2016-07-01

    The complete mitochondrial genome of the Yellow-spotted skate Okamejei hollandi was determined in this study. It is 16,974 bp in length and contains 13 protein-coding genes, two rRNA genes, 22 tRNA genes, and one putative control region. The overall base composition is 30.5% A, 27.8% C, 14.0% G, and 27.8% T. There are 28 bp short intergenic spaces located in 12 gene junctions and 31 bp overlaps located in nine gene junctions in the whole mitogenome. Two start codons (ATG and GTG) and two stop codons (TAG and TAA/T) were used in the protein-coding genes. The lengths of 22 tRNA genes range from 68 (tRNA-Ser2) to 75 (tRNA-Leu1) bp. The origin of L-strand replication (OL) sequence (37 bp) was identified between the tRNA-Asn and tRNA-Cys genes. The control region is 1311 bp in length with high A + T and poor G content.

  6. microRNA-122 target sites in the hepatitis C virus RNA NS5B coding region and 3' untranslated region: function in replication and influence of RNA secondary structure.

    PubMed

    Gerresheim, Gesche K; Dünnes, Nadia; Nieder-Röhrmann, Anika; Shalamova, Lyudmila A; Fricke, Markus; Hofacker, Ivo; Höner Zu Siederdissen, Christian; Marz, Manja; Niepmann, Michael

    2017-02-01

    We have analyzed the binding of the liver-specific microRNA-122 (miR-122) to three conserved target sites of hepatitis C virus (HCV) RNA, two in the non-structural protein 5B (NS5B) coding region and one in the 3' untranslated region (3'UTR). miR-122 binding efficiency strongly depends on target site accessibility under conditions when the range of flanking sequences available for the formation of local RNA secondary structures changes. Our results indicate that the particular sequence feature that contributes most to the correlation between target site accessibility and binding strength varies between different target sites. This suggests that the dynamics of miRNA/Ago2 binding not only depends on the target site itself but also on flanking sequence context to a considerable extent, in particular in a small viral genome in which strong selection constraints act on coding sequence and overlapping cis-signals and model the accessibility of cis-signals. In full-length genomes, single and combination mutations in the miR-122 target sites reveal that site 5B.2 is positively involved in regulating overall genome replication efficiency, whereas mutation of site 5B.3 showed a weaker effect. Mutation of the 3'UTR site and double or triple mutants showed no significant overall effect on genome replication, whereas in a translation reporter RNA, the 3'UTR target site inhibits translation directed by the HCV 5'UTR. Thus, the miR-122 target sites in the 3'-region of the HCV genome are involved in a complex interplay in regulating different steps of the HCV replication cycle.

  7. [Cloning and sequence analysis of full-length cDNA of secoisolariciresinol dehydrogenase of Dysosma versipellis].

    PubMed

    Xu, Li; Ding, Zhi-Shan; Zhou, Yun-Kai; Tao, Xue-Fen

    2009-06-01

    To obtain the full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene from Dysosma versipellis by RACE PCR,then investigate the character of Secoisolariciresinol Dehydrogenase gene. The full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene was obtained by 3'-RACE and 5'-RACE from Dysosma versipellis. We first reported the full cDNA sequences of Secoisolariciresinol Dehydrogenase in Dysosma versipellis. The acquired gene was 991bp in full length, including 5' untranslated region of 42bp, 3' untranslated region of 112bp with Poly (A). The open reading frame (ORF) encoding 278 amino acid with molecular weight 29253.3 Daltons and isolectric point 6.328. The gene accession nucleotide sequence number in GeneBank was EU573789. Semi-quantitative RT-PCR analysis revealed that the Secoisolariciresinol Dehydrogenase gene was highly expressed in stem. Alignment of the amino acid sequence of Secoisolariciresinol Dehydrogenase indicated there may be some significant amino acid sequence difference among different species. Obtain the full-length cDNA sequence of Secoisolariciresinol Dehydrogenase gene from Dysosma versipellis.

  8. n-Nucleotide circular codes in graph theory.

    PubMed

    Fimmel, Elena; Michel, Christian J; Strüngmann, Lutz

    2016-03-13

    The circular code theory proposes that genes are constituted of two trinucleotide codes: the classical genetic code with 61 trinucleotides for coding the 20 amino acids (except the three stop codons {TAA,TAG,TGA}) and a circular code based on 20 trinucleotides for retrieving, maintaining and synchronizing the reading frame. It relies on two main results: the identification of a maximal C(3) self-complementary trinucleotide circular code X in genes of bacteria, eukaryotes, plasmids and viruses (Michel 2015 J. Theor. Biol. 380, 156-177. (doi:10.1016/j.jtbi.2015.04.009); Arquès & Michel 1996 J. Theor. Biol. 182, 45-58. (doi:10.1006/jtbi.1996.0142)) and the finding of X circular code motifs in tRNAs and rRNAs, in particular in the ribosome decoding centre (Michel 2012 Comput. Biol. Chem. 37, 24-37. (doi:10.1016/j.compbiolchem.2011.10.002); El Soufi & Michel 2014 Comput. Biol. Chem. 52, 9-17. (doi:10.1016/j.compbiolchem.2014.08.001)). The univerally conserved nucleotides A1492 and A1493 and the conserved nucleotide G530 are included in X circular code motifs. Recently, dinucleotide circular codes were also investigated (Michel & Pirillo 2013 ISRN Biomath. 2013, 538631. (doi:10.1155/2013/538631); Fimmel et al. 2015 J. Theor. Biol. 386, 159-165. (doi:10.1016/j.jtbi.2015.08.034)). As the genetic motifs of different lengths are ubiquitous in genes and genomes, we introduce a new approach based on graph theory to study in full generality n-nucleotide circular codes X, i.e. of length 2 (dinucleotide), 3 (trinucleotide), 4 (tetranucleotide), etc. Indeed, we prove that an n-nucleotide code X is circular if and only if the corresponding graph [Formula: see text] is acyclic. Moreover, the maximal length of a path in [Formula: see text] corresponds to the window of nucleotides in a sequence for detecting the correct reading frame. Finally, the graph theory of tournaments is applied to the study of dinucleotide circular codes. It has full equivalence between the combinatorics theory (Michel & Pirillo 2013 ISRN Biomath. 2013, 538631. (doi:10.1155/2013/538631)) and the group theory (Fimmel et al. 2015 J. Theor. Biol. 386, 159-165. (doi:10.1016/j.jtbi.2015.08.034)) of dinucleotide circular codes while its mathematical approach is simpler. © 2016 The Author(s).

  9. Complete nucleotide sequences of the coat protein messenger RNAs of brome mosaic virus and cowpea chlorotic mottle virus.

    PubMed Central

    Dasgupta, R; Kaesberg, P

    1982-01-01

    The nucleotide sequences of the subgenomic coat protein messengers (RNA4's) of two related bromoviruses, brome mosaic virus (BMV) and cowpea chlorotic mottle virus (CCMV), have been determined by direct RNA and CDNA sequencing without cloning. BMV RNA4 is 876 b long including a 5' noncoding region of nine nucleotides and a 3' noncoding region of 300 nucleotides. CCMV RNA 4 is 824 b long, including a 5' noncoding region of 10 nucleotides and a 3' noncoding region of 244 nucleotides. The encoded coat proteins are similar in length (188 amino acids for BMV and 189 amino acids for CCMV) and display about 70% homology in their amino acid sequences. Length difference between the two RNAs is due mostly to a single deletion, in CCMV with respect to BMV, of about 57 b immediately following the coding region. Allowing for this deletion the RNAs are indicate that mutations leading to divergence were constrained in the coding region primarily by the requirement of maintaining a favorable coat protein structure and in the 3' noncoding region primarily by the requirement of maintaining a favorable RNA spatial configuration. PMID:6895941

  10. Flow adjustment inside homogeneous canopies after a leading edge – An analytical approach backed by LES

    DOE PAGES

    Kroniger, Konstantin; Banerjee, Tirtha; De Roo, Frederik; ...

    2017-10-06

    A two-dimensional analytical model for describing the mean flow behavior inside a vegetation canopy after a leading edge in neutral conditions was developed and tested by means of large eddy simulations (LES) employing the LES code PALM. The analytical model is developed for the region directly after the canopy edge, the adjustment region, where one-dimensional canopy models fail due to the sharp change in roughness. The derivation of this adjustment region model is based on an analytic solution of the two-dimensional Reynolds averaged Navier–Stokes equation in neutral conditions for a canopy with constant plant area density (PAD). The main assumptionsmore » for solving the governing equations are separability of the velocity components concerning the spatial variables and the neglection of the Reynolds stress gradients. These two assumptions are verified by means of LES. To determine the emerging model parameters, a simultaneous fitting scheme was applied to the velocity and pressure data of a reference LES simulation. Furthermore a sensitivity analysis of the adjustment region model, equipped with the previously calculated parameters, was performed varying the three relevant length, the canopy height ( h), the canopy length and the adjustment length ( Lc), in additional LES. Even if the model parameters are, in general, functions of h/ Lc, it was found out that the model is capable of predicting the flow quantities in various cases, when using constant parameters. Subsequently the adjustment region model is combined with the one-dimensional model of Massman, which is applicable for the interior of the canopy, to attain an analytical model capable of describing the mean flow for the full canopy domain. As a result, the model is tested against an analytical model based on a linearization approach.« less

  11. Flow adjustment inside homogeneous canopies after a leading edge – An analytical approach backed by LES

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kroniger, Konstantin; Banerjee, Tirtha; De Roo, Frederik

    A two-dimensional analytical model for describing the mean flow behavior inside a vegetation canopy after a leading edge in neutral conditions was developed and tested by means of large eddy simulations (LES) employing the LES code PALM. The analytical model is developed for the region directly after the canopy edge, the adjustment region, where one-dimensional canopy models fail due to the sharp change in roughness. The derivation of this adjustment region model is based on an analytic solution of the two-dimensional Reynolds averaged Navier–Stokes equation in neutral conditions for a canopy with constant plant area density (PAD). The main assumptionsmore » for solving the governing equations are separability of the velocity components concerning the spatial variables and the neglection of the Reynolds stress gradients. These two assumptions are verified by means of LES. To determine the emerging model parameters, a simultaneous fitting scheme was applied to the velocity and pressure data of a reference LES simulation. Furthermore a sensitivity analysis of the adjustment region model, equipped with the previously calculated parameters, was performed varying the three relevant length, the canopy height ( h), the canopy length and the adjustment length ( Lc), in additional LES. Even if the model parameters are, in general, functions of h/ Lc, it was found out that the model is capable of predicting the flow quantities in various cases, when using constant parameters. Subsequently the adjustment region model is combined with the one-dimensional model of Massman, which is applicable for the interior of the canopy, to attain an analytical model capable of describing the mean flow for the full canopy domain. As a result, the model is tested against an analytical model based on a linearization approach.« less

  12. Hermes Transposon Distribution and Structure in Musca domestica

    PubMed Central

    Subramanian, Ramanand A.; Cathcart, Laura A.; Krafsur, Elliot S.; Atkinson, Peter W.

    2009-01-01

    Hermes are hAT transposons from Musca domestica that are very closely related to the hobo transposons from Drosophila melanogaster and are useful as gene vectors in a wide variety of organisms including insects, planaria, and yeast. hobo elements show distinct length variations in a rapidly evolving region of the transposase-coding region as a result of expansions and contractions of a simple repeat sequence encoding 3 amino acids threonine, proline, and glutamic acid (TPE). These variations in length may influence the function of the protein and the movement of hobo transposons in natural populations. Here, we determine the distribution of Hermes in populations of M. domestica as well as whether Hermes transposase has undergone similar sequence expansions and contractions during its evolution in this species. Hermes transposons were found in all M. domestica individuals sampled from 14 populations collected from 4 continents. All individuals with Hermes transposons had evidence for the presence of intact transposase open reading frames, and little sequence variation was observed among Hermes elements. A systematic analysis of the TPE-homologous region of the Hermes transposase-coding region revealed no evidence for length variation. The simple sequence repeat found in hobo elements is a feature of this transposon that evolved since the divergence of hobo and Hermes. PMID:19366812

  13. Genetic heterogeneity of the dnaK gene locus including transcription terminator region (TTR) in Campylobacter lari.

    PubMed

    Shitara, M; Tsuboi, Y; Sekizuka, T; Tazumi, A; Moorei, J E; Millar, B C; Taneike, I; Matsuda, M

    2008-01-01

    Nucleotide sequences of approximately 3.1 kbp consisting of the full-length open reading frame (ORF) for grpE, a non-coding (NC) region and a putative ORF for the full-length dnaK gene (1860 bp) were identified from a urease-positive thermophilic Campylobacter (UPTC) CF89-12 isolate. Then, following the construction of a new degenerate polymerase chain reaction (PCR) primer pair for amplification of the dnaK structural gene, including the transcription terminator region of C. lari isolates, the dnaK region was amplified successfully, TA-cloned and sequenced in nine C. lari isolates. The dnaK gene sequences commenced with an ATG and terminated with a TAA in all 10 isolates, including CF89-12. In addition, the putative ORFs for the dnaK gene locus from seven UPTC isolates consisted of 1860 bases, and the four urease-negative (UN) C. lari isolates included C. lari RM2100 reference strain 1866. Interestingly, different probable ribosome binding sites and hypothetically intrinsic p-independent terminator structures were identified between the seven UPTC and four UN C. lari isolates, respectively. Moreover, it is interesting to note that 20 out of a total of 28 polymorphic sites occurred among amino acid sequences of the dnaK ORF from 11 C. lari isolates, identified to be alternatively UPTC-specific or UN C. lari-specific. In the neighbour-joining tree based on the nucleotide sequence information of the dnaK gene, C. lari forms two major distinct clusters consisting of UPTC and UN C. lari isolates, respectively, with UN C. lari being more closely related to other thermophilic campylobacters than to UPTC.

  14. Complete mitochondrial genome sequence of the heart failure model of cardiomyopathic Syrian hamster (Mesocricetus auratus).

    PubMed

    Hu, Bo; Liu, Dong-Xing; Zhang, Yu-Qing; Song, Jian-Tao; Ji, Xian-Fei; Hou, Zhi-Qiang; Zhang, Zhen-Hai

    2016-05-01

    In this study we sequenced the complete mitochondrial genome sequencing of a heart failure model of cardiomyopathic Syrian hamster (Mesocricetus auratus) for the first time. The total length of the mitogenome was 16,267 bp. It harbored 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and 1 non-coding control region.

  15. Complete mitochondrial genome of Xingguo red carp (Cyprinus carpio var. singuonensis) and purse red carp (Cyprinus carpio var. wuyuanensis).

    PubMed

    Hu, Guang-Fu; Liu, Xiang-Jiang; Li, Zhong; Liang, Hong-Wei; Hu, Shao-Na; Zou, Gui-Wei

    2016-01-01

    The complete mitochondrial genomes of Xingguo red carp (Cyprinus carpio var. singuonensis) and purse red carp (Cyprinus carpio var. wuyuanensis) were sequenced. Comparison of these two mitochondrial genomes revealed that the mtDNAs of these two common carp varieties were remarkably similar in genome length, gene order and content, and AT content. However, size variation between these two mitochondrial genomes presented here showed 39 site differences in overall length. About 2 site differences were located in rRNAs, 3 in tRNAs, 3 in the control region, 31 in protein-coding genes. Thirty-one variable bases in the protein-coding regions between the two varieties mitochondrial sequences led to three variable amino acids, which were mainly located in the protein ND5 and ND4.

  16. Complete mitochondrial genome of the whiter-spotted flower chafer, Protaetia brevitarsis (Coleoptera: Scarabaeidae).

    PubMed

    Kim, Min Jee; Im, Hyun Hwak; Lee, Kwang Youll; Han, Yeon Soo; Kim, Iksoo

    2014-06-01

    Abstract The complete nucleotide sequences of the mitochondrial genome from the whiter-spotted flower chafer, Protaetia brevitarsis (Coleoptera: Scarabaeidae), was determined. The 20,319-bp long circular genome is the longest among completely sequenced Coleoptera. As is typical in animals, the P. brevitarsis genome consisted of two ribosomal RNAs, 22 transfer RNAs, 13 protein-coding genes and one A + T-rich region. Although the size of the coding genes was typical, the non-coding A + T-rich region was 5654 bp, which is the longest in insects. The extraordinary length of this region was composed of 28,117-bp tandem repeats and 782-bp tandem repeats. These repeat sequences were encompassed by three non-repeat sequences constituting 1804 bp.

  17. Hypervariability of ribosomal DNA at multiple chromosomal sites in lake trout (Salvelinus namaycush).

    PubMed

    Zhuo, L; Reed, K M; Phillips, R B

    1995-06-01

    Variation in the intergenic spacer (IGS) of the ribosomal DNA (rDNA) of lake trout (Salvelinus namaycush) was examined. Digestion of genomic DNA with restriction enzymes showed that almost every individual had a unique combination of length variants with most of this variation occurring within rather than between populations. Sequence analysis of a 2.3 kilobase (kb) EcoRI-DraI fragment spanning the 3' end of the 28S coding region and approximately 1.8 kb of the IGS revealed two blocks of repetitive DNA. Putative transcriptional termination sites were found approximately 220 bases (b) downstream from the end of the 28S coding region. Comparison of the 2.3-kb fragments with two longer (3.1 kb) fragments showed that the major difference in length resulted from variation in the number of short (89 b) repeats located 3' to the putative terminator. Repeat units within a single nucleolus organizer region (NOR) appeared relatively homogeneous and genetic analysis found variants to be stably inherited. A comparison of the number of spacer-length variants with the number of NORs found that the number of length variants per individual was always less than the number of NORs. Examination of spacer variants in five populations showed that populations with more NORs had more spacer variants, indicating that variants are present at different rDNA sites on nonhomologous chromosomes.

  18. Chimeric NP Non Coding Regions between Type A and C Influenza Viruses Reveal Their Role in Translation Regulation

    PubMed Central

    Crescenzo-Chaigne, Bernadette; Barbezange, Cyril; Frigard, Vianney; Poulain, Damien; van der Werf, Sylvie

    2014-01-01

    Exchange of the non coding regions of the NP segment between type A and C influenza viruses was used to demonstrate the importance not only of the proximal panhandle, but also of the initial distal panhandle strength in type specificity. Both elements were found to be compulsory to rescue infectious virus by reverse genetics systems. Interestingly, in type A influenza virus infectious context, the length of the NP segment 5′ NC region once transcribed into mRNA was found to impact its translation, and the level of produced NP protein consequently affected the level of viral genome replication. PMID:25268971

  19. TCOF1 gene encodes a putative nucleolar phosphoprotein that exhibits mutations in Treacher Collins Syndrome throughout its coding region.

    PubMed

    Wise, C A; Chiang, L C; Paznekas, W A; Sharma, M; Musy, M M; Ashley, J A; Lovett, M; Jabs, E W

    1997-04-01

    Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development.

  20. A role of carboxy-terminal region of Toxoplasma gondii-heat shock protein 70 in enhancement of T. gondii infection in mice

    PubMed Central

    Mun, Hye-Seong; Norose, Kazumi; Aosai, Fumie; Chen, Mei

    2000-01-01

    We investigated the role of recombinant Toxoplasma gondii heat shock protein (rT.g.HSP) 70-full length, rT.g.HSP70-NH2-terminal region, or rT.g.HSP70-carboxy-terminal region in prophylactic immunity in C57BL/6 mice perorally infected with Fukaya cysts of T. gondii. At 3, 4, 5, and 6 weeks after infection, the number of T. gondii in the brain tissue of each mouse was measured by quantitative competitive-polymerase chain reaction (QC-PCR) targeting the surface antigen (SAG) 1 gene. Immunization with rT.g.HSP70-full length or rT.g.HSP70-carboxy-terminal region increased the number of T.gondii in the brain tissue after T. gondii infection, whereas immunization with rT.g.HSP70-NH2-terminal region did not. These results suggest that T.g.HSP70-carboxy-terminal region as well as T.g.HSP70-full length may induce deleterious effects on the protective immunity of mice infected with a cyst-forming T. gondii strain, Fukaya. PMID:10905074

  1. Targeting a Complex Transcriptome: The Construction of the Mouse Full-Length cDNA Encyclopedia

    PubMed Central

    Carninci, Piero; Waki, Kazunori; Shiraki, Toshiyuki; Konno, Hideaki; Shibata, Kazuhiro; Itoh, Masayoshi; Aizawa, Katsunori; Arakawa, Takahiro; Ishii, Yoshiyuki; Sasaki, Daisuke; Bono, Hidemasa; Kondo, Shinji; Sugahara, Yuichi; Saito, Rintaro; Osato, Naoki; Fukuda, Shiro; Sato, Kenjiro; Watahiki, Akira; Hirozane-Kishikawa, Tomoko; Nakamura, Mari; Shibata, Yuko; Yasunishi, Ayako; Kikuchi, Noriko; Yoshiki, Atsushi; Kusakabe, Moriaki; Gustincich, Stefano; Beisel, Kirk; Pavan, William; Aidinis, Vassilis; Nakagawara, Akira; Held, William A.; Iwata, Hiroo; Kono, Tomohiro; Nakauchi, Hiromitsu; Lyons, Paul; Wells, Christine; Hume, David A.; Fagiolini, Michela; Hensch, Takao K.; Brinkmeier, Michelle; Camper, Sally; Hirota, Junji; Mombaerts, Peter; Muramatsu, Masami; Okazaki, Yasushi; Kawai, Jun; Hayashizaki, Yoshihide

    2003-01-01

    We report the construction of the mouse full-length cDNA encyclopedia,the most extensive view of a complex transcriptome,on the basis of preparing and sequencing 246 libraries. Before cloning,cDNAs were enriched in full-length by Cap-Trapper,and in most cases,aggressively subtracted/normalized. We have produced 1,442,236 successful 3′-end sequences clustered into 171,144 groups, from which 60,770 clones were fully sequenced cDNAs annotated in the FANTOM-2 annotation. We have also produced 547,149 5′ end reads,which clustered into 124,258 groups. Altogether, these cDNAs were further grouped in 70,000 transcriptional units (TU),which represent the best coverage of a transcriptome so far. By monitoring the extent of normalization/subtraction, we define the tentative equivalent coverage (TEC),which was estimated to be equivalent to >12,000,000 ESTs derived from standard libraries. High coverage explains discrepancies between the very large numbers of clusters (and TUs) of this project,which also include non-protein-coding RNAs,and the lower gene number estimation of genome annotations. Altogether,5′-end clusters identify regions that are potential promoters for 8637 known genes and 5′-end clusters suggest the presence of almost 63,000 transcriptional starting points. An estimate of the frequency of polyadenylation signals suggests that at least half of the singletons in the EST set represent real mRNAs. Clones accounting for about half of the predicted TUs await further sequencing. The continued high-discovery rate suggests that the task of transcriptome discovery is not yet complete. PMID:12819125

  2. Internally deleted WNV genomes isolated from exotic birds in New Mexico: function in cells, mosquitoes, and mice.

    PubMed

    Pesko, Kendra N; Fitzpatrick, Kelly A; Ryan, Elizabeth M; Shi, Pei-Yong; Zhang, Bo; Lennon, Niall J; Newman, Ruchi M; Henn, Matthew R; Ebel, Gregory D

    2012-05-25

    Most RNA viruses exist in their hosts as a heterogeneous population of related variants. Due to error prone replication, mutants are constantly generated which may differ in individual fitness from the population as a whole. Here we characterize three WNV isolates that contain, along with full-length genomes, mutants with large internal deletions to structural and nonstructural protein-coding regions. The isolates were all obtained from lorikeets that died from WNV at the Rio Grande Zoo in Albuquerque, NM between 2005 and 2007. The deletions are approximately 2kb, in frame, and result in the elimination of the complete envelope, and portions of the prM and NS-1 proteins. In Vero cell culture, these internally deleted WNV genomes function as defective interfering particles, reducing the production of full-length virus when introduced at high multiplicities of infection. In mosquitoes, the shortened WNV genomes reduced infection and dissemination rates, and virus titers overall, and were not detected in legs or salivary secretions at 14 or 21 days post-infection. In mice, inoculation with internally deleted genomes did not attenuate pathogenesis relative to full-length or infectious clone derived virus, and shortened genomes were not detected in mice at the time of death. These observations provide evidence that large deletions may occur within flavivirus populations more frequently than has generally been appreciated and suggest that they impact population phenotype minimally. Additionally, our findings suggest that highly similar mutants may frequently occur in particular vertebrate hosts. Copyright © 2012 Elsevier Inc. All rights reserved.

  3. Arc Length Coding by Interference of Theta Frequency Oscillations May Underlie Context-Dependent Hippocampal Unit Data and Episodic Memory Function

    ERIC Educational Resources Information Center

    Hasselmo, Michael E.

    2007-01-01

    Many memory models focus on encoding of sequences by excitatory recurrent synapses in region CA3 of the hippocampus. However, data and modeling suggest an alternate mechanism for encoding of sequences in which interference between theta frequency oscillations encodes the position within a sequence based on spatial arc length or time. Arc length…

  4. Identification of BSAP (Pax-5) target genes in early B-cell development by loss- and gain-of-function experiments.

    PubMed Central

    Nutt, S L; Morrison, A M; Dörfler, P; Rolink, A; Busslinger, M

    1998-01-01

    The Pax-5 gene codes for the transcription factor BSAP which is essential for the progression of adult B lymphopoiesis beyond an early progenitor (pre-BI) cell stage. Although several genes have been proposed to be regulated by BSAP, CD19 is to date the only target gene which has been genetically confirmed to depend on this transcription factor for its expression. We have now taken advantage of cultured pre-BI cells of wild-type and Pax-5 mutant bone marrow to screen a large panel of B lymphoid genes for additional BSAP target genes. Four differentially expressed genes were shown to be under the direct control of BSAP, as their expression was rapidly regulated in Pax-5-deficient pre-BI cells by a hormone-inducible BSAP-estrogen receptor fusion protein. The genes coding for the B-cell receptor component Ig-alpha (mb-1) and the transcription factors N-myc and LEF-1 are positively regulated by BSAP, while the gene coding for the cell surface protein PD-1 is efficiently repressed. Distinct regulatory mechanisms of BSAP were revealed by reconstituting Pax-5-deficient pre-BI cells with full-length BSAP or a truncated form containing only the paired domain. IL-7 signalling was able to efficiently induce the N-myc gene only in the presence of full-length BSAP, while complete restoration of CD19 synthesis was critically dependent on the BSAP protein concentration. In contrast, the expression of the mb-1 and LEF-1 genes was already reconstituted by the paired domain polypeptide lacking any transactivation function, suggesting that the DNA-binding domain of BSAP is sufficient to recruit other transcription factors to the regulatory regions of these two genes. In conclusion, these loss- and gain-of-function experiments demonstrate that BSAP regulates four newly identified target genes as a transcriptional activator, repressor or docking protein depending on the specific regulatory sequence context. PMID:9545244

  5. Primary structure of prostaglandin G/H synthase from sheep vesicular gland determined from the complementary DNA sequence.

    PubMed Central

    DeWitt, D L; Smith, W L

    1988-01-01

    Prostaglandin G/H synthase (8,11,14-icosatrienoate, hydrogen-donor:oxygen oxidoreductase, EC 1.14.99.1) catalyzes the first step in the formation of prostaglandins and thromboxanes, the conversion of arachidonic acid to prostaglandin endoperoxides G and H. This enzyme is the site of action of nonsteroidal anti-inflammatory drugs. We have isolated a 2.7-kilobase complementary DNA (cDNA) encompassing the entire coding region of prostaglandin G/H synthase from sheep vesicular glands. This cDNA, cloned from a lambda gt 10 library prepared from poly(A)+ RNA of vesicular glands, hybridizes with a single 2.75-kilobase mRNA species. The cDNA clone was selected using oligonucleotide probes modeled from amino acid sequences of tryptic peptides prepared from the purified enzyme. The full-length cDNA encodes a protein of 600 amino acids, including a signal sequence of 24 amino acids. Identification of the cDNA as coding for prostaglandin G/H synthase is based on comparison of amino acid sequences of seven peptides comprising 103 amino acids with the amino acid sequence deduced from the nucleotide sequence of the cDNA. The molecular weight of the unglycosylated enzyme lacking the signal peptide is 65,621. The synthase is a glycoprotein, and there are three potential sites for N-glycosylation, two of them in the amino-terminal half of the molecule. The serine reported to be acetylated by aspirin is at position 530, near the carboxyl terminus. There is no significant similarity between the sequence of the synthase and that of any other protein in amino acid or nucleotide sequence libraries, and a heme binding site(s) is not apparent from the amino acid sequence. The availability of a full-length cDNA clone coding for prostaglandin G/H synthase should facilitate studies of the regulation of expression of this enzyme and the structural features important for catalysis and for interaction with anti-inflammatory drugs. Images PMID:3125548

  6. Mitochondrial genomes of parasitic flatworms.

    PubMed

    Le, Thanh H; Blair, David; McManus, Donald P

    2002-05-01

    Complete or near-complete mitochondrial genomes are now available for 11 species or strains of parasitic flatworms belonging to the Trematoda and the Cestoda. The organization of these genomes is not strikingly different from those of other eumetazoans, although one gene (atp8) commonly found in other phyla is absent from flatworms. The gene order in most flatworms has similarities to those seen in higher protostomes such as annelids. However, the gene order has been drastically altered in Schistosoma mansoni, which obscures this possible relationship. Among the sequenced taxa, base composition varies considerably, creating potential difficulties for phylogeny reconstruction. Long non-coding regions are present in all taxa, but these vary in length from only a few hundred to approximately 10000 nucleotides. Among Schistosoma spp., the long non-coding regions are rich in repeats and length variation among individuals is known. Data from mitochondrial genomes are valuable for studies on species identification, phylogenies and biogeography.

  7. Run-length encoding graphic rules, biochemically editable designs and steganographical numeric data embedment for DNA-based cryptographical coding system.

    PubMed

    Kawano, Tomonori

    2013-03-01

    There have been a wide variety of approaches for handling the pieces of DNA as the "unplugged" tools for digital information storage and processing, including a series of studies applied to the security-related area, such as DNA-based digital barcodes, water marks and cryptography. In the present article, novel designs of artificial genes as the media for storing the digitally compressed data for images are proposed for bio-computing purpose while natural genes principally encode for proteins. Furthermore, the proposed system allows cryptographical application of DNA through biochemically editable designs with capacity for steganographical numeric data embedment. As a model case of image-coding DNA technique application, numerically and biochemically combined protocols are employed for ciphering the given "passwords" and/or secret numbers using DNA sequences. The "passwords" of interest were decomposed into single letters and translated into the font image coded on the separate DNA chains with both the coding regions in which the images are encoded based on the novel run-length encoding rule, and the non-coding regions designed for biochemical editing and the remodeling processes revealing the hidden orientation of letters composing the original "passwords." The latter processes require the molecular biological tools for digestion and ligation of the fragmented DNA molecules targeting at the polymerase chain reaction-engineered termini of the chains. Lastly, additional protocols for steganographical overwriting of the numeric data of interests over the image-coding DNA are also discussed.

  8. Crystallization and preliminary X-ray crystallographic analysis of carboxyl-terminal region 4 of SigR from Streptomyces coelicolor A3(2)

    PubMed Central

    Kim, Keon Young; Kim, Sunmin; Park, Jeong Kuk; Song, HyoJin; Park, SangYoun

    2014-01-01

    Full-length SigR from Streptomyces coelicolor A3(2) was overexpressed in Escherichia coli, purified and submitted to crystallization trials using either polyethylene glycol 3350 or 4000 as a precipitant. X-ray diffraction data were collected to 2.60 Å resolution under cryoconditions using synchrotron X-rays. The crystal packs in space group P43212, with unit-cell parameters a = b = 42.14, c = 102.02 Å. According to the Matthews coefficient, the crystal asymmetric unit cannot contain the full-length protein. Molecular replacement with the known structures of region 2 and region 4 as independent search models indicates that the crystal contains only the −35 element-binding carboxyl-terminal region 4 of full-length SigR. Mass-spectrometric analysis of the harvested crystal confirms this, suggesting a crystal volume per protein weight (V M) of 2.24 Å3 Da−1 and 45.1% solvent content. PMID:24915084

  9. Full-length genome sequences of porcine epidemic diarrhoea virus strain CV777; Use of NGS to analyse genomic and sub-genomic RNAs

    PubMed Central

    Rasmussen, Thomas Bruun; Boniotti, Maria Beatrice; Papetti, Alice; Grasland, Béatrice; Frossard, Jean-Pierre; Dastjerdi, Akbar; Hulst, Marcel; Hanke, Dennis; Pohlmann, Anne; Blome, Sandra; van der Poel, Wim H. M.; Steinbach, Falko; Blanchard, Yannick; Lavazza, Antonio; Bøtner, Anette

    2018-01-01

    Porcine epidemic diarrhoea virus, strain CV777, was initially characterized in 1978 as the causative agent of a disease first identified in the UK in 1971. This coronavirus has been widely distributed among laboratories and has been passaged both within pigs and in cell culture. To determine the variability between different stocks of the PEDV strain CV777, sequencing of the full-length genome (ca. 28kb) has been performed in 6 different laboratories, using different protocols. Not surprisingly, each of the different full genome sequences were distinct from each other and from the reference sequence (Accession number AF353511) but they are >99% identical. Unique and shared differences between sequences were identified. The coding region for the surface-exposed spike protein showed the highest proportion of variability including both point mutations and small deletions. The predicted expression of the ORF3 gene product was more dramatically affected in three different variants of this virus through either loss of the initiation codon or gain of a premature termination codon. The genome of one isolate had a substantially rearranged 5´-terminal sequence. This rearrangement was validated through the analysis of sub-genomic mRNAs from infected cells. It is clearly important to know the features of the specific sample of CV777 being used for experimental studies. PMID:29494671

  10. A specific indel marker for the Philippines Schistosoma japonicum revealed by analysis of mitochondrial genome sequences.

    PubMed

    Li, Juan; Chen, Fen; Sugiyama, Hiromu; Blair, David; Lin, Rui-Qing; Zhu, Xing-Quan

    2015-07-01

    In the present study, near-complete mitochondrial (mt) genome sequences for Schistosoma japonicum from different regions in the Philippines and Japan were amplified and sequenced. Comparisons among S. japonicum from the Philippines, Japan, and China revealed a geographically based length difference in mt genomes, but the mt genomic organization and gene arrangement were the same. Sequence differences among samples from the Philippines and all samples from the three endemic areas were 0.57-2.12 and 0.76-3.85 %, respectively. The most variable part of the mt genome was the non-coding region. In the coding portion of the genome, protein-coding genes varied more than rRNA genes and tRNAs. The near-complete mt genome sequences for Philippine specimens were identical in length (14,091 bp) which was 4 bp longer than those of S. japonicum samples from Japan and China. This indel provides a unique genetic marker for S. japonicum samples from the Philippines. Phylogenetic analyses based on the concatenated amino acids of 12 protein-coding genes showed that samples of S. japonicum clustered according to their geographical origins. The identified mitochondrial indel marker will be useful for tracing the source of S. japonicum infection in humans and animals in Southeast Asia.

  11. The complete mitochondrial genome of Pholis nebulosus (Perciformes: Pholidae).

    PubMed

    Wang, Zhongquan; Qin, Kaili; Liu, Jingxi; Song, Na; Han, Zhiqiang; Gao, Tianxiang

    2016-11-01

    In this study, the complete mitochondrial genome (mitogenome) sequence of Pholis nebulosus has been determined by long polymerase chain reaction and primer-walking methods. The mitogenome is a circular molecule of 16 524 bp in length, including the typical structure of 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and 2 non-coding regions (L-strand replication origin and control region), the gene contents of which are identical to those observed in most bony fishes. Within the control region, we identified the termination-associated sequence domain (TAS), and the conserved sequence block domain (CSB-F, CSB-E, CSB-D, CSB-C, CSB-B, CSB-A, CSB-1, CSB-2, CSB-3).

  12. Complete mitochondrial genome of the Tyto longimembris (Strigiformes: Tytonidae).

    PubMed

    Xu, Peng; Li, Yankuo; Miao, Lujun; Xie, Guangyong; Huang, Yan

    2016-07-01

    The complete mitochondrial genome of Tyto longimembris has been determined in this study. It is 18,466 bp in length and consists of 13 protein-coding genes, 22 transfer RNA (tRNA) genes, 2 ribosomal RNA (rRNA) genes and a non-coding control region (D-loop). The overall base composition of the heavy strand of the T. longimembris mitochondrial genome is A: 30.1%, T: 23.5%, C: 31.8% and G: 14.6%. The structure of control region should be characterized by a region containing tandem repeats as two definitely separated clusters of tandem repeats were found. This study provided an important data set for phylogenetic and taxonomic analyses of Tyto species.

  13. Systematic analysis of coding and noncoding DNA sequences using methods of statistical linguistics

    NASA Technical Reports Server (NTRS)

    Mantegna, R. N.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Peng, C. K.; Simons, M.; Stanley, H. E.

    1995-01-01

    We compare the statistical properties of coding and noncoding regions in eukaryotic and viral DNA sequences by adapting two tests developed for the analysis of natural languages and symbolic sequences. The data set comprises all 30 sequences of length above 50 000 base pairs in GenBank Release No. 81.0, as well as the recently published sequences of C. elegans chromosome III (2.2 Mbp) and yeast chromosome XI (661 Kbp). We find that for the three chromosomes we studied the statistical properties of noncoding regions appear to be closer to those observed in natural languages than those of coding regions. In particular, (i) a n-tuple Zipf analysis of noncoding regions reveals a regime close to power-law behavior while the coding regions show logarithmic behavior over a wide interval, while (ii) an n-gram entropy measurement shows that the noncoding regions have a lower n-gram entropy (and hence a larger "n-gram redundancy") than the coding regions. In contrast to the three chromosomes, we find that for vertebrates such as primates and rodents and for viral DNA, the difference between the statistical properties of coding and noncoding regions is not pronounced and therefore the results of the analyses of the investigated sequences are less conclusive. After noting the intrinsic limitations of the n-gram redundancy analysis, we also briefly discuss the failure of the zeroth- and first-order Markovian models or simple nucleotide repeats to account fully for these "linguistic" features of DNA. Finally, we emphasize that our results by no means prove the existence of a "language" in noncoding DNA.

  14. Convolutional encoding of self-dual codes

    NASA Technical Reports Server (NTRS)

    Solomon, G.

    1994-01-01

    There exist almost complete convolutional encodings of self-dual codes, i.e., block codes of rate 1/2 with weights w, w = 0 mod 4. The codes are of length 8m with the convolutional portion of length 8m-2 and the nonsystematic information of length 4m-1. The last two bits are parity checks on the two (4m-1) length parity sequences. The final information bit complements one of the extended parity sequences of length 4m. Solomon and van Tilborg have developed algorithms to generate these for the Quadratic Residue (QR) Codes of lengths 48 and beyond. For these codes and reasonable constraint lengths, there are sequential decodings for both hard and soft decisions. There are also possible Viterbi-type decodings that may be simple, as in a convolutional encoding/decoding of the extended Golay Code. In addition, the previously found constraint length K = 9 for the QR (48, 24;12) Code is lowered here to K = 8.

  15. Linear chirp phase perturbing approach for finding binary phased codes

    NASA Astrophysics Data System (ADS)

    Li, Bing C.

    2017-05-01

    Binary phased codes have many applications in communication and radar systems. These applications require binary phased codes to have low sidelobes in order to reduce interferences and false detection. Barker codes are the ones that satisfy these requirements and they have lowest maximum sidelobes. However, Barker codes have very limited code lengths (equal or less than 13) while many applications including low probability of intercept radar, and spread spectrum communication, require much higher code lengths. The conventional techniques of finding binary phased codes in literatures include exhaust search, neural network, and evolutionary methods, and they all require very expensive computation for large code lengths. Therefore these techniques are limited to find binary phased codes with small code lengths (less than 100). In this paper, by analyzing Barker code, linear chirp, and P3 phases, we propose a new approach to find binary codes. Experiments show that the proposed method is able to find long low sidelobe binary phased codes (code length >500) with reasonable computational cost.

  16. Interleukin-1 homologues IL-1F7b and IL-18 contain functional mRNA instability elements within the coding region responsive to lipopolysaccharide

    PubMed Central

    2004-01-01

    IL-1F7b, a novel homologue of the IL-1 (interleukin 1) family, was discovered by computational cloning. We demonstrated that IL-1F7b shares critical amino acid residues with IL-18 and binds to the IL-18-binding protein enhancing its ability to inhibit IL-18-induced interferon-γ. We also showed that low levels of IL-1F7b are constitutively present intracellularly in human blood monocytes. In this study, we demonstrate that similar to IL-18, both mRNA and intracellular protein expression of IL-1F7b are up-regulated by LPS (lipopolysaccharide) in human monocytes. In stable transfectants of murine RAW264.7 macrophage cells, there was no IL-1F7b protein expression despite a highly active CMV promoter. We found that IL-1F7b-specific mRNA was rapidly degraded in transfected cells, via a 3′-UTR (untranslated region)-independent control of IL-1F7b transcript stability. After LPS stimulation, there was a rapid transient increase in IL-1F7b-specific mRNA and concomitant protein levels. Using sequence alignment, we found a conserved ten-nucleotide homology box within the open reading frame of IL-F7b, which is flanking the coding region instability elements of some selective genes. In-frame deletion of downstream exon 5 from the full-length IL-1F7b cDNA markedly increased the levels of IL-1F7b mRNA. A similar coding region element is located in IL-18. When transfected into RAW264.7 macrophages, IL-18 mRNA was also unstable unless treated with LPS. These results indicate that both IL-1F7b and IL-18 mRNA contain functional instability determinants within their coding region, which influence mRNA decay as a novel mechanism to regulate the expression of IL-1 family members. PMID:15046617

  17. First complete mitochondrial genome of the South American annual fish Austrolebias charrua (Cyprinodontiformes: Rivulidae): peculiar features among cyprinodontiforms mitogenomes.

    PubMed

    Gutiérrez, Verónica; Rego, Natalia; Naya, Hugo; García, Graciela

    2015-10-28

    Among teleosts, the South American genus Austrolebias (Cyprinodontiformes: Rivulidae) includes 42 taxa of annual fishes divided into five different species groups. It is a monophyletic genus, but morphological and molecular data do not resolve the relationship among intrageneric clades and high rates of substitution have been previously described in some mitochondrial genes. In this work, the complete mitogenome of a species of the genus was determined for the first time. We determined its structure, gene order and evolutionary peculiar features, which will allow us to evaluate the performance of mitochondrial genes in the phylogenetic resolution at different taxonomic levels. Regarding gene content and order, the circular mitogenome of A. charrua (17,271 pb) presents the typical pattern of vertebrate mitogenomes. It contains the full complement of 13 proteins-coding genes, 22 tRNA, 2 rRNA and one non-coding control region. Notably, the tRNA-Cys was only 57 bp in length and lacks the D-loop arm. In three full sibling individuals, heteroplasmatic condition was detected due to a total of 12 variable sites in seven protein-coding genes. Among cyprinodontiforms, the mitogenome of A. charrua exhibits the lowest G+C content (37 %) and GCskew, as well as the highest strand asymmetry with a net difference of T over A at 1st and 3rd codon positions. Considering the 12 coding-genes of the H strand, correspondence analyses of nucleotide composition and codon usage show that A and T at 1st and 3rd codon positions have the highest weight in the first axis, and segregate annual species from the other cyprinodontiforms analyzed. Given the annual life-style, their mitogenomes could be under different selective pressures. All 13 protein-coding genes are under strong purifying selection and we did not find any significant evidence of nucleotide sites showing episodic selection (dN >dS) at annual lineages. When fast evolving third codon positions were removed from alignments, the "supergene" tree recovers our reference species phylogeny as well as the Cytb, ND4L and ND6 genes. Therefore, third codon positions seem to be saturated in the aforementioned coding regions at intergeneric Cyprinodontiformes comparisons. The complete mitogenome obtained in present work, offers relevant data for further comparative studies on molecular phylogeny and systematics of this taxonomic controversial endemic genus of annual fishes.

  18. Cloning of the cDNA for U1 small nuclear ribonucleoprotein particle 70K protein from Arabidopsis thaliana

    NASA Technical Reports Server (NTRS)

    Reddy, A. S.; Czernik, A. J.; An, G.; Poovaiah, B. W.

    1992-01-01

    We cloned and sequenced a plant cDNA that encodes U1 small nuclear ribonucleoprotein (snRNP) 70K protein. The plant U1 snRNP 70K protein cDNA is not full length and lacks the coding region for 68 amino acids in the amino-terminal region as compared to human U1 snRNP 70K protein. Comparison of the deduced amino acid sequence of the plant U1 snRNP 70K protein with the amino acid sequence of animal and yeast U1 snRNP 70K protein showed a high degree of homology. The plant U1 snRNP 70K protein is more closely related to the human counter part than to the yeast 70K protein. The carboxy-terminal half is less well conserved but, like the vertebrate 70K proteins, is rich in charged amino acids. Northern analysis with the RNA isolated from different parts of the plant indicates that the snRNP 70K gene is expressed in all of the parts tested. Southern blotting of genomic DNA using the cDNA indicates that the U1 snRNP 70K protein is coded by a single gene.

  19. Sequencing and Characterization of Novel PII Signaling Protein Gene in Microalga Haematococcus pluvialis.

    PubMed

    Ma, Ruijuan; Li, Yan; Lu, Yinghua

    2017-10-11

    The PII signaling protein is a key protein for controlling nitrogen assimilatory reactions in most organisms, but little information is reported on PII proteins of green microalga Haematococcus pluvialis . Since H. pluvialis cells can produce a large amount of astaxanthin upon nitrogen starvation, its PII protein may represent an important factor on elevated production of Haematococcus astaxanthin. This study identified and isolated the coding gene (Hp GLB1 ) from this microalga. The full-length of Hp GLB1 was 1222 bp, including 621 bp coding sequence (CDS), 103 bp 5' untranslated region (5' UTR), and 498 bp 3' untranslated region (3' UTR). The CDS could encode a protein with 206 amino acids (HpPII). Its calculated molecular weight (Mw) was 22.4 kDa and the theoretical isoelectric point was 9.53. When H. pluvialis cells were exposed to nitrogen starvation, the Hp GLB1 expression was increased 2.46 times in 48 h, concomitant with the raise of astaxanthin content. This study also used phylogenetic analysis to prove that HpPII was homogeneous to the PII proteins of other green microalgae. The results formed a fundamental basis for the future study on HpPII, for its potential physiological function in Haematococcus astaxanthin biosysthesis.

  20. Molecular architecture of silk fibroin of Indian golden silkmoth, Antheraea assama.

    PubMed

    Gupta, Adarsh K; Mita, Kazuei; Arunkumar, Kallare P; Nagaraju, Javaregowda

    2015-08-03

    The golden silk spun by Indian golden silkmoth Antheraea assama, is regarded for its shimmering golden luster, tenacity and value as biomaterial. This report describes the gene coding for golden silk H-fibroin (AaFhc), its expression, full-length sequence and structurally important motifs discerning the underlying genetic and biochemical factors responsible for its much sought-after properties. The coding region, with biased isocodons, encodes highly repetitious crystalline core, flanked by a pair of 5' and 3' non-repetitious ends. AaFhc mRNA expression is strictly territorial, confined to the posterior silk gland, encoding a protein of size 230 kDa, which makes homodimers making the elementary structural units of the fibrous core of the golden silk. Characteristic polyalanine repeats that make tight β-sheet crystals alternate with non-polyalanine repeats that make less orderly antiparallel β-sheets, β-turns and partial α-helices. Phylogenetic analysis of the conserved N-terminal amorphous motif and the comparative analysis of the crystalline region with other saturniid H-fibroins reveal that AaFhc has longer, numerous and relatively uniform repeat motifs with lower serine content that assume tighter β-crystals and denser packing, which are speculated to be responsible for its acclaimed properties of higher tensile strength and higher refractive index responsible for golden luster.

  1. Recombination in Avian Gamma-Coronavirus Infectious Bronchitis Virus

    PubMed Central

    Thor, Sharmi W.; Hilt, Deborah A.; Kissinger, Jessica C.; Paterson, Andrew H.; Jackwood, Mark W.

    2011-01-01

    Recombination in the family Coronaviridae has been well documented and is thought to be a contributing factor in the emergence and evolution of different coronaviral genotypes as well as different species of coronavirus. However, there are limited data available on the frequency and extent of recombination in coronaviruses in nature and particularly for the avian gamma-coronaviruses where only recently the emergence of a turkey coronavirus has been attributed solely to recombination. In this study, the full-length genomes of eight avian gamma-coronavirus infectious bronchitis virus (IBV) isolates were sequenced and along with other full-length IBV genomes available from GenBank were analyzed for recombination. Evidence of recombination was found in every sequence analyzed and was distributed throughout the entire genome. Areas that have the highest occurrence of recombination are located in regions of the genome that code for nonstructural proteins 2, 3 and 16, and the structural spike glycoprotein. The extent of the recombination observed, suggests that this may be one of the principal mechanisms for generating genetic and antigenic diversity within IBV. These data indicate that reticulate evolutionary change due to recombination in IBV, likely plays a major role in the origin and adaptation of the virus leading to new genetic types and strains of the virus. PMID:21994806

  2. The complete mitochondrial genome and phylogenetic analysis of the giant panda (Ailuropoda melanoleuca).

    PubMed

    Peng, Rui; Zeng, Bo; Meng, Xiuxiang; Yue, Bisong; Zhang, Zhihe; Zou, Fangdong

    2007-08-01

    The complete mitochondrial genome sequence of the giant panda, Ailuropoda melanoleuca, was determined by the long and accurate polymerase chain reaction (LA-PCR) with conserved primers and primer walking sequence methods. The complete mitochondrial DNA is 16,805 nucleotides in length and contains two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one control region. The total length of the 13 protein-coding genes is longer than the American black bear, brown bear and polar bear by 3 amino acids at the end of ND5 gene. The codon usage also followed the typical vertebrate pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 5 (ND5) gene. The molecular phylogenetic analysis was performed on the sequences of 12 concatenated heavy-strand encoded protein-coding genes, and suggested that the giant panda is most closely related to bears.

  3. Run-length encoding graphic rules, biochemically editable designs and steganographical numeric data embedment for DNA-based cryptographical coding system

    PubMed Central

    Kawano, Tomonori

    2013-01-01

    There have been a wide variety of approaches for handling the pieces of DNA as the “unplugged” tools for digital information storage and processing, including a series of studies applied to the security-related area, such as DNA-based digital barcodes, water marks and cryptography. In the present article, novel designs of artificial genes as the media for storing the digitally compressed data for images are proposed for bio-computing purpose while natural genes principally encode for proteins. Furthermore, the proposed system allows cryptographical application of DNA through biochemically editable designs with capacity for steganographical numeric data embedment. As a model case of image-coding DNA technique application, numerically and biochemically combined protocols are employed for ciphering the given “passwords” and/or secret numbers using DNA sequences. The “passwords” of interest were decomposed into single letters and translated into the font image coded on the separate DNA chains with both the coding regions in which the images are encoded based on the novel run-length encoding rule, and the non-coding regions designed for biochemical editing and the remodeling processes revealing the hidden orientation of letters composing the original “passwords.” The latter processes require the molecular biological tools for digestion and ligation of the fragmented DNA molecules targeting at the polymerase chain reaction-engineered termini of the chains. Lastly, additional protocols for steganographical overwriting of the numeric data of interests over the image-coding DNA are also discussed. PMID:23750303

  4. piggyBac transposons expressing full-length human dystrophin enable genetic correction of dystrophic mesoangioblasts

    PubMed Central

    Loperfido, Mariana; Jarmin, Susan; Dastidar, Sumitava; Di Matteo, Mario; Perini, Ilaria; Moore, Marc; Nair, Nisha; Samara-Kuko, Ermira; Athanasopoulos, Takis; Tedesco, Francesco Saverio; Dickson, George; Sampaolesi, Maurilio; VandenDriessche, Thierry; Chuah, Marinee K.

    2016-01-01

    Duchenne muscular dystrophy (DMD) is a genetic neuromuscular disorder caused by the absence of dystrophin. We developed a novel gene therapy approach based on the use of the piggyBac (PB) transposon system to deliver the coding DNA sequence (CDS) of either full-length human dystrophin (DYS: 11.1 kb) or truncated microdystrophins (MD1: 3.6 kb; MD2: 4 kb). PB transposons encoding microdystrophins were transfected in C2C12 myoblasts, yielding 65±2% MD1 and 66±2% MD2 expression in differentiated multinucleated myotubes. A hyperactive PB (hyPB) transposase was then deployed to enable transposition of the large-size PB transposon (17 kb) encoding the full-length DYS and green fluorescence protein (GFP). Stable GFP expression attaining 78±3% could be achieved in the C2C12 myoblasts that had undergone transposition. Western blot analysis demonstrated expression of the full-length human DYS protein in myotubes. Subsequently, dystrophic mesoangioblasts from a Golden Retriever muscular dystrophy dog were transfected with the large-size PB transposon resulting in 50±5% GFP-expressing cells after stable transposition. This was consistent with correction of the differentiated dystrophic mesoangioblasts following expression of full-length human DYS. These results pave the way toward a novel non-viral gene therapy approach for DMD using PB transposons underscoring their potential to deliver large therapeutic genes. PMID:26682797

  5. Complete sequence of two tick-borne flaviviruses isolated from Siberia and the UK: analysis and significance of the 5' and 3'-UTRs.

    PubMed

    Gritsun, T S; Venugopal, K; Zanotto, P M; Mikhailov, M V; Sall, A A; Holmes, E C; Polkinghorne, I; Frolova, T V; Pogodina, V V; Lashkevich, V A; Gould, E A

    1997-05-01

    The complete nucleotide sequence of two tick-transmitted flaviviruses, Vasilchenko (Vs) from Siberia and louping ill (LI) from the UK, have been determined. The genomes were respectively, 10928 and 10871 nucleotides (nt) in length. The coding strategy and functional protein sequence motifs of tick-borne flaviviruses are presented in both Vs and LI viruses. The phylogenies based on maximum likelihood, maximum parsimony and distance analysis of the polyproteins, identified Vs virus as a member of the tick-borne encephalitis virus subgroup within the tick-borne serocomplex, genus Flavivirus, family Flaviviridae. Comparative alignment of the 3'-untranslated regions revealed deletions of different lengths essentially at the same position downstream of the stop codon for all tick-borne viruses. Two direct 27 nucleotide repeats at the 3'-end were found only for Vs and LI virus. Immediately following the deletions a region of 332-334 nt with relatively conserved primary structure (67-94% identity) was observed at the 3'-non-coding end of the virus genome. Pairwise comparisons of the nucleotide sequence data revealed similar levels of variation between the coding region, and the 5' and 3'-termini of the genome, implying an equivalent strong selective control for translated and untranslated regions. Indeed the predicted folding of the 5' and 3'-untranslated regions revealed patterns of stem and loop structures conserved for all tick-borne flaviviruses suggesting a purifying selection for preservation of essential RNA secondary structures which could be involved in translational control and replication. The possible implications of these findings are discussed.

  6. Variability and transmission by Aphis glycines of North American and Asian Soybean mosaic virus isolates.

    PubMed

    Domier, L L; Latorre, I J; Steinlage, T A; McCoppin, N; Hartman, G L

    2003-10-01

    The variability of North American and Asian strains and isolates of Soybean mosaic virus was investigated. First, polymerase chain reaction (PCR) products representing the coat protein (CP)-coding regions of 38 SMVs were analyzed for restriction fragment length polymorphisms (RFLP). Second, the nucleotide and predicted amino acid sequence variability of the P1-coding region of 18 SMVs and the helper component/protease (HC/Pro) and CP-coding regions of 25 SMVs were assessed. The CP nucleotide and predicted amino acid sequences were the most similar and predicted phylogenetic relationships similar to those obtained from RFLP analysis. Neither RFLP nor sequence analyses of the CP-coding regions grouped the SMVs by geographical origin. The P1 and HC/Pro sequences were more variable and separated the North American and Asian SMV isolates into two groups similar to previously reported differences in pathogenic diversity of the two sets of SMV isolates. The P1 region was the most informative of the three regions analyzed. To assess the biological relevance of the sequence differences in the HC/Pro and CP coding regions, the transmissibility of 14 SMV isolates by Aphis glycines was tested. All field isolates of SMV were transmitted efficiently by A. glycines, but the laboratory isolates analyzed were transmitted poorly. The amino acid sequences from most, but not all, of the poorly transmitted isolates contained mutations in the aphid transmission-associated DAG and/or KLSC amino acid sequence motifs of CP and HC/Pro, respectively.

  7. Dystrophin Hot-Spot Mutants Leading to Becker Muscular Dystrophy Insert More Deeply into Membrane Models than the Native Protein.

    PubMed

    Ameziane-Le Hir, Sarah; Paboeuf, Gilles; Tascon, Christophe; Hubert, Jean-François; Le Rumeur, Elisabeth; Vié, Véronique; Raguénès-Nicol, Céline

    2016-07-26

    Dystrophin (DYS) is a membrane skeleton protein whose mutations lead to lethal Duchenne muscular dystrophy or to the milder Becker muscular dystrophy (BMD). One third of BMD "in-frame" exon deletions are located in the region that codes for spectrin-like repeats R16 to R21. We focused on four prevalent mutated proteins deleted in this area (called RΔ45-47, RΔ45-48, RΔ45-49, and RΔ45-51 according to the deleted exon numbers), analyzing protein/membrane interactions. Two of the mutants, RΔ45-48 and RΔ45-51, led to mild pathologies and displayed a similar triple coiled-coil structure as the full-length DYS R16-21, whereas the two others, RΔ45-47 and RΔ45-49, induced more severe pathologies and showed "fractional" structures unrelated to the normal one. To explore lipid packing, small unilamellar liposomes (SUVs) and planar monolayers were used at various initial surface pressures. The dissociation constants determined by microscale thermophoresis (MST) were much higher for the full-length DYS R161-21 than for the mutants; thus the wild type protein has weaker SUV binding. Comparing surface pressures after protein adsorption and analysis of atomic force microscopy images of mixed protein/lipid monolayers revealed that the mutants insert more into the lipid monolayer than the wild type does. In fact, in both models every deletion mutant showed more interactions with membranes than the full-length protein did. This means that mutations in the R16-21 part of dystrophin disturb the protein's molecular behavior as it relates to membranes, regardless of whether the accompanying pathology is mild or severe.

  8. Genetically encoded photocross-linkers determine the biological binding site of exendin-4 peptide in the N-terminal domain of the intact human glucagon-like peptide-1 receptor (GLP-1R)

    PubMed Central

    Koole, Cassandra; Reynolds, Christopher A.; Mobarec, Juan C.; Hick, Caroline; Sexton, Patrick M.; Sakmar, Thomas P.

    2017-01-01

    The glucagon-like peptide-1 receptor (GLP-1R) is a key therapeutic target in the management of type II diabetes mellitus, with actions including regulation of insulin biosynthesis and secretion, promotion of satiety, and preservation of β-cell mass. Like most class B G protein-coupled receptors (GPCRs), there is limited knowledge linking biological activity of the GLP-1R with the molecular structure of an intact, full-length, and functional receptor·ligand complex. In this study, we have utilized genetic code expansion to site-specifically incorporate the photoactive amino acid p-azido-l-phenylalanine (azF) into N-terminal residues of a full-length functional human GLP-1R in mammalian cells. UV-mediated photolysis of azF was then carried out to induce targeted photocross-linking to determine the proximity of the azido group in the mutant receptor with the peptide exendin-4. Cross-linking data were compared directly with the crystal structure of the isolated N-terminal extracellular domain of the GLP-1R in complex with exendin(9–39), revealing both similarities as well as distinct differences in the mode of interaction. Generation of a molecular model to accommodate the photocross-linking constraints highlights the potential influence of environmental conditions on the conformation of the receptor·peptide complex, including folding dynamics of the peptide and formation of dimeric and higher order oligomeric receptor multimers. These data demonstrate that crystal structures of isolated receptor regions may not give a complete reflection of peptide/receptor interactions and should be combined with additional experimental constraints to reveal peptide/receptor interactions occurring in the dynamic, native, and full-length receptor state. PMID:28283573

  9. A fresh look at the male-specific region of the human Y chromosome.

    PubMed

    Jangravi, Zohreh; Alikhani, Mehdi; Arefnezhad, Babak; Sharifi Tabar, Mehdi; Taleahmad, Sara; Karamzadeh, Razieh; Jadaliha, Mahdieh; Mousavi, Seyed Ahmad; Ahmadi Rastegar, Diba; Parsamatin, Pouria; Vakilian, Haghighat; Mirshahvaladi, Shahab; Sabbaghian, Marjan; Mohseni Meybodi, Anahita; Mirzaei, Mehdi; Shahhoseini, Maryam; Ebrahimi, Marzieh; Piryaei, Abbas; Moosavi-Movahedi, Ali Akbar; Haynes, Paul A; Goodchild, Ann K; Nasr-Esfahani, Mohammad Hossein; Jabbari, Esmaiel; Baharvand, Hossein; Sedighi Gilani, Mohammad Ali; Gourabi, Hamid; Salekdeh, Ghasem Hosseini

    2013-01-04

    The Chromosome-centric Human Proteome Project (C-HPP) aims to systematically map the entire human proteome with the intent to enhance our understanding of human biology at the cellular level. This project attempts simultaneously to establish a sound basis for the development of diagnostic, prognostic, therapeutic, and preventive medical applications. In Iran, current efforts focus on mapping the proteome of the human Y chromosome. The male-specific region of the Y chromosome (MSY) is unique in many aspects and comprises 95% of the chromosome's length. The MSY continually retains its haploid state and is full of repeated sequences. It is responsible for important biological roles such as sex determination and male fertility. Here, we present the most recent update of MSY protein-encoding genes and their association with various traits and diseases including sex determination and reversal, spermatogenesis and male infertility, cancers such as prostate cancers, sex-specific effects on the brain and behavior, and graft-versus-host disease. We also present information available from RNA sequencing, protein-protein interaction, post-translational modification of MSY protein-coding genes and their implications in biological systems. An overview of Human Y chromosome Proteome Project is presented and a systematic approach is suggested to ensure that at least one of each predicted protein-coding gene's major representative proteins will be characterized in the context of its major anatomical sites of expression, its abundance, and its functional relevance in a biological and/or medical context. There are many technical and biological issues that will need to be overcome in order to accomplish the full scale mapping.

  10. The complete mitochondrial genome of the Border Collie dog.

    PubMed

    Wu, An-Quan; Zhang, Yong-Liang; Li, Li-Li; Chen, Long; Yang, Tong-Wen

    2016-01-01

    Border Collie dog is one of the famous breed of dog. In the present work we report the complete mitochondrial genome sequence of Border Collie dog for the first time. The total length of the mitogenome was 16,730 bp with the base composition of 31.6% for A, 28.7% for T, 25.5% for C, and 14.2% for G and an A-T (60.3%)-rich feature was detected. It harbored 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes and one non-coding control region (D-loop region). The arrangement of all genes was identical to the typical mitochondrial genomes of dogs.

  11. Complete mitochondrial genome of Eagle Owl (Bubo bubo, Strigiformes; Strigidae) from China.

    PubMed

    Hengjiu, Tian; Jianwei, Ji; Shi, Yang; Zhiming, Zhang; Laghari, Muhammad Younis; Narejo, Naeem Tariq; Lashari, Punhal

    2016-01-01

    In the present study, the complete mitochondrial genome sequence of Bubo bubo using PCR amplification, sequencing and assembling has been obtained for the first time. The total length of the mitochondrial genome was 16,250  bp, with the base composition of 29.88% A, 34.16% C, 14.35% G, and 21.58% T. It contained 37 genes (2 ribosomal RNA genes, 13 protein-coding genes and 22 transfer RNA genes) and a major non-coding control region (D-loop region). The complete mitochondrial genome sequence of Bubo bubo provides an important data set for further investigation on the phylogenetic relationships within Strigiformes.

  12. Complete genome sequences of two highly divergent Japanese isolates of Plantago asiatica mosaic virus.

    PubMed

    Komatsu, Ken; Yamashita, Kazuo; Sugawara, Kota; Verbeek, Martin; Fujita, Naoko; Hanada, Kaoru; Uehara-Ichiki, Tamaki; Fuji, Shin-Ichi

    2017-02-01

    Plantago asiatica mosaic virus (PlAMV) is a member of the genus Potexvirus and has an exceptionally wide host range. It causes severe damage to lilies. Here we report on the complete nucleotide sequences of two new Japanese PlAMV isolates, one from the eudicot weed Viola grypoceras (PlAMV-Vi), and the other from the eudicot shrub Nandina domestica Thunb. (PlAMV-NJ). Their genomes contain five open reading frames (ORFs), which is characteristic of potexviruses. Surprisingly, the isolates showed only 76.0-78.0 % sequence identity with each other and with other PlAMV isolates, including isolates from Japanese lily and American nandina. Amino acid alignments of the replicase coding region encoded by ORF1 showed that the regions between the methyltransferase and helicase domains were less conserved than other regions, with several insertions and/or deletions. Phylogenetic analyses of the full-length nucleotide sequences revealed a moderate correlation between phylogenetic clustering and the original host plants of the PlAMV isolates. This study revealed the presence of two highly divergent PlAMV isolates in Japan.

  13. Substantial Fast-Wave Power Flux in the SOL of a Cylindrical Model; Comparison with Coaxial Modes

    NASA Astrophysics Data System (ADS)

    Perkins, R. J.; Bertelli, N.; Hosea, J. C.; Phillips, C. K.; Taylor, G.; Wilson, J. R.

    2015-11-01

    The NSTX high-harmonic fast-wave (HHFW) heating system can lose a significant amount of power along magnetic fields lines in the SOL to the divertor regions under certain conditions. A cylindrical cold-plasma model, with parameters resembling those of NSTX, shows the existence of modes with relatively large RF field amplitudes in the low-density annulus, similar to recent results found with the full-wave simulation AORSA. Here, we compare and contrast these modes against ``coaxial modes,'' modes that resemble TEM modes found in coaxial cables. We also compute the 3D Poynting flux as a function of length along the cylinder for comparison to NSTX. Such work is part of an effort to include the proper edge damping into full-wave codes so that they can reproduce the losses observed in NSTX and predict their importance for ITER. This work was supported by DOE Contract No. DE-AC02-09CH11466.

  14. The complete mitochondrial genome of the bagarius yarrelli from honghe river

    NASA Astrophysics Data System (ADS)

    Du, M.; Zhou, C. J.; Niu, B. Z.; Liu, Y. H.; Li, N.; Ai, J. L.; Xu, G. L.

    2016-08-01

    The total length of mitochondrial DNA sequence of the Bagarius yarrelli from the Honghe river of China is determined in this paper. The total length of the circular molecule is 16524 base pair which denoted a similar gene order to that of the other bony fishes, which include a non-coding control region, a replicated origin, two ribosome RNA (rRNA) genes, 22 transfer RNA (tRNA) genes as well as 13 protein-coding genes. Its whole base constitution is 31.4% for A, 26.9% for C, 15.7% for G and 26.0% for T, with an A+T bias of 57.4%. Those mitochondrial data would contribute to further study molecular evolution and population genetics of this species.

  15. Complete sequence and gene organization of the mitochondrial genome of Asio flammeus (Strigiformes, strigidae).

    PubMed

    Zhang, Yanan; Song, Tao; Pan, Tao; Sun, Xiaonan; Sun, Zhonglou; Qian, Lifu; Zhang, Baowei

    2016-07-01

    The complete sequence of the mitochondrial genome was determined for Asio flammeus, which is distributed widely in geography. The length of the complete mitochondrial genome was 18,966 bp, containing 2 rRNA genes, 22 tRNA genes, 13 protein-coding genes (PCGs), and 1 non-coding region (D-loop). All the genes were distributed on the H-strand, except for the ND6 subunit gene and eight tRNA genes which were encoded on the L-strand. The D-loop of A. flammeus contained many tandem repeats of varying lengths and repeat numbers. The molecular-based phylogeny showed that our species acted as the sister group to A. capensis and the supported Asio was the monophyletic group.

  16. Morphometric Analysis of Recognized Genes for Autism Spectrum Disorders and Obesity in Relationship to the Distribution of Protein-Coding Genes on Human Chromosomes.

    PubMed

    McGuire, Austen B; Rafi, Syed K; Manzardo, Ann M; Butler, Merlin G

    2016-05-05

    Mammalian chromosomes are comprised of complex chromatin architecture with the specific assembly and configuration of each chromosome influencing gene expression and function in yet undefined ways by varying degrees of heterochromatinization that result in Giemsa (G) negative euchromatic (light) bands and G-positive heterochromatic (dark) bands. We carried out morphometric measurements of high-resolution chromosome ideograms for the first time to characterize the total euchromatic and heterochromatic chromosome band length, distribution and localization of 20,145 known protein-coding genes, 790 recognized autism spectrum disorder (ASD) genes and 365 obesity genes. The individual lengths of G-negative euchromatin and G-positive heterochromatin chromosome bands were measured in millimeters and recorded from scaled and stacked digital images of 850-band high-resolution ideograms supplied by the International Society of Chromosome Nomenclature (ISCN) 2013. Our overall measurements followed established banding patterns based on chromosome size. G-negative euchromatic band regions contained 60% of protein-coding genes while the remaining 40% were distributed across the four heterochromatic dark band sub-types. ASD genes were disproportionately overrepresented in the darker heterochromatic sub-bands, while the obesity gene distribution pattern did not significantly differ from protein-coding genes. Our study supports recent trends implicating genes located in heterochromatin regions playing a role in biological processes including neurodevelopment and function, specifically genes associated with ASD.

  17. The Complete Sequence of a Human Parainfluenzavirus 4 Genome

    PubMed Central

    Yea, Carmen; Cheung, Rose; Collins, Carol; Adachi, Dena; Nishikawa, John; Tellier, Raymond

    2009-01-01

    Although the human parainfluenza virus 4 (HPIV4) has been known for a long time, its genome, alone among the human paramyxoviruses, has not been completely sequenced to date. In this study we obtained the first complete genomic sequence of HPIV4 from a clinical isolate named SKPIV4 obtained at the Hospital for Sick Children in Toronto (Ontario, Canada). The coding regions for the N, P/V, M, F and HN proteins show very high identities (95% to 97%) with previously available partial sequences for HPIV4B. The sequence for the L protein and the non-coding regions represent new information. A surprising feature of the genome is its length, more than 17 kb, making it the longest genome within the genus Rubulavirus, although the length is well within the known range of 15 kb to 19 kb for the subfamily Paramyxovirinae. The availability of a complete genomic sequence will facilitate investigations on a respiratory virus that is still not completely characterized. PMID:21994536

  18. Informatic and genomic analysis of melanocyte cDNA libraries as a resource for the study of melanocyte development and function.

    PubMed

    Baxter, Laura L; Hsu, Benjamin J; Umayam, Lowell; Wolfsberg, Tyra G; Larson, Denise M; Frith, Martin C; Kawai, Jun; Hayashizaki, Yoshihide; Carninci, Piero; Pavan, William J

    2007-06-01

    As part of the RIKEN mouse encyclopedia project, two cDNA libraries were prepared from melanocyte-derived cell lines, using techniques of full-length clone selection and subtraction/normalization to enrich for rare transcripts. End sequencing showed that these libraries display over 83% complete coding sequence at the 5' end and 96-97% complete coding sequence at the 3' end. Evaluation of the libraries, derived from B16F10Y tumor cells and melan-c cells, revealed that they contain clones for a majority of the genes previously demonstrated to function in melanocyte biology. Analysis of genomic locations for transcripts revealed that the distribution of melanocyte genes is non-random throughout the genome. Three genomic regions identified that showed significant clustering of melanocyte-expressed genes contain one or more genes previously shown to regulate melanocyte development or function. A catalog of genes expressed in these libraries is presented, providing a valuable resource of cDNA clones and sequence information that can be used for identification of new genes important for melanocyte development, function, and disease.

  19. Using hidden Markov models and observed evolution to annotate viral genomes.

    PubMed

    McCauley, Stephen; Hein, Jotun

    2006-06-01

    ssRNA (single stranded) viral genomes are generally constrained in length and utilize overlapping reading frames to maximally exploit the coding potential within the genome length restrictions. This overlapping coding phenomenon leads to complex evolutionary constraints operating on the genome. In regions which code for more than one protein, silent mutations in one reading frame generally have a protein coding effect in another. To maximize coding flexibility in all reading frames, overlapping regions are often compositionally biased towards amino acids which are 6-fold degenerate with respect to the 64 codon alphabet. Previous methodologies have used this fact in an ad hoc manner to look for overlapping genes by motif matching. In this paper differentiated nucleotide compositional patterns in overlapping regions are incorporated into a probabilistic hidden Markov model (HMM) framework which is used to annotate ssRNA viral genomes. This work focuses on single sequence annotation and applies an HMM framework to ssRNA viral annotation. A description of how the HMM is parameterized, whilst annotating within a missing data framework is given. A Phylogenetic HMM (Phylo-HMM) extension, as applied to 14 aligned HIV2 sequences is also presented. This evolutionary extension serves as an illustration of the potential of the Phylo-HMM framework for ssRNA viral genomic annotation. The single sequence annotation procedure (SSA) is applied to 14 different strains of the HIV2 virus. Further results on alternative ssRNA viral genomes are presented to illustrate more generally the performance of the method. The results of the SSA method are encouraging however there is still room for improvement, and since there is overwhelming evidence to indicate that comparative methods can improve coding sequence (CDS) annotation, the SSA method is extended to a Phylo-HMM to incorporate evolutionary information. The Phylo-HMM extension is applied to the same set of 14 HIV2 sequences which are pre-aligned. The performance improvement that results from including the evolutionary information in the analysis is illustrated.

  20. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Strauss, H.R.

    This paper describes the code FEMHD, an adaptive finite element MHD code, which is applied in a number of different manners to model MHD behavior and edge plasma phenomena on a diverted tokamak. The code uses an unstructured triangular mesh in 2D and wedge shaped mesh elements in 3D. The code has been adapted to look at neutral and charged particle dynamics in the plasma scrape off region, and into a full MHD-particle code.

  1. Optimal Codes for the Burst Erasure Channel

    NASA Technical Reports Server (NTRS)

    Hamkins, Jon

    2010-01-01

    Deep space communications over noisy channels lead to certain packets that are not decodable. These packets leave gaps, or bursts of erasures, in the data stream. Burst erasure correcting codes overcome this problem. These are forward erasure correcting codes that allow one to recover the missing gaps of data. Much of the recent work on this topic concentrated on Low-Density Parity-Check (LDPC) codes. These are more complicated to encode and decode than Single Parity Check (SPC) codes or Reed-Solomon (RS) codes, and so far have not been able to achieve the theoretical limit for burst erasure protection. A block interleaved maximum distance separable (MDS) code (e.g., an SPC or RS code) offers near-optimal burst erasure protection, in the sense that no other scheme of equal total transmission length and code rate could improve the guaranteed correctible burst erasure length by more than one symbol. The optimality does not depend on the length of the code, i.e., a short MDS code block interleaved to a given length would perform as well as a longer MDS code interleaved to the same overall length. As a result, this approach offers lower decoding complexity with better burst erasure protection compared to other recent designs for the burst erasure channel (e.g., LDPC codes). A limitation of the design is its lack of robustness to channels that have impairments other than burst erasures (e.g., additive white Gaussian noise), making its application best suited for correcting data erasures in layers above the physical layer. The efficiency of a burst erasure code is the length of its burst erasure correction capability divided by the theoretical upper limit on this length. The inefficiency is one minus the efficiency. The illustration compares the inefficiency of interleaved RS codes to Quasi-Cyclic (QC) LDPC codes, Euclidean Geometry (EG) LDPC codes, extended Irregular Repeat Accumulate (eIRA) codes, array codes, and random LDPC codes previously proposed for burst erasure protection. As can be seen, the simple interleaved RS codes have substantially lower inefficiency over a wide range of transmission lengths.

  2. A survey of the sorghum transcriptome using single-molecule long reads

    DOE PAGES

    Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; ...

    2016-06-24

    Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novelmore » splice isoforms. Additionally, we uncover APA ofB11,000 expressed genes and more than 2,100 novel genes. Lastly, these results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism.« less

  3. A survey of the sorghum transcriptome using single-molecule long reads

    PubMed Central

    Abdel-Ghany, Salah E.; Hamilton, Michael; Jacobi, Jennifer L.; Ngam, Peter; Devitt, Nicholas; Schilkey, Faye; Ben-Hur, Asa; Reddy, Anireddy S. N.

    2016-01-01

    Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ∼11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism. PMID:27339290

  4. High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing.

    PubMed

    Lagarde, Julien; Uszczynska-Ratajczak, Barbara; Carbonell, Silvia; Pérez-Lluch, Sílvia; Abad, Amaya; Davis, Carrie; Gingeras, Thomas R; Frankish, Adam; Harrow, Jennifer; Guigo, Roderic; Johnson, Rory

    2017-12-01

    Accurate annotation of genes and their transcripts is a foundation of genomics, but currently no annotation technique combines throughput and accuracy. As a result, reference gene collections remain incomplete-many gene models are fragmentary, and thousands more remain uncataloged, particularly for long noncoding RNAs (lncRNAs). To accelerate lncRNA annotation, the GENCODE consortium has developed RNA Capture Long Seq (CLS), which combines targeted RNA capture with third-generation long-read sequencing. Here we present an experimental reannotation of the GENCODE intergenic lncRNA populations in matched human and mouse tissues that resulted in novel transcript models for 3,574 and 561 gene loci, respectively. CLS approximately doubled the annotated complexity of targeted loci, outperforming existing short-read techniques. Full-length transcript models produced by CLS enabled us to definitively characterize the genomic features of lncRNAs, including promoter and gene structure, and protein-coding potential. Thus, CLS removes a long-standing bottleneck in transcriptome annotation and generates manual-quality full-length transcript models at high-throughput scales.

  5. Lossy to lossless object-based coding of 3-D MRI data.

    PubMed

    Menegaz, Gloria; Thiran, Jean-Philippe

    2002-01-01

    We propose a fully three-dimensional (3-D) object-based coding system exploiting the diagnostic relevance of the different regions of the volumetric data for rate allocation. The data are first decorrelated via a 3-D discrete wavelet transform. The implementation via the lifting steps scheme allows to map integer-to-integer values, enabling lossless coding, and facilitates the definition of the object-based inverse transform. The coding process assigns disjoint segments of the bitstream to the different objects, which can be independently accessed and reconstructed at any up-to-lossless quality. Two fully 3-D coding strategies are considered: embedded zerotree coding (EZW-3D) and multidimensional layered zero coding (MLZC), both generalized for region of interest (ROI)-based processing. In order to avoid artifacts along region boundaries, some extra coefficients must be encoded for each object. This gives rise to an overheading of the bitstream with respect to the case where the volume is encoded as a whole. The amount of such extra information depends on both the filter length and the decomposition depth. The system is characterized on a set of head magnetic resonance images. Results show that MLZC and EZW-3D have competitive performances. In particular, the best MLZC mode outperforms the others state-of-the-art techniques on one of the datasets for which results are available in the literature.

  6. piggyBac transposons expressing full-length human dystrophin enable genetic correction of dystrophic mesoangioblasts.

    PubMed

    Loperfido, Mariana; Jarmin, Susan; Dastidar, Sumitava; Di Matteo, Mario; Perini, Ilaria; Moore, Marc; Nair, Nisha; Samara-Kuko, Ermira; Athanasopoulos, Takis; Tedesco, Francesco Saverio; Dickson, George; Sampaolesi, Maurilio; VandenDriessche, Thierry; Chuah, Marinee K

    2016-01-29

    Duchenne muscular dystrophy (DMD) is a genetic neuromuscular disorder caused by the absence of dystrophin. We developed a novel gene therapy approach based on the use of the piggyBac (PB) transposon system to deliver the coding DNA sequence (CDS) of either full-length human dystrophin (DYS: 11.1 kb) or truncated microdystrophins (MD1: 3.6 kb; MD2: 4 kb). PB transposons encoding microdystrophins were transfected in C2C12 myoblasts, yielding 65±2% MD1 and 66±2% MD2 expression in differentiated multinucleated myotubes. A hyperactive PB (hyPB) transposase was then deployed to enable transposition of the large-size PB transposon (17 kb) encoding the full-length DYS and green fluorescence protein (GFP). Stable GFP expression attaining 78±3% could be achieved in the C2C12 myoblasts that had undergone transposition. Western blot analysis demonstrated expression of the full-length human DYS protein in myotubes. Subsequently, dystrophic mesoangioblasts from a Golden Retriever muscular dystrophy dog were transfected with the large-size PB transposon resulting in 50±5% GFP-expressing cells after stable transposition. This was consistent with correction of the differentiated dystrophic mesoangioblasts following expression of full-length human DYS. These results pave the way toward a novel non-viral gene therapy approach for DMD using PB transposons underscoring their potential to deliver large therapeutic genes. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Analysis of the Length of Braille Texts in English Braille American Edition, the Nemeth Code, and Computer Braille Code versus the Unified English Braille Code

    ERIC Educational Resources Information Center

    Knowlton, Marie; Wetzel, Robin

    2006-01-01

    This study compared the length of text in English Braille American Edition, the Nemeth code, and the computer braille code with the Unified English Braille Code (UEBC)--also known as Unified English Braille (UEB). The findings indicate that differences in the length of text are dependent on the type of material that is transcribed and the grade…

  8. Nihilism, relativism, and Engelhardt.

    PubMed

    Wreen, M

    1998-01-01

    This paper is a critical analysis of Tristram Engelhardt's attempts to avoid unrestricted nihilism and relativism. The focus of attention is his recent book, The Foundations of Bioethics (Oxford University Press, 1996). No substantive or "content-full" bioethics (e.g., that of Roman Catholicism or the Samurai) has an intersubjectively verifiable and universally binding foundation, Engelhardt thinks, for unaided secular reason cannot show that any particular substantive morality (or moral code) is correct. He thus seems to be committed to either nihilism or relativism. The first is the view that there is not even one true or valid moral code, and the second is the view that there is a plurality of true or valid moral codes. However, Engelhardt rejects both nihilism and relativism, at least in unrestricted form. Strictly speaking, he himself is a universalist, someone who believes that there is a single true moral code. Two argumentative strategies are employed by him to fend off unconstrained nihilism and relativism. The first argues that although all attempts to establish a content-full morality on the basis of secular reason fail, secular reason can still establish a content-less, purely procedural morality. Although not content-full and incapable of providing positive direction in life, much less a meaning of life, such a morality does limit the range of relativism and nihilism. The second argues that there is a single true, content-full morality. Grace and revelation, however, are needed to make it available to us; secular reason alone is not up to the task. This second line of argument is not pursued in The Foundations at any length, but it does crop up at times, and if it is sound, nihilism and relativism can be much more thoroughly routed than the first line of argument has it. Engelhardt's position and argumentative strategies are exposed at length and accorded a detailed critical examination. In the end, it is concluded that neither strategy will do, and that Engelhardt is probably committed to some form of relativism.

  9. The complete mitochondrial genome of Pomacea canaliculata (Gastropoda: Ampullariidae).

    PubMed

    Zhou, Xuming; Chen, Yu; Zhu, Shanliang; Xu, Haigen; Liu, Yan; Chen, Lian

    2016-01-01

    The mitochondrial genome of Pomacea canaliculata (Gastropoda: Ampullariidae) is the first complete mtDNA sequence reported in the genus Pomacea. The total length of mtDNA is 15,707 bp, which containing 13 protein-coding genes, 2 ribosomal RNAs, 22 transfer RNAs, and a 359 bp non-coding region. The A + T content of the overall base composition of H-strand is 71.7% (T: 41%, C: 12.7%, A: 30.7%, G: 15.6%). ATP6, ATP8, CO1, CO2, ND1-3, ND5, ND6, ND4L and Cyt b genes begin with ATG as start codon, CO3 and ND4 begin with ATA. ATP8, CO2-3, ND4L, ND2-6 and Cyt b genes are terminated with TAA as stop codon, ATP6, ND1, and CO1 end with TAG. A long non-coding region is found and a 23 bp repeat unit repeat 11 times in this region.

  10. Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing.

    PubMed

    Kanda, Kojun; Pflug, James M; Sproul, John S; Dasenko, Mark A; Maddison, David R

    2015-01-01

    In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced.

  11. Successful Recovery of Nuclear Protein-Coding Genes from Small Insects in Museums Using Illumina Sequencing

    PubMed Central

    Dasenko, Mark A.

    2015-01-01

    In this paper we explore high-throughput Illumina sequencing of nuclear protein-coding, ribosomal, and mitochondrial genes in small, dried insects stored in natural history collections. We sequenced one tenebrionid beetle and 12 carabid beetles ranging in size from 3.7 to 9.7 mm in length that have been stored in various museums for 4 to 84 years. Although we chose a number of old, small specimens for which we expected low sequence recovery, we successfully recovered at least some low-copy nuclear protein-coding genes from all specimens. For example, in one 56-year-old beetle, 4.4 mm in length, our de novo assembly recovered about 63% of approximately 41,900 nucleotides in a target suite of 67 nuclear protein-coding gene fragments, and 70% using a reference-based assembly. Even in the least successfully sequenced carabid specimen, reference-based assembly yielded fragments that were at least 50% of the target length for 34 of 67 nuclear protein-coding gene fragments. Exploration of alternative references for reference-based assembly revealed few signs of bias created by the reference. For all specimens we recovered almost complete copies of ribosomal and mitochondrial genes. We verified the general accuracy of the sequences through comparisons with sequences obtained from PCR and Sanger sequencing, including of conspecific, fresh specimens, and through phylogenetic analysis that tested the placement of sequences in predicted regions. A few possible inaccuracies in the sequences were detected, but these rarely affected the phylogenetic placement of the samples. Although our sample sizes are low, an exploratory regression study suggests that the dominant factor in predicting success at recovering nuclear protein-coding genes is a high number of Illumina reads, with success at PCR of COI and killing by immersion in ethanol being secondary factors; in analyses of only high-read samples, the primary significant explanatory variable was body length, with small beetles being more successfully sequenced. PMID:26716693

  12. Common position of indels that cause deviations from canonical genome organization in different measles virus strains.

    PubMed

    Ivancic-Jelecki, Jelena; Slovic, Anamarija; Šantak, Maja; Tešović, Goran; Forcic, Dubravko

    2016-07-29

    The canonical genome organization of measles virus (MV) is characterized by total size of 15 894 nucleotides (nts) and defined length of every genomic region, both coding and non-coding. Only rarely have reports of strains possessing non-canonical genomic properties (possessing indels, with or without the change of total genome length) been published. The observed mutations are mutually compensatory in a sense that the total genome length remains polyhexameric. Although programmed and highly precise pseudo-templated nucleotide additions during transcription are inherent to polymerases of all viruses belonging to family Paramyxoviridae, a similar mechanism that would serve to non-randomly correct genome length, if an indel has occurred during replication, has so far not been described in the context of a complete virus genome. We compiled all complete MV genomic sequences (64 in total) available in open access sequence databases. Multiple sequence comparisons and phylogenetic analyses were performed with the aim of exploring whether non-recombinant and non-evolutionary linked measles strains that show deviations from canonical genome organization possess a common genetic characteristic. In 11 MV sequences we detected deviations from canonical genome organization due to short indels located within homopolymeric stretches or next to them. In nine out of 11 identified non-canonical MV sequences, a common feature was observed: one mutation, either an insertion or a deletion, was located in a 28 nts long region in F gene 5' untranslated region (positions 5051-5078 in genomic cDNA of canonical strains). This segment is composed of five tandemly linked homopolymeric stretches, its consensus sequence is G6-7C7-8A6-7G1-3C5-6. Although none of the mononucleotide repeats within this segment has fixed length, the total number of nts in canonical strains is always 28. These nine non-canonical strains, as well as the tenth (not mutated in 5051-5078 segment), can be grouped in three clusters, based on their passage histories/epidemiological data/genetic similarities. There are no indications that the 3 clusters are evolutionary linked, other than the fact that they all belong to clade D. A common narrow genomic region was found to be mutated in different, non-related, wild type strains suggesting that this region might have a function in non-random genome length corrections occurring during MV replication.

  13. The Mitochondrial Cytochrome Oxidase Subunit I Gene Occurs on a Minichromosome with Extensive Heteroplasmy in Two Species of Chewing Lice, Geomydoecus aurei and Thomomydoecus minor

    PubMed Central

    Pietan, Lucas L.; Spradling, Theresa A.

    2016-01-01

    In animals, mitochondrial DNA (mtDNA) typically occurs as a single circular chromosome with 13 protein-coding genes and 22 tRNA genes. The various species of lice examined previously, however, have shown mitochondrial genome rearrangements with a range of chromosome sizes and numbers. Our research demonstrates that the mitochondrial genomes of two species of chewing lice found on pocket gophers, Geomydoecus aurei and Thomomydoecus minor, are fragmented with the 1,536 base-pair (bp) cytochrome-oxidase subunit I (cox1) gene occurring as the only protein-coding gene on a 1,916–1,964 bp minicircular chromosome in the two species, respectively. The cox1 gene of T. minor begins with an atypical start codon, while that of G. aurei does not. Components of the non-protein coding sequence of G. aurei and T. minor include a tRNA (isoleucine) gene, inverted repeat sequences consistent with origins of replication, and an additional non-coding region that is smaller than the non-coding sequence of other lice with such fragmented mitochondrial genomes. Sequences of cox1 minichromosome clones for each species reveal extensive length and sequence heteroplasmy in both coding and noncoding regions. The highly variable non-gene regions of G. aurei and T. minor have little sequence similarity with one another except for a 19-bp region of phylogenetically conserved sequence with unknown function. PMID:27589589

  14. The phylogenomic position of the grey nurse shark Carcharias taurus Rafinesque, 1810 (Lamniformes, Odontaspididae) inferred from the mitochondrial genome.

    PubMed

    Bowden, Deborah L; Vargas-Caro, Carolina; Ovenden, Jennifer R; Bennett, Michael B; Bustamante, Carlos

    2016-11-01

    The complete mitochondrial genome of the grey nurse shark Carcharias taurus is described from 25 963 828 sequences obtained using Illumina NGS technology. Total length of the mitogenome is 16 715 bp, consisting of 2 rRNAs, 13 protein-coding regions, 22 tRNA and 2 non-coding regions thus updating the previously published mitogenome for this species. The phylogenomic reconstruction inferred from the mitogenome of 15 species of Lamniform and Carcharhiniform sharks supports the inclusion of C. taurus in a clade with the Lamnidae and Cetorhinidae. This complete mitogenome contributes to ongoing investigation into the monophyly of the Family Odontaspididae.

  15. Whole mitochondrial genome sequence for an osteoarthritis model of Guinea pig (Caviidae; Cavia).

    PubMed

    Cui, Xin-Gang; Liu, Cheng-Yao; Wei, Bo; Zhao, Wen-Jian; Zhang, Wen-Feng

    2016-11-01

    Animal models played an important role in osteoarthritis studies. Here, the complete mitochondrial genome sequence of the Guinea pig was reported for the first time. The total length of the mitogenome was 16,797 bp. It contained the typical structure, including two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one non-coding control region (D-loop region). The overall composition of the mitogenome was estimated to be 34.9% for A, 26.1% for T, 26.0% for C and 13.0% for G showing an A-T (61.0%)-rich feature. This mitochondrial genome sequence will provide new genetic resource into osteoarthritis disease.

  16. Complete mitochondrial genome of Cynopterus sphinx (Pteropodidae: Cynopterus).

    PubMed

    Li, Linmiao; Li, Min; Wu, Zhengjun; Chen, Jinping

    2015-01-01

    We have characterized the complete mitochondrial genome of Cynopterus sphinx (Pteropodidae: Cynopterus) and described its organization in this study. The total length of C. sphinx complete mitochondrial genome was 16,895 bp with the base composition of 32.54% A, 14.05% G, 25.82% T and 27.59% C. The complete mitochondrial genome included 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes (12S rRNA and 16S rRNA) and 1 control region (D-loop). The control region was 1435 bp long with the sequence CATACG repeat 64 times. Three protein-coding genes (ND1, COI and ND4) were ended with incomplete stop codon TA or T.

  17. Transcriptional mapping of the ribosomal RNA region of mouse L-cell mitochondrial DNA.

    PubMed Central

    Nagley, P; Clayton, D A

    1980-01-01

    The map positions in mouse mitochondrial DNA of the two ribosomal RNA genes and adjacent genes coding several small transcripts have been determined precisely by application of a procedure in which DNA-RNA hybrids have been subjected to digestion by S1 nuclease under conditions of varying severity. Digestion of the DNA-RNA hybrids with S1 nuclease yielded a series of species which were shown to contain ribosomal RNA molecules together with adjacent transcripts hybridized conjointly to a continuous segment of mitochondrial DNA. There is one small transcript about 60 bases long whose gene adjoins the sequences coding the 5'-end of the small ribosomal RNA (950 bases) and which lies approximately 200 nucleotides from the D-loop origin of heavy strand mitochondrial DNA synthesis. An 80-base transcript lies between the small and large ribosomal RNA genes, and genes for two further short transcript (each about 80 bases in length) abut the sequences coding the 3'-end of the large ribosomal RNA (approximately 1500 bases). The ability to isolate a discrete DNA-RNA hybrid species approximately 2700 base pairs in length containing all these transcripts suggests that there can be few nucleotides in this region of mouse mitochondrial DNA which are not represented as stable RNA species. Images PMID:6253898

  18. Large-scale identification and characterization of alternative splicing variants of human gene transcripts using 56 419 completely sequenced and manually annotated full-length cDNAs

    PubMed Central

    Takeda, Jun-ichi; Suzuki, Yutaka; Nakao, Mitsuteru; Barrero, Roberto A.; Koyanagi, Kanako O.; Jin, Lihua; Motono, Chie; Hata, Hiroko; Isogai, Takao; Nagai, Keiichi; Otsuki, Tetsuji; Kuryshev, Vladimir; Shionyu, Masafumi; Yura, Kei; Go, Mitiko; Thierry-Mieg, Jean; Thierry-Mieg, Danielle; Wiemann, Stefan; Nomura, Nobuo; Sugano, Sumio; Gojobori, Takashi; Imanishi, Tadashi

    2006-01-01

    We report the first genome-wide identification and characterization of alternative splicing in human gene transcripts based on analysis of the full-length cDNAs. Applying both manual and computational analyses for 56 419 completely sequenced and precisely annotated full-length cDNAs selected for the H-Invitational human transcriptome annotation meetings, we identified 6877 alternative splicing genes with 18 297 different alternative splicing variants. A total of 37 670 exons were involved in these alternative splicing events. The encoded protein sequences were affected in 6005 of the 6877 genes. Notably, alternative splicing affected protein motifs in 3015 genes, subcellular localizations in 2982 genes and transmembrane domains in 1348 genes. We also identified interesting patterns of alternative splicing, in which two distinct genes seemed to be bridged, nested or having overlapping protein coding sequences (CDSs) of different reading frames (multiple CDS). In these cases, completely unrelated proteins are encoded by a single locus. Genome-wide annotations of alternative splicing, relying on full-length cDNAs, should lay firm groundwork for exploring in detail the diversification of protein function, which is mediated by the fast expanding universe of alternative splicing variants. PMID:16914452

  19. Near Full-Length Identification of a Novel HIV-1 CRF01_AE/B/C Recombinant in Northern Myanmar.

    PubMed

    Zhou, Yan-Heng; Chen, Xin; Liang, Yue-Bo; Pang, Wei; Qin, Wei-Hong; Zhang, Chiyu; Zheng, Yong-Tang

    2015-08-01

    The Myanmar-China border appears to be the "hot spot" region for the occurrence of HIV-1 recombination. The majority of the previous analyses of HIV-1 recombination were based on partial genomic sequences, which obviously cannot reflect the reality of the genetic diversity of HIV-1 in this area well. Here, we present a near full-length characterization of a novel HIV-1 CRF01_AE/B/C recombinant isolated from a long-distance truck driver in Northern Myanmar. It is the first description of a near full-length genomic sequence in Myanmar since 2003, and might be one of the most complicated HIV-1 chimeras ever detected in Myanmar, containing four CRF01_AE, six B segments, and five C segments separated by 14 breakpoints throughout its genome. The discovery and characterization of this new CRF01_AE/B/C recombinant indicate that intersubtype recombination is ongoing in Myanmar, continuously generating new forms of HIV-1. More work based on near full-length sequence analyses is urgently needed to better understand the genetic diversity of HIV-1 in these regions.

  20. Identification of full-length dentin matrix protein 1 in dentin and bone.

    PubMed

    Huang, Bingzhen; Maciejewska, Izabela; Sun, Yao; Peng, Tao; Qin, Disheng; Lu, Yongbo; Bonewald, Lynda; Butler, William T; Feng, Jian; Qin, Chunlin

    2008-05-01

    Dentin matrix protein 1 (DMP1) has been identified in the extracellular matrix (ECM) of dentin and bone as the processed NH(2)-terminal and COOH-terminal fragment. However, the full-length form of DMP1 has not been identified in these tissues. The focus of this investigation was to search for the intact full-length DMP1 in dentin and bone. We used two types of anti-DMP1 antibodies to identify DMP1: one type specifically recognizes the NH(2)-terminal region and the other type is only reactive to the COOH-terminal region of the DMP1 amino acid sequence. An approximately 105-kDa protein, extracted from the ECM of rat dentin and bone, was recognized by both types of antibodies; and the migration rate of this protein was identical to the recombinant mouse full-length DMP1 made in eukaryotic cells. We concluded that this approximately 105-kDa protein is the full-length form of DMP1, which is considerably less abundant than its processed fragments in the ECM of dentin and bone. We also detected the full-length form of DMP1 and its processed fragments in the extract of dental pulp/odontoblast complex dissected from rat teeth. In addition, immunofluorescence analysis showed that in MC3T3-E1 cells the NH(2)-terminal and COOH-terminal fragments of DMP1 are distributed differently. Our findings indicate that the majority of DMP1 must be cleaved within the cells that synthesize it and that minor amounts of uncleaved DMP1 molecules are secreted into the ECM of dentin and bone.

  1. Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).

    PubMed

    Koning-Boucoiran, Carole F S; Esselink, G Danny; Vukosavljev, Mirjana; van 't Westende, Wendy P C; Gitonga, Virginia W; Krens, Frans A; Voorrips, Roeland E; van de Weg, W Eric; Schulz, Dietmar; Debener, Thomas; Maliepaard, Chris; Arens, Paul; Smulders, Marinus J M

    2015-01-01

    In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular.

  2. [Analysis of the molecular characteristics and cloning of full-length coding sequence of interleukin-2 in tree shrews].

    PubMed

    Huang, Xiao-Yan; Li, Ming-Li; Xu, Juan; Gao, Yue-Dong; Wang, Wen-Guang; Yin, An-Guo; Li, Xiao-Fei; Sun, Xiao-Mei; Xia, Xue-Shan; Dai, Jie-Jie

    2013-04-01

    While the tree shrew (Tupaia belangeri chinensis) is an excellent animal model for studying the mechanisms of human diseases, but few studies examine interleukin-2 (IL-2), an important immune factor in disease model evaluation. In this study, a 465 bp of the full-length IL-2 cDNA encoding sequence was cloned from the RNA of tree shrew spleen lymphocytes, which were then cultivated and stimulated with ConA (concanavalin). Clustal W 2.0 was used to compare and analyze the sequence and molecular characteristics, and establish the similarity of the overall structure of IL-2 between tree shrews and other mammals. The homology of the IL-2 nucleotide sequence between tree shrews and humans was 93%, and the amino acid homology was 80%. The phylogenetic tree results, derived through the Neighbour-Joining method using MEGA5.0, indicated a close genetic relationship between tree shrews, Homo sapiens, and Macaca mulatta. The three-dimensional structure analysis showed that the surface charges in most regions of tree shrew IL-2 were similar to between tree shrews and humans; however, the N-glycosylation sites and local structures were different, which may affect antibody binding. These results provide a fundamental basis for the future study of IL-2 monoclonal antibody in tree shrews, thereby improving their utility as a model.

  3. Petunia Floral Defensins with Unique Prodomains as Novel Candidates for Development of Fusarium Wilt Resistance in Transgenic Banana Plants

    PubMed Central

    Ghag, Siddhesh B.; Shekhawat, Upendra K. Singh; Ganapathi, Thumballi R.

    2012-01-01

    Antimicrobial peptides are a potent group of defense active molecules that have been utilized in developing resistance against a multitude of plant pathogens. Floral defensins constitute a group of cysteine-rich peptides showing potent growth inhibition of pathogenic filamentous fungi especially Fusarium oxysporum in vitro. Full length genes coding for two Petunia floral defensins, PhDef1 and PhDef2 having unique C- terminal 31 and 27 amino acid long predicted prodomains, were overexpressed in transgenic banana plants using embryogenic cells as explants for Agrobacterium–mediated genetic transformation. High level constitutive expression of these defensins in elite banana cv. Rasthali led to significant resistance against infection of Fusarium oxysporum f. sp. cubense as shown by in vitro and ex vivo bioassay studies. Transgenic banana lines expressing either of the two defensins were clearly less chlorotic and had significantly less infestation and discoloration in the vital corm region of the plant as compared to untransformed controls. Transgenic banana plants expressing high level of full-length PhDef1 and PhDef2 were phenotypically normal and no stunting was observed. In conclusion, our results suggest that high-level constitutive expression of floral defensins having distinctive prodomains is an efficient strategy for development of fungal resistance in economically important fruit crops like banana. PMID:22745785

  4. Importance of Viral Sequence Length and Number of Variable and Informative Sites in Analysis of HIV Clustering.

    PubMed

    Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor; Essex, M

    2015-05-01

    To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice.

  5. Importance of Viral Sequence Length and Number of Variable and Informative Sites in Analysis of HIV Clustering

    PubMed Central

    Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor

    2015-01-01

    Abstract To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice. PMID:25560745

  6. Computational Study of Primary Electrons in the Cusp Region of an Ion Engine's Discharge Chamber

    NASA Technical Reports Server (NTRS)

    Stueber, Thomas J. (Technical Monitor); Deshpande, Shirin S.; Mahalingam, Sudhakar; Menart, James A.

    2004-01-01

    In this work a computer code called PRIMA is used to study the motion of primary electrons in the magnetic cusp region of the discharge chamber of an ion engine. Even though the amount of wall area covered by the cusps is very small, the cusp regions are important because prior computational analyses have indicated that most primary electrons leave the discharge chamber through the cusps. The analysis presented here focuses on the cusp region only. The affects of the shape and size of the cusp region on primary electron travel are studied as well as the angle and location at which the electron enters the cusp region. These affects are quantified using the confinement length and the number density distributions of the primary electrons. In addition to these results comparisons of the results from PRIMA are made to experimental results for a cylindrical discharge chamber with two magnetic rings. These comparisons indicate the validity of the computer code called PRIMA.

  7. GBS: Global 3D simulation of tokamak edge region

    NASA Astrophysics Data System (ADS)

    Zhu, Ben; Fisher, Dustin; Rogers, Barrett; Ricci, Paolo

    2012-10-01

    A 3D two-fluid global code, namely Global Braginskii Solver (GBS), is being developed to explore the physics of turbulent transport, confinement, self-consistent profile formation, pedestal scaling and related phenomena in the edge region of tokamaks. Aimed at solving drift-reduced Braginskii equations [1] in complex magnetic geometry, the GBS is used for turbulence simulation in SOL region. In the recent upgrade, the simulation domain is expanded into close flux region with twist-shift boundary conditions. Hence, the new GBS code is able to explore global transport physics in an annular full-torus domain from the top of the pedestal into the far SOL. We are in the process of identifying and analyzing the linear and nonlinear instabilities in the system using the new GBS code. Preliminary results will be presented and compared with other codes if possible.[4pt] [1] A. Zeiler, J. F. Drake and B. Rogers, Phys. Plasmas 4, 2134 (1997)

  8. A Dual-Route Perspective on Brain Activation in Response to Visual Words: Evidence for a Length by Lexicality Interaction in the Visual Word Form Area (VWFA)

    PubMed Central

    Schurz, Matthias; Sturm, Denise; Richlan, Fabio; Kronbichler, Martin; Ladurner, Gunther; Wimmer, Heinz

    2010-01-01

    Based on our previous work, we expected the Visual Word Form Area (VWFA) in the left ventral visual pathway to be engaged by both whole-word recognition and by serial sublexical coding of letter strings. To examine this double function, a phonological lexical decision task (i.e., “Does xxx sound like an existing word?”) presented short and long letter strings of words, pseudohomophones, and pseudowords (e.g., Taxi, Taksi and Tazi). Main findings were that the length effect for words was limited to occipital regions and absent in the VWFA. In contrast, a marked length effect for pseudowords was found throughout the ventral visual pathway including the VWFA, as well as in regions presumably engaged by visual attention and silent-articulatory processes. The length by lexicality interaction on brain activation corresponds to well-established behavioral findings of a length by lexicality interaction on naming latencies and speaks for the engagement of the VWFA by both lexical and sublexical processes. PMID:19896538

  9. A dual-route perspective on brain activation in response to visual words: evidence for a length by lexicality interaction in the visual word form area (VWFA).

    PubMed

    Schurz, Matthias; Sturm, Denise; Richlan, Fabio; Kronbichler, Martin; Ladurner, Gunther; Wimmer, Heinz

    2010-02-01

    Based on our previous work, we expected the Visual Word Form Area (VWFA) in the left ventral visual pathway to be engaged by both whole-word recognition and by serial sublexical coding of letter strings. To examine this double function, a phonological lexical decision task (i.e., "Does xxx sound like an existing word?") presented short and long letter strings of words, pseudohomophones, and pseudowords (e.g., Taxi, Taksi and Tazi). Main findings were that the length effect for words was limited to occipital regions and absent in the VWFA. In contrast, a marked length effect for pseudowords was found throughout the ventral visual pathway including the VWFA, as well as in regions presumably engaged by visual attention and silent-articulatory processes. The length by lexicality interaction on brain activation corresponds to well-established behavioral findings of a length by lexicality interaction on naming latencies and speaks for the engagement of the VWFA by both lexical and sublexical processes. Copyright (c) 2009 Elsevier Inc. All rights reserved.

  10. Coded Cooperation for Multiway Relaying in Wireless Sensor Networks †

    PubMed Central

    Si, Zhongwei; Ma, Junyang; Thobaben, Ragnar

    2015-01-01

    Wireless sensor networks have been considered as an enabling technology for constructing smart cities. One important feature of wireless sensor networks is that the sensor nodes collaborate in some manner for communications. In this manuscript, we focus on the model of multiway relaying with full data exchange where each user wants to transmit and receive data to and from all other users in the network. We derive the capacity region for this specific model and propose a coding strategy through coset encoding. To obtain good performance with practical codes, we choose spatially-coupled LDPC (SC-LDPC) codes for the coded cooperation. In particular, for the message broadcasting from the relay, we construct multi-edge-type (MET) SC-LDPC codes by repeatedly applying coset encoding. Due to the capacity-achieving property of the SC-LDPC codes, we prove that the capacity region can theoretically be achieved by the proposed MET SC-LDPC codes. Numerical results with finite node degrees are provided, which show that the achievable rates approach the boundary of the capacity region in both binary erasure channels and additive white Gaussian channels. PMID:26131675

  11. Coded Cooperation for Multiway Relaying in Wireless Sensor Networks.

    PubMed

    Si, Zhongwei; Ma, Junyang; Thobaben, Ragnar

    2015-06-29

    Wireless sensor networks have been considered as an enabling technology for constructing smart cities. One important feature of wireless sensor networks is that the sensor nodes collaborate in some manner for communications. In this manuscript, we focus on the model of multiway relaying with full data exchange where each user wants to transmit and receive data to and from all other users in the network. We derive the capacity region for this specific model and propose a coding strategy through coset encoding. To obtain good performance with practical codes, we choose spatially-coupled LDPC (SC-LDPC) codes for the coded cooperation. In particular, for the message broadcasting from the relay, we construct multi-edge-type (MET) SC-LDPC codes by repeatedly applying coset encoding. Due to the capacity-achieving property of the SC-LDPC codes, we prove that the capacity region can theoretically be achieved by the proposed MET SC-LDPC codes. Numerical results with finite node degrees are provided, which show that the achievable rates approach the boundary of the capacity region in both binary erasure channels and additive white Gaussian channels.

  12. Isolation and functional characterization of Lycopene β-cyclase (CYC-B) promoter from Solanum habrochaites

    PubMed Central

    2010-01-01

    Background Carotenoids are a group of C40 isoprenoid molecules that play diverse biological and ecological roles in plants. Tomato is an important vegetable in human diet and provides the vitamin A precursor β-carotene. Genes encoding enzymes involved in carotenoid biosynthetic pathway have been cloned. However, regulation of genes involved in carotenoid biosynthetic pathway and accumulation of specific carotenoid in chromoplasts are not well understood. One of the approaches to understand regulation of carotenoid metabolism is to characterize the promoters of genes encoding proteins involved in carotenoid metabolism. Lycopene β-cyclase is one of the crucial enzymes in carotenoid biosynthesis pathway in plants. Its activity is required for synthesis of both α-and β-carotenes that are further converted into other carotenoids such as lutein, zeaxanthin, etc. This study describes the isolation and characterization of chromoplast-specific Lycopene β-cyclase (CYC-B) promoter from a green fruited S. habrochaites genotype EC520061. Results A 908 bp region upstream to the initiation codon of the Lycopene β-cyclase gene was cloned and identified as full-length promoter. To identify promoter region necessary for regulating developmental expression of the ShCYC-B gene, the full-length promoter and its three different 5' truncated fragments were cloned upstream to the initiation codon of GUS reporter cDNA in binary vectors. These four plant transformation vectors were separately transformed in to Agrobacterium. Agrobacterium-mediated transient and stable expression systems were used to study the GUS expression driven by the full-length promoter and its 5' deletion fragments in tomato. The full-length promoter showed a basal level activity in leaves, and its expression was upregulated > 5-fold in flowers and fruits in transgenic tomato plants. Deletion of -908 to -577 bp 5' to ATG decreases the ShCYC-B promoter strength, while deletion of -908 to -437 bp 5' to ATG led to significant increase in the activity of GUS in the transgenic plants. Promoter deletion analysis led to the identification of a short promoter region (-436 bp to ATG) that exhibited a higher promoter strength but similar developmental expression pattern as compared with the full-length ShCYC-B promoter. Conclusion Functional characterization of the full-length ShCYC-B promoter and its deletion fragments in transient expression system in fruto as well as in stable transgenic tomato revealed that the promoter is developmentally regulated and its expression is upregulated in chromoplast-rich flowers and fruits. Our study identified a short promoter region with functional activity and developmental expression pattern similar to that of the full-length ShCYC-B promoter. This 436 bp promoter region can be used in promoter::reporter fusion molecular genetic screens to identify mutants impaired in CYC-B expression, and thus can be a valuable tool in understanding carotenoid metabolism in tomato. Moreover, this short promoter region of ShCYC-B may be useful in genetic engineering of carotenoid content and other agronomic traits in tomato fruits. PMID:20380705

  13. [Genetic Characteristics of Type 2 Vaccine-derived Poliovirus in Shanxi Province (China) in 2014].

    PubMed

    Yan, Dongrei; Li, Xiaolei; Zhang, Yong; Yang, Jianfang; Zhu, Shuangli; Wang, Dongyan; Zhang, Chuangye; Zhu, Hui; Xu, Wenbo

    2015-03-01

    The World Health Organization redefined the type 2 vaccine-derived poliovirus (VDPV) in 2010. To study the genetic characteristics and evolution of type 2 VDPV under this new definition, we conducted genome sequencing and analyses of type 2 VDPVs isolated from one patient with acute flaccid paralysis in Shanxi province (China) in 2014. Nucleotide sequencing revealed that the full-length of type 2 VDPV is 7439 bases encoding 2207 amino acids with no insertion or deletion of nucleotides compared with Sabin2. One nucleotide substitution identified as a key determinant of the attenuated phenotype of the Sabin 2 strain (A-G reversion at nucleotide nt 481 in the 5-end of the untranslated region) had reverted in the Shanxi type 2 VDPV. The other known key determinant of the attenuated phenotype of the Sabin 2 strain (U-->C reversion at nt2909 in the VP1 coding region that caused a Ile143Thr substitution in VP1) had not reverted in the Shanxi VDPV. The Shanxi type 2 VDPV was S2/S1 recombinant, the crossover site of which mapped to the 3-end of the 3D region (between nt 6247 and nt 6281). A phylogentic tree based on the VP1 coding region showed that evolution of the Shanxi type 2 VDPV was independent of other type 2 VDPVs detected worldwide. We estimated that the strain circulated for approximately = 11 months in the population according to the known evolution rate. The present study confirmed that the Chinese Polio Laboratory Network could discover the VDPV promptly and that it played an important part in maintenance of a polio-free China.

  14. Rift Valley fever virus structural and non-structural proteins: Recombinant protein expression and immunoreactivity against antisera from sheep

    USDA-ARS?s Scientific Manuscript database

    The Rift Valley fever virus (RVFV) encodes structural proteins, nucleoprotein (N), N-terminus glycoprotein (Gn), C-terminus glycoprotein (Gc) and L protein, 78-kDa and non-structural proteins NSm and NSs. Using the baculovirus system we expressed the full-length coding sequence of N, NSs, NSm, Gc an...

  15. Characterization of developmental and stress mediated expression of cinnamoyl-CoA reductase (CCR) in kenaf (Hibiscus cannabinus L.)

    USDA-ARS?s Scientific Manuscript database

    Cinnamoyl-CoA reductase (CCR) is an important enzyme for lignin biosynthesis as it catalyzes the first specific committed step in monolignol biosynthesis. We have cloned a full length coding sequence of CCR from kenaf (Hibiscus cannabinus L.), which contains a 1,020-bp open reading frame (ORF), enco...

  16. Complete Mitochondrial Genome Sequence of Aethina tumida (Coleoptera: Nitidulidae), a Beekeeping Pest.

    PubMed

    Duquesne, Véronique; Delcont, Aurélie; Huleux, Anthéa; Beven, Véronique; Touzain, Fabrice; Ribière-Chabert, Magali

    2017-11-02

    We report here the full mitochondrial genome sequence of Aethina tumida , a Nitidulidae species beetle, that is a pest of bee hives. The obtained sequence is 16,576 bp in length and contains 13 protein-coding genes, 2 rRNA genes, and 22 tRNAs. Copyright © 2017 Duquesne et al.

  17. LH-independent testosterone secretion is mediated by the interaction between GNRH2 and its receptor within porcine testes

    USDA-ARS?s Scientific Manuscript database

    Unlike the classical gonadotropin-releasing hormone (GNRH1), the second mammalian isoform (GNRH2) is an ineffective stimulant of gonadotropin release. Species that produce GNRH2 may not maintain a functional GNRH2 receptor (GNRHR2) due to coding errors. A full length GNRHR2 gene has been identified ...

  18. The complete mitochondrial genome of Octopus conispadiceus (Sasaki, 1917) (Cephalopoda: Octopodidae).

    PubMed

    Ma, Yuanyuan; Zheng, Xiaodong; Cheng, Rubin; Li, Qi

    2016-01-01

    In this paper, we determined the complete mitochondrial genome of Octopus conispadiceus (Cephalopoda: Octopodidae). The whole mitogenome of O. conispadiceus is 16,027 basepairs (bp) in length with a base composition of 41.4% A, 34.8% T, 16.1% C, 7.7% G and contains 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes, and a major non-coding region (MNR). The gene arrangements of O. conispadiceus showed remarkable similarity to that of O. vulgaris, Amphioctopus fangsiao, Cistopus chinensis and C. taiwanicus.

  19. The nearly complete mitochondrial genome of a stonefly species, Styloperla sp. (Plecoptera: Styloperlidae).

    PubMed

    Chen, Zhi-Teng; Wu, Hai-Yan; Du, Yu-Zhou

    2016-07-01

    We report the nearly complete mitochondrial genome of a stonefly species, Styloperla sp. (Plecoptera: Styloperlidae), which is a circular molecule of 15,416 bp in length and consists of 13 protein-coding genes, 2 ribosomal RNAs, 20 transfer RNAs and a partial control region (645 bp). Using the 13 protein-coding genes of 8 stoneflies and 3 other related species, we constructed a phylogenetic tree to verify the accuracy of the new determined mitogenome sequences. Our results provide basic data for further study of phylogeny in Plecoptera.

  20. The complete validated mitochondrial genome of the silver gemfish Rexea solandri (Cuvier, 1832) (Perciformes, Gempylidae).

    PubMed

    Bustamante, Carlos; Ovenden, Jennifer R

    2016-01-01

    The silver gemfish Rexea solandri is an important economic resource but Vulnerable to overfishing in Australian waters. The complete mitochondrial genome sequence is described from 1.6 million reads obtained via next generation sequencing. The total length of the mitogenome is 16,350 bp comprising 2 rRNA, 13 protein-coding genes, 22 tRNA and 2 non-coding regions. The mitogenome sequence was validated against sequences of PCR fragments and BLAST queries of Genbank. Gene order was equivalent to that found in marine fishes.

  1. The phylogenetic position of the roughskin skate Dipturus trachyderma (Krefft & Stehmann, 1975) (Rajiformes, Rajidae) inferred from the mitochondrial genome.

    PubMed

    Vargas-Caro, Carolina; Bustamante, Carlos; Lamilla, Julio; Bennett, Michael B; Ovenden, Jennifer R

    2016-07-01

    The complete mitochondrial genome of the roughskin skate Dipturus trachyderma is described from 1 455 724 sequences obtained using Illumina NGS technology. Total length of the mitogenome was 16 909 base pairs, comprising 2 rRNAs, 13 protein-coding genes, 22 tRNAs and 2 non-coding regions. Phylogenetic analysis based on mtDNA revealed low genetic divergence among longnose skates, in particular, those dwelling the continental shelf and slope off the coasts of Chile and Argentina.

  2. The complete mitochondrial genome of Gryllotalpa unispina Saussure, 1874 (Orthoptera: Gryllotalpoidea: Gryllotalpidae).

    PubMed

    Zhang, Yulong; Shao, Dandan; Cai, Miao; Yin, Hong; Zhang, Daochuan

    2016-01-01

    The complete mitochondrial genome of Gryllotalpa unispina was 15,513 bp in length and contained 70.9% AT. All G. unispina protein-coding sequences except for the nad2 started with a typical ATN codon. The usual termination codons (TAA) and incomplete stop codons (T) were found from 13 protein-coding genes. All tRNA genes were folded into the typical cloverleaf secondary structure, except trnS(AGN) lacking the dihydrouridine arm. The sizes of the large and small ribosomal RNA genes were 1245 and 725 bp, respectively. The A + T-rich region was 917 bp in length with 76.8%. The orientation and gene order of the G. unispina mitogenome were identical to the G. orientalis and G. pluvialis, there was no phenomenon of "DK rearrangement" which has been widely reported in Caelifera.

  3. Dasheng: a recently amplified nonautonomous long terminal repeat element that is a major component of pericentromeric regions in rice.

    PubMed

    Jiang, Ning; Bao, Zhirong; Temnykh, Svetlana; Cheng, Zhukuan; Jiang, Jiming; Wing, Rod A; McCouch, Susan R; Wessler, Susan R

    2002-07-01

    A new and unusual family of LTR elements, Dasheng, has been discovered in the genome of Oryza sativa following database searches of approximately 100 Mb of rice genomic sequence and 78 Mb of BAC-end sequence information. With all of the cis-elements but none of the coding domains normally associated with retrotransposons (e.g., gag, pol), Dasheng is a novel nonautonomous LTR element with high copy number. Over half of the approximately 1000 Dasheng elements in the rice genome are full length (5.6-8.6 kb), and 60% are estimated to have amplified in the past 500,000 years. Using a modified AFLP technique called transposon display, 215 elements were mapped to all 12 rice chromosomes. Interestingly, more than half of the mapped elements are clustered in the heterochromatic regions around centromeres. The distribution pattern was further confirmed by FISH analysis. Despite clustering in heterochromatin, Dasheng elements are not nested, suggesting their potential value as molecular markers for these marker-poor regions. Taken together, Dasheng is one of the highest-copy-number LTR elements and one of the most recent elements to amplify in the rice genome.

  4. The complete mitochondrial genome of the Feral Rock Pigeon (Columba livia breed feral).

    PubMed

    Li, Chun-Hong; Liu, Fang; Wang, Li

    2014-10-01

    Abstract In the present work, we report the complete mitochondrial genome sequence of feral rock pigeon for the first time. The total length of the mitogenome was 17,239 bp with the base composition of 30.3% for A, 24.0% for T, 31.9% for C, and 13.8% for G and an A-T (54.3 %)-rich feature was detected. It harbored 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and 1 non-coding control region (D-loop region). The arrangement of all genes was identical to the typical mitochondrial genomes of pigeon. The complete mitochondrial genome sequence of feral rock pigeon would serve as an important data set of the germplasm resources for further study.

  5. The complete mitochondrial genome sequence of the Datong yak (Bos grunniens).

    PubMed

    Wu, Xiaoyun; Chu, Min; Liang, Chunnian; Ding, Xuezhi; Guo, Xian; Bao, Pengjia; Yan, Ping

    2016-01-01

    Datong yak is a famous artificially cultivated breed in China. In the present work, we report the complete mitochondrial genome sequence of Datong yak for the first time. The total length of the mitogenome is 16,323 bp long, containing 13 protein-coding genes, 22 tRNA genes, two rRNA genes and one non-coding region (D-loop region). The gene order of Datong yak mitogenome is identical to that observed in most other vertebrates. The overall base composition is 33.71% A, 25.8.0% C, 13.21% G and 27.27% T, with an A + T content of 60.98%. The complete mitogenome sequence information of Datong yak can provide useful data for further studies on molecular breeding and taxonomic status.

  6. Characterization of the complete mitochondrial genome sequence of Gannan yak (Bos grunniens).

    PubMed

    Wu, Xiaoyun; Ding, Xuezhi; Chu, Min; Guo, Xian; Bao, Pengjia; Liang, Chunnian; Yan, Ping

    2016-01-01

    Gannan yak is the native breed of Gansu province in China. In this work, the complete mitochondrial genome sequence of Gannan yak was determined for the first time. The total length of the mitogenome is 16,322 bp long, with the base composition of 33.74% A, 25.84% T, 13.18% C, and 27.24% G. It contained 13 protein-coding genes, 22 tRNA genes, two rRNA genes and one non-coding region (D-loop region). The gene order of Gannan yak mitogenome is identical to that observed in most other vertebrates. The complete mitogenome sequence information of Gannan yak can provide useful data for further studies on protection of genetic resources and phylogenetic relationships within Bos grunniens.

  7. Next generation sequencing yields the complete mitochondrial genome of the Endangered Chilean silverside Basilichthys microlepidotus (Jenyns, 1841) (Teleostei, Atherinopsidae), validated with RNA-seq.

    PubMed

    Véliz, David; Vega-Retter, Caren; Quezada-Romegialli, Claudio

    2016-01-01

    The complete sequence of the mitochondrial genome for the Chilean silverside Basilichthys microlepidotus is reported for the first time. The entire mitochondrial genome was 16,544 bp in length (GenBank accession no. KM245937); gene composition and arrangement was conformed to that reported for most fishes and contained the typical structure of 2 rRNAs, 13 protein-coding genes, 22 tRNAs and a non-coding region. The assembled mitogenome was validated against sequences of COI and Control Region previously sequenced in our lab, functional genes from RNA-Seq data for the same species and the mitogenome of two other atherinopsid species available in Genbank.

  8. Numerical study of nonlinear full wave acoustic propagation

    NASA Astrophysics Data System (ADS)

    Velasco-Segura, Roberto; Rendon, Pablo L.

    2013-11-01

    With the aim of describing nonlinear acoustic phenomena, a form of the conservation equations for fluid dynamics is presented, deduced using slightly less restrictive hypothesis than those necessary to obtain the well known Westervelt equation. This formulation accounts for full wave diffraction, nonlinearity, and thermoviscous dissipative effects. A CLAWPACK based, 2D finite-volume method using Roe's linearization has been implemented to obtain numerically the solution of the proposed equations. In order to validate the code, two different tests have been performed: one against a special Taylor shock-like analytic solution, the other against published results on a HIFU system, both with satisfactory results. The code is written for parallel execution on a GPU and improves performance by a factor of over 50 when compared to the standard CLAWPACK Fortran code. This code can be used to describe moderate nonlinear phenomena, at low Mach numbers, in domains as large as 100 wave lengths. Applications range from modest models of diagnostic and therapeutic HIFU, parametric acoustic arrays, to acoustic wave guides. A couple of examples will be presented showing shock formation and oblique interaction. DGAPA PAPIIT IN110411, PAEP UNAM 2013.

  9. Neural correlates of word production stages delineated by parametric modulation of psycholinguistic variables.

    PubMed

    Wilson, Stephen M; Isenberg, Anna Lisette; Hickok, Gregory

    2009-11-01

    Word production is a complex multistage process linking conceptual representations, lexical entries, phonological forms and articulation. Previous studies have revealed a network of predominantly left-lateralized brain regions supporting this process, but many details regarding the precise functions of different nodes in this network remain unclear. To better delineate the functions of regions involved in word production, we used event-related functional magnetic resonance imaging (fMRI) to identify brain areas where blood oxygen level-dependent (BOLD) responses to overt picture naming were modulated by three psycholinguistic variables: concept familiarity, word frequency, and word length, and one behavioral variable: reaction time. Each of these variables has been suggested by prior studies to be associated with different aspects of word production. Processing of less familiar concepts was associated with greater BOLD responses in bilateral occipitotemporal regions, reflecting visual processing and conceptual preparation. Lower frequency words produced greater BOLD signal in left inferior temporal cortex and the left temporoparietal junction, suggesting involvement of these regions in lexical selection and retrieval and encoding of phonological codes. Word length was positively correlated with signal intensity in Heschl's gyrus bilaterally, extending into the mid-superior temporal gyrus (STG) and sulcus (STS) in the left hemisphere. The left mid-STS site was also modulated by reaction time, suggesting a role in the storage of lexical phonological codes.

  10. Development length of 0.6-inch prestressing strand in standard I-shaped pretensioned concrete beams

    NASA Astrophysics Data System (ADS)

    Barnes, Robert Wesley

    The use of 0.6 in prestressing strand at a center-to-center spacing of 2 in allows for the optimal implementation of High Strength Concrete (HSC) in precast, prestressed concrete bridge superstructures. For this strand configuration, partial debonding of strands is a desirable alternative to the more traditional method of draping strands to alleviate extreme concrete stresses after prestress release. Recent experimental evidence suggests that existing code provisions addressing the anchorage of pretensioned strands do not adequately describe the behavior of these strands. In addition, the anchorage behavior of partially debonded strands is not fully understood. These uncertainties have combined to hinder the full exploitation of HSC in pretensioned concrete construction. A research study was conducted to determine the anchorage behavior of 0.6 in strands at 2 in spacing in full-size bridge members. The experimental program consisted of assessing transfer and development lengths in plant-cast AASHTO Type I I-beams. The influence of concrete compressive strengths ranging from 5700 to 14,700 psi was examined. In order to consider the full range of strand surface conditions found in practice, the prestressing strand featured either a bright mill finish or a rusted surface condition. The anchorage behavior of partially debonded strands was investigated by using a variety of strand debonding configurations---including debonded strand percentages as high as 75 percent. A limited investigation of the effect of horizontal web reinforcement on anchorage behavior was performed. Pull-out tests were performed in an attempt to correlate results with the bond quality of the strands used in the study. The correlation between strand draw-in and the anchorage behavior of prestressing strands was also examined. A review of the evolution and shortcomings of existing code provisions for the anchorage of prestressing strands is presented. Results of the experimental program are reported, along with recommended design procedures based on these results and those from other studies. The use of 0.6 in strand at 2 in spacing is concluded to be safe, and partial debonding of prestressing strands is shown to be an effective means of reducing stresses in the end regions of pretensioned girders.

  11. Long non-coding RNA CRYBG3 blocks cytokinesis by directly binding G-actin.

    PubMed

    Pei, Hailong; Hu, Wentao; Guo, Ziyang; Chen, Huaiyuan; Ma, Ji; Mao, Weidong; Li, Bingyan; Wang, Aiqing; Wan, Jianmei; Zhang, Jian; Nie, Jing; Zhou, Guangming; Hei, Tom K

    2018-06-22

    The dynamic interchange between monomeric globular actin (G-actin) and polymeric filamentous actin filaments (F-actin) is fundamental and essential to many cellular processes including cytokinesis and maintenance of genomic stability. Here we report that the long non-coding RNA LNC CRYBG3 directly binds G-actin to inhibit its polymerization and formation of contractile rings, resulting in M-Phase cell arrest. Knockdown of LNC CRYBG3 in tumor cells enhanced their malignant phenotypes. Nucleotide sequence 228-237 of the full-length LNC CRYBG3 and the ser14 domain of beta-actin are essential for their interaction, and mutation of either of these sites abrogated binding of LNC CRYBG3 to G-actin. Binding of LNC CRYBG3 to G-actin blocked nuclear localization of MAL, which consequently kept serum response factor (SRF) away from the promoter region of several immediate early genes, including JUNB and Arp3, which are necessary for cellular proliferation, tumor growth, adhesion, movement, and metastasis. These findings reveal a novel lncRNA-actin-MAL-SRF pathway and highlight LNC CRYBG3 as a means to block cytokinesis and treat cancer by targeting the actin cytoskeleton. Copyright ©2018, American Association for Cancer Research.

  12. Bitis gabonica (Gaboon viper) snake venom gland: toward a catalog for the full- length transcripts (cDNA) and proteins

    PubMed Central

    Francischetti, Ivo M. B.; My-Pham, Van; Harrison, Jim; Garfield, Mark K.; Ribeiro, José M. C.

    2010-01-01

    The venom gland of the snake Bitis gabonica (Gaboon viper) was used for the first time to construct a unidirectional cDNA phage library followed by high-throughput sequencing and bioinformatic analysis. Hundreds of cDNAs were obtained and clustered into contigs. We found mostly novel full-length cDNA coding for metalloproteases (P-II and P-III classes), Lys49-phospholipase A2, serine proteases with essential mutations in the active site, Kunitz protease inhibitors, several C-type lectins, bradykinin-potentiating peptide, vascular endothelial growth factor, nucleotidases and nucleases, nerve growth factor, and L-amino acid oxidases. Two new members of the recently described short coding region family of disintegrin, displaying RGD and MLD motifs are reported. In addition, we have identified for the first time a cytokine-like molecule and a multi-Kunitz protease inhibitor in snake venoms. The CLUSTAL alignment and the unrooted cladograms for selected families of B. gabonica venom proteins are also presented. A significant number of sequences were devoid of database matches, suggesting that their biologic function remains to be identified. This paper also reports the N-terminus of the 15 most abundant venom proteins and the sequences matching their corresponding transcripts. The electronic version of this manuscript, available on request, contains spreadsheets with hyperlinks to FASTA-formatted files for each contig and the best match to the GenBank and Conserved Domain Databases, in addition to CLUSTAL alignments of each contig. We have thus generated a comprehensive catalog of the B. gabonica venom gland, containing for each secreted protein: i) the predicted molecular weight, ii) the predicted isoelectric point, iii) the accession number, and iv) the putative function. The role of these molecules is discussed in the context of the envenomation caused by the Gaboon viper. PMID:15276202

  13. Classification Techniques for Digital Map Compression

    DTIC Science & Technology

    1989-03-01

    classification improved the performance of the K-means classification algorithm resulting in a compression of 8.06:1 with Lempel - Ziv coding. Run-length coding... compression performance are run-length coding [2], [8] and Lempel - Ziv coding 110], [11]. These techniques are chosen because they are most efficient when...investigated. After the classification, some standard file compression methods, such as Lempel - Ziv and run-length encoding were applied to the

  14. Full-genome sequences of hepatitis B virus subgenotype D3 isolates from the Brazilian Amazon Region.

    PubMed

    Spitz, Natália; Mello, Francisco C A; Araujo, Natalia Motta

    2015-02-01

    The Brazilian Amazon Region is a highly endemic area for hepatitis B virus (HBV). However, little is known regarding the genetic variability of the strains circulating in this geographical region. Here, we describe the first full-length genomes of HBV isolated in the Brazilian Amazon Region; these genomes are also the first complete HBV subgenotype D3 genomes reported for Brazil. The genomes of the five Brazilian isolates were all 3,182 base pairs in length and the isolates were classified as belonging to subgenotype D3, subtypes ayw2 (n = 3) and ayw3 (n = 2). Phylogenetic analysis suggested that the Brazilian sequences are not likely to be closely related to European D3 sequences. Such results will contribute to further epidemiological and evolutionary studies of HBV.

  15. Similar but not the same: insights into the evolutionary history of paralogous sex-determining genes of the dwarf honey bee Apis florea.

    PubMed

    Biewer, M; Lechner, S; Hasselmann, M

    2016-01-01

    Studying the fate of duplicated genes provides informative insight into the evolutionary plasticity of biological pathways to which they belong. In the paralogous sex-determining genes complementary sex determiner (csd) and feminizer (fem) of honey bee species (genus Apis), only heterozygous csd initiates female development. Here, the full-length coding sequences of the genes csd and fem of the phylogenetically basal dwarf honey bee Apis florea are characterized. Compared with other Apis species, remarkable evolutionary changes in the formation and localization of a protein-interacting (coiled-coil) motif and in the amino acids coding for the csd characteristic hypervariable region (HVR) are observed. Furthermore, functionally different csd alleles were isolated as genomic fragments from a random population sample. In the predicted potential specifying domain (PSD), a high ratio of πN/πS=1.6 indicated positive selection, whereas signs of balancing selection, commonly found in other Apis species, are missing. Low nucleotide diversity on synonymous and genome-wide, non-coding sites as well as site frequency analyses indicated a strong impact of genetic drift in A. florea, likely linked to its biology. Along the evolutionary trajectory of ~30 million years of csd evolution, episodic diversifying selection seems to have acted differently among distinct Apis branches. Consistently low amino-acid differences within the PSD among pairs of functional heterozygous csd alleles indicate that the HVR is the most important region for determining allele specificity. We propose that in the early history of the lineage-specific fem duplication giving rise to csd in Apis, A. florea csd stands as a remarkable example for the plasticity of initial sex-determining signals.

  16. Resolving the Kinetic Reconnection Length Scale in Global Magnetospheric Simulations with MHD-EPIC

    NASA Astrophysics Data System (ADS)

    Toth, G.; Chen, Y.; Cassak, P.; Jordanova, V.; Peng, B.; Markidis, S.; Gombosi, T. I.

    2016-12-01

    We have recently developed a new modeling capability: the Magnetohydrodynamics with Embedded Particle-in-Cell (MHD-EPIC) algorithm with support from Los Alamos SHIELDS and NSF INSPIRE grants. We have implemented MHD-EPIC into the Space Weather Modeling Framework (SWMF) using the implicit Particle-in-Cell (iPIC3D) and the BATS-R-US extended magnetohydrodynamic codes. The MHD-EPIC model allows two-way coupled simulations in two and three dimensions with multiple embedded PIC regions. Both BATS-R-US and iPIC3D are massively parallel codes. The MHD-EPIC approach allows global magnetosphere simulations with embedded kinetic simulations. For small magnetospheres, like Ganymede or Mercury, we can easily resolve the ion scales around the reconnection sites. Modeling the Earth magnetosphere is very challenging even with our efficient MHD-EPIC model due to the large separation between the global and ion scales. On the other hand the large separation of scales may be exploited: the solution may not be sensitive to the ion inertial length as long as it is small relative to the global scales. The ion inertial length can be varied by changing the ion mass while keeping the MHD mass density, the velocity, and pressure the same for the initial and boundary conditions. Our two-dimensional MHD-EPIC simulations for the dayside reconnection region show in fact, that the overall solution is not sensitive to ion inertial length. The shape, size and frequency of flux transfer events are very similar for a wide range of ion masses. Our results mean that 3D MHD-EPIC simulations for the Earth and other large magnetospheres can be made computationally affordable by artificially increasing the ion mass: the required grid resolution and time step in the PIC model are proportional to the ion inertial length. Changing the ion mass by a factor of 4, for example, speeds up the PIC code by a factor of 256. In fact, this approach allowed us to perform an hour-long 3D MHD-EPIC simulations for the Earth magnetosphere.

  17. A fully decompressed synthetic bacteriophage øX174 genome assembled and archived in yeast.

    PubMed

    Jaschke, Paul R; Lieberman, Erica K; Rodriguez, Jon; Sierra, Adrian; Endy, Drew

    2012-12-20

    The 5386 nucleotide bacteriophage øX174 genome has a complicated architecture that encodes 11 gene products via overlapping protein coding sequences spanning multiple reading frames. We designed a 6302 nucleotide synthetic surrogate, øX174.1, that fully separates all primary phage protein coding sequences along with cognate translation control elements. To specify øX174.1f, a decompressed genome the same length as wild type, we truncated the gene F coding sequence. We synthesized DNA encoding fragments of øX174.1f and used a combination of in vitro- and yeast-based assembly to produce yeast vectors encoding natural or designer bacteriophage genomes. We isolated clonal preparations of yeast plasmid DNA and transfected E. coli C strains. We recovered viable øX174 particles containing the øX174.1f genome from E. coli C strains that independently express full-length gene F. We expect that yeast can serve as a genomic 'drydock' within which to maintain and manipulate clonal lineages of other obligate lytic phage. Copyright © 2012 Elsevier Inc. All rights reserved.

  18. Efficient Transition State Optimization of Periodic Structures through Automated Relaxed Potential Energy Surface Scans.

    PubMed

    Plessow, Philipp N

    2018-02-13

    This work explores how constrained linear combinations of bond lengths can be used to optimize transition states in periodic structures. Scanning of constrained coordinates is a standard approach for molecular codes with localized basis functions, where a full set of internal coordinates is used for optimization. Common plane wave-codes for periodic boundary conditions almost exlusively rely on Cartesian coordinates. An implementation of constrained linear combinations of bond lengths with Cartesian coordinates is described. Along with an optimization of the value of the constrained coordinate toward the transition states, this allows transition optimization within a single calculation. The approach is suitable for transition states that can be well described in terms of broken and formed bonds. In particular, the implementation is shown to be effective and efficient in the optimization of transition states in zeolite-catalyzed reactions, which have high relevance in industrial processes.

  19. The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus.

    PubMed Central

    Gustafson, G; Armour, S L

    1986-01-01

    The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus (BSMV) has been determined. The sequence is 3289 nucleotides in length and contains four open reading frames (ORFs) which code for proteins of Mr 22,147 (ORF1), Mr 58,098 (ORF2), Mr 17,378 (ORF3), and Mr 14,119 (ORF4). The predicted N-terminal amino acid sequence of the polypeptide encoded by the ORF nearest the 5'-end of the RNA (ORF1) is identical (after the initiator methionine) to the published N-terminal amino acid sequence of BSMV coat protein for 29 of the first 30 amino acids. ORF2 occupies the central portion of the coding region of RNA beta and ORF3 is located at the 3'-end. The ORF4 sequence overlaps the 3'-region of ORF2 and the 5'-region of ORF3 and differs in codon usage from the other three RNA beta ORFs. The coding region of RNA beta is followed by a poly(A) tract and a 238 nucleotide tRNA-like structure which are common to all three BSMV genomic RNAs. Images PMID:3754962

  20. Forensic strategy to ensure the quality of sequencing data of mitochondrial DNA in highly degraded samples.

    PubMed

    Adachi, Noboru; Umetsu, Kazuo; Shojo, Hideki

    2014-01-01

    Mitochondrial DNA (mtDNA) is widely used for DNA analysis of highly degraded samples because of its polymorphic nature and high number of copies in a cell. However, as endogenous mtDNA in deteriorated samples is scarce and highly fragmented, it is not easy to obtain reliable data. In the current study, we report the risks of direct sequencing mtDNA in highly degraded material, and suggest a strategy to ensure the quality of sequencing data. It was observed that direct sequencing data of the hypervariable segment (HVS) 1 by using primer sets that generate an amplicon of 407 bp (long-primer sets) was different from results obtained by using newly designed primer sets that produce an amplicon of 120-139 bp (mini-primer sets). The data aligned with the results of mini-primer sets analysis in an amplicon length-dependent manner; the shorter the amplicon, the more evident the endogenous sequence became. Coding region analysis using multiplex amplified product-length polymorphisms revealed the incongruence of single nucleotide polymorphisms between the coding region and HVS 1 caused by contamination with exogenous mtDNA. Although the sequencing data obtained using long-primer sets turned out to be erroneous, it was unambiguous and reproducible. These findings suggest that PCR primers that produce amplicons shorter than those currently recognized should be used for mtDNA analysis in highly degraded samples. Haplogroup motif analysis of the coding region and HVS should also be performed to improve the reliability of forensic mtDNA data. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  1. Hybrid 3D model for the interaction of plasma thruster plumes with nearby objects

    NASA Astrophysics Data System (ADS)

    Cichocki, Filippo; Domínguez-Vázquez, Adrián; Merino, Mario; Ahedo, Eduardo

    2017-12-01

    This paper presents a hybrid particle-in-cell (PIC) fluid approach to model the interaction of a plasma plume with a spacecraft and/or any nearby object. Ions and neutrals are modeled with a PIC approach, while electrons are treated as a fluid. After a first iteration of the code, the domain is split into quasineutral and non-neutral regions, based on non-neutrality criteria, such as the relative charge density and the Debye length-to-cell size ratio. At the material boundaries of the former quasineutral region, a dedicated algorithm ensures that the Bohm condition is met. In the latter non-neutral regions, the electron density and electric potential are obtained by solving the coupled electron momentum balance and Poisson equations. Boundary conditions for both the electric current and potential are finally obtained with a plasma sheath sub-code and an equivalent circuit model. The hybrid code is validated by applying it to a typical plasma plume-spacecraft interaction scenario, and the physics and capabilities of the model are finally discussed.

  2. Acetylcholinesterase 1 in populations of organophosphate resistant North American strains of the cattle tick, Rhipicephalus microplus (Acari: Ixodidae)

    USDA-ARS?s Scientific Manuscript database

    In a collaboration with Purdue University researchers, we sequenced a 143,606 base pair Rhipicephalus microplus BAC library clone that contained the coding region for acetylcholinesterase 1 (AChE1). Sequencing was by Sanger protocols and the final assembly resulted in 15 contigs of varying length, e...

  3. Length and nucleotide sequence polymorphism at the trnL and trnF non-coding regions of chloroplast genomes among Saccharum and Erianthus species

    USDA-ARS?s Scientific Manuscript database

    The aneupolyploidy genome of sugarcane (Saccharum hybrids spp.) and lack of a classical genetic linkage map make genetics research most difficult for sugarcane. Whole genome sequencing and genetic characterization of sugarcane and related taxa are far behind other crops. In this study, universal PCR...

  4. Analysis of expressed sequence tags generated from full-length enriched cDNA libraries of melon

    PubMed Central

    2011-01-01

    Background Melon (Cucumis melo), an economically important vegetable crop, belongs to the Cucurbitaceae family which includes several other important crops such as watermelon, cucumber, and pumpkin. It has served as a model system for sex determination and vascular biology studies. However, genomic resources currently available for melon are limited. Result We constructed eleven full-length enriched and four standard cDNA libraries from fruits, flowers, leaves, roots, cotyledons, and calluses of four different melon genotypes, and generated 71,577 and 22,179 ESTs from full-length enriched and standard cDNA libraries, respectively. These ESTs, together with ~35,000 ESTs available in public domains, were assembled into 24,444 unigenes, which were extensively annotated by comparing their sequences to different protein and functional domain databases, assigning them Gene Ontology (GO) terms, and mapping them onto metabolic pathways. Comparative analysis of melon unigenes and other plant genomes revealed that 75% to 85% of melon unigenes had homologs in other dicot plants, while approximately 70% had homologs in monocot plants. The analysis also identified 6,972 gene families that were conserved across dicot and monocot plants, and 181, 1,192, and 220 gene families specific to fleshy fruit-bearing plants, the Cucurbitaceae family, and melon, respectively. Digital expression analysis identified a total of 175 tissue-specific genes, which provides a valuable gene sequence resource for future genomics and functional studies. Furthermore, we identified 4,068 simple sequence repeats (SSRs) and 3,073 single nucleotide polymorphisms (SNPs) in the melon EST collection. Finally, we obtained a total of 1,382 melon full-length transcripts through the analysis of full-length enriched cDNA clones that were sequenced from both ends. Analysis of these full-length transcripts indicated that sizes of melon 5' and 3' UTRs were similar to those of tomato, but longer than many other dicot plants. Codon usages of melon full-length transcripts were largely similar to those of Arabidopsis coding sequences. Conclusion The collection of melon ESTs generated from full-length enriched and standard cDNA libraries is expected to play significant roles in annotating the melon genome. The ESTs and associated analysis results will be useful resources for gene discovery, functional analysis, marker-assisted breeding of melon and closely related species, comparative genomic studies and for gaining insights into gene expression patterns. PMID:21599934

  5. EXPRESSION AND CHARACTERIZATION OF FULL-LENGTH HUMAN HEME OXYGENASE-1: PRESENCE OF INTACT MEMBRANE-BINDING REGION LEADS TO INCREASED BINDING AFFINITY FOR NADPH-CYTOCHROME P450 REDUCTASE

    PubMed Central

    Huber, Warren J.; Backes, Wayne L.

    2009-01-01

    Heme oxygenase (HO) is the chief regulatory enzyme in the oxidative degradation of heme to biliverdin. In the process of heme degradation, this NADPH and cytochrome P450 reductase (CPR)-dependent oxidation of heme also releases free iron and carbon monoxide. Much of the recent research involving heme oxygenase is done using a 30-kDa soluble form of the enzyme, which lacks the membrane binding region (C-terminal 23 amino acids). The goal of this study was to express and purify a full-length human HO-1 (hHO-1) protein; however, due to the lability of the full-length form, a rapid purification procedure was required. This was accomplished by use of a GST-tagged hHO-1 construct. Although the procedure permitted the generation of a full-length HO-1, this form was contaminated with a 30-kDa degradation product that could not be eliminated. Therefore, we attempted to remove a putative secondary thrombin cleavage site by a conservative mutation of amino acid 254, which replaces lysine with arginine. This mutation allowed the expression and purification of a full length hHO-1 protein. Unlike wild-type HO-1, the K254R mutant could be purified to a single 32-kDa protein capable of degrading heme at the same rate as the wild-type enzyme. The K254R full-length form had a specific activity of ~200–225 nmol bilirubin hr−1nmol−1 HO-1 as compared to ~140–150 nmol bilirubin hr−1nmol−1 for the WT form, which contains the 30-kDa contaminant. This is a 2–3-fold increase from the previously reported soluble 30-kDa HO-1, suggesting that the C-terminal 23 amino acids are essential for maximal catalytic activity. Because the membrane spanning domain is present, the full-length hHO-1 has the potential to incorporate into phospholipid membranes, which can be reconstituted at known concentrations, in combination with other ER-resident enzymes. PMID:17915953

  6. Expression and characterization of full-length human heme oxygenase-1: the presence of intact membrane-binding region leads to increased binding affinity for NADPH cytochrome P450 reductase.

    PubMed

    Huber, Warren J; Backes, Wayne L

    2007-10-30

    Heme oxygenase-1 (HO-1) is the chief regulatory enzyme in the oxidative degradation of heme to biliverdin. In the process of heme degradation, HO-1 receives the electrons necessary for catalysis from the flavoprotein NADPH cytochrome P450 reductase (CPR), releasing free iron and carbon monoxide. Much of the recent research involving heme oxygenase has been done using a 30 kDa soluble form of the enzyme, which lacks the membrane binding region (C-terminal 23 amino acids). The goal of this study was to express and purify a full-length human HO-1 (hHO-1) protein; however, due to the lability of the full-length form, a rapid purification procedure was required. This was accomplished by use of a glutathione-s-transferase (GST)-tagged hHO-1 construct. Although the procedure permitted the generation of a full-length HO-1, this form was contaminated with a 30 kDa degradation product that could not be eliminated. Therefore, attempts were made to remove a putative secondary thrombin cleavage site by a conservative mutation of amino acid 254, which replaces arginine with lysine. This mutation allowed the expression and purification of a full-length hHO-1 protein. Unlike wild type (WT) HO-1, the R254K mutant could be purified to a single 32 kDa protein capable of degrading heme at the same rate as the WT enzyme. The R254K full-length form had a specific activity of approximately 200-225 nmol of bilirubin h-1 nmol-1 HO-1 as compared to approximately 140-150 nmol of bilirubin h-1 nmol-1 for the WT form, which contains the 30 kDa contaminant. This is a 2-3-fold increase from the previously reported soluble 30 kDa HO-1, suggesting that the C-terminal 23 amino acids are essential for maximal catalytic activity. Because the membrane-spanning domain is present, the full-length hHO-1 has the potential to incorporate into phospholipid membranes, which can be reconstituted at known concentrations, in combination with other endoplasmic reticulum resident enzymes.

  7. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mohr, C.L.; Rausch, W.N.; Hesson, G.M.

    The LOCA Simulation Program in the NRU reactor is the first set of experiments to provide data on the behavior of full-length, nuclear-heated PWR fuel bundles during the heatup, reflood, and quench phases of a loss-of-coolant accident (LOCA). This paper compares the temperature time histories of 4 experimental test cases with 4 computer codes: CE-THERM, FRAP-T5, GT3-FLECHT, and TRUMP-FLECHT. The preliminary comparisons between prediction and experiment show that the state-of-the art fuel codes have large uncertainties and are not necessarily conservative in predicting peak temperatures, turn around times, and bundle quench times.

  8. The complete chloroplast genome of Cinnamomum camphora and its comparison with related Lauraceae species.

    PubMed

    Chen, Caihui; Zheng, Yongjie; Liu, Sian; Zhong, Yongda; Wu, Yanfang; Li, Jiang; Xu, Li-An; Xu, Meng

    2017-01-01

    Cinnamomum camphora , a member of the Lauraceae family, is a valuable aromatic and timber tree that is indigenous to the south of China and Japan. All parts of Cinnamomum camphora have secretory cells containing different volatile chemical compounds that are utilized as herbal medicines and essential oils. Here, we reported the complete sequencing of the chloroplast genome of Cinnamomum camphora using illumina technology. The chloroplast genome of Cinnamomum camphora is 152,570 bp in length and characterized by a relatively conserved quadripartite structure containing a large single copy region of 93,705 bp, a small single copy region of 19,093 bp and two inverted repeat (IR) regions of 19,886 bp. Overall, the genome contained 123 coding regions, of which 15 were repeated in the IR regions. An analysis of chloroplast sequence divergence revealed that the small single copy region was highly variable among the different genera in the Lauraceae family. A total of 40 repeat structures and 83 simple sequence repeats were detected in both the coding and non-coding regions. A phylogenetic analysis indicated that Calycanthus is most closely related to Lauraceae , both being members of Laurales , which forms a sister group to Magnoliids . The complete sequence of the chloroplast of Cinnamomum camphora will aid in in-depth taxonomical studies of the Lauraceae family in the future. The genetic sequence information will also have valuable applications for chloroplast genetic engineering.

  9. Bovine adipose triglyceride lipase is not altered and adipocyte fatty acid binding protein is increased by dietary flaxseed

    USDA-ARS?s Scientific Manuscript database

    In this paper, we report the full length coding sequence of bovine ATGL cDNA are reported and analyze its expression in bovine tissues. Similar to human, mouse, and pig ATGL sequences, bovine ATGL has a highly conserved patatin domain that is necessary for lipolytic function in mice and humans. Thi...

  10. Effects of Homology Length in the Repeat Region on Minus-Strand DNA Transfer and Retroviral Replication

    PubMed Central

    Dang, Que; Hu, Wei-Shau

    2001-01-01

    Homology between the two repeat (R) regions in the retroviral genome mediates minus-strand DNA transfer during reverse transcription. We sought to define the effects of R homology lengths on minus-strand DNA transfer. We generated five murine leukemia virus (MLV)-based vectors that contained identical sequences but different lengths of the 3′ R (3, 6, 12, 24 and 69 nucleotides [nt]); 69 nt is the full-length MLV R. After one round of replication, viral titers from the vector with a full-length downstream R were compared with viral titers generated from the other four vectors with reduced R lengths. Viral titers generated from vectors with R lengths reduced to one-third (24 nt) or one-sixth (12 nt) that of the wild type were not significantly affected; however, viral titers generated from vectors with only 3- or 6-nt homology in the R region were significantly lower. Because expression and packaging of the RNA were similar among all the vectors, the differences in the viral titers most likely reflected the impact of the homology lengths on the efficiency of minus-strand DNA transfer. The molecular nature of minus-strand DNA transfer was characterized in 63 proviruses. Precise R-to-R transfer was observed in most proviruses generated from vectors with 12-, 24-, or 69-nt homology in R, whereas aberrant transfers were predominantly used to generate proviruses from vectors with 3- or 6-nt homology. Reverse transcription using RNA transcribed from an upstream promoter, termed read-in RNA transcripts, resulted in most of the aberrant transfers. These data demonstrate that minus-strand DNA transfer is homology driven and a minimum homology length is required for accurate and efficient minus-strand DNA transfer. PMID:11134294

  11. Construction of a Full-Length Enriched cDNA Library and Preliminary Analysis of Expressed Sequence Tags from Bengal Tiger Panthera tigris tigris

    PubMed Central

    Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

    2013-01-01

    In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers. PMID:23708105

  12. Construction of a full-length enriched cDNA library and preliminary analysis of expressed sequence tags from Bengal Tiger Panthera tigris tigris.

    PubMed

    Liu, Changqing; Liu, Dan; Guo, Yu; Lu, Taofeng; Li, Xiangchen; Zhang, Minghai; Ma, Jianzhang; Ma, Yuehui; Guan, Weijun

    2013-05-24

    In this study, a full-length enriched cDNA library was successfully constructed from Bengal tiger, Panthera tigris tigris, the most well-known wild Animal. Total RNA was extracted from cultured Bengal tiger fibroblasts in vitro. The titers of primary and amplified libraries were 1.28 × 106 pfu/mL and 1.56 × 109 pfu/mL respectively. The percentage of recombinants from unamplified library was 90.2% and average length of exogenous inserts was 0.98 kb. A total of 212 individual ESTs with sizes ranging from 356 to 1108 bps were then analyzed. The BLASTX score revealed that 48.1% of the sequences were classified as a strong match, 45.3% as nominal and 6.6% as a weak match. Among the ESTs with known putative function, 26.4% ESTs were found to be related to all kinds of metabolisms, 19.3% ESTs to information storage and processing, 11.3% ESTs to posttranslational modification, protein turnover, chaperones, 11.3% ESTs to transport, 9.9% ESTs to signal transducer/cell communication, 9.0% ESTs to structure protein, 3.8% ESTs to cell cycle, and only 6.6% ESTs classified as novel genes. By EST sequencing, a full-length gene coding ferritin was identified and characterized. The recombinant plasmid pET32a-TAT-Ferritin was constructed, coded for the TAT-Ferritin fusion protein with two 6× His-tags in N and C-terminal. After BCA assay, the concentration of soluble Trx-TAT-Ferritin recombinant protein was 2.32 ± 0.12 mg/mL. These results demonstrated that the reliability and representativeness of the cDNA library attained to the requirements of a standard cDNA library. This library provided a useful platform for the functional genome and transcriptome research of Bengal tigers.

  13. Neural code alterations and abnormal time patterns in Parkinson’s disease

    NASA Astrophysics Data System (ADS)

    Andres, Daniela Sabrina; Cerquetti, Daniel; Merello, Marcelo

    2015-04-01

    Objective. The neural code used by the basal ganglia is a current question in neuroscience, relevant for the understanding of the pathophysiology of Parkinson’s disease. While a rate code is known to participate in the communication between the basal ganglia and the motor thalamus/cortex, different lines of evidence have also favored the presence of complex time patterns in the discharge of the basal ganglia. To gain insight into the way the basal ganglia code information, we studied the activity of the globus pallidus pars interna (GPi), an output node of the circuit. Approach. We implemented the 6-hydroxydopamine model of Parkinsonism in Sprague-Dawley rats, and recorded the spontaneous discharge of single GPi neurons, in head-restrained conditions at full alertness. Analyzing the temporal structure function, we looked for characteristic scales in the neuronal discharge of the GPi. Main results. At a low-scale, we observed the presence of dynamic processes, which allow the transmission of time patterns. Conversely, at a middle-scale, stochastic processes force the use of a rate code. Regarding the time patterns transmitted, we measured the word length and found that it is increased in Parkinson’s disease. Furthermore, it showed a positive correlation with the frequency of discharge, indicating that an exacerbation of this abnormal time pattern length can be expected, as the dopamine depletion progresses. Significance. We conclude that a rate code and a time pattern code can co-exist in the basal ganglia at different temporal scales. However, their normal balance is progressively altered and replaced by pathological time patterns in Parkinson’s disease.

  14. Distinct spatiotemporal accumulation of N-truncated and full-length amyloid-β42 in Alzheimer's disease.

    PubMed

    Shinohara, Mitsuru; Koga, Shunsuke; Konno, Takuya; Nix, Jeremy; Shinohara, Motoko; Aoki, Naoya; Das, Pritam; Parisi, Joseph E; Petersen, Ronald C; Rosenberry, Terrone L; Dickson, Dennis W; Bu, Guojun

    2017-12-01

    Accumulation of amyloid-β peptides is a dominant feature in the pathogenesis of Alzheimer's disease; however, it is not clear how individual amyloid-β species accumulate and affect other neuropathological and clinical features in the disease. Thus, we compared the accumulation of N-terminally truncated amyloid-β and full-length amyloid-β, depending on disease stage as well as brain area, and determined how these amyloid-β species respectively correlate with clinicopathological features of Alzheimer's disease. To this end, the amounts of amyloid-β species and other proteins related to amyloid-β metabolism or Alzheimer's disease were quantified by enzyme-linked immunosorbent assays (ELISA) or theoretically calculated in 12 brain regions, including neocortical, limbic and subcortical areas from Alzheimer's disease cases (n = 19), neurologically normal elderly without amyloid-β accumulation (normal ageing, n = 13), and neurologically normal elderly with cortical amyloid-β accumulation (pathological ageing, n = 15). We observed that N-terminally truncated amyloid-β42 and full-length amyloid-β42 accumulations distributed differently across disease stages and brain areas, while N-terminally truncated amyloid-β40 and full-length amyloid-β40 accumulation showed an almost identical distribution pattern. Cortical N-terminally truncated amyloid-β42 accumulation was increased in Alzheimer's disease compared to pathological ageing, whereas cortical full-length amyloid-β42 accumulation was comparable between Alzheimer's disease and pathological ageing. Moreover, N-terminally truncated amyloid-β42 were more likely to accumulate more in specific brain areas, especially some limbic areas, while full-length amyloid-β42 tended to accumulate more in several neocortical areas, including frontal cortices. Immunoprecipitation followed by mass spectrometry analysis showed that several N-terminally truncated amyloid-β42 species, represented by pyroglutamylated amyloid-β11-42, were enriched in these areas, consistent with ELISA results. N-terminally truncated amyloid-β42 accumulation showed significant regional association with BACE1 and neprilysin, but not PSD95 that regionally associated with full-length amyloid-β42 accumulation. Interestingly, accumulations of tau and to a greater extent apolipoprotein E (apoE, encoded by APOE) were more strongly correlated with N-terminally truncated amyloid-β42 accumulation than those of other amyloid-β species across brain areas and disease stages. Consistently, immunohistochemical staining and in vitro binding assays showed that apoE co-localized and bound more strongly with pyroglutamylated amyloid-β11-x fibrils than full-length amyloid-β fibrils. Retrospective review of clinical records showed that accumulation of N-terminally truncated amyloid-β42 in cortical areas was associated with disease onset, duration and cognitive scores. Collectively, N-terminally truncated amyloid-β42 species have spatiotemporal accumulation patterns distinct from full-length amyloid-β42, likely due to different mechanisms governing their accumulations in the brain. These truncated amyloid-β species could play critical roles in the disease by linking other clinicopathological features of Alzheimer's disease. © The Author (2017). Published by Oxford University Press on behalf of the Guarantors of Brain. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  15. Molecular cloning and characterization of a novel RING zinc-finger protein gene up-regulated under in vitro salt stress in cassava.

    PubMed

    dos Reis, Sávio Pinho; Tavares, Liliane de Souza Conceição; Costa, Carinne de Nazaré Monteiro; Brígida, Aílton Borges Santa; de Souza, Cláudia Regina Batista

    2012-06-01

    Cassava (Manihot esculenta Crantz) is one of the world's most important food crops. It is cultivated mainly in developing countries of tropics, since its root is a major source of calories for low-income people due to its high productivity and resistance to many abiotic and biotic factors. A previous study has identified a partial cDNA sequence coding for a putative RING zinc finger in cassava storage root. The RING zinc finger protein is a specialized type of zinc finger protein found in many organisms. Here, we isolated the full-length cDNA sequence coding for M. esculenta RZF (MeRZF) protein by a combination of 5' and 3' RACE assays. BLAST analysis showed that its deduced amino acid sequence has a high level of similarity to plant proteins of RZF family. MeRZF protein contains a signature sequence motif for a RING zinc finger at its C-terminal region. In addition, this protein showed a histidine residue at the fifth coordination site, likely belonging to the RING-H2 subgroup, as confirmed by our phylogenetic analysis. There is also a transmembrane domain in its N-terminal region. Finally, semi-quantitative RT-PCR assays showed that MeRZF expression is increased in detached leaves treated with sodium chloride. Here, we report the first evidence of a RING zinc finger gene of cassava showing potential role in response to salt stress.

  16. Characterization of a Novel Polerovirus Infecting Maize in China

    PubMed Central

    Chen, Sha; Jiang, Guangzhuang; Wu, Jianxiang; Liu, Yong; Qian, Yajuan; Zhou, Xueping

    2016-01-01

    A novel virus, tentatively named Maize Yellow Mosaic Virus (MaYMV), was identified from the field-grown maize plants showing yellow mosaic symptoms on the leaves collected from the Yunnan Province of China by the deep sequencing of small RNAs. The complete 5642 nucleotide (nt)-long genome of the MaYMV shared the highest nucleotide sequence identity (73%) to Maize Yellow Dwarf Virus-RMV. Sequence comparisons and phylogenetic analyses suggested that MaYMV represents a new member of the genus Polerovirus in the family Luteoviridae. Furthermore, the P0 protein encoded by MaYMV was demonstrated to inhibit both local and systemic RNA silencing by co-infiltration assays using transgenic Nicotiana benthamiana line 16c carrying the GFP reporter gene, which further supported the identification of a new polerovirus. The biologically-active cDNA clone of MaYMV was generated by inserting the full-length cDNA of MaYMV into the binary vector pCB301. RT-PCR and Northern blot analyses showed that this clone was systemically infectious upon agro-inoculation into N. benthamiana. Subsequently, 13 different isolates of MaYMV from field-grown maize plants in different geographical locations of Yunnan and Guizhou provinces of China were sequenced. Analyses of their molecular variation indicate that the 3′ half of P3–P5 read-through protein coding region was the most variable, whereas the coat protein- (CP-) and movement protein- (MP-)coding regions were the most conserved. PMID:27136578

  17. Characterization of a Novel Polerovirus Infecting Maize in China.

    PubMed

    Chen, Sha; Jiang, Guangzhuang; Wu, Jianxiang; Liu, Yong; Qian, Yajuan; Zhou, Xueping

    2016-04-28

    A novel virus, tentatively named Maize Yellow Mosaic Virus (MaYMV), was identified from the field-grown maize plants showing yellow mosaic symptoms on the leaves collected from the Yunnan Province of China by the deep sequencing of small RNAs. The complete 5642 nucleotide (nt)-long genome of the MaYMV shared the highest nucleotide sequence identity (73%) to Maize Yellow Dwarf Virus-RMV. Sequence comparisons and phylogenetic analyses suggested that MaYMV represents a new member of the genus Polerovirus in the family Luteoviridae. Furthermore, the P0 protein encoded by MaYMV was demonstrated to inhibit both local and systemic RNA silencing by co-infiltration assays using transgenic Nicotiana benthamiana line 16c carrying the GFP reporter gene, which further supported the identification of a new polerovirus. The biologically-active cDNA clone of MaYMV was generated by inserting the full-length cDNA of MaYMV into the binary vector pCB301. RT-PCR and Northern blot analyses showed that this clone was systemically infectious upon agro-inoculation into N. benthamiana. Subsequently, 13 different isolates of MaYMV from field-grown maize plants in different geographical locations of Yunnan and Guizhou provinces of China were sequenced. Analyses of their molecular variation indicate that the 3' half of P3-P5 read-through protein coding region was the most variable, whereas the coat protein- (CP-) and movement protein- (MP-)coding regions were the most conserved.

  18. A novel frameshift mutation in the lipoprotein lipase gene is rescued by alternative messenger RNA splicing.

    PubMed

    Laurie, Andrew D; Kyle, Campbell V

    Type I hyperlipoproteinemia, manifesting as chylomicronemia and severe hypertriglyceridemia, is a rare autosomal recessive disorder usually caused by mutations in the lipoprotein lipase gene (LPL). We sought to determine whether mutations in LPL could explain the clinical indications of a patient presenting with pancreatitis and hypertriglyceridemia. Coding regions of LPL were amplified by polymerase chain reaction and analyzed by nucleotide sequencing. The LPL messenger RNA transcript was also analyzed to investigate whether alternative splicing was occurring. The patient was homozygous for the mutation c.767_768insTAAATATT in exon 5 of the LPL gene. This mutation is predicted to result in either a truncated nonfunctional LPL, or alternatively a new 5' donor splice site may be used, resulting in a full-length LPL with an in-frame deletion of 3 amino acids. Analysis of messenger RNA from the patient showed that the new splice site is used in vivo. Homozygosity for a mutation in the LPL gene was consistent with the clinical findings. Use of the new splice site created by the insertion mutation rescues an otherwise damaging frameshift mutation, resulting in expression of an almost full-length LPL that is predicted to be partially functional. The patient therefore has a less severe form of type I hyperlipoproteinemia than would be expected if she lacked any functional LPL. Copyright © 2017 National Lipid Association. Published by Elsevier Inc. All rights reserved.

  19. Complete mitochondrial genome of Bactrocera arecae (Insecta: Tephritidae) by next-generation sequencing and molecular phylogeny of Dacini tribe

    PubMed Central

    Yong, Hoi-Sen; Song, Sze-Looi; Lim, Phaik-Eem; Chan, Kok-Gan; Chow, Wan-Loo; Eamsobhana, Praphathip

    2015-01-01

    The whole mitochondrial genome of the pest fruit fly Bactrocera arecae was obtained from next-generation sequencing of genomic DNA. It had a total length of 15,900 bp, consisting of 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and a non-coding region (A + T-rich control region). The control region (952 bp) was flanked by rrnS and trnI genes. The start codons included 6 ATG, 3 ATT and 1 each of ATA, ATC, GTG and TCG. Eight TAA, two TAG, one incomplete TA and two incomplete T stop codons were represented in the protein-coding genes. The cloverleaf structure for trnS1 lacked the D-loop, and that of trnN and trnF lacked the TΨC-loop. Molecular phylogeny based on 13 protein-coding genes was concordant with 37 mitochondrial genes, with B. arecae having closest genetic affinity to B. tryoni. The subgenus Bactrocera of Dacini tribe and the Dacinae subfamily (Dacini and Ceratitidini tribes) were monophyletic. The whole mitogenome of B. arecae will serve as a useful dataset for studying the genetics, systematics and phylogenetic relationships of the many species of Bactrocera genus in particular, and tephritid fruit flies in general. PMID:26472633

  20. RAMICS: trainable, high-speed and biologically relevant alignment of high-throughput sequencing reads to coding DNA

    PubMed Central

    Wright, Imogen A.; Travers, Simon A.

    2014-01-01

    The challenge presented by high-throughput sequencing necessitates the development of novel tools for accurate alignment of reads to reference sequences. Current approaches focus on using heuristics to map reads quickly to large genomes, rather than generating highly accurate alignments in coding regions. Such approaches are, thus, unsuited for applications such as amplicon-based analysis and the realignment phase of exome sequencing and RNA-seq, where accurate and biologically relevant alignment of coding regions is critical. To facilitate such analyses, we have developed a novel tool, RAMICS, that is tailored to mapping large numbers of sequence reads to short lengths (<10 000 bp) of coding DNA. RAMICS utilizes profile hidden Markov models to discover the open reading frame of each sequence and aligns to the reference sequence in a biologically relevant manner, distinguishing between genuine codon-sized indels and frameshift mutations. This approach facilitates the generation of highly accurate alignments, accounting for the error biases of the sequencing machine used to generate reads, particularly at homopolymer regions. Performance improvements are gained through the use of graphics processing units, which increase the speed of mapping through parallelization. RAMICS substantially outperforms all other mapping approaches tested in terms of alignment quality while maintaining highly competitive speed performance. PMID:24861618

  1. Characterization of the complete mitochondrial genome of the hybrid Epinephelus moara♀ × Epinephelus lanceolatus♂, and phylogenetic analysis in subfamily epinephelinae

    NASA Astrophysics Data System (ADS)

    Gao, Fengtao; Wei, Min; Zhu, Ying; Guo, Hua; Chen, Songlin; Yang, Guanpin

    2017-06-01

    This study presents the complete mitochondrial genome of the hybrid Epinephelus moara♀× Epinephelus lanceolatus♂. The genome is 16886 bp in length, and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes, a light-strand replication origin and a control region. Additionally, phylogenetic analysis based on the nucleotide sequences of 13 conserved protein-coding genes using the maximum likelihood method indicated that the mitochondrial genome is maternally inherited. This study presents genomic data for studying phylogenetic relationships and breeding of hybrid Epinephelinae.

  2. Complete mitochondrial genome of Chocolate Pansy, Junonia iphita (Lepidoptera: Nymphalidae: Nymphalinae).

    PubMed

    Vanlalruati, Catherine; Mandal, Surajit De; Gurusubramanian, Guruswami; Senthil Kumar, Nachimuthu

    2016-07-01

    The complete mitochondrial genome of Junonia iphita was determined to be 15,433 bp in length, including 37 typical mitochondrial genes and an AT-rich region. All the protein coding genes (PCGs) are initiated by typical ATN codons, except cox1 gene that is by CGA codon. Eight genes use complete termination codon (TAA), whereas the cox1, cox2 and nad5 genes end with single T; nad4 and nad1 ends with stop codon TA. All the tRNA show secondary cloverleaf structures except trnS1 (AGN). The A + T rich region is 546 bp in length containing ATAGA motif followed by a 18 bp poly-T stretch, two microsatellite-like (TA)9 elements and 8 bp poly-A stretch immediately upstream of trnM gene.

  3. The complete mitochondrial genome of the longhorn beetle Xylotrechus grayii (Coleoptera: Cerambycidae).

    PubMed

    Guo, Kun; Chen, Jun; Xu, Chang-Qing; Qiao, Hai-Li; Xu, Rong; Zhao, Xiang-Jian

    2016-05-01

    We sequenced the complete mitochondrial genome of the longhorn beetle, Xylotrechus grayii. The total length of the X. grayii mitogenome was 15,540 bp with an A + T content of 75.29%, consisting of 13 protein-coding genes (PCGs), 22 tRNA genes, 2 rRNA genes and an A + T-rich region. All the genes were arranged in the same order as that of the ancestral insect. All PCGs started with a typical ATN codon except for cox1 and nad1, which used TTG as start codon. Ten out of 13 PCGs terminated with incomplete codons (TA or T). The A + T-rich region was 893 bp in length with an A + T content of 85.89 %.

  4. Large-Scale Collection and Analysis of Full-Length cDNAs from Brachypodium distachyon and Integration with Pooideae Sequence Resources

    PubMed Central

    Mochida, Keiichi; Uehara-Yamaguchi, Yukiko; Takahashi, Fuminori; Yoshida, Takuhiro; Sakurai, Tetsuya; Shinozaki, Kazuo

    2013-01-01

    A comprehensive collection of full-length cDNAs is essential for correct structural gene annotation and functional analyses of genes. We constructed a mixed full-length cDNA library from 21 different tissues of Brachypodium distachyon Bd21, and obtained 78,163 high quality expressed sequence tags (ESTs) from both ends of ca. 40,000 clones (including 16,079 contigs). We updated gene structure annotations of Brachypodium genes based on full-length cDNA sequences in comparison with the latest publicly available annotations. About 10,000 non-redundant gene models were supported by full-length cDNAs; ca. 6,000 showed some transcription unit modifications. We also found ca. 580 novel gene models, including 362 newly identified in Bd21. Using the updated transcription start sites, we searched a total of 580 plant cis-motifs in the −3 kb promoter regions and determined a genome-wide Brachypodium promoter architecture. Furthermore, we integrated the Brachypodium full-length cDNAs and updated gene structures with available sequence resources in wheat and barley in a web-accessible database, the RIKEN Brachypodium FL cDNA database. The database represents a “one-stop” information resource for all genomic information in the Pooideae, facilitating functional analysis of genes in this model grass plant and seamless knowledge transfer to the Triticeae crops. PMID:24130698

  5. Organizational heterogeneity of vertebrate genomes.

    PubMed

    Frenkel, Svetlana; Kirzhner, Valery; Korol, Abraham

    2012-01-01

    Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.

  6. Sequence adaptations during growth of rescued classical swine fever viruses in cell culture and within infected pigs.

    PubMed

    Hadsbjerg, Johanne; Friis, Martin B; Fahnøe, Ulrik; Nielsen, Jens; Belsham, Graham J; Rasmussen, Thomas Bruun

    2016-08-30

    Classical swine fever virus (CSFV) causes an economically important disease of swine. Four different viruses were rescued from full-length cloned cDNAs derived from the Paderborn strain of CSFV. Three of these viruses had been modified by mutagenesis (with 7 or 8 nt changes) within stem 2 of the subdomain IIIf of the internal ribosome entry site (IRES) that directs the initiation of protein synthesis. Rescued viruses were inoculated into pigs. The rescued vPader10 virus, without modifications in the IRES, induced clinical disease in pigs that was very similar to that observed previously with the parental field strain and transmission to in-contact pigs occurred. Two sequence reversions, in the NS2 and NS5B coding regions, became dominant within the virus populations in these infected pigs. Rescued viruses, with mutant IRES elements, did not induce disease and only very limited circulation of viral RNA could be detected. However, the animals inoculated with these mutant viruses seroconverted against CSFV. Thus, these mutant viruses were highly attenuated in vivo. All 4 rescued viruses were also passaged up to 20 times in cell culture. Using full genome sequencing, the same two adaptations within each of four independent virus populations were observed that restored the coding sequence to that of the parental field strain. These adaptations occurred with different kinetics. The combination of reverse genetics and in depth, full genome sequencing provides a powerful approach to analyse virus adaptation and to identify key determinants of viral replication efficiency in cells and within host animals. Copyright © 2016 Elsevier B.V. All rights reserved.

  7. Microwave beam broadening due to turbulent plasma density fluctuations within the limit of the Born approximation and beyond

    NASA Astrophysics Data System (ADS)

    Köhn, A.; Guidi, L.; Holzhauer, E.; Maj, O.; Poli, E.; Snicker, A.; Weber, H.

    2018-07-01

    Plasma turbulence, and edge density fluctuations in particular, can under certain conditions broaden the cross-section of injected microwave beams significantly. This can be a severe problem for applications relying on well-localized deposition of the microwave power, like the control of MHD instabilities. Here we investigate this broadening mechanism as a function of fluctuation level, background density and propagation length in a fusion-relevant scenario using two numerical codes, the full-wave code IPF-FDMC and the novel wave kinetic equation solver WKBeam. The latter treats the effects of fluctuations using a statistical approach, based on an iterative solution of the scattering problem (Born approximation). The full-wave simulations are used to benchmark this approach. The Born approximation is shown to be valid over a large parameter range, including ITER-relevant scenarios.

  8. TypeLoader: A fast and efficient automated workflow for the annotation and submission of novel full-length HLA alleles.

    PubMed

    Surendranath, V; Albrecht, V; Hayhurst, J D; Schöne, B; Robinson, J; Marsh, S G E; Schmidt, A H; Lange, V

    2017-07-01

    Recent years have seen a rapid increase in the discovery of novel allelic variants of the human leukocyte antigen (HLA) genes. Commonly, only the exons encoding the peptide binding domains of novel HLA alleles are submitted. As a result, the IPD-IMGT/HLA Database lacks sequence information outside those regions for the majority of known alleles. This has implications for the application of the new sequencing technologies, which deliver sequence data often covering the complete gene. As these technologies simplify the characterization of the complete gene regions, it is desirable for novel alleles to be submitted as full-length sequences to the database. However, the manual annotation of full-length alleles and the generation of specific formats required by the sequence repositories is prone to error and time consuming. We have developed TypeLoader to address both these facets. With only the full-length sequence as a starting point, Typeloader performs automatic sequence annotation and subsequently handles all steps involved in preparing the specific formats for submission with very little manual intervention. TypeLoader is routinely used at the DKMS Life Science Lab and has aided in the successful submission of more than 900 novel HLA alleles as full-length sequences to the European Nucleotide Archive repository and the IPD-IMGT/HLA Database with a 95% reduction in the time spent on annotation and submission when compared with handling these processes manually. TypeLoader is implemented as a web application and can be easily installed and used on a standalone Linux desktop system or within a Linux client/server architecture. TypeLoader is downloadable from http://www.github.com/DKMS-LSL/typeloader. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  9. Comparative Mitogenomics of the Assassin Bug Genus Peirates (Hemiptera: Reduviidae: Peiratinae) Reveal Conserved Mitochondrial Genome Organization of P. atromaculatus, P. fulvescens and P. turpis

    PubMed Central

    Zhao, Guangyu; Li, Hu; Zhao, Ping; Cai, Wanzhi

    2015-01-01

    In this study, we sequenced four new mitochondrial genomes and presented comparative mitogenomic analyses of five species in the genus Peirates (Hemiptera: Reduviidae). Mitochondrial genomes of these five assassin bugs had a typical set of 37 genes and retained the ancestral gene arrangement of insects. The A+T content, AT- and GC-skews were similar to the common base composition biases of insect mtDNA. Genomic size ranges from 15,702 bp to 16,314 bp and most of the size variation was due to length and copy number of the repeat unit in the putative control region. All of the control region sequences included large tandem repeats present in two or more copies. Our result revealed similarity in mitochondrial genomes of P. atromaculatus, P. fulvescens and P. turpis, as well as the highly conserved genomic-level characteristics of these three species, e.g., the same start and stop codons of protein-coding genes, conserved secondary structure of tRNAs, identical location and length of non-coding and overlapping regions, and conservation of structural elements and tandem repeat unit in control region. Phylogenetic analyses also supported a close relationship between P. atromaculatus, P. fulvescens and P. turpis, which might be recently diverged species. The present study indicates that mitochondrial genome has important implications on phylogenetics, population genetics and speciation in the genus Peirates. PMID:25689825

  10. Throughput Optimization Via Adaptive MIMO Communications

    DTIC Science & Technology

    2006-05-30

    End-to-end matlab packet simulation platform. * Low density parity check code (LDPCC). * Field trials with Silvus DSP MIMO testbed. * High mobility...incorporate advanced LDPC (low density parity check) codes . Realizing that the power of LDPC codes come at the price of decoder complexity, we also...Channel Coding Binary Convolution Code or LDPC Packet Length 0 - 216-1, bytes Coding Rate 1/2, 2/3, 3/4, 5/6 MIMO Channel Training Length 0 - 4, symbols

  11. The repetitive portion of the Xenopus IgH Mu switch region mediates orientation-dependent class switch recombination.

    PubMed

    Zhang, Zheng Z; Pannunzio, Nicholas R; Lu, Zhengfei; Hsu, Ellen; Yu, Kefei; Lieber, Michael R

    2015-10-01

    Vertebrates developed immunoglobulin heavy chain (IgH) class switch recombination (CSR) to express different IgH constant regions. Most double-strand breaks for Ig CSR occur within the repetitive portion of the switch regions located upstream of each set of constant domain exons for the Igγ, Igα or Igϵ heavy chain. Unlike mammalian switch regions, Xenopus switch regions do not have a high G-density on the non-template DNA strand. In previous studies, when Xenopus Sμ DNA was moved to the genome of mice, it is able to support substantial CSR when it is used to replace the murine Sγ1 region. Here, we tested both the 2kb repetitive portion and the 4.6 kb full-length portions of the Xenopus Sμ in both their natural (forward) orientation relative to the constant domain exons, as well as the opposite (reverse) orientation. Consistent with previous work, we find that the 4.6 kb full-length Sμ mediates similar levels of CSR in both the forward and reverse orientations. Whereas, the forward orientation of the 2kb portion can restore the majority of the CSR level of the 4.6 kb full-length Sμ, the reverse orientation poorly supports R-looping and no CSR. The forward orientation of the 2kb repetitive portion has more GG dinucleotides on the non-template strand than the reverse orientation. The correlation of R-loop formation with CSR efficiency, as demonstrated in the 2kb repetitive fragment of the Xenopus switch region, confirms a role played by R-looping in CSR that appears to be conserved through evolution. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. Overexpression, purification, and characterization of SHPTP1, a Src homology 2-containing protein-tyrosine-phosphatase.

    PubMed Central

    Pei, D; Neel, B G; Walsh, C T

    1993-01-01

    A protein-tyrosine-phosphatase (PTPase; EC 3.1.3.48) containing two Src homology 2 (SH2) domains, SHPTP1, was previously identified in hematopoietic and epithelial cells. By placing the coding sequence of the PTPase behind a bacteriophage T7 promoter, we have overexpressed both the full-length enzyme and a truncated PTPase domain in Escherichia coli. In each case, the soluble enzyme was expressed at levels of 3-4% of total soluble E. coli protein. The recombinant proteins had molecular weights of 63,000 and 45,000 for the full-length protein and the truncated PTPase domain, respectively, as determined by SDS/PAGE. The recombinant enzymes dephosphorylated p-nitrophenyl phosphate, phosphotyrosine, and phosphotyrosyl peptides but not phosphoserine, phosphothreonine, or phosphoseryl peptides. The enzymes showed a strong dependence on pH and ionic strength for their activity, with pH optima of 5.5 and 6.3 for the full-length enzyme and the catalytic domain, respectively, and an optimal NaCl concentration of 250-300 mM. The recombinant PTPases had high Km values for p-nitrophenyl phosphate and exhibited non-Michaelis-Menten kinetics for phosphotyrosyl peptides. Images PMID:8430079

  13. Neoclassical orbit calculations with a full-f code for tokamak edge plasmas

    NASA Astrophysics Data System (ADS)

    Rognlien, T. D.; Cohen, R. H.; Dorr, M.; Hittinger, J.; Xu, X. Q.; Collela, P.; Martin, D.

    2008-11-01

    Ion distribution function modifications are considered for the case of neoclassical orbit widths comparable to plasma radial-gradient scale-lengths. Implementation of proper boundary conditions at divertor plates in the continuum TEMPEST code, including the effect of drifts in determining the direction of total flow, enables such calculations in single-null divertor geometry, with and without an electrostatic potential. The resultant poloidal asymmetries in densities, temperatures, and flows are discussed. For long-time simulations, a slow numerical instability develops, even in simplified (circular) geometry with no endloss, which aids identification of the mixed treatment of parallel and radial convection terms as the cause. The new Edge Simulation Laboratory code, expected to be operational, has algorithmic refinements that should address the instability. We will present any available results from the new code on this problem as well as geodesic acoustic mode tests.

  14. The complete chloroplast genome sequence of the medicinal plant Salvia miltiorrhiza.

    PubMed

    Qian, Jun; Song, Jingyuan; Gao, Huanhuan; Zhu, Yingjie; Xu, Jiang; Pang, Xiaohui; Yao, Hui; Sun, Chao; Li, Xian'en; Li, Chuyuan; Liu, Juyan; Xu, Haibin; Chen, Shilin

    2013-01-01

    Salvia miltiorrhiza is an important medicinal plant with great economic and medicinal value. The complete chloroplast (cp) genome sequence of Salvia miltiorrhiza, the first sequenced member of the Lamiaceae family, is reported here. The genome is 151,328 bp in length and exhibits a typical quadripartite structure of the large (LSC, 82,695 bp) and small (SSC, 17,555 bp) single-copy regions, separated by a pair of inverted repeats (IRs, 25,539 bp). It contains 114 unique genes, including 80 protein-coding genes, 30 tRNAs and four rRNAs. The genome structure, gene order, GC content and codon usage are similar to the typical angiosperm cp genomes. Four forward, three inverted and seven tandem repeats were detected in the Salvia miltiorrhiza cp genome. Simple sequence repeat (SSR) analysis among the 30 asterid cp genomes revealed that most SSRs are AT-rich, which contribute to the overall AT richness of these cp genomes. Additionally, fewer SSRs are distributed in the protein-coding sequences compared to the non-coding regions, indicating an uneven distribution of SSRs within the cp genomes. Entire cp genome comparison of Salvia miltiorrhiza and three other Lamiales cp genomes showed a high degree of sequence similarity and a relatively high divergence of intergenic spacers. Sequence divergence analysis discovered the ten most divergent and ten most conserved genes as well as their length variation, which will be helpful for phylogenetic studies in asterids. Our analysis also supports that both regional and functional constraints affect gene sequence evolution. Further, phylogenetic analysis demonstrated a sister relationship between Salvia miltiorrhiza and Sesamum indicum. The complete cp genome sequence of Salvia miltiorrhiza reported in this paper will facilitate population, phylogenetic and cp genetic engineering studies of this medicinal plant.

  15. High-efficiency Gaussian key reconciliation in continuous variable quantum key distribution

    NASA Astrophysics Data System (ADS)

    Bai, ZengLiang; Wang, XuYang; Yang, ShenShen; Li, YongMin

    2016-01-01

    Efficient reconciliation is a crucial step in continuous variable quantum key distribution. The progressive-edge-growth (PEG) algorithm is an efficient method to construct relatively short block length low-density parity-check (LDPC) codes. The qua-sicyclic construction method can extend short block length codes and further eliminate the shortest cycle. In this paper, by combining the PEG algorithm and qua-si-cyclic construction method, we design long block length irregular LDPC codes with high error-correcting capacity. Based on these LDPC codes, we achieve high-efficiency Gaussian key reconciliation with slice recon-ciliation based on multilevel coding/multistage decoding with an efficiency of 93.7%.

  16. Comparison of Ultra-Rapid Orbit Prediction Strategies for GPS, GLONASS, Galileo and BeiDou.

    PubMed

    Geng, Tao; Zhang, Peng; Wang, Wei; Xie, Xin

    2018-02-06

    Currently, ultra-rapid orbits play an important role in the high-speed development of global navigation satellite system (GNSS) real-time applications. This contribution focuses on the impact of the fitting arc length of observed orbits and solar radiation pressure (SRP) on the orbit prediction performance for GPS, GLONASS, Galileo and BeiDou. One full year's precise ephemerides during 2015 were used as fitted observed orbits and then as references to be compared with predicted orbits, together with known earth rotation parameters. The full nine-parameter Empirical Center for Orbit Determination in Europe (CODE) Orbit Model (ECOM) and its reduced version were chosen in our study. The arc lengths of observed fitted orbits that showed the smallest weighted root mean squares (WRMSs) and medians of the orbit differences after a Helmert transformation fell between 40 and 45 h for GPS and GLONASS and between 42 and 48 h for Galileo, while the WRMS values and medians become flat after a 42 h arc length for BeiDou. The stability of the Helmert transformation and SRP parameters also confirmed the similar optimal arc lengths. The range around 42-45 h is suggested to be the optimal arc length interval of the fitted observed orbits for the multi-GNSS joint solution of ultra-rapid orbits.

  17. Comparison of Ultra-Rapid Orbit Prediction Strategies for GPS, GLONASS, Galileo and BeiDou

    PubMed Central

    Zhang, Peng; Wang, Wei; Xie, Xin

    2018-01-01

    Currently, ultra-rapid orbits play an important role in the high-speed development of global navigation satellite system (GNSS) real-time applications. This contribution focuses on the impact of the fitting arc length of observed orbits and solar radiation pressure (SRP) on the orbit prediction performance for GPS, GLONASS, Galileo and BeiDou. One full year’s precise ephemerides during 2015 were used as fitted observed orbits and then as references to be compared with predicted orbits, together with known earth rotation parameters. The full nine-parameter Empirical Center for Orbit Determination in Europe (CODE) Orbit Model (ECOM) and its reduced version were chosen in our study. The arc lengths of observed fitted orbits that showed the smallest weighted root mean squares (WRMSs) and medians of the orbit differences after a Helmert transformation fell between 40 and 45 h for GPS and GLONASS and between 42 and 48 h for Galileo, while the WRMS values and medians become flat after a 42 h arc length for BeiDou. The stability of the Helmert transformation and SRP parameters also confirmed the similar optimal arc lengths. The range around 42–45 h is suggested to be the optimal arc length interval of the fitted observed orbits for the multi-GNSS joint solution of ultra-rapid orbits. PMID:29415467

  18. Compression of digital images over local area networks. Appendix 1: Item 3. M.S. Thesis

    NASA Technical Reports Server (NTRS)

    Gorjala, Bhargavi

    1991-01-01

    Differential Pulse Code Modulation (DPCM) has been used with speech for many years. It has not been as successful for images because of poor edge performance. The only corruption in DPC is quantizer error but this corruption becomes quite large in the region of an edge because of the abrupt changes in the statistics of the signal. We introduce two improved DPCM schemes; Edge correcting DPCM and Edge Preservation Differential Coding. These two coding schemes will detect the edges and take action to correct them. In an Edge Correcting scheme, the quantizer error for an edge is encoded using a recursive quantizer with entropy coding and sent to the receiver as side information. In an Edge Preserving scheme, when the quantizer input falls in the overload region, the quantizer error is encoded and sent to the receiver repeatedly until the quantizer input falls in the inner levels. Therefore these coding schemes increase the bit rate in the region of an edge and require variable rate channels. We implement these two variable rate coding schemes on a token wing network. Timed token protocol supports two classes of messages; asynchronous and synchronous. The synchronous class provides a pre-allocated bandwidth and guaranteed response time. The remaining bandwidth is dynamically allocated to the asynchronous class. The Edge Correcting DPCM is simulated by considering the edge information under the asynchronous class. For the simulation of the Edge Preserving scheme, the amount of information sent each time is fixed, but the length of the packet or the bit rate for that packet is chosen depending on the availability capacity. The performance of the network, and the performance of the image coding algorithms, is studied.

  19. Interactive boundary-layer calculations of a transonic wing flow

    NASA Technical Reports Server (NTRS)

    Kaups, Kalle; Cebeci, Tuncer; Mehta, Unmeel

    1989-01-01

    Results obtained from iterative solutions of inviscid and boundary-layer equations are presented and compared with experimental values. The calculated results were obtained with an Euler code and a transonic potential code in order to furnish solutions for the inviscid flow; they were interacted with solutions of two-dimensional boundary-layer equations having a strip-theory approximation. Euler code results are found to be in better agreement with the experimental data than with the full potential code, especially in the presence of shock waves, (with the sole exception of the near-tip region).

  20. Ab initio reconstruction of transcriptomes of pluripotent and lineage committed cells reveals gene structures of thousands of lincRNAs

    PubMed Central

    Guttman, Mitchell; Garber, Manuel; Levin, Joshua Z.; Donaghey, Julie; Robinson, James; Adiconis, Xian; Fan, Lin; Koziol, Magdalena J.; Gnirke, Andreas; Nusbaum, Chad; Rinn, John L.; Lander, Eric S.; Regev, Aviv

    2010-01-01

    RNA-Seq provides an unbiased way to study a transcriptome, including both coding and non-coding genes. To date, most RNA-Seq studies have critically depended on existing annotations, and thus focused on expression levels and variation in known transcripts. Here, we present Scripture, a method to reconstruct the transcriptome of a mammalian cell using only RNA-Seq reads and the genome sequence. We apply it to mouse embryonic stem cells, neuronal precursor cells, and lung fibroblasts to accurately reconstruct the full-length gene structures for the vast majority of known expressed genes. We identify substantial variation in protein-coding genes, including thousands of novel 5′-start sites, 3′-ends, and internal coding exons. We then determine the gene structures of over a thousand lincRNA and antisense loci. Our results open the way to direct experimental manipulation of thousands of non-coding RNAs, and demonstrate the power of ab initio reconstruction to render a comprehensive picture of mammalian transcriptomes. PMID:20436462

  1. Error Control Techniques for Satellite and Space Communications

    NASA Technical Reports Server (NTRS)

    Costello, Daniel J., Jr.

    1996-01-01

    In this report, we present the results of our recent work on turbo coding in two formats. Appendix A includes the overheads of a talk that has been given at four different locations over the last eight months. This presentation has received much favorable comment from the research community and has resulted in the full-length paper included as Appendix B, 'A Distance Spectrum Interpretation of Turbo Codes'. Turbo codes use a parallel concatenation of rate 1/2 convolutional encoders combined with iterative maximum a posteriori probability (MAP) decoding to achieve a bit error rate (BER) of 10(exp -5) at a signal-to-noise ratio (SNR) of only 0.7 dB. The channel capacity for a rate 1/2 code with binary phase shift-keyed modulation on the AWGN (additive white Gaussian noise) channel is 0 dB, and thus the Turbo coding scheme comes within 0.7 DB of capacity at a BER of 10(exp -5).

  2. Processing, Assembly and Localization of a Bacillus anthracis Spore Protein

    DTIC Science & Technology

    2010-01-01

    phage transduction, using the CP51 phage as described by Thorne (1968). All mutations were confirmed by PCR analysis (Supplementary Table S1). Protein...with End-It (Epicentre) and self-ligated, creating pKH-KSM4. The region between the T7 terminator and T7 promoter of pET23A (EMD Table 1. Strains and...represent full-length BxpA, we analysed the electrophoretic behaviour of a full-length, histidine-tagged and T7 -tagged version of BxpA overproduced in E

  3. Direct Numerical Simulations of a Full Stationary Wind-Turbine Blade

    NASA Astrophysics Data System (ADS)

    Qamar, Adnan; Zhang, Wei; Gao, Wei; Samtaney, Ravi

    2014-11-01

    Direct numerical simulation of flow past a full stationary wind-turbine blade is carried out at Reynolds number, Re = 10,000 placed at 0 and 5 (degree) angle of attack. The study is targeted to create a DNS database for verification of solvers and turbulent models that are utilized in wind-turbine modeling applications. The full blade comprises of a circular cylinder base that is attached to a spanwise varying airfoil cross-section profile (without twist). An overlapping composite grid technique is utilized to perform these DNS computations, which permits block structure in the mapped computational space. Different flow shedding regimes are observed along the blade length. Von-Karman shedding is observed in the cylinder shaft region of the turbine blade. Along the airfoil cross-section of the blade, near body shear layer breakdown is observed. A long tip vortex originates from the blade tip region, which exits the computational plane without being perturbed. Laminar to turbulent flow transition is observed along the blade length. The turbulent fluctuations amplitude decreases along the blade length and the flow remains laminar regime in the vicinity of the blade tip. The Strouhal number is found to decrease monotonously along the blade length. Average lift and drag coefficients are also reported for the cases investigated. Supported by funding under a KAUST OCRF-CRG grant.

  4. An Adaptive Source-Channel Coding with Feedback for Progressive Transmission of Medical Images

    PubMed Central

    Lo, Jen-Lung; Sanei, Saeid; Nazarpour, Kianoush

    2009-01-01

    A novel adaptive source-channel coding with feedback for progressive transmission of medical images is proposed here. In the source coding part, the transmission starts from the region of interest (RoI). The parity length in the channel code varies with respect to both the proximity of the image subblock to the RoI and the channel noise, which is iteratively estimated in the receiver. The overall transmitted data can be controlled by the user (clinician). In the case of medical data transmission, it is vital to keep the distortion level under control as in most of the cases certain clinically important regions have to be transmitted without any visible error. The proposed system significantly reduces the transmission time and error. Moreover, the system is very user friendly since the selection of the RoI, its size, overall code rate, and a number of test features such as noise level can be set by the users in both ends. A MATLAB-based TCP/IP connection has been established to demonstrate the proposed interactive and adaptive progressive transmission system. The proposed system is simulated for both binary symmetric channel (BSC) and Rayleigh channel. The experimental results verify the effectiveness of the design. PMID:19190770

  5. Reduced 3d modeling on injection schemes for laser wakefield acceleration at plasma scale lengths

    NASA Astrophysics Data System (ADS)

    Helm, Anton; Vieira, Jorge; Silva, Luis; Fonseca, Ricardo

    2017-10-01

    Current modelling techniques for laser wakefield acceleration (LWFA) are based on particle-in-cell (PIC) codes which are computationally demanding. In PIC simulations the laser wavelength λ0, in μm-range, has to be resolved over the acceleration lengths in meter-range. A promising approach is the ponderomotive guiding center solver (PGC) by only considering the laser envelope for laser pulse propagation. Therefore only the plasma skin depth λp has to be resolved, leading to speedups of (λp /λ0) 2. This allows to perform a wide-range of parameter studies and use it for λ0 <<λp studies. We present the 3d version of a PGC solver in the massively parallel, fully relativistic PIC code OSIRIS. Further, a discussion and characterization of the validity of the PGC solver for injection schemes on the plasma scale lengths, such as down-ramp injection, magnetic injection and ionization injection, through parametric studies, full PIC simulations and theoretical scaling, is presented. This work was partially supported by Fundacao para a Ciencia e Tecnologia (FCT), Portugal, through Grant No. PTDC/FIS-PLA/2940/2014 and PD/BD/105882/2014.

  6. The complete sequence of mitochondrial genome of polled yak (Bos grunniens).

    PubMed

    Chu, Min; Wu, Xiaoyun; Liang, Chunnian; Pei, Jie; Ding, Xuezhi; Guo, Xian; Bao, Pengjia; Yan, Ping

    2016-05-01

    Generally speaking, the hornless trait is also known as polled. Although the POLL locus could be assigned to a 1.36-Mb interval in the centromeric region of BTA1 (Georges et al., 1993; Drögemüller et al., 2005)), and (Liu et al., 2014) reported a 147-kb segment that included three protein-coding genes was the most likely location of the POLL mutation in domestic yaks, the underlying genetic basis for the polled trait is still unknown. In this work, the complete mitochondrial genome sequence of polled yak was determined for the first time. The total length of the mitogenome is 16,324 bp long, with the base composition of 33.72% A, 27.25% T, 25.83% C, and 13.20% G. It contained 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes and 1 non-coding region (D-loop region). The gene order of polled yak mitogenome is identical to that observed in most other vertebrates. The complete mitogenome sequence information of polled yak will provide useful data for further studies on protection of genetic resources and phylogenetic relationships within Bos grunniens.

  7. The mitochondrial genome of Polistes jokahamae and a phylogenetic analysis of the Vespoidea (Insecta: Hymenoptera).

    PubMed

    Song, Sheng-Nan; Chen, Peng-Yan; Wei, Shu-Jun; Chen, Xue-Xin

    2016-07-01

    The mitochondrial genome sequence of Polistes jokahamae (Radoszkowski, 1887) (Hymenoptera: Vespidae) (GenBank accession no. KR052468) was sequenced. The current length with partial A + T-rich region of this mitochondrial genome is 16,616 bp. All the typical mitochondrial genes were sequenced except for three tRNAs (trnI, trnQ, and trnY) located between the A + T-rich region and nad2. At least three rearrangement events occurred in the sequenced region compared with the pupative ancestral arrangement of insects, corresponding to the shuffling of trnK and trnD, translocation or remote inversion of tnnY and translocation of trnL1. All protein-coding genes start with ATN codons. Eleven, one, and another one protein-coding genes stop with termination codon TAA, TA, and T, respectively. Phylogenetic analysis using the Bayesian method based on all codon positions of the 13 protein-coding genes supports the monophyly of Vespidae and Formicidae. Within the Formicidae, the Myrmicinae and Formicinae form a sister lineage and then sister to the Dolichoderinae, while within the Vespidae, the Eumeninae is sister to the lineage of Vespinae + Polistinae.

  8. Self-organizing approach for meta-genomes.

    PubMed

    Zhu, Jianfeng; Zheng, Wei-Mou

    2014-12-01

    We extend the self-organizing approach for annotation of a bacterial genome to analyze the raw sequencing data of the human gut metagenome without sequence assembling. The original approach divides the genomic sequence of a bacterium into non-overlapping segments of equal length and assigns to each segment one of seven 'phases', among which one is for the noncoding regions, three for the direct coding regions to indicate the three possible codon positions of the segment starting site, and three for the reverse coding regions. The noncoding phase and the six coding phases are described by two frequency tables of the 64 triplet types or 'codon usages'. A set of codon usages can be used to update the phase assignment and vice versa. An iteration after an initialization leads to a convergent phase assignment to give an annotation of the genome. In the extension of the approach to a metagenome, we consider a mixture model of a number of categories described by different codon usages. The Illumina Genome Analyzer sequencing data of the total DNA from faecal samples are then examined to understand the diversity of the human gut microbiome. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. The complete mitochondrial genome of the ice pigeon (Columba livia breed ice).

    PubMed

    Zhang, Rui-Hua; He, Wen-Xiao

    2015-02-01

    The ice pigeon is a breed of fancy pigeon developed over many years of selective breeding. In the present work, we report the complete mitochondrial genome sequence of ice pigeon for the first time. The total length of the mitogenome was 17,236 bp with the base composition of 30.2% for A, 24.0% for T, 31.9% for C, and 13.9% for G and an A-T (54.2 %)-rich feature was detected. It harbored 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and 1 non-coding control region (D-loop region). The arrangement of all genes was identical to the typical mitochondrial genomes of pigeon. The complete mitochondrial genome sequence of ice pigeon would serve as an important data set of the germplasm resources for further study.

  10. Mitochondrial genome sequence of Egyptian swift Rock Pigeon (Columba livia breed Egyptian swift).

    PubMed

    Li, Chun-Hong; Shi, Wei; Shi, Wan-Yu

    2015-06-01

    The Egyptian swift Rock Pigeon is a breed of fancy pigeon developed over many years of selective breeding. In this work, we report the complete mitochondrial genome sequence of Egyptian swift Rock Pigeon. The total length of the mitogenome was 17,239 bp and its overall base composition was estimated to be 30.2% for A, 24.0% for T, 31.9% for C and 13.9% for G, indicating an A-T (54.2%)-rich feature in the mitogenome. It contained the typical structure of 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and a non-coding control region (D-loop region). The complete mitochondrial genome sequence of Egyptian swift Rock Pigeon would serve as an important data set of the germplasm resources for further study.

  11. The complete mitochondrial genome of the Fancy Pigeon, Columba livia (Columbiformes: Columbidae).

    PubMed

    Zhang, Rui-Hua; Xu, Ming-Ju; Wang, Cun-Lian; Xu, Tong; Wei, Dong; Liu, Bao-Jian; Wang, Guo-Hua

    2015-02-01

    The fancy pigeons are domesticated varieties of the rock pigeon developed over many years of selective breeding. In the present work, we report the complete mitochondrial genome sequence of fancy pigeon for the first time. The total length of the mitogenome was 17,233 bp with the base composition of 30.1% for A, 24.0% for T, 31.9% for C, and 14.0% for G and an A-T (54.2 %)-rich feature was detected. It harbored 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and 1 non-coding control region (D-loop region). The arrangement of all genes was identical to the typical mitochondrial genomes of pigeon. The complete mitochondrial genome sequence of fancy pigeon would serve as an important data set of the germplasm resources for further study.

  12. The complete mitochondrial genome of Octopus bimaculatus Verrill, 1883 from the Gulf of California.

    PubMed

    Domínguez-Contreras, José Francisco; Munguia-Vega, Adrian; Ceballos-Vázquez, Bertha Patricia; García-Rodriguez, Francisco Javier; Arellano-Martinez, Marcial

    2016-11-01

    The complete mitochondrial genome of Octopus bimaculatus is 16 085 bp in length and includes 13 protein-codes genes, 2 ribosomal RNA genes, 22 transfers RNA genes, and a control region. The composition of genome is A (40.9%), T (34.7%), C (16.9%), and G (7.5%). The control region of O. bimaculatus contains a VNTR locus not present in the genomes from other octopus species. A phylogenetic analysis shows a closer relationship between the mitogenomes from O. bimaculatus and O. vulgaris.

  13. A CFD-based aerodynamic design procedure for hypersonic wind-tunnel nozzles

    NASA Technical Reports Server (NTRS)

    Korte, John J.

    1993-01-01

    A new procedure which unifies the best of current classical design practices, computational fluid dynamics (CFD), and optimization procedures is demonstrated for designing the aerodynamic lines of hypersonic wind-tunnel nozzles. The new procedure can be used to design hypersonic wind tunnel nozzles with thick boundary layers where the classical design procedure has been shown to break down. An efficient CFD code, which solves the parabolized Navier-Stokes (PNS) equations using an explicit upwind algorithm, is coupled to a least-squares (LS) optimization procedure. A LS problem is formulated to minimize the difference between the computed flow field and the objective function, consisting of the centerline Mach number distribution and the exit Mach number and flow angle profiles. The aerodynamic lines of the nozzle are defined using a cubic spline, the slopes of which are optimized with the design procedure. The advantages of the new procedure are that it allows full use of powerful CFD codes in the design process, solves an optimization problem to determine the new contour, can be used to design new nozzles or improve sections of existing nozzles, and automatically compensates the nozzle contour for viscous effects as part of the unified design procedure. The new procedure is demonstrated by designing two Mach 15, a Mach 12, and a Mach 18 helium nozzles. The flexibility of the procedure is demonstrated by designing the two Mach 15 nozzles using different constraints, the first nozzle for a fixed length and exit diameter and the second nozzle for a fixed length and throat diameter. The computed flow field for the Mach 15 least squares parabolized Navier-Stokes (LS/PNS) designed nozzle is compared with the classically designed nozzle and demonstrates a significant improvement in the flow expansion process and uniform core region.

  14. Does diabetes mellitus comorbidity affect in-hospital mortality and length of stay? Analysis of administrative data in an Italian Academic Hospital.

    PubMed

    Valent, Francesca; Tonutti, Laura; Grimaldi, Franco

    2017-12-01

    Hospitalized patients with comorbid diabetes mellitus may have worse outcomes than the others. We conducted a study to assess whether comorbid diabetes affects in-hospital mortality and length of stay. For this population-based study, we analyzed the administrative databases of the Regional Health Information System of the Region Friuli Venezia Giulia, where the Hospital of Udine is located. Hospital discharge data were linked at the individual patient level with the regional Diabetes Mellitus Registry to identify diabetic patients. For each 3-digit ICD-9-CM discharge diagnosis code, we assessed the difference in length of stay and in-hospital mortality between diabetic and non-diabetic patients. We conducted both univariate and multivariate analyses, adjusted for age, sex, Charlson's comorbidity score, and urgency of hospitalization, through linear and logistic regression models. After adjusting for potential confounders, diabetes significantly increased the risk of in-hospital death among patients hospitalized for bacterial pneumonia (OR = 1.94) and intestinal obstruction (OR = 4.23) and length of stay among those admitted for several diagnoses, including acute myocardial infarction and acute renal failure. Admission glucose blood level was associated with in-hospital death in patients with pneumonia and intestinal obstruction, and increased length of stay for several conditions. Patients with diabetes mellitus who are hospitalized for other health problems may have increased risk of in-hospital death and longer hospital stay. For this reason, diabetes should be promptly recognized upon admission and properly managed.

  15. Abrogation of Microsatellite-instable Tumors Using a Highly Selective Suicide Gene/Prodrug Combination

    PubMed Central

    Ferrás, Cristina; Oude Vrielink, Joachim AF; Verspuy, Johan WA; te Riele, Hein; Tsaalbi-Shtylik, Anastasia; de Wind, Niels

    2009-01-01

    A substantial fraction of sporadic and inherited colorectal and endometrial cancers in humans is deficient in DNA mismatch repair (MMR). These cancers are characterized by length alterations in ubiquitous simple sequence repeats, a phenotype called microsatellite instability. Here we have exploited this phenotype by developing a novel approach for the highly selective gene therapy of MMR-deficient tumors. To achieve this selectivity, we mutated the VP22FCU1 suicide gene by inserting an out-of-frame microsatellite within its coding region. We show that in a significant fraction of microsatellite-instable (MSI) cells carrying the mutated suicide gene, full-length protein becomes expressed within a few cell doublings, presumably resulting from a reverting frameshift within the inserted microsatellite. Treatment of these cells with the innocuous prodrug 5-fluorocytosine (5-FC) induces strong cytotoxicity and we demonstrate that this owes to multiple bystander effects conferred by the suicide gene/prodrug combination. In a mouse model, MMR-deficient tumors that contained the out-of-frame VP22FCU1 gene displayed strong remission after treatment with 5-FC, without any obvious adverse systemic effects to the mouse. By virtue of its high selectivity and potency, this conditional enzyme/prodrug combination may hold promise for the treatment or prevention of MMR-deficient cancer in humans. PMID:19471249

  16. Complete chloroplast DNA sequence from a Korean endemic genus, Megaleranthis saniculifolia, and its evolutionary implications.

    PubMed

    Kim, Young-Kyu; Park, Chong-wook; Kim, Ki-Joong

    2009-03-31

    The chloroplast DNA sequences of Megaleranthis saniculifolia, an endemic and monotypic endangered plant species, were completed in this study (GenBank FJ597983). The genome is 159,924 bp in length. It harbors a pair of IR regions consisting of 26,608 bp each. The lengths of the LSC and SSC regions are 88,326 bp and 18,382 bp, respectively. The structural organizations, gene and intron contents, gene orders, AT contents, codon usages, and transcription units of the Megaleranthis chloroplast genome are similar to those of typical land plant cp DNAs. However, the detailed features of Megaleranthis chloroplast genomes are substantially different from that of Ranunculus, which belongs to the same family, the Ranunculaceae. First, the Megaleranthis cp DNA was 4,797 bp longer than that of Ranunculus due to an expanded IR region into the SSC region and duplicated sequence elements in several spacer regions of the Megaleranthis cp genome. Second, the chloroplast genomes of Megaleranthis and Ranunculus evidence 5.6% sequence divergence in the coding regions, 8.9% sequence divergence in the intron regions, and 18.7% sequence divergence in the intergenic spacer regions, respectively. In both the coding and noncoding regions, average nucleotide substitution rates differed markedly, depending on the genome position. Our data strongly implicate the positional effects of the evolutionary modes of chloroplast genes. The genes evidencing higher levels of base substitutions also have higher incidences of indel mutations and low Ka/Ks ratios. A total of 54 simple sequence repeat loci were identified from the Megaleranthis cp genome. The existence of rich cp SSR loci in the Megaleranthis cp genome provides a rare opportunity to study the population genetic structures of this endangered species. Our phylogenetic trees based on the two independent markers, the nuclear ITS and chloroplast matK sequences, strongly support the inclusion of the Megaleranthis to the Trollius. Therefore, our molecular trees support Ohwi's original treatment of Megaleranthis saniculiforia to Trollius chosenensis Ohwi.

  17. RAMICS: trainable, high-speed and biologically relevant alignment of high-throughput sequencing reads to coding DNA.

    PubMed

    Wright, Imogen A; Travers, Simon A

    2014-07-01

    The challenge presented by high-throughput sequencing necessitates the development of novel tools for accurate alignment of reads to reference sequences. Current approaches focus on using heuristics to map reads quickly to large genomes, rather than generating highly accurate alignments in coding regions. Such approaches are, thus, unsuited for applications such as amplicon-based analysis and the realignment phase of exome sequencing and RNA-seq, where accurate and biologically relevant alignment of coding regions is critical. To facilitate such analyses, we have developed a novel tool, RAMICS, that is tailored to mapping large numbers of sequence reads to short lengths (<10 000 bp) of coding DNA. RAMICS utilizes profile hidden Markov models to discover the open reading frame of each sequence and aligns to the reference sequence in a biologically relevant manner, distinguishing between genuine codon-sized indels and frameshift mutations. This approach facilitates the generation of highly accurate alignments, accounting for the error biases of the sequencing machine used to generate reads, particularly at homopolymer regions. Performance improvements are gained through the use of graphics processing units, which increase the speed of mapping through parallelization. RAMICS substantially outperforms all other mapping approaches tested in terms of alignment quality while maintaining highly competitive speed performance. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Kangaroo – A pattern-matching program for biological sequences

    PubMed Central

    2002-01-01

    Background Biologists are often interested in performing a simple database search to identify proteins or genes that contain a well-defined sequence pattern. Many databases do not provide straightforward or readily available query tools to perform simple searches, such as identifying transcription binding sites, protein motifs, or repetitive DNA sequences. However, in many cases simple pattern-matching searches can reveal a wealth of information. We present in this paper a regular expression pattern-matching tool that was used to identify short repetitive DNA sequences in human coding regions for the purpose of identifying potential mutation sites in mismatch repair deficient cells. Results Kangaroo is a web-based regular expression pattern-matching program that can search for patterns in DNA, protein, or coding region sequences in ten different organisms. The program is implemented to facilitate a wide range of queries with no restriction on the length or complexity of the query expression. The program is accessible on the web at http://bioinfo.mshri.on.ca/kangaroo/ and the source code is freely distributed at http://sourceforge.net/projects/slritools/. Conclusion A low-level simple pattern-matching application can prove to be a useful tool in many research settings. For example, Kangaroo was used to identify potential genetic targets in a human colorectal cancer variant that is characterized by a high frequency of mutations in coding regions containing mononucleotide repeats. PMID:12150718

  19. Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination.

    PubMed

    Savidor, Alon; Barzilay, Rotem; Elinger, Dalia; Yarden, Yosef; Lindzen, Moshit; Gabashvili, Alexandra; Adiv Tal, Ophir; Levin, Yishai

    2017-06-01

    Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  20. Analysis of polyglutamine-coding repeats in the TATA-binding protein in different human populations and in patients with schizophrenia an bipolar affective disorder

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rubinsztein, D.C.; Leggo, J.; Crow, T.J.

    A new class of disease (including Huntington disease, Kennedy disease, and spinocerebellar ataxias types 1 and 3) results from abnormal expansions of CAG trinucleotides in the coding regions of genes. In all of these diseases the CAG repeats are thought to be translated into polyglutamine tracts. There is accumulating evidence arguing for CAG trinucleotide expansions as one of the causative disease mutations in schizophrenia and bipolar affective disorder. We and others believe that the TATA-binding protein (TBP) is an important candidate to investigate in these diseases as it contains a highly polymorphic stretch of glutamine codons, which are close tomore » the threshold length where the polyglutamine tracts start to be associated with disease. Thus, we examined the lengths of this polyglutamine repeat in normal unrelated East Anglians, South African Blacks, sub-Saharan Africans mainly from Nigeria, and Asian Indians. We also examined 43 bipolar affective disorder patients and 65 schizophrenic patients. The range of polyglutamine tract-lengths that we found in humans was from 26-42 codons. No patients with bipolar affective disorder and schizophrenia had abnormal expansions at this locus. 22 refs., 1 tab.« less

  1. Molecular Cloning, Expression Profile and 5′ Regulatory Region Analysis of Two Chemosensory Protein Genes from the Diamondback Moth, Plutella xylostella

    PubMed Central

    Gong, Liang; Zhong, Guo-Hua; Hu, Mei-Ying; Luo, Qian; Ren, Zhen-Zhen

    2010-01-01

    Chemosensory proteins play an important role in transporting chemical compounds to their receptors on dendrite membranes. In this study, two full-length cDNA codings for chemosensory proteins of Plutella xylostella (Lepidoptera: Plutellidae) were obtained by RACE-PCR. PxylCSP3 and Pxyl-CSP4, with GenBank accession numbers ABM92663 and ABM92664, respectively, were cloned and sequenced. The gene sequences both consisted of three exons and two introns. RT-PCR analysis showed that Pxyl-CSP3 and Pxyl-CSP4 had different expression patterns in the examined developmental stages, but were expressed in all larval stages. Phylogenetic analysis indicated that lepidopteran insects consist of three branches, and Pxyl-CSP3 and Pxyl-CSP4 belong to different branches. The 5′regulatory regions of Pxyl-CSP3 and Pxyl-CSP4 were isolated and analyzed, and the results consist of not only the core promoter sequences (TATA-box), but also several transcriptional elements (BR-C Z4, Hb, Dfd, CF2-II, etc.). This study provides clues to better understanding the various physiological functions of CSPs in P. xylostella and other insects. PMID:21073345

  2. Trinucleotide repeat length and progression of illness in Huntington's disease.

    PubMed

    Kieburtz, K; MacDonald, M; Shih, C; Feigin, A; Steinberg, K; Bordwell, K; Zimmerman, C; Srinidhi, J; Sotack, J; Gusella, J

    1994-11-01

    The genetic defect causing Huntington's disease (HD) has been identified as an unstable expansion of a trinucleotide (CAG) repeat sequence within the coding region of the IT15 gene on chromosome 4. In 50 patients with manifest HD who were evaluated prospectively and uniformly, we examined the relationship between the extent of the DNA expansion and the rate of illness progression. Although the length of CAG repeats showed a strong inverse correlation with the age at onset of HD, there was no such relationship between the number of CAG repeats and the rate of clinical decline. These findings suggest that the CAG repeat length may influence or trigger the onset of HD, but other genetic, neurobiological, or environmental factors contribute to the progression of illness and the underlying pace of neuronal degeneration.

  3. Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof

    DOEpatents

    Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

    2016-02-16

    The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  4. Polypeptide having beta-glucosidase activity and uses thereof

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less

  5. Polypeptide having swollenin activity and uses thereof

    DOEpatents

    Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius

    2015-11-04

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  6. Polypeptide having beta-glucosidase activity and uses thereof

    DOEpatents

    Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius

    2015-09-01

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  7. Polypeptide having cellobiohydrolase activity and uses thereof

    DOEpatents

    Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

    2015-09-15

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  8. Polypeptide having acetyl xylan esterase activity and uses thereof

    DOEpatents

    Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

    2015-10-20

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  9. Polypeptide having carbohydrate degrading activity and uses thereof

    DOEpatents

    Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius

    2015-08-18

    The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  10. Structure, organization and expression of common carp (Cyprinus carpio L.) SLP-76 gene.

    PubMed

    Huang, Rong; Sun, Xiao-Feng; Hu, Wei; Wang, Ya-Ping; Guo, Qiong-Lin

    2008-05-01

    SLP-76 is an important member of the SLP-76 family of adapters, and it plays a key role in TCR signaling and T cell function. Partial cDNA sequence of SLP-76 of common carp (Cyprinus carpio L.) was isolated from thymus cDNA library by the method of suppression subtractive hybridization (SSH). Subsequently, the full length cDNA of carp SLP-76 was obtained by means of 3' RACE and 5' RACE, respectively. The full length cDNA of carp SLP-76 was 2007 bp, consisting of a 5'-terminal untranslated region (UTR) of 285 bp, a 3'-terminal UTR of 240 bp, and an open reading frame of 1482 bp. Sequence comparison showed that the deduced amino acid sequence of carp SLP-76 had an overall similarity of 34-73% to that of other species homologues, and it was composed of an NH2-terminal domain, a central proline-rich domain, and a C-terminal SH2 domain. Amino acid sequence analysis indicated the existence of a Gads binding site R-X-X-K, a 10-aa-long sequence which binds to the SH3 domain of LCK in vitro, and three conserved tyrosine-containing sequence in the NH2-terminal domain. Then we used PCR to obtain a genomic DNA which covers the entire coding region of carp SLP-76. In the 9.2k-long genomic sequence, twenty one exons and twenty introns were identified. RT-PCR results showed that carp SLP-76 was expressed predominantly in hematopoietic tissues, and was upregulated in thymus tissue of four-month carp compared to one-year old carp. RT-PCR and virtual northern hybridization results showed that carp SLP-76 was also upregulated in thymus tissue of GH transgenic carp at the age of four-months. These results suggest that the expression level of SLP-76 gene may be related to thymocyte development in teleosts.

  11. Splice variants and promoter methylation status of the Bovine Vasa Homology (Bvh) gene may be involved in bull spermatogenesis

    PubMed Central

    2013-01-01

    Background Vasa is a member of the DEAD-box protein family that plays an indispensable role in mammalian spermatogenesis, particularly during meiosis. Bovine vasa homology (Bvh) of Bos taurus has been reported, however, its function in bovine testicular tissue remains obscure. This study aimed to reveal the functions of Bvh and to determine whether Bvh is a candidate gene in the regulation of spermatogenesis in bovine, and to illustrate whether its transcription is regulated by alternative splicing and DNA methylation. Results Here we report the molecular characterization, alternative splicing pattern, expression and promoter methylation status of Bvh. The full-length coding region of Bvh was 2190 bp, which encodes a 729 amino acid (aa) protein containing nine consensus regions of the DEAD box protein family. Bvh is expressed only in the ovary and testis of adult cattle. Two splice variants were identified and termed Bvh-V4 (2112 bp and 703 aa) and Bvh-V45 (2040 bp and 679 aa). In male cattle, full-length Bvh (Bvh-FL), Bvh-V4 and Bvh-V45 are exclusively expressed in the testes in the ratio of 2.2:1.6:1, respectively. Real-time PCR revealed significantly reduced mRNA expression of Bvh-FL, Bvh-V4 and Bvh-V45 in testes of cattle-yak hybrids, with meiotic arrest compared with cattle and yaks with normal spermatogenesis (P < 0.01). The promoter methylation level of Bvh in the testes of cattle-yak hybrids was significantly greater than in cattle and yaks (P < 0.01). Conclusion In the present study, Bvh was isolated and characterized. These data suggest that Bvh functions in bovine spermatogenesis, and that transcription of the gene in testes were regulated by alternative splice and promoter methylation. PMID:23815438

  12. Using adaptive-mesh refinement in SCFT simulations of surfactant adsorption

    NASA Astrophysics Data System (ADS)

    Sides, Scott; Kumar, Rajeev; Jamroz, Ben; Crockett, Robert; Pletzer, Alex

    2013-03-01

    Adsorption of surfactants at interfaces is relevant to many applications such as detergents, adhesives, emulsions and ferrofluids. Atomistic simulations of interface adsorption are challenging due to the difficulty of modeling the wide range of length scales in these problems: the thin interface region in equilibrium with a large bulk region that serves as a reservoir for the adsorbed species. Self-consistent field theory (SCFT) has been extremely useful for studying the morphologies of dense block copolymer melts. Field-theoretic simulations such as these are able to access large length and time scales that are difficult or impossible for particle-based simulations such as molecular dynamics. However, even SCFT methods can be difficult to apply to systems in which small spatial regions might require finer resolution than most of the simulation grid (eg. interface adsorption and confinement). We will present results on interface adsorption simulations using PolySwift++, an object-oriented, polymer SCFT simulation code aided by the Tech-X Chompst library that enables via block-structured AMR calculations with PETSc.

  13. Human homolog of the mouse sperm receptor

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chamberlin, M.E.; Dean, J.

    1990-08-01

    The human zona pellucida, composed of three glycoproteins (ZP1, ZP2, and ZP3), forms an extracellular matrix that surrounds ovulated eggs and mediates species-specific fertilization. The genes that code for at least two of the zona proteins (ZP2 and ZP3) cross-hybridize with other mammalian DNA. The recently characterized mouse sperm receptor gene (Zp-3) was used to isolate its human homolog. The human homolog spans {approx}18.3 kilobase pairs (kbp) (compared to 8.6 kbp for the mouse gene) and contains eight exons, the sizes of which are strictly conserved between the two species. Four short (8-15 bp) sequences within the first 250 bpmore » of the 5{prime} flanking region in the human Zp-3 homolog are also present upstream of mouse Zp-3. These elements may modulate oocyte-specific gene expression. By using the polymerase chain reaction, a full-length cDNA of human ZP3 was isolated from human ovarian poly(A){sup +} RNA and used to deduce the structure of human ZP3 mRNA. Certain features of the human and mouse ZP3 transcripts are conserved. Both have unusually short 5{prime} and 3{prime} untranslated regions, both contain a single open reading frame that is 74% identical, and both code for 424 amino acid polypeptides that are 67% the same. The similarity between the two proteins may define domains that are important in maintaining the structural integrity of the zona pellucida, while the differences may play a role in mediating the species-specific events of mammalian fertilization.« less

  14. 3D radiation belt diffusion model results using new empirical models of whistler chorus and hiss

    NASA Astrophysics Data System (ADS)

    Cunningham, G.; Chen, Y.; Henderson, M. G.; Reeves, G. D.; Tu, W.

    2012-12-01

    3D diffusion codes model the energization, radial transport, and pitch angle scattering due to wave-particle interactions. Diffusion codes are powerful but are limited by the lack of knowledge of the spatial & temporal distribution of waves that drive the interactions for a specific event. We present results from the 3D DREAM model using diffusion coefficients driven by new, activity-dependent, statistical models of chorus and hiss waves. Most 3D codes parameterize the diffusion coefficients or wave amplitudes as functions of magnetic activity indices like Kp, AE, or Dst. These functional representations produce the average value of the wave intensities for a given level of magnetic activity; however, the variability of the wave population at a given activity level is lost with such a representation. Our 3D code makes use of the full sample distributions contained in a set of empirical wave databases (one database for each wave type, including plasmaspheric hiss, lower and upper hand chorus) that were recently produced by our team using CRRES and THEMIS observations. The wave databases store the full probability distribution of observed wave intensity binned by AE, MLT, MLAT and L*. In this presentation, we show results that make use of the wave intensity sample probability distributions for lower-band and upper-band chorus by sampling the distributions stochastically during a representative CRRES-era storm. The sampling of the wave intensity probability distributions produces a collection of possible evolutions of the phase space density, which quantifies the uncertainty in the model predictions caused by the uncertainty of the chorus wave amplitudes for a specific event. A significant issue is the determination of an appropriate model for the spatio-temporal correlations of the wave intensities, since the diffusion coefficients are computed as spatio-temporal averages of the waves over MLT, MLAT and L*. The spatiotemporal correlations cannot be inferred from the wave databases. In this study we use a temporal correlation of ~1 hour for the sampled wave intensities that is informed by the observed autocorrelation in the AE index, a spatial correlation length of ~100 km in the two directions perpendicular to the magnetic field, and a spatial correlation length of 5000 km in the direction parallel to the magnetic field, according to the work of Santolik et al (2003), who used multi-spacecraft measurements from Cluster to quantify the correlation length scales for equatorial chorus . We find that, despite the small correlation length scale for chorus, there remains significant variability in the model outcomes driven by variability in the chorus wave intensities.

  15. Molecular cloning, mRNA expression and tissue distribution analysis of Slc7a11 gene in alpaca (Lama paco) skins associated with different coat colors.

    PubMed

    Tian, Xue; Meng, Xiaolin; Wang, Liangyan; Song, Yunfei; Zhang, Danli; Ji, Yuankai; Li, Xuejun; Dong, Changsheng

    2015-01-25

    Slc7a11 encoding solute carrier family 7 member 11 (amionic amino acid transporter light chain, xCT), has been identified to be a critical genetic regulator of pheomelanin synthesis in hair and melanocytes. To better understand the molecular characterization of Slc7a11 and the expression patterns in skin of white versus brown alpaca (lama paco), we cloned the full length coding sequence (CDS) of alpaca Slc7a11 gene and analyzed the expression patterns using Real Time PCR, Western blotting and immunohistochemistry. The full length CDS of 1512bp encodes a 503 amino acid polypeptide. Sequence analysis showed that alpaca xCT contains 12 transmembrane regions consistent with the highly conserved amino acid permease (AA_permease_2) domain similar to other vertebrates. Sequence alignment and phylogenetic analysis revealed that alpaca xCT had the highest identity and shared the same branch with Camelus ferus. Real Time PCR and Western blotting suggested that xCT was expressed at significantly high levels in brown alpaca skin, and transcripts and protein possessed the same expression pattern in white and brown alpaca skins. Additionally, immunohistochemical analysis further demonstrated that xCT staining was robustly increased in the matrix and root sheath of brown alpaca skin compared with that of white. These results suggest that Slc7a11 functions in alpaca coat color regulation and offer essential information for further exploration on the role of Slc7a11 in melanogenesis. Copyright © 2014 Elsevier B.V. All rights reserved.

  16. Computer program for calculating full potential transonic, quasi-three-dimensional flow through a rotating turbomachinery blade row

    NASA Technical Reports Server (NTRS)

    Farrell, C. A.

    1982-01-01

    A fast, reliable computer code is described for calculating the flow field about a cascade of arbitrary two dimensional airfoils. The method approximates the three dimensional flow in a turbomachinery blade row by correcting for stream tube convergence and radius change in the throughflow direction. A fully conservative solution of the full potential equation is combined with the finite volume technique on a body-fitted periodic mesh, with an artificial density imposed in the transonic region to insure stability and the capture of shock waves. The instructions required to set up and use the code are included. The name of the code is QSONIC. A numerical example is also given to illustrate the output of the program.

  17. Optimizing the use of a sensor resource for opponent polarization coding

    PubMed Central

    Heras, Francisco J.H.

    2017-01-01

    Flies use specialized photoreceptors R7 and R8 in the dorsal rim area (DRA) to detect skylight polarization. R7 and R8 form a tiered waveguide (central rhabdomere pair, CRP) with R7 on top, filtering light delivered to R8. We examine how the division of a given resource, CRP length, between R7 and R8 affects their ability to code polarization angle. We model optical absorption to show how the length fractions allotted to R7 and R8 determine the rates at which they transduce photons, and correct these rates for transduction unit saturation. The rates give polarization signal and photon noise in R7, and in R8. Their signals are combined in an opponent unit, intrinsic noise added, and the unit’s output analysed to extract two measures of coding ability, number of discriminable polarization angles and mutual information. A very long R7 maximizes opponent signal amplitude, but codes inefficiently due to photon noise in the very short R8. Discriminability and mutual information are optimized by maximizing signal to noise ratio, SNR. At lower light levels approximately equal lengths of R7 and R8 are optimal because photon noise dominates. At higher light levels intrinsic noise comes to dominate and a shorter R8 is optimum. The optimum R8 length fractions falls to one third. This intensity dependent range of optimal length fractions corresponds to the range observed in different fly species and is not affected by transduction unit saturation. We conclude that a limited resource, rhabdom length, can be divided between two polarization sensors, R7 and R8, to optimize opponent coding. We also find that coding ability increases sub-linearly with total rhabdom length, according to the law of diminishing returns. Consequently, the specialized shorter central rhabdom in the DRA codes polarization twice as efficiently with respect to rhabdom length than the longer rhabdom used in the rest of the eye. PMID:28316880

  18. TCOF1 gene encodes a putative nucleolar phosphoprotein that exhibits mutations in Treacher Collins Syndrome throughout its coding region

    PubMed Central

    Wise, Carol A.; Chiang, Lydia C.; Paznekas, William A.; Sharma, Mridula; Musy, Maurice M.; Ashley, Jennifer A.; Lovett, Michael; Jabs, Ethylin W.

    1997-01-01

    Treacher Collins Syndrome (TCS) is the most common of the human mandibulofacial dysostosis disorders. Recently, a partial TCOF1 cDNA was identified and shown to contain mutations in TCS families. Here we present the entire exon/intron genomic structure and the complete coding sequence of TCOF1. TCOF1 encodes a low complexity protein of 1,411 amino acids, whose predicted protein structure reveals repeated motifs that mirror the organization of its exons. These motifs are shared with nucleolar trafficking proteins in other species and are predicted to be highly phosphorylated by casein kinase. Consistent with this, the full-length TCOF1 protein sequence also contains putative nuclear and nucleolar localization signals. Throughout the open reading frame, we detected an additional eight mutations in TCS families and several polymorphisms. We postulate that TCS results from defects in a nucleolar trafficking protein that is critically required during human craniofacial development. PMID:9096354

  19. Complete mitochondrial DNA sequence of the Eastern keelback mullet Liza affinis.

    PubMed

    Gong, Xiaoling; Zhu, Wenjia; Bao, Baolong

    2016-05-01

    Eastern keelback mullet (Liza affinis) inhabits inlet waters and estuaries of rivers. In this paper, we initially determined the complete mitochondrial genome of Liza affinis. The entire mtDNA sequence is 16,831 bp in length, including 2 rRNA genes, 22 tRNA genes, 13 protein-coding genes and 1 putative control region. Its order and numbers of genes are similar to most bony fishes.

  20. Resource utilization in primary repair of cleft lip.

    PubMed

    Owusu, James A; Liu, Meixia; Sidman, James D; Scott, Andrew R

    2013-03-01

    To determine national variations in resource utilization for primary repair of cleft lip, identify patient and institutional factors associated with high resource use, and estimate the current incidence of cleft lip in the United States. Retrospective analysis of a national, pediatric database (2009 Kids' Inpatient Database [KID]). Patients aged 1 year and younger were selected using international classification of disease codes for cleft lip and procedure codes for cleft lip repair. A number of demographic variables were analyzed, and hospital charges were considered as a measure of resource utilization. There were 1318 patients identified. The national incidence was 0.09%, with a male to female ratio of 1.8:1. Regional incidence varied from 0.07% (Northeast) to 0.10% (West). The mean age at surgery was 4.2 months. The average length of stay was 1.4 days. The national average hospital charge was $20,147, ranging from $14,635 (South) to $23,663 (West). Teaching hospitals charge an average of $9764 higher than nonteaching hospitals. The strongest predictor of charge was length of stay, increasing charge by $8102 for every additional hospital day (P < .01). Regional variations exist in resource utilization for primary cleft lip repair. Resource use is higher in the West and among teaching hospitals.

  1. Comparison of FDNS liquid rocket engine plume computations with SPF/2

    NASA Technical Reports Server (NTRS)

    Kumar, G. N.; Griffith, D. O., II; Warsi, S. A.; Seaford, C. M.

    1993-01-01

    Prediction of a plume's shape and structure is essential to the evaluation of base region environments. The JANNAF standard plume flowfield analysis code SPF/2 predicts plumes well, but cannot analyze base regions. Full Navier-Stokes CFD codes can calculate both zones; however, before they can be used, they must be validated. The CFD code FDNS3D (Finite Difference Navier-Stokes Solver) was used to analyze the single plume of a Space Transportation Main Engine (STME) and comparisons were made with SPF/2 computations. Both frozen and finite rate chemistry models were employed as well as two turbulence models in SPF/2. The results indicate that FDNS3D plume computations agree well with SPF/2 predictions for liquid rocket engine plumes.

  2. The mitochondrial genome of the multicolored Asian lady beetle Harmonia axyridis (Pallas) and a phylogenetic analysis of the Polyphaga (Insecta: Coleoptera).

    PubMed

    Niu, Fang-Fang; Zhu, Liang; Wang, Su; Wei, Shu-Jun

    2016-07-01

    Here, we report the mitochondrial genome sequence of the multicolored Asian lady beetle Harmonia axyridis (Pallas, 1773) (Coleoptera: Coccinellidae) (GenBank accession No. KR108208). This is the first species with sequenced mitochondrial genome from the genus Harmonia. The current length with partitial A + T-rich region of this mitochondrial genome is 16,387 bp. All the typical genes were sequenced except the trnI and trnQ. As in most other sequenced mitochondrial genomes of Coleoptera, there is no re-arrangement in the sequenced region compared with the pupative ancestral arrangement of insects. All protein-coding genes start with ATN codons. Five, five and three protein-coding genes stop with termination codon TAA, TA and T, respectively. Phylogenetic analysis using Bayesian method based on the first and second codon positions of the protein-coding genes supported that the Scirtidae is a basal lineage of Polyphaga. The Harmonia and the Coccinella form a sister lineage. The monophyly of Staphyliniformia, Scarabaeiformia and Cucujiformia was supported. The Buprestidae was found to be a sister group to the Bostrichiformia.

  3. A long constraint length VLSI Viterbi decoder for the DSN

    NASA Technical Reports Server (NTRS)

    Statman, J. I.; Zimmerman, G.; Pollara, F.; Collins, O.

    1988-01-01

    A Viterbi decoder, capable of decoding convolutional codes with constraint lengths up to 15, is under development for the Deep Space Network (DSN). The objective is to complete a prototype of this decoder by late 1990, and demonstrate its performance using the (15, 1/4) encoder in Galileo. The decoder is expected to provide 1 to 2 dB improvement in bit SNR, compared to the present (7, 1/2) code and existing Maximum Likelihood Convolutional Decoder (MCD). The decoder will be fully programmable for any code up to constraint length 15, and code rate 1/2 to 1/6. The decoder architecture and top-level design are described.

  4. SecureQEMU: Emulation-Based Software Protection Providing Encrypted Code Execution and Page Granularity Code Signing

    DTIC Science & Technology

    2008-12-01

    SHA256 DIGEST LENGTH) ) ; peAddSection(&sF i l e , " . S i g S t u b " , dwStubSecSize , dwStubSecSize ) ; 169 peSecure(&sF i l e , deqAddrSize...deqAuthPageAddrSize . s i z e ( ) /2) ∗ (8 + SHA256 DIGEST LENGTH) ) + 16 ; bCode [ 3 4 ] = ( ( char∗)&dwSize ) [ 0 ] ; bCode [ 3 5 ] = ( ( char∗)&dwSize ) [ 1...2) ∗ (8 + SHA256 DIGEST LENGTH... ) ) ; AES KEY aesKey ; unsigned char i v s a l t [ 1 6 ] , temp iv [ 1 6 ] ; 739 unsigned char ∗key

  5. Performance of convolutional codes on fading channels typical of planetary entry missions

    NASA Technical Reports Server (NTRS)

    Modestino, J. W.; Mui, S. Y.; Reale, T. J.

    1974-01-01

    The performance of convolutional codes in fading channels typical of the planetary entry channel is examined in detail. The signal fading is due primarily to turbulent atmospheric scattering of the RF signal transmitted from an entry probe through a planetary atmosphere. Short constraint length convolutional codes are considered in conjunction with binary phase-shift keyed modulation and Viterbi maximum likelihood decoding, and for longer constraint length codes sequential decoding utilizing both the Fano and Zigangirov-Jelinek (ZJ) algorithms are considered. Careful consideration is given to the modeling of the channel in terms of a few meaningful parameters which can be correlated closely with theoretical propagation studies. For short constraint length codes the bit error probability performance was investigated as a function of E sub b/N sub o parameterized by the fading channel parameters. For longer constraint length codes the effect was examined of the fading channel parameters on the computational requirements of both the Fano and ZJ algorithms. The effects of simple block interleaving in combatting the memory of the channel is explored, using the analytic approach or digital computer simulation.

  6. Characterization of the complete mitochondrial genome of the king pigeon (Columba livia breed king).

    PubMed

    Zhang, Rui-Hua; He, Wen-Xiao; Xu, Tong

    2015-06-01

    The king pigeon is a breed of pigeon developed over many years of selective breeding primarily as a utility breed. In the present work, we report the complete mitochondrial genome sequence of king pigeon for the first time. The total length of the mitogenome was 17,221 bp with the base composition of 30.14% for A, 24.05% for T, 31.82% for C, and 13.99% for G and an A-T (54.22 %)-rich feature was detected. It harbored 13 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and one non-coding control region (D-loop region). The arrangement of all genes was identical to the typical mitochondrial genomes of pigeon. The complete mitochondrial genome sequence of king pigeon would serve as an important data set of the germplasm resources for further study.

  7. Complete mitogenome sequencing and phylogenetic analysis of PaLi yak (Bos grunniens).

    PubMed

    Bao, Pengjia; Guo, Xian; Pei, Jie; Liang, Chunnian; Ding, Xuezhi; Min, Chu; Wang, Hongbo; Wu, Xiaoyun; Yan, Ping

    2016-11-01

    PaLi yak is a very important local breed in China; as a year-round grazing animal, it plays a very important role for the economic and native herdsmen. The PaLi yak complete mitochondrial DNA is sequenced in this study, the total length is 16,324 bp, containing 13 protein-coding genes, 22 tRNA genes, 2 rRNA genes and a non-coding control region (D-loop region). The order and composition are similar to most of the other vertebrates. The base contents are: 33.72% A, 25.80% C, 13.21% G and 27.27% T; A + T (60.99%) was higher than G + C (39.01%). The phylogenetic relationships were analyzed using the complete mitogenome sequence, results showed that the genetic relationship between yak and cattle is distinct. These information provides useful data for further study on protection of genetic resources and the taxonomy of Bovinae.

  8. Characterization of the complete mitochondrial genome sequence of wild yak (Bos mutus).

    PubMed

    Chunnian, Liang; Wu, Xiaoyun; Ding, Xuezhi; Wang, Hongbo; Guo, Xian; Chu, Min; Bao, Pengjia; Yan, Ping

    2016-11-01

    Wild yak is a special breed in China and it is regarded as an important genetic resource for sustainably developing the animal husbandry in Tibetan area and enriching region's biodiversity. The complete mitochondrial genome of wild yak (16,322 bp in length) displayed 37 typical animal mitochondrial genes and A + T-rich (61.01%), with an overall G + C content of only 38.99%. It contained a non-coding control region (D-loop), 13 protein-coding genes, two rRNA genes, and 22 tRNA genes. Most of the genes have ATG initiation codons, whereas ND2, ND3, and ND5 genes start with ATA and were encoded on H-strand. The gene order of wild yak mitogenome is identical to that observed in most other vertebrates. The complete mitochondrial genome sequence of wild yak reported here could provide valuable information for developing genetic markers and phylogenetic analysis in yak.

  9. Discrete Ramanujan transform for distinguishing the protein coding regions from other regions.

    PubMed

    Hua, Wei; Wang, Jiasong; Zhao, Jian

    2014-01-01

    Based on the study of Ramanujan sum and Ramanujan coefficient, this paper suggests the concepts of discrete Ramanujan transform and spectrum. Using Voss numerical representation, one maps a symbolic DNA strand as a numerical DNA sequence, and deduces the discrete Ramanujan spectrum of the numerical DNA sequence. It is well known that of discrete Fourier power spectrum of protein coding sequence has an important feature of 3-base periodicity, which is widely used for DNA sequence analysis by the technique of discrete Fourier transform. It is performed by testing the signal-to-noise ratio at frequency N/3 as a criterion for the analysis, where N is the length of the sequence. The results presented in this paper show that the property of 3-base periodicity can be only identified as a prominent spike of the discrete Ramanujan spectrum at period 3 for the protein coding regions. The signal-to-noise ratio for discrete Ramanujan spectrum is defined for numerical measurement. Therefore, the discrete Ramanujan spectrum and the signal-to-noise ratio of a DNA sequence can be used for distinguishing the protein coding regions from the noncoding regions. All the exon and intron sequences in whole chromosomes 1, 2, 3 and 4 of Caenorhabditis elegans have been tested and the histograms and tables from the computational results illustrate the reliability of our method. In addition, we have analyzed theoretically and gotten the conclusion that the algorithm for calculating discrete Ramanujan spectrum owns the lower computational complexity and higher computational accuracy. The computational experiments show that the technique by using discrete Ramanujan spectrum for classifying different DNA sequences is a fast and effective method. Copyright © 2014 Elsevier Ltd. All rights reserved.

  10. Development of a dual-protective live attenuated vaccine against H5N1 and H9N2 avian influenza viruses by modifying the NS1 gene.

    PubMed

    Choi, Eun-hye; Song, Min-Suk; Park, Su-Jin; Pascua, Philippe Noriel Q; Baek, Yun Hee; Kwon, Hyeok-il; Kim, Eun-Ha; Kim, Semi; Jang, Hyung-Kwan; Poo, Haryoung; Kim, Chul-Joong; Choi, Young Ki

    2015-07-01

    An increasing number of outbreaks of avian influenza H5N1 and H9N2 viruses in poultry have caused serious economic losses and raised concerns for human health due to the risk of zoonotic transmission. However, licensed H5N1 and H9N2 vaccines for animals and humans have not been developed. Thus, to develop a dual H5N1 and H9N2 live-attenuated influenza vaccine (LAIV), the HA and NA genes from a virulent mouse-adapted avian H5N2 (A/WB/Korea/ma81/06) virus and a recently isolated chicken H9N2 (A/CK/Korea/116/06) virus, respectively, were introduced into the A/Puerto Rico/8/34 backbone expressing truncated NS1 proteins (NS1-73, NS1-86, NS1-101, NS1-122) but still possessing a full-length NS gene. Two H5N2/NS1-LAIV viruses (H5N2/NS1-86 and H5N2/NS1-101) were highly attenuated compared with the full-length and remaining H5N2/NS-LAIV viruses in a mouse model. Furthermore, viruses containing NS1 modifications were found to induce more IFN-β activation than viruses with full-length NS1 proteins and were correspondingly attenuated in mice. Intranasal vaccination with a single dose (10(4.0) PFU/ml) of these viruses completely protected mice from a lethal challenge with the homologous A/WB/Korea/ma81/06 (H5N2), heterologous highly pathogenic A/EM/Korea/W149/06 (H5N1), and heterosubtypic highly virulent mouse-adapted H9N2 viruses. This study clearly demonstrates that the modified H5N2/NS1-LAIV viruses attenuated through the introduction of mutations in the NS1 coding region display characteristics that are desirable for live attenuated vaccines and hold potential as vaccine candidates for mammalian hosts.

  11. The Mitosis and Neurodevelopment Proteins NDE1 and NDEL1 Form Dimers, Tetramers, and Polymers with a Folded Back Structure in Solution*

    PubMed Central

    Soares, Dinesh C.; Bradshaw, Nicholas J.; Zou, Juan; Kennaway, Christopher K.; Hamilton, Russell S.; Chen, Zhuo A.; Wear, Martin A.; Blackburn, Elizabeth A.; Bramham, Janice; Böttcher, Bettina; Millar, J. Kirsty; Barlow, Paul N.; Walkinshaw, Malcolm D.; Rappsilber, Juri; Porteous, David J.

    2012-01-01

    Paralogs NDE1 (nuclear distribution element 1) and NDEL1 (NDE-like 1) are essential for mitosis and neurodevelopment. Both proteins are predicted to have similar structures, based upon high sequence similarity, and they co-complex in mammalian cells. X-ray diffraction studies and homology modeling suggest that their N-terminal regions (residues 8–167) adopt continuous, extended α-helical coiled-coil structures, but no experimentally derived information on the structure of their C-terminal regions or the architecture of the full-length proteins is available. In the case of NDE1, no biophysical data exists. Here we characterize the structural architecture of both full-length proteins utilizing negative stain electron microscopy along with our established paradigm of chemical cross-linking followed by tryptic digestion, mass spectrometry, and database searching, which we enhance using isotope labeling for mixed NDE1-NDEL1. We determined that full-length NDE1 forms needle-like dimers and tetramers in solution, similar to crystal structures of NDEL1, as well as chain-like end-to-end polymers. The C-terminal domain of each protein, required for interaction with key protein partners dynein and DISC1 (disrupted-in-schizophrenia 1), includes a predicted disordered region that allows a bent back structure. This facilitates interaction of the C-terminal region with the N-terminal coiled-coil domain and is in agreement with previous results showing N- and C-terminal regions of NDEL1 and NDE1 cooperating in dynein interaction. It sheds light on recently identified mutations in the NDE1 gene that cause truncation of the encoded protein. Additionally, analysis of mixed NDE1-NDEL1 complexes demonstrates that NDE1 and NDEL1 can interact directly. PMID:22843697

  12. Molecular characterization of a novel proto-type antimicrobial protein galectin-1 from striped murrel.

    PubMed

    Arasu, Abirami; Kumaresan, Venkatesh; Sathyamoorthi, Akila; Chaurasia, Mukesh Kumar; Bhatt, Prasanth; Gnanam, Annie J; Palanisamy, Rajesh; Marimuthu, Kasi; Pasupuleti, Mukesh; Arockiaraj, Jesu

    2014-11-01

    In this study, we reported a molecular characterization of a novel proto-type galectin-1 from the striped murrel Channa striatus (named as CsGal-1). The full length CsGal-1 was identified from an established striped murrel cDNA library and further we confirmed the sequence by cloning. The complete cDNA sequence of CsGal-1 is 590 base pairs (bp) in length and its coding region encoded a poly peptide of 135 amino acids. The polypeptide contains a galactoside binding lectin domain at 4-135. The domain carries a sugar binding site at 45-74 along with its signatures (H(45)-X-Asn(47)-X-Arg(49) and Trp(69)-X-X-Glu(72)-X-Arg(74)). CsGal-1 shares a highly conserved carbohydrate recognition domain (CRD) with galectin-1 from other proto-type galectin of teleosts. The mRNA expressions of CsGal-1 in healthy and various immune stimulants including Aphanomyces invadans, Aeromonas hydrophila, Escherchia coli lipopolysaccharide and poly I:C injected tissues of C. striatus were examined using qRT-PCR. CsGal-1 mRNA is highly expressed in kidney and is up-regulated with different immune stimulants at various time points. To understand its biological activity, the coding region of CsGal-1 gene was expressed in an E. coli BL21 (DE3) cloning system and its recombinant protein was purified. The recombinant CsGal-1 protein was agglutinated with mouse erythrocytes at a concentration of 4μg/mL in a calcium independent manner. CsGal-1 activity was inhibited by d-galactose at 25mM(-1) and d-glucose and d-fructose at 100mM(-1). The results of microbial binding assay showed that the recombinant CsGal-1 protein agglutinated only with the Gram-negative bacteria. Interestingly, we observed no agglutination against Gram-positive bacteria. Overall, the study showed that CsGal-1 is an important immune gene involved in the recognition and elimination of pathogens in C. striatus. Copyright © 2014 Elsevier GmbH. All rights reserved.

  13. Reduction of PAPR in coded OFDM using fast Reed-Solomon codes over prime Galois fields

    NASA Astrophysics Data System (ADS)

    Motazedi, Mohammad Reza; Dianat, Reza

    2017-02-01

    In this work, two new techniques using Reed-Solomon (RS) codes over GF(257) and GF(65,537) are proposed for peak-to-average power ratio (PAPR) reduction in coded orthogonal frequency division multiplexing (OFDM) systems. The lengths of these codes are well-matched to the length of OFDM frames. Over these fields, the block lengths of codes are powers of two and we fully exploit the radix-2 fast Fourier transform algorithms. Multiplications and additions are simple modulus operations. These codes provide desirable randomness with a small perturbation in information symbols that is essential for generation of different statistically independent candidates. Our simulations show that the PAPR reduction ability of RS codes is the same as that of conventional selected mapping (SLM), but contrary to SLM, we can get error correction capability. Also for the second proposed technique, the transmission of side information is not needed. To the best of our knowledge, this is the first work using RS codes for PAPR reduction in single-input single-output systems.

  14. On the error probability of general tree and trellis codes with applications to sequential decoding

    NASA Technical Reports Server (NTRS)

    Johannesson, R.

    1973-01-01

    An upper bound on the average error probability for maximum-likelihood decoding of the ensemble of random binary tree codes is derived and shown to be independent of the length of the tree. An upper bound on the average error probability for maximum-likelihood decoding of the ensemble of random L-branch binary trellis codes of rate R = 1/n is derived which separates the effects of the tail length T and the memory length M of the code. It is shown that the bound is independent of the length L of the information sequence. This implication is investigated by computer simulations of sequential decoding utilizing the stack algorithm. These simulations confirm the implication and further suggest an empirical formula for the true undetected decoding error probability with sequential decoding.

  15. A high-throughput platform for population reformatting and mammalian expression of phage display libraries to enable functional screening as full-length IgG.

    PubMed

    Xiao, Xiaodong; Douthwaite, Julie A; Chen, Yan; Kemp, Ben; Kidd, Sara; Percival-Alwyn, Jennifer; Smith, Alison; Goode, Kate; Swerdlow, Bonnie; Lowe, David; Wu, Herren; Dall'Acqua, William F; Chowdhury, Partha S

    Phage display antibody libraries are a rich resource for discovery of potential therapeutic antibodies. Single-chain variable fragment (scFv) libraries are the most common format due to the efficient display of scFv by phage particles and the ease by which soluble scFv antibodies can be expressed for high-throughput screening. Typically, a cascade of screening and triaging activities are performed, beginning with the assessment of large numbers of E. coli-expressed scFv, and progressing through additional assays with individual reformatting of the most promising scFv to full-length IgG. However, use of high-throughput screening of scFv for the discovery of full-length IgG is not ideal because of the differences between these molecules. Furthermore, the reformatting step represents a bottle neck in the process because each antibody has to be handled individually to preserve the unique VH and VL pairing. These problems could be resolved if populations of scFv could be reformatted to full-length IgG before screening without disrupting the variable region pairing. Here, we describe a novel strategy that allows the reformatting of diverse populations of scFv from phage selections to full-length IgG in a batch format. The reformatting process maintains the diversity and variable region pairing with high fidelity, and the resulted IgG pool enables high-throughput expression of IgG in mammalian cells and cell-based functional screening. The improved process led to the discovery of potent candidates that are comparable or better than those obtained by traditional methods. This strategy should also be readily applicable to Fab-based phage libraries. Our approach, Screening in Product Format (SiPF), represents a substantial improvement in the field of antibody discovery using phage display.

  16. In-vitro and in-vivo phenotype of type Asia 1 foot-and-mouth disease viruses utilizing two non-RGD receptor recognition sites

    PubMed Central

    2011-01-01

    Background Foot-and-mouth disease virus (FMDV) uses a highly conserved Arg-Gly-Asp (RGD) triplet for attachment to host cells and this motif is believed to be essential for virus viability. Previous sequence analyses of the 1D-encoding region of an FMDV field isolate (Asia1/JS/CHA/05) and its two derivatives indicated that two viruses, which contained an Arg-Asp-Asp (RDD) or an Arg-Ser-Asp (RSD) triplet instead of the RGD integrin recognition motif, were generated serendipitously upon short-term evolution of field isolate in different biological environments. To examine the influence of single amino acid substitutions in the receptor binding site of the RDD-containing FMD viral genome on virus viability and the ability of non-RGD FMDVs to cause disease in susceptible animals, we constructed an RDD-containing FMDV full-length cDNA clone and derived mutant molecules with RGD or RSD receptor recognition motifs. Following transfection of BSR cells with the full-length genome plasmids, the genetically engineered viruses were examined for their infectious potential in cell culture and susceptible animals. Results Amino acid sequence analysis of the 1D-coding region of different derivatives derived from the Asia1/JS/CHA/05 field isolate revealed that the RDD mutants became dominant or achieved population equilibrium with coexistence of the RGD and RSD subpopulations at an early phase of type Asia1 FMDV quasispecies evolution. Furthermore, the RDD and RSD sequences remained genetically stable for at least 20 passages. Using reverse genetics, the RDD-, RSD-, and RGD-containing FMD viruses were rescued from full-length cDNA clones, and single amino acid substitution in RDD-containing FMD viral genome did not affect virus viability. The genetically engineered viruses replicated stably in BHK-21 cells and had similar growth properties to the parental virus. The RDD parental virus and two non-RGD recombinant viruses were virulent to pigs and bovines that developed typical clinical disease and viremia. Conclusions FMDV quasispecies evolving in a different biological environment gained the capability of selecting different receptor recognition site. The RDD-containing FMD viral genome can accommodate substitutions in the receptor binding site without additional changes in the capsid. The viruses expressing non-RGD receptor binding sites can replicate stably in vitro and produce typical FMD clinical disease in susceptible animals. PMID:21711567

  17. The complete chloroplast genome of a medicinal plant Epimedium koreanum Nakai (Berberidaceae).

    PubMed

    Lee, Jung-Hoon; Kim, Kyunghee; Kim, Na-Rae; Lee, Sang-Choon; Yang, Tae-Jin; Kim, Young-Dong

    2016-11-01

    Epimedium koreanum is a perennial medicinal plant distributed in Eastern Asia. The complete chloroplast genome sequences of E. koreanum was obtained by de novo assembly using whole genome next-generation sequences. The chloroplast genome of E. koreanum was 157 218 bp in length and separated into four distinct regions such as large single copy region (89 600 bp), small single copy region (17 222 bp) and a pair of inverted repeat regions (25 198 bp). The genome contained a total of 112 genes including 78 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. Phylogenetic analysis with the reported chloroplast genomes revealed that E. koreanum is most closely related to Berberis bealei, a traditional medicinal plant in the Berberidaceae family.

  18. Modulation and coding for satellite and space communications

    NASA Technical Reports Server (NTRS)

    Yuen, Joseph H.; Simon, Marvin K.; Pollara, Fabrizio; Divsalar, Dariush; Miller, Warner H.; Morakis, James C.; Ryan, Carl R.

    1990-01-01

    Several modulation and coding advances supported by NASA are summarized. To support long-constraint-length convolutional code, a VLSI maximum-likelihood decoder, utilizing parallel processing techniques, which is being developed to decode convolutional codes of constraint length 15 and a code rate as low as 1/6 is discussed. A VLSI high-speed 8-b Reed-Solomon decoder which is being developed for advanced tracking and data relay satellite (ATDRS) applications is discussed. A 300-Mb/s modem with continuous phase modulation (CPM) and codings which is being developed for ATDRS is discussed. Trellis-coded modulation (TCM) techniques are discussed for satellite-based mobile communication applications.

  19. A novel encoding scheme for effective biometric discretization: Linearly Separable Subcode.

    PubMed

    Lim, Meng-Hui; Teoh, Andrew Beng Jin

    2013-02-01

    Separability in a code is crucial in guaranteeing a decent Hamming-distance separation among the codewords. In multibit biometric discretization where a code is used for quantization-intervals labeling, separability is necessary for preserving distance dissimilarity when feature components are mapped from a discrete space to a Hamming space. In this paper, we examine separability of Binary Reflected Gray Code (BRGC) encoding and reveal its inadequacy in tackling interclass variation during the discrete-to-binary mapping, leading to a tradeoff between classification performance and entropy of binary output. To overcome this drawback, we put forward two encoding schemes exhibiting full-ideal and near-ideal separability capabilities, known as Linearly Separable Subcode (LSSC) and Partially Linearly Separable Subcode (PLSSC), respectively. These encoding schemes convert the conventional entropy-performance tradeoff into an entropy-redundancy tradeoff in the increase of code length. Extensive experimental results vindicate the superiority of our schemes over the existing encoding schemes in discretization performance. This opens up possibilities of achieving much greater classification performance with high output entropy.

  20. The structure of the regulatory region of the rat L1 (L1Rn, long interspersed repeated) DNA family of transposable elements.

    PubMed Central

    Furano, A V; Robb, S M; Robb, F T

    1988-01-01

    Here we report the DNA structure of the left 1.5 kb of two newly isolated full length members of the rat L1 DNA family (L1Rn, long interspersed repeated DNA). In contrast to earlier isolated rat L1 members, both of these contain promoter-like regions that are most likely full length. In addition, the promoter-like region of both members has undergone a partial tandem duplication. A second internal region of the left end of one of the reported members is also tandemly duplicated. The propensity of the left end of rat L1 elements to undergo this form of genetic rearrangement, as well as other structural features revealed by the present work, is discussed in light of the fact that during evolution the otherwise conserved mammalian L1 DNA families have each acquired completely different promoter-like regions. In an accompanying paper [Nur, I., Pascale, E., and Furano, A. V. (1988) Nucleic Acids Res. 16, submitted], we report that one of the rat promoter-like regions can function as a promoter in rat cells when fused to the Escherichia coli chloramphenicol acyltransferase gene. PMID:2845369

  1. Molecular cloning and identification of the transcriptional regulatory domain of the goat neurokinin B gene TAC3.

    PubMed

    Suetomi, Yuta; Matsuda, Fuko; Uenoyama, Yoshihisa; Maeda, Kei-ichiro; Tsukamura, Hiroko; Ohkura, Satoshi

    2013-10-01

    Neurokinin B (NKB), encoded by TAC3, is thought to be an important accelerator of pulsatile gonadotropin-releasing hormone release. This study aimed to clarify the transcriptional regulatory mechanism of goat TAC3. First, we determined the full-length mRNA sequence of goat TAC3 from the hypothalamus to be 820 b, including a 381 b coding region, with the putative transcription start site located 143-b upstream of the start codon. The deduced amino acid sequence of NKB, which is produced from preproNKB, was completely conserved among goat, cattle, and human. Next, we cloned 5'-upstream region of goat TAC3 up to 3400 b from the translation initiation site, and this region was highly homologous with cattle TAC3 (89%). We used this goat TAC3 5'-upstream region to perform luciferase assays. We created a luciferase reporter vector containing DNA constructs from -2706, -1837, -834, -335, or -197 to +166 bp (the putative transcription start site was designated as +1) of goat TAC3 and these were transiently transfected into mouse hypothalamus-derived N7 cells and human neuroblastoma-derived SK-N-AS cells. The luciferase activity gradually increased with the deletion of the 5'-upstream region, suggesting that the transcriptional suppressive region is located between -2706 and -336 bp and that the core promoter exists downstream of -197 bp. Estradiol treatment did not lead to significant suppression of luciferase activity of any constructs, suggesting the existence of other factor(s) that regulate goat TAC3 transcription.

  2. Characterization of antigenic determinants in ApxIIA exotoxin capable of inducing protective immunity to Actinobacillus pleuropneumoniae challenge.

    PubMed

    Seo, Ki-Weon; Kim, Dong-Heon; Kim, Ah Hyun; Yoo, Han-Sang; Lee, Kyung-Yeol; Jang, Yong-Suk

    2011-01-01

    Actinobacillus pleuropneumoniae is the causative agent of porcine pleuropneumonia. Among the virulence factors of the pathogen, ApxIIA, a bacterial exotoxin, is expressed by many serotypes and presents a plausible target for vaccine development. We characterized the region within ApxIIA that induces a protective immune response against bacterial infection using mouse challenge model. Recombinant proteins spanning the length of ApxIIA were produced and antiserum to the full-length ApxIIA was induced in mice. This antiserum recognized fragments #2, #3 and #5 with high binding specificity, but showed poor recognition for fragments #1 and #4. Of the antisera induced in mice by injection of each fragments, only the antiserum to fragment #4 failed to efficiently recognize the full-length antigen, although the individual antisera recognized their cognate antigens with almost equal efficiency. The protective potency of the immunogenic proteins against a challenge injection of bacteria in vivo correlated well with the antibody titer. Fragment #5 induced the highest level of protective activity, comparable to that by the full-length protein. These results support the use of fragment #5 to produce a vaccine against A. pleuropneumoniae challenge, since the small antigen peptide is easier to handle than is the full-length protein and can be expressed efficiently in heterologous expression systems.

  3. Trinucleotide repeat length and progression of illness in Huntington's disease.

    PubMed Central

    Kieburtz, K; MacDonald, M; Shih, C; Feigin, A; Steinberg, K; Bordwell, K; Zimmerman, C; Srinidhi, J; Sotack, J; Gusella, J

    1994-01-01

    The genetic defect causing Huntington's disease (HD) has been identified as an unstable expansion of a trinucleotide (CAG) repeat sequence within the coding region of the IT15 gene on chromosome 4. In 50 patients with manifest HD who were evaluated prospectively and uniformly, we examined the relationship between the extent of the DNA expansion and the rate of illness progression. Although the length of CAG repeats showed a strong inverse correlation with the age at onset of HD, there was no such relationship between the number of CAG repeats and the rate of clinical decline. These findings suggest that the CAG repeat length may influence or trigger the onset of HD, but other genetic, neurobiological, or environmental factors contribute to the progression of illness and the underlying pace of neuronal degeneration. PMID:7853373

  4. Analysis of copy number variations in Holstein-Friesian cow genomes based on whole-genome sequence data.

    PubMed

    Mielczarek, M; Frąszczak, M; Giannico, R; Minozzi, G; Williams, John L; Wojdak-Maksymiec, K; Szyda, J

    2017-07-01

    Thirty-two whole genome DNA sequences of cows were analyzed to evaluate inter-individual variability in the distribution and length of copy number variations (CNV) and to functionally annotate CNV breakpoints. The total number of deletions per individual varied between 9,731 and 15,051, whereas the number of duplications was between 1,694 and 5,187. Most of the deletions (81%) and duplications (86%) were unique to a single cow. No relation between the pattern of variant sharing and a family relationship or disease status was found. The animal-averaged length of deletions was from 5,234 to 9,145 bp and the average length of duplications was between 7,254 and 8,843 bp. Highly significant inter-individual variation in length and number of CNV was detected for both deletions and duplications. The majority of deletion and duplication breakpoints were located in intergenic regions and introns, whereas fewer were identified in noncoding transcripts and splice regions. Only 1.35 and 0.79% of the deletion and duplication breakpoints were observed within coding regions. A gene with the highest number of deletion breakpoints codes for protein kinase cGMP-dependent type I, whereas the T-cell receptor α constant gene had the most duplication breakpoints. The functional annotation of genes with the largest incidence of deletion/duplication breakpoints identified 87/112 Kyoto Encyclopedia of Genes and Genomes pathways, but none of the pathways were significantly enriched or depleted with breakpoints. The analysis of Gene Ontology (GO) terms revealed that a cluster with the highest enrichment score among genes with many deletion breakpoints was represented by GO terms related to ion transport, whereas the GO term cluster mostly enriched among the genes with many duplication breakpoints was related to binding of macromolecules. Furthermore, when considering the number of deletion breakpoints per gene functional category, no significant differences were observed between the "housekeeping" and "strong selection" categories, but genes representing the "low selection pressure" group showed a significantly higher number of breakpoints. Copyright © 2017 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  5. Structure of the full-length TRPV2 channel by cryo-EM

    NASA Astrophysics Data System (ADS)

    Huynh, Kevin W.; Cohen, Matthew R.; Jiang, Jiansen; Samanta, Amrita; Lodowski, David T.; Zhou, Z. Hong; Moiseenkova-Bell, Vera Y.

    2016-03-01

    Transient receptor potential (TRP) proteins form a superfamily Ca2+-permeable cation channels regulated by a range of chemical and physical stimuli. Structural analysis of a `minimal' TRP vanilloid subtype 1 (TRPV1) elucidated a mechanism of channel activation by agonists through changes in its outer pore region. Though homologous to TRPV1, other TRPV channels (TRPV2-6) are insensitive to TRPV1 activators including heat and vanilloids. To further understand the structural basis of TRPV channel function, we determined the structure of full-length TRPV2 at ~5 Å resolution by cryo-electron microscopy. Like TRPV1, TRPV2 contains two constrictions, one each in the pore-forming upper and lower gates. The agonist-free full-length TRPV2 has wider upper and lower gates compared with closed and agonist-activated TRPV1. We propose these newly revealed TRPV2 structural features contribute to diversity of TRPV channels.

  6. Structure of the full-length TRPV2 channel by cryo-EM

    PubMed Central

    Huynh, Kevin W.; Cohen, Matthew R.; Jiang, Jiansen; Samanta, Amrita; Lodowski, David T.; Zhou, Z. Hong; Moiseenkova-Bell, Vera Y.

    2016-01-01

    Transient receptor potential (TRP) proteins form a superfamily Ca2+-permeable cation channels regulated by a range of chemical and physical stimuli. Structural analysis of a ‘minimal' TRP vanilloid subtype 1 (TRPV1) elucidated a mechanism of channel activation by agonists through changes in its outer pore region. Though homologous to TRPV1, other TRPV channels (TRPV2–6) are insensitive to TRPV1 activators including heat and vanilloids. To further understand the structural basis of TRPV channel function, we determined the structure of full-length TRPV2 at ∼5 Å resolution by cryo-electron microscopy. Like TRPV1, TRPV2 contains two constrictions, one each in the pore-forming upper and lower gates. The agonist-free full-length TRPV2 has wider upper and lower gates compared with closed and agonist-activated TRPV1. We propose these newly revealed TRPV2 structural features contribute to diversity of TRPV channels. PMID:27021073

  7. Structure of the full-length TRPV2 channel by cryo-EM.

    PubMed

    Huynh, Kevin W; Cohen, Matthew R; Jiang, Jiansen; Samanta, Amrita; Lodowski, David T; Zhou, Z Hong; Moiseenkova-Bell, Vera Y

    2016-03-29

    Transient receptor potential (TRP) proteins form a superfamily Ca(2+)-permeable cation channels regulated by a range of chemical and physical stimuli. Structural analysis of a 'minimal' TRP vanilloid subtype 1 (TRPV1) elucidated a mechanism of channel activation by agonists through changes in its outer pore region. Though homologous to TRPV1, other TRPV channels (TRPV2-6) are insensitive to TRPV1 activators including heat and vanilloids. To further understand the structural basis of TRPV channel function, we determined the structure of full-length TRPV2 at ∼5 Å resolution by cryo-electron microscopy. Like TRPV1, TRPV2 contains two constrictions, one each in the pore-forming upper and lower gates. The agonist-free full-length TRPV2 has wider upper and lower gates compared with closed and agonist-activated TRPV1. We propose these newly revealed TRPV2 structural features contribute to diversity of TRPV channels.

  8. Genomic Landscape of Long Terminal Repeat Retrotransposons (LTR-RTs) and Solo LTRs as Shaped by Ectopic Recombination in Chicken and Zebra Finch.

    PubMed

    Ji, Yanzhu; DeWoody, J Andrew

    2016-06-01

    Transposable elements (TEs) are nearly ubiquitous among eukaryotic genomes, but TE contents vary dramatically among phylogenetic lineages. Several mechanisms have been proposed as drivers of TE dynamics in genomes, including the fixation/loss of a particular TE insertion by selection or drift as well as structural changes in the genome due to mutation (e.g., recombination). In particular, recombination can have a significant and directional effect on the genomic TE landscape. For example, ectopic recombination removes internal regions of long terminal repeat retrotransposons (LTR-RTs) as well as one long terminal repeat (LTR), resulting in a solo LTR. In this study, we focus on the intra-species dynamics of LTR-RTs and solo LTRs in bird genomes. The distribution of LTR-RTs and solo LTRs in birds is intriguing because avian recombination rates vary widely within a given genome. We used published linkage maps and whole genome assemblies to study the relationship between recombination rates and LTR-removal events in the chicken and zebra finch. We hypothesized that regions with low recombination rates would harbor more full-length LTR-RTs (and fewer solo LTRs) than regions with high recombination rates. We tested this hypothesis by comparing the ratio of full-length LTR-RTs and solo LTRs across chromosomes, across non-overlapping megabase windows, and across physical features (i.e., centromeres and telomeres). The chicken data statistically supported the hypothesis that recombination rates are inversely correlated with the ratio of full-length to solo LTRs at both the chromosome level and in 1-Mb non-overlapping windows. We also found that the ratio of full-length to solo LTRs near chicken telomeres was significantly lower than those ratios near centromeres. Our results suggest a potential role of ectopic recombination in shaping the chicken LTR-RT genomic landscape.

  9. The complete mitochondrial genome of a stonefly species, Kamimuria chungnanshana Wu, 1948 (Plecoptera: Perlidae).

    PubMed

    Wang, Kai; Ding, Shuangmei; Yang, Ding

    2016-09-01

    This study determined the complete mitochondrial (mt) genome of the stonefly, Kamimuria chungnanshana Wu, 1948. The mt genome is 15, 943 bp in size and contains 37 canonical genes which include 22 transfer RNA genes, 13 protein-coding genes, and two ribosomal RNA genes, the control region is 1062 bp in length. The phylogenetic tree shows that Kamimuria chungnanshana is sister group of Kamimuria wangi.

  10. Complete mitochondrial genome of a wild Siberian tiger.

    PubMed

    Sun, Yujiao; Lu, Taofeng; Sun, Zhaohui; Guan, Weijun; Liu, Zhensheng; Teng, Liwei; Wang, Shuo; Ma, Yuehui

    2015-01-01

    In this study, the complete mitochondrial genome of Siberian tiger (Panthera tigris altaica) was sequenced, using muscle tissue obtained from a male wild tiger. The total length of the mitochondrial genome is 16,996 bp. The genome structure of this tiger is in accordance with other Siberian tigers and it contains 12S rRNA gene, 16S rRNA gene, 22 tRNA genes, 13 protein-coding genes, and 1 control region.

  11. Mutations in nsP1 and PE2 are critical determinants of Ross River virus-induced musculoskeletal inflammatory disease in a mouse model

    PubMed Central

    Jupille, Henri J.; Oko, Lauren; Stoermer, Kristina A.; Heise, Mark T.; Mahalingam, Suresh; Gunn, Bronwyn M.; Morrison, Thomas E.

    2010-01-01

    The viral determinants of Alphavirus-induced rheumatic disease have not been elucidated. We identified an RRV strain (DC5692) which, in contrast to the T48 strain, does not induce musculoskeletal inflammation in a mouse model of RRV disease. Substitution of the RRV T48 strain nonstructural protein 1 (nsP1) coding sequence with that from strain DC5692 generated a virus that was attenuated in vivo despite similar viral loads in tissues. In contrast, substitution of the T48 PE2 coding region with the PE2 coding region from DC5692 resulted in attenuation in vivo and reduced viral loads in tissues. In gain of virulence experiments, substitution of the DC5692 strain nsP1 and PE2 coding regions with those from the T48 strain was sufficient to restore full virulence to the DC5692 strain. These findings indicate that determinants in both nsP1 and PE2 have critical and distinct roles in the pathogenesis of RRV-induced musculoskeletal inflammatory disease in mice. PMID:21131014

  12. Modelling crystal plasticity by 3D dislocation dynamics and the finite element method: The Discrete-Continuous Model revisited

    NASA Astrophysics Data System (ADS)

    Vattré, A.; Devincre, B.; Feyel, F.; Gatti, R.; Groh, S.; Jamond, O.; Roos, A.

    2014-02-01

    A unified model coupling 3D dislocation dynamics (DD) simulations with the finite element (FE) method is revisited. The so-called Discrete-Continuous Model (DCM) aims to predict plastic flow at the (sub-)micron length scale of materials with complex boundary conditions. The evolution of the dislocation microstructure and the short-range dislocation-dislocation interactions are calculated with a DD code. The long-range mechanical fields due to the dislocations are calculated by a FE code, taking into account the boundary conditions. The coupling procedure is based on eigenstrain theory, and the precise manner in which the plastic slip, i.e. the dislocation glide as calculated by the DD code, is transferred to the integration points of the FE mesh is described in full detail. Several test cases are presented, and the DCM is applied to plastic flow in a single-crystal Nickel-based superalloy.

  13. Comparative Analysis of the Mitochondrial Genomes of Callitettixini Spittlebugs (Hemiptera: Cercopidae) Confirms the Overall High Evolutionary Speed of the AT-Rich Region but Reveals the Presence of Short Conservative Elements at the Tribal Level

    PubMed Central

    Liu, Jie; Bu, Cuiping; Wipfler, Benjamin; Liang, Aiping

    2014-01-01

    The present study compares the mitochondrial genomes of five species of the spittlebug tribe Callitettixini (Hemiptera: Cercopoidea: Cercopidae) from eastern Asia. All genomes of the five species sequenced are circular double-stranded DNA molecules and range from 15,222 to 15,637 bp in length. They contain 22 tRNA genes, 13 protein coding genes (PCGs) and 2 rRNA genes and share the putative ancestral gene arrangement of insects. The PCGs show an extreme bias of nucleotide and amino acid composition. Significant differences of the substitution rates among the different genes as well as the different codon position of each PCG are revealed by the comparative evolutionary analyses. The substitution speeds of the first and second codon position of different PCGs are negatively correlated with their GC content. Among the five species, the AT-rich region features great differences in length and pattern and generally shows a 2–5 times higher substitution rate than the fastest PCG in the mitochondrial genome, atp8. Despite the significant variability in length, short conservative segments were identified in the AT-rich region within Callitettixini, although absent from the other groups of the spittlebug superfamily Cercopoidea. PMID:25285442

  14. [Construction of dengue virus-specific full-length fully human antibody libraries by mammalian display technology].

    PubMed

    Wen, Yangming; Lan, Kaijian; Wang, Junjie; Yu, Jingyi; Qu, Yarong; Zhao, Wei; Zhang, Fuchun; Tan, Wanlong; Cao, Hong; Zhou, Chen

    2013-06-01

    To construct dengue virus-specific full-length fully human antibody libraries using mammalian cell surface display technique. Total RNA was extracted from peripheral blood mononuclear cells (PBMCs) from convalescent patients with dengue fever. The reservoirs of the light chain and heavy chain variable regions (LCκ and VH) of the antibody genes were amplified by RT-PCR and inserted into the vector pDGB-HC-TM separately to construct the light chain and heavy chain libraries. The library DNAs were transfected into CHO cells and the expression of full-length fully human antibodies on the surface of CHO cells was analyzed by flow cytometry. Using 1.2 µg of the total RNA isolated from the PBMCs as the template, the LCκ and VH were amplified and the full-length fully human antibody mammalian display libraries were constructed. The kappa light chain gene library had a size of 1.45×10(4) and the heavy chain gene library had a size of 1.8×10(5). Sequence analysis showed that 8 out of the 10 light chain clones and 7 out of the 10 heavy chain clones randomly picked up from the constructed libraries contained correct open reading frames. FACS analysis demonstrated that all the 15 clones with correct open reading frames expressed full-length antibodies, which could be detected on CHO cell surfaces. After co-transfection of the heavy chain and light chain gene libraries into CHO cells, the expression of full-length antibodies on CHO cell surfaces could be detected by FACS analysis with an expressible diversity of the antibody library reaching 1.46×10(9) [(1.45×10(4)×80%)×(1.8×10(5)×70%)]. Using 1.2 µg of total RNA as template, the LCκ and VH full-length fully human antibody libraries against dengue virus have been successfully constructed with an expressible diversity of 10(9).

  15. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Spencer, B. W.; Williamson, R. L.; Stafford, D. S.

    One of the important roles of cladding in light water reactor fuel rods is to prevent the release of fission products. To that end, it is essential that the cladding maintain its integrity under a variety of thermal and mechanical loading conditions. Local geometric irregularities in fuel pellets caused by manufacturing defects known as missing pellet surfaces (MPS) can in some circumstances lead to elevated cladding stresses that are sufficiently high to cause cladding failure. Accurate modeling of these defects can help prevent these types of failures. The BISON nuclear fuel performance code developed at Idaho National Laboratory can bemore » used to simulate the global thermo-mechanical fuel rod behavior, as well as the local response of regions of interest, in either 2D or 3D. In either case, a full set of models to represent the thermal and mechanical properties of the fuel, cladding and plenum gas is employed. A procedure for coupling 2D full-length fuel rod models to detailed 3D models of the region of the rod containing a MPS defect is detailed in this paper. The global and local model each contain appropriate physics and behavior models for nuclear fuel. This procedure is demonstrated on a simulation of a boiling water reactor (BWR) fuel rod containing a pellet with an MPS defect, subjected to a variety of transient events, including a control blade withdrawal and a ramp to high power. The importance of modeling the local defect using a 3D model is highlighted by comparing 3D and 2D representations of the defective pellet region. Finally, parametric studies demonstrate the effects of the choice of gaseous swelling model and of the depth and geometry of the MPS defect on the response of the cladding adjacent to the defect.« less

  16. Box codes of lengths 48 and 72

    NASA Technical Reports Server (NTRS)

    Solomon, G.; Jin, Y.

    1993-01-01

    A self-dual code length 48, dimension 24, with Hamming distance essentially equal to 12 is constructed here. There are only six code words of weight eight. All the other code words have weights that are multiples of four and have a minimum weight equal to 12. This code may be encoded systematically and arises from a strict binary representation of the (8,4;5) Reed-Solomon (RS) code over GF (64). The code may be considered as six interrelated (8,7;2) codes. The Mattson-Solomon representation of the cyclic decomposition of these codes and their parity sums are used to detect an odd number of errors in any of the six codes. These may then be used in a correction algorithm for hard or soft decision decoding. A (72,36;15) box code was constructed from a (63,35;8) cyclic code. The theoretical justification is presented herein. A second (72,36;15) code is constructed from an inner (63,27;16) Bose Chaudhuri Hocquenghem (BCH) code and expanded to length 72 using box code algorithms for extension. This code was simulated and verified to have a minimum distance of 15 with even weight words congruent to zero modulo four. The decoding for hard and soft decision is still more complex than the first code constructed above. Finally, an (8,4;5) RS code over GF (512) in the binary representation of the (72,36;15) box code gives rise to a (72,36;16*) code with nine words of weight eight, and all the rest have weights greater than or equal to 16.

  17. Characterization of a Novel Cutaneous Human Papillomavirus Genotype HPV-125

    PubMed Central

    Kovanda, Anja; Kocjan, Boštjan J.; Potočnik, Marko; Poljak, Mario

    2011-01-01

    The DNA genome of a novel HPV genotype, HPV-125, isolated from a hand wart of an immuno-competent 19-year old male was fully cloned, sequenced and characterized. The full genome of HPV-125 is 7,809-bp in length with a GC content of 46.4%. By comparing the nucleotide sequence of the complete L1 gene, HPV-125 is phylogenetically placed within cutaneotrophic species 2 of Alphapapillomaviruses, and is most closely related to HPV-3 and HPV-28. HPV-125 has a typical genomic organization of Alphapapillomaviruses and contains genes coding for five early proteins, E6, E7, E1, E2 and E4 and two late capsid proteins, L1 and L2. The genome contains two non-coding regions: the first located between the L1 and E6 genes (nucleotide positions 7,137–7,809, length 673-bp) and the second between genes E2 and L2 (nucleotide positions 3,757–4,216, length 460-bp). The E6 protein of HPV-125 contains two regular zinc-binding domains at amino acid positions 29 and 102, whereas the E7 protein exhibits one such domain at position 50. HPV-125 lacks the regular pRb-binding core sequence within its E7 protein. In order to assess the tissue predilection and clinical significance of HPV-125, a quantitative type-specific real-time PCR was developed. The 95% limit-of-detection of the assay was 2.5 copies per reaction (range 1.7–5.7) and the intra- and inter-assay coefficients of variation were 0.47 and 2.00 for 100 copies per reaction, and 1.15 and 2.15 for 10 copies per reaction, respectively. Testing of a representative collection of HPV-associated mucosal and cutaneous benign and malignant neoplasms and hair follicles (a total of 601 samples) showed that HPV-125 is a relatively rare HPV genotype, with cutaneous tropism etiologically linked with sporadic cases of common warts. PMID:21811601

  18. dbWFA: a web-based database for functional annotation of Triticum aestivum transcripts

    PubMed Central

    Vincent, Jonathan; Dai, Zhanwu; Ravel, Catherine; Choulet, Frédéric; Mouzeyar, Said; Bouzidi, M. Fouad; Agier, Marie; Martre, Pierre

    2013-01-01

    The functional annotation of genes based on sequence homology with genes from model species genomes is time-consuming because it is necessary to mine several unrelated databases. The aim of the present work was to develop a functional annotation database for common wheat Triticum aestivum (L.). The database, named dbWFA, is based on the reference NCBI UniGene set, an expressed gene catalogue built by expressed sequence tag clustering, and on full-length coding sequences retrieved from the TriFLDB database. Information from good-quality heterogeneous sources, including annotations for model plant species Arabidopsis thaliana (L.) Heynh. and Oryza sativa L., was gathered and linked to T. aestivum sequences through BLAST-based homology searches. Even though the complexity of the transcriptome cannot yet be fully appreciated, we developed a tool to easily and promptly obtain information from multiple functional annotation systems (Gene Ontology, MapMan bin codes, MIPS Functional Categories, PlantCyc pathway reactions and TAIR gene families). The use of dbWFA is illustrated here with several query examples. We were able to assign a putative function to 45% of the UniGenes and 81% of the full-length coding sequences from TriFLDB. Moreover, comparison of the annotation of the whole T. aestivum UniGene set along with curated annotations of the two model species assessed the accuracy of the annotation provided by dbWFA. To further illustrate the use of dbWFA, genes specifically expressed during the early cell division or late storage polymer accumulation phases of T. aestivum grain development were identified using a clustering analysis and then annotated using dbWFA. The annotation of these two sets of genes was consistent with previous analyses of T. aestivum grain transcriptomes and proteomes. Database URL: urgi.versailles.inra.fr/dbWFA/ PMID:23660284

  19. Probing Conformational Dynamics of Tau Protein by Hydrogen/Deuterium Exchange Mass Spectrometry

    NASA Astrophysics Data System (ADS)

    Huang, Richard Y.-C.; Iacob, Roxana E.; Sankaranarayanan, Sethu; Yang, Ling; Ahlijanian, Michael; Tao, Li; Tymiak, Adrienne A.; Chen, Guodong

    2018-01-01

    Fibrillization of the microtubule-associated protein tau has been recognized as one of the signature pathologies of the nervous system in Alzheimer's disease, progressive supranuclear palsy, and other tauopathies. The conformational transition of tau in the fibrillization process, tau monomer to soluble aggregates to fibrils in particular, remains unclear. Here we report on the use of hydrogen/deuterium exchange mass spectrometry (HDX-MS) in combination with other biochemical approaches, including Thioflavin S fluorescence measurements, enzyme-linked immunosorbent assay (ELISA), and Western blotting to understand the heparin-induced tau's fibrillization. HDX-MS studies including anti-tau antibody epitope mapping experiments provided molecular level details of the full-length tau's conformational dynamics and its regional solvent accessibility upon soluble aggregates formation. The results demonstrate that R3 region in the full-length tau's microtubule binding repeat region (MTBR) is stabilized in the aggregation process, leaving both N and C terminal regions to be solvent exposed in the soluble aggregates and fibrils. The findings also illustrate the practical utility of orthogonal analytical methodologies for the characterization of protein higher order structure. [Figure not available: see fulltext.

  20. Characterization of AFLAV, a Tf1/Sushi retrotransposon from Aspergillus flavus.

    PubMed

    Hua, Sui-Sheng T; Tarun, Alice S; Pandey, Sonal N; Chang, Leo; Chang, Perng-Kuang

    2007-02-01

    The plasmid, pAF28, a genomic clone from Aspergillus flavus NRRL 6541, has been used as a hybridization probe to fingerprint A. flavus strains isolated in corn and peanut fields. The insert of pAF28 contains a 4.5 kb region which encodes a truncated retrotransposon (AfRTL-1). In search for a full-length and intact copy of retrotransposon, we exploited a novel PCR cloning strategy by amplifying a 3.4 kb region from the genomic DNA of A. flavus NRRL 6541. The fragment was cloned into pCR 4-TOPO. Sequence analysis confirmed that this region encoded putative domains of partial reverse transcriptase, RNase H, and integrase of the predicted retrotransposon. The two flanking long terminal repeats (LTRs) and the sequence between them comprise a putative full-length LTR retrotransposon of 7799 bp in length. This intact retrotransposon sequence is named AFLAV (A. flavus Retrotransposon). The order of the predicted catalytic domains in the polyprotein (Pol) placed AFLAV in the Tf1/sushi subgroup of the Ty3/gypsy retrotransposon family. Primers derived from AFLAV sequence were used to screen this retrotransposon in other strains of A. flavus. More than fifty strains of A. flavus isolated from different geological origins were surveyed and the results show that many strains have extensive deletions in the regions encoding the capsid (Gag) and Pol.

  1. Using the NASA GRC Sectored-One-Dimensional Combustor Simulation

    NASA Technical Reports Server (NTRS)

    Paxson, Daniel E.; Mehta, Vishal R.

    2014-01-01

    The document is a user manual for the NASA GRC Sectored-One-Dimensional (S-1-D) Combustor Simulation. It consists of three sections. The first is a very brief outline of the mathematical and numerical background of the code along with a description of the non-dimensional variables on which it operates. The second section describes how to run the code and includes an explanation of the input file. The input file contains the parameters necessary to establish an operating point as well as the associated boundary conditions (i.e. how it is fed and terminated) of a geometrically configured combustor. It also describes the code output. The third section describes the configuration process and utilizes a specific example combustor to do so. Configuration consists of geometrically describing the combustor (section lengths, axial locations, and cross sectional areas) and locating the fuel injection point and flame region. Configuration requires modifying the source code and recompiling. As such, an executable utility is included with the code which will guide the requisite modifications and insure that they are done correctly.

  2. Cloning and characterization of full-length mouse thymidine kinase 2: the N-terminal sequence directs import of the precursor protein into mitochondria.

    PubMed Central

    Wang, L; Eriksson, S

    2000-01-01

    The subcellular localization of mitochondrial thymidine kinase (TK2) has been questioned, since no mitochondrial targeting sequences have been found in cloned human TK2 cDNAs. Here we report the cloning of mouse TK2 cDNA from a mouse full-length enriched cDNA library. The mouse TK2 cDNA codes for a protein of 270 amino acids, with a 40-amino-acid presumed N-terminal mitochondrial targeting signal. In vitro translation and translocation experiments with purified rat mitochondria confirmed that the N-terminal sequence directed import of the precursor TK2 into the mitochondrial matrix. A single 2.4 kb mRNA transcript was detected in most tissues examined, except in liver, where an additional shorter (1.0 kb) transcript was also observed. There was no correlation between the tissue distribution of TK2 activity and the expression of TK2 mRNA. Full-length mouse TK2 protein and two N-terminally truncated forms, one of which corresponds to the mitochondrial form of TK2 and a shorter form corresponding to the previously characterized recombinant human TK2, were expressed in Escherichia coli and affinity purified. All three forms of TK2 phosphorylated thymidine, deoxycytidine and 2'-deoxyuridine, but with different kinetic efficiencies. A number of cytostatic pyrimidine nucleoside analogues were also tested and shown to be good substrates for the various forms of TK2. The active form of full-length mouse TK2 was a dimer, as judged by Superdex 200 chromatography. These results enhance our understanding of the structure and function of TK2, and may help to explain the mitochondrial disorder, mitochondrial neurogastrointestinal encephalomyopathy. PMID:11023833

  3. Intersubunit distances in full-length, dimeric, bacterial phytochrome Agp1, as measured by pulsed electron-electron double resonance (PELDOR) between different spin label positions, remain unchanged upon photoconversion.

    PubMed

    Kacprzak, Sylwia; Njimona, Ibrahim; Renz, Anja; Feng, Juan; Reijerse, Edward; Lubitz, Wolfgang; Krauss, Norbert; Scheerer, Patrick; Nagano, Soshichiro; Lamparter, Tilman; Weber, Stefan

    2017-05-05

    Bacterial phytochromes are dimeric light-regulated histidine kinases that convert red light into signaling events. Light absorption by the N-terminal photosensory core module (PCM) causes the proteins to switch between two spectrally distinct forms, Pr and Pfr, thus resulting in a conformational change that modulates the C-terminal histidine kinase region. To provide further insights into structural details of photoactivation, we investigated the full-length Agp1 bacteriophytochrome from the soil bacterium Agrobacterium fabrum using a combined spectroscopic and modeling approach. We generated seven mutants suitable for spin labeling to enable application of pulsed EPR techniques. The distances between attached spin labels were measured using pulsed electron-electron double resonance spectroscopy to probe the arrangement of the subunits within the dimer. We found very good agreement of experimental and calculated distances for the histidine-kinase region when both subunits are in a parallel orientation. However, experimental distance distributions surprisingly showed only limited agreement with either parallel- or antiparallel-arranged dimer structures when spin labels were placed into the PCM region. This observation indicates that the arrangements of the PCM subunits in the full-length protein dimer in solution differ significantly from that in the PCM crystals. The pulsed electron-electron double resonance data presented here revealed either no or only minor changes of distance distributions upon Pr-to-Pfr photoconversion. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  4. Conceptual Core Analysis of Long Life PWR Utilizing Thorium-Uranium Fuel Cycle

    NASA Astrophysics Data System (ADS)

    Rouf; Su'ud, Zaki

    2016-08-01

    Conceptual core analysis of long life PWR utilizing thorium-uranium based fuel has conducted. The purpose of this study is to evaluate neutronic behavior of reactor core using combined thorium and enriched uranium fuel. Based on this fuel composition, reactor core have higher conversion ratio rather than conventional fuel which could give longer operation length. This simulation performed using SRAC Code System based on library SRACLIB-JDL32. The calculation carried out for (Th-U)O2 and (Th-U)C fuel with uranium composition 30 - 40% and gadolinium (Gd2O3) as burnable poison 0,0125%. The fuel composition adjusted to obtain burn up length 10 - 15 years under thermal power 600 - 1000 MWt. The key properties such as uranium enrichment, fuel volume fraction, percentage of uranium are evaluated. Core calculation on this study adopted R-Z geometry divided by 3 region, each region have different uranium enrichment. The result show multiplication factor every burn up step for 15 years operation length, power distribution behavior, power peaking factor, and conversion ratio. The optimum core design achieved when thermal power 600 MWt, percentage of uranium 35%, U-235 enrichment 11 - 13%, with 14 years operation length, axial and radial power peaking factor about 1.5 and 1.2 respectively.

  5. Single Amino Acid Repeats in the Proteome World: Structural, Functional, and Evolutionary Insights

    PubMed Central

    Kumar, Amitha Sampath; Sowpati, Divya Tej; Mishra, Rakesh K.

    2016-01-01

    Microsatellites or simple sequence repeats (SSR) are abundant, highly diverse stretches of short DNA repeats present in all genomes. Tandem mono/tri/hexanucleotide repeats in the coding regions contribute to single amino acids repeats (SAARs) in the proteome. While SSRs in the coding region always result in amino acid repeats, a majority of SAARs arise due to a combination of various codons representing the same amino acid and not as a consequence of SSR events. Certain amino acids are abundant in repeat regions indicating a positive selection pressure behind the accumulation of SAARs. By analysing 22 proteomes including the human proteome, we explored the functional and structural relationship of amino acid repeats in an evolutionary context. Only ~15% of repeats are present in any known functional domain, while ~74% of repeats are present in the disordered regions, suggesting that SAARs add to the functionality of proteins by providing flexibility, stability and act as linker elements between domains. Comparison of SAAR containing proteins across species reveals that while shorter repeats are conserved among orthologs, proteins with longer repeats, >15 amino acids, are unique to the respective organism. Lysine repeats are well conserved among orthologs with respect to their length and number of occurrences in a protein. Other amino acids such as glutamic acid, proline, serine and alanine repeats are generally conserved among the orthologs with varying repeat lengths. These findings suggest that SAARs have accumulated in the proteome under positive selection pressure and that they provide flexibility for optimal folding of functional/structural domains of proteins. The insights gained from our observations can help in effective designing and engineering of proteins with novel features. PMID:27893794

  6. Regions of extreme synonymous codon selection in mammalian genes

    PubMed Central

    Schattner, Peter; Diekhans, Mark

    2006-01-01

    Recently there has been increasing evidence that purifying selection occurs among synonymous codons in mammalian genes. This selection appears to be a consequence of either cis-regulatory motifs, such as exonic splicing enhancers (ESEs), or mRNA secondary structures, being superimposed on the coding sequence of the gene. We have developed a program to identify regions likely to be enriched for such motifs by searching for extended regions of extreme codon conservation between homologous genes of related species. Here we present the results of applying this approach to five mammalian species (human, chimpanzee, mouse, rat and dog). Even with very conservative selection criteria, we find over 200 regions of extreme codon conservation, ranging in length from 60 to 178 codons. The regions are often found within genes involved in DNA-binding, RNA-binding or zinc-ion-binding. They are highly depleted for synonymous single nucleotide polymorphisms (SNPs) but not for non-synonymous SNPs, further indicating that the observed codon conservation is being driven by negative selection. Forty-three percent of the regions overlap conserved alternative transcript isoforms and are enriched for known ESEs. Other regions are enriched for TpA dinucleotides and may contain conserved motifs/structures relating to mRNA stability and/or degradation. We anticipate that this tool will be useful for detecting regions enriched in other classes of coding-sequence motifs and structures as well. PMID:16556911

  7. Signal sequence and keyword trap in silico for selection of full-length human cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries.

    PubMed

    Otsuki, Tetsuji; Ota, Toshio; Nishikawa, Tetsuo; Hayashi, Koji; Suzuki, Yutaka; Yamamoto, Jun-ichi; Wakamatsu, Ai; Kimura, Kouichi; Sakamoto, Katsuhiko; Hatano, Naoto; Kawai, Yuri; Ishii, Shizuko; Saito, Kaoru; Kojima, Shin-ichi; Sugiyama, Tomoyasu; Ono, Tetsuyoshi; Okano, Kazunori; Yoshikawa, Yoko; Aotsuka, Satoshi; Sasaki, Naokazu; Hattori, Atsushi; Okumura, Koji; Nagai, Keiichi; Sugano, Sumio; Isogai, Takao

    2005-01-01

    We have developed an in silico method of selection of human full-length cDNAs encoding secretion or membrane proteins from oligo-capped cDNA libraries. Fullness rates were increased to about 80% by combination of the oligo-capping method and ATGpr, software for prediction of translation start point and the coding potential. Then, using 5'-end single-pass sequences, cDNAs having the signal sequence were selected by PSORT ('signal sequence trap'). We also applied 'secretion or membrane protein-related keyword trap' based on the result of BLAST search against the SWISS-PROT database for the cDNAs which could not be selected by PSORT. Using the above procedures, 789 cDNAs were primarily selected and subjected to full-length sequencing, and 334 of these cDNAs were finally selected as novel. Most of the cDNAs (295 cDNAs: 88.3%) were predicted to encode secretion or membrane proteins. In particular, 165(80.5%) of the 205 cDNAs selected by PSORT were predicted to have signal sequences, while 70 (54.2%) of the 129 cDNAs selected by 'keyword trap' preserved the secretion or membrane protein-related keywords. Many important cDNAs were obtained, including transporters, receptors, and ligands, involved in significant cellular functions. Thus, an efficient method of selecting secretion or membrane protein-encoding cDNAs was developed by combining the above four procedures.

  8. The first mitochondrial genome for the butterfly family Riodinidae (Abisara fylloides) and its systematic implications.

    PubMed

    Zhao, Fang; Huang, Dun-Yuan; Sun, Xiao-Yan; Shi, Qing-Hui; Hao, Jia-Sheng; Zhang, Lan-Lan; Yang, Qun

    2013-10-01

    The Riodinidae is one of the lepidopteran butterfly families. This study describes the complete mitochondrial genome of the butterfly species Abisara fylloides, the first mitochondrial genome of the Riodinidae family. The results show that the entire mitochondrial genome of A. fylloides is 15 301 bp in length, and contains 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and a 423 bp A+T-rich region. The gene content, orientation and order are identical to the majority of other lepidopteran insects. Phylogenetic reconstruction was conducted using the concatenated 13 protein-coding gene (PCG) sequences of 19 available butterfly species covering all the five butterfly families (Papilionidae, Nymphalidae, Peridae, Lycaenidae and Riodinidae). Both maximum likelihood and Bayesian inference analyses highly supported the monophyly of Lycaenidae+Riodinidae, which was standing as the sister of Nymphalidae. In addition, we propose that the riodinids be categorized into the family Lycaenidae as a subfamilial taxon. The Riodinidae is one of the lepidopteran butterfly families. This study describes the complete mitochondrial genome of the butterfly species Abisara fylloides , the first mitochondrial genome of the Riodinidae family. The results show that the entire mitochondrial genome of A. fylloides is 15 301 bp in length, and contains 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and a 423 bp A+T-rich region. The gene content, orientation and order are identical to the majority of other lepidopteran insects. Phylogenetic reconstruction was conducted using the concatenated 13 protein-coding gene (PCG) sequences of 19 available butterfly species covering all the five butterfly families (Papilionidae, Nymphalidae, Peridae, Lycaenidae and Riodinidae). Both maximum likelihood and Bayesian inference analyses highly supported the monophyly of Lycaenidae+Riodinidae, which was standing as the sister of Nymphalidae. In addition, we propose that the riodinids be categorized into the family Lycaenidae as a subfamilial taxon.

  9. Efficient full wave code for the coupling of large multirow multijunction LH grills

    NASA Astrophysics Data System (ADS)

    Preinhaelter, Josef; Hillairet, Julien; Milanesio, Daniele; Maggiora, Riccardo; Urban, Jakub; Vahala, Linda; Vahala, George

    2017-11-01

    The full wave code OLGA, for determining the coupling of a single row lower hybrid launcher (waveguide grills) to the plasma, is extended to handle multirow multijunction active passive structures (like the C3 and C4 launchers on TORE SUPRA) by implementing the scattering matrix formalism. The extended code is still computationally fast because of the use of (i) 2D splines of the plasma surface admittance in the accessibility region of the k-space, (ii) high order Gaussian quadrature rules for the integration of the coupling elements and (iii) utilizing the symmetries of the coupling elements in the multiperiodic structures. The extended OLGA code is benchmarked against the ALOHA-1D, ALOHA-2D and TOPLHA codes for the coupling of the C3 and C4 TORE SUPRA launchers for several plasma configurations derived from reflectometry and interferometery. Unlike nearly all codes (except the ALOHA-1D code), OLGA does not require large computational resources and can be used for everyday usage in planning experimental runs. In particular, it is shown that the OLGA code correctly handles the coupling of the C3 and C4 launchers over a very wide range of plasma densities in front of the grill.

  10. Comparison of Calculations and Measurements of the Off-Axis Radiation Dose (SI) in Liquid Nitrogen as a Function of Radiation Length.

    DTIC Science & Technology

    1984-12-01

    radiation lengths. The off-axis dose in Silicon was calculated using the electron/photon transport code CYLTRAN and measured using thermal luminescent...various path lengths out to 2 radiation lengths. The cff-axis dose in Silicon was calculated using the electron/photon transport code CYLTRAN and measured... using thermal luminescent dosimeters (TLD’s). Calculations were performed on a CDC-7600 computer at Los Alamos National Laboratory and measurements

  11. Mitochondrial genomes of the jungle crow Corvus macrorhynchos (Passeriformes: Corvidae) from shed feathers and a phylogenetic analysis of genus Corvus using mitochondrial protein-coding genes.

    PubMed

    Krzeminska, Urszula; Wilson, Robyn; Rahman, Sadequr; Song, Beng Kah; Seneviratne, Sampath; Gan, Han Ming; Austin, Christopher M

    2016-07-01

    The complete mitochondrial genomes of two jungle crows (Corvus macrorhynchos) were sequenced. DNA was extracted from tissue samples obtained from shed feathers collected in the field in Sri Lanka and sequenced using the Illumina MiSeq Personal Sequencer. Jungle crow mitogenomes have a structural organization typical of the genus Corvus and are 16,927 bp and 17,066 bp in length, both comprising 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal subunit genes, and a non-coding control region. In addition, we complement already available house crow (Corvus spelendens) mitogenome resources by sequencing an individual from Singapore. A phylogenetic tree constructed from Corvidae family mitogenome sequences available on GenBank is presented. We confirm the monophyly of the genus Corvus and propose to use complete mitogenome resources for further intra- and interspecies genetic studies.

  12. Optimal periodic binary codes of lengths 28 to 64

    NASA Technical Reports Server (NTRS)

    Tyler, S.; Keston, R.

    1980-01-01

    Results from computer searches performed to find repeated binary phase coded waveforms with optimal periodic autocorrelation functions are discussed. The best results for lengths 28 to 64 are given. The code features of major concern are where (1) the peak sidelobe in the autocorrelation function is small and (2) the sum of the squares of the sidelobes in the autocorrelation function is small.

  13. Identification of Putative Nuclear Receptors and Steroidogenic Enzymes in Murray-Darling Rainbowfish (Melanotaenia fluviatilis) Using RNA-Seq and De Novo Transcriptome Assembly.

    PubMed

    Bain, Peter A; Papanicolaou, Alexie; Kumar, Anupama

    2015-01-01

    Murray-Darling rainbowfish (Melanotaenia fluviatilis [Castelnau, 1878]; Atheriniformes: Melanotaeniidae) is a small-bodied teleost currently under development in Australasia as a test species for aquatic toxicological studies. To date, efforts towards the development of molecular biomarkers of contaminant exposure have been hindered by the lack of available sequence data. To address this, we sequenced messenger RNA from brain, liver and gonads of mature male and female fish and generated a high-quality draft transcriptome using a de novo assembly approach. 149,742 clusters of putative transcripts were obtained, encompassing 43,841 non-redundant protein-coding regions. Deduced amino acid sequences were annotated by functional inference based on similarity with sequences from manually curated protein sequence databases. The draft assembly contained protein-coding regions homologous to 95.7% of the complete cohort of predicted proteins from the taxonomically related species, Oryzias latipes (Japanese medaka). The mean length of rainbowfish protein-coding sequences relative to their medaka homologues was 92.1%, indicating that despite the limited number of tissues sampled a large proportion of the total expected number of protein-coding genes was captured in the study. Because of our interest in the effects of environmental contaminants on endocrine pathways, we manually curated subsets of coding regions for putative nuclear receptors and steroidogenic enzymes in the rainbowfish transcriptome, revealing 61 candidate nuclear receptors encompassing all known subfamilies, and 41 putative steroidogenic enzymes representing all major steroidogenic enzymes occurring in teleosts. The transcriptome presented here will be a valuable resource for researchers interested in biomarker development, protein structure and function, and contaminant-response genomics in Murray-Darling rainbowfish.

  14. Bidirectional motility of the fission yeast kinesin-5, Cut7

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Edamatsu, Masaki, E-mail: cedam@mail.ecc.u-tokyo.ac.jp

    Highlights: • Motile properties of Cut7 (fission yeast kinesin-5) were studied for the first time. • Half-length Cut7 moved toward plus-end direction of microtubule. • Full-length Cut7 moved toward minus-end direction of microtubule. • N- and C-terminal microtubule binding sites did not switch the motile direction. - Abstract: Kinesin-5 is a homotetrameric motor with its motor domain at the N-terminus. Kinesin-5 crosslinks microtubules and functions in separating spindle poles during mitosis. In this study, the motile properties of Cut7, fission yeast kinesin-5, were examined for the first time. In in vitro motility assays, full-length Cut7 moved toward minus-end of microtubules,more » but the N-terminal half of Cut7 moved toward the opposite direction. Furthermore, additional truncated constructs lacking the N-terminal or C-terminal regions, but still contained the motor domain, did not switch the motile direction. These indicated that Cut7 was a bidirectional motor, and microtubule binding regions at the N-terminus and C-terminus were not involved in its directionality.« less

  15. Species-Specific TT Viruses and Cross-Species Infection in Nonhuman Primates

    PubMed Central

    Okamoto, Hiroaki; Fukuda, Masako; Tawara, Akio; Nishizawa, Tsutomu; Itoh, Yukio; Hayasaka, Ikuo; Tsuda, Fumio; Tanaka, Takeshi; Miyakawa, Yuzo; Mayumi, Makoto

    2000-01-01

    Viruses resembling human TT virus (TTV) were searched for in sera from nonhuman primates by PCR with primers deduced from well-conserved areas in the untranslated region. TTV DNA was detected in 102 (98%) of 104 chimpanzees, 9 (90%) of 10 Japanese macaques, 4 (100%) of 4 red-bellied tamarins, 5 (83%) of 6 cotton-top tamarins, and 5 (100%) of 5 douroucoulis tested. Analysis of the amplification products of 90 to 106 nucleotides revealed TTV DNA sequences specific for each species, with a decreasing similarity to human TTV in the order of chimpanzee, Japanese macaque, and tamarin/douroucouli TTVs. Full-length viral sequences were amplified by PCR with inverted nested primers deduced from the untranslated region of TTV DNA from each species. All animal TTVs were found to be circular with a genomic length at 3.5 to 3.8 kb, which was comparable to or slightly shorter than human TTV. Sequences closely similar to human TTV were determined by PCR with primers deduced from a coding region (N22 region) and were detected in 49 (47%) of the 104 chimpanzees; they were not found in any animals of the other species. Sequence analysis of the N22 region (222 to 225 nucleotides) of chimpanzee TTV DNAs disclosed four genetic groups that differed by 36.1 to 50.2% from one another; they were 35.0 to 52.8% divergent from any of the 16 genotypes of human TTV. Of the 104 chimpanzees, only 1 was viremic with human TTV of genotype 1a. It was among the 53 chimpanzees which had been used in transmission experiments with human hepatitis viruses. Antibody to TTV of genotype 1a was detected significantly more frequently in the chimpanzees that had been used in transmission experiments than in those that had not (8 of 28 [29%] and 3 of 35 [9%], respectively; P = 0.038). These results indicate that species-specific TTVs are prevalent in nonhuman primates and that human TTV can cross-infect chimpanzees. PMID:10627523

  16. Two Upper Bounds for the Weighted Path Length of Binary Trees. Report No. UIUCDCS-R-73-565.

    ERIC Educational Resources Information Center

    Pradels, Jean Louis

    Rooted binary trees with weighted nodes are structures encountered in many areas, such as coding theory, searching and sorting, information storage and retrieval. The path length is a meaningful quantity which gives indications about the expected time of a search or the length of a code, for example. In this paper, two sharp bounds for the total…

  17. GINGER simulations of short-pulse effects in the LEUTL FEL

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Huang, Z.; Fawley, W.M.

    While the long-pulse, coasting beam model is often used in analysis and simulation of self-amplified spontaneous emission (SASE) free-electron lasers (FELs), many current SASE demonstration experiments employ relatively short electron bunches whose pulse length is on the order of the radiation slippage length. In particular, the low-energy undulator test line (LEUTL) FEL at the Advanced Photon Source has recently lased and nominally saturated in both visible and near-ultraviolet wavelength regions with a sub-ps pulse length that is somewhat shorter than the total slippage length in the 22-m undulator system. In this paper we explore several characteristics of the short pulsemore » regime for SASE FELs with the multidimensional, time-dependent simulation code GINGER, concentrating on making a direct comparison with the experimental results from LEUTL. Items of interest include the radiation gain length, pulse energy, saturation position, and spectral bandwidth. We address the importance of short-pulse effects when scaling the LEUTL results to proposed x-ray FELs and also briefly discuss the possible importance of coherent spontaneous emission at startup.« less

  18. Pool-based genome-wide association study identified novel candidate regions on BTA9 and 14 for oleic acid percentage in Japanese Black cattle.

    PubMed

    Kawaguchi, Fuki; Kigoshi, Hiroto; Nakajima, Ayaka; Matsumoto, Yuta; Uemoto, Yoshinobu; Fukushima, Moriyuki; Yoshida, Emi; Iwamoto, Eiji; Akiyama, Takayuki; Kohama, Namiko; Kobayashi, Eiji; Honda, Takeshi; Oyama, Kenji; Mannen, Hideyuki; Sasazaki, Shinji

    2018-05-17

    Fatty acid composition is an important indicator of beef quality. The objective of this study was to search the potential candidate region for fatty acid composition. We performed pool-based genome-wide association studies (GWAS) for oleic acid percentage (C18:1) in a Japanese Black cattle population from the Hyogo prefecture. GWAS analysis revealed two novel candidate regions on BTA9 and BTA14. The most significant single nucleotide polymorphisms (SNPs) in each region were genotyped in a population (n = 899) to verify their effect on C18:1. Statistical analysis revealed that both SNPs were significantly associated with C18:1 (p = .0080 and .0003), validating the quantitative trait loci (QTLs) detected in GWAS. We subsequently selected VNN1 and LYPLA1 genes as candidate genes from each region on BTA9 and BTA14, respectively. We sequenced full-length coding sequence (CDS) of these genes in eight individuals and identified a nonsynonymous SNP T66M on VNN1 gene as a putative candidate polymorphism. The polymorphism was also significantly associated with C18:1, but the p value (p = .0162) was higher than the most significant SNP on BTA9, suggesting that it would not be responsible for the QTL. Although further investigation will be needed to determine the responsible gene and polymorphism, our findings would contribute to development of selective markers for fatty acid composition in the Japanese Black cattle of Hyogo. © 2018 Japanese Society of Animal Science.

  19. Seabream ghrelin: cDNA cloning, genomic organization and promoter studies.

    PubMed

    Yeung, Chung-Man; Chan, Chi-Bun; Woo, Norman Y S; Cheng, Christopher H K

    2006-05-01

    Recent studies have indicated that ghrelin stimulates growth hormone release from the pituitary via the growth hormone secretagogue receptor (GHSR). We have previously isolated two GHSR subtypes from the pituitary of the black seabream Acanthopagrus schlegeli. In the present study, we have cloned and characterized ghrelin from the same fish species at both the cDNA and gene levels. The full-length seabream ghrelin cDNA, isolated from sea-bream stomach using a novel approach by exploiting a single conserved region in the coding region, was found to encode a prepropeptide of 107 amino acids, with the predicted mature ghrelin peptide consisting of 20 amino acids (GSSFLSPSQKPQNRGKSSRV). Embedded in this full-length cDNA is a putative fish orthologue of the recently reported mammalian obestatin peptide. The ghrelin gene in black seabream, obtained by genomic PCR, was found to encompass four exons and three introns, possessing the same structural organization as in tilapia and goldfish, but different from that in rainbow trout. In addition, a 2230-bp 5'-flanking region of the seabream ghrelin gene was obtained by genome walking. Sequence analysis revealed that, as in the case of the human ghrelin gene, there is neither a GC box nor a CAAT box present in the isolated 5'-flanking region. However, a number of putative transcription factor-binding sites different from the human counterpart were found in the 5'-flanking region of the seabream ghrelin gene, suggesting that different cis- and trans-acting elements are involved in controlling their gene expression. Functional activity of this 5'-flanking region was examined by cloning it into the pGL3-Basic vector upstream of the luciferase reporter gene and transfected into various cell lines. Positive promoter activity could only be recorded in the colon-derived Caco-2 cells, suggesting that the cloned 5'-flanking region represents the functional promoter of the seabream ghrelin gene, which exhibits tissue-specific promoter activity. Using reverse transcriptase PCR analysis, expression of ghrelin was detected only in the seabream stomach, but not in the other tissues examined, including the brain, gill, intestine, kidney, liver and spleen. This stomach-specific expression of ghrelin in seabream is subject to regulation, as administration of growth hormone or ipamorelin to the fish in vivo was demonstrated to enhance its expression. Reminiscent of the homologous upregulation found in the transcriptional control of the seabream GHSR gene, a similar homologous regulatory mechanism might also exist in controlling the expression of seabream ghrelin. The identification of both GHSR and ghrelin from a single fish species would facilitate our subsequent studies on the elucidation of the physiological functions of the ghrelin/GHSR system in teleost. The possible existence of obestatin in teleost opens up new research avenues on the somatotropic axis in fish.

  20. Identification of a natural intergenotypic recombinant hepatitis delta virus genotype 1 and 2 in Vietnamese HBsAg-positive patients.

    PubMed

    Sy, B T; Nguyen, H M; Toan, N L; Song, L H; Tong, H V; Wolboldt, C; Binh, V Q; Kremsner, P G; Velavan, T P; Bock, C-T

    2015-01-01

    Hepatitis D virus (HDV) infection is acquired as a co- /superinfection of Hepatitis B virus (HBV) and can modulate the pathophysiology of chronic hepatitis B and related liver diseases including hepatocellular carcinoma. Among the eight distinct HDV genotypes reported, relatively few studies have attempted to investigate the prevalence of HDV mixed genotypes and RNA recombination of HDV. With a recorded prevalence of 10-20% HBV infection in Vietnam, this study investigated the HDV variability, HDV genotypes and HDV recombination among twenty-one HDV isolates in Vietnamese HBsAg-positive patients. HDV subgenomic and full-length genome sequences were obtained using newly established HDV-specific RT-PCR techniques. The nucleotide homology was observed from 74.6% to 99.4% among the investigated full-length genome of the HDV isolates. We observed HDV genotype 1 and HDV genotype 2 in the investigated Vietnamese patients. Although no HDV genotype mixtures were observed, we report here a newly identified recombinant of HDV genotypes (HDV 1 and HDV 2). The identified recombinant HDV isolate C03 revealed sequence homology to both HDV genotype 1 (nt1 to nt907) and HDV genotype 2 (nt908 to nt1675; HDAg coding region) with a breakpoint at nt908. Our findings demonstrate the prevalence of intergenotypic recombination between HDV genotypes 1 and 2 in a Vietnamese HBsAg-positive patient. Extended investigation on the distribution and prevalence of HDV, HDV mixed genotypes and recombinant HDV genotypes in a larger Vietnamese population offers vital insights into understanding of the micro-epidemiology of HDV and subsequent pathophysiology in chronic HBV- /HDV-related liver diseases. © 2014 John Wiley & Sons Ltd.

  1. Analysis of the 5′ untranslated region (5′UTR) of the alcohol oxidase 1 (AOX1) gene in recombinant protein expression in Pichia pastoris

    PubMed Central

    Staley, Chris A.; Huang, Amy; Nattestad, Maria; Oshiro, Kristin T.; Ray, Laura E.; Mulye, Tejas; Li, Zhiguo Harry; Le, Thu; Stephens, Justin J.; Gomez, Seth R.; Moy, Allison D.; Nguyen, Jackson C.; Franz, Andreas H.; Lin-Cereghino, Joan; Lin-Cereghino, Geoff P.

    2012-01-01

    Pichia pastoris is a methylotrophic yeast that has been genetically engineered to express over one thousand heterologous proteins valued for industrial, pharmaceutical and basic research purposes. In most cases, the 5′ untranslated region (UTR) of the alcohol oxidase 1 (AOX1) gene is fused to the coding sequence of the recombinant gene for protein expression in this yeast. Because the effect of the AOX1 5′UTR on protein expression is not known, site-directed mutagenesis was performed in order to decrease or increase the length of this region. Both of these types of changes were shown to affect translational efficiency, not transcript stability. While increasing the length of the 5′UTR clearly decreased expression of a β-galactosidase reporter in a proportional manner, a deletion analysis demonstrated that the AOX1 5′UTR contains a complex mixture of both positive and negative cis-acting elements, suggesting that the construction of a synthetic 5′UTR optimized for a higher level of expression may be challenging. PMID:22285974

  2. Variational learning and bits-back coding: an information-theoretic view to Bayesian learning.

    PubMed

    Honkela, Antti; Valpola, Harri

    2004-07-01

    The bits-back coding first introduced by Wallace in 1990 and later by Hinton and van Camp in 1993 provides an interesting link between Bayesian learning and information-theoretic minimum-description-length (MDL) learning approaches. The bits-back coding allows interpreting the cost function used in the variational Bayesian method called ensemble learning as a code length in addition to the Bayesian view of misfit of the posterior approximation and a lower bound of model evidence. Combining these two viewpoints provides interesting insights to the learning process and the functions of different parts of the model. In this paper, the problem of variational Bayesian learning of hierarchical latent variable models is used to demonstrate the benefits of the two views. The code-length interpretation provides new views to many parts of the problem such as model comparison and pruning and helps explain many phenomena occurring in learning.

  3. Carbohydrate degrading polypeptide and uses thereof

    DOEpatents

    Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

    2015-10-20

    The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

  4. Full-f version of GENE for turbulence in open-field-line systems

    NASA Astrophysics Data System (ADS)

    Pan, Q.; Told, D.; Shi, E. L.; Hammett, G. W.; Jenko, F.

    2018-06-01

    Unique properties of plasmas in the tokamak edge, such as large amplitude fluctuations and plasma-wall interactions in the open-field-line regions, require major modifications of existing gyrokinetic codes originally designed for simulating core turbulence. To this end, the global version of the 3D2V gyrokinetic code GENE, so far employing a δf-splitting technique, is extended to simulate electrostatic turbulence in straight open-field-line systems. The major extensions are the inclusion of the velocity-space nonlinearity, the development of a conducting-sheath boundary, and the implementation of the Lenard-Bernstein collision operator. With these developments, the code can be run as a full-f code and can handle particle loss to and reflection from the wall. The extended code is applied to modeling turbulence in the Large Plasma Device (LAPD), with a reduced mass ratio and a much lower collisionality. Similar to turbulence in a tokamak scrape-off layer, LAPD turbulence involves collisions, parallel streaming, cross-field turbulent transport with steep profiles, and particle loss at the parallel boundary.

  5. VCSEL End-Pumped Passively Q-Switched Nd:YAG Laser with Adjustable Pulse Energy

    DTIC Science & Technology

    2011-02-28

    entire VCSEL array. Neglecting lens aberrations, the focused spot diameter is given by focal length of the lens times the full divergence angle of the...pump intensity distribution generated by a pump-light-focusing lens . ©2011 Optical Society of America OCIS codes: (140.3530) Lasers Neodymium...Passive Q-Switch and Brewster Plate in a Pulsed Nd: YAG Laser,” IEEE J. Quantum Electron. 31(10), 1738–1741 (1995). 6. G. Xiao, and M. Bass, “A

  6. Huntingtin gene evolution in Chordata and its peculiar features in the ascidian Ciona genus

    PubMed Central

    Gissi, Carmela; Pesole, Graziano; Cattaneo, Elena; Tartari, Marzia

    2006-01-01

    Background To gain insight into the evolutionary features of the huntingtin (htt) gene in Chordata, we have sequenced and characterized the full-length htt mRNA in the ascidian Ciona intestinalis, a basal chordate emerging as new invertebrate model organism. Moreover, taking advantage of the availability of genomic and EST sequences, the htt gene structure of a number of chordate species, including the cogeneric ascidian Ciona savignyi, and the vertebrates Xenopus and Gallus was reconstructed. Results The C. intestinalis htt transcript exhibits some peculiar features, such as spliced leader trans-splicing in the 98 nt-long 5' untranslated region (UTR), an alternative splicing in the coding region, eight alternative polyadenylation sites, and no similarities of both 5' and 3'UTRs compared to homologs of the cogeneric C. savignyi. The predicted protein is 2946 amino acids long, shorter than its vertebrate homologs, and lacks the polyQ and the polyP stretches found in the the N-terminal regions of mammalian homologs. The exon-intron organization of the htt gene is almost identical among vertebrates, and significantly conserved between Ciona and vertebrates, allowing us to hypothesize an ancestral chordate gene consisting of at least 40 coding exons. Conclusion During chordate diversification, events of gain/loss, sliding, phase changes, and expansion of introns occurred in both vertebrate and ascidian lineages predominantly in the 5'-half of the htt gene, where there is also evidence of lineage-specific evolutionary dynamics in vertebrates. On the contrary, the 3'-half of the gene is highly conserved in all chordates at the level of both gene structure and protein sequence. Between the two Ciona species, a fast evolutionary rate and/or an early divergence time is suggested by the absence of significant similarity between UTRs, protein divergence comparable to that observed between mammals and fishes, and different distribution of repetitive elements. PMID:17092333

  7. Short-term memory coding in children with intellectual disabilities.

    PubMed

    Henry, Lucy

    2008-05-01

    To examine visual and verbal coding strategies, I asked children with intellectual disabilities and peers matched for MA and CA to perform picture memory span tasks with phonologically similar, visually similar, long, or nonsimilar named items. The CA group showed effects consistent with advanced verbal memory coding (phonological similarity and word length effects). Neither the intellectual disabilities nor MA groups showed evidence for memory coding strategies. However, children in these groups with MAs above 6 years showed significant visual similarity and word length effects, broadly consistent with an intermediate stage of dual visual and verbal coding. These results suggest that developmental progressions in memory coding strategies are independent of intellectual disabilities status and consistent with MA.

  8. Truncation Depth Rule-of-Thumb for Convolutional Codes

    NASA Technical Reports Server (NTRS)

    Moision, Bruce

    2009-01-01

    In this innovation, it is shown that a commonly used rule of thumb (that the truncation depth of a convolutional code should be five times the memory length, m, of the code) is accurate only for rate 1/2 codes. In fact, the truncation depth should be 2.5 m/(1 - r), where r is the code rate. The accuracy of this new rule is demonstrated by tabulating the distance properties of a large set of known codes. This new rule was derived by bounding the losses due to truncation as a function of the code rate. With regard to particular codes, a good indicator of the required truncation depth is the path length at which all paths that diverge from a particular path have accumulated the minimum distance of the code. It is shown that the new rule of thumb provides an accurate prediction of this depth for codes of varying rates.

  9. The complete chloroplast genome sequence of the medicinal plant Andrographis paniculata.

    PubMed

    Ding, Ping; Shao, Yanhua; Li, Qian; Gao, Junli; Zhang, Runjing; Lai, Xiaoping; Wang, Deqin; Zhang, Huiye

    2016-07-01

    The complete chloroplast genome of Andrographis paniculata, an important medicinal plant with great economic value, has been studied in this article. The genome size is 150,249 bp in length, with 38.3% GC content. A pair of inverted repeats (IRs, 25,300 bp) are separated by a large single copy region (LSC, 82,459 bp) and a small single-copy region (SSC, 17,190 bp). The chloroplast genome contains 114 unique genes, 80 protein-coding genes, 30 tRNA genes and 4 rRNA genes. In these genes, 15 genes contained 1 intron and 3 genes comprised of 2 introns.

  10. Complete mitochondrial genome sequence of northeastern sika deer (Cervus nippon hortulorum).

    PubMed

    Shao, Yuanchen; Zha, Daiming; Xing, Xiumei; Su, Weilin; Liu, Huamiao; Zhang, Ranran

    2016-01-01

    The complete mitochondrial genome of the northeastern sika deer, Cervus nippon hortulorum, was determined by accurate polymerase chain reaction. The entire genome is 16,434 bp in length and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes and 1 control region, all of which are arranged in a typical vertebrate manner. The overall base composition of the northeastern sika deer's mitochondrial genome is 33.3% of A, 24.5% of C, 28.7% of T and 13.5% of G. A termination associated sequence and several conserved central sequence block domains were discovered within the control region.

  11. The complete mitochondrial genome of Lota lota (Gadiformes: Gadidae) from the Burqin River in China.

    PubMed

    Lu, Zhichuang; Zhang, Nan; Song, Na; Gao, Tianxiang

    2016-05-01

    In this study, the complete mitochondrial genome (mitogenome) sequence of Lota lota has been determined by long polymerase chain reaction and primer walking methods. The mitogenome is a circular molecule of 16,519 bp in length and contains 37 mitochondrial genes including 13 protein-coding genes, 2 ribosomal RNA (rRNA), 22 transfer RNA (tRNA) and a control region as other bony fishes. Within the control region, we identified the termination-associated sequence domain (TAS), the central conserved sequence block domains (CSB-F and CSB-D), and the conserved sequence block domains (CSB-1, CSB-2 and CSB-3).

  12. Complete mitochondrial genome of Chuanzhong black goat in southwest of China (Capra hircus).

    PubMed

    Huang, Yong-Fu; Chen, Li-Peng; Zhao, Yong-Ju; Zhang, Hao; Na, Ri-Su; Zhao, Zhong-Quan; Zhang, Jia-Hua; Jiang, Cao-De; Ma, Yue-Hui; Sun, Ya-Wang; E, Guang-Xin

    2016-09-01

    The Chuanzhong black goat (Capra hircus) is a breed native to southwest of China. Its complete mitochondrial genome is 16,641 nt in length, consisting of 13 protein-coding genes, 22 transfer RNA (tRNA) genes, two ribosomal RNA (rRNA) genes, and a non-coding control region. As in other mammals, most mitochondrial genes are encoded on the heavy strand, except for ND6 and eight tRNA genes, which are encoded on the light strand. Its overall base composition is A: 33.5%, T: 27.3%, C: 26.1%, and G: 13.1%. The complete mitogenome of the Chinese indigenous breed of goat could provide a basic data for further phylogenetics analysis.

  13. Investigation on a coupled CFD/DSMC method for continuum-rarefied flows

    NASA Astrophysics Data System (ADS)

    Tang, Zhenyu; He, Bijiao; Cai, Guobiao

    2012-11-01

    The purpose of the present work is to investigate the coupled CFD/DSMC method using the existing CFD and DSMC codes developed by the authors. The interface between the continuum and particle regions is determined by the gradient-length local Knudsen number. A coupling scheme combining both state-based and flux-based coupling methods is proposed in the current study. Overlapping grids are established between the different grid systems of CFD and DSMC codes. A hypersonic flow over a 2D cylinder has been simulated using the present coupled method. Comparison has been made between the results obtained from both methods, which shows that the coupled CFD/DSMC method can achieve the same precision as the pure DSMC method and obtain higher computational efficiency.

  14. Complete mitochondrial genome of the Asian pencil halfbeak Hyporhamphus intermedius (Beloniformes, Hemirhamphidae).

    PubMed

    Song, Chao; Hu, Gengdong; Qiu, Liping; Fan, Limin; Meng, Shunlong; Chen, Jiazhang

    2016-11-01

    The complete mitochondrial genome of Hyporhamphus intermedius was determined to be 16,720 bp in length with (A + T) content of 56.3%, and it consists of 13 protein-coding genes, 22 tRNAs, two ribosomal RNAs, and a control region. The gene composition and the structural arrangement of the H. intermedius complete mtDNA were identical to most of the other vertebrates. Interestingly, two tandem repeat units were identified across tRNA-Pro and control region (2*41 bp), while in most of the fishes the tandem repeat units are located in the control region. The molecular data we presented here could play a useful role to study the evolutionary relationships and population genetics of Hemirhamphidae fish.

  15. The complete chloroplast genome of salt cress (Eutrema salsugineum).

    PubMed

    Guo, Xinyi; Hao, Guoqian; Ma, Tao

    2016-07-01

    The complete chloroplast (cp) sequence of the salt cress (Eutrema salsugineum), a plant well-adapted to salt stress, was presented in this study. The circular molecule is 153,407 bp in length and exhibit a typical quadripartite structure containing an 83,894 bp large single copy (LSC) region, a 17,607 bp small single copy (SSC) region, and the two 25,953 bp inverted repeats (IRs). The salt cress cp genome contains 135 known genes, including 87 protein-coding genes, 8 ribosomal RNA genes, and 40 tRNA genes; 21 of these are located in the inverted repeat region. As expected, phylogenetic analysis support the idea that E. salsugineum is sister to Brassiceae species within the Brassicaceae family.

  16. Real-time transmission of digital video using variable-length coding

    NASA Technical Reports Server (NTRS)

    Bizon, Thomas P.; Shalkhauser, Mary JO; Whyte, Wayne A., Jr.

    1993-01-01

    Huffman coding is a variable-length lossless compression technique where data with a high probability of occurrence is represented with short codewords, while 'not-so-likely' data is assigned longer codewords. Compression is achieved when the high-probability levels occur so frequently that their benefit outweighs any penalty paid when a less likely input occurs. One instance where Huffman coding is extremely effective occurs when data is highly predictable and differential coding can be applied (as with a digital video signal). For that reason, it is desirable to apply this compression technique to digital video transmission; however, special care must be taken in order to implement a communication protocol utilizing Huffman coding. This paper addresses several of the issues relating to the real-time transmission of Huffman-coded digital video over a constant-rate serial channel. Topics discussed include data rate conversion (from variable to a fixed rate), efficient data buffering, channel coding, recovery from communication errors, decoder synchronization, and decoder architectures. A description of the hardware developed to execute Huffman coding and serial transmission is also included. Although this paper focuses on matters relating to Huffman-coded digital video, the techniques discussed can easily be generalized for a variety of applications which require transmission of variable-length data.

  17. The next-generation ESL continuum gyrokinetic edge code

    NASA Astrophysics Data System (ADS)

    Cohen, R.; Dorr, M.; Hittinger, J.; Rognlien, T.; Collela, P.; Martin, D.

    2009-05-01

    The Edge Simulation Laboratory (ESL) project is developing continuum-based approaches to kinetic simulation of edge plasmas. A new code is being developed, based on a conservative formulation and fourth-order discretization of full-f gyrokinetic equations in parallel-velocity, magnetic-moment coordinates. The code exploits mapped multiblock grids to deal with the geometric complexities of the edge region, and utilizes a new flux limiter [P. Colella and M.D. Sekora, JCP 227, 7069 (2008)] to suppress unphysical oscillations about discontinuities while maintaining high-order accuracy elsewhere. The code is just becoming operational; we will report initial tests for neoclassical orbit calculations in closed-flux surface and limiter (closed plus open flux surfaces) geometry. It is anticipated that the algorithmic refinements in the new code will address the slow numerical instability that was observed in some long simulations with the existing TEMPEST code. We will also discuss the status and plans for physics enhancements to the new code.

  18. The complete mitochondrial genome of the American black flour beetle Tribolium audax (Coleoptera: Tenebrionidae).

    PubMed

    Ou, Jing; Liu, Jin-Bo; Yao, Fu-Jiao; Wang, Xin-Guo; Wei, Zhao-Ming

    2016-01-01

    Flour beetles of the genus Tribolium are all pests of stored products and cause severe economic losses every year. The American black flour beetle Tribolium audax is one of the important pest species of flour beetle, and it is also an important quarantine insect. Here we sequenced and characterized the complete mitochondrial genome of T. audax, which was intercepted by Huangpu Custom in maize from America. The complete circular mitochondrial genome (mitogenome) of T. audax was 15,924 bp in length, containing 37 typical coding genes and one non-coding AT-rich region. The mitogenome of T. audax exhibits a gene arrangement and content identical to the most common type in insects. All protein coding genes (PCGs) are start with a typical ATN initiation codon, except for the cox1, which use AAC as its start codon instead of ATN. Eleven genes use standard complete termination codon (nine TAA, two TAG), whereas the nad4 and nad5 genes end with single T. Except for trnS1 (AGN), all tRNA genes display typical secondary cloverleaf structures as those of other insects. The sizes of the large and small ribosomal RNA genes are 1288 and 780 bp, respectively. The AT content of the AT-rich region is 81.36%. The 5 bp conserved motif TACTA was found in the intergenic region between trnS2 (UCN) and nad1.

  19. Dual CRISPR-Cas9 Cleavage Mediated Gene Excision and Targeted Integration in Yarrowia lipolytica.

    PubMed

    Gao, Difeng; Smith, Spencer; Spagnuolo, Michael; Rodriguez, Gabriel; Blenner, Mark

    2018-05-29

    CRISPR-Cas9 technology has been successfully applied in Yarrowia lipolytica for targeted genomic editing including gene disruption and integration; however, disruptions by existing methods typically result from small frameshift mutations caused by indels within the coding region, which usually resulted in unnatural protein. In this study, a dual cleavage strategy directed by paired sgRNAs is developed for gene knockout. This method allows fast and robust gene excision, demonstrated on six genes of interest. The targeted regions for excision vary in length from 0.3 kb up to 3.5 kb and contain both non-coding and coding regions. The majority of the gene excisions are repaired by perfect nonhomologous end-joining without indel. Based on this dual cleavage system, two targeted markerless integration methods are developed by providing repair templates. While both strategies are effective, homology mediated end joining (HMEJ) based method are twice as efficient as homology recombination (HR) based method. In both cases, dual cleavage leads to similar or improved gene integration efficiencies compared to gene excision without integration. This dual cleavage strategy will be useful for not only generating more predictable and robust gene knockout, but also for efficient targeted markerless integration, and simultaneous knockout and integration in Y. lipolytica. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  20. A candidate gene for choanal atresia in alpaca.

    PubMed

    Reed, Kent M; Bauer, Miranda M; Mendoza, Kristelle M; Armién, Aníbal G

    2010-03-01

    Choanal atresia (CA) is a common nasal craniofacial malformation in New World domestic camelids (alpaca and llama). CA results from abnormal development of the nasal passages and is especially debilitating to newborn crias. CA in camelids shares many of the clinical manifestations of a similar condition in humans (CHARGE syndrome). Herein we report on the regulatory gene CHD7 of alpaca, whose homologue in humans is most frequently associated with CHARGE. Sequence of the CHD7 coding region was obtained from a non-affected cria. The complete coding region was 9003 bp, corresponding to a translated amino acid sequence of 3000 aa. Additional genomic sequences corresponding to a significant portion of the CHD7 gene were identified and assembled from the 2x alpaca whole genome sequence, providing confirmatory sequence for much of the CHD7 coding region. The alpaca CHD7 mRNA sequence was 97.9% similar to the human sequence, with the greatest sequence difference being an insertion in exon 38 that results in a polyalanine repeat (A12). Polymorphism in this repeat was tested for association with CA in alpaca by cloning and sequencing the repeat from both affected and non-affected individuals. Variation in length of the poly-A repeat was not associated with CA. Complete sequencing of the CHD7 gene will be necessary to determine whether other mutations in CHD7 are the cause of CA in camelids.

  1. Isolation and characterization of full-length cDNA clones coding for cholinesterase from fetal human tissues

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Prody, C.A.; Zevin-Sonkin, D.; Gnatt, A.

    1987-06-01

    To study the primary structure and regulation of human cholinesterases, oligodeoxynucleotide probes were prepared according to a consensus peptide sequence present in the active site of both human serum pseudocholinesterase and Torpedo electric organ true acetylcholinesterase. Using these probes, the authors isolated several cDNA clones from lambdagt10 libraries of fetal brain and liver origins. These include 2.4-kilobase cDNA clones that code for a polypeptide containing a putative signal peptide and the N-terminal, active site, and C-terminal peptides of human BtChoEase, suggesting that they code either for BtChoEase itself or for a very similar but distinct fetal form of cholinesterase. Inmore » RNA blots of poly(A)/sup +/ RNA from the cholinesterase-producing fetal brain and liver, these cDNAs hybridized with a single 2.5-kilobase band. Blot hybridization to human genomic DNA revealed that these fetal BtChoEase cDNA clones hybridize with DNA fragments of the total length of 17.5 kilobases, and signal intensities indicated that these sequences are not present in many copies. Both the cDNA-encoded protein and its nucleotide sequence display striking homology to parallel sequences published for Torpedo AcChoEase. These finding demonstrate extensive homologies between the fetal BtChoEase encoded by these clones and other cholinesterases of various forms and species.« less

  2. The complete mitochondrial genome of Gobiobotia filifer (Teleostei, Cypriniformes: Cyprinidae).

    PubMed

    Li, Qiang; Liu, Ya; Zhou, Jian; Gong, Quan; Li, Hua; Lai, Jiansheng; Li, Lianman

    2016-09-01

    The Gobiobotia filifer is a small economic fish which distributes in the upstream of Yangtze River and its distributaries. For the environmental pollution and overfishing, its population declined drastically in recent decades, so it is essential to protect its resource. In this study, the complete mitochondrial genome sequence of G. filifer was determined with PCR technology, which contains 13 protein-coding genes, 22 tRNA genes, two rRNA genes, and a non-coding control region with the total length of 16,613 bp. The order and composition of genes were similar to most of the other teleost fish. Most of the genes were encoded on heavy strand, except for ND6 genes and eight tRNAs. Just like most other vertebrates, the bias of G and C has been found in different genes/regions. The complete mitochondrial genome sequence of G. filifer would contribute to better understand evolution of this lineage, population genetics, and will help administrative department to make rules and laws to protect this lineage.

  3. The complete mitochondrial genome of Liobagrus marginatus (Teleostei, Siluriformes: Amblycipitidae).

    PubMed

    Li, Qiang; Du, Jun; Liu, Ya; Zhou, Jian; Ke, Hongyu; Liu, Chao; Liu, Guangxun

    2014-04-01

    The Liobagrus marginatus is an economic fish which distribute in the upstream of Yangtze river and its distributary. For its taste fresh, environmental pollution and overfishing, its population declined drastically and body miniaturization in recent decades, so it is essential to protect its resource. In this study, the complete mitochondrial genome sequence of Liobagrus marginatus was sequenced, which contains 22 tRNA genes, 13 protein-coding genes, 2 rRNA genes, and a non-coding control region with the total length of 16,497 bp. The gene arrangement and composition are similar to most of other fish. Most of the genes are encoded on heavy-strand, except for eight tRNA and ND6 genes. Just like most other vertebrates, the bias of G and C has been found in statistics results of different genes/regions. The complete mitochondrial genome sequence of Liobagrus marginatus would contribute to better understand population genetics, evolution of this lineage, and will help administrative departments to make rules and laws to protect it.

  4. Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.

    PubMed

    Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing

    2016-12-01

    Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.

  5. A Solution NMR Investigation into the Murine Amelogenin Splice-Variant LRAP (Leucine-Rich Amelogenin Protein).

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Buchko, Garry W.; Tarasevich, Barbara J.; Roberts, Jacky

    2010-09-01

    Amelogenins are the dominant proteins present in ameloblasts during the early stages of enamel biomineralization, making up >90% of the matrix protein. Along with the full-length protein there are several splice-variant isoforms of amelogenin present including LRAP (Leucine-Rich Amelogenin Protein), a protein that consists of the first 33 and the last 26 residues of full-length amelogenin. Using solution-state NMR spectroscopy we have assigned the 1H-15N HSQC spectrum of murine LRAP (rp(H)LRAP) in 2% acetic acid at pH 3.0 by making extensive use of previous chemical shift assignments for full-length murine amelogenin (rp(H)M180). This correlation was possible because LRAP, like themore » full-length protein, is intrinsically disordered under these solution conditions. The major difference between the 1H-15N HSQC spectra of rp(H)M180 and rp(H)LRAP was an additional set of amide resonances for each of the seven non-proline residues between S12* and Y12 at the N-terminus of rp(H)LRAP indicating that the N-terminal region of LRAP exists in two different conformations. Analysis of the proline carbon chemical shifts suggest that the molecular basis for the two states is not a cis-trans isomerization of one or more of the proline residues in the N-terminal region and is likely due to a slow exchange process. As observed with rp(H)M180, residue specific changes in molecular dynamics, manifested by the reduction in intensity and disappearance of 1H-15N HSQC cross peaks, were observed with the addition of NaCl to rp(H)LRAP. These perturbations may signal early events governing supramolecular self-assembly of rp(H)LRAP into nanospheres. However, the different pattern of 1H-15N HSQC cross peak perturbation between rp(H)LRAP and rp(H)M180 in high salt suggest that the termini may behave differently in their respective nanospheres, and perhaps, these differences account for the cell signaling properties attributable to LRAP but not the full-length protein.« less

  6. Evaluation of large girth LDPC codes for PMD compensation by turbo equalization.

    PubMed

    Minkov, Lyubomir L; Djordjevic, Ivan B; Xu, Lei; Wang, Ting; Kueppers, Franko

    2008-08-18

    Large-girth quasi-cyclic LDPC codes have been experimentally evaluated for use in PMD compensation by turbo equalization for a 10 Gb/s NRZ optical transmission system, and observing one sample per bit. Net effective coding gain improvement for girth-10, rate 0.906 code of length 11936 over maximum a posteriori probability (MAP) detector for differential group delay of 125 ps is 6.25 dB at BER of 10(-6). Girth-10 LDPC code of rate 0.8 outperforms the girth-10 code of rate 0.906 by 2.75 dB, and provides the net effective coding gain improvement of 9 dB at the same BER. It is experimentally determined that girth-10 LDPC codes of length around 15000 approach channel capacity limit within 1.25 dB.

  7. The random coding bound is tight for the average code.

    NASA Technical Reports Server (NTRS)

    Gallager, R. G.

    1973-01-01

    The random coding bound of information theory provides a well-known upper bound to the probability of decoding error for the best code of a given rate and block length. The bound is constructed by upperbounding the average error probability over an ensemble of codes. The bound is known to give the correct exponential dependence of error probability on block length for transmission rates above the critical rate, but it gives an incorrect exponential dependence at rates below a second lower critical rate. Here we derive an asymptotic expression for the average error probability over the ensemble of codes used in the random coding bound. The result shows that the weakness of the random coding bound at rates below the second critical rate is due not to upperbounding the ensemble average, but rather to the fact that the best codes are much better than the average at low rates.

  8. Error Control Coding Techniques for Space and Satellite Communications

    NASA Technical Reports Server (NTRS)

    Costello, Daniel J., Jr.; Takeshita, Oscar Y.; Cabral, Hermano A.

    1998-01-01

    It is well known that the BER performance of a parallel concatenated turbo-code improves roughly as 1/N, where N is the information block length. However, it has been observed by Benedetto and Montorsi that for most parallel concatenated turbo-codes, the FER performance does not improve monotonically with N. In this report, we study the FER of turbo-codes, and the effects of their concatenation with an outer code. Two methods of concatenation are investigated: across several frames and within each frame. Some asymmetric codes are shown to have excellent FER performance with an information block length of 16384. We also show that the proposed outer coding schemes can improve the BER performance as well by eliminating pathological frames generated by the iterative MAP decoding process.

  9. Computer search for binary cyclic UEP codes of odd length up to 65

    NASA Technical Reports Server (NTRS)

    Lin, Mao-Chao; Lin, Chi-Chang; Lin, Shu

    1990-01-01

    Using an exhaustive computation, the unequal error protection capabilities of all binary cyclic codes of odd length up to 65 that have minimum distances at least 3 are found. For those codes that can only have upper bounds on their unequal error protection capabilities computed, an analytic method developed by Dynkin and Togonidze (1976) is used to show that the upper bounds meet the exact unequal error protection capabilities.

  10. A Novel Domain Assembly Routine for Creating Full-Length Models of Membrane Proteins from Known Domain Structures.

    PubMed

    Koehler Leman, Julia; Bonneau, Richard

    2018-04-03

    Membrane proteins composed of soluble and membrane domains are often studied one domain at a time. However, to understand the biological function of entire protein systems and their interactions with each other and drugs, knowledge of full-length structures or models is required. Although few computational methods exist that could potentially be used to model full-length constructs of membrane proteins, none of these methods are perfectly suited for the problem at hand. Existing methods require an interface or knowledge of the relative orientations of the domains or are not designed for domain assembly, and none of them are developed for membrane proteins. Here we describe the first domain assembly protocol specifically designed for membrane proteins that assembles intra- and extracellular soluble domains and the transmembrane domain into models of the full-length membrane protein. Our protocol does not require an interface between the domains and samples possible domain orientations based on backbone dihedrals in the flexible linker regions, created via fragment insertion, while keeping the transmembrane domain fixed in the membrane. For five examples tested, our method mp_domain_assembly, implemented in RosettaMP, samples domain orientations close to the known structure and is best used in conjunction with experimental data to reduce the conformational search space.

  11. Structural Determinants Underlying Constitutive Dimerization of Unoccupied Human Follitropin Receptors

    PubMed Central

    Guan, Rongbin; Wu, Xueqing; Feng, Xiuyan; Zhang, Meilin; Hébert, Terence E.; Segaloff, Deborah L.

    2009-01-01

    The human follitropin receptor (hFSHR) is a G protein-coupled receptor (GPCR) central to reproductive physiology that is composed of an extracellular domain (ECD) fused to a serpentine region. Using bioluminescence resonance energy transfer (BRET) in living cells, we show that hFSHR dimers form constitutively during their biosynthesis. Mutations in TM1 and TM4 had no effect on hFSHR dimerization, alone or when combined with mutation of Tyr110 in the ECD, a residue predicted to mediate dimerization of the soluble hormone-binding portion of the ECD complexed with FSH (Q. Fan and W. Hendrickson, Nature 433:269–277, 2005). Expressed individually, the serpentine region and a membrane-anchored form of the hFSHR ECD each exhibited homodimerization, suggesting that both domains contribute to dimerization of the full-length receptor. However, even in the context of only the membrane-anchored ECD, mutation of Tyr110 to alanine did not inhibit dimerization. The full-length hFSHR and the membrane-anchored ECD were then each engineered to introduce a consensus site for N-linked glycosylation at residue 110. Despite experimental validation of the presence of carbohydrate on residue 110, we failed to observe disruption of dimerization of either the full-length hFSHR or membrane-anchored ECD containing the inserted glycan wedge. Taken altogether, our data suggest that both the serpentine region and the ECD contribute to hFSHR dimerization and that the dimerization interface of the unoccupied hFSHR does not involve Tyr110 of the ECD. PMID:19800402

  12. The complete mitochondrial genome of Endangered fish Huso dauricus (Acipenseriformes: Acipenseridae).

    PubMed

    Lu, Cuiyun; Gu, Ying; Li, Chao; Cheng, Lei; Sun, Xiaowen

    2016-01-01

    In this study, we sequenced and obtained the complete mitochondrial genome of the Kaluga (Huso dauricus) for the first time. The circular genome (16,691 bp in length) contained 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes, and 1 control region. The overall base composition of the novel mitogenome is 30.39% for A, 24.18% for T, 29.27% for C, 16.15% for G. AT content (54.57%) is higher than the GC content.

  13. The mitochondrial genome of the Arizona Snowfly Mesocapnia arizonensis (Plecoptera, Capniidae).

    PubMed

    Elbrecht, Vasco; Leese, Florian

    2016-09-01

    We assembled the mitochondrial genome of the capniid stonefly Mesocapnia arizonensis (Baumann & Gaufin, 1969) using Illumina HiSeq sequence data. The recovered mitogenome is 14,921 bp in length and includes 13 protein-coding genes, 2 ribosomal RNA genes and 22 transfer RNA genes. The control region could only be assembled partially. Gene order resembles that of basal arthropods. This is the first partial mitogenome sequence for the stonefly superfamily group Euholognatha and will be useful in future phylogenetic analyses.

  14. The complete mitochondrial genome of the diamondback moth, Plutella xylostella (Lepidoptera: Plutellidae).

    PubMed

    Dai, Li-Shang; Zhu, Bao-Jian; Qian, Cen; Zhang, Cong-Fen; Li, Jun; Wang, Lei; Wei, Guo-Qing; Liu, Chao-Liang

    2016-01-01

    The complete mitochondrial genome (mitogenome) of Plutella xylostella (Lepidoptera: Plutellidae) was determined (GenBank accession No. KM023645). The length of this mitogenome is 16,014 bp with 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes and an A + T-rich region. It presents the typical gene organization and order for completely sequenced lepidopteran mitogenomes. The nucleotide composition of the genome is highly A + T biased, accounting for 81.48%, with a slightly positive AT skewness (0.005). All PCGs are initiated by typical ATN codons, except for the gene cox1, which uses CGA as its start codon. Some PCGs harbor TA (nad5) or incomplete termination codon T (cox1, cox2, nad2 and nad4), while others use TAA as their termination codons. The A + T-rich region is located between rrnS and trnM with a length of 888 bp.

  15. Avian influenza virus and Newcastle disease virus (NDV) surveillance in commercial breeding farm in China and the characterization of Class I NDV isolates.

    PubMed

    Hu, Beixia; Huang, Yanyan; He, Yefeng; Xu, Chuantian; Lu, Xishan; Zhang, Wei; Meng, Bin; Yan, Shigan; Zhang, Xiumei

    2010-07-29

    In order to determine the actual prevalence of avian influenza virus (AIV) and Newcastle disease virus (NDV) in ducks in Shandong province of China, extensive surveillance studies were carried out in the breeding ducks of an intensive farm from July 2007 to September 2008. Each month cloacal and tracheal swabs were taken from 30 randomly selected birds that appeared healthy. All of the swabs were negative for influenza A virus recovery, whereas 87.5% of tracheal swabs and 100% cloacal swabs collected in September 2007, were positive for Newcastle disease virus isolation. Several NDV isolates were recovered from tracheal and cloacal swabs of apparently healthy ducks. All of the isolates were apathogenic as determined by the MDT and ICPI. The HN gene and the variable region of F gene (nt 47-420) of four isolates selected at random were sequenced. A 374 bp region of F gene and the full length of HN gene were used for phylogenetic analysis. Four isolates were identified as the same isolate based on nucleotide sequences identities of 99.2-100%, displaying a closer phylogenetic relationship to lentogenic Class I viruses. There were 1.9-9.9% nucleotide differences between the isolates and other Class I virus in the variable region of F gene (nt 47-420), whereas there were 38.5-41.2% nucleotide difference between the isolates and Class II viruses. The amino acid sequences of the F protein cleavage sites in these isolates were 112-ERQERL-117. The full length of HN gene of these isolates was 1851 bp, coding 585 amino acids. The homology analysis of the nucleotide sequence of HN gene indicated that there were 2.0-4.2% nucleotide differences between the isolates and other Class I viruses, whereas there were 29.5-40.9% differences between the isolates and Class II viruses. The results shows that these isolates are not phylogenetically related to the vaccine strain (LaSota). This study adds to the understanding of the ecology of influenza viruses and Newcastle disease viruses in ducks and emphasizes the need for constant surveillance in times of an ongoing and expanding epidemic of AIV and NDV. Copyright (c) 2010 Elsevier B.V. All rights reserved.

  16. Analysis of the genome sequence of the pathogenic Muscovy duck parvovirus strain YY reveals a 14-nucleotide-pair deletion in the inverted terminal repeats.

    PubMed

    Wang, Jianye; Huang, Yu; Zhou, Mingxu; Zhu, Guoqiang

    2016-09-01

    Genomic information about Muscovy duck parvovirus is still limited. In this study, the genome of the pathogenic MDPV strain YY was sequenced. The full-length genome of YY is 5075 nucleotides (nt) long, 57 nt shorter than that of strain FM. Sequence alignment indicates that the 5' and 3' inverted terminal repeats (ITR) of strain YY contain a 14-nucleotide-pair deletion in the stem of the palindromic hairpin structure in comparison to strain FM and FZ91-30. The deleted region contains one "E-box" site and one repeated motif with the sequence "TTCCGGT" or "ACCGGAA". Phylogenetic trees constructed based the protein coding genes concordantly showed that YY, together with nine other MDPV isolates from various places, clustered in a separate branch, distinct from the branch formed by goose parvovirus (GPV) strains. These results demonstrate that, despite the distinctive deletion, the YY strain still belongs to the classical MDPV group. Moreover, the deletion of ITR may contribute to the genome evolution of MDPV under immunization pressure.

  17. The complete mitochondrial genome of the Asian tapirs (Tapirus indicus): the only extant Tapiridae species in the old world.

    PubMed

    Muangkram, Yuttamol; Wajjwalku, Worawidh; Kaolim, Nongnid; Buddhakosai, Waradee; Kamolnorranath, Sumate; Siriaroonrat, Boripat; Tipkantha, Wanlaya; Dongsaard, Khwanruean; Maikaew, Umaporn; Sanannu, Saowaphang

    2016-01-01

    Asian tapir (Tapirus indicus) is categorized as Endangered on the 2008 IUCN red list. The first full-length mitochondrial DNA (mtDNA) sequence of Asian tapir is 16,717 bp in length. Base composition shows 34.6% A, 27.2% T, 25.8% C and 12.3% G. Highest polymorphic site is on the control region as typical for many species.

  18. The accuracy of burn diagnosis codes in health administrative data: A validation study.

    PubMed

    Mason, Stephanie A; Nathens, Avery B; Byrne, James P; Fowler, Rob; Gonzalez, Alejandro; Karanicolas, Paul J; Moineddin, Rahim; Jeschke, Marc G

    2017-03-01

    Health administrative databases may provide rich sources of data for the study of outcomes following burn. We aimed to determine the accuracy of International Classification of Diseases diagnoses codes for burn in a population-based administrative database. Data from a regional burn center's clinical registry of patients admitted between 2006-2013 were linked to administrative databases. Burn total body surface area (TBSA), depth, mechanism, and inhalation injury were compared between the registry and administrative records. The sensitivity, specificity, and positive and negative predictive values were determined, and coding agreement was assessed with the kappa statistic. 1215 burn center patients were linked to administrative records. TBSA codes were highly sensitive and specific for ≥10 and ≥20% TBSA (89/93% sensitive and 95/97% specific), with excellent agreement (κ, 0.85/κ, 0.88). Codes were weakly sensitive (68%) in identifying ≥10% TBSA full-thickness burn, though highly specific (86%) with moderate agreement (κ, 0.46). Codes for inhalation injury had limited sensitivity (43%) but high specificity (99%) with moderate agreement (κ, 0.54). Burn mechanism had excellent coding agreement (κ, 0.84). Administrative data diagnosis codes accurately identify burn by burn size and mechanism, while identification of inhalation injury or full-thickness burns is less sensitive but highly specific. Copyright © 2016 Elsevier Ltd and ISBI. All rights reserved.

  19. Performance Analysis of New Binary User Codes for DS-CDMA Communication

    NASA Astrophysics Data System (ADS)

    Usha, Kamle; Jaya Sankar, Kottareddygari

    2016-03-01

    This paper analyzes new binary spreading codes through correlation properties and also presents their performance over additive white Gaussian noise (AWGN) channel. The proposed codes are constructed using gray and inverse gray codes. In this paper, a n-bit gray code appended by its n-bit inverse gray code to construct the 2n-length binary user codes are discussed. Like Walsh codes, these binary user codes are available in sizes of power of two and additionally code sets of length 6 and their even multiples are also available. The simple construction technique and generation of code sets of different sizes are the salient features of the proposed codes. Walsh codes and gold codes are considered for comparison in this paper as these are popularly used for synchronous and asynchronous multi user communications respectively. In the current work the auto and cross correlation properties of the proposed codes are compared with those of Walsh codes and gold codes. Performance of the proposed binary user codes for both synchronous and asynchronous direct sequence CDMA communication over AWGN channel is also discussed in this paper. The proposed binary user codes are found to be suitable for both synchronous and asynchronous DS-CDMA communication.

  20. Viterbi decoding for satellite and space communication.

    NASA Technical Reports Server (NTRS)

    Heller, J. A.; Jacobs, I. M.

    1971-01-01

    Convolutional coding and Viterbi decoding, along with binary phase-shift keyed modulation, is presented as an efficient system for reliable communication on power limited satellite and space channels. Performance results, obtained theoretically and through computer simulation, are given for optimum short constraint length codes for a range of code constraint lengths and code rates. System efficiency is compared for hard receiver quantization and 4 and 8 level soft quantization. The effects on performance of varying of certain parameters relevant to decoder complexity and cost are examined. Quantitative performance degradation due to imperfect carrier phase coherence is evaluated and compared to that of an uncoded system. As an example of decoder performance versus complexity, a recently implemented 2-Mbit/sec constraint length 7 Viterbi decoder is discussed. Finally a comparison is made between Viterbi and sequential decoding in terms of suitability to various system requirements.

  1. Ovine mitochondrial DNA sequence variation and its association with production and reproduction traits within an Afec-Assaf flock.

    PubMed

    Reicher, S; Seroussi, E; Weller, J I; Rosov, A; Gootwine, E

    2012-07-01

    Polymorphisms in mitochondrial DNA (mtDNA) protein- and tRNA-coding genes were shown to be associated with various diseases in humans as well as with production and reproduction traits in livestock. Alignment of full length mitochondria sequences from the 5 known ovine haplogroups: HA (n = 3), HB (n = 5), HC (n = 3), HD (n = 2), and HE (n = 2; GenBank accession nos. HE577847-50 and 11 published complete ovine mitochondria sequences) revealed sequence variation in 10 out of the 13 protein coding mtDNA sequences. Twenty-six of the 245 variable sites found in the protein coding sequences represent non-synonymous mutations. Sequence variation was observed also in 8 out of the 22 tRNA mtDNA sequences. On the basis of the mtDNA control region and cytochrome b partial sequences along with information on maternal lineages within an Afec-Assaf flock, 1,126 Afec-Assaf ewes were assigned to mitochondrial haplogroups HA, HB, and HC, with frequencies of 0.43, 0.43, and 0.14, respectively. Analysis of birth weight and growth rate records of lamb (n = 1286) and productivity from 4,993 lambing records revealed no association between mitochondrial haplogroup affiliation and female longevity, lambs perinatal survival rate, birth weight, and daily growth rate of lambs up to 150 d that averaged 1,664 d, 88.3%, 4.5 kg, and 320 g/d, respectively. However, significant (P < 0.0001) differences among the haplogroups were found for prolificacy of ewes, with prolificacies (mean ± SE) of 2.14 ± 0.04, 2.25 ± 0.04, and 2.30 ± 0.06 lamb born/ewe lambing for the HA, HB, and the HC haplogroups, respectively. Our results highlight the ovine mitogenome genetic variation in protein- and tRNA coding genes and suggest that sequence variation in ovine mtDNA is associated with variation in ewe prolificacy.

  2. Integrating De Novo Transcriptome Assembly and Cloning to Obtain Chicken Ovocleidin-17 Full-Length cDNA

    PubMed Central

    Ning, ZhongHua; Hincke, Maxwell T.; Yang, Ning; Hou, ZhuoCheng

    2014-01-01

    Efficiently obtaining full-length cDNA for a target gene is the key step for functional studies and probing genetic variations. However, almost all sequenced domestic animal genomes are not ‘finished’. Many functionally important genes are located in these gapped regions. It can be difficult to obtain full-length cDNA for which only partial amino acid/EST sequences exist. In this study we report a general pipeline to obtain full-length cDNA, and illustrate this approach for one important gene (Ovocleidin-17, OC-17) that is associated with chicken eggshell biomineralization. Chicken OC-17 is one of the best candidates to control and regulate the deposition of calcium carbonate in the calcified eggshell layer. OC-17 protein has been purified, sequenced, and has had its three-dimensional structure solved. However, researchers still cannot conduct OC-17 mRNA related studies because the mRNA sequence is unknown and the gene is absent from the current chicken genome. We used RNA-Seq to obtain the entire transcriptome of the adult hen uterus, and then conducted de novo transcriptome assembling with bioinformatics analysis to obtain candidate OC-17 transcripts. Based on this sequence, we used RACE and PCR cloning methods to successfully obtain the full-length OC-17 cDNA. Temporal and spatial OC-17 mRNA expression analyses were also performed to demonstrate that OC-17 is predominantly expressed in the adult hen uterus during the laying cycle and barely at immature developmental stages. Differential uterine expression of OC-17 was observed in hens laying eggs with weak versus strong eggshell, confirming its important role in the regulation of eggshell mineralization and providing a new tool for genetic selection for eggshell quality parameters. This study is the first one to report the full-length OC-17 cDNA sequence, and builds a foundation for OC-17 mRNA related studies. We provide a general method for biologists experiencing difficulty in obtaining candidate gene full-length cDNA sequences. PMID:24676480

  3. Integrating de novo transcriptome assembly and cloning to obtain chicken Ovocleidin-17 full-length cDNA.

    PubMed

    Zhang, Quan; Liu, Long; Zhu, Feng; Ning, ZhongHua; Hincke, Maxwell T; Yang, Ning; Hou, ZhuoCheng

    2014-01-01

    Efficiently obtaining full-length cDNA for a target gene is the key step for functional studies and probing genetic variations. However, almost all sequenced domestic animal genomes are not 'finished'. Many functionally important genes are located in these gapped regions. It can be difficult to obtain full-length cDNA for which only partial amino acid/EST sequences exist. In this study we report a general pipeline to obtain full-length cDNA, and illustrate this approach for one important gene (Ovocleidin-17, OC-17) that is associated with chicken eggshell biomineralization. Chicken OC-17 is one of the best candidates to control and regulate the deposition of calcium carbonate in the calcified eggshell layer. OC-17 protein has been purified, sequenced, and has had its three-dimensional structure solved. However, researchers still cannot conduct OC-17 mRNA related studies because the mRNA sequence is unknown and the gene is absent from the current chicken genome. We used RNA-Seq to obtain the entire transcriptome of the adult hen uterus, and then conducted de novo transcriptome assembling with bioinformatics analysis to obtain candidate OC-17 transcripts. Based on this sequence, we used RACE and PCR cloning methods to successfully obtain the full-length OC-17 cDNA. Temporal and spatial OC-17 mRNA expression analyses were also performed to demonstrate that OC-17 is predominantly expressed in the adult hen uterus during the laying cycle and barely at immature developmental stages. Differential uterine expression of OC-17 was observed in hens laying eggs with weak versus strong eggshell, confirming its important role in the regulation of eggshell mineralization and providing a new tool for genetic selection for eggshell quality parameters. This study is the first one to report the full-length OC-17 cDNA sequence, and builds a foundation for OC-17 mRNA related studies. We provide a general method for biologists experiencing difficulty in obtaining candidate gene full-length cDNA sequences.

  4. SAW correlator spread spectrum receiver

    DOEpatents

    Brocato, Robert W

    2014-04-01

    A surface acoustic wave (SAW) correlator spread-spectrum (SS) receiver is disclosed which utilizes a first demodulation stage with a chip length n and a second demodulation stage with a chip length m to decode a transmitted SS signal having a code length l=n.times.m which can be very long (e.g. up to 2000 chips or more). The first demodulation stage utilizes a pair of SAW correlators which demodulate the SS signal to generate an appropriate code sequence at an intermediate frequency which can then be fed into the second demodulation stage which can be formed from another SAW correlator, or by a digital correlator. A compound SAW correlator comprising two input transducers and a single output transducer is also disclosed which can be used to form the SAW correlator SS receiver, or for use in processing long code length signals.

  5. Construction and characterization of HIV type 1 CRF07_BC infectious molecular clone from men who have sex with men.

    PubMed

    Jiang, Yan-Ling; Bai, Wen-Wei; Qu, Fan-Wei; Ma, Hua; Jiang, Run-Sheng; Shen, Bao-Sheng

    2016-03-01

    This study aimed to investigate the biological characterization of HIV type 1 (HIV-1) CRF07_BC infection among men who have sex with men (MSM). From November 2011 to November 2013, a total of 66 blood samples were collected from MSM with acute HIV-1 infection with CRF07_BC subgroup strains. Deletion in the gag p6 region was detected by sequence alignment and comparative analysis. Peripheral blood mononuclear cells (PBMCs) of HNXX1301-1307 samples were separated by density gradient centrifugation. Nested polymerase chain reaction (nPCR) was used to amplify the viral DNA. The near full-length HIV-1 DNA products were ligated to the long terminal repeat (LTR) vector plasmid (07BCLTR) to construct a full-length HIV clone. The molecular clone was transfected into HEK-293T cells, TZM-b1 cells and patients' PBMCs. The pregenome of an infectious molecular clone of HIV-1 (pNL4-3) was amplified, and a subclone with CRF07_BC was developed to construct the full-length chimeric molecular clone pNL4-3/07BCLTR. Detection of p24 antigen and luciferase activity was used to measure the in vitro infectivity of pNL4-3/07BCLTR. Among the 66 MSM patients infected with CRF07_BC strains, deletion mutations of the Gag P6 proteins were found in 7 of 18CRF07_BC strains; deletion mutations of 2-13 amino acids in different regions were discovered in 6 strains; and the remaining 42 strains did not show deletions. Seven strains with amino acids deficiency in the P6 protein accounted for 27% of all strains and 75% of all deletion genotype strains. A total of 186 full-length molecular clones of CRF07_BC were constructed. There were 5, 9, 10 and 11 clones of HNXX1302, HNXX1304, HNXX1305 and HNXX1306 that resulted in p24-positive supernatant when transfected into HEK-293T cells. Full-length clones of HNXX1302, HNXX1304, HNXX1305 and HNXX1306 showed slight infection in the transfected TZM-b1 cells, as judged by the fluorescence values of TZM-b1 cells 48h post-transfection. However, we were unable to transfect the patients' PMBCs with the above four clones. The phylogenetic tree of the C2V3 segment of the Env gene showed that a significant gene cluster was formed by all of the chimeric full-length HNXX1306 clones, and the bootstrap value for this cluster was 97.5%. Patients' PBMCs could be infected by 1306N6, 1306N13 and 1306N22 chimeric full-length clones. The CRF07_BC subtype (6889-7407 nucleotide residues of HXB2) is one of the most prevalent epidemic HIV-1 virus strains among the MSM population. The full-length chimeric molecular clone pNL4-3/07BCLTR may significantly improve the in vitro infectivity of the CRF07_BC strain. Copyright © 2016 Elsevier B.V. All rights reserved.

  6. An accurate evaluation of the performance of asynchronous DS-CDMA systems with zero-correlation-zone coding in Rayleigh fading

    NASA Astrophysics Data System (ADS)

    Walker, Ernest; Chen, Xinjia; Cooper, Reginald L.

    2010-04-01

    An arbitrarily accurate approach is used to determine the bit-error rate (BER) performance for generalized asynchronous DS-CDMA systems, in Gaussian noise with Raleigh fading. In this paper, and the sequel, new theoretical work has been contributed which substantially enhances existing performance analysis formulations. Major contributions include: substantial computational complexity reduction, including a priori BER accuracy bounding; an analytical approach that facilitates performance evaluation for systems with arbitrary spectral spreading distributions, with non-uniform transmission delay distributions. Using prior results, augmented by these enhancements, a generalized DS-CDMA system model is constructed and used to evaluated the BER performance, in a variety of scenarios. In this paper, the generalized system modeling was used to evaluate the performance of both Walsh- Hadamard (WH) and Walsh-Hadamard-seeded zero-correlation-zone (WH-ZCZ) coding. The selection of these codes was informed by the observation that WH codes contain N spectral spreading values (0 to N - 1), one for each code sequence; while WH-ZCZ codes contain only two spectral spreading values (N/2 - 1,N/2); where N is the sequence length in chips. Since these codes span the spectral spreading range for DS-CDMA coding, by invoking an induction argument, the generalization of the system model is sufficiently supported. The results in this paper, and the sequel, support the claim that an arbitrary accurate performance analysis for DS-CDMA systems can be evaluated over the full range of binary coding, with minimal computational complexity.

  7. High-Affinity Rb Binding, p53 Inhibition, Subcellular Localization, and Transformation by Wild-Type or Tumor-Derived Shortened Merkel Cell Polyomavirus Large T Antigens

    PubMed Central

    Borchert, Sophie; Czech-Sioli, Manja; Neumann, Friederike; Schmidt, Claudia; Wimmer, Peter; Dobner, Thomas

    2014-01-01

    ABSTRACT Interference with tumor suppressor pathways by polyomavirus-encoded tumor antigens (T-Ags) can result in transformation. Consequently, it is thought that T-Ags encoded by Merkel cell polyomavirus (MCPyV), a virus integrated in ∼90% of all Merkel cell carcinoma (MCC) cases, are major contributors to tumorigenesis. The MCPyV large T-Ag (LT-Ag) has preserved the key functional domains present in all family members but has also acquired unique regions that flank the LxCxE motif. As these regions may mediate unique functions, or may modulate those shared with T-Ags of other polyomaviruses, functional studies of MCPyV T-Ags are required. Here, we have performed a comparative study of full-length or MCC-derived truncated LT-Ags with regard to their biochemical characteristics, their ability to bind to retinoblastoma (Rb) and p53 proteins, and their transforming potential. We provide evidence that full-length MCPyV LT-Ag may not directly bind to p53 but nevertheless can significantly reduce p53-dependent transcription in reporter assays. Although early region expression constructs harboring either full-length or MCC-derived truncated LT-Ag genes can transform primary baby rat kidney cells, truncated LT-Ags do not bind to p53 or reduce p53-dependent transcription. Interestingly, shortened LT-Ags exhibit a very high binding affinity for Rb, as shown by coimmunoprecipitation and in vitro binding studies. Additionally, we show that truncated MCPyV LT-Ag proteins are expressed at higher levels than those for the wild-type protein and are able to partially relocalize Rb to the cytoplasm, indicating that truncated LT proteins may have gained additional features that distinguish them from the full-length protein. IMPORTANCE MCPyV is one of the 12 known polyomaviruses that naturally infect humans. Among these, it is of particular interest since it is the only human polyomavirus known to be involved in tumorigenesis. MCPyV is thought to be causally linked to MCC, a rare skin tumor. In these tumors, viral DNA is monoclonally integrated into the genome of the tumor cells in up to 90% of all MCC cases, and the integrated MCV genomes, furthermore, harbor signature mutations in the so-called early region that selectively abrogate viral replication while preserving cell cycle deregulating functions of the virus. This study describes comparative studies of early region T-Ag protein characteristics, their ability to bind to Rb and p53, and their transforming potential. PMID:24371076

  8. Complete mitochondrial genomes of Trisidos kiyoni and Potiarca pilula: Varied mitochondrial genome size and highly rearranged gene order in Arcidae

    PubMed Central

    Sun, Shao’e; Li, Qi; Kong, Lingfeng; Yu, Hong

    2016-01-01

    We present the complete mitochondrial genomes (mitogenomes) of Trisidos kiyoni and Potiarca pilula, both important species from the family Arcidae (Arcoida: Arcacea). Typical bivalve mtDNA features were described, such as the relatively conserved gene number (36 and 37), a high A + T content (62.73% and 61.16%), the preference for A + T-rich codons, and the evidence of non-optimal codon usage. The mitogenomes of Arcidae species are exceptional for their extraordinarily large and variable sizes and substantial gene rearrangements. The mitogenome of T. kiyoni (19,614 bp) and P. pilula (28,470 bp) are the two smallest Arcidae mitogenomes. The compact mitogenomes are weakly associated with gene number and primarily reflect shrinkage of the non-coding regions. The varied size in Arcidae mitogenomes reflect a dynamic history of expansion. A significant positive correlation is observed between mitogenome size and the combined length of cox1-3, the lengths of Cytb, and the combined length of rRNAs (rrnS and rrnL) (P < 0.001). Both protein coding genes (PCGs) and tRNA rearrangements is observed in P. pilula and T. kiyoni mitogenomes. This analysis imply that the complicated gene rearrangement in mitochondrial genome could be considered as one of key characters in inferring higher-level phylogenetic relationship of Arcidae. PMID:27653979

  9. The complete chloroplast genome sequence of strawberry (Fragaria  × ananassa Duch.) and comparison with related species of Rosaceae

    PubMed Central

    Cheng, Hui; Li, Jinfeng; Zhang, Hong; Cai, Binhua; Gao, Zhihong

    2017-01-01

    Compared with other members of the family Rosaceae, the chloroplast genomes of Fragaria species exhibit low variation, and this situation has limited phylogenetic analyses; thus, complete chloroplast genome sequencing of Fragaria species is needed. In this study, we sequenced the complete chloroplast genome of F. × ananassa ‘Benihoppe’ using the Illumina HiSeq 2500-PE150 platform and then performed a combination of de novo assembly and reference-guided mapping of contigs to generate complete chloroplast genome sequences. The chloroplast genome exhibits a typical quadripartite structure with a pair of inverted repeats (IRs, 25,936 bp) separated by large (LSC, 85,531 bp) and small (SSC, 18,146 bp) single-copy (SC) regions. The length of the F. × ananassa ‘Benihoppe’ chloroplast genome is 155,549 bp, representing the smallest Fragaria chloroplast genome observed to date. The genome encodes 112 unique genes, comprising 78 protein-coding genes, 30 tRNA genes and four rRNA genes. Comparative analysis of the overall nucleotide sequence identity among ten complete chloroplast genomes confirmed that for both coding and non-coding regions in Rosaceae, SC regions exhibit higher sequence variation than IRs. The Ka/Ks ratio of most genes was less than 1, suggesting that most genes are under purifying selection. Moreover, the mVISTA results also showed a high degree of conservation in genome structure, gene order and gene content in Fragaria, particularly among three octoploid strawberries which were F. × ananassa ‘Benihoppe’, F. chiloensis (GP33) and F. virginiana (O477). However, when the sequences of the coding and non-coding regions of F. × ananassa ‘Benihoppe’ were compared in detail with those of F. chiloensis (GP33) and F. virginiana (O477), a number of SNPs and InDels were revealed by MEGA 7. Six non-coding regions (trnK-matK, trnS-trnG, atpF-atpH, trnC-petN, trnT-psbD and trnP-psaJ) with a percentage of variable sites greater than 1% and no less than five parsimony-informative sites were identified and may be useful for phylogenetic analysis of the genus Fragaria. PMID:29038765

  10. Full-coverage film cooling: 3-dimensional measurements of turbulence structure and prediction of recovery region hydrodynamics

    NASA Technical Reports Server (NTRS)

    Yavuzkurt, S.; Moffat, R. J.; Kays, W. M.

    1979-01-01

    Hydrodynamic measurements were made with a triaxial hot-wire in the full-coverage region and the recovery region following an array of injection holes inclined downstream, at 30 degrees to the surface. The data were taken under isothermal conditions at ambient temperature and pressure for two blowing ratios: M = 0.9 and M = 0.4. Profiles of the three main velocity components and the six Reynolds stresses were obtained at several spanwise positions at each of the five locations down the test plate. A one-equation model of turbulence (using turbulent kinetic energy with an algebraic mixing length) was used in a two-dimensional computer program to predict the mean velocity and turbulent kinetic energy profiles in the recovery region. A new real-time hotwire scheme was developed to make measurements in the three-dimensional turbulent boundary layer over the full-coverage surface.

  11. Synonymous Mutations in the Core Gene Are Linked to Unusual Serological Profile in Hepatitis C Virus Infection

    PubMed Central

    Budkowska, Agata; Kakkanas, Athanassios; Nerrienet, Eric; Kalinina, Olga; Maillard, Patrick; Horm, Srey Viseth; Dalagiorgou, Geena; Vassilaki, Niki; Georgopoulou, Urania; Martinot, Michelle; Sall, Amadou Alpha; Mavromara, Penelope

    2011-01-01

    The biological role of the protein encoded by the alternative open reading frame (core+1/ARF) of the Hepatitis C virus (HCV) genome remains elusive, as does the significance of the production of corresponding antibodies in HCV infection. We investigated the prevalence of anti-core and anti-core+1/ARFP antibodies in HCV-positive blood donors from Cambodia, using peptide and recombinant protein-based ELISAs. We detected unusual serological profiles in 3 out of 58 HCV positive plasma of genotype 1a. These patients were negative for anti-core antibodies by commercial and peptide-based assays using C-terminal fragments of core but reacted by Western Blot with full-length core protein. All three patients had high levels of anti-core+1/ARFP antibodies. Cloning of the cDNA that corresponds to the core-coding region from these sera resulted in the expression of both core and core+1/ARFP in mammalian cells. The core protein exhibited high amino-acid homology with a consensus HCV1a sequence. However, 10 identical synonymous mutations were found, and 7 were located in the aa(99–124) region of core. All mutations concerned the third base of a codon, and 5/10 represented a T>C mutation. Prediction analyses of the RNA secondary structure revealed conformational changes within the stem-loop region that contains the core+1/ARFP internal AUG initiator at position 85/87. Using the luciferase tagging approach, we showed that core+1/ARFP expression is more efficient from such a sequence than from the prototype HCV1a RNA. We provide additional evidence of the existence of core+1/ARFP in vivo and new data concerning expression of HCV core protein. We show that HCV patients who do not produce normal anti-core antibodies have unusually high levels of antit-core+1/ARFP and harbour several identical synonymous mutations in the core and core+1/ARFP coding region that result in major changes in predicted RNA structure. Such HCV variants may favour core+1/ARFP production during HCV infection. PMID:21283512

  12. Complete Mitochondrial Genome of Echinostoma hortense (Digenea: Echinostomatidae).

    PubMed

    Liu, Ze-Xuan; Zhang, Yan; Liu, Yu-Ting; Chang, Qiao-Cheng; Su, Xin; Fu, Xue; Yue, Dong-Mei; Gao, Yuan; Wang, Chun-Ren

    2016-04-01

    Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans.

  13. Complete Mitochondrial Genome of Echinostoma hortense (Digenea: Echinostomatidae)

    PubMed Central

    Liu, Ze-Xuan; Zhang, Yan; Liu, Yu-Ting; Chang, Qiao-Cheng; Su, Xin; Fu, Xue; Yue, Dong-Mei; Gao, Yuan; Wang, Chun-Ren

    2016-01-01

    Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans. PMID:27180575

  14. Genome sequence of foot-and-mouth disease virus outside the 3A region is also responsible for virus replication in bovine cells.

    PubMed

    Ma, Xueqing; Li, Pinghua; Sun, Pu; Lu, Zengjun; Bao, Huifang; Bai, Xingwen; Fu, Yuanfang; Cao, Yimei; Li, Dong; Chen, Yingli; Qiao, Zilin; Liu, Zaixin

    2016-07-15

    The deletion of residues 93-102 in non-structure protein 3A of foot-and-mouth disease virus (FMDV) is associated with the inability of FMDV to grow in bovine cells and attenuated virulence in cattle.Whereas, a previously reported FMDV strain O/HKN/21/70 harboring 93-102 deletion in 3A protein grew equally well in bovine and swine cells. This suggests that changes inFMDV genome sequence, in addition to 93-102 deletion in 3A, may also affectthe viral growth phenotype in bovine cellsduring infection and replication.However, it is nuclear that changes in which region (inside or outside of 3A region) influences FMDV growth phenotype in bovine cells.In this study, to determine the region in FMDV genomeaffecting viral growth phenotype in bovine cells, we constructed chimeric FMDVs, rvGZSB-HKN3A and rvHN-HKN3A, by introducing the 3A coding region of O/HKN/21/70 into the context of O/SEA/Mya-98 strain O/GZSB/2011 and O Cathay topotype strain O/HN/CHA/93, respectively, since O/GZSB/2011 containing full-length 3A protein replicated well in bovine and swine cells, and O/HN/CHA/93 harboring 93-102 deletion in 3A protein grew poorly in bovine cells.The chimeric virusesrvGZSB-HKN3A and rvHN-HKN3A displayed growth properties and plaque phenotypes similar to those of the parental virus rvGZSB and rv-HN in BHK-21 and primary fetal porcine kidney (FPK) cells. However, rvHN-HKN3A and rv-HN replicated poorly in primary fetal bovine kidney (FBK) cells with no visible plaques, and rvGZSB-HKN3A exhibited lower growth rate and smaller plaque size phenotypes than those of the parental virus in FBK cells, but similar growth properties and plaque phenotypes to those of the recombinant viruses harboring 93-102 deletion in 3A. These results demonstrate that the difference present in FMDV genome sequence outside the 3A coding region also have influence on FMDV replication ability in bovine cells. Copyright © 2016 Elsevier B.V. All rights reserved.

  15. Characterization of the cod (Gadus morhua) steroidogenic acute regulatory protein (StAR) sheds light on StAR gene structure in fish.

    PubMed

    Goetz, Frederick W; Norberg, Birgitta; McCauley, Linda A R; Iliev, Dimitar B

    2004-03-01

    The full-length cDNA for the cod (Gadus morhua) StAR was cloned by RT-PCR and library screening using ovarian RNA. From the library screening, 2 size classes of cDNA were obtained; a 1577 bp cDNA (cStAR1) and a 2851 bp cDNA (cStAR2). The cStAR1 cDNA presumably encodes a protein of 286 amino acids. The cStAR2 cDNA was composed of 6 separated sequences that contained all of the coding regions of cStAR1 when added together, but also contained 5 noncoding regions not observed in cStAR1. Polymerase chain reactions of cod genomic DNA produced products slightly larger than cStAR2. The sequence of these products were the same as cStAR2 but revealed one additional noncoding region (intron). Thus, the fish StAR gene contains the same number of exons (7) and introns (6) as observed in mammals, but is approximately half the size of the mammalian gene. Using Northern analysis and RT-PCR, cStAR1 expression was observed only in testes, ovaries and head kidneys. Polymerase chain reaction products were also observed using cDNA from steroidogenic tissues and primers designed to regions specific for cStAR2, indicating that cStAR2 is expressed in tissues and may account for the presence of larger transcripts observed on Northern blots.

  16. MODTOHAFSD — A GUI based JAVA code for gravity analysis of strike limited sedimentary basins by means of growing bodies with exponential density contrast-depth variation: A space domain approach

    NASA Astrophysics Data System (ADS)

    Chakravarthi, V.; Sastry, S. Rajeswara; Ramamma, B.

    2013-07-01

    Based on the principles of modeling and inversion, two interpretation methods are developed in the space domain along with a GUI based JAVA code, MODTOHAFSD, to analyze the gravity anomalies of strike limited sedimentary basins using a prescribed exponential density contrast-depth function. A stack of vertical prisms all having equal widths, but each one possesses its own limited strike length and thickness, describes the structure of a sedimentary basin above the basement complex. The thicknesses of prisms represent the depths to the basement and are the unknown parameters to be estimated from the observed gravity anomalies. Forward modeling is realized in the space domain using a combination of analytical and numerical approaches. The algorithm estimates the initial depths of a sedimentary basin and improves them, iteratively, based on the differences between the observed and modeled gravity anomalies within the specified convergence criteria. The present code, works on Model-View-Controller (MVC) pattern, reads the Bouguer gravity anomalies, constructs/modifies regional gravity background in an interactive approach, estimates residual gravity anomalies and performs automatic modeling or inversion based on user specification for basement topography. Besides generating output in both ASCII and graphical forms, the code displays (i) the changes in the depth structure, (ii) nature of fit between the observed and modeled gravity anomalies, (iii) changes in misfit, and (iv) variation of density contrast with iteration in animated forms. The code is used to analyze both synthetic and real field gravity anomalies. The proposed technique yielded information that is consistent with the assumed parameters in case of synthetic structure and with available drilling depths in case of field example. The advantage of the code is that it can be used to analyze the gravity anomalies of sedimentary basins even when the profile along which the interpretation is intended fails to bisect the strike length.

  17. Evolutionary Dynamics of Microsatellite Distribution in Plants: Insight from the Comparison of Sequenced Brassica, Arabidopsis and Other Angiosperm Species

    PubMed Central

    Shi, Jiaqin; Huang, Shunmou; Fu, Donghui; Yu, Jinyin; Wang, Xinfa; Hua, Wei; Liu, Shengyi; Liu, Guihua; Wang, Hanzhong

    2013-01-01

    Despite their ubiquity and functional importance, microsatellites have been largely ignored in comparative genomics, mostly due to the lack of genomic information. In the current study, microsatellite distribution was characterized and compared in the whole genomes and both the coding and non-coding DNA sequences of the sequenced Brassica, Arabidopsis and other angiosperm species to investigate their evolutionary dynamics in plants. The variation in the microsatellite frequencies of these angiosperm species was much smaller than those for their microsatellite numbers and genome sizes, suggesting that microsatellite frequency may be relatively stable in plants. The microsatellite frequencies of these angiosperm species were significantly negatively correlated with both their genome sizes and transposable elements contents. The pattern of microsatellite distribution may differ according to the different genomic regions (such as coding and non-coding sequences). The observed differences in many important microsatellite characteristics (especially the distribution with respect to motif length, type and repeat number) of these angiosperm species were generally accordant with their phylogenetic distance, which suggested that the evolutionary dynamics of microsatellite distribution may be generally consistent with plant divergence/evolution. Importantly, by comparing these microsatellite characteristics (especially the distribution with respect to motif type) the angiosperm species (aside from a few species) all clustered into two obviously different groups that were largely represented by monocots and dicots, suggesting a complex and generally dichotomous evolutionary pattern of microsatellite distribution in angiosperms. Polyploidy may lead to a slight increase in microsatellite frequency in the coding sequences and a significant decrease in microsatellite frequency in the whole genome/non-coding sequences, but have little effect on the microsatellite distribution with respect to motif length, type and repeat number. Interestingly, several microsatellite characteristics seemed to be constant in plant evolution, which can be well explained by the general biological rules. PMID:23555856

  18. The poly(A) tail length of casein mRNA in the lactating mammary gland changes depending upon the accumulation and removal of milk.

    PubMed Central

    Kuraishi, T; Sun, Y; Aoki, F; Imakawa, K; Sakai, S

    2000-01-01

    The length of casein mRNA from the lactating mouse mammary gland, as assessed on Northern blots, is shorter after weaning, but is elongated following the removal of milk. In order to investigate this phenomenon, the molecular structures of beta- and gamma-casein mRNAs were analysed. The coding and non-coding regions of the two forms were the same length, but the long form of casein mRNA had a longer poly(A) tail than the short form (P<0.05). In order to examine the stability of casein mRNA under identical conditions, casein mRNAs with the long and short poly(A) tails were incubated in the rabbit reticulocyte lysate (RRL) cell-free translation system. Casein mRNA with the long poly(A) tail had a longer half-life than that with the short tail (P<0.05). The beta- and gamma-casein mRNAs were first degraded into 0.92 and 0.81 kb fragments respectively. With undegraded mRNA, the poly(A) tail shortening by exoribonuclease was not observed until the end of the incubation. Northern blot analysis showed that casein mRNA with the long poly(A) tail was protected efficiently from endoribonucleases. We conclude that the length of the poly(A) tail of casein mRNA in the lactating mammary gland changes depending upon the accumulation and removal of the gland's milk, and we show that the longer poly(A) tail potentially protects the mRNA from degradation by endoribonucleases. PMID:10749689

  19. LOOPREF: A Fluid Code for the Simulation of Coronal Loops

    NASA Technical Reports Server (NTRS)

    deFainchtein, Rosalinda; Antiochos, Spiro; Spicer, Daniel

    1998-01-01

    This report documents the code LOOPREF. LOOPREF is a semi-one dimensional finite element code that is especially well suited to simulate coronal-loop phenomena. It has a full implementation of adaptive mesh refinement (AMR), which is crucial for this type of simulation. The AMR routines are an improved version of AMR1D. LOOPREF's versatility makes is suitable to simulate a wide variety of problems. In addition to efficiently providing very high resolution in rapidly changing regions of the domain, it is equipped to treat loops of variable cross section, any non-linear form of heat conduction, shocks, gravitational effects, and radiative loss.

  20. Characterization and phylogenetic analysis of the swine leukocyte antigen 3 gene from Korean native pigs.

    PubMed

    Chung, H Y; Choi, Y C; Park, H N

    2015-05-18

    We investigated the phylogenetic relationships between pig breeds, compared the genetic similarity between humans and pigs, and provided basic genetic information on Korean native pigs (KNPs), using genetic variants of the swine leukocyte antigen 3 (SLA-3) gene. Primers were based on sequences from GenBank (accession Nos. AF464010 and AF464009). Polymerase chain reaction analysis amplified approximately 1727 bp of segments, which contained 1086 bp of coding regions and 641 bp of the 3'- and 5'-untranslated regions. Bacterial artificial chromosome clones of miniature pigs were used for sequencing the SLA-3 genomic region, which was 3114 bp in total length, including the coding (1086 bp) and non-coding (2028 bp) regions. Sequence analysis detected 53 single nucleotide polymorphisms (SNPs), based on a minor allele frequency greater than 0.01, which is low compared with other pig breeds, and the results suggest that there is low genetic variability in KNPs. Comparative analysis revealed that humans possess approximately three times more genetic variation than do pigs. Approximately 71% of SNPs in exons 2 and 3 were detected in KNPs, and exon 5 in humans is a highly polymorphic region. Newly identified sequences of SLA-3 using KNPs were submitted to GenBank (accession No. DQ992512-18). Cluster analysis revealed that KNPs were grouped according to three major alleles: SLA-3*0502 (DQ992518), SLA-3*0302 (DQ992513 and DQ992516), and SLA-3*0303 (DQ992512, DQ992514, DQ992515, and DQ992517). Alignments revealed that humans have a relatively close genetic relationship with pigs and chimpanzees. The information provided by this study may be useful in KNP management.

  1. Complete mitochondrial genome of the versicoloured emerald hummingbird Amazilia versicolor, a polymorphic species.

    PubMed

    Prosdocimi, Francisco; Souto, Helena Magarinos; Ruschi, Piero Angeli; Furtado, Carolina; Jennings, W Bryan

    2016-09-01

    The genome of the versicoloured emerald hummingbird (Amazilia versicolor) was partially sequenced in one-sixth of an Illumina HiSeq lane. The mitochondrial genome was assembled using MIRA and MITObim software, yielding a circular molecule of 16,861 bp in length and deposited in GenBank under the accession number KF624601. The mitogenome contained 13 protein-coding genes, 22 transfer tRNAs, 2 ribosomal RNAs and 1 non-coding control region. The molecule was assembled using 21,927 sequencing reads of 100 bp each, resulting in ∼130 × coverage of uniformly distributed reads along the genome. This is the forth mitochondrial genome described for this highly diverse family of birds and may benefit further phylogenetic, phylogeographic, population genetic and species delimitation studies of hummingbirds.

  2. Next generation sequencing yields the complete mitochondrial genome of the Hornlip mullet Plicomugil labiosus (Teleostei: Mugilidae).

    PubMed

    Shen, Kang-Ning; Chen, Ching-Hung; Hsiao, Chung-Der

    2016-05-01

    In this study, the complete mitogenome sequence of hornlip mullet Plicomugil labiosus (Teleostei: Mugilidae) has been sequenced by next-generation sequencing method. The assembled mitogenome, consisting of 16,829 bp, had the typical vertebrate mitochondrial gene arrangement, including 13 protein coding genes, 22 transfer RNAs, 2 ribosomal RNAs genes and a non-coding control region of D-loop. D-loop contains 1057 bp length is located between tRNA-Pro and tRNA-Phe. The overall base composition of P. labiosus is 28.0% for A, 29.3% for C, 15.5% for G and 27.2% for T. The complete mitogenome may provide essential and important DNA molecular data for further population, phylogenetic and evolutionary analysis for Mugilidae.

  3. Next generation sequencing yields the complete mitochondrial genome of the largescale mullet, Liza macrolepis (Teleostei: Mugilidae).

    PubMed

    Shen, Kang-Ning; Tsai, Shiou-Yi; Chen, Ching-Hung; Hsiao, Chung-Der; Durand, Jean-Dominique

    2016-11-01

    In this study, the complete mitogenome sequence of largescale mullet (Teleostei: Mugilidae) has been sequenced by the next-generation sequencing method. The assembled mitogenome, consisting of 16,832 bp, had the typical vertebrate mitochondrial gene arrangement, including 13 protein-coding genes, 22 transfer RNAs, two ribosomal RNAs genes, and a non-coding control region of D-loop. D-loop which has a length of 1094 bp is located between tRNA-Pro and tRNA-Phe. The overall base composition of largescale mullet is 27.8% for A, 30.1% for C, 16.2% for G, and 25.9% for T. The complete mitogenome may provide essential and important DNA molecular data for further phylogenetic and evolutionary analysis for Mugilidae.

  4. Complete mitochondrial genome of Yangtze River wild common carp (Cyprinus carpio haematopterus) and Russian scattered scale mirror carp (Cyprinus carpio carpio).

    PubMed

    Hu, Guang Fu; Liu, Xiang Jiang; Zou, Gui Wei; Li, Zhong; Liang, Hong-Wei; Hu, Shao-Na

    2016-01-01

    We sequenced the complete mitogenomes of (Cyprinus carpio haematopterus) and Russian scattered scale mirror carp (Cyprinus carpio carpio). Comparison of these two mitogenomes revealed that the mitogenomes of these two common carp strains were remarkably similar in genome length, gene order and content, and AT content. There were only 55 bp variations in 16,581 nucleotides. About 1 bp variation was located in rRNAs, 2 bp in tRNAs, 9 bp in the control region and 43 bp in protein-coding genes. Furthermore, forty-three variable nucleotides in the protein-coding genes of the two strains led to four variable amino acids, which were located in the ND2, ATPase 6, ND5 and ND6 genes, respectively.

  5. The complete chloroplast genome of Sinopodophyllum hexandrum Ying (Berberidaceae).

    PubMed

    Meng, Lihua; Liu, Ruijuan; Chen, Jianbing; Ding, Chenxu

    2017-05-01

    The complete nucleotide sequence of the Sinopodophyllum hexandrum Ying chloroplast genome (cpDNA) was determined based on next-generation sequencing technologies in this study. The genome was 157 203 bp in length, containing a pair of inverted repeat (IRa and IRb) regions of 25 960 bp, which were separated by a large single-copy (LSC) region of 87 065 bp and a small single-copy (SSC) region of 18 218 bp, respectively. The cpDNA contained 148 genes, including 96 protein-coding genes, 8 ribosomal RNA genes, and 44 tRNA genes. In these genes, eight harbored a single intron, and two (ycf3 and clpP) contained a couple of introns. The cpDNA AT content of S. hexandrum cpDNA is 61.5%.

  6. The complete chloroplast genome sequence of Euonymus japonicus (Celastraceae).

    PubMed

    Choi, Kyoung Su; Park, SeonJoo

    2016-09-01

    The complete chloroplast (cp) genome sequence of the Euonymus japonicus, the first sequenced of the genus Euonymus, was reported in this study. The total length was 157 637 bp, containing a pair of 26 678 bp inverted repeat region (IR), which were separated by small single copy (SSC) region and large single copy (LSC) region of 18 340 bp and 85 941 bp, respectively. This genome contains 107 unique genes, including 74 coding genes, four rRNA genes, and 29 tRNA genes. Seventeen genes contain intron of E. japonicus, of which three genes (clpP, ycf3, and rps12) include two introns. The maximum likelihood (ML) phylogenetic analysis revealed that E. japonicus was closely related to Manihot and Populus.

  7. The complete chloroplast genome of the Dendrobium strongylanthum (Orchidaceae: Epidendroideae).

    PubMed

    Li, Jing; Chen, Chen; Wang, Zhe-Zhi

    2016-07-01

    Complete chloroplast genome sequence is very useful for studying the phylogenetic and evolution of species. In this study, the complete chloroplast genome of Dendrobium strongylanthum was constructed from whole-genome Illumina sequencing data. The chloroplast genome is 153 058 bp in length with 37.6% GC content and consists of two inverted repeats (IRs) of 26 316 bp. The IR regions are separated by large single-copy region (LSC, 85 836 bp) and small single-copy (SSC, 14 590 bp) region. A total of 130 chloroplast genes were successfully annotated, including 84 protein coding genes, 38 tRNA genes, and eight rRNA genes. Phylogenetic analyses showed that the chloroplast genome of Dendrobium strongylanthum is related to that of the Dendrobium officinal.

  8. Badh2, Encoding Betaine Aldehyde Dehydrogenase, Inhibits the Biosynthesis of 2-Acetyl-1-Pyrroline, a Major Component in Rice Fragrance[W

    PubMed Central

    Chen, Saihua; Yang, Yi; Shi, Weiwei; Ji, Qing; He, Fei; Zhang, Ziding; Cheng, Zhukuan; Liu, Xiangnong; Xu, Mingliang

    2008-01-01

    In rice (Oryza sativa), the presence of a dominant Badh2 allele encoding betaine aldehyde dehydrogenase (BADH2) inhibits the synthesis of 2-acetyl-1-pyrroline (2AP), a potent flavor component in rice fragrance. By contrast, its two recessive alleles, badh2-E2 and badh2-E7, induce 2AP formation. Badh2 was found to be transcribed in all tissues tested except for roots, and the transcript was detected at higher abundance in young, healthy leaves than in other tissues. Multiple Badh2 transcript lengths were detected, and the complete, full-length Badh2 transcript was much less abundant than partial Badh2 transcripts. 2AP levels were significantly reduced in cauliflower mosaic virus 35S-driven transgenic lines expressing the complete, but not the partial, Badh2 coding sequences. In accordance, the intact, full-length BADH2 protein (503 residues) appeared exclusively in nonfragrant transgenic lines and rice varieties. These results indicate that the full-length BADH2 protein encoded by Badh2 renders rice nonfragrant by inhibiting 2AP biosynthesis. The BADH2 enzyme was predicted to contain three domains: NAD binding, substrate binding, and oligomerization domains. BADH2 was distributed throughout the cytoplasm, where it is predicted to catalyze the oxidization of betaine aldehyde, 4-aminobutyraldehyde (AB-ald), and 3-aminopropionaldehyde. The presence of null badh2 alleles resulted in AB-ald accumulation and enhanced 2AP biosynthesis. In summary, these data support the hypothesis that BADH2 inhibits 2AP biosynthesis by exhausting AB-ald, a presumed 2AP precursor. PMID:18599581

  9. The complete sequence of the mitochondrial genome of Arctic fox (Alopex lagopus).

    PubMed

    Yan, Shou-Qing; Guo, Peng-Cheng; Yue, Yuan; Li, Wan-Hong; Bai, Chun-Yan; Li, Yu-Mei; Sun, Jin-Hai; Zhao, Zhi-Hui

    2016-11-01

    In the present study, the complete mitochondrial genome sequence of Arctic fox (Alopex lagopus) was determined for the first time. It has a total length of 16,656 bp, and contains 13 protein-coding genes, 22 tRNA genes, 2 ribosome RNA genes and 1 control region. The nucleotide composition is 31.3% for A, 26.2% for C, 14.8% for G and 27.7% for T, respectively. The D-loop region located between tRNA Pro and tRNA Phe contains a (ACACGTACACGCAT) 18 tandem repeat array. The data will be useful for the investigation of the genetic structure and diversity in the natural and farmed population of Arctic foxes.

  10. Increased length of inpatient stay and poor clinical coding: audit of patients with diabetes.

    PubMed

    Daultrey, Harriet; Gooday, Catherine; Dhatariya, Ketan

    2011-11-01

    People with diabetes stay in hospital for longer than those without diabetes for similar conditions. Clinical coding is poor across all specialties. Inpatients with diabetes often have unrecognized foot problems. We wanted to look at the relationships between these factors. A single day audit, looking at the prevalence of diabetes in all adult inpatients. Also looking at their feet to find out how many were high-risk or had existing problems. A 998-bed university teaching hospital. All adult inpatients. (a) To see if patients with diabetes and foot problems were in hospital for longer than the national average length of stay compared with national data; (b) to see if there were people in hospital with acute foot problems who were not known to the specialist diabetic foot team; and (c) to assess the accuracy of clinical coding. We identified 110 people with diabetes. However, discharge coding data for inpatients on that day showed 119 people with diabetes. Length of stay (LOS) was substantially higher for those with diabetes compared to those without (± SD) at 22.39 (22.26) days, vs. 11.68 (6.46) (P < 0.001). Finally, clinical coding was poor with some people who had been identified as having diabetes on the audit, who were not coded as such on discharge. Clinical coding - which is dependent on discharge summaries - poorly reflects diagnoses. Additionally, length of stay is significantly longer than previous estimates. The discrepancy between coding and diagnosis needs addressing by increasing the levels of awareness and education of coders and physicians. We suggest that our data be used by healthcare planners when deciding on future tariffs.

  11. Increased length of inpatient stay and poor clinical coding: audit of patients with diabetes

    PubMed Central

    Daultrey, Harriet; Gooday, Catherine; Dhatariya, Ketan

    2011-01-01

    Objectives People with diabetes stay in hospital for longer than those without diabetes for similar conditions. Clinical coding is poor across all specialties. Inpatients with diabetes often have unrecognized foot problems. We wanted to look at the relationships between these factors. Design A single day audit, looking at the prevalence of diabetes in all adult inpatients. Also looking at their feet to find out how many were high-risk or had existing problems. Setting A 998-bed university teaching hospital. Participants All adult inpatients. Main outcome measures (a) To see if patients with diabetes and foot problems were in hospital for longer than the national average length of stay compared with national data; (b) to see if there were people in hospital with acute foot problems who were not known to the specialist diabetic foot team; and (c) to assess the accuracy of clinical coding. Results We identified 110 people with diabetes. However, discharge coding data for inpatients on that day showed 119 people with diabetes. Length of stay (LOS) was substantially higher for those with diabetes compared to those without (± SD) at 22.39 (22.26) days, vs. 11.68 (6.46) (P < 0.001). Finally, clinical coding was poor with some people who had been identified as having diabetes on the audit, who were not coded as such on discharge. Conclusion Clinical coding – which is dependent on discharge summaries – poorly reflects diagnoses. Additionally, length of stay is significantly longer than previous estimates. The discrepancy between coding and diagnosis needs addressing by increasing the levels of awareness and education of coders and physicians. We suggest that our data be used by healthcare planners when deciding on future tariffs. PMID:22140609

  12. Short initial length quench on CICC of ITER TF coils

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nicollet, S.; Ciazynski, D.; Duchateau, J.-L.

    Previous quench studies performed for the International Thermonuclear Experimental Reactor (ITER) Toroidal Field (TF) Coils have led to identify two extreme families of quench: first 'severe' quenches over long initial lengths in high magnetic field, and second smooth quenches over short initial lengths in low field region. Detailed analyses and results on smooth quench propagation and detectability on one TF Cable In Conduit Conductor (CICC) with a lower propagation velocity are presented here. The influence of the initial quench energy is shown and results of computations with either a Fast Discharge (FD) of the magnet or without (failure of themore » voltage quench detection system) are reported. The influence of the central spiral of the conductor on the propagation velocity is also detailed. In the cases of a regularly triggered FD, the hot spot temperature criterion of 150 K (with helium and jacket) is fulfilled for an initial quench length of 1 m, whereas this criterion is exceed (Tmax ≈ 200 K) for an extremely short length of 5 cm. These analyses were carried out using both the Supermagnet(trade mark, serif) and Venecia codes and the comparisons of the results are also discussed.« less

  13. Leptin and leptin receptor gene polymorphisms are correlated with production performance in the Arctic fox.

    PubMed

    Zhang, M; Bai, X J

    2015-05-25

    The polymerase chain reaction-single-strand conformation polymorphism technique was employed to measure mononucleotide diversity in the coding region of the leptin and leptin receptor genes in the Arctic fox. The relationships between specific genetic mutations and reproductive performance in Arctic foxes were determined to im-prove breeding strategies. We found that a leptin gene polymorphism was significantly associated with body weight (P < 0.01), abdominal circumference (P < 0.01), and fur length (P < 0.01). Furthermore, a polymorphism in the leptin receptor gene was associated with carcass weight and guard hair length (P < 0.01). Leptin and leptin receptor gene combinatorial genotypes were significantly associated with abdominal circumference, fur length (P < 0.01), and body weight (P < 0.05). The leptin gene is thus a key gene affecting body weight, abdominal circumference, and fur length in Arctic foxes, whereas variations in the leptin receptor mainly affect carcass weight and guard hair. The marker loci identified in this study can be used to assist in the selection of Arctic foxes for breeding to raise the production performance of this species.

  14. Characterization and antifungal properties of wheat nonspecific lipid transfer proteins.

    PubMed

    Sun, Jin-Yue; Gaudet, Denis A; Lu, Zhen-Xiang; Frick, Michele; Puchalski, Byron; Laroche, André

    2008-03-01

    This study simultaneously considered the phylogeny, fatty acid binding ability, and fungal toxicity of a large number of monocot nonspecific lipid transfer proteins (ns-LTP). Nine novel full-length wheat ns-LTP1 clones, all possessing coding sequences of 348 bp, isolated from abiotic- and biotic-stressed cDNA libraries from aerial tissues, exhibited highly conserved coding regions with 78 to 99 and 71 to 100% identity at the nucleotide and amino acid levels, respectively. Phylogenetic analyses revealed two major ns-LTP families in wheat. Eight wheat ns-LTP genes from different clades were cloned into the expression vector pPICZalpha and transformed into Pichia pastoris. Sodium dodecyl sulfate polyacrylamide gel electrophoresis, Western blotting, and in vitro lipid binding activity assay confirmed that the eight ns-LTP were all successfully expressed and capable of in vitro binding fatty acid molecules. A comparative in vitro study on the toxicity of eight wheat ns-LTP to mycelium growth or spore germination of eight wheat pathogens and three nonwheat pathogens revealed differential toxicities among different ns-LTP. Values indicating 50% inhibition of fungal growth or spore germination of three selected ns-LTP against six fungi ranged from 1 to 7 microM. In vitro lipid-binding activity of ns-LTP was not correlated with their antifungal activity. Using the fluorescent probe SYTOX Green as an indicator of fungal membrane integrity, the in vitro toxicity of wheat ns-LTP was associated with alteration in permeability of fungal membranes.

  15. Complete mitochondrial genome of the giant ramshorn snail Marisa cornuarietis (Gastropoda: Ampullariidae).

    PubMed

    Wang, Mingling; Qiu, Jian-Wen

    2016-05-01

    We report the complete mitochondrial genome (mitogenome) of the giant ramshorn snail Marisa cornuarietis, a biocontrol agent of freshwater weeds and snail vectors of schistosomes. The mitogenome is 15,923 bp in length, encoding 13 protein-coding genes, 22 transfer RNAs and 2 ribosomal RNAs. The mitogenome is A+T biased (70.0%), with 28.9% A, 41.1% T, 16.7% G, and 13.3% C. A comparison with Pomacea canaliculata, the other member in the same family (Ampullariidae) with a sequenced mitogenome, shows that the two species have an identical gene order, but their intergenic regions vary substantially in sequence length. The mitogenome data can be used to understand the population genetics of M. cornuarietis, and resolve the phylogenetic relationship of various genera in Ampullariidae.

  16. FindGDPs: fast identification of primers for labeling microbial transcriptomes for DNA microarray analysis

    PubMed Central

    Blick, Robert J.; Revel, Andrew T.; Hansen, Eric J.

    2008-01-01

    Summary FindGDPs is a program that uses a greedy algorithm to quickly identify a set of genome-directed primers that specifically anneal to all of the open reading frames in a genome and that do not exhibit full-length complementarity to the members of another user-supplied set of nucleotide sequences. Availability The program code is distributed under the GNU General Public License at http://www8.utsouthwestern.edu/utsw/cda/dept131456/files/159331.html Contact eric.hansen@utsouthwestern.edu PMID:15593406

  17. A Very Efficient Transfer Function Bounding Technique on Bit Error Rate for Viterbi Decoded, Rate 1/N Convolutional Codes

    NASA Technical Reports Server (NTRS)

    Lee, P. J.

    1984-01-01

    For rate 1/N convolutional codes, a recursive algorithm for finding the transfer function bound on bit error rate (BER) at the output of a Viterbi decoder is described. This technique is very fast and requires very little storage since all the unnecessary operations are eliminated. Using this technique, we find and plot bounds on the BER performance of known codes of rate 1/2 with K 18, rate 1/3 with K 14. When more than one reported code with the same parameter is known, we select the code that minimizes the required signal to noise ratio for a desired bit error rate of 0.000001. This criterion of determining goodness of a code had previously been found to be more useful than the maximum free distance criterion and was used in the code search procedures of very short constraint length codes. This very efficient technique can also be used for searches of longer constraint length codes.

  18. Recombinant lactoferrin (Lf) of Vechur cow, the critical breed of Bos indicus and the Lf gene variants.

    PubMed

    Anisha, Shashidharan; Bhasker, Salini; Mohankumar, Chinnamma

    2012-03-01

    Vechur cow, categorized as a critically maintained breed by the FAO, is a unique breed of Bos indicus due to its extremely small size, less fodder intake, adaptability, easy domestication and traditional medicinal property of the milk. Lactoferrin (Lf) is an iron-binding glycoprotein that is found predominantly in the milk of mammals. The full coding region of Lf gene of Vechur cow was cloned, sequenced and expressed in a prokaryotic system. Antibacterial activity of the recombinant Lf showed suppression of bacterial growth. To the best of our knowledge this is the first time that the full coding region of Lf gene of B. indicus Vechur breed is sequenced, successfully expressed in a prokaryotic system and characterized. Comparative analysis of Lf gene sequence of five Vechur cows with B. taurus revealed 15 SNPs in the exon region associated with 11 amino acid substitutions. The amino acid arginine was noticed as a pronounced substitution and the tertiary structure analysis of the BLfV protein confirmed the positions of arginine in the β sheet region, random coil and helix region 1. Based on the recent reports on the nutritional therapies of arginine supplementation for wound healing and for cardiovascular diseases, the higher level of arginine in the lactoferrin protein of Vechur cow milk provides enormous scope for further therapeutic studies. Copyright © 2011 Elsevier B.V. All rights reserved.

  19. Reynolds-Averaged Navier-Stokes Solutions to Flat Plate Film Cooling Scenarios

    NASA Technical Reports Server (NTRS)

    Johnson, Perry L.; Shyam, Vikram; Hah, Chunill

    2011-01-01

    The predictions of several Reynolds-Averaged Navier-Stokes solutions for a baseline film cooling geometry are analyzed and compared with experimental data. The Fluent finite volume code was used to perform the computations with the realizable k-epsilon turbulence model. The film hole was angled at 35 to the crossflow with a Reynolds number of 17,400. Multiple length-to-diameter ratios (1.75 and 3.5) as well as momentum flux ratios (0.125 and 0.5) were simulated with various domains, boundary conditions, and grid refinements. The coolant to mainstream density ratio was maintained at 2.0 for all scenarios. Computational domain and boundary condition variations show the ability to reduce the computational cost as compared to previous studies. A number of grid refinement and coarsening variations are compared for further insights into the reduction of computational cost. Liberal refinement in the near hole region is valuable, especially for higher momentum jets that tend to lift-off and create a recirculating flow. A lack of proper refinement in the near hole region can severely diminish the accuracy of the solution, even in the far region. The effects of momentum ratio and hole length-to-diameter ratio are also discussed.

  20. Steady-state sulfur critical loads and exceedances for protection of aquatic ecosystems in the U.S. Southern Appalachian Mountains.

    PubMed

    McDonnell, Todd C; Sullivan, Timothy J; Hessburg, Paul F; Reynolds, Keith M; Povak, Nicholas A; Cosby, Bernard J; Jackson, William; Salter, R Brion

    2014-12-15

    Atmospherically deposited sulfur (S) causes stream water acidification throughout the eastern U.S. Southern Appalachian Mountain (SAM) region. Acidification has been linked with reduced fitness and richness of aquatic species and changes to benthic communities. Maintaining acid-base chemistry that supports native biota depends largely on balancing acidic deposition with the natural resupply of base cations. Stream water acid neutralizing capacity (ANC) is maintained by base cations that mostly originate from weathering of surrounding lithologies. When ambient atmospheric S deposition exceeds the critical load (CL) an ecosystem can tolerate, stream water chemistry may become lethal to biota. This work links statistical predictions of ANC and base cation weathering for streams and watersheds of the SAM region with a steady-state model to estimate CLs and exceedances. Results showed that 20.1% of the total length of study region streams displayed ANC <100 μeq∙L(-1), a level at which effects to biota may be anticipated; most were 4th or lower order streams. Nearly one-third of the stream length within the study region exhibited CLs of S deposition <50 meq∙m(-2)∙yr(-1), which is less than the regional average S deposition of 60 meq∙m(-2)∙yr(-1). Owing to their geologic substrates, relatively high elevation, and cool and moist forested conditions, the percentage of stream length in exceedance was highest for mountain wilderness areas and in national parks, and lowest for privately owned valley bottom land. Exceedance results were summarized by 12-digit hydrologic unit code (subwatershed) for use in developing management goals and policy objectives, and for long-term monitoring. Copyright © 2014 Elsevier Ltd. All rights reserved.

  1. Complete mitochondrial genome sequences of the northern spotted owl (Strix occidentalis caurina) and the barred owl (Strix varia; Aves: Strigiformes: Strigidae) confirm the presence of a duplicated control region

    PubMed Central

    Henderson, James B.; Sellas, Anna B.; Fuchs, Jérôme; Bowie, Rauri C.K.; Dumbacher, John P.

    2017-01-01

    We report here the successful assembly of the complete mitochondrial genomes of the northern spotted owl (Strix occidentalis caurina) and the barred owl (S. varia). We utilized sequence data from two sequencing methodologies, Illumina paired-end sequence data with insert lengths ranging from approximately 250 nucleotides (nt) to 9,600 nt and read lengths from 100–375 nt and Sanger-derived sequences. We employed multiple assemblers and alignment methods to generate the final assemblies. The circular genomes of S. o. caurina and S. varia are comprised of 19,948 nt and 18,975 nt, respectively. Both code for two rRNAs, twenty-two tRNAs, and thirteen polypeptides. They both have duplicated control region sequences with complex repeat structures. We were not able to assemble the control regions solely using Illumina paired-end sequence data. By fully spanning the control regions, Sanger-derived sequences enabled accurate and complete assembly of these mitochondrial genomes. These are the first complete mitochondrial genome sequences of owls (Aves: Strigiformes) possessing duplicated control regions. We searched the nuclear genome of S. o. caurina for copies of mitochondrial genes and found at least nine separate stretches of nuclear copies of gene sequences originating in the mitochondrial genome (Numts). The Numts ranged from 226–19,522 nt in length and included copies of all mitochondrial genes except tRNAPro, ND6, and tRNAGlu. Strix occidentalis caurina and S. varia exhibited an average of 10.74% (8.68% uncorrected p-distance) divergence across the non-tRNA mitochondrial genes. PMID:29038757

  2. Neutron transport analysis for nuclear reactor design

    DOEpatents

    Vujic, Jasmina L.

    1993-01-01

    Replacing regular mesh-dependent ray tracing modules in a collision/transfer probability (CTP) code with a ray tracing module based upon combinatorial geometry of a modified geometrical module (GMC) provides a general geometry transfer theory code in two dimensions (2D) for analyzing nuclear reactor design and control. The primary modification of the GMC module involves generation of a fixed inner frame and a rotating outer frame, where the inner frame contains all reactor regions of interest, e.g., part of a reactor assembly, an assembly, or several assemblies, and the outer frame, with a set of parallel equidistant rays (lines) attached to it, rotates around the inner frame. The modified GMC module allows for determining for each parallel ray (line), the intersections with zone boundaries, the path length between the intersections, the total number of zones on a track, the zone and medium numbers, and the intersections with the outer surface, which parameters may be used in the CTP code to calculate collision/transfer probability and cross-section values.

  3. Neutron transport analysis for nuclear reactor design

    DOEpatents

    Vujic, J.L.

    1993-11-30

    Replacing regular mesh-dependent ray tracing modules in a collision/transfer probability (CTP) code with a ray tracing module based upon combinatorial geometry of a modified geometrical module (GMC) provides a general geometry transfer theory code in two dimensions (2D) for analyzing nuclear reactor design and control. The primary modification of the GMC module involves generation of a fixed inner frame and a rotating outer frame, where the inner frame contains all reactor regions of interest, e.g., part of a reactor assembly, an assembly, or several assemblies, and the outer frame, with a set of parallel equidistant rays (lines) attached to it, rotates around the inner frame. The modified GMC module allows for determining for each parallel ray (line), the intersections with zone boundaries, the path length between the intersections, the total number of zones on a track, the zone and medium numbers, and the intersections with the outer surface, which parameters may be used in the CTP code to calculate collision/transfer probability and cross-section values. 28 figures.

  4. On decoding of multi-level MPSK modulation codes

    NASA Technical Reports Server (NTRS)

    Lin, Shu; Gupta, Alok Kumar

    1990-01-01

    The decoding problem of multi-level block modulation codes is investigated. The hardware design of soft-decision Viterbi decoder for some short length 8-PSK block modulation codes is presented. An effective way to reduce the hardware complexity of the decoder by reducing the branch metric and path metric, using a non-uniform floating-point to integer mapping scheme, is proposed and discussed. The simulation results of the design are presented. The multi-stage decoding (MSD) of multi-level modulation codes is also investigated. The cases of soft-decision and hard-decision MSD are considered and their performance are evaluated for several codes of different lengths and different minimum squared Euclidean distances. It is shown that the soft-decision MSD reduces the decoding complexity drastically and it is suboptimum. The hard-decision MSD further simplifies the decoding while still maintaining a reasonable coding gain over the uncoded system, if the component codes are chosen properly. Finally, some basic 3-level 8-PSK modulation codes using BCH codes as component codes are constructed and their coding gains are found for hard decision multistage decoding.

  5. Comparison of calculated and measured velocities near the tip of a model rotor blade at transonic speeds

    NASA Technical Reports Server (NTRS)

    Tauber, M. E.; Owen, F. K.; Langhi, R. G.; Palmer, G. E.

    1985-01-01

    The ability of the ROT22 code to predict accurately the transonic flow field in the crucial region around and beyond the tip of a high speed rotor blade was assessed. The computations were compared with extensive laser velocimetry measurements made at zero advance ratio and tip Mach numbers of 0.85, 0.88, 0.90, and 0.95. The comparison between theory and experiment was made using 300 scans for the three orthogonal velocity components covering a volume having a height of over one blade chord, a width of nearly two chords, and a length ranging from about 1 to 1.6 chords, depending on the tip speeds. The good agreement between the calculated and measured velocities established the ability of the code to predict the off blade flow field at high tip speeds. This supplements previous comparisons where surface pressures were shown to be well predicted on two different tips at advance ratios to 0.45, especially at the critical 90 deg azimuth blade position. These results demonstrate that the ROT22 code can be used with confidence to predict the important tip region flow field including the occurrence, strength, and location of shock waves causing high drag and noise.

  6. Analysis of variable sites between two complete South China tiger (Panthera tigris amoyensis) mitochondrial genomes.

    PubMed

    Zhang, Wenping; Yue, Bisong; Wang, Xiaofang; Zhang, Xiuyue; Xie, Zhong; Liu, Nonglin; Fu, Wenyuan; Yuan, Yaohua; Chen, Daqing; Fu, Danghua; Zhao, Bo; Yin, Yuzhong; Yan, Xiahui; Wang, Xinjing; Zhang, Rongying; Liu, Jie; Li, Maoping; Tang, Yao; Hou, Rong; Zhang, Zhihe

    2011-10-01

    In order to investigate the mitochondrial genome of Panthera tigris amoyensis, two South China tigers (P25 and P27) were analyzed following 15 cymt-specific primer sets. The entire mtDNA sequence was found to be 16,957 bp and 17,001 bp long for P25 and P27 respectively, and this difference in length between P25 and P27 occurred in the number of tandem repeats in the RS-3 segment of the control region. The structural characteristics of complete P. t. amoyensis mitochondrial genomes were also highly similar to those of P. uncia. Additionally, the rate of point mutation was only 0.3% and a total of 59 variable sites between P25 and P27 were found. Out of the 59 variable sites, 6 were located in 6 different tRNA genes, 6 in the 2 rRNA genes, 7 in non-coding regions (one located between tRNA-Asn and tRNA-Tyr and six in the D-loop), and 40 in 10 protein-coding genes. COI held the largest amount of variable sites (9 sites) and Cytb contained the highest variable rate (0.7%) in the complete sequences. Moreover, out of the 40 variable sites located in 10 protein-coding genes, 12 sites were nonsynonymous.

  7. Simulation of Different Truncated p16INK4a Forms and In Silico Study of Interaction with Cdk4

    PubMed Central

    Fahham, Najmeh; Ghahremani, Mohammad Hossein; Sardari, Soroush; Vaziri, Behrouz; Ostad, Seyed Nasser

    2008-01-01

    Protein-protein interactions studies can greatly increase the amount of structural and functional information pertaining to biologically active molecules and processes. The information obtained from such studies can lead to design and application of new modification in order to obtain a desired bioactivity. Many application packages and servers performing docking, such as HEX, DOT, AUTODOCK, and ZDOCK are now available for predicting the lowest free energy state of a protein complex. In this study, we have focused on cyclin-dependent kinase 4 (Cdk4), a key molecule in the regulation of cell cycle progression at the G1-S phase restriction point and p16INK4a, a tumor suppressor which inhibits Cdk4 activity. Truncated structures were created to find the more critical regions of p16 for interaction. The tertiary structures were determined by ProSAL, GENO3D Web Server. We evaluated their interactions with Cdk4 using two docking systems, HEX 4.5 and DOT 1. Calculations were performed on a high-speed computer. Minimizations and visualizations were carried out by PdbViewer 3.7. Considering shape and shape/electrostatic total energy, structures containing ANK II, III and IV motifs that lack the N-terminal region of the full length p16 molecule showed the best fit complexes among the p16 truncated forms. The free energies were compatible with that of p16 full length original form, the full length. It seems that the N-terminal of the molecule is not crucial for the interaction since the truncated structure containing only this region did not show a good total energy. PMID:19352455

  8. Liner Optimization Studies Using the Ducted Fan Noise Prediction Code TBIEM3D

    NASA Technical Reports Server (NTRS)

    Dunn, M. H.; Farassat, F.

    1998-01-01

    In this paper we demonstrate the usefulness of the ducted fan noise prediction code TBIEM3D as a liner optimization design tool. Boundary conditions on the interior duct wall allow for hard walls or a locally reacting liner with axially segmented, circumferentially uniform impedance. Two liner optimization studies are considered in which farfield noise attenuation due to the presence of a liner is maximized by adjusting the liner impedance. In the first example, the dependence of optimal liner impedance on frequency and liner length is examined. Results show that both the optimal impedance and attenuation levels are significantly influenced by liner length and frequency. In the second example, TBIEM3D is used to compare radiated sound pressure levels between optimal and non-optimal liner cases at conditions designed to simulate take-off. It is shown that significant noise reduction is achieved for most of the sound field by selecting the optimal or near optimal liner impedance. Our results also indicate that there is relatively large region of the impedance plane over which optimal or near optimal liner behavior is attainable. This is an important conclusion for the designer since there are variations in liner characteristics due to manufacturing imprecisions.

  9. Theoretical analysis and simulation of the influence of self-bunching effects and longitudinal space charge effects on the propagation of keV electron bunch produced by a novel S-band Micro-Pulse electron Gun

    NASA Astrophysics Data System (ADS)

    Zhao, Jifei; Lu, Xiangyang; Zhou, Kui; Yang, Ziqin; Yang, Deyu; Luo, Xing; Tan, Weiwei; Yang, Yujia

    2016-06-01

    As an important electron source, Micro-Pulse electron Gun (MPG) which is qualified for producing high average current, short pulse, low emittance electron bunches steadily holds promise to use as an electron source of Coherent Smith-Purcell Radiation (CSPR), Free Electron Laser (FEL). The stable output of S-band MPG has been achieved in many labs. To establish reliable foundation for the future application of it, the propagation of picosecond electron bunch produced by MPG should be studied in detail. In this article, the MPG which was working on the rising stage of total effective Secondary Electron Yield (SEY) curve was introduced. The self-bunching mechanism was discussed in depth both in the multipacting amplifying state and the steady working state. The bunch length broadening induced by the longitudinal space-charge (SC) effects was investigated by different theoretical models in different regions. The 2D PIC codes MAGIC and beam dynamic codes TraceWin simulations were also performed in the propagation. The result shows an excellent agreement between the simulation and the theoretical analysis for bunch length evolution.

  10. Modelling of radio frequency sheath and fast wave coupling on the realistic ion cyclotron resonant antenna surroundings and the outer wall

    NASA Astrophysics Data System (ADS)

    Lu, L.; Colas, L.; Jacquot, J.; Després, B.; Heuraux, S.; Faudot, E.; Van Eester, D.; Crombé, K.; Křivská, A.; Noterdaeme, J.-M.; Helou, W.; Hillairet, J.

    2018-03-01

    In order to model the sheath rectification in a realistic geometry over the size of ion cyclotron resonant heating (ICRH) antennas, the self-consistent sheaths and waves for ICH (SSWICH) code couples self-consistently the RF wave propagation and the DC SOL biasing via nonlinear RF and DC sheath boundary conditions applied at plasma/wall interfaces. A first version of SSWICH had 2D (toroidal and radial) geometry, rectangular walls either normal or parallel to the confinement magnetic field B 0 and only included the evanescent slow wave (SW) excited parasitically by the ICRH antenna. The main wave for plasma heating, the fast wave (FW) plays no role on the sheath excitation in this version. A new version of the code, 2D SSWICH-full wave, was developed based on the COMSOL software, to accommodate full RF field polarization and shaped walls tilted with respect to B 0 . SSWICH-full wave simulations have shown the mode conversion of FW into SW occurring at the sharp corners where the boundary shape varies rapidly. It has also evidenced ‘far-field’ sheath oscillations appearing at the shaped walls with a relatively long magnetic connection length to the antenna, that are only accessible to the propagating FW. Joint simulation, conducted by SSWICH-full wave within a multi-2D approach excited using the 3D wave coupling code (RAPLICASOL), has recovered the double-hump poloidal structure measured in the experimental temperature and potential maps when only the SW is modelled. The FW contribution on the potential poloidal structure seems to be affected by the 3D effects, which was ignored in the current stage. Finally, SSWICH-full wave simulation revealed the left-right asymmetry that has been observed extensively in the unbalanced strap feeding experiments, suggesting that the spatial proximity effects in RF sheath excitation, studied for SW only previously, is still important in the vicinity of the wave launcher under full wave polarizations.

  11. Nucleoplasmin-like domain of FKBP39 from Drosophila melanogaster forms a tetramer with partly disordered tentacle-like C-terminal segments

    PubMed Central

    Kozłowska, Małgorzata; Tarczewska, Aneta; Jakób, Michał; Bystranowska, Dominika; Taube, Michał; Kozak, Maciej; Czarnocki-Cieciura, Mariusz; Dziembowski, Andrzej; Orłowski, Marek; Tkocz, Katarzyna; Ożyhar, Andrzej

    2017-01-01

    Nucleoplasmins are a nuclear chaperone family defined by the presence of a highly conserved N-terminal core domain. X-ray crystallographic studies of isolated nucleoplasmin core domains revealed a β-propeller structure consisting of a set of five monomers that together form a stable pentamer. Recent studies on isolated N-terminal domains from Drosophila 39-kDa FK506-binding protein (FKBP39) and from other chromatin-associated proteins showed analogous, nucleoplasmin-like (NPL) pentameric structures. Here, we report that the NPL domain of the full-length FKBP39 does not form pentameric complexes. Multi-angle light scattering (MALS) and sedimentation equilibrium ultracentrifugation (SE AUC) analyses of the molecular mass of the full-length protein indicated that FKBP39 forms homotetrameric complexes. Molecular models reconstructed from small-angle X-ray scattering (SAXS) revealed that the NPL domain forms a stable, tetrameric core and that FK506-binding domains are linked to it by intrinsically disordered, flexible chains that form tentacle-like segments. Analyses of full-length FKBP39 and its isolated NPL domain suggested that the distal regions of the polypeptide chain influence and determine the quaternary conformation of the nucleoplasmin-like protein. These results provide new insights regarding the conserved structure of nucleoplasmin core domains and provide a potential explanation for the importance of the tetrameric structural organization of full-length nucleoplasmins. PMID:28074868

  12. RT-PCR and sequence analysis of the full-length fusion protein of Canine Distemper Virus from domestic dogs.

    PubMed

    Romanutti, Carina; Gallo Calderón, Marina; Keller, Leticia; Mattion, Nora; La Torre, José

    2016-02-01

    During 2007-2014, 84 out of 236 (35.6%) samples from domestic dogs submitted to our laboratory for diagnostic purposes were positive for Canine Distemper Virus (CDV), as analyzed by RT-PCR amplification of a fragment of the nucleoprotein gene. Fifty-nine of them (70.2%) were from dogs that had been vaccinated against CDV. The full-length gene encoding the Fusion (F) protein of fifteen isolates was sequenced and compared with that of those of other CDVs, including wild-type and vaccine strains. Phylogenetic analysis using the F gene full-length sequences grouped all the Argentinean CDV strains in the SA2 clade. Sequence identity with the Onderstepoort vaccine strain was 89.0-90.6%, and the highest divergence was found in the 135 amino acids corresponding to the F protein signal-peptide, Fsp (64.4-66.7% identity). In contrast, this region was highly conserved among the local strains (94.1-100% identity). One extra putative N-glycosylation site was identified in the F gene of CDV Argentinean strains with respect to the vaccine strain. The present report is the first to analyze full-length F protein sequences of CDV strains circulating in Argentina, and contributes to the knowledge of molecular epidemiology of CDV, which may help in understanding future disease outbreaks. Copyright © 2015 Elsevier B.V. All rights reserved.

  13. Hibiscus latent Fort Pierce virus in Brazil and synthesis of its biologically active full-length cDNA clone.

    PubMed

    Gao, Ruimin; Niu, Shengniao; Dai, Weifang; Kitajima, Elliot; Wong, Sek-Man

    2016-10-01

    A Brazilian isolate of Hibiscus latent Fort Pierce virus (HLFPV-BR) was firstly found in a hibiscus plant in Limeira, SP, Brazil. RACE PCR was carried out to obtain the full-length sequences of HLFPV-BR which is 6453 nucleotides and has more than 99.15 % of complete genomic RNA nucleotide sequence identity with that of HLFPV Japanese isolate. The genomic structure of HLFPV-BR is similar to other tobamoviruses. It includes a 5' untranslated region (UTR), followed by open reading frames encoding for a 128-kDa protein and a 188-kDa readthrough protein, a 38-kDa movement protein, 18-kDa coat protein, and a 3' UTR. Interestingly, the unique feature of poly(A) tract is also found within its 3'-UTR. Furthermore, from the total RNA extracted from the local lesions of HLFPV-BR-infected Chenopodium quinoa leaves, a biologically active, full-length cDNA clone encompassing the genome of HLFPV-BR was amplified and placed adjacent to a T7 RNA polymerase promoter. The capped in vitro transcripts from the cloned cDNA were infectious when mechanically inoculated into C. quinoa and Nicotiana benthamiana plants. This is the first report of the presence of an isolate of HLFPV in Brazil and the successful synthesis of a biologically active HLFPV-BR full-length cDNA clone.

  14. ScaffoldSeq: Software for characterization of directed evolution populations.

    PubMed

    Woldring, Daniel R; Holec, Patrick V; Hackel, Benjamin J

    2016-07-01

    ScaffoldSeq is software designed for the numerous applications-including directed evolution analysis-in which a user generates a population of DNA sequences encoding for partially diverse proteins with related functions and would like to characterize the single site and pairwise amino acid frequencies across the population. A common scenario for enzyme maturation, antibody screening, and alternative scaffold engineering involves naïve and evolved populations that contain diversified regions, varying in both sequence and length, within a conserved framework. Analyzing the diversified regions of such populations is facilitated by high-throughput sequencing platforms; however, length variability within these regions (e.g., antibody CDRs) encumbers the alignment process. To overcome this challenge, the ScaffoldSeq algorithm takes advantage of conserved framework sequences to quickly identify diverse regions. Beyond this, unintended biases in sequence frequency are generated throughout the experimental workflow required to evolve and isolate clones of interest prior to DNA sequencing. ScaffoldSeq software uniquely handles this issue by providing tools to quantify and remove background sequences, cluster similar protein families, and dampen the impact of dominant clones. The software produces graphical and tabular summaries for each region of interest, allowing users to evaluate diversity in a site-specific manner as well as identify epistatic pairwise interactions. The code and detailed information are freely available at http://research.cems.umn.edu/hackel. Proteins 2016; 84:869-874. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  15. Detection and initial characterization of protein entities consisting of the HIV glycoprotein cytoplasmic C-terminal domain alone.

    PubMed

    Pfeiffer, Tanya; Ruppert, Thomas; Schaal, Heiner; Bosch, Valerie

    2013-06-20

    Employing antibodies against the cytoplasmic tail of the HIV-1 glycoprotein (Env-CT), in addition to gp160/gp41, we have identified several novel small Env proteins (<25kD) in HIV-1 transfected and infected cells. Mass spectrometric and mutational analyses show that two mechanisms contribute to their generation. Thus the protein, designated Tr-Env-CT (for truncated Env-CT), consists of the C-terminal 139 amino acids (aa) of Env (aa 718-856) with the N-terminal Q718 modified to pyroglutamic acid. It is likely derived from full-length Env protein by proteolytic processing. A further heterogeneous set of slightly larger proteins, termed Env-CT* species, are rather derived from spliced mRNAs containing only those Env C-terminal residues (aa 719-856) which overlap with the second tat and rev coding exons. They are N-terminally extended in the same reading frame. It is conceivable that essential Env-CT functions may be fulfilled by these novel species rather than by the full-length glycoprotein itself. Copyright © 2013 Elsevier Inc. All rights reserved.

  16. A Possible Role of the Full-Length Nascent Protein in Post-Translational Ribosome Recycling.

    PubMed

    Das, Debasis; Samanta, Dibyendu; Bhattacharya, Arpita; Basu, Arunima; Das, Anindita; Ghosh, Jaydip; Chakrabarti, Abhijit; Das Gupta, Chanchal

    2017-01-01

    Each cycle of translation initiation in bacterial cell requires free 50S and 30S ribosomal subunits originating from the post-translational dissociation of 70S ribosome from the previous cycle. Literature shows stable dissociation of 70S from model post-termination complexes by the concerted action of Ribosome Recycling Factor (RRF) and Elongation Factor G (EF-G) that interact with the rRNA bridge B2a/B2b joining 50S to 30S. In such experimental models, the role of full-length nascent protein was never considered seriously. We observed relatively slow release of full-length nascent protein from 50Sof post translation ribosome, and in that process, its toe prints on the rRNA in vivo and in in vitro translation with E.coli S30 extract. We reported earlier that a number of chemically unfolded proteins like bovine carbonic anhydrase (BCA), lactate dehydrogenase (LDH), malate dehydrogenase (MDH), lysozyme, ovalbumin etc., when added to free 70Sin lieu of the full length nascent proteins, also interact with identical RNA regions of the 23S rRNA. Interestingly the rRNA nucleotides that slow down release of the C-terminus of full-length unfolded protein were found in close proximity to the B2a/B2b bridge. It indicated a potentially important chemical reaction conserved throughout the evolution. Here we set out to probe that conserved role of unfolded protein conformation in splitting the free or post-termination 70S. How both the RRF-EFG dependent and the plausible nascent protein-EFG dependent ribosome recycling pathways might be relevant in bacteria is discussed here.

  17. A Possible Role of the Full-Length Nascent Protein in Post-Translational Ribosome Recycling

    PubMed Central

    Das, Debasis; Samanta, Dibyendu; Bhattacharya, Arpita; Basu, Arunima; Das, Anindita; Ghosh, Jaydip; Chakrabarti, Abhijit; Das Gupta, Chanchal

    2017-01-01

    Each cycle of translation initiation in bacterial cell requires free 50S and 30S ribosomal subunits originating from the post-translational dissociation of 70S ribosome from the previous cycle. Literature shows stable dissociation of 70S from model post-termination complexes by the concerted action of Ribosome Recycling Factor (RRF) and Elongation Factor G (EF-G) that interact with the rRNA bridge B2a/B2b joining 50S to 30S. In such experimental models, the role of full-length nascent protein was never considered seriously. We observed relatively slow release of full-length nascent protein from 50Sof post translation ribosome, and in that process, its toe prints on the rRNA in vivo and in in vitro translation with E.coli S30 extract. We reported earlier that a number of chemically unfolded proteins like bovine carbonic anhydrase (BCA), lactate dehydrogenase (LDH), malate dehydrogenase (MDH), lysozyme, ovalbumin etc., when added to free 70Sin lieu of the full length nascent proteins, also interact with identical RNA regions of the 23S rRNA. Interestingly the rRNA nucleotides that slow down release of the C-terminus of full-length unfolded protein were found in close proximity to the B2a/B2b bridge. It indicated a potentially important chemical reaction conserved throughout the evolution. Here we set out to probe that conserved role of unfolded protein conformation in splitting the free or post-termination 70S. How both the RRF-EFG dependent and the plausible nascent protein–EFG dependent ribosome recycling pathways might be relevant in bacteria is discussed here. PMID:28099529

  18. Recent horizontal transfer of mellifera subfamily mariner transposons into insect lineages representing four different orders shows that selection acts only during horizontal transfer.

    PubMed

    Lampe, David J; Witherspoon, David J; Soto-Adames, Felipe N; Robertson, Hugh M

    2003-04-01

    We report the isolation and sequencing of genomic copies of mariner transposons involved in recent horizontal transfers into the genomes of the European earwig, Forficula auricularia; the European honey bee, Apis mellifera; the Mediterranean fruit fly, Ceratitis capitata; and a blister beetle, Epicauta funebris, insects from four different orders. These elements are in the mellifera subfamily and are the second documented example of full-length mariner elements involved in this kind of phenomenon. We applied maximum likelihood methods to the coding sequences and determined that the copies in each genome were evolving neutrally, whereas reconstructed ancestral coding sequences appeared to be under selection, which strengthens our previous hypothesis that the primary selective constraint on mariner sequence evolution is the act of horizontal transfer between genomes.

  19. Isolation and characterization of full-length cDNA clones coding for cholinesterase from fetal human tissues.

    PubMed Central

    Prody, C A; Zevin-Sonkin, D; Gnatt, A; Goldberg, O; Soreq, H

    1987-01-01

    To study the primary structure and regulation of human cholinesterases, oligodeoxynucleotide probes were prepared according to a consensus peptide sequence present in the active site of both human serum pseudocholinesterase (BtChoEase; EC 3.1.1.8) and Torpedo electric organ "true" acetylcholinesterase (AcChoEase; EC 3.1.1.7). Using these probes, we isolated several cDNA clones from lambda gt10 libraries of fetal brain and liver origins. These include 2.4-kilobase cDNA clones that code for a polypeptide containing a putative signal peptide and the N-terminal, active site, and C-terminal peptides of human BtChoEase, suggesting that they code either for BtChoEase itself or for a very similar but distinct fetal form of cholinesterase. In RNA blots of poly(A)+ RNA from the cholinesterase-producing fetal brain and liver, these cDNAs hybridized with a single 2.5-kilobase band. Blot hybridization to human genomic DNA revealed that these fetal BtChoEase cDNA clones hybridize with DNA fragments of the total length of 17.5 kilobases, and signal intensities indicated that these sequences are not present in many copies. Both the cDNA-encoded protein and its nucleotide sequence display striking homology to parallel sequences published for Torpedo AcChoEase. These findings demonstrate extensive homologies between the fetal BtChoEase encoded by these clones and other cholinesterases of various forms and species. Images PMID:3035536

  20. 2D hydrodynamic simulations of a variable length gas target for density down-ramp injection of electrons into a laser wakefield accelerator

    NASA Astrophysics Data System (ADS)

    Kononenko, O.; Lopes, N. C.; Cole, J. M.; Kamperidis, C.; Mangles, S. P. D.; Najmudin, Z.; Osterhoff, J.; Poder, K.; Rusby, D.; Symes, D. R.; Warwick, J.; Wood, J. C.; Palmer, C. A. J.

    2016-09-01

    In this work, two-dimensional (2D) hydrodynamic simulations of a variable length gas cell were performed using the open source fluid code OpenFOAM. The gas cell was designed to study controlled injection of electrons into a laser-driven wakefield at the Astra Gemini laser facility. The target consists of two compartments: an accelerator and an injector section connected via an aperture. A sharp transition between the peak and plateau density regions in the injector and accelerator compartments, respectively, was observed in simulations with various inlet pressures. The fluid simulations indicate that the length of the down-ramp connecting the sections depends on the aperture diameter, as does the density drop outside the entrance and the exit cones. Further studies showed, that increasing the inlet pressure leads to turbulence and strong fluctuations in density along the axial profile during target filling, and consequently, is expected to negatively impact the accelerator stability.

  1. What is the spatial sampling of MISR?

    Atmospheric Science Data Center

    2014-12-08

    ... spatial resolution of the sensors without exceeding the data transfer quotas, MISR can be operated in two different data acquisition modes: ... data at the full resolution, but only for limited periods of time and therefore for limited regions, typically about 300 km in length (along ...

  2. Predicting the Where and the How Big of Solar Flares

    NASA Astrophysics Data System (ADS)

    Barnes, Graham; Leka, K. D.; Gilchrist, Stuart

    2017-08-01

    The approach to predicting solar flares generally characterizes global properties of a solar active region, for example the total magnetic flux or the total length of a sheared magnetic neutral line, and compares new data (from which to make a prediction) to similar observations of active regions and their associated propensity for flare production. We take here a different tack, examining solar active regions in the context of their energy storage capacity. Specifically, we characterize not the region as a whole, but summarize the energy-release prospects of different sub-regions within, using a sub-area analysis of the photospheric boundary, the CFIT non-linear force-free extrapolation code, and the Minimum Current Corona model. We present here early results from this approach whose objective is to understand the different pathways available for regions to release stored energy, thus eventually providing better estimates of the where (what sub-areas are storing how much energy) and the how big (how much energy is stored, and how much is available for release) of solar flares.

  3. Characterization of anti-liver-kidney microsome antibody (anti-LKM1) from hepatitis C virus-positive and -negative sera.

    PubMed

    Yamamoto, A M; Cresteil, D; Homberg, J C; Alvarez, F

    1993-06-01

    Hepatitis C virus-related antibodies were found in sera positive for antibodies to liver/kidney microsome antibody, usually considered a marker of autoimmune hepatitis. The aim of this study was to analyze the specificity of this autoantibody in sera from patients with and without hepatitis C virus infection. Fifteen anti-hepatitis C virus- and anti-liver kidney microsome-positive sera were compared with 11 sera from patients with autoimmune hepatitis, for reactivity against rat and human liver microsomal proteins, P450IID6 recombinant proteins, and various synthetic peptides spanning the 241-429 amino acids sequence of the P450IID6. Ten of 11 sera from patients with autoimmune hepatitis bound to recombinant proteins spanning the P450IID6 region between amino acids 72 and 458. These sera bound to the 254-271 peptide, and some also recognized the 321-351, 373-389 and 410-429 peptides. Four of 15 antihepatitis C virus recognized the fusion protein coded by the full-length P450IID6 complementary DNA; 3 of them also reacted with the P450IID6 region between amino acids 72-456. Only 1 sera recognized the 321-351 peptide. P450IID6 antigenic sites recognized by anti-hepatitis C virus-positive sera were different from those recognized by sera from patients with autoimmune hepatitis.

  4. cDNA cloning of the human peroxisomal enoyl-CoA hydratase: 3-Hydroxyacyl-CoA dehydrogenase bifunctional enzyme and localization to chromosome 3q26. 3-3q28: A free left Alu arm is inserted in the 3[prime] noncoding region

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hoefler, G.; Forstner, M.; Hulla, W.

    1994-01-01

    Enoyl-CoA hydratase:3-hydroxyacyl-CoA dehydrogenase bifunctional enzyme is one of the four enzymes of the peroxisomal, [beta]-oxidation pathway. Here, the authors report the full-length human cDNA sequence and the localization of the corresponding gene on chromosome 3q26.3-3q28. The cDNA sequence spans 3779 nucleotides with an open reading frame of 2169 nucleotides. The tripeptide SKL at the carboxy terminus, known to serve as a peroxisomal targeting signal, is present. DNA sequence comparison of the coding region showed an 80% homology between human and rat bifunctional enzyme cDNA. The 3[prime] noncoding sequence contains 117 nucleotides homologous to an Alu repeat. Based on sequence comparison,more » they propose that these nucleotides are a free left Alu arm with 86% homology to the Alu-J family. RNA analysis shows one band with highest intensity in liver and kidney. This cDNA will allow in-depth studies of molecular defects in patients with defective peroxisomal bifunctional enzyme. Moreover, it will also provide a means for studying the regulation of peroxisomal [beta]-oxidation in humans. 33 refs., 5 figs.« less

  5. Single mutation in Shine-Dalgarno-like sequence present in the amino terminal of lactate dehydrogenase of Plasmodium effects the production of an eukaryotic protein expressed in a prokaryotic system.

    PubMed

    Cicek, Mustafa; Mutlu, Ozal; Erdemir, Aysegul; Ozkan, Ebru; Saricay, Yunus; Turgut-Balik, Dilek

    2013-06-01

    One of the most important step in structure-based drug design studies is obtaining the protein in active form after cloning the target gene. In one of our previous study, it was determined that an internal Shine-Dalgarno-like sequence present just before the third methionine at N-terminus of wild type lactate dehydrogenase enzyme of Plasmodium falciparum prevent the translation of full length protein. Inspection of the same region in P. vivax LDH, which was overproduced as an active enzyme, indicated that the codon preference in the same region was slightly different than the codon preference of wild type PfLDH. In this study, 5'-GGAGGC-3' sequence of P. vivax that codes for two glycine residues just before the third methionine was exchanged to 5'-GGAGGA-3', by mimicking P. falciparum LDH, to prove the possible effects of having an internal SD-like sequence when expressing an eukaryotic protein in a prokaryotic system. Exchange was made by site-directed mutagenesis. Results indicated that having two glycine residues with an internal SD-like sequence (GGAGGA) just before the third methionine abolishes the enzyme activity due to the preference of the prokaryotic system used for the expression. This study emphasizes the awareness of use of a prokaryotic system to overproduce an eukaryotic protein.

  6. Cloning and characterization of a cell cycle-regulated gene encoding topoisomerase I from Nicotiana tabacum that is inducible by light, low temperature and abscisic acid.

    PubMed

    Mudgil, Y; Singh, B N; Upadhyaya, K C; Sopory, S K; Reddy, M K

    2002-05-01

    We have cloned a full-length 2874-bp cDNA coding for tobacco topoisomerase I, with an ORF of 2559 bp encoding a protein of 852 amino acids with a calculated molecular mass of 95 kDa and an estimated pI of 9.51. The deduced amino acid sequence shows homology to other eukaryotic topoisomerases I. Tobacco topoisomerase I was over-expressed in Escherichia coli, and the purified recombinant protein was found to relax both positively and negatively super-coiled DNA in the absence of the divalent cation Mg(2+)and ATP. These characteristic features indicate that the tobacco enzyme is a type I topoisomerase. The recombinant protein could be phosphorylated at (a) threonine residue(s) by protein kinase C. However, phosphorylation did not cause any change in its enzymatic activity. The genomic organization of the topoisomerase I gene revealed the presence of 8 exons and 7 introns in the region corresponding to the ORF and one intron in the 3' UTR region. Transcript analysis using RT-PCR showed basal constitutive expression in all organs examined, and the gene was expressed at all stages of the cell cycle--but the level of expression increased during the G1-S phase. The transcript level also increased following exposure to light, low-temperature stress and abscisic acid, a stress hormone.

  7. Secretory production of tetrameric native full-length streptavidin with thermostability using Streptomyces lividans as a host.

    PubMed

    Noda, Shuhei; Matsumoto, Takuya; Tanaka, Tsutomu; Kondo, Akihiko

    2015-01-13

    Streptavidin is a tetrameric protein derived from Streptomyces avidinii, and has tight and specific biotin binding affinity. Applications of the streptavidin-biotin system have been widely studied. Streptavidin is generally produced using protein expression in Escherichia coli. In the present study, the secretory production of streptavidin was carried out using Streptomyces lividans as a host. In this study, we used the gene encoding native full-length streptavidin, whereas the core region is generally used for streptavidin production in E. coli. Tetrameric streptavidin composed of native full-length streptavidin monomers was successfully secreted in the culture supernatant of S. lividans transformants, and had specific biotin binding affinity as strong as streptavidin produced by E. coli. The amount of Sav using S. lividans was about 9 times higher than using E. coli. Surprisingly, streptavidin produced by S. lividans exhibited affinity to biotin after boiling, despite the fact that tetrameric streptavidin is known to lose its biotin binding ability after brief boiling. We successfully produced a large amount of tetrameric streptavidin as a secretory-form protein with unique thermotolerance.

  8. Myc-nick: a cytoplasmic cleavage product of Myc that promotes alpha-tubulin acetylation and cell differentiation.

    PubMed

    Conacci-Sorrell, Maralice; Ngouenet, Celine; Eisenman, Robert N

    2010-08-06

    The Myc oncoprotein family comprises transcription factors that control multiple cellular functions and are widely involved in oncogenesis. Here we report the identification of Myc-nick, a cytoplasmic form of Myc generated by calpain-dependent proteolysis at lysine 298 of full-length Myc. Myc-nick retains conserved Myc box regions but lacks nuclear localization signals and the bHLHZ domain essential for heterodimerization with Max and DNA binding. Myc-nick induces alpha-tubulin acetylation and altered cell morphology by recruiting histone acetyltransferase GCN5 to microtubules. During muscle differentiation, while the levels of full-length Myc diminish, Myc-nick and acetylated alpha-tubulin levels are increased. Ectopic expression of Myc-nick accelerates myoblast fusion, triggers the expression of myogenic markers, and permits Myc-deficient fibroblasts to transdifferentiate in response to MyoD. We propose that the cleavage of Myc by calpain abrogates the transcriptional inhibition of differentiation by full-length Myc and generates Myc-nick, a driver of cytoplasmic reorganization and differentiation. Copyright 2010 Elsevier Inc. All rights reserved.

  9. The complete mitochondrial genome of the North Chinese Leopard (Panthera pardus japonensis).

    PubMed

    Dou, Hailong; Feng, Limin; Xiao, Wenhong; Wang, Tianming

    2016-01-01

    The North Chinese Leopard (Panthera pardus japonensis) is an endemic subspecies of Panthera pardus to China, living in small and isolated populations with a severely fragmented distribution. Here we first sequenced and annotated its complete mitochondrial genome. The total length of the North Chinese Leopard is of 16,966 base pairs that consist of 2 rRNA gene, 22 tRNA genes, 13 protein-coding genes, 1 OLR and 1 control region (CR). The structures of the genomes were highly similar to other Felidae.

  10. The complete mitochondrial genome of the masked palm civet (Paguma larvata, Mammalia, Carnivora).

    PubMed

    Zhang, Dan; Xu, Liwen; Bu, Hongliang; Wang, Di; Xu, Chongren; Wang, Rongjiang

    2016-09-01

    The complete mitochondrial genome of the masked palm civet (Paguma larvata, Mammalia, Carnivora) is a circular molecule of 16 710 bp in length, containing 22 transfer RNA genes, 13 protein-coding genes, two ribosomal RNA genes, and a control region. The features of the mitochondrial genome of the masked palm civet are similar to the other mammals. The phylogenetic analysis shows that all species from the family Viverridae cluster together, in which P. larvata exhibits the closest relationship with Genetta servalina.

  11. An adaptable binary entropy coder

    NASA Technical Reports Server (NTRS)

    Kiely, A.; Klimesh, M.

    2001-01-01

    We present a novel entropy coding technique which is based on recursive interleaving of variable-to-variable length binary source codes. We discuss code design and performance estimation methods, as well as practical encoding and decoding algorithms.

  12. Sequencing and analysis of 10,967 full-length cDNA clones from Xenopus laevis and Xenopus tropicalis reveals post-tetraploidization transcriptome remodeling

    PubMed Central

    Morin, Ryan D.; Chang, Elbert; Petrescu, Anca; Liao, Nancy; Griffith, Malachi; Kirkpatrick, Robert; Butterfield, Yaron S.; Young, Alice C.; Stott, Jeffrey; Barber, Sarah; Babakaiff, Ryan; Dickson, Mark C.; Matsuo, Corey; Wong, David; Yang, George S.; Smailus, Duane E.; Wetherby, Keith D.; Kwong, Peggy N.; Grimwood, Jane; Brinkley, Charles P.; Brown-John, Mabel; Reddix-Dugue, Natalie D.; Mayo, Michael; Schmutz, Jeremy; Beland, Jaclyn; Park, Morgan; Gibson, Susan; Olson, Teika; Bouffard, Gerard G.; Tsai, Miranda; Featherstone, Ruth; Chand, Steve; Siddiqui, Asim S.; Jang, Wonhee; Lee, Ed; Klein, Steven L.; Blakesley, Robert W.; Zeeberg, Barry R.; Narasimhan, Sudarshan; Weinstein, John N.; Pennacchio, Christa Prange; Myers, Richard M.; Green, Eric D.; Wagner, Lukas; Gerhard, Daniela S.; Marra, Marco A.; Jones, Steven J.M.; Holt, Robert A.

    2006-01-01

    Sequencing of full-insert clones from full-length cDNA libraries from both Xenopus laevis and Xenopus tropicalis has been ongoing as part of the Xenopus Gene Collection Initiative. Here we present 10,967 full ORF verified cDNA clones (8049 from X. laevis and 2918 from X. tropicalis) as a community resource. Because the genome of X. laevis, but not X. tropicalis, has undergone allotetraploidization, comparison of coding sequences from these two clawed (pipid) frogs provides a unique angle for exploring the molecular evolution of duplicate genes. Within our clone set, we have identified 445 gene trios, each comprised of an allotetraploidization-derived X. laevis gene pair and their shared X. tropicalis ortholog. Pairwise dN/dS, comparisons within trios show strong evidence for purifying selection acting on all three members. However, dN/dS ratios between X. laevis gene pairs are elevated relative to their X. tropicalis ortholog. This difference is highly significant and indicates an overall relaxation of selective pressures on duplicated gene pairs. We have found that the paralogs that have been lost since the tetraploidization event are enriched for several molecular functions, but have found no such enrichment in the extant paralogs. Approximately 14% of the paralogous pairs analyzed here also show differential expression indicative of subfunctionalization. PMID:16672307

  13. Complete chloroplast genome sequence of MD-2 pineapple and its comparative analysis among nine other plants from the subclass Commelinidae.

    PubMed

    Redwan, R M; Saidin, A; Kumar, S V

    2015-08-12

    Pineapple (Ananas comosus var. comosus) is known as the king of fruits for its crown and is the third most important tropical fruit after banana and citrus. The plant, which is indigenous to South America, is the most important species in the Bromeliaceae family and is largely traded for fresh fruit consumption. Here, we report the complete chloroplast sequence of the MD-2 pineapple that was sequenced using the PacBio sequencing technology. In this study, the high error rate of PacBio long sequence reads of A. comosus's total genomic DNA were improved by leveraging on the high accuracy but short Illumina reads for error-correction via the latest error correction module from Novocraft. Error corrected long PacBio reads were assembled by using a single tool to produce a contig representing the pineapple chloroplast genome. The genome of 159,636 bp in length is featured with the conserved quadripartite structure of chloroplast containing a large single copy region (LSC) with a size of 87,482 bp, a small single copy region (SSC) with a size of 18,622 bp and two inverted repeat regions (IRA and IRB) each with the size of 26,766 bp. Overall, the genome contained 117 unique coding regions and 30 were repeated in the IR region with its genes contents, structure and arrangement similar to its sister taxon, Typha latifolia. A total of 35 repeats structure were detected in both the coding and non-coding regions with a majority being tandem repeats. In addition, 205 SSRs were detected in the genome with six protein-coding genes contained more than two SSRs. Comparative chloroplast genomes from the subclass Commelinidae revealed a conservative protein coding gene albeit located in a highly divergence region. Analysis of selection pressure on protein-coding genes using Ka/Ks ratio showed significant positive selection exerted on the rps7 gene of the pineapple chloroplast with P less than 0.05. Phylogenetic analysis confirmed the recent taxonomical relation among the member of commelinids which support the monophyly relationship between Arecales and Dasypogonaceae and between Zingiberales to the Poales, which includes the A. comosus. The complete sequence of the chloroplast of pineapple provides insights to the divergence of genic chloroplast sequences from the members of the subclass Commelinidae. The complete pineapple chloroplast will serve as a reference for in-depth taxonomical studies in the Bromeliaceae family when more species under the family are sequenced in the future. The genetic sequence information will also make feasible other molecular applications of the pineapple chloroplast for plant genetic improvement.

  14. 1,2,3,4,6-penta-O-galloyl-β-D-glucopyranose Binds to the N-terminal Metal Binding Region to Inhibit Amyloid β-protein Oligomer and Fibril Formation.

    PubMed

    de Almeida, Natália E C; Do, Thanh D; LaPointe, Nichole E; Tro, Michael; Feinstein, Stuart C; Shea, Joan-Emma; Bowers, Michael T

    2017-09-01

    The early oligomerization of amyloid β -protein (A β ) is a crucial step in the etiology of Alzheimer's disease (AD), in which soluble and highly neurotoxic oligomers are produced and accumulated inside neurons. In search of therapeutic solutions for AD treatment and prevention, potent inhibitors that remodel A β assembly and prevent neurotoxic oligomer formation offer a promising approach. In particular, several polyphenolic compounds have shown anti-aggregation properties and good efficacy on inhibiting oligomeric amyloid formation. 1,2,3,4,6-penta-O-galloyl-β-D-glucopyranose is a large polyphenol that has been shown to be effective at inhibiting aggregation of full-length A β 1-40 and A β 1-42 , but has the opposite effect on the C-terminal fragment A β 25-35 . Here, we use a combination of ion mobility coupled to mass spectrometry (IMS-MS), transmission electron microscopy (TEM) and molecular dynamics (MD) simulations to elucidate the inhibitory effect of PGG on aggregation of full-length A β 1-40 and A β 1-42 . We show that PGG interacts strongly with these two peptides, especially in their N-terminal metal binding regions, and suppresses the formation of A β 1-40 tetramer and A β 1-42 dodecamer. By exploring multiple facets of polyphenol-amyloid interactions, we provide a molecular basis for the opposing effects of PGG on full-length A β and its C-terminal fragments.

  15. Isolation and characterization of full-length putative alcohol dehydrogenase genes from polygonum minus

    NASA Astrophysics Data System (ADS)

    Hamid, Nur Athirah Abd; Ismail, Ismanizan

    2013-11-01

    Polygonum minus, locally named as Kesum is an aromatic herb which is high in secondary metabolite content. Alcohol dehydrogenase is an important enzyme that catalyzes the reversible oxidation of alcohol and aldehyde with the presence of NAD(P)(H) as co-factor. The main focus of this research is to identify the gene of ADH. The total RNA was extracted from leaves of P. minus which was treated with 150 μM Jasmonic acid. Full-length cDNA sequence of ADH was isolated via rapid amplification cDNA end (RACE). Subsequently, in silico analysis was conducted on the full-length cDNA sequence and PCR was done on genomic DNA to determine the exon and intron organization. Two sequences of ADH, designated as PmADH1 and PmADH2 were successfully isolated. Both sequences have ORF of 801 bp which encode 266 aa residues. Nucleotide sequence comparison of PmADH1 and PmADH2 indicated that both sequences are highly similar at the ORF region but divergent in the 3' untranslated regions (UTR). The amino acid is differ at the 107 residue; PmADH1 contains Gly (G) residue while PmADH2 contains Cys (C) residue. The intron-exon organization pattern of both sequences are also same, with 3 introns and 4 exons. Based on in silico analysis, both sequences contain "classical" short chain alcohol dehydrogenases/reductases ((c) SDRs) conserved domain. The results suggest that both sequences are the members of short chain alcohol dehydrogenase family.

  16. The Helioseismic and Magnetic Imager (HMI) Vector Magnetic Field Pipeline: Overview and Performance

    NASA Astrophysics Data System (ADS)

    Hoeksema, J. Todd; Liu, Yang; Hayashi, Keiji; Sun, Xudong; Schou, Jesper; Couvidat, Sebastien; Norton, Aimee; Bobra, Monica; Centeno, Rebecca; Leka, K. D.; Barnes, Graham; Turmon, Michael

    2014-09-01

    The Helioseismic and Magnetic Imager (HMI) began near-continuous full-disk solar measurements on 1 May 2010 from the Solar Dynamics Observatory (SDO). An automated processing pipeline keeps pace with observations to produce observable quantities, including the photospheric vector magnetic field, from sequences of filtergrams. The basic vector-field frame list cadence is 135 seconds, but to reduce noise the filtergrams are combined to derive data products every 720 seconds. The primary 720 s observables were released in mid-2010, including Stokes polarization parameters measured at six wavelengths, as well as intensity, Doppler velocity, and the line-of-sight magnetic field. More advanced products, including the full vector magnetic field, are now available. Automatically identified HMI Active Region Patches (HARPs) track the location and shape of magnetic regions throughout their lifetime. The vector field is computed using the Very Fast Inversion of the Stokes Vector (VFISV) code optimized for the HMI pipeline; the remaining 180∘ azimuth ambiguity is resolved with the Minimum Energy (ME0) code. The Milne-Eddington inversion is performed on all full-disk HMI observations. The disambiguation, until recently run only on HARP regions, is now implemented for the full disk. Vector and scalar quantities in the patches are used to derive active region indices potentially useful for forecasting; the data maps and indices are collected in the SHARP data series, hmi.sharp_720s. Definitive SHARP processing is completed only after the region rotates off the visible disk; quick-look products are produced in near real time. Patches are provided in both CCD and heliographic coordinates. HMI provides continuous coverage of the vector field, but has modest spatial, spectral, and temporal resolution. Coupled with limitations of the analysis and interpretation techniques, effects of the orbital velocity, and instrument performance, the resulting measurements have a certain dynamic range and sensitivity and are subject to systematic errors and uncertainties that are characterized in this report.

  17. Factors associated with delay in trauma team activation and impact on patient outcomes.

    PubMed

    Connolly, Rory; Woo, Michael Y; Lampron, Jacinthe; Perry, Jeffrey J

    2017-09-05

    Trauma code activation is initiated by emergency physicians using physiological and anatomical criteria, mechanism of injury, and patient demographic factors. Our objective was to identify factors associated with delayed trauma team activation. We assessed consecutive cases from a regional trauma database from January 2008 to March 2014. We defined a delay in trauma code activation as a time greater than 30 minutes from the time of arrival. We conducted univariate analysis for factors potentially influencing trauma team activation, and we subsequently used multiple logistic regression analysis models for delayed activation in relation to mortality, length of stay, and time to operative management. Patients totalling 846 were included for our analysis; 4.1% (35/846) of trauma codes were activated after 30 minutes. Mean age was 40.8 years in the early group versus 49.2 in the delayed group (p=0.01). Patients were over age 70 years in 7.6% in the early activation group versus 17.1% in the delayed group (p=0.04). There was no significant difference in sex, type of injury, injury severity, or time from injury between the two groups. There was no significant difference in mortality, median length of stay, or median time to operative management. Delayed activation is linked with increasing age with no clear link to increased mortality. Given the severe injuries in the delayed cohort that required activation of the trauma team, further emphasis on the older trauma patient and interventions to recognize this vulnerable population should be made.

  18. The complete nucleotide sequence of the domestic dog (Canis familiaris) mitochondrial genome.

    PubMed

    Kim, K S; Lee, S E; Jeong, H W; Ha, J H

    1998-10-01

    The complete nucleotide sequence of the mitochondrial genome of the domestic dog, Canis familiaris, was determined. The length of the sequence was 16,728 bp; however, the length was not absolute due to the variation (heteroplasmy) caused by differing numbers of the repetitive motif, 5'-GTACACGT(A/G)C-3', in the control region. The genome organization, gene contents, and codon usage conformed to those of other mammalian mitochondrial genomes. Although its features were unknown, the "CTAGA" duplication event which followed the translational stop codon of the COII gene was not observed in other mammalian mitochondrial genomes. In order to determine the possible differences between mtDNAs in carnivores, two rRNA and 13 protein-coding genes from the cat, dog, and seal were compared. The combined molecular differences, in two rRNA genes as well as in the inferred amino acid sequences of the mitochondrial 13 protein-coding genes, suggested that there is a closer relationship between the dog and the seal than there is between either of these species and the cat. Based on the molecular differences of the mtDNA, the evolutionary divergence between the cat, the dog, and the seal was dated to approximately 50 +/- 4 million years ago. The degree of difference between carnivore mtDNAs varied according to the individual protein-coding gene applied, showing that the evolutionary relationships of distantly related species should be presented in an extended study based on ample sequence data like complete mtDNA molecules. Copyright 1998 Academic Press.

  19. An Efficient Variable Length Coding Scheme for an IID Source

    NASA Technical Reports Server (NTRS)

    Cheung, K. -M.

    1995-01-01

    A scheme is examined for using two alternating Huffman codes to encode a discrete independent and identically distributed source with a dominant symbol. This combined strategy, or alternating runlength Huffman (ARH) coding, was found to be more efficient than ordinary coding in certain circumstances.

  20. Xuhuai goat H-FABP gene clone, subcellular localization of expression products and the preparation of transgenic mice.

    PubMed

    Yin, Yan-hui; Li, Bi-chun; Wei, Guang-hui; Zhu, Cai-ye; Li, Wei; Zhang, Ya-ni; Du, Li-xin; Cao, Wen-guang

    2012-05-01

    The aim of this study was to clone the heart-type fatty acid binding protein (H-FABP) gene of Xuhuai goat, to explore it bioinformatically, and analyze the subcellular localization using enhanced green fluorescent protein (EGFP). The results showed that the coding sequence (CDS) length of Xuhuai goat H-FABP gene was 402 bp, encoding 133 amino acids (GenBank accession number AY466498.1). The H-FABP cDNA coding sequence was compared with the corresponding region of human, chicken, brown rat, cow, wild boar, donkey, and zebrafish. The similarity were 89%, 76%, 85%, 84%, 93%, 91%, 70%, respectively. For the corresponding amino acid sequences, the similarity were 90%, 79%, 88%, 97%, 95%, 94%, 72%, respectively. This study did not find the signal peptide region in the H-FABP protein; it revealed that H-FABP protein might be a nonsecreted protein. H-FABP expression was detected in vitro by reverse transcription-polymerase chain reaction (RT-PCR), and the EGFP-H-FABP fusion protein was localized to the cytoplasm. The gene could also be transiently and permanently expressed in mice.

  1. Correlation approach to identify coding regions in DNA sequences

    NASA Technical Reports Server (NTRS)

    Ossadnik, S. M.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.

    1994-01-01

    Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law correlations, whereas coding regions typically display only short-range correlations. We develop an algorithm based on this finding that enables investigators to perform a statistical analysis on long DNA sequences to locate possible coding regions. The algorithm is particularly successful in predicting the location of lengthy coding regions. For example, for the complete genome of yeast chromosome III (315,344 nucleotides), at least 82% of the predictions correspond to putative coding regions; the algorithm correctly identified all coding regions larger than 3000 nucleotides, 92% of coding regions between 2000 and 3000 nucleotides long, and 79% of coding regions between 1000 and 2000 nucleotides. The predictive ability of this new algorithm supports the claim that there is a fundamental difference in the correlation property between coding and noncoding sequences. This algorithm, which is not species-dependent, can be implemented with other techniques for rapidly and accurately locating relatively long coding regions in genomic sequences.

  2. The complete mitogenome of Ginkgo-toothed beaked whale (Mesoplodon ginkgodens) (Chordata: Ziphiidae).

    PubMed

    Yao, Chiou-Ju; Chen, Ching-Hung; Hsiao, Chung-Der

    2016-07-01

    In this study, we used the next-generation sequencing method to deduce the complete mitogenome of Ginkgo-toothed beaked whale (Mesoplodon ginkgodens) for the first time. The nucleotide composition was asymmetric (33.3% A, 25.3% C, 12.6% G, and 28.7% T) with an overall GC content of 37.9%. The length of the assembled mitogenome was 16,339 bp and follows the typical vertebrate arrangement, including 13 protein coding genes, 22 transfer RNAs, 2 ribosomal RNAs genes, and a non-coding control region of D-loop. The D-loop contains 870 bp and is located between tRNA-Pro and tRNA-Phe. The complete mitogenome of Ginkgo-toothed beaked whale deduced in this study provides essential and important DNA molecular data for further phylogenetic and evolutionary analysis for cetaceans.

  3. The complete mitochondrial genome of the Jacobin pigeon (Columba livia breed Jacobin).

    PubMed

    He, Wen-Xiao; Jia, Jin-Feng

    2015-06-01

    The Jacobin is a breed of fancy pigeon developed over many years of selective breeding that originated in Asia. In the present work, we report the complete mitochondrial genome sequence of Jacobin pigeon for the first time. The total length of the mitogenome was 17,245 bp with the base composition of 30.18% for A, 23.98% for T, 31.88% for C, and 13.96% for G and an A-T (54.17 %)-rich feature was detected. It harbored 13 protein-coding genes, 2 ribosomal RNA genes, 22 transfer RNA genes and 1 non-coding control region. The arrangement of all genes was identical to the typical mitochondrial genomes of pigeon. The complete mitochondrial genome sequence of Jacobin pigeon would serve as an important data set of the germplasm resources for further study.

  4. Resource utilization in primary repair of cleft palate.

    PubMed

    Owusu, James A; Liu, Meixia; Sidman, James D; Scott, Andrew R

    2013-03-01

    To estimate the current incidence of cleft palate in the United States and to determine national variations in resource utilization for primary repair of cleft palate. Retrospective analysis of a national, pediatric database (2009 Kids Inpatient Database). Patients aged 3 and below admitted for cleft palate repair were selected, using ICD-9 codes for cleft palate and procedure code for primary (initial) repair of cleft palate. A number of demographic variables were analyzed, and hospital charges were considered as a measure of resource utilization. Primary repair of cleft palate was performed on 1,943 patients. The estimated incidence was 0.11% with male to female ratio of 1.2:1. Regional incidence ranged from 0.09% (Northeast) to 0.12% (Midwest). The mean age at surgery was 13.4 months. The average length of stay was 1.9 days. The average total charge nationwide was $22,982, ranging from $17,972 (South) to $25,671 (Northeast). Average charge in a teaching institution was $4,925 higher than for nonteaching institutions. The strongest predictor of charge was length of stay, increasing charge by $7,663 for every additional hospital day (P < 0.01). National variations exist in resource utilization for primary repair of cleft palate, with higher charges in Northeastern states and teaching hospitals. The strongest predictor of increased resource use was length of stay, which was significantly higher at teaching institutions. Copyright © 2012 The American Laryngological, Rhinological, and Otological Society, Inc.

  5. 3D modeling of missing pellet surface defects in BWR fuel

    DOE PAGES

    Spencer, B. W.; Williamson, R. L.; Stafford, D. S.; ...

    2016-07-26

    One of the important roles of cladding in light water reactor fuel rods is to prevent the release of fission products. To that end, it is essential that the cladding maintain its integrity under a variety of thermal and mechanical loading conditions. Local geometric irregularities in fuel pellets caused by manufacturing defects known as missing pellet surfaces (MPS) can in some circumstances lead to elevated cladding stresses that are sufficiently high to cause cladding failure. Accurate modeling of these defects can help prevent these types of failures. The BISON nuclear fuel performance code developed at Idaho National Laboratory can bemore » used to simulate the global thermo-mechanical fuel rod behavior, as well as the local response of regions of interest, in either 2D or 3D. In either case, a full set of models to represent the thermal and mechanical properties of the fuel, cladding and plenum gas is employed. A procedure for coupling 2D full-length fuel rod models to detailed 3D models of the region of the rod containing a MPS defect is detailed in this paper. The global and local model each contain appropriate physics and behavior models for nuclear fuel. This procedure is demonstrated on a simulation of a boiling water reactor (BWR) fuel rod containing a pellet with an MPS defect, subjected to a variety of transient events, including a control blade withdrawal and a ramp to high power. The importance of modeling the local defect using a 3D model is highlighted by comparing 3D and 2D representations of the defective pellet region. Finally, parametric studies demonstrate the effects of the choice of gaseous swelling model and of the depth and geometry of the MPS defect on the response of the cladding adjacent to the defect.« less

  6. Liquid Engine Design: Effect of Chamber Dimensions on Specific Impulse

    NASA Technical Reports Server (NTRS)

    Hoggard, Lindsay; Leahy, Joe

    2009-01-01

    Which assumption of combustion chemistry - frozen or equilibrium - should be used in the prediction of liquid rocket engine performance calculations? Can a correlation be developed for this? A literature search using the LaSSe tool, an online repository of old rocket data and reports, was completed. Test results of NTO/Aerozine-50 and Lox/LH2 subscale and full-scale injector and combustion chamber test results were found and studied for this task. NASA code, Chemical Equilibrium with Applications (CEA) was used to predict engine performance using both chemistry assumptions, defined here. Frozen- composition remains frozen during expansion through the nozzle. Equilibrium- instantaneous chemical equilibrium during nozzle expansion. Chamber parameters were varied to understand what dimensions drive chamber C* and Isp. Contraction Ratio is the ratio of the nozzle throat area to the area of the chamber. L is the length of the chamber. Characteristic chamber length, L*, is the length that the chamber would be if it were a straight tube and had no converging nozzle. Goal: Develop a qualitative and quantitative correlation for performance parameters - Specific Impulse (Isp) and Characteristic Velocity (C*) - as a function of one or more chamber dimensions - Contraction Ratio (CR), Chamber Length (L ) and/or Characteristic Chamber Length (L*). Determine if chamber dimensions can be correlated to frozen or equilibrium chemistry.

  7. Analysis for complete genomic sequence of HLA-B and HLA-C alleles in the Chinese Han population.

    PubMed

    Zhu, F; He, Y; Zhang, W; He, J; He, J; Xu, X; Lv, H; Yan, L

    2011-08-01

    In the present study, we have determined the complete genomic sequence and analysed the intron polymorphism of partial HLA-B and HLA-C alleles in the Chinese Han population. Over 3.0 kb DNA fragments of HLA-B and HLA-C loci were amplified by polymerase chain reaction from partial 5' untranslated region to 3' noncoding region respectively, and then the amplified products were sequenced. Full-length nucleotide sequences of 14 HLA-B alleles and 10 HLA-C alleles were obtained and have been submitted to GenBank and IMGT/HLA database. Two novel alleles of HLA-B*52:01:01:02 and HLA-B*59:01:01:02 were identified, and the complete genomic sequence of HLA-B*52:01:01:01 was firstly reported. Totally 157 and 167 polymorphism positions were found in the full-length genomic sequence of HLA-B and HLA-C loci respectively. Our results suggested that many single nucleotide polymorphisms existed in the exon and intron regions, and the data can provide useful information for understanding the evolution of HLA-B and HLA-C alleles. © 2011 Blackwell Publishing Ltd.

  8. Maximum-likelihood soft-decision decoding of block codes using the A* algorithm

    NASA Technical Reports Server (NTRS)

    Ekroot, L.; Dolinar, S.

    1994-01-01

    The A* algorithm finds the path in a finite depth binary tree that optimizes a function. Here, it is applied to maximum-likelihood soft-decision decoding of block codes where the function optimized over the codewords is the likelihood function of the received sequence given each codeword. The algorithm considers codewords one bit at a time, making use of the most reliable received symbols first and pursuing only the partially expanded codewords that might be maximally likely. A version of the A* algorithm for maximum-likelihood decoding of block codes has been implemented for block codes up to 64 bits in length. The efficiency of this algorithm makes simulations of codes up to length 64 feasible. This article details the implementation currently in use, compares the decoding complexity with that of exhaustive search and Viterbi decoding algorithms, and presents performance curves obtained with this implementation of the A* algorithm for several codes.

  9. [A quality controllable algorithm for ECG compression based on wavelet transform and ROI coding].

    PubMed

    Zhao, An; Wu, Baoming

    2006-12-01

    This paper presents an ECG compression algorithm based on wavelet transform and region of interest (ROI) coding. The algorithm has realized near-lossless coding in ROI and quality controllable lossy coding outside of ROI. After mean removal of the original signal, multi-layer orthogonal discrete wavelet transform is performed. Simultaneously,feature extraction is performed on the original signal to find the position of ROI. The coefficients related to the ROI are important coefficients and kept. Otherwise, the energy loss of the transform domain is calculated according to the goal PRDBE (Percentage Root-mean-square Difference with Baseline Eliminated), and then the threshold of the coefficients outside of ROI is determined according to the loss of energy. The important coefficients, which include the coefficients of ROI and the coefficients that are larger than the threshold outside of ROI, are put into a linear quantifier. The map, which records the positions of the important coefficients in the original wavelet coefficients vector, is compressed with a run-length encoder. Huffman coding has been applied to improve the compression ratio. ECG signals taken from the MIT/BIH arrhythmia database are tested, and satisfactory results in terms of clinical information preserving, quality and compress ratio are obtained.

  10. A new Monte Carlo code for light transport in biological tissue.

    PubMed

    Torres-García, Eugenio; Oros-Pantoja, Rigoberto; Aranda-Lara, Liliana; Vieyra-Reyes, Patricia

    2018-04-01

    The aim of this work was to develop an event-by-event Monte Carlo code for light transport (called MCLTmx) to identify and quantify ballistic, diffuse, and absorbed photons, as well as their interaction coordinates inside the biological tissue. The mean free path length was computed between two interactions for scattering or absorption processes, and if necessary scatter angles were calculated, until the photon disappeared or went out of region of interest. A three-layer array (air-tissue-air) was used, forming a semi-infinite sandwich. The light source was placed at (0,0,0), emitting towards (0,0,1). The input data were: refractive indices, target thickness (0.02, 0.05, 0.1, 0.5, and 1 cm), number of particle histories, and λ from which the code calculated: anisotropy, scattering, and absorption coefficients. Validation presents differences less than 0.1% compared with that reported in the literature. The MCLTmx code discriminates between ballistic and diffuse photons, and inside of biological tissue, it calculates: specular reflection, diffuse reflection, ballistics transmission, diffuse transmission and absorption, and all parameters dependent on wavelength and thickness. The MCLTmx code can be useful for light transport inside any medium by changing the parameters that describe the new medium: anisotropy, dispersion and attenuation coefficients, and refractive indices for specific wavelength.

  11. Performance of an Axisymmetric Rocket Based Combined Cycle Engine During Rocket Only Operation Using Linear Regression Analysis

    NASA Technical Reports Server (NTRS)

    Smith, Timothy D.; Steffen, Christopher J., Jr.; Yungster, Shaye; Keller, Dennis J.

    1998-01-01

    The all rocket mode of operation is shown to be a critical factor in the overall performance of a rocket based combined cycle (RBCC) vehicle. An axisymmetric RBCC engine was used to determine specific impulse efficiency values based upon both full flow and gas generator configurations. Design of experiments methodology was used to construct a test matrix and multiple linear regression analysis was used to build parametric models. The main parameters investigated in this study were: rocket chamber pressure, rocket exit area ratio, injected secondary flow, mixer-ejector inlet area, mixer-ejector area ratio, and mixer-ejector length-to-inlet diameter ratio. A perfect gas computational fluid dynamics analysis, using both the Spalart-Allmaras and k-omega turbulence models, was performed with the NPARC code to obtain values of vacuum specific impulse. Results from the multiple linear regression analysis showed that for both the full flow and gas generator configurations increasing mixer-ejector area ratio and rocket area ratio increase performance, while increasing mixer-ejector inlet area ratio and mixer-ejector length-to-diameter ratio decrease performance. Increasing injected secondary flow increased performance for the gas generator analysis, but was not statistically significant for the full flow analysis. Chamber pressure was found to be not statistically significant.

  12. Recuperator construction for a gas turbine engine

    DOEpatents

    Kang, Yungmo; McKeirnan, Jr., Robert D.

    2006-12-12

    A counter-flow recuperator formed from annular arrays of recuperator core segments. The recuperator core segments are formed from two opposing sheets of fin fold material coined to form a primary surface zone disposed between two flattened manifold zones. Each primary surface zone has undulating corrugations including a uniform, full height central portion and a transition zone disposed between the central portion and one of the manifold zones. Corrugations of the transition zone rise from zero adjacent to the manifold zone and increase along a transition length to full crest height at the central portion. The transition lengths increase in a direction away from an inner edge containing the air inlet so as to equalize air flow to the distal regions of the primary surface zone.

  13. Woot, an Active Gypsy-Class Retrotransposon in the Flour Beetle, Tribolium Castaneum, Is Associated with a Recent Mutation

    PubMed Central

    Beeman, R. W.; Thomson, M. S.; Clark, J. M.; DeCamillis, M. A.; Brown, S. J.; Denell, R. E.

    1996-01-01

    A recently isolated, lethal mutation of the homeotic Abdominal gene of the red flour beetle Tribolium castaneum is associated with an insertion of a novel retrotransposon into an intron. Sequence analysis indicates that this retrotransposon, named Woot, is a member of the gypsy family of mobile elements. Most strains of T. castaneum appear to harbor ~25-35 copies of Woot per genome. Woot is composed of long terminal repeats of unprecedented length (3.6 kb each), flanking an internal coding region 5.0 kb in length. For most copies of Woot, the internal region includes two open reading frames (ORFs) that correspond to the gag and pol genes of previously described retrotransposons and retroviruses. The copy of Woot inserted into Abdominal bears an apparent single frameshift mutation that separates the normal second ORF into two. Woot does not appear to generate infectious virions by the criterion that no envelop gene is discernible. The association of Woot with a recent mutation suggests that this retroelement is currently transpositionally active in at least some strains. PMID:8722793

  14. Application of the RNS3D Code to a Circular-Rectangular Transition Duct With and Without Inlet Swirl and Comparison with Experiments

    NASA Technical Reports Server (NTRS)

    Cavicchi, Richard H.

    1999-01-01

    Circular-rectangular transition ducts are used between engine exhausts and nozzles with rectangular cross sections that are designed for high performance aircraft. NASA Glenn Research Center has made experimental investigations of a series of circular-rectangular transition ducts to provide benchmark flow data for comparison with numerical calculations. These ducts are all designed with superellipse cross sections to facilitate grid generation. In response to this challenge, the three-dimensional RNS3D code has been applied to one of these transition ducts. This particular duct has a length-to-inlet diameter ratio of 1.5 and an exit-plane aspect ratio of 3.0. The inlet Mach number is 0.35. Two GRC experiments and the code were run for this duct without inlet swirl. One GRC experiment and the code were also run with inlet swirl. With no inlet swirl the code was successful in predicting pressures and secondary flow conditions, including a pair of counter-rotating vortices at both sidewalls of the exit plane. All these phenomena have been reported from the two GRC experiments. However, these vortices were suppressed in the one experiment when inlet swirl was used; whereas the RNS3D code still predicted them. The experiment was unable to provide data near the sidewalls, the very region where the vortices were predicted.

  15. The 5΄ UTR of the type I toxin ZorO can both inhibit and enhance translation

    PubMed Central

    Wen, Jia; Harp, John R.

    2017-01-01

    Abstract Many bacterial type I toxin mRNAs possess a long 5΄ untranslated region (UTR) that serves as the target site of the corresponding antitoxin sRNA. This is the case for the zorO-orzO type I system where the OrzO antitoxin base pairs to the 174-nucleotide zorO 5΄ UTR. Here, we demonstrate that the full-length 5΄ UTR of the zorO type I toxin hinders its own translation independent of the sRNA whereas a processed 5΄ UTR (zorO Δ28) promotes translation. The full-length zorO 5΄ UTR folds into an extensive secondary structure sequestering the ribosome binding site (RBS). Processing of the 5΄ UTR does not alter the RBS structure, but opens a large region (EAP region) located upstream of the RBS. Truncation of this EAP region impairs zorO translation, but this defect can be rescued upon exposing the RBS. Additionally, the region spanning +35 to +50 of the zorO mRNA is needed for optimal translation of zorO. Importantly, the positive and negative effects on translation imparted by the 5΄ UTR can be transferred onto a reporter gene, indicative that the 5΄ UTR can solely drive regulation. Moreover, we show that the OrzO sRNA can inhibit zorO translation via base pairing to the of the EAP region. PMID:27903909

  16. RNA structural constraints in the evolution of the influenza A virus genome NP segment

    PubMed Central

    Gultyaev, Alexander P; Tsyganov-Bodounov, Anton; Spronken, Monique IJ; van der Kooij, Sander; Fouchier, Ron AM; Olsthoorn, René CL

    2014-01-01

    Conserved RNA secondary structures were predicted in the nucleoprotein (NP) segment of the influenza A virus genome using comparative sequence and structure analysis. A number of structural elements exhibiting nucleotide covariations were identified over the whole segment length, including protein-coding regions. Calculations of mutual information values at the paired nucleotide positions demonstrate that these structures impose considerable constraints on the virus genome evolution. Functional importance of a pseudoknot structure, predicted in the NP packaging signal region, was confirmed by plaque assays of the mutant viruses with disrupted structure and those with restored folding using compensatory substitutions. Possible functions of the conserved RNA folding patterns in the influenza A virus genome are discussed. PMID:25180940

  17. DNA octaplex formation with an I-motif of water-mediated A-quartets: reinterpretation of the crystal structure of d(GCGAAAGC).

    PubMed

    Sato, Yoshiteru; Mitomi, Kenta; Sunami, Tomoko; Kondo, Jiro; Takénaka, Akio

    2006-12-01

    The crystal structure of the tetragonal form of d(gcGAAAgc) has been revised and reasonably refined including the disordered residues. The two DNA strands form a base-intercalated duplex, and the four duplexes are assembled according to the crystallographic 222 symmetry to form an octaplex. In the central region, the eight strands are associated by I-motif of double A-quartets. Furthermore, eight hydrated-magnesium cations link the four duplexes to support the octaplex formation. Based on these structural features, a proposal that folding of d(GAAA)n, found in the non-coding region of genomes, into an octaplex can induce slippage during replication to facilitate length polymorphism is presented.

  18. The complete mitochondrial genome sequence of Malus hupehensis var. pinyiensis.

    PubMed

    Duan, Naibin; Sun, Honghe; Wang, Nan; Fei, Zhangjun; Chen, Xuesen

    2016-07-01

    The complete mitochondrial genome sequence of Malus hupehensis var. pinyiensis, a widely used apple rootstock, was determined using the Illumina high-throughput sequencing approach. The genome is 422,555 bp in length and has a GC content of 45.21%. It is separated by a pair of inverted repeats of 32,504 bp, to form a large single copy region of 213,055 bp and a small single copy region of 144,492 bp. The genome contains 38 protein-coding genes, four pseudogenes, 25 tRNA genes, and three rRNA genes. The genome is 25,608 bp longer than that of M. domestica, and several structural variations between these two mitogenomes were detected.

  19. ISOLATION OF THE REGULATORY REGIONS AND GENOMIC ORGANIZATION OF THE PORCINE α1,3-GALACTOSYLTRANSFERASE GENE1

    PubMed Central

    Koike, Chihiro; Friday, Robert P.; Nakashima, Izumi; Luppi, Patrizia; Fung, John J.; Rao, Abdul S.; Starzl, Thomas E.; Trucco, Massimo

    2010-01-01

    Background α1,3-galactosyltransferase (α1,3GT) is an enzyme that produces carbohydrate chains termed αGal epitopes found in most mammals, although some species of higher primates, including human, are notable exceptions. The evolutionary origin of the lost α1,3GT enzyme activity is not yet known, although it has been suggested that the promoter activity of this gene in the ancestors of higher primates was inactivated. Methods We used 5′-or 3′-RACE, GenomeWalking, reverse transcriptase polymerase chain reaction (RT-PCR) and dual Luciferase reporter assay for identification of the full-length cDNA, which includes the transcription initiation site and the promoter region of porcine α1,3GT gene. Results The region around exon 1 is guanine and cytosine (GC)-rich (about 70%), comprising a CpG island spanning more than 1.5 kbp. The 5′-flanking region of exon 1 contains multiple transcription factor consensus motifs, including GC-box, SP1, AP2, and GATA-box sites, in the absence of TATA or CAAT-box sequences. The entire gene consists of three 5′ noncoding and six coding region exons spanning more than 52 kbp. Detailed analysis of α1,3GT transcripts revealed two major alternative splicing patterns in the 5′-untranslated region (5′-UTR) and evidence for minor splicing activity that occurs in a tissue-specific manner. Interspecies comparison of 5′-UTR shows minimal homology between porcine and murine sequences except for exon 2, which suggests that the regulatory regions differ among species. Conclusions These observations have important implications for experiments involving genetic manipulation of the α1,3GT gene in transgenic animals in terms of promoter utilization, and particularly in genetically engineering cells for the animal cloning technology by nuclear transfer. PMID:11087141

  20. The Monte Carlo photoionization and moving-mesh radiation hydrodynamics code CMACIONIZE

    NASA Astrophysics Data System (ADS)

    Vandenbroucke, B.; Wood, K.

    2018-04-01

    We present the public Monte Carlo photoionization and moving-mesh radiation hydrodynamics code CMACIONIZE, which can be used to simulate the self-consistent evolution of HII regions surrounding young O and B stars, or other sources of ionizing radiation. The code combines a Monte Carlo photoionization algorithm that uses a complex mix of hydrogen, helium and several coolants in order to self-consistently solve for the ionization and temperature balance at any given type, with a standard first order hydrodynamics scheme. The code can be run as a post-processing tool to get the line emission from an existing simulation snapshot, but can also be used to run full radiation hydrodynamical simulations. Both the radiation transfer and the hydrodynamics are implemented in a general way that is independent of the grid structure that is used to discretize the system, allowing it to be run both as a standard fixed grid code, but also as a moving-mesh code.

  1. A generalized weight-based particle-in-cell simulation scheme

    NASA Astrophysics Data System (ADS)

    Lee, W. W.; Jenkins, T. G.; Ethier, S.

    2011-03-01

    A generalized weight-based particle simulation scheme suitable for simulating magnetized plasmas, where the zeroth-order inhomogeneity is important, is presented. The scheme is an extension of the perturbative simulation schemes developed earlier for particle-in-cell (PIC) simulations. The new scheme is designed to simulate both the perturbed distribution ( δf) and the full distribution (full- F) within the same code. The development is based on the concept of multiscale expansion, which separates the scale lengths of the background inhomogeneity from those associated with the perturbed distributions. The potential advantage for such an arrangement is to minimize the particle noise by using δf in the linear stage of the simulation, while retaining the flexibility of a full- F capability in the fully nonlinear stage of the development when signals associated with plasma turbulence are at a much higher level than those from the intrinsic particle noise.

  2. Memory-efficient table look-up optimized algorithm for context-based adaptive variable length decoding in H.264/advanced video coding

    NASA Astrophysics Data System (ADS)

    Wang, Jianhua; Cheng, Lianglun; Wang, Tao; Peng, Xiaodong

    2016-03-01

    Table look-up operation plays a very important role during the decoding processing of context-based adaptive variable length decoding (CAVLD) in H.264/advanced video coding (AVC). However, frequent table look-up operation can result in big table memory access, and then lead to high table power consumption. Aiming to solve the problem of big table memory access of current methods, and then reduce high power consumption, a memory-efficient table look-up optimized algorithm is presented for CAVLD. The contribution of this paper lies that index search technology is introduced to reduce big memory access for table look-up, and then reduce high table power consumption. Specifically, in our schemes, we use index search technology to reduce memory access by reducing the searching and matching operations for code_word on the basis of taking advantage of the internal relationship among length of zero in code_prefix, value of code_suffix and code_lengh, thus saving the power consumption of table look-up. The experimental results show that our proposed table look-up algorithm based on index search can lower about 60% memory access consumption compared with table look-up by sequential search scheme, and then save much power consumption for CAVLD in H.264/AVC.

  3. Mobile and embedded fast high resolution image stitching for long length rectangular monochromatic objects with periodic structure

    NASA Astrophysics Data System (ADS)

    Limonova, Elena; Tropin, Daniil; Savelyev, Boris; Mamay, Igor; Nikolaev, Dmitry

    2018-04-01

    In this paper we describe stitching protocol, which allows to obtain high resolution images of long length monochromatic objects with periodic structure. This protocol can be used for long length documents or human-induced objects in satellite images of uninhabitable regions like Arctic regions. The length of such objects can reach notable values, while modern camera sensors have limited resolution and are not able to provide good enough image of the whole object for further processing, e.g. using in OCR system. The idea of the proposed method is to acquire a video stream containing full object in high resolution and use image stitching. We expect the scanned object to have straight boundaries and periodic structure, which allow us to introduce regularization to the stitching problem and adapt algorithm for limited computational power of mobile and embedded CPUs. With the help of detected boundaries and structure we estimate homography between frames and use this information to reduce complexity of stitching. We demonstrate our algorithm on mobile device and show image processing speed of 2 fps on Samsung Exynos 5422 processor

  4. Complete mitochondrial genome sequence from an endangered Indian snake, Python molurus molurus (Serpentes, Pythonidae).

    PubMed

    Dubey, Bhawna; Meganathan, P R; Haque, Ikramul

    2012-07-01

    This paper reports the complete mitochondrial genome sequence of an endangered Indian snake, Python molurus molurus (Indian Rock Python). A typical snake mitochondrial (mt) genome of 17258 bp length comprising of 37 genes including the 13 protein coding genes, 22 tRNA genes, and 2 ribosomal RNA genes along with duplicate control regions is described herein. The P. molurus molurus mt. genome is relatively similar to other snake mt. genomes with respect to gene arrangement, composition, tRNA structures and skews of AT/GC bases. The nucleotide composition of the genome shows that there are more A-C % than T-G% on the positive strand as revealed by positive AT and CG skews. Comparison of individual protein coding genes, with other snake genomes suggests that ATP8 and NADH3 genes have high divergence rates. Codon usage analysis reveals a preference of NNC codons over NNG codons in the mt. genome of P. molurus. Also, the synonymous and non-synonymous substitution rates (ka/ks) suggest that most of the protein coding genes are under purifying selection pressure. The phylogenetic analyses involving the concatenated 13 protein coding genes of P. molurus molurus conformed to the previously established snake phylogeny.

  5. [Malama project in the Region of Murcia (Spain): environment and breastfeeding].

    PubMed

    Ortega García, J A; Pastor Torres, E; Martínez Lorente, I; Bosch Giménez, V; Quesada López, J J; Hernández Ramón, F; Alcaráz Quiñonero, M; Llamas del Castillo, M M; Torres Cantero, A M; García de León González, R; Sánchez Solís de Querol, M

    2008-05-01

    To identify protective factors and risk factors for the initiation and length of breastfeeding and full breastfeeding, in the Region of Murcia (Spain). The Malama study (Medio Ambiente y Lactancia Materna) is a follow up study from birth up to years of 1,000 mother-child pairs. A description of breastfeeding practices are presented here, the survival curve of breastfeeding and a Cox regression model of the pilot study that includes 101 mother-child pairs and 6 months of follow-up. After six months the prevalence of breastfeeding was 35 %. The mean duration of full breastfeeding was 63 days (median 45 days) with six months prevalence of 8 %. Hazard ratios (HR) for full breastfeeding were, to be a smoker (1.89; 95 % CI: 1.18-3.02), older than 35 years of age (2.04; 95 % CI: 1.22-3.42), caesarean birth (1.63; 95 % CI: 1.00-2.66). As well as those previously mentioned risks for breastfeeding, there were also hazard ratios for primary school education or less (1.63; 95 % CI: 0.98-2.82); to have breastfed an earlier child for at least 16 weeks (0.33; 95 % CI: 0.13-0.79), and to be the first birth (0.50; 95 % CI: 0.27-0.95). The length of both breastfeeding and full breastfeeding increased with the length of the maternal leave (0.96; 95 % CI: 0.94-0.99). Pregestational occupational exposure to endocrine disruptors did not seem to interfere with the duration of breastfeeding. In order to improve quality and duration of breastfeeding programmes, paediatric research and training on breastfeeding practice should be encouraged, to reduce unnecessary caesarean sections, promote tobacco cessation, focus human and economic resources to women with less education, and include legal mechanisms to ensure longer maternal leave.

  6. Adaptive variable-length coding for efficient compression of spacecraft television data.

    NASA Technical Reports Server (NTRS)

    Rice, R. F.; Plaunt, J. R.

    1971-01-01

    An adaptive variable length coding system is presented. Although developed primarily for the proposed Grand Tour missions, many features of this system clearly indicate a much wider applicability. Using sample to sample prediction, the coding system produces output rates within 0.25 bit/picture element (pixel) of the one-dimensional difference entropy for entropy values ranging from 0 to 8 bit/pixel. This is accomplished without the necessity of storing any code words. Performance improvements of 0.5 bit/pixel can be simply achieved by utilizing previous line correlation. A Basic Compressor, using concatenated codes, adapts to rapid changes in source statistics by automatically selecting one of three codes to use for each block of 21 pixels. The system adapts to less frequent, but more dramatic, changes in source statistics by adjusting the mode in which the Basic Compressor operates on a line-to-line basis. Furthermore, the compression system is independent of the quantization requirements of the pulse-code modulation system.

  7. Long-read sequencing of the coffee bean transcriptome reveals the diversity of full-length transcripts

    PubMed Central

    Cheng, Bing; Furtado, Agnelo

    2017-01-01

    Abstract Polyploidization contributes to the complexity of gene expression, resulting in numerous related but different transcripts. This study explored the transcriptome diversity and complexity of the tetraploid Arabica coffee (Coffea arabica) bean. Long-read sequencing (LRS) by Pacbio Isoform sequencing (Iso-seq) was used to obtain full-length transcripts without the difficulty and uncertainty of assembly required for reads from short-read technologies. The tetraploid transcriptome was annotated and compared with data from the sub-genome progenitors. Caffeine and sucrose genes were targeted for case analysis. An isoform-level tetraploid coffee bean reference transcriptome with 95 995 distinct transcripts (average 3236 bp) was obtained. A total of 88 715 sequences (92.42%) were annotated with BLASTx against NCBI non-redundant plant proteins, including 34 719 high-quality annotations. Further BLASTn analysis against NCBI non-redundant nucleotide sequences, Coffea canephora coding sequences with UTR, C. arabica ESTs, and Rfam resulted in 1213 sequences without hits, were potential novel genes in coffee. Longer UTRs were captured, especially in the 5΄UTRs, facilitating the identification of upstream open reading frames. The LRS also revealed more and longer transcript variants in key caffeine and sucrose metabolism genes from this polyploid genome. Long sequences (>10 kilo base) were poorly annotated. LRS technology shows the limitation of previous studies. It provides an important tool to produce a reference transcriptome including more of the diversity of full-length transcripts to help understand the biology and support the genetic improvement of polyploid species such as coffee. PMID:29048540

  8. The Laborers-AGC Construction Skills Training Program. Final Performance Report.

    ERIC Educational Resources Information Center

    Tippie, John L.; Rice, Eric

    Patterned after a previously successful Laborers-Associated General Contractors model named the Construction Skills Training Program, a demonstration project was implemented at five regional training centers. At least eight courses were created, combined, or revised. Four full-length audiovisual support pieces were completed. Three courses were…

  9. Performance Analysis of Hybrid ARQ Protocols in a Slotted Code Division Multiple-Access Network

    DTIC Science & Technology

    1989-08-01

    Convolutional Codes . in Proc Int. Conf. Commun., 21.4.1-21.4.5, 1987. [27] J. Hagenauer. Rate Compatible Punctured Convolutional Codes . in Proc Int. Conf...achieved by using a low rate (r = 0.5), high constraint length (e.g., 32) punctured convolutional code . Code puncturing provides for a variable rate code ...investigated the use of convolutional codes in Type II Hybrid ARQ protocols. The error

  10. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome

    PubMed Central

    Camargo, Anamaria A.; Samaia, Helena P. B.; Dias-Neto, Emmanuel; Simão, Daniel F.; Migotto, Italo A.; Briones, Marcelo R. S.; Costa, Fernando F.; Aparecida Nagai, Maria; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; Sonati, Maria de Fátima; Tajara, Eloiza H.; Valentini, Sandro R.; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Arnaldi, Liliane A. T.; de Assis, Angela M.; Bengtson, Mário Henrique; Bergamo, Nadia Aparecida; Bombonato, Vanessa; de Camargo, Maria E. R.; Canevari, Renata A.; Carraro, Dirce M.; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Corrêa, Rosana F. R.; Costa, Maria Cristina R.; Curcio, Cyntia; Hokama, Paula O. M.; Ferreira, Ari J. S.; Furuzawa, Gilberto K.; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Krieger, José E.; Leite, Luciana C. C.; Majumder, Paromita; Marins, Mozart; Marques, Everaldo R.; Melo, Analy S. A.; Melo, Monica; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana G.; Prevedel, Aline C.; Rahal, Paula; Rainho, Claudia A.; Reis, Eduardo M. R.; Ribeiro, Marcelo L.; da Rós, Nancy; de Sá, Renata G.; Sales, Magaly M.; Sant'anna, Simone Cristina; dos Santos, Mariana L.; da Silva, Aline M.; da Silva, Neusa P.; Silva, Wilson A.; da Silveira, Rosana A.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Soares, Fernando; Moreira, Eloisa S.; Nunes, Diana N.; Correa, Ricardo G.; Zalcberg, Heloisa; Carvalho, Alex F.; Reis, Luis F. L.; Brentani, Ricardo R.; Simpson, Andrew J. G.; de Souza, Sandro J.

    2001-01-01

    Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription–PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning. PMID:11593022

  11. The contribution of 700,000 ORF sequence tags to the definition of the human transcriptome.

    PubMed

    Camargo, A A; Samaia, H P; Dias-Neto, E; Simão, D F; Migotto, I A; Briones, M R; Costa, F F; Nagai, M A; Verjovski-Almeida, S; Zago, M A; Andrade, L E; Carrer, H; El-Dorry, H F; Espreafico, E M; Habr-Gama, A; Giannella-Neto, D; Goldman, G H; Gruber, A; Hackel, C; Kimura, E T; Maciel, R M; Marie, S K; Martins, E A; Nobrega, M P; Paco-Larson, M L; Pardini, M I; Pereira, G G; Pesquero, J B; Rodrigues, V; Rogatto, S R; da Silva, I D; Sogayar, M C; Sonati, M F; Tajara, E H; Valentini, S R; Alberto, F L; Amaral, M E; Aneas, I; Arnaldi, L A; de Assis, A M; Bengtson, M H; Bergamo, N A; Bombonato, V; de Camargo, M E; Canevari, R A; Carraro, D M; Cerutti, J M; Correa, M L; Correa, R F; Costa, M C; Curcio, C; Hokama, P O; Ferreira, A J; Furuzawa, G K; Gushiken, T; Ho, P L; Kimura, E; Krieger, J E; Leite, L C; Majumder, P; Marins, M; Marques, E R; Melo, A S; Melo, M B; Mestriner, C A; Miracca, E C; Miranda, D C; Nascimento, A L; Nobrega, F G; Ojopi, E P; Pandolfi, J R; Pessoa, L G; Prevedel, A C; Rahal, P; Rainho, C A; Reis, E M; Ribeiro, M L; da Ros, N; de Sa, R G; Sales, M M; Sant'anna, S C; dos Santos, M L; da Silva, A M; da Silva, N P; Silva, W A; da Silveira, R A; Sousa, J F; Stecconi, D; Tsukumo, F; Valente, V; Soares, F; Moreira, E S; Nunes, D N; Correa, R G; Zalcberg, H; Carvalho, A F; Reis, L F; Brentani, R R; Simpson, A J; de Souza, S J; Melo, M

    2001-10-09

    Open reading frame expressed sequences tags (ORESTES) differ from conventional ESTs by providing sequence data from the central protein coding portion of transcripts. We generated a total of 696,745 ORESTES sequences from 24 human tissues and used a subset of the data that correspond to a set of 15,095 full-length mRNAs as a means of assessing the efficiency of the strategy and its potential contribution to the definition of the human transcriptome. We estimate that ORESTES sampled over 80% of all highly and moderately expressed, and between 40% and 50% of rarely expressed, human genes. In our most thoroughly sequenced tissue, the breast, the 130,000 ORESTES generated are derived from transcripts from an estimated 70% of all genes expressed in that tissue, with an equally efficient representation of both highly and poorly expressed genes. In this respect, we find that the capacity of the ORESTES strategy both for gene discovery and shotgun transcript sequence generation significantly exceeds that of conventional ESTs. The distribution of ORESTES is such that many human transcripts are now represented by a scaffold of partial sequences distributed along the length of each gene product. The experimental joining of the scaffold components, by reverse transcription-PCR, represents a direct route to transcript finishing that may represent a useful alternative to full-length cDNA cloning.

  12. CMacIonize: Monte Carlo photoionisation and moving-mesh radiation hydrodynamics

    NASA Astrophysics Data System (ADS)

    Vandenbroucke, Bert; Wood, Kenneth

    2018-02-01

    CMacIonize simulates the self-consistent evolution of HII regions surrounding young O and B stars, or other sources of ionizing radiation. The code combines a Monte Carlo photoionization algorithm that uses a complex mix of hydrogen, helium and several coolants in order to self-consistently solve for the ionization and temperature balance at any given time, with a standard first order hydrodynamics scheme. The code can be run as a post-processing tool to get the line emission from an existing simulation snapshot, but can also be used to run full radiation hydrodynamical simulations. Both the radiation transfer and the hydrodynamics are implemented in a general way that is independent of the grid structure that is used to discretize the system, allowing it to be run both as a standard fixed grid code and also as a moving-mesh code.

  13. Recent advances in coding theory for near error-free communications

    NASA Technical Reports Server (NTRS)

    Cheung, K.-M.; Deutsch, L. J.; Dolinar, S. J.; Mceliece, R. J.; Pollara, F.; Shahshahani, M.; Swanson, L.

    1991-01-01

    Channel and source coding theories are discussed. The following subject areas are covered: large constraint length convolutional codes (the Galileo code); decoder design (the big Viterbi decoder); Voyager's and Galileo's data compression scheme; current research in data compression for images; neural networks for soft decoding; neural networks for source decoding; finite-state codes; and fractals for data compression.

  14. Augmented burst-error correction for UNICON laser memory. [digital memory

    NASA Technical Reports Server (NTRS)

    Lim, R. S.

    1974-01-01

    A single-burst-error correction system is described for data stored in the UNICON laser memory. In the proposed system, a long fire code with code length n greater than 16,768 bits was used as an outer code to augment an existing inner shorter fire code for burst error corrections. The inner fire code is a (80,64) code shortened from the (630,614) code, and it is used to correct a single-burst-error on a per-word basis with burst length b less than or equal to 6. The outer code, with b less than or equal to 12, would be used to correct a single-burst-error on a per-page basis, where a page consists of 512 32-bit words. In the proposed system, the encoding and error detection processes are implemented by hardware. A minicomputer, currently used as a UNICON memory management processor, is used on a time-demanding basis for error correction. Based upon existing error statistics, this combination of an inner code and an outer code would enable the UNICON system to obtain a very low error rate in spite of flaws affecting the recorded data.

  15. Multicast Routing of Hierarchical Data

    NASA Technical Reports Server (NTRS)

    Shacham, Nachum

    1992-01-01

    The issue of multicast of broadband, real-time data in a heterogeneous environment, in which the data recipients differ in their reception abilities, is considered. Traditional multicast schemes, which are designed to deliver all the source data to all recipients, offer limited performance in such an environment, since they must either force the source to overcompress its signal or restrict the destination population to those who can receive the full signal. We present an approach for resolving this issue by combining hierarchical source coding techniques, which allow recipients to trade off reception bandwidth for signal quality, and sophisticated routing algorithms that deliver to each destination the maximum possible signal quality. The field of hierarchical coding is briefly surveyed and new multicast routing algorithms are presented. The algorithms are compared in terms of network utilization efficiency, lengths of paths, and the required mechanisms for forwarding packets on the resulting paths.

  16. Modified signed-digit arithmetic based on redundant bit representation.

    PubMed

    Huang, H; Itoh, M; Yatagai, T

    1994-09-10

    Fully parallel modified signed-digit arithmetic operations are realized based on redundant bit representation of the digits proposed. A new truth-table minimizing technique is presented based on redundant-bitrepresentation coding. It is shown that only 34 minterms are enough for implementing one-step modified signed-digit addition and subtraction with this new representation. Two optical implementation schemes, correlation and matrix multiplication, are described. Experimental demonstrations of the correlation architecture are presented. Both architectures use fixed minterm masks for arbitrary-length operands, taking full advantage of the parallelism of the modified signed-digit number system and optics.

  17. Implementation of a tree algorithm in MCNP code for nuclear well logging applications.

    PubMed

    Li, Fusheng; Han, Xiaogang

    2012-07-01

    The goal of this paper is to develop some modeling capabilities that are missing in the current MCNP code. Those missing capabilities can greatly help for some certain nuclear tools designs, such as a nuclear lithology/mineralogy spectroscopy tool. The new capabilities to be developed in this paper include the following: zone tally, neutron interaction tally, gamma rays index tally and enhanced pulse-height tally. The patched MCNP code also can be used to compute neutron slowing-down length and thermal neutron diffusion length. Copyright © 2011 Elsevier Ltd. All rights reserved.

  18. The influence of viral coding sequences on pestivirus IRES activity reveals further parallels with translation initiation in prokaryotes.

    PubMed Central

    Fletcher, Simon P; Ali, Iraj K; Kaminski, Ann; Digard, Paul; Jackson, Richard J

    2002-01-01

    Classical swine fever virus (CSFV) is a member of the pestivirus family, which shares many features in common with hepatitis C virus (HCV). It is shown here that CSFV has an exceptionally efficient cis-acting internal ribosome entry segment (IRES), which, like that of HCV, is strongly influenced by the sequences immediately downstream of the initiation codon, and is optimal with viral coding sequences in this position. Constructs that retained 17 or more codons of viral coding sequence exhibited full IRES activity, but with only 12 codons, activity was approximately 66% of maximum in vitro (though close to maximum in transfected BHK cells), whereas with just 3 codons or fewer, the activity was only approximately 15% of maximum. The minimal coding region elements required for high activity were exchanged between HCV and CSFV. Although maximum activity was observed in each case with the homologous combination of coding region and 5' UTR, the heterologous combinations were sufficiently active to rule out a highly specific functional interplay between the 5' UTR and coding sequences. On the other hand, inversion of the coding sequences resulted in low IRES activity, particularly with the HCV coding sequences. RNA structure probing showed that the efficiency of internal initiation of these chimeric constructs correlated most closely with the degree of single-strandedness of the region around and immediately downstream of the initiation codon. The low activity IRESs could not be rescued by addition of supplementary eIF4A (the initiation factor with ATP-dependent RNA helicase activity). The extreme sensitivity to secondary structure around the initiation codon is likely to be due to the fact that the eIF4F complex (which has eIF4A as one of its subunits) is not required for and does not participate in initiation on these IRESs. PMID:12515388

  19. Numerical investigation of over expanded flow behavior in a single expansion ramp nozzle

    NASA Astrophysics Data System (ADS)

    Mousavi, Seyed Mahmood; Pourabidi, Reza; Goshtasbi-Rad, Ebrahim

    2018-05-01

    The single expansion ramp nozzle is severely over-expanded when the vehicle is at low speed, which hinders its ability to provide optimal configurations for combined cycle engines. The over-expansion leads to flow separation as a result of shock wave/boundary-layer interaction. Flow separation, and the presence of shocks themselves, result in a performance loss in the single expansion ramp nozzle, leading to reduced thrust and increased pressure losses. In the present work, the unsteady two dimensional compressible flow in an over expanded single expansion ramp nozzle has been investigated using finite volume code. To achieve this purpose, the Reynolds stress turbulence model and full multigrid initialization, in addition to the Smirnov's method for examining the errors accumulation, have been employed and the results are compared with available experimental data. The results show that the numerical code is capable of predicting the experimental data with high accuracy. Afterward, the effect of discontinuity jump in wall temperature as well as the length of straight ramp on flow behavior have been studied. It is concluded that variations in wall temperature and length of straight ramp change the shock wave boundary layer interaction, shock structure, shock strength as well as the distance between Lambda shocks.

  20. GFinisher: a new strategy to refine and finish bacterial genome assemblies

    NASA Astrophysics Data System (ADS)

    Guizelini, Dieval; Raittz, Roberto T.; Cruz, Leonardo M.; Souza, Emanuel M.; Steffens, Maria B. R.; Pedrosa, Fabio O.

    2016-10-01

    Despite the development in DNA sequencing technology, improving the number and the length of reads, the process of reconstruction of complete genome sequences, the so called genome assembly, is still complex. Only 13% of the prokaryotic genome sequencing projects have been completed. Draft genome sequences deposited in public databases are fragmented in contigs and may lack the full gene complement. The aim of the present work is to identify assembly errors and improve the assembly process of bacterial genomes. The biological patterns observed in genomic sequences and the application of a priori information can allow the identification of misassembled regions, and the reorganization and improvement of the overall de novo genome assembly. GFinisher starts generating a Fuzzy GC skew graphs for each contig in an assembly and follows breaking down the contigs in critical points in order to reassemble and close them using jFGap. This has been successfully applied to dataset from 96 genome assemblies, decreasing the number of contigs by up to 86%. GFinisher can easily optimize assemblies of prokaryotic draft genomes and can be used to improve the assembly programs based on nucleotide sequence patterns in the genome. The software and source code are available at http://gfinisher.sourceforge.net/.

  1. Cloning, tissue expression and polymorphisms of chicken Krüppel-like factor 7 gene.

    PubMed

    Zhang, Zhi-Wei; Wang, Zhi-Peng; Zhang, Kun; Wang, Ning; Li, Hui

    2013-07-01

    Krüppel-like factor 7 (KLF7) has been extensively studied in mammalian species, but its role in birds is still unclear. In the current study, cloning and sequencing showed that the full-length coding region of chicken KLF7 (Gallus gallus KLF7, gKLF7) was 891 bp long, encoding 296 amino acids. In addition, real-time RT-PCR analysis showed that gKLF7 was broadly expressed in all 15 chicken tissues selected, and its expression was significantly different in spleen, proventriculus, abdominal fat, brain, leg muscle, gizzard and heart between fat and lean broilers at 7 weeks of age. Additionally, one novel single nucleotide polymorphism (SNP), XM_426569.3: c. A141G, was identified in the second exon of gKLF7. Association analysis showed that this locus was significantly associated with fatness traits in Arbor Acres broiler random population and the eighth generation of Northeast Agricultural University broiler lines divergently selected for abdominal fat content (NEAUHLF) population (P < 0.05). These results suggest that gKLF7 might be a candidate gene for chicken fatness traits. © 2013 Japanese Society of Animal Science.

  2. A global assembly of cotton ESTs

    PubMed Central

    Udall, Joshua A.; Swanson, Jordan M.; Haller, Karl; Rapp, Ryan A.; Sparks, Michael E.; Hatfield, Jamie; Yu, Yeisoo; Wu, Yingru; Dowd, Caitriona; Arpat, Aladdin B.; Sickler, Brad A.; Wilkins, Thea A.; Guo, Jin Ying; Chen, Xiao Ya; Scheffler, Jodi; Taliercio, Earl; Turley, Ricky; McFadden, Helen; Payton, Paxton; Klueva, Natalya; Allen, Randell; Zhang, Deshui; Haigler, Candace; Wilkerson, Curtis; Suo, Jinfeng; Schulze, Stefan R.; Pierce, Margaret L.; Essenberg, Margaret; Kim, HyeRan; Llewellyn, Danny J.; Dennis, Elizabeth S.; Kudrna, David; Wing, Rod; Paterson, Andrew H.; Soderlund, Cari; Wendel, Jonathan F.

    2006-01-01

    Approximately 185,000 Gossypium EST sequences comprising >94,800,000 nucleotides were amassed from 30 cDNA libraries constructed from a variety of tissues and organs under a range of conditions, including drought stress and pathogen challenges. These libraries were derived from allopolyploid cotton (Gossypium hirsutum; AT and DT genomes) as well as its two diploid progenitors, Gossypium arboreum (A genome) and Gossypium raimondii (D genome). ESTs were assembled using the Program for Assembling and Viewing ESTs (PAVE), resulting in 22,030 contigs and 29,077 singletons (51,107 unigenes). Further comparisons among the singletons and contigs led to recognition of 33,665 exemplar sequences that represent a nonredundant set of putative Gossypium genes containing partial or full-length coding regions and usually one or two UTRs. The assembly, along with their UniProt BLASTX hits, GO annotation, and Pfam analysis results, are freely accessible as a public resource for cotton genomics. Because ESTs from diploid and allotetraploid Gossypium were combined in a single assembly, we were in many cases able to bioinformatically distinguish duplicated genes in allotetraploid cotton and assign them to either the A or D genome. The assembly and associated information provide a framework for future investigation of cotton functional and evolutionary genomics. PMID:16478941

  3. Molecular Characterization, Tissue Distribution and Expression, and Potential Antiviral Effects of TRIM32 in the Common Carp (Cyprinus carpio).

    PubMed

    Wang, Yeda; Li, Zeming; Lu, Yuanan; Hu, Guangfu; Lin, Li; Zeng, Lingbing; Zhou, Yong; Liu, Xueqin

    2016-10-09

    Tripartite motif-containing protein 32 (TRIM32) belongs to the tripartite motif (TRIM) family, which consists of a large number of proteins containing a RING (Really Interesting New Gene) domain, one or two B-box domains, and coiled coil motif followed by different C-terminal domains. The TRIM family is known to be implicated in multiple cellular functions, including antiviral activity. However, it is presently unknown whether TRIM32 of common carp ( Cyprinus carpio ) has the antiviral effect. In this study, the sequence, expression, and antiviral function of TRIM32 homolog from common carp were analyzed. The full-length coding sequence region of trim32 was cloned from common carp. The results showed that the expression of TRIM32 (mRNA) was highest in the brain, remained stably expressed during embryonic development, and significantly increased following spring viraemia of carp virus (SVCV) infection. Transient overexpression of TRIM32 in affected Epithelioma papulosum cyprinid cells led to significant decrease of SVCV production as compared to the control group. These results suggested a potentially important role of common carp TRIM32 in enhancing host immune response during SVCV infection both in vivo and in vitro.

  4. Micromechanical slit positioning system as a transmissive spatial light modulator

    NASA Astrophysics Data System (ADS)

    Riesenberg, Rainer

    2001-11-01

    Micro-slits have been prepared with a slit-width and a slit- length of 2 ... 1000 micrometers . Linear and two-dimensional arrays up to 10 x 110 slits have been developed and completed with a piezo-actuator for shifting. This system is a so-called mechanical slit positioning system. The light is switched by simple one- or two-dimensional displacement of coded slit masks in a one- or two-layer architecture. The slit positioning system belongs to the transmissive class of MEMS-based spatial light modulators (SLM). It has fundamental advantages for optical contrast and also can be used in the full spectral region. Therefore transmissive versions of SLM should be a future solution. Instrument architectures based on the slit positioning system can increase the resolution by subpixel generation, the throughput by HADAMARD transform mode, or select objects for multi-object-spectroscopy. The linear slit positioning system was space qualified within an advanced micro- spectrometer. A NIR multi-object-spectrometer for the Next Generation Space Telescope (NGST) is based on a field selector for selecting objects. The field selector is a SLM, which could be implemented by a slit positioning system.

  5. GFinisher: a new strategy to refine and finish bacterial genome assemblies.

    PubMed

    Guizelini, Dieval; Raittz, Roberto T; Cruz, Leonardo M; Souza, Emanuel M; Steffens, Maria B R; Pedrosa, Fabio O

    2016-10-10

    Despite the development in DNA sequencing technology, improving the number and the length of reads, the process of reconstruction of complete genome sequences, the so called genome assembly, is still complex. Only 13% of the prokaryotic genome sequencing projects have been completed. Draft genome sequences deposited in public databases are fragmented in contigs and may lack the full gene complement. The aim of the present work is to identify assembly errors and improve the assembly process of bacterial genomes. The biological patterns observed in genomic sequences and the application of a priori information can allow the identification of misassembled regions, and the reorganization and improvement of the overall de novo genome assembly. GFinisher starts generating a Fuzzy GC skew graphs for each contig in an assembly and follows breaking down the contigs in critical points in order to reassemble and close them using jFGap. This has been successfully applied to dataset from 96 genome assemblies, decreasing the number of contigs by up to 86%. GFinisher can easily optimize assemblies of prokaryotic draft genomes and can be used to improve the assembly programs based on nucleotide sequence patterns in the genome. The software and source code are available at http://gfinisher.sourceforge.net/.

  6. Molecular Characterization, Tissue Distribution and Expression, and Potential Antiviral Effects of TRIM32 in the Common Carp (Cyprinus carpio)

    PubMed Central

    Wang, Yeda; Li, Zeming; Lu, Yuanan; Hu, Guangfu; Lin, Li; Zeng, Lingbing; Zhou, Yong; Liu, Xueqin

    2016-01-01

    Tripartite motif-containing protein 32 (TRIM32) belongs to the tripartite motif (TRIM) family, which consists of a large number of proteins containing a RING (Really Interesting New Gene) domain, one or two B-box domains, and coiled coil motif followed by different C-terminal domains. The TRIM family is known to be implicated in multiple cellular functions, including antiviral activity. However, it is presently unknown whether TRIM32 of common carp (Cyprinus carpio) has the antiviral effect. In this study, the sequence, expression, and antiviral function of TRIM32 homolog from common carp were analyzed. The full-length coding sequence region of trim32 was cloned from common carp. The results showed that the expression of TRIM32 (mRNA) was highest in the brain, remained stably expressed during embryonic development, and significantly increased following spring viraemia of carp virus (SVCV) infection. Transient overexpression of TRIM32 in affected Epithelioma papulosum cyprinid cells led to significant decrease of SVCV production as compared to the control group. These results suggested a potentially important role of common carp TRIM32 in enhancing host immune response during SVCV infection both in vivo and in vitro. PMID:27735853

  7. The comparative chloroplast genomic analysis of photosynthetic orchids and developing DNA markers to distinguish Phalaenopsis orchids.

    PubMed

    Jheng, Cheng-Fong; Chen, Tien-Chih; Lin, Jhong-Yi; Chen, Ting-Chieh; Wu, Wen-Luan; Chang, Ching-Chun

    2012-07-01

    The chloroplast genome of Phalaenopsis equestris was determined and compared to those of Phalaenopsis aphrodite and Oncidium Gower Ramsey in Orchidaceae. The chloroplast genome of P. equestris is 148,959 bp, and a pair of inverted repeats (25,846 bp) separates the genome into large single-copy (85,967 bp) and small single-copy (11,300 bp) regions. The genome encodes 109 genes, including 4 rRNA, 30 tRNA and 75 protein-coding genes, but loses four ndh genes (ndhA, E, F and H) and seven other ndh genes are pseudogenes. The rate of inter-species variation between the two moth orchids was 0.74% (1107 sites) for single nucleotide substitution and 0.24% for insertions (161 sites; 1388 bp) and deletions (189 sites; 1393 bp). The IR regions have a lower rate of nucleotide substitution (3.5-5.8-fold) and indels (4.3-7.1-fold) than single-copy regions. The intergenic spacers are the most divergent, and based on the length variation of the three intergenic spacers, 11 native Phalaenopsis orchids could be successfully distinguished. The coding genes, IR junction and RNA editing sites are relatively more conserved between the two moth orchids than between those of Phalaenopsis and Oncidium spp. Copyright © 2012 Elsevier Ireland Ltd. All rights reserved.

  8. Identification of Rubisco rbcL and rbcS in Camellia oleifera and their potential as molecular markers for selection of high tea oil cultivars.

    PubMed

    Chen, Yongzhong; Wang, Baoming; Chen, Jianjun; Wang, Xiangnan; Wang, Rui; Peng, Shaofeng; Chen, Longsheng; Ma, Li; Luo, Jian

    2015-01-01

    Tea oil derived from seeds of Camellia oleifera Abel. is high-quality edible oil in China. This study isolated full-length cDNAs of Rubisco subunits rbcL and rbcS from C. oleifera. The rbcL has 1,522 bp with a 1,425 bp coding region, encoding 475 amino acids; and the rbcS has 615 bp containing a 528 bp coding region, encoding 176 amino acids. The expression level of the two genes, designated as Co-rbcL and Co-rbcS, was determined in three C. oleifera cultivars: Hengchong 89, Xianglin 1, and Xianglin 14 whose annual oil yields were 546.9, 591.4, and 657.7 kg ha(-1), respectively. The Co-rbcL expression in 'Xianglin 14' was significantly higher than 'Xianglin 1', and 'Xianglin 1' was greater than 'Hengchong 89'. The expression levels of Co-rbcS in 'Xianglin 1' and 'Xianglin 14' were similar but were significantly greater than in 'Hengchong 89'. The net photosynthetic rate of 'Xianglin 14' was significantly higher than 'Xianglin 1', and 'Xianglin 1' was higher than 'Hengchong 89'. Pearson's correlation analysis showed that seed yields and oil yields were highly correlated with the expression level of Co-rbcL at P < 0.001 level; and the expression of Co-rbcS was correlated with oil yield at P < 0.01 level. Net photosynthetic rate was also correlated with oil yields and seed yields at P < 0.001 and P < 0.01 levels, respectively. Our results suggest that Co-rbcS and Co-rbcL in particular could potentially be molecular markers for early selection of high oil yield cultivars. In combination with the measurement of net photosynthetic rates, the early identification of potential high oil production cultivars would significantly shorten plant breeding time and increase breeding efficiency.

  9. Sequence characterization of cDNA sequence of encoding of an antimicrobial Peptide with no disulfide bridge from the Iranian mesobuthus eupeus venomous glands.

    PubMed

    Farajzadeh-Sheikh, Ahmad; Jolodar, Abbas; Ghaemmaghami, Shamsedin

    2013-01-01

    Scorpion venom glands produce some antimicrobial peptides (AMP) that can rapidly kill a broad range of microbes and have additional activities that impact on the quality and effectiveness of innate responses and inflammation. In this study, we reported the identification of a cDNA sequence encoding cysteine-free antimicrobial peptides isolated from venomous glands of this species. Total RNA was extracted from the Iranian mesobuthus eupeus venom glands, and cDNA was synthesized by using the modified oligo (dT). The cDNA was used as the template for applying Semi-nested RT- PCR technique. PCR Products were used for direct nucleotide sequencing and the results were compared with Gen Bank database. A 213 BP cDNA fragment encoding the entire coding region of an antimicrobial toxin from the Iranian scorpion M. Eupeus venom glands were isolated. The full-length sequence of the coding region was 210 BP contained an open reading frame of 70 amino with a predicted molecular mass of 7970.48 Da and theoretical Pi of 9.10. The open reading frame consists of 210 BP encoding a precursor of 70 amino acid residues, including a signal peptide of 23 residues a propertied of 7 residues, and a mature peptide of 34 residues with no disulfide bridge. The peptide has detectable sequence identity to the Lesser Asian mesobuthus eupeus MeVAMP-2 (98%), MeVAMP-9 (60%) and several previously described AMPs from other scorpion venoms including mesobuthus martensii (94%) and buthus occitanus Israelis (82%). The secondary structure of the peptide mainly consisted of α-helical structure which was generally conserved by previously reported scorpion counterparts. The phylogenetic analysis showed that the Iranian MeAMP-like toxin was similar but not identical with that of venom antimicrobial peptides from lesser Asian scorpion mesobuthus eupeus.

  10. Four different sublineages of highly pathogenic avian influenza H5N1 introduced in Hungary in 2006-2007.

    PubMed

    Szeleczky, Zsófia; Dán, Adám; Ursu, Krisztina; Ivanics, Eva; Kiss, István; Erdélyi, Károly; Belák, Sándor; Muller, Claude P; Brown, Ian H; Bálint, Adám

    2009-10-20

    Highly pathogenic avian influenza (HPAI) H5N1 viruses were introduced to Hungary during 2006-2007 in three separate waves. This study aimed at determining the full-length genomic coding regions of the index strains from these epizootics in order to: (i) understand the phylogenetic relationship to other European H5N1 isolates, (ii) elucidate the possible connection between the different outbreaks and (iii) determine the putative origin and way of introduction of the different virus variants. Molecular analysis of the HA gene of Hungarian HPAI isolates obtained from wild birds during the first introduction revealed two groups designated Hungarian1 (HUN1) and Hungarian2 (HUN2) within sublineage 2.2B and clade 2.2.1, respectively. Sequencing the whole coding region of the two index viruses A/mute swan/Hungary/3472/2006 and A/mute swan/4571/Hungary/2006 suggests the role of wild birds in the introduction of HUN1 and HUN2 viruses: the most similar isolates to HUN1 and HUN2 group were found in wild avian species in Croatia and Slovakia, respectively. The second introduction of HPAI H5N1 led to the largest epizootic in domestic waterfowl in Europe. The index strain of the epizootic A/goose/Hungary/14756/2006 clustered to sublineage 2.2.A1 forming the Hungarian3 (HUN3) group. A common ancestry of HUN3 isolates with Bavarian strains is suggested as the most likely scenario of origin. Hungarian4 (HUN4) viruses isolated from the third introduction clustered with isolate A/turkey/United Kingdom/750/2007 forming a sublineage 2.2.A2. The origin and way of introduction of HUN4 viruses is still obscure, thus further genetic, phylogenetic, ecological and epidemiological data are required in order to elucidate it.

  11. Phonological, visual, and semantic coding strategies and children's short-term picture memory span.

    PubMed

    Henry, Lucy A; Messer, David; Luger-Klein, Scarlett; Crane, Laura

    2012-01-01

    Three experiments addressed controversies in the previous literature on the development of phonological and other forms of short-term memory coding in children, using assessments of picture memory span that ruled out potentially confounding effects of verbal input and output. Picture materials were varied in terms of phonological similarity, visual similarity, semantic similarity, and word length. Older children (6/8-year-olds), but not younger children (4/5-year-olds), demonstrated robust and consistent phonological similarity and word length effects, indicating that they were using phonological coding strategies. This confirmed findings initially reported by Conrad (1971), but subsequently questioned by other authors. However, in contrast to some previous research, little evidence was found for a distinct visual coding stage at 4 years, casting doubt on assumptions that this is a developmental stage that consistently precedes phonological coding. There was some evidence for a dual visual and phonological coding stage prior to exclusive use of phonological coding at around 5-6 years. Evidence for semantic similarity effects was limited, suggesting that semantic coding is not a key method by which young children recall lists of pictures.

  12. Wild-Type Measles Viruses with Non-Standard Genome Lengths

    PubMed Central

    Bankamp, Bettina; Liu, Chunyu; Rivailler, Pierre; Bera, Jayati; Shrivastava, Susmita; Kirkness, Ewen F.; Bellini, William J.; Rota, Paul A.

    2014-01-01

    The length of the single stranded, negative sense RNA genome of measles virus (MeV) is highly conserved at 15,894 nucleotides (nt). MeVs can be grouped into 24 genotypes based on the highly variable 450 nucleotides coding for the carboxyl-terminus of the nucleocapsid protein (N-450). Here, we report the genomic sequences of 2 wild-type viral isolates of genotype D4 with genome lengths of 15,900 nt. Both genomes had a 7 nt insertion in the 3′ untranslated region (UTR) of the matrix (M) gene and a 1 nt deletion in the 5′ UTR of the fusion (F) gene. The net gain of 6 nt complies with the rule-of-six required for replication competency of the genomes of morbilliviruses. The insertions and deletion (indels) were confirmed in a patient sample that was the source of one of the viral isolates. The positions of the indels were identical in both viral isolates, even though epidemiological data and the 3 nt differences in N-450 between the two genomes suggested that the viruses represented separate chains of transmission. Identical indels were found in the M-F intergenic regions of 14 additional genotype D4 viral isolates that were imported into the US during 2007–2010. Viral isolates with and without indels produced plaques of similar size and replicated efficiently in A549/hSLAM and Vero/hSLAM cells. This is the first report of wild-type MeVs with genome lengths other than 15,894 nt and demonstrates that the length of the M-F UTR of wild-type MeVs is flexible. PMID:24748123

  13. Genome-wide identification of aquaporin encoding genes in Brassica oleracea and their phylogenetic sequence comparison to Brassica crops and Arabidopsis

    PubMed Central

    Diehn, Till A.; Pommerrenig, Benjamin; Bernhardt, Nadine; Hartmann, Anja; Bienert, Gerd P.

    2015-01-01

    Aquaporins (AQPs) are essential channel proteins that regulate plant water homeostasis and the uptake and distribution of uncharged solutes such as metalloids, urea, ammonia, and carbon dioxide. Despite their importance as crop plants, little is known about AQP gene and protein function in cabbage (Brassica oleracea) and other Brassica species. The recent releases of the genome sequences of B. oleracea and Brassica rapa allow comparative genomic studies in these species to investigate the evolution and features of Brassica genes and proteins. In this study, we identified all AQP genes in B. oleracea by a genome-wide survey. In total, 67 genes of four plant AQP subfamilies were identified. Their full-length gene sequences and locations on chromosomes and scaffolds were manually curated. The identification of six additional full-length AQP sequences in the B. rapa genome added to the recently published AQP protein family of this species. A phylogenetic analysis of AQPs of Arabidopsis thaliana, B. oleracea, B. rapa allowed us to follow AQP evolution in closely related species and to systematically classify and (re-) name these isoforms. Thirty-three groups of AQP-orthologous genes were identified between B. oleracea and Arabidopsis and their expression was analyzed in different organs. The two selectivity filters, gene structure and coding sequences were highly conserved within each AQP subfamily while sequence variations in some introns and untranslated regions were frequent. These data suggest a similar substrate selectivity and function of Brassica AQPs compared to Arabidopsis orthologs. The comparative analyses of all AQP subfamilies in three Brassicaceae species give initial insights into AQP evolution in these taxa. Based on the genome-wide AQP identification in B. oleracea and the sequence analysis and reprocessing of Brassica AQP information, our dataset provides a sequence resource for further investigations of the physiological and molecular functions of Brassica crop AQPs. PMID:25904922

  14. Ideal form of optical plasma lenses

    NASA Astrophysics Data System (ADS)

    Gordon, D. F.; Stamm, A. B.; Hafizi, B.; Johnson, L. A.; Kaganovich, D.; Hubbard, R. F.; Richardson, A. S.; Zhigunov, D.

    2018-06-01

    The canonical form of an optical plasma lens is a parabolic density channel. This form suffers from spherical aberrations, among others. Spherical aberration is partially corrected by adding a quartic term to the radial density profile. Ideal forms which lead to perfect focusing or imaging are obtained. The fields at the focus of a strong lens are computed with high accuracy and efficiency using a combination of eikonal and full Maxwell descriptions of the radiation propagation. The calculations are performed using a new computer propagation code, SeaRay, which is designed to transition between various solution methods as the beam propagates through different spatial regions. The calculations produce the full Maxwell vector fields in the focal region.

  15. A review and analysis of boundary layer transition data for turbine application

    NASA Technical Reports Server (NTRS)

    Gaugler, R. E.

    1985-01-01

    A number of data sets from the open literature that include heat transfer data in apparently transitional boundary layers, with particular application to the turbine environment, were reviewed and analyzed to extract transition information. The data were analyzed by using a version of the STAN5 two-dimensional boundary layer code. The transition starting and ending points were determined by adjusting parameters in STAN5 until the calculations matched the data. The results are presented as a table of the deduced transition location and length as functions of the test parameters. The data sets reviewed cover a wide range of flow conditions, from low-speed, flat-plate tests to full-scale turbine airfoils operating at simulated turbine engine conditions. The results indicate that free-stream turbulence and pressure gradient have strong, and opposite, effects on the location of the start of transition and on the length of the transition zone.

  16. Convolutional coding combined with continuous phase modulation

    NASA Technical Reports Server (NTRS)

    Pizzi, S. V.; Wilson, S. G.

    1985-01-01

    Background theory and specific coding designs for combined coding/modulation schemes utilizing convolutional codes and continuous-phase modulation (CPM) are presented. In this paper the case of r = 1/2 coding onto a 4-ary CPM is emphasized, with short-constraint length codes presented for continuous-phase FSK, double-raised-cosine, and triple-raised-cosine modulation. Coding buys several decibels of coding gain over the Gaussian channel, with an attendant increase of bandwidth. Performance comparisons in the power-bandwidth tradeoff with other approaches are made.

  17. ChloroSSRdb: a repository of perfect and imperfect chloroplastic simple sequence repeats (cpSSRs) of green plants

    PubMed Central

    Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh

    2014-01-01

    Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1–6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ PMID:25380781

  18. ChloroSSRdb: a repository of perfect and imperfect chloroplastic simple sequence repeats (cpSSRs) of green plants.

    PubMed

    Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh

    2014-01-01

    Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1-6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ © The Author(s) 2014. Published by Oxford University Press.

  19. Review of particle-in-cell modeling for the extraction region of large negative hydrogen ion sources for fusion

    NASA Astrophysics Data System (ADS)

    Wünderlich, D.; Mochalskyy, S.; Montellano, I. M.; Revel, A.

    2018-05-01

    Particle-in-cell (PIC) codes are used since the early 1960s for calculating self-consistently the motion of charged particles in plasmas, taking into account external electric and magnetic fields as well as the fields created by the particles itself. Due to the used very small time steps (in the order of the inverse plasma frequency) and mesh size, the computational requirements can be very high and they drastically increase with increasing plasma density and size of the calculation domain. Thus, usually small computational domains and/or reduced dimensionality are used. In the last years, the available central processing unit (CPU) power strongly increased. Together with a massive parallelization of the codes, it is now possible to describe in 3D the extraction of charged particles from a plasma, using calculation domains with an edge length of several centimeters, consisting of one extraction aperture, the plasma in direct vicinity of the aperture, and a part of the extraction system. Large negative hydrogen or deuterium ion sources are essential parts of the neutral beam injection (NBI) system in future fusion devices like the international fusion experiment ITER and the demonstration reactor (DEMO). For ITER NBI RF driven sources with a source area of 0.9 × 1.9 m2 and 1280 extraction apertures will be used. The extraction of negative ions is accompanied by the co-extraction of electrons which are deflected onto an electron dump. Typically, the maximum negative extracted ion current is limited by the amount and the temporal instability of the co-extracted electrons, especially for operation in deuterium. Different PIC codes are available for the extraction region of large driven negative ion sources for fusion. Additionally, some effort is ongoing in developing codes that describe in a simplified manner (coarser mesh or reduced dimensionality) the plasma of the whole ion source. The presentation first gives a brief overview of the current status of the ion source development for ITER NBI and of the PIC method. Different PIC codes for the extraction region are introduced as well as the coupling to codes describing the whole source (PIC codes or fluid codes). Presented and discussed are different physical and numerical aspects of applying PIC codes to negative hydrogen ion sources for fusion as well as selected code results. The main focus of future calculations will be the meniscus formation and identifying measures for reducing the co-extracted electrons, in particular for deuterium operation. The recent results of the 3D PIC code ONIX (calculation domain: one extraction aperture and its vicinity) for the ITER prototype source (1/8 size of the ITER NBI source) are presented.

  20. Bacterial Polysaccharide Co-Polymerases Share a Common Framework for Control of Polymer Length

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tocilj,A.; Munger, C.; Proteau, A.

    2008-01-01

    The chain length distribution of complex polysaccharides present on the bacterial surface is determined by polysaccharide co-polymerases (PCPs) anchored in the inner membrane. We report crystal structures of the periplasmic domains of three PCPs that impart substantially different chain length distributions to surface polysaccharides. Despite very low sequence similarities, they have a common protomer structure with a long central alpha-helix extending 100 Angstroms into the periplasm. The protomers self-assemble into bell-shaped oligomers of variable sizes, with a large internal cavity. Electron microscopy shows that one of the full-length PCPs has a similar organization as that observed in the crystal formore » its periplasmic domain alone. Functional studies suggest that the top of the PCP oligomers is an important region for determining polysaccharide modal length. These structures provide a detailed view of components of the bacterial polysaccharide assembly machinery.« less

Top