Sample records for complete nucleotide sequences

  1. Complete nucleotide sequence of a monopartite Begomovirus and associated satellites infecting Carica papaya in Nepal.

    PubMed

    Shahid, M S; Yoshida, S; Khatri-Chhetri, G B; Briddon, R W; Natsuaki, K T

    2013-06-01

    Carica papaya (papaya) is a fruit crop that is cultivated mostly in kitchen gardens throughout Nepal. Leaf samples of C. papaya plants with leaf curling, vein darkening, vein thickening, and a reduction in leaf size were collected from a garden in Darai village, Rampur, Nepal in 2010. Full-length clones of a monopartite Begomovirus, a betasatellite and an alphasatellite were isolated. The complete nucleotide sequence of the Begomovirus showed the arrangement of genes typical of Old World begomoviruses with the highest nucleotide sequence identity (>99 %) to an isolate of Ageratum yellow vein virus (AYVV), confirming it as an isolate of AYVV. The complete nucleotide sequence of betasatellite showed greater than 89 % nucleotide sequence identity to an isolate of Tomato leaf curl Java betasatellite originating from Indonesian. The sequence of the alphasatellite displayed 92 % nucleotide sequence identity to Sida yellow vein China alphasatellite. This is the first identification of these components in Nepal and the first time they have been identified in papaya.

  2. Characterization of apple stem grooving virus and apple chlorotic leaf spot virus identified in a crab apple tree.

    PubMed

    Li, Yongqiang; Deng, Congliang; Bian, Yong; Zhao, Xiaoli; Zhou, Qi

    2017-04-01

    Apple stem grooving virus (ASGV), apple chlorotic leaf spot virus (ACLSV), and prunus necrotic ringspot virus (PNRSV) were identified in a crab apple tree by small RNA deep sequencing. The complete genome sequence of ACLSV isolate BJ (ACLSV-BJ) was 7554 nucleotides and shared 67.0%-83.0% nucleotide sequence identity with other ACLSV isolates. A phylogenetic tree based on the complete genome sequence of all available ACLSV isolates showed that ACLSV-BJ clustered with the isolates SY01 from hawthorn, MO5 from apple, and JB, KMS and YH from pear. The complete nucleotide sequence of ASGV-BJ was 6509 nucleotides (nt) long and shared 78.2%-80.7% nucleotide sequence identity with other isolates. ASGV-BJ and the isolate ASGV_kfp clustered together in the phylogenetic tree as an independent clade. Recombination analysis showed that isolate ASGV-BJ was a naturally occurring recombinant.

  3. Complete nucleotide sequence of Alfalfa mosaic virus isolated from alfalfa (Medicago sativa L.) in Argentina.

    PubMed

    Trucco, Verónica; de Breuil, Soledad; Bejerman, Nicolás; Lenardon, Sergio; Giolitti, Fabián

    2014-06-01

    The complete nucleotide sequence of an Alfalfa mosaic virus (AMV) isolate infecting alfalfa (Medicago sativa L.) in Argentina, AMV-Arg, was determined. The virus genome has the typical organization described for AMV, and comprises 3,643, 2,593, and 2,038 nucleotides for RNA1, 2 and 3, respectively. The whole genome sequence and each encoding region were compared with those of other four isolates that have been completely sequenced from China, Italy, Spain and USA. The nucleotide identity percentages ranged from 95.9 to 99.1 % for the three RNAs and from 93.7 to 99 % for the protein 1 (P1), protein 2 (P2), movement protein and coat protein (CP) encoding regions, whereas the amino acid identity percentages of these proteins ranged from 93.4 to 99.5 %, the lowest value corresponding to P2. CP sequences of AMV-Arg were compared with those of other 25 available isolates, and the phylogenetic analysis based on the CP gene was carried out. The highest percentage of nucleotide sequence identity of the CP gene was 98.3 % with a Chinese isolate and 98.6 % at the amino acid level with four isolates, two from Italy, one from Brazil and the remaining one from China. The phylogenetic analysis showed that AMV-Arg is closely related to subgroup I of AMV isolates. To our knowledge, this is the first report of a complete nucleotide sequence of AMV from South America and the first worldwide report of complete nucleotide sequence of AMV isolated from alfalfa as natural host.

  4. Complete nucleotide sequence of a novel Hibiscus-infecting Cilevirus from Florida and its relationship with closely associated Cileviruses

    USDA-ARS?s Scientific Manuscript database

    The complete nucleotide sequence of a recently discovered Florida (FL) isolate of Hibiscus infecting Cilevirus (HiCV) was determined by Sanger sequencing. The movement- and coat- protein gene sequences of the HiCV-FL isolate are more divergent than other genes of the previously sequenced HiCV-HA (Ha...

  5. Complete nucleotide sequence and genome organization of a novel allexivirus from alfalfa (Medicago sativa)

    USDA-ARS?s Scientific Manuscript database

    A new species of the family Alphaflexiviridae provisionally named Alfalfa virus S (AVS) was diagnosed in alfalfa samples originating from Sudan. A complete nucleotide sequence of the viral genome consisting of 8,349 nucleotides excluding the 3’ poly(A) tail was determined by Illumina NGS technology ...

  6. Detection of a divergent variant of grapevine virus F by next-generation sequencing.

    PubMed

    Molenaar, Nicholas; Burger, Johan T; Maree, Hans J

    2015-08-01

    The complete genome sequence of a South African isolate of grapevine virus F (GVF) is presented. It was first detected by metagenomic next-generation sequencing of field samples and validated through direct Sanger sequencing. The genome sequence of GVF isolate V5 consists of 7539 nucleotides and contains a poly(A) tail. It has a typical vitivirus genome arrangement that comprises five open reading frames (ORFs), which share only 88.96 % nucleotide sequence identity with the existing complete GVF genome sequence (JX105428).

  7. Hop stunt viroid: molecular cloning and nucleotide sequence of the complete cDNA copy.

    PubMed Central

    Ohno, T; Takamatsu, N; Meshi, T; Okada, Y

    1983-01-01

    The complete cDNA of hop stunt viroid (HSV) has been cloned by the method of Okayama and Berg (Mol.Cell.Biol.2,161-170. (1982] and the complete nucleotide sequence has been established. The covalently closed circular single-stranded HSV RNA consists of 297 nucleotides. The secondary structure predicted for HSV contains 67% of its residues base-paired. The native HSV can possess an extended rod-like structure characteristic of viroids previously established. The central region of the native HSV has a similar structure to the conserved region found in all viroids sequenced so far except for avocado sunblotch viroid. The sequence homologous to the 5'-end of U1a RNA is also found in the sequence of HSV but not in the central conserved region. Images PMID:6312412

  8. The complete sequence of Cymbidium mosaic virus from Vanilla fragrans in Hainan, China.

    PubMed

    He, Zhen; Jiang, Dongmei; Liu, Aiqin; Sang, Liwei; Li, Wenfeng; Li, Shifang

    2011-06-01

    The complete nucleotide sequence of Cymbidium mosaic virus (CymMV) isolated from vanilla in Hainan province, China was determined for the first time. It comprised 6,224 nucleotides; sequence analysis suggested that the isolate we obtained was a member of the genus Potexvirus, and its sequence shared 86.67-96.61% identities with previously reported sequences. Phylogenetic analysis suggested that CymMV from vanilla fragrans was clustered into subgroup A and the isolates in this subgroup displayed little regional difference.

  9. Extension of the COG and arCOG databases by amino acid and nucleotide sequences

    PubMed Central

    Meereis, Florian; Kaufmann, Michael

    2008-01-01

    Background The current versions of the COG and arCOG databases, both excellent frameworks for studies in comparative and functional genomics, do not contain the nucleotide sequences corresponding to their protein or protein domain entries. Results Using sequence information obtained from GenBank flat files covering the completely sequenced genomes of the COG and arCOG databases, we constructed NUCOCOG (nucleotide sequences containing COG databases) as an extended version including all nucleotide sequences and in addition the amino acid sequences originally utilized to construct the current COG and arCOG databases. We make available three comprehensive single XML files containing the complete databases including all sequence information. In addition, we provide a web interface as a utility suitable to browse the NUCOCOG database for sequence retrieval. The database is accessible at . Conclusion NUCOCOG offers the possibility to analyze any sequence related property in the context of the COG and arCOG framework simply by using script languages such as PERL applied to a large but single XML document. PMID:19014535

  10. Sequencing and phylogenetic analysis of tobacco virus 2, a polerovirus from Nicotiana tabacum.

    PubMed

    Zhou, Benguo; Wang, Fang; Zhang, Xuesong; Zhang, Lina; Lin, Huafeng

    2017-07-01

    The complete genome sequence of a new virus, provisionally named tobacco virus 2 (TV2), was determined and identified from leaves of tobacco (Nicotiana tabacum) exhibiting leaf mosaic, yellowing, and deformity, in Anhui Province, China. The genome sequence of TV2 comprises 5,979 nucleotides, with 87% nucleotide sequence identity to potato leafroll virus (PLRV). Its genome organization is similar to that of PLRV, containing six open reading frames (ORFs) that potentially encode proteins with putative functions in cell-to-cell movement and suppression of RNA silencing. Phylogenetic analysis of the nucleotide sequence placed TV2 alongside members of the genus Polerovirus in the family Luteoviridae. To the best our knowledge, this study is the first report of a complete genome sequence of a new polerovirus identified in tobacco.

  11. First Complete Genome Sequence of an Isolate of Tomato Mottle Mosaic Virus Infecting Plants of Solanum lycopersicum in South America.

    PubMed

    Nagai, Alice; Duarte, Lígia M L; Chaves, Alexandre L R; Alexandre, Maria A V; Ramos-González, Pedro L; Chabi-Jesus, Camila; Harakava, Ricardo; Dos Santos, Déborah Y A C

    2018-05-10

    The complete nucleotide sequence of an isolate of tomato mottle mosaic virus (ToMMV) was determined. The virus, originally isolated from symptomatic tomato plants found in a county near the city of São Paulo, Brazil, has a genome with 99% nucleotide sequence identity with ToMMV from Mexico, China, Spain, and the United States. Copyright © 2018 Nagai et al.

  12. Normalization of Complete Genome Characteristics: Application to Evolution from Primitive Organisms to Homo sapiens.

    PubMed

    Sorimachi, Kenji; Okayasu, Teiji; Ohhira, Shuji

    2015-04-01

    Normalized nucleotide and amino acid contents of complete genome sequences can be visualized as radar charts. The shapes of these charts depict the characteristics of an organism's genome. The normalized values calculated from the genome sequence theoretically exclude experimental errors. Further, because normalization is independent of both target size and kind, this procedure is applicable not only to single genes but also to whole genomes, which consist of a huge number of different genes. In this review, we discuss the applications of the normalization of the nucleotide and predicted amino acid contents of complete genomes to the investigation of genome structure and to evolutionary research from primitive organisms to Homo sapiens. Some of the results could never have been obtained from the analysis of individual nucleotide or amino acid sequences but were revealed only after the normalization of nucleotide and amino acid contents was applied to genome research. The discovery that genome structure was homogeneous was obtained only after normalization methods were applied to the nucleotide or predicted amino acid contents of genome sequences. Normalization procedures are also applicable to evolutionary research. Thus, normalization of the contents of whole genomes is a useful procedure that can help to characterize organisms.

  13. First report of Beet western yellows virus infecting Epiphyllum spp

    USDA-ARS?s Scientific Manuscript database

    Beet western yellow virus (BWYV) was identified from an orchid cactus (Epiphyllum spp.) hybrid without obvious symptoms by high-throughput sequencing. The nearly complete genomic sequence of 5,458 nucleotides of the virus was determined. The isolate has the highest nucleotide sequence identity (93%)...

  14. Complete genomic sequence of Powassan virus: evaluation of genetic elements in tick-borne versus mosquito-borne flaviviruses.

    PubMed

    Mandl, C W; Holzmann, H; Kunz, C; Heinz, F X

    1993-05-01

    The complete nucleotide sequence of the positive-stranded RNA genome of the tick-borne flavivirus Powassan (10,839 nucleotides) was elucidated and the amino acid sequence of all viral proteins was derived. Based on this sequence as well as serological data, Powassan virus represents the most divergent member of the tick-borne serocomplex within the genus flaviviruses, family Flaviviridae. The primary nucleotide sequence and potential RNA secondary structures of the Powassan virus genome as well as the protein sequences and the reactivities of the virion with a panel of monoclonal antibodies were compared to other tick-borne and mosquito-borne flaviviruses. These analyses corroborated significant differences between tick-borne and mosquito-borne flaviviruses, but also emphasized structural elements that are conserved among both vector groups. The comparisons among tick-borne flaviviruses revealed conserved sequence elements that might represent important determinants of the tick-borne flavivirus phenotype.

  15. The complete nucleotide sequence of the glnALG operon of Escherichia coli K12.

    PubMed Central

    Miranda-Ríos, J; Sánchez-Pescador, R; Urdea, M; Covarrubias, A A

    1987-01-01

    The nucleotide sequence of the E. coli glnALG operon has been determined. The glnL (ntrB) and glnG (ntrC) genes present a high homology, at the nucleotide and aminoacid levels, with the corresponding genes of Klebsiella pneumoniae. The predicted aminoacid sequence for glutamine synthetase allowed us to locate some of the enzyme domains. The structure of this operon is discussed. PMID:2882477

  16. Porcine insulin receptor substrate 4 (IRS4) gene: cloning, polymorphism and association study

    USDA-ARS?s Scientific Manuscript database

    Using PCR and IPCR techniques we obtained a 4498 bp nucleotide sequence FN424076 encompassing the complete coding sequence of the porcine IRS4 gene and its proximal promoter. The 1269-amino acid porcine protein deduced from the nucleotide sequence shares 92% identity with the human IRS4 and possesse...

  17. The complete nucleotide sequence and genome organization of a novel betaflexivirus infecting Citrullus lanatus.

    PubMed

    Xin, Min; Zhang, Peipei; Liu, Wenwen; Ren, Yingdang; Cao, Mengji; Wang, Xifeng

    2017-10-01

    The complete nucleotide sequence of a novel positive single-stranded (+ss) RNA virus, tentatively named watermelon virus A (WVA), was determined using a combination of three methods: RNA sequencing, small RNA sequencing, and Sanger sequencing. The full genome of WVA is comprised of 8,372 nucleotides (nt), excluding the poly (A) tail, and contains four open reading frames (ORFs). The largest ORF, ORF1 encodes a putative replication-associated polyprotein (RP) with three conserved domains. ORF2 and ORF4 encode a movement protein (MP) and coat protein (CP), respectively. The putative product encoded by ORF3, of an estimated molecular mass of 25 kDa, has no significant similarity with other proteins. Identity and phylogenetic analysis indicate that WVA is a new virus, closely related to members of the family Betaflexiviridae. However, the final taxonomic allocation of WVA within the family is yet to be determined.

  18. The complete nucleotide sequence of the barley yellow dwarf GPV isolate from China shows that it is a new member of the genus Polerovirus.

    PubMed

    Zhang, Wenwei; Cheng, Zhuomin; Xu, Lei; Wu, Maosen; Waterhouse, Peter; Zhou, Guanghe; Li, Shifang

    2009-01-01

    The complete nucleotide sequence of the ssRNA genome of a Chinese GPV isolate of barley yellow dwarf virus (BYDV) was determined. It comprised 5673 nucleotides, and the deduced genome organization resembled that of members of the genus Polerovirus. It was most closely related to cereal yellow dwarf virus-RPV (77% nt identity over the entire genome; coat protein amino acid identity 79%). The GPV isolate also differs in vector specificity from other BYDV strains. Biological properties, phylogenetic analyses and detailed sequence comparisons suggest that GPV should be considered a member of a new species within the genus, and the name Wheat yellow dwarf virus-GPV is proposed.

  19. The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus.

    PubMed Central

    Gustafson, G; Armour, S L

    1986-01-01

    The complete nucleotide sequence of RNA beta from the type strain of barley stripe mosaic virus (BSMV) has been determined. The sequence is 3289 nucleotides in length and contains four open reading frames (ORFs) which code for proteins of Mr 22,147 (ORF1), Mr 58,098 (ORF2), Mr 17,378 (ORF3), and Mr 14,119 (ORF4). The predicted N-terminal amino acid sequence of the polypeptide encoded by the ORF nearest the 5'-end of the RNA (ORF1) is identical (after the initiator methionine) to the published N-terminal amino acid sequence of BSMV coat protein for 29 of the first 30 amino acids. ORF2 occupies the central portion of the coding region of RNA beta and ORF3 is located at the 3'-end. The ORF4 sequence overlaps the 3'-region of ORF2 and the 5'-region of ORF3 and differs in codon usage from the other three RNA beta ORFs. The coding region of RNA beta is followed by a poly(A) tract and a 238 nucleotide tRNA-like structure which are common to all three BSMV genomic RNAs. Images PMID:3754962

  20. Alfalfa virus S, a new species in the family Alphaflexiviridae

    USDA-ARS?s Scientific Manuscript database

    A new species of the family Alphaflexiviridae provisionally named alfalfa virus S (AVS) was discovered in alfalfa samples originating from Sudan. A complete nucleotide sequence of the viral genome consisting of 8,349 nucleotides excluding the 3’ poly(A) tail was determined by high throughput sequenc...

  1. Complete Genome Sequence of Porcine Parvovirus 2 Recovered from Swine Sera

    PubMed Central

    Kluge, M.; Franco, A. C.; Giongo, A.; Valdez, F. P.; Saddi, T. M.; Brito, W. M. E. D.; Roehe, P. M.

    2016-01-01

    A complete genomic sequence of porcine parvovirus 2 (PPV-2) was detected by viral metagenome analysis on swine sera. A phylogenetic analysis of this genome reveals that it is highly similar to previously reported North American PPV-2 genomes. The complete PPV-2 sequence is 5,426 nucleotides long. PMID:26823583

  2. Complete genome sequence of ‘Candidatus Liberibacter africanus’

    USDA-ARS?s Scientific Manuscript database

    The complete genome sequence of ‘Candidatus Liberibacter africanus’ (Laf), strain ptsapsy, was obtained by an Illumina HiSeq 2000. The Laf genome comprises 1,192,232 nucleotides, 34.5% GC content, 1,141 predicted coding sequences, 44 tRNAs, 3 complete copies of ribosomal RNA genes (16S, 23S and 5S) ...

  3. Detection and characterization of hepatitis A virus circulating in Egypt.

    PubMed

    Hamza, Hazem; Abd-Elshafy, Dina Nadeem; Fayed, Sayed A; Bahgat, Mahmoud Mohamed; El-Esnawy, Nagwa Abass; Abdel-Mobdy, Emam

    2017-07-01

    Hepatitis A virus (HAV) still poses a considerable problem worldwide. In the current study, hepatitis A virus was recovered from wastewater samples collected from three wastewater treatment plants over one year. Using RT-PCR, HAV was detected in 43 out of 68 samples (63.2%) representing both inlet and outlet. Eleven positive samples were subjected to sequencing targeting the VP1-2A junction region. Phylogenetic analysis revealed that all samples belonged to subgenotype IB with few substitutions at the amino acid level. The complete sequence of one isolate (HAV/Egy/BI-11/2015) showed that the similarity at the amino acid level was not reflected at the nucleotide level. However, the deduced amino acid sequence derived from the complete nucleotide sequence showed distinct substitutions in the 2B, 2C, and 3A regions. Recombination analysis revealed a recombination event between X75215 (subgenotype IA) and AF268396 (subgenotype IB) involving a portion of the 2B nonstructural protein coding region (nucleotides 3757-3868) assuming the herein characterized sequence an actual recombinant. Despite the role of recombination in picornaviruses evolution, its involvement in HAV evolution has rarely been reported, and this may be due to the limited available complete HAV sequences. To our knowledge, this represents the first characterized complete sequence of an Egyptian isolate and the described recombination event provides an important update on the circulating HAV strains in Egypt.

  4. Molecular characterization of the virulent infectious hematopoietic necrosis virus (IHNV) strain 220-90

    PubMed Central

    2010-01-01

    Background Infectious hematopoietic necrosis virus (IHNV) is the type species of the genus Novirhabdovirus, within the family Rhabdoviridae, infecting several species of wild and hatchery reared salmonids. Similar to other rhabdoviruses, IHNV has a linear single-stranded, negative-sense RNA genome of approximately 11,000 nucleotides. The IHNV genome encodes six genes; the nucleocapsid, phosphoprotein, matrix protein, glycoprotein, non-virion protein and polymerase protein genes, respectively. This study describes molecular characterization of the virulent IHNV strain 220-90, belonging to the M genogroup, and its phylogenetic relationships with available sequences of IHNV isolates worldwide. Results The complete genomic sequence of IHNV strain 220-90 was determined from the DNA of six overlapping clones obtained by RT-PCR amplification of genomic RNA. The complete genome sequence of 220-90 comprises 11,133 nucleotides (GenBank GQ413939) with the gene order of 3'-N-P-M-G-NV-L-5'. These genes are separated by conserved gene junctions, with di-nucleotide gene spacers. An additional uracil nucleotide was found at the end of the 5'-trailer region, which was not reported before in other IHNV strains. The first 15 of the 16 nucleotides at the 3'- and 5'-termini of the genome are complementary, and the first 4 nucleotides at 3'-ends of the IHNV are identical to other novirhadoviruses. Sequence homology and phylogenetic analysis of the glycoprotein genes show that 220-90 strain is 97% identical to most of the IHNV strains. Comparison of the virulent 220-90 genomic sequences with less virulent WRAC isolate shows more than 300 nucleotides changes in the genome, which doesn't allow one to speculate putative residues involved in the virulence of IHNV. Conclusion We have molecularly characterized one of the well studied IHNV isolates, 220-90 of genogroup M, which is virulent for rainbow trout, and compared phylogenetic relationship with North American and other strains. Determination of the complete nucleotide sequence is essential for future studies on pathogenesis of IHNV using a reverse genetics approach and developing efficient control strategies. PMID:20085652

  5. Complete genome sequence analysis of novel human bocavirus reveals genetic recombination between human bocavirus 2 and human bocavirus 4.

    PubMed

    Khamrin, Pattara; Okitsu, Shoko; Ushijima, Hiroshi; Maneekarn, Niwat

    2013-07-01

    Epidemiological surveillance of human bocavirus (HBoV) was conducted on fecal specimens collected from hospitalized children with diarrhea in Chiang Mai, Thailand in 2011. By partial sequence analysis of VP1 gene, an unusual strain of HBoV (CMH-S011-11), was initially identified as HBoV4. The complete genome sequence of CMH-S011-11 was performed and analyzed further to clarify whether it was a recombinant strain or a new HBoV variant. Analysis of complete genome sequence revealed that the coding sequence starting from NS1, NP1 to VP1/VP2 was 4795 nucleotides long. Interestingly, the nucleotide sequence of NS1 gene of CMH-S011-11 was most closely related to the HBoV2 reference strains detected in Pakistan, which contradicted to the initial genotyping result of the partial VP1 region in the previous study. In addition, comparison of NP1 nucleotide sequence of CMH-S011-11 with those of other HBoV1-4 reference strains also revealed a high level of sequence identity with HBoV2. On the other hand, nucleotide sequence of VP1/VP2 gene of CMH-S011-11 was most closely related to those of HBoV4 reference strains detected in Nigeria. The overall full-length sequence analysis revealed that this CMH-S011-11 was grouped within HBoV4 species, but located in a separate branch from other HBoV4 prototype strains. Recombination analysis revealed that CMH-S011-11 was the result of recombination between HBoV2 and HBoV4 strains with the break point located near the start codon of VP2. Copyright © 2013 Elsevier B.V. All rights reserved.

  6. Complete genome sequence of chinese strain of ‘Candidatus Liberibacter asiaticus’

    USDA-ARS?s Scientific Manuscript database

    The complete genome sequence of ‘Candidatus Liberibacter asiaticus’ strain (Las) Guangxi-1(GX-1) was obtained by an Illumina HiSeq 2000. The GX-1 genome comprises 1,268,237 nucleotides, 36.5 % GC content, 1,141 predicted coding sequences, 44 tRNAs, 3 complete copies of ribosomal RNA genes (16S, 23S ...

  7. Complete Genome Sequence of Porcine Parvovirus 2 Recovered from Swine Sera.

    PubMed

    Campos, F S; Kluge, M; Franco, A C; Giongo, A; Valdez, F P; Saddi, T M; Brito, W M E D; Roehe, P M

    2016-01-28

    A complete genomic sequence of porcine parvovirus 2 (PPV-2) was detected by viral metagenome analysis on swine sera. A phylogenetic analysis of this genome reveals that it is highly similar to previously reported North American PPV-2 genomes. The complete PPV-2 sequence is 5,426 nucleotides long. Copyright © 2016 Campos et al.

  8. Complete nucleotide sequence and genome structure of a Japanese isolate of hibiscus latent Fort Pierce virus, a unique tobamovirus that contains an internal poly(A) region in its 3' end.

    PubMed

    Yoshida, Tetsuya; Kitazawa, Yugo; Komatsu, Ken; Neriya, Yutaro; Ishikawa, Kazuya; Fujita, Naoko; Hashimoto, Masayoshi; Maejima, Kensaku; Yamaji, Yasuyuki; Namba, Shigetou

    2014-11-01

    In this study, we detected a Japanese isolate of hibiscus latent Fort Pierce virus (HLFPV-J), a member of the genus Tobamovirus, in a hibiscus plant in Japan and determined the complete sequence and organization of its genome. HLFPV-J has four open reading frames (ORFs), each of which shares more than 98 % nucleotide sequence identity with those of other HLFPV isolates. Moreover, HLFPV-J contains a unique internal poly(A) region of variable length, ranging from 44 to 78 nucleotides, in its 3'-untranslated region (UTR), as is the case with hibiscus latent Singapore virus (HLSV), another hibiscus-infecting tobamovirus. The length of the HLFPV-J genome was 6431 nucleotides, including the shortest internal poly(A) region. The sequence identities of ORFs 1, 2, 3 and 4 of HLFPV-J to other tobamoviruses were 46.6-68.7, 49.9-70.8, 31.0-70.8 and 39.4-70.1 %, respectively, at the nucleotide level and 39.8-75.0, 43.6-77.8, 19.2-70.4 and 31.2-74.2 %, respectively, at the amino acid level. The 5'- and 3'-UTRs of HLFPV-J showed 24.3-58.6 and 13.0-79.8 % identity, respectively, to other tobamoviruses. In particular, when compared to other tobamoviruses, each ORF and UTR of HLFPV-J showed the highest sequence identity to those of HLSV. Phylogenetic analysis showed that HLFPV-J, other HLFPV isolates and HLSV constitute a malvaceous-plant-infecting tobamovirus cluster. These results indicate that the genomic structure of HLFPV-J has unique features similar to those of HLSV. To our knowledge, this is the first report of the complete genome sequence of HLFPV.

  9. Complete genome sequence of maize yellow striate virus, a new cytorhabdovirus infecting maize and wheat crops in Argentina.

    PubMed

    Maurino, Fernanda; Dumón, Analía D; Llauger, Gabriela; Alemandri, Vanina; de Haro, Luis A; Mattio, M Fernanda; Del Vas, Mariana; Laguna, Irma Graciela; Giménez Pecci, María de la Paz

    2018-01-01

    A rhabdovirus infecting maize and wheat crops in Argentina was molecularly characterized. Through next-generation sequencing (NGS) of symptomatic leaf samples, the complete genome was obtained of two isolates of maize yellow striate virus (MYSV), a putative new rhabdovirus, differing by only 0.4% at the nucleotide level. The MYSV genome consists of 12,654 nucleotides for maize and wheat virus isolates, and shares 71% nucleotide sequence identity with the complete genome of barley yellow striate mosaic virus (BYSMV, NC028244). Ten open reading frames (ORFs) were predicted in the MYSV genome from the antigenomic strand and were compared with their BYSMV counterparts. The highest amino acid sequence identity of the MYSV and BYSMV proteins was 80% between the L proteins, and the lowest was 37% between the proteins 4. Phylogenetic analysis suggested that the MYSV isolates are new members of the genus Cytorhabdovirus, family Rhabdoviridae. Yellow striate, affecting maize and wheat crops in Argentina, is an emergent disease that presents a potential economic risk for these widely distributed crops.

  10. Complete sequence analysis reveals two distinct poleroviruses infecting cucurbits in China.

    PubMed

    Xiang, Hai-ying; Shang, Qiao-xia; Han, Cheng-gui; Li, Da-wei; Yu, Jia-lin

    2008-01-01

    The complete RNA genomes of a Chinese isolate of cucurbit aphid-borne yellows virus (CABYV-CHN) and a new polerovirus tentatively referred to as melon aphid-borne yellows virus (MABYV) were determined. The entire genome of CABYV-CHN shared 89.0% nucleotide sequence identity with the French CABYV isolate. In contrast, nucleotide sequence identities between MABYV and CABYV and other poleroviruses were in the range of 50.7-74.2%, with amino acid sequence identities ranging from 24.8 to 82.9% for individual gene products. We propose that CABYV-CHN is a strain of CABYV and that MABYV is a member of a tentative distinct species within the genus Polerovirus.

  11. The complete genomic sequence of a tentative new polerovirus identified in barley in South Korea.

    PubMed

    Zhao, Fumei; Lim, Seungmo; Yoo, Ran Hee; Igori, Davaajargal; Kim, Sang-Min; Kwak, Do Yeon; Kim, Sun Lim; Lee, Bong Choon; Moon, Jae Sun

    2016-07-01

    The complete nucleotide sequence of a new barley polerovirus, tentatively named barley virus G (BVG), which was isolated in Gimje, South Korea, has been determined using an RNA sequencing technique combined with polymerase chain reaction methods. The viral genomic RNA of BVG is 5,620 nucleotides long and contains six typical open reading frames commonly observed in other poleroviruses. Sequence comparisons revealed that BVG is most closely related to maize yellow dwarf virus-RMV, with the highest amino acid identities being less than 90 % for all of the corresponding proteins. These results suggested that BVG is a member of a new species in the genus Polerovirus.

  12. Complete nucleotide sequence of spring beauty latent virus, a bromovirus infectious to Arabidopsis thaliana.

    PubMed

    Fujisaki, K; Hagihara, F; Kaido, M; Mise, K; Okuno, T

    2003-01-01

    Spring beauty latent virus (SBLV), a bromovirus, systemically and efficiently infected Arabidopsis thaliana, whereas the well-studied bromoviruses brome mosaic virus (BMV) and cowpea chlorotic mottle virus (CCMV) did not infect and poorly infected A. thaliana, respectively. We constructed biologically active cDNA clones of SBLV genomic RNAs and determined their complete nucleotide sequences. Interestingly, SBLV RNA3 contains both the box B motif in the intercistronic region, as does BMV, and the subgenomic promoter-like sequence in the 5' noncoding region, as does CCMV. Sequence comparisons of SBLV, BMV, CCMV, and broad bean mottle virus demonstrated that SBLV is closely related to BMV and CCMV.

  13. Molecular variability analysis of five new complete cacao swollen shoot virus genomic sequences.

    PubMed

    Muller, E; Sackey, S

    2005-01-01

    Cacao swollen shoot virus (CSSV), a member of the family Caulimovi-ridae, genus Badnavirus occurs in all the main cacao-growing areas of West Africa. We amplified, cloned and sequenced complete genomes of five new isolates, two originating from Togo and three originating from Ghana. The genome of these five newly sequenced isolates all contain the five putative open reading frames I, II, III, X and Y described for the first sequenced CSSV isolate, Agou1 originating from Togo. Their genomes have been aligned with the genome of Agou1. The nucleotide and amino acid sequence identities between isolates have been calculated and a phylogenetic analysis has been made including other pararetroviruses. Maximum nucleotide sequence variability between complete genomes of CSSV isolates was 29.4%. Geographical differentiation between isolates appears more important than differentiation between mild and severe isolates. ORF X differs greatly in size and sequence between the Togolese isolates Nyongbo2 and Agou1, and the four other isolates, its functional role is therefore clearly questionable.

  14. Complete genome sequence of a novel genotype of squash mosaic virus

    USDA-ARS?s Scientific Manuscript database

    Complete genome sequence of a novel genotype of Squash mosaic virus (SqMV) infecting squash plants in Spain was obtained using deep sequencing of small ribonucleic acids and assembly. The low nucleotide sequence identities, with 87-88% on RNA1 and 84-86% on RNA2 to known SqMV isolates, suggest a new...

  15. First complete genome sequence of an emerging cucumber green mottle mosaic virus isolate in North America

    USDA-ARS?s Scientific Manuscript database

    The complete genome sequence (6,423 nt) of an emerging Cucumber green mottle mosaic virus (CGMMV) isolate on cucumber in North America was determined through deep sequencing of sRNA and rapid amplification of cDNA ends. It shares 99% nucleotide sequence identity to the Asian genotype, but only 90% t...

  16. Complete Genomic Sequence and Comparative Analysis of the Genome Segments of Sweet Potato Chlorotic Stunt Virus in China

    PubMed Central

    Qin, Yanhong; Wang, Li; Zhang, Zhenchen; Qiao, Qi; Zhang, Desheng; Tian, Yuting; Wang, Shuang; Wang, Yongjiang; Yan, Zhaoling

    2014-01-01

    Background Sweet potato chlorotic stunt virus (family Closteroviridae, genus Crinivirus) features a large bipartite, single-stranded, positive-sense RNA genome. To date, only three complete genomic sequences of SPCSV can be accessed through GenBank. SPCSV was first detected from China in 2011, only partial genomic sequences have been determined in the country. No report on the complete genomic sequence and genome structure of Chinese SPCSV isolates or the genetic relation between isolates from China and other countries is available. Methodology/Principal Findings The complete genomic sequences of five isolates from different areas in China were characterized. This study is the first to report the complete genome sequences of SPCSV from whitefly vectors. Genome structure analysis showed that isolates of WA and EA strains from China have the same coding protein as isolates Can181-9 and m2-47, respectively. Twenty cp genes and four RNA1 partial segments were sequenced and analyzed, and the nucleotide identities of complete genomic, cp, and RNA1 partial sequences were determined. Results indicated high conservation among strains and significant differences between WA and EA strains. Genetic analysis demonstrated that, except for isolates from Guangdong Province, SPCSVs from other areas belong to the WA strain. Genome organization analysis showed that the isolates in this study lack the p22 gene. Conclusions/Significance We presented the complete genome sequences of SPCSV in China. Comparison of nucleotide identities and genome structures between these isolates and previously reported isolates showed slight differences. The nucleotide identities of different SPCSV isolates showed high conservation among strains and significant differences between strains. All nine isolates in this study lacked p22 gene. WA strains were more extensively distributed than EA strains in China. These data provide important insights into the molecular variation and genomic structure of SPCSV in China as well as genetic relationships among isolates from China and other countries. PMID:25170926

  17. The complete genome sequence of a virus associated with cotton blue disease, cotton leafroll dwarf virus, confirms that it is a new member of the genus Polerovirus.

    PubMed

    Distéfano, Ana J; Bonacic Kresic, Ivan; Hopp, H Esteban

    2010-11-01

    Cotton blue disease is the most important virus disease of cotton in the southern part of America. The complete nucleotide sequence of the ssRNA genome of the cotton blue disease-associated virus was determined for the first time. It comprised 5,866 nucleotides, and the deduced genomic organization resembled that of members of the genus Polerovirus. Sequence homology comparison and phylogenetic analysis confirm that this virus (previous proposed name cotton leafroll dwarf virus) is a member of a new species within the genus Polerovirus.

  18. Complete genome sequence of a divergent strain of Japanese yam mosaic virus from China

    USDA-ARS?s Scientific Manuscript database

    A novel strain of Japanese yam mosaic virus (JYMV-CN) was identified in a yam plant with foliar mottle symptoms in China. The complete genomic sequence of JYMV-CN was determined. Its genomic sequence of 9701 nucleotides encodes a polyprotein of 3247 amino acids. Its organization was virtually identi...

  19. Complete genome analysis of jasmine virus T from Jasminum sambac in China.

    PubMed

    Tang, Yajun; Gao, Fangluan; Yang, Zhen; Wu, Zujian; Yang, Liang

    2016-07-01

    The genome of a potyvirus (isolate JaVT_FZ) recovered from jasmine (Jasminum sambac L.) showing yellow ringspot symptoms in Fuzhou, China, was sequenced. JaVT_FZ is closely related to seven other potyviruses with completely sequenced genomes, with which it shares 66-70 % nucleotide and 52-56 % amino acid sequence identity. However, the coat protein (CP) gene shares 82-92 % nucleotide and 90-97 % amino acid sequence identity with those of two partially sequenced potyviruses, named jasmine potyvirus T (JaVT-jasmine) and jasmine yellow mosaic potyvirus (JaYMV-India), respectively. This suggests that JaVT_FZ, JaVT-jasmine and JaYMV-India should be regarded as members of a single potyvirus species, for which the name "Jasmine virus T" has priority.

  20. Complete Genome Sequence of a Putative Densovirus of the Asian Citrus Psyllid, Diaphorina citri.

    PubMed

    Nigg, Jared C; Nouri, Shahideh; Falk, Bryce W

    2016-07-28

    Here, we report the complete genome sequence of a putative densovirus of the Asian citrus psyllid, Diaphorina citri Diaphorina citri densovirus (DcDNV) was originally identified through metagenomics, and here, we obtained the complete nucleotide sequence using PCR-based approaches. Phylogenetic analysis places DcDNV between viruses of the Ambidensovirus and Iteradensovirus genera. Copyright © 2016 Nigg et al.

  1. Complete genome sequence and phylogenetic analyses of an aquabirnavirus isolated from a diseased marbled eel culture in Taiwan.

    PubMed

    Wen, Chiu-Ming

    2017-08-01

    An aquabirnavirus was isolated from diseased marbled eels (Anguilla marmorata; MEIPNV1310) with gill haemorrhages and associated mortality. Its genome segment sequences were obtained through next-generation sequencing and compared with published aquabirnavirus sequences. The results indicated that the genome sequence of MEIPNV1310 contains segment A (3099 nucleotides) and segment B (2789 nucleotides). Phylogenetic analysis showed that MEIPNV1310 is closely related to the infectious pancreatic necrosis Ab strain within genogroup II. This genome sequence is beneficial for studying the geographic distribution and evolution of aquabirnaviruses.

  2. Phylogenetic Network for European mtDNA

    PubMed Central

    Finnilä, Saara; Lehtonen, Mervi S.; Majamaa, Kari

    2001-01-01

    The sequence in the first hypervariable segment (HVS-I) of the control region has been used as a source of evolutionary information in most phylogenetic analyses of mtDNA. Population genetic inference would benefit from a better understanding of the variation in the mtDNA coding region, but, thus far, complete mtDNA sequences have been rare. We determined the nucleotide sequence in the coding region of mtDNA from 121 Finns, by conformation-sensitive gel electrophoresis and subsequent sequencing and by direct sequencing of the D loop. Furthermore, 71 sequences from our previous reports were included, so that the samples represented all the mtDNA haplogroups present in the Finnish population. We found a total of 297 variable sites in the coding region, which allowed the compilation of unambiguous phylogenetic networks. The D loop harbored 104 variable sites, and, in most cases, these could be localized within the coding-region networks, without discrepancies. Interestingly, many homoplasies were detected in the coding region. Nucleotide variation in the rRNA and tRNA genes was 6%, and that in the third nucleotide positions of structural genes amounted to 22% of that in the HVS-I. The complete networks enabled the relationships between the mtDNA haplogroups to be analyzed. Phylogenetic networks based on the entire coding-region sequence in mtDNA provide a rich source for further population genetic studies, and complete sequences make it easier to differentiate between disease-causing mutations and rare polymorphisms. PMID:11349229

  3. Complete Genome Sequence of Komagataeibacter hansenii Strain SC-3B

    PubMed Central

    Santos, Richard; Ebels, Marcus; Bordbar, Darius

    2017-01-01

    ABSTRACT This study reports the release of the complete nucleotide sequence of Komagataeibacter hansenii SC-3B, a new efficient producer of cellulose. Elucidation of the genome may provide more information to aid in understanding the genes necessary for cellulose biosynthesis. PMID:28408681

  4. Complete Genome Sequence of Komagataeibacter hansenii LMG 23726T

    PubMed Central

    Santos, Richard; Ebels, Marcus; Bordbar, Darius

    2017-01-01

    ABSTRACT This study reports the release of the complete nucleotide sequence of Komagataeibacter hansenii LMG 23726T. This organism is a cellulose producer, and its genome may provide more information to aid in the understanding of the genes necessary for cellulose biosynthesis. PMID:28408680

  5. Begomoviruses infecting weeds in Cuba: increased host range and a novel virus infecting Sida rhombifolia.

    PubMed

    Fiallo-Olivé, Elvira; Navas-Castillo, Jesús; Moriones, Enrique; Martínez-Zubiaur, Yamila

    2012-01-01

    As a result of surveys conducted during the last few years to search for wild reservoirs of begomoviruses in Cuba, we detected a novel bipartite begomovirus, sida yellow mottle virus (SiYMoV), infecting Sida rhombifolia plants. The complete genome sequence was obtained, showing that DNA-A was 2622 nucleotides (nt) in length and that it was most closely related (87.6% nucleotide identity) to DNA-A of an isolate of sida golden mosaic virus (SiGMV) that infects snap beans (Phaseolus vulgaris) in Florida. The DNA-B sequence was 2600 nt in length and shared the highest nucleotide identity (75.1%) with corchorus yellow spot virus (CoYSV). Phylogenetic relationship analysis showed that both DNA components of SiYMoV were grouped in the Abutilon clade, along with begomoviruses from Florida and the Caribbean islands. We also present here the complete nucleotide sequence of a novel strain of sida yellow vein virus found infecting Malvastrum coromandelianum and an isolate of euphorbia mosaic virus that was found for the first time infecting Euphorbia heterophylla in Cuba.

  6. Complete genome sequence of keunjorong mosaic virus, a potyvirus from Cynanchum wilfordii.

    PubMed

    Nam, Moon; Lee, Joo-Hee; Choi, Hong Soo; Lim, Hyoun-Sub; Moon, Jae Sun; Lee, Su-Heon

    2013-08-01

    We have determined the complete genome sequence of keunjorong mosaic virus (KjMV). The KjMV genome is composed of 9,611 nucleotides, excluding the 3'-terminal poly(A) tail. It contains two open reading frames (ORFs), with the large one encoding a polyprotein of 3,070 amino acids and the small overlapping ORF encoding a PIPO protein of 81 amino acids. The KjMV genome shared the highest nucleotide sequence identity (57.5  %) with pepper mottle virus and freesia mosaic virus, two members of the genus Potyvirus. Based on the phylogenetic relatedness to known potyviruses, KjMV appears to be a member of a new species in the genus Potyvirus.

  7. Molecular characterization of a novel rhabdovirus infecting blackcurrant identified by high-throughput sequencing.

    PubMed

    Wu, L-P; Yang, T; Liu, H-W; Postman, J; Li, R

    2018-05-01

    A large contig with sequence similarities to several nucleorhabdoviruses was identified by high-throughput sequencing analysis from a black currant (Ribes nigrum L.) cultivar. The complete genome sequence of this new nucleorhabdovirus is 14,432 nucleotides long. Its genomic organization is very similar to those of unsegmented plant rhabdoviruses, containing six open reading frames in the order 3'-N-P-P3-M-G-L-5. The virus, which is provisionally named "black currant-associated rhabdovirus", is 41-52% identical in its genome nucleotide sequence to other nucleorhabdoviruses and may represent a new species in the genus Nucleorhabdovirus.

  8. Correlation approach to identify coding regions in DNA sequences

    NASA Technical Reports Server (NTRS)

    Ossadnik, S. M.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Mantegna, R. N.; Peng, C. K.; Simons, M.; Stanley, H. E.

    1994-01-01

    Recently, it was observed that noncoding regions of DNA sequences possess long-range power-law correlations, whereas coding regions typically display only short-range correlations. We develop an algorithm based on this finding that enables investigators to perform a statistical analysis on long DNA sequences to locate possible coding regions. The algorithm is particularly successful in predicting the location of lengthy coding regions. For example, for the complete genome of yeast chromosome III (315,344 nucleotides), at least 82% of the predictions correspond to putative coding regions; the algorithm correctly identified all coding regions larger than 3000 nucleotides, 92% of coding regions between 2000 and 3000 nucleotides long, and 79% of coding regions between 1000 and 2000 nucleotides. The predictive ability of this new algorithm supports the claim that there is a fundamental difference in the correlation property between coding and noncoding sequences. This algorithm, which is not species-dependent, can be implemented with other techniques for rapidly and accurately locating relatively long coding regions in genomic sequences.

  9. Nucleotide sequence and genetic organization of barley stripe mosaic virus RNA gamma.

    PubMed

    Gustafson, G; Hunter, B; Hanau, R; Armour, S L; Jackson, A O

    1987-06-01

    The complete nucleotide sequences of RNA gamma from the Type and ND18 strains of barley stripe mosaic virus (BSMV) have been determined. The sequences are 3164 (Type) and 2791 (ND18) nucleotides in length. Both sequences contain a 5'-noncoding region (87 or 88 nucleotides) which is followed by a long open reading frame (ORF1). A 42-nucleotide intercistronic region separates ORF1 from a second, shorter open reading frame (ORF2) located near the 3'-end of the RNA. There is a high degree of homology between the Type and ND18 strains in the nucleotide sequence of ORF1. However, the Type strain contains a 366 nucleotide direct tandem repeat within ORF1 which is absent in the ND18 strain. Consequently, the predicted translation product of Type RNA gamma ORF1 (mol wt 87,312) is significantly larger than that of ND18 RNA gamma ORF1 (mol wt 74,011). The amino acid sequence of the ORF1 polypeptide contains homologies with putative RNA polymerases from other RNA viruses, suggesting that this protein may function in replication of the BSMV genome. The nucleotide sequence of RNA gamma ORF2 is nearly identical in the Type and ND18 strains. ORF2 codes for a polypeptide with a predicted molecular weight of 17,209 (Type) or 17,074 (ND18) which is known to be translated from a subgenomic (sg) RNA. The initiation point of this sgRNA has been mapped to a location 27 nucleotides upstream of the ORF2 initiation codon in the intercistronic region between ORF1 and ORF2. The sgRNA is not coterminal with the 3'-end of the genomic RNA, but instead contains heterogeneous poly(A) termini up to 150 nucleotides long (J. Stanley, R. Hanau, and A. O. Jackson, 1984, Virology 139, 375-383). In the genomic RNA gamma, ORF2 is followed by a short poly(A) tract and a 238-nucleotide tRNA-like structure.

  10. First Complete Genome Sequence of Suakwa aphid-borne yellows virus from East Timor

    PubMed Central

    Maina, Solomon; Edwards, Owain R.; de Almeida, Luis; Ximenes, Abel

    2016-01-01

    We present here the first complete genomic RNA sequence of the polerovirus Suakwa aphid-borne yellows virus (SABYV), from East Timor. The isolate sequenced came from a virus-infected pumpkin plant. The East Timorese genome had a nucleotide identity of 86.5% with the only other SABYV genome available, which is from Taiwan. PMID:27469955

  11. High Degree of Interlaboratory Reproducibility of Human Immunodeficiency Virus Type 1 Protease and Reverse Transcriptase Sequencing of Plasma Samples from Heavily Treated Patients

    PubMed Central

    Shafer, Robert W.; Hertogs, Kurt; Zolopa, Andrew R.; Warford, Ann; Bloor, Stuart; Betts, Bradley J.; Merigan, Thomas C.; Harrigan, Richard; Larder, Brendon A.

    2001-01-01

    We assessed the reproducibility of human immunodeficiency virus type 1 (HIV-1) reverse transcriptase (RT) and protease sequencing using cryopreserved plasma aliquots obtained from 46 heavily treated HIV-1-infected individuals in two laboratories using dideoxynucleotide sequencing. The rates of complete sequence concordance between the two laboratories were 99.1% for the protease sequence and 99.0% for the RT sequence. Approximately 90% of the discordances were partial, defined as one laboratory detecting a mixture and the second laboratory detecting only one of the mixture's components. Only 0.1% of the nucleotides were completely discordant between the two laboratories, and these were significantly more likely to occur in plasma samples with lower plasma HIV-1 RNA levels. Nucleotide mixtures were detected at approximately 1% of the nucleotide positions, and in every case in which one laboratory detected a mixture, the second laboratory either detected the same mixture or detected one of the mixture's components. The high rate of concordance in detecting mixtures and the fact that most discordances between the two laboratories were partial suggest that most discordances were caused by variation in sampling of the HIV-1 quasispecies by PCR rather than by technical errors in the sequencing process itself. PMID:11283081

  12. Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences

    PubMed Central

    Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.

    2012-01-01

    ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136

  13. Molecular characterization of a novel Nucleorhabdovirus from black currant identified by high-throughput sequencing

    USDA-ARS?s Scientific Manuscript database

    Contigs with sequence similarities to several nucleorhabdoviruses were identified by high-throughput sequencing analysis from a black currant (Ribes nigrum L.) cultivar. The complete genomic sequence of this new nucleorhabdovirus is 14,432 nucleotides. Its genomic organization is typical of nucleorh...

  14. Complete nucleotide sequences of the coat protein messenger RNAs of brome mosaic virus and cowpea chlorotic mottle virus.

    PubMed Central

    Dasgupta, R; Kaesberg, P

    1982-01-01

    The nucleotide sequences of the subgenomic coat protein messengers (RNA4's) of two related bromoviruses, brome mosaic virus (BMV) and cowpea chlorotic mottle virus (CCMV), have been determined by direct RNA and CDNA sequencing without cloning. BMV RNA4 is 876 b long including a 5' noncoding region of nine nucleotides and a 3' noncoding region of 300 nucleotides. CCMV RNA 4 is 824 b long, including a 5' noncoding region of 10 nucleotides and a 3' noncoding region of 244 nucleotides. The encoded coat proteins are similar in length (188 amino acids for BMV and 189 amino acids for CCMV) and display about 70% homology in their amino acid sequences. Length difference between the two RNAs is due mostly to a single deletion, in CCMV with respect to BMV, of about 57 b immediately following the coding region. Allowing for this deletion the RNAs are indicate that mutations leading to divergence were constrained in the coding region primarily by the requirement of maintaining a favorable coat protein structure and in the 3' noncoding region primarily by the requirement of maintaining a favorable RNA spatial configuration. PMID:6895941

  15. Complete genome sequence of a new begomovirus associated with yellow mosaic disease of Hemidesmus indicus in India.

    PubMed

    Reddy, M Sreekanth; Kanakala, S; Srinivas, K P; Hema, M; Malathi, V G; Sreenivasulu, P

    2014-05-01

    The complete DNA A genome of a virus isolate associated with yellow mosaic disease of a medicinal plant, Hemidesmus indicus, from India was cloned and sequenced. The length of DNA A was 2825 nucleotides, 35 nucleotides longer than the unit genome of monopartite begomoviruses. Comparison of the nucleotide sequence of DNA A of the virus isolate with those of other begomoviruses showed maximum sequence identity of 69 % to DNA A of ageratum yellow vein China virus (AYVCNV; AJ558120) and 68 % with tomato yellow leaf curl virus- LBa4 (TYLCV; EF185318), and it formed a distinct clade in phylogenetic analysis. The genome organization of the present virus isolate was found to be similar to that of Old World monopartite begomoviruses. The genome was considered to be monopartite, because association of DNA B and β satellite DNA components was not detected. Based on its sequence identity (<70 %) to all other begomoviruses known to date and ICTV (International Committee on Taxonomy of Viruses) species demarcating criteria (<89 % identity), it is considered a member of a novel begomovirus species, and the tentative name "Hemidesmus yellow mosaic virus" (HeYMV) is proposed.

  16. Typing of canine parvovirus isolates using mini-sequencing based single nucleotide polymorphism analysis.

    PubMed

    Naidu, Hariprasad; Subramanian, B Mohana; Chinchkar, Shankar Ramchandra; Sriraman, Rajan; Rana, Samir Kumar; Srinivasan, V A

    2012-05-01

    The antigenic types of canine parvovirus (CPV) are defined based on differences in the amino acids of the major capsid protein VP2. Type specificity is conferred by a limited number of amino acid changes and in particular by few nucleotide substitutions. PCR based methods are not particularly suitable for typing circulating variants which differ in a few specific nucleotide substitutions. Assays for determining SNPs can detect efficiently nucleotide substitutions and can thus be adapted to identify CPV types. In the present study, CPV typing was performed by single nucleotide extension using the mini-sequencing technique. A mini-sequencing signature was established for all the four CPV types (CPV2, 2a, 2b and 2c) and feline panleukopenia virus. The CPV typing using the mini-sequencing reaction was performed for 13 CPV field isolates and the two vaccine strains available in our repository. All the isolates had been typed earlier by full-length sequencing of the VP2 gene. The typing results obtained from mini-sequencing matched completely with that of sequencing. Typing could be achieved with less than 100 copies of standard plasmid DNA constructs or ≤10¹ FAID₅₀ of virus by mini-sequencing technique. The technique was also efficient for detecting multiple types in mixed infections. Copyright © 2012 Elsevier B.V. All rights reserved.

  17. Molecular characterization of a novel Luteovirus from peach identified by high-throughput sequencing

    USDA-ARS?s Scientific Manuscript database

    Contigs with sequence homologies to Cherry-associated luteovirus were identified by high-throughput sequencing analysis of two peach accessions undergoing quarantine testing. The complete genomic sequences of the two isolates of this virus are 5,819 and 5,814 nucleotides. Their genome organization i...

  18. Complete genome sequence of a recent panzootic virulent Newcastle disease virus from Pakistan

    USDA-ARS?s Scientific Manuscript database

    Complete genome sequence of a new strain of Newcastle disease virus (NDV) (chicken/Pak/Lahore-611/2013) is reported. The strain was isolated from a vaccinated chicken flock in Pakistan in 2013 and has panzootic features. The genome is 15192 nucleotides in length and is classified as sub-genotype V...

  19. Complete genome sequence of yam chlorotic necrosis virus, a novel macluravirus infecting yam

    USDA-ARS?s Scientific Manuscript database

    Complete genomic sequence of a novel member of the genus Macluravirus was determined from yam plants with chlorotic and necrotic symptoms in China. The genomic RNA consists of 8,261 nucleotides (nt) excluding the 3’-terminal poly (A) tail, containing one long open reading frame (ORF) encoding a larg...

  20. Complete genome sequence of a novel potyvirus, Callistephus mottle virus identified in Callistephus chinensis

    USDA-ARS?s Scientific Manuscript database

    The complete genomic sequence of a novel putative member of the genus Potyvirus was detected from Callistephus chinensis (china aster) in South Korea. The genomic RNA consists of 9,859 nucleotides excluding the 3’ poly(A) tail. The Callistephus virus genome, which contains the typical open reading f...

  1. The nucleotide sequence and genome organization of Plasmopara halstedii virus.

    PubMed

    Heller-Dohmen, Marion; Göpfert, Jens C; Pfannstiel, Jens; Spring, Otmar

    2011-03-17

    Only very few viruses of Oomycetes have been studied in detail. Isometric virions were found in different isolates of the oomycete Plasmopara halstedii, the downy mildew pathogen of sunflower. However, complete nucleotide sequences and data on the genome organization were lacking. Viral RNA of different P. halstedii isolates was subjected to nucleotide sequencing and analysis of the viral genome. The N-terminal sequence of the viral coat protein was determined using Top-Down MALDI-TOF analysis. The complete nucleotide sequences of both single-stranded RNA segments (RNA1 and RNA2) were established. RNA1 consisted of 2793 nucleotides (nt) exclusive its 3' poly(A) tract and a single open-reading frame (ORF1) of 2745 nt. ORF1 was framed by a 5' untranslated region (5' UTR) of 18 nt and a 3' untranslated region (3' UTR) of 30 nt. ORF1 contained motifs of RNA-dependent RNA polymerases (RdRp) and showed similarities to RdRp of Scleropthora macrospora virus A (SmV A) and viruses within the Nodaviridae family. RNA2 consisted of 1526 nt exclusive its 3' poly(A) tract and a second ORF (ORF2) of 1128 nt. ORF2 coded for the single viral coat protein (CP) and was framed by a 5' UTR of 164 nt and a 3' UTR of 234 nt. The deduced amino acid sequence of ORF2 was verified by nano-LC-ESI-MS/MS experiments. Top-Down MALDI-TOF analysis revealed the N-terminal sequence of the CP. The N-terminal sequence represented a region within ORF2 suggesting a proteolytic processing of the CP in vivo. The CP showed similarities to CP of SmV A and viruses within the Tombusviridae family. Fragments of RNA1 (ca. 1.9 kb) and RNA2 (ca. 1.4 kb) were used to analyze the nucleotide sequence variation of virions in different P. halstedii isolates. Viral sequence variation was 0.3% or less regardless of their host's pathotypes, the geographical origin and the sensitivity towards the fungicide metalaxyl. The results showed the presence of a single and new virus type in different P. halstedii isolates. Insignificant viral sequence variation indicated that the virus did not account for differences in pathogenicity of the oomycete P. halstedii.

  2. Complete genome sequence of a new maize-associated cytorhabdovirus

    USDA-ARS?s Scientific Manuscript database

    A new 11,877 nt cytorhabdovirus sequence with 6 open reading frames has been identified in a maize sample. It shares 50 and 51% genome-wide nucleotide sequence identity with northern cereal mosaic cytorhabdovirus (NCMV) and barley yellow striate mosaic cytorhabdovirus (BYSMV), respectively....

  3. Complete nucleotide sequence of pig (Sus scrofa) mitochondrial genome and dating evolutionary divergence within Artiodactyla.

    PubMed

    Lin, C S; Sun, Y L; Liu, C Y; Yang, P C; Chang, L C; Cheng, I C; Mao, S J; Huang, M C

    1999-08-05

    The complete nucleotide sequence of the pig (Sus scrofa) mitochondrial genome, containing 16613bp, is presented in this report. The genome is not a specific length because of the presence of the variable numbers of tandem repeats, 5'-CGTGCGTACA in the displacement loop (D-loop). Genes responsible for 12S and 16S rRNAs, 22 tRNAs, and 13 protein-coding regions are found. The genome carries very few intergenic nucleotides with several instances of overlap between protein-coding or tRNA genes, except in the D-loop region. For evaluating the possible evolutionary relationships between Artiodactyla and Cetacea, the nucleotide substitutions and amino acid sequences of 13 protein-coding genes were aligned by pairwise comparisons of the pig, cow, and fin whale. By comparing these sequences, we suggest that there is a closer relationship between the pig and cow than that between either of these species and fin whale. In addition, the accumulation of transversions and gaps in pig 12S and 16S rRNA genes was compared with that in other eutherian species, including cow, fin whale, human, horse, and harbor seal. The results also reveal a close phylogenetic relationship between pig and cow, as compared to fin whale and others. Thus, according to the sequence differences of mitochondrial rRNA genes in eutherian species, the evolutionary separation of pig and cow occurred about 53-60 million years ago.

  4. Complete Genome Sequence of Petrimonas sp. Strain IBARAKI, Assembled from the Metagenome Data of a Culture Containing Dehalococcoides spp.

    PubMed

    Ikegami, Kentaro; Aita, Yuto; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Ashimine, Noriko; Shinzato, Misuzu; Ohki, Shun; Nakano, Kazuma; Teruya, Kuniko; Satou, Kazuhito; Hirano, Takashi; Yohda, Masafumi

    2018-05-03

    The complete genome sequence of Petrimonas sp. strain IBARAKI in a Dehalococcoides -containing culture was determined using the PacBio RS II platform. The genome is a single circular chromosome of 3,693,233 nucleotides (nt), with a GC content of 44%. This is the first genome sequence of a Petrimonas species. Copyright © 2018 Ikegami et al.

  5. Complete genome sequence of the biofilm-forming Curtobacterium sp. strain BH-2-1-1, isolated from lettuce (Lactuca sativa) originating from a conventional field in Norway.

    PubMed

    Dees, Merete Wiken; Brurberg, May Bente; Lysøe, Erik

    2016-12-01

    Here, we present the 3,795,952 bp complete genome sequence of the biofilm-forming Curtobacterium sp. strain BH-2-1-1, isolated from conventionally grown lettuce ( Lactuca sativa ) from a field in Vestfold, Norway. The nucleotide sequence of this genome was deposited into NCBI GenBank under the accession CP017580.

  6. Complete genome sequence of Paris mosaic necrosis virus, a distinct member of the genus Potyvirus

    USDA-ARS?s Scientific Manuscript database

    The complete genomic sequence of a novel potyvirus was determined from Paris polyphylla var. yunnanensis. Its genomic RNA consists of 9,660 nucleotides (nt) excluding the 3’-terminal poly (A) tail, containing a single open reading frame (ORF) encoding a large polyprotein. The virus shares 52.1-69.7%...

  7. Complete genome sequences of four avian paramyxoviruses of serotype 10 isolated from Rockhopper Penguins on the Falkland Islands

    USDA-ARS?s Scientific Manuscript database

    The first complete genome sequences of four Avian paramyxovirus serotype 10 (APMV-10) isolates are described here. The viruses were isolated from Rockhopper Penguins sampled in 2007 on the Falkland Islands. All four genomes are 15,456 nucleotides in length and phylogenetic analyses show them to be c...

  8. Complete genome sequence of a divergent strain of lettuce chlorosis virus from Periwinkle in China

    USDA-ARS?s Scientific Manuscript database

    A novel strain of Lettuce chlorosis virus (LCV) was identified from periwinkle in China (PW) with foliar interveinal chlorosis and plant dwarfing. Complete nucleotide (nt) sequences of genomic RNA1 and RNA2 of the virus are 8,602 nt and 8,456 nt, respectively. The genomic organization of LCV-PW rese...

  9. Complete genome sequence of switchgrass mosaic virus, a member of a proposed new species in the genus Marafivirus

    USDA-ARS?s Scientific Manuscript database

    The complete genome sequence of a virus recently detected in switchgrass (Panicum virgatum) was determined and was found to be closely related to Maize rayado fino virus (MRFV), genus Marafivirus, family Tymoviridae. The genomic RNA is 6408 nucleotides long, excluding the poly (A) tail, and encodes...

  10. Asystasia mosaic Madagascar virus: a novel bipartite begomovirus infecting the weed Asystasia gangetica in Madagascar.

    PubMed

    De Bruyn, Alexandre; Harimalala, Mireille; Hoareau, Murielle; Ranomenjanahary, Sahondramalala; Reynaud, Bernard; Lefeuvre, Pierre; Lett, Jean-Michel

    2015-06-01

    Here, we describe for the first time the complete genome sequence of a new bipartite begomovirus in Madagascar isolated from the weed Asystasia gangetica (Acanthaceae), for which we propose the tentative name asystasia mosaic Madagascar virus (AMMGV). DNA-A and -B nucleotide sequences of AMMGV were only distantly related to known begomovirus sequence and shared highest nucleotide sequence identity of 72.9 % (DNA-A) and 66.9 % (DNA-B) with a recently described bipartite begomovirus infecting Asystasia sp. in West Africa. Phylogenetic analysis demonstrated that this novel virus from Madagascar belongs to a new lineage of Old World bipartite begomoviruses.

  11. Analysis for complete genomic sequence of HLA-B and HLA-C alleles in the Chinese Han population.

    PubMed

    Zhu, F; He, Y; Zhang, W; He, J; He, J; Xu, X; Lv, H; Yan, L

    2011-08-01

    In the present study, we have determined the complete genomic sequence and analysed the intron polymorphism of partial HLA-B and HLA-C alleles in the Chinese Han population. Over 3.0 kb DNA fragments of HLA-B and HLA-C loci were amplified by polymerase chain reaction from partial 5' untranslated region to 3' noncoding region respectively, and then the amplified products were sequenced. Full-length nucleotide sequences of 14 HLA-B alleles and 10 HLA-C alleles were obtained and have been submitted to GenBank and IMGT/HLA database. Two novel alleles of HLA-B*52:01:01:02 and HLA-B*59:01:01:02 were identified, and the complete genomic sequence of HLA-B*52:01:01:01 was firstly reported. Totally 157 and 167 polymorphism positions were found in the full-length genomic sequence of HLA-B and HLA-C loci respectively. Our results suggested that many single nucleotide polymorphisms existed in the exon and intron regions, and the data can provide useful information for understanding the evolution of HLA-B and HLA-C alleles. © 2011 Blackwell Publishing Ltd.

  12. The complete nucleotide sequence of RNA 3 of a peach isolate of Prunus necrotic ringspot virus.

    PubMed

    Hammond, R W; Crosslin, J M

    1995-04-01

    The complete nucleotide sequence of RNA 3 of the PE-5 peach isolate of Prunus necrotic ringspot ilarvirus (PNRSV) was obtained from cloned cDNA. The RNA sequence is 1941 nucleotides and contains two open reading frames (ORFs). ORF 1 consisted of 284 amino acids with a calculated molecular weight of 31,729 Da and ORF 2 contained 224 amino acids with a calculated molecular weight of 25,018 Da. ORF 2 corresponds to the coat protein gene. Expression of ORF 2 engineered into a pTrcHis vector in Escherichia coli results in a fusion polypeptide of approximately 28 kDa which cross-reacts with PNRSV polyclonal antiserum. Analysis of the coat protein amino acid sequence reveals a putative "zinc-finger" domain at the amino-terminal portion of the protein. Two tetranucleotide AUGC motifs occur in the 3'-UTR of the RNA and may function in coat protein binding and genome activation. ORF 1 homologies to other ilarviruses and alfalfa mosaic virus are confined to limited regions of conserved amino acids. The translated amino acid sequence of the coat protein gene shows 92% similarity to one isolate of apple mosaic virus, a closely related member of the ilarvirus group of plant viruses, but only 66% similarity to the amino acid sequence of the coat protein gene of a second isolate. These relationships are also reflected at the nucleotide sequence level. These results in one instance confirm the close similarities observed at the biophysical and serological levels between these two viruses, but on the other hand call into question the nomenclature used to describe these viruses.

  13. Completion of full length genome sequence of novel avian paramyxovirus strain APMV/Shimane67 isolated from migratory wild geese in Japan.

    PubMed

    Yamamoto, Eiji; Ito, Toshihiro; Ito, Hiroshi

    2016-11-01

    The nucleotide sequences of nucleocapsid protein (N); phosphoprotein (P); matrix protein (M); hemagglutinin-neuraminidase (HN); and large polymerase protein (L) genes, 3'-end leader, 5'-end trailer and intergenic regions of the avian paramyxovirus (APMV) strain goose/Shimane/67/2000 (APMV/Shimane67) were determined. Together with previously reported data on fusion protein (F) gene sequence [46], the determination of the genome sequence of APMV/Shimane67 has been completed in this study. The genome of APMV/Shimane67 comprised 16,146 nucleotides in length and contains six genes in the order of 3'-N-P-M-F-HN-L-5'. The features of the APMV/Shimane67 genome (e.g., nucleotide length of whole genome and each of the six genes, and predicted amino acid length of each of the six genes) were distinct from those of other APMV serotypes. Phylogenetic analysis indicated that although APMV/Shimane67 was grouped with APMV-1, -9 and -12, the evolutionary distance between APMV/Shimane67 and these viruses was longer than that observed between intra-serotype viruses. These results show that the genome sequence of APMV/Shimane67 contains specific characteristics and is distinguishable from other types of APMV.

  14. A first report and complete genome sequence of alfalfa enamovirus from Sudan

    USDA-ARS?s Scientific Manuscript database

    A full genome sequence of a viral pathogen, provisionally named alfalfa enamovirus 2 (AEV-2), was reconstructed from short reads obtained by Illumina RNA sequencing of alfalfa sample originating from Sudan. Ambiguous nucleotides in the resultant consensus assembly and identity of the predicted virus...

  15. Genetic characterization of strains of Saccharomyces uvarum from New Zealand wineries.

    PubMed

    Zhang, Hanyao; Richards, Keith D; Wilson, Sandra; Lee, Soon A; Sheehan, Hester; Roncoroni, Miguel; Gardner, Richard C

    2015-04-01

    We present a genetic characterization of 65 isolates of Saccharomyces uvarum isolated from wineries in New Zealand, along with the complete nucleotide sequence of a single sulfite-tolerant isolate. The genome of the New Zealand isolate averaged 99.85% nucleotide identity to CBS7001, the previously sequenced strain of S. uvarum. However, three genomic segments (37-87 kb) showed 10% nucleotide divergence from CBS7001 but 99% identity to Saccharomyces eubayanus. We conclude that these three segments appear to have been introgressed from that species. The nucleotide sequence of the internal transcribed spacer (ITS) region from other New Zealand isolates were also very similar to that of CBS7001, and hybrids showed complete genetic compatibility for some strains, with tetrads giving four viable progeny that showed 2:2 segregations of marker genes. Some strains showed high tolerance to sulfite, with genetic analysis indicating linkage of this trait to the transcription factor FZF1, but not to SSU1, the sulfite efflux pump that it regulates in order to confer sulfite tolerance in Saccharomyces cerevisiae. The fermentation characteristics of selected strains of S. uvarum showed exceptionally good cold fermentation characteristics, superior to the best commercially available strains of S. cerevisiae. Copyright © 2014 Elsevier Ltd. All rights reserved.

  16. First Complete Genome Sequence of Papaya ringspot virus-W Isolated from a Gourd in the United States.

    PubMed

    Ali, Akhtar

    2017-01-12

    In the United States, the Papaya ringspot virus was first reported from papaya in Florida in 1949. Here, we determined the first complete genome sequence (10,302 nucleotides) of a Papaya ringspot virus-W isolate, which was collected from a commercial field of gourd in Tulsa, OK. Copyright © 2017 Ali.

  17. Molecular Characterization of the Meyer Lemon Isolate of Citrus Tatter Leaf Virus: Complete Genome Sequence and Development of Biologically Active In Vitro Transcripts

    USDA-ARS?s Scientific Manuscript database

    Citrus tatter leaf virus isolated from Meyer lemon trees (CTLV-ML) from California and Florida induces bud union incompatibility of citrus trees grafted on the widely used trifoliate and trifoliate hybrid rootstocks. The complete genome sequence of CTLV-ML was determined to be 6,495 nucleotides (nts...

  18. Complete Genome Sequences of Four Avian Paramyxoviruses of Serotype 10 Isolated from Rockhopper Penguins on the Falkland Islands

    PubMed Central

    Goraichuk, Iryna V.; Dimitrov, Kiril M.; Sharma, Poonam; Miller, Patti J.; Swayne, David E.; Suarez, David L.

    2017-01-01

    ABSTRACT The first complete genome sequences of four avian paramyxovirus serotype 10 (APMV-10) isolates are described here. The viruses were isolated from rockhopper penguins on the Falkland Islands, sampled in 2007. All four genomes are 15,456 nucleotides in length, and phylogenetic analyses show them to be closely related. PMID:28572332

  19. Differential sequence diversity at merozoite surface protein-1 locus of Plasmodium knowlesi from humans and macaques in Thailand.

    PubMed

    Putaporntip, Chaturong; Thongaree, Siriporn; Jongwutiwes, Somchai

    2013-08-01

    To determine the genetic diversity and potential transmission routes of Plasmodium knowlesi, we analyzed the complete nucleotide sequence of the gene encoding the merozoite surface protein-1 of this simian malaria (Pkmsp-1), an asexual blood-stage vaccine candidate, from naturally infected humans and macaques in Thailand. Analysis of Pkmsp-1 sequences from humans (n=12) and monkeys (n=12) reveals five conserved and four variable domains. Most nucleotide substitutions in conserved domains were dimorphic whereas three of four variable domains contained complex repeats with extensive sequence and size variation. Besides purifying selection in conserved domains, evidence of intragenic recombination scattering across Pkmsp-1 was detected. The number of haplotypes, haplotype diversity, nucleotide diversity and recombination sites of human-derived sequences exceeded that of monkey-derived sequences. Phylogenetic networks based on concatenated conserved sequences of Pkmsp-1 displayed a character pattern that could have arisen from sampling process or the presence of two independent routes of P. knowlesi transmission, i.e. from macaques to human and from human to humans in Thailand. Copyright © 2013 Elsevier B.V. All rights reserved.

  20. Complete genome sequence of a novel Plum pox virus strain W isolate determined by 454 pyrosequencing.

    PubMed

    Sheveleva, Anna; Kudryavtseva, Anna; Speranskaya, Anna; Belenikin, Maxim; Melnikova, Natalia; Chirkov, Sergei

    2013-10-01

    The near-complete (99.7 %) genome sequence of a novel Russian Plum pox virus (PPV) isolate Pk, belonging to the strain Winona (W), has been determined by 454 pyrosequencing with the exception of the thirty-one 5'-terminal nucleotides. This region was amplified using 5'RACE kit and sequenced by the Sanger method. Genomic RNA released from immunocaptured PPV particles was employed for generation of cDNA library using TransPlex Whole transcriptome amplification kit (WTA2, Sigma-Aldrich). The entire Pk genome has identity level of 92.8-94.5 % when compared to the complete nucleotide sequences of other PPV-W isolates (W3174, LV-141pl, LV-145bt, and UKR 44189), confirming a high degree of variability within the PPV-W strain. The isolates Pk and LV-141pl are most closely related. The Pk has been found in a wild plum (Prunus domestica) in a new region of Russia indicating widespread dissemination of the PPV-W strain in the European part of the former USSR.

  1. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Fleischmann, R.D.; Adams, M.D.; White, O.

    1995-07-28

    An approach for genome analysis based on sequencing and assembly of unselected pieces of DNA from the whole chromosome has been applied to obtain the complete nucleotide sequence (1,830,137 base pairs) of the genome from the bacterium Haemophilus influenzae Rd. This approach eliminates the need for initial mapping efforts and is therefore applicable to the vast array of microbial species for which genome maps are unavailable. The H. influenzae Rd genome sequence (Genome Sequence DataBase accession number L42023) represents the only complete genome sequence from a free-living organism. 46 refs., 4 figs., 4 tabs.

  2. Sequence of rat alpha- and gamma-casein mRNAs: evolutionary comparison of the calcium-dependent rat casein multigene family.

    PubMed Central

    Hobbs, A A; Rosen, J M

    1982-01-01

    The complete sequences of rat alpha- and gamma-casein mRNAs have been determined. The 1402-nucleotide alpha- and 864-nucleotide gamma-casein mRNAs both encode 15 amino acid signal peptides and mature proteins of 269 and 164 residues, respectively. Considerable homology between the 5' non-coding regions, and the regions encoding the signal peptides and the phosphorylation sites, in these mRNAs as compared to several other rodent casein mRNAs, was observed. Significant homology was also detected between rat alpha- and bovine alpha s1-casein. Comparison of the rodent and bovine sequences suggests that the caseins evolved at about the time of the appearance of the primitive mammals. This may have occurred by intragenic duplication of a nucleotide sequence encoding a primitive phosphorylation site, -(Ser)n-Glu-Glu-, and intergenic duplication resulting in the small casein multigene family. A unique feature of the rat alpha-casein sequence is an insertion in the coding region containing 10 repeated elements of 18 nucleotides each. This insertion appears to have occurred 7-12 million years ago, just prior to the divergence of rat and mouse. Images PMID:6298707

  3. Genetic characterization of avian metapneumovirus subtype C isolated from pheasants in a live bird market.

    PubMed

    Lee, Eun ho; Song, Min-Suk; Shin, Jin-Young; Lee, Young-Min; Kim, Chul-Joong; Lee, Young Sik; Kim, Hyunggee; Choi, Young Ki

    2007-09-01

    Complete nucleotide sequences of two avian metapneumoviruses (aMPV), designated PL-1 and PL-2, were isolated from pheasants, revealing novel sequences of the first aMPV to be fully sequenced in Korea. The complete genome of both PL-1 and PL-2 was composed of 13,170 nucleotides. Phylogenetic analysis revealed that PL-1 belonged to aMPV subtype C, sharing higher homology in deduced amino acid sequence identities with hMPV, rather than with aMPV subtypes A and B. Replication of PL-1 in experimentally re-infected pheasants was confirmed by reverse transcription (RT)-polymerase chain reaction (PCR). Chickens and mice were experimentally inoculated with PL-1 to test the replication potential of PL-1 in other species. Although one specimen from the nasal turbinates of an inoculated chicken showed a slight trace of viral replication at 3 days post-infection (dpi), all of the infected mice were negative for aMPV by RT-PCR throughout the experiment, suggesting that PL-1 does not readily infect mammals. This is the first report of the isolation and complete genomic sequence of aMPV subtype C originating from pheasants.

  4. Complete Genome Sequence of the Largest Known Flavi-Like Virus, Diaphorina citri flavi-like virus, a Novel Virus of the Asian Citrus Psyllid, Diaphorina citri.

    PubMed

    Matsumura, Emilyn E; Nerva, Luca; Nigg, Jared C; Falk, Bryce W; Nouri, Shahideh

    2016-09-08

    A novel flavi-like virus tentatively named Diaphorina citri flavi-like virus (DcFLV) was identified in field populations of Diaphorina citri through small RNA and transcriptome sequencing followed by reverse transcription (RT)-PCR. We report here the complete nucleotide sequence and genome organization of DcFLV, the largest flavi-like virus identified to date. Copyright © 2016 Matsumura et al.

  5. Complete Genome Sequence of Diaphorina citri-associated C virus, a Novel Putative RNA Virus of the Asian Citrus Psyllid, Diaphorina citri.

    PubMed

    Nouri, Shahideh; Salem, Nidà; Falk, Bryce W

    2016-07-21

    We present here the complete nucleotide sequence and genome organization of a novel putative RNA virus identified in field populations of the Asian citrus psyllid, Diaphorina citri, through sequencing of the transcriptome followed by reverse transcription-PCR (RT-PCR). We tentatively named this virus Diaphorina citri-associated C virus (DcACV). DcACV is an unclassified positive-sense RNA virus. Copyright © 2016 Nouri et al.

  6. The complete nucleotide sequence and genomic characterization of tropical soda apple mosaic virus

    USDA-ARS?s Scientific Manuscript database

    Tropical soda apple mosaic virus (TSAMV) was first identified in tropical soda apple (Solanum viarum), a noxious weed, in Florida in 2002. This report provides the first full genome sequence of TSAMV. The full genome sequence of this virus will enable research scientists to develop additional spec...

  7. Complete genomic sequence of a Tobacco rattle virus isolate from Michigan-grown potatoes.

    PubMed

    Crosslin, James M; Hamm, Philip B; Kirk, William W; Hammond, Rosemarie W

    2010-04-01

    Tobacco rattle virus (TRV) causes stem mottle on potato leaves and necrotic arcs and rings in potato tubers, known as corky ringspot disease. Recently, TRV was reported in Michigan potato tubers cv. FL1879 exhibiting corky ringspot disease. Sequence analysis of the RNA-1-encoded 16-kDa gene of the Michigan isolate, designated MI-1, revealed homology to TRV isolates from Florida and Washington. Here, we report the complete genomic sequence of RNA-1 (6,791 nt) and RNA-2 (3,685 nt) of TRV MI-1. RNA-1 is predicted to contain four open reading frames, and the genome structure and phylogenetic analyses of the RNA-1 nucleotide sequence revealed significant homologies to the known sequences of other TRV-1 isolates. The relationships based on the full-length nucleotide sequence were different from than those based on the 16-kDa gene encoded on genomic RNA-1 and reflect sequence variation within a 20-25-aa residue region of the 16-kDa protein. MI-1 RNA-2 is predicted to contain three ORFs, encoding the coat protein (CP), a 37.6-kDa protein (ORF 2b), and a 33.6-kDa protein (ORF 2c). In addition, it contains a region of similarity to the 3' terminus of RNA-1, including a truncated portion of the 16-kDa cistron. Phylogenetic analysis of RNA-2, based on a comparison of nucleotide sequences with other members of the genus Tobravirus, indicates that TRV MI-1 and other North American isolates cluster as a distinct group. TRV M1-1 is only the second North American isolate for which there is a complete sequence of the genome, and it is distinct from the North American isolate TRV ORY. The relationship of the TRV MI-1 isolate to other tobravirus isolates is discussed.

  8. Complete genome sequence of duck Tembusu virus, isolated from Muscovy ducks in southern China.

    PubMed

    Zhu, Wanjun; Chen, Jidang; Wei, Chunya; Wang, Heng; Huang, Zhen; Zhang, Minze; Tang, Fengfeng; Xie, Jiexiong; Liang, Huanbin; Zhang, Guihong; Su, Shuo

    2012-12-01

    We report here the complete genomic sequence of the duck Tembusu virus (DTMUV) WJ-1 strain, isolated from Muscovy ducks. This is the first complete genome sequence of DTMUV reported in southern China. Compared with the other strains (TA, GH-2, YY5, and ZJ-407) that were previously found in eastern China, WJ-1 bears a few differences in the nucleotide and amino acid sequences. We found that there are 47 mutations of amino acids encoded by the whole open reading frame (ORF) among these five strains. The whole-genome sequence of DTMUV will help in understanding the epidemiology and molecular characteristics of duck Tembusu virus in southern China.

  9. Nucleotide sequence of an exceptionally long 5.8S ribosomal RNA from Crithidia fasciculata.

    PubMed Central

    Schnare, M N; Gray, M W

    1982-01-01

    In Crithidia fasciculata, a trypanosomatid protozoan, the large ribosomal subunit contains five small RNA species (e, f, g, i, j) in addition to 5S rRNA [Gray, M.W. (1981) Mol. Cell. Biol. 1, 347-357]. The complete primary sequence of species i is shown here to be pAACGUGUmCGCGAUGGAUGACUUGGCUUCCUAUCUCGUUGA ... AGAmACGCAGUAAAGUGCGAUAAGUGGUApsiCAAUUGmCAGAAUCAUUCAAUUACCGAAUCUUUGAACGAAACGG ... CGCAUGGGAGAAGCUCUUUUGAGUCAUCCCCGUGCAUGCCAUAUUCUCCAmGUGUCGAA(C)OH. This sequence establishes that species i is a 5.8S rRNA, despite its exceptional length (171-172 nucleotides). The extra nucleotides in C. fasciculata 5.8S rRNA are located in a region whose primary sequence and length are highly variable among 5.8S rRNAs, but which is capable of forming a stable hairpin loop structure (the "G+C-rich hairpin"). The sequence of C. fasciculata 5.8S rRNA is no more closely related to that of another protozoan, Acanthamoeba castellanii, than it is to representative 5.8S rRNA sequences from the other eukaryotic kingdoms, emphasizing the deep phylogenetic divisions that seem to exist within the Kingdom Protista. Images PMID:7079176

  10. Complete genome sequence analysis of a duck circovirus from Guangxi pockmark ducks.

    PubMed

    Xie, Liji; Xie, Zhixun; Zhao, Guangyuan; Liu, Jiabo; Pang, Yaoshan; Deng, Xianwen; Xie, Zhiqin; Fan, Qing

    2012-12-01

    We report here the complete genomic sequence of a novel duck circovirus (DuCV) strain, GX1104, isolated from Guangxi pockmark ducks in Guangxi, China. The whole nucleotide sequence had the highest homology (97.2%) with the sequence of strain TC/2002 (GenBank accession number AY394721.1) and had a low homology (76.8% to 78.6%) with the sequences of other strains isolated from China, Germany, and the United States. This report will help to understand the epidemiology and molecular characteristics of Guangxi pockmark duck circovirus in southern China.

  11. The complete nucleotide sequences of the five genetically distinct plastid genomes of Oenothera, subsection Oenothera: I. sequence evaluation and plastome evolution.

    PubMed

    Greiner, Stephan; Wang, Xi; Rauwolf, Uwe; Silber, Martina V; Mayer, Klaus; Meurer, Jörg; Haberer, Georg; Herrmann, Reinhold G

    2008-04-01

    The flowering plant genus Oenothera is uniquely suited for studying molecular mechanisms of speciation. It assembles an intriguing combination of genetic features, including permanent translocation heterozygosity, biparental transmission of plastids, and a general interfertility of well-defined species. This allows an exchange of plastids and nuclei between species often resulting in plastome-genome incompatibility. For evaluation of its molecular determinants we present the complete nucleotide sequences of the five basic, genetically distinguishable plastid chromosomes of subsection Oenothera (=Euoenothera) of the genus, which are associated in distinct combinations with six basic genomes. Sizes of the chromosomes range from 163 365 bp (plastome IV) to 165 728 bp (plastome I), display between 96.3% and 98.6% sequence similarity and encode a total of 113 unique genes. Plastome diversification is caused by an abundance of nucleotide substitutions, small insertions, deletions and repetitions. The five plastomes deviate from the general ancestral design of plastid chromosomes of vascular plants by a subsection-specific 56 kb inversion within the large single-copy segment. This inversion disrupted operon structures and predates the divergence of the subsection presumably 1 My ago. Phylogenetic relationships suggest plastomes I-III in one clade, while plastome IV appears to be closest to the common ancestor.

  12. The complete nucleotide sequences of the five genetically distinct plastid genomes of Oenothera, subsection Oenothera: I. Sequence evaluation and plastome evolution†

    PubMed Central

    Greiner, Stephan; Wang, Xi; Rauwolf, Uwe; Silber, Martina V.; Mayer, Klaus; Meurer, Jörg; Haberer, Georg; Herrmann, Reinhold G.

    2008-01-01

    The flowering plant genus Oenothera is uniquely suited for studying molecular mechanisms of speciation. It assembles an intriguing combination of genetic features, including permanent translocation heterozygosity, biparental transmission of plastids, and a general interfertility of well-defined species. This allows an exchange of plastids and nuclei between species often resulting in plastome–genome incompatibility. For evaluation of its molecular determinants we present the complete nucleotide sequences of the five basic, genetically distinguishable plastid chromosomes of subsection Oenothera (=Euoenothera) of the genus, which are associated in distinct combinations with six basic genomes. Sizes of the chromosomes range from 163 365 bp (plastome IV) to 165 728 bp (plastome I), display between 96.3% and 98.6% sequence similarity and encode a total of 113 unique genes. Plastome diversification is caused by an abundance of nucleotide substitutions, small insertions, deletions and repetitions. The five plastomes deviate from the general ancestral design of plastid chromosomes of vascular plants by a subsection-specific 56 kb inversion within the large single-copy segment. This inversion disrupted operon structures and predates the divergence of the subsection presumably 1 My ago. Phylogenetic relationships suggest plastomes I–III in one clade, while plastome IV appears to be closest to the common ancestor. PMID:18299283

  13. Second generation DNA sequencing of the mitogenome of the Chinstrap penguin and comparative genomics of Antarctic penguins.

    PubMed

    Subramanian, Sankar; Lingala, Syamala Gowri; Swaminathan, Siva; Huynen, Leon; Lambert, David

    2014-08-01

    The complete mitochondrial genome of the Chinstrap penguin (Pygoscelis antarcticus) was sequenced and compared with other penguin mitogenomes. The genome is 15,972 bp in length with the number and order of protein coding genes and RNAs being very similar to that of other known penguin mitogenomes. Comparative nucleotide analysis showed the Chinstrap mitogenome shares 94% homology with the mitogenome of its sister species, Pygoscelis adelie (Adélie penguin). Divergence at nonsynonymous nucleotide positions was found to be up to 23 times less than that observed in synonymous positions of protein coding genes, suggesting high selection constraints. The complete mitogenome data will be useful for genetic and evolutionary studies of penguins.

  14. Complete nucleotide sequences of okra isolates of Cotton leaf curl Gezira virus and their associated DNA-beta from Niger.

    PubMed

    Shih, S L; Kumar, S; Tsai, W S; Lee, L M; Green, S K

    2009-01-01

    Okra (Abelmoschus esculentus) is a major crop in Niger. In the fall of 2007, okra leaf curl disease was observed in Niger and the begomovirus and DNA-beta satellite were found associated with the disease. The complete nucleotide sequences of DNA-A (FJ469626 and FJ469627) and associated DNA-beta satellites (FJ469628 and FJ469629) were determined from two samples. This is the first report of molecular characterization of okra-infecting begomovirus and their associated DNA-beta from Niger. The begomovirus and DNA-beta have been identified as Cotton leaf curl Gezira virus and Cotton leaf curl Gezira betasatellite, respectively, which are reported to also infect okra in Egypt, Mali and Sudan.

  15. INFO-RNA--a server for fast inverse RNA folding satisfying sequence constraints.

    PubMed

    Busch, Anke; Backofen, Rolf

    2007-07-01

    INFO-RNA is a new web server for designing RNA sequences that fold into a user given secondary structure. Furthermore, constraints on the sequence can be specified, e.g. one can restrict sequence positions to a fixed nucleotide or to a set of nucleotides. Moreover, the user can allow violations of the constraints at some positions, which can be advantageous in complicated cases. The INFO-RNA web server allows biologists to design RNA sequences in an automatic manner. It is clearly and intuitively arranged and easy to use. The procedure is fast, as most applications are completed within seconds and it proceeds better and faster than other existing tools. The INFO-RNA web server is freely available at http://www.bioinf.uni-freiburg.de/Software/INFO-RNA/

  16. INFO-RNA—a server for fast inverse RNA folding satisfying sequence constraints

    PubMed Central

    Busch, Anke; Backofen, Rolf

    2007-01-01

    INFO-RNA is a new web server for designing RNA sequences that fold into a user given secondary structure. Furthermore, constraints on the sequence can be specified, e.g. one can restrict sequence positions to a fixed nucleotide or to a set of nucleotides. Moreover, the user can allow violations of the constraints at some positions, which can be advantageous in complicated cases. The INFO-RNA web server allows biologists to design RNA sequences in an automatic manner. It is clearly and intuitively arranged and easy to use. The procedure is fast, as most applications are completed within seconds and it proceeds better and faster than other existing tools. The INFO-RNA web server is freely available at http://www.bioinf.uni-freiburg.de/Software/INFO-RNA/ PMID:17452349

  17. Nucleotide sequence of the phosphoglycerate kinase gene from the extreme thermophile Thermus thermophilus. Comparison of the deduced amino acid sequence with that of the mesophilic yeast phosphoglycerate kinase.

    PubMed Central

    Bowen, D; Littlechild, J A; Fothergill, J E; Watson, H C; Hall, L

    1988-01-01

    Using oligonucleotide probes derived from amino acid sequencing information, the structural gene for phosphoglycerate kinase from the extreme thermophile, Thermus thermophilus, was cloned in Escherichia coli and its complete nucleotide sequence determined. The gene consists of an open reading frame corresponding to a protein of 390 amino acid residues (calculated Mr 41,791) with an extreme bias for G or C (93.1%) in the codon third base position. Comparison of the deduced amino acid sequence with that of the corresponding mesophilic yeast enzyme indicated a number of significant differences. These are discussed in terms of the unusual codon bias and their possible role in enhanced protein thermal stability. Images Fig. 1. PMID:3052437

  18. Genome sequences of a mouse-avirulent and a mouse-virulent strain of Ross River virus.

    PubMed

    Faragher, S G; Meek, A D; Rice, C M; Dalgarno, L

    1988-04-01

    The nucleotide sequence of the genomic RNA of a mouse-avirulent strain of Ross River virus, RRV NB5092 (isolated in 1969), has been determined and the corresponding sequence for the prototype mouse-virulent strain, RRV T48 (isolated in 1959), has been completed. The RRV NB5092 genome is approximately 11,674 nucleotides in length, compared with 11,853 nucleotides for RRV T48. RRV NB5092 and RRV T48 have the same genome organization. For both viruses an untranslated region of 80 nucleotides at the 5' end of the genome is followed by a 7440-nucleotide open reading frame which is interrupted after 5586 nucleotides by a single opal termination codon. By homology with other alphaviruses, the 5586-nucleotide open reading frame encodes the nonstructural proteins nsP1, nsP2, and nsP3; a fourth nonstructural protein, nsP4, is produced by read-through of the opal codon. The RRV nonstructural proteins show strong homology with the corresponding proteins of Sindbis virus and Semliki Forest virus in terms of size, net charge, and hydropathy characteristics. However, homology is not uniform between or within the proteins; nsP1, nsP2, and nsP4 contain extended domains which are highly conserved between alphaviruses, while the C-terminal region of nsP3 shows little conservation in sequence or length between alphaviruses. An untranslated "junction" region of 44 nucleotides (for RRV NB5092) or 47 nucleotides (for RRV T48) separates the nonstructural and structural protein coding regions. The structural proteins (capsid-E3-E2-6K-E1) are translated from an open reading frame of 3762 nucleotides which is followed by a 3'-untranslated region of approximately 348 nucleotides (for RRV NB5092) or 524 nucleotides (for RRV T48). Excluding deletions and insertions, the genomes of RRV NB5092 and RRV T48 differ at 284 nucleotides, representing a sequence divergence of 2.38%. Sequence deletions or insertions were found only in the noncoding regions and include a 173-nucleotide deletion in the 3'-untranslated region of RRV NB5092, compared with RRV T48. In the coding regions, most of the nucleotide differences are silent; there are 36 amino acid differences in the nonstructural proteins and 12 in the structural proteins. The distribution of amino acid differences between the two RRV strains correlates with the location of domains which are poorly conserved in sequence between alphaviruses. The possible role of amino acid differences in envelope glycoproteins E1 and E2 in determining the different antigenic and biological properties of RRV NB5092 and RRV T48 is discussed.

  19. Complete mitochondrial genome sequences of Brassica rapa (Chinese cabbage and mizuna), and intraspecific differentiation of cytoplasm in B. rapa and Brassica juncea.

    PubMed

    Hatono, Saki; Nishimura, Kaori; Murakami, Yoko; Tsujimura, Mai; Yamagishi, Hiroshi

    2017-09-01

    The complete sequence of the mitochondrial genome was determined for two cultivars of Brassica rapa . After determining the sequence of a Chinese cabbage variety, 'Oushou hakusai', the sequence of a mizuna variety, 'Chusei shiroguki sensuji kyomizuna', was mapped against the sequence of Chinese cabbage. The precise sequences where the two varieties demonstrated variation were ascertained by direct sequencing. It was found that the mitochondrial genomes of the two varieties are identical over 219,775 bp, with a single nucleotide polymorphism (SNP) between the genomes. Because B. rapa is the maternal species of an amphidiploid crop species, Brassica juncea , the distribution of the SNP was observed both in B. rapa and B. juncea . While the mizuna type SNP was restricted mainly to cultivars of mizuna (japonica group) in B. rapa , the mizuna type was widely distributed in B. juncea . The finding that the two Brassica species have these SNP types in common suggests that the nucleotide substitution occurred in wild B. rapa before both mitotypes were domesticated. It was further inferred that the interspecific hybridization between B. rapa and B. nigra took place twice and resulted in the two mitotypes of cultivated B. juncea .

  20. Sequence diversity of wheat mosaic virus isolates.

    PubMed

    Stewart, Lucy R

    2016-02-02

    Wheat mosaic virus (WMoV), transmitted by eriophyid wheat curl mites (Aceria tosichella) is the causal agent of High Plains disease in wheat and maize. WMoV and other members of the genus Emaravirus evaded thorough molecular characterization for many years due to the experimental challenges of mite transmission and manipulating multisegmented negative sense RNA genomes. Recently, the complete genome sequence of a Nebraska isolate of WMoV revealed eight segments, plus a variant sequence of the nucleocapsid protein-encoding segment. Here, near-complete and partial consensus sequences of five more WMoV isolates are reported and compared to the Nebraska isolate: an Ohio maize isolate (GG1), a Kansas barley isolate (KS7), and three Ohio wheat isolates (H1, K1, W1). Results show two distinct groups of WMoV isolates: Ohio wheat isolate RNA segments had 84% or lower nucleotide sequence identity to the NE isolate, whereas GG1 and KS7 had 98% or higher nucleotide sequence identity to the NE isolate. Knowledge of the sequence variability of WMoV isolates is a step toward understanding virus biology, and potentially explaining observed biological variation. Published by Elsevier B.V.

  1. De novo assembly of mitochondrial genomes provides insights into genetic diversity and molecular evolution in wild boars and domestic pigs.

    PubMed

    Ni, Pan; Bhuiyan, Ali Akbar; Chen, Jian-Hai; Li, Jingjin; Zhang, Cheng; Zhao, Shuhong; Du, Xiaoyong; Li, Hua; Yu, Hui; Liu, Xiangdong; Li, Kui

    2018-06-01

    Up to date, the scarcity of publicly available complete mitochondrial sequences for European wild pigs hampers deeper understanding about the genetic changes following domestication. Here, we have assembled 26 de novo mtDNA sequences of European wild boars from next generation sequencing (NGS) data and downloaded 174 complete mtDNA sequences to assess the genetic relationship, nucleotide diversity, and selection. The Bayesian consensus tree reveals the clear divergence between the European and Asian clade and a very small portion (10 out of 200 samples) of maternal introgression. The overall nucleotides diversities of the mtDNA sequences have been reduced following domestication. Interestingly, the selection efficiencies in both European and Asian domestic pigs are reduced, probably caused by changes in both selection constraints and maternal population size following domestication. This study suggests that de novo assembled mitogenomes can be a great boon to uncover the genetic turnover following domestication. Further investigation is warranted to include more samples from the ever-increasing amounts of NGS data to help us to better understand the process of domestication.

  2. [Sequencing and analysis of complete genome of rabies viruses isolated from Chinese Ferret-Badger and dog in Zhejiang province].

    PubMed

    Lei, Yong-Liang; Wang, Xiao-Guang; Tao, Xiao-Yan; Li, Hao; Meng, Sheng-Li; Chen, Xiu-Ying; Liu, Fu-Ming; Ye, Bi-Feng; Tang, Qing

    2010-01-01

    Based on sequencing the full-length genomes of four Chinese Ferret-Badger and dog, we analyze the properties of rabies viruses genetic variation in molecular level, get the information about rabies viruses prevalence and variation in Zhejiang, and enrich the genome database of rabies viruses street strains isolated from China. Rabies viruses in suckling mice were isolated, overlapped fragments were amplified by RT-PCR and full-length genomes were assembled to analyze the nucleotide and deduced protein similarities and phylogenetic analyses from Chinese Ferret-Badger, dog, sika deer, vole, used vaccine strain were determined. The four full-length genomes were sequenced completely and had the same genetic structure with the length of 11, 923 nts or 11, 925 nts including 58 nts-Leader, 1353 nts-NP, 894 nts-PP, 609 nts-MP, 1575 nts-GP, 6386 nts-LP, and 2, 5, 5 nts- intergenic regions(IGRs), 423 nts-Pseudogene-like sequence (psi), 70 nts-Trailer. The four full-length genomes were in accordance with the properties of Rhabdoviridae Lyssa virus by BLAST and multi-sequence alignment. The nucleotide and amino acid sequences among Chinese strains had the highest similarity, especially among animals of the same species. Of the four full-length genomes, the similarity in amino acid level was dramatically higher than that in nucleotide level, so the nucleotide mutations happened in these four genomes were most synonymous mutations. Compared with the reference rabies viruses, the lengths of the five protein coding regions had no change, no recombination, only with a few point mutations. It was evident that the five proteins appeared to be stable. The variation sites and types of the four genomes were similar to the reference vaccine or street strains. And the four strains were genotype 1 according to the multi-sequence and phylogenetic analyses, which possessed the distinct district characteristics of China. Therefore, these four rabies viruses are likely to be street viruses already existing in the natural world.

  3. Inhibition of Escherichia coli viability by external guide sequences complementary to two essential genes

    PubMed Central

    McKinney, Jeffrey; Guerrier-Takada, Cecilia; Wesolowski, Donna; Altman, Sidney

    2001-01-01

    Narrow spectrum antimicrobial activity has been designed to reduce the expression of two essential genes, one coding for the protein subunit of RNase P (C5 protein) and one for gyrase (gyrase A). In both cases, external guide sequences (EGS) have been designed to complex with either mRNA. Using the EGS technology, the level of microbial viability is reduced to less than 10% of the wild-type strain. The EGSs are additive when used together and depend on the number of nucleotides paired when attacking gyrase A mRNA. In the case of gyrase A, three nucleotides unpaired out of a 15-mer EGS still favor complete inhibition by the EGS but five unpaired nucleotides do not. PMID:11381134

  4. Complete nucleotide sequences of a new bipartite begomovirus from Malvastrum sp. plants with bright yellow mosaic symptoms in South Texas.

    PubMed

    Alabi, Olufemi J; Villegas, Cecilia; Gregg, Lori; Murray, K Daniel

    2016-06-01

    Two isolates of a novel bipartite begomovirus, tentatively named malvastrum bright yellow mosaic virus (MaBYMV), were molecularly characterized from naturally infected plants of the genus Malvastrum showing bright yellow mosaic disease symptoms in South Texas. Six complete DNA-A and five DNA-B genome sequences of MaBYMV obtained from the isolates ranged in length from 2,608 to 2,609 nucleotides (nt) and 2,578 to 2,605 nt, respectively. Both genome segments shared a 178- to 180-nt common region. In pairwise comparisons, the complete DNA-A and DNA-B sequences of MaBYMV were most similar (87-88 % and 79-81 % identity, respectively) and phylogenetically related to the corresponding sequences of sida mosaic Sinaloa virus-[MX-Gua-06]. Further analysis revealed that MaBYMV is a putative recombinant virus, thus supporting the notion that malvaceous hosts may be influencing the evolution of several begomoviruses. The design of new diagnostic primers enabled the detection of MaBYMV in cohorts of Bemisia tabaci collected from symptomatic Malvastrum sp. plants, thus implicating whiteflies as potential vectors of the virus.

  5. Complete genome analysis of dengue virus type 3 isolated from the 2013 dengue outbreak in Yunnan, China.

    PubMed

    Wang, Xiaodan; Ma, Dehong; Huang, Xinwei; Li, Lihua; Li, Duo; Zhao, Yujiao; Qiu, Lijuan; Pan, Yue; Chen, Junying; Xi, Juemin; Shan, Xiyun; Sun, Qiangming

    2017-06-15

    In the past few decades, dengue has spread rapidly and is an emerging disease in China. An unexpected dengue outbreak occurred in Xishuangbanna, Yunnan, China, resulting in 1331 patients in 2013. In order to obtain the complete genome information and perform mutation and evolutionary analysis of causative agent related to this largest outbreak of dengue fever. The viruses were isolated by cell culture and evaluated by genome sequence analysis. Phylogenetic trees were then constructed by Neighbor-Joining methods (MEGA6.0), followed by analysis of nucleotide mutation and amino acid substitution. The analysis of the diversity of secondary structure for E and NS1 protein were also performed. Then selection pressures acting on the coding sequences were estimated by PAML software. The complete genome sequences of two isolated strains (YNSW1, YNSW2) were 10,710 and 10,702 nucleotides in length, respectively. Phylogenetic analysis revealed both strain were classified as genotype II of DENV-3. The results indicated that both isolated strains of Xishuangbanna in 2013 and Laos 2013 stains (KF816161.1, KF816158.1, LC147061.1, LC147059.1, KF816162.1) were most similar to Bangladesh (AY496873.2) in 2002. After comparing with the DENV-3SS (H87) 62 amino acid substitutions were identified in translated regions, and 38 amino acid substitutions were identified in translated regions compared with DENV-3 genotype II stains Bangladesh (AY496873.2). 27(YNSW1) or 28(YNSW2) single nucleotide changes were observed in structural protein sequences with 7(YNSW1) or 8(YNSW2) non-synonymous mutations compared with AY496873.2. Of them, 4 non-synonymous mutations were identified in E protein sequences with (2 in the β-sheet, 2 in the coil). Meanwhile, 117(YNSW1) or 115 (YNSW2) single nucleotide changes were observed in non-structural protein sequences with 31(YNSW1) or 30 (YNSW2) non-synonymous mutations. Particularly, 14 single nucleotide changes were observed in NS1 sequences with 4/14 non-synonymous substitutions (4 in the coil). Selection pressure analysis revealed no positive selection in the amino acid sites of the genes encoding for structural and non-structural proteins. This study may help understand the intrinsic geographical relatedness of dengue virus 3 and contributes further to research on their infectivity, pathogenicity and vaccine development. Copyright © 2017 Elsevier B.V. All rights reserved.

  6. Complete sequence of two tick-borne flaviviruses isolated from Siberia and the UK: analysis and significance of the 5' and 3'-UTRs.

    PubMed

    Gritsun, T S; Venugopal, K; Zanotto, P M; Mikhailov, M V; Sall, A A; Holmes, E C; Polkinghorne, I; Frolova, T V; Pogodina, V V; Lashkevich, V A; Gould, E A

    1997-05-01

    The complete nucleotide sequence of two tick-transmitted flaviviruses, Vasilchenko (Vs) from Siberia and louping ill (LI) from the UK, have been determined. The genomes were respectively, 10928 and 10871 nucleotides (nt) in length. The coding strategy and functional protein sequence motifs of tick-borne flaviviruses are presented in both Vs and LI viruses. The phylogenies based on maximum likelihood, maximum parsimony and distance analysis of the polyproteins, identified Vs virus as a member of the tick-borne encephalitis virus subgroup within the tick-borne serocomplex, genus Flavivirus, family Flaviviridae. Comparative alignment of the 3'-untranslated regions revealed deletions of different lengths essentially at the same position downstream of the stop codon for all tick-borne viruses. Two direct 27 nucleotide repeats at the 3'-end were found only for Vs and LI virus. Immediately following the deletions a region of 332-334 nt with relatively conserved primary structure (67-94% identity) was observed at the 3'-non-coding end of the virus genome. Pairwise comparisons of the nucleotide sequence data revealed similar levels of variation between the coding region, and the 5' and 3'-termini of the genome, implying an equivalent strong selective control for translated and untranslated regions. Indeed the predicted folding of the 5' and 3'-untranslated regions revealed patterns of stem and loop structures conserved for all tick-borne flaviviruses suggesting a purifying selection for preservation of essential RNA secondary structures which could be involved in translational control and replication. The possible implications of these findings are discussed.

  7. Characterization of sams genes of Amoeba proteus and the endosymbiotic X-bacteria.

    PubMed

    Jeon, Taeck J; Jeon, Kwang W

    2003-01-01

    As a result of harboring obligatory bacterial endosymbionts, the xD strain of Amoeba proteus no longer produces its own S-adenosylmethionine synthetase (SAMS). When symbiont-free D amoebae are infected with symbionts (X-bacteria), the amount of amoeba SAMS decreases to a negligible level within four weeks, but about 47% of the SAMS activity, which apparently comes from another source, is still detected. Complete nucleotide sequences of sams genes of D and xD amoebae are presented and show that there are no differences between the two. Long-established xD amoebae contain an intact sams gene and thus the loss of xD amoeba's SAMS is not due to the loss of the gene itself. The open reading frame of the amoeba's sams gene has 1,281 nucleotides, encoding SAMS of 426 amino acids with a mass of 48 kDa and pI of 6.5. The amino acid sequence of amoeba SAMS is longer than the SAMS of other organisms by having an extra internal stretch of 28 amino acids. The 5'-flanking region of amoeba sams contains consensus-binding sites for several transcription factors that are related to the regulation of sams genes in E. coli and yeast. The complete nucleotide sequence of the symbiont's sams gene is also presented. The open reading frame of X-bacteria sams is 1,146 nucleotides long, encoding SAMS of 381 amino acids with a mass of 41 kDa and pI of 6.0. The X-bacteria SAMS has 45% sequence identity with that of A. proteus.

  8. Characterization of the repetitive DNA elements in the genome of fish lymphocystis disease viruses.

    PubMed

    Schnitzler, P; Darai, G

    1989-09-01

    The complete DNA nucleotide sequence of the repetitive DNA elements in the genome of fish lymphocystis disease virus (FLDV) isolated from two different species (flounder and dab) was determined. The size of these repetitive DNA elements was found to be 1413 bp which corresponds to the DNA sequences of the 5' terminus of the EcoRI DNA fragment B (0.034 to 0.052 m.u.) and to the EcoRI DNA fragment M (0.718 to 0.736 m.u.) of the FLDV genome causing lymphocystis disease in flounder and plaice. The degree of DNA nucleotide homology between both regions was found to be 99%. The repetitive DNA element in the genome of FLDV isolated from other fish species (dab) was identified and is located within the EcoRI DNA fragment B and J of the viral genome. The DNA nucleotide sequence of one duplicate of this repetition (EcoRI DNA fragment J) was determined (1410 bp) and compared to the DNA nucleotide sequences of the repetitive DNA elements of the genome of FLDV isolated from flounder. It was found that the repetitive DNA elements of the genome of FLDV derived from two different fish species are highly conserved and possess a degree of DNA sequence homology of 94%. The DNA sequences of each strand of the individual repetitive element possess one open reading frame.

  9. Genetic diversity among isolates of Autographa californica multiple nucleopolyhedrovirus

    USDA-ARS?s Scientific Manuscript database

    Our knowledge of genetic variation at the nucleotide sequence level of Autographa californica multiple nucleopolyhedrovirus (AcMNPV; Baculoviridae: Alphabaculovirus) derives from complete genome sequences of the C6 clonal isolate of AcMNPV and the R1 and CL3 clonal isolates of AcMNPV variants Rachip...

  10. The complete genome sequence of a second distinct betabaculovirus from the true armyworm, Mythimna unipuncta

    USDA-ARS?s Scientific Manuscript database

    The betabaculovirus Pseudaletia (Mythimna) sp. granulovirus #8 (MyspGV#8) was examined by electron microscopy, host barcoding PCR, and determination of the nucleotide sequence of its genome. Scanning and transmission electron microscopy revealed that the occlusion bodies of MyspGV#8 possessed the c...

  11. Complete Genome Sequences of 38 Gordonia sp. Bacteriophages

    PubMed Central

    Montgomery, Matthew T.; Bonilla, J. Alfred; Dejong, Randall; Garlena, Rebecca A.; Guerrero Bustamante, Carlos; Klyczek, Karen K.; Russell, Daniel A.; Wertz, John T.; Jacobs-Sera, Deborah; Hatfull, Graham F.

    2017-01-01

    ABSTRACT We report here the genome sequences of 38 newly isolated bacteriophages using Gordonia terrae 3612 (ATCC 25594) and Gordonia neofelifaecis NRRL59395 as bacterial hosts. All of the phages are double-stranded DNA (dsDNA) tail phages with siphoviral morphologies, with genome sizes ranging from 17,118 bp to 93,843 bp and spanning considerable nucleotide sequence diversity. PMID:28057748

  12. Molecular characterization and phylogenetic relationships of Desmodium leaf distortion virus (DeLDV): a new begomovirus infecting Desmodium glabrum in Yucatan, Mexico.

    PubMed

    Hernández-Zepeda, Cecilia; Argüello-Astorga, Gerardo; Idris, Ali M; Carnevali, Germán; Brown, Judith K; Moreno-Valenzuela, Oscar A

    2009-12-01

    The complete DNA-A component sequence of Desmodium leaf distortion virus (DeLDV, Begomovirus) isolated in Yucatan was determined to be 2569 nucleotides (nt) in length, and it was most closely related to Cotton leaf crumple virus-California (CLCrV-[Cal]), at 76%. The complete DNA-B component sequence was 2514 nt in length, and shared its highest nucleotide identity (60%) with Potato yellow mosaic Trinidad virus (PYMTV). Phylogenetic analyses group the DeLDV DNA-A component in the SLCV clade, whereas, the DeLDV DNA-B was grouped with the Abutilon mosaic virus clade, which also contains PYMV, suggesting that the DeLDV components have distinct evolutionary histories, possibly as the result of recombination and reassortment.

  13. Complete Genome Sequence of Frog virus 3, Isolated from a Strawberry Poison Frog (Oophaga pumilio) Imported from Nicaragua into the Netherlands

    PubMed Central

    Hughes, Joseph; van Beurden, Steven J.; Suárez, Nicolás M.; Haenen, Olga L. M.; Voorbergen-Laarman, Michal; Gröne, Andrea; Kik, Marja J. L.

    2017-01-01

    ABSTRACT Frog virus 3 was isolated from a strawberry poison frog (Oophaga pumilio) imported from Nicaragua via Germany to the Netherlands, and its complete genome sequence was determined. Frog virus 3 isolate Op/2015/Netherlands/UU3150324001 is 107,183 bp long and has a nucleotide similarity of 98.26% to the reference Frog virus 3 isolate. PMID:28860243

  14. Complete sequence of the first chimera genome constructed by cloning the whole genome of Synechocystis strain PCC6803 into the Bacillus subtilis 168 genome.

    PubMed

    Watanabe, Satoru; Shiwa, Yuh; Itaya, Mitsuhiro; Yoshikawa, Hirofumi

    2012-12-01

    Genome synthesis of existing or designed genomes is made feasible by the first successful cloning of a cyanobacterium, Synechocystis PCC6803, in Gram-positive, endospore-forming Bacillus subtilis. Whole-genome sequence analysis of the isolate and parental B. subtilis strains provides clues for identifying single nucleotide polymorphisms (SNPs) in the 2 complete bacterial genomes in one cell.

  15. Complete Genome Sequence of a Virulent Newcastle Disease Virus Strain Isolated from a Clinically Healthy Duck (Anas platyrhynchos domesticus) in Pakistan

    PubMed Central

    Wajid, Abdul; Rehmani, Shafqat F.; Wasim, Muhammad; Basharat, Asma; Bibi, Tasra; Arif, Saima; Dimitrov, Kiril M.

    2016-01-01

    Here, we report the complete genome sequence of a virulent Newcastle disease virus (vNDV) strain, duck/Pakistan/Lahore/AW-123/2015, isolated from apparently healthy laying ducks (Anas platyrhynchos domesticus) from the province of Punjab, Pakistan. The virus has a genome length of 15,192 nucleotides and is classified as member of subgenotype VIIi, class II. PMID:27469959

  16. Complete Genome Sequence of the Alfalfa Symbiont Sinorhizobium/Ensifer meliloti Strain GR4.

    PubMed

    Martínez-Abarca, Francisco; Martínez-Rodríguez, Laura; López-Contreras, José Antonio; Jiménez-Zurdo, José Ignacio; Toro, Nicolás

    2013-01-01

    We present the complete nucleotide sequence of the multipartite genome of Sinorhizobium/Ensifer meliloti GR4, a predominant rhizobial strain in an agricultural field site. The genome (total size, 7.14 Mb) consists of five replicons: one chromosome, two expected symbiotic megaplasmids (pRmeGR4c and pRmeGR4d), and two accessory plasmids (pRmeGR4a and pRmeGR4b).

  17. ANCAC: amino acid, nucleotide, and codon analysis of COGs--a tool for sequence bias analysis in microbial orthologs.

    PubMed

    Meiler, Arno; Klinger, Claudia; Kaufmann, Michael

    2012-09-08

    The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC's NUCOCOG dataset as the largest one available for that purpose thus far. Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills.

  18. ANCAC: amino acid, nucleotide, and codon analysis of COGs – a tool for sequence bias analysis in microbial orthologs

    PubMed Central

    2012-01-01

    Background The COG database is the most popular collection of orthologous proteins from many different completely sequenced microbial genomes. Per definition, a cluster of orthologous groups (COG) within this database exclusively contains proteins that most likely achieve the same cellular function. Recently, the COG database was extended by assigning to every protein both the corresponding amino acid and its encoding nucleotide sequence resulting in the NUCOCOG database. This extended version of the COG database is a valuable resource connecting sequence features with the functionality of the respective proteins. Results Here we present ANCAC, a web tool and MySQL database for the analysis of amino acid, nucleotide, and codon frequencies in COGs on the basis of freely definable phylogenetic patterns. We demonstrate the usefulness of ANCAC by analyzing amino acid frequencies, codon usage, and GC-content in a species- or function-specific context. With respect to amino acids we, at least in part, confirm the cognate bias hypothesis by using ANCAC’s NUCOCOG dataset as the largest one available for that purpose thus far. Conclusions Using the NUCOCOG datasets, ANCAC connects taxonomic, amino acid, and nucleotide sequence information with the functional classification via COGs and provides a GUI for flexible mining for sequence-bias. Thereby, to our knowledge, it is the only tool for the analysis of sequence composition in the light of physiological roles and phylogenetic context without requirement of substantial programming-skills. PMID:22958836

  19. Nucleotide sequences of immunoglobulin eta genes of chimpanzee and orangutan: DNA molecular clock and hominoid evolution

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sakoyama, Y.; Hong, K.J.; Byun, S.M.

    To determine the phylogenetic relationships among hominoids and the dates of their divergence, the complete nucleotide sequences of the constant region of the immunoglobulin eta-chain (C/sub eta1/) genes from chimpanzee and orangutan have been determined. These sequences were compared with the human eta-chain constant-region sequence. A molecular clock (silent molecular clock), measured by the degree of sequence divergence at the synonymous (silent) positions of protein-encoding regions, was introduced for the present study. From the comparison of nucleotide sequences of ..cap alpha../sub 1/-antitrypsin and ..beta..- and delta-globulin genes between humans and Old World monkeys, the silent molecular clock was calibrated: themore » mean evolutionary rate of silent substitution was determined to be 1.56 x 10/sup -9/ substitutions per site per year. Using the silent molecular clock, the mean divergence dates of chimpanzee and orangutan from the human lineage were estimated as 6.4 +/- 2.6 million years and 17.3 +/- 4.5 million years, respectively. It was also shown that the evolutionary rate of primate genes is considerably slower than those of other mammalian genes.« less

  20. Complete genome sequence of the biofilm-forming Microbacterium sp. strain BH-3-3-3, isolated from conventional field-grown lettuce (Lactuca sativa) in Norway.

    PubMed

    Dees, Merete Wiken; Brurberg, May Bente; Lysøe, Erik

    2017-03-01

    The genus Microbacterium contains bacteria that are ubiquitously distributed in various environments and includes plant-associated bacteria that are able to colonize tissue of agricultural crop plants. Here, we report the 3,508,491 bp complete genome sequence of Microbacterium sp. strain BH-3-3-3, isolated from conventionally grown lettuce ( Lactuca sativa ) from a field in Vestfold, Norway. The nucleotide sequence of this genome was deposited into NCBI GenBank under the accession CP017674.

  1. Identification of Genes Encoding Conjugated Bile Salt Hydrolase and Transport in Lactobacillus johnsonii 100-100

    PubMed Central

    Elkins, Christopher A.; Savage, Dwayne C.

    1998-01-01

    Cytosolic extracts of Lactobacillus johnsonii 100-100 (previously reported as Lactobacillus sp. strain 100-100) contain four heterotrimeric isozymes composed of two peptides, α and β, with conjugated bile salt hydrolase (BSH) activity. We now report cloning, from the genome of strain 100-100, a 2,977-bp DNA segment that expresses BSH activity in Escherichia coli. The sequencing of this segment showed that it contained one complete and two partial open reading frames (ORFs). The 3′ partial ORF (927 nucleotides) was predicted by BLAST and confirmed with 5′ and 3′ deletions to be a BSH gene. Thermal asymmetric interlaced PCR was used to extend and complete the 948-nucleotide sequence of the BSH gene 3′ of the cloned segment. The predicted amino acid sequence of the 5′ partial ORF (651 nucleotides) was about 80% similar to the C-terminal half of the largest, complete ORF (1,353 nucleotides), and these two putative proteins were similar to several amine, multidrug resistance, and sugar transport proteins of the major facilitator superfamily. E. coli DH5α cells transformed with a construct containing these ORFs, in concert with an extracellular factor produced by strain 100-100, demonstrated levels of uptake of [14C]taurocholic acid that were increased as much as threefold over control levels. [14C]Cholic acid was taken up in similar amounts by strain DH5α pSportI (control) and DH5α p2000 (transport clones). These findings support a hypothesis that the ORFs are conjugated bile salt transport genes which may be arranged in an operon with BSH genes. PMID:9721268

  2. Genome characterization and genetic diversity of sweet potato symptomless virus 1: a mastrevirus with an unusual nonanucleotide

    USDA-ARS?s Scientific Manuscript database

    Complete genomic sequences of nine isolates of sweet potato symptomless virus 1 (SPSMV-1), a virus of genus Mastrevirus in the family Geminiviridae, was determined to be 2,559-2,602 nucleotides from sweet potato accessions from different countries. These isolates shared genomic sequence identities o...

  3. Complete genome sequences of two highly divergent Japanese isolates of Plantago asiatica mosaic virus.

    PubMed

    Komatsu, Ken; Yamashita, Kazuo; Sugawara, Kota; Verbeek, Martin; Fujita, Naoko; Hanada, Kaoru; Uehara-Ichiki, Tamaki; Fuji, Shin-Ichi

    2017-02-01

    Plantago asiatica mosaic virus (PlAMV) is a member of the genus Potexvirus and has an exceptionally wide host range. It causes severe damage to lilies. Here we report on the complete nucleotide sequences of two new Japanese PlAMV isolates, one from the eudicot weed Viola grypoceras (PlAMV-Vi), and the other from the eudicot shrub Nandina domestica Thunb. (PlAMV-NJ). Their genomes contain five open reading frames (ORFs), which is characteristic of potexviruses. Surprisingly, the isolates showed only 76.0-78.0 % sequence identity with each other and with other PlAMV isolates, including isolates from Japanese lily and American nandina. Amino acid alignments of the replicase coding region encoded by ORF1 showed that the regions between the methyltransferase and helicase domains were less conserved than other regions, with several insertions and/or deletions. Phylogenetic analyses of the full-length nucleotide sequences revealed a moderate correlation between phylogenetic clustering and the original host plants of the PlAMV isolates. This study revealed the presence of two highly divergent PlAMV isolates in Japan.

  4. Genomic analysis reveals Nairobi sheep disease virus to be highly diverse and present in both Africa, and in India in the form of the Ganjam virus variant.

    PubMed

    Yadav, Pragya D; Vincent, Martin J; Khristova, Marina; Kale, Charuta; Nichol, Stuart T; Mishra, Akhilesh C; Mourya, Devendra T

    2011-07-01

    Nairobi sheep disease (NSD) virus, the prototype tick-borne virus of the genus Nairovirus, family Bunyaviridae is associated with acute hemorrhagic gastroenteritis in sheep and goats in East and Central Africa. The closely related Ganjam virus found in India is associated with febrile illness in humans and disease in livestock. The complete S, M and L segment sequences of Ganjam and NSD virus and partial sequence analysis of Ganjam viral RNA genome S, M and L segments encoding regions (396 bp, 701 bp and 425 bp) of the viral nucleocapsid (N), glycoprotein precursor (GPC) and L polymerase (L) proteins, respectively, was carried out for multiple Ganjam virus isolates obtained from 1954 to 2002 and from various regions of India. M segments of NSD and Ganjam virus encode a large ORF for the glycoprotein precursor (GPC), (1627 and 1624 amino acids in length, respectively) and their L segments encode a very large L polymerase (3991 amino acids). The complete S, M and L segments of NSD and Ganjam viruses were more closely related to one another than to other characterized nairoviruses, and no evidence of reassortment was found. However, the NSD and Ganjam virus complete M segment differed by 22.90% and 14.70%, for nucleotide and amino acid respectively, and the complete L segment nucleotide and protein differing by 9.90% and 2.70%, respectively among themselves. Ganjam and NSD virus, complete S segment differed by 9.40-10.40% and 3.2-4.10 for nucleotide and proteins while among Ganjam viruses 0.0-6.20% and 0.0-1.4%, variation was found for nucleotide and amino acids. Ganjam virus isolates differed by up to 17% and 11% at the nucleotide level for the partial S and L gene fragments, respectively, with less variation observed at the deduced amino acid level (10.5 and 2%, S and L, respectively). However, the virus partial M gene fragment (which encodes the hypervariable mucin-like domain) of these viruses differed by as much as 56% at the nucleotide level. Phylogenetic analysis of partial sequence differences suggests considerable mixing and movement of Ganjam virus strains within India, with no clear relationship between genetic lineages and virus geographic origin or year of isolation. Surprisingly, NSD virus does not represent a distinct lineage, but appears as a variant with other Ganjam virus among NSD virus group. Copyright © 2011 Elsevier B.V. All rights reserved.

  5. Complete Genome Sequence of Frog virus 3, Isolated from a Strawberry Poison Frog (Oophaga pumilio) Imported from Nicaragua into the Netherlands.

    PubMed

    Saucedo, Bernardo; Hughes, Joseph; van Beurden, Steven J; Suárez, Nicolás M; Haenen, Olga L M; Voorbergen-Laarman, Michal; Gröne, Andrea; Kik, Marja J L

    2017-08-31

    Frog virus 3 was isolated from a strawberry poison frog ( Oophaga pumilio ) imported from Nicaragua via Germany to the Netherlands, and its complete genome sequence was determined. Frog virus 3 isolate Op /2015/Netherlands/UU3150324001 is 107,183 bp long and has a nucleotide similarity of 98.26% to the reference Frog virus 3 isolate. Copyright © 2017 Saucedo et al.

  6. Complete Genome Sequence of the Alfalfa Symbiont Sinorhizobium/Ensifer meliloti Strain GR4

    PubMed Central

    Martínez-Abarca, Francisco; Martínez-Rodríguez, Laura; López-Contreras, José Antonio; Jiménez-Zurdo, José Ignacio

    2013-01-01

    We present the complete nucleotide sequence of the multipartite genome of Sinorhizobium/Ensifer meliloti GR4, a predominant rhizobial strain in an agricultural field site. The genome (total size, 7.14 Mb) consists of five replicons: one chromosome, two expected symbiotic megaplasmids (pRmeGR4c and pRmeGR4d), and two accessory plasmids (pRmeGR4a and pRmeGR4b). PMID:23409262

  7. The human myelin oligodendrocyte glycoprotein (MOG) gene: Complete nucleotide sequence and structural characterization

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Paule Roth, M.; Malfroy, L.; Offer, C.

    1995-07-20

    Human myelin oligodendrocyte glycoprotein (MOG), a myelin component of the central nervous system, is a candidate target antigen for autoimmune-mediated demyelination. We have isolated and sequenced part of a cosmid clone that contains the entire human MOG gene. The primary nuclear transcript, extending from the putative start of transcription to the site of poly(A) addition, is 15,561 nucleotides in length. The human MOG gene contains 8 exons, separated by 7 introns; canonical intron/exon boundary sites are observed at each junction. The introns vary in size from 242 to 6484 bp and contain numerous repetitive DNA elements, including 14 Alu sequencesmore » within 3 introns. Another Alu element is located in the 3{prime}-untranslated region of the gene. Alu sequences were classified with respect to subfamily assignment. Seven hundred sixty-three nucleotides 5{prime} of the transcription start and 1214 nucleotides 3{prime} of the poly(A) addition sites were also sequenced. The 5{prime}-flanking region revealed the presence of several consensus sequences that could be relevant in the transcription of the MOG gene, in particular binding sites in common with other myelin gene promoters. Two polymorphic intragenic dinucleotide (CA){sub n} and tetranucleotide (TAAA){sub n} repeats were identified and may provide genetic marker tools for association and linkage studies. 50 refs., 3 figs., 3 tabs.« less

  8. Complete Sequence of the mitochondrial genome of the tapeworm Hymenolepis diminuta: Gene arrangements indicate that platyhelminths are eutrochozoans

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    von Nickisch-Rosenegk, Markus; Brown, Wesley M.; Boore, Jeffrey L.

    2001-01-01

    Using ''long-PCR'' we have amplified in overlapping fragments the complete mitochondrial genome of the tapeworm Hymenolepis diminuta (Platyhelminthes: Cestoda) and determined its 13,900 nucleotide sequence. The gene content is the same as that typically found for animal mitochondrial DNA (mtDNA) except that atp8 appears to be lacking, a condition found previously for several other animals. Despite the small size of this mtDNA, there are two large non-coding regions, one of which contains 13 repeats of a 31 nucleotide sequence and a potential stem-loop structure of 25 base pairs with an 11-member loop. Large potential secondary structures are identified also formore » the non-coding regions of two other cestode mtDNAs. Comparison of the mitochondrial gene arrangement of H. diminuta with those previously published supports a phylogenetic position of flatworms as members of the Eutrochozoa, rather than being basal to either a clade of protostomes or a clade of coelomates.« less

  9. Combined hairpin-antisense compositions and methods for modulating expression

    DOEpatents

    Shanklin, John; Nguyen, Tam

    2014-08-05

    A nucleotide construct comprising a nucleotide sequence that forms a stem and a loop, wherein the loop comprises a nucleotide sequence that modulates expression of a target, wherein the stem comprises a nucleotide sequence that modulates expression of a target, and wherein the target modulated by the nucleotide sequence in the loop and the target modulated by the nucleotide sequence in the stem may be the same or different. Vectors, methods of regulating target expression, methods of providing a cell, and methods of treating conditions comprising the nucleotide sequence are also disclosed.

  10. Combined hairpin-antisense compositions and methods for modulating expression

    DOEpatents

    Shanklin, John; Nguyen, Tam Huu

    2015-11-24

    A nucleotide construct comprising a nucleotide sequence that forms a stem and a loop, wherein the loop comprises a nucleotide sequence that modulates expression of a target, wherein the stem comprises a nucleotide sequence that modulates expression of a target, and wherein the target modulated by the nucleotide sequence in the loop and the target modulated by the nucleotide sequence in the stem may be the same or different. Vectors, methods of regulating target expression, methods of providing a cell, and methods of treating conditions comprising the nucleotide sequence are also disclosed.

  11. Isolation and characterization of a cDNA clone for the complete protein coding region of the delta subunit of the mouse acetylcholine receptor.

    PubMed Central

    LaPolla, R J; Mayne, K M; Davidson, N

    1984-01-01

    A mouse cDNA clone has been isolated that contains the complete coding region of a protein highly homologous to the delta subunit of the Torpedo acetylcholine receptor (AcChoR). The cDNA library was constructed in the vector lambda 10 from membrane-associated poly(A)+ RNA from BC3H-1 mouse cells. Surprisingly, the delta clone was selected by hybridization with cDNA encoding the gamma subunit of the Torpedo AcChoR. The nucleotide sequence of the mouse cDNA clone contains an open reading frame of 520 amino acids. This amino acid sequence exhibits 59% and 50% sequence homology to the Torpedo AcChoR delta and gamma subunits, respectively. However, the mouse nucleotide sequence has several stretches of high homology with the Torpedo gamma subunit cDNA, but not with delta. The mouse protein has the same general structural features as do the Torpedo subunits. It is encoded by a 3.3-kilobase mRNA. There is probably only one, but at most two, chromosomal genes coding for this or closely related sequences. Images PMID:6096870

  12. Bean common mosaic virus isolates causing different symptoms in asparagus bean in China differ greatly in the 5'-parts of their genomes.

    PubMed

    Zheng, Hongying; Chen, Jiong; Chen, Jianping; Adams, Michael J; Hou, Mingsheng

    2002-06-01

    Potyvirus isolates from asparagus bean ( Vigna sesquipedalis) plants in Zhejiang province, China, caused either rugose and vein banding mosaic symptoms (isolate R) or severe yellowing (isolate Y) in this host, but were otherwise similar in host range. Both isolates were completely sequenced and shown to be isolates of Bean common mosaic virus (BCMV). The complete sequences were 9992 (R) or 10062 (Y) nucleotides long and shared 91.7% identical nucleotides (93.2% identical amino acids) in their genomes and were more distantly related to the BCMV-Peanut stripe virus sequence (PStV). The isolates were much less similar to one another in the 5'-UTR and the N-terminal region of the P1 protein. In the P1, isolate Y was closer to PStV (76.1% identical amino acids) than to isolate R (64.8%). Phylogenetic analyses of the coat protein region showed that the new isolates grouped with other isolates from Vigna spp., forming the blackeye cowpea mosaic strain subgroup of BCMV with 94-98% nucleotides (96-99% amino acids) identical to one another and about 90% identity to other BCMV isolates. Other significant subgroupings amongst published BCMV isolates were detected.

  13. Nonneutral mitochondrial DNA variation in humans and chimpanzees

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nachman, M.W.; Aquadro, C.F.; Brown, W.M.

    1996-03-01

    We sequenced the NADH dehydrogenase subunit 3 (ND3) gene from a sample of 61 humans, five common chimpanzees, and one gorilla to test whether patterns of mitochondrial DNA (mtDNA) variation are consistent with a neutral model of molecular evolution. Within humans and within chimpanzees, the ratio of replacement to silent nucleotide substitutions was higher than observed in comparisons between species, contrary to neutral expectations. To test the generality of this result, we reanalyzed published human RFLP data from the entire mitochondrial genome. Gains of restriction sites relative to a known human mtDNA sequence were used to infer unambiguous nucleotide substitutions.more » We also compared the complete mtDNA sequences of three humans. Both the RFLP data and the sequence data reveal a higher ratio of replacement to silent nucleotide substitutions within humans than is seen between species. This pattern is observed at most or all human mitochondrial genes and is inconsistent with a strictly neutral model. These data suggest that many mitochondrial protein polymorphisms are slightly deleterious, consistent with studies of human mitochondrial diseases. 59 refs., 2 figs., 8 tabs.« less

  14. Identification and nucleotide sequence analysis of the repetitive DNA element in the genome of fish lymphocystis disease virus.

    PubMed

    Schnitzler, P; Delius, H; Scholz, J; Touray, M; Orth, E; Darai, G

    1987-12-01

    The genome of the fish lymphocystis disease virus (FLDV) was screened for the existence of repetitive DNA sequences using a defined and complete gene library of the viral genome (98 kbp) by DNA-DNA hybridization, heteroduplex analysis, and restriction fine mapping. A repetitive DNA sequence was detected at the coordinates 0.034 to 0.057 and 0.718 to 0.736 map units (m.u.) of the FLDV genome. The first region (0.034 to 0.057 m.u.) corresponds to the 5' terminus of the EcoRI FLDV DNA fragment B (0.034 to 0.165 m.u.) and the second region (0.718 to 0.736 m.u.) is identical to the EcoRI DNA fragment M of the viral genome. The DNA nucleotide sequence of the EcoRI FLDV DNA fragment M was determined. This analysis revealed the presence of many short direct and inverted repetitions, e.g., a 18-mer direct repetition (TTTAAAATTTAATTAA) that started at nucleotide positions 812 and 942 and a 14-mer inverted repeat (TTAAATTTAAATTT) at nucleotide positions 820 and 959. Only short open reading frames were detected within this region. The DNA repetitions are discussed as sequences that play a possible regulatory role for virus replication. Furthermore, hybridization experiments revealed that the repetitive DNA sequences are conserved in the genome of different strains of fish lymphocystis disease virus isolated from two species of Pleuronectidae (flounder and dab).

  15. The Complete Nucleotide Sequence of the Mitochondrial Genome of Bactrocera minax (Diptera: Tephritidae)

    PubMed Central

    Zhang, Bin; Nardi, Francesco; Hull-Sanders, Helen; Wan, Xuanwu; Liu, Yinghong

    2014-01-01

    The complete 16,043 bp mitochondrial genome (mitogenome) of Bactrocera minax (Diptera: Tephritidae) has been sequenced. The genome encodes 37 genes usually found in insect mitogenomes. The mitogenome information for B. minax was compared to the homologous sequences of Bactrocera oleae, Bactrocera tryoni, Bactrocera philippinensis, Bactrocera carambolae, Bactrocera papayae, Bactrocera dorsalis, Bactrocera correcta, Bactrocera cucurbitae and Ceratitis capitata. The analysis indicated the structure and organization are typical of, and similar to, the nine closely related species mentioned above, although it contains the lowest genome-wide A+T content (67.3%). Four short intergenic spacers with a high degree of conservation among the nine tephritid species mentioned above and B. minax were observed, which also have clear counterparts in the control regions (CRs). Correlation analysis among these ten tephritid species revealed close positive correlation between the A+T content of zero-fold degenerate sites (P0FD), the ratio of nucleotide substitution frequency at P0FD sites to all degenerate sites (zero-fold degenerate sites, two-fold degenerate sites and four-fold degenerate sites) and amino acid sequence distance (ASD) were found. Further, significant positive correlation was observed between the A+T content of four-fold degenerate sites (P4FD) and the ratio of nucleotide substitution frequency at P4FD sites to all degenerate sites; however, we found significant negative correlation between ASD and the A+T content of P4FD, and the ratio of nucleotide substitution frequency at P4FD sites to all degenerate sites. A higher nucleotide substitution frequency at non-synonymous sites compared to synonymous sites was observed in nad4, the first time that has been observed in an insect mitogenome. A poly(T) stretch at the 5′ end of the CR followed by a [TA(A)]n-like stretch was also found. In addition, a highly conserved G+A-rich sequence block was observed in front of the poly(T) stretch among the ten tephritid species and two tandem repeats were present in the CR. PMID:24964138

  16. Complete genome sequence of a Chinese isolate of pepper vein yellows virus and evolutionary analysis based on the CP, MP and RdRp coding regions.

    PubMed

    Liu, Maoyan; Liu, Xiangning; Li, Xun; Zhang, Deyong; Dai, Liangyin; Tang, Qianjun

    2016-03-01

    The genome sequence of pepper vein yellows virus (PeVYV) (PeVYV-HN, accession number KP326573), isolated from pepper plants (Capsicum annuum L.) grown at the Hunan Vegetables Institute (Changsha, Hunan, China), was determined by deep sequencing of small RNAs. The PeVYV-HN genome consists of 6244 nucleotides, contains six open reading frames (ORFs), and is similar to that of an isolate (AB594828) from Japan. Its genomic organization is similar to that of members of the genus Polerovirus. Sequence analysis revealed that PeVYV-HN shared 92% sequence identity with the Japanese PeVYV genome at both the nucleotide and amino acid levels. Evolutionary analysis based on the coat protein (CP), movement protein (MP), and RNA-dependent RNA polymerase (RdRP) showed that PeVYV could be divided into two major lineages corresponding to their geographical origins. The Asian isolates have a higher population expansion frequency than the African isolates. Negative selection and genetic drift (founder effect) were found to be the potential drivers of the molecular evolution of PeVYV. Moreover, recombination was not the distinct cause of PeVYV evolution. This is the first report of a complete genomic sequence of PeVYV in China.

  17. Complete mitochondrial genome of Helicoverpa zea (Boddie) and expression profiles of mitochondrial-encoded genes in early and late embryos

    USDA-ARS?s Scientific Manuscript database

    The mitochondrial genome of the bollworm, Helicoverpa zea, was assembled using paired-end nucleotide sequence reads generated with a next-generation sequencing platform. Assembly resulted in a mitogenome of 15,348 bp with greater than 17,000-fold average coverage. Organization of the H. zea mitogen...

  18. Complete nucleotide sequence of Sida golden mosaic Florida virus and phylogenetic relationships with other begomoviruses infecting malvaceous weeds in the Caribbean.

    PubMed

    Fiallo-Olivé, Elvira; Martínez-Zubiaur, Yamila; Moriones, Enrique; Navas-Castillo, Jesús

    2010-09-01

    The complete genome sequence of two isolates of the bipartite begomovirus (genus Begomovirus, family Geminiviridae) Sida golden mosaic Florida virus (SiGMFV) is presented. We propose that both isolates, found infecting Malvastrum coromandelianum (family Malvaceae) in Cuba, belong to a new strain of SiGMFV. Phylogenetic analysis showed that SiGMFV DNA-A is located in a monophyletic cluster that includes begomoviruses infecting malvaceous weeds from the Caribbean.

  19. Molecular characterization of the complete genome of falconid herpesvirus strain S-18

    USDA-ARS?s Scientific Manuscript database

    Falconid herpesvirus type 1 (FHV-1) is the causative agent of falcon inclusion body disease, an acute, highly contagious disease of raptors. The complete nucleotide sequence of the genome of FHV-1 has been determined. The genome is arranged as a D-type genome with large inverted repeats flanking a ...

  20. Complete nucleotide sequences of three pigeon paramyxovirus serotype-1 (PPMV-1) isolates

    USDA-ARS?s Scientific Manuscript database

    Pigeon paramyxovirus serotype-1 (PPMV-1) is an antigenic variant of avian paramyxovirus serotype-1 (APMV-1), the agent responsible for Newcastle disease. Given that PPMV-1 can be transmitted to the poultry population it is important to characterize PPMV-1 in native birds. Here we report the complet...

  1. [Complete genome sequencing and analyses of rabies viruses isolated from wild animals (Chinese Ferret-Badger) in Zhejiang province].

    PubMed

    Lei, Yong-Liang; Wang, Xiao-Guang; Liu, Fu-Ming; Chen, Xiu-Ying; Ye, Bi-Feng; Mei, Jian-Hua; Lan, Jin-Quan; Tang, Qing

    2009-08-01

    Based on sequencing the full-length genomes of two Chinese Ferret-Badger, we analyzed the properties of rabies viruses genetic variation in molecular level to get information on prevalence and variation of rabies viruses in Zhejiang, and to enrich the genome database of rabies viruses street strains isolated from Chinese wildlife. Overlapped fragments were amplified by RT-PCR and full-length genomes were assembled to analyze the nucleotide and deduced protein similarities and phylogenetic analyses of the N genes from Chinese Ferret-Badger, sika deer, vole, dog. Vaccine strains were then determined. The two full-length genomes were completely sequenced to find out that they had the same genetic structure with 11 923 nts including 58 nts-Leader, 1353 nts-NP, 894 nts-PP, 609 nts-MP, 1575 nts-GP, 6386 nts-LP, and 2, 5, 5 nts- intergenic regions (IGRs), 423 nts-Pseudogene-like sequence (Psi), 70 nts-Trailer. The two full-length genomes were in accordance with the properties of Rhabdoviridae Lyssa virus by blast and multi-sequence alignment. The nucleotide and amino acid sequences among Chinese strains had the highest similarity, especially among animals of the same species. Of the two full-length genomes, the similarity in amino acid level was dramatically higher than that in nucleotide level, so that the nucleotide mutations happened in these two genomes were most probably as synonymous mutations. Compared to the referenced rabies viruses, the lengths of the five protein coding regions did not show any changes or recombination, but only with a few-point mutations. It was evident that the five proteins appeared to be stable. The variation sites and types of the two ferret badgers genomes were similar to the referenced vaccine or street strains. The two strains were genotype 1 according to the multi-sequence and phylogenetic analyses, which possessing the distinct geographyphic characteristics of China. All the evidence suggested a cue that these two ferret badgers rabies viruses were likely to be street virus that already circulating in wildlife.

  2. Complete genome sequence of Fer-de-Lance Virus reveals a novel gene in reptilian Paramyxoviruses

    USGS Publications Warehouse

    Kurath, G.; Batts, W.N.; Ahne, W.; Winton, J.R.

    2004-01-01

    The complete RNA genome sequence of the archetype reptilian paramyxovirus, Fer-de-Lance virus (FDLV), has been determined. The genome is 15,378 nucleotides in length and consists of seven nonoverlapping genes in the order 3??? N-U-P-M-F-HN-L 5???, coding for the nucleocapsid, unknown, phospho-, matrix, fusion, hemagglutinin-neuraminidase, and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and tri-nucleotide intergenic regions similar to those of other Paramyxoviridae. The FDLV P gene expression strategy is like that of rubulaviruses, which express the accessory V protein from the primary transcript and edit a portion of the mRNA to encode P and I proteins. There is also an overlapping open reading frame potentially encoding a small basic protein in the P gene. The gene designated U (unknown), encodes a deduced protein of 19.4 kDa that has no counterpart in other paramyxoviruses and has no similarity with sequences in the National Center for Biotechnology Information database. Active transcription of the U gene in infected cells was demonstrated by Northern blot analysis, and bicistronic N-U mRNA was also evident. The genomes of two other snake paramyxovirus genotypes were also found to have U genes, with 11 to 16% nucleotide divergence from the FDLV U gene. Pairwise comparisons of amino acid identities and phylogenetic analyses of all deduced FDLV protein sequences with homologous sequences from other Paramyxoviridae indicate that FDLV represents a new genus within the subfamily Paramyxovirinae. We suggest the name Ferlavirus for the new genus, with FDLV as the type species.

  3. The full mitochondrial genome sequence of Raillietina tetragona from chicken (Cestoda: Davaineidae).

    PubMed

    Liang, Jian-Ying; Lin, Rui-Qing

    2016-11-01

    In the present study, the complete mitochondrial DNA (mtDNA) sequence of Raillietina tetragona was sequenced and its gene contents and genome organizations was compared with that of other tapeworm. The complete mt genome sequence of R. tetragona is 14,444 bp in length. It contains 12 protein-coding genes, two ribosomal RNA genes, 22 transfer RNA genes, and two non-coding region. All genes are transcribed in the same direction and have a nucleotide composition high in A and T. The contents of A + T of the complete mt genome are 71.4% for R. tetragona. The R. tetragona mt genome sequence provides novel mtDNA marker for studying the molecular epidemiology and population genetics of Raillietina and has implications for the molecular diagnosis of chicken cestodosis caused by Raillietina.

  4. Human somatostatin I: sequence of the cDNA.

    PubMed Central

    Shen, L P; Pictet, R L; Rutter, W J

    1982-01-01

    RNA has been isolated from a human pancreatic somatostatinoma and used to prepare a cDNA library. After prescreening, clones containing somatostatin I sequences were identified by hybridization with an anglerfish somatostatin I-cloned cDNA probe. From the nucleotide sequence of two of these clones, we have deduced an essentially full-length mRNA sequence, including the preprosomatostatin coding region, 105 nucleotides from the 5' untranslated region and the complete 150-nucleotide 3' untranslated region. The coding region predicts a 116-amino acid precursor protein (Mr, 12.727) that contains somatostatin-14 and -28 at its COOH terminus. The predicted amino acid sequence of human somatostatin-28 is identical to that of somatostatin-28 isolated from the porcine and ovine species. A comparison of the amino acid sequences of human and anglerfish preprosomatostatin I indicated that the COOH-terminal region encoding somatostatin-14 and the adjacent 6 amino acids are highly conserved, whereas the remainder of the molecule, including the signal peptide region, is more divergent. However, many of the amino acid differences found in the pro region of the human and anglerfish proteins are conservative changes. This suggests that the propeptides have a similar secondary structure, which in turn may imply a biological function for this region of the molecule. Images PMID:6126875

  5. Curated eutherian third party data gene data sets.

    PubMed

    Premzl, Marko

    2016-03-01

    The free available eutherian genomic sequence data sets advanced scientific field of genomics. Of note, future revisions of gene data sets were expected, due to incompleteness of public eutherian genomic sequence assemblies and potential genomic sequence errors. The eutherian comparative genomic analysis protocol was proposed as guidance in protection against potential genomic sequence errors in public eutherian genomic sequences. The protocol was applicable in updates of 7 major eutherian gene data sets, including 812 complete coding sequences deposited in European Nucleotide Archive as curated third party data gene data sets.

  6. Complete mitochondrial genome of the whiter-spotted flower chafer, Protaetia brevitarsis (Coleoptera: Scarabaeidae).

    PubMed

    Kim, Min Jee; Im, Hyun Hwak; Lee, Kwang Youll; Han, Yeon Soo; Kim, Iksoo

    2014-06-01

    Abstract The complete nucleotide sequences of the mitochondrial genome from the whiter-spotted flower chafer, Protaetia brevitarsis (Coleoptera: Scarabaeidae), was determined. The 20,319-bp long circular genome is the longest among completely sequenced Coleoptera. As is typical in animals, the P. brevitarsis genome consisted of two ribosomal RNAs, 22 transfer RNAs, 13 protein-coding genes and one A + T-rich region. Although the size of the coding genes was typical, the non-coding A + T-rich region was 5654 bp, which is the longest in insects. The extraordinary length of this region was composed of 28,117-bp tandem repeats and 782-bp tandem repeats. These repeat sequences were encompassed by three non-repeat sequences constituting 1804 bp.

  7. GenBank.

    PubMed

    Benson, Dennis A; Karsch-Mizrachi, Ilene; Lipman, David J; Ostell, James; Wheeler, David L

    2008-01-01

    GenBank (R) is a comprehensive database that contains publicly available nucleotide sequences for more than 260 000 named organisms, obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects. Most submissions are made using the web-based BankIt or standalone Sequin programs and accession numbers are assigned by GenBank staff upon receipt. Daily data exchange with the European Molecular Biology Laboratory Nucleotide Sequence Database in Europe and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through NCBI's retrieval system, Entrez, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. To access GenBank and its related retrieval and analysis services, begin at the NCBI Homepage: www.ncbi.nlm.nih.gov.

  8. GenBank

    PubMed Central

    Benson, Dennis A.; Karsch-Mizrachi, Ilene; Lipman, David J.; Ostell, James; Wheeler, David L.

    2008-01-01

    GenBank (R) is a comprehensive database that contains publicly available nucleotide sequences for more than 260 000 named organisms, obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects. Most submissions are made using the web-based BankIt or standalone Sequin programs and accession numbers are assigned by GenBank staff upon receipt. Daily data exchange with the European Molecular Biology Laboratory Nucleotide Sequence Database in Europe and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through NCBI's retrieval system, Entrez, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. To access GenBank and its related retrieval and analysis services, begin at the NCBI Homepage: www.ncbi.nlm.nih.gov PMID:18073190

  9. Nucleotide sequence of the Saccharomyces cerevisiae PUT4 proline-permease-encoding gene: similarities between CAN1, HIP1 and PUT4 permeases.

    PubMed

    Vandenbol, M; Jauniaux, J C; Grenson, M

    1989-11-15

    The complete nucleotide (nt) sequence of the PUT4 gene, whose product is required for high-affinity proline active transport in the yeast Saccharomyces cerevisiae, is presented. The sequence contains a single long open reading frame of 1881 nt, encoding a polypeptide with a calculated Mr of 68,795. The predicted protein is strongly hydrophobic and exhibits six potential glycosylation sites. Its hydropathy profile suggests the presence of twelve membrane-spanning regions flanked by hydrophilic N- and C-terminal domains. The N terminus does not resemble signal sequences found in secreted proteins. These features are characteristic of integral membrane proteins catalyzing translocation of ligands across cellular membranes. Protein sequence comparisons indicate strong resemblance to the arginine and histidine permeases of S. cerevisiae, but no marked sequence similarity to the proline permease of Escherichia coli or to other known prokaryotic or eukaryotic transport proteins. The strong similarity between the three yeast amino acid permeases suggests a common ancestor for the three proteins.

  10. Molecular analysis of the split cox1 gene from the Basidiomycota Agrocybe aegerita: relationship of its introns with homologous Ascomycota introns and divergence levels from common ancestral copies.

    PubMed

    Gonzalez, P; Barroso, G; Labarère, J

    1998-10-05

    The Basidiomycota Agrocybe aegerita (Aa) mitochondrial cox1 gene (6790 nucleotides), encoding a protein of 527aa (58377Da), is split by four large subgroup IB introns possessing site-specific endonucleases assumed to be involved in intron mobility. When compared to other fungal COX1 proteins, the Aa protein is closely related to the COX1 one of the Basidiomycota Schizophyllum commune (Sc). This clade reveals a relationship with the studied Ascomycota ones, with the exception of Schizosaccharomyces pombe (Sp) which ranges in an out-group position compared with both higher fungi divisions. When comparison is extended to other kingdoms, fungal COX1 sequences are found to be more related to algae and plant ones (more than 57.5% aa similarity) than to animal sequences (53.6% aa similarity), contrasting with the previously established close relationship between fungi and animals, based on comparisons of nuclear genes. The four Aa cox1 introns are homologous to Ascomycota or algae cox1 introns sharing the same location within the exonic sequences. The percentages of identity of the intronic nucleotide sequences suggest a possible acquisition by lateral transfers of ancestral copies or of their derived sequences. These identities extend over the whole intronic sequences, arguing in favor of a transfer of the complete intron rather than a transfer limited to the encoded ORF. The intron i4 shares 74% of identity, at the nucleotidic level, with the Podospora anserina (Pa) intron i14, and up to 90.5% of aa similarity between the encoded proteins, i.e. the highest values reported to date between introns of two phylogenetically distant species. This low divergence argues for a recent lateral transfer between the two species. On the contrary, the low sequence identities (below 36%) observed between Aa i1 and the homologous Sp i1 or Prototheca wickeramii (Pw) i1 suggest a long evolution time after the separation of these sequences. The introns i2 and i3 possessed intermediate percentages of identity with their homologous Ascomycota introns. This is the first report of the complete nucleotide sequence and molecular organization of a mitochondrial cox1 gene of any member of the Basidiomycota division.

  11. Viral Genome DataBase: storing and analyzing genes and proteins from complete viral genomes.

    PubMed

    Hiscock, D; Upton, C

    2000-05-01

    The Viral Genome DataBase (VGDB) contains detailed information of the genes and predicted protein sequences from 15 completely sequenced genomes of large (&100 kb) viruses (2847 genes). The data that is stored includes DNA sequence, protein sequence, GenBank and user-entered notes, molecular weight (MW), isoelectric point (pI), amino acid content, A + T%, nucleotide frequency, dinucleotide frequency and codon use. The VGDB is a mySQL database with a user-friendly JAVA GUI. Results of queries can be easily sorted by any of the individual parameters. The software and additional figures and information are available at http://athena.bioc.uvic.ca/genomes/index.html .

  12. A weighted sampling algorithm for the design of RNA sequences with targeted secondary structure and nucleotide distribution.

    PubMed

    Reinharz, Vladimir; Ponty, Yann; Waldispühl, Jérôme

    2013-07-01

    The design of RNA sequences folding into predefined secondary structures is a milestone for many synthetic biology and gene therapy studies. Most of the current software uses similar local search strategies (i.e. a random seed is progressively adapted to acquire the desired folding properties) and more importantly do not allow the user to control explicitly the nucleotide distribution such as the GC-content in their sequences. However, the latter is an important criterion for large-scale applications as it could presumably be used to design sequences with better transcription rates and/or structural plasticity. In this article, we introduce IncaRNAtion, a novel algorithm to design RNA sequences folding into target secondary structures with a predefined nucleotide distribution. IncaRNAtion uses a global sampling approach and weighted sampling techniques. We show that our approach is fast (i.e. running time comparable or better than local search methods), seedless (we remove the bias of the seed in local search heuristics) and successfully generates high-quality sequences (i.e. thermodynamically stable) for any GC-content. To complete this study, we develop a hybrid method combining our global sampling approach with local search strategies. Remarkably, our glocal methodology overcomes both local and global approaches for sampling sequences with a specific GC-content and target structure. IncaRNAtion is available at csb.cs.mcgill.ca/incarnation/. Supplementary data are available at Bioinformatics online.

  13. Complete genomic sequence of an infectious pancreatic necrosis virus isolated from rainbow trout (Oncorhynchus mykiss) in China.

    PubMed

    Ji, Feng; Zhao, Jing-Zhuang; Liu, Miao; Lu, Tong-Yan; Liu, Hong-Bai; Yin, Jiasheng; Xu, Li-Ming

    2017-04-01

    Infectious pancreatic necrosis (IPN) is a significant disease of farmed salmonids resulting in direct economic losses due to high mortality in China. However, no gene sequence of any Chinese infectious pancreatic necrosis virus (IPNV) isolates was available. In the study, moribund rainbow trout fry samples were collected during an outbreak of IPN in Yunnan province of southwest China in 2013. An IPNV was isolated and tentatively named ChRtm213. We determined the full genome sequence of the IPNV ChRtm213 and compared it with previously identified IPNV sequences worldwide. The sequences of different structural and non-structural protein genes were compared to those of other aquatic birnaviruses sequenced to date. The results indicated that the complete genome sequence of ChRtm213 strain contains a segment A (3099 nucleotides) coding a polyprotein VP2-VP4-VP3, and a segment B (2789 nucleotides) coding a RNA-dependent RNA polymerase VP1. The phylogenetic analyses showed that ChRtm213 strain fell within genogroup 1, serotype A9 (Jasper), having similarities of 96.3% (segment A) and 97.3% (segment B) with the IPNV strain AM98 from Japan. The results suggest that the Chinese IPNV isolate has relative closer relationship with Japanese IPNV strains. The sequence of ChRtm213 was the first gene sequence of IPNV isolates in China. This study provided a robust reference for diagnosis and/or control of IPNV prevalent in China.

  14. Leek yellow stripe virus isolates from Brazil form a distant clade based on the P1 gene

    USDA-ARS?s Scientific Manuscript database

    The complete genomic sequence of a garlic isolate of Leek yellow stripe virus from Brazil (LYSV-MG) has been determined, and phylogenetic comparisons made to LYSV isolates from other parts of the world. In addition, the nucleotide sequence of the 5'UTR and part of the P1 gene of multiple LYSV isolat...

  15. Nucleotide sequence of the gag gene and gag-pol junction of feline leukemia virus.

    PubMed Central

    Laprevotte, I; Hampe, A; Sherr, C J; Galibert, F

    1984-01-01

    The nucleotide sequence of the gag gene of feline leukemia virus and its flanking sequences were determined and compared with the corresponding sequences of two strains of feline sarcoma virus and with that of the Moloney strain of murine leukemia virus. A high degree of nucleotide sequence homology between the feline leukemia virus and murine leukemia virus gag genes was observed, suggesting that retroviruses of domestic cats and laboratory mice have a common, proximal evolutionary progenitor. The predicted structure of the complete feline leukemia virus gag gene precursor suggests that the translation of nonglycosylated and glycosylated gag gene polypeptides is initiated at two different AUG codons. These initiator codons fall in the same reading frame and are separated by a 222-base-pair segment which encodes an amino terminal signal peptide. The nucleotide sequence predicts the order of amino acids in each of the individual gag-coded proteins (p15, p12, p30, p10), all of which derive from the gag gene precursor. Stable stem-and-loop secondary structures are proposed for two regions of viral RNA. The first falls within sequences at the 5' end of the viral genome, together with adjacent palindromic sequences which may play a role in dimer linkage of RNA subunits. The second includes coding sequences at the gag-pol junction and is proposed to be involved in translation of the pol gene product. Sequence analysis of the latter region shows that the gag and pol genes are translated in different reading frames. Classical consensus splice donor and acceptor sequences could not be localized to regions which would permit synthesis of the expected gag-pol precursor protein. Alternatively, we suggest that the pol gene product (RNA-dependent DNA polymerase) could be translated by a frameshift suppressing mechanism which could involve cleavage modification of stems and loops in a manner similar to that observed in tRNA processing. PMID:6328019

  16. Molecular cloning and nucleotide sequence of the alpha and beta subunits of allophycocyanin from the cyanelle genome of Cyanophora paradoxa.

    PubMed Central

    Bryant, D A; de Lorimier, R; Lambert, D H; Dubbs, J M; Stirewalt, V L; Stevens, S E; Porter, R D; Tam, J; Jay, E

    1985-01-01

    The genes for the alpha- and beta-subunit apoproteins of allophycocyanin (AP) were isolated from the cyanelle genome of Cyanophora paradoxa and subjected to nucleotide sequence analysis. The AP beta-subunit apoprotein gene was localized to a 7.8-kilobase-pair Pst I restriction fragment from cyanelle DNA by hybridization with a tetradecameric oligonucleotide probe. Sequence analysis using that oligonucleotide and its complement as primers for the dideoxy chain-termination sequencing method confirmed the presence of both AP alpha- and beta-subunit genes on this restriction fragment. Additional oligonucleotide primers were synthesized as sequencing progressed and were used to determine rapidly the nucleotide sequence of a 1336-base-pair region of this cloned fragment. This strategy allowed the sequencing to be completed without a detailed restriction map and without extensive and time-consuming subcloning. The sequenced region contains two open reading frames whose deduced amino acid sequences are 81-85% homologous to cyanobacterial and red algal AP subunits whose amino acid sequences have been determined. The two open reading frames are in the same orientation and are separated by 39 base pairs. AP alpha is 5' to AP beta and both coding sequences are preceded by a polypurine, Shine-Dalgarno-type sequence. Sequences upstream from AP alpha closely resemble the Escherichia coli consensus promoter sequences and also show considerable homology to promoter sequences for several chloroplast-encoded psbA genes. A 56-base-pair palindromic sequence downstream from the AP beta gene could play a role in the termination of transcription or translation. The allophycocyanin apoprotein subunit genes are located on the large single-copy region of the cyanelle genome. PMID:2987916

  17. The complete mitochondrial genome of Rapana venosa (Gastropoda, Muricidae).

    PubMed

    Sun, Xiujun; Yang, Aiguo

    2016-01-01

    The complete mitochondrial (mt) genome of the veined rapa whelk, Rapana venosa, was determined using genome walking techniques in this study. The total length of the mt genome sequence of R. venosa was 15,271 bp, which is comparable to the reported Muricidae mitogenomes to date. It contained 13 protein-coding genes, 21 transfer RNA genes, and two ribosomal RNA genes. A bias towards a higher representation of nucleotides A and T (69%) was detected in the mt genome of R. venosa. A small number of non-coding nucleotides (302 bp) was detected, and the largest non-coding region was 74 bp in length.

  18. The complete nucleotide sequence of the domestic dog (Canis familiaris) mitochondrial genome.

    PubMed

    Kim, K S; Lee, S E; Jeong, H W; Ha, J H

    1998-10-01

    The complete nucleotide sequence of the mitochondrial genome of the domestic dog, Canis familiaris, was determined. The length of the sequence was 16,728 bp; however, the length was not absolute due to the variation (heteroplasmy) caused by differing numbers of the repetitive motif, 5'-GTACACGT(A/G)C-3', in the control region. The genome organization, gene contents, and codon usage conformed to those of other mammalian mitochondrial genomes. Although its features were unknown, the "CTAGA" duplication event which followed the translational stop codon of the COII gene was not observed in other mammalian mitochondrial genomes. In order to determine the possible differences between mtDNAs in carnivores, two rRNA and 13 protein-coding genes from the cat, dog, and seal were compared. The combined molecular differences, in two rRNA genes as well as in the inferred amino acid sequences of the mitochondrial 13 protein-coding genes, suggested that there is a closer relationship between the dog and the seal than there is between either of these species and the cat. Based on the molecular differences of the mtDNA, the evolutionary divergence between the cat, the dog, and the seal was dated to approximately 50 +/- 4 million years ago. The degree of difference between carnivore mtDNAs varied according to the individual protein-coding gene applied, showing that the evolutionary relationships of distantly related species should be presented in an extended study based on ample sequence data like complete mtDNA molecules. Copyright 1998 Academic Press.

  19. Complete genomic characterization of milk vetch dwarf virus isolates from cowpea and broad bean in Anhui province, China.

    PubMed

    Zhang, Chenhua; Zheng, Hongying; Yan, Dankan; Han, Kelei; Song, Xijiao; Liu, Yong; Zhang, Dongfang; Chen, Jianping; Yan, Fei

    2017-08-01

    Cowpea and broad bean plants showing severe stunting and leaf rolling symptoms were observed in Hefei city, Anhui province, China, in 2014. Symptomatic plants from both species were shown to be infected with milk vetch dwarf virus (MDV) by PCR. The complete genomes of MDV isolates from cowpea and broad bean were sequenced. Each of them had eight genomic DNAs that differed between the two isolates by 10.7% in their overall nucleotide sequences. In addition, the MDV genomes from cowpea and broad bean were associated with two and three alphasatellite DNAs, respectively. This is the first report of MDV on cowpea in China and the first complete genome sequences of Chinese MDV isolates.

  20. [Complete nucleotide sequences and genome structure of two Chinese tobacco mosaic virus isolates deduced from full-length infectious cDNA clones].

    PubMed

    Yang, G; Liu, X G; Qiu, B S

    2000-07-01

    The complete nucleotides of two Chinese tobacco mosaic virus (TMV) isolates, TMV-Cv (vulgare strain) and TMV-N14 (an attenuated virus originated from a tomato strain), were determined from their respective full-length infectious cDNA clones and compared with published TMV sequences. The genome structure of TMV-Cv contained 6395 nucleotides, in which four functional open reading frames (ORF), coding for replicase (126 kD/183 kD), movement protein (MP, 30 kD) and coat protein (CP, 17.6 kD) respectively, could be recognized. TMV-N14 contained 6384 nucleotides in its genome. In contrast to TMV-Cv, five functional ORFs encoding the replicase 98.5 kD/126 kD/183 kD, MP(27 kD) and CP(17.6 kD), respectively, were detected in the TMV-N14 genome. TMV-Cv is 99% homologous to a Korean TMV isolate belonging to the vulgare strain at the nucleotide level. TMV-N14 is 99% homologous to a highly virulent Japanese isolate TMV-L (tomato strain) at the nucleotide level. In TMV-N14, one opal nulation (UGA) occurred in the replicase gene and one ochre nutation (UAA) in the MP gene. The former mutation created a potential, additional ORF within the replicase gene, the latter reduced the size of the MP to 27 kD. In addition, there were also 13 amino acid substitutions in the replicase gene of TMV-N14 when compared to that of TMV-L. Collectively, these changes may have significant implications in the attenuation of the virulence of TMV-N14.

  1. Molecular Characterization of Bombyx mori Cytoplasmic Polyhedrosis Virus Genome Segment 4

    PubMed Central

    Ikeda, Keiko; Nagaoka, Sumiharu; Winkler, Stefan; Kotani, Kumiko; Yagi, Hiroaki; Nakanishi, Kae; Miyajima, Shigetoshi; Kobayashi, Jun; Mori, Hajime

    2001-01-01

    The complete nucleotide sequence of the genome segment 4 (S4) of Bombyx mori cytoplasmic polyhedrosis virus (BmCPV) was determined. The 3,259-nucleotide sequence contains a single long open reading frame which spans nucleotides 14 to 3187 and which is predicted to encode a protein with a molecular mass of about 130 kDa. Western blot analysis showed that S4 encodes BmCPV protein VP3, which is one of the outer components of the BmCPV virion. Sequence analysis of the deduced amino acid sequence of BmCPV VP3 revealed possible sequence homology with proteins from rice ragged stunt virus (RRSV) S2, Nilaparvata lugens reovirus S4, and Fiji disease fijivirus S4. This may suggest that plant reoviruses originated from insect viruses and that RRSV emerged more recently than other plant reoviruses. A chimeric protein consisting of BmCPV VP3 and green fluorescent protein (GFP) was constructed and expressed with BmCPV polyhedrin using a baculovirus expression vector. The VP3-GFP chimera was incorporated into BmCPV polyhedra and released under alkaline conditions. The results indicate that specific interactions occur between BmCPV polyhedrin and VP3 which might facilitate BmCPV virion occlusion into the polyhedra. PMID:11134312

  2. Skipping of exon 27 in C3 gene compromises TED domain and results in complete human C3 deficiency.

    PubMed

    da Silva, Karina Ribeiro; Fraga, Tatiana Rodrigues; Lucatelli, Juliana Faggion; Grumach, Anete Sevciovic; Isaac, Lourdes

    2016-05-01

    Primary deficiency of complement C3 is rare and usually associated with increased susceptibility to bacterial infections. In this work, we investigated the molecular basis of complete C3 deficiency in a Brazilian 9-year old female patient with a family history of consanguinity. Hemolytic assays revealed complete lack of complement-mediated hemolytic activity in the patient's serum. While levels of the complement regulatory proteins Factor I, Factor H and Factor B were normal in the patient's and family members' sera, complement C3 levels were undetectable in the patient's serum and were reduced by at least 50% in the sera of the patient's parents and brother. Additionally, no C3 could be observed in the patient's plasma and cell culture supernatants by Western blot. We also observed that patient's skin fibroblasts stimulated with Escherichia coli LPS were unable to secrete C3, which might be accumulated within the cells before being intracellularly degraded. Sequencing analysis of the patient's C3 cDNA revealed a genetic mutation responsible for the complete skipping of exon 27, resulting in the loss of 99 nucleotides (3450-3549) located in the TED domain. Sequencing of the intronic region between the exons 26 and 27 of the C3 gene (nucleotides 6690313-6690961) showed a nucleotide exchange (T→C) at position 6690626 located in a splicing donor site, resulting in the complete skipping of exon 27 in the C3 mRNA. Copyright © 2016. Published by Elsevier GmbH.

  3. Molecular typing and characterization of a new serotype of human enterovirus (EV-B111) identified in China.

    PubMed

    Zhang, Yong; Hong, Mei; Sun, Qiang; Zhu, Shuangli; Tsewang; Li, Xiaolei; Yan, Dongmei; Wang, Dongyan; Xu, Wenbo

    2014-04-01

    Molecular methods, based on sequencing the region encoding the complete VP1 or P1 protein, have enabled the rapid identification of new enterovirus serotypes. In the present study, the complete genome of a newly discovered enterovirus serotype, strain Q0011/XZ/CHN/2000 (hereafter referred to as Q0011), was sequenced and analyzed. The virus, isolated from a stool sample from a patient with acute flaccid paralysis in the Tibet region of China in 2000, was characterized by amplicon sequencing and comparison to a GenBank database of enterovirus nucleotide sequences. The nucleotide sequence encoding the complete VP1 capsid protein is most closely related to the sequences of viruses within the species enterovirus B (EV-B), but is less than 72.1% identical to the homologous sequences of the recognized human enterovirus serotypes, with the greatest homology to EV-B101 and echovirus 32. Moreover, the deduced amino acid sequence of the complete VP1 region is less than 84.7% identical to those of the recognized serotypes, suggesting that the strain is a new serotype of enterovirus within EV-B. The virus was characterized as a new enterovirus type, named EV-B111, by the Picornaviridae Study Group of the International Committee on Taxonomy of Viruses. Low positive rate and titer of neutralizing antibody against EV-B111 were found in the Tibet region of China. Nearly 50% of children ≤5 years had no neutralizing antibody against EV-B111. So the extent of transmission and the exposure of the population to this new EV are very limited. This is the first identification of a new serotype of human enterovirus in China, and strain Q0011 was designated the prototype strain of EV-B111. Copyright © 2014 Elsevier B.V. All rights reserved.

  4. [Sequencing and analysis of the complete genome of a rabies virus isolate from Sika deer].

    PubMed

    Zhao, Yun-Jiao; Guo, Li; Huang, Ying; Zhang, Li-Shi; Qian, Ai-Dong

    2008-05-01

    One DRV strain was isolated from Sika Deer brain and sequenced. Nine overlapped gene fragments were amplified by RT-PCR through 3'-RACE and 5'-RACE method, and the complete DRV genome sequence was assembled. The length of the complete genome is 11863bp. The DRV genome organization was similar to other rabies viruses which were composed of five genes and the initiation sites and termination sites were highly conservative. There were mutated amino acids in important antigen sites of nucleoprotein and glycoprotein. The nucleotide and amino acid homologies of gene N, P, M, G, L in strains with completed genomie sequencing were compared. Compared with N gene sequence of other typical rabies viruses, a phylogenetic tree was established . These results indicated that DRV belonged to gene type 1. The highest homology compared with Chinese vaccine strain 3aG was 94%, and the lowest was 71% compared with WCBV. These findings provided theoretical reference for further research in rabies virus.

  5. Identification of the initiation site of poliovirus polyprotein synthesis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dorner, A.J.; Dorner, L.F.; Larsen, G.R.

    1982-06-01

    The complete nucleotide sequence of poliovirus RNA has a long open reading frame capable of encoding the precursor polyprotein NCVPOO. The first AUG codon in this reading frame is located 743 nucleotides from the 5' end of the RNA and is preceded by eight AUG codons in all three reading frames. Because all proteins that map at the amino terminus of the polyprotein (P1-1a, VPO, and VP4) are blocked at their amino termini and previous studies of ribosome binding have been inconclusive, direct identification of the initiation site of protein synthesis was difficult. We separated and identified all of themore » tryptic peptides of capsid protein VP4 and correlated these peptides with the amino acid sequence predicted to follow the AUG codon at nucleotide 743. Our data indicate that VP4 begins with a blocked glycine that is encoded immediately after the AUG codon at nucleotide 743. An S1 nuclease analysis of poliovirus mRNA failed to reveal a splice in the 5' region. We concluded that synthesis of poliovirus polyprotein is initiated at nucleotide 743, the first AUG codon in the long open reading frame.« less

  6. Nucleotide sequence analysis of the gene encoding the Deinococcus radiodurans surface protein, derived amino acid sequence, and complementary protein chemical studies

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Peters, J.; Peters, M.; Lottspeich, F.

    1987-11-01

    The complete nucleotide sequence of the gene encoding the surface (hexagonally packed intermediate (HPI))-layer polypeptide of Deinococcus radiodurans Sark was determined and found to encode a polypeptide of 1036 amino acids. Amino acid sequence analysis of about 30% of the residues revealed that the mature polypeptide consists of at least 978 amino acids. The N terminus was blocked to Edman degradation. The results of proteolytic modification of the HPI layer in situ and M/sub r/ estimations of the HPI polypeptide expressed in Escherichia coli indicated that there is a leader sequence. The N-terminal region contained a very high percentage (29%)more » of threonine and serine, including a cluster of nine consecutive serine or threonine residues, whereas a stretch near the C terminus was extremely rich in aromatic amino acids (29%). The protein contained at least two disulfide bridges, as well as tightly bound reducing sugars and fatty acids.« less

  7. Genomic organization and sequence of the Gus-s/sup a/ allele of the murine. beta. -glucuronidase gene

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Funkenstein, B.; Leary, S.L.; Stein, J.C.

    1988-03-01

    The Gus-s/sup ..cap alpha../ allele of the mouse ..beta..-glucuronidase gene exhibits a high degree of inducibility by androgens due to its linkage with the Gus-r/sup ..cap alpha../ regulatory locus. The authors isolated Gus-s/sup ..cap alpha../ on a 28-kilobase pair fragment of mouse chromosome 5 and found that it contains 12 exons and 11 intervening sequences spanning 14 kilobase pairs of this genomic segment. The mRNA cap site was identified by ribonuclease protection and primer extension analyses which revealed an unusually short 5' noncoding sequence of 12 nucleotides. Proximal regulatory sequences in the 5'-flanking DNA and the complete sequence of themore » Gus-s/sup ..cap alpha../ mRNA transcript were also determined. Comparison of the amino acid sequence determined from the Gus-s/sup ..cap alpha../ nucleotide sequence with that of human ..beta..-glucuronidase indicated that the two human mRNA species differ due to alternate splicing of an exon homologous to exon 6 of the mouse gene.« less

  8. 37 CFR 1.822 - Symbols and format to be used for nucleotide and/or amino acid sequence data.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... for nucleotide and/or amino acid sequence data. 1.822 Section 1.822 Patents, Trademarks, and... Amino Acid Sequences § 1.822 Symbols and format to be used for nucleotide and/or amino acid sequence data. (a) The symbols and format to be used for nucleotide and/or amino acid sequence data shall...

  9. Study of mitochondria D-loop gene to detect the heterogeneity of gemak in Turnicidae family

    NASA Astrophysics Data System (ADS)

    Setiati, N.; Partaya

    2018-03-01

    As a part of life biodiversity, birds in Turnicidae family should be preserved from the extinction and its type heterogeneity decline. One effort for giving the strategic base of plasma nutfah conservation is through genetic heterogeneity study. The aim of the research is to analyze D-loop gen from DNA mitochondria of gemak bird in Turnicidae family molecularly. From the result of the analysis, it may be known the genetic heterogeneity of gemak bird based on the sequence of D-loop gen. The collection of both types of gemak of Turnicidae family is still easy since we can find them in ricefield area after harvest particularly for Gemakloreng (Turnix sylvatica), it means while gemak tegalan (Turnixsusciator) is getting difficult to find. Based on the above DNA quantification standard, the blood sample of Gemak in this research is mostly grouped into pure blood (ranges from 1,63 – 1,90), and it deserves to be used for PCR analysis. The sequencing analysis has not detected the sequence of nucleotide completely. However, it indicates sequence polymorphism of base as the arranger of D-loop gen. D-loop gen may identify genetic heterogeneity of gemak bird of Turnicidae family, but it is necessary to perform further sequencing analysis with PCR-RFLP technique. This complete nucleotide sequence is obtained and easy to detect after being cut restriction enzyme.

  10. The complete genome sequence and genetic analysis of ΦCA82 a novel uncultured microphage from the turkey gastrointestinal system

    PubMed Central

    2011-01-01

    The genomic DNA sequence of a novel enteric uncultured microphage, ΦCA82 from a turkey gastrointestinal system was determined utilizing metagenomics techniques. The entire circular, single-stranded nucleotide sequence of the genome was 5,514 nucleotides. The ΦCA82 genome is quite different from other microviruses as indicated by comparisons of nucleotide similarity, predicted protein similarity, and functional classifications. Only three genes showed significant similarity to microviral proteins as determined by local alignments using BLAST analysis. ORF1 encoded a predicted phage F capsid protein that was phylogenetically most similar to the Microviridae ΦMH2K member's major coat protein. The ΦCA82 genome also encoded a predicted minor capsid protein (ORF2) and putative replication initiation protein (ORF3) most similar to the microviral bacteriophage SpV4. The distant evolutionary relationship of ΦCA82 suggests that the divergence of this novel turkey microvirus from other microviruses may reflect unique evolutionary pressures encountered within the turkey gastrointestinal system. PMID:21714899

  11. The nucleotide sequence of RNA1 of Lettuce big-vein virus, genus Varicosavirus, reveals its relation to nonsegmented negative-strand RNA viruses.

    PubMed

    Sasaya, Takahide; Ishikawa, Koichi; Koganezawa, Hiroki

    2002-06-05

    The complete nucleotide sequence of RNA1 from Lettuce big-vein virus (LBVV), the type member of the genus Varicosavirus, was determined. LBVV RNA1 consists of 6797 nucleotides and contains one large ORF that encodes a large (L) protein of 2040 amino acids with a predicted M(r) of 232,092. Northern blot hybridization analysis indicated that the LBVV RNA1 is a negative-sense RNA. Database searches showed that the amino acid sequence of L protein is homologous to those of L polymerases of nonsegmented negative-strand RNA viruses. A cluster dendrogram derived from alignments of the LBVV L protein and the L polymerases indicated that the L protein is most closely related to the L polymerases of plant rhabdoviruses. Transcription termination/polyadenylation signal-like poly(U) tracts that resemble those in rhabdovirus and paramyxovirus RNAs were present upstream and downstream of the coding region. Although LBVV is related to rhabdoviruses, a key distinguishing feature is that the genome of LBVV is segmented. The results reemphasize the need to reconsider the taxonomic position of varicosaviruses.

  12. Protein structure and the sequential structure of mRNA: alpha-helix and beta-sheet signals at the nucleotide level.

    PubMed

    Brunak, S; Engelbrecht, J

    1996-06-01

    A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed. We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting protein. The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain. A complete search for GenBank nucleotide sequences coding for structural entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment. By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets. These signals do not originate from the clustering of rare codons, but from the similarity of codons coding for very abundant amino acid residues at the N- and C-termini of helices and sheets. No correlation between the positioning of rare codons and the location of structural units was found. The mRNA signals were also compared with conserved nucleotide features of 16S-like ribosomal RNA sequences and related to mechanisms for maintaining the correct reading frame by the ribosome.

  13. Novel avian paramyxovirus (APMV-15) isolated from a migratory bird in South America.

    PubMed

    Thomazelli, Luciano Matsumiya; de Araújo, Jansen; Fabrizio, Thomas; Walker, David; Reischak, Dilmara; Ometto, Tatiana; Barbosa, Carla Meneguin; Petry, Maria Virginia; Webby, Richard J; Durigon, Edison Luiz

    2017-01-01

    A novel avian paramyxovirus (APMV) isolated from a migratory bird cloacal swab obtained during active surveillance in April 2012 in the Lagoa do Peixe National Park, Rio Grande do Sul state, South of Brazil was biologically and genetically characterized. The nucleotide sequence of the full viral genome was completed using a next-generation sequencing approach. The genome was 14,952 nucleotides (nt) long, with six genes (3'-NP-P-M-F-HN-L-5') encoding 7 different proteins, typical of APMV. The fusion (F) protein gene of isolate RS-1177 contained 1,707 nucleotides in a single open reading frame encoding a protein of 569 amino acids. The F protein cleavage site contained two basic amino acids (VPKER↓L), typical of avirulent strains. Phylogenetic analysis of the whole genome indicated that the virus is related to APMV-10, -2 and -8, with 60.1% nucleotide sequence identity to the closest APMV-10 virus, 58.7% and 58.5% identity to the closest APMV-8 and APMV-2 genome, respectively, and less than 52% identity to representatives of the other APMVs groups. Such distances are comparable to the distances observed among other previously identified APMVs serotypes. These results suggest that unclassified/calidris_fuscicollis/Brazil/RS-1177/2012 is the prototype strain of a new APMV serotype, APMV-15.

  14. Genetic Diversity of Crimean Congo Hemorrhagic Fever Virus Strains from Iran

    PubMed Central

    Chinikar, Sadegh; Bouzari, Saeid; Shokrgozar, Mohammad Ali; Mostafavi, Ehsan; Jalali, Tahmineh; Khakifirouz, Sahar; Nowotny, Norbert; Fooks, Anthony R.; Shah-Hosseini, Nariman

    2016-01-01

    Background: Crimean Congo hemorrhagic fever virus (CCHFV) is a member of the Bunyaviridae family and Nairovirus genus. It has a negative-sense, single stranded RNA genome approximately 19.2 kb, containing the Small, Medium, and Large segments. CCHFVs are relatively divergent in their genome sequence and grouped in seven distinct clades based on S-segment sequence analysis and six clades based on M-segment sequences. Our aim was to obtain new insights into the molecular epidemiology of CCHFV in Iran. Methods: We analyzed partial and complete nucleotide sequences of the S and M segments derived from 50 Iranian patients. The extracted RNA was amplified using one-step RT-PCR and then sequenced. The sequences were analyzed using Mega5 software. Results: Phylogenetic analysis of partial S segment sequences demonstrated that clade IV-(Asia 1), clade IV-(Asia 2) and clade V-(Europe) accounted for 80 %, 4 % and 14 % of the circulating genomic variants of CCHFV in Iran respectively. However, one of the Iranian strains (Iran-Kerman/22) was associated with none of other sequences and formed a new clade (VII). The phylogenetic analysis of complete S-segment nucleotide sequences from selected Iranian CCHFV strains complemented with representative strains from GenBank revealed similar topology as partial sequences with eight major clusters. A partial M segment phylogeny positioned the Iranian strains in either association with clade III (Asia-Africa) or clade V (Europe). Conclusion: The phylogenetic analysis revealed subtle links between distant geographic locations, which we propose might originate either from international livestock trade or from long-distance carriage of CCHFV by infected ticks via bird migration. PMID:27308271

  15. Characterization of a tandemly repeated DNA sequence family originally derived by retroposition of tRNA(Glu) in the newt.

    PubMed

    Nagahashi, S; Endoh, H; Suzuki, Y; Okada, N

    1991-11-20

    A previous report from this laboratory showed that in vitro transcription of total genomic DNA of the newt Cynopus pyrrhogaster resulted in a discrete sized 8 S RNA, which represented highly repetitive and transcribable sequences with a glutamic acid tRNA-like structure in the newt genome. We isolated four independent clones from a newt genomic library and determined the complete sequences of three 2000 to 2400 base-pair PstI fragments spanning the 8 S RNA gene. The glutamic acid tRNA-related segment in the 8 S RNA gene contains the CCA sequence expected as the 3' terminus of a tRNA molecule. Further, the 11 nucleotides located 13 nucleotides upstream from one of the two transcription initiation sites of the 8 S RNA were found to be repeated in the region upstream from the termination site, suggesting that the original unit, which is shorter than the 8 S RNA, was retrotransposed via cDNA intermediates from the PolIII transcript. In the upstream region of the 8 S RNA gene, a 360 nucleotide unit containing the glutamic acid tRNA-related segment was found to be duplicated (clones NE1 and NE10) or triplicated (clone NE3). Except for the difference in the number of the 360 nucleotide unit, the three sequences of the 2000 to 2400 base-pair PstI fragment were essentially the same with only a few mutations and minor deletions. Inverse polymerase chain reaction and sequence determination of the products, together with a Southern hybridization experiment, demonstrated that the family consists of a tandemly repeated unit of 3300, 3700 or 4100 base-pairs. Thus during evolution, this family in the newt was created by retroposition via cDNA intermediates, followed by duplication or triplication of the 360 nucleotide unit and multiplication of the 3300 to 4100 base-pair region at the DNA level.

  16. The maize stripe virus major noncapsid protein messenger RNA transcripts contain heterogeneous leader sequences at their 5' termini.

    PubMed

    Huiet, L; Feldstein, P A; Tsai, J H; Falk, B W

    1993-12-01

    Primer extension analyses and a PCR-based cloning strategy were used to identify and characterize 5' nucleotide sequences on the maize stripe virus (MStV) RNA4 mRNA transcripts encoding the major noncapsid protein (NCP). Direct RNA sequence analysis by primer extension showed that the NCP mRNA transcripts had 10-15 nucleotides beyond the 5' terminus of the MStV RNA4 nucleotide sequence. MStV genomic RNAs isolated from ribonucleoprotein particles (RNPs) lacked the additional 5' nucleotides. cDNA clones representing the 5' region of the mRNA transcripts were constructed, and the nucleotide sequences of the 5' regions were determined for 16 clones. Each was found to have a distinct 10-15 nucleotide sequence immediately 5' of the MStV RNA4 sequence. Eleven of 16 clones had the correct MStV RNA4 5' nucleotide sequence, while five showed minor variations at or near the 5' most MStV RNA4 nucleotide. These characteristics show strong similarities to other viral mRNA transcripts which are synthesized by cap snatching.

  17. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...

  18. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2011-07-01 2011-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences § 1.821 Nucleotide and/or amino acid sequence disclosures in patent applications. (a) Nucleotide and...

  19. The complete genome sequence, occurrence and host range of Tomato mottle mosaic virus Chinese isolate.

    PubMed

    Li, Yueyue; Wang, Yang; Hu, John; Xiao, Long; Tan, Guanlin; Lan, Pingxiu; Liu, Yong; Li, Fan

    2017-01-31

    Tomato mottle mosaic virus (ToMMV) is a recently identified species in the genus Tobamovirus and was first reported from a greenhouse tomato sample collected in Mexico in 2013. In August 2013, ToMMV was detected on peppers (Capsicum spp.) in China. However, little is known about the molecular and biological characteristics of ToMMV. Reverse transcription-polymerase chain reaction (RT-PCR) and rapid identification of cDNA ends (RACE) were carried out to obtain the complete genomic sequences of ToMMV. Sap transmission was used to test the host range and pathogenicity of ToMMV. The full-length genomes of two ToMMV isolates infecting peppers in Yunnan Province and Tibet Autonomous Region of China were determined and analyzed. The complete genomic sequences of both ToMMV isolates consisted of 6399 nucleotides and contained four open reading frames (ORFs) encoding 126, 183, 30 and 18 kDa proteins from the 5' to 3' end, respectively. Overall similarities of the ToMMV genome sequence to those of the other tobamoviruses available in GenBank ranged from 49.6% to 84.3%. Phylogenetic analyses of the sequences of full-genome nucleotide and the amino acids of its four proteins confirmed that ToMMV was most closely related to Tomato mosaic virus (ToMV). According to the genetic structure, host of origin and phylogenetic relationships, the available 32 tobamoviruses could be divided into at least eight subgroups based on the host plant family they infect: Solanaceae-, Brassicaceae-, Cactaceae-, Apocynaceae-, Cucurbitaceae-, Malvaceae-, Leguminosae-, and Passifloraceae-infecting subgroups. The detection of ToMMV on some solanaceous, cucurbitaceous, brassicaceous and leguminous plants in Yunnan Province and other few parts of China revealed ToMMV only occurred on peppers so far. However, the host range test results showed ToMMV could infect most of the tested solanaceous and cruciferous plants, and had a high affinity for the solanaceous plants. The complete nucleotide sequences of two Chinese ToMMV isolates from naturally infected peppers were verified. The tobamoviruses were divided into at least eight subgroups, with ToMMV belonging to the subgroup that infected plants in the Solanaceae. In China, ToMMV only occurred on peppers in the fields till now. ToMMV could infect the plants in family Solanaceae and Cucurbitaceae by sap transmission.

  20. Complete genome analysis of highly pathogenic bovine ephemeral fever virus isolated in Turkey in 2012.

    PubMed

    Abayli, Hasan; Tonbak, Sukru; Azkur, Ahmet Kursat; Bulut, Hakan

    2017-10-01

    Relatively high prevalence and mortality rates of bovine ephemeral fever (BEF) have been reported in recent epidemics in some countries, including Turkey, when compared with previous outbreaks. A limited number of complete genome sequences of BEF virus (BEFV) are available in the GenBank Database. In this study, the complete genome of highly pathogenic BEFV isolated during an outbreak in Turkey in 2012 was analyzed for genetic characterization. The complete genome of the Turkish BEFV isolate was amplified by reverse transcription-polymerase chain reaction (RT-PCR) and sequenced. It was found that the complete genome of the Turkish BEFV isolate was 14,901 nt in length. The complete genome sequence obtained from the study showed 91-92% identity at nucleotide level to Australian (BB7721) and Chinese (Bovine/China/Henan1/2012) BEFV isolates. Phylogenetic analysis of the glycoprotein gene of the Turkish BEFV isolate also showed that Turkish isolates were closely related to Israeli isolates. Because of the limited number of complete BEFV genome sequences, the results from this study will be useful for understanding the global molecular epidemiology and geodynamics of BEF.

  1. Mitochondrial genome nucleotide substitution pattern between domesticated silkmoth, Bombyx mori, and its wild ancestors, Chinese Bombyx mandarina and Japanese Bombyx mandarina

    PubMed Central

    2010-01-01

    Bombyx mori and Bombyx mandarina are morphologically and physiologically similar. In this study, we compared the nucleotide variations in the complete mitochondrial (mt) genomes between the domesticated silkmoth, B. mori, and its wild ancestors, Chinese B. mandarina (ChBm) and Japanese B. mandarina (JaBm). The sequence divergence and transition mutation ratio between B. mori and ChBm are significantly smaller than those observed between B. mori and JaBm. The preference of transition by DNA strands between B. mori and ChBm is consistent with that between B. mori and JaBm, however, the regional variation in nucleotide substitution rate shows a different feature. These results suggest that the ChBm mt genome is not undergoing the same evolutionary process as JaBm, providing evidence for selection on mtDNA. Moreover, investigation of the nucleotide sequence divergence in the A+T-rich region of Bombyx mt genomes also provides evidence for the assumption that the A+T-rich region might not be the fastest evolving region of the mtDNA of insects. PMID:21637625

  2. RECOVIR Software for Identifying Viruses

    NASA Technical Reports Server (NTRS)

    Chakravarty, Sugoto; Fox, George E.; Zhu, Dianhui

    2013-01-01

    Most single-stranded RNA (ssRNA) viruses mutate rapidly to generate a large number of strains with highly divergent capsid sequences. Determining the capsid residues or nucleotides that uniquely characterize these strains is critical in understanding the strain diversity of these viruses. RECOVIR (an acronym for "recognize viruses") software predicts the strains of some ssRNA viruses from their limited sequence data. Novel phylogenetic-tree-based databases of protein or nucleic acid residues that uniquely characterize these virus strains are created. Strains of input virus sequences (partial or complete) are predicted through residue-wise comparisons with the databases. RECOVIR uses unique characterizing residues to identify automatically strains of partial or complete capsid sequences of picorna and caliciviruses, two of the most highly diverse ssRNA virus families. Partition-wise comparisons of the database residues with the corresponding residues of more than 300 complete and partial sequences of these viruses resulted in correct strain identification for all of these sequences. This study shows the feasibility of creating databases of hitherto unknown residues uniquely characterizing the capsid sequences of two of the most highly divergent ssRNA virus families. These databases enable automated strain identification from partial or complete capsid sequences of these human and animal pathogens.

  3. GenBank.

    PubMed

    Benson, Dennis A; Karsch-Mizrachi, Ilene; Lipman, David J; Ostell, James; Sayers, Eric W

    2010-01-01

    GenBank is a comprehensive database that contains publicly available nucleotide sequences for more than 300,000 organisms named at the genus level or lower, obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and accession numbers are assigned by GenBank staff upon receipt. Daily data exchange with the European Molecular Biology Laboratory Nucleotide Sequence Database in Europe and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bi-monthly releases and daily updates of the GenBank database are available by FTP. To access GenBank and its related retrieval and analysis services, begin at the NCBI homepage: www.ncbi.nlm.nih.gov.

  4. GenBank.

    PubMed

    Benson, Dennis A; Karsch-Mizrachi, Ilene; Lipman, David J; Ostell, James; Sayers, Eric W

    2009-01-01

    GenBank is a comprehensive database that contains publicly available nucleotide sequences for more than 300,000 organisms named at the genus level or lower, obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and accession numbers are assigned by GenBank(R) staff upon receipt. Daily data exchange with the European Molecular Biology Laboratory Nucleotide Sequence Database in Europe and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through the National Center for Biotechnology Information (NCBI) Entrez retrieval system, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. To access GenBank and its related retrieval and analysis services, begin at the NCBI Homepage: www.ncbi.nlm.nih.gov.

  5. A comprehensive analysis of three Asiatic black bear mitochondrial genomes (subspecies ussuricus, formosanus and mupinensis), with emphasis on the complete mtDNA sequence of Ursus thibetanus ussuricus (Ursidae).

    PubMed

    Hwang, Dae-Sik; Ki, Jang-Seu; Jeong, Dong-Hyuk; Kim, Bo-Hyun; Lee, Bae-Keun; Han, Sang-Hoon; Lee, Jae-Seong

    2008-08-01

    In the present paper, we describe the mitochondrial genome sequence of the Asiatic black bear (Ursus thibetanus ussuricus) with particular emphasis on the control region (CR), and compared with mitochondrial genomes on molecular relationships among the bears. The mitochondrial genome sequence of U. thibetanus ussuricus was 16,700 bp in size with mostly conserved structures (e.g. 13 protein-coding, two rRNA genes, 22 tRNA genes). The CR consisted of several typical conserved domains such as F, E, D, and C boxes, and a conserved sequence block. Nucleotide sequences and the repeated motifs in the CR were different among the bear species, and their copy numbers were also variable according to populations, even within F1 generations of U. thibetanus ussuricus. Comparative analyses showed that the CR D1 region was highly informative for the discrimination of the bear family. These findings suggest that nucleotide sequences of both repeated motifs and CR D1 in the bear family are good markers for species discriminations.

  6. Molecular epidemiology of Plum pox virus in Japan.

    PubMed

    Maejima, Kensaku; Himeno, Misako; Komatsu, Ken; Takinami, Yusuke; Hashimoto, Masayoshi; Takahashi, Shuichiro; Yamaji, Yasuyuki; Oshima, Kenro; Namba, Shigetou

    2011-05-01

    For a molecular epidemiological study based on complete genome sequences, 37 Plum pox virus (PPV) isolates were collected from the Kanto region in Japan. Pair-wise analyses revealed that all 37 Japanese isolates belong to the PPV-D strain, with low genetic diversity (less than 0.8%). In phylogenetic analysis of the PPV-D strain based on complete nucleotide sequences, the relationships of the PPV-D strain were reconstructed with high resolution: at the global level, the American, Canadian, and Japanese isolates formed their own distinct monophyletic clusters, suggesting that the routes of viral entry into these countries were independent; at the local level, the actual transmission histories of PPV were precisely reconstructed with high bootstrap support. This is the first description of the molecular epidemiology of PPV based on complete genome sequences.

  7. Rose spring dwarf-associated virus has RNA structural and gene-expression features like those of Barley yellow dwarf virus

    PubMed Central

    Salem, Nida’ M.; Miller, W. Allen; Rowhani, Adib; Golino, Deborah A.; Moyne, Anne-Laure; Falk, Bryce W.

    2015-01-01

    We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5′- and 3′-RACE showed the RSDaV genomic RNA to be 5,808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3′-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5′ ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5′ end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3′ cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae. PMID:18329064

  8. Rose spring dwarf-associated virus has RNA structural and gene-expression features like those of Barley yellow dwarf virus.

    PubMed

    Salem, Nida' M; Miller, W Allen; Rowhani, Adib; Golino, Deborah A; Moyne, Anne-Laure; Falk, Bryce W

    2008-06-05

    We determined the complete nucleotide sequence of the Rose spring dwarf-associated virus (RSDaV) genomic RNA (GenBank accession no. EU024678) and compared its predicted RNA structural characteristics affecting gene expression. A cDNA library was derived from RSDaV double-stranded RNAs (dsRNAs) purified from infected tissue. Nucleotide sequence analysis of the cloned cDNAs, plus for clones generated by 5'- and 3'-RACE showed the RSDaV genomic RNA to be 5808 nucleotides. The genomic RNA contains five major open reading frames (ORFs), and three small ORFs in the 3'-terminal 800 nucleotides, typical for viruses of genus Luteovirus in the family Luteoviridae. Northern blot hybridization analysis revealed the genomic RNA and two prominent subgenomic RNAs of approximately 3 kb and 1 kb. Putative 5' ends of the sgRNAs were predicted by identification of conserved sequences and secondary structures which resembled the Barley yellow dwarf virus (BYDV) genomic RNA 5' end and subgenomic RNA promoter sequences. Secondary structures of the BYDV-like ribosomal frameshift elements and cap-independent translation elements, including long-distance base pairing spanning four kb were identified. These contain similarities but also informative differences with the BYDV structures, including a strikingly different structure predicted for the 3' cap-independent translation element. These analyses of the RSDaV genomic RNA show more complexity for the RNA structural elements for members of the Luteoviridae.

  9. Genome characterization of sugarcane yellow leaf virus from China reveals a novel recombinant genotype.

    PubMed

    Lin, Yi-Hua; Gao, San-Ji; Damaj, Mona B; Fu, Hua-Ying; Chen, Ru-Kai; Mirkov, T Erik

    2014-06-01

    Sugarcane yellow leaf virus (SCYLV; genus Polerovirus, family Luteoviridae) is a recombinant virus associated with yellow leaf disease, a serious threat to sugarcane in China and worldwide. Among the nine known SCYLV genotypes existing worldwide, COL, HAW, REU, IND, CHN1, CHN2, BRA, CUB and PER, the last five have been reported in China. In this study, the complete genome sequences (5,880 nt) of GZ-GZ18 and HN-CP502 isolates from the Chinese provinces of Guizhou and Hainan, respectively, were cloned, sequenced and characterized. Phylogenetic analysis showed that, among 29 SCYLV isolates described worldwide, the two Chinese isolates clustered together into an independent clade based on the near-complete genome nucleotide (ORF0-ORF5) or amino acid sequences of individual genes, except for the MP protein (ORF4). We propose that the two isolates represent a novel genotype, CHN3, diverging from other genotypes by 1.7-13.6 % nucleotide differences in ORF0-ORF5, and 2.7-28.1 %, 1.8-20.4 %, 0.5-5.1 % and 2.7-15.9 % amino acid differences in P0 (ORF0), RdRp (RNA-dependent RNA polymerase) (ORF1+2), CP (coat protein) (ORF3) and RT (readthrough protein) (ORF3+5), respectively. CHN3 was closely related to the BRA, HAW and PER genotypes, differing by 1.7-3.8 % in the near-complete genome nucleotide sequence. Recombination analysis further identified CHN3 as a new recombinant strain, arising from the major parent CHN-HN1 and the minor parent CHN-GD-WY19. Recombination breakpoints were distributed mostly within the RdRp region in CHN3 and the four significant recombinant genotypes, IND, REU, CUB and BRA. Recombination is considered to contribute significantly to the evolution and emergence of such new SCYLV variants.

  10. Conserved features of eukaryotic hsp70 genes revealed by comparison with the nucleotide sequence of human hsp70.

    PubMed Central

    Hunt, C; Morimoto, R I

    1985-01-01

    We have determined the nucleotide sequence of the human hsp70 gene and 5' flanking region. The hsp70 gene is transcribed as an uninterrupted primary transcript of 2440 nucleotides composed of a 5' noncoding leader sequence of 212 nucleotides, a 3' noncoding region of 242 nucleotides, and a continuous open reading frame of 1986 nucleotides that encodes a protein with predicted molecular mass of 69,800 daltons. Upstream of the 5' terminus are the canonical TATAAA box, the sequence ATTGG that corresponds in the inverted orientation to the CCAAT motif, and the dyad sequence CTGGAAT/ATTCCCG that shares homology in 12 of 14 positions with the consensus transcription regulatory sequence common to Drosophila heat shock genes. Comparison of the predicted amino acid sequences of human hsp70 with the published sequences of Drosophila hsp70 and Escherichia coli dnaK reveals that human hsp70 is 73% identical to Drosophila hsp70 and 47% identical to E. coli dnaK. Surprisingly, the nucleotide sequences of the human and Drosophila genes are 72% identical and human and E. coli genes are 50% identical, which is more highly conserved than necessary given the degeneracy of the genetic code. The lack of accumulated silent nucleotide substitutions leads us to propose that there may be additional information in the nucleotide sequence of the hsp70 gene or the corresponding mRNA that precludes the maximum divergence allowed in the silent codon positions. PMID:3931075

  11. Dynamics of actin evolution in dinoflagellates.

    PubMed

    Kim, Sunju; Bachvaroff, Tsvetan R; Handy, Sara M; Delwiche, Charles F

    2011-04-01

    Dinoflagellates have unique nuclei and intriguing genome characteristics with very high DNA content making complete genome sequencing difficult. In dinoflagellates, many genes are found in multicopy gene families, but the processes involved in the establishment and maintenance of these gene families are poorly understood. Understanding the dynamics of gene family evolution in dinoflagellates requires comparisons at different evolutionary scales. Studies of closely related species provide fine-scale information relative to species divergence, whereas comparisons of more distantly related species provides broad context. We selected the actin gene family as a highly expressed conserved gene previously studied in dinoflagellates. Of the 142 sequences determined in this study, 103 were from the two closely related species, Dinophysis acuminata and D. caudata, including full length and partial cDNA sequences as well as partial genomic amplicons. For these two Dinophysis species, at least three types of sequences could be identified. Most copies (79%) were relatively similar and in nucleotide trees, the sequences formed two bushy clades corresponding to the two species. In comparisons within species, only eight to ten nucleotide differences were found between these copies. The two remaining types formed clades containing sequences from both species. One type included the most similar sequences in between-species comparisons with as few as 12 nucleotide differences between species. The second type included the most divergent sequences in comparisons between and within species with up to 93 nucleotide differences between sequences. In all the sequences, most variation occurred in synonymous sites or the 5' UnTranslated Region (UTR), although there was still limited amino acid variation between most sequences. Several potential pseudogenes were found (approximately 10% of all sequences depending on species) with incomplete open reading frames due to frameshifts or early stop codons. Overall, variation in the actin gene family fits best with the "birth and death" model of evolution based on recent duplications, pseudogenes, and incomplete lineage sorting. Divergence between species was similar to variation within species, so that actin may be too conserved to be useful for phylogenetic estimation of closely related species.

  12. 77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2012-10-29

    ... DEPARTMENT OF COMMERCE Patent and Trademark Office Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request... Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of...

  13. Variation in the Nucleotide Sequence of Cottontail Rabbit Papillomavirus a and b Subtypes Affects Wart Regression and Malignant Transformation and Level of Viral Replication in Domestic Rabbits

    PubMed Central

    Salmon, Jérôme; Nonnenmacher, Mathieu; Cazé, Sandrine; Flamant, Patricia; Croissant, Odile; Orth, Gérard; Breitburd, Françoise

    2000-01-01

    We previously reported the partial characterization of two cottontail rabbit papillomavirus (CRPV) subtypes with strikingly divergent E6 and E7 oncoproteins. We report now the complete nucleotide sequences of these subtypes, referred to as CRPVa4 (7,868 nucleotides) and CRPVb (7,867 nucleotides). The CRPVa4 and CRPVb genomes differed at 238 (3%) nucleotide positions, whereas CRPVa4 and the prototype CRPV differed by only 5 nucleotides. The most variable region (7% nucleotide divergence) included the long regulatory region (LRR) and the E6 and E7 genes. A mutation in the stop codon resulted in an 8-amino-acid-longer CRPVb E4 protein, and a nucleotide deletion reduced the coding capacity of the E5 gene from 101 to 25 amino acids. In domestic rabbits homozygous for a specific haplotype of the DRA and DQA genes of the major histocompatibility complex, warts induced by CRPVb DNA or a chimeric genome containing the CRPVb LRR/E6/E7 region showed an early regression, whereas warts induced by CRPVa4 or a chimeric genome containing the CRPVa4 LRR/E6/E7 region persisted and evolved into carcinomas. In contrast, most CRPVa, CRPVb, and chimeric CRPV DNA-induced warts showed no early regression in rabbits homozygous for another DRA-DQA haplotype. Little, if any, viral replication is usually observed in domestic rabbit warts. When warts induced by CRPVa and CRPVb virions and DNA were compared, the number of cells positive for viral DNA or capsid antigens was found to be greater by 1 order of magnitude for specimens induced by CRPVb. Thus, both sequence variation in the LRR/E6/E7 region and the genetic constitution of the host influence the expression of the oncogenic potential of CRPV. Furthermore, intratype variation may overcome to some extent the host restriction of CRPV replication in domestic rabbits. PMID:11044121

  14. Breaking the 1000-gene barrier for Mimivirus using ultra-deep genome and transcriptome sequencing.

    PubMed

    Legendre, Matthieu; Santini, Sébastien; Rico, Alain; Abergel, Chantal; Claverie, Jean-Michel

    2011-03-04

    Mimivirus, a giant dsDNA virus infecting Acanthamoeba, is the prototype of the mimiviridae family, the latest addition to the family of the nucleocytoplasmic large DNA viruses (NCLDVs). Its 1.2 Mb-genome was initially predicted to encode 917 genes. A subsequent RNA-Seq analysis precisely mapped many transcript boundaries and identified 75 new genes. We now report a much deeper analysis using the SOLiD™ technology combining RNA-Seq of the Mimivirus transcriptome during the infectious cycle (202.4 Million reads), and a complete genome re-sequencing (45.3 Million reads). This study corrected the genome sequence and identified several single nucleotide polymorphisms. Our results also provided clear evidence of previously overlooked transcription units, including an important RNA polymerase subunit distantly related to Euryarchea homologues. The total Mimivirus gene count is now 1018, 11% greater than the original annotation. This study highlights the huge progress brought about by ultra-deep sequencing for the comprehensive annotation of virus genomes, opening the door to a complete one-nucleotide resolution level description of their transcriptional activity, and to the realistic modeling of the viral genome expression at the ultimate molecular level. This work also illustrates the need to go beyond bioinformatics-only approaches for the annotation of short protein and non-coding genes in viral genomes.

  15. The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing.

    PubMed

    Binladen, Jonas; Gilbert, M Thomas P; Bollback, Jonathan P; Panitz, Frank; Bendixen, Christian; Nielsen, Rasmus; Willerslev, Eske

    2007-02-14

    The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform) has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR) reactions and subsequent sequencing runs have been unable to combine template DNA from multiple individuals, as homologous sequences cannot be subsequently assigned to their original sources. We use conventional PCR with 5'-nucleotide tagged primers to generate homologous DNA amplification products from multiple specimens, followed by sequencing through the high-throughput Genome Sequence 20 DNA Sequencing System (GS20, Roche/454 Life Sciences). Each DNA sequence is subsequently traced back to its individual source through 5'tag-analysis. We demonstrate that this new approach enables the assignment of virtually all the generated DNA sequences to the correct source once sequencing anomalies are accounted for (miss-assignment rate<0.4%). Therefore, the method enables accurate sequencing and assignment of homologous DNA sequences from multiple sources in single high-throughput GS20 run. We observe a bias in the distribution of the differently tagged primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution of the sequences as sorted by the second nucleotide of the dinucleotide tags. As the results are based on a single GS20 run, the general applicability of the approach requires confirmation. However, our experiments demonstrate that 5'primer tagging is a useful method in which the sequencing power of the GS20 can be applied to PCR-based assays of multiple homologous PCR products. The new approach will be of value to a broad range of research areas, such as those of comparative genomics, complete mitochondrial analyses, population genetics, and phylogenetics.

  16. Predicting protein-binding regions in RNA using nucleotide profiles and compositions.

    PubMed

    Choi, Daesik; Park, Byungkyu; Chae, Hanju; Lee, Wook; Han, Kyungsook

    2017-03-14

    Motivated by the increased amount of data on protein-RNA interactions and the availability of complete genome sequences of several organisms, many computational methods have been proposed to predict binding sites in protein-RNA interactions. However, most computational methods are limited to finding RNA-binding sites in proteins instead of protein-binding sites in RNAs. Predicting protein-binding sites in RNA is more challenging than predicting RNA-binding sites in proteins. Recent computational methods for finding protein-binding sites in RNAs have several drawbacks for practical use. We developed a new support vector machine (SVM) model for predicting protein-binding regions in mRNA sequences. The model uses sequence profiles constructed from log-odds scores of mono- and di-nucleotides and nucleotide compositions. The model was evaluated by standard 10-fold cross validation, leave-one-protein-out (LOPO) cross validation and independent testing. Since actual mRNA sequences have more non-binding regions than protein-binding regions, we tested the model on several datasets with different ratios of protein-binding regions to non-binding regions. The best performance of the model was obtained in a balanced dataset of positive and negative instances. 10-fold cross validation with a balanced dataset achieved a sensitivity of 91.6%, a specificity of 92.4%, an accuracy of 92.0%, a positive predictive value (PPV) of 91.7%, a negative predictive value (NPV) of 92.3% and a Matthews correlation coefficient (MCC) of 0.840. LOPO cross validation showed a lower performance than the 10-fold cross validation, but the performance remains high (87.6% accuracy and 0.752 MCC). In testing the model on independent datasets, it achieved an accuracy of 82.2% and an MCC of 0.656. Testing of our model and other state-of-the-art methods on a same dataset showed that our model is better than the others. Sequence profiles of log-odds scores of mono- and di-nucleotides were much more powerful features than nucleotide compositions in finding protein-binding regions in RNA sequences. But, a slight performance gain was obtained when using the sequence profiles along with nucleotide compositions. These are preliminary results of ongoing research, but demonstrate the potential of our approach as a powerful predictor of protein-binding regions in RNA. The program and supporting data are available at http://bclab.inha.ac.kr/RBPbinding .

  17. Characterization of the complete genome segments from BmCPV-SZ, a novel Bombyx mori cypovirus 1 isolate.

    PubMed

    Cao, Guangli; Meng, Xiangkun; Xue, Renyu; Zhu, Yuexiong; Zhang, Xiaorong; Pan, Zhonghua; Zheng, Xiaojian; Gong, Chengliang

    2012-07-01

    A novel Bombyx mori cypovirus 1 isolated from infected silkworm larvae and tentatively assigned as Bombyx mori cypovirus 1 isolate Suzhou (BmCPV-SZ). The complete nucleotide sequences of genomic segments S1-S10 from BmCPV-SZ were determined. All segments possessed a single open reading frame; however, bioinformatic evidence suggested a short overlapping coding sequence in S1. Each BmCPV-SZ segment possessed the conserved terminal sequences AGUAA and GUUAGCC at the 5' and 3' ends, respectively. The conserved A/G at the -3 position in relation to the AUG codon could be found in the BmCPV-SZ genome, and it was postulated that this conserved A/G may be the most important nucleotide for efficient translation initiation in cypoviruses (CPVs). Examination of the putative amino acid sequences encoded by BmCPV-SZ revealed some characteristic motifs. Homology searches showed that viral structural proteins VP1, VP3, and VP4 had localized homologies with proteins of Rice ragged stunt virus , a member of the genus Oryzavirus within the family Reoviridae. A phylogenetic tree based on RNA-dependent RNA polymerase sequences demonstrated that CPV is more closely related to Rice ragged stunt virus and Aedes pseudoscutellaris reovirus than to other members of Reoviridae, suggesting that they may have originated from common ancestors.

  18. Functional Genomics Analysis of Singapore Grouper Iridovirus: Complete Sequence Determination and Proteomic Analysis

    PubMed Central

    Song, Wen Jun; Qin, Qi Wei; Qiu, Jin; Huang, Can Hua; Wang, Fan; Hew, Choy Leong

    2004-01-01

    Here we report the complete genome sequence of Singapore grouper iridovirus (SGIV). Sequencing of the random shotgun and restriction endonuclease genomic libraries showed that the entire SGIV genome consists of 140,131 nucleotide bp. One hundred sixty-two open reading frames (ORFs) from the sense and antisense DNA strands, coding for lengths varying from 41 to 1,268 amino acids, were identified. Computer-assisted analyses of the deduced amino acid sequences revealed that 77 of the ORFs exhibited homologies to known virus genes, 23 of which matched functional iridovirus proteins. Forty-two putative conserved domains or signatures were detected in the National Center for Biotechnology Information CD-Search database and PROSITE database. An assortment of enzyme activities involved in DNA replication, transcription, nucleotide metabolism, cell signaling, etc., were identified. Viruses were cultured on a cell line derived from the embryonated egg of the grouper Epinephelus tauvina, isolated, and purified by sucrose gradient ultracentrifugation. The protein extract from the purified virions was analyzed by polyacrylamide gel electrophoresis followed by in-gel digestion of protein bands. Matrix-assisted laser desorption ionization-time of flight mass spectrometry and database searching led to identification of 26 proteins. Twenty of these represented novel or previously unidentified genes, which were further confirmed by reverse transcription-PCR (RT-PCR) and DNA sequencing of their respective RT-PCR products. PMID:15507645

  19. Isolation and sequence analysis of a canine distemper virus from a raccoon dog in Jilin Province, China.

    PubMed

    Cheng, Yuening; Wang, Jianke; Zhang, Miao; Zhao, Jianjun; Shao, Xiqun; Ma, Zengjun; Zhao, Hang; Lin, Peng; Wu, Hua

    2015-10-01

    Canine distemper virus (CDV) is a major pathogen not only in raccoon dogs but also in a variety of carnivorous animals, including domesticated animals, particularly if they have not been vaccinated. In this study, a wild-type strain of CDV was isolated from lung tissue from a raccoon dog kept at a fur farm in Jilin Province, China. Cytopathic effects typical of CDV infection were observed after three blind passages in Vero cells, yielding a virus titer of 10(4.6) TCID50/mL. Virus identification was carried out by RT-PCR, immunofluorescence, electron microscopy, and genome sequencing. The results showed that the isolated virus, termed the SY strain, corresponded to the Asia-1 genotype of CDV and has a genome of 15,690 nucleotides. This represents the first complete nucleotide sequence of a CDV strain circulating in raccoon dogs in China.

  20. Complete Nucleotide Sequence of Watermelon Chlorotic Stunt Virus Originating from Oman

    PubMed Central

    Khan, Akhtar J.; Akhtar, Sohail; Briddon, Rob W.; Ammara, Um; Al-Matrooshi, Abdulrahman M.; Mansoor, Shahid

    2012-01-01

    Watermelon chlorotic stunt virus (WmCSV) is a bipartite begomovirus (genus Begomovirus, family Geminiviridae) that causes economic losses to cucurbits, particularly watermelon, across the Middle East and North Africa. Recently squash (Cucurbita moschata) grown in an experimental field in Oman was found to display symptoms such as leaf curling, yellowing and stunting, typical of a begomovirus infection. Sequence analysis of the virus isolated from squash showed 97.6–99.9% nucleotide sequence identity to previously described WmCSV isolates for the DNA A component and 93–98% identity for the DNA B component. Agrobacterium-mediated inoculation to Nicotiana benthamiana resulted in the development of symptoms fifteen days post inoculation. This is the first bipartite begomovirus identified in Oman. Overall the Oman isolate showed the highest levels of sequence identity to a WmCSV isolate originating from Iran, which was confirmed by phylogenetic analysis. This suggests that WmCSV present in Oman has been introduced from Iran. The significance of this finding is discussed. PMID:22852046

  1. Complete nucleotide sequence of watermelon chlorotic stunt virus originating from Oman.

    PubMed

    Khan, Akhtar J; Akhtar, Sohail; Briddon, Rob W; Ammara, Um; Al-Matrooshi, Abdulrahman M; Mansoor, Shahid

    2012-07-01

    Watermelon chlorotic stunt virus (WmCSV) is a bipartite begomovirus (genus Begomovirus, family Geminiviridae) that causes economic losses to cucurbits, particularly watermelon, across the Middle East and North Africa. Recently squash (Cucurbita moschata) grown in an experimental field in Oman was found to display symptoms such as leaf curling, yellowing and stunting, typical of a begomovirus infection. Sequence analysis of the virus isolated from squash showed 97.6-99.9% nucleotide sequence identity to previously described WmCSV isolates for the DNA A component and 93-98% identity for the DNA B component. Agrobacterium-mediated inoculation to Nicotiana benthamiana resulted in the development of symptoms fifteen days post inoculation. This is the first bipartite begomovirus identified in Oman. Overall the Oman isolate showed the highest levels of sequence identity to a WmCSV isolate originating from Iran, which was confirmed by phylogenetic analysis. This suggests that WmCSV present in Oman has been introduced from Iran. The significance of this finding is discussed.

  2. The complete sequence and promoter activity of the human A-raf-1 gene (ARAF1)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lee, J.E.; Beck, T.W.; Brennscheidt, U.

    1994-03-01

    The raf proto-oncogenes encode cytoplasmic protein serine/threonine kinases, which play a critical role in cell growth and development. One of these, A-raf-1 (human gene symbol, ARAF1), which is predominantly expressed in mouse urogenital tissues, has been mapped to an evolutionarily conserved linkage group composed of ARAF1, SYN1, TIMP, and properdin located at human chromosome Xp11.2. The authors have isolated human genomic DNA clones containing the expressed gene (ARAF1) on the X chromosome and a pseudogene (ARAF2) on chromosome 7p12-q11.21. Analysis of the nucleotide sequence from the ARAF1 genomic clones demonstrated that it consists of 16 exons encoded by minimally 10,776more » nucleotides. The major transcriptional start site (+1) was determined by RNase protection and primer extension assays. Promoter activity was confirmed by functional assays using DNA fragments fused to a CAT reporter gene. The ARAF1 minimal promoter, located between nucleotides -59 and +93, has a low G + C content and lacks consensus TATA and Inr sequences but shows sequence similarity at position -1 to the E box that is known to interact with USF and TFII-I transcription factors. 65 refs., 7 figs., 1 tab.« less

  3. Outbreak of poliomyelitis in Finland in 1984-85 - Re-analysis of viral sequences using the current standard approach.

    PubMed

    Simonen, Marja-Leena; Roivainen, Merja; Iber, Jane; Burns, Cara; Hovi, Tapani

    2010-01-01

    In 1984, a wild type 3 poliovirus (PV3/FIN84) spread all over Finland causing nine cases of paralytic poliomyelitis and one case of aseptic meningitis. The outbreak was ended in 1985 with an intensive vaccination campaign. By limited sequence comparison with previously isolated PV3 strains, closest relatives of PV3/FIN84 were found among strains circulating in the Mediterranean region. Now we wanted to reanalyse the relationships using approaches currently exploited in poliovirus surveillance. Cell lysates of 22 strains isolated during the outbreak and stored frozen were subjected to RT-PCR amplification in three genomic regions without prior subculture. Sequences of the entire VP1 coding region, 150 nucleotides in the VP1-2A junction, most of the 5' non-coding region, partial sequences of the 3D RNA polymerase coding region and partial 3' non-coding region were compared within the outbreak and with sequences available in data banks. In addition, complete nucleotide sequences were obtained for 2 strains isolated from two different cases of disease during the outbreak. The results confirmed the previously described wide intraepidemic variation of the strains, including amino acid substitutions in antigenic sites, as well as the likely Mediterranean region origin of the strains. Simplot and bootscanning analyses of the complete genomes indicated complicated evolutionary history of the non-capsid coding regions of the genome suggesting several recombinations with different HEV-C viruses in the past.

  4. Nucleotide sequence of the L1 ribosomal protein gene of Xenopus laevis: remarkable sequence homology among introns.

    PubMed Central

    Loreni, F; Ruberti, I; Bozzoni, I; Pierandrei-Amaldi, P; Amaldi, F

    1985-01-01

    Ribosomal protein L1 is encoded by two genes in Xenopus laevis. The comparison of two cDNA sequences shows that the two L1 gene copies (L1a and L1b) have diverged in many silent sites and very few substitution sites; moreover a small duplication occurred at the very end of the coding region of the L1b gene which thus codes for a product five amino acids longer than that coded by L1a. Quantitatively the divergence between the two L1 genes confirms that a whole genome duplication took place in Xenopus laevis approximately 30 million years ago. A genomic fragment containing one of the two L1 gene copies (L1a), with its nine introns and flanking regions, has been completely sequenced. The 5' end of this gene has been mapped within a 20-pyridimine stretch as already found for other vertebrate ribosomal protein genes. Four of the nine introns have a 60-nucleotide sequence with 80% homology; within this region some boxes, one of which is 16 nucleotides long, are 100% homologous among the four introns. This feature of L1a gene introns is interesting since we have previously shown that the activity of this gene is regulated at a post-transcriptional level and it involves the block of the normal splicing of some intron sequences. Images Fig. 3. Fig. 5. PMID:3841512

  5. Complete Mitochondrial Genome of Echinostoma hortense (Digenea: Echinostomatidae).

    PubMed

    Liu, Ze-Xuan; Zhang, Yan; Liu, Yu-Ting; Chang, Qiao-Cheng; Su, Xin; Fu, Xue; Yue, Dong-Mei; Gao, Yuan; Wang, Chun-Ren

    2016-04-01

    Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans.

  6. Complete Mitochondrial Genome of Echinostoma hortense (Digenea: Echinostomatidae)

    PubMed Central

    Liu, Ze-Xuan; Zhang, Yan; Liu, Yu-Ting; Chang, Qiao-Cheng; Su, Xin; Fu, Xue; Yue, Dong-Mei; Gao, Yuan; Wang, Chun-Ren

    2016-01-01

    Echinostoma hortense (Digenea: Echinostomatidae) is one of the intestinal flukes with medical importance in humans. However, the mitochondrial (mt) genome of this fluke has not been known yet. The present study has determined the complete mt genome sequences of E. hortense and assessed the phylogenetic relationships with other digenean species for which the complete mt genome sequences are available in GenBank using concatenated amino acid sequences inferred from 12 protein-coding genes. The mt genome of E. hortense contained 12 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes, and 1 non-coding region. The length of the mt genome of E. hortense was 14,994 bp, which was somewhat smaller than those of other trematode species. Phylogenetic analyses based on concatenated nucleotide sequence datasets for all 12 protein-coding genes using maximum parsimony (MP) method showed that E. hortense and Hypoderaeum conoideum gathered together, and they were closer to each other than to Fasciolidae and other echinostomatid trematodes. The availability of the complete mt genome sequences of E. hortense provides important genetic markers for diagnostics, population genetics, and evolutionary studies of digeneans. PMID:27180575

  7. Characterization of complete genome sequence of the spring viremia of carp virus isolated from common carp (Cyprinus carpio) in China.

    PubMed

    Teng, Y; Liu, H; Lv, J Q; Fan, W H; Zhang, Q Y; Qin, Q W

    2007-01-01

    The complete genome of spring viraemia of carp virus (SVCV) strain A-1 isolated from cultured common carp (Cyprinus carpio) in China was sequenced and characterized. Reverse transcription-polymerase chain reaction (RT-PCR) derived clones were constructed and the DNA was sequenced. It showed that the entire genome of SVCV A-1 consists of 11,100 nucleotide base pairs, the predicted size of the viral RNA of rhabdoviruses. However, the additional insertions in bp 4633-4676 and bp 4684-4724 of SVCV A-1 were different from the other two published SVCV complete genomes. Five open reading frames (ORFs) of SVCV A-1 were identified and further confirmed by RT-PCR and DNA sequencing of their respective RT-PCR products. The 5 structural proteins encoded by the viral RNA were ordered 3'-N-P-M-G-L-5'. This is the first report of a complete genome sequence of SVCV isolated from cultured carp in China. Phylogenetic analysis indicates that SVCV A-1 is closely related to the members of the genus Vesiculovirus, family Rhabdoviridae.

  8. Nucleotide sequences specific to Yersinia pestis and methods for the detection of Yersinia pestis

    DOEpatents

    McCready, Paula M [Tracy, CA; Radnedge, Lyndsay [San Mateo, CA; Andersen, Gary L [Berkeley, CA; Ott, Linda L [Livermore, CA; Slezak, Thomas R [Livermore, CA; Kuczmarski, Thomas A [Livermore, CA; Motin, Vladinir L [League City, TX

    2009-02-24

    Nucleotide sequences specific to Yersinia pestis that serve as markers or signatures for identification of this bacterium were identified. In addition, forward and reverse primers and hybridization probes derived from these nucleotide sequences that are used in nucleotide detection methods to detect the presence of the bacterium are disclosed.

  9. Nucleotide sequences specific to Brucella and methods for the detection of Brucella

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McCready, Paula M; Radnedge, Lyndsay; Andersen, Gary L

    Nucleotide sequences specific to Brucella that serves as a marker or signature for identification of this bacterium were identified. In addition, forward and reverse primers and hybridization probes derived from these nucleotide sequences that are used in nucleotide detection methods to detect the presence of the bacterium are disclosed.

  10. GenBank.

    PubMed

    Benson, Dennis A; Karsch-Mizrachi, Ilene; Lipman, David J; Ostell, James; Sayers, Eric W

    2011-01-01

    GenBank® is a comprehensive database that contains publicly available nucleotide sequences for more than 380,000 organisms named at the genus level or lower, obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects, including whole genome shotgun (WGS) and environmental sampling projects. Most submissions are made using the web-based BankIt or standalone Sequin programs, and accession numbers are assigned by GenBank staff upon receipt. Daily data exchange with the European Nucleotide Archive (ENA) and the DNA Data Bank of Japan (DDBJ) ensures worldwide coverage. GenBank is accessible through the NCBI Entrez retrieval system that integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. To access GenBank and its related retrieval and analysis services, begin at the NCBI Homepage: www.ncbi.nlm.nih.gov.

  11. Characterization of rat calcitonin mRNA.

    PubMed Central

    Amara, S G; David, D N; Rosenfeld, M G; Roos, B A; Evans, R M

    1980-01-01

    A chimeric plasmic containing cDNA complementary to rat calcitonin mRNA has been constructed. Partial sequence analysis shows that the insert contains a nucleotide sequence encoding the complete amino acid sequence of calcitonin. Two basic amino acids precede and three basic amino acids follow the hormone sequence, suggesting that calcitonin is generated by the proteolytic cleavage of a larger precursor in a manner analogous to that of other small polypeptide hormones. The COOH-terminal proline, known to be amidated in the secreted hormone, is followed by a glycine in the precursor. The cloned calcitonin DNA was used to characterize the expression of calcitonin mRNA. Cytoplasmic mRNAs from calcitonin-producing rat medullary thyroid carcinoma lines and from normal rat thyroid glands contain a single species, 1050 nucleotides long, whch hybridizes to the cloned calcitonin cDNA. The concentration of calcitonin mRNA sequences is greater in those tumors that produce larger amounts of immunoreactive calcitonin. RNAs from other endocrine tissues, including anterior and neurointermediate lobes of rat pituitary, contain no detectable calcitonin mRNA. Images PMID:6933496

  12. Nucleotide sequence of the gene for the Mr 32,000 thylakoid membrane protein from Spinacia oleracea and Nicotiana debneyi predicts a totally conserved primary translation product of Mr 38,950

    PubMed Central

    Zurawski, Gerard; Bohnert, Hans J.; Whitfeld, Paul R.; Bottomley, Warwick

    1982-01-01

    The gene for the so-called Mr 32,000 rapidly labeled photosystem II thylakoid membrane protein (here designated psbA) of spinach (Spinacia oleracea) chloroplasts is located on the chloroplast DNA in the large single-copy region immediately adjacent to one of the inverted repeat sequences. In this paper we show that the size of the mRNA for this protein is ≈ 1.25 kilobases and that the direction of transcription is towards the inverted repeat unit. The nucleotide sequence of the gene and its flanking regions is presented. The only large open reading frame in the sequence codes for a protein of Mr 38,950. The nucleotide sequence of psbA from Nicotiana debneyi also has been determined, and comparison of the sequences from the two species shows them to be highly conserved (>95% homology) throughout the entire reading frame. Conservation of the amino acid sequence is absolute, there being no changes in a total of 353 residues. This leads us to conclude that the primary translation product of psbA must be a protein of Mr 38,950. The protein is characterized by the complete absence of lysine residues and is relatively rich in hydrophobic amino acids, which tend to be clustered. Transcription of spinach psbA starts about 86 base pairs before the first ATG codon. Immediately upstream from this point there is a sequence typical of that found in E. coli promoters. An almost identical sequence occurs in the equivalent region of N. debneyi DNA. Images PMID:16593262

  13. Complete coding regions of the prototypes enterovirus B93 and C95: phylogenetic analyses of the P1 and P3 regions of EV-B and EV-C strains.

    PubMed

    Junttila, N; Lévêque, N; Magnius, L O; Kabue, J P; Muyembe-Tamfum, J J; Maslin, J; Lina, B; Norder, H

    2015-03-01

    Complete coding regions were sequenced for two new enterovirus genomes: EV-B93 previously identified by VP1 sequencing, derived from a child with acute flaccid paralysis in the Democratic Republic of Congo; and EV-C95 from a French soldier with acute gastroenteritis in Djibouti. The EV-B93 P1 had more than 30% nucleotide divergence from other EV-B types, with highest similarity to E-15 and EV-B80. The P1 nucleotide sequence of EV-C95 was most similar, 71%, to CV-A21. Complete coding regions for the new enteroviruses were compared with those of 135 EV-B and 176 EV-C strains representing all types available in GenBank. When strains from the same outbreak or strains isolated during the same year in the same geographical region were excluded, 27 of the 58 EV-B, and 16 of the 23 EV-C types were represented by more than one sequence. However, for EV-B the P3 sequences formed three clades mainly according to origin or time of isolation, irrespective of type, while for EV-C the P3 sequences segregated mainly according to disease manifestation, with most strains causing paralysis, including polioviruses, forming one clade, and strains causing respiratory illness forming another. There was no intermixing of types between these two clades, apart from two EV-C96 strains. The EV-B P3 sequences had lower inter-clade and higher intra-clade variability as compared to the EV-C sequences, which may explain why inter-clade recombinations are more frequent in EV-B. Further analysis of more isolates may shed light on the role of recombinations in the evolution of EV-B in geographical context. © 2014 Wiley Periodicals, Inc.

  14. Ebola Virus Epidemiology and Evolution in Nigeria

    DTIC Science & Technology

    2016-10-04

    the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of 10 Drosophila melanogaster strain w1118; iso-2; iso-3. Fly 2012...cases, and full-4 length Ebola virus (EBOV) genome sequences for 12 of the 20. The detailed contact data permits 5 nearly complete reconstruction of...two methods highlights the strengths of each, and the importance 16 of both contact tracing and genomic sequencing during an outbreak. 17 18

  15. Mitochondrial genome sequence of the Tibetan wild ass (Equus kiang).

    PubMed

    Luo, Yongjun; Chen, Yu; Liu, Fuyu; Jiang, Chunhua; Gao, Yuqi

    2011-02-01

    The Tibetan wild ass, or kiang (Equus kiang) is endemic to the cold and hypoxic (4000-7000 m above sea level) climates of the montane and alpine grasslands of the Tibetan Plateau. We report here the complete nucleotide sequence of the E. kiang mitochondrial genome. Our results show that E. kiang mitochondrial DNA is 16,634 bp long, and predicted to encode all the 37 genes that are typical for vertebrates.

  16. A comparative genomics strategy for targeted discovery of single-nucleotide polymorphisms and conserved-noncoding sequences in orphan crops.

    PubMed

    Feltus, F A; Singh, H P; Lohithaswa, H C; Schulze, S R; Silva, T D; Paterson, A H

    2006-04-01

    Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species.

  17. A Comparative Genomics Strategy for Targeted Discovery of Single-Nucleotide Polymorphisms and Conserved-Noncoding Sequences in Orphan Crops1[W

    PubMed Central

    Feltus, F.A.; Singh, H.P.; Lohithaswa, H.C.; Schulze, S.R.; Silva, T.D.; Paterson, A.H.

    2006-01-01

    Completed genome sequences provide templates for the design of genome analysis tools in orphan species lacking sequence information. To demonstrate this principle, we designed 384 PCR primer pairs to conserved exonic regions flanking introns, using Sorghum/Pennisetum expressed sequence tag alignments to the Oryza genome. Conserved-intron scanning primers (CISPs) amplified single-copy loci at 37% to 80% success rates in taxa that sample much of the approximately 50-million years of Poaceae divergence. While the conserved nature of exons fostered cross-taxon amplification, the lesser evolutionary constraints on introns enhanced single-nucleotide polymorphism detection. For example, in eight rice (Oryza sativa) genotypes, polymorphism averaged 12.1 per kb in introns but only 3.6 per kb in exons. Curiously, among 124 CISPs evaluated across Oryza, Sorghum, Pennisetum, Cynodon, Eragrostis, Zea, Triticum, and Hordeum, 23 (18.5%) seemed to be subject to rigid intron size constraints that were independent of per-nucleotide DNA sequence variation. Furthermore, we identified 487 conserved-noncoding sequence motifs in 129 CISP loci. A large CISP set (6,062 primer pairs, amplifying introns from 1,676 genes) designed using an automated pipeline showed generally higher abundance in recombinogenic than in nonrecombinogenic regions of the rice genome, thus providing relatively even distribution along genetic maps. CISPs are an effective means to explore poorly characterized genomes for both DNA polymorphism and noncoding sequence conservation on a genome-wide or candidate gene basis, and also provide anchor points for comparative genomics across a diverse range of species. PMID:16607031

  18. Superstatistical model of bacterial DNA architecture

    NASA Astrophysics Data System (ADS)

    Bogachev, Mikhail I.; Markelov, Oleg A.; Kayumov, Airat R.; Bunde, Armin

    2017-02-01

    Understanding the physical principles that govern the complex DNA structural organization as well as its mechanical and thermodynamical properties is essential for the advancement in both life sciences and genetic engineering. Recently we have discovered that the complex DNA organization is explicitly reflected in the arrangement of nucleotides depicted by the universal power law tailed internucleotide interval distribution that is valid for complete genomes of various prokaryotic and eukaryotic organisms. Here we suggest a superstatistical model that represents a long DNA molecule by a series of consecutive ~150 bp DNA segments with the alternation of the local nucleotide composition between segments exhibiting long-range correlations. We show that the superstatistical model and the corresponding DNA generation algorithm explicitly reproduce the laws governing the empirical nucleotide arrangement properties of the DNA sequences for various global GC contents and optimal living temperatures. Finally, we discuss the relevance of our model in terms of the DNA mechanical properties. As an outlook, we focus on finding the DNA sequences that encode a given protein while simultaneously reproducing the nucleotide arrangement laws observed from empirical genomes, that may be of interest in the optimization of genetic engineering of long DNA molecules.

  19. Low-coverage MiSeq next generation sequencing reveals the mitochondrial genome of the Eastern Rock Lobster, Sagmariasus verreauxi.

    PubMed

    Doyle, Stephen R; Griffith, Ian S; Murphy, Nick P; Strugnell, Jan M

    2015-01-01

    The complete mitochondrial genome of the Eastern Rock lobster, Sagmariasus verreauxi, is reported for the first time. Using low-coverage, long read MiSeq next generation sequencing, we constructed and determined the mtDNA genome organization of the 15,470 bp sequence from two isolates from Eastern Tasmania, Australia and Northern New Zealand, and identified 46 polymorphic nucleotides between the two sequences. This genome sequence and its genetic polymorphisms will likely be useful in understanding the distribution and population connectivity of the Eastern Rock Lobster, and in the fisheries management of this commercially important species.

  20. Identification of a novel circular DNA virus in pig feces

    USDA-ARS?s Scientific Manuscript database

    Metagenomic analysis of fecal samples collected from a swine with diarrhea detected sequences encoding a replicase (Rep) protein typically found in small circular Rep-encoding ssDNA (CRESS-DNA) viruses. The complete 3,062 nucleotide genome was generated and found to encode two bi-directionally trans...

  1. The Apis mellifera filamentous virus genome

    USDA-ARS?s Scientific Manuscript database

    A complete reference genome of the Apis mellifera Filamentous virus (AmFV) was determined using Illumina Hiseq sequencing. The AmFV genome is a double strand DNA molecule of approximately 498’500 nucleotides with a GC content of 50.8%. It encompasses 251 non overlapping open reading frames (ORFs), e...

  2. Molecular epidemiology of infectious laryngotracheitis: a review

    USDA-ARS?s Scientific Manuscript database

    Falconid herpesvirus type 1 (FHV-1) is the causative agent of falcon inclusion body disease, an acute, highly contagious disease of raptors. The complete nucleotide sequence of the genome of FHV-1 has been determined. The genome is arranged as a D-type genome with large inverted repeats flanking a ...

  3. Bioinformatics: A History of Evolution "In Silico"

    ERIC Educational Resources Information Center

    Ondrej, Vladan; Dvorak, Petr

    2012-01-01

    Bioinformatics, biological databases, and the worldwide use of computers have accelerated biological research in many fields, such as evolutionary biology. Here, we describe a primer of nucleotide sequence management and the construction of a phylogenetic tree with two examples; the two selected are from completely different groups of organisms:…

  4. Complete genome sequence of a novel flavivirus, duck tembusu virus, isolated from ducks and geese in china.

    PubMed

    Yun, Tao; Zhang, Dabing; Ma, Xuejun; Cao, Zhenzhen; Chen, Liu; Ni, Zheng; Ye, Weicheng; Yu, Bin; Hua, Jionggang; Zhang, Yan; Zhang, Cun

    2012-03-01

    Duck tembusu virus (DTMUV) is an emerging agent that causes a severe disease in ducks. We report herein the first complete genome sequences of duck tembusu virus strains YY5, ZJ-407, and GH-2, isolated from Shaoxing ducks, breeder ducks, and geese, respectively, in China. The genomes of YY5, ZJ-407, and GH-2 are all 10,990 nucleotides (nt) in length and encode a putative polyprotein of 3,426 amino acids. It is flanked by a 5' and a 3' noncoding region (NCR) of 94 and 618 nt, respectively. Knowledge of the whole sequence of DTMUV will be useful for further studies of the mechanisms of virus replication and pathogenesis.

  5. Complete genome analysis of a novel umbravirus-polerovirus combination isolated from Ixeridium dentatum.

    PubMed

    Yoo, Ran Hee; Lee, Seung-Won; Lim, Seungmo; Zhao, Fumei; Igori, Davaajargal; Baek, Dasom; Hong, Jin-Sung; Lee, Su-Heon; Moon, Jae Sun

    2017-12-01

    Two novel viruses, isolated in Bonghwa, Republic of Korea, from an Ixeridium dentatum plant with yellowing mottle symptoms, have been provisionally named Ixeridium yellow mottle-associated virus 1 (IxYMaV-1) and Ixeridium yellow mottle-associated virus 2 (IxYMaV-2). IxYMaV-1 has a genome of 6,017 nucleotides sharing a 56.4% sequence identity with that of cucurbit aphid-borne yellows virus (genus Polerovirus). The IxYMaV-2 genome of 4,196 nucleotides has a sequence identity of less than 48.3% with e other species classified within the genus Umbravirus. Genome properties and phylogenetic analysis suggested that IxYMaV-1 and -2 are representative isolates of new species classifiable within the genus Polerovirus and Umbravirus, respectively.

  6. First complete genome sequence of infectious laryngotracheitis virus

    PubMed Central

    2011-01-01

    Background Infectious laryngotracheitis virus (ILTV) is an alphaherpesvirus that causes acute respiratory disease in chickens worldwide. To date, only one complete genomic sequence of ILTV has been reported. This sequence was generated by concatenating partial sequences from six different ILTV strains. Thus, the full genomic sequence of a single (individual) strain of ILTV has not been determined previously. This study aimed to use high throughput sequencing technology to determine the complete genomic sequence of a live attenuated vaccine strain of ILTV. Results The complete genomic sequence of the Serva vaccine strain of ILTV was determined, annotated and compared to the concatenated ILTV reference sequence. The genome size of the Serva strain was 152,628 bp, with a G + C content of 48%. A total of 80 predicted open reading frames were identified. The Serva strain had 96.5% DNA sequence identity with the concatenated ILTV sequence. Notably, the concatenated ILTV sequence was found to lack four large regions of sequence, including 528 bp and 594 bp of sequence in the UL29 and UL36 genes, respectively, and two copies of a 1,563 bp sequence in the repeat regions. Considerable differences in the size of the predicted translation products of 4 other genes (UL54, UL30, UL37 and UL38) were also identified. More than 530 single-nucleotide polymorphisms (SNPs) were identified. Most SNPs were located within three genomic regions, corresponding to sequence from the SA-2 ILTV vaccine strain in the concatenated ILTV sequence. Conclusions This is the first complete genomic sequence of an individual ILTV strain. This sequence will facilitate future comparative genomic studies of ILTV by providing an appropriate reference sequence for the sequence analysis of other ILTV strains. PMID:21501528

  7. Nucleotide sequences encoding a thermostable alkaline protease

    DOEpatents

    Wilson, David B.; Lao, Guifang

    1998-01-01

    Nucleotide sequences, derived from a thermophilic actinomycete microorganism, which encode a thermostable alkaline protease are disclosed. Also disclosed are variants of the nucleotide sequences which encode a polypeptide having thermostable alkaline proteolytic activity. Recombinant thermostable alkaline protease or recombinant polypeptide may be obtained by culturing in a medium a host cell genetically engineered to contain and express a nucleotide sequence according to the present invention, and recovering the recombinant thermostable alkaline protease or recombinant polypeptide from the culture medium.

  8. Identification of a Novel Human Papillomavirus, Type HPV199, Isolated from a Nasopharynx and Anal Canal, and Complete Genomic Characterization of Papillomavirus Species Gamma-12

    PubMed Central

    Oštrbenk, Anja; Kocjan, Boštjan J.; Hošnjak, Lea; Li, Jingjing; Deng, Qiuju; Šterbenc, Anja; Poljak, Mario

    2015-01-01

    The novel human papillomavirus type 199 (HPV199) was initially identified in a nasopharyngeal swab sample obtained from a 25 year-old immunocompetent male. The complete genome of HPV199 is 7,184 bp in length with a GC content of 36.5%. Comparative genomic characterization of HPV199 and its closest relatives showed the classical genomic organization of Gammapapillomaviruses (Gamma-PVs). HPV199 has seven major open reading frames (ORFs), encoding five early (E1, E2, E4, E6, and E7) and two late (L1 and L2) proteins, while lacking the E5 ORF. The long control region (LCR) of 513 bp is located between the L1 and E6 ORFs. Phylogenetic analysis additionally confirmed that HPV-199 clusters into the Gamma-PV genus, species Gamma-12, additionally containing HPV127, HV132, HPV148, HPV165, and three putative HPV types: KC5, CG2 and CG3. HPV199 is most closely related to HPV127 (nucleotide identity 77%). The complete viral genome sequence of additional HPV199 isolate was determined from anal canal swab sample. Two HPV199 complete viral sequences exhibit 99.4% nucleotide identity. To the best of our knowledge, this is the first member of Gamma-PV with complete nucleotide sequences determined from two independent clinical samples. To evaluate the tissue tropism of the novel HPV type, 916 clinical samples were tested using HPV199 type-specific real-time PCR: HPV199 was detected in 2/76 tissue samples of histologically confirmed common warts, 2/108 samples of eyebrow hair follicles, 2/137 anal canal swabs obtained from individuals with clinically evident anal pathology, 4/184 nasopharyngeal swabs and 3/411 cervical swabs obtained from women with normal cervical cytology. Although HPV199 was found in 1.4% of cutaneous and mucosal samples only, it exhibits dual tissue tropism. According to the results of our study and literature data, dual tropism of all Gamma-12 members is highly possible. PMID:26375679

  9. Nucleotide sequences specific to Francisella tularensis and methods for the detection of Francisella tularensis

    DOEpatents

    McCready, Paula M [Tracy, CA; Radnedge, Lyndsay [San Mateo, CA; Andersen, Gary L [Berkeley, CA; Ott, Linda L [Livermore, CA; Slezak, Thomas R [Livermore, CA; Kuczmarski, Thomas A [Livermore, CA; Vitalis, Elizabeth A [Livermore, CA

    2007-02-06

    Described herein is the identification of nucleotide sequences specific to Francisella tularensis that serves as a marker or signature for identification of this bacterium. In addition, forward and reverse primers and hybridization probes derived from these nucleotide sequences that are used in nucleotide detection methods to detect the presence of the bacterium are disclosed.

  10. Nucleotide sequences specific to Francisella tularensis and methods for the detection of Francisella tularensis

    DOEpatents

    McCready, Paula M [Tracy, CA; Radnedge, Lyndsay [San Mateo, CA; Andersen, Gary L [Berkeley, CA; Ott, Linda L [Livermore, CA; Slezak, Thomas R [Livermore, CA; Kuczmarski, Thomas A [Livermore, CA; Vitalis, Elizabeth A [Livermore, CA

    2009-02-24

    Described herein is the identification of nucleotide sequences specific to Francisella tularensis that serves as a marker or signature for identification of this bacterium. In addition, forward and reverse primers and hybridization probes derived from these nucleotide sequences that are used in nucleotide detection methods to detect the presence of the bacterium are disclosed.

  11. Phylogenetic analysis of the complete genome of 11 BKV isolates obtained from allogenic stem cell transplant recipients in Ireland.

    PubMed

    Drew, Richard John; Walsh, Anne; Laoi, Bairbre Ni; Crowley, Brendan

    2012-07-01

    BK polyomavirus (family Polyomaviridae) may cause hemorrhagic cystitis (BKV-HC) in hematopoietic stem cell transplant recipients. Eleven complete BKV genomes (GenBank accession numbers: JN192431-JN192441) were sequenced from urine samples of allogenic hematopoietic stem cell transplant recipients and compared to complete BKV genomes in the published literature. Of the 11 isolates, seven (64%) were subgroup Ib-1, three (27%) isolates belonged to subgroup Ib-2 and a single isolate belonged to subtype III. The analysis of single-nucleotide polymorphisms in this study showed that isolates could be subclassified into subtypes I-IV and subgroups Ib-1 and Ib-2 on the basis of VP1 of the first part of the Large T-antigen (LTag). The non-coding control region (NCCR) of the 11 isolates was also sequenced. These sequences showed that there was consistent sequence homology within subgroups Ib-1 and Ib-2. Two new mutations were described in the isolates, G→C at O(84) in isolate SJH-LG-310, and a deletion at R(2-7) in isolate SJH-LG-309. No known transcription factor is thought to be present at the site of either of these mutations. There were no rearrangements seen in isolates and this may be because the patients were not followed up over time. There were five nucleotide positions at which subgroup Ib-1 isolated differed from subgroup Ib-2 isolates in the NCCR sequence, O(41) , P(18) , P(31) , R(4) , and S(18) . The mutation O(41) is present in the promoter granulocyte/macrophage stimulating factor) gene and the P(31) mutation is present in the NF-1 gene. Copyright © 2012 Wiley Periodicals, Inc.

  12. Sequence analysis and expression of the M1 and M2 matrix protein genes of hirame rhabdovirus (HIRRV)

    USGS Publications Warehouse

    Nishizawa, T.; Kurath, G.; Winton, J.R.

    1997-01-01

    We have cloned and sequenced a 2318 nucleotide region of the genomic RNA of hirame rhabdovirus (HIRRV), an important viral pathogen of Japanese flounder Paralichthys olivaceus. This region comprises approximately two-thirds of the 3' end of the nucleocapsid protein (N) gene and the complete matrix protein (M1 and M2) genes with the associated intergenic regions. The partial N gene sequence was 812 nucleotides in length with an open reading frame (ORF) that encoded the carboxyl-terminal 250 amino acids of the N protein. The M1 and M2 genes were 771 and 700 nucleotides in length, respectively, with ORFs encoding proteins of 227 and 193 amino acids. The M1 gene sequence contained an additional small ORF that could encode a highly basic, arginine-rich protein of 25 amino acids. Comparisons of the N, M1, and M2 gene sequences of HIRRV with the corresponding sequences of the fish rhabdoviruses, infectious hematopoietic necrosis virus (IHNV) or viral hemorrhagic septicemia virus (VHSV) indicated that HIRRV was more closely related to IHNV than to VHSV, but was clearly distinct from either. The putative consensus gene termination sequence for IHNV and VHSV, AGAYAG(A)(7), was present in the N-M1, M1-M2, and M2-G intergenic regions of HIRRV as were the putative transcription initiation sequences YGGCAC and AACA. An Escherichia coli expression system was used to produce recombinant proteins from the M1 and M2 genes of HIRRV. These were the same size as the authentic M1 and M2 proteins and reacted with anti-HIRRV rabbit serum in western blots. These reagents can be used for further study of the fish immune response and to test novel control methods.

  13. Complete genome sequence of Menghai rhabdovirus, a novel mosquito-borne rhabdovirus from China.

    PubMed

    Sun, Qiang; Zhao, Qiumin; An, Xiaoping; Guo, Xiaofang; Zuo, Shuqing; Zhang, Xianglilan; Pei, Guangqian; Liu, Wenli; Cheng, Shi; Wang, Yunfei; Shu, Peng; Mi, Zhiqiang; Huang, Yong; Zhang, Zhiyi; Tong, Yigang; Zhou, Hongning; Zhang, Jiusong

    2017-04-01

    Menghai rhabdovirus (MRV) was isolated from Aedes albopictus in Menghai county of Yunnan Province, China, in August 2010. Whole-genome sequencing of MRV was performed using an Ion PGM™ Sequencer. We found that MRV is a single-stranded, negative-sense RNA virus. The complete genome of MRV has 10,744 nt, with short inverted repeat termini, encoding five typical rhabdovirus proteins (N, P, M, G, and L) and an additional small hypothetical protein. Nucleotide BLAST analysis using the BLASTn method showed that the genome sequence most similar to that of MRV is that of Arboretum virus (NC_025393.1), with a Max score of 322, query coverage of 14%, and 66% identity. Genomic and phylogenetic analyses both demonstrated that MRV should be considered a member of a novel species of the family Rhabdoviridae.

  14. Complete genome sequence analysis identifies a new genotype of brassica yellows virus that infects cabbage and radish in China.

    PubMed

    Zhang, Xiao-Yan; Xiang, Hai-Ying; Zhou, Cui-Ji; Li, Da-Wei; Yu, Jia-Lin; Han, Cheng-Gui

    2014-08-01

    For brassica yellows virus (BrYV), proposed to be a member of a new polerovirus species, two clearly distinct genotypes (BrYV-A and BrYV-B) have been described. In this study, the complete nucleotide sequences of two BrYV isolates from radish and Chinese cabbage were determined. Sequence analysis suggested that these isolates represent a new genotype, referred to here as BrYV-C. The full-length sequences of the two BrYV-C isolates shared 93.4-94.8 % identity with BrYV-A and BrYV-B. Further phylogenetic analysis showed that the BrYV-C isolates formed a subgroup that was distinct from the BrYV-A and BrYV-B isolates based on all of the proteins except P5.

  15. Nucleotide sequences encoding a thermostable alkaline protease

    DOEpatents

    Wilson, D.B.; Lao, G.

    1998-01-06

    Nucleotide sequences, derived from a thermophilic actinomycete microorganism, which encode a thermostable alkaline protease are disclosed. Also disclosed are variants of the nucleotide sequences which encode a polypeptide having thermostable alkaline proteolytic activity. Recombinant thermostable alkaline protease or recombinant polypeptide may be obtained by culturing in a medium a host cell genetically engineered to contain and express a nucleotide sequence according to the present invention, and recovering the recombinant thermostable alkaline protease or recombinant polypeptide from the culture medium. 3 figs.

  16. Denoising DNA deep sequencing data—high-throughput sequencing errors and their correction

    PubMed Central

    Laehnemann, David; Borkhardt, Arndt

    2016-01-01

    Characterizing the errors generated by common high-throughput sequencing platforms and telling true genetic variation from technical artefacts are two interdependent steps, essential to many analyses such as single nucleotide variant calling, haplotype inference, sequence assembly and evolutionary studies. Both random and systematic errors can show a specific occurrence profile for each of the six prominent sequencing platforms surveyed here: 454 pyrosequencing, Complete Genomics DNA nanoball sequencing, Illumina sequencing by synthesis, Ion Torrent semiconductor sequencing, Pacific Biosciences single-molecule real-time sequencing and Oxford Nanopore sequencing. There is a large variety of programs available for error removal in sequencing read data, which differ in the error models and statistical techniques they use, the features of the data they analyse, the parameters they determine from them and the data structures and algorithms they use. We highlight the assumptions they make and for which data types these hold, providing guidance which tools to consider for benchmarking with regard to the data properties. While no benchmarking results are included here, such specific benchmarks would greatly inform tool choices and future software development. The development of stand-alone error correctors, as well as single nucleotide variant and haplotype callers, could also benefit from using more of the knowledge about error profiles and from (re)combining ideas from the existing approaches presented here. PMID:26026159

  17. Complete mitochondrial genomes of the yellow-bellied slider turtle Trachemys scripta scripta and anoxia tolerant red-eared slider Trachemys scripta elegans.

    PubMed

    Yu, Danna; Fang, Xindong; Storey, Kenneth B; Zhang, Yongpu; Zhang, Jiayong

    2016-05-01

    The complete mitochondrial genomes of the yellow-bellied slider (Trachemys scripta scripta) and anoxia tolerant red-eared slider (Trachemys scripta elegans) turtles were sequenced to analyze gene arrangement. The complete mt genomes of T. s. scripta and elegans were circular molecules of 16,791 bp and 16,810 bp in length, respectively, and included an A + 1 frameshift insertion in ND3 and ND4L genes. The AT content of the overall base composition of scripta and elegans was 61.2%. Nucleotide sequence divergence of the mt-genome (p distance) between scripta and elegans was 0.4%. A detailed comparison between the mitochondrial genomes of the two subspecies is shown.

  18. Mitochondrial genome of the tomato clownfish Amphiprion frenatus (Pomacentridae, Amphiprioninae).

    PubMed

    Ye, Le; Hu, Jing; Wu, Kaichang; Wang, Yu; Li, Jianlong

    2016-01-01

    The complete mitochondrial (mt) genome of the tomato clownfish Amphiprion frenatus was obtained in this study. The circular mtDNA molecule was 16,774 bp in size and the overall nucleotide composition of the H-strand was 29.72% A, 25.81% T, 15.38% G and 29.09% C, with an A + T bias. The complete mitogenome encoded 13 protein-coding genes, 2 rRNAs, 22 tRNAs and a control region (D-loop), with the gene arrangement and translation direction basically identical to other typical vertebrate mitogenomes. The D-loop included termination associated sequence (TAS), central conserved domain (CCD) and conserved sequence block (CSB), and was composed of 6 complete continuity tandem repeat units and an imperfect tandem repeat unit.

  19. The complete nucleotide sequences of the 5 genetically distinct plastid genomes of Oenothera, subsection Oenothera: II. A microevolutionary view using bioinformatics and formal genetic data.

    PubMed

    Greiner, Stephan; Wang, Xi; Herrmann, Reinhold G; Rauwolf, Uwe; Mayer, Klaus; Haberer, Georg; Meurer, Jörg

    2008-09-01

    A unique combination of genetic features and a rich stock of information make the flowering plant genus Oenothera an appealing model to explore the molecular basis of speciation processes including nucleus-organelle coevolution. From representative species, we have recently reported complete nucleotide sequences of the 5 basic and genetically distinguishable plastid chromosomes of subsection Oenothera (I-V). In nature, Oenothera plastid genomes are associated with 6 distinct, either homozygous or heterozygous, diploid nuclear genotypes of the 3 basic genomes A, B, or C. Artificially produced plastome-genome combinations that do not occur naturally often display interspecific plastome-genome incompatibility (PGI). In this study, we compare formal genetic data available from all 30 plastome-genome combinations with sequence differences between the plastomes to uncover potential determinants for interspecific PGI. Consistent with an active role in speciation, a remarkable number of genes have high Ka/Ks ratios. Different from the Solanacean cybrid model Atropa/tobacco, RNA editing seems not to be relevant for PGIs in Oenothera. However, predominantly sequence polymorphisms in intergenic segments are proposed as possible sources for PGI. A single locus, the bidirectional promoter region between psbB and clpP, is suggested to contribute to compartmental PGI in the interspecific AB hybrid containing plastome I (AB-I), consistent with its perturbed photosystem II activity.

  20. The complete nucleotide sequence of the genome of Barley yellow dwarf virus-RMV reveals it to be a new Polerovirus distantly related to other yellow dwarf viruses

    PubMed Central

    Krueger, Elizabeth N.; Beckett, Randy J.; Gray, Stewart M.; Miller, W. Allen

    2013-01-01

    The yellow dwarf viruses (YDVs) of the Luteoviridae family represent the most widespread group of cereal viruses worldwide. They include the Barley yellow dwarf viruses (BYDVs) of genus Luteovirus, the Cereal yellow dwarf viruses (CYDVs) and Wheat yellow dwarf virus (WYDV) of genus Polerovirus. All of these viruses are obligately aphid transmitted and phloem-limited. The first described YDVs (initially all called BYDV) were classified by their most efficient vector. One of these viruses, BYDV-RMV, is transmitted most efficiently by the corn leaf aphid, Rhopalosiphum maidis. Here we report the complete 5612 nucleotide sequence of the genomic RNA of a Montana isolate of BYDV-RMV (isolate RMV MTFE87, Genbank accession no. KC921392). The sequence revealed that BYDV-RMV is a polerovirus, but it is quite distantly related to the CYDVs or WYDV, which are very closely related to each other. Nor is BYDV-RMV closely related to any other particular polerovirus. Depending on the gene that is compared, different poleroviruses (none of them a YDV) share the most sequence similarity to BYDV-RMV. Because of its distant relationship to other YDVs, and because it commonly infects maize via its vector, R. maidis, we propose that BYDV-RMV be renamed Maize yellow dwarf virus-RMV (MYDV-RMV). PMID:23888156

  1. The complete nucleotide sequence of the genome of Barley yellow dwarf virus-RMV reveals it to be a new Polerovirus distantly related to other yellow dwarf viruses.

    PubMed

    Krueger, Elizabeth N; Beckett, Randy J; Gray, Stewart M; Miller, W Allen

    2013-01-01

    The yellow dwarf viruses (YDVs) of the Luteoviridae family represent the most widespread group of cereal viruses worldwide. They include the Barley yellow dwarf viruses (BYDVs) of genus Luteovirus, the Cereal yellow dwarf viruses (CYDVs) and Wheat yellow dwarf virus (WYDV) of genus Polerovirus. All of these viruses are obligately aphid transmitted and phloem-limited. The first described YDVs (initially all called BYDV) were classified by their most efficient vector. One of these viruses, BYDV-RMV, is transmitted most efficiently by the corn leaf aphid, Rhopalosiphum maidis. Here we report the complete 5612 nucleotide sequence of the genomic RNA of a Montana isolate of BYDV-RMV (isolate RMV MTFE87, Genbank accession no. KC921392). The sequence revealed that BYDV-RMV is a polerovirus, but it is quite distantly related to the CYDVs or WYDV, which are very closely related to each other. Nor is BYDV-RMV closely related to any other particular polerovirus. Depending on the gene that is compared, different poleroviruses (none of them a YDV) share the most sequence similarity to BYDV-RMV. Because of its distant relationship to other YDVs, and because it commonly infects maize via its vector, R. maidis, we propose that BYDV-RMV be renamed Maize yellow dwarf virus-RMV (MYDV-RMV).

  2. A comprehensive bioinformatic analysis of hepatitis D virus full-length genomes.

    PubMed

    Delfino, C M; Cerrudo, C S; Biglione, M; Oubiña, J R; Ghiringhelli, P D; Mathet, V L

    2018-02-06

    In association with hepatitis B virus (HBV), hepatitis delta virus (HDV) is a subviral agent that may promote severe acute and chronic forms of liver disease. Based on the percentage of nucleotide identity of the genome, HDV was initially classified into three genotypes. However, since 2006, the original classification has been further expanded into eight clades/genotypes. The intergenotype divergence may be as high as 35%-40% over the entire RNA genome, whereas sequence heterogeneity among the isolates of a given genotype is <20%; furthermore, HDV recombinants have been clearly demonstrated. The genetic diversity of HDV is related to the geographic origin of the isolates. This study shows the first comprehensive bioinformatic analysis of the complete available set of HDV sequences, using both nucleotide and protein phylogenies (based on an evolutionary model selection, gamma distribution estimation, tree inference and phylogenetic distance estimation), protein composition analysis and comparison (based on the presence of invariant residues, molecular signatures, amino acid frequencies and mono- and di-amino acid compositional distances), as well as amino acid changes in sequence evolution. Taking into account the congruent and consistent results of both nucleotide and amino acid analyses of GenBank available sequences (recorded as of January, 2017), we propose that the eight hepatitis D virus genotypes may be grouped into three large genogroups fully supported by their shared characteristics. © 2018 John Wiley & Sons Ltd.

  3. Landscape of Insertion Polymorphisms in the Human Genome

    PubMed Central

    Onozawa, Masahiro; Goldberg, Liat; Aplan, Peter D.

    2015-01-01

    Nucleotide substitutions, small (<50 bp) insertions or deletions (indels), and large (>50 bp) deletions are well-known causes of genetic variation within the human genome. We recently reported a previously unrecognized form of polymorphic insertions, termed templated sequence insertion polymorphism (TSIP), in which the inserted sequence was templated from a distant genomic region, and was inserted in the genome through reverse transcription of an RNA intermediate. TSIPs can be grouped into two classes based on nucleotide sequence features at the insertion junctions; class 1 TSIPs show target site duplication, polyadenylation, and preference for insertion at a 5′-TTTT/A-3′ sequence, suggesting a LINE-1 based insertion mechanism, whereas class 2 TSIPs show features consistent with repair of a DNA double strand break by nonhomologous end joining. To gain a more complete picture of TSIPs throughout the human population, we evaluated whole-genome sequence from 52 individuals, and identified 171 TSIPs. Most individuals had 25–30 TSIPs, and common (present in >20% of individuals) TSIPs were found in individuals throughout the world, whereas rare TSIPs tended to cluster in specific geographic regions. The number of rare TSIPs was greater than the number of common TSIPs, suggesting that TSIP generation is an ongoing process. Intriguingly, mitochondrial sequences were a frequent template for class 2 insertions, used more commonly than any nuclear chromosome. Similar to single nucleotide polymorphisms and indels, we suspect that these TSIPs may be important for the generation of human diversity and genetic diseases, and can be useful in tracking historical migration of populations. PMID:25745018

  4. The complete mitochondrial genome and phylogenetic analysis of the giant panda (Ailuropoda melanoleuca).

    PubMed

    Peng, Rui; Zeng, Bo; Meng, Xiuxiang; Yue, Bisong; Zhang, Zhihe; Zou, Fangdong

    2007-08-01

    The complete mitochondrial genome sequence of the giant panda, Ailuropoda melanoleuca, was determined by the long and accurate polymerase chain reaction (LA-PCR) with conserved primers and primer walking sequence methods. The complete mitochondrial DNA is 16,805 nucleotides in length and contains two ribosomal RNA genes, 13 protein-coding genes, 22 transfer RNA genes and one control region. The total length of the 13 protein-coding genes is longer than the American black bear, brown bear and polar bear by 3 amino acids at the end of ND5 gene. The codon usage also followed the typical vertebrate pattern except for an unusual ATT start codon, which initiates the NADH dehydrogenase subunit 5 (ND5) gene. The molecular phylogenetic analysis was performed on the sequences of 12 concatenated heavy-strand encoded protein-coding genes, and suggested that the giant panda is most closely related to bears.

  5. Sequence Analysis of IncA/C and IncI1 Plasmids Isolated from Multidrug-Resistant Salmonella Newport Using Single-Molecule Real-Time Sequencing.

    PubMed

    Cao, Guojie; Allard, Marc; Hoffmann, Maria; Muruvanda, Tim; Luo, Yan; Payne, Justin; Meng, Kevin; Zhao, Shaohua; McDermott, Patrick; Brown, Eric; Meng, Jianghong

    2018-06-01

    Multidrug-resistant (MDR) plasmids play an important role in disseminating antimicrobial resistance genes. To elucidate the antimicrobial resistance gene compositions in A/C incompatibility complex (IncA/C) plasmids carried by animal-derived MDR Salmonella Newport, and to investigate the spread mechanism of IncA/C plasmids, this study characterizes the complete nucleotide sequences of IncA/C plasmids by comparative analysis. Complete nucleotide sequencing of plasmids and chromosomes of six MDR Salmonella Newport strains was performed using PacBio RSII. Open reading frames were assigned using prokaryotic genome annotation pipeline (PGAP). To understand genomic diversity and evolutionary relationships among Salmonella Newport IncA/C plasmids, we included three complete IncA/C plasmid sequences with similar backbones from Salmonella Newport and Escherichia coli: pSN254, pAM04528, and peH4H, and additional 200 draft chromosomes. With the exception of canine isolate CVM22462, which contained an additional IncI1 plasmid, each of the six MDR Salmonella Newport strains contained only the IncA/C plasmid. These IncA/C plasmids (including references) ranged in size from 80.1 (pCVM21538) to 176.5 kb (pSN254) and carried various resistance genes. Resistance genes floR, tetA, tetR, strA, strB, sul, and mer were identified in all IncA/C plasmids. Additionally, bla CMY-2 and sugE were present in all IncA/C plasmids, excepting pCVM21538. Plasmid pCVM22462 was capable of being transferred by conjugation. The IncI1 plasmid pCVM22462b in CVM22462 carried bla CMY-2 and sugE. Our data showed that MDR Salmonella Newport strains carrying similar IncA/C plasmids clustered together in the phylogenetic tree using chromosome sequences and the IncA/C plasmids from animal-derived Salmonella Newport contained diverse resistance genes. In the current study, we analyzed genomic diversities and phylogenetic relationships among MDR Salmonella Newport using complete plasmids and chromosome sequences and provided possible spread mechanism of IncA/C plasmids in Salmonella Newport Lineage II.

  6. [Investigation of a Patient with Pre-vaccine-derived Poliovirus in Shandong Province, China].

    PubMed

    Lin, Xiaojuan; Liu, Yao; Wang, Suting; Zhang Xiao; Song, Lizhi; Tao, Zexin; Ji, Feng; Xiong, Ping; Xu, Aiqiang

    2015-09-01

    To analyze the genetic characteristics of a polio-I highly variant vaccine recombinant virus in Shandong Province (China) in 2011 and to identify isolates from healthy contacts, two stool specimens from one patient with acute flaccid paralysis (AFP) and 40 stool specimens from his contacts were collected for virus isolation. The complete genome of poliovirus and VP1 coding region of the non-polio enterovirus were sequenced. Homologous comparison and phylogenetic analyses based on VP1 sequences were undertaken among coxsackievirus (CV) B1, CV-B3 isolates, and those in GenBank. One poliovirus (P1/11186), CV-A4 and CV-A8 were isolated from the AFP patient; one CV-A2, Echovirus 3 (E-3), E-12 and E-14, ten CV-B1, and five CV-B3 strains were isolated from his contacts. These results led us to believe that there may be a human enterovirus epidemic in this area, and that surveillance must be enhanced. P1/11186 was a type-1 vaccine-related poliovirus; it combined with type-2 and type-3 polioviruses in 2A and 3A regions, respectively. There were 25 nucleotide mutations with 9 amino-acid alterations in the entire genome. There were 8 nucleotide mutations with 5 amino-acid alterations in the VP1 region compared with the corresponding Sabin strains. Homology analyses suggested that the ten CV-B1 isolates had 97.0%-100% nucleotide and 98.9%-100% amino-acid identities with each other, as well as 92.6%-100% nucleotide and 99.2%-100% amino-acid identities among the five CV-B3 isolates. Phylogenetic analyses on the complete sequences of VP1 among CV-B1 and CV-B3 isolates showed that Shandong strains, together with strains from other provinces in China, had a close relationship and belonged to the same group.

  7. Molecular cloning and sequence analysis of the Anticarsia gemmatalis multicapsid nuclear polyhedrosis virus GP64 glycoprotein.

    PubMed

    Pilloff, Marcela Gabriela; Bilen, Marcos Fabián; Belaich, Mariano Nicolás; Lozano, Mario Enrique; Ghiringhelli, Pablo Daniel

    2003-01-01

    The gp64 locus of Anticarsia gemmatalis multicapsid nucleopolyhedrovirus isolate Santa Fe (AgMNPV-SF) was characterised molecularly in our laboratory. To this end, we have located and cloned a AgMNPV-SF genomic DNA fragment containing the gp64 gene and sequenced the complete gp64 locus. Nucleotide sequence analysis indicated that the AgMNPV gp64 gene consists of a 1500 nucleotide open reading frame (ORF), encoding a protein of 499 amino acids. Of the seven gp64 homologues identified to date, the AgMNPV gp64 ORF shared most sequence similarity with the gp64 gene of Orgyia pseudotsugata MNPV. The GP64 from AgMNPV is the smallest baculoviral envelope glycoprotein found to date, differing in 10 or more residues from the other group I nucleopolyhedroviruses. The biological activity of AgMNPV GP64 protein was assessed by cell fusion assays in UFL-AG-286 cells using the obtained recombinant plasmids. In the upstream and downstream regions, relative to the gp64 ORF, we found different conserved transcriptional and post-transcriptional regulatory elements, respectively.

  8. Complete nucleotide sequence of Clematis chlorotic mottle virus, a new member of the family Tombusviridae

    USDA-ARS?s Scientific Manuscript database

    Clematis chlorotic mottle virus (ClCMV) is a previously undescribed virus associated with yellow mottling and veining, chlorotic ring spots, line pattern mosaics, and flower distortion and discoloration on ornamental Clematis. The ClCMV genome is 3,880nt in length with 5 putative open reading frames...

  9. New Hepatitis E Virus Genotype in Camels, the Middle East

    PubMed Central

    Lau, Susanna K.P.; Teng, Jade L.L.; Tsang, Alan K. L.; Joseph, Marina; Wong, Emily Y.M.; Tang, Ying; Sivakumar, Saritha; Xie, Jun; Bai, Ru; Wernery, Renate; Wernery, Ulrich; Yuen, Kwok-Yung

    2014-01-01

    In a molecular epidemiology study of hepatitis E virus (HEV) in dromedaries in Dubai, United Arab Emirates, HEV was detected in fecal samples from 3 camels. Complete genome sequencing of 2 strains showed >20% overall nucleotide difference to known HEVs. Comparative genomic and phylogenetic analyses revealed a previously unrecognized HEV genotype. PMID:24856611

  10. A novel Caulimovirus associated with a complete fruit drop symptom in ‘Bluecrop’ blueberry

    USDA-ARS?s Scientific Manuscript database

    Here we describe the nucleotide sequence and genome organization of a novel virus in the family Caulimoviridae from ‘Bluecrop’ blueberry plants that exhibited fruit drop symptoms. The virus is tentatively named Blueberry fruit drop associated virus (BFDaV). Blueberry fruit drop disease (BFDD) was fi...

  11. Complete mitochondrial genome of the stonefly Cryptoperla stilifera Sivec (Plecoptera: Peltoperlidae) and the phylogeny of Polyneopteran insects.

    PubMed

    Wu, Hai-Yan; Ji, Xiao-Yu; Yu, Wei-Wei; Du, Yu-Zhou

    2014-03-10

    We present the complete mitogenome of a stonefly, Cryptoperla stilifera Sivec (Plecoptera; Peltoperlidae). The mitogenome was a circular molecule consisting of 15,633 nucleotides, 37 genes and a A+T-rich region. C. stilifera mitogenome was similar to Pteronarcys princeps mitogenome (Plecoptera; Pteronarcyidae). All transfer RNA genes (tRNAs) had typical cloverleaf secondary structures except for trnSer (AGN), where the stem-loop structure of the dihydrouridine (DHU) arm was missing. The A+T-rich region of C. stilifera had two stem-loops and each had two interlink. Three conserved sequence blocks (CSBs) were present in the A+T-rich regions of C. stilifera, Peltoperla tarteri and Peltoperla arcuata. Moreover, many polynucleotide stretches (Poly N, N=A, T and C) in the A+T-rich region of C. stilifera Phylogenetic relationships of Polyneopteran species were constructed based on the nucleotide sequences of 13 protein coding genes (PCGs). Both maximum likelihood (ML) and Bayesian inference (BI) analyses supported Grylloblattodea as the sister group to Plecoptera+Dermaptera and Embiidina and Phasmatodea as sister groups. Copyright © 2014 Elsevier B.V. All rights reserved.

  12. Complete nucleotide sequence and genome organization of a Chinese isolate of Tobacco vein distorting virus.

    PubMed

    Mo, Xiao-han; Chen, Zheng-bin; Chen, Jian-ping

    2010-12-01

    Tobacco bushy top disease is caused by tobacco bushy top virus (TBTV, a member of the genus Umbravirus) which is dependent on tobacco vein-distorting virus (TVDV) to act as a helper virus encapsidating TBTV and enabling its transmission by aphids. Isometric virions from diseased tobacco plants were purified and disease symptoms were reproduced after experimental aphid transmission. The complete genome of TVDV was determined from cloned RT-PCR products derived from viral RNA. It was 5,920 nucleotides (nts) long and had the six major open reading frames (ORFs) typical of a member of the genus Polerovirus. Sequence comparisons showed that it differed significantly from any of the other species in the genus and this was confirmed by phylogenetic analyses of the RdRp and coat protein. SDS-PAGE analysis of purified virions gave two protein bands of about 26 and 59 kDa both of which reacted strongly in Western blots with antiserum produced to prokaryotically expressed TVDV CP showing that the two forms of the TVDV CP were the only protein components of the capsid.

  13. Complete mitochondrial genome of Yangtze River wild common carp (Cyprinus carpio haematopterus) and Russian scattered scale mirror carp (Cyprinus carpio carpio).

    PubMed

    Hu, Guang Fu; Liu, Xiang Jiang; Zou, Gui Wei; Li, Zhong; Liang, Hong-Wei; Hu, Shao-Na

    2016-01-01

    We sequenced the complete mitogenomes of (Cyprinus carpio haematopterus) and Russian scattered scale mirror carp (Cyprinus carpio carpio). Comparison of these two mitogenomes revealed that the mitogenomes of these two common carp strains were remarkably similar in genome length, gene order and content, and AT content. There were only 55 bp variations in 16,581 nucleotides. About 1 bp variation was located in rRNAs, 2 bp in tRNAs, 9 bp in the control region and 43 bp in protein-coding genes. Furthermore, forty-three variable nucleotides in the protein-coding genes of the two strains led to four variable amino acids, which were located in the ND2, ATPase 6, ND5 and ND6 genes, respectively.

  14. Complete nucleotide and derived amino acid sequence of cDNA encoding the mitochondrial uncoupling protein of rat brown adipose tissue: lack of a mitochondrial targeting presequence.

    PubMed Central

    Ridley, R G; Patel, H V; Gerber, G E; Morton, R C; Freeman, K B

    1986-01-01

    A cDNA clone spanning the entire amino acid sequence of the nuclear-encoded uncoupling protein of rat brown adipose tissue mitochondria has been isolated and sequenced. With the exception of the N-terminal methionine the deduced N-terminus of the newly synthesized uncoupling protein is identical to the N-terminal 30 amino acids of the native uncoupling protein as determined by protein sequencing. This proves that the protein contains no N-terminal mitochondrial targeting prepiece and that a targeting region must reside within the amino acid sequence of the mature protein. Images PMID:3012461

  15. The complete genome sequence of freesia mosaic virus and its relationship to other potyviruses.

    PubMed

    Choi, H I; Lim, H R; Song, Y S; Kim, M J; Choi, S H; Song, Y S; Bae, S C; Ryu, K H

    2010-07-01

    We have completed the genomic sequence of a potyvirus, freesia mosaic virus (FreMV), and compared it to those of other known potyviruses. The full-length genome sequence of FreMV consists of 9,489 nucleotides. The large protein contains 3,077 amino acids, with an AUG start codon and UAA stop codon, containing one open reading frame typical of a potyvirus polyprotein. The polyprotein of FreMV-Kr gives rise to eleven proteins (P1, HC-pro, P3, PIPO, 6K1, CI, 6K2, VPg, NIa, NIb and CP), and putative cleavage sites of each protein were identified by sequence comparison to those of other known potyviruses. Phylogenetic analysis of the polyprotein revealed that FreMV-Kr was most closely related to PeMoV and was related to BtMV, BaRMV and PeLMV, which belong to the BCMV subgroup. This is the first information on the complete genome structure of FreMV, and the sequence information clearly supports the status of FreMV as a member of a distinct species in the genus Potyvirus.

  16. Molecular systematics of higher primates: genealogical relations and classification.

    PubMed Central

    Miyamoto, M M; Koop, B F; Slightom, J L; Goodman, M; Tennant, M R

    1988-01-01

    We obtained 5' and 3' flanking sequences (5.4 kilobase pairs) from the psi eta-globin gene region of the rhesus macaque (Macaca mulatta) and combined them with available nucleotide data. The completed sequence, representing 10.8 kilobase pairs of contiguous noncoding DNA, was compared to the same orthologous regions available for human (Homo sapiens, as represented by five different alleles), common chimpanzee (Pan troglodytes), gorilla (Gorilla gorilla), and orangutan (Pongo pygmaeus). The nucleotide sequence for Macaca mulatta provided the outgroup perspective needed to evaluate better the relationships of humans and great apes. Pairwise comparisons and parsimony analysis of these orthologues clearly demonstrated (i) that humans and great apes share a high degree of genetic similarity and (ii) that humans, chimpanzees, and gorillas form a natural monophyletic group. These conclusions strongly favor a genealogical classification for higher primates consisting of a single family (Hominidae) with two subfamilies (Homininae for Homo, Pan, and Gorilla and Ponginae for Pongo). PMID:3174657

  17. The Complete Nucleotide Sequence of the Human Immunoglobulin Heavy Chain Variable Region Locus

    PubMed Central

    Matsuda, Fumihiko; Ishii, Kazuo; Bourvagnet, Patrice; Kuma, Kei-ichi; Hayashida, Hidenori; Miyata, Takashi; Honjo, Tasuku

    1998-01-01

    The complete nucleotide sequence of the 957-kb DNA of the human immunoglobulin heavy chain variable (VH) region locus was determined and 43 novel VH segments were identified. The region contains 123 VH segments classifiable into seven different families, of which 79 are pseudogenes. Of the 44 VH segments with an open reading frame, 39 are expressed as heavy chain proteins and 1 as mRNA, while the remaining 4 are not found in immunoglobulin cDNAs. Combinatorial diversity of VH region was calculated to be ∼6,000. Conservation of the promoter and recombination signal sequences was observed to be higher in functional VH segments than in pseudogenes. Phylogenetic analysis of 114 VH segments clearly showed clustering of the VH segments of each family. However, an independent branch in the tree contained a single VH, V4-44.1P, sharing similar levels of homology to human VH families and to those of other vertebrates. Comparison between different copies of homologous units that appear repeatedly across the locus clearly demonstrates that dynamic DNA reorganization of the locus took place at least eight times between 133 and 10 million years ago. One nonimmunoglobulin gene of unknown function was identified in the intergenic region. PMID:9841928

  18. In silico Comparison of 19 Porphyromonas gingivalis Strains in Genomics, Phylogenetics, Phylogenomics and Functional Genomics.

    PubMed

    Chen, Tsute; Siddiqui, Huma; Olsen, Ingar

    2017-01-01

    Currently, genome sequences of a total of 19 Porphyromonas gingivalis strains are available, including eight completed genomes (strains W83, ATCC 33277, TDC60, HG66, A7436, AJW4, 381, and A7A1-28) and 11 high-coverage draft sequences (JCVI SC001, F0185, F0566, F0568, F0569, F0570, SJD2, W4087, W50, Ando, and MP4-504) that are assembled into fewer than 300 contigs. The objective was to compare these genomes at both nucleotide and protein sequence levels in order to understand their phylogenetic and functional relatedness. Four copies of 16S rRNA gene sequences were identified in each of the eight complete genomes and one in the other 11 unfinished genomes. These 43 16S rRNA sequences represent only 24 unique sequences and the derived phylogenetic tree suggests a possible evolutionary history for these strains. Phylogenomic comparison based on shared proteins and whole genome nucleotide sequences consistently showed two groups with closely related members: one consisted of ATCC 33277, 381, and HG66, another of W83, W50, and A7436. At least 1,037 core/shared proteins were identified in the 19 P. gingivalis genomes based on the most stringent detecting parameters. Comparative functional genomics based on genome-wide comparisons between NCBI and RAST annotations, as well as additional approaches, revealed functions that are unique or missing in individual P. gingivalis strains, or species-specific in all P. gingivalis strains, when compared to a neighboring species P. asaccharolytica . All the comparative results of this study are available online for download at ftp://www.homd.org/publication_data/20160425/.

  19. In silico Comparison of 19 Porphyromonas gingivalis Strains in Genomics, Phylogenetics, Phylogenomics and Functional Genomics

    PubMed Central

    Chen, Tsute; Siddiqui, Huma; Olsen, Ingar

    2017-01-01

    Currently, genome sequences of a total of 19 Porphyromonas gingivalis strains are available, including eight completed genomes (strains W83, ATCC 33277, TDC60, HG66, A7436, AJW4, 381, and A7A1-28) and 11 high-coverage draft sequences (JCVI SC001, F0185, F0566, F0568, F0569, F0570, SJD2, W4087, W50, Ando, and MP4-504) that are assembled into fewer than 300 contigs. The objective was to compare these genomes at both nucleotide and protein sequence levels in order to understand their phylogenetic and functional relatedness. Four copies of 16S rRNA gene sequences were identified in each of the eight complete genomes and one in the other 11 unfinished genomes. These 43 16S rRNA sequences represent only 24 unique sequences and the derived phylogenetic tree suggests a possible evolutionary history for these strains. Phylogenomic comparison based on shared proteins and whole genome nucleotide sequences consistently showed two groups with closely related members: one consisted of ATCC 33277, 381, and HG66, another of W83, W50, and A7436. At least 1,037 core/shared proteins were identified in the 19 P. gingivalis genomes based on the most stringent detecting parameters. Comparative functional genomics based on genome-wide comparisons between NCBI and RAST annotations, as well as additional approaches, revealed functions that are unique or missing in individual P. gingivalis strains, or species-specific in all P. gingivalis strains, when compared to a neighboring species P. asaccharolytica. All the comparative results of this study are available online for download at ftp://www.homd.org/publication_data/20160425/. PMID:28261563

  20. Complexity: an internet resource for analysis of DNA sequence complexity

    PubMed Central

    Orlov, Y. L.; Potapov, V. N.

    2004-01-01

    The search for DNA regions with low complexity is one of the pivotal tasks of modern structural analysis of complete genomes. The low complexity may be preconditioned by strong inequality in nucleotide content (biased composition), by tandem or dispersed repeats or by palindrome-hairpin structures, as well as by a combination of all these factors. Several numerical measures of textual complexity, including combinatorial and linguistic ones, together with complexity estimation using a modified Lempel–Ziv algorithm, have been implemented in a software tool called ‘Complexity’ (http://wwwmgs.bionet.nsc.ru/mgs/programs/low_complexity/). The software enables a user to search for low-complexity regions in long sequences, e.g. complete bacterial genomes or eukaryotic chromosomes. In addition, it estimates the complexity of groups of aligned sequences. PMID:15215465

  1. Molecular characterization and expression of the M6 gene of grass carp hemorrhage virus (GCHV), an aquareovirus.

    PubMed

    Qiu, T; Lu, R H; Zhang, J; Zhu, Z Y

    2001-07-01

    The complete nucleotide sequence of M6 gene of grass carp hemorrhage virus (GCHV) was determined. It is 2039 nucleotides in length and contains a single large open reading frame that could encode a protein of 648 amino acids with predicted molecular mass of 68.7 kDa. Amino acid sequence comparison revealed that the protein encoded by GCHV M6 is closely related to the protein mu1 of mammalian reovirus. The M6 gene, encoding the major outer-capsid protein, was expressed using the pET fusion protein vector in Escherichia coli and detected by Western blotting using chicken anti-GCHV immunoglobulin (IgY). The result indicates that the protein encoded by M6 may share a putative Asn-42-Pro-43 proteolytic cleavage site with mu1.

  2. Composition for nucleic acid sequencing

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2008-08-26

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  3. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-06-06

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  4. Method for sequencing nucleic acid molecules

    DOEpatents

    Korlach, Jonas; Webb, Watt W.; Levene, Michael; Turner, Stephen; Craighead, Harold G.; Foquet, Mathieu

    2006-05-30

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  5. Characterization and mapping of cDNA encoding aspartate aminotransferase in rice, Oryza sativa L.

    PubMed

    Song, J; Yamamoto, K; Shomura, A; Yano, M; Minobe, Y; Sasaki, T

    1996-10-31

    Fifteen cDNA clones, putatively identified as encoding aspartate aminotransferase (AST, EC 2.6.1.1.), were isolated and partially sequenced. Together with six previously isolated clones putatively identified to encode ASTs (Sasaki, et al. 1994, Plant Journal 6, 615-624), their sequences were characterized and classified into 4 cDNA species. Two of the isolated clones, C60213 and C2079, were full-length cDNAs, and their complete nucleotide sequences were determined. C60213 was 1612 bp long and its deduced amino acid sequence showed 88% homology with that of Panicum miliaceum L. mitochondrial AST. The C60213-encoded protein had an N-terminal amino acid sequence that was characteristic of a mitochondrial transit peptide. On the other hand, C2079 was 1546 bp long and had 91% amino acid sequence homology with P. miliaceum L. cytosolic AST but lacked in the transit peptide sequence. The homologies of nucleotide sequences and deduced amino acid sequences of C2079 and C60213 were 54% and 52%, respectively. C2079 and C60213 were mapped on chromosomes 1 and 6, respectively, by restriction fragment length polymorphism linkage analysis. Northern blot analysis using C2079 as a probe revealed much higher transcript levels in callus and root than in green and etiolated shoots, suggesting tissue-specific variations of AST gene expression.

  6. Nucleotide sequence and phylogenetic analysis of Cucurbit yellow stunting disorder virus RNA 2.

    PubMed

    Livieratos, Ioannis C; Coutts, Robert H A

    2002-06-01

    The complete nucleotide sequence of Cucurbit yellow stunting disorder virus (CYSDV) RNA 2, a whitefly (Bemisia tabaci)-transmitted closterovirus with a bi-partite genome, is reported. CYSDV RNA 2 is 7,281 nucleotides long and contains the closterovirus hallmark gene array with a similar arrangement to the prototype member of the genus Crinivirus, Lettuce infectious yellows virus (LIYV). CYSDV RNA 2 contains open reading frames (ORFs) potentially encoding in a 5' to 3' direction for proteins of 5 kDa (ORF 1; hydrophobic protein), 62 kDa (ORF 2; heat shock protein 70 homolog, HSP70h), 59 kDa (ORF 3; protein of unknown function), 9 kDa (ORF 4; protein of unknown function), 28.5 kDa (ORF 5; coat protein, CP), 53 kDa (ORF 6; coat protein minor, CPm), and 26.5 kDa (ORF 7; protein of unknown function). Pairwise comparisons of CYSDV RNA 2-encoded proteins (HSP70h, p59 and CPm) among the closteroviruses showed that CYSDV is closely related to LIYV. Phylogenetic analysis based on the amino acid sequence of the HSP70h, indicated that CYSDV clusters with other members of the genus Crinivirus, and it is related to Little cherry virus-1 (LChV-1), but is distinct from the aphid- or mealybug-transmitted closteroviruses.

  7. Genetic diversity of ORF3 and spike genes of porcine epidemic diarrhea virus in Thailand.

    PubMed

    Temeeyasen, Gun; Srijangwad, Anchalee; Tripipat, Thitima; Tipsombatboon, Pavita; Piriyapongsa, Jittima; Phoolcharoen, Waranyoo; Chuanasa, Taksina; Tantituvanont, Angkana; Nilubol, Dachrit

    2014-01-01

    Porcine epidemic diarrhea virus (PEDV) has become endemic in the Thai swine industry, causing economic losses and repeated outbreaks since its first emergence in 2007. In the present study, 69 Thai PEDV isolates were obtained from 50 swine herds across Thailand during the period 2008-2012. Both partial and complete nucleotide sequences of the spike (S) glycoprotein and the nucleotide sequences of ORF3 genes were determined to investigate the genetic diversity and molecular epidemiology of Thai PEDV. Based on the analysis of the partial S glycoprotein genes, the Thai PEDV isolates were clustered into 2 groups related to Korean and Chinese field isolates. The results for the complete spike genes, however, demonstrated that both groups were grouped in the same cluster. Interestingly, both groups of Thai PEDV isolates had a 4-aa (GENQ) insertion between positions 55 and 56, a 1-aa insertion between positions 135 and 136, and a 2-aa deletion between positions 155 and 156, making them identical to the Korean KNU series and isolates responsible for outbreaks in China in recent years. In addition to the complete S sequences, the ORF3 gene analyses suggested that the isolates responsible for outbreaks in Thailand are not vaccine related. The results of this study suggest that the PEDV isolates responsible for outbreaks in Thailand since its emergence represent a variant of PEDV that was previously reported in China and Korea. Copyright © 2013 Elsevier B.V. All rights reserved.

  8. Labeled nucleotide phosphate (NP) probes

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2009-02-03

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  9. The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.

    PubMed Central

    Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R

    1982-01-01

    The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791

  10. First complete genome sequences of genogroup V, genotype 3 porcine sapoviruses: common 5'-terminal genomic feature of sapoviruses.

    PubMed

    Oka, Tomoichiro; Doan, Yen Hai; Shimoike, Takashi; Haga, Kei; Takizawa, Takenori

    2017-12-01

    Sapoviruses (SaVs) are enteric viruses and have been detected in various mammals. They are divided into multiple genogroups and genotypes based on the entire major capsid protein (VP1) encoding region sequences. In this study, we determined the first complete genome sequences of two genogroup V, genotype 3 (GV.3) SaV strains detected from swine fecal samples, in combination with Illumina MiSeq sequencing of the libraries prepared from viral RNA and PCR products. The lengths of the viral genome (7494 nucleotides [nt] excluding polyA tail) and short 5'-untranslated region (14 nt) as well as two predicted open reading frames are similar to those of other SaVs. The amino acid differences between the two porcine SaVs are most frequent in the central region of the VP1-encoding region. A stem-loop structure which was predicted in the first 41 nt of the 5'-terminal region of GV.3 SaVs and the other available complete genome sequences of SaVs may have a critical role in viral genome replication. Our study provides complete genome sequences of rarely reported GV.3 SaV strains and highlights the common 5'-terminal genomic feature of SaVs detected from different mammalian species.

  11. Characterization of the first complete genome sequence of an Impatiens necrotic spot orthotospovirus isolate from the United States and worldwide phylogenetic analyses of INSV isolates.

    PubMed

    Zhao, Kaixi; Margaria, Paolo; Rosa, Cristina

    2018-05-10

    Impatiens necrotic spot orthotospovirus (INSV) can impact economically important ornamental plants and vegetables worldwide. Characterization studies on INSV are limited. For most INSV isolates, there are no complete genome sequences available. This lack of genomic information has a negative impact on the understanding of the INSV genetic diversity and evolution. Here we report the first complete nucleotide sequence of a US INSV isolate. INSV-UP01 was isolated from an impatiens in Pennsylvania, US. RT-PCR was used to clone its full-length genome and Vector NTI to assemble overlapping sequences. Phylogenetic trees were constructed by using MEGA7 software to show the phylogenetic relationships with other available INSV sequences worldwide. This US isolate has genome and biological features classical of INSV species and clusters in the Western Hemisphere clade, but its origin appears to be recent. Furthermore, INSV-UP01 might have been involved in a recombination event with an Italian isolate belonging to the Asian clade. Our analyses support that INSV isolates infect a broad plant-host range they group by geographic origin and not by host, and are subjected to frequent recombination events. These results justify the need to generate and analyze complete genome sequences of orthotospoviruses in general and INSV in particular.

  12. The complete genome sequence of a south Indian isolate of Rice tungro spherical virus reveals evidence of genetic recombination between distinct isolates.

    PubMed

    Sailaja, B; Anjum, Najreen; Patil, Yogesh K; Agarwal, Surekha; Malathi, P; Krishnaveni, D; Balachandran, S M; Viraktamath, B C; Mangrauthia, Satendra K

    2013-12-01

    In this study, complete genome of a south Indian isolate of Rice tungro spherical virus (RTSV) from Andhra Pradesh (AP) was sequenced, and the predicted amino acid sequence was analysed. The RTSV RNA genome consists of 12,171 nt without the poly(A) tail, encoding a putative typical polyprotein of 3,470 amino acids. Furthermore, cleavage sites and sequence motifs of the polyprotein were predicted. Multiple alignment with other RTSV isolates showed a nucleotide sequence identity of 95% to east Indian isolates and 90% to Philippines isolates. A phylogenetic tree based on complete genome sequence showed that Indian isolates clustered together, while Vt6 and PhilA isolates of Philippines formed two separate clusters. Twelve recombination events were detected in RNA genome of RTSV using the Recombination Detection Program version 3. Recombination analysis suggested significant role of 5' end and central region of genome in virus evolution. Further, AP and Odisha isolates appeared as important RTSV isolates involved in diversification of this virus in India through recombination phenomenon. The new addition of complete genome of first south Indian isolate provided an opportunity to establish the molecular evolution of RTSV through recombination analysis and phylogenetic relationship.

  13. 37 CFR 5.31-5.33 - [Reserved

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... from abandonment 1.135 Amino Acid Sequences. (See Nucleotide and/or Amino Acid Sequences) Appeal to... Appeals and Interference 41.47 Of rejection of an application 1.104(a) Nucleotide and/or Amino Acid...) Symbols for nucleotide and/or amino acid sequence data 1.822 T Tables in patent applications 1.58 Terminal...

  14. Complete genome sequences of two strains of Treponema pallidum subsp. pertenue from Ghana, Africa: Identical genome sequences in samples isolated more than 7 years apart.

    PubMed

    Strouhal, Michal; Mikalová, Lenka; Havlíčková, Pavla; Tenti, Paolo; Čejková, Darina; Rychlík, Ivan; Bruisten, Sylvia; Šmajs, David

    2017-09-01

    Treponema pallidum subsp. pertenue (TPE) is the causative agent of yaws, a multi-stage disease, endemic in tropical regions of Africa, Asia, Oceania, and South America. To date, four TPE strains have been completely sequenced including three TPE strains of human origin (Samoa D, CDC-2, and Gauthier) and one TPE strain (Fribourg-Blanc) isolated from a baboon. All TPE strains are highly similar to T. pallidum subsp. pallidum (TPA) strains. The mutation rate in syphilis and related treponemes has not been experimentally determined yet. Complete genomes of two TPE strains, CDC 2575 and Ghana-051, that infected patients in Ghana and were isolated in 1980 and 1988, respectively, were sequenced and analyzed. Both strains had identical consensus genome nucleotide sequences raising the question whether TPE CDC 2575 and Ghana-051 represent two different strains. Several lines of evidence support the fact that both strains represent independent samples including regions showing intrastrain heterogeneity (13 and 5 intrastrain heterogeneous sites in TPE Ghana-051 and TPE CDC 2575, respectively). Four of these heterogeneous sites were found in both genomes but the frequency of alternative alleles differed. The identical consensus genome sequences were used to estimate the upper limit of the yaws treponeme evolution rate, which was 4.1 x 10-10 nucleotide changes per site per generation. The estimated upper limit for the mutation rate of TPE was slightly lower than the mutation rate of E. coli, which was determined during a long-term experiment. Given the known diversity between TPA and TPE genomes and the assumption that both TPA and TPE have a similar mutation rate, the most recent common ancestor of syphilis and yaws treponemes appears to be more than ten thousand years old and likely even older.

  15. Geographically Distinct and Domain-Specific Sequence Variations in the Alleles of Rice Blast Resistance Gene Pib

    PubMed Central

    Vasudevan, Kumar; Vera Cruz, Casiana M.; Gruissem, Wilhelm; Bhullar, Navreet K.

    2016-01-01

    Rice blast is caused by Magnaporthe oryzae, which is the most destructive fungal pathogen affecting rice growing regions worldwide. The rice blast resistance gene Pib confers broad-spectrum resistance against Southeast Asian M. oryzae races. We investigated the allelic diversity of Pib in rice germplasm originating from 12 major rice growing countries. Twenty-five new Pib alleles were identified that have unique single nucleotide polymorphisms (SNPs), insertions and/or deletions, in addition to the polymorphic nucleotides that are shared between the different alleles. These partially or completely shared polymorphic nucleotides indicate frequent sequence exchange events between the Pib alleles. In some of the new Pib alleles, nucleotide diversity is high in the LRR domain, whereas, in others it is distributed among the NB-ARC and LRR domains. Most of the polymorphic amino acids in LRR and NB-ARC2 domains are predicted as solvent-exposed. Several of the alleles and the unique SNPs are country specific, suggesting a diversifying selection of alleles in various geographical locations in response to the locally prevalent M. oryzae population. Together, the new Pib alleles are an important genetic resource for rice blast resistance breeding programs and provide new information on rice-M. oryzae interactions at the molecular level. PMID:27446145

  16. WEB-server for search of a periodicity in amino acid and nucleotide sequences

    NASA Astrophysics Data System (ADS)

    E Frenkel, F.; Skryabin, K. G.; Korotkov, E. V.

    2017-12-01

    A new web server (http://victoria.biengi.ac.ru/splinter/login.php) was designed and developed to search for periodicity in nucleotide and amino acid sequences. The web server operation is based upon a new mathematical method of searching for multiple alignments, which is founded on the position weight matrices optimization, as well as on implementation of the two-dimensional dynamic programming. This approach allows the construction of multiple alignments of the indistinctly similar amino acid and nucleotide sequences that accumulated more than 1.5 substitutions per a single amino acid or a nucleotide without performing the sequences paired comparisons. The article examines the principles of the web server operation and two examples of studying amino acid and nucleotide sequences, as well as information that could be obtained using the web server.

  17. The complete mitochondrial genome of domestic sheep, Ovis aries.

    PubMed

    Hu, Xiao-di; Gao, Li-zhi

    2016-01-01

    In this study, we report a complete mitochondrial (mt) genome sequence of the Texel ewe, Ovis aries. The total genome is 16,615 bp in length and its overall base composition was estimated to be 33.68% for A, 27.36% for T, 25.86% for C, and 13.10% for G indicating an AT-rich (61.04%) feature in the O. aries mtgenome. It contains a total of 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and a control region (D-loop region). Comparisons with other publicly available sheep mitogenomes revealed a bunch of nucleotide diversity. This complete mitgenome sequence would enlarge useful genomic information for further studies on sheep evolution and domestication that will enhance germplasm conservation and breeding programs of O. aries.

  18. Complete genome sequence of a novel avian paramyxovirus isolated from wild birds in South Korea.

    PubMed

    Jeong, Jipseol; Kim, Youngsik; An, Injung; Wang, Seung-Jun; Kim, Yongkwan; Lee, Hyun-Jeong; Choi, Kang-Seuk; Im, Se-Pyeong; Min, Wongi; Oem, Jae-Ku; Jheong, Weonhwa

    2018-01-01

    A novel avian paramyxovirus (APMV), Cheonsu1510, was isolated from wild bird feces in South Korea and serologically and genetically characterized. In hemagglutination inhibition tests, antiserum against Cheonsu1510 showed low reactivity with other APMVs and vice versa. The complete genome of Cheonsu1510 comprised 15,408 nucleotides, contained six open reading frames (3'-N-P-M-F-HN-L-5'), and showed low sequence identity to other APMVs (< 63%) and a unique genomic composition. Phylogenetic analysis revealed that Cheonsu1510 was related to but distinct from APMV-1, -9, and -15. These results suggest that Cheonsu1510 represents a new APMV serotype, APMV-17.

  19. The complete sequence of the mitochondrial genome of Arctic fox (Alopex lagopus).

    PubMed

    Yan, Shou-Qing; Guo, Peng-Cheng; Yue, Yuan; Li, Wan-Hong; Bai, Chun-Yan; Li, Yu-Mei; Sun, Jin-Hai; Zhao, Zhi-Hui

    2016-11-01

    In the present study, the complete mitochondrial genome sequence of Arctic fox (Alopex lagopus) was determined for the first time. It has a total length of 16,656 bp, and contains 13 protein-coding genes, 22 tRNA genes, 2 ribosome RNA genes and 1 control region. The nucleotide composition is 31.3% for A, 26.2% for C, 14.8% for G and 27.7% for T, respectively. The D-loop region located between tRNA Pro and tRNA Phe contains a (ACACGTACACGCAT) 18 tandem repeat array. The data will be useful for the investigation of the genetic structure and diversity in the natural and farmed population of Arctic foxes.

  20. [Completed sequences analysis on the Chinese attenuated yellow fever 17D vaccine strain and the WHO standard yellow fever 17D vaccine strain].

    PubMed

    Li, Jing; Yu, Yong-Xin; Dong, Guan-Mu

    2009-04-01

    To compare the molecular characteristics of the Chinese attenuated yellow fever 17D vaccine strain and the WHO reference yellow fever 17D vaccine strain. The primers were designed according to the published nucleotide sequences of YFV 17D strains in GenBank. Total RNA of was extracted by the Trizol and reverse transcripted. The each fragments of the YFV genome were amplified by PCR and sequenced subsequently. The fragments of the 5' and 3' end of the two strains were cloned into the pGEM T-easy vector and then sequenced. The nucleotide acid and amino acid sequences of the homology to both strains were 99% with each other. No obvious nulceotide changes were found in the sequences of the entire genome of each 17D strains. Moreover, there was no obvious changes in the E protein genes. But the E173 of YF17D Tiantan, associted with the virulence, had mutantions. And the two live attenuated yellow fever 17D vaccine strains fell to the same lineage by the phylogenetic analysis. The results indicated that the two attenuated yellow fever 17D vaccine viruses accumulates mutations at a very low frequency and the genomes were relative stable.

  1. Molecular characterisation of Atlantic salmon paramyxovirus (ASPV): A novel paramyxovirus associated with proliferative gill inflammation

    USGS Publications Warehouse

    Falk, K.; Batts, W.N.; Kvellestad, A.; Kurath, G.; Wiik-Nielsen, J.; Winton, J.R.

    2008-01-01

    Atlantic salmon paramyxovirus (ASPV) was isolated in 1995 from gills of farmed Atlantic salmon suffering from proliferative gill inflammation. The complete genome sequence of ASPV was determined, revealing a genome 16,968 nucleotides in length consisting of six non-overlapping genes coding for the nucleo- (N), phospho- (P), matrix- (M), fusion- (F), haemagglutinin-neuraminidase- (HN) and large polymerase (L) proteins in the order 3???-N-P-M-F-HN-L-5???. The various conserved features related to virus replication found in most paramyxoviruses were also found in ASPV. These include: conserved and complementary leader and trailer sequences, tri-nucleotide intergenic regions and highly conserved transcription start and stop signal sequences. The P gene expression strategy of ASPV was like that of the respiro-, morbilli- and henipaviruses, which express the P and C proteins from the primary transcript and edit a portion of the mRNA to encode V and W proteins. Sequence similarities among various features related to virus replication, pairwise comparisons of all deduced ASPV protein sequences with homologous regions from other members of the family Paramyxoviridae, and phylogenetic analyses of these amino acid sequences suggested that ASPV was a novel member of the sub-family Paramyxovirinae, most closely related to the respiroviruses. ?? 2008 Elsevier B.V. All rights reserved.

  2. Differential recognition of the ORF2 region in a complete genome sequence of porcine circovirus type 2 (PCV2) isolated from boar bone marrow in Korea.

    PubMed

    Kweon, Chang-Hee; Nguyen, Lien Thi Kim; Yoo, Mi-Sun; Kang, Seung-Won

    2015-09-15

    Porcine circovirus type 2 (PCV2) is the causative agent of post-weaning multisystemic wasting syndrome (PMWS) in swine. Here, a phylogenetic tree was constructed using PCV2 nucleotide sequences derived from the bone marrow of Korean boar and previously reported PCV2 sequences isolated from various countries. PCV2 from Korean boar bone marrow (KC188796) was classified into the group containing PCV2a-Canada and other PCV2 strain from Korea. While the ORF1 region of the PCV2 genome was highly conserved, ORF2 (the capsid protein coding region) was relatively variable. The nucleotide sequences for bone marrow-derived PCV2 were 93.4-99.0% homologous to the other reference sequences. The deduced amino acid sequences for the ORF1 and ORF2 coding regions were 97.4-99.3% and 84.5-97.4% homologous with the other reference strains, respectively, indicating that KC188796 did not differ markedly from the other PCV2 strains. Phylogenetic analysis demonstrated that bone marrow-derived PCV2 was highly similar to PCV2a from Canada and may be related to persistent PCV2 infections in swine. Copyright © 2015 Elsevier B.V. All rights reserved.

  3. An Outbreak of Acute Hepatitis Caused by Genotype IB Hepatitis A Viruses Contaminating the Water Supply in Thailand.

    PubMed

    Ruchusatsawat, Kriangsak; Wongpiyabovorn, Jongkonnee; Kawidam, Chonthicha; Thiemsing, Laddawan; Sangkitporn, Somchai; Yoshizaki, Sayaka; Tatsumi, Masashi; Takeda, Naokazu; Ishii, Koji

    2016-01-01

    In 2000, an outbreak of acute hepatitis A was reported in a province adjacent to Bangkok, Thailand. To investigate the cause of the 2000 hepatitis A outbreaks in Thailand using molecular epidemiological analysis. Serum and stool specimens were collected from patients who were clinically diagnosed with acute viral hepatitis. Water samples from drinking water and deep-drilled wells were also collected. These specimens were subjected to polymerase chain reaction (PCR) amplification and sequencing of the VP1/2A region of the hepatitis A virus (HAV) genome. The entire genome sequence of one of the fecal specimens was determined and phylogenetically analyzed with those of known HAV sequences. Eleven of 24 fecal specimens collected from acute viral hepatitis patients were positive as determined by semi- nested reverse transcription PCR targeting the VP1/2A region of HAV. The nucleotide sequence of these samples had an identical genotype IB sequence, suggesting that the same causative agent was present. The complete nucleotide sequence derived from one of the samples indicated that the Thai genotype IB strain should be classified in a unique phylogenetic cluster. The analysis using an adjusted odds ratio showed that the consumption of groundwater was the most likely risk factor associated with the disease. © 2017 S. Karger AG, Basel.

  4. Nucleotide cleaving agents and method

    DOEpatents

    Que, Jr., Lawrence; Hanson, Richard S.; Schnaith, Leah M. T.

    2000-01-01

    The present invention provides a unique series of nucleotide cleaving agents and a method for cleaving a nucleotide sequence, whether single-stranded or double-stranded DNA or RNA, using and a cationic metal complex having at least one polydentate ligand to cleave the nucleotide sequence phosphate backbone to yield a hydroxyl end and a phosphate end.

  5. Circulation of Endemic Type 2 Vaccine-Derived Poliovirus in Egypt from 1983 to 1993

    PubMed Central

    Yang, Chen-Fu; Naguib, Tary; Yang, Su-Ju; Nasr, Eman; Jorba, Jaume; Ahmed, Nahed; Campagnoli, Ray; van der Avoort, Harrie; Shimizu, Hiroyuki; Yoneyama, Tetsuo; Miyamura, Tatsuo; Pallansch, Mark; Kew, Olen

    2003-01-01

    From 1988 to 1993, 30 cases of poliomyelitis associated with poliovirus type 2 were found in seven governorates of Egypt. Because many of the cases were geographically and temporally clustered and because the case isolates differed antigenically from the vaccine strain, it was initially assumed that the cases signaled the continued circulation of wild type 2 poliovirus. However, comparison of sequences encoding the major capsid protein, VP1 (903 nucleotides), revealed that the isolates were related (93 to 97% nucleotide sequence identity) to the Sabin type 2 oral poliovirus vaccine (OPV) strain and unrelated (<82% nucleotide sequence identity) to the wild type 2 polioviruses previously indigenous to Egypt (last known isolate: 1979) or to any contemporary wild type 2 polioviruses found elsewhere. The rate and pattern of VP1 divergence among the circulating vaccine-derived poliovirus (cVDPV) isolates suggested that all lineages were derived from a single OPV infection that occurred around 1983 and that progeny from the initiating infection circulated for approximately a decade within Egypt along several independent chains of transmission. Complete genomic sequences of an early (1988) and a late (1993) cVDPV isolate revealed that their 5′ untranslated region (5′ UTR) and noncapsid- 3′ UTR sequences were derived from other species C enteroviruses. Circulation of type 2 cVDPVs occurred at a time of low OPV coverage in the affected communities and ceased when OPV coverage rates increased. The potential for cVDPVs to circulate in populations with low immunity to poliovirus has important implications for current and future strategies to eradicate polio worldwide. PMID:12857906

  6. Circulation of endemic type 2 vaccine-derived poliovirus in Egypt from 1983 to 1993.

    PubMed

    Yang, Chen-Fu; Naguib, Tary; Yang, Su-Ju; Nasr, Eman; Jorba, Jaume; Ahmed, Nahed; Campagnoli, Ray; van der Avoort, Harrie; Shimizu, Hiroyuki; Yoneyama, Tetsuo; Miyamura, Tatsuo; Pallansch, Mark; Kew, Olen

    2003-08-01

    From 1988 to 1993, 30 cases of poliomyelitis associated with poliovirus type 2 were found in seven governorates of Egypt. Because many of the cases were geographically and temporally clustered and because the case isolates differed antigenically from the vaccine strain, it was initially assumed that the cases signaled the continued circulation of wild type 2 poliovirus. However, comparison of sequences encoding the major capsid protein, VP1 (903 nucleotides), revealed that the isolates were related (93 to 97% nucleotide sequence identity) to the Sabin type 2 oral poliovirus vaccine (OPV) strain and unrelated (<82% nucleotide sequence identity) to the wild type 2 polioviruses previously indigenous to Egypt (last known isolate: 1979) or to any contemporary wild type 2 polioviruses found elsewhere. The rate and pattern of VP1 divergence among the circulating vaccine-derived poliovirus (cVDPV) isolates suggested that all lineages were derived from a single OPV infection that occurred around 1983 and that progeny from the initiating infection circulated for approximately a decade within Egypt along several independent chains of transmission. Complete genomic sequences of an early (1988) and a late (1993) cVDPV isolate revealed that their 5' untranslated region (5' UTR) and noncapsid- 3' UTR sequences were derived from other species C enteroviruses. Circulation of type 2 cVDPVs occurred at a time of low OPV coverage in the affected communities and ceased when OPV coverage rates increased. The potential for cVDPVs to circulate in populations with low immunity to poliovirus has important implications for current and future strategies to eradicate polio worldwide.

  7. Molecular Characterization of two Potato Virus S Isolates from Late Blight Resistant Genotypes of Potato (Solanum tuberosum)

    USDA-ARS?s Scientific Manuscript database

    Potato virus S (PVS) has a widespread distribution in the U.S. However, only two complete nucleotide sequences have been published. A recent survey of potato fields in the state of Washington confirms that PVS is widely prevalent. Late blight resistant (LBR) potato cultivars and genotypes were sho...

  8. Characterization and Complete Nucleotide Sequence of an Unusual Reptilian Retrovirus Recovered from the Order Crocodylia

    PubMed Central

    Martin, Joanne; Kabat, Peter; Herniou, Elisabeth; Tristem, Michael

    2002-01-01

    A novel group of retroviruses found within the order Crocodylia are described. Phylogenetic analyses demonstrate that they are probably the most divergent members of the Retroviridae described to date; even the most conserved regions of Pol show an average of only 23% amino acid identity when compared to other retroviruses. PMID:11932432

  9. Complete Genome Sequence of a Genomovirus Associated with Common Bean Plant Leaves in Brazil.

    PubMed

    Lamas, Natalia Silva; Fontenele, Rafaela Salgado; Melo, Fernando Lucas; Costa, Antonio Felix; Varsani, Arvind; Ribeiro, Simone Graça

    2016-11-10

    A new genomovirus has been identified in three common bean plants in Brazil. This virus has a circular genome of 2,220 nucleotides and 3 major open reading frames. It shares 80.7% genome-wide pairwise identity with a genomovirus recovered from Tongan fruit bat guano. Copyright © 2016 Lamas et al.

  10. Identification of sequence changes in live attenuated goose parvovirus vaccine strains developed in Asia and Europe.

    PubMed

    Shien, J-H; Wang, Y-S; Chen, C-H; Shieh, H K; Hu, C-C; Chang, P-C

    2008-10-01

    Live attenuated vaccines have been used for control of the disease caused by goose parvovirus (GPV), but the mechanism involved in attenuation of GPV remains elusive. This report presents the complete nucleotide sequences of two live attenuated strains of GPV (82-0321V and VG32/1) that were independently developed in Taiwan and Europe, together with the parental strain of 82-0321V and a field strain isolated in Taiwan in 2006. Sequence comparisons showed that 82-0321V and VG32/1 had multiple deletions and substitutions in the inverted terminal repeats region when compared with their parental strain or the field virus, but these changes did not affect the formation of the hairpin structure essential for viral replication. Moreover, 82-0321V and VG32/1 had five amino acid changes in the non-structural protein, but these changes were located at positions distant from known functional motifs in the non-structural protein. In contrast, 82-0321V had nine changes and VG32/1 had 11 changes in their capsid proteins (VP1), and the majority of these changes occurred at positions close to the putative receptor binding sites of VP1, as predicted using the structure of adeno-associated virus 2 as the model system. Taken together, the results suggest that changes in sequence near the receptor binding sites of VP1 might be responsible for attenuation of GPV. This is the first report of complete nucleotide sequences of GPV other than the virulent B strain, and suggests a possible mechanism for attenuation of GPV.

  11. Analysis of Complete Nucleotide Sequences of 12 Gossypium Chloroplast Genomes: Origin and Evolution of Allotetraploids

    PubMed Central

    Xu, Qin; Xiong, Guanjun; Li, Pengbo; He, Fei; Huang, Yi; Wang, Kunbo; Li, Zhaohu; Hua, Jinping

    2012-01-01

    Background Cotton (Gossypium spp.) is a model system for the analysis of polyploidization. Although ascertaining the donor species of allotetraploid cotton has been intensively studied, sequence comparison of Gossypium chloroplast genomes is still of interest to understand the mechanisms underlining the evolution of Gossypium allotetraploids, while it is generally accepted that the parents were A- and D-genome containing species. Here we performed a comparative analysis of 13 Gossypium chloroplast genomes, twelve of which are presented here for the first time. Methodology/Principal Findings The size of 12 chloroplast genomes under study varied from 159,959 bp to 160,433 bp. The chromosomes were highly similar having >98% sequence identity. They encoded the same set of 112 unique genes which occurred in a uniform order with only slightly different boundary junctions. Divergence due to indels as well as substitutions was examined separately for genome, coding and noncoding sequences. The genome divergence was estimated as 0.374% to 0.583% between allotetraploid species and A-genome, and 0.159% to 0.454% within allotetraploids. Forty protein-coding genes were completely identical at the protein level, and 20 intergenic sequences were completely conserved. The 9 allotetraploids shared 5 insertions and 9 deletions in whole genome, and 7-bp substitutions in protein-coding genes. The phylogenetic tree confirmed a close relationship between allotetraploids and the ancestor of A-genome, and the allotetraploids were divided into four separate groups. Progenitor allotetraploid cotton originated 0.43–0.68 million years ago (MYA). Conclusion Despite high degree of conservation between the Gossypium chloroplast genomes, sequence variations among species could still be detected. Gossypium chloroplast genomes preferred for 5-bp indels and 1–3-bp indels are mainly attributed to the SSR polymorphisms. This study supports that the common ancestor of diploid A-genome species in Gossypium is the maternal source of extant allotetraploid species and allotetraploids have a monophyletic origin. G. hirsutum AD1 lineages have experienced more sequence variations than other allotetraploids in intergenic regions. The available complete nucleotide sequences of 12 Gossypium chloroplast genomes should facilitate studies to uncover the molecular mechanisms of compartmental co-evolution and speciation of Gossypium allotetraploids. PMID:22876273

  12. Nucleotide sequence analysis of the recA gene and discrimination of the three isolates of urease-positive thermophilic Campylobacter (UPTC) isolated from seagulls (Larus spp.) in Northern Ireland.

    PubMed

    Matsuda, M; Tai, K; Moore, J E; Millar, B C; Murayama, O

    2004-01-01

    Nucleotide sequencing after TA cloning of the amplicon of the almost-full length recA gene from three strains of UPTC (A1, A2, and A3) isolated from seagulls in Northern Ireland, the phenotypical and genotypical characteristics of which have been demonstrated to be indistinguishable, clarified nucleotide differences at three nucleotide positions among the three strains. In conclusion, the nucleotide sequences of the recA gene were found to discriminate among the three strains of UPTC, A1, A2, and A3, which are indistinguishable phenotypically and genotypically. Thus, the present study strongly suggests that nucleotide sequence data of the amplicon of a suitable gene or region could aid in discriminating among isolates of the UPTC group, which are indistinguishable phenotypically and genotypically. Copyright 2004 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim

  13. Nucleic acid analysis using terminal-phosphate-labeled nucleotides

    DOEpatents

    Korlach, Jonas [Ithaca, NY; Webb, Watt W [Ithaca, NY; Levene, Michael [Ithaca, NY; Turner, Stephen [Ithaca, NY; Craighead, Harold G [Ithaca, NY; Foquet, Mathieu [Ithaca, NY

    2008-04-22

    The present invention is directed to a method of sequencing a target nucleic acid molecule having a plurality of bases. In its principle, the temporal order of base additions during the polymerization reaction is measured on a molecule of nucleic acid, i.e. the activity of a nucleic acid polymerizing enzyme on the template nucleic acid molecule to be sequenced is followed in real time. The sequence is deduced by identifying which base is being incorporated into the growing complementary strand of the target nucleic acid by the catalytic activity of the nucleic acid polymerizing enzyme at each step in the sequence of base additions. A polymerase on the target nucleic acid molecule complex is provided in a position suitable to move along the target nucleic acid molecule and extend the oligonucleotide primer at an active site. A plurality of labelled types of nucleotide analogs are provided proximate to the active site, with each distinguishable type of nucleotide analog being complementary to a different nucleotide in the target nucleic acid sequence. The growing nucleic acid strand is extended by using the polymerase to add a nucleotide analog to the nucleic acid strand at the active site, where the nucleotide analog being added is complementary to the nucleotide of the target nucleic acid at the active site. The nucleotide analog added to the oligonucleotide primer as a result of the polymerizing step is identified. The steps of providing labelled nucleotide analogs, polymerizing the growing nucleic acid strand, and identifying the added nucleotide analog are repeated so that the nucleic acid strand is further extended and the sequence of the target nucleic acid is determined.

  14. Nucleotide sequence analysis of the L gene of Newcastle disease virus: homologies with Sendai and vesicular stomatitis viruses.

    PubMed Central

    Yusoff, K; Millar, N S; Chambers, P; Emmerson, P T

    1987-01-01

    The nucleotide sequence of the L gene of the Beaudette C strain of Newcastle disease virus (NDV) has been determined. The L gene is 6704 nucleotides long and encodes a protein of 2204 amino acids with a calculated molecular weight of 248822. Mung bean nuclease mapping of the 5' terminus of the L gene mRNA indicates that the transcription of the L gene is initiated 11 nucleotides upstream of the translational start site. Comparison with the amino acid sequences of the L genes of Sendai virus and vesicular stomatitis virus (VSV) suggests that there are several regions of homology between the sequences. These data provide further evidence for an evolutionary relationship between the Paramyxoviridae and the Rhabdoviridae. A non-coding sequence of 46 nucleotides downstream of the presumed polyadenylation site of the L gene may be part of a negative strand leader RNA. Images PMID:3035486

  15. Enriching public descriptions of marine phages using the Genomic Standards Consortium MIGS standard

    PubMed Central

    Duhaime, Melissa Beth; Kottmann, Renzo; Field, Dawn; Glöckner, Frank Oliver

    2011-01-01

    In any sequencing project, the possible depth of comparative analysis is determined largely by the amount and quality of the accompanying contextual data. The structure, content, and storage of this contextual data should be standardized to ensure consistent coverage of all sequenced entities and facilitate comparisons. The Genomic Standards Consortium (GSC) has developed the “Minimum Information about Genome/Metagenome Sequences (MIGS/MIMS)” checklist for the description of genomes and here we annotate all 30 publicly available marine bacteriophage sequences to the MIGS standard. These annotations build on existing International Nucleotide Sequence Database Collaboration (INSDC) records, and confirm, as expected that current submissions lack most MIGS fields. MIGS fields were manually curated from the literature and placed in XML format as specified by the Genomic Contextual Data Markup Language (GCDML). These “machine-readable” reports were then analyzed to highlight patterns describing this collection of genomes. Completed reports are provided in GCDML. This work represents one step towards the annotation of our complete collection of genome sequences and shows the utility of capturing richer metadata along with raw sequences. PMID:21677864

  16. The genome sequence of pepper vein yellows virus (family Luteoviridae, genus Polerovirus).

    PubMed

    Murakami, Ritsuko; Nakashima, Nobuhiko; Hinomoto, Norihide; Kawano, Shinji; Toyosato, Tetsuya

    2011-05-01

    The complete genome of pepper vein yellows virus (PeVYV) was sequenced using random amplification of RNA samples isolated from vector insects (Aphis gossypii) that had been given access to PeVYV-infected plants. The PeVYV genome consisted of 6244 nucleotides and had a genomic organization characteristic of members of the genus Polerovirus. PeVYV had highest amino acid sequence identities in ORF0 to ORF3 (75.9 - 91.9%) with tobacco vein distorting polerovirus, with which it was only 25.1% identical in ORF5. These sequence comparisons and previously studied biological properties indicate that PeVYV is a distinctly different virus and belongs to a new species of the genus Polerovirus.

  17. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2014-07-01 2014-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...

  18. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2013-07-01 2013-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...

  19. 37 CFR 1.821 - Nucleotide and/or amino acid sequence disclosures in patent applications.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ...” means those amino acids other than “Xaa” and those nucleotide bases other than “n”defined in accordance... 37 Patents, Trademarks, and Copyrights 1 2012-07-01 2012-07-01 false Nucleotide and/or amino acid... Biotechnology Invention Disclosures Application Disclosures Containing Nucleotide And/or Amino Acid Sequences...

  20. Detection and molecular characterization of tomato yellow leaf curl virus naturally infecting Lycopersicon esculentum in Egypt.

    PubMed

    Rabie, M; Ratti, C; Abdel Aleem, E; Fattouh, F

    Tomato yellow leaf curl virus (TYLCV) infections of tomato crops in Egypt were widely spread in 2014. Infected symptomatic tomato plants from different governorates were sampled. TYLCV strains Israel and Mild (TYLCV-IL, TYLCV-Mild) were identified by multiplex and real-time PCR. In addition, nucleotide sequence analysis of the V1 and V2 protein genes, revealed ten TYLCV Egyptian isolates (TYLCV from TY1 to 10). Phylogenetic analysis showed their high degree of relatedness with TYLCV-IL Jordan isolate (98%). Here we have showed the complete nucleotide sequence of the TYLCV Egyptian isolate TY10, sampled from El Beheira. A high degree of similarity to other previously reported Egyptian isolates and isolates from Jordan and Japan reflect the importance of phylogenetic analysis in monitoring virus genetic diversity and possibilities for divergence of more virulent strains or genotypes.

  1. The complete mitochondrial genome of the medicinal fungus Ganoderma applanatum (Polyporales, Basidiomycota).

    PubMed

    Wang, Xin-Cun; Shao, Junjie; Liu, Chang

    2016-07-01

    We have determined the complete nucleotide sequence of the mitochondrial genome of the medicinal fungus Ganoderma applanatum (Pers.) Pat. using the next-generation sequencing technology. The circular molecule is 119,803 bp long with a GC content of 26.66%. Gene prediction revealed genes encoding 15 conserved proteins, 25 tRNAs, the large and small ribosomal RNAs, all genes are located on the same strand except trnW-CCA. Compared with previously sequenced genomes of G. lucidum, G. meredithiae and G. sinense, the order of the protein and rRNA genes is highly conserved; however, the types of tRNA genes are slightly different. The mitochondrial genome of G. applanatum will contribute to the understanding of the phylogeny and evolution of Ganoderma and Ganodermataceae, the group containing many species with high medicinal values.

  2. Complete nucleotide sequence of the Cryptomeria japonica D. Don. chloroplast genome and comparative chloroplast genomics: diversified genomic structure of coniferous species.

    PubMed

    Hirao, Tomonori; Watanabe, Atsushi; Kurita, Manabu; Kondo, Teiji; Takata, Katsuhiko

    2008-06-23

    The recent determination of complete chloroplast (cp) genomic sequences of various plant species has enabled numerous comparative analyses as well as advances in plant and genome evolutionary studies. In angiosperms, the complete cp genome sequences of about 70 species have been determined, whereas those of only three gymnosperm species, Cycas taitungensis, Pinus thunbergii, and Pinus koraiensis have been established. The lack of information regarding the gene content and genomic structure of gymnosperm cp genomes may severely hamper further progress of plant and cp genome evolutionary studies. To address this need, we report here the complete nucleotide sequence of the cp genome of Cryptomeria japonica, the first in the Cupressaceae sensu lato of gymnosperms, and provide a comparative analysis of their gene content and genomic structure that illustrates the unique genomic features of gymnosperms. The C. japonica cp genome is 131,810 bp in length, with 112 single copy genes and two duplicated (trnI-CAU, trnQ-UUG) genes that give a total of 116 genes. Compared to other land plant cp genomes, the C. japonica cp has lost one of the relevant large inverted repeats (IRs) found in angiosperms, fern, liverwort, and gymnosperms, such as Cycas and Gingko, and additionally has completely lost its trnR-CCG, partially lost its trnT-GGU, and shows diversification of accD. The genomic structure of the C. japonica cp genome also differs significantly from those of other plant species. For example, we estimate that a minimum of 15 inversions would be required to transform the gene organization of the Pinus thunbergii cp genome into that of C. japonica. In the C. japonica cp genome, direct repeat and inverted repeat sequences are observed at the inversion and translocation endpoints, and these sequences may be associated with the genomic rearrangements. The observed differences in genomic structure between C. japonica and other land plants, including pines, strongly support the theory that the large IRs stabilize the cp genome. Furthermore, the deleted large IR and the numerous genomic rearrangements that have occurred in the C. japonica cp genome provide new insights into both the evolutionary lineage of coniferous species in gymnosperm and the evolution of the cp genome.

  3. Nucleotide sequence determination of guinea-pig casein B mRNA reveals homology with bovine and rat alpha s1 caseins and conservation of the non-coding regions of the mRNA.

    PubMed Central

    Hall, L; Laird, J E; Craig, R K

    1984-01-01

    Nucleotide sequence analysis of cloned guinea-pig casein B cDNA sequences has identified two casein B variants related to the bovine and rat alpha s1 caseins. Amino acid homology was largely confined to the known bovine or predicted rat phosphorylation sites and within the 'signal' precursor sequence. Comparison of the deduced nucleotide sequence of the guinea-pig and rat alpha s1 casein mRNA species showed greater sequence conservation in the non-coding than in the coding regions, suggesting a functional and possibly regulatory role for the non-coding regions of casein mRNA. The results provide insight into the evolution of the casein genes, and raise questions as to the role of conserved nucleotide sequences within the non-coding regions of mRNA species. Images Fig. 1. PMID:6548375

  4. Homogeneity of the 16S rDNA sequence among geographically disparate isolates of Taylorella equigenitalis

    PubMed Central

    Matsuda, M; Tazumi, A; Kagawa, S; Sekizuka, T; Murayama, O; Moore, JE; Millar, BC

    2006-01-01

    Background At present, six accessible sequences of 16S rDNA from Taylorella equigenitalis (T. equigenitalis) are available, whose sequence differences occur at a few nucleotide positions. Thus it is important to determine these sequences from additional strains in other countries, if possible, in order to clarify any anomalies regarding 16S rDNA sequence heterogeneity. Here, we clone and sequence the approximate full-length 16S rDNA from additional strains of T. equigenitalis isolated in Japan, Australia and France and compare these sequences to the existing published sequences. Results Clarification of any anomalies regarding 16S rDNA sequence heterogeneity of T. equigenitalis was carried out. When cloning, sequencing and comparison of the approximate full-length 16S rDNA from 17 strains of T. equigenitalis isolated in Japan, Australia and France, nucleotide sequence differences were demonstrated at the six loci in the 1,469 nucleotide sequence. Moreover, 12 polymorphic sites occurred among 23 sequences of the 16S rDNA, including the six reference sequences. Conclusion High sequence similarity (99.5% or more) was observed throughout, except from nucleotide positions 138 to 501 where substitutions and deletions were noted. PMID:16398935

  5. Human ribosomal RNA gene: nucleotide sequence of the transcription initiation region and comparison of three mammalian genes.

    PubMed Central

    Financsek, I; Mizumoto, K; Mishima, Y; Muramatsu, M

    1982-01-01

    The transcription initiation site of the human ribosomal RNA gene (rDNA) was located by using the single-strand specific nuclease protection method and by determining the first nucleotide of the in vitro capped 45S preribosomal RNA. The sequence of 1,211 nucleotides surrounding the initiation site was determined. The sequenced region was found to consist of 75% G and C and to contain a number of short direct and inverted repeats and palindromes. By comparison of the corresponding initiation regions of three mammalian species, several conserved sequences were found upstream and downstream from the transcription starting point. Two short A + T-rich sequences are present on human, mouse, and rat ribosomal RNA genes between the initiation site and 40 nucleotides upstream, and a C + T cluster is located at a position around -60. At and downstream from the initiation site, a common sequence, T-AG-C-T-G-A-C-A-C-G-C-T-G-T-C-C-T-CT-T, was found in the three genes from position -1 through +18. The strong conservation of these sequences suggests their functional significance in rDNA. The S1 nuclease protection experiments with cloned rDNA fragments indicated the presence in human 45S RNA of molecules several hundred nucleotides shorter than the supposed primary transcript. The first 19 nucleotides of these molecules appear identical--except for one mismatch--to the nucleotide sequence of the 5' end of a supposed early processing product of the mouse 45S RNA. Images PMID:6954460

  6. Molecular identification of a new begomovirus infecting yellow passion fruit (Passiflora edulis) in Colombia.

    PubMed

    Vaca-Vaca, Juan Carlos; Carrasco-Lozano, Emerson Clovis; López-López, Karina

    2017-02-01

    The complete genome sequence of a bipartite begomovirus (genus Begomovirus, family Geminiviridae) infecting yellow passion fruit (Passiflora edulis) in the state of Valle del Cauca (Colombia) has been determined. The complete DNA-A and DNA-B components were determined to be 2600 and 2572 nt in length, respectively. The DNA-A showed the highest nucleotide sequence identity (87.2 %) to bean dwarf mosaic virus (M88179), a begomovirus found in common bean crops in Colombia, and only 77.4 % identity to passion fruit severe leaf distortion virus (FJ972767), a begomovirus identified infecting passion fruit in Brazil. Based on its sequence identity to all other begomoviruses known to date and in accordance with the ICTV species demarcation criterion for the genus Begomovirus (≥91 % sequence identity for the complete DNA-A), the name passion fruit leaf distortion virus is proposed for this new begomovirus. To our knowledge, this is the first report of a bipartite begomovirus affecting passion fruit in Colombia and the second report of a geminivirus affecting this crop worldwide.

  7. CNTNAP2 Is Significantly Associated With Speech Sound Disorder in the Chinese Han Population.

    PubMed

    Zhao, Yun-Jing; Wang, Yue-Ping; Yang, Wen-Zhu; Sun, Hong-Wei; Ma, Hong-Wei; Zhao, Ya-Ru

    2015-11-01

    Speech sound disorder is the most common communication disorder. Some investigations support the possibility that the CNTNAP2 gene might be involved in the pathogenesis of speech-related diseases. To investigate single-nucleotide polymorphisms in the CNTNAP2 gene, 300 unrelated speech sound disorder patients and 200 normal controls were included in the study. Five single-nucleotide polymorphisms were amplified and directly sequenced. Significant differences were found in the genotype (P = .0003) and allele (P = .0056) frequencies of rs2538976 between patients and controls. The excess frequency of the A allele in the patient group remained significant after Bonferroni correction (P = .0280). A significant haplotype association with rs2710102T/+rs17236239A/+2538976A/+2710117A (P = 4.10e-006) was identified. A neighboring single-nucleotide polymorphism, rs10608123, was found in complete linkage disequilibrium with rs2538976, and the genotypes exactly corresponded to each other. The authors propose that these CNTNAP2 variants increase the susceptibility to speech sound disorder. The single-nucleotide polymorphisms rs10608123 and rs2538976 may merge into one single-nucleotide polymorphism. © The Author(s) 2015.

  8. Quantum Point Contact Single-Nucleotide Conductance for DNA and RNA Sequence Identification.

    PubMed

    Afsari, Sepideh; Korshoj, Lee E; Abel, Gary R; Khan, Sajida; Chatterjee, Anushree; Nagpal, Prashant

    2017-11-28

    Several nanoscale electronic methods have been proposed for high-throughput single-molecule nucleic acid sequence identification. While many studies display a large ensemble of measurements as "electronic fingerprints" with some promise for distinguishing the DNA and RNA nucleobases (adenine, guanine, cytosine, thymine, and uracil), important metrics such as accuracy and confidence of base calling fall well below the current genomic methods. Issues such as unreliable metal-molecule junction formation, variation of nucleotide conformations, insufficient differences between the molecular orbitals responsible for single-nucleotide conduction, and lack of rigorous base calling algorithms lead to overlapping nanoelectronic measurements and poor nucleotide discrimination, especially at low coverage on single molecules. Here, we demonstrate a technique for reproducible conductance measurements on conformation-constrained single nucleotides and an advanced algorithmic approach for distinguishing the nucleobases. Our quantum point contact single-nucleotide conductance sequencing (QPICS) method uses combed and electrostatically bound single DNA and RNA nucleotides on a self-assembled monolayer of cysteamine molecules. We demonstrate that by varying the applied bias and pH conditions, molecular conductance can be switched ON and OFF, leading to reversible nucleotide perturbation for electronic recognition (NPER). We utilize NPER as a method to achieve >99.7% accuracy for DNA and RNA base calling at low molecular coverage (∼12×) using unbiased single measurements on DNA/RNA nucleotides, which represents a significant advance compared to existing sequencing methods. These results demonstrate the potential for utilizing simple surface modifications and existing biochemical moieties in individual nucleobases for a reliable, direct, single-molecule, nanoelectronic DNA and RNA nucleotide identification method for sequencing.

  9. The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

    PubMed

    Holland, M J; Holland, J P; Thill, G P; Jackson, K A

    1981-02-10

    Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5- noncoding portions of these glycolytic genes.

  10. Molecular characterization of the Great Lakes viral hemorrhagic septicemia virus (VHSV) isolate from USA

    PubMed Central

    Ammayappan, Arun; Vakharia, Vikram N

    2009-01-01

    Background Viral hemorrhagic septicemia virus (VHSV) is a highly contagious viral disease of fresh and saltwater fish worldwide. VHSV caused several large scale fish kills in the Great Lakes area and has been found in 28 different host species. The emergence of VHS in the Great Lakes began with the isolation of VHSV from a diseased muskellunge (Esox masquinongy) caught from Lake St. Clair in 2003. VHSV is a member of the genus Novirhabdovirus, within the family Rhabdoviridae. It has a linear single-stranded, negative-sense RNA genome of approximately 11 kbp, with six genes. VHSV replicates in the cytoplasm and produces six monocistronic mRNAs. The gene order of VHSV is 3'-N-P-M-G-NV-L-5'. This study describes molecular characterization of the Great Lakes VHSV strain (MI03GL), and its phylogenetic relationships with selected European and North American isolates. Results The complete genomic sequences of VHSV-MI03GL strain was determined from cloned cDNA of six overlapping fragments, obtained by RT-PCR amplification of genomic RNA. The complete genome sequence of MI03GL comprises 11,184 nucleotides (GenBank GQ385941) with the gene order of 3'-N-P-M-G-NV-L-5'. These genes are separated by conserved gene junctions, with di-nucleotide gene spacers. The first 4 nucleotides at the termini of the VHSV genome are complementary and identical to other novirhadoviruses genomic termini. Sequence homology and phylogenetic analysis show that the Great Lakes virus is closely related to the Japanese strains JF00Ehi1 (96%) and KRRV9822 (95%). Among other novirhabdoviruses, VHSV shares highest sequence homology (62%) with snakehead rhabdovirus. Conclusion Phylogenetic tree obtained by comparing 48 glycoprotein gene sequences of different VHSV strains demonstrate that the Great Lakes VHSV is closely related to the North American and Japanese genotype IVa, but forms a distinct genotype IVb, which is clearly different from the three European genotypes. Molecular characterization of the Great Lakes isolate will be helpful in studying the pathogenesis of VHSV using a reverse genetics approach and developing efficient control strategies. PMID:19852863

  11. Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

    PubMed

    Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

    2014-11-01

    As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of our knowledge, this is the first attempt to predict protein-binding nucleotides in a given DNA sequence from the sequence data alone. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  12. Array of nucleic acid probes on biological chips for diagnosis of HIV and methods of using the same

    DOEpatents

    Chee, Mark; Gingeras, Thomas R.; Fodor, Stephen P. A.; Hubble, Earl A.; Morris, MacDonald S.

    1999-01-19

    The invention provides an array of oligonucleotide probes immobilized on a solid support for analysis of a target sequence from a human immunodeficiency virus. The array comprises at least four sets of oligonucleotide probes 9 to 21 nucleotides in length. A first probe set has a probe corresponding to each nucleotide in a reference sequence from a human immunodeficiency virus. A probe is related to its corresponding nucleotide by being exactly complementary to a subsequence of the reference sequence that includes the corresponding nucleotide. Thus, each probe has a position, designated an interrogation position, that is occupied by a complementary nucleotide to the corresponding nucleotide. The three additional probe sets each have a corresponding probe for each probe in the first probe set. Thus, for each nucleotide in the reference sequence, there are four corresponding probes, one from each of the probe sets. The three corresponding probes in the three additional probe sets are identical to the corresponding probe from the first probe or a subsequence thereof that includes the interrogation position, except that the interrogation position is occupied by a different nucleotide in each of the four corresponding probes.

  13. The use of sequence-based SSR mining for the development of a vast collection of microsatellites in Aquilegia Formosa

    Treesearch

    Brandon Schlautman; Vera Pfeiffer; Juan Zalapa; Johanne Brunet

    2014-01-01

    Numerous microsatellite markers were developed for Aquilegia formosafrom sequences deposited within the Expressed Sequence Tag (EST), Genomic Survey Sequence (GSS), and Nucleotide databases in NCBI. Microsatellites (SSRs) were identified and primers were designed for 9 SSR containing sequences in the Nucleotide database, 3803 sequences in the EST...

  14. Cloning and expression of UDP-glucose: flavonoid 7-O-glucosyltransferase from hairy root cultures of Scutellaria baicalensis.

    PubMed

    Hirotani, M; Kuroda, R; Suzuki, H; Yoshikawa, T

    2000-05-01

    A cDNA encoding UDP-glucose: baicalein 7-O-glucosyltransferase (UBGT) was isolated from a cDNA library from hairy root cultures of Scutellaria baicalensis Georgi probed with a partial-length cDNA clone of a UDP-glucose: flavonoid 3-O-glucosyltransferase (UFGT) from grape (Vitis vinifera L.). The heterologous probe contained a glucosyltransferase consensus amino acid sequence which was also present in the Scutellaria cDNA clones. The complete nucleotide sequence of the 1688-bp cDNA insert was determined and the deduced amino acid sequences are presented. The nucleotide sequence analysis of UBGT revealed an open reading frame encoding a polypeptide of 476 amino acids with a calculated molecular mass of 53,094 Da. The reaction product for baicalein and UDP-glucose catalyzed by recombinant UBGT in Escherichia coli was identified as authentic baicalein 7-O-glucoside using high-performance liquid chromatography and proton nuclear magnetic resonance spectroscopy. The enzyme activities of recombinant UBGT expressed in E. coli were also detected towards flavonoids such as baicalein, wogonin, apigenin, scutellarein, 7,4'-dihydroxyflavone and kaempferol, and phenolic compounds. The accumulation of UBGT mRNA in hairy roots was in response to wounding or salicylic acid treatments.

  15. [Molecular cloning and characterization of an acetylcholinesterase gene Dd-ace-2 from sweet potato stem nematode Ditylenchus destructor].

    PubMed

    Ding, Zhong; Peng, Deliang; Huang, Wenkun; He, Wenting; Gao, Bida

    2008-02-01

    A cDNA, named Dd-ace-2, encoding an acetylcholinesterase (AChE, EC3.1.1.7), was isolated from sweet-potato-stem nematode, Ditylenchus destructor. The nucleotide and amino acid sequences among different nematode species were compared and analyzed with DNAMAN5.0, MEGA3.0 softwares. The results showed that the complete nucleotide sequence of Dd-ace-2 gene of Ditylenchus destructor contains 2425 base pairs from which deduced 734 amino acids (GenBank accession No. EF583058). The homology rates of amino acid sequences of Dd-ace-2 gene between Ditylenchus destructor and Meloidogyne incognita, Caenorhabditis elegans, Dictyocaulus viviparous were 48.0%, 42.7%, 42.1% respectively. The mature acetylcholinesterase sequences of Ditylenchus destructor may encode by the first 701 residues of deduced 734 amino acids.The conserved motifs involved in the catalytic triad, the choline binding site and 10 aromatic residues lining the catalytic gorge were present in the Dd-ace-2 deduced protein. Phylogenetic analysis based on AChEs of other nematodes and species showed that the deduced AChE formed the same cluster with ACE-2s.

  16. Switchgrass ubiquitin promoter (PVUBI2) and uses thereof

    DOEpatents

    Stewart, C. Neal; Mann, David George James

    2013-12-10

    The subject application provides polynucleotides, compositions thereof and methods for regulating gene expression in a plant. Polynucleotides disclosed herein comprise novel sequences for a promoter isolated from Panicum virgatum (switchgrass) that initiates transcription of an operably linked nucleotide sequence. Thus, various embodiments of the invention comprise the nucleotide sequence of SEQ ID NO: 2 or fragments thereof comprising nucleotides 1 to 692 of SEQ ID NO: 2 that are capable of driving the expression of an operably linked nucleic acid sequence.

  17. The complete mitochondrial genome of Plodia interpunctella (Lepidoptera: Pyralidae) and comparison with other Pyraloidea insects.

    PubMed

    Liu, Qiu-Ning; Chai, Xin-Yue; Bian, Dan-Dan; Zhou, Chun-Lin; Tang, Bo-Ping

    2016-01-01

    The mitochondrial (mt) genome can provide important information for the understanding of phylogenetic relationships. The complete mt genome of Plodia interpunctella (Lepidoptera: Pyralidae) has been sequenced. The circular genome is 15 287 bp in size, encoding 13 protein-coding genes (PCGs), 2 rRNA genes, 22 tRNA genes, and a control region. The AT skew of this mt genome is slightly negative, and the nucleotide composition is biased toward A+T nucleotides (80.15%). All PCGs start with the typical ATN (ATA, ATC, ATG, and ATT) codons, except for the cox1 gene which may start with the CGA codon. Four of the 13 PCGs harbor the incomplete termination codon T or TA. All the tRNA genes are folded into the typical clover-leaf structure of mitochondrial tRNA, except for trnS1 (AGN) in which the DHU arm fails to form a stable stem-loop structure. The overlapping sequences are 35 bp in total and are found in seven different locations. A total of 240 bp of intergenic spacers are scattered in 16 regions. The control region of the mt genome is 327 bp in length and consisted of several features common to the sequenced lepidopteran insects. Phylogenetic analysis based on 13 PCGs using the Maximum Likelihood method shows that the placement of P. interpunctella was within the Pyralidae.

  18. The complete chloroplast genome of North American ginseng, Panax quinquefolius.

    PubMed

    Han, Zeng-Jie; Li, Wei; Liu, Yuan; Gao, Li-Zhi

    2016-09-01

    We report complete nucleotide sequence of the Panax quinquefolius chloroplast genome using next-generation sequencing technology. The genome size is 156 359 bp, including two inverted repeats (IRs) of 52 153 bp, separated by the large single-copy (LSC 86 184 bp) and small single-copy (SSC 18 081 bp) regions. This cp genome encodes 114 unigenes (80 protein-coding genes, four rRNA genes, and 30 tRNA genes), in which 18 are duplicated in the IR regions. Overall GC content of the genome is 38.08%. A phylogenomic analysis of the 10 complete chloroplast genomes from Araliaceae using Daucus carota from Apiaceae as outgroup showed that P. quinquefolius is closely related to the other two members of the genus Panax, P. ginseng and P. notoginseng.

  19. Complete genome sequence of a new isolate of Solenopsis invicta virus 3 from Solenopsis invicta x richteri hybrid ants

    USDA-ARS?s Scientific Manuscript database

    Solenopsis invicta virus 3 (SINV-3) is a positive-sense, single-stranded RNA virus that infects the red imported fire ant, Solenopsis invicta Buren. We report here the full genome (10,383 nucleotides [nt]) of an isolate infecting Solenopsis invicta x richteri hybrids, which we have identified as SI...

  20. The complete nucleotide sequence of the Barley yellow dwarf virus-RMV genome reveals it to be a new Polerovirus distantly related to other yellow dwarf viruses

    USDA-ARS?s Scientific Manuscript database

    The yellow dwarf viruses (YDVs) of the Luteoviridae family represent the most widespread group of cereal viruses worldwide. They include the Barley yellow dwarf viruses (BYDVs) of genus Luteovirus, the Cereal yellow dwarf viruses (CYDVs) and Wheat yellow dwarf virus (WYDV) of genus Polerovirus. All ...

  1. Molecular anatomy of lymphocystis disease virus.

    PubMed

    Tidona, C A; Darai, G

    1997-01-01

    Lymphocystis disease (LD) has been reported to occur in over one hundred different species of fish worldwide. The disease is caused by lymphocystis disease virus (LCDV), a member of the iridovirus family. Numerous fish species that play an important role in fishery and fish farming are highly susceptible to LCDV infection. The infected animals develop disseminated clusters of aberrant hypertrophied cells within their connective tissue, the so-called lymphocystis cells. In the cytoplasm of these cells a massive accumulation of virions can be observed. As a first step towards understanding the mechanisms of viral infection and pathogenesis the complete genomic nucleotide sequence of lymphocystis disease virus type 1 (LCDV-1; flounder isolate) was determined. LCDV-1 is the type species of the genus Lymphocystivirus within the family Iridoviridae. The virions contain a single linear double-stranded DNA molecule that is circularly permuted, terminally redundant and heavily methylated. Since there is no convenient cell system for virus replication we determined the complete nucleotide sequence of the viral genome (102,653 base pairs). Computer assisted analyses of 195 potential open reading frames resulted in the identification of a number of putative gene products with significant homology to functionally characterized proteins of other species.

  2. The Complete Genomic Sequence of Pepper Yellow Leaf Curl Virus (PYLCV) and Its Implications for Our Understanding of Evolution Dynamics in the Genus Polerovirus

    PubMed Central

    Dombrovsky, Aviv; Glanz, Eyal; Lachman, Oded; Sela, Noa; Doron-Faigenboim, Adi; Antignus, Yehezkel

    2013-01-01

    We determined the complete sequence and organization of the genome of a putative member of the genus Polerovirus tentatively named Pepper yellow leaf curl virus (PYLCV). PYLCV has a wider host range than Tobacco vein-distorting virus (TVDV) and has a close serological relationship with Cucurbit aphid-borne yellows virus (CABYV) (both poleroviruses). The extracted viral RNA was subjected to SOLiD next-generation sequence analysis and used as a template for reverse transcription synthesis, which was followed by PCR amplification. The ssRNA genome of PYLCV includes 6,028 nucleotides encoding six open reading frames (ORFs), which is typical of the genus Polerovirus. Comparisons of the deduced amino acid sequences of the PYLCV ORFs 2-4 and ORF5, indicate that there are high levels of similarity between these sequences to ORFs 2-4 of TVDV (84-93%) and to ORF5 of CABYV (87%). Both PYLCV and Pepper vein yellowing virus (PeVYV) contain sequences that point to a common ancestral polerovirus. The recombination breakpoint which is located at CABYV ORF3, which encodes the viral coat protein (CP), may explain the CABYV-like sequences found in the genomes of the pepper infecting viruses PYLCV and PeVYV. Two additional regions unique to PYLCV (PY1 and PY2) were identified between nucleotides 4,962 and 5,061 (ORF 5) and between positions 5,866 and 6,028 in the 3' NCR. Sequence analysis of the pepper-infecting PeVYV revealed three unique regions (Pe1-Pe3) with no similarity to other members of the genus Polerovirus. Genomic analyses of PYLCV and PeVYV suggest that the speciation of these viruses occurred through putative recombination event(s) between poleroviruses co-infecting a common host(s), resulting in the emergence of PYLCV, a novel pathogen with a wider host range. PMID:23936244

  3. The complete genomic sequence of pepper yellow leaf curl virus (PYLCV) and its implications for our understanding of evolution dynamics in the genus polerovirus.

    PubMed

    Dombrovsky, Aviv; Glanz, Eyal; Lachman, Oded; Sela, Noa; Doron-Faigenboim, Adi; Antignus, Yehezkel

    2013-01-01

    We determined the complete sequence and organization of the genome of a putative member of the genus Polerovirus tentatively named Pepper yellow leaf curl virus (PYLCV). PYLCV has a wider host range than Tobacco vein-distorting virus (TVDV) and has a close serological relationship with Cucurbit aphid-borne yellows virus (CABYV) (both poleroviruses). The extracted viral RNA was subjected to SOLiD next-generation sequence analysis and used as a template for reverse transcription synthesis, which was followed by PCR amplification. The ssRNA genome of PYLCV includes 6,028 nucleotides encoding six open reading frames (ORFs), which is typical of the genus Polerovirus. Comparisons of the deduced amino acid sequences of the PYLCV ORFs 2-4 and ORF5, indicate that there are high levels of similarity between these sequences to ORFs 2-4 of TVDV (84-93%) and to ORF5 of CABYV (87%). Both PYLCV and Pepper vein yellowing virus (PeVYV) contain sequences that point to a common ancestral polerovirus. The recombination breakpoint which is located at CABYV ORF3, which encodes the viral coat protein (CP), may explain the CABYV-like sequences found in the genomes of the pepper infecting viruses PYLCV and PeVYV. Two additional regions unique to PYLCV (PY1 and PY2) were identified between nucleotides 4,962 and 5,061 (ORF 5) and between positions 5,866 and 6,028 in the 3' NCR. Sequence analysis of the pepper-infecting PeVYV revealed three unique regions (Pe1-Pe3) with no similarity to other members of the genus Polerovirus. Genomic analyses of PYLCV and PeVYV suggest that the speciation of these viruses occurred through putative recombination event(s) between poleroviruses co-infecting a common host(s), resulting in the emergence of PYLCV, a novel pathogen with a wider host range.

  4. Evaluation of anonymous and expressed sequence tag derived polymorphic microsatellite markers in the tobacco budworm Heliothis virescens (Lepidoptera: noctuidae)

    USDA-ARS?s Scientific Manuscript database

    Polymorphic genetic markers were identified and characterized using a partial genomic library of Heliothis virescens enriched for simple sequence repeats (SSR) and nucleotide sequences of expressed sequence tags (EST). Nucleotide sequences of 192 clones from the partial genomic library yielded 147 u...

  5. Phylogenetic analysis of the envelope protein (domain lll) of dengue 4 viruses

    PubMed Central

    Mota, Javier; Ramos-Castañeda, José; Rico-Hesse, Rebeca; Ramos, Celso

    2011-01-01

    Objective To evaluate the genetic variability of domain III of envelope (E) protein and to estimate phylogenetic relationships of dengue 4 (Den-4) viruses isolated in Mexico and from other endemic areas of the world. Material and Methods A phylogenetic study of domain III of envelope (E) protein of Den-4 viruses was conducted in 1998 using virus strains from Mexico and other parts of the world, isolated in different years. Specific primers were used to amplify by RT-PCR the domain III and to obtain nucleotide sequence. Based on nucleotide and deduced aminoacid sequence, genetic variability was estimated and a phylogenetic tree was generated. To make an easy genetic analysis of domain III region, a Restriction Fragment Length Polymorphism (RFLP) assay was performed, using six restriction enzymes. Results Study results demonstrate that nucleotide and aminoacid sequence analysis of domain III are similar to those reported from the complete E protein gene. Based on the RFLP analysis of domain III using the restriction enzymes Nla III, Dde I and Cfo I, Den-4 viruses included in this study were clustered into genotypes 1 and 2 previously reported. Conclusions Study results suggest that domain III may be used as a genetic marker for phylogenetic and molecular epidemiology studies of dengue viruses. The English version of this paper is available too at: http://www.insp.mx/salud/index.html PMID:12132320

  6. Construction and sequencing of an infectious clone of the goose embryo-adapted Muscovy duck parvovirus vaccine strain FZ91-30.

    PubMed

    Wang, Jianye; Huang, Yu; Zhou, Mingxu; Hardwidge, Philip R; Zhu, Guoqiang

    2016-06-21

    Muscovy duck parvovirus (MDPV) is the etiological agent of Muscovy duckling parvoviral disease, which is characterized by diarrhea, locomotive dysfunction, stunting, and death in young ducklings, and causes substantial economic losses in the Muscovy duck industry worldwide. FZ91-30 is an attenuated vaccine strain that is safe and immunogenic to ducklings, but the genomic information and molecular mechanism underlining the attenuation are not understood. The FZ91-30 strain was propagated in 11-day-old embryonated goose eggs, and viral particles were purified from the pooled allantoic fluid by differential centrifugation and ultracentrifugation. Single-stranded genomic DNA was extracted and annealed to form double-stranded DNA. The dsDNA digested with NcoI resulted two sub-genomic fragments, which were then cloned into the modified plasmid pBluescript II SK, respectively, generating plasmid pBSKNL and pBSKNR. The sub-genomic plasmid clones were sequenced and further combined to construct the plasmid pFZ that contained the entire genome of strain FZ91-30. The complete genome sequences of strain FM and YY and partial genome sequences of other strains were retrieved from GenBank for sequence comparison. The plasmid pFZ containing the entire genome of FZ91-30 was transfected in 11-day-old embryonated goose eggs via the chorioallantoic membranes route to rescue infectious virus. A genetic marker was introduced into the rescued virus to discriminate from its parental virus. The genome of FZ91-30 consists of 5,131 nucleotides and has 98.9 % similarity to the FM strain. The inverted terminal repeats (ITR) are 456 nucleotides in length, 14 nucleotides longer than that of Goose parvovirus (GPV). The exterior 415 nucleotides of the ITR form a hairpin structure, and the interior 41 nucleotides constitute the D sequence, a reverse complement of the D' sequence at the 3' ITR. Amino acid sequence alignment of the VP1 proteins between FZ91-30 and five pathogenic MDPV strains revealed that FZ91-30 had five mutations; two in the unique region of the VP1 protein (VP1u) and three in VP3. Sequence alignment of the Rep1 proteins revealed two amino acid alterations for FZ91-30, both of which were conserved for two pathogenic strains YY and P. Transfection of the plasmid pFZ in 11-day-old embryonated goose eggs resulted in generation of infectious virus with similar biological properties as compared with the parental strain. The amino acid mutations identified in the VP1 and Rep1 protein may contribute to the attenuation of FZ91-30 in Muscovy ducklings. Plasmid transfection in embryonated goose eggs was suitable for rescue of infectious MDPV.

  7. DNA Nucleotide Sequence Restricted by the RI Endonuclease

    PubMed Central

    Hedgpeth, Joe; Goodman, Howard M.; Boyer, Herbert W.

    1972-01-01

    The sequence of DNA base pairs adjacent to the phosphodiester bonds cleaved by the RI restriction endonuclease in unmodified DNA from coliphage λ has been determined. The 5′-terminal nucleotide labeled with 32P and oligonucleotides up to the heptamer were analyzed from a pancreatic DNase digest. The following sequence of nucleotides adjacent to the RI break made in λ DNA was deduced from these data and from the 3′-dinucleotide sequence and nearest-neighbor analysis obtained from repair synthesis with the DNA polymerase of Rous sarcoma virus [Formula: see text] The RI endonuclease cleavage of the phosphodiester bonds (indicated by arrows) generates 5′-phosphoryls and short cohesive termini of four nucleotides, pApApTpT. The most striking feature of the sequence is its symmetry. PMID:4343974

  8. Comparison of simple sequence repeats in 19 Archaea.

    PubMed

    Trivedi, S

    2006-12-05

    All organisms that have been studied until now have been found to have differential distribution of simple sequence repeats (SSRs), with more SSRs in intergenic than in coding sequences. SSR distribution was investigated in Archaea genomes where complete chromosome sequences of 19 Archaea were analyzed with the program SPUTNIK to find di- to penta-nucleotide repeats. The number of repeats was determined for the complete chromosome sequences and for the coding and non-coding sequences. Different from what has been found for other groups of organisms, there is an abundance of SSRs in coding regions of the genome of some Archaea. Dinucleotide repeats were rare and CG repeats were found in only two Archaea. In general, trinucleotide repeats are the most abundant SSR motifs; however, pentanucleotide repeats are abundant in some Archaea. Some of the tetranucleotide and pentanucleotide repeat motifs are organism specific. In general, repeats are short and CG-rich repeats are present in Archaea having a CG-rich genome. Among the 19 Archaea, SSR density was not correlated with genome size or with optimum growth temperature. Pentanucleotide density had an inverse correlation with the CG content of the genome.

  9. Complete sequence of the genome of avian paramyxovirus type 2 (strain Yucaipa) and comparison with other paramyxoviruses

    PubMed Central

    Subbiah, Madhuri; Xiao, Sa; Collins, Peter L.; Samal, Siba K

    2009-01-01

    The complete RNA genome sequence of avian paramyxovirus (APMV) serotype 2, strain Yucaipa isolated from chicken has been determined. With genome size of 14,904 nucleotides (nt), strain Yucaipa is consistent with the “rule of six” and is the smallest virus reported to date among the members of subfamily Paramyxovirinae. The genome contains six non-overlapping genes in the order 3′-N-P/V-M-F-HN-L-5′. The genes are flanked on either side by highly-conserved transcription start and stop signals and have intergenic sequences varying in length from 3 to 23 nt. The genome contains a 55 nt leader sequence at 3′ end and a 154 nt trailer sequence at 5′ end. Alignment and phylogenetic analysis of the predicted amino acid sequences of strain Yucaipa proteins with the cognate proteins of viruses of all of the five genera of family Paramyxoviridae showed that APMV-2 strain Yucaipa is more closely related to APMV-6 than APMV-1. PMID:18603323

  10. Isolation and molecular identification of Sunshine virus, a novel paramyxovirus found in Australian snakes.

    PubMed

    Hyndman, Timothy H; Marschang, Rachel E; Wellehan, James F X; Nicholls, Philip K

    2012-10-01

    This paper describes the isolation and molecular identification of a novel paramyxovirus found during an investigation of an outbreak of neurorespiratory disease in a collection of Australian pythons. Using Illumina® high-throughput sequencing, a 17,187 nucleotide sequence was assembled from RNA extracts from infected viper heart cells (VH2) displaying widespread cytopathic effects in the form of multinucleate giant cells. The sequence appears to contain all the coding regions of the genome, including the following predicted paramyxoviral open reading frames (ORFs): 3'--Nucleocapsid (N)--putative Phosphoprotein (P)--Matrix (M)--Fusion (F)--putative attachment protein--Polymerase (L)--5'. There is also a 540 nucleotide ORF between the N and putative P genes that may be an additional coding region. Phylogenetic analyses of the complete N, M, F and L genes support the clustering of this virus within the family Paramyxoviridae but outside both of the current subfamilies: Paramyxovirinae and Pneumovirinae. We propose to name this new virus, Sunshine virus, after the geographic origin of the first isolate--the Sunshine Coast of Queensland, Australia. Copyright © 2012 Elsevier B.V. All rights reserved.

  11. A novel flavivirus detected in two Aedes spp. collected near the demilitarized zone of the Republic of Korea.

    PubMed

    Korkusol, Achareeya; Takhampunya, Ratree; Hang, Jun; Jarman, Richard G; Tippayachai, Bousaraporn; Kim, Heung-Chul; Chong, Sung-Tae; Davidson, Silas A; Klein, Terry A

    2017-05-01

    Flaviviruses comprise a large and diverse group of positive-stranded RNA viruses, including tick-, mosquito- and unknown-vector-borne flaviviruses. A novel flavivirus was detected in pools of Aedes vexans nipponii (n=1) and Aedes esoensis (n=3) collected in 2012 and 2013 near the demilitarized zone (DMZ), Republic of Korea (ROK). Phylogenetic analyses of the NS5, E gene and complete polyprotein coding sequence (CDS) showed that the novel virus fell within the Aedes-borne flaviviruses (ABFVs), with nucleotide identity ranging from 57.8-75.1 %, 46.1-74.2 % and 51.1-76.2 %, respectively. While the novel ABFV was distant from other flaviviruses within the group, it formed a clade with Ilomantsi virus (ILOV). Sequence alignments of the partial NS5 gene, full-length E gene and polyprotein CDS between the novel virus and ILOV showed approximately 76.2 % nucleotide identity and 90 % amino acid identity, respectively. The ABFV identified in Aedes mosquitoes from the ROK is a novel ABFV based on the sequence analyses and is designated as Panmunjeom flavivirus (PANFV).

  12. Persistence of Orientia tsutsugamushi in humans.

    PubMed

    Chung, Moon-Hyun; Lee, Jin-Soo; Baek, Ji-hyeon; Kim, Mijeong; Kang, Jae-Seung

    2012-03-01

    We investigated the persistence of viable Orientia tsutsugamushi in patients who had recovered from scrub typhus. Blood specimens were available from six patients with scrub typhus who were at 1 to 18 months after the onset of the illness. The EDTA-treated blood specimens were inoculated into ECV304 cells, and cultures were maintained for 7 months. Sequencing of the 56-kDa type-specific antigen gene of O. tsutsugamushi was performed to ascertain the homology of isolates. O. tsutsugamushi was isolated from all six patients, and nucleotide sequences of isolates serially collected from each patient were identical in all five patients in whom nucleotide sequences were compared. One patient relapsed 2 days after completion of antibiotic therapy; two patients complained of weakness for 1 to 2.5 months after the illness; one patient underwent coronary angioplasty 6 months later; and one patient suffered from a transient ischemic attack 8 months later. This finding suggests that O. tsutsugamushi causes chronic latent infection, which may be associated with certain clinical illnesses, preceded by scrub typhus. Antibiotic therapy abates the symptoms of scrub typhus, but does not eradicate O. tsutsugamushi from the human body.

  13. Complete nucleotide sequence, genome organization, and biological properties of human immunodeficiency virus type 1 in vivo: evidence for limited defectiveness and complementation.

    PubMed Central

    Li, Y; Hui, H; Burgess, C J; Price, R W; Sharp, P M; Hahn, B H; Shaw, G M

    1992-01-01

    Previous studies of the genetic and biologic characteristics of human immunodeficiency virus type 1 (HIV-1) have by necessity used tissue culture-derived virus. We recently reported the molecular cloning of four full-length HIV-1 genomes directly from uncultured human brain tissue (Y. Li, J. C. Kappes, J. A. Conway, R. W. Price, G. M. Shaw, and B. H. Hahn, J. Virol. 65:3973-3985, 1991). In this report, we describe the biologic properties of these four clones and the complete nucleotide sequences and genome organization of two of them. Clones HIV-1YU-2 and HIV-1YU-10 were 9,174 and 9,176 nucleotides in length, differed by 0.26% in nucleotide sequence, and except for a frameshift mutation in the pol gene in HIV-1YU-10, contained open reading frames corresponding to 5'-gag-pol-vif-vpr-tat-rev-vpu-env-nef-3' flanked by long terminal repeats. HIV-1YU-2 was fully replication competent, while HIV-1YU-10 and two other clones, HIV-1YU-21 and HIV-1YU-32, were defective. All three defective clones, however, when transfected into Cos-1 cells in any pairwise combination, yielded virions that were replication competent and transmissible by cell-free passage. The cellular host range of HIV-1YU-2 was strictly limited to primary T lymphocytes and monocyte-macrophages, a property conferred by its external envelope glycoprotein. Phylogenetic analyses of HIV-1YU-2 gene sequences revealed this virus to be a member of the North American/European HIV-1 subgroup, with specific similarity to other monocyte-tropic viruses in its V3 envelope amino acid sequence. These results indicate that HIV-1 infection of brain is characterized by the persistence of mixtures of fully competent, minimally defective, and more substantially altered viral forms and that complementation among them is readily attainable. In addition, the limited degree of genotypic heterogeneity observed among HIV-1YU and other brain-derived viruses and their preferential tropism for monocyte-macrophages suggest that viral replication within the central nervous system may differ from that within the peripheral lymphoid compartment in significant and clinically important ways. The availability of genetically and biologically well characterized HIV-1 clones from uncultured human tissue should facilitate future studies of virus-cell interactions relevant to viral pathogenesis and drug and vaccine development. Images PMID:1404605

  14. Sequence of a cDNA encoding pancreatic preprosomatostatin-22.

    PubMed Central

    Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E

    1982-01-01

    We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673

  15. Chloroplast DNA Structural Variation, Phylogeny, and Age of Divergence among Diploid Cotton Species.

    PubMed

    Chen, Zhiwen; Feng, Kun; Grover, Corrinne E; Li, Pengbo; Liu, Fang; Wang, Yumei; Xu, Qin; Shang, Mingzhao; Zhou, Zhongli; Cai, Xiaoyan; Wang, Xingxing; Wendel, Jonathan F; Wang, Kunbo; Hua, Jinping

    2016-01-01

    The cotton genus (Gossypium spp.) contains 8 monophyletic diploid genome groups (A, B, C, D, E, F, G, K) and a single allotetraploid clade (AD). To gain insight into the phylogeny of Gossypium and molecular evolution of the chloroplast genome in this group, we performed a comparative analysis of 19 Gossypium chloroplast genomes, six reported here for the first time. Nucleotide distance in non-coding regions was about three times that of coding regions. As expected, distances were smaller within than among genome groups. Phylogenetic topologies based on nucleotide and indel data support for the resolution of the 8 genome groups into 6 clades. Phylogenetic analysis of indel distribution among the 19 genomes demonstrates contrasting evolutionary dynamics in different clades, with a parallel genome downsizing in two genome groups and a biased accumulation of insertions in the clade containing the cultivated cottons leading to large (for Gossypium) chloroplast genomes. Divergence time estimates derived from the cpDNA sequence suggest that the major diploid clades had diverged approximately 10 to 11 million years ago. The complete nucleotide sequences of 6 cpDNA genomes are provided, offering a resource for cytonuclear studies in Gossypium.

  16. Chloroplast DNA Structural Variation, Phylogeny, and Age of Divergence among Diploid Cotton Species

    PubMed Central

    Li, Pengbo; Liu, Fang; Wang, Yumei; Xu, Qin; Shang, Mingzhao; Zhou, Zhongli; Cai, Xiaoyan; Wang, Xingxing; Wendel, Jonathan F.; Wang, Kunbo

    2016-01-01

    The cotton genus (Gossypium spp.) contains 8 monophyletic diploid genome groups (A, B, C, D, E, F, G, K) and a single allotetraploid clade (AD). To gain insight into the phylogeny of Gossypium and molecular evolution of the chloroplast genome in this group, we performed a comparative analysis of 19 Gossypium chloroplast genomes, six reported here for the first time. Nucleotide distance in non-coding regions was about three times that of coding regions. As expected, distances were smaller within than among genome groups. Phylogenetic topologies based on nucleotide and indel data support for the resolution of the 8 genome groups into 6 clades. Phylogenetic analysis of indel distribution among the 19 genomes demonstrates contrasting evolutionary dynamics in different clades, with a parallel genome downsizing in two genome groups and a biased accumulation of insertions in the clade containing the cultivated cottons leading to large (for Gossypium) chloroplast genomes. Divergence time estimates derived from the cpDNA sequence suggest that the major diploid clades had diverged approximately 10 to 11 million years ago. The complete nucleotide sequences of 6 cpDNA genomes are provided, offering a resource for cytonuclear studies in Gossypium. PMID:27309527

  17. Plant nitrogen regulatory P-PII genes

    DOEpatents

    Coruzzi, Gloria M.; Lam, Hon-Ming; Hsieh, Ming-Hsiun

    2001-01-01

    The present invention generally relates to plant nitrogen regulatory PII gene (hereinafter P-PII gene), a gene involved in regulating plant nitrogen metabolism. The invention provides P-PII nucleotide sequences, expression constructs comprising said nucleotide sequences, and host cells and plants having said constructs and, optionally expressing the P-PII gene from said constructs. The invention also provides substantially pure P-PII proteins. The P-PII nucleotide sequences and constructs of the

  18. Synthesis and evaluations of an acid-cleavable, fluorescently labeled nucleotide as a reversible terminator for DNA sequencing.

    PubMed

    Tan, Lianjiang; Liu, Yazhi; Li, Xiaowei; Wu, Xin-Yan; Gong, Bing; Shen, Yu-Mei; Shao, Zhifeng

    2016-02-11

    An acid-cleavable linker based on a dimethylketal moiety was synthesized and used to connect a nucleotide with a fluorophore to produce a 3'-OH unblocked nucleotide analogue as an excellent reversible terminator for DNA sequencing by synthesis.

  19. Nucleotide sequence of the Varkud mitochondrial plasmid of Neurospora and synthesis of a hybrid transcript with a 5' leader derived from mitochondrial RNA.

    PubMed

    Akins, R A; Grant, D M; Stohl, L L; Bottorff, D A; Nargang, F E; Lambowitz, A M

    1988-11-05

    The Mauriceville and Varkud mitochondrial plasmids of Neurospora are closely related, closed circular DNAs (3.6 and 3.7 kb, respectively; 1 kb = 10(3) bases or base-pairs), whose characteristics suggest relationships to mitochondrial DNA introns and retrotransposons. Here, we characterized the structure of the Varkud plasmid, determined its complete nucleotide sequence and mapped its major transcripts. The Mauriceville and Varkud plasmids have more than 97% positional identity. Both plasmids contain a 710 amino acid open reading frame that encodes a reverse transcriptase-like protein. The amino acid sequence of this open reading frame is strongly conserved between the two plasmids (701/710 amino acids) as expected for a functionally important protein. Both plasmids have a 0.4 kb region that contains five PstI palindromes and a direct repeat of approximately 160 base-pairs. Comparison of sequences in this region suggests that the Varkud plasmid has diverged less from a common ancestor than has the Mauriceville plasmid. Two major transcripts of the Varkud plasmid were detected by Northern hybridization experiments: a full-length linear RNA of 3.7 kb and an additional prominent transcript of 4.9 kb, 1.2 kb longer than monomer plasmid. Remarkably, we find that the 4.9 kb transcript is a hybrid RNA consisting of the full-length 3.7 kb Varkud plasmid transcript plus a 5' leader of 1.2 kb that is derived from the 5' end of the mitochondrial small rRNA. This and other findings suggest that the Varkud plasmid, like certain RNA viruses, has a mechanism for joining heterologous RNAs to the 5' end of its major transcript, and that, under some circumstances, nucleotide sequences in mitochondria may be recombined at the RNA level.

  20. Identification of novel microRNAs in Hevea brasiliensis and computational prediction of their targets

    PubMed Central

    2012-01-01

    Background Plants respond to external stimuli through fine regulation of gene expression partially ensured by small RNAs. Of these, microRNAs (miRNAs) play a crucial role. They negatively regulate gene expression by targeting the cleavage or translational inhibition of target messenger RNAs (mRNAs). In Hevea brasiliensis, environmental and harvesting stresses are known to affect natural rubber production. This study set out to identify abiotic stress-related miRNAs in Hevea using next-generation sequencing and bioinformatic analysis. Results Deep sequencing of small RNAs was carried out on plantlets subjected to severe abiotic stress using the Solexa technique. By combining the LeARN pipeline, data from the Plant microRNA database (PMRD) and Hevea EST sequences, we identified 48 conserved miRNA families already characterized in other plant species, and 10 putatively novel miRNA families. The results showed the most abundant size for miRNAs to be 24 nucleotides, except for seven families. Several MIR genes produced both 20-22 nucleotides and 23-27 nucleotides. The two miRNA class sizes were detected for both conserved and putative novel miRNA families, suggesting their functional duality. The EST databases were scanned with conserved and novel miRNA sequences. MiRNA targets were computationally predicted and analysed. The predicted targets involved in "responses to stimuli" and to "antioxidant" and "transcription activities" are presented. Conclusions Deep sequencing of small RNAs combined with transcriptomic data is a powerful tool for identifying conserved and novel miRNAs when the complete genome is not yet available. Our study provided additional information for evolutionary studies and revealed potentially specific regulation of the control of redox status in Hevea. PMID:22330773

  1. Real-time single-molecule electronic DNA sequencing by synthesis using polymer-tagged nucleotides on a nanopore array

    PubMed Central

    Fuller, Carl W.; Kumar, Shiv; Porel, Mintu; Chien, Minchen; Bibillo, Arek; Stranges, P. Benjamin; Dorwart, Michael; Tao, Chuanjuan; Li, Zengmin; Guo, Wenjing; Shi, Shundi; Korenblum, Daniel; Trans, Andrew; Aguirre, Anne; Liu, Edward; Harada, Eric T.; Pollard, James; Bhat, Ashwini; Cech, Cynthia; Yang, Alexander; Arnold, Cleoma; Palla, Mirkó; Hovis, Jennifer; Chen, Roger; Morozova, Irina; Kalachikov, Sergey; Russo, James J.; Kasianowicz, John J.; Davis, Randy; Roever, Stefan; Church, George M.; Ju, Jingyue

    2016-01-01

    DNA sequencing by synthesis (SBS) offers a robust platform to decipher nucleic acid sequences. Recently, we reported a single-molecule nanopore-based SBS strategy that accurately distinguishes four bases by electronically detecting and differentiating four different polymer tags attached to the 5′-phosphate of the nucleotides during their incorporation into a growing DNA strand catalyzed by DNA polymerase. Further developing this approach, we report here the use of nucleotides tagged at the terminal phosphate with oligonucleotide-based polymers to perform nanopore SBS on an α-hemolysin nanopore array platform. We designed and synthesized several polymer-tagged nucleotides using tags that produce different electrical current blockade levels and verified they are active substrates for DNA polymerase. A highly processive DNA polymerase was conjugated to the nanopore, and the conjugates were complexed with primer/template DNA and inserted into lipid bilayers over individually addressable electrodes of the nanopore chip. When an incoming complementary-tagged nucleotide forms a tight ternary complex with the primer/template and polymerase, the tag enters the pore, and the current blockade level is measured. The levels displayed by the four nucleotides tagged with four different polymers captured in the nanopore in such ternary complexes were clearly distinguishable and sequence-specific, enabling continuous sequence determination during the polymerase reaction. Thus, real-time single-molecule electronic DNA sequencing data with single-base resolution were obtained. The use of these polymer-tagged nucleotides, combined with polymerase tethering to nanopores and multiplexed nanopore sensors, should lead to new high-throughput sequencing methods. PMID:27091962

  2. Molecular Cloning and Sequencing of Hemoglobin-Beta Gene of Channel Catfish, Ictalurus Punctatus Rafinesque

    USDA-ARS?s Scientific Manuscript database

    : Hemoglobin-y gene of channel catfish , lctalurus punctatus, was cloned and sequenced . Total RNA from head kidneys was isolated, reverse transcribed and amplified . The sequence of the channel catfish hemoglobin-y gene consists of 600 nucleotides . Analysis of the nucleotide sequence reveals one o...

  3. Complete nucleotide sequence and organization of the mitogenome of the silk moth Caligula boisduvalii (Lepidoptera: Saturniidae) and comparison with other lepidopteran insects.

    PubMed

    Hong, Mee Yeon; Lee, Eun Mee; Jo, Yong Hun; Park, Hae Chul; Kim, Seong Ryul; Hwang, Jae Sam; Jin, Byung Rae; Kang, Pil Don; Kim, Ki-Gyoung; Han, Yeon Soo; Kim, Iksoo

    2008-04-30

    The 15,360-bp long complete mitogenome of Caligula boisduvalii possesses a gene arrangement and content identical to other completely sequenced lepidopteran mitogenomes, but different from the common arrangement found in most insect order, as the result of the movement of tRNA(Met) to a position 5'-upstream of tRNA Ile. The 330-bp A+T-rich region is apparently capable of forming a stem-and-loop structure, which harbors the conserved flanking sequences at both ends. Dissimilar to what has been seen in other sequenced lepidopteran insects, the initiation codon for C. boisduvalii COI appears to be TTG, which is a rare, but apparently possible initiation codon. The ATP8, ATP6, ND4L, and ND6 genes, which neighbor another PCG at their 3' end, all harbored potential sequences for the formation of a hairpin structure. This is suggestive of the importance of such structures for the precise cleavage of the mRNA of mature PCGs. Phylogenetic analyses of available sequenced species of Bombycoidea, Pyraloidea, and Tortricidea supported the morphology-based current hypothesis that Bombycoidea and Pyraloidea are monophyletic (Obtectomera). As previously suggested, Bombycidae (Bombyx mori and B. mandarina) and Saturniidae (Antheraea pernyi and C. boisduvalii) formed a reciprocal monophyletic group.

  4. Novel methodologies for spectral classification of exon and intron sequences

    NASA Astrophysics Data System (ADS)

    Kwan, Hon Keung; Kwan, Benjamin Y. M.; Kwan, Jennifer Y. Y.

    2012-12-01

    Digital processing of a nucleotide sequence requires it to be mapped to a numerical sequence in which the choice of nucleotide to numeric mapping affects how well its biological properties can be preserved and reflected from nucleotide domain to numerical domain. Digital spectral analysis of nucleotide sequences unfolds a period-3 power spectral value which is more prominent in an exon sequence as compared to that of an intron sequence. The success of a period-3 based exon and intron classification depends on the choice of a threshold value. The main purposes of this article are to introduce novel codes for 1-sequence numerical representations for spectral analysis and compare them to existing codes to determine appropriate representation, and to introduce novel thresholding methods for more accurate period-3 based exon and intron classification of an unknown sequence. The main findings of this study are summarized as follows: Among sixteen 1-sequence numerical representations, the K-Quaternary Code I offers an attractive performance. A windowed 1-sequence numerical representation (with window length of 9, 15, and 24 bases) offers a possible speed gain over non-windowed 4-sequence Voss representation which increases as sequence length increases. A winner threshold value (chosen from the best among two defined threshold values and one other threshold value) offers a top precision for classifying an unknown sequence of specified fixed lengths. An interpolated winner threshold value applicable to an unknown and arbitrary length sequence can be estimated from the winner threshold values of fixed length sequences with a comparable performance. In general, precision increases as sequence length increases. The study contributes an effective spectral analysis of nucleotide sequences to better reveal embedded properties, and has potential applications in improved genome annotation.

  5. A novel rhabdovirus, related to Merida virus, in field-collected mosquitoes from Anatolia and Thrace.

    PubMed

    Ergünay, Koray; Brinkmann, Annika; Litzba, Nadine; Günay, Filiz; Kar, Sırrı; Öter, Kerem; Örsten, Serra; Sarıkaya, Yasemen; Alten, Bülent; Nitsche, Andreas; Linton, Yvonne-Marie

    2017-07-01

    Next-generation sequencing technologies have significantly facilitated the discovery of novel viruses, and metagenomic surveillance of arthropods has enabled exploration of the diversity of novel or known viral agents. We have identified a novel rhabdovirus that is genetically related to the recently described Merida virus via next-generation sequencing in a mosquito pool from Thrace. The complete viral genome contains 11,798 nucleotides with 83% genome-wide nucleotide sequence similarity to Merida virus. Five major putative open reading frames that follow the canonical rhabdovirus genome organization were identified. A total of 1380 mosquitoes comprising 13 species, collected from Thrace and the Mediterranean and Aegean regions of Anatolia were screened for the novel virus using primers based on the N and L genes of the prototype genome. Eight positive pools (6.2%) exclusively comprised Culex pipiens sensu lato specimens originating from all study regions. Infections were observed in pools with female as well as male or mixed-sex individuals. The overall and Cx. pipiens-specific minimal infection rates were calculated to be 5.7 and 14.8, respectively. Sequencing of the PCR products revealed marked diversity within a portion of the N gene, with up to 4% divergence and distinct amino acid substitutions that were unrelated to the collection site. Phylogenetic analysis of the complete and partial viral polymerase (L gene) amino acid sequences placed the novel virus and Merida virus in a distinct group, indicating that these strains are closely related. The strain is tentatively named "Merida-like virus Turkey". Studies are underway to isolate and further explore the host range and distribution of this new strain.

  6. Complete genome sequences of avian paramyxovirus type 8 strains goose/Delaware/1053/76 and pintail/Wakuya/20/78

    PubMed Central

    Paldurai, Anandan; Subbiah, Madhuri; Kumar, Sachin; Collins, Peter L.; Samal, Siba K.

    2009-01-01

    Complete consensus genome sequences were determined for avian paramyxovirus type 8 (APMV-8) strains goose/Delaware/1053/76 (prototype strain) and pintail/Wakuya/20/78. The genome of each strain is 15,342 nucleotides (nt) long, which follows the “rule of six”. The genome consists of six genes in the order of 3′-N-P/V/W-M-F-HN-L-5′. The genes are flanked on either side by conserved transcription start and stop signals, and have intergenic regions ranging from 1 to 30 nt. The genome contains a 55 nt leader region at the 3′-end and a 171 nt trailer region at the 5′-end. Comparison of sequences of strains Delaware and Wakuya showed nucleotide identity of 96.8% at the genome level and amino acid identities of 99.3%, 96.5%, 98.6%, 99.4%, 98.6% and 99.1% for the predicted N, P, M, F, HN and L proteins, respectively. Both strains grew in embryonated chicken eggs and in primary chicken embryo kidney cells, and 293T cells. Both strains contained only a single basic residue at the cleavage activation site of the F protein and their efficiency of replication in vitro depended on and was augmented by, the presence of exogenous protease in most cell lines. Sequence alignment and phylogenic analysis of the predicted amino acid sequence of APMV-8 strain Delaware proteins with the cognate proteins of other available APMV serotypes showed that APMV-8 is more closely related to APMV-2 and -6 than to APMV-1, -3 and -4. PMID:19341613

  7. Molecular cloning, sequence characterization and recombinant expression of Nanog gene in goat fibroblast cells using lentiviral based expression system.

    PubMed

    Singhal, Dinesh K; Singhal, Raxita; Malik, Hruda N; Kumar, Surender; Kumar, Sudarshan; Mohanty, Ashok K; Kaushik, Jai K; Malakar, Dhruba

    2014-01-01

    Nanog is a homeodomain containing protein which plays important roles in regulation of signaling pathways for maintenance and induction of pluripotency in stem cells. Because of its unique expression in stem cells it is also regarded as pluripotency marker. In this study goat Nanog (gNanog) gene has been amplified, cloned and characterized at sequence level with successful over-expression in CHO-K1 cell line using a lentiviral based system. gNanog ORF is 903 bp long which codes for Nanog protein of size 300 amino acids (aas). Complete nucleotide sequence shows some evolutionary mutation in goat in comparision to other species. Protein sequence of goat is highly similar to other species. Overall, gNanog nucleotide sequence and predicted protein sequence showed high similarity and minimum divergence with cattle (96 % identity/4 % divergence) and buffalo (94/5 %) while low similarity and high divergence with pig (84/15 %), human (81/23 %) and mouse (69/40 %) indicating evolutionary closeness of gNanog to cattle and buffalo. gNanog lentiviral expression construct was prepared for over-expression of Nanog gene in adult goat fibroblast cells. Lentiviral expression construct of Nanog enabled continuous protein expression for induction and maintenance of pluripotency. Western blotting revealed the expression of Nanog gene at protein level which supported that the lentiviral expression system is highly promising for Nanog protein expression in differentiated goat cell.

  8. Molecular identification and partial sequence analysis of an aryl hydrocarbon receptor from beluga (Delphinapterus leucas)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jensen, B.A.; Hahn, M.E.

    1995-12-31

    The aryl hydrocarbon receptor (AhR) mediates the effects of many common and potentially toxic organic hydrocarbons, including some polychlorinated biphenyls and dioxins. Since small cetaceans often inhabit industrially polluted coastal waters, comparison of the molecular structure and function of this protein in cetaeans with other marine and mammalian species is important for evaluating the sensitivity of cetaceans to these pollutants. An AhR protein has been identified in beluga liver by photoaffinity labeling. In the present study, the authors sought to clone and sequence an AhR cDNA from beluga as a prelude to studying its structure and function, using reverse-transcription polymerasemore » chain reaction (RT-PCR) and degenerate primers, a 515 base pair fragment was amplified, cloned and sequenced, revealing homology to the PAS domain (ligand binding and dimerization region) of AhRs from terrestrial mammals. This portion of the putative beluga AhR has 82% amino acid and 81% nucleotide sequence identity to the mouse AhR, and 63% amino acid and 64% nucleotide sequence identity to an AhR from the marine fish Fundulus heteroclitus. A beluga cDNA library was synthesized and is currently being screened with the PCR-generated fragment to obtain the complete coding sequence. This is the first molecular evidence of AhR presence in cetaceans.« less

  9. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F. William

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient.

  10. Method for high-volume sequencing of nucleic acids: random and directed priming with libraries of oligonucleotides

    DOEpatents

    Studier, F.W.

    1995-04-18

    Random and directed priming methods for determining nucleotide sequences by enzymatic sequencing techniques, using libraries of primers of lengths 8, 9 or 10 bases, are disclosed. These methods permit direct sequencing of nucleic acids as large as 45,000 base pairs or larger without the necessity for subcloning. Individual primers are used repeatedly to prime sequence reactions in many different nucleic acid molecules. Libraries containing as few as 10,000 octamers, 14,200 nonamers, or 44,000 decamers would have the capacity to determine the sequence of almost any cosmid DNA. Random priming with a fixed set of primers from a smaller library can also be used to initiate the sequencing of individual nucleic acid molecules, with the sequence being completed by directed priming with primers from the library. In contrast to random cloning techniques, a combined random and directed priming strategy is far more efficient. 2 figs.

  11. Statistical analysis of nucleotide sequences of the hemagglutinin gene of human influenza A viruses.

    PubMed Central

    Ina, Y; Gojobori, T

    1994-01-01

    To examine whether positive selection operates on the hemagglutinin 1 (HA1) gene of human influenza A viruses (H1 subtype), 21 nucleotide sequences of the HA1 gene were statistically analyzed. The nucleotide sequences were divided into antigenic and nonantigenic sites. The nucleotide diversities for antigenic and nonantigenic sites of the HA1 gene were computed at synonymous and nonsynonymous sites separately. For nonantigenic sites, the nucleotide diversities were larger at synonymous sites than at nonsynonymous sites. This is consistent with the neutral theory of molecular evolution. For antigenic sites, however, the nucleotide diversities at nonsynonymous sites were larger than those at synonymous sites. These results suggest that positive selection operates on antigenic sites of the HA1 gene of human influenza A viruses (H1 subtype). PMID:8078892

  12. 40 CFR 174.3 - Definitions.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ..., flowers, and pollen. Noncoding, nonexpressed nucleotide sequences means the nucleotide sequences are not... surgical alteration of the plant pistil, bud pollination, mentor pollen, immunosuppressants, in vitro...

  13. 40 CFR 174.3 - Definitions.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ..., flowers, and pollen. Noncoding, nonexpressed nucleotide sequences means the nucleotide sequences are not... surgical alteration of the plant pistil, bud pollination, mentor pollen, immunosuppressants, in vitro...

  14. 40 CFR 174.3 - Definitions.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ..., flowers, and pollen. Noncoding, nonexpressed nucleotide sequences means the nucleotide sequences are not... surgical alteration of the plant pistil, bud pollination, mentor pollen, immunosuppressants, in vitro...

  15. 40 CFR 174.3 - Definitions.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ..., flowers, and pollen. Noncoding, nonexpressed nucleotide sequences means the nucleotide sequences are not... surgical alteration of the plant pistil, bud pollination, mentor pollen, immunosuppressants, in vitro...

  16. 40 CFR 174.3 - Definitions.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ..., flowers, and pollen. Noncoding, nonexpressed nucleotide sequences means the nucleotide sequences are not... surgical alteration of the plant pistil, bud pollination, mentor pollen, immunosuppressants, in vitro...

  17. Complete Genome Sequence of a Novel Newcastle Disease Virus Strain Isolated from a Chicken in West Africa

    PubMed Central

    Kim, Shin-Hee; Nayak, Subhashree; Paldurai, Anandan; Nayak, Baibaswata; Samuel, Arthur; Aplogan, Gilbert L.; Awoume, Kodzo A.; Webby, Richard J.; Ducatez, Mariette F.; Collins, Peter L.

    2012-01-01

    The complete genome sequence of an African Newcastle disease virus (NDV) strain isolated from a chicken in Togo in 2009 was determined. The genome is 15,198 nucleotides (nt) in length and is classified in genotype VII in the class II cluster. Compared to common vaccine strains, the African strain contains a previously described 6-nt insert in the downstream untranslated region of the N gene and a novel 6-nt insert in the HN-L intergenic region. Genome length differences are a marker of the natural history of NDV. This is the first description of a class II NDV strain with a genome of 15,198 nt and a 6-nt insert in the HN-L intergenic region. Sequence divergence relative to vaccine strains was substantial, likely contributes to outbreaks, and illustrates the continued evolution of new NDV strains in West Africa. PMID:22997417

  18. Genomic characterization of two new enterovirus types, EV-A114 and EV-A121.

    PubMed

    Deshpande, Jagadish M; Sharma, Deepa K; Saxena, Vinay K; Shetty, Sushmitha A; Qureshi, Tarique Husain I H; Nalavade, Uma P

    2016-12-01

    Enteroviruses cause a variety of illnesses of the gastrointestinal tract, central nervous system and cardiovascular system. Phylogenetic analysis of VP1 sequences has identified 106 different human enteroviruses classified into four enterovirus species within the genus Enterovirus of the family Picornaviridae. It is likely that not all enterovirus types have been discovered. Between September 2013 and October 2014, stool samples of 6274 apparently healthy children of up to 5 years of age residing in Gorakhpur district, Uttar Pradesh, India were screened for enteroviruses. Virus isolates obtained in RD and Hep-2c cells were identified by complete VP1 sequencing. Enteroviruses were isolated from 3042 samples. A total of 87 different enterovirus types were identified. Two isolates with 71 and 74 % nucleotide sequence similarity to all other known enteroviruses were recognized as novel types. In this paper we report identification and complete genome sequence analysis of these two isolates classified as EV-A114 and EV-A121.

  19. Complete genome sequence of the first human parechovirus type 3 isolated in Taiwan.

    PubMed

    Chang, Jenn-Tzong; Yang, Chih-Shiang; Chen, Bao-Chen; Chen, Yao-Shen; Chang, Tsung-Hsien

    2017-11-01

    The first human parechovirus 3 (HPeV3 VGHKS-2007) in Taiwan was identified from a clinical specimen from a male infant. The entire genome of the HPeV3 isolate was sequenced and compared to known HPeV3 sequences. Genome alignment data showed that HPeV3 VGHKS-2007 shares the highest nucleotide identity, 99%, with the Japanese strain of HPeV3 1361K-162589-Yamagata-2008. All HPeV3 isolates possess at least 97% amino acid identity. The analysis of the genome sequence of HPeV3 VGHKS-2007 will facilitate future investigations of the epidemiology and pathogenicity of HPeV3 infection. Copyright © 2017. Published by Elsevier Taiwan LLC.

  20. Sequence characterization of 5S ribosomal RNA from eight gram positive procaryotes

    NASA Technical Reports Server (NTRS)

    Woese, C. R.; Luehrsen, K. R.; Pribula, C. D.; Fox, G. E.

    1976-01-01

    Complete nucleotide sequences are presented for 5S rRNA from Bacillus subtilis, B. firmus, B. pasteurii, B. brevis, Lactobacillus brevis, and Streptococcus faecalis, and 5S rRNA oligonucleotide catalogs and partial sequence data are given for B. cereus and Sporosarcina ureae. These data demonstrate a striking consistency of 5S rRNA primary and secondary structure within a given bacterial grouping. An exception is B. brevis, in which the 5S rRNA sequence varies significantly from that of other bacilli in the tuned helix and the procaryotic loop. The localization of these variations suggests that B. brevis occupies an ecological niche that selects such changes. It is noted that this organism produces antibiotics which affect ribosome function.

  1. Numerical classification of coding sequences

    NASA Technical Reports Server (NTRS)

    Collins, D. W.; Liu, C. C.; Jukes, T. H.

    1992-01-01

    DNA sequences coding for protein may be represented by counts of nucleotides or codons. A complete reading frame may be abbreviated by its base count, e.g. A76C158G121T74, or with the corresponding codon table, e.g. (AAA)0(AAC)1(AAG)9 ... (TTT)0. We propose that these numerical designations be used to augment current methods of sequence annotation. Because base counts and codon tables do not require revision as knowledge of function evolves, they are well-suited to act as cross-references, for example to identify redundant GenBank entries. These descriptors may be compared, in place of DNA sequences, to extract homologous genes from large databases. This approach permits rapid searching with good selectivity.

  2. The nucleotide sequences of 5S rRNAs from a rotifer, Brachionus plicatilis, and two nematodes, Rhabditis tokai and Caenorhabditis elegans.

    PubMed

    Kumazaki, T; Hori, H; Osawa, S; Ishii, N; Suzuki, K

    1982-11-11

    The nucleotide sequences of 5S rRNAs from a rotifer, Brachionus plicatilis, and two nematodes, Rhabditis tokai and Caenorhabditis elegans have been determined. The rotifer has two 5S rRNA species that are composed of 120 and 121 nucleotides, respectively. The sequences of these two 5S rRNAs are the same except that the latter has an additional base at its 3'-terminus. The 5S rRNAs from the two nematode species are both 119 nucleotides long. The sequence similarity percents are 79% (Brachionus/Rhabditis), 80% (Brachionus/Caenorhabditis), and 95% (Rhabditis/Caenorhabditis) among these three species. Brachionus revealed the highest similarity to Lingula (89%), but not to the nematodes (79%).

  3. From genomics to functional markers in the era of next-generation sequencing.

    PubMed

    Salgotra, R K; Gupta, B B; Stewart, C N

    2014-03-01

    The availability of complete genome sequences, along with other genomic resources for Arabidopsis, rice, pigeon pea, soybean and other crops, has revolutionized our understanding of the genetic make-up of plants. Next-generation DNA sequencing (NGS) has facilitated single nucleotide polymorphism discovery in plants. Functionally-characterized sequences can be identified and functional markers (FMs) for important traits can be developed at an ever-increasing ease. FMs are derived from sequence polymorphisms found in allelic variants of a functional gene. Linkage disequilibrium-based association mapping and homologous recombinants have been developed for identification of "perfect" markers for their use in crop improvement practices. Compared with many other molecular markers, FMs derived from the functionally characterized sequence genes using NGS techniques and their use provide opportunities to develop high-yielding plant genotypes resistant to various stresses at a fast pace.

  4. Comparative Genetic Analyses of Human Rhinovirus C (HRV-C) Complete Genome from Malaysia.

    PubMed

    Khaw, Yam Sim; Chan, Yoke Fun; Jafar, Faizatul Lela; Othman, Norlijah; Chee, Hui Yee

    2016-01-01

    Human rhinovirus-C (HRV-C) has been implicated in more severe illnesses than HRV-A and HRV-B, however, the limited number of HRV-C complete genomes (complete 5' and 3' non-coding region and open reading frame sequences) has hindered the in-depth genetic study of this virus. This study aimed to sequence seven complete HRV-C genomes from Malaysia and compare their genetic characteristics with the 18 published HRV-Cs. Seven Malaysian HRV-C complete genomes were obtained with newly redesigned primers. The seven genomes were classified as HRV-C6, C12, C22, C23, C26, C42, and pat16 based on the VP4/VP2 and VP1 pairwise distance threshold classification. Five of the seven Malaysian isolates, namely, 3430-MY-10/C22, 8713-MY-10/C23, 8097-MY-11/C26, 1570-MY-10/C42, and 7383-MY-10/pat16 are the first newly sequenced complete HRV-C genomes. All seven Malaysian isolates genomes displayed nucleotide similarity of 63-81% among themselves and 63-96% with other HRV-Cs. Malaysian HRV-Cs had similar putative immunogenic sites, putative receptor utilization and potential antiviral sites as other HRV-Cs. The genomic features of Malaysian isolates were similar to those of other HRV-Cs. Negative selections were frequently detected in HRV-Cs complete coding sequences indicating that these sequences were under functional constraint. The present study showed that HRV-Cs from Malaysia have diverse genetic sequences but share conserved genomic features with other HRV-Cs. This genetic information could provide further aid in the understanding of HRV-C infection.

  5. Comparative Genetic Analyses of Human Rhinovirus C (HRV-C) Complete Genome from Malaysia

    PubMed Central

    Khaw, Yam Sim; Chan, Yoke Fun; Jafar, Faizatul Lela; Othman, Norlijah; Chee, Hui Yee

    2016-01-01

    Human rhinovirus-C (HRV-C) has been implicated in more severe illnesses than HRV-A and HRV-B, however, the limited number of HRV-C complete genomes (complete 5′ and 3′ non-coding region and open reading frame sequences) has hindered the in-depth genetic study of this virus. This study aimed to sequence seven complete HRV-C genomes from Malaysia and compare their genetic characteristics with the 18 published HRV-Cs. Seven Malaysian HRV-C complete genomes were obtained with newly redesigned primers. The seven genomes were classified as HRV-C6, C12, C22, C23, C26, C42, and pat16 based on the VP4/VP2 and VP1 pairwise distance threshold classification. Five of the seven Malaysian isolates, namely, 3430-MY-10/C22, 8713-MY-10/C23, 8097-MY-11/C26, 1570-MY-10/C42, and 7383-MY-10/pat16 are the first newly sequenced complete HRV-C genomes. All seven Malaysian isolates genomes displayed nucleotide similarity of 63–81% among themselves and 63–96% with other HRV-Cs. Malaysian HRV-Cs had similar putative immunogenic sites, putative receptor utilization and potential antiviral sites as other HRV-Cs. The genomic features of Malaysian isolates were similar to those of other HRV-Cs. Negative selections were frequently detected in HRV-Cs complete coding sequences indicating that these sequences were under functional constraint. The present study showed that HRV-Cs from Malaysia have diverse genetic sequences but share conserved genomic features with other HRV-Cs. This genetic information could provide further aid in the understanding of HRV-C infection. PMID:27199901

  6. A Raspberry bushy dwarf virus isolate from Ecuadorean Rubus glaucus contains an additional RNA that is a rearrangement of RNA 2

    USDA-ARS?s Scientific Manuscript database

    A new Raspberry bushy dwarf virus isolate was found in commercial blackberry (Rubus glaucus) in Azuay, province of Ecuador and named RBDV-Ec-Az. The complete bipartite genome was sequenced using dsRNA as initial template. RNA 1 was 5449 nucleotides (nt) long and the normal RBDV RNA 2 was 2231 nt lon...

  7. Characterization of the Triticum Mosaic Virus Genome and Interactions between Triticum Mosaic Virus and Wheat Streak Mosaic Virus

    USDA-ARS?s Scientific Manuscript database

    The complete genome sequence of Triticum mosaic virus (TriMV) has been determined to be 10,266 nucleotides encoding a large polyprotein of 3,112 amino acids. The proteins of TriMV possess only 33-44% (with NIb protein) and 15-29% (with P1 protein) amino acid identity with the reported members of Pot...

  8. Complete Genome Sequence of an Avian Paramyxovirus Type 4 from North America Reveals a Shorter Genome and New Genotype

    PubMed Central

    Parthiban, Manoharan; Kaliyaperumal, Manimaran; Xiao, Sa; Nayak, Baibaswata; Paldurai, Anandan; Kim, Shin-Hee; Ladman, Brian S.; Preskenis, Lauren A.; Gelb, Jack; Collins, Peter L.

    2013-01-01

    An avian paramyxovirus type 4 (APMV-4) was isolated from a duck in Delaware in 2010. Its genome is 15,048 nucleotides (nt) long, which is shorter by 6 nt than those for all previously reported strains. Phylogenetic analysis revealed that this strain formed a separate cluster within APMV-4 strains. PMID:23405329

  9. The EMBL nucleotide sequence database

    PubMed Central

    Stoesser, Guenter; Baker, Wendy; van den Broek, Alexandra; Camon, Evelyn; Garcia-Pastor, Maria; Kanz, Carola; Kulikova, Tamara; Lombard, Vincent; Lopez, Rodrigo; Parkinson, Helen; Redaschi, Nicole; Sterk, Peter; Stoehr, Peter; Tuli, Mary Ann

    2001-01-01

    The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/) is maintained at the European Bioinformatics Institute (EBI) in an international collaboration with the DNA Data Bank of Japan (DDBJ) and GenBank at the NCBI (USA). Data is exchanged amongst the collaborating databases on a daily basis. The major contributors to the EMBL database are individual authors and genome project groups. Webin is the preferred web-based submission system for individual submitters, whilst automatic procedures allow incorporation of sequence data from large-scale genome sequencing centres and from the European Patent Office (EPO). Database releases are produced quarterly. Network services allow free access to the most up-to-date data collection via ftp, email and World Wide Web interfaces. EBI’s Sequence Retrieval System (SRS), a network browser for databanks in molecular biology, integrates and links the main nucleotide and protein databases plus many specialized databases. For sequence similarity searching a variety of tools (e.g. Blitz, Fasta, BLAST) are available which allow external users to compare their own sequences against the latest data in the EMBL Nucleotide Sequence Database and SWISS-PROT. PMID:11125039

  10. Interactive computer programs for the graphic analysis of nucleotide sequence data.

    PubMed Central

    Luckow, V A; Littlewood, R K; Rownd, R H

    1984-01-01

    A group of interactive computer programs have been developed which aid in the collection and graphical analysis of nucleotide and protein sequence data. The programs perform the following basic functions: a) enter, edit, list, and rearrange sequence data; b) permit automatic entry of nucleotide sequence data directly from an autoradiograph into the computer; c) search for restriction sites or other specified patterns and plot a linear or circular restriction map, or print their locations; d) plot base composition; e) analyze homology between sequences by plotting a two-dimensional graphic matrix; and f) aid in plotting predicted secondary structures of RNA molecules. PMID:6546437

  11. Nucleotide sequence analysis establishes the role of endogenous murine leukemia virus DNA segments in formation of recombinant mink cell focus-forming murine leukemia viruses.

    PubMed Central

    Khan, A S

    1984-01-01

    The sequence of 363 nucleotides near the 3' end of the pol gene and 564 nucleotides from the 5' terminus of the env gene in an endogenous murine leukemia viral (MuLV) DNA segment, cloned from AKR/J mouse DNA and designated as A-12, was obtained. For comparison, the nucleotide sequence in an analogous portion of AKR mink cell focus-forming (MCF) 247 MuLV provirus was also determined. Sequence features unique to MCF247 MuLV DNA in the 3' pol and 5' env regions were identified by comparison with nucleotide sequences in analogous regions of NFS -Th-1 xenotropic and AKR ecotropic MuLV proviruses. These included (i) an insertion of 12 base pairs encoding four amino acids located 60 base pairs from the 3' terminus of the pol gene and immediately preceding the env gene, (ii) the deletion of 12 base pairs (encoding four amino acids) and the insertion of 3 base pairs (encoding one amino acid) in the 5' portion of the env gene, and (iii) single base substitutions resulting in 2 MCF247 -specific amino acids in the 3' pol and 23 in the 5' env regions. Nucleotide sequence comparison involving the 3' pol and 5' env regions of AKR MCF247 , NFS xenotropic, and AKR ecotropic MuLV proviruses with the cloned endogenous MuLV DNA indicated that MCF247 proviral DNA sequences were conserved in the cloned endogenous MuLV proviral segment. In fact, total nucleotide sequence identity existed between the endogenous MuLV DNA and the MCF247 MuLV provirus in the 3' portion of the pol gene. In the 5' env region, only 4 of 564 nucleotides were different, resulting in three amino acid changes between AKR MCF247 MuLV DNA and the endogenous MuLV DNA present in clone A-12. In addition, nucleotide sequence comparison indicated that Moloney-and Friend-MCF MuLVs were also highly related in the 3' pol and 5' env regions to the cloned endogenous MuLV DNA. These results establish the role of endogenous MuLV DNA segments in generation of recombinant MCF viruses. PMID:6328017

  12. Detection and Identification of the First Viruses in Chia (Salvia hispanica)

    PubMed Central

    Celli, Marcos G.; Perotto, Maria C.; Martino, Julia A.; Flores, Ceferino R.; Conci, Vilma C.; Pardina, Patricia Rodriguez

    2014-01-01

    Chia (Salvia hispanica), an herbaceous plant native to Latin America, has become important in the last 20 years due to its beneficial effects on health. Here, we present the first record and identification of two viruses in chia plants. The comparison of the complete nucleotide sequences showed the presence of two viral species with the typical genome organization of bipartite New World begomovirus, identified as Sida mosaic Bolivia virus 2 and Tomato yellow spot virus, according to the ICTV taxonomic criteria for begomovirus classification. DNA-A from Sida mosaic Bolivia virus 2 exhibited 96.1% nucleotide identity with a Bolivian isolate of Sida micrantha, and Tomato yellow spot virus showed 95.3% nucleotide identity with an Argentine bean isolate. This is the first report of begomoviruses infecting chia as well as of the occurrence of Sida mosaic Bolivia virus 2 in Argentina. PMID:25243369

  13. The primary structure of the thymidine kinase gene of fish lymphocystis disease virus.

    PubMed

    Schnitzler, P; Handermann, M; Szépe, O; Darai, G

    1991-06-01

    The DNA nucleotide sequence of the thymidine kinase (TK) gene of fish lymphocystis disease virus (FLDV) which has been localized between the coordinates 0.678 to 0.688 of the viral genome was determined. The analysis of the DNA nucleotide sequence located between the recognition sites of HindIII (0.669 map unit; nucleotide position 1) and AccI (nucleotide position 2032) revealed the presence of an open reading frame of 954 bp on the lower strand of this region between nucleotide positions 1868 (ATG) and 915 (TAA). It encodes for a protein of 318 amino acid residues. The evolutionary relationships of the TK gene of FLDV to the other known TK genes was investigated using the method of progressive sequence alignment. These analyses revealed a high degree of diversity between the protein sequence of FLDV TK gene and the amino acid composition of other TKs tested. However, significant conservations were detected at several regions of amino acid residues of the FLDV TK protein when compared to the amino acid sequence of TKs of African swine fever virus, fowlpox virus, shope fibroma virus, and vaccinia virus and to the amino acid sequences of the cellular cytoplasmic TK of chicken, mouse, and man.

  14. The Mouse Genomes Project: a repository of inbred laboratory mouse strain genomes.

    PubMed

    Adams, David J; Doran, Anthony G; Lilue, Jingtao; Keane, Thomas M

    2015-10-01

    The Mouse Genomes Project was initiated in 2009 with the goal of using next-generation sequencing technologies to catalogue molecular variation in the common laboratory mouse strains, and a selected set of wild-derived inbred strains. The initial sequencing and survey of sequence variation in 17 inbred strains was completed in 2011 and included comprehensive catalogue of single nucleotide polymorphisms, short insertion/deletions, larger structural variants including their fine scale architecture and landscape of transposable element variation, and genomic sites subject to post-transcriptional alteration of RNA. From this beginning, the resource has expanded significantly to include 36 fully sequenced inbred laboratory mouse strains, a refined and updated data processing pipeline, and new variation querying and data visualisation tools which are available on the project's website ( http://www.sanger.ac.uk/resources/mouse/genomes/ ). The focus of the project is now the completion of de novo assembled chromosome sequences and strain-specific gene structures for the core strains. We discuss how the assembled chromosomes will power comparative analysis, data access tools and future directions of mouse genetics.

  15. The complete DNA sequence of lymphocystis disease virus.

    PubMed

    Tidona, C A; Darai, G

    1997-04-14

    Lymphocystis disease virus (LCDV) is the causative agent of lymphocystis disease, which has been reported to occur in over 100 different fish species worldwide. LCDV is a member of the family Iridoviridae and the type species of the genus Lymphocystivirus. The virions contain a single linear double-stranded DNA molecule, which is circularly permuted, terminally redundant, and heavily methylated at cytosines in CpG sequences. The complete nucleotide sequence of LCDV-1 (flounder isolate) was determined by automated cycle sequencing and primer walking. The genome of LCDV-1 is 102.653 bp in length and contains 195 open reading frames with coding capacities ranging from 40 to 1199 amino acids. Computer-assisted analyses of the deduced amino acid sequences led to the identification of several putative gene products with significant homologies to entries in protein data banks, such as the two major subunits of the viral DNA-dependent RNA polymerase, DNA polymerase, several protein kinases, two subunits of the ribonucleoside diphosphate reductase, DNA methyltransferase, the viral major capsid protein, insulin-like growth factor, and tumor necrosis factor receptor homolog.

  16. Sequence diversity within the reovirus S2 gene: reovirus genes reassort in nature, and their termini are predicted to form a panhandle motif.

    PubMed Central

    Chapell, J D; Goral, M I; Rodgers, S E; dePamphilis, C W; Dermody, T S

    1994-01-01

    To better understand genetic diversity within mammalian reoviruses, we determined S2 nucleotide and deduced sigma 2 amino acid sequences of nine reovirus strains and compared these sequences with those of prototype strains of the three reovirus serotypes. The S2 gene and sigma 2 protein are highly conserved among the four type 1, one type 2, and seven type 3 strains studied. Phylogenetic analyses based on S2 nucleotide sequences of the 12 reovirus strains indicate that diversity within the S2 gene is independent of viral serotype. Additionally, we found marked topological differences between phylogenetic trees generated from S1 and S2 gene nucleotide sequences of the seven type 3 strains. These results demonstrate that reovirus S1 and S2 genes have distinct evolutionary histories, thus providing phylogenetic evidence for lateral transfer of reovirus genes in nature. When variability among the 12 sigma 2-encoding S2 nucleotide sequences was analyzed at synonymous positions, we found that approximately 60 nucleotides at the 5' terminus and 30 nucleotides at the 3' terminus were markedly conserved in comparison with other sigma 2-encoding regions of S2. Predictions of RNA secondary structures indicate that the more conserved S2 sequences participate in the formation of an extended region of duplex RNA interrupted by a pair of stem-loops. Among the 12 deduced sigma 2 amino acid sequences examined, substitutions were observed at only 11% of amino acid positions. This finding suggests that constraints on the structure or function of sigma 2, perhaps in part because of its location in the virion core, have limited sequence diversity within this protein. PMID:8289378

  17. DNA sequence analysis of simian virus 40 mutants with deletions mapping in the leader region of the late viral mRNA's: mutants with deletions similar in size and position exhibit varied phenotypes.

    PubMed

    Barkan, A; Mertz, J E

    1981-02-01

    The nucleotide sequences of 10 viable yet partially defective deletion mutants of simian virus 40 were determined. The deletions mapped within, and, in many cases, 5' to, the predominant leader sequence of the late viral mRNA's. They ranged from 74 to 187 nucleotide pairs in length. Six of the mutants had lost the sequence that corresponds to the "cap" site (5' terminus) of the most abundant class of 16S mRNA's. One of these mutants had a deletion that extended 103 nucleotide pairs into the region preceding this primary cap site and, therefore, was missing many secondary cap sites as well. A seventh mutant lacked the entire major 16S leader sequence except for the first six nucleotides at its 5' end and the last nine at its 3' end. Although these mutants differed in the size and position of their deletions, we were unable to discover any simple correlations between their growth characteristics and their DNA sequences. This finding indicates that the secondary structures of the RNA transcripts may play a more important role than the exact nucleotide sequence of the RNAs in determining how they function within the cell.

  18. Antifungal polypeptides

    DOEpatents

    Altier, Daniel J.; Dahlbacka, Glen; Ellanskaya, legal representative, Natalia; Herrmann, Rafael; Hunter-Cevera, Jennie; McCutchen, Billy F.; Presnail, James K.; Rice, Janet A.; Schepers, Eric; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser; Ellanskaya, deceased, Irina

    2007-12-11

    Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.

  19. Antifungal polypeptides

    DOEpatents

    Altier, Daniel J.; Dahlbacka, Glen; Elleskaya, Irina; Ellanskaya, legal representative; Natalia; Herrmann, Rafael; Hunter-Cevera, Jennie; McCutchen, Billy F.; Presnail, James K.; Rice, Janet A.; Schepers, Eric; Simmons, Carl R.; Torok, Tamas; Yalpani, Nasser

    2010-08-10

    Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.

  20. Antifungal polypeptides

    DOEpatents

    Altier, Daniel J [Waukee, IA; Dahlbacka, Glen [Oakland, CA; Elleskaya, Irina [Kyiv, UA; Ellanskaya, legal representative, Natalia; Herrmann, Rafael [Wilmington, DE; Hunter-Cevera, Jennie [Elliott City, MD; McCutchen, Billy F [College Station, IA; Presnail, James K [Avondale, PA; Rice, Janet A [Wilmington, DE; Schepers, Eric [Port Deposit, MD; Simmons, Carl R [Des Moines, IA; Torok, Tamas [Richmond, CA; Yalpani, Nasser [Johnston, IA

    2011-04-12

    Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.

  1. Antifungal polypeptides

    DOEpatents

    Altier, Daniel J [Granger, IA; Dahlbacka, Glen [Oakland, CA; Ellanskaya, Irina [Kyiv, UA; Ellanskaya, legal representative, Natalia; Herrmann, Rafael [Wilmington, DE; Hunter-Cevera, Jennie [Elliott City, MD; McCutchen, Billy F [College Station, TX; Presnail, James K [Avondale, PA; Rice, Janet A [Wilmington, DE; Schepers, Eric [Port Deposit, MD; Simmons, Carl R [Des Moines, IA; Torok, Tamas [Richmond, CA; Yalpani, Nasser [Johnston, IA

    2012-04-03

    Compositions and methods for protecting a plant from a pathogen, particularly a fungal pathogen, are provided. Compositions include novel amino acid sequences, and variants and fragments thereof, for antipathogenic polypeptides that were isolated from microbial fermentation broths. Nucleic acid molecules comprising nucleotide sequences that encode the antipathogenic polypeptides of the invention are also provided. A method for inducing pathogen resistance in a plant using the nucleotide sequences disclosed herein is further provided. The method comprises introducing into a plant an expression cassette comprising a promoter operably linked to a nucleotide sequence that encodes an antipathogenic polypeptide of the invention. Compositions comprising an antipathogenic polypeptide or a transformed microorganism comprising a nucleic acid of the invention in combination with a carrier and methods of using these compositions to protect a plant from a pathogen are further provided. Transformed plants, plant cells, seeds, and microorganisms comprising a nucleotide sequence that encodes an antipathogenic polypeptide of the invention, or variant or fragment thereof, are also disclosed.

  2. Complete Genome Sequence of Zucchini Yellow Mosaic Virus Strain Kurdistan, Iran.

    PubMed

    Maghamnia, Hamid Reza; Hajizadeh, Mohammad; Azizi, Abdolbaset

    2018-03-01

    The complete genome sequence of Zucchini yellow mosaic virus strain Kurdistan (ZYMV-Kurdistan) infecting squash from Iran was determined from 13 overlapping fragments. Excluding the poly (A) tail, ZYMV-Kurdistan genome consisted of 9593 nucleotides (nt), with 138 and 211 nt at the 5' and 3' non-translated regions, respectively. It contained two open-reading frames (ORFs), the large ORF encoding a polyprotein of 3080 amino acids (aa) and the small overlapping ORF encoding a P3N-PIPO protein of 74 aa. This isolate had six unique aa differences compared to other ZYMV isolates and shared 79.6-98.8% identities with other ZYMV genome sequences at the nt level and 90.1-99% identities at the aa level. A phylogenetic tree of ZYMV complete genomic sequences showed that Iranian and Central European isolates are closely related and form a phylogenetically homogenous group. All values in the ratio of substitution rates at non-synonymous and synonymous sites ( d N / d S ) were below 1, suggestive of strong negative selection forces during ZYMV protein history. This is the first report of complete genome sequence information of the most prevalent virus in the west of Iran. This study helps our understanding of the genetic diversity of ZYMV isolates infecting cucurbit plants in Iran, virus evolution and epidemiology and can assist in designing better diagnostic tools.

  3. Intercalation of XR5944 with the estrogen response element is modulated by the tri-nucleotide spacer sequence between half-sites

    PubMed Central

    Sidell, Neil; Mathad, Raveendra I.; Shu, Feng-jue; Zhang, Zhenjiang; Kallen, Caleb B.; Yang, Danzhou

    2011-01-01

    DNA-intercalating molecules can impair DNA replication, DNA repair, and gene transcription. We previously demonstrated that XR5944, a DNA bis-intercalator, specifically blocks binding of estrogen receptor-α (ERα) to the consensus estrogen response element (ERE). The consensus ERE sequence is AGGTCAnnnTGACCT, where nnn is known as the tri-nucleotide spacer. Recent work has shown that the tri-nucleotide spacer can modulate ERα-ERE binding affinity and ligand-mediated transcriptional responses. To further understand the mechanism by which XR5944 inhibits ERα-ERE binding, we tested its ability to interact with consensus EREs with variable tri-nucleotide spacer sequences and with natural but non-consensus ERE sequences using one dimensional nuclear magnetic resonance (1D 1H NMR) titration studies. We found that the tri-nucleotide spacer sequence significantly modulates the binding of XR5944 to EREs. Of the sequences that were tested, EREs with CGG and AGG spacers showed the best binding specificity with XR5944, while those spaced with TTT demonstrated the least specific binding. The binding stoichiometry of XR5944 with EREs was 2:1, which can explain why the spacer influences the drug-DNA interaction; each XR5944 spans four nucleotides (including portions of the spacer) when intercalating with DNA. To validate our NMR results, we conducted functional studies using reporter constructs containing consensus EREs with tri-nucleotide spacers CGG, CTG, and TTT. Results of reporter assays in MCF-7 cells indicated that XR5944 was significantly more potent in inhibiting the activity of CGG- than TTT-spaced EREs, consistent with our NMR results. Taken together, these findings predict that the anti-estrogenic effects of XR5944 will depend not only on ERE half-site composition but also on the tri-nucleotide spacer sequence of EREs located in the promoters of estrogen-responsive genes. PMID:21333738

  4. Complete mitochondrial genome sequences of three bats species and whole genome mitochondrial analyses reveal patterns of codon bias and lend support to a basal split in Chiroptera.

    PubMed

    Meganathan, P R; Pagan, Heidi J T; McCulloch, Eve S; Stevens, Richard D; Ray, David A

    2012-01-15

    Order Chiroptera is a unique group of mammals whose members have attained self-powered flight as their main mode of locomotion. Much speculation persists regarding bat evolution; however, lack of sufficient molecular data hampers evolutionary and conservation studies. Of ~1200 species, complete mitochondrial genome sequences are available for only eleven. Additional sequences should be generated if we are to resolve many questions concerning these fascinating mammals. Herein, we describe the complete mitochondrial genomes of three bats: Corynorhinus rafinesquii, Lasiurus borealis and Artibeus lituratus. We also compare the currently available mitochondrial genomes and analyze codon usage in Chiroptera. C. rafinesquii, L. borealis and A. lituratus mitochondrial genomes are 16438 bp, 17048 bp and 16709 bp, respectively. Genome organization and gene arrangements are similar to other bats. Phylogenetic analyses using complete mitochondrial genome sequences support previously established phylogenetic relationships and suggest utility in future studies focusing on the evolutionary aspects of these species. Comprehensive analyses of available bat mitochondrial genomes reveal distinct nucleotide patterns and synonymous codon preferences corresponding to different chiropteran families. These patterns suggest that mutational and selection forces are acting to different extents within Chiroptera and shape their mitochondrial genomes. Copyright © 2011 Elsevier B.V. All rights reserved.

  5. Complete mitochondrial genome sequences of the northern spotted owl (Strix occidentalis caurina) and the barred owl (Strix varia; Aves: Strigiformes: Strigidae) confirm the presence of a duplicated control region

    PubMed Central

    Henderson, James B.; Sellas, Anna B.; Fuchs, Jérôme; Bowie, Rauri C.K.; Dumbacher, John P.

    2017-01-01

    We report here the successful assembly of the complete mitochondrial genomes of the northern spotted owl (Strix occidentalis caurina) and the barred owl (S. varia). We utilized sequence data from two sequencing methodologies, Illumina paired-end sequence data with insert lengths ranging from approximately 250 nucleotides (nt) to 9,600 nt and read lengths from 100–375 nt and Sanger-derived sequences. We employed multiple assemblers and alignment methods to generate the final assemblies. The circular genomes of S. o. caurina and S. varia are comprised of 19,948 nt and 18,975 nt, respectively. Both code for two rRNAs, twenty-two tRNAs, and thirteen polypeptides. They both have duplicated control region sequences with complex repeat structures. We were not able to assemble the control regions solely using Illumina paired-end sequence data. By fully spanning the control regions, Sanger-derived sequences enabled accurate and complete assembly of these mitochondrial genomes. These are the first complete mitochondrial genome sequences of owls (Aves: Strigiformes) possessing duplicated control regions. We searched the nuclear genome of S. o. caurina for copies of mitochondrial genes and found at least nine separate stretches of nuclear copies of gene sequences originating in the mitochondrial genome (Numts). The Numts ranged from 226–19,522 nt in length and included copies of all mitochondrial genes except tRNAPro, ND6, and tRNAGlu. Strix occidentalis caurina and S. varia exhibited an average of 10.74% (8.68% uncorrected p-distance) divergence across the non-tRNA mitochondrial genes. PMID:29038757

  6. The nucleotide sequences of 5S rRNAs from a rotifer, Brachionus plicatilis, and two nematodes, Rhabditis tokai and Caenorhabditis elegans.

    PubMed Central

    Kumazaki, T; Hori, H; Osawa, S; Ishii, N; Suzuki, K

    1982-01-01

    The nucleotide sequences of 5S rRNAs from a rotifer, Brachionus plicatilis, and two nematodes, Rhabditis tokai and Caenorhabditis elegans have been determined. The rotifer has two 5S rRNA species that are composed of 120 and 121 nucleotides, respectively. The sequences of these two 5S rRNAs are the same except that the latter has an additional base at its 3'-terminus. The 5S rRNAs from the two nematode species are both 119 nucleotides long. The sequence similarity percents are 79% (Brachionus/Rhabditis), 80% (Brachionus/Caenorhabditis), and 95% (Rhabditis/Caenorhabditis) among these three species. Brachionus revealed the highest similarity to Lingula (89%), but not to the nematodes (79%). PMID:6891053

  7. Detection and quantitation of single nucleotide polymorphisms, DNA sequence variations, DNA mutations, DNA damage and DNA mismatches

    DOEpatents

    McCutchen-Maloney, Sandra L.

    2002-01-01

    DNA mutation binding proteins alone and as chimeric proteins with nucleases are used with solid supports to detect DNA sequence variations, DNA mutations and single nucleotide polymorphisms. The solid supports may be flow cytometry beads, DNA chips, glass slides or DNA dips sticks. DNA molecules are coupled to solid supports to form DNA-support complexes. Labeled DNA is used with unlabeled DNA mutation binding proteins such at TthMutS to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by binding which gives an increase in signal. Unlabeled DNA is utilized with labeled chimeras to detect DNA sequence variations, DNA mutations and single nucleotide length polymorphisms by nuclease activity of the chimera which gives a decrease in signal.

  8. 37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...

  9. 37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...

  10. 37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...

  11. 37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...

  12. 37 CFR 1.823 - Requirements for nucleotide and/or amino acid sequences as part of the application.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... and/or amino acid sequences as part of the application. 1.823 Section 1.823 Patents, Trademarks, and... Amino Acid Sequences § 1.823 Requirements for nucleotide and/or amino acid sequences as part of the... incorporation-by-reference of the Sequence Listing as required by § 1.52(e)(5). The presentation of the...

  13. Beet western yellows virus infects the carnivorous plant Nepenthes mirabilis.

    PubMed

    Miguel, Sissi; Biteau, Flore; Mignard, Benoit; Marais, Armelle; Candresse, Thierry; Theil, Sébastien; Bourgaud, Frédéric; Hehn, Alain

    2016-08-01

    Although poleroviruses are known to infect a broad range of higher plants, carnivorous plants have not yet been reported as hosts. Here, we describe the first polerovirus naturally infecting the pitcher plant Nepenthes mirabilis. The virus was identified through bioinformatic analysis of NGS transcriptome data. The complete viral genome sequence was assembled from overlapping PCR fragments and shown to share 91.1 % nucleotide sequence identity with the US isolate of beet western yellows virus (BWYV). Further analysis of other N. mirabilis plants revealed the presence of additional BWYV isolates differing by several insertion/deletion mutations in ORF5.

  14. The mitogenome of Onchocerca volvulus from the Brazilian Amazonia focus.

    PubMed

    Crainey, James L; Silva, Túllio R R da; Encinas, Fernando; Marín, Michel A; Vicente, Ana Carolina P; Luz, Sérgio L B

    2016-01-01

    We report here the first complete mitochondria genome of Onchocerca volvulus from a focus outside of Africa. An O. volvulus mitogenome from the Brazilian Amazonia focus was obtained using a combination of high-throughput and Sanger sequencing technologies. Comparisons made between this mitochondrial genome and publicly available mitochondrial sequences identified 46 variant nucleotide positions and suggested that our Brazilian mitogenome is more closely related to Cameroon-origin mitochondria than West African-origin mitochondria. As well as providing insights into the origins of Latin American onchocerciasis, the Brazilian Amazonia focus mitogenome may also have value as an epidemiological resource.

  15. The complete chloroplast genome of Sinopodophyllum hexandrum Ying (Berberidaceae).

    PubMed

    Meng, Lihua; Liu, Ruijuan; Chen, Jianbing; Ding, Chenxu

    2017-05-01

    The complete nucleotide sequence of the Sinopodophyllum hexandrum Ying chloroplast genome (cpDNA) was determined based on next-generation sequencing technologies in this study. The genome was 157 203 bp in length, containing a pair of inverted repeat (IRa and IRb) regions of 25 960 bp, which were separated by a large single-copy (LSC) region of 87 065 bp and a small single-copy (SSC) region of 18 218 bp, respectively. The cpDNA contained 148 genes, including 96 protein-coding genes, 8 ribosomal RNA genes, and 44 tRNA genes. In these genes, eight harbored a single intron, and two (ycf3 and clpP) contained a couple of introns. The cpDNA AT content of S. hexandrum cpDNA is 61.5%.

  16. Description of the mitochondrial genome of the tree coral Dendrophyllia arbuscula (Anthozoa, Scleractinia).

    PubMed

    Luz, Bruna Louise Pereira; Capel, Kátia Cristina Cruz; Stampar, Sérgio Nascimento; Kitahara, Marcelo Visentini

    2016-07-01

    Dendrophylliidae is one of the few monophyletic families within the Scleractinia that embraces zooxanthellate and azooxanthellate species represented by both solitary and colonial forms. Among the exclusively azooxanthellate genera, Dendrophyllia is reported worldwide from 1 to 1200 m deep. To date, although three complete mitochondrial (mt) genomes from representatives of the family are available, only that from Turbinaria peltata has been formally published. Here we describe the complete nucleotide sequence of the mt genome from Dendrophyllia arbuscula that is 19 069 bp in length and comprises two rDNAs, two tRNAs, and 13 protein-coding genes arranged in the canonical scleractinian mt gene order. No genes overlap, resulting in the presence of 18 intergenic spacers and one of the longest scleractinian mt genome sequenced to date.

  17. Biological nanopore MspA for DNA sequencing

    NASA Astrophysics Data System (ADS)

    Manrao, Elizabeth A.

    Unlocking the information hidden in the human genome provides insight into the inner workings of complex biological systems and can be used to greatly improve health-care. In order to allow for widespread sequencing, new technologies are required that provide fast and inexpensive readings of DNA. Nanopore sequencing is a third generation DNA sequencing technology that is currently being developed to fulfill this need. In nanopore sequencing, a voltage is applied across a small pore in an electrolyte solution and the resulting ionic current is recorded. When DNA passes through the channel, the ionic current is partially blocked. If the DNA bases uniquely modulate the ionic current flowing through the channel, the time trace of the current can be related to the sequence of DNA passing through the pore. There are two main challenges to realizing nanopore sequencing: identifying a pore with sensitivity to single nucleotides and controlling the translocation of DNA through the pore so that the small single nucleotide current signatures are distinguishable from background noise. In this dissertation, I explore the use of Mycobacterium smegmatis porin A (MspA) for nanopore sequencing. In order to determine MspA's sensitivity to single nucleotides, DNA strands of various compositions are held in the pore as the resulting ionic current is measured. DNA is immobilized in MspA by attaching it to a large molecule which acts as an anchor. This technique confirms the single nucleotide resolution of the pore and additionally shows that MspA is sensitive to epigenetic modifications and single nucleotide polymorphisms. The forces from the electric field within MspA, the effective charge of nucleotides, and elasticity of DNA are estimated using a Freely Jointed Chain model of single stranded DNA. These results offer insight into the interactions of DNA within the pore. With the nucleotide sensitivity of MspA confirmed, a method is introduced to controllably pass DNA through the pore. Using a DNA polymerase, DNA strands are stepped through MspA one nucleotide at a time. The steps are observable as distinct levels on the ionic-current time-trace and are related to the DNA sequence. These experiments overcome the two fundamental challenges to realizing MspA nanopore sequencing and pave the way to the development of a commercial technology.

  18. A new single-nucleotide polymorphism database for rainbow trout generated through whole genome re-sequencing

    USDA-ARS?s Scientific Manuscript database

    Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout, SNP discovery has been done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL), RNA sequencing, and whole...

  19. Co-circulation of a novel phlebovirus and Massilia virus in sandflies, Portugal.

    PubMed

    Amaro, Fátima; Zé-Zé, Líbia; Alves, Maria J; Börstler, Jessica; Clos, Joachim; Lorenzen, Stephan; Becker, Stefanie Christine; Schmidt-Chanasit, Jonas; Cadar, Daniel

    2015-10-24

    In Portugal, entomological surveys to detect phleboviruses in their natural vectors have not been performed so far. Thus, the aims of the present study were to detect, isolate and characterize phleboviruses in sandfly populations of Portugal. From May to October 2007-2008, 896 female sandflies were trapped in Arrábida region, located on the southwest coast of Portugal. Phlebovirus RNA was detected by using a pan-phlebovirus RT-PCR in 4 out of 34 Phlebotomus perniciosus pools. Direct sequencing of the amplicons showed that 2 samples exhibited 72 % nucleotide identity with Arbia virus, and two showed 96 % nucleotide identity with Massilia virus. The Arbia-like virus (named Alcube virus) was isolated in cell culture and complete genomic sequences of one Alcube and two Massila viruses were determined using next-generation sequencing technology. Phylogenetic analysis demonstrated that Alcube virus clustered with members of the Salehabad virus species complex. Within this clade, Alcube virus forms a monophyletic lineage with the Arbia, Salehabad and Adana viruses sharing a common ancestor. Arbia virus has been identified as the most closely related virus with 20-28 % nucleotide and 10-27 % amino acid divergences depending on the analysed segment. We have provided genetic evidence for the circulation of a novel phlebovirus species named Alcube virus in Ph. perniciosus and co-circulation of Massilia virus, in Arrábida region, southwest of Portugal. Further epidemiological investigations and surveillance for sandfly-borne phleboviruses in Portugal are needed to elucidate their medical importance.

  20. Mutations That Improve the pRE Promoter of Coliphage Lambda

    PubMed Central

    Mahoney, Michael E.; Wulff, Daniel L.

    1987-01-01

    The dya5 mutation, a C→T change at position -43 of the λ pRE promoter, results in a twofold increase in pRE activity in vivo. Smaller increases in pRE activity are found for the dya2 mutation, a T→C change at position -1 of pRE, and the dya3 mutation, an A→G change at +5 of pRE. The mutant p RE promoters retain complete dependence on cII protein for activity. These observations argue, at least for pRE-like promoters, that promoter activities are influenced by nucleotide sequences at least eight nucleotides to the 5'-side of the conventional -35 region consensus sequence, and by nucleotide sequences near the start-site of transcription. Although Hawley and McClure (1983) found A·T pairs more frequently than G·TC pairs in the region of -40 to -45 of prokaryotic promoters, other mutations that change a G·TC pair to an A·T pair at positions -41, -44 and -45 of pRE do not result in increased promoter activity. We also found that a T→C change at position -42 results in a mild decrease in promoter activity. These observations argue that Ts at positions -42 and -43 of pRE are required for maximum promoter activity, but do not support the hypothesis that As and Ts in the -40 to -45 region generally lead to higher promoter activities. PMID:2953648

  1. Isolation and Genomic Characterization of a Duck-Origin GPV-Related Parvovirus from Cherry Valley Ducklings in China.

    PubMed

    Chen, Hao; Dou, Yanguo; Tang, Yi; Zhang, Zhenjie; Zheng, Xiaoqiang; Niu, Xiaoyu; Yang, Jing; Yu, Xianglong; Diao, Youxiang

    2015-01-01

    A newly emerged duck parvovirus, which causes beak atrophy and dwarfism syndrome (BADS) in Cherry Valley ducks, has appeared in Northern China since March 2015. To explore the genetic diversity among waterfowl parvovirus isolates, the complete genome of an identified isolate designated SDLC01 was sequenced and analyzed in the present study. Genomic sequence analysis showed that SDLC01 shared 90.8%-94.6% of nucleotide identity with goose parvovirus (GPV) isolates and 78.6%-81.6% of nucleotide identity with classical Muscovy duck parvovirus (MDPV) isolates. Phylogenetic analysis of 443 nucleotides (nt) of the fragment A showed that SDLC01 was highly similar to a mule duck isolate (strain D146/02) and close to European GPV isolates but separate from Asian GPV isolates. Analysis of the left inverted terminal repeat regions revealed that SDLC01 had two major segments deleted between positions 160-176 and 306-322 nt compared with field GPV and MDPV isolates. Phylogenetic analysis of Rep and VP1 encoded by two major open reading frames of parvoviruses revealed that SDLC01 was distinct from all GPV and MDPV isolates. The viral pathogenicity and genome characterization of SDLC01 suggest that the novel GPV (N-GPV) is the causative agent of BADS and belongs to a distinct GPV-related subgroup. Furthermore, N-GPV sequences were detected in diseased ducks by polymerase chain reaction and viral proliferation was demonstrated in duck embryos and duck embryo fibroblast cells.

  2. Structure of yeast Argonaute with guide RNA

    PubMed Central

    Nakanishi, Kotaro; Weinberg, David E.; Bartel, David P.; Patel, Dinshaw J.

    2012-01-01

    The RNA-induced silencing complex, comprising Argonaute and guide RNA, mediates RNA interference. Here we report the 3.2 Å crystal structure of Kluyveromyces Argonaute (KpAGO) fortuitously complexed with guide RNA originating from small-RNA duplexes autonomously loaded and processed by recombinant KpAGO. Despite their diverse sequences, guide-RNA nucleotides 1–8 are positioned similarly, with sequence-independent contacts to bases, phosphates and 2′-hydroxyl groups pre-organizing the backbone of nucleotides 2–8 in a near–A-form conformation. Compared with prokaryotic Argonautes, KpAGO has numerous surface-exposed insertion segments, with a cluster of conserved insertions repositioning the N domain to enable full propagation of guide–target pairing. Compared with Argonautes in inactive conformations, KpAGO has a hydrogen-bond network that stabilizes an expanded and repositioned loop, which inserts an invariant glutamate into the catalytic pocket. Mutation analyses and analogies to Ribonuclease H indicate that insertion of this glutamate finger completes a universally conserved catalytic tetrad, thereby activating Argonaute for RNA cleavage. PMID:22722195

  3. Cleavage sites in the polypeptide precursors of poliovirus protein P2-X

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Selmer, B.L.; Hanecak, R.; Anderson, C.W.

    1981-01-01

    Partial amino-terminal sequence analysis has been performed on the three major polypeptide products (P2-3b, P2-5b, and P2-X) from the central region (P2) of the poliovirus polyprotein, and this analysis precisely locates the amino termini of these products with respect to the nucleotide sequence of the poliovirus RNA genome. Like most of the products of the replicase region (P3), the amino termini of P2-5b and P2-X are generated by cleavage between glutamine and glycine residues. Thus, P2-5b and P2-X are probably both produced by the action of a singly (virus-encoded.) proteinase. The amino terminus of P2-3b, on the other hand, ismore » produced by a cleavage between the carboxy-terminal tyrosine of VP1 and the glycine encoded by nucleotides 3381-3383. This result may suggest that more than one proteolytic activity is required for the complete processing of the poliovirus polyprotein.« less

  4. Characterization of a potyvirus associated with yellow mosaic disease of jasmine (Jasminum sambac L.) in Andhra Pradesh, India.

    PubMed

    Sudheera, Y; Vishnu Vardhan, G P; Hema, M; Krishna Reddy, M; Sreenivasulu, P

    2014-01-01

    A virus isolate associated with yellow mosaic disease was purified from commercially cultivated jasmine (Jasminum sambac) from Andhra Pradesh, India and it contained flexuous filamentous particles of ~720 × 13 nm. The denatured purified virus had single major polypeptide of molecular weight 32 kDa. Complementary DNA representing 1678 nucleotides (nt) of the 3' terminus of viral RNA was cloned and sequenced. Comparisons of complete coat protein (CP) gene nucleotide and amino acid sequences of the present virus isolate with certain reported potyviruses revealed 86.1 and 92.7 % identity, respectively with jasmine potyvirus T (JaVT) reported from Taiwan and less than 70 % with other potyviruses. Based on the phylogenetic analysis of 3' UTR and CP gene, the present virus isolate was identified as an isolate of JaVT that belongs to the genus Potyvirus and the name Jasmine yellow mosaic virus-Andhra Pradesh (JaYMV-AP) is proposed.

  5. The Nucleotide Sequence and Spliced pol mRNA Levels of the Nonprimate Spumavirus Bovine Foamy Virus

    PubMed Central

    Holzschu, Donald L.; Delaney, Mari A.; Renshaw, Randall W.; Casey, James W.

    1998-01-01

    We have determined the complete nucleotide sequence of a replication-competent clone of bovine foamy virus (BFV) and have quantitated the amount of splice pol mRNA processed early in infection. The 544-amino-acid Gag protein precursor has little sequence similarity with its primate foamy virus homologs, but the putative nucleocapsid (NC) protein, like the primate NCs, contains the three glycine-arginine-rich regions that are postulated to bind genomic RNA during virion assembly. The BFV gag and pol open reading frames overlap, with pro and pol in the same translational frame. As with the human foamy virus (HFV) and feline foamy virus, we have detected a spliced pol mRNA by PCR. Quantitatively, this mRNA approximates the level of full-length genomic RNA early in infection. The integrase (IN) domain of reverse transcriptase does not contain the canonical HH-CC zinc finger motif present in all characterized retroviral INs, but it does contain a nearby histidine residue that could conceivably participate as a member of the zinc finger. The env gene encodes a protein that is over 40% identical in sequence to the HFV Env. By comparison, the Gag precursor of BFV is predicted to be only 28% identical to the HFV protein. PMID:9499074

  6. Analysis of the beak and feather disease viral genome indicates the existence of several genotypes which have a complex psittacine host specificity.

    PubMed

    de Kloet, E; de Kloet, S R

    2004-12-01

    A study was made of the phylogenetic relationships between fifteen complete nucleotide sequences as well as 43 nucleotide sequences of the putative coat protein gene of different strains belonging to the virus species Beak and feather disease virus obtained from 39 individuals of 16 psittacine species. The species included among others, cockatoos ( Cacatuini), African grey parrots ( Psittacus erithacus) and peach-faced lovebirds ( Agapornis roseicollis), which were infected at different geographical locations, within and outside Australia, the native origin of the virus. The derived amino acid sequences of the putative coat protein were highly diverse, with differences between some strains amounting to 50 of the 250 amino acids. Phylogenetic analysis demonstrated that the putative coat gene sequences form six clusters which show a varying degree of psittacine species specificity. Most, but not all strains infecting African grey parrots formed a single cluster as did the strains infecting the cockatoos. Strains infecting the lovebirds clustered with those infecting such Australasian species as Eclectus roratus, Psittacula kramerii and Psephotus haematogaster. Although individual birds included in this study were, where studied, often infected by closely related strains, infection by highly diverged trains was also detected. The possible relationship between BFD viral strains and clinical disease signs is discussed.

  7. The History of Bordetella pertussis Genome Evolution Includes Structural Rearrangement

    PubMed Central

    Peng, Yanhui; Loparev, Vladimir; Batra, Dhwani; Bowden, Katherine E.; Burroughs, Mark; Cassiday, Pamela K.; Davis, Jamie K.; Johnson, Taccara; Juieng, Phalasy; Knipe, Kristen; Mathis, Marsenia H.; Pruitt, Andrea M.; Rowe, Lori; Sheth, Mili; Tondella, M. Lucia; Williams, Margaret M.

    2017-01-01

    ABSTRACT Despite high pertussis vaccine coverage, reported cases of whooping cough (pertussis) have increased over the last decade in the United States and other developed countries. Although Bordetella pertussis is well known for its limited gene sequence variation, recent advances in long-read sequencing technology have begun to reveal genomic structural heterogeneity among otherwise indistinguishable isolates, even within geographically or temporally defined epidemics. We have compared rearrangements among complete genome assemblies from 257 B. pertussis isolates to examine the potential evolution of the chromosomal structure in a pathogen with minimal gene nucleotide sequence diversity. Discrete changes in gene order were identified that differentiated genomes from vaccine reference strains and clinical isolates of various genotypes, frequently along phylogenetic boundaries defined by single nucleotide polymorphisms. The observed rearrangements were primarily large inversions centered on the replication origin or terminus and flanked by IS481, a mobile genetic element with >240 copies per genome and previously suspected to mediate rearrangements and deletions by homologous recombination. These data illustrate that structural genome evolution in B. pertussis is not limited to reduction but also includes rearrangement. Therefore, although genomes of clinical isolates are structurally diverse, specific changes in gene order are conserved, perhaps due to positive selection, providing novel information for investigating disease resurgence and molecular epidemiology. IMPORTANCE Whooping cough, primarily caused by Bordetella pertussis, has resurged in the United States even though the coverage with pertussis-containing vaccines remains high. The rise in reported cases has included increased disease rates among all vaccinated age groups, provoking questions about the pathogen's evolution. The chromosome of B. pertussis includes a large number of repetitive mobile genetic elements that obstruct genome analysis. However, these mobile elements facilitate large rearrangements that alter the order and orientation of essential protein-encoding genes, which otherwise exhibit little nucleotide sequence diversity. By comparing the complete genome assemblies from 257 isolates, we show that specific rearrangements have been conserved throughout recent evolutionary history, perhaps by eliciting changes in gene expression, which may also provide useful information for molecular epidemiology. PMID:28167525

  8. Genome analysis of canine astroviruses reveals genetic heterogeneity and suggests possible inter-species transmission.

    PubMed

    Mihalov-Kovács, Eszter; Martella, Vito; Lanave, Gianvito; Bodnar, Livia; Fehér, Enikő; Marton, Szilvia; Kemenesi, Gábor; Jakab, Ferenc; Bányai, Krisztián

    2017-03-15

    Canine astrovirus RNA was detected in the stools of 17/63 (26.9%) samples, using either a broadly reactive consensus RT-PCR for astroviruses or random RT-PCR coupled with massive deep sequencing. The complete or nearly complete genome sequence of five canine astroviruses was reconstructed that allowed mapping the genome organization and to investigate the genetic diversity of these viruses. The genome was about 6.6kb in length and contained three open reading frames (ORFs) flanked by a 5' UTR, and a 3' UTR plus a poly-A tail. ORF1a and ORF1b overlapped by 43 nucleotides while the ORF2 overlapped by 8 nucleotides with the 3' end of ORF1b. Upon genome comparison, four strains (HUN/2012/2, HUN/2012/6, HUN/2012/115, and HUN/2012/135) were more related genetically to each other and to UK canine astroviruses (88-96% nt identity), whilst strain HUN/2012/126 was more divergent (75-76% nt identity). In the ORF1b and ORF2, strains HUN/2012/2, HUN/2012/6, and HUN/2012/135 were related genetically to other canine astroviruses identified formerly in Europe and China, whereas strain HUN/2012/126 was related genetically to a divergent canine astrovirus strain, ITA/2010/Zoid. For one canine astrovirus, HUN/2012/8, only a 3.2kb portion of the genome, at the 3' end, could be determined. Interestingly, this strain possessed unique genetic signatures (including a longer ORF1b/ORF2 overlap and a longer 3'UTR) and it was divergent in both ORF1b and ORF2 from all other canine astroviruses, with the highest nucleotide sequence identity (68% and 63%, respectively) to a mink astrovirus, thus suggesting a possible event of interspecies transmission. The genetic heterogeneity of canine astroviruses may pose a challenge for the diagnostics and for future prophylaxis strategies. Copyright © 2016 Elsevier B.V. All rights reserved.

  9. Molecular characterization of two prunus necrotic ringspot virus isolates from Canada.

    PubMed

    Cui, Hongguang; Hong, Ni; Wang, Guoping; Wang, Aiming

    2012-05-01

    We determined the entire RNA1, 2 and 3 sequences of two prunus necrotic ringspot virus (PNRSV) isolates, Chr3 from cherry and Pch12 from peach, obtained from an orchard in the Niagara Fruit Belt, Canada. The RNA1, 2 and 3 of the two isolates share nucleotide sequence identities of 98.6%, 98.4% and 94.5%, respectively. Their RNA1- and 2-encoded amino acid sequences are about 98% identical to the corresponding sequences of a cherry isolate, CH57, the only other PNRSV isolate with complete RNA1 and 2 sequences available. Phylogenetic analysis of the coat protein and movement protein encoded by RNA3 of Pch12 and Chr3 and published PNRSV isolates indicated that Chr3 belongs to the PV96 group and Pch12 belongs to the PV32 group.

  10. Sequence analysis of the internal transcribed spacer (ITS) region reveals a novel clade of Ichthyophonus sp. from rainbow trout

    USGS Publications Warehouse

    Rasmussen, C.; Purcell, M.K.; Gregg, J.L.; LaPatra, S.E.; Winton, J.R.; Hershberger, P.K.

    2010-01-01

    The mesomycetozoean parasite Ichthyophonus hoferi is most commonly associated with marine fish hosts but also occurs in some components of the freshwater rainbow trout Oncorhynchus mykiss aquaculture industry in Idaho, USA. It is not certain how the parasite was introduced into rainbow trout culture, but it might have been associated with the historical practice of feeding raw, ground common carp Cyprinus carpio that were caught by commercial fisherman. Here, we report a major genetic division between west coast freshwater and marine isolates of Ichthyophonus hoferi. Sequence differences were not detected in 2 regions of the highly conserved small subunit (18S) rDNA gene; however, nucleotide variation was seen in internal transcribed spacer loci (ITS1 and ITS2), both within and among the isolates. Intra-isolate variation ranged from 2.4 to 7.6 nucleotides over a region consisting of ~740 bp. Majority consensus sequences from marine/anadromous hosts differed in only 0 to 3 nucleotides (99.6 to 100% nucleotide identity), while those derived from freshwater rainbow trout had no nucleotide substitutions relative to each other. However, the consensus sequences between isolates from freshwater rainbow trout and those from marine/anadromous hosts differed in 13 to 16 nucleotides (97.8 to 98.2% nucleotide identity).

  11. Complete mitochondrial genome sequence of black mustard (Brassica nigra; BB) and comparison with Brassica oleracea (CC) and Brassica carinata (BBCC).

    PubMed

    Yamagishi, Hiroshi; Tanaka, Yoshiyuki; Terachi, Toru

    2014-11-01

    Crop species of Brassica (Brassicaceae) consist of three monogenomic species and three amphidiploid species resulting from interspecific hybridizations among them. Until now, mitochondrial genome sequences were available for only five of these species. We sequenced the mitochondrial genome of the sixth species, Brassica nigra (nuclear genome constitution BB), and compared it with those of Brassica oleracea (CC) and Brassica carinata (BBCC). The genome was assembled into a 232 145 bp circular sequence that is slightly larger than that of B. oleracea (219 952 bp). The genome of B. nigra contained 33 protein-coding genes, 3 rRNA genes, and 17 tRNA genes. The cox2-2 gene present in B. oleracea was absent in B. nigra. Although the nucleotide sequences of 52 genes were identical between B. nigra and B. carinata, the second exon of rps3 showed differences including an insertion/deletion (indel) and nucleotide substitutions. A PCR test to detect the indel revealed intraspecific variation in rps3, and in one line of B. nigra it amplified a DNA fragment of the size expected for B. carinata. In addition, the B. carinata lines tested here produced DNA fragments of the size expected for B. nigra. The results indicate that at least two mitotypes of B. nigra were present in the maternal parents of B. carinata.

  12. Infectious hematopoietic necrosis virus: monophyletic origin of European isolates from North American genogroup M.

    PubMed

    Enzmann, P J; Kurath, G; Fichtner, D; Bergmann, S M

    2005-09-23

    Infectious hematopoietic necrosis virus (IHNV) was first detected in Europe in 1987 in France and Italy, and later, in 1992, in Germany. The source of the virus and the route of introduction are unknown. The present study investigates the molecular epidemiology of IHNV outbreaks in Germany since its first introduction. The complete nucleotide sequences of the glycoprotein (G) and non-virion (NV) genes from 9 IHNV isolates from Germany have been determined, and this has allowed the identification of characteristic differences between these isolates. Phylogenetic analysis of partial G gene sequences (mid-G, 303 nucleotides) from North American IHNV isolates (Kurath et al. 2003) has revealed 3 major genogroups, designated U, M and L. Using this gene region with 2 different North American IHNV data sets, it was possible to group the European IHNV strains within the M genogroup, but not in any previously defined subgroup. Analysis of the full length G gene sequences indicated that an independent evolution of IHN viruses had occurred in Europe. IHN viruses in Europe seem to be of a monophyletic origin, again most closely related to North American isolates in the M genogroup. Analysis of the NV gene sequences also showed the European isolates to be monophyletic, but resolution of the 3 genogroups was poor with this gene region. As a result of comparative sequence analyses, several different genotypes have been identified circulating in Europe.

  13. Changes in mumps virus neurovirulence phenotype associated with quasispecies heterogeneity

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sauder, Christian J.; Vandenburgh, Kari M.; Iskow, Rebecca C.

    2006-06-20

    Mumps virus is a highly neurotropic virus with evidence of central nervous system invasion (CNS) in approximately half of all cases of infection. In countries where live attenuated mumps virus vaccines were introduced, the number of mumps cases declined dramatically; however, recently, the safety of some vaccine strains has been questioned. For example, one of the most widely used vaccines, the Urabe AM9 strain, was causally associated with meningitis, leading to the withdrawal of this product from the market in several countries. This highlights the need for a better understanding of the attenuation process and the identification of markers ofmore » attenuation. To this end, we further attenuated the Urabe AM9 strain by serial passage in cell culture and compared the complete nucleotide sequences of the parental and passaged viruses. Interestingly, despite a dramatic decrease in virus virulence (as assayed in rats), the only genomic changes were in the form of changes in the level of genetic heterogeneity at specific genome sites, i.e., either selection of one nucleotide variant at positions where the starting material exhibited nucleotide heterogeneity or the evolution of an additional nucleotide to create a heterogenic site. This finding suggests that changes in the level of genetic heterogeneity at specific genome sites can have profound neurovirulence phenotypic consequences and, therefore, caution should be exercised when evaluating genetic markers of virulence or attenuation based only on a consensus sequence.« less

  14. Primary and secondary structural analyses of glutathione S-transferase pi from human placenta.

    PubMed

    Ahmad, H; Wilson, D E; Fritz, R R; Singh, S V; Medh, R D; Nagle, G T; Awasthi, Y C; Kurosky, A

    1990-05-01

    The primary structure of glutathione S-transferase (GST) pi from a single human placenta was determined. The structure was established by chemical characterization of tryptic and cyanogen bromide peptides as well as automated sequence analysis of the intact enzyme. The structural analysis indicated that the protein is comprised of 209 amino acid residues and gave no evidence of post-translational modifications. The amino acid sequence differed from that of the deduced amino acid sequence determined by nucleotide sequence analysis of a cDNA clone (Kano, T., Sakai, M., and Muramatsu, M., 1987, Cancer Res. 47, 5626-5630) at position 104 which contained both valine and isoleucine whereas the deduced sequence from nucleotide sequence analysis identified only isoleucine at this position. These results demonstrated that in the one individual placenta studied at least two GST pi genes are coexpressed, probably as a result of allelomorphism. Computer assisted consensus sequence evaluation identified a hydrophobic region in GST pi (residues 155-181) that was predicted to be either a buried transmembrane helical region or a signal sequence region. The significance of this hydrophobic region was interpreted in relation to the mode of action of the enzyme especially in regard to the potential involvement of a histidine in the active site mechanism. A comparison of the chemical similarity of five known human GST complete enzyme structures, one of pi, one of mu, two of alpha, and one microsomal, gave evidence that all five enzymes have evolved by a divergent evolutionary process after gene duplication, with the microsomal enzyme representing the most divergent form.

  15. [Complete genomic sequence of a watermelon isolate of cucumber green mottle mosaic virus in northern China].

    PubMed

    Chen, Hong-yun; Lin, Shi-ming; Chen, Qing; Zhao, Wen-jun; Liao, Fu-rong; Chen, Hong-jun; Zhu, Shui-fang

    2009-01-01

    The complete genomic sequence of a watermelon isolate of Cucumber green mottle mosaic virus (CGMMV-LN) in Liaoning province was determined and compared with other cucurbit-infecting tobamoviruses. The genomic RNA of CGMMV-LN comprised 6422 nt, and 5'- and 3'- noncoding regions consisted of 59 nt and 175 nt, respectively. The encoded four proteins were two replicase proteins of 186 kD and 129 kD, move protein of 29 kD and coat protein of 17.4 kD. The alignment results of complete nucleotide sequence showed that CGMMV-LN shared identities of 97.6%-99.3% with four other CGMMV isolates, but only shared identities of 61.7%-62.8% with three other tobamoviruses. Homology trees generated from replicase proteins of 186 kD and coat proteins suggested that cucurbit-infecting tobamoviruses could be separated into two subgroups: subgroup I comprising all the isolates of CGMMV and subgroup II comprising Cucumber fruit mottle mosaic virus, Kyuri green mottle mosaic virus and Zucchini green mottle mosaic virus.

  16. Comparative chloroplast genomics: Analyses including new sequencesfrom the angiosperms Nuphar advena and Ranunculus macranthus

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Raubeso, Linda A.; Peery, Rhiannon; Chumley, Timothy W.

    2007-03-01

    The number of completely sequenced plastid genomes available is growing rapidly. This new array of sequences presents new opportunities to perform comparative analyses. In comparative studies, it is most useful to compare across wide phylogenetic spans and, within angiosperms, to include representatives from basally diverging lineages such as the new genomes reported here: Nuphar advena (from a basal-most lineage) and Ranunculus macranthus (from the basal group of eudicots). We report these two new plastid genome sequences and make comparisons (within angiosperms, seed plants, or all photosynthetic lineages) to evaluate features such as the status of ycf15 and ycf68 as proteinmore » coding genes, the distribution of simple sequence repeats (SSRs) and longer dispersed repeats (SDR), and patterns of nucleotide composition.« less

  17. Simultaneous Differentiation and Typing of Entamoeba histolytica and Entamoeba dispar

    PubMed Central

    Zaki, Mehreen; Meelu, Parool; Sun, Wei; Clark, C. Graham

    2002-01-01

    Sequences corresponding to some of the polymorphic loci previously reported from Entamoeba histolytica have been detected in Entamoeba dispar. Comparison of nucleotide sequences of two loci between E. dispar strain SAW760 and E. histolytica strain HM-1:IMSS revealed significant differences in both repeat and flanking regions. The tandem repeat units varied not only in sequence but also in number and arrangement between the two species at both the loci. Using the sequences obtained, primer pairs aimed at amplifying species-specific products were designed and tested on a variety of E. histolytica and E. dispar samples. Amplification results were in complete agreement with the original species classification in all cases, and the PCR products displayed discernible size and pattern variations among the isolates. PMID:11923344

  18. [Study on the genetic difference of SEO type Hantaviruses].

    PubMed

    Zhang, X; Zhou, S; Wang, H; Hu, J; Guan, Z; Liu, H

    2000-10-01

    To understand the genetic type of Hantaviruses and the difference between them caused by rodents in Beijing and to furhter explore the source of the infectious factors. Hantavirus RNA, isolated from lungs of rodents captured in Beijing and positive with Hantavirus antigens with frozen sectioning and Immunofluorescent assay, were reverse-transcribed and amplified with PCR with Hantavirus-specific primers. Five of the PCR amplifications were discovered and sequenced with 300 bp sequence data of M segments (from 2003 - 2302nt according cDNA of seoul 8039 strain). Nucleotide sequence homology showed that they were sequences of SEO-type Hantavirus. Compared with SEO type Hantavirus, the nucleotide sequence homology of these samples was more than 94% while the homology of amonia acid sequence was more than 98%. When compared with HNT type Hantavirus, the homology of nucleotide sequence became less than 72% with the homology of amonia acid sequence less than 81%. Similar to other Hantavirus of SEO type, their nucleotide sequences and deduced amino acid sequences were highly preserved. Phylogenetic tree analysis showed that the five viruses could be divided into at least 4 branches. It was quite likely that there were at least two sub-type SEO viruses with 4 branches that were circulating in Beijing.

  19. GFinisher: a new strategy to refine and finish bacterial genome assemblies

    NASA Astrophysics Data System (ADS)

    Guizelini, Dieval; Raittz, Roberto T.; Cruz, Leonardo M.; Souza, Emanuel M.; Steffens, Maria B. R.; Pedrosa, Fabio O.

    2016-10-01

    Despite the development in DNA sequencing technology, improving the number and the length of reads, the process of reconstruction of complete genome sequences, the so called genome assembly, is still complex. Only 13% of the prokaryotic genome sequencing projects have been completed. Draft genome sequences deposited in public databases are fragmented in contigs and may lack the full gene complement. The aim of the present work is to identify assembly errors and improve the assembly process of bacterial genomes. The biological patterns observed in genomic sequences and the application of a priori information can allow the identification of misassembled regions, and the reorganization and improvement of the overall de novo genome assembly. GFinisher starts generating a Fuzzy GC skew graphs for each contig in an assembly and follows breaking down the contigs in critical points in order to reassemble and close them using jFGap. This has been successfully applied to dataset from 96 genome assemblies, decreasing the number of contigs by up to 86%. GFinisher can easily optimize assemblies of prokaryotic draft genomes and can be used to improve the assembly programs based on nucleotide sequence patterns in the genome. The software and source code are available at http://gfinisher.sourceforge.net/.

  20. GFinisher: a new strategy to refine and finish bacterial genome assemblies.

    PubMed

    Guizelini, Dieval; Raittz, Roberto T; Cruz, Leonardo M; Souza, Emanuel M; Steffens, Maria B R; Pedrosa, Fabio O

    2016-10-10

    Despite the development in DNA sequencing technology, improving the number and the length of reads, the process of reconstruction of complete genome sequences, the so called genome assembly, is still complex. Only 13% of the prokaryotic genome sequencing projects have been completed. Draft genome sequences deposited in public databases are fragmented in contigs and may lack the full gene complement. The aim of the present work is to identify assembly errors and improve the assembly process of bacterial genomes. The biological patterns observed in genomic sequences and the application of a priori information can allow the identification of misassembled regions, and the reorganization and improvement of the overall de novo genome assembly. GFinisher starts generating a Fuzzy GC skew graphs for each contig in an assembly and follows breaking down the contigs in critical points in order to reassemble and close them using jFGap. This has been successfully applied to dataset from 96 genome assemblies, decreasing the number of contigs by up to 86%. GFinisher can easily optimize assemblies of prokaryotic draft genomes and can be used to improve the assembly programs based on nucleotide sequence patterns in the genome. The software and source code are available at http://gfinisher.sourceforge.net/.

  1. Complete nucleotide sequences and construction of full-length infectious cDNA clones of Cucumber green mottle virus (CGMMV) in a versatile newly developed binary vector including both 35S and T7 promoters

    USDA-ARS?s Scientific Manuscript database

    Seed-transmitted viruses have caused significant damage to watermelon crops in Korea in recent years, with Cucumber green mottle mosaic virus (CGMMV) infection widespread as a result of infected seed lots. To determine the likely origin of CGMMV infection, we collected CGMMV isolates from watermelon...

  2. Resolving the tips of the tree of life: How much mitochondrialdata doe we need?

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bonett, Ronald M.; Macey, J. Robert; Boore, Jeffrey L.

    2005-04-29

    Mitochondrial (mt) DNA sequences are used extensively to reconstruct evolutionary relationships among recently diverged animals,and have constituted the most widely used markers for species- and generic-level relationships for the last decade or more. However, most studies to date have employed relatively small portions of the mt-genome. In contrast, complete mt-genomes primarily have been used to investigate deep divergences, including several studies of the amount of mt sequence necessary to recover ancient relationships. We sequenced and analyzed 24 complete mt-genomes from a group of salamander species exhibiting divergences typical of those in many species-level studies. We present the first comprehensive investigationmore » of the amount of mt sequence data necessary to consistently recover the mt-genome tree at this level, using parsimony and Bayesian methods. Both methods of phylogenetic analysis revealed extremely similar results. A surprising number of well supported, yet conflicting, relationships were found in trees based on fragments less than {approx}2000 nucleotides (nt), typical of the vast majority of the thousands of mt-based studies published to date. Large amounts of data (11,500+ nt) were necessary to consistently recover the whole mt-genome tree. Some relationships consistently were recovered with fragments of all sizes, but many nodes required the majority of the mt-genome to stabilize, particularly those associated with short internal branches. Although moderate amounts of data (2000-3000 nt) were adequate to recover mt-based relationships for which most nodes were congruent with the whole mt-genome tree, many thousands of nucleotides were necessary to resolve rapid bursts of evolution. Recent advances in genomics are making collection of large amounts of sequence data highly feasible, and our results provide the basis for comparative studies of other closely related groups to optimize mt sequence sampling and phylogenetic resolution at the ''tips'' of the Tree of Life.« less

  3. Complete genome characterization of a novel enterovirus type EV-B106 isolated in China, 2012.

    PubMed

    Tang, Jingjing; Tao, Zexin; Ding, Zhengrong; Zhang, Yong; Zhang, Jie; Tian, Bingjun; Zhao, Zhixian; Zhang, Lifen; Xu, Wenbo

    2014-03-03

    Human enterovirus B106 (EV-B106) is a recently identified member of enterovirus species B. In this study, we report the complete genomic characterization of an EV-B106 strain (148/YN/CHN/12) isolated from an acute flaccid paralysis patient in Yunnan Province, China. The new strain had 79.2-81.3% nucleotide and 89.1-94.8% amino acid similarity in the VP1 region with the other two EV-B106 strains from Bolivia and Pakistan. When compared with other EV serotypes, it had the highest (73.3%) VP1 nucleotide similarity with the EV-B77 prototype strain CF496-99. However, when aligned with all EV-B106 and EV-B77 sequences available from the GenBank database, two major frame shifts were observed in the VP1 coding region, which resulted in substantial (20.5%) VP1 amino acid divergence between the two serotypes. Phylogenetic analysis and similarity plot analysis revealed multiple recombination events in the genome of this strain. This is the first report of the complete genome of EV-B106.

  4. Equid herpesvirus 8: Complete genome sequence and association with abortion in mares

    PubMed Central

    Garvey, Marie; Suárez, Nicolás M.; Kerr, Karen; Hector, Ralph; Moloney-Quinn, Laura; Arkins, Sean; Davison, Andrew J.

    2018-01-01

    Equid herpesvirus 8 (EHV-8), formerly known as asinine herpesvirus 3, is an alphaherpesvirus that is closely related to equid herpesviruses 1 and 9 (EHV-1 and EHV-9). The pathogenesis of EHV-8 is relatively little studied and to date has only been associated with respiratory disease in donkeys in Australia and horses in China. A single EHV-8 genome sequence has been generated for strain Wh in China, but is apparently incomplete and contains frameshifts in two genes. In this study, the complete genome sequences of four EHV-8 strains isolated in Ireland between 2003 and 2015 were determined by Illumina sequencing. Two of these strains were isolated from cases of abortion in horses, and were misdiagnosed initially as EHV-1, and two were isolated from donkeys, one with neurological disease. The four genome sequences are very similar to each other, exhibiting greater than 98.4% nucleotide identity, and their phylogenetic clustering together demonstrated that genomic diversity is not dependent on the host. Comparative genomic analysis revealed 24 of the 76 predicted protein sequences are completely conserved among the Irish EHV-8 strains. Evolutionary comparisons indicate that EHV-8 is phylogenetically closer to EHV-9 than it is to EHV-1. In summary, the first complete genome sequences of EHV-8 isolates from two host species over a twelve year period are reported. The current study suggests that EHV-8 can cause abortion in horses. The potential threat of EHV-8 to the horse industry and the possibility that donkeys may act as reservoirs of infection warrant further investigation. PMID:29414990

  5. NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins

    PubMed Central

    Pruitt, Kim D.; Tatusova, Tatiana; Maglott, Donna R.

    2005-01-01

    The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) provides a non-redundant collection of sequences representing genomic data, transcripts and proteins. Although the goal is to provide a comprehensive dataset representing the complete sequence information for any given species, the database pragmatically includes sequence data that are currently publicly available in the archival databases. The database incorporates data from over 2400 organisms and includes over one million proteins representing significant taxonomic diversity spanning prokaryotes, eukaryotes and viruses. Nucleotide and protein sequences are explicitly linked, and the sequences are linked to other resources including the NCBI Map Viewer and Gene. Sequences are annotated to include coding regions, conserved domains, variation, references, names, database cross-references, and other features using a combined approach of collaboration and other input from the scientific community, automated annotation, propagation from GenBank and curation by NCBI staff. PMID:15608248

  6. Complete nucleotide sequence of the freshwater unicellular cyanobacterium Synechococcus elongatus PCC 6301 chromosome: gene content and organization.

    PubMed

    Sugita, Chieko; Ogata, Koretsugu; Shikata, Masamitsu; Jikuya, Hiroyuki; Takano, Jun; Furumichi, Miho; Kanehisa, Minoru; Omata, Tatsuo; Sugiura, Masahiro; Sugita, Mamoru

    2007-01-01

    The entire genome of the unicellular cyanobacterium Synechococcus elongatus PCC 6301 (formerly Anacystis nidulans Berkeley strain 6301) was sequenced. The genome consisted of a circular chromosome 2,696,255 bp long. A total of 2,525 potential protein-coding genes, two sets of rRNA genes, 45 tRNA genes representing 42 tRNA species, and several genes for small stable RNAs were assigned to the chromosome by similarity searches and computer predictions. The translated products of 56% of the potential protein-coding genes showed sequence similarities to experimentally identified and predicted proteins of known function, and the products of 35% of the genes showed sequence similarities to the translated products of hypothetical genes. The remaining 9% of genes lacked significant similarities to genes for predicted proteins in the public DNA databases. Some 139 genes coding for photosynthesis-related components were identified. Thirty-seven genes for two-component signal transduction systems were also identified. This is the smallest number of such genes identified in cyanobacteria, except for marine cyanobacteria, suggesting that only simple signal transduction systems are found in this strain. The gene arrangement and nucleotide sequence of Synechococcus elongatus PCC 6301 were nearly identical to those of a closely related strain Synechococcus elongatus PCC 7942, except for the presence of a 188.6 kb inversion. The sequences as well as the gene information shown in this paper are available in the Web database, CYORF (http://www.cyano.genome.jp/).

  7. Nucleotide sequence analysis of the 3' terminal region of a wasabi strain of crucifer tobamovirus genomic RNA: subgrouping of crucifer tobamoviruses.

    PubMed

    Shimamoto, I; Sonoda, S; Vazquez, P; Minaka, N; Nishiguchi, M

    1998-01-01

    The 3' terminal 2378 nucleotides of a wasabi strain of crucifer tobamovirus (CTMV-W) infectious to crucifer plants was determined. This includes the 3' non-coding region of 235 nucleotides, coat protein (CP) gene (468 nucleotides), movement protein (MP) gene (798 nucleotides) and C-terminal partial readthrough portion of 180 K protein gene (940 nucleotides). Comparison of the sequence with homologous regions of thirteen other tobamovirus genomes showed that it had much higher identity to those of four other crucifer tobamoviruses, 85.2% to cr-TMV and turnip vein-clearing virus (TVCV), 87.4% to oilseed rape mosaic virus (ORMV) and 87.1% to TMV-Cg, than to those of other tobamoviruses. Thus CTMV-W was most similar to ORMV and TMV-Cg in sequence, but only marginally so, whereas the location and size of its MP gene was the same as cr-TMV amd TVCV. These results, together with other analyses, show that CTMV-W is a new crucifer tobamovirus, that the five crucifer tobamoviruses can be classified into two subgroups based on MP gene organization, and that the rate of sequence change is not the same in all lineages.

  8. Genetic Diversity and Phylogenetic Evolution of Tibetan Sheep Based on mtDNA D-Loop Sequences

    PubMed Central

    Yue, Yaojing; Guo, Xian; Guo, Tingting; Chu, Min; Wang, Fan; Han, Jilong; Feng, Ruilin; Sun, Xiaoping; Niu, Chune; Yang, Bohui; Guo, Jian; Yuan, Chao

    2016-01-01

    The molecular and population genetic evidence of the phylogenetic status of the Tibetan sheep (Ovis aries) is not well understood, and little is known about this species’ genetic diversity. This knowledge gap is partly due to the difficulty of sample collection. This is the first work to address this question. Here, the genetic diversity and phylogenetic relationship of 636 individual Tibetan sheep from fifteen populations were assessed using 642 complete sequences of the mitochondrial DNA D-loop. Samples were collected from the Qinghai-Tibetan Plateau area in China, and reference data were obtained from the six reference breed sequences available in GenBank. The length of the sequences varied considerably, between 1031 and 1259 bp. The haplotype diversity and nucleotide diversity were 0.992±0.010 and 0.019±0.001, respectively. The average number of nucleotide differences was 19.635. The mean nucleotide composition of the 350 haplotypes was 32.961% A, 29.708% T, 22.892% C, 14.439% G, 62.669% A+T, and 37.331% G+C. Phylogenetic analysis showed that all four previously defined haplogroups (A, B, C, and D) were found in the 636 individuals of the fifteen Tibetan sheep populations but that only the D haplogroup was found in Linzhou sheep. Further, the clustering analysis divided the fifteen Tibetan sheep populations into at least two clusters. The estimation of the demographic parameters from the mismatch analyses showed that haplogroups A, B, and C had at least one demographic expansion in Tibetan sheep. These results contribute to the knowledge of Tibetan sheep populations and will help inform future conservation programs about the Tibetan sheep native to the Qinghai-Tibetan Plateau. PMID:27463976

  9. Complementary DNA cloning and molecular evolution of opine dehydrogenases in some marine invertebrates.

    PubMed

    Kimura, Tomohiro; Nakano, Toshiki; Yamaguchi, Toshiyasu; Sato, Minoru; Ogawa, Tomohisa; Muramoto, Koji; Yokoyama, Takehiko; Kan-No, Nobuhiro; Nagahisa, Eizou; Janssen, Frank; Grieshaber, Manfred K

    2004-01-01

    The complete complementary DNA sequences of genes presumably coding for opine dehydrogenases from Arabella iricolor (sandworm), Haliotis discus hannai (abalone), and Patinopecten yessoensis (scallop) were determined, and partial cDNA sequences were derived for Meretrix lusoria (Japanese hard clam) and Spisula sachalinensis (Sakhalin surf clam). The primers ODH-9F and ODH-11R proved useful for amplifying the sequences for opine dehydrogenases from the 4 mollusk species investigated in this study. The sequence of the sandworm was obtained using primers constructed from the amino acid sequence of tauropine dehydrogenase, the main opine dehydrogenase in A. iricolor. The complete cDNA sequence of A. iricolor, H. discus hannai, and P. yessoensis encode 397, 400, and 405 amino acids, respectively. All sequences were aligned and compared with published databank sequences of Loligo opalescens, Loligo vulgaris (squid), Sepia officinalis (cuttlefish), and Pecten maximus (scallop). As expected, a high level of homology was observed for the cDNA from closely related species, such as for cephalopods or scallops, whereas cDNA from the other species showed lower-level homologies. A similar trend was observed when the deduced amino acid sequences were compared. Furthermore, alignment of these sequences revealed some structural motifs that are possibly related to the binding sites of the substrates. The phylogenetic trees derived from the nucleotide and amino acid sequences were consistent with the classification of species resulting from classical taxonomic analyses.

  10. The mitochondrial genome of Paraspadella gotoi is highly reduced and reveals that chaetognaths are a sister-group to protostomes

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Helfenbein, Kevin G.; Fourcade, H. Matthew; Vanjani, Rohit G.

    2004-05-01

    We report the first complete mitochondrial (mt) DNA sequence from a member of the phylum Chaetognatha (arrow worms). The Paraspadella gotoi mtDNA is highly unusual, missing 23 of the genes commonly found in animal mtDNAs, including atp6, which has otherwise been found universally to be present. Its 14 genes are unusually arranged into two groups, one on each strand. One group is punctuated by numerous non-coding intergenic nucleotides, while the other group is tightly packed, having no non-coding nucleotides, leading to speculation that there are two transcription units with differing modes of expression. The phylogenetic position of the Chaetognatha withinmore » the Metazoa has long been uncertain, with conflicting or equivocal results from various morphological analyses and rRNA sequence comparisons. Comparisons here of amino acid sequences from mitochondrially encoded proteins gives a single most parsimonious tree that supports a position of Chaetognatha as sister to the protostomes studied here. From this, one can more clearly interpret the patterns of evolution of various developmental features, especially regarding the embryological fate of the blastopore.« less

  11. Survey of bat populations from Mexico and Paraguay for rabies.

    PubMed

    Sheeler-Gordon, L L; Smith, J S

    2001-07-01

    A mammalian survey was conducted in Mexico (October 1994-January 1996) and in Paraguay (August 1996-March 1997); a complete specimen was collected for each bat in the survey, including primary voucher specimen, ectoparasites, karyotype, and various frozen tissues. The surveys combined provided 937 brain samples (65 bat species) for rabies diagnosis. One male Lasiurus ega, collected in Paraguay, tested positive for the rabies virus (overall prevalence rate of 0.1%). Nucleotide sequence from a 300 bp region of the rabies nucleoprotein gene was compared with sequence obtained from representative rabies virus samples in the repository at the Centers for Disease Control and Prevention (Atlanta, Georgia, USA). Rabies virus extracted from the brain material of L. ega differed by only one nucleotide from a 300 bp consensus sequence (>99% homology) derived from samples for the variant of rabies virus transmitted by Lasiurus cinereus. Lasiurus ego differed by approximately 15% for the variant transmitted by Desmodus rotundus. Phylogenetic analysis found no evidence to suggest L. ego is a reservoir for rabies antigenic variant 6. The most likely explanation for rabies in L. ega was infection following contact with a rabid L. cinereus.

  12. Comparative genomic sequence analysis of novel Helicoverpa armigera nucleopolyhedrovirus (NPV) isolated from Kenya and three other previously sequenced Helicoverpa spp. NPVs.

    PubMed

    Ogembo, Javier Gordon; Caoili, Barbara L; Shikata, Masamitsu; Chaeychomsri, Sudawan; Kobayashi, Michihiro; Ikeda, Motoko

    2009-10-01

    A newly cloned Helicoverpa armigera nucleopolyhedrovirus (HearNPV) from Kenya, HearNPV-NNg1, has a higher insecticidal activity than HearNPV-G4, which also exhibits lower insecticidal activity than HearNPV-C1. In the search for genes and/or nucleotide sequences that might be involved in the observed virulence differences among Helicoverpa spp. NPVs, the entire genome of NNg1 was sequenced and compared with previously sequenced genomes of G4, C1 and Helicoverpa zea single-nucleocapsid NPV (Hz). The NNg1 genome was 132,425 bp in length, with a total of 143 putative open reading frames (ORFs), and shared high levels of overall amino acid and nucleotide sequence identities with G4, C1 and Hz. Three NNg1 ORFs, ORF5, ORF100 and ORF124, which were shared with C1, were absent in G4 and Hz, while NNg1 and C1 were missing a homologue of G4/Hz ORF5. Another three ORFs, ORF60 (bro-b), ORF119 and ORF120, and one direct repeat sequence (dr) were unique to NNg1. Relative to the overall nucleotide sequence identity, lower sequence identities were observed between NNg1 hrs and the homologous hrs in the other three Helicoverpa spp. NPVs, despite containing the same number of hrs located at essentially the same positions on the genomes. Differences were also observed between NNg1 and each of the other three Helicoverpa spp. NPVs in the diversity of bro genes encoded on the genomes. These results indicate several putative genes and nucleotide sequences that may be responsible for the virulence differences observed among Helicoverpa spp., yet the specific genes and/or nucleotide sequences responsible have not been identified.

  13. Molecular characterization of KGH, the first human isolate of rabies virus in Korea.

    PubMed

    Park, Jun-Sun; Kim, Chi-Kyeong; Kim, Su Yeon; Ju, Young Ran

    2013-04-01

    The complete genome sequence of the KGH strain of the first human rabies virus, which was isolated from a skin biopsy of a patient with rabies, whose symptoms developed due to bites from a raccoon dog in 2001. The size of the KGH strain genome was determined to be 11,928 nucleotides (nt) with a leader sequence of 58 nt, nucleoprotein gene of 1,353 nt, phosphoprotein gene of 894 nt, matrix protein gene of 609 nt, glycoprotein gene of 1,575 nt, RNA-dependent RNA polymerase gene of 6,384 nt, and trailer region of 69 nt. Sequence similarity was compared with 39 fully sequenced rabies virus genomes currently available, and the result showed 70.6-91.6 % at the nucleotide level, and 82.8-97.9 % at the amino acid level. The deduced amino acids in the viral protein were compared with those of other rabies viruses, and various functional regions were investigated. As a result, we found that the KGH strain only had a unique amino acid substitution that was identified to be associated either with host immune response and pathogenicity in the N protein, or with a related region regulating STAT1 in the P protein, and related to pathogenicity in G protein. Based on phylogenetic analyses using the complete genome of 39 rabies viruses, the KGH strain was determined to be closely related with the NNV-RAB-H strain and transplant rabies virus serotype 1, which are Indian isolates, and was confirmed to belong to the Arctic-like 2 clade. The KGH strain was most closely related to the SKRRD0204HC and SKRRD0205HC strain when compared with Korean animal isolates, which was separated around the same time and place, and belonged to the Gangwon III subgroup.

  14. Predicted stem-loop structures and variation in nucleotide sequence of 3' noncoding regions among animal calicivirus genomes.

    PubMed

    Seal, B S; Neill, J D; Ridpath, J F

    1994-07-01

    Caliciviruses are nonenveloped with a polyadenylated genome of approximately 7.6 kb and a single capsid protein. The "RNA Fold" computer program was used to analyze 3'-terminal noncoding sequences of five feline calicivirus (FCV), rabbit hemorrhagic disease virus (RHDV), and two San Miguel sea lion virus (SMSV) isolates. The FCV 3'-terminal sequences are 40-46 nucleotides in length and 72-91% similar. The FCV sequences were predicted to contain two possible duplex structures and one stem-loop structure with free energies of -2.1 to -18.2 kcal/mole. The RHDV genomic 3'-terminal RNA sequences are 54 nucleotides in length and share 49% sequence similarity to homologous regions of the FCV genome. The RHDV sequence was predicted to form two duplex structures in the 3'-terminal noncoding region with a single stem-loop structure, resembling that of FCV. In contrast, the SMSV 1 and 4 genomic 3'-terminal noncoding sequences were 185 and 182 nucleotides in length, respectively. Ten possible duplex structures were predicted with an average structural free energy of -35 kcal/mole. Sequence similarity between the two SMSV isolates was 75%. Furthermore, extensive cloverleaflike structures are predicted in the 3' noncoding region of the SMSV genome, in contrast to the predicted single stem-loop structures of FCV or RHDV.

  15. Terminal Duplex Stability and Nucleotide Identity Differentially Control siRNA Loading and Activity in RNA Interference

    PubMed Central

    Angart, Phillip A.; Carlson, Rebecca J.; Adu-Berchie, Kwasi

    2016-01-01

    Efficient short interfering RNA (siRNA)-mediated gene silencing requires selection of a sequence that is complementary to the intended target and possesses sequence and structural features that encourage favorable functional interactions with the RNA interference (RNAi) pathway proteins. In this study, we investigated how terminal sequence and structural characteristics of siRNAs contribute to siRNA strand loading and silencing activity and how these characteristics ultimately result in a functionally asymmetric duplex in cultured HeLa cells. Our results reiterate that the most important characteristic in determining siRNA activity is the 5′ terminal nucleotide identity. Our findings further suggest that siRNA loading is controlled principally by the hybridization stability of the 5′ terminus (Nucleotides: 1–2) of each siRNA strand, independent of the opposing terminus. Postloading, RNA-induced silencing complex (RISC)–specific activity was found to be improved by lower hybridization stability in the 5′ terminus (Nucleotides: 3–4) of the loaded siRNA strand and greater hybridization stability toward the 3′ terminus (Nucleotides: 17–18). Concomitantly, specific recognition of the 5′ terminal nucleotide sequence by human Argonaute 2 (Ago2) improves RISC half-life. These findings indicate that careful selection of siRNA sequences can maximize both the loading and the specific activity of the intended guide strand. PMID:27399870

  16. The nucleotide sequence of 5S ribosomal RNA from Micrococcus lysodeikticus.

    PubMed Central

    Hori, H; Osawa, S; Murao, K; Ishikura, H

    1980-01-01

    The nucleotide sequence of ribosomal 5S RNA from Micrococcus lysodeikticus is pGUUACGGCGGCUAUAGCGUGGGGGAAACGCCCGGCCGUAUAUCGAACCCGGAAGCUAAGCCCCAUAGCGCCGAUGGUUACUGUAACCGGGAGGUUGUGGGAGAGUAGGUCGCCGCCGUGAOH. When compared to other 5S RNAs, the sequence homology is greatest with Thermus aquaticus, and these two 5S RNAs reveal several features intermediate between those of typical gram-positive bacteria and gram-negative bacteria. PMID:6780979

  17. Unprecedented high-resolution view of bacterial operon architecture revealed by RNA sequencing.

    PubMed

    Conway, Tyrrell; Creecy, James P; Maddox, Scott M; Grissom, Joe E; Conkle, Trevor L; Shadid, Tyler M; Teramoto, Jun; San Miguel, Phillip; Shimada, Tomohiro; Ishihama, Akira; Mori, Hirotada; Wanner, Barry L

    2014-07-08

    We analyzed the transcriptome of Escherichia coli K-12 by strand-specific RNA sequencing at single-nucleotide resolution during steady-state (logarithmic-phase) growth and upon entry into stationary phase in glucose minimal medium. To generate high-resolution transcriptome maps, we developed an organizational schema which showed that in practice only three features are required to define operon architecture: the promoter, terminator, and deep RNA sequence read coverage. We precisely annotated 2,122 promoters and 1,774 terminators, defining 1,510 operons with an average of 1.98 genes per operon. Our analyses revealed an unprecedented view of E. coli operon architecture. A large proportion (36%) of operons are complex with internal promoters or terminators that generate multiple transcription units. For 43% of operons, we observed differential expression of polycistronic genes, despite being in the same operons, indicating that E. coli operon architecture allows fine-tuning of gene expression. We found that 276 of 370 convergent operons terminate inefficiently, generating complementary 3' transcript ends which overlap on average by 286 nucleotides, and 136 of 388 divergent operons have promoters arranged such that their 5' ends overlap on average by 168 nucleotides. We found 89 antisense transcripts of 397-nucleotide average length, 7 unannotated transcripts within intergenic regions, and 18 sense transcripts that completely overlap operons on the opposite strand. Of 519 overlapping transcripts, 75% correspond to sequences that are highly conserved in E. coli (>50 genomes). Our data extend recent studies showing unexpected transcriptome complexity in several bacteria and suggest that antisense RNA regulation is widespread. Importance: We precisely mapped the 5' and 3' ends of RNA transcripts across the E. coli K-12 genome by using a single-nucleotide analytical approach. Our resulting high-resolution transcriptome maps show that ca. one-third of E. coli operons are complex, with internal promoters and terminators generating multiple transcription units and allowing differential gene expression within these operons. We discovered extensive antisense transcription that results from more than 500 operons, which fully overlap or extensively overlap adjacent divergent or convergent operons. The genomic regions corresponding to these antisense transcripts are highly conserved in E. coli (including Shigella species), although it remains to be proven whether or not they are functional. Our observations of features unearthed by single-nucleotide transcriptome mapping suggest that deeper layers of transcriptional regulation in bacteria are likely to be revealed in the future. Copyright © 2014 Conway et al.

  18. Regions of conservation and divergence in the 3' untranslated sequences of genomic RNA from Ross River virus isolates.

    PubMed

    Faragher, S G; Dalgarno, L

    1986-07-20

    The 3' untranslated (UT) sequences of the genomic RNAs of five geographic variants of the alphavirus Ross River virus (RRV) were determined and compared with the 3' UT sequence of RRV T48, the prototype strain. Part of the 3' UT region of Getah virus, a close serological relative of RRV, was also sequenced. The RRV 3' UT region varies markedly in length between variants. Large deletions or insertions, sequence rearrangements and single nucleotide substitutions are observed. A sequence tract of 49 to 58 nucleotides, which is repeated as four blocks in the RRV T48 3' UT region, occurs only once in the 3' UT region of one RRV strain (NB5092), indicating that the existence of repeat sequence blocks is not essential for RRV replication. However, the precise sequence of the 3' proximal copy of the repeat block and its position relative to the poly(A) tail were identical in all RRV isolates examined, suggesting that it has an important role in RRV replication. Nucleotide substitutions between RRV variants are distributed non-randomly along the length of the 3' UT region. The sequence of 120 to 130 nucleotides adjacent to the poly(A) tail is strongly conserved. Getah virus RNA contains three repeat sequence blocks in the 3' UT region. These are similar in sequence to those in RRV RNA but differ in their arrangement. Homology between the RRV and Getah 3' UT sequences is greatest in the 3' proximal repeat sequence block that shows three differences in 49 nucleotides. The 3' proximal repeat in Getah RNA occurs at the same position, relative to the poly(A) tail, as in all RRV variants. The RRV and Getah virus 3' UT sequences show extensive homology in the region between the 3' proximal repeat and the poly(A) tail but, apart from the repeat blocks themselves, they show no significant homology elsewhere.

  19. Filovirus RefSeq Entries: Evaluation and Selection of Filovirus Type Variants, Type Sequences, and Names

    PubMed Central

    Kuhn, Jens H.; Andersen, Kristian G.; Bào, Yīmíng; Bavari, Sina; Becker, Stephan; Bennett, Richard S.; Bergman, Nicholas H.; Blinkova, Olga; Bradfute, Steven; Brister, J. Rodney; Bukreyev, Alexander; Chandran, Kartik; Chepurnov, Alexander A.; Davey, Robert A.; Dietzgen, Ralf G.; Doggett, Norman A.; Dolnik, Olga; Dye, John M.; Enterlein, Sven; Fenimore, Paul W.; Formenty, Pierre; Freiberg, Alexander N.; Garry, Robert F.; Garza, Nicole L.; Gire, Stephen K.; Gonzalez, Jean-Paul; Griffiths, Anthony; Happi, Christian T.; Hensley, Lisa E.; Herbert, Andrew S.; Hevey, Michael C.; Hoenen, Thomas; Honko, Anna N.; Ignatyev, Georgy M.; Jahrling, Peter B.; Johnson, Joshua C.; Johnson, Karl M.; Kindrachuk, Jason; Klenk, Hans-Dieter; Kobinger, Gary; Kochel, Tadeusz J.; Lackemeyer, Matthew G.; Lackner, Daniel F.; Leroy, Eric M.; Lever, Mark S.; Mühlberger, Elke; Netesov, Sergey V.; Olinger, Gene G.; Omilabu, Sunday A.; Palacios, Gustavo; Panchal, Rekha G.; Park, Daniel J.; Patterson, Jean L.; Paweska, Janusz T.; Peters, Clarence J.; Pettitt, James; Pitt, Louise; Radoshitzky, Sheli R.; Ryabchikova, Elena I.; Saphire, Erica Ollmann; Sabeti, Pardis C.; Sealfon, Rachel; Shestopalov, Aleksandr M.; Smither, Sophie J.; Sullivan, Nancy J.; Swanepoel, Robert; Takada, Ayato; Towner, Jonathan S.; van der Groen, Guido; Volchkov, Viktor E.; Volchkova, Valentina A.; Wahl-Jensen, Victoria; Warren, Travis K.; Warfield, Kelly L.; Weidmann, Manfred; Nichol, Stuart T.

    2014-01-01

    Sequence determination of complete or coding-complete genomes of viruses is becoming common practice for supporting the work of epidemiologists, ecologists, virologists, and taxonomists. Sequencing duration and costs are rapidly decreasing, sequencing hardware is under modification for use by non-experts, and software is constantly being improved to simplify sequence data management and analysis. Thus, analysis of virus disease outbreaks on the molecular level is now feasible, including characterization of the evolution of individual virus populations in single patients over time. The increasing accumulation of sequencing data creates a management problem for the curators of commonly used sequence databases and an entry retrieval problem for end users. Therefore, utilizing the data to their fullest potential will require setting nomenclature and annotation standards for virus isolates and associated genomic sequences. The National Center for Biotechnology Information’s (NCBI’s) RefSeq is a non-redundant, curated database for reference (or type) nucleotide sequence records that supplies source data to numerous other databases. Building on recently proposed templates for filovirus variant naming [ ()////-], we report consensus decisions from a majority of past and currently active filovirus experts on the eight filovirus type variants and isolates to be represented in RefSeq, their final designations, and their associated sequences. PMID:25256396

  20. Characterization of the complete mitochondrial genome of the hybrid Epinephelus moara♀ × Epinephelus lanceolatus♂, and phylogenetic analysis in subfamily epinephelinae

    NASA Astrophysics Data System (ADS)

    Gao, Fengtao; Wei, Min; Zhu, Ying; Guo, Hua; Chen, Songlin; Yang, Guanpin

    2017-06-01

    This study presents the complete mitochondrial genome of the hybrid Epinephelus moara♀× Epinephelus lanceolatus♂. The genome is 16886 bp in length, and contains 13 protein-coding genes, 2 rRNA genes, 22 tRNA genes, a light-strand replication origin and a control region. Additionally, phylogenetic analysis based on the nucleotide sequences of 13 conserved protein-coding genes using the maximum likelihood method indicated that the mitochondrial genome is maternally inherited. This study presents genomic data for studying phylogenetic relationships and breeding of hybrid Epinephelinae.

  1. Update on Pneumocystis carinii f. sp. hominis Typing Based on Nucleotide Sequence Variations in Internal Transcribed Spacer Regions of rRNA Genes

    PubMed Central

    Lee, Chao-Hung; Helweg-Larsen, Jannik; Tang, Xing; Jin, Shaoling; Li, Baozheng; Bartlett, Marilyn S.; Lu, Jang-Jih; Lundgren, Bettina; Lundgren, Jens D.; Olsson, Mats; Lucas, Sebastian B.; Roux, Patricia; Cargnel, Antonietta; Atzori, Chiara; Matos, Olga; Smith, James W.

    1998-01-01

    Pneumocystis carinii f. sp. hominis isolates from 207 clinical specimens from nine countries were typed based on nucleotide sequence variations in the internal transcribed spacer regions I and II (ITS1 and ITS2, respectively) of rRNA genes. The number of ITS1 nucleotides has been revised from the previously reported 157 bp to 161 bp. Likewise, the number of ITS2 nucleotides has been changed from 177 to 192 bp. The number of ITS1 sequence types has increased from 2 to 15, and that of ITS2 has increased from 3 to 14. The 15 ITS1 sequence types are designated types A through O, and the 14 ITS2 types are named types a through n. A total of 59 types of P. carinii f. sp. hominis were found in this study. PMID:9508304

  2. Nucleotide Sequence Diversity and Linkage Disequilibrium of Four Nuclear Loci in Foxtail Millet (Setaria italica).

    PubMed

    He, Shui-Lian; Yang, Yang; Morrell, Peter L; Yi, Ting-Shuang

    2015-01-01

    Foxtail millet (Setaria italica (L.) Beauv) is one of the earliest domesticated grains, which has been cultivated in northern China by 8,700 years before present (YBP) and across Eurasia by 4,000 YBP. Owing to a small genome and diploid nature, foxtail millet is a tractable model crop for studying functional genomics of millets and bioenergy grasses. In this study, we examined nucleotide sequence diversity, geographic structure, and levels of linkage disequilibrium at four nuclear loci (ADH1, G3PDH, IGS1 and TPI1) in representative samples of 311 landrace accessions across its cultivated range. Higher levels of nucleotide sequence and haplotype diversity were observed in samples from China relative to other sampled regions. Genetic assignment analysis classified the accessions into seven clusters based on nucleotide sequence polymorphisms. Intralocus LD decayed rapidly to half the initial value within ~1.2 kb or less.

  3. The complete genome sequence of the Atlantic salmon paramyxovirus (ASPV)

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nylund, Stian; Karlsen, Marius; Nylund, Are

    2008-03-30

    The complete RNA genome of the Atlantic salmon paramyxovirus (ASPV), isolated from Atlantic salmon suffering from proliferative gill inflammation (PGI), has been determined. The genome is 16,965 nucleotides in length and consists of six nonoverlapping genes in the order 3'- N - P/C/V - M - F - HN - L -5', coding for the nucleocapsid, phospho-, matrix, fusion, hemagglutinin-neuraminidase and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and trinucleotide intergenic regions similar to those of other Paramyxoviridae. The ASPV P-gene expression strategy is like that of the respiro- and morbilliviruses,more » which express the phosphoprotein from the primary transcript, and edit a portion of the mRNA to encode the accessory proteins V and W. It also encodes the C-protein by ribosomal choice of translation initiation. Pairwise comparisons of amino acid identities, and phylogenetic analysis of deduced ASPV protein sequences with homologous sequences from other Paramyxoviridae, show that ASPV has an affinity for the genus Respirovirus, but may represent a new genus within the subfamily Paramyxovirinae.« less

  4. Sequence Analysis of Mitochondrial Genome of Toxascaris leonina from a South China Tiger.

    PubMed

    Li, Kangxin; Yang, Fang; Abdullahi, A Y; Song, Meiran; Shi, Xianli; Wang, Minwei; Fu, Yeqi; Pan, Weida; Shan, Fang; Chen, Wu; Li, Guoqing

    2016-12-01

    Toxascaris leonina is a common parasitic nematode of wild mammals and has significant impacts on the protection of rare wild animals. To analyze population genetic characteristics of T. leonina from South China tiger, its mitochondrial (mt) genome was sequenced. Its complete circular mt genome was 14,277 bp in length, including 12 protein-coding genes, 22 tRNA genes, 2 rRNA genes, and 2 non-coding regions. The nucleotide composition was biased toward A and T. The most common start codon and stop codon were TTG and TAG, and 4 genes ended with an incomplete stop codon. There were 13 intergenic regions ranging 1 to 10 bp in size. Phylogenetically, T. leonina from a South China tiger was close to canine T. leonina . This study reports for the first time a complete mt genome sequence of T. leonina from the South China tiger, and provides a scientific basis for studying the genetic diversity of nematodes between different hosts.

  5. Complete mitochondrial genome of the Yellownose skate: Zearaja chilensis (Rajiformes, Rajidae).

    PubMed

    Jeong, Dageum; Lee, Youn-Ho

    2016-01-01

    The complete sequence of mitochondrial DNA of a Yellownose skate, Zearaja chilensis was determined for the first time. It is 16,909 bp in length covering 2 rRNA, 22 tRNA and 13 protein coding genes with the identical gene order and structure as those of other Rajidae species. The nucleotide of L-strand is composed of low G (14.3%), and slightly high A + T (58.9%) nucleotides. The strong codon usage bias against the use of G (6.0%) is found at the third codon positions. Twelve of the 13 protein coding genes use ATG as the start codon while COX1 starts with GTG. As for the stop codon, only ND4 shows an incomplete stop codon TA. This is the first report of the mitogenome for a species in the genus Zearaja, providing a valuable source of genetic information on the evolution of the family Rajidae and the genus Zearaja as well as for establishment of a sustainble fishery management plan of the species.

  6. Interaction of influenza virus polymerase with viral RNA in the 'corkscrew' conformation.

    PubMed

    Flick, R; Hobom, G

    1999-10-01

    The influenza virus RNA (vRNA) promoter structure is known to consist of the 5'- and 3'-terminal sequences of the RNA, within very narrow boundaries of 16 and 15 nucleotides, respectively. A complete set of single nucleotide substitutions led to the previously proposed model of a binary hooked or 'corkscrew' conformation for the vRNA promoter when it interacts with the viral polymerase. This functional structure is confirmed here with a complete set of complementary double substitutions, of both the regular A:U and G:C type and also the G:U type of base-pair exchanges. The proposed structure consists of a six base-pair RNA rod in the distal element in conjunction with two stem-loop structures of two short-range base-pairs (positions 2-9; 3-8). These support an exposed tetranucleotide loop within each branch of the proximal element, in an overall oblique organization due to a central unpaired A residue at position 10 in the 5' sequence. Long-range base-pairing between the entire 5' and 3' branches, as required for an unmodified 'panhandle' model, has been excluded for the proximal element, while it is known to represent the mode of interaction within the distal element. A large number of short-range base-pair exchanges in the proximal element constitute promoter-up mutations, which show activities several times above that of the wild-type in reporter gene assays. The unique overall conformation and rather few invariant nucleotides appear to be the core elements in vRNA recognition by polymerase and also in viral ribonucleoprotein packaging, to allow discrimination against the background of other RNA molecules in the cell.

  7. Determination and analysis of the complete genome sequence of Paralichthys olivaceus rhabdovirus (PORV).

    PubMed

    Zhu, Ruo-Lin; Zhang, Qi-Ya

    2014-04-01

    Paralichthys olivaceus rhabdovirus (PORV), which is associated with high mortality rates in flounder, was isolated in China in 2005. Here, we provide an annotated sequence record of PORV, the genome of which comprises 11,182 nucleotides and contains six genes in the order 3'-N-P-M-G-NV-L-5'. Phylogenetic analysis based on glycoprotein sequences of PORV and other rhabdoviruses showed that PORV clusters with viral haemorrhagic septicemia virus (VHSV), genus Novirhabdovirus, family Rhabdoviridae. Further phylogenetic analysis of the combined amino acid sequences of six proteins of PORV and VHSV strains showed that PORV clusters with Korean strains and is closely related to Asian strains, all of which were isolated from flounder. In a comparison in which the sequences of the six proteins were combined, PORV shared the highest identity (98.3 %) with VHSV strain KJ2008 from Korea.

  8. GenBank

    PubMed Central

    Benson, Dennis A.; Karsch-Mizrachi, Ilene; Lipman, David J.; Ostell, James; Wheeler, David L.

    2007-01-01

    GenBank (R) is a comprehensive database that contains publicly available nucleotide sequences for more than 240 000 named organisms, obtained primarily through submissions from individual laboratories and batch submissions from large-scale sequencing projects. Most submissions are made using the web-based BankIt or standalone Sequin programs and accession numbers are assigned by GenBank staff upon receipt. Daily data exchange with the EMBL Data Library in Europe and the DNA Data Bank of Japan ensures worldwide coverage. GenBank is accessible through NCBI's retrieval system, Entrez, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome, mapping, protein structure and domain information, and the biomedical journal literature via PubMed. BLAST provides sequence similarity searches of GenBank and other sequence databases. Complete bimonthly releases and daily updates of the GenBank database are available by FTP. To access GenBank and its related retrieval and analysis services, begin at the NCBI Homepage (). PMID:17202161

  9. Identification of a new Apscaviroid from Japanese persimmon.

    PubMed

    Nakaune, Ryoji; Nakano, Masaaki

    2008-01-01

    Three viroid-like sequences were detected from Japanese persimmon (Diospyrus kaki Thunb.) by RT-PCR using primers specific for members of the genus Apscaviroid. Based on the sequences, we determined the complete genomic sequences. Two had 92.1-94.3% sequence identity with citrus viroid OS (CVd-OS) and 91.4-96.3% identity with apple fruit crinkle viroid (AFCVd), respectively. Another one, tentatively named persimmon viroid (PVd), had 396 nucleotides and less than 70% sequence identity with known viroids. The secondary structure of PVd is proposed to be rod-like with extensive base pairing and contains the terminal conserved region and the central conserved region characteristic of the genus Apscaviroid. Moreover, we confirmed that the viroids, including PVd, are graft transmissible from persimmon to persimmon and that persimmon is a natural host of these viroids. According to its molecular and biological properties, PVd should be considered a member of a new species in the genus Apscaviroid.

  10. Characterization, genetic diversity, and evolutionary link of Cucumber mosaic virus strain New Delhi from India.

    PubMed

    Koundal, Vikas; Haq, Qazi Mohd Rizwanul; Praveen, Shelly

    2011-02-01

    The genome of Cucumber mosaic virus New Delhi strain (CMV-ND) from India, obtained from tomato, was completely sequenced and compared with full genome sequences of 14 known CMV strains from subgroups I and II, for their genetic diversity. Sequence analysis suggests CMV-ND shares maximum sequence identity at the nucleotide level with a CMV strain from Taiwan. Among all 15 strains of CMV, the encoded protein 2b is least conserved, whereas the coat protein (CP) is most conserved. Sequence identity values and phylogram results indicate that CMV-ND belongs to subgroup I. Based on the recombination detection program result, it appears that CMV is prone to recombination, and different RNA components of CMV-ND have evolved differently. Recombinational analysis of all 15 CMV strains detected maximum recombination breakpoints in RNA2; CP showed the least recombination sites.

  11. Nucleotide sequence composition and method for detection of neisseria gonorrhoeae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lo, A.; Yang, H.L.

    1990-02-13

    This patent describes a composition of matter that is specific for {ital Neisseria gonorrhoeae}. It comprises: at least one nucleotide sequence for which the ratio of the amount of the sequence which hybridizes to chromosomal DNA of {ital Neisseria gonorrhoeae} to the amount of the sequence which hybridizes to chromosomal DNA of {ital Neisseria meningitidis} is greater than about five. The ratio being obtained by a method described.

  12. Whole exome sequencing to estimate alloreactivity potential between donors and recipients in stem cell transplantation

    PubMed Central

    Sampson, Juliana K.; Sheth, Nihar U.; Koparde, Vishal N.; Scalora, Allison F.; Serrano, Myrna G.; Lee, Vladimir; Roberts, Catherine H.; Jameson-Lee, Max; Ferreira-Gonzalez, Andrea; Manjili, Masoud H.; Buck, Gregory A.; Neale, Michael C.; Toor, Amir A.

    2016-01-01

    Summary Whole exome sequencing (WES) was performed on stem cell transplant donor-recipient (D-R) pairs to determine the extent of potential antigenic variation at a molecular level. In a small cohort of D-R pairs, a high frequency of sequence variation was observed between the donor and recipient exomes independent of human leucocyte antigen (HLA) matching. Nonsynonymous, nonconservative single nucleotide polymorphisms were approximately twice as frequent in HLA-matched unrelated, compared with related D-R pairs. When mapped to individual chromosomes, these polymorphic nucleotides were uniformly distributed across the entire exome. In conclusion, WES reveals extensive nucleotide sequence variation in the exomes of HLA-matched donors and recipients. PMID:24749631

  13. Complete chloroplast genome sequence and comparative analysis of loblolly pine (Pinus taeda L.) with related species

    PubMed Central

    Khan, Abdul Latif; Khan, Muhammad Aaqil; Shahzad, Raheem; Lubna; Kang, Sang Mo; Al-Harrasi, Ahmed; Al-Rawahi, Ahmed; Lee, In-Jung

    2018-01-01

    Pinaceae, the largest family of conifers, has a diversified organization of chloroplast (cp) genomes with two typical highly reduced inverted repeats (IRs). In the current study, we determined the complete sequence of the cp genome of an economically and ecologically important conifer tree, the loblolly pine (Pinus taeda L.), using Illumina paired-end sequencing and compared the sequence with those of other pine species. The results revealed a genome size of 121,531 base pairs (bp) containing a pair of 830-bp IR regions, distinguished by a small single copy (42,258 bp) and large single copy (77,614 bp) region. The chloroplast genome of P. taeda encodes 120 genes, comprising 81 protein-coding genes, four ribosomal RNA genes, and 35 tRNA genes, with 151 randomly distributed microsatellites. Approximately 6 palindromic, 34 forward, and 22 tandem repeats were found in the P. taeda cp genome. Whole cp genome comparison with those of other Pinus species exhibited an overall high degree of sequence similarity, with some divergence in intergenic spacers. Higher and lower numbers of indels and single-nucleotide polymorphism substitutions were observed relative to P. contorta and P. monophylla, respectively. Phylogenomic analyses based on the complete genome sequence revealed that 60 shared genes generated trees with the same topologies, and P. taeda was closely related to P. contorta in the subgenus Pinus. Thus, the complete P. taeda genome provided valuable resources for population and evolutionary studies of gymnosperms and can be used to identify related species. PMID:29596414

  14. The nucleotide sequences of 5S rRNAs from a fern Dryopteris acuminata and a horsetail Equisetum arvense.

    PubMed Central

    Hori, H; Osawa, S; Takaiwa, F; Sugiura, M

    1984-01-01

    The nucleotide sequences from two Pteridophyta species, a fern Dryopteris acuminata and a horsetail Equisetum arvense have been determined. These two sequences are more related to those of the Bryophyta species (88% identity on average) than to those of seed plants (84% identity on average). PMID:6538332

  15. Energy efficiency trade-offs drive nucleotide usage in transcribed regions

    PubMed Central

    Chen, Wei-Hua; Lu, Guanting; Bork, Peer; Hu, Songnian; Lercher, Martin J.

    2016-01-01

    Efficient nutrient usage is a trait under universal selection. A substantial part of cellular resources is spent on making nucleotides. We thus expect preferential use of cheaper nucleotides especially in transcribed sequences, which are often amplified thousand-fold compared with genomic sequences. To test this hypothesis, we derive a mutation-selection-drift equilibrium model for nucleotide skews (strand-specific usage of ‘A' versus ‘T' and ‘G' versus ‘C'), which explains nucleotide skews across 1,550 prokaryotic genomes as a consequence of selection on efficient resource usage. Transcription-related selection generally favours the cheaper nucleotides ‘U' and ‘C' at synonymous sites. However, the information encoded in mRNA is further amplified through translation. Due to unexpected trade-offs in the codon table, cheaper nucleotides encode on average energetically more expensive amino acids. These trade-offs apply to both strand-specific nucleotide usage and GC content, causing a universal bias towards the more expensive nucleotides ‘A' and ‘G' at non-synonymous coding sites. PMID:27098217

  16. Detection of possible restriction sites for type II restriction enzymes in DNA sequences.

    PubMed

    Gagniuc, P; Cimponeriu, D; Ionescu-Tîrgovişte, C; Mihai, Andrada; Stavarachi, Monica; Mihai, T; Gavrilă, L

    2011-01-01

    In order to make a step forward in the knowledge of the mechanism operating in complex polygenic disorders such as diabetes and obesity, this paper proposes a new algorithm (PRSD -possible restriction site detection) and its implementation in Applied Genetics software. This software can be used for in silico detection of potential (hidden) recognition sites for endonucleases and for nucleotide repeats identification. The recognition sites for endonucleases may result from hidden sequences through deletion or insertion of a specific number of nucleotides. Tests were conducted on DNA sequences downloaded from NCBI servers using specific recognition sites for common type II restriction enzymes introduced in the software database (n = 126). Each possible recognition site indicated by the PRSD algorithm implemented in Applied Genetics was checked and confirmed by NEBcutter V2.0 and Webcutter 2.0 software. In the sequence NG_008724.1 (which includes 63632 nucleotides) we found a high number of potential restriction sites for ECO R1 that may be produced by deletion (n = 43 sites) or insertion (n = 591 sites) of one nucleotide. The second module of Applied Genetics has been designed to find simple repeats sizes with a real future in understanding the role of SNPs (Single Nucleotide Polymorphisms) in the pathogenesis of the complex metabolic disorders. We have tested the presence of simple repetitive sequences in five DNA sequence. The software indicated exact position of each repeats detected in the tested sequences. Future development of Applied Genetics can provide an alternative for powerful tools used to search for restriction sites or repetitive sequences or to improve genotyping methods.

  17. Information Entropy of Influenza A Segment 7

    NASA Astrophysics Data System (ADS)

    Thompson, William A.; Fan, Shaohua; Weltman, Joel K.

    2008-12-01

    Information entropy (H) is a measure of uncertainty at each position within in a sequence of nucleotides.H was used to characterize a set of influenza A segment 7 nucleotide sequences. Nucleotide locations of high entropy were identified near the 5’ start of all of the sequences and the sequences were assigned to subsets according to synonymous nucleotide variants at those positions: either uracil at position six (U6), cytosine at position six (C6), adenine (A12) at position 12, guanine at position 12 (G12), adenine at position 15 (A15) or cytosine (C15) at position 15. H values were found to be correlated/corresponding (Kendall tau) along the lengths of the nucleotide segments of the subset pairs at each position. However, the H values of each subset of sequences were statistically distinguishable from those of the other member of the pair (Kolmogorov-Smirnov test). The joint probability of uncorrelated distributions of U6 and C6 sequences to viral subtypes and to viral host species was 34 times greater than for the A12:G12 subset pair and 214 times greater than for the A15:C15 pair. This result indicates that the high entropy position six of segment 7 is either a reporter or a sentinel location. The fact that not one of the H5N1 sequences in the dataset was a member of the C6 subset, but all 125 H5N1 sequences are members of the U6 subset suggests a non-random sentinel function.

  18. A new variant of antimetabolic protein, arcelin from an Indian bean, Lablab purpureus (Linn.) and its effect on the stored product pest, Callosobruchus maculatus.

    PubMed

    Janarthanan, Sundaram; Sakthivelkumar, Shanmugavel; Veeramani, Velayutham; Radhika, Dixit; Muthukrishanan, Subbaratnam

    2012-12-15

    The anti-metabolic or insecticidal gene, arcelin (Arl) was isolated, cloned and sequenced using sequence specific degenerate primers from the seeds of Lablab purpureus collected from the Western Ghats, Tamil Nadu, India. The L. purpureus arcelin nucleotide sequence was homologous to Arl-3 and Arl-4 alleles from Phaseolus spp. The protein it encodes has 70% amino acid identity with the amino acid sequences of Arl-3I, Arl-3III, Arl-4 precursor, Arl-4 and Arl-4I. The partially purified arcelin from the seeds of L. purpureus using an artificial diet confirmed the complete retardation of development of the stored product pest Callosobruchus maculatus at 0.2% w/w arcelin-incorporated artificial seeds. Copyright © 2012 Elsevier Ltd. All rights reserved.

  19. Mapping DNA polymerase errors by single-molecule sequencing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Lee, David F.; Lu, Jenny; Chang, Seungwoo

    Genomic integrity is compromised by DNA polymerase replication errors, which occur in a sequence-dependent manner across the genome. Accurate and complete quantification of a DNA polymerase's error spectrum is challenging because errors are rare and difficult to detect. We report a high-throughput sequencing assay to map in vitro DNA replication errors at the single-molecule level. Unlike previous methods, our assay is able to rapidly detect a large number of polymerase errors at base resolution over any template substrate without quantification bias. To overcome the high error rate of high-throughput sequencing, our assay uses a barcoding strategy in which each replicationmore » product is tagged with a unique nucleotide sequence before amplification. Here, this allows multiple sequencing reads of the same product to be compared so that sequencing errors can be found and removed. We demonstrate the ability of our assay to characterize the average error rate, error hotspots and lesion bypass fidelity of several DNA polymerases.« less

  20. Mapping DNA polymerase errors by single-molecule sequencing

    DOE PAGES

    Lee, David F.; Lu, Jenny; Chang, Seungwoo; ...

    2016-05-16

    Genomic integrity is compromised by DNA polymerase replication errors, which occur in a sequence-dependent manner across the genome. Accurate and complete quantification of a DNA polymerase's error spectrum is challenging because errors are rare and difficult to detect. We report a high-throughput sequencing assay to map in vitro DNA replication errors at the single-molecule level. Unlike previous methods, our assay is able to rapidly detect a large number of polymerase errors at base resolution over any template substrate without quantification bias. To overcome the high error rate of high-throughput sequencing, our assay uses a barcoding strategy in which each replicationmore » product is tagged with a unique nucleotide sequence before amplification. Here, this allows multiple sequencing reads of the same product to be compared so that sequencing errors can be found and removed. We demonstrate the ability of our assay to characterize the average error rate, error hotspots and lesion bypass fidelity of several DNA polymerases.« less

  1. Systematic analysis of enzymatic DNA polymerization using oligo-DNA templates and triphosphate analogs involving 2',4'-bridged nucleosides.

    PubMed

    Kuwahara, Masayasu; Obika, Satoshi; Nagashima, Jun-ichi; Ohta, Yuki; Suto, Yoshiyuki; Ozaki, Hiroaki; Sawai, Hiroaki; Imanishi, Takeshi

    2008-08-01

    In order to systematically analyze the effects of nucleoside modification of sugar moieties in DNA polymerase reactions, we synthesized 16 modified templates containing 2',4'-bridged nucleotides and three types of 2',4'-bridged nucleoside-5'-triphospates with different bridging structures. Among the five types of thermostable DNA polymerases used, Taq, Phusion HF, Vent(exo-), KOD Dash and KOD(exo-), the KOD Dash and KOD(exo-) DNA polymerases could smoothly read through the modified templates containing 2'-O,4'-C-methylene-linked nucleotides at intervals of a few nucleotides, even at standard enzyme concentrations for 5 min. Although the Vent(exo-) DNA polymerase also read through these modified templates, kinetic study indicates that the KOD(exo-) DNA polymerase was found to be far superior to the Vent(exo-) DNA polymerase in accurate incorporation of nucleotides. When either of the DNA polymerase was used, the presence of 2',4'-bridged nucleotides on a template strand substantially decreased the reaction rates of nucleotide incorporations. The modified templates containing sequences of seven successive 2',4'-bridged nucleotides could not be completely transcribed by any of the DNA polymerases used; yields of longer elongated products decreased in the order of steric bulkiness of the modified sugars. Successive incorporation of 2',4'-bridged nucleotides into extending strands using 2',4'-bridged nucleoside-5'-triphospates was much more difficult. These data indicate that the sugar modification would have a greater effect on the polymerase reaction when it is adjacent to the elongation terminus than when it is on the template as well, as in base modification.

  2. Glutamine 89 is a key residue in the allosteric modulation of human serine racemase activity by ATP.

    PubMed

    Canosa, Andrea V; Faggiano, Serena; Marchetti, Marialaura; Armao, Stefano; Bettati, Stefano; Bruno, Stefano; Percudani, Riccardo; Campanini, Barbara; Mozzarelli, Andrea

    2018-06-13

    Serine racemase (SR) catalyses two reactions: the reversible racemisation of L-serine and the irreversible dehydration of L- and D-serine to pyruvate and ammonia. SRs are evolutionarily related to serine dehydratases (SDH) and degradative threonine deaminases (TdcB). Most SRs and TdcBs - but not SDHs - are regulated by nucleotides. SR binds ATP cooperatively and the nucleotide allosterically stimulates the serine dehydratase activity of the enzyme. A H-bond network comprising five residues (T52, N86, Q89, E283 and N316) and water molecules connects the active site with the ATP-binding site. Conservation analysis points to Q89 as a key residue for the allosteric communication, since its mutation to either Met or Ala is linked to the loss of control of activity by nucleotides. We verified this hypothesis by introducing the Q89M and Q89A point mutations in the human SR sequence. The allosteric communication between the active site and the allosteric site in both mutants is almost completely abolished. Indeed, the stimulation of the dehydratase activity by ATP is severely diminished and the binding of the nucleotide is no more cooperative. Ancestral state reconstruction suggests that the allosteric control by nucleotides established early in SR evolution and has been maintained in most eukaryotic lineages.

  3. Substrate-specifying determinants of the nucleotide pyrophosphatases/phosphodiesterases NPP1 and NPP2

    PubMed Central

    2004-01-01

    The nucleotide pyrophosphatases/phosphodiesterases NPP1 and NPP2/autotaxin are structurally related eukaryotic ecto-enzymes, but display a very different substrate specificity. NPP1 releases nucleoside 5′-monophosphates from various nucleotides, whereas NPP2 mainly functions as a lysophospholipase D. We have used a domain-swapping approach to map substrate-specifying determinants of NPP1 and NPP2. The catalytic domain of NPP1 fused to the N- and C-terminal domains of NPP2 was hyperactive as a nucleotide phosphodiesterase, but did not show any lysophospholipase D activity. In contrast, chimaeras of the catalytic domain of NPP2 and the N- and/or C-terminal domains of NPP1 were completely inactive. These data indicate that the catalytic domain as well as both extremities of NPP2 contain lysophospholipid-specifying sequences. Within the catalytic domain of NPP1 and NPP2, we have mapped residues close to the catalytic site that determine the activities towards nucleotides and lysophospholipids. We also show that the conserved Gly/Phe-Xaa-Gly-Xaa-Xaa-Gly (G/FXGXXG) motif near the catalytic site is required for metal binding, but is not involved in substrate-specification. Our data suggest that the distinct activities of NPP1 and NPP2 stem from multiple differences throughout the polypeptide chain. PMID:15096095

  4. PUTATIVE GENE PROMOTER SEQUENCES IN THE CHLORELLA VIRUSES

    PubMed Central

    Fitzgerald, Lisa A.; Boucher, Philip T.; Yanai-Balser, Giane; Suhre, Karsten; Graves, Michael V.; Van Etten, James L.

    2008-01-01

    Three short (7 to 9 nucleotides) highly conserved nucleotide sequences were identified in the putative promoter regions (150 bp upstream and 50 bp downstream of the ATG translation start site) of three members of the genus Chlorovirus, family Phycodnaviridae. Most of these sequences occurred in similar locations within the defined promoter regions. The sequence and location of the motifs were often conserved among homologous ORFs within the Chlorovirus family. One of these conserved sequences (AATGACA) is predominately associated with genes expressed early in virus replication. PMID:18768195

  5. A study of lactose metabolism in Lactococcus garvieae reveals a genetic marker for distinguishing between dairy and fish biotypes.

    PubMed

    Fortina, Maria Grazia; Ricci, Giovanni; Borgo, Francesca

    2009-06-01

    Dairy and fish isolates of Lactococcus garvieae were tested for their ability to utilize lactose and to grow in milk. Fish isolates were unable to assimilate lactose, but unexpectedly, they possessed the ability to grow in milk. Genetic studies, carried out constructing different vectorette libraries, provided evidence that in fish isolates, no genes involved in lactose utilization were present. For L. garvieae dairy isolates, a single system for the catabolism of lactose was found. It consists of a lactose transport and hydrolysis depending on a phosphoenolpyruvate-dependent phosphotransferase system combined with a phospho-beta-galactosidase. The genes involved were highly similar at the nucleotide sequence level to their counterparts in Lactococcus lactis; however, while in many L. lactis strains these genes are plasmid encoded, in L. garvieae they are chromosomally located. Thus, in the species L. garvieae, the phospho-beta-galactosidase gene, detectable in all strains of dairy origin but lacking in fish isolates, can be considered a reliable genetic marker for distinguishing biotypes in the two diverse ecological niches. Moreover, we obtained information regarding the complete nucleotide sequence of the gal operon in L. garvieae, consisting of a galactose permease and the Leloir pathway enzymes. This is one of the first reports concerning the determination of the nucleotide sequences of genes (other than the 16S rDNA gene) in L. garvieae and should be considered a step in a continuous effort to explore the genome of this species, with the aim of determining the real relationship between the presence of L. garvieae in dairy products and food safety.

  6. Multiple introductions of serotype O foot-and-mouth disease viruses into East Asia in 2010–2011

    PubMed Central

    2013-01-01

    Foot-and-mouth disease virus (FMDV) is a highly contagious and genetically variable virus. Sporadic introductions of this virus into FMD-free countries may cause outbreaks with devastating consequences. In 2010 and 2011, incursions of the FMDV O/SEA/Mya-98 strain, normally restricted to countries in mainland Southeast Asia, caused extensive outbreaks across East Asia. In this study, 12 full genome FMDV sequences for representative samples collected from the People’s Republic of China (PR China) including the Hong Kong Special Administrative Region (SAR), the Republic of Korea, the Democratic People’s Republic of Korea, Japan, Mongolia and The Russian Federation were generated and compared with additional contemporary sequences from viruses within this lineage. These complete genomes were 8119 to 8193 nucleotides in length and differed at 1181 sites, sharing a nucleotide identity ≥ 91.0% and an amino acid identity ≥ 96.6%. An unexpected deletion of 70 nucleotides within the 5′-untranslated region which resulted in a shorter predicted RNA stem-loop for the S-fragment was revealed in two sequences from PR China and Hong Kong SAR and five additional related samples from the region. Statistical parsimony and Bayesian phylogenetic analysis provide evidence that these outbreaks in East Asia were generated by two independent introductions of the O/SEA/Mya-98 lineage sometime between August 2008 and March 2010. The rapid emergence of these viruses from Southeast Asia highlights the importance of adopting approaches to closely monitor the spread of this lineage that now poses a threat to livestock industries in other regions. PMID:24007643

  7. Multiple introductions of serotype O foot-and-mouth disease viruses into East Asia in 2010-2011.

    PubMed

    Valdazo-González, Begoña; Timina, Anna; Scherbakov, Alexey; Abdul-Hamid, Nor Faizah; Knowles, Nick J; King, Donald P

    2013-09-05

    Foot-and-mouth disease virus (FMDV) is a highly contagious and genetically variable virus. Sporadic introductions of this virus into FMD-free countries may cause outbreaks with devastating consequences. In 2010 and 2011, incursions of the FMDV O/SEA/Mya-98 strain, normally restricted to countries in mainland Southeast Asia, caused extensive outbreaks across East Asia. In this study, 12 full genome FMDV sequences for representative samples collected from the People's Republic of China (PR China) including the Hong Kong Special Administrative Region (SAR), the Republic of Korea, the Democratic People's Republic of Korea, Japan, Mongolia and The Russian Federation were generated and compared with additional contemporary sequences from viruses within this lineage. These complete genomes were 8119 to 8193 nucleotides in length and differed at 1181 sites, sharing a nucleotide identity ≥ 91.0% and an amino acid identity ≥ 96.6%. An unexpected deletion of 70 nucleotides within the 5'-untranslated region which resulted in a shorter predicted RNA stem-loop for the S-fragment was revealed in two sequences from PR China and Hong Kong SAR and five additional related samples from the region. Statistical parsimony and Bayesian phylogenetic analysis provide evidence that these outbreaks in East Asia were generated by two independent introductions of the O/SEA/Mya-98 lineage sometime between August 2008 and March 2010. The rapid emergence of these viruses from Southeast Asia highlights the importance of adopting approaches to closely monitor the spread of this lineage that now poses a threat to livestock industries in other regions.

  8. Isolation and Genomic Characterization of a Duck-Origin GPV-Related Parvovirus from Cherry Valley Ducklings in China

    PubMed Central

    Chen, Hao; Dou, Yanguo; Tang, Yi; Zhang, Zhenjie; Zheng, Xiaoqiang; Niu, Xiaoyu; Yang, Jing; Yu, Xianglong; Diao, Youxiang

    2015-01-01

    A newly emerged duck parvovirus, which causes beak atrophy and dwarfism syndrome (BADS) in Cherry Valley ducks, has appeared in Northern China since March 2015. To explore the genetic diversity among waterfowl parvovirus isolates, the complete genome of an identified isolate designated SDLC01 was sequenced and analyzed in the present study. Genomic sequence analysis showed that SDLC01 shared 90.8%–94.6% of nucleotide identity with goose parvovirus (GPV) isolates and 78.6%–81.6% of nucleotide identity with classical Muscovy duck parvovirus (MDPV) isolates. Phylogenetic analysis of 443 nucleotides (nt) of the fragment A showed that SDLC01 was highly similar to a mule duck isolate (strain D146/02) and close to European GPV isolates but separate from Asian GPV isolates. Analysis of the left inverted terminal repeat regions revealed that SDLC01 had two major segments deleted between positions 160–176 and 306–322 nt compared with field GPV and MDPV isolates. Phylogenetic analysis of Rep and VP1 encoded by two major open reading frames of parvoviruses revealed that SDLC01 was distinct from all GPV and MDPV isolates. The viral pathogenicity and genome characterization of SDLC01 suggest that the novel GPV (N-GPV) is the causative agent of BADS and belongs to a distinct GPV-related subgroup. Furthermore, N-GPV sequences were detected in diseased ducks by polymerase chain reaction and viral proliferation was demonstrated in duck embryos and duck embryo fibroblast cells. PMID:26465143

  9. Molecular gene organisation and secondary structure of the mitochondrial large subunit ribosomal RNA from the cultivated Basidiomycota Agrocybe aegerita: a 13 kb gene possessing six unusual nucleotide extensions and eight introns.

    PubMed

    Gonzalez, P; Barroso, G; Labarère, J

    1999-04-01

    The complete gene sequence and secondary structure of the mitochondrial LSU rRNA from the cultivated Basidiomycota Agrocybe aegerita was derived by chromosome walking. The A.aegerita LSU rRNA gene (13 526 nt) represents, to date, the longest described, due to the highest number of introns (eight) and the occurrence of six long nucleotidic extensions. Seven introns belong to group I, while the intronic sequence i5 constitutes the first typical group II intron reported in a fungal mitochondrial LSU rDNA. As with most fungal LSU rDNA introns reported to date, four introns (i5-i8) are distributed in domain V associated with the peptidyl-transferase activity. One intron (i1) is located in domain I, and three (i2-i4) in domain II. The introns i2-i8 possess homologies with other fungal, algal or protozoan introns located at the same position in LSU rDNAs. One of them (i6) is located at the same insertion site as most Ascomycota or algae LSU introns, suggesting a possible inheritance from a common ancestor. On the contrary, intron i1 is located at a so-far unreported insertion site. Among the six unusual nucleotide extensions, five are located in domain I and one in domain V. This is the first report of a mitochondrial LSU rRNA gene sequence and secondary structure for the whole Basidiomycota division.

  10. Integrating multiple genomic data to predict disease-causing nonsynonymous single nucleotide variants in exome sequencing studies.

    PubMed

    Wu, Jiaxin; Li, Yanda; Jiang, Rui

    2014-03-01

    Exome sequencing has been widely used in detecting pathogenic nonsynonymous single nucleotide variants (SNVs) for human inherited diseases. However, traditional statistical genetics methods are ineffective in analyzing exome sequencing data, due to such facts as the large number of sequenced variants, the presence of non-negligible fraction of pathogenic rare variants or de novo mutations, and the limited size of affected and normal populations. Indeed, prevalent applications of exome sequencing have been appealing for an effective computational method for identifying causative nonsynonymous SNVs from a large number of sequenced variants. Here, we propose a bioinformatics approach called SPRING (Snv PRioritization via the INtegration of Genomic data) for identifying pathogenic nonsynonymous SNVs for a given query disease. Based on six functional effect scores calculated by existing methods (SIFT, PolyPhen2, LRT, MutationTaster, GERP and PhyloP) and five association scores derived from a variety of genomic data sources (gene ontology, protein-protein interactions, protein sequences, protein domain annotations and gene pathway annotations), SPRING calculates the statistical significance that an SNV is causative for a query disease and hence provides a means of prioritizing candidate SNVs. With a series of comprehensive validation experiments, we demonstrate that SPRING is valid for diseases whose genetic bases are either partly known or completely unknown and effective for diseases with a variety of inheritance styles. In applications of our method to real exome sequencing data sets, we show the capability of SPRING in detecting causative de novo mutations for autism, epileptic encephalopathies and intellectual disability. We further provide an online service, the standalone software and genome-wide predictions of causative SNVs for 5,080 diseases at http://bioinfo.au.tsinghua.edu.cn/spring.

  11. Genetic analysis of Fasciola isolates from cattle in Korea based on second internal transcribed spacer (ITS-2) sequence of nuclear ribosomal DNA.

    PubMed

    Choe, Se-Eun; Nguyen, Thuy Thi-Dieu; Kang, Tae-Gyu; Kweon, Chang-Hee; Kang, Seung-Won

    2011-09-01

    Nuclear ribosomal DNA sequence of the second internal transcribed spacer (ITS-2) has been used efficiently to identify the liver fluke species collected from different hosts and various geographic regions. ITS-2 sequences of 19 Fasciola samples collected from Korean native cattle were determined and compared. Sequence comparison including ITS-2 sequences of isolates from this study and reference sequences from Fasciola hepatica and Fasciola gigantica and intermediate Fasciola in Genbank revealed seven identical variable sites of investigated isolates. Among 19 samples, 12 individuals had ITS-2 sequences completely identical to that of pure F. hepatica, five possessed the sequences identical to F. gigantica type, whereas two shared the sequence of both F. hepatica and F. gigantica. No variations in length and nucleotide composition of ITS-2 sequence were observed within isolates that belonged to F. hepatica or F. gigantica. At the position of 218, five Fasciola containing a single-base substitution (C>T) formed a distinct branch inside the F. gigantica-type group which was similar to those of Asian-origin isolates. The phylogenetic tree of the Fasciola spp. based on complete ITS-2 sequences from this study and other representative isolates in different locations clearly showed that pure F. hepatica, F. gigantica type and intermediate Fasciola were observed. The result also provided additional genetic evidence for the existence of three forms of Fasciola isolated from native cattle in Korea by genetic approach using ITS-2 sequence.

  12. Human papillomavirus type 18 variant lineages in United States populations characterized by sequence analysis of LCR-E6, E2, and L1 regions.

    PubMed

    Arias-Pulido, Hugo; Peyton, Cheri L; Torrez-Martínez, Norah; Anderson, D Nelson; Wheeler, Cosette M

    2005-07-20

    While HPV 16 variant lineages have been well characterized, the knowledge about HPV 18 variants is limited. In this study, HPV 18 nucleotide variations in the E2 hinge region were characterized by sequence analysis in 47 control and 51 tumor specimens. Fifty of these specimens were randomly selected for sequencing of an LCR-E6 segment and 20 samples representative of LCR-E6 and E2 sequence variants were examined across the L1 region. A total of 2770 nucleotides per HPV 18 variant genome were considered in this study. HPV 18 variant nucleotides were linked among all gene segments analyzed and grouped into three main branches: Asian-American (AA), European (E), and African (Af). These three branches were equally distributed among controls and cases and when stratified by Hispanic and non-Hispanic ethnicities. Among invasive cervical cancer cases, no significant differences in the three HPV variant branches were observed among ethnic groups or when stratified by histopathology (squamous vs. adenocarcinoma). The Af branch showed the greatest nucleotide variability when compared to the HPV 18 reference sequence and was more closely related to HPV 45 than either AA or E branches. Our data also characterize nucleotide and amino acid variations in the L1 capsid gene among HPV 18 variants, which may be relevant to vaccine strategies and subsequent studies of naturally occurring HPV 18 variants. Several novel HPV 18 nucleotide variations were identified in this study.

  13. The repeating nucleotide sequence in the repetitive mitochondrial DNA from a "low-density" petite mutant of yeast.

    PubMed Central

    Van Kreijl, C F; Bos, J L

    1977-01-01

    The repeating nucleotide sequence of 68 base pairs in the mtDNA from an ethidium-induced cytoplasmic petite mutant of yeast has been determined. For sequence analysis specifically primed and terminated RNA copies, obtained by in vitro transcription of the separated strands, were use. The sequence consists of 66 consecutive AT base pairs flanked by two GC pairs and comprises nearly all of the mutant mitochondrial genome. The sequence, moreover, also represents the first part of wild-type mtDNA sequence so far. Images PMID:198740

  14. The complete sequence of the mitochondrial genome of the African Penguin (Spheniscus demersus).

    PubMed

    Labuschagne, Christiaan; Kotzé, Antoinette; Grobler, J Paul; Dalton, Desiré L

    2014-01-15

    The complete mitochondrial genome of the African Penguin (Spheniscus demersus) was sequenced. The molecule was sequenced via next generation sequencing and primer walking. The size of the genome is 17,346 bp in length. Comparison with the mitochondrial DNA of two other penguin genomes that have so far been reported was conducted namely; Little blue penguin (Eudyptula minor) and the Rockhopper penguin (Eudyptes chrysocome). This analysis made it possible to identify common penguin mitochondrial DNA characteristics. The S. demersus mtDNA genome is very similar, both in composition and length to both the E. chrysocome and E. minor genomes. The gene content of the African penguin mitochondrial genome is typical of vertebrates and all three penguin species have the standard gene order originally identified in the chicken. The control region for S. demersus is located between tRNA-Glu and tRNA-Phe and all three species of penguins contain two sets of similar repeats with varying copy numbers towards the 3' end of the control region, accounting for the size variance. This is the first report of the complete nucleotide sequence for the mitochondrial genome of the African penguin, S. demersus. These results can be subsequently used to provide information for penguin phylogenetic studies and insights into the evolution of genomes. © 2013 Elsevier B.V. All rights reserved.

  15. Reduced-median-network analysis of complete mitochondrial DNA coding-region sequences for the major African, Asian, and European haplogroups.

    PubMed

    Herrnstadt, Corinna; Elson, Joanna L; Fahy, Eoin; Preston, Gwen; Turnbull, Douglass M; Anderson, Christen; Ghosh, Soumitra S; Olefsky, Jerrold M; Beal, M Flint; Davis, Robert E; Howell, Neil

    2002-05-01

    The evolution of the human mitochondrial genome is characterized by the emergence of ethnically distinct lineages or haplogroups. Nine European, seven Asian (including Native American), and three African mitochondrial DNA (mtDNA) haplogroups have been identified previously on the basis of the presence or absence of a relatively small number of restriction-enzyme recognition sites or on the basis of nucleotide sequences of the D-loop region. We have used reduced-median-network approaches to analyze 560 complete European, Asian, and African mtDNA coding-region sequences from unrelated individuals to develop a more complete understanding of sequence diversity both within and between haplogroups. A total of 497 haplogroup-associated polymorphisms were identified, 323 (65%) of which were associated with one haplogroup and 174 (35%) of which were associated with two or more haplogroups. Approximately one-half of these polymorphisms are reported for the first time here. Our results confirm and substantially extend the phylogenetic relationships among mitochondrial genomes described elsewhere from the major human ethnic groups. Another important result is that there were numerous instances both of parallel mutations at the same site and of reversion (i.e., homoplasy). It is likely that homoplasy in the coding region will confound evolutionary analysis of small sequence sets. By a linkage-disequilibrium approach, additional evidence for the absence of human mtDNA recombination is presented here.

  16. Molecular Characterization of Geographically Different Banana bunchy top virus Isolates in India.

    PubMed

    Selvarajan, R; Mary Sheeba, M; Balasubramanian, V; Rajmohan, R; Dhevi, N Lakshmi; Sasireka, T

    2010-10-01

    Banana bunchy top disease (BBTD) caused by Banana bunchy top virus (BBTV) is one of the most devastating diseases of banana and poses a serious threat for cultivars like Hill Banana (Syn: Virupakshi) and Grand Naine in India. In this study, we have cloned and sequenced the complete genome comprised of six DNA components of BBTV infecting Hill Banana grown in lower Pulney hills, Tamil Nadu State, India. The complete genome sequence of this hill banana isolate showed high degree of similarity with the corresponding sequences of BBTV isolates originating from Lucknow, Uttar Pradesh State, India, and from Fiji, Egypt, Pakistan, and Australia. In addition, sixteen coat protein (CP) and thirteen replicase genes (Rep) sequences of BBTV isolates collected from different banana growing states of India were cloned and sequenced. The replicase sequences of 13 isolates showed high degree of similarity with that of South Pacific group of BBTV isolates. However, the CP gene of BBTV isolates from Shervroy and Kodaikanal hills of Tamil Nadu showed higher amino acid sequence variability compared to other isolates. Another hill banana isolate from Meghalaya state had 23 nucleotide substitutions in the CP gene but the amino acid sequence was conserved. This is the first report of the characterization of a complete genome of BBTV occurring in the high altitudes of India. Our study revealed that the Indian BBTV isolates with distinct geographical origins belongs to the South Pacific group, except Shervroy and Kodaikanal hill isolates which neither belong to the South Pacific nor the Asian group.

  17. Complete sequences of IncHI1 plasmids carrying blaCTX-M-1 and qnrS1 in equine Escherichia coli provide new insights into plasmid evolution.

    PubMed

    Dolejska, Monika; Villa, Laura; Minoia, Marco; Guardabassi, Luca; Carattoli, Alessandra

    2014-09-01

    To determine the structure of two multidrug-resistant IncHI1 plasmids carrying blaCTX-M-1 in Escherichia coli isolates disseminated in an equine clinic in the Czech Republic. A complete nucleotide sequencing of 239 kb IncHI1 (pEQ1) and 287 kb IncHI1/X1 (pEQ2) plasmids was performed using the 454-Genome Sequencer FLX system. The sequences were compared using bioinformatic tools with other sequenced IncHI1 plasmids. A comparative analysis of pEQ1 and pEQ2 identified high nucleotide identity with the IncHI1 type 2 plasmids. A novel 24 kb module containing an operon involved in short-chain fructooligosaccharide uptake and metabolism was found in the pEQ backbones. The role of the pEQ plasmids in the metabolism of short-chain fructooligosaccharides was demonstrated by studying the growth of E. coli cells in the presence of these sugars. The module containing the blaCTX-M-1 gene was formed by a truncated macrolide resistance cluster and flanked by IS26 as previously observed in IncI1 and IncN plasmids. The IncHI1 plasmid changed size and gained the quinolone resistance gene qnrS1 as a result of IS26-mediated fusion with an IncX1 plasmid. Our data highlight the structure and evolution of IncHI1 from equine E. coli. A plasmid-mediated sugar metabolic element could play a key role in strain fitness, contributing to the successful dissemination and maintenance of these plasmids in the intestinal microflora of horses. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  18. Strain variation in Mycobacterium marinum fish isolates.

    PubMed

    Ucko, M; Colorni, A; Kvitt, H; Diamant, A; Zlotkin, A; Knibb, W R

    2002-11-01

    A molecular characterization of two Mycobacterium marinum genes, 16S rRNA and hsp65, was carried out with a total of 21 isolates from various species of fish from both marine and freshwater environments of Israel, Europe, and the Far East. The nucleotide sequences of both genes revealed that all M. marinum isolates from fish in Israel belonged to two different strains, one infecting marine (cultured and wild) fish and the other infecting freshwater (cultured) fish. A restriction enzyme map based on the nucleotide sequences of both genes confirmed the divergence of the Israeli marine isolates from the freshwater isolates and differentiated the Israeli isolates from the foreign isolates, with the exception of one of three Greek isolates from marine fish which was identical to the Israeli marine isolates. The second isolate from Greece exhibited a single base alteration in the 16S rRNA sequence, whereas the third isolate was most likely a new Mycobacterium species. Isolates from Denmark and Thailand shared high sequence homology to complete identity with reference strain ATCC 927. Combined analysis of the two gene sequences increased the detection of intraspecific variations and was thus of importance in studying the taxonomy and epidemiology of this aquatic pathogen. Whether the Israeli M. marinum strain infecting marine fish is endemic to the Red Sea and found extremely susceptible hosts in the exotic species imported for aquaculture or rather was accidentally introduced with occasional imports of fingerlings from the Mediterranean Sea could not be determined.

  19. Population genetic implications from sequence variation in four Y chromosome genes.

    PubMed

    Shen, P; Wang, F; Underhill, P A; Franco, C; Yang, W H; Roxas, A; Sung, R; Lin, A A; Hyman, R W; Vollrath, D; Davis, R W; Cavalli-Sforza, L L; Oefner, P J

    2000-06-20

    Some insight into human evolution has been gained from the sequencing of four Y chromosome genes. Primary genomic sequencing determined gene SMCY to be composed of 27 exons that comprise 4,620 bp of coding sequence. The unfinished sequencing of the 5' portion of gene UTY1 was completed by primer walking, and a total of 20 exons were found. By using denaturing HPLC, these two genes, as well as DBY and DFFRY, were screened for polymorphic sites in 53-72 representatives of the five continents. A total of 98 variants were found, yielding nucleotide diversity estimates of 2.45 x 10(-5), 5. 07 x 10(-5), and 8.54 x 10(-5) for the coding regions of SMCY, DFFRY, and UTY1, respectively, with no variant having been observed in DBY. In agreement with most autosomal genes, diversity estimates for the noncoding regions were about 2- to 3-fold higher and ranged from 9. 16 x 10(-5) to 14.2 x 10(-5) for the four genes. Analysis of the frequencies of derived alleles for all four genes showed that they more closely fit the expectation of a Luria-Delbrück distribution than a distribution expected under a constant population size model, providing evidence for exponential population growth. Pairwise nucleotide mismatch distributions date the occurrence of population expansion to approximately 28,000 years ago. This estimate is in accord with the spread of Aurignacian technology and the disappearance of the Neanderthals.

  20. The complete genome structure and phylogenetic relationship of infectious hematopoietic necrosis virus

    USGS Publications Warehouse

    Morzunov , Sergey P.; Winton, James R.; Nichol, Stuart T.

    1995-01-01

    Infectious hematopoietic necrosis virus (IHNV), a member of the family Rhabdoviridae, causes a severe disease with high mortality in salmonid fish. The nucleotide sequence (11, 131 bases) of the entire genome was determined for the pathogenic WRAC strain of IHNV from southern Idaho. This allowed detailed analysis of all 6 genes, the deduced amino acid sequences of their encoded proteins, and important control motifs including leader, trailer and gene junction regions. Sequence analysis revealed that the 6 virus genes are located along the genome in the 3′ to 5′ order: nucleocapsid (N), polymerase-associated phosphoprotein (P or M1), matrix protein (M or M2), surface glycoprotein (G), a unique non-virion protein (NV) and virus polymerase (L). The IHNV genome RNA was found to have highly complementary termini (15 of 16 nucleotides). The gene junction regions display the highly conserved sequence UCURUC(U)7RCCGUG(N)4CACR (in the vRNA sense), which includes the typical rhabdovirus transcription termination/polyadenylation signal and a novel putative transcription initiation signal. Phylogenetic analysis of M, G and L protein sequences allowed insights into the evolutionary and taxonomic relationship of rhabdoviruses of fish relative to those of insects or mammals, and a broader sense of the relationship of non-segmented negative-strand RNA viruses. Based on these data, a new genus, piscivirus, is proposed which will initially contain IHNV, viral hemorrhagic septicemia virus and Hirame rhabdovirus.

  1. Infectious hematopoietic necrosis virus: Monophyletic origin of European isolates from North American Genogroup M

    USGS Publications Warehouse

    Enzmann, P.-J.; Kurath, G.; Fichtner, D.; Bergmann, S.M.

    2005-01-01

    Infectious hematopoietic necrosis virus (IHNV) was first detected in Europe in 1987 in France and Italy, and later, in 1992, in Germany. The source of the virus and the route of introduction are unknown. The present study investigates the molecular epidemiology of IHNV outbreaks in Germany since its first introduction. The complete nucleotide sequences of the glycoprotein (G) and non-virion (NV) genes from 9 IHNV isolates from Germany have been determined, and this has allowed the identification of characteristic differences between these isolates. Phylogenetic analysis of partial G gene sequences (mid-G, 303 nucleotides) from North American IHNV isolates (Kurath et al. 2003) has revealed 3 major genogroups, designated U, M and L. Using this gene region with 2 different North American IHNV data sets, it was possible to group the European IHNV strains within the M genogroup, but not in any previously defined subgroup. Analysis of the full length G gene sequences indicated that an independent evolution of IHN viruses had occurred in Europe. IHN viruses in Europe seem to be of a monophyletic origin, again most closely related to North American isolates in the M genogroup. Analysis of the NV gene sequences also showed the European isolates to be monophyletic, but resolution of the 3 genogroups was poor with this gene region. As a result of comparative sequence analyses, several different genotypes have been identified circulating in Europe. ?? Inter-Research 2005.

  2. Two missense mutations in melanocortin 1 receptor (MC1R) are strongly associated with dark ventral coat color in reindeer (Rangifer tarandus).

    PubMed

    Våge, D I; Nieminen, M; Anderson, D G; Røed, K H

    2014-10-01

    The protein-coding region of melanocortin 1 receptor (MC1R) was sequenced to identify potential variation affecting coat color in reindeer (Rangifer tarandus). A T→C sequence variation at nucleotide position 218 (c.218T>C) causing an amino acid (aa) change from methionine to threonine at aa position 73 (p.Met73Thr) was identified. In addition, a T→G sequence variation was found at nucleotide position 839 (c.839T>G), causing phenylalanine to be exchanged by cysteine at aa position 280 (p.Phe280Cys). The two sequence variants (c.218C and c.839G) were found to be closely associated with a darker belly coat compared with animals not having any of these two variants. The aa acid change p.Met73Thr affects the same position as p.Met73Lys previously reported to give constitutive activation of MC1R in black sheep (Ovis aries), whereas p.Phe280Cys is identical to one of two variants previously reported to be associated with dark coat color in Arctic fox (Alopex lagopus), supporting that the two variants found in reindeer are functional. The complete absence of Thr73 and Cys280 among the 51 wild reindeer analyzed provides some evidence that these variants are more common in the domestic herds. © 2014 Stichting International Foundation for Animal Genetics.

  3. Complete sequence of Tvv1, a family of Ty 1 copia-like retrotransposons of Vitis vinifera L., reconstituted by chromosome walking.

    PubMed

    Pelsy, F.; Merdinoglu, D.

    2002-09-01

    A chromosome-walking strategy was used to sequence and characterize retrotransposons in the grapevine genome. The reconstitution of a family of retroelements, named Tvv1, was achieved by six successive steps. These elements share a single, highly conserved open reading frame 4,153 nucleotides-long, putatively encoding the gag, pro, int, rt and rh proteins. Comparison of the Tvv1 open reading frame coding potential with those of drosophila copia and tobacco Tnt1, revealed that Tvv1 is closely related to Ty 1 copia-like retrotransposons. A highly variable untranslated leader region, upstream of the open reading frame, allowed us to differentiate Tvv1 variants, which represent a family of at least 28 copies, in varying sizes. This internal region is flanked by two long terminal repeats in direct orientation, sized between 149 and 157 bp. Among elements theoretically sized from 4,970 to 5,550 bp, we describe the full-length sequence of a reference element Tvv1-1, 5,343 nucleotides-long. The full-length sequence of Tvv1-1 compared to pea PDR1 shows a 53.3% identity. In addition, both elements contain long terminal repeats of nearly the same size in which the U5 region could be entirely absent. Therefore, we assume that Tvv1 and PDR1 could constitute a particular class of short LTRs retroelements.

  4. Isolation and characterization of two cryptic plasmids in the ammonia-oxidizing bacterium Nitrosomonas sp. strain ENI-11.

    PubMed

    Yamagata, A; Kato, J; Hirota, R; Kuroda, A; Ikeda, T; Takiguchi, N; Ohtake, H

    1999-06-01

    Two plasmids were discovered in the ammonia-oxidizing bacterium Nitrosomonas sp. strain ENI-11, which was isolated from activated sludge. The plasmids, designated pAYS and pAYL, were relatively small, being approximately 1.9 kb long. They were cryptic plasmids, having no detectable plasmid-linked antibiotic resistance or heavy metal resistance markers. The complete nucleotide sequences of pAYS and pAYL were determined, and their physical maps were constructed. There existed two major open reading frames, ORF1 in pAYS and ORF2 in pAYL, each of which was more than 500 bp long. The predicted product of ORF2 was 28% identical to part of the replication protein of a Bacillus plasmid, pBAA1. However, no significant similarity to any known protein sequences was detected with the predicted product of ORF1. pAYS and pAYL had a highly homologous region, designated HHR, of 262 bp. The overall identity was 98% between the two nucleotide sequences. Interestingly, HHR-homologous sequences were also detected in the genomes of ENI-11 and the plasmidless strain Nitrosomonas europaea IFO14298. Deletion analysis of pAYS and pAYL indicated that HHR, together with either ORF1 or ORF2, was essential for plasmid maintenance in ENI-11. To our knowledge, pAYS and pAYL are the first plasmids found in the ammonia-oxidizing autotrophic bacteria.

  5. The complete sequences and gene organisation of the mitochondrial genomes of the heterodont bivalves Acanthocardia tuberculata and Hiatella arctica – and the first record for a putative Atpase subunit 8 gene in marine bivalves

    PubMed Central

    Dreyer, Hermann; Steiner, Gerhard

    2006-01-01

    Background Mitochondrial (mt) gene arrangement is highly variable among molluscs and especially among bivalves. Of the 30 complete molluscan mt-genomes published to date, only one is of a heterodont bivalve, although this is the most diverse taxon in terms of species numbers. We determined the complete sequence of the mitochondrial genomes of Acanthocardia tuberculata and Hiatella arctica, (Mollusca, Bivalvia, Heterodonta) and describe their gene contents and genome organisations to assess the variability of these features among the Bivalvia and their value for phylogenetic inference. Results The size of the mt-genome in Acanthocardia tuberculata is 16.104 basepairs (bp), and in Hiatella arctica 18.244 bp. The Acanthocardia mt-genome contains 12 of the typical protein coding genes, lacking the Atpase subunit 8 (atp8) gene, as all published marine bivalves. In contrast, a complete atp8 gene is present in Hiatella arctica. In addition, we found a putative truncated atp8 gene when re-annotating the mt-genome of Venerupis philippinarum. Both mt-genomes reported here encode all genes on the same strand and have an additional trnM. In Acanthocardia several large non-coding regions are present. One of these contains 3.5 nearly identical copies of a 167 bp motive. In Hiatella, the 3' end of the NADH dehydrogenase subunit (nad)6 gene is duplicated together with the adjacent non-coding region. The gene arrangement of Hiatella is markedly different from all other known molluscan mt-genomes, that of Acanthocardia shows few identities with the Venerupis philippinarum. Phylogenetic analyses on amino acid and nucleotide levels robustly support the Heterodonta and the sister group relationship of Acanthocardia and Venerupis. Monophyletic Bivalvia are resolved only by a Bayesian inference of the nucleotide data set. In all other analyses the two unionid species, being to only ones with genes located on both strands, do not group with the remaining bivalves. Conclusion The two mt-genomes reported here add to and underline the high variability of gene order and presence of duplications in bivalve and molluscan taxa. Some genomic traits like the loss of the atp8 gene or the encoding of all genes on the same strand are homoplastic among the Bivalvia. These characters, gene order, and the nucleotide sequence data show considerable potential of resolving phylogenetic patterns at lower taxonomic levels. PMID:16948842

  6. 37 CFR 1.824 - Form and format for nucleotide and/or amino acid sequence submissions in computer readable form.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 37 Patents, Trademarks, and Copyrights 1 2010-07-01 2010-07-01 false Form and format for... And/or Amino Acid Sequences § 1.824 Form and format for nucleotide and/or amino acid sequence... Code for Information Interchange (ASCII) text. No other formats shall be allowed. (3) The computer...

  7. The nucleotide sequence of 5S rRNA from a cellular slime mold Dictyostelium discoideum.

    PubMed Central

    Hori, H; Osawa, S; Iwabuchi, M

    1980-01-01

    The nucleotide sequence of ribosomal 5S rRNA from a cellular slime mold Dictyostelium discoideum is GUAUACGGCCAUACUAGGUUGGAAACACAUCAUCCCGUUCGAUCUGAUA AGUAAAUCGACCUCAGGCCUUCCAAGUACUCUGGUUGGAGACAACAGGGGAACAUAGGGUGCUGUAUACU. A model for the secondary structure of this 5S rRNA is proposed. The sequence is more similar to those of animals (62% similarity on the average) rather than those of yeasts (56%). Images PMID:7465421

  8. A nucleotide sequence comparison of coxsackievirus B4 isolates from aquatic samples and clinical specimens.

    PubMed Central

    Hughes, M. S.; Hoey, E. M.; Coyle, P. V.

    1993-01-01

    Ten coxsackievirus B4 (CVB4) strains isolated from clinical and environmental sources in Northern Ireland in 1985-7, were compared at the nucleotide sequence level. Dideoxynucleotide sequencing of a polymerase chain reaction (PCR) amplified fragment, spanning the VP1/P2A genomic region, classified the isolates into two distinct groups or genotypes as defined by Rico-Hesse and colleagues for poliovirus type 1. Isolates within each group shared approximately 99% sequence identity at the nucleotide level whereas < or = 86% sequence identity was shared between groups. One isolate derived from a clinical specimen in 1987 was grouped with six CVB4 isolates recovered from the aquatic environment in 1986-7. The second group comprised CVB4 isolates from clinical specimens in 1985-6. Both groups were different at the nucleotide level from the prototype strain isolated in 1950. It was concluded that the method could be used to sub-type CVB4 isolates and would be of value in epidemiological studies of CVB4. Predicted amino acid sequences revealed non-conservation of the tyrosine residue at the VP1/P2A cleavage site but were of little value in distinguishing CVB4 variants. PMID:8386098

  9. Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms.

    PubMed

    Zhang, Wei; Qi, Weihong; Albert, Thomas J; Motiwala, Alifiya S; Alland, David; Hyytia-Trees, Eija K; Ribot, Efrain M; Fields, Patricia I; Whittam, Thomas S; Swaminathan, Bala

    2006-06-01

    Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous protein-coding genes, limiting the use of nucleotide sequences to study the evolution and epidemiology of this bacterial pathogen. To systematically examine single nucleotide polymorphisms (SNPs) at a genome scale, we designed comparative genome sequencing microarrays and analyzed 1199 chromosomal genes (a total of 1,167,948 bp) and 92,721 bp of the large virulence plasmid (pO157) of eleven outbreak-associated STEC O157 strains. We discovered 906 SNPs in 523 chromosomal genes and observed a high level of DNA polymorphisms among the pO157 plasmids. Based on a uniform rate of synonymous substitution for Escherichia coli and Salmonella enterica (4.7x10(-9) per site per year), we estimate that the most recent common ancestor of the contemporary beta-glucuronidase-negative, non-sorbitolfermenting STEC O157 strains existed ca. 40 thousand years ago. The phylogeny of the STEC O157 strains based on the informative synonymous SNPs was compared to the maximum parsimony trees inferred from pulsed-field gel electrophoresis and multilocus variable numbers of tandem repeats analysis. The topological discrepancies indicate that, in contrast to the synonymous mutations, parts of STEC O157 genomes have evolved through different mechanisms with highly variable divergence rates. The SNP loci reported here will provide useful genetic markers for developing high-throughput methods for fine-resolution genotyping of STEC O157. Functional characterization of nucleotide polymorphisms should shed new insights on the evolution, epidemiology, and pathogenesis of STEC O157 and related pathogens.

  10. Probing genomic diversity and evolution of Escherichia coli O157 by single nucleotide polymorphisms

    PubMed Central

    Zhang, Wei; Qi, Weihong; Albert, Thomas J.; Motiwala, Alifiya S.; Alland, David; Hyytia-Trees, Eija K.; Ribot, Efrain M.; Fields, Patricia I.; Whittam, Thomas S.; Swaminathan, Bala

    2006-01-01

    Infections by Shiga toxin-producing Escherichia coli O157:H7 (STEC O157) are the predominant cause of bloody diarrhea and hemolytic uremic syndrome in the United States. In silico comparison of the two complete STEC O157 genomes (Sakai and EDL933) revealed a strikingly high level of sequence identity in orthologous protein-coding genes, limiting the use of nucleotide sequences to study the evolution and epidemiology of this bacterial pathogen. To systematically examine single nucleotide polymorphisms (SNPs) at a genome scale, we designed comparative genome sequencing microarrays and analyzed 1199 chromosomal genes (a total of 1,167,948 bp) and 92,721 bp of the large virulence plasmid (pO157) of eleven outbreak-associated STEC O157 strains. We discovered 906 SNPs in 523 chromosomal genes and observed a high level of DNA polymorphisms among the pO157 plasmids. Based on a uniform rate of synonymous substitution for Escherichia coli and Salmonella enterica (4.7 × 10−9 per site per year), we estimate that the most recent common ancestor of the contemporary β-glucuronidase-negative, non-sorbitolfermenting STEC O157 strains existed ca. 40 thousand years ago. The phylogeny of the STEC O157 strains based on the informative synonymous SNPs was compared to the maximum parsimony trees inferred from pulsed-field gel electrophoresis and multilocus variable numbers of tandem repeats analysis. The topological discrepancies indicate that, in contrast to the synonymous mutations, parts of STEC O157 genomes have evolved through different mechanisms with highly variable divergence rates. The SNP loci reported here will provide useful genetic markers for developing high-throughput methods for fine-resolution genotyping of STEC O157. Functional characterization of nucleotide polymorphisms should shed new insights on the evolution, epidemiology, and pathogenesis of STEC O157 and related pathogens. PMID:16606700

  11. Control of total GFP expression by alterations to the 3′ region nucleotide sequence

    PubMed Central

    2013-01-01

    Background Previously, we distinguished the Escherichia coli type II cytoplasmic membrane translocation pathways of Tat, Yid, and Sec for unfolded and folded soluble target proteins. The translocation of folded protein to the periplasm for soluble expression via the Tat pathway was controlled by an N-terminal hydrophilic leader sequence. In this study, we investigated the effect of the hydrophilic C-terminal end and its nucleotide sequence on total and soluble protein expression. Results The native hydrophilic C-terminal end of GFP was obtained by deleting the C-terminal peptide LeuGlu-6×His, derived from pET22b(+). The corresponding clones induced total and soluble GFP expression that was either slightly increased or dramatically reduced, apparently through reconstruction of the nucleotide sequence around the stop codon in the 3′ region. In the expression-induced clones, the hydrophilic C-terminus showed increased Tat pathway specificity for soluble expression. However, in the expression-reduced clone, after analyzing the role of the 5′ poly(A) coding sequence with a substituted synonymous codon, we proved that the longer 5′ poly(A) coding sequence interacted with the reconstructed 3′ region nucleotide sequence to create a new mRNA tertiary structure between the 5′ and 3′ regions, which resulted in reduced total GFP expression. Further, to recover the reduced expression by changing the 3′ nucleotide sequence, after replacing selected C-terminal 5′ codons and the stop codon in the ORF with synonymous codons, total GFP expression in most of the clones was recovered to the undeleted control level. The insertion of trinucleotides after the stop codon in the 3′-UTR recovered or reduced total GFP expression. RT-PCR revealed that the level of total protein expression was controlled by changes in translational or transcriptional regulation, which were induced or reduced by the substitution or insertion of 3′ region nucleotides. Conclusions We found that the hydrophilic C-terminal end of GFP increased Tat pathway specificity and that the 3′ nucleotide sequence played an important role in total protein expression through translational and transcriptional regulation. These findings may be useful for efficiently producing recombinant proteins as well as for potentially controlling the expression level of specific genes in the body for therapeutic purposes. PMID:23834827

  12. Stability of Tandem Repeats in the Drosophila Melanogaster HSR-Omega Nuclear RNA

    PubMed Central

    Hogan, N. C.; Slot, F.; Traverse, K. L.; Garbe, J. C.; Bendena, W. G.; Pardue, M. L.

    1995-01-01

    The Drosophila melanogaster Hsr-omega locus produces a nuclear RNA containing >5 kb of tandem repeat sequences. These repeats are unique to Hsr-omega and show concerted evolution similar to that seen with classical satellite DNAs. In D. melanogaster the monomer is ~280 bp. Sequences of 191/2 monomers differ by 8 +/- 5% (mean +/- SD), when all pairwise comparisons are considered. Differences are single nucleotide substitutions and 1-3 nucleotide deletions/insertions. Changes appear to be randomly distributed over the repeat unit. Outer repeats do not show the decrease in monomer homogeneity that might be expected if homogeneity is maintained by recombination. However, just outside the last complete repeat at each end, there are a few fragments of sequence similar to the monomer. The sequences in these flanking regions are not those predicted for sequences decaying in the absence of recombination. Instead, the fragmentation of the sequence homology suggests that flanking regions have undergone more severe disruptions, possibly during an insertion or amplification event. Hsr-omega alleles differing in the number of repeats are detected and appear to be stable over a few thousand generations; however, both increases and decreases in repeat numbers have been observed. The new alleles appear to be as stable as their predecessors. No alleles of less than ~5 kb nor more than ~16 kb of repeats were seen in any stocks examined. The evidence that there is a limit on the minimum number of repeats is consistent with the suggestion that these repeats are important in the function of the unusual Hsr-omega nuclear RNA. PMID:7540581

  13. Characterization of the Campylobacter jejuni cryptic plasmid pTIW94 recovered from wild birds in the southeastern United States.

    PubMed

    Hiett, Kelli L; Rothrock, Michael J; Seal, Bruce S

    2013-09-01

    The complete nucleotide sequence was determined for a cryptic plasmid, pTIW94, recovered from several Campylobacter jejuni isolates from wild birds in the southeastern United States. pTIW94 is a circular molecule of 3860 nucleotides, with a G+C content (31.0%) similar to that of many Campylobacter spp. genomes. A typical origin of replication, with iteron sequences, was identified upstream of DNA sequences that demonstrated similarity to replication initiation proteins. A total of five open reading frames (ORFs) were identified; two of the five ORFs demonstrated significant similarity to plasmid pCC2228-2 found within Campylobacter coli. These two ORFs were similar to essential replication proteins RepA (100%; 26/26 aa identity) and RepB (95%; 327/346 aa identity). A third identified ORF demonstrated significant similarity (99%; 421/424 aa identity) to the MOB protein from C. coli 67-8, originally recovered from swine. The other two identified ORFs were either similar to hypothetical proteins from other Campylobacter spp., or exhibited no significant similarity to any DNA or protein sequence in the GenBank database. Promoter regions (-35 and -10 signal sites), ribosomal binding sites upstream of ORFs, and stem-loop structures were also identified within the plasmid. These results demonstrate that pTIW94 represents a previously un-reported small cryptic plasmid with unique sequences as well as highly similar sequences to other small plasmids found within Campylobacter spp., and that this cryptic plasmid is present among Campylobacter spp. recovered from different genera of wild birds. Copyright © 2013. Published by Elsevier Inc.

  14. Molecular characterization of the vitamin D receptor (VDR) gene in Holstein cows.

    PubMed

    Ali, Mayar O; El-Adl, Mohamed A; Ibrahim, Hussam M M; Elseedy, Youssef Y; Rizk, Mohamed A; El-Khodery, Sabry A

    2018-06-01

    Vitamin D plays a vital role in calcium homeostasis, growth, and immunoregulation. Because little is known about the vitamin D receptor (VDR) gene in cattle, the aim of the present investigation was to present the molecular characterization of exons 5 and 6 of the VDR gene in Holstein cows. DNA extraction, genomic sequencing, phylogenetic analysis, synteny mapping and single nucleotide gene polymorphism analysis of the VDR gene were performed to assess blood samples collected from 50 clinically healthy Holstein cows. The results revealed the presence of a 450-base pair (bp) nucleotide sequence that resembled exons 5 and 6 with intron 5 enclosed between these exons. Sequence alignment and phylogenetic analysis revealed a close relationship between the sequenced VDR region and that found in Hereford cattle. A close association between this region and the corresponding region in small ruminants was also documented. Moreover, a single nucleotide polymorphism (SNP) that caused the replacement of a glutamate with an arginine in the deduced amino acid sequence was detected at position 7 of exon 5. In conclusion, Holstein and Hereford cattle differ with respect to exon 5 of the VDR gene. Phylogenetic analysis of the VDR gene based on nucleotide sequence produced different results from prior analyses based on amino acid sequence. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. HIV Sequence Compendium 2015

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Foley, Brian Thomas; Leitner, Thomas Kenneth; Apetrei, Cristian

    This compendium is an annual printed summary of the data contained in the HIV sequence database. We try to present a judicious selection of the data in such a way that it is of maximum utility to HIV researchers. Each of the alignments attempts to display the genetic variability within the different species, groups and subtypes of the virus. This compendium contains sequences published before January 1, 2015. Hence, though it is published in 2015 and called the 2015 Compendium, its contents correspond to the 2014 curated alignments on our website. The number of sequences in the HIV database ismore » still increasing. In total, at the end of 2014, there were 624,121 sequences in the HIV Sequence Database, an increase of 7% since the previous year. This is the first year that the number of new sequences added to the database has decreased compared to the previous year. The number of near complete genomes (>7000 nucleotides) increased to 5834 by end of 2014. However, as in previous years, the compendium alignments contain only a fraction of these. A more complete version of all alignments is available on our website, http://www.hiv.lanl.gov/ content/sequence/NEWALIGN/align.html As always, we are open to complaints and suggestions for improvement. Inquiries and comments regarding the compendium should be addressed to seq-info@lanl.gov.« less

  16. Characteristics and phylogenetic analysis of the complete mitochondrial genome of Cheilodactylus quadricornis (Perciformes, Cheilodactylidae).

    PubMed

    Wang, Aishuai; Sun, Yuena; Wu, Changwen

    2016-11-01

    The complete mitochondrial genome of the Cheilodactylus quadricornis was firstly determined in the present study. The mitochondrial genome of C. quadricornis is 16 521 nucleotides, comprising 13 protein-coding genes and 2 ribosomal RNA genes, 22 tRNA genes and 2 main non-coding regions (the control region and the origin of the light-strand replication). The overall base composition was T, 26.3%; C, 29.6%; A, 27.8% and G, 16.3%. The gene arrangement, base composition, and tRNA structures of the complete mitochondrial genome of C. quadricornis is similar to other teleosts. Only two central conserved sequence blocks (CSB-2 and CSB-3) were identified in the control region. In addition, the conserved motif 5'-GCCGG-3' was identified in the origin of light-strand replication of C. quadricornis. The complete mitochondrial genome of C. quadricornis was used to construct phylogenetic tree, which shows that C. quadricornis and C. variegatus clustered in a clade and formed a sister relationship. This mitogenome sequence data would play an important role in population genetics and phylogenetic analysis of the Cheilodactylidae.

  17. The Complete Chloroplast and Mitochondrial Genome Sequences of Boea hygrometrica: Insights into the Evolution of Plant Organellar Genomes

    PubMed Central

    Wang, Xumin; Deng, Xin; Zhang, Xiaowei; Hu, Songnian; Yu, Jun

    2012-01-01

    The complete nucleotide sequences of the chloroplast (cp) and mitochondrial (mt) genomes of resurrection plant Boea hygrometrica (Bh, Gesneriaceae) have been determined with the lengths of 153,493 bp and 510,519 bp, respectively. The smaller chloroplast genome contains more genes (147) with a 72% coding sequence, and the larger mitochondrial genome have less genes (65) with a coding faction of 12%. Similar to other seed plants, the Bh cp genome has a typical quadripartite organization with a conserved gene in each region. The Bh mt genome has three recombinant sequence repeats of 222 bp, 843 bp, and 1474 bp in length, which divide the genome into a single master circle (MC) and four isomeric molecules. Compared to other angiosperms, one remarkable feature of the Bh mt genome is the frequent transfer of genetic material from the cp genome during recent Bh evolution. We also analyzed organellar genome evolution in general regarding genome features as well as compositional dynamics of sequence and gene structure/organization, providing clues for the understanding of the evolution of organellar genomes in plants. The cp-derived sequences including tRNAs found in angiosperm mt genomes support the conclusion that frequent gene transfer events may have begun early in the land plant lineage. PMID:22291979

  18. The CD8α gene in duck (Anatidae): cloning, characterization, and expression during viral infection.

    PubMed

    Xu, Qi; Chen, Yang; Zhao, Wen Ming; Huang, Zheng Yang; Duan, Xiu Jun; Tong, Yi Yu; Zhang, Yang; Li, Xiu; Chang, Guo Bin; Chen, Guo Hong

    2015-02-01

    Cluster of differentiation 8 alpha (CD8α) is critical for cell-mediated immune defense and T-cell development. Although CD8α sequences have been reported for several species, very little is known about CD8α in ducks. To elucidate the mechanisms involved in the innate and adaptive immune responses of ducks, we cloned CD8α coding sequences from domestic, Muscovy, Mallard, and Spotbill ducks using reverse transcription polymerase chain reaction (RT-PCR). Each sequence consisted of 714 nucleotides and encoded a signal peptide, an IgV-like domain, a stalk region, a transmembrane region, and a cytoplasmic tail. We identified 58 nucleotide differences and 37 amino acid differences among the four types of duck; of these, 53 nucleotide and 33 amino acid differences were between Muscovy ducks and the other duck species. The CD8α cDNA sequence from domestic duck consisted of a 61-nucleotide 5' untranslated region (UTR), a 714-nucleotide open reading frame, and an 849-nucleotide 3' UTR. Multiple sequence alignments showed that the amino acid sequence of CD8α is conserved in vertebrates. RT-PCR revealed that expression of CD8α mRNA of domestic ducks was highest in the thymus and very low in the kidney, cerebrum, cerebellum, and muscle. Immunohistochemical analyses detected CD8α on the splenic corpuscle and periarterial lymphatic sheath of the spleen. CD8α mRNA in domestic ducklings was initially up-regulated, and then down-regulated, in the thymus, spleen, and liver after treatment with duck hepatitis virus type I (DHV-1) or the immunostimulant polyriboinosinic polyribocytidylic acid (poly I:C).

  19. Whole exome sequencing to estimate alloreactivity potential between donors and recipients in stem cell transplantation.

    PubMed

    Sampson, Juliana K; Sheth, Nihar U; Koparde, Vishal N; Scalora, Allison F; Serrano, Myrna G; Lee, Vladimir; Roberts, Catherine H; Jameson-Lee, Max; Ferreira-Gonzalez, Andrea; Manjili, Masoud H; Buck, Gregory A; Neale, Michael C; Toor, Amir A

    2014-08-01

    Whole exome sequencing (WES) was performed on stem cell transplant donor-recipient (D-R) pairs to determine the extent of potential antigenic variation at a molecular level. In a small cohort of D-R pairs, a high frequency of sequence variation was observed between the donor and recipient exomes independent of human leucocyte antigen (HLA) matching. Nonsynonymous, nonconservative single nucleotide polymorphisms were approximately twice as frequent in HLA-matched unrelated, compared with related D-R pairs. When mapped to individual chromosomes, these polymorphic nucleotides were uniformly distributed across the entire exome. In conclusion, WES reveals extensive nucleotide sequence variation in the exomes of HLA-matched donors and recipients. © 2014 John Wiley & Sons Ltd.

  20. Nucleotide sequences of Japanese isolates of citrus vein enation virus.

    PubMed

    Nakazono-Nagaoka, Eiko; Fujikawa, Takashi; Iwanami, Toru

    2017-03-01

    The genomic sequences of five Japanese isolates of citrus vein enation virus (CVEV) isolates that induce vein enation were determined and compared with that of the Spanish isolate VE-1. The nucleotide sequences of all Japanese isolates were 5,983 nt in length. The genomic RNA of Japanese isolates had five potential open reading frames (ORF 0, ORF 1, ORF 2, ORF 3, and ORF 5) in the positive-sense strand. The nucleotide sequence identity among the Japanese isolates and Spanish isolate VE-1 ranged from 98.0% to 99.8%. Comparison of the partial amino acid sequences of ten Japanese isolates and three Spanish isolates suggested that four amino acid residues, at positions of 83, 104, and 113 in ORF 2 and position 41 in ORF 5, might be unique to some Japanese isolates.

Top