sequence comparison revealed: Topics by Science.gov

Sample records for sequence comparison revealed

Host Cell Virus Entry Mediated by Australian Bat Lyssavirus Envelope G glycoprotein

DTIC Science & Technology

2013-10-24

39 Figure 7. Comparison of the amino acid sequences of Saccolaimus and Pteropus ABLV G mature protein... sequence analysis revealed that the PCR products were identical. Sequence comparisons of the ABLV N and other lyssavirus N proteins showed that ABLV...Saccolaimus flaviventris) (129). Nucleoprotein sequence comparisons revealed that the Saccolaimus N protein shared 96% amino acid homology with the Pteropus
Genome Comparisons Reveal a Dominant Mechanism of Chromosome Number Reduction in Grasses and Accelerated Genome Evolution in Triticeae

USDA-ARS?s Scientific Manuscript database

Single nucleotide polymorphism was employed in the construction of a high-resolution, expressed sequence tag (EST) map of Aegilops tauschii, the diploid source of the wheat D genome. Comparison of the map with the rice and sorghum genome sequences revealed 50 inversions and translocations; 2, 8, and...
Deep Sequencing Reveals a Divergent Ugandan cassava brown streak virus Isolate from Malawi

PubMed Central

Winter, Stephan; Mukasa, Settumba; Tairo, Fred; Sseruwagi, Peter; Ndunguru, Joseph; Duffy, Siobain

2017-01-01

ABSTRACT Illumina sequencing of RNA from a cassava cutting from northern Malawi produced a genome of Ugandan cassava brown streak virus (UCBSV-MW-NB7_2013). Sequence comparisons revealed stronger similarity to an isolate from nearby Tanzania (93.4% pairwise nucleotide identity) than to those previously reported from Malawi (86.9 to 87.0%). PMID:28818908
Re-sequencing regions of the ovine Y chromosome in domestic and wild sheep reveals novel paternal haplotypes.

PubMed

Meadows, J R S; Kijas, J W

2009-02-01

The male-specific region of the ovine Y chromosome (MSY) remains poorly characterized, yet sequence variants from this region have the potential to reveal the wild progenitor of domestic sheep or examples of domestic and wild paternal introgression. The 5' promoter region of the sex-determining gene SRY was re-sequenced using a subset of wild sheep including bighorn (Ovis canadensis), thinhorn (Ovis dalli spp.), urial (Ovis vignei), argali (Ovis ammon), mouflon (Ovis musimon) and domestic sheep (Ovis aries). Seven novel SNPs (oY2-oY8) were revealed; these were polymorphic between but not within species. Re-sequencing and fragment analysis was applied to the MSY microsatellite SRYM18. It contains a complex compound repeat structure and sequencing of three novel size fragments revealed that a pentanucleotide element remained fixed, whilst a dinucleotide element displayed variability within species. Comparison of the sequence between species revealed that urial and argali sheep grouped more closely to the mouflon and domestic breeds than the pachyceriforms (bighorn and thinhorn). SNP and microsatellite data were combined to define six previously undetected haplotypes. Analysis revealed the mouflon as the only species to share a haplotype with domestic sheep, consistent with its status as a feral domesticate that has undergone male-mediated exchange with domestic animals. A comparison of the remaining wild species and domestic sheep revealed that O. aries is free from signatures of wild sheep introgression.
Indigenous and introduced potyviruses of legumes and Passiflora spp. from Australia: biological properties and comparison of coat protein sequences

USDA-ARS?s Scientific Manuscript database

Coat protein sequences of 33 Potyvirus isolates from legume and Passiflora spp. were sequenced to determine the identity of infecting viruses. Phylogenetic analysis of the sequences revealed the presence of seven distinct virus species....
Sequence of the tomato chloroplast DNA and evolutionary comparison of solanaceous plastid genomes.

PubMed

Kahlau, Sabine; Aspinall, Sue; Gray, John C; Bock, Ralph

2006-08-01

Tomato, Solanum lycopersicum (formerly Lycopersicon esculentum), has long been one of the classical model species of plant genetics. More recently, solanaceous species have become a model of evolutionary genomics, with several EST projects and a tomato genome project having been initiated. As a first contribution toward deciphering the genetic information of tomato, we present here the complete sequence of the tomato chloroplast genome (plastome). The size of this circular genome is 155,461 base pairs (bp), with an average AT content of 62.14%. It contains 114 genes and conserved open reading frames (ycfs). Comparison with the previously sequenced plastid DNAs of Nicotiana tabacum and Atropa belladonna reveals patterns of plastid genome evolution in the Solanaceae family and identifies varying degrees of conservation of individual plastid genes. In addition, we discovered several new sites of RNA editing by cytidine-to-uridine conversion. A detailed comparison of editing patterns in the three solanaceous species highlights the dynamics of RNA editing site evolution in chloroplasts. To assess the level of intraspecific plastome variation in tomato, the plastome of a second tomato cultivar was sequenced. Comparison of the two genotypes (IPA-6, bred in South America, and Ailsa Craig, bred in Europe) revealed no nucleotide differences, suggesting that the plastomes of modern tomato cultivars display very little, if any, sequence variation.
Sequenced sorghum mutant library- an efficient platform for discovery of causal gene mutations

USDA-ARS?s Scientific Manuscript database

Ethyl methanesulfonate (EMS) efficiently generates high-density mutations in genomes. We applied whole-genome sequencing to 256 phenotyped mutant lines of sorghum (Sorghum bicolor L. Moench) to 16x coverage. Comparisons with the reference sequence revealed >1.8 million canonical EMS-induced G/C to A...
Identification of gyrB and rpoB gene mutations and differentially expressed proteins between a novobiocin-resistant Aeromonas hydrophila catfish vaccine strain and its virulent parent strain

USDA-ARS?s Scientific Manuscript database

Sequence comparison between the full-length 2412 bp DNA gyrase subunit B (gyrB) gene of a novobiocin resistant Aeromonas hydrophila AH11NOVO vaccine strain and that of its virulent parent strain AH11P revealed 10 missense mutations. Similarly, sequence comparison between the full-length 4092 bp RNA ...
Complete Genome Sequence of ER2796, a DNA Methyltransferase-Deficient Strain of Escherichia coli K-12.

PubMed

Anton, Brian P; Mongodin, Emmanuel F; Agrawal, Sonia; Fomenkov, Alexey; Byrd, Devon R; Roberts, Richard J; Raleigh, Elisabeth A

2015-01-01

We report the complete sequence of ER2796, a laboratory strain of Escherichia coli K-12 that is completely defective in DNA methylation. Because of its lack of any native methylation, it is extremely useful as a host into which heterologous DNA methyltransferase genes can be cloned and the recognition sequences of their products deduced by Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. The genome was itself sequenced from a long-insert library using the SMRT platform, resulting in a single closed contig devoid of methylated bases. Comparison with K-12 MG1655, the first E. coli K-12 strain to be sequenced, shows an essentially co-linear relationship with no major rearrangements despite many generations of laboratory manipulation. The comparison revealed a total of 41 insertions and deletions, and 228 single base pair substitutions. In addition, the long-read approach facilitated the surprising discovery of four gene conversion events, three involving rRNA operons and one between two cryptic prophages. Such events thus contribute both to genomic homogenization and to bacteriophage diversification. As one of relatively few laboratory strains of E. coli to be sequenced, the genome also reveals the sequence changes underlying a number of classical mutant alleles including those affecting the various native DNA methylation systems.
Complete Genome Sequence of ER2796, a DNA Methyltransferase-Deficient Strain of Escherichia coli K-12

PubMed Central

Anton, Brian P.; Mongodin, Emmanuel F.; Agrawal, Sonia; Fomenkov, Alexey; Byrd, Devon R.; Roberts, Richard J.; Raleigh, Elisabeth A.

2015-01-01

We report the complete sequence of ER2796, a laboratory strain of Escherichia coli K-12 that is completely defective in DNA methylation. Because of its lack of any native methylation, it is extremely useful as a host into which heterologous DNA methyltransferase genes can be cloned and the recognition sequences of their products deduced by Pacific Biosciences Single-Molecule Real Time (SMRT) sequencing. The genome was itself sequenced from a long-insert library using the SMRT platform, resulting in a single closed contig devoid of methylated bases. Comparison with K-12 MG1655, the first E. coli K-12 strain to be sequenced, shows an essentially co-linear relationship with no major rearrangements despite many generations of laboratory manipulation. The comparison revealed a total of 41 insertions and deletions, and 228 single base pair substitutions. In addition, the long-read approach facilitated the surprising discovery of four gene conversion events, three involving rRNA operons and one between two cryptic prophages. Such events thus contribute both to genomic homogenization and to bacteriophage diversification. As one of relatively few laboratory strains of E. coli to be sequenced, the genome also reveals the sequence changes underlying a number of classical mutant alleles including those affecting the various native DNA methylation systems. PMID:26010885
A clone-free, single molecule map of the domestic cow (Bos taurus) genome.

PubMed

Zhou, Shiguo; Goldstein, Steve; Place, Michael; Bechner, Michael; Patino, Diego; Potamousis, Konstantinos; Ravindran, Prabu; Pape, Louise; Rincon, Gonzalo; Hernandez-Ortiz, Juan; Medrano, Juan F; Schwartz, David C

2015-08-28

The cattle (Bos taurus) genome was originally selected for sequencing due to its economic importance and unique biology as a model organism for understanding other ruminants, or mammals. Currently, there are two cattle genome sequence assemblies (UMD3.1 and Btau4.6) from groups using dissimilar assembly algorithms, which were complemented by genetic and physical map resources. However, past comparisons between these assemblies revealed substantial differences. Consequently, such discordances have engendered ambiguities when using reference sequence data, impacting genomic studies in cattle and motivating construction of a new optical map resource--BtOM1.0--to guide comparisons and improvements to the current sequence builds. Accordingly, our comprehensive comparisons of BtOM1.0 against the UMD3.1 and Btau4.6 sequence builds tabulate large-to-immediate scale discordances requiring mediation. The optical map, BtOM1.0, spanning the B. taurus genome (Hereford breed, L1 Dominette 01449) was assembled from an optical map dataset consisting of 2,973,315 (439 X; raw dataset size before assembly) single molecule optical maps (Rmaps; 1 Rmap = 1 restriction mapped DNA molecule) generated by the Optical Mapping System. The BamHI map spans 2,575.30 Mb and comprises 78 optical contigs assembled by a combination of iterative (using the reference sequence: UMD3.1) and de novo assembly techniques. BtOM1.0 is a high-resolution physical map featuring an average restriction fragment size of 8.91 Kb. Comparisons of BtOM1.0 vs. UMD3.1, or Btau4.6, revealed that Btau4.6 presented far more discordances (7,463) vs. UMD3.1 (4,754). Overall, we found that Btau4.6 presented almost double the number of discordances than UMD3.1 across most of the 6 categories of sequence vs. map discrepancies, which are: COMPLEX (misassembly), DELs (extraneous sequences), INSs (missing sequences), ITs (Inverted/Translocated sequences), ECs (extra restriction cuts) and MCs (missing restriction cuts). Alignments of UMD3.1 and Btau4.6 to BtOM1.0 reveal discordances commensurate with previous reports, and affirm the NCBI's current designation of UMD3.1 sequence assembly as the "reference assembly" and the Btau4.6 as the "alternate assembly." The cattle genome optical map, BtOM1.0, when used as a comprehensive and largely independent guide, will greatly assist improvements to existing sequence builds, and later serve as an accurate physical scaffold for studies concerning the comparative genomics of cattle breeds.
The phosphotransferase system-dependent sucrose utilization regulon in enteropathogenic Escherichia coli strains is located in a variable chromosomal region containing iap sequences.

PubMed

Treviño-Quintanilla, Luis Gerardo; Escalante, Adelfo; Caro, Alma Delia; Martínez, Alfredo; González, Ricardo; Puente, José Luis; Bolívar, Francisco; Gosset, Guillermo

2007-01-01

The capacity to utilize sucrose as a carbon and energy source (Scr(+) phenotype) is a highly variable trait among Escherichia coli strains. In this study, seven enteropathogenic E. coli (EPEC) strains from different sources were studied for their capacity to grow using sucrose. Liquid media cultures showed that all analyzed strains have the Scr(+) phenotype and two distinct groups were defined: one of five and another of two strains displaying doubling times of 67 and 125 min, respectively. The genes conferring the Scr(+) phenotype in one of the fast-growing strains (T19) were cloned and sequenced. Comparative sequence analysis revealed that this strain possesses the scr regulon genes scrKYABR, encoding phosphoenolpyruvate:phosphotransferase system-dependent sucrose transport and utilization activities. Transcript level quantification revealed sucrose-dependent induction of scrK and scrR genes in fast-growing strains, whereas no transcripts were detected in slow-growing strains. Sequence comparison analysis revealed that the scr genes in strain T19 are almost identical to those present in the scr regulon of prototype EPEC E2348/69 and in both strains, the scr genes are inserted in the chromosomal intergenic region of hypothetical genes ygcE and ygcF. Comparison of the ygcE-ygcF intergenic region sequence of strains MG1655, enterohemorrhagic EDL933, uropathogenic ECFT073 and EPEC T19-E2348/69 revealed that the number of extragenic highly repeated iap sequences corresponded to nine, four, two and none, respectively. These results show that the iap sequence-containing chromosomal ygcE-ygcF intergenic region is highly variable in E. coli. Copyright (c) 2007 S. Karger AG, Basel.
Intervening sequences in a plant gene-comparison of the partial sequence of cDNA and genomic DNA of French bean phaseolin

NASA Astrophysics Data System (ADS)

Sun, S. M.; Slightom, J. L.; Hall, T. C.

1981-01-01

A plant gene coding for the major storage protein (phaseolin, G1-globulin) of the French bean was isolated from a genomic library constructed in the phage vector Charon 24A. Comparison of the nucleotide sequence of part of the gene with that of the cloned messenger RNA (cDNA) revealed the presence of three intervening sequences, all beginning with GTand ending with AG. The 5' and 3' boundaries of intervening sequences TVS-A (88 base pairs) and IVS-B (124 base pairs) are similar to those described for animal and viral genes, but the 3' boundary of IVS-C (129 base pairs) shows some differences. A sequence of 185 amino acids deduced from the cloned DMAs represents about 40% of a phaseolin polypeptide.
Isolation of an insertion sequence (IS1051) from Xanthomonas campestris pv. dieffenbachiae with potential use for strain identification and characterization.

PubMed Central

Berthier, Y; Thierry, D; Lemattre, M; Guesdon, J L

1994-01-01

A new insertion sequence was isolated from Xanthomonas campestris pv. dieffenbachiae. Sequence analysis showed that this element is 1,158 bp long and has 15-bp inverted repeat ends containing two mismatches. Comparison of this sequence with sequences in data bases revealed significant homology with Escherichia coli IS5. IS1051, which detected multiple restriction fragment length polymorphisms, was used as a probe to characterize strains from the pathovar dieffenbachiae. Images PMID:7906933
Conserved features of eukaryotic hsp70 genes revealed by comparison with the nucleotide sequence of human hsp70.

PubMed Central

Hunt, C; Morimoto, R I

1985-01-01

We have determined the nucleotide sequence of the human hsp70 gene and 5' flanking region. The hsp70 gene is transcribed as an uninterrupted primary transcript of 2440 nucleotides composed of a 5' noncoding leader sequence of 212 nucleotides, a 3' noncoding region of 242 nucleotides, and a continuous open reading frame of 1986 nucleotides that encodes a protein with predicted molecular mass of 69,800 daltons. Upstream of the 5' terminus are the canonical TATAAA box, the sequence ATTGG that corresponds in the inverted orientation to the CCAAT motif, and the dyad sequence CTGGAAT/ATTCCCG that shares homology in 12 of 14 positions with the consensus transcription regulatory sequence common to Drosophila heat shock genes. Comparison of the predicted amino acid sequences of human hsp70 with the published sequences of Drosophila hsp70 and Escherichia coli dnaK reveals that human hsp70 is 73% identical to Drosophila hsp70 and 47% identical to E. coli dnaK. Surprisingly, the nucleotide sequences of the human and Drosophila genes are 72% identical and human and E. coli genes are 50% identical, which is more highly conserved than necessary given the degeneracy of the genetic code. The lack of accumulated silent nucleotide substitutions leads us to propose that there may be additional information in the nucleotide sequence of the hsp70 gene or the corresponding mRNA that precludes the maximum divergence allowed in the silent codon positions. PMID:3931075
Amino acid sequences of ribosomal proteins S11 from Bacillus stearothermophilus and S19 from Halobacterium marismortui. Comparison of the ribosomal protein S11 family.

PubMed

Kimura, M; Kimura, J; Hatakeyama, T

1988-11-21

The complete amino acid sequences of ribosomal proteins S11 from the Gram-positive eubacterium Bacillus stearothermophilus and of S19 from the archaebacterium Halobacterium marismortui have been determined. A search for homologous sequences of these proteins revealed that they belong to the ribosomal protein S11 family. Homologous proteins have previously been sequenced from Escherichia coli as well as from chloroplast, yeast and mammalian ribosomes. A pairwise comparison of the amino acid sequences showed that Bacillus protein S11 shares 68% identical residues with S11 from Escherichia coli and a slightly lower homology (52%) with the homologous chloroplast protein. The halophilic protein S19 is more related to the eukaryotic (45-49%) than to the eubacterial counterparts (35%).
High throughput sequencing analysis of RNA libraries reveals the influences of initial library and PCR methods on SELEX efficiency.

PubMed

Takahashi, Mayumi; Wu, Xiwei; Ho, Michelle; Chomchan, Pritsana; Rossi, John J; Burnett, John C; Zhou, Jiehua

2016-09-22

The systemic evolution of ligands by exponential enrichment (SELEX) technique is a powerful and effective aptamer-selection procedure. However, modifications to the process can dramatically improve selection efficiency and aptamer performance. For example, droplet digital PCR (ddPCR) has been recently incorporated into SELEX selection protocols to putatively reduce the propagation of byproducts and avoid selection bias that result from differences in PCR efficiency of sequences within the random library. However, a detailed, parallel comparison of the efficacy of conventional solution PCR versus the ddPCR modification in the RNA aptamer-selection process is needed to understand effects on overall SELEX performance. In the present study, we took advantage of powerful high throughput sequencing technology and bioinformatics analysis coupled with SELEX (HT-SELEX) to thoroughly investigate the effects of initial library and PCR methods in the RNA aptamer identification. Our analysis revealed that distinct "biased sequences" and nucleotide composition existed in the initial, unselected libraries purchased from two different manufacturers and that the fate of the "biased sequences" was target-dependent during selection. Our comparison of solution PCR- and ddPCR-driven HT-SELEX demonstrated that PCR method affected not only the nucleotide composition of the enriched sequences, but also the overall SELEX efficiency and aptamer efficacy.
A confidence interval analysis of sampling effort, sequencing depth, and taxonomic resolution of fungal community ecology in the era of high-throughput sequencing.

PubMed

Oono, Ryoko

2017-01-01

High-throughput sequencing technology has helped microbial community ecologists explore ecological and evolutionary patterns at unprecedented scales. The benefits of a large sample size still typically outweigh that of greater sequencing depths per sample for accurate estimations of ecological inferences. However, excluding or not sequencing rare taxa may mislead the answers to the questions 'how and why are communities different?' This study evaluates the confidence intervals of ecological inferences from high-throughput sequencing data of foliar fungal endophytes as case studies through a range of sampling efforts, sequencing depths, and taxonomic resolutions to understand how technical and analytical practices may affect our interpretations. Increasing sampling size reliably decreased confidence intervals across multiple community comparisons. However, the effects of sequencing depths on confidence intervals depended on how rare taxa influenced the dissimilarity estimates among communities and did not significantly decrease confidence intervals for all community comparisons. A comparison of simulated communities under random drift suggests that sequencing depths are important in estimating dissimilarities between microbial communities under neutral selective processes. Confidence interval analyses reveal important biases as well as biological trends in microbial community studies that otherwise may be ignored when communities are only compared for statistically significant differences.
A confidence interval analysis of sampling effort, sequencing depth, and taxonomic resolution of fungal community ecology in the era of high-throughput sequencing

PubMed Central

2017-01-01

High-throughput sequencing technology has helped microbial community ecologists explore ecological and evolutionary patterns at unprecedented scales. The benefits of a large sample size still typically outweigh that of greater sequencing depths per sample for accurate estimations of ecological inferences. However, excluding or not sequencing rare taxa may mislead the answers to the questions ‘how and why are communities different?’ This study evaluates the confidence intervals of ecological inferences from high-throughput sequencing data of foliar fungal endophytes as case studies through a range of sampling efforts, sequencing depths, and taxonomic resolutions to understand how technical and analytical practices may affect our interpretations. Increasing sampling size reliably decreased confidence intervals across multiple community comparisons. However, the effects of sequencing depths on confidence intervals depended on how rare taxa influenced the dissimilarity estimates among communities and did not significantly decrease confidence intervals for all community comparisons. A comparison of simulated communities under random drift suggests that sequencing depths are important in estimating dissimilarities between microbial communities under neutral selective processes. Confidence interval analyses reveal important biases as well as biological trends in microbial community studies that otherwise may be ignored when communities are only compared for statistically significant differences. PMID:29253889
Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity

PubMed Central

Pope, Welkin H; Bowman, Charles A; Russell, Daniel A; Jacobs-Sera, Deborah; Asai, David J; Cresawn, Steven G; Jacobs, William R; Hendrix, Roger W; Lawrence, Jeffrey G; Hatfull, Graham F; Abbazia, Patrick; Ababio, Amma; Adam, Naazneen

2015-01-01

The bacteriophage population is large, dynamic, ancient, and genetically diverse. Limited genomic information shows that phage genomes are mosaic, and the genetic architecture of phage populations remains ill-defined. To understand the population structure of phages infecting a single host strain, we isolated, sequenced, and compared 627 phages of Mycobacterium smegmatis. Their genetic diversity is considerable, and there are 28 distinct genomic types (clusters) with related nucleotide sequences. However, amino acid sequence comparisons show pervasive genomic mosaicism, and quantification of inter-cluster and intra-cluster relatedness reveals a continuum of genetic diversity, albeit with uneven representation of different phages. Furthermore, rarefaction analysis shows that the mycobacteriophage population is not closed, and there is a constant influx of genes from other sources. Phage isolation and analysis was performed by a large consortium of academic institutions, illustrating the substantial benefits of a disseminated, structured program involving large numbers of freshman undergraduates in scientific discovery. DOI: http://dx.doi.org/10.7554/eLife.06416.001 PMID:25919952

Whole genome comparison of a large collection of mycobacteriophages reveals a continuum of phage genetic diversity.

PubMed

Pope, Welkin H; Bowman, Charles A; Russell, Daniel A; Jacobs-Sera, Deborah; Asai, David J; Cresawn, Steven G; Jacobs, William R; Hendrix, Roger W; Lawrence, Jeffrey G; Hatfull, Graham F

2015-04-28

The bacteriophage population is large, dynamic, ancient, and genetically diverse. Limited genomic information shows that phage genomes are mosaic, and the genetic architecture of phage populations remains ill-defined. To understand the population structure of phages infecting a single host strain, we isolated, sequenced, and compared 627 phages of Mycobacterium smegmatis. Their genetic diversity is considerable, and there are 28 distinct genomic types (clusters) with related nucleotide sequences. However, amino acid sequence comparisons show pervasive genomic mosaicism, and quantification of inter-cluster and intra-cluster relatedness reveals a continuum of genetic diversity, albeit with uneven representation of different phages. Furthermore, rarefaction analysis shows that the mycobacteriophage population is not closed, and there is a constant influx of genes from other sources. Phage isolation and analysis was performed by a large consortium of academic institutions, illustrating the substantial benefits of a disseminated, structured program involving large numbers of freshman undergraduates in scientific discovery.
Gene conversion as a secondary mechanism of short interspersed element (SINE) evolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kass, D.H.; Batzer, M.A.; Deininger, P.L.

The Alu repetitive family of short interspersed elements (SINEs) in primates can be subdivided into distinct subfamilies by specific diagnostic nucleotide changes. The older subfamilies are generally very abundant, while the younger subfamilies have fewer copies. Some of the youngest Alu elements are absent in the orthologous loci of nonhuman primates, indicative of recent retroposition events, the primary mode of SINE evolutions. PCR analysis of one young Alu subfamily (Sb2) member found in the low-density lipoprotein receptor gene apparently revealed the presence of this element in the green monkey, orangutan, gorilla, and chimpanzee genomes, as well as the human genome.more » However, sequence analysis of these genomes revealed a highly mutated, older, primate-specific Alu element was present at this position in the nonhuman primates. Comparison of the flanking DNA sequences upstream of this Alu insertion corresponded to evolution expected for standard primate phylogeny, but comparison of the Alu repeat sequences revealed that the human element departed from this phylogeny. The change in the human sequence apparently occurred by a gene conversion event only within the Alu element itself, converting it from one of the oldest to one of the youngest Alu subfamilies. Although gene conversions of Alu elements are clearly very rare, this finding shows that such events can occur and contribute to specific cases of SINE subfamily evolution.« less
Evolution of the herpes thymidine kinase: identification and comparison of the equine herpesvirus 1 thymidine kinase gene reveals similarity to a cell-encoded thymidylate kinase.

PubMed Central

Robertson, G R; Whalley, J M

1988-01-01

We have identified the equine herpesvirus 1 (EHV-1) thymidine kinase gene (TK) by DNA-mediated transformation and by DNA sequencing. Alignment of the amino acid sequence of the EHV-1 TK with the TKs from 3 other herpesviruses revealed regions of homology, some of which correspond to the previously identified substrate binding sites, while others have as yet, no assigned function. In particular, the strict conservation of an aspartate within the proposed nucleoside binding site suggests a role in ATP binding for this residue. Comparison of 5 herpes TKs with the thymidylate kinase of yeast revealed significant similarity which was strongest in those regions important to catalytic activity of the herpes TKs, and, therefore we propose that the herpes TK may be derived from a cellular thymidylate kinase. The implications for the evolution of enzyme activities within a pathway of nucleotide metabolism are discussed. PMID:2849761
Conservation in the face of diversity: multistrain analysis of an intracellular bacterium

USDA-ARS?s Scientific Manuscript database

Comparisons of multiple strains revealed that A. marginale has a closed-core genome with few highly plastic regions, which include the msp2 and msp3 genes, as well as the aaap locus. Comparison of the Florida and St. Maries genome sequences found that SNPs comprise 0.8% of the longer Florida genome,...
Low-coverage single-cell mRNA sequencing reveals cellular heterogeneity and activated signaling pathways in developing cerebral cortex.

PubMed

Pollen, Alex A; Nowakowski, Tomasz J; Shuga, Joe; Wang, Xiaohui; Leyrat, Anne A; Lui, Jan H; Li, Nianzhen; Szpankowski, Lukasz; Fowler, Brian; Chen, Peilin; Ramalingam, Naveen; Sun, Gang; Thu, Myo; Norris, Michael; Lebofsky, Ronald; Toppani, Dominique; Kemp, Darnell W; Wong, Michael; Clerkson, Barry; Jones, Brittnee N; Wu, Shiquan; Knutsson, Lawrence; Alvarado, Beatriz; Wang, Jing; Weaver, Lesley S; May, Andrew P; Jones, Robert C; Unger, Marc A; Kriegstein, Arnold R; West, Jay A A

2014-10-01

Large-scale surveys of single-cell gene expression have the potential to reveal rare cell populations and lineage relationships but require efficient methods for cell capture and mRNA sequencing. Although cellular barcoding strategies allow parallel sequencing of single cells at ultra-low depths, the limitations of shallow sequencing have not been investigated directly. By capturing 301 single cells from 11 populations using microfluidics and analyzing single-cell transcriptomes across downsampled sequencing depths, we demonstrate that shallow single-cell mRNA sequencing (~50,000 reads per cell) is sufficient for unbiased cell-type classification and biomarker identification. In the developing cortex, we identify diverse cell types, including multiple progenitor and neuronal subtypes, and we identify EGR1 and FOS as previously unreported candidate targets of Notch signaling in human but not mouse radial glia. Our strategy establishes an efficient method for unbiased analysis and comparison of cell populations from heterogeneous tissue by microfluidic single-cell capture and low-coverage sequencing of many cells.
Validation of Skeletal Muscle cis-Regulatory Module Predictions Reveals Nucleotide Composition Bias in Functional Enhancers

PubMed Central

Kwon, Andrew T.; Chou, Alice Yi; Arenillas, David J.; Wasserman, Wyeth W.

2011-01-01

We performed a genome-wide scan for muscle-specific cis-regulatory modules (CRMs) using three computational prediction programs. Based on the predictions, 339 candidate CRMs were tested in cell culture with NIH3T3 fibroblasts and C2C12 myoblasts for capacity to direct selective reporter gene expression to differentiated C2C12 myotubes. A subset of 19 CRMs validated as functional in the assay. The rate of predictive success reveals striking limitations of computational regulatory sequence analysis methods for CRM discovery. Motif-based methods performed no better than predictions based only on sequence conservation. Analysis of the properties of the functional sequences relative to inactive sequences identifies nucleotide sequence composition can be an important characteristic to incorporate in future methods for improved predictive specificity. Muscle-related TFBSs predicted within the functional sequences display greater sequence conservation than non-TFBS flanking regions. Comparison with recent MyoD and histone modification ChIP-Seq data supports the validity of the functional regions. PMID:22144875
Complete genome sequence analysis of novel human bocavirus reveals genetic recombination between human bocavirus 2 and human bocavirus 4.

PubMed

Khamrin, Pattara; Okitsu, Shoko; Ushijima, Hiroshi; Maneekarn, Niwat

2013-07-01

Epidemiological surveillance of human bocavirus (HBoV) was conducted on fecal specimens collected from hospitalized children with diarrhea in Chiang Mai, Thailand in 2011. By partial sequence analysis of VP1 gene, an unusual strain of HBoV (CMH-S011-11), was initially identified as HBoV4. The complete genome sequence of CMH-S011-11 was performed and analyzed further to clarify whether it was a recombinant strain or a new HBoV variant. Analysis of complete genome sequence revealed that the coding sequence starting from NS1, NP1 to VP1/VP2 was 4795 nucleotides long. Interestingly, the nucleotide sequence of NS1 gene of CMH-S011-11 was most closely related to the HBoV2 reference strains detected in Pakistan, which contradicted to the initial genotyping result of the partial VP1 region in the previous study. In addition, comparison of NP1 nucleotide sequence of CMH-S011-11 with those of other HBoV1-4 reference strains also revealed a high level of sequence identity with HBoV2. On the other hand, nucleotide sequence of VP1/VP2 gene of CMH-S011-11 was most closely related to those of HBoV4 reference strains detected in Nigeria. The overall full-length sequence analysis revealed that this CMH-S011-11 was grouped within HBoV4 species, but located in a separate branch from other HBoV4 prototype strains. Recombination analysis revealed that CMH-S011-11 was the result of recombination between HBoV2 and HBoV4 strains with the break point located near the start codon of VP2. Copyright © 2013 Elsevier B.V. All rights reserved.
Mitochondrial genes in the colourless alga Prototheca wickerhamii resemble plant genes in their exons but fungal genes in their introns.

PubMed Central

Wolff, G; Burger, G; Lang, B F; Kück, U

1993-01-01

The mitochondrial DNA from the colourless alga Prototheca wickerhamii contains two mosaic genes as was revealed from complete sequencing of the circular extranuclear genome. The genes for the large subunit of the ribosomal RNA (LSUrRNA) as well as for subunit I of the cytochrome oxidase (coxI) carry two and three intronic sequences respectively. On the basis of their canonical nucleotide sequences they can be classified as group I introns. Phylogenetic comparisons of the coxI protein sequences allow us to conclude that the P.wickerhamii mtDNA is much closer related to higher plant mtDNAs than to those of the chlorophyte alga C.reinhardtii. The comparison of the intron sequences revealed several unusual features: (1) The P.wickerhamii introns are structurally related to mitochondrial introns from various ascomycetous fungi. (2) Phylogenetic analyses indicate a close relationship between fungal and algal intronic sequences. (3) The P. wickerhamii introns are located at positions within the structural genes which can be considered as preferred intron insertion sites in homologous mitochondrial genes from fungi or liverwort. In all cases, the sequences adjacent to the insertion sites are very well conserved over large evolutionary distances. Our finding of highly similar introns in fungi and algae is consistent with the idea that introns have already been present in the bacterial ancestors of present day mitochondria and evolved concomitantly with the organelles. PMID:7680126
Analysis of expressed sequence tags from Maize mosaic rhabdovirus-infected gut tissues of Peregrinus maidis reveals the presence of key components of insect innate immunity.

PubMed

Whitfield, A E; Rotenberg, D; Aritua, V; Hogenhout, S A

2011-04-01

The corn planthopper, Peregrinus maidis, causes direct feeding damage to plants and transmits Maize mosaic rhabdovirus (MMV) in a persistent-propagative manner. MMV must cross several insect tissue layers for successful transmission to occur, and the gut serves as an important barrier for rhabdovirus transmission. In order to facilitate the identification of proteins that may interact with MMV either by facilitating acquisition or responding to virus infection, we generated and analysed the gut transcriptome of P. maidis. From two normalized cDNA libraries, we generated a P. maidis gut transcriptome composed of 20,771 expressed sequence tags (ESTs). Assembly of the sequences yielded 1860 contigs and 14,032 singletons, and biological roles were assigned to 5793 (36%). Comparison of P. maidis ESTs with other insect amino acid sequences revealed that P. maidis shares greatest sequence similarity with another hemipteran, the brown planthopper Nilaparvata lugens. We identified 202 P. maidis transcripts with putative homology to proteins associated with insect innate immunity, including those implicated in the Toll, Imd, JAK/STAT, Jnk and the small-interfering RNA-mediated pathways. Sequence comparisons between our P. maidis gut EST collection and the currently available National Center for Biotechnology Information EST database collection for Ni. lugens revealed that a pathogen recognition receptor in the Imd pathway, peptidoglycan recognition protein-long class (PGRP-LC), is present in these two members of the family Delphacidae; however, these recognition receptors are lacking in the model hemipteran Acyrthosiphon pisum. In addition, we identified sequences in the P. maidis gut transcriptome that share significant amino acid sequence similarities with the rhabdovirus receptor molecule, acetylcholine receptor (AChR), found in other hosts. This EST analysis sheds new light on immune response pathways in hemipteran guts that will be useful for further dissecting innate defence response pathways to rhabdovirus infection. © 2011 The Authors. Insect Molecular Biology © 2011 The Royal Entomological Society.
Insights into the phylogenetic positions of photosynthetic bacteria obtained from 5S rRNA and 16S rRNA sequence data

NASA Technical Reports Server (NTRS)

Fox, G. E.

1985-01-01

Comparisons of complete 16S ribosomal ribonucleic acid (rRNA) sequences established that the secondary structure of these molecules is highly conserved. Earlier work with 5S rRNA secondary structure revealed that when structural conservation exists the alignment of sequences is straightforward. The constancy of structure implies minimal functional change. Under these conditions a uniform evolutionary rate can be expected so that conditions are favorable for phylogenetic tree construction.
Isolation and molecular characterization of partial FSH and LH receptor genes in Arabian camels (Camelus dromedarius)

PubMed Central

Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza

2015-01-01

Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus. Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research (FSHR and LHR) as well as reproduction-linked polymorphisms and breeding programs. PMID:27844002
Isolation and molecular characterization of partial FSH and LH receptor genes in Arabian camels (Camelus dromedarius).

PubMed

Jelokhani-Niaraki, Saber; Tahmoorespur, Mojtaba; Bitaraf-Sani, Morteza

2015-06-01

Very little is known about LHR and FSHR genes of domestic dromedary camels. The main objective of this study was to determine and analyze partial genomic regions of FSHR and LHR genes in dromedary camels for the first time. To this end, a total of50 DNA samples belonging to dromedary camels raised in Iran were sent for sequencing (25 samples of each gene). We compared the nucleotide sequences of Camelus dromedarius with corresponding sequences of previously published FSHR and LHR genes in bactrian camels and other species. According to the data, the same nucleotide variation was identified in both regions of the two camel species. The alignment of deduced protein sequences of the two different species revealed an amino acid variation at the FSHR region. No evidence of amino acid variation was observed, however, in LHR sequences. Phylogenetic analysis indicated that both camel species had a close relationship and clustered together in a separate branch. This was further confirmed by genetic distance values illustrating significant sequence identity between Camelus dromedarius and Camelus bactrianus . Interestingly, sequence comparisons revealed heterozygote patterns in FSHR sequences isolated from dromedary camels of Iran. In comparison to other species, this camel contains three amino acid substitutions at 5, 67, and 105 positions in the FSHR coding region. These positions are found exclusively in camels and can be considered as species specific. The results of our study can be used for hormone functionality research ( FSHR and LHR ) as well as reproduction-linked polymorphisms and breeding programs.
Cloning and sequence analysis of Hemonchus contortus HC58cDNA.

PubMed

Muleke, Charles I; Ruofeng, Yan; Lixin, Xu; Xinwen, Bo; Xiangrui, Li

2007-06-01

The complete coding sequence of Hemonchus contortus HC58cDNA was generated by rapid amplification of cDNA ends and polymerase chain reaction using primers based on the 5' and 3' ends of the parasite mRNA, accession no. AF305964. The HC58cDNA gene was 851 bp long, with open reading frame of 717 bp, precursors to 239 amino acids coding for approximately 27 kDa protein. Analysis of amino acid sequence revealed conserved residues of cysteine, histidine, asparagine, occluding loop pattern, hemoglobinase motif and glutamine of the oxyanion hole characteristic of cathepsin B like proteases (CBL). Comparison of the predicted amino acid sequences showed the protein shared 33.5-58.7% identity to cathepsin B homologues in the papain clan CA family (family C1). Phylogenetic analysis revealed close evolutionary proximity of the protein sequence to counterpart sequences in the CBL, suggesting that HC58cDNA was a member of the papain family.
Positive Streptobacillus moniliformis PCR in guinea pigs likely due to Leptotrichia spp.

PubMed

Boot, Ron; Van de Berg, Lia; Reubsaet, Frans A G; Vlemminx, Maurice J

2008-04-30

Streptobacillus moniliformis is a zoonotic bacterium. We obtained positive S. moniliformis PCR results in oral swab samples from guinea pigs from an experimental colony and the breeding colony of origin. Comparison of the DNA sequence of an amplicon with deposited 16S rDNA sequences revealed that Leptotrichia sp. can be the source of a false positive S. moniliformis PCR outcome.
Analysis of DNA methylation in Arabidopsis thaliana based on methylation-sensitive AFLP markers.

PubMed

Cervera, M T; Ruiz-García, L; Martínez-Zapater, J M

2002-12-01

AFLP analysis using restriction enzyme isoschizomers that differ in their sensitivity to methylation of their recognition sites has been used to analyse the methylation state of anonymous CCGG sequences in Arabidopsis thaliana. The technique was modified to improve the quality of fingerprints and to visualise larger numbers of scorable fragments. Sequencing of amplified fragments indicated that detection was generally associated with non-methylation of the cytosine to which the isoschizomer is sensitive. Comparison of EcoRI/ HpaII and EcoRI/ MspI patterns in different ecotypes revealed that 35-43% of CCGG sites were differentially digested by the isoschizomers. Interestingly, the pattern of digestion among different plants belonging to the same ecotype is highly conserved, with the rate of intra-ecotype methylation-sensitive polymorphisms being less than 1%. However, pairwise comparisons of methylation patterns between samples belonging to different ecotypes revealed differences in up to 34% of the methylation-sensitive polymorphisms. The lack of correlation between inter-ecotype similarity matrices based on methylation-insensitive or methylation-sensitive polymorphisms suggests that whatever the mechanisms regulating methylation may be, they are not related to nucleotide sequence variation.
Characterization of a cDNA encoding a protein involved in formation of the skeleton during development of the sea urchin Lytechinus pictus.

PubMed

Livingston, B T; Shaw, R; Bailey, A; Wilt, F

1991-12-01

In order to investigate the role of proteins in the formation of mineralized tissues during development, we have isolated a cDNA that encodes a protein that is a component of the organic matrix of the skeletal spicule of the sea urchin, Lytechinus pictus. The expression of the RNA encoding this protein is regulated over development and is localized to the descendents of the micromere lineage. Comparison of the sequence of this cDNA to homologous cDNAs from other species of urchin reveal that the protein is basic and contains three conserved structural motifs: a signal peptide, a proline-rich region, and an unusual region composed of a series of direct repeats. Studies on the protein encoded by this cDNA confirm the predicted reading frame deduced from the nucleotide sequence and show that the protein is secreted and not glycosylated. Comparison of the amino acid sequence to databases reveal that the repeat domain is similar to proteins that form a unique beta-spiral supersecondary structure.
Dali server update.

PubMed

Holm, Liisa; Laakso, Laura M

2016-07-08

The Dali server (http://ekhidna2.biocenter.helsinki.fi/dali) is a network service for comparing protein structures in 3D. In favourable cases, comparing 3D structures may reveal biologically interesting similarities that are not detectable by comparing sequences. The Dali server has been running in various places for over 20 years and is used routinely by crystallographers on newly solved structures. The latest update of the server provides enhanced analytics for the study of sequence and structure conservation. The server performs three types of structure comparisons: (i) Protein Data Bank (PDB) search compares one query structure against those in the PDB and returns a list of similar structures; (ii) pairwise comparison compares one query structure against a list of structures specified by the user; and (iii) all against all structure comparison returns a structural similarity matrix, a dendrogram and a multidimensional scaling projection of a set of structures specified by the user. Structural superimpositions are visualized using the Java-free WebGL viewer PV. The structural alignment view is enhanced by sequence similarity searches against Uniprot. The combined structure-sequence alignment information is compressed to a stack of aligned sequence logos. In the stack, each structure is structurally aligned to the query protein and represented by a sequence logo. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Sequencing of Seven Haloarchaeal Genomes Reveals Patterns of Genomic Flux

PubMed Central

Lynch, Erin A.; Langille, Morgan G. I.; Darling, Aaron; Wilbanks, Elizabeth G.; Haltiner, Caitlin; Shao, Katie S. Y.; Starr, Michael O.; Teiling, Clotilde; Harkins, Timothy T.; Edwards, Robert A.; Eisen, Jonathan A.; Facciotti, Marc T.

2012-01-01

We report the sequencing of seven genomes from two haloarchaeal genera, Haloferax and Haloarcula. Ease of cultivation and the existence of well-developed genetic and biochemical tools for several diverse haloarchaeal species make haloarchaea a model group for the study of archaeal biology. The unique physiological properties of these organisms also make them good candidates for novel enzyme discovery for biotechnological applications. Seven genomes were sequenced to ∼20×coverage and assembled to an average of 50 contigs (range 5 scaffolds - 168 contigs). Comparisons of protein-coding gene compliments revealed large-scale differences in COG functional group enrichment between these genera. Analysis of genes encoding machinery for DNA metabolism reveals genera-specific expansions of the general transcription factor TATA binding protein as well as a history of extensive duplication and horizontal transfer of the proliferating cell nuclear antigen. Insights gained from this study emphasize the importance of haloarchaea for investigation of archaeal biology. PMID:22848480
A ruby-colored Pseudobaeospora species is described as new from material collected on the island of Hawaii.

PubMed

Desjardin, Dennis E; Hemmes, Don E; Perry, Brian A

2014-01-01

Pseudobaeospora wipapatiae is described as new based on material collected in alien wet habitats on the island of Hawaii. Unique features of this beautiful species include deep ruby-colored basidiomes with two-spored basidia, amyloid cheilocystidia and a hymeniderm pileipellis with abundant pileocystidia that is initially deep ruby in KOH then changes to lilac gray. Phylogenetic analysis of nuclear large ribosomal subunit sequence data suggest a close relationship between Pseudobaeospora and Tricholoma. BLAST comparisons of internal transcribed spacer and 5.8S nuclear ribosomal subunit regions sequence data reveal greatest similarity with existing sequences of Pseudobaeospora species. A comprehensive description, color photograph, illustrations of salient micromorphological features and comparisons with phenetically similar taxa are provided. © 2014 by The Mycological Society of America.
Genomic sequencing of Pleistocene cave bears

DOE Office of Scientific and Technical Information (OSTI.GOV)

Noonan, James P.; Hofreiter, Michael; Smith, Doug

2005-04-01

Despite the information content of genomic DNA, ancient DNA studies to date have largely been limited to amplification of mitochondrial DNA due to technical hurdles such as contamination and degradation of ancient DNAs. In this study, we describe two metagenomic libraries constructed using unamplified DNA extracted from the bones of two 40,000-year-old extinct cave bears. Analysis of {approx}1 Mb of sequence from each library showed that, despite significant microbial contamination, 5.8 percent and 1.1 percent of clones in the libraries contain cave bear inserts, yielding 26,861 bp of cave bear genome sequence. Alignment of this sequence to the dog genome,more » the closest sequenced genome to cave bear in terms of evolutionary distance, revealed roughly the expected ratio of cave bear exons, repeats and conserved noncoding sequences. Only 0.04 percent of all clones sequenced were derived from contamination with modern human DNA. Comparison of cave bear with orthologous sequences from several modern bear species revealed the evolutionary relationship of these lineages. Using the metagenomic approach described here, we have recovered substantial quantities of mammalian genomic sequence more than twice as old as any previously reported, establishing the feasibility of ancient DNA genomic sequencing programs.« less

Identification of divergent protein domains by combining HMM-HMM comparisons and co-occurrence detection.

PubMed

Ghouila, Amel; Florent, Isabelle; Guerfali, Fatma Zahra; Terrapon, Nicolas; Laouini, Dhafer; Yahia, Sadok Ben; Gascuel, Olivier; Bréhélin, Laurent

2014-01-01

Identification of protein domains is a key step for understanding protein function. Hidden Markov Models (HMMs) have proved to be a powerful tool for this task. The Pfam database notably provides a large collection of HMMs which are widely used for the annotation of proteins in sequenced organisms. This is done via sequence/HMM comparisons. However, this approach may lack sensitivity when searching for domains in divergent species. Recently, methods for HMM/HMM comparisons have been proposed and proved to be more sensitive than sequence/HMM approaches in certain cases. However, these approaches are usually not used for protein domain discovery at a genome scale, and the benefit that could be expected from their utilization for this problem has not been investigated. Using proteins of P. falciparum and L. major as examples, we investigate the extent to which HMM/HMM comparisons can identify new domain occurrences not already identified by sequence/HMM approaches. We show that although HMM/HMM comparisons are much more sensitive than sequence/HMM comparisons, they are not sufficiently accurate to be used as a standalone complement of sequence/HMM approaches at the genome scale. Hence, we propose to use domain co-occurrence--the general domain tendency to preferentially appear along with some favorite domains in the proteins--to improve the accuracy of the approach. We show that the combination of HMM/HMM comparisons and co-occurrence domain detection boosts protein annotations. At an estimated False Discovery Rate of 5%, it revealed 901 and 1098 new domains in Plasmodium and Leishmania proteins, respectively. Manual inspection of part of these predictions shows that it contains several domain families that were missing in the two organisms. All new domain occurrences have been integrated in the EuPathDomains database, along with the GO annotations that can be deduced.
Identification of Divergent Protein Domains by Combining HMM-HMM Comparisons and Co-Occurrence Detection

PubMed Central

Ghouila, Amel; Florent, Isabelle; Guerfali, Fatma Zahra; Terrapon, Nicolas; Laouini, Dhafer; Yahia, Sadok Ben; Gascuel, Olivier; Bréhélin, Laurent

2014-01-01

Identification of protein domains is a key step for understanding protein function. Hidden Markov Models (HMMs) have proved to be a powerful tool for this task. The Pfam database notably provides a large collection of HMMs which are widely used for the annotation of proteins in sequenced organisms. This is done via sequence/HMM comparisons. However, this approach may lack sensitivity when searching for domains in divergent species. Recently, methods for HMM/HMM comparisons have been proposed and proved to be more sensitive than sequence/HMM approaches in certain cases. However, these approaches are usually not used for protein domain discovery at a genome scale, and the benefit that could be expected from their utilization for this problem has not been investigated. Using proteins of P. falciparum and L. major as examples, we investigate the extent to which HMM/HMM comparisons can identify new domain occurrences not already identified by sequence/HMM approaches. We show that although HMM/HMM comparisons are much more sensitive than sequence/HMM comparisons, they are not sufficiently accurate to be used as a standalone complement of sequence/HMM approaches at the genome scale. Hence, we propose to use domain co-occurrence — the general domain tendency to preferentially appear along with some favorite domains in the proteins — to improve the accuracy of the approach. We show that the combination of HMM/HMM comparisons and co-occurrence domain detection boosts protein annotations. At an estimated False Discovery Rate of 5%, it revealed 901 and 1098 new domains in Plasmodium and Leishmania proteins, respectively. Manual inspection of part of these predictions shows that it contains several domain families that were missing in the two organisms. All new domain occurrences have been integrated in the EuPathDomains database, along with the GO annotations that can be deduced. PMID:24901648
Construction of a high-density genetic map for grape using next generation restriction-site associated DNA sequencing

PubMed Central

2012-01-01

Background Genetic mapping and QTL detection are powerful methodologies in plant improvement and breeding. Construction of a high-density and high-quality genetic map would be of great benefit in the production of superior grapes to meet human demand. High throughput and low cost of the recently developed next generation sequencing (NGS) technology have resulted in its wide application in genome research. Sequencing restriction-site associated DNA (RAD) might be an efficient strategy to simplify genotyping. Combining NGS with RAD has proven to be powerful for single nucleotide polymorphism (SNP) marker development. Results An F1 population of 100 individual plants was developed. In-silico digestion-site prediction was used to select an appropriate restriction enzyme for construction of a RAD sequencing library. Next generation RAD sequencing was applied to genotype the F1 population and its parents. Applying a cluster strategy for SNP modulation, a total of 1,814 high-quality SNP markers were developed: 1,121 of these were mapped to the female genetic map, 759 to the male map, and 1,646 to the integrated map. A comparison of the genetic maps to the published Vitis vinifera genome revealed both conservation and variations. Conclusions The applicability of next generation RAD sequencing for genotyping a grape F1 population was demonstrated, leading to the successful development of a genetic map with high density and quality using our designed SNP markers. Detailed analysis revealed that this newly developed genetic map can be used for a variety of genome investigations, such as QTL detection, sequence assembly and genome comparison. PMID:22908993
Mitochondrial DNA Evidence Supports the Hypothesis that Triodontophorus Species Belong to Cyathostominae

PubMed Central

Gao, Yuan; Zhang, Yan; Yang, Xin; Qiu, Jian-Hua; Duan, Hong; Xu, Wen-Wen; Chang, Qiao-Cheng; Wang, Chun-Ren

2017-01-01

Equine strongyles, the significant nematode pathogens of horses, are characterized by high quantities and species abundance, but classification of this group of parasitic nematodes is debated. Mitochondrial (mt) genome DNA data are often used to address classification controversies. Thus, the objectives of this study were to determine the complete mt genomes of three Cyathostominae nematode species (Cyathostomum catinatum, Cylicostephanus minutus, and Poteriostomum imparidentatum) of horses and reconstruct the phylogenetic relationship of Strongylidae with other nematodes in Strongyloidea to test the hypothesis that Triodontophorus spp. belong to Cyathostominae using the mt genomes. The mt genomes of Cy. catinatum, Cs. minutus, and P. imparidentatum were 13,838, 13,826, and 13,817 bp in length, respectively. Complete mt nucleotide sequence comparison of all Strongylidae nematodes revealed that sequence identity ranged from 77.8 to 91.6%. The mt genome sequences of Triodontophorus species had relatively high identity with Cyathostominae nematodes, rather than Strongylus species of the same subfamily (Strongylinae). Comparative analyses of mt genome organization for Strongyloidea nematodes sequenced to date revealed that members of this superfamily possess identical gene arrangements. Phylogenetic analyses using mtDNA data indicated that the Triodontophorus species clustered with Cyathostominae species instead of Strongylus species. The present study first determined the complete mt genome sequences of Cy. catinatum, Cs. minutus, and P. imparidentatum, which will provide novel genetic markers for further studies of Strongylidae taxonomy, population genetics, and systematics. Importantly, sequence comparison and phylogenetic analyses based on mtDNA sequences supported the hypothesis that Triodontophorus belongs to Cyathostominae. PMID:28824575
High throughput sequencing analysis of RNA libraries reveals the influences of initial library and PCR methods on SELEX efficiency

PubMed Central

Takahashi, Mayumi; Wu, Xiwei; Ho, Michelle; Chomchan, Pritsana; Rossi, John J.; Burnett, John C.; Zhou, Jiehua

2016-01-01

The systemic evolution of ligands by exponential enrichment (SELEX) technique is a powerful and effective aptamer-selection procedure. However, modifications to the process can dramatically improve selection efficiency and aptamer performance. For example, droplet digital PCR (ddPCR) has been recently incorporated into SELEX selection protocols to putatively reduce the propagation of byproducts and avoid selection bias that result from differences in PCR efficiency of sequences within the random library. However, a detailed, parallel comparison of the efficacy of conventional solution PCR versus the ddPCR modification in the RNA aptamer-selection process is needed to understand effects on overall SELEX performance. In the present study, we took advantage of powerful high throughput sequencing technology and bioinformatics analysis coupled with SELEX (HT-SELEX) to thoroughly investigate the effects of initial library and PCR methods in the RNA aptamer identification. Our analysis revealed that distinct “biased sequences” and nucleotide composition existed in the initial, unselected libraries purchased from two different manufacturers and that the fate of the “biased sequences” was target-dependent during selection. Our comparison of solution PCR- and ddPCR-driven HT-SELEX demonstrated that PCR method affected not only the nucleotide composition of the enriched sequences, but also the overall SELEX efficiency and aptamer efficacy. PMID:27652575
Application of the major capsid protein as a marker of the phylogenetic diversity of Emiliania huxleyi viruses.

PubMed

Rowe, Janet M; Fabre, Marie-Françoise; Gobena, Daniel; Wilson, William H; Wilhelm, Steven W

2011-05-01

Studies of the Phycodnaviridae have traditionally relied on the DNA polymerase (pol) gene as a biomarker. However, recent investigations have suggested that the major capsid protein (MCP) gene may be a reliable phylogenetic biomarker. We used MCP gene amplicons gathered across the North Atlantic to assess the diversity of Emiliania huxleyi-infecting Phycodnaviridae. Nucleotide sequences were examined across >6000 km of open ocean, with comparisons between concentrates of the virus-size fraction of seawater and of lysates generated by exposing host strains to these same virus concentrates. Analyses revealed that many sequences were only sampled once, while several were over-represented. Analyses also revealed nucleotide sequences distinct from previous coastal isolates. Examination of lysed cultures revealed a new richness in phylogeny, as MCP sequences previously unrepresented within the existing collection of E. huxleyi viruses (EhV) were associated with viruses lysing cultures. Sequences were compared with previously described EhV MCP sequences from the North Sea and a Norwegian Fjord, as well as from the Gulf of Maine. Principal component analysis indicates that location-specific distinctions exist despite the presence of sequences common across these environments. Overall, this investigation provides new sequence data and an assessment on the use of the MCP gene. © 2011 Federation of European Microbiological Societies Published by Blackwell Publishing Ltd. All rights reserved.
Novel RAD sequence data reveal a lack of genomic divergence between dietary ecotypes in a landlocked salmonid population

USGS Publications Warehouse

Limborg, Morten T.; Larson, Wesley; Shedd, Kyle; Seeb, Lisa W.; Seeb, James E.

2017-01-01

Preservation of heritable ecological diversity within species and populations is a key challenge for managing natural resources and wild populations. Salmonid fish are iconic and socio-economically important species for commercial, aquaculture, and recreational fisheries across the globe. Many salmonids are known to exhibit ecological divergence within species, including distinct feeding ecotypes within the same lakes. Here we used 5559 SNPs, derived from RAD sequencing, to perform population genetic comparisons between two dietary ecotypes of sockeye salmon (Oncorhynchus nerka) in Jo-Jo Lake, Alaska (USA). We tested the standing hypothesis that these two ecotypes are currently diverging as a result of adaptation to distinct dietary niches; results support earlier conclusions of a single panmictic population. The RAD sequence data revealed 40 new SNPs not previously detected in the species, and our sequence data can be used in future studies of ecotypic diversity in salmonid species.
Construction of Pseudomolecule Sequences of the aus Rice Cultivar Kasalath for Comparative Genomics of Asian Cultivated Rice

PubMed Central

Sakai, Hiroaki; Kanamori, Hiroyuki; Arai-Kichise, Yuko; Shibata-Hatta, Mari; Ebana, Kaworu; Oono, Youko; Kurita, Kanako; Fujisawa, Hiroko; Katagiri, Satoshi; Mukai, Yoshiyuki; Hamada, Masao; Itoh, Takeshi; Matsumoto, Takashi; Katayose, Yuichi; Wakasa, Kyo; Yano, Masahiro; Wu, Jianzhong

2014-01-01

Having a deep genetic structure evolved during its domestication and adaptation, the Asian cultivated rice (Oryza sativa) displays considerable physiological and morphological variations. Here, we describe deep whole-genome sequencing of the aus rice cultivar Kasalath by using the advanced next-generation sequencing (NGS) technologies to gain a better understanding of the sequence and structural changes among highly differentiated cultivars. The de novo assembled Kasalath sequences represented 91.1% (330.55 Mb) of the genome and contained 35 139 expressed loci annotated by RNA-Seq analysis. We detected 2 787 250 single-nucleotide polymorphisms (SNPs) and 7393 large insertion/deletion (indel) sites (>100 bp) between Kasalath and Nipponbare, and 2 216 251 SNPs and 3780 large indels between Kasalath and 93-11. Extensive comparison of the gene contents among these cultivars revealed similar rates of gene gain and loss. We detected at least 7.39 Mb of inserted sequences and 40.75 Mb of unmapped sequences in the Kasalath genome in comparison with the Nipponbare reference genome. Mapping of the publicly available NGS short reads from 50 rice accessions proved the necessity and the value of using the Kasalath whole-genome sequence as an additional reference to capture the sequence polymorphisms that cannot be discovered by using the Nipponbare sequence alone. PMID:24578372
Conservation and diversification of Msx protein in metazoan evolution.

PubMed

Takahashi, Hirokazu; Kamiya, Akiko; Ishiguro, Akira; Suzuki, Atsushi C; Saitou, Naruya; Toyoda, Atsushi; Aruga, Jun

2008-01-01

Msx (/msh) family genes encode homeodomain (HD) proteins that control ontogeny in many animal species. We compared the structures of Msx genes from a wide range of Metazoa (Porifera, Cnidaria, Nematoda, Arthropoda, Tardigrada, Platyhelminthes, Mollusca, Brachiopoda, Annelida, Echiura, Echinodermata, Hemichordata, and Chordata) to gain an understanding of the role of these genes in phylogeny. Exon-intron boundary analysis suggested that the position of the intron located N-terminally to the HDs was widely conserved in all the genes examined, including those of cnidarians. Amino acid (aa) sequence comparison revealed 3 new evolutionarily conserved domains, as well as very strong conservation of the HDs. Two of the three domains were associated with Groucho-like protein binding in both a vertebrate and a cnidarian Msx homolog, suggesting that the interaction between Groucho-like proteins and Msx proteins was established in eumetazoan ancestors. Pairwise comparison among the collected HDs and their C-flanking aa sequences revealed that the degree of sequence conservation varied depending on the animal taxa from which the sequences were derived. Highly conserved Msx genes were identified in the Vertebrata, Cephalochordata, Hemichordata, Echinodermata, Mollusca, Brachiopoda, and Anthozoa. The wide distribution of the conserved sequences in the animal phylogenetic tree suggested that metazoan ancestors had already acquired a set of conserved domains of the current Msx family genes. Interestingly, although strongly conserved sequences were recovered from the Vertebrata, Cephalochordata, and Anthozoa, the sequences from the Urochordata and Hydrozoa showed weak conservation. Because the Vertebrata-Cephalochordata-Urochordata and Anthozoa-Hydrozoa represent sister groups in the Chordata and Cnidaria, respectively, Msx sequence diversification may have occurred differentially in the course of evolution. We speculate that selective loss of the conserved domains in Msx family proteins contributed to the diversification of animal body organization.
Transcriptome survey of the anhydrobiotic tardigrade Milnesium tardigradum in comparison with Hypsibius dujardini and Richtersius coronifer

PubMed Central

2010-01-01

Background The phenomenon of desiccation tolerance, also called anhydrobiosis, involves the ability of an organism to survive the loss of almost all cellular water without sustaining irreversible damage. Although there are several physiological, morphological and ecological studies on tardigrades, only limited DNA sequence information is available. Therefore, we explored the transcriptome in the active and anhydrobiotic state of the tardigrade Milnesium tardigradum which has extraordinary tolerance to desiccation and freezing. In this study, we present the first overview of the transcriptome of M. tardigradum and its response to desiccation and discuss potential parallels to stress responses in other organisms. Results We sequenced a total of 9984 expressed sequence tags (ESTs) from two cDNA libraries from the eutardigrade M. tardigradum in its active and inactive, anhydrobiotic (tun) stage. Assembly of these ESTs resulted in 3283 putative unique transcripts, whereof ~50% showed significant sequence similarity to known genes. The resulting unigenes were functionally annotated using the Gene Ontology (GO) vocabulary. A GO term enrichment analysis revealed several GOs that were significantly underrepresented in the inactive stage. Furthermore we compared the putative unigenes of M. tardigradum with ESTs from two other eutardigrade species that are available from public sequence databases, namely Richtersius coronifer and Hypsibius dujardini. The processed sequences of the three tardigrade species revealed similar functional content and the M. tardigradum dataset contained additional sequences from tardigrades not present in the other two. Conclusions This study describes novel sequence data from the tardigrade M. tardigradum, which significantly contributes to the available tardigrade sequence data and will help to establish this extraordinary tardigrade as a model for studying anhydrobiosis. Functional comparison of active and anhydrobiotic tardigrades revealed a differential distribution of Gene Ontology terms associated with chromatin structure and the translation machinery, which are underrepresented in the inactive animals. These findings imply a widespread metabolic response of the animals on dehydration. The collective tardigrade transcriptome data will serve as a reference for further studies and support the identification and characterization of genes involved in the anhydrobiotic response. PMID:20226016
Transcriptome survey of the anhydrobiotic tardigrade Milnesium tardigradum in comparison with Hypsibius dujardini and Richtersius coronifer.

PubMed

Mali, Brahim; Grohme, Markus A; Förster, Frank; Dandekar, Thomas; Schnölzer, Martina; Reuter, Dirk; Wełnicz, Weronika; Schill, Ralph O; Frohme, Marcus

2010-03-12

The phenomenon of desiccation tolerance, also called anhydrobiosis, involves the ability of an organism to survive the loss of almost all cellular water without sustaining irreversible damage. Although there are several physiological, morphological and ecological studies on tardigrades, only limited DNA sequence information is available. Therefore, we explored the transcriptome in the active and anhydrobiotic state of the tardigrade Milnesium tardigradum which has extraordinary tolerance to desiccation and freezing. In this study, we present the first overview of the transcriptome of M. tardigradum and its response to desiccation and discuss potential parallels to stress responses in other organisms. We sequenced a total of 9984 expressed sequence tags (ESTs) from two cDNA libraries from the eutardigrade M. tardigradum in its active and inactive, anhydrobiotic (tun) stage. Assembly of these ESTs resulted in 3283 putative unique transcripts, whereof approximately 50% showed significant sequence similarity to known genes. The resulting unigenes were functionally annotated using the Gene Ontology (GO) vocabulary. A GO term enrichment analysis revealed several GOs that were significantly underrepresented in the inactive stage. Furthermore we compared the putative unigenes of M. tardigradum with ESTs from two other eutardigrade species that are available from public sequence databases, namely Richtersius coronifer and Hypsibius dujardini. The processed sequences of the three tardigrade species revealed similar functional content and the M. tardigradum dataset contained additional sequences from tardigrades not present in the other two. This study describes novel sequence data from the tardigrade M. tardigradum, which significantly contributes to the available tardigrade sequence data and will help to establish this extraordinary tardigrade as a model for studying anhydrobiosis. Functional comparison of active and anhydrobiotic tardigrades revealed a differential distribution of Gene Ontology terms associated with chromatin structure and the translation machinery, which are underrepresented in the inactive animals. These findings imply a widespread metabolic response of the animals on dehydration. The collective tardigrade transcriptome data will serve as a reference for further studies and support the identification and characterization of genes involved in the anhydrobiotic response.
Genome-wide comparison of medieval and modern Mycobacterium leprae.

PubMed

Schuenemann, Verena J; Singh, Pushpendra; Mendum, Thomas A; Krause-Kyora, Ben; Jäger, Günter; Bos, Kirsten I; Herbig, Alexander; Economou, Christos; Benjak, Andrej; Busso, Philippe; Nebel, Almut; Boldsen, Jesper L; Kjellström, Anna; Wu, Huihai; Stewart, Graham R; Taylor, G Michael; Bauer, Peter; Lee, Oona Y-C; Wu, Houdini H T; Minnikin, David E; Besra, Gurdyal S; Tucker, Katie; Roffey, Simon; Sow, Samba O; Cole, Stewart T; Nieselt, Kay; Krause, Johannes

2013-07-12

Leprosy was endemic in Europe until the Middle Ages. Using DNA array capture, we have obtained genome sequences of Mycobacterium leprae from skeletons of five medieval leprosy cases from the United Kingdom, Sweden, and Denmark. In one case, the DNA was so well preserved that full de novo assembly of the ancient bacterial genome could be achieved through shotgun sequencing alone. The ancient M. leprae sequences were compared with those of 11 modern strains, representing diverse genotypes and geographic origins. The comparisons revealed remarkable genomic conservation during the past 1000 years, a European origin for leprosy in the Americas, and the presence of an M. leprae genotype in medieval Europe now commonly associated with the Middle East. The exceptional preservation of M. leprae biomarkers, both DNA and mycolic acids, in ancient skeletons has major implications for palaeomicrobiology and human pathogen evolution.
Implementation of Objective PASC-Derived Taxon Demarcation Criteria for Official Classification of Filoviruses.

PubMed

Bào, Yīmíng; Amarasinghe, Gaya K; Basler, Christopher F; Bavari, Sina; Bukreyev, Alexander; Chandran, Kartik; Dolnik, Olga; Dye, John M; Ebihara, Hideki; Formenty, Pierre; Hewson, Roger; Kobinger, Gary P; Leroy, Eric M; Mühlberger, Elke; Netesov, Sergey V; Patterson, Jean L; Paweska, Janusz T; Smither, Sophie J; Takada, Ayato; Towner, Jonathan S; Volchkov, Viktor E; Wahl-Jensen, Victoria; Kuhn, Jens H

2017-05-11

The mononegaviral family Filoviridae has eight members assigned to three genera and seven species. Until now, genus and species demarcation were based on arbitrarily chosen filovirus genome sequence divergence values (≈50% for genera, ≈30% for species) and arbitrarily chosen phenotypic virus or virion characteristics. Here we report filovirus genome sequence-based taxon demarcation criteria using the publicly accessible PAirwise Sequencing Comparison (PASC) tool of the US National Center for Biotechnology Information (Bethesda, MD, USA). Comparison of all available filovirus genomes in GenBank using PASC revealed optimal genus demarcation at the 55-58% sequence diversity threshold range for genera and at the 23-36% sequence diversity threshold range for species. Because these thresholds do not change the current official filovirus classification, these values are now implemented as filovirus taxon demarcation criteria that may solely be used for filovirus classification in case additional data are absent. A near-complete, coding-complete, or complete filovirus genome sequence will now be required to allow official classification of any novel "filovirus." Classification of filoviruses into existing taxa or determining the need for novel taxa is now straightforward and could even become automated using a presented algorithm/flowchart rooted in RefSeq (type) sequences.
Sequence Composition and Gene Content of the Short Arm of Rye (Secale cereale) Chromosome 1

PubMed Central

Fluch, Silvia; Kopecky, Dieter; Burg, Kornel; Šimková, Hana; Taudien, Stefan; Petzold, Andreas; Kubaláková, Marie; Platzer, Matthias; Berenyi, Maria; Krainer, Siegfried; Doležel, Jaroslav; Lelley, Tamas

2012-01-01

Background The purpose of the study is to elucidate the sequence composition of the short arm of rye chromosome 1 (Secale cereale) with special focus on its gene content, because this portion of the rye genome is an integrated part of several hundreds of bread wheat varieties worldwide. Methodology/Principal Findings Multiple Displacement Amplification of 1RS DNA, obtained from flow sorted 1RS chromosomes, using 1RS ditelosomic wheat-rye addition line, and subsequent Roche 454FLX sequencing of this DNA yielded 195,313,589 bp sequence information. This quantity of sequence information resulted in 0.43× sequence coverage of the 1RS chromosome arm, permitting the identification of genes with estimated probability of 95%. A detailed analysis revealed that more than 5% of the 1RS sequence consisted of gene space, identifying at least 3,121 gene loci representing 1,882 different gene functions. Repetitive elements comprised about 72% of the 1RS sequence, Gypsy/Sabrina (13.3%) being the most abundant. More than four thousand simple sequence repeat (SSR) sites mostly located in gene related sequence reads were identified for possible marker development. The existence of chloroplast insertions in 1RS has been verified by identifying chimeric chloroplast-genomic sequence reads. Synteny analysis of 1RS to the full genomes of Oryza sativa and Brachypodium distachyon revealed that about half of the genes of 1RS correspond to the distal end of the short arm of rice chromosome 5 and the proximal region of the long arm of Brachypodium distachyon chromosome 2. Comparison of the gene content of 1RS to 1HS barley chromosome arm revealed high conservation of genes related to chromosome 5 of rice. Conclusions The present study revealed the gene content and potential gene functions on this chromosome arm and demonstrated numerous sequence elements like SSRs and gene-related sequences, which can be utilised for future research as well as in breeding of wheat and rye. PMID:22328922
Basis of altered RNA-binding specificity by PUF proteins revealed by crystal structures of yeast Puf4p

DOE Office of Scientific and Technical Information (OSTI.GOV)

Miller, Matthew T.; Higgin, Joshua J.; Hall, Traci M.Tanaka

2008-06-06

Pumilio/FBF (PUF) family proteins are found in eukaryotic organisms and regulate gene expression post-transcriptionally by binding to sequences in the 3' untranslated region of target transcripts. PUF proteins contain an RNA binding domain that typically comprises eight {alpha}-helical repeats, each of which recognizes one RNA base. Some PUF proteins, including yeast Puf4p, have altered RNA binding specificity and use their eight repeats to bind to RNA sequences with nine or ten bases. Here we report the crystal structures of Puf4p alone and in complex with a 9-nucleotide (nt) target RNA sequence, revealing that Puf4p accommodates an 'extra' nucleotide by modestmore » adaptations allowing one base to be turned away from the RNA binding surface. Using structural information and sequence comparisons, we created a mutant Puf4p protein that preferentially binds to an 8-nt target RNA sequence over a 9-nt sequence and restores binding of each protein repeat to one RNA base.« less
Evolutionary growth process of highly conserved sequences in vertebrate genomes.

PubMed

Ishibashi, Minaka; Noda, Akiko Ogura; Sakate, Ryuichi; Imanishi, Tadashi

2012-08-01

Genome sequence comparison between evolutionarily distant species revealed ultraconserved elements (UCEs) among mammals under strong purifying selection. Most of them were also conserved among vertebrates. Because they tend to be located in the flanking regions of developmental genes, they would have fundamental roles in creating vertebrate body plans. However, the evolutionary origin and selection mechanism of these UCEs remain unclear. Here we report that UCEs arose in primitive vertebrates, and gradually grew in vertebrate evolution. We searched for UCEs in two teleost fishes, Tetraodon nigroviridis and Oryzias latipes, and found 554 UCEs with 100% identity over 100 bps. Comparison of teleost and mammalian UCEs revealed 43 pairs of common, jawed-vertebrate UCEs (jUCE) with high sequence identities, ranging from 83.1% to 99.2%. Ten of them retain lower similarities to the Petromyzon marinus genome, and the substitution rates of four non-exonic jUCEs were reduced after the teleost-mammal divergence, suggesting that robust conservation had been acquired in the jawed vertebrate lineage. Our results indicate that prototypical UCEs originated before the divergence of jawed and jawless vertebrates and have been frozen as perfect conserved sequences in the jawed vertebrate lineage. In addition, our comparative sequence analyses of UCEs and neighboring regions resulted in a discovery of lineage-specific conserved sequences. They were added progressively to prototypical UCEs, suggesting step-wise acquisition of novel regulatory roles. Our results indicate that conserved non-coding elements (CNEs) consist of blocks with distinct evolutionary history, each having been frozen since different evolutionary era along the vertebrate lineage. Copyright © 2012 Elsevier B.V. All rights reserved.
Mhc class II B gene evolution in East African cichlid fishes.

PubMed

Figueroa, F; Mayer, W E; Sültmann, H; O'hUigin, C; Tichy, H; Satta, Y; Takezaki, N; Takahata, N; Klein, J

2000-06-01

A distinctive feature of essential major histocompatibility complex (Mhc) loci is their polymorphism characterized by large genetic distances between alleles and long persistence times of allelic lineages. Since the lineages often span several successive speciations, we investigated the behavior of the Mhc alleles during or close to the speciation phase. We sequenced exon 2 of the class II B locus 4 from 232 East African cichlid fishes representing 32 related species. The divergence times of the (sub)species ranged from 6,000 to 8.4 million years. Two types of evolutionary analysis were used to elucidate the pattern of exon 2 sequence divergence. First, phylogenetic methods were applied to reconstruct the most likely evolutionary pathways leading from the last common ancestor of the set to the extant sequences, and to assess the probable mechanisms involved in allelic diversification. Second, pairwise comparisons of sequences were carried out to detect differences seemingly incompatible with origin by nonparallel point mutations. The analysis revealed point mutations to be the most important mechanism behind allelic divergences, with recombination playing only an auxiliary part. Comparison of sequences from related species revealed evidence of random allelic (lineage) losses apparently associated with speciation. Sharing of identical alleles could be demonstrated between species that diverged 2 million years ago. The phylogeny of the exon was incongruent with that of the flanking introns, indicating either a high degree of convergent evolution at the peptide-binding region-encoding sites, or intron homogenization.
DMRT gene cluster analysis in the platypus: new insights into genomic organization and regulatory regions.

PubMed

El-Mogharbel, Nisrine; Wakefield, Matthew; Deakin, Janine E; Tsend-Ayush, Enkhjargal; Grützner, Frank; Alsop, Amber; Ezaz, Tariq; Marshall Graves, Jennifer A

2007-01-01

We isolated and characterized a cluster of platypus DMRT genes and compared their arrangement, location, and sequence across vertebrates. The DMRT gene cluster on human 9p24.3 harbors, in order, DMRT1, DMRT3, and DMRT2, which share a DM domain. DMRT1 is highly conserved and involved in sexual development in vertebrates, and deletions in this region cause sex reversal in humans. Sequence comparisons of DMRT genes between species have been valuable in identifying exons, control regions, and conserved nongenic regions (CNGs). The addition of platypus sequences is expected to be particularly valuable, since monotremes fill a gap in the vertebrate genome coverage. We therefore isolated and fully sequenced platypus BAC clones containing DMRT3 and DMRT2 as well as DMRT1 and then generated multispecies alignments and ran prediction programs followed by experimental verification to annotate this gene cluster. We found that the three genes have 58-66% identity to their human orthologues, lie in the same order as in other vertebrates, and colocate on 1 of the 10 platypus sex chromosomes, X5. We also predict that optimal annotation of the newly sequenced platypus genome will be challenging. The analysis of platypus sequence revealed differences in structure and sequence of the DMRT gene cluster. Multispecies comparison was particularly effective for detecting CNGs, revealing several novel potential regulatory regions within DMRT3 and DMRT2 as well as DMRT1. RT-PCR indicated that platypus DMRT1 and DMRT3 are expressed specifically in the adult testis (and not ovary), but DMRT2 has a wider expression profile, as it does for other mammals. The platypus DMRT1 expression pattern, and its location on an X chromosome, suggests an involvement in monotreme sexual development.
Primary structures of ribosomal proteins from the archaebacterium Halobacterium marismortui and the eubacterium Bacillus stearothermophilus.

PubMed

Arndt, E; Scholzen, T; Krömer, W; Hatakeyama, T; Kimura, M

1991-06-01

Approximately 40 ribosomal proteins from each Halobacterium marismortui and Bacillus stearothermophilus have been sequenced either by direct protein sequence analysis or by DNA sequence analysis of the appropriate genes. The comparison of the amino acid sequences from the archaebacterium H marismortui with the available ribosomal proteins from the eubacterial and eukaryotic kingdoms revealed four different groups of proteins: 24 proteins are related to both eubacterial as well as eukaryotic proteins. Eleven proteins are exclusively related to eukaryotic counterparts. For three proteins only eubacterial relatives-and for another three proteins no counterpart-could be found. The similarities of the halobacterial ribosomal proteins are in general somewhat higher to their eukaryotic than to their eubacterial counterparts. The comparison of B stearothermophilus proteins with their E coli homologues showed that the proteins evolved at different rates. Some proteins are highly conserved with 64-76% identity, others are poorly conserved with only 25-34% identical amino acid residues.
An Ambystoma mexicanum EST sequencing project: analysis of 17,352 expressed sequence tags from embryonic and regenerating blastema cDNA libraries

PubMed Central

Habermann, Bianca; Bebin, Anne-Gaelle; Herklotz, Stephan; Volkmer, Michael; Eckelt, Kay; Pehlke, Kerstin; Epperlein, Hans Henning; Schackert, Hans Konrad; Wiebe, Glenis; Tanaka, Elly M

2004-01-01

Background The ambystomatid salamander, Ambystoma mexicanum (axolotl), is an important model organism in evolutionary and regeneration research but relatively little sequence information has so far been available. This is a major limitation for molecular studies on caudate development, regeneration and evolution. To address this lack of sequence information we have generated an expressed sequence tag (EST) database for A. mexicanum. Results Two cDNA libraries, one made from stage 18-22 embryos and the other from day-6 regenerating tail blastemas, generated 17,352 sequences. From the sequenced ESTs, 6,377 contigs were assembled that probably represent 25% of the expressed genes in this organism. Sequence comparison revealed significant homology to entries in the NCBI non-redundant database. Further examination of this gene set revealed the presence of genes involved in important cell and developmental processes, including cell proliferation, cell differentiation and cell-cell communication. On the basis of these data, we have performed phylogenetic analysis of key cell-cycle regulators. Interestingly, while cell-cycle proteins such as the cyclin B family display expected evolutionary relationships, the cyclin-dependent kinase inhibitor 1 gene family shows an unusual evolutionary behavior among the amphibians. Conclusions Our analysis reveals the importance of a comprehensive sequence set from a representative of the Caudata and illustrates that the EST sequence database is a rich source of molecular, developmental and regeneration studies. To aid in data mining, the ESTs have been organized into an easily searchable database that is freely available online. PMID:15345051

Comparative analysis of the complete genome of an epidemic hospital sequence type 203 clone of vancomycin-resistant Enterococcus faecium

PubMed Central

2013-01-01

Background In this report we have explored the genomic and microbiological basis for a sustained increase in bloodstream infections at a major Australian hospital caused by Enterococcus faecium multi-locus sequence type (ST) 203, an outbreak strain that has largely replaced a predecessor ST17 sequence type. Results To establish a ST203 reference sequence we fully assembled and annotated the genome of Aus0085, a 2009 vancomycin-resistant Enterococcus faecium (VREfm) bloodstream isolate, and the first example of a completed ST203 genome. Aus0085 has a 3.2 Mb genome, comprising a 2.9 Mb circular chromosome and six circular plasmids (2 kb–130 kb). Twelve percent of the 3222 coding sequences (CDS) in Aus0085 are not present in ST17 E. faecium Aus0004 and ST18 E. faecium TX16. Extending this comparison to an additional 12 ST17 and 14 ST203 E. faecium hospital isolate genomes revealed only six genomic regions spanning 41 kb that were present in all ST203 and absent from all ST17 genomes. The 40 CDS have predicted functions that include ion transport, riboflavin metabolism and two phosphotransferase systems. Comparison of the vancomycin resistance-conferring Tn1549 transposon between Aus0004 and Aus0085 revealed differences in transposon length and insertion site, and van locus sequence variation that correlated with a higher vancomycin MIC in Aus0085. Additional phenotype comparisons between ST17 and ST203 isolates showed that while there were no differences in biofilm-formation and killing of Galleria mellonella, ST203 isolates grew significantly faster and out-competed ST17 isolates in growth assays. Conclusions Here we have fully assembled and annotated the first ST203 genome, and then characterized the genomic differences between ST17 and ST203 E. faecium. We also show that ST203 E. faecium are faster growing and can out-compete ST17 E. faecium. While a causal genetic basis for these phenotype differences is not provided here, this study revealed conserved genetic differences between the two clones, differences that can now be tested to explain the molecular basis for the success and emergence of ST203 E. faecium. PMID:24004955
Microbial Communities on Seafloor Basalts at Dorado Outcrop Reflect Level of Alteration and Highlight Global Lithic Clades

PubMed Central

Lee, Michael D.; Walworth, Nathan G.; Sylvan, Jason B.; Edwards, Katrina J.; Orcutt, Beth N.

2015-01-01

Areas of exposed basalt along mid-ocean ridges and at seafloor outcrops serve as conduits of fluid flux into and out of a subsurface ocean, and microbe–mineral interactions can influence alteration reactions at the rock–water interface. Located on the eastern flank of the East Pacific Rise, Dorado Outcrop is a site of low-temperature (<20°C) hydrothermal venting and represents a new end-member in the current survey of seafloor basalt biomes. Consistent with prior studies, a survey of 16S rRNA gene sequence diversity using universal primers targeting the V4 hypervariable region revealed much greater richness and diversity on the seafloor rocks than in surrounding seawater. Overall, Gamma-, Alpha-, and Deltaproteobacteria, and Thaumarchaeota dominated the sequenced communities, together making up over half of the observed diversity, though bacterial sequences were more abundant than archaeal in all samples. The most abundant bacterial reads were closely related to the obligate chemolithoautotrophic, sulfur-oxidizing Thioprofundum lithotrophicum, suggesting carbon and sulfur cycling as dominant metabolic pathways in this system. Representatives of Thaumarchaeota were detected in relatively high abundance on the basalts in comparison to bottom water, possibly indicating ammonia oxidation. In comparison to other sequence datasets from globally distributed seafloor basalts, this study reveals many overlapping and cosmopolitan phylogenetic groups and also suggests that substrate age correlates with community structure. PMID:26779122
Whole-Genome Sequencing of Sake Yeast Saccharomyces cerevisiae Kyokai no. 7

PubMed Central

Akao, Takeshi; Yashiro, Isao; Hosoyama, Akira; Kitagaki, Hiroshi; Horikawa, Hiroshi; Watanabe, Daisuke; Akada, Rinji; Ando, Yoshinori; Harashima, Satoshi; Inoue, Toyohisa; Inoue, Yoshiharu; Kajiwara, Susumu; Kitamoto, Katsuhiko; Kitamoto, Noriyuki; Kobayashi, Osamu; Kuhara, Satoru; Masubuchi, Takashi; Mizoguchi, Haruhiko; Nakao, Yoshihiro; Nakazato, Atsumi; Namise, Masahiro; Oba, Takahiro; Ogata, Tomoo; Ohta, Akinori; Sato, Masahide; Shibasaki, Seiji; Takatsume, Yoshifumi; Tanimoto, Shota; Tsuboi, Hirokazu; Nishimura, Akira; Yoda, Koji; Ishikawa, Takeaki; Iwashita, Kazuhiro; Fujita, Nobuyuki; Shimoi, Hitoshi

2011-01-01

The term ‘sake yeast’ is generally used to indicate the Saccharomyces cerevisiae strains that possess characteristics distinct from others including the laboratory strain S288C and are well suited for sake brewery. Here, we report the draft whole-genome shotgun sequence of a commonly used diploid sake yeast strain, Kyokai no. 7 (K7). The assembled sequence of K7 was nearly identical to that of the S288C, except for several subtelomeric polymorphisms and two large inversions in K7. A survey of heterozygous bases between the homologous chromosomes revealed the presence of mosaic-like uneven distribution of heterozygosity in K7. The distribution patterns appeared to have resulted from repeated losses of heterozygosity in the ancestral lineage of K7. Analysis of genes revealed the presence of both K7-acquired and K7-lost genes, in addition to numerous others with segmentations and terminal discrepancies in comparison with those of S288C. The distribution of Ty element also largely differed in the two strains. Interestingly, two regions in chromosomes I and VII of S288C have apparently been replaced by Ty elements in K7. Sequence comparisons suggest that these gene conversions were caused by cDNA-mediated recombination of Ty elements. The present study advances our understanding of the functional and evolutionary genomics of the sake yeast. PMID:21900213
Rosetta stone method for detecting protein function and protein-protein interactions from genome sequences

DOEpatents

Eisenberg, David; Marcotte, Edward M.; Pellegrini, Matteo; Thompson, Michael J.; Yeates, Todd O.

2002-10-15

A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.
Determining protein function and interaction from genome analysis

DOEpatents

Eisenberg, David; Marcotte, Edward M.; Thompson, Michael J.; Pellegrini, Matteo; Yeates, Todd O.

2004-08-03

A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.
Assigning protein functions by comparative genome analysis protein phylogenetic profiles

DOEpatents

Pellegrini, Matteo; Marcotte, Edward M.; Thompson, Michael J.; Eisenberg, David; Grothe, Robert; Yeates, Todd O.

2003-05-13

A computational method system, and computer program are provided for inferring functional links from genome sequences. One method is based on the observation that some pairs of proteins A' and B' have homologs in another organism fused into a single protein chain AB. A trans-genome comparison of sequences can reveal these AB sequences, which are Rosetta Stone sequences because they decipher an interaction between A' and B. Another method compares the genomic sequence of two or more organisms to create a phylogenetic profile for each protein indicating its presence or absence across all the genomes. The profile provides information regarding functional links between different families of proteins. In yet another method a combination of the above two methods is used to predict functional links.
The complete genomic sequence of a tentative new polerovirus identified in barley in South Korea.

PubMed

Zhao, Fumei; Lim, Seungmo; Yoo, Ran Hee; Igori, Davaajargal; Kim, Sang-Min; Kwak, Do Yeon; Kim, Sun Lim; Lee, Bong Choon; Moon, Jae Sun

2016-07-01

The complete nucleotide sequence of a new barley polerovirus, tentatively named barley virus G (BVG), which was isolated in Gimje, South Korea, has been determined using an RNA sequencing technique combined with polymerase chain reaction methods. The viral genomic RNA of BVG is 5,620 nucleotides long and contains six typical open reading frames commonly observed in other poleroviruses. Sequence comparisons revealed that BVG is most closely related to maize yellow dwarf virus-RMV, with the highest amino acid identities being less than 90 % for all of the corresponding proteins. These results suggested that BVG is a member of a new species in the genus Polerovirus.
Evidence for Elizabethkingia anophelis transmission from mother to infant, Hong Kong.

PubMed

Lau, Susanna K P; Wu, Alan K L; Teng, Jade L L; Tse, Herman; Curreem, Shirly O T; Tsui, Stephen K W; Huang, Yi; Chen, Jonathan H K; Lee, Rodney A; Yuen, Kwok-Yung; Woo, Patrick C Y

2015-02-01

Elizabethkingia anophelis, recently discovered from mosquito gut, is an emerging bacterium associated with neonatal meningitis and nosocomial outbreaks. However, its transmission route remains unknown. We use rapid genome sequencing to investigate 3 cases of E. anophelis sepsis involving 2 neonates who had meningitis and 1 neonate's mother who had chorioamnionitis. Comparative genomics revealed evidence for perinatal vertical transmission from a mother to her neonate; the 2 isolates from these patients, HKU37 and HKU38, shared essentially identical genome sequences. In contrast, the strain from another neonate (HKU36) was genetically divergent, showing only 78.6% genome sequence identity to HKU37 and HKU38, thus excluding a clonal outbreak. Comparison to genomes from mosquito strains revealed potential metabolic adaptations in E. anophelis under different environments. Maternal infection, not mosquitoes, is most likely the source of neonatal E. anophelis infections. Our findings highlight the power of genome sequencing in gaining rapid insights on transmission and pathogenesis of emerging pathogens.
Plastome Sequence Determination and Comparative Analysis for Members of the Lolium-Festuca Grass Species Complex

PubMed Central

Hand, Melanie L.; Spangenberg, German C.; Forster, John W.; Cogan, Noel O. I.

2013-01-01

Chloroplast genome sequences are of broad significance in plant biology, due to frequent use in molecular phylogenetics, comparative genomics, population genetics, and genetic modification studies. The present study used a second-generation sequencing approach to determine and assemble the plastid genomes (plastomes) of four representatives from the agriculturally important Lolium-Festuca species complex of pasture grasses (Lolium multiflorum, Festuca pratensis, Festuca altissima, and Festuca ovina). Total cellular DNA was extracted from either roots or leaves, was sequenced, and the output was filtered for plastome-related reads. A comparison between sources revealed fewer plastome-related reads from root-derived template but an increase in incidental bacterium-derived sequences. Plastome assembly and annotation indicated high levels of sequence identity and a conserved organization and gene content between species. However, frequent deletions within the F. ovina plastome appeared to contribute to a smaller plastid genome size. Comparative analysis with complete plastome sequences from other members of the Poaceae confirmed conservation of most grass-specific features. Detailed analysis of the rbcL–psaI intergenic region, however, revealed a “hot-spot” of variation characterized by independent deletion events. The evolutionary implications of this observation are discussed. The complete plastome sequences are anticipated to provide the basis for potential organelle-specific genetic modification of pasture grasses. PMID:23550121
Molecular phenotype of zebrafish ovarian follicle by serial analysis of gene expression and proteomic profiling, and comparison with the transcriptomes of other animals

PubMed Central

Knoll-Gellida, Anja; André, Michèle; Gattegno, Tamar; Forgue, Jean; Admon, Arie; Babin, Patrick J

2006-01-01

Background The ability of an oocyte to develop into a viable embryo depends on the accumulation of specific maternal information and molecules, such as RNAs and proteins. A serial analysis of gene expression (SAGE) was carried out in parallel with proteomic analysis on fully-grown ovarian follicles from zebrafish (Danio rerio). The data obtained were compared with ovary/follicle/egg molecular phenotypes of other animals, published or available in public sequence databases. Results Sequencing of 27,486 SAGE tags identified 11,399 different ones, including 3,329 tags with an occurrence superior to one. Fifty-eight genes were expressed at over 0.15% of the total population and represented 17.34% of the mRNA population identified. The three most expressed transcripts were a rhamnose-binding lectin, beta-actin 2, and a transcribed locus similar to the H2B histone family. Comparison with the large-scale expressed sequence tags sequencing approach revealed highly expressed transcripts that were not previously known to be expressed at high levels in fish ovaries, like the short-sized polarized metallothionein 2 transcript. A higher sensitivity for the detection of transcripts with a characterized maternal genetic contribution was also demonstrated compared to large-scale sequencing of cDNA libraries. Ferritin heavy polypeptide 1, heat shock protein 90-beta, lactate dehydrogenase B4, beta-actin isoforms, tubulin beta 2, ATP synthase subunit 9, together with 40 S ribosomal protein S27a, were common highly-expressed transcripts of vertebrate ovary/unfertilized egg. Comparison of transcriptome and proteome data revealed that transcript levels provide little predictive value with respect to the extent of protein abundance. All the proteins identified by proteomic analysis of fully-grown zebrafish follicles had at least one transcript counterpart, with two exceptions: eosinophil chemotactic cytokine and nothepsin. Conclusion This study provides a complete sequence data set of maternal mRNA stored in zebrafish germ cells at the end of oogenesis. This catalogue contains highly-expressed transcripts that are part of a vertebrate ovarian expressed gene signature. Comparison of transcriptome and proteome data identified downregulated transcripts or proteins potentially incorporated in the oocyte by endocytosis. The molecular phenotype described provides groundwork for future experimental approaches aimed at identifying functionally important stored maternal transcripts and proteins involved in oogenesis and early stages of embryo development. PMID:16526958
Chromosome-level assembly of Arabidopsis thaliana Ler reveals the extent of translocation and inversion polymorphisms.

PubMed

Zapata, Luis; Ding, Jia; Willing, Eva-Maria; Hartwig, Benjamin; Bezdan, Daniela; Jiao, Wen-Biao; Patel, Vipul; Velikkakam James, Geo; Koornneef, Maarten; Ossowski, Stephan; Schneeberger, Korbinian

2016-07-12

Resequencing or reference-based assemblies reveal large parts of the small-scale sequence variation. However, they typically fail to separate such local variation into colinear and rearranged variation, because they usually do not recover the complement of large-scale rearrangements, including transpositions and inversions. Besides the availability of hundreds of genomes of diverse Arabidopsis thaliana accessions, there is so far only one full-length assembled genome: the reference sequence. We have assembled 117 Mb of the A. thaliana Landsberg erecta (Ler) genome into five chromosome-equivalent sequences using a combination of short Illumina reads, long PacBio reads, and linkage information. Whole-genome comparison against the reference sequence revealed 564 transpositions and 47 inversions comprising ∼3.6 Mb, in addition to 4.1 Mb of nonreference sequence, mostly originating from duplications. Although rearranged regions are not different in local divergence from colinear regions, they are drastically depleted for meiotic recombination in heterozygotes. Using a 1.2-Mb inversion as an example, we show that such rearrangement-mediated reduction of meiotic recombination can lead to genetically isolated haplotypes in the worldwide population of A. thaliana Moreover, we found 105 single-copy genes, which were only present in the reference sequence or the Ler assembly, and 334 single-copy orthologs, which showed an additional copy in only one of the genomes. To our knowledge, this work gives first insights into the degree and type of variation, which will be revealed once complete assemblies will replace resequencing or other reference-dependent methods.
Comparison of corrosion scales in full and partially replaced lead service lines after changes in water quality

EPA Science Inventory

Preliminary results from scales formed 38 weeks following the LSL replacement simulations revealed differences in scale formations amongst varying water qualities and pipe sequence. Rigs fed with dechlorinated tap water show distinct pH gradients between the galvanic and the back...
Characterization and analysis of a transcriptome from the boreal spider crab Hyas araneus.

PubMed

Harms, Lars; Frickenhaus, Stephan; Schiffer, Melanie; Mark, Felix C; Storch, Daniela; Pörtner, Hans-Otto; Held, Christoph; Lucassen, Magnus

2013-12-01

Research investigating the genetic basis of physiological responses has significantly broadened our understanding of the mechanisms underlying organismic response to environmental change. However, genomic data are currently available for few taxa only, thus excluding physiological model species from this approach. In this study we report the transcriptome of the model organism Hyas araneus from Spitsbergen (Arctic). We generated 20,479 transcripts, using the 454 GS FLX sequencing technology in combination with an Illumina HiSeq sequencing approach. Annotation by Blastx revealed 7159 blast hits in the NCBI non-redundant protein database. The comparison between the spider crab H. araneus transcriptome and EST libraries of the European lobster Homarus americanus and the porcelain crab Petrolisthes cinctipes yielded 3229/2581 sequences with a significant hit, respectively. The clustering by the Markov Clustering Algorithm (MCL) revealed a common core of 1710 clusters present in all three species and 5903 unique clusters for H. araneus. The combined sequencing approaches generated transcripts that will greatly expand the limited genomic data available for crustaceans. We introduce the MCL clustering for transcriptome comparisons as a simple approach to estimate similarities between transcriptomic libraries of different size and quality and to analyze homologies within the selected group of species. In particular, we identified a large variety of reverse transcriptase (RT) sequences not only in the H. araneus transcriptome and other decapod crustaceans, but also sea urchin, supporting the hypothesis of a heritable, anti-viral immunity and the proposed viral fragment integration by host-derived RTs in marine invertebrates. © 2013.
The primary structures of ribosomal proteins S14 and S16 from the archaebacterium Halobacterium marismortui. Comparison with eubacterial and eukaryotic ribosomal proteins.

PubMed

Kimura, J; Kimura, M

1987-09-05

The amino acid sequences of two ribosomal proteins, S14 and S16, from the archaebacterium Halobacterium marismortui have been determined. Sequence data were obtained by the manual and solid-phase sequencing of peptides derived from enzymatic digestions with trypsin, chymotrypsin, pepsin, and Staphylococcus aureus protease as well as by chemical cleavage with cyanogen bromide. Proteins S14 and S16 contain 109 and 126 amino acid residues and have Mr values of 11,964 and 13,515, respectively. Comparison of the sequences with those of ribosomal proteins from other organisms demonstrates that S14 has a significant homology with the rat liver ribosomal protein S11 (36% identity) as well as with the Escherichia coli ribosomal protein S17 (37%), and that S16 is related to the yeast ribosomal protein YS22 (40%) and proteins S8 from E. coli (28%) and Bacillus stearothermophilus (30%). A comparison of the amino acid residues in the homologous regions of halophilic and nonhalophilic ribosomal proteins reveals that halophilic proteins have more glutamic acids, asparatic acids, prolines, and alanines, and less lysines, arginines, and isoleucines than their nonhalophilic counterparts. These amino acid substitutions probably contribute to the structural stability of halophilic ribosomal proteins.
Antigenic and genetic comparison of foot-and-mouth disease virus serotype O Indian vaccine strain, O/IND/R2/75 against currently circulating viruses

PubMed Central

Mahapatra, Mana; Yuvaraj, S.; Madhanmohan, M.; Subramaniam, S.; Pattnaik, B.; Paton, D.J.; Srinivasan, V.A.; Parida, Satya

2015-01-01

Foot-and-mouth disease (FMD) virus serotype O is the most common cause of FMD outbreaks in India and three of the six lineages that have been described are most frequently detected, namely Ind2001, PanAsia and PanAsia 2. We report the full capsid sequence of 21 serotype O viruses isolated from India between 2002 and 2012. All these viruses belong to the Middle East–South Asia (ME–SA) topotype. The serological cross-reactivity of a bovine post-vaccination serum pool raised against the current Indian vaccine strain, O/IND/R2/75,was tested by virus neutralisation test with the 23 Indian field isolates, revealing a good match between the vaccine and the field isolates. The cross reactivity of the O/IND/R2/75 vaccine with 19 field isolates from other countries (mainly from Asia and Africa) revealed a good match to 79% of the viruses indicating that the vaccine strain is broadly cross-reactive and could be used to control FMD in other countries. Comparison of the capsid sequences of the serologically non-matching isolates with the vaccine strain sequence identified substitutions in neutralising antigenic sites 1 and 2, which could explain the observed serological differences. PMID:25500306
Complete genomic sequence of Powassan virus: evaluation of genetic elements in tick-borne versus mosquito-borne flaviviruses.

PubMed

Mandl, C W; Holzmann, H; Kunz, C; Heinz, F X

1993-05-01

The complete nucleotide sequence of the positive-stranded RNA genome of the tick-borne flavivirus Powassan (10,839 nucleotides) was elucidated and the amino acid sequence of all viral proteins was derived. Based on this sequence as well as serological data, Powassan virus represents the most divergent member of the tick-borne serocomplex within the genus flaviviruses, family Flaviviridae. The primary nucleotide sequence and potential RNA secondary structures of the Powassan virus genome as well as the protein sequences and the reactivities of the virion with a panel of monoclonal antibodies were compared to other tick-borne and mosquito-borne flaviviruses. These analyses corroborated significant differences between tick-borne and mosquito-borne flaviviruses, but also emphasized structural elements that are conserved among both vector groups. The comparisons among tick-borne flaviviruses revealed conserved sequence elements that might represent important determinants of the tick-borne flavivirus phenotype.
Nucleotide sequence determination of guinea-pig casein B mRNA reveals homology with bovine and rat alpha s1 caseins and conservation of the non-coding regions of the mRNA.

PubMed Central

Hall, L; Laird, J E; Craig, R K

1984-01-01

Nucleotide sequence analysis of cloned guinea-pig casein B cDNA sequences has identified two casein B variants related to the bovine and rat alpha s1 caseins. Amino acid homology was largely confined to the known bovine or predicted rat phosphorylation sites and within the 'signal' precursor sequence. Comparison of the deduced nucleotide sequence of the guinea-pig and rat alpha s1 casein mRNA species showed greater sequence conservation in the non-coding than in the coding regions, suggesting a functional and possibly regulatory role for the non-coding regions of casein mRNA. The results provide insight into the evolution of the casein genes, and raise questions as to the role of conserved nucleotide sequences within the non-coding regions of mRNA species. Images Fig. 1. PMID:6548375
A putative carbohydrate-binding domain of the lactose-binding Cytisus sessilifolius anti-H(O) lectin has a similar amino acid sequence to that of the L-fucose-binding Ulex europaeus anti-H(O) lectin.

PubMed

Konami, Y; Yamamoto, K; Osawa, T; Irimura, T

1995-04-01

The complete amino acid sequence of a lactose-binding Cytisus sessilifolius anti-H(O) lectin II (CSA-II) was determined using a protein sequencer. After digestion of CSA-II with endoproteinase Lys-C or Asp-N, the resulting peptides were purified by reversed-phase high performance liquid chromatography (HPLC) and then subjected to sequence analysis. Comparison of the complete amino acid sequence of CSA-II with the sequences of other leguminous seed lectins revealed regions of extensive homology. The amino acid sequence of a putative carbohydrate-binding domain of CSA-II was found to be similar to those of several anti-H(O) leguminous lectins, especially to that of the L-fucose-binding Ulex europaeus lectin I (UEA-I).
Structural analysis of the 5{prime} region of mouse and human Huntington disease genes reveals conservation of putative promoter region and Di- and trinucleotide polymorphisms

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lin, Biaoyang; Nasir, J.; Kalchman, M.A.

1995-02-10

We have previously cloned and characterized the murine homologue of the Huntington disease (HD) gene and shown that it maps to mouse chromosome 5 within a region of conserved synteny with human chromosome 4p16.3. Here we present a detailed comparison of the sequence of the putative promoter and the organization of the 5{prime} genomic region of the murine (Hdh) and human HD genes encompassing the first five exons. We show that in this region these two genes share identical exon boundaries, but have different-size introns. Two dinucleotide (CT) and one trinucleotide intronic polymorphism in Hdh and an intronic CA polymorphismmore » in the HD gene were identified. Comparison of 940-bp sequence 5{prime} to the putative translation start site reveals a highly conserved region (78.8% nucleotide identity) between Hdh and the HD gene from nucleotide -56 to -206 (of Hdh). Neither Hdh nor the HD gene have typical TATA or CCAAT elements, but both show one putative AP2 binding site and numerous potential Sp1 binding sites. The high sequence identity between Hdh and the HD gene for approximately 200 bp 5{prime} to the putative translation start site indicates that these sequences may play a role in regulating expression of the Huntington disease gene. 30 refs., 4 figs., 2 tabs.« less
Annotation of the Clostridium Acetobutylicum Genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Daly, M. J.

The genome sequence of the solvent producing bacterium Clostridium acetobutylicum ATCC824, has been determined by the shotgun approach. The genome consists of a 3.94 Mb chromosome and a 192 kb megaplasmid that contains the majority of genes responsible for solvent production. Comparison of C. acetobutylicum to Bacillus subtilis reveals significant local conservation of gene order, which has not been seen in comparisons of other genomes with similar, or, in some cases, closer, phylogenetic proximity. This conservation allows the prediction of many previously undetected operons in both bacteria.

Blood-Borne Candidatus Borrelia algerica in a Patient with Prolonged Fever in Oran, Algeria

PubMed Central

Fotso Fotso, Aurélien; Angelakis, Emmanouil; Mouffok, Nadjet; Drancourt, Michel; Raoult, Didier

2015-01-01

To improve the knowledge base of Borrelia in north Africa, we tested 257 blood samples collected from febrile patients in Oran, Algeria, between January and December 2012 for Borrelia species using flagellin gene polymerase chain reaction sequencing. A sequence indicative of a new Borrelia sp. named Candidatus Borrelia algerica was detected in one blood sample. Further multispacer sequence typing indicated this Borrelia sp. had 97% similarity with Borrelia crocidurae, Borrelia duttonii, and Borrelia recurrentis. In silico comparison of Candidatus B. algerica spacer sequences with those of Borrelia hispanica and Borrelia garinii revealed 94% and 89% similarity, respectively. Candidatus B. algerica is a new relapsing fever Borrelia sp. detected in Oran. Further studies may help predict its epidemiological importance. PMID:26416117
HIV Type 1 Transmission Networks Among Men Having Sex with Men and Heterosexuals in Kenya

PubMed Central

Faria, Nuno Rodrigues; Hassan, Amin; Hamers, Raph L.; Mutua, Gaudensia; Anzala, Omu; Mandaliya, Kishor; Cane, Patricia; Berkley, James A.; Rinke de Wit, Tobias F.; Wallis, Carole; Graham, Susan M.; Price, Matthew A.; Coutinho, Roel A.; Sanders, Eduard J.

2014-01-01

Abstract We performed a molecular phylogenetic study on HIV-1 polymerase sequences of men who have sex with men (MSM) and heterosexual patient samples in Kenya to characterize any observed HIV-1 transmission networks. HIV-1 polymerase sequences were obtained from samples in Nairobi and coastal Kenya from 84 MSM, 226 other men, and 364 women from 2005 to 2010. Using Bayesian phylogenetics, we tested whether sequences clustered by sexual orientation and geographic location. In addition, we used trait diffusion analyses to identify significant epidemiological links and to quantify the number of transmissions between risk groups. Finally, we compared 84 MSM sequences with all HIV-1 sequences available online at GenBank. Significant clustering of sequences from MSM at both coastal Kenya and Nairobi was found, with evidence of HIV-1 transmission between both locations. Although a transmission pair between a coastal MSM and woman was confirmed, no significant HIV-1 transmission was evident between MSM and the comparison population for the predominant subtype A (60%). However, a weak but significant link was evident when studying all subtypes together. GenBank comparison did not reveal other important transmission links. Our data suggest infrequent intermingling of MSM and heterosexual HIV-1 epidemics in Kenya. PMID:23947948
Comparative whole genome DNA methylation profiling of cattle sperm and somatic tissues reveals striking hypomethylated patterns in sperm

USDA-ARS?s Scientific Manuscript database

Using whole-genome bisulfite sequencing (WGBS), we profiled the DNA methylome of cattle sperms through comparison with three bovine somatic tissues (mammary grand, brain and blood). Large differences between them were observed in the methylation patterns of global CpGs, pericentromeric satellites, p...
Newly identified essential amino acid residues affecting ^8-sphingolipid desaturase activity revealed by site-directed mutagenesis

USDA-ARS?s Scientific Manuscript database

In order to identify amino acid residues crucial for the enzymatic activity of ^8-sphingolipid desaturases, a sequence comparison was performed among ^8-sphingolipid desaturases and ^6-fatty acid desaturase from various plants. In addition to the known conserved cytb5 (cytochrome b5) HPGG motif and...
Comparison of G protein sequences of South African street rabies viruses showing distinct progression of the disease in a mouse model of experimental rabies.

PubMed

Seo, Wonhyo; Servat, Alexandre; Cliquet, Florence; Akinbowale, Jenkins; Prehaud, Christophe; Lafon, Monique; Sabeta, Claude

Rabies is a fatal zoonotic disease and infections generally lead to a fatal encephalomyelitis in both humans and animals. In South Africa, domestic (dogs) and the wildlife (yellow mongoose) host species maintain the canid and mongoose rabies variants respectively. In this study, pathogenicity differences of South African canid and mongoose rabies viruses were investigated in a murine model, by assessing the progression of clinical signs and survivorship. Comparison of glycoprotein gene sequences revealed amino acid differences that may underpin the observed pathogenicity differences. Cumulatively, our results suggest that the canid rabies virus may be more neurovirulent in mice than the mongoose rabies variant. Copyright © 2017 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.
In silico Comparison of 19 Porphyromonas gingivalis Strains in Genomics, Phylogenetics, Phylogenomics and Functional Genomics.

PubMed

Chen, Tsute; Siddiqui, Huma; Olsen, Ingar

2017-01-01

Currently, genome sequences of a total of 19 Porphyromonas gingivalis strains are available, including eight completed genomes (strains W83, ATCC 33277, TDC60, HG66, A7436, AJW4, 381, and A7A1-28) and 11 high-coverage draft sequences (JCVI SC001, F0185, F0566, F0568, F0569, F0570, SJD2, W4087, W50, Ando, and MP4-504) that are assembled into fewer than 300 contigs. The objective was to compare these genomes at both nucleotide and protein sequence levels in order to understand their phylogenetic and functional relatedness. Four copies of 16S rRNA gene sequences were identified in each of the eight complete genomes and one in the other 11 unfinished genomes. These 43 16S rRNA sequences represent only 24 unique sequences and the derived phylogenetic tree suggests a possible evolutionary history for these strains. Phylogenomic comparison based on shared proteins and whole genome nucleotide sequences consistently showed two groups with closely related members: one consisted of ATCC 33277, 381, and HG66, another of W83, W50, and A7436. At least 1,037 core/shared proteins were identified in the 19 P. gingivalis genomes based on the most stringent detecting parameters. Comparative functional genomics based on genome-wide comparisons between NCBI and RAST annotations, as well as additional approaches, revealed functions that are unique or missing in individual P. gingivalis strains, or species-specific in all P. gingivalis strains, when compared to a neighboring species P. asaccharolytica . All the comparative results of this study are available online for download at ftp://www.homd.org/publication_data/20160425/.
In silico Comparison of 19 Porphyromonas gingivalis Strains in Genomics, Phylogenetics, Phylogenomics and Functional Genomics

PubMed Central

Chen, Tsute; Siddiqui, Huma; Olsen, Ingar

2017-01-01

Currently, genome sequences of a total of 19 Porphyromonas gingivalis strains are available, including eight completed genomes (strains W83, ATCC 33277, TDC60, HG66, A7436, AJW4, 381, and A7A1-28) and 11 high-coverage draft sequences (JCVI SC001, F0185, F0566, F0568, F0569, F0570, SJD2, W4087, W50, Ando, and MP4-504) that are assembled into fewer than 300 contigs. The objective was to compare these genomes at both nucleotide and protein sequence levels in order to understand their phylogenetic and functional relatedness. Four copies of 16S rRNA gene sequences were identified in each of the eight complete genomes and one in the other 11 unfinished genomes. These 43 16S rRNA sequences represent only 24 unique sequences and the derived phylogenetic tree suggests a possible evolutionary history for these strains. Phylogenomic comparison based on shared proteins and whole genome nucleotide sequences consistently showed two groups with closely related members: one consisted of ATCC 33277, 381, and HG66, another of W83, W50, and A7436. At least 1,037 core/shared proteins were identified in the 19 P. gingivalis genomes based on the most stringent detecting parameters. Comparative functional genomics based on genome-wide comparisons between NCBI and RAST annotations, as well as additional approaches, revealed functions that are unique or missing in individual P. gingivalis strains, or species-specific in all P. gingivalis strains, when compared to a neighboring species P. asaccharolytica. All the comparative results of this study are available online for download at ftp://www.homd.org/publication_data/20160425/. PMID:28261563
Estimation of pairwise sequence similarity of mammalian enhancers with word neighbourhood counts.

PubMed

Göke, Jonathan; Schulz, Marcel H; Lasserre, Julia; Vingron, Martin

2012-03-01

The identity of cells and tissues is to a large degree governed by transcriptional regulation. A major part is accomplished by the combinatorial binding of transcription factors at regulatory sequences, such as enhancers. Even though binding of transcription factors is sequence-specific, estimating the sequence similarity of two functionally similar enhancers is very difficult. However, a similarity measure for regulatory sequences is crucial to detect and understand functional similarities between two enhancers and will facilitate large-scale analyses like clustering, prediction and classification of genome-wide datasets. We present the standardized alignment-free sequence similarity measure N2, a flexible framework that is defined for word neighbourhoods. We explore the usefulness of adding reverse complement words as well as words including mismatches into the neighbourhood. On simulated enhancer sequences as well as functional enhancers in mouse development, N2 is shown to outperform previous alignment-free measures. N2 is flexible, faster than competing methods and less susceptible to single sequence noise and the occurrence of repetitive sequences. Experiments on the mouse enhancers reveal that enhancers active in different tissues can be separated by pairwise comparison using N2. N2 represents an improvement over previous alignment-free similarity measures without compromising speed, which makes it a good candidate for large-scale sequence comparison of regulatory sequences. The software is part of the open-source C++ library SeqAn (www.seqan.de) and a compiled version can be downloaded at http://www.seqan.de/projects/alf.html. Supplementary data are available at Bioinformatics online.
Phylogenetic analysis of a spontaneous cocoa bean fermentation metagenome reveals new insights into its bacterial and fungal community diversity.

PubMed

Illeghems, Koen; De Vuyst, Luc; Papalexandratou, Zoi; Weckx, Stefan

2012-01-01

This is the first report on the phylogenetic analysis of the community diversity of a single spontaneous cocoa bean box fermentation sample through a metagenomic approach involving 454 pyrosequencing. Several sequence-based and composition-based taxonomic profiling tools were used and evaluated to avoid software-dependent results and their outcome was validated by comparison with previously obtained culture-dependent and culture-independent data. Overall, this approach revealed a wider bacterial (mainly γ-Proteobacteria) and fungal diversity than previously found. Further, the use of a combination of different classification methods, in a software-independent way, helped to understand the actual composition of the microbial ecosystem under study. In addition, bacteriophage-related sequences were found. The bacterial diversity depended partially on the methods used, as composition-based methods predicted a wider diversity than sequence-based methods, and as classification methods based solely on phylogenetic marker genes predicted a more restricted diversity compared with methods that took all reads into account. The metagenomic sequencing analysis identified Hanseniaspora uvarum, Hanseniaspora opuntiae, Saccharomyces cerevisiae, Lactobacillus fermentum, and Acetobacter pasteurianus as the prevailing species. Also, the presence of occasional members of the cocoa bean fermentation process was revealed (such as Erwinia tasmaniensis, Lactobacillus brevis, Lactobacillus casei, Lactobacillus rhamnosus, Lactococcus lactis, Leuconostoc mesenteroides, and Oenococcus oeni). Furthermore, the sequence reads associated with viral communities were of a restricted diversity, dominated by Myoviridae and Siphoviridae, and reflecting Lactobacillus as the dominant host. To conclude, an accurate overview of all members of a cocoa bean fermentation process sample was revealed, indicating the superiority of metagenomic sequencing over previously used techniques.
Complete mitochondrial genomes of eleven extinct or possibly extinct bird species.

PubMed

Anmarkrud, Jarl A; Lifjeld, Jan T

2017-03-01

Natural history museum collections represent a vast source of ancient and historical DNA samples from extinct taxa that can be utilized by high-throughput sequencing tools to reveal novel genetic and phylogenetic information about them. Here, we report on the successful sequencing of complete mitochondrial genome sequences (mitogenomes) from eleven extinct bird species, using de novo assembly of short sequences derived from toepad samples of degraded DNA from museum specimens. For two species (the Passenger Pigeon Ectopistes migratorius and the South Island Piopio Turnagra capensis), whole mitogenomes were already available from recent studies, whereas for five others (the Great Auk Pinguinis impennis, the Imperial Woodpecker Campehilus imperialis, the Huia Heteralocha acutirostris, the Kauai Oo Moho braccathus and the South Island Kokako Callaeas cinereus), there were partial mitochondrial sequences available for comparison. For all seven species, we found sequence similarities of >98%. For the remaining four species (the Kamao Myadestes myadestinus, the Paradise Parrot Psephotellus pulcherrimus, the Ou Psittirostra psittacea and the Lesser Akialoa Akialoa obscura), there was no sequence information available for comparison, so we conducted blast searches and phylogenetic analyses to determine their phylogenetic positions and identify their closest extant relatives. These mitogenomes will be valuable for future analyses of avian phylogenetics and illustrate the importance of museum collections as repositories for genomics resources. © 2016 John Wiley & Sons Ltd.
Sequence analysis of chloroplast chlB gene of medicinal Ephedra species and its application to authentication of Ephedra Herb.

PubMed

Guo, Yahong; Tsuruga, Ayako; Yamaguchi, Shigeharu; Oba, Koji; Iwai, Kasumi; Sekita, Setsuko; Mizukami, Hajime

2006-06-01

Chloroplast chlB gene encoding subunit B of light-independent protochlorophyllide reductase was amplified from herbarium and crude drug specimens of Ephedra sinica, E. intermedia, E. equisetina, and E. przewalskii. Sequence comparison of the chlB gene indicated that all the E. sinica specimens have the same sequence type (Type S) distinctive from other species, while there are two sequence types (Type E1 and Type E2) in E. equisetina. E. intermedia and E. prezewalskii revealed an identical sequence type (Type IP). E. sinica was also identified by digesting the chlB fragment with Bcl I. A novel method for DNA authentication of Ephedra Herb based on the sequences of the chloroplast chlB gene and internal transcribed spacer of nuclear rRNA genes was developed and successfully applied for identification of the crude drugs obtained in the Chinese market.
The spotted gar genome illuminates vertebrate evolution and facilitates human-to-teleost comparisons

PubMed Central

Braasch, Ingo; Gehrke, Andrew R.; Smith, Jeramiah J.; Kawasaki, Kazuhiko; Manousaki, Tereza; Pasquier, Jeremy; Amores, Angel; Desvignes, Thomas; Batzel, Peter; Catchen, Julian; Berlin, Aaron M.; Campbell, Michael S.; Barrell, Daniel; Martin, Kyle J.; Mulley, John F.; Ravi, Vydianathan; Lee, Alison P.; Nakamura, Tetsuya; Chalopin, Domitille; Fan, Shaohua; Wcisel, Dustin; Cañestro, Cristian; Sydes, Jason; Beaudry, Felix E. G.; Sun, Yi; Hertel, Jana; Beam, Michael J.; Fasold, Mario; Ishiyama, Mikio; Johnson, Jeremy; Kehr, Steffi; Lara, Marcia; Letaw, John H.; Litman, Gary W.; Litman, Ronda T.; Mikami, Masato; Ota, Tatsuya; Saha, Nil Ratan; Williams, Louise; Stadler, Peter F.; Wang, Han; Taylor, John S.; Fontenot, Quenton; Ferrara, Allyse; Searle, Stephen M. J.; Aken, Bronwen; Yandell, Mark; Schneider, Igor; Yoder, Jeffrey A.; Volff, Jean-Nicolas; Meyer, Axel; Amemiya, Chris T.; Venkatesh, Byrappa; Holland, Peter W. H.; Guiguen, Yann; Bobe, Julien; Shubin, Neil H.; Di Palma, Federica; Alföldi, Jessica; Lindblad-Toh, Kerstin; Postlethwait, John H.

2016-01-01

To connect human biology to fish biomedical models, we sequenced the genome of spotted gar (Lepisosteus oculatus), whose lineage diverged from teleosts before the teleost genome duplication (TGD). The slowly evolving gar genome conserved in content and size many entire chromosomes from bony vertebrate ancestors. Gar bridges teleosts to tetrapods by illuminating the evolution of immunity, mineralization, and development (e.g., Hox, ParaHox, and miRNA genes). Numerous conserved non-coding elements (CNEs, often cis-regulatory) undetectable in direct human-teleost comparisons become apparent using gar: functional studies uncovered conserved roles of such cryptic CNEs, facilitating annotation of sequences identified in human genome-wide association studies. Transcriptomic analyses revealed that the sum of expression domains and levels from duplicated teleost genes often approximate patterns and levels of gar genes, consistent with subfunctionalization. The gar genome provides a resource for understanding evolution after genome duplication, the origin of vertebrate genomes, and the function of human regulatory sequences. PMID:26950095
The full genome sequences of 8 equine herpesvirus type 4 isolates from horses in Japan.

PubMed

Izume, Satoko; Kirisawa, Rikio; Ohya, Kenji; Ohnuma, Aiko; Kimura, Takashi; Omatsu, Tsutomu; Katayama, Yukie; Mizutani, Tetsuya; Fukushi, Hideto

2017-01-24

Equine herpesvirus type 4 (EHV-4) is one of the most important pathogens in horses. To clarify the key genes of the EHV-4 genome that cause abortion in female horses, we determined the whole genome sequences of a laboratory strain and 7 Japanese EHV-4 isolates that were isolated from 2 aborted fetuses and nasal swabs of 5 horses with respiratory disease. The full genome sequences and predicted amino acid sequences of each gene of these isolates were compared with of the reference EHV-4 strain NS80567 and Australian isolates that were reported in 2015. The EHV-4 isolates clustered in 2 groups which did not reflect their pathogenicity. A comparison of the predicted amino acid sequences of the genes did not reveal any genes that were associated with EHV-4-induced abortion.
The complete chloroplast genome of Aconitum chiisanense Nakai (Ranunculaceae).

PubMed

Lim, Chae Eun; Kim, Goon-Bo; Baek, Seunghoon; Han, Su-Min; Yu, Hee-Ju; Mun, Jeong-Hwan

2017-01-01

We determined the complete chloroplast DNA sequence of Aconitum chiisanense Nakai, a rare Aconitum species endemic to Korea. The chloroplast genome is 155 934 bp in length and contains 4 rRNA, 30 tRNA, and 78 protein-coding genes. Phylogenetic analysis revealed that the chloroplast genome of A. chiisanense is closely related to that of A. barbatum var. puberulum. Sequence comparison with other Ranunculaceae chloroplasts identified a unique deletion in the rps16 gene of A. chiisanense chloroplast DNA that can serve as a molecular marker for species identification.
Functional analysis of Pacific oyster (Crassostrea gigas) β-thymosin: Focus on antimicrobial activity.

PubMed

Nam, Bo-Hye; Seo, Jung-Kil; Lee, Min Jeong; Kim, Young-Ok; Kim, Dong-Gyun; An, Cheul Min; Park, Nam Gyu

2015-07-01

An antimicrobial peptide, ∼5 kDa in size, was isolated and purified in its active form from the mantle of the Pacific oyster Crassostrea gigas by C18 reversed-phase high-performance liquid chromatography. Matrix-assisted laser desorption ionisation time-of-flight analysis revealed 4656.4 Da of the purified and unreduced peptide. A comparison of the N-terminal amino acid sequence of oyster antimicrobial peptide with deduced amino acid sequences in our local expressed sequence tag (EST) database of C. gigas (unpublished data) revealed that the oyster antimicrobial peptide sequence entirely matched the deduced amino acid sequence of an EST clone (HM-8_A04), which was highly homologous with the β-thymosin of other species. The cDNA possessed a 126-bp open reading frame that encoded a protein of 41 amino acids. To confirm the antimicrobial activity of C. gigas β-thymosin, we overexpressed a recombinant β-thymosin (rcgTβ) using a pET22 expression plasmid in an Escherichia coli system. The antimicrobial activity of rcgTβ was evaluated and demonstrated using a bacterial growth inhibition test in both liquid and solid cultures. Copyright © 2015 Elsevier Ltd. All rights reserved.
Chromobacterium sphagni sp. nov., an insecticidal bacterium isolated from Sphagnum bogs.

PubMed

Blackburn, Michael B; Farrar, Robert R; Sparks, Michael E; Kuhar, Daniel; Mitchell, Ashaki; Gundersen-Rindal, Dawn E

2017-09-01

Sixteen isolates of Gram-reaction-negative, motile, violet-pigmented bacteria were isolated from Sphagnum bogs in West Virginia and Maine, USA. 16S rRNA gene sequences and fatty acid analysis revealed a high degree of relatedness among the isolates, and genome sequencing of two isolates, IIBBL 14B-1T and IIBBL 37-2 (from West Virginia and Maine, respectively), revealed highly similar genomic sequences. The average nucleotide identity (gANI) calculated for these two isolates was found to be in excess of 99 %, but did not exceed 88 % when comparing either isolate with genomic sequences of Chromobacterium violaceum ATCC 12472T, C. haemolyticum DSM 19808T, C. piscinae ND17, C. subtsugae PRAA4-1T, C. vaccinii MWU205T or C. amazonense CBMAI 310T. Collectively, gANI and 16S rRNA gene sequence comparisons suggested that isolates IIBBL 14B-1T and IIBBL 37-2 were most closely related to C. subtsugae, but represented a distinct species. We propose the name Chromobacterium sphagni sp. nov. for this taxon; the type strain is IIBBL 14B-1T (=NRRL B-67130T=JCM 31882T).
Assembly of the Lactuca sativa, L. cv. Tizian draft genome sequence reveals differences within major resistance complex 1 as compared to the cv. Salinas reference genome.

PubMed

Verwaaijen, Bart; Wibberg, Daniel; Nelkner, Johanna; Gordin, Miriam; Rupp, Oliver; Winkler, Anika; Bremges, Andreas; Blom, Jochen; Grosch, Rita; Pühler, Alfred; Schlüter, Andreas

2018-02-10

Lettuce (Lactuca sativa, L.) is an important annual plant of the family Asteraceae (Compositae). The commercial lettuce cultivar Tizian has been used in various scientific studies investigating the interaction of the plant with phytopathogens or biological control agents. Here, we present the de novo draft genome sequencing and gene prediction for this specific cultivar derived from transcriptome sequence data. The assembled scaffolds amount to a size of 2.22 Gb. Based on RNAseq data, 31,112 transcript isoforms were identified. Functional predictions for these transcripts were determined within the GenDBE annotation platform. Comparison with the cv. Salinas reference genome revealed a high degree of sequence similarity on genome and transcriptome levels, with an average amino acid identity of 99%. Furthermore, it was observed that two large regions are either missing or are highly divergent within the cv. Tizian genome compared to cv. Salinas. One of these regions covers the major resistance complex 1 region of cv. Salinas. The cv. Tizian draft genome sequence provides a valuable resource for future functional and transcriptome analyses focused on this lettuce cultivar. Copyright © 2017 Elsevier B.V. All rights reserved.
High-resolution phylogeography of zoonotic tapeworm Echinococcus granulosus sensu stricto genotype G1 with an emphasis on its distribution in Turkey, Italy and Spain.

PubMed

Kinkar, Liina; Laurimäe, Teivi; Simsek, Sami; Balkaya, Ibrahim; Casulli, Adriano; Manfredi, Maria Teresa; Ponce-Gordo, Francisco; Varcasia, Antonio; Lavikainen, Antti; González, Luis Miguel; Rehbein, Steffen; VAN DER Giessen, Joke; Sprong, Hein; Saarma, Urmas

2016-11-01

Echinococcus granulosus is the causative agent of cystic echinococcosis. The disease is a significant global public health concern and human infections are most commonly associated with E. granulosus sensu stricto (s. s.) genotype G1. The objectives of this study were to: (i) analyse the genetic variation and phylogeography of E. granulosus s. s. G1 in part of its main distribution range in Europe using 8274 bp of mtDNA; (ii) compare the results with those derived from previously used shorter mtDNA sequences and highlight the major differences. We sequenced a total of 91 E. granulosus s. s. G1 isolates from six different intermediate host species, including humans. The isolates originated from seven countries representing primarily Turkey, Italy and Spain. Few samples were also from Albania, Greece, Romania and from a patient originating from Algeria, but diagnosed in Finland. The analysed 91 sequences were divided into 83 haplotypes, revealing complex phylogeography and high genetic variation of E. granulosus s. s. G1 in Europe, particularly in the high-diversity domestication centre of western Asia. Comparisons with shorter mtDNA datasets revealed that 8274 bp sequences provided significantly higher phylogenetic resolution and thus more power to reveal the genetic relations between different haplotypes.
Cloning and Sequence Analysis of Vibrio halioticoli Genes Encoding Three Types of Polyguluronate Lyase.

PubMed

Sugimura; Sawabe; Ezura

2000-01-01

The alginate lyase-coding genes of Vibrio halioticoli IAM 14596(T), which was isolated from the gut of the abalone Haliotis discus hannai, were cloned using plasmid vector pUC 18, and expressed in Escherichia coli. Three alginate lyase-positive clones, pVHB, pVHC, and pVHE, were obtained, and all clones expressed the enzyme activity specific for polyguluronate. Three genes, alyVG1, alyVG2, and alyVG3, encoding polyguluronate lyase were sequenced: alyVG1 from pVHB was composed of a 1056-bp open reading frame (ORF) encoding 352 amino acid residues; alyVG2 gene from pVHC was composed of a 993-bp ORF encoding 331 amino acid residues; and alyVG3 gene from pVHE was composed of a 705-bp ORF encoding 235 amino acid residues. Comparison of nucleotide and deduced amino acid sequences among AlyVG1, AlyVG2, and AlyVG3 revealed low homologies. The identity value between AlyVG1 and AlyVG2 was 18.7%, and that between AlyVG2 and AlyVG3 was 17.0%. A higher identity value (26.0%) was observed between AlyVG1 and AlyVG3. Sequence comparison among known polyguluronate lyases including AlyVG1, AlyVG2, and AlyVG3 also did not reveal an identical region in these sequences. However, AlyVG1 showed the highest identity value (36.2%) and the highest similarity (73.3%) to AlyA from Klebsiella pneumoniae. A consensus region comprising nine amino acid (YFKAGXYXQ) in the carboxy-terminal region previously reported by Mallisard and colleagues was observed only in AlyVG1 and AlyVG2.
Evolution of long centromeres in fire ants.

PubMed

Huang, Yu-Ching; Lee, Chih-Chi; Kao, Chia-Yi; Chang, Ni-Chen; Lin, Chung-Chi; Shoemaker, DeWayne; Wang, John

2016-09-15

Centromeres are essential for accurate chromosome segregation, yet sequence conservation is low even among closely related species. Centromere drive predicts rapid turnover because some centromeric sequences may compete better than others during female meiosis. In addition to sequence composition, longer centromeres may have a transmission advantage. We report the first observations of extremely long centromeres, covering on average 34 % of the chromosomes, in the red imported fire ant Solenopsis invicta. By comparison, cytological examination of Solenopsis geminata revealed typical small centromeric constrictions. Bioinformatics and molecular analyses identified CenSol, the major centromeric satellite DNA repeat. We found that CenSol sequences are very similar between the two species but the CenSol copy number in S. invicta is much greater than that in S. geminata. In addition, centromere expansion in S. invicta is not correlated with the duplication of CenH3. Comparative analyses revealed that several closely related fire ant species also possess long centromeres. Our results are consistent with a model of simple runaway centromere expansion due to centromere drive. We suggest expanded centromeres may be more prevalent in hymenopteran insects, which use haplodiploid sex determination, than previously considered.

[Hepatitis C virus: sequence homology of a European isolate and divergence from the prototype].

PubMed

Seelig, R; Seelig, H P; Renz, M

1991-08-01

The polymerase chain reaction (PCR) detected specific hepatitis C viral (HCV) RNA sequences in liver biopsies from two patients with chronic hepatitis, in the tissue of a liver implantate, in plasma from four chronic non-A, non-B hepatitis (NANBH) patients and, for the first time, in an infectious anti-D-immunoglobulin preparation. A comparison of the viral sequences coding for a region for the nonstructural NS3 protein from the liver tissues revealed only a very small degree of sequence divergence on the cDNA as well as on the amino acid level (between 0 and 5%). The sequence similarities of the RNA isolated from plasma of the four chronic NANBH patients and the anti-D-immunoglobulin preparation were partly somewhat lower but altogether also high (between 90 and 100%). In contrast, all eight cDNA and amino acid sequences exhibited a significantly higher degree of divergence in comparison with the HCV prototype sequence (between 29 and 32%) than among themselves (between 0 and 10%). This unexpected high sequence similarity of the eight European isolates and their low homology to the Northamerican prototype sequence is indicative for the existence of different types of HCV. This will be important not only for epidemiological studies but also for the development of effective diagnostic procedures and vaccines. Concerning the pathogenesis of NANBH, a double infection or a helper mechanism has to be considered: in addition to the C virus, sequences of an other virus particle were found in the infectious IgG preparation as well as in the liver biopsies.
Genomic characterization and taxonomic position of a rhabdovirus from a hybrid snakehead.

PubMed

Zeng, Weiwei; Wang, Qing; Wang, Yingying; Liu, Cun; Liang, Hongru; Fang, Xiang; Wu, Shuqin

2014-09-01

A new rhabdovirus, tentatively designated as hybrid snakehead rhabdovirus C1207 (HSHRV-C1207), was first isolated from a moribund hybrid snakehead (Channa maculata×Channa argus) in China. We present the complete genome sequence of HSHRV-C1207 and a comprehensive sequence comparison between HSHRV-C1207 and other rhabdoviruses. Sequence alignment and phylogenetic analysis revealed that HSHRV-C1207 shared the highest degree of homology with Monopterus albus rhabdovirus and Siniperca chuatsi rhabdovirus. All three viruses clustered into a single group that was distinct from the recognized genera in the family Rhabdoviridae. Our analysis suggests that HSHRV-C1207, as well as MARV and SCRV, should be assigned to a new rhabdovirus genus.
Megalocytivirus infection in cultured Nile tilapia Oreochromis niloticus.

PubMed

Subramaniam, Kuttichantran; Gotesman, Michael; Smith, Charlie E; Steckler, Natalie K; Kelley, Karen L; Groff, Joseph M; Waltzek, Thomas B

2016-05-26

Megalocytiviruses, such as infectious spleen and kidney necrosis virus (ISKNV), induce lethal systemic diseases in both ornamental and food fish species. In this study, we investigated an epizootic affecting Nile tilapia Oreochromis niloticus cultured in the US Midwest. Diseased fish displayed lethargy, gill pallor, and distension of the coelomic cavity due to ascites. Histopathological examination revealed a severe systemic abundance of intravascular megalocytes that were especially prominent in the gills, kidney, spleen, liver, and intestinal submucosa. Transmission electron microscopic examination revealed abundant intracytoplasmic polygonal virions consistent with iridovirus infection. Comparison of the full-length major capsid protein nucleotide sequences from a recent outbreak with a remarkably similar case that occurred at the same facility many years earlier revealed that both epizootics were caused by ISKNV. A comparison of this case with previous reports suggests that ISKNV may represent a greater threat to tilapia aquaculture than previously realized.
Simultaneous Differentiation and Typing of Entamoeba histolytica and Entamoeba dispar

PubMed Central

Zaki, Mehreen; Meelu, Parool; Sun, Wei; Clark, C. Graham

2002-01-01

Sequences corresponding to some of the polymorphic loci previously reported from Entamoeba histolytica have been detected in Entamoeba dispar. Comparison of nucleotide sequences of two loci between E. dispar strain SAW760 and E. histolytica strain HM-1:IMSS revealed significant differences in both repeat and flanking regions. The tandem repeat units varied not only in sequence but also in number and arrangement between the two species at both the loci. Using the sequences obtained, primer pairs aimed at amplifying species-specific products were designed and tested on a variety of E. histolytica and E. dispar samples. Amplification results were in complete agreement with the original species classification in all cases, and the PCR products displayed discernible size and pattern variations among the isolates. PMID:11923344
Re-Assembly and Analysis of an Ancient Variola Virus Genome.

PubMed

Smithson, Chad; Imbery, Jacob; Upton, Chris

2017-09-08

We report a major improvement to the assembly of published short read sequencing data from an ancient variola virus (VARV) genome by the removal of contig-capping sequencing tags and manual searches for gap-spanning reads. The new assembly, together with camelpox and taterapox genomes, permitted new dates to be calculated for the last common ancestor of all VARV genomes. The analysis of recently sequenced VARV-like cowpox virus genomes showed that single nucleotide polymorphisms (SNPs) and amino acid changes in the vaccinia virus (VACV)-Cop-O1L ortholog, predicted to be associated with VARV host specificity and virulence, were introduced into the lineage before the divergence of these viruses. A comparison of the ancient and modern VARV genome sequences also revealed a measurable drift towards adenine + thymine (A + T) richness.
Three copies of a single protein II-encoding sequence in the genome of Neisseria gonorrhoeae JS3: evidence for gene conversion and gene duplication.

PubMed

van der Ley, P

1988-11-01

Gonococci express a family of related outer membrane proteins designated protein II (P.II). These surface proteins are subject to both phase variation and antigenic variation. The P.II gene repertoire of Neisseria gonorrhoeae strain JS3 was found to consist of at least ten genes, eight of which were cloned. Sequence analysis and DNA hybridization studies revealed that one particular P.II-encoding sequence is present in three distinct, but almost identical, copies in the JS3 genome. These genes encode the P.II protein that was previously identified as P.IIc. Comparison of their sequences shows that the multiple copies of this P.IIc-encoding gene might have been generated by both gene conversion and gene duplication.
Arpeggio: harmonic compression of ChIP-seq data reveals protein-chromatin interaction signatures

PubMed Central

Stanton, Kelly Patrick; Parisi, Fabio; Strino, Francesco; Rabin, Neta; Asp, Patrik; Kluger, Yuval

2013-01-01

Researchers generating new genome-wide data in an exploratory sequencing study can gain biological insights by comparing their data with well-annotated data sets possessing similar genomic patterns. Data compression techniques are needed for efficient comparisons of a new genomic experiment with large repositories of publicly available profiles. Furthermore, data representations that allow comparisons of genomic signals from different platforms and across species enhance our ability to leverage these large repositories. Here, we present a signal processing approach that characterizes protein–chromatin interaction patterns at length scales of several kilobases. This allows us to efficiently compare numerous chromatin-immunoprecipitation sequencing (ChIP-seq) data sets consisting of many types of DNA-binding proteins collected from a variety of cells, conditions and organisms. Importantly, these interaction patterns broadly reflect the biological properties of the binding events. To generate these profiles, termed Arpeggio profiles, we applied harmonic deconvolution techniques to the autocorrelation profiles of the ChIP-seq signals. We used 806 publicly available ChIP-seq experiments and showed that Arpeggio profiles with similar spectral densities shared biological properties. Arpeggio profiles of ChIP-seq data sets revealed characteristics that are not easily detected by standard peak finders. They also allowed us to relate sequencing data sets from different genomes, experimental platforms and protocols. Arpeggio is freely available at http://sourceforge.net/p/arpeggio/wiki/Home/. PMID:23873955
Arpeggio: harmonic compression of ChIP-seq data reveals protein-chromatin interaction signatures.

PubMed

Stanton, Kelly Patrick; Parisi, Fabio; Strino, Francesco; Rabin, Neta; Asp, Patrik; Kluger, Yuval

2013-09-01

Researchers generating new genome-wide data in an exploratory sequencing study can gain biological insights by comparing their data with well-annotated data sets possessing similar genomic patterns. Data compression techniques are needed for efficient comparisons of a new genomic experiment with large repositories of publicly available profiles. Furthermore, data representations that allow comparisons of genomic signals from different platforms and across species enhance our ability to leverage these large repositories. Here, we present a signal processing approach that characterizes protein-chromatin interaction patterns at length scales of several kilobases. This allows us to efficiently compare numerous chromatin-immunoprecipitation sequencing (ChIP-seq) data sets consisting of many types of DNA-binding proteins collected from a variety of cells, conditions and organisms. Importantly, these interaction patterns broadly reflect the biological properties of the binding events. To generate these profiles, termed Arpeggio profiles, we applied harmonic deconvolution techniques to the autocorrelation profiles of the ChIP-seq signals. We used 806 publicly available ChIP-seq experiments and showed that Arpeggio profiles with similar spectral densities shared biological properties. Arpeggio profiles of ChIP-seq data sets revealed characteristics that are not easily detected by standard peak finders. They also allowed us to relate sequencing data sets from different genomes, experimental platforms and protocols. Arpeggio is freely available at http://sourceforge.net/p/arpeggio/wiki/Home/.
Antigenic and genetic comparison of foot-and-mouth disease virus serotype O Indian vaccine strain, O/IND/R2/75 against currently circulating viruses.

PubMed

Mahapatra, Mana; Yuvaraj, S; Madhanmohan, M; Subramaniam, S; Pattnaik, B; Paton, D J; Srinivasan, V A; Parida, Satya

2015-01-29

Foot-and-mouth disease (FMD) virus serotype O is the most common cause of FMD outbreaks in India and three of the six lineages that have been described are most frequently detected, namely Ind2001, PanAsia and PanAsia 2. We report the full capsid sequence of 21 serotype O viruses isolated from India between 2002 and 2012. All these viruses belong to the Middle East-South Asia (ME-SA) topotype. The serological cross-reactivity of a bovine post-vaccination serum pool raised against the current Indian vaccine strain, O/IND/R2/75,was tested by virus neutralisation test with the 23 Indian field isolates, revealing a good match between the vaccine and the field isolates. The cross reactivity of the O/IND/R2/75 vaccine with 19 field isolates from other countries (mainly from Asia and Africa) revealed a good match to 79% of the viruses indicating that the vaccine strain is broadly cross-reactive and could be used to control FMD in other countries. Comparison of the capsid sequences of the serologically non-matching isolates with the vaccine strain sequence identified substitutions in neutralising antigenic sites 1 and 2, which could explain the observed serological differences. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.
Center for Regenerative Biology and Medicine at Mount Desert Island Biological Laboratory

DTIC Science & Technology

2012-06-01

Code Axolotl microRNAs Zebrafish Polypterus 16. SECURITY CLASSIFICATION OF: 17. LIMITATION OF...controlled in both Polypterus and axolotl samples. These comparisons revealed a total of 2779 shared genes that are significantly upregulated during...UPREGULATED DOWNREGULATED Figure 1: Venn diagram of UniProt protein sequence IDs among Axolotl and Polypterus contigs that were up-regulated
Analysis of septins across kingdoms reveals orthology and new motifs.

PubMed

Pan, Fangfang; Malmberg, Russell L; Momany, Michelle

2007-07-01

Septins are cytoskeletal GTPase proteins first discovered in the fungus Saccharomyces cerevisiae where they organize the septum and link nuclear division with cell division. More recently septins have been found in animals where they are important in processes ranging from actin and microtubule organization to embryonic patterning and where defects in septins have been implicated in human disease. Previous studies suggested that many animal septins fell into independent evolutionary groups, confounding cross-kingdom comparison. In the current work, we identified 162 septins from fungi, microsporidia and animals and analyzed their phylogenetic relationships. There was support for five groups of septins with orthology between kingdoms. Group 1 (which includes S. cerevisiae Cdc10p and human Sept9) and Group 2 (which includes S. cerevisiae Cdc3p and human Sept7) contain sequences from fungi and animals. Group 3 (which includes S. cerevisiae Cdc11p) and Group 4 (which includes S. cerevisiae Cdc12p) contain sequences from fungi and microsporidia. Group 5 (which includes Aspergillus nidulans AspE) contains sequences from filamentous fungi. We suggest a modified nomenclature based on these phylogenetic relationships. Comparative sequence alignments revealed septin derivatives of already known G1, G3 and G4 GTPase motifs, four new motifs from two to twelve amino acids long and six conserved single amino acid positions. One of these new motifs is septin-specific and several are group specific. Our studies provide an evolutionary history for this important family of proteins and a framework and consistent nomenclature for comparison of septin orthologs across kingdoms.
Typing and comparative genome analysis of Brucella melitensis isolated from Lebanon.

PubMed

Abou Zaki, Natalia; Salloum, Tamara; Osman, Marwan; Rafei, Rayane; Hamze, Monzer; Tokajian, Sima

2017-10-16

Brucella melitensis is the main causative agent of the zoonotic disease brucellosis. This study aimed at typing and characterizing genetic variation in 33 Brucella isolates recovered from patients in Lebanon. Bruce-ladder multiplex PCR and PCR-RFLP of omp31, omp2a and omp2b were performed. Sixteen representative isolates were chosen for draft-genome sequencing and analyzed to determine variations in virulence, resistance, genomic islands, prophages and insertion sequences. Comparative whole-genome single nucleotide polymorphism analysis was also performed. The isolates were confirmed to be B. melitensis. Genome analysis revealed multiple virulence determinants and efflux pumps. Genome comparisons and single nucleotide polymorphisms divided the isolates based on geographical distribution but revealed high levels of similarity between the strains. Sequence divergence in B. melitensis was mainly due to lateral gene transfer of mobile elements. This is the first report of an in-depth genomic characterization of B. melitensis in Lebanon. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Characterization of the genetic elements required for site-specific integration of plasmid pSE211 in Saccharopolyspora erythraea.

PubMed Central

Brown, D P; Idler, K B; Katz, L

1990-01-01

The 18.1-kilobase plasmid pSE211 integrates into the chromosome of Saccharopolyspora erythraea at a specific attB site. Restriction analysis of the integrated plasmid, pSE211int, and adjacent chromosomal sequences allowed identification of attP, the plasmid attachment site. Nucleotide sequencing of attP, attB, attL, and attR revealed a 57-base-pair sequence common to all sites with no duplications of adjacent plasmid or chromosomal sequences in the integrated state, indicating that integration takes place through conservative, reciprocal strand exchange. An analysis of the sequences indicated the presence of a putative gene for Phe-tRNA at attB which is preserved at attL after integration has occurred. A comparison of the attB site for a number of actinomycete plasmids is presented. Integration at attB was also observed when a 2.4-kilobase segment of pSE211 containing attP and the adjacent plasmid sequence was used to transform a pSE211- host. Nucleotide sequencing of this segment revealed the presence of two complete open reading frames (ORFs) and a segment of a third ORF. The ORF adjacent to attP encodes a putative polypeptide 437 amino acids in length that shows similarity, at its C-terminal domain, to sequences of site-specific recombinases of the integrase family. The adjacent ORF encodes a putative 98-amino-acid basic polypeptide that contains a helix-turn-helix motif at its N terminus which corresponds to domains in the Xis proteins of a number of bacteriophages. A proposal for the function of this polypeptide is presented. The deduced amino acid sequence of the third ORF did not reveal similarities to polypeptide sequences in the current data banks. Images FIG. 2 FIG. 3 PMID:2180909
Diversity and evolutionary patterns of immune genes in free-ranging Namibian leopards (Panthera pardus pardus).

PubMed

Castro-Prieto, Aines; Wachter, Bettina; Melzheimer, Joerg; Thalwitzer, Susanne; Sommer, Simone

2011-01-01

The genes of the major histocompatibility complex (MHC) are a key component of the mammalian immune system and have become important molecular markers for fitness-related genetic variation in wildlife populations. Currently, no information about the MHC sequence variation and constitution in African leopards exists. In this study, we isolated and characterized genetic variation at the adaptively most important region of MHC class I and MHC class II-DRB genes in 25 free-ranging African leopards from Namibia and investigated the mechanisms that generate and maintain MHC polymorphism in the species. Using single-stranded conformation polymorphism analysis and direct sequencing, we detected 6 MHC class I and 6 MHC class II-DRB sequences, which likely correspond to at least 3 MHC class I and 3 MHC class II-DRB loci. Amino acid sequence variation in both MHC classes was higher or similar in comparison to other reported felids. We found signatures of positive selection shaping the diversity of MHC class I and MHC class II-DRB loci during the evolutionary history of the species. A comparison of MHC class I and MHC class II-DRB sequences of the leopard to those of other felids revealed a trans-species mode of evolution. In addition, the evolutionary relationships of MHC class II-DRB sequences between African and Asian leopard subspecies are discussed.
Identification and Resolution of Microdiversity through Metagenomic Sequencing of Parallel Consortia

PubMed Central

Maezato, Yukari; Wu, Yu-Wei; Romine, Margaret F.; Lindemann, Stephen R.

2015-01-01

To gain a predictive understanding of the interspecies interactions within microbial communities that govern community function, the genomic complement of every member population must be determined. Although metagenomic sequencing has enabled the de novo reconstruction of some microbial genomes from environmental communities, microdiversity confounds current genome reconstruction techniques. To overcome this issue, we performed short-read metagenomic sequencing on parallel consortia, defined as consortia cultivated under the same conditions from the same natural community with overlapping species composition. The differences in species abundance between the two consortia allowed reconstruction of near-complete (at an estimated >85% of gene complement) genome sequences for 17 of the 20 detected member species. Two Halomonas spp. indistinguishable by amplicon analysis were found to be present within the community. In addition, comparison of metagenomic reads against the consensus scaffolds revealed within-species variation for one of the Halomonas populations, one of the Rhodobacteraceae populations, and the Rhizobiales population. Genomic comparison of these representative instances of inter- and intraspecies microdiversity suggests differences in functional potential that may result in the expression of distinct roles in the community. In addition, isolation and complete genome sequence determination of six member species allowed an investigation into the sensitivity and specificity of genome reconstruction processes, demonstrating robustness across a wide range of sequence coverage (9× to 2,700×) within the metagenomic data set. PMID:26497460
Identification and Resolution of Microdiversity through Metagenomic Sequencing of Parallel Consortia

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nelson, William C.; Maezato, Yukari; Wu, Yu-Wei

2015-10-23

To gain a predictive understanding of the interspecies interactions within microbial communities that govern community function, the genomic complement of every member population must be determined. Although metagenomic sequencing has enabled thede novoreconstruction of some microbial genomes from environmental communities, microdiversity confounds current genome reconstruction techniques. To overcome this issue, we performed short-read metagenomic sequencing on parallel consortia, defined as consortia cultivated under the same conditions from the same natural community with overlapping species composition. The differences in species abundance between the two consortia allowed reconstruction of near-complete (at an estimated >85% of gene complement) genome sequences for 17 ofmore » the 20 detected member species. TwoHalomonasspp. indistinguishable by amplicon analysis were found to be present within the community. In addition, comparison of metagenomic reads against the consensus scaffolds revealed within-species variation for one of theHalomonaspopulations, one of theRhodobacteraceaepopulations, and theRhizobialespopulation. Genomic comparison of these representative instances of inter- and intraspecies microdiversity suggests differences in functional potential that may result in the expression of distinct roles in the community. In addition, isolation and complete genome sequence determination of six member species allowed an investigation into the sensitivity and specificity of genome reconstruction processes, demonstrating robustness across a wide range of sequence coverage (9× to 2,700×) within the metagenomic data set.« less
Comparative Genome Sequence Analysis of the Bpa/Str Region in Mouse and Man

PubMed Central

Mallon, A.-M.; Platzer, M.; Bate, R.; Gloeckner, G.; Botcherby, M.R.M.; Nordsiek, G.; Strivens, M.A.; Kioschis, P.; Dangel, A.; Cunningham, D.; Straw, R.N.A.; Weston, P.; Gilbert, M.; Fernando, S.; Goodall, K.; Hunter, G.; Greystrong, J.S.; Clarke, D.; Kimberley, C.; Goerdes, M.; Blechschmidt, K.; Rump, A.; Hinzmann, B.; Mundy, C.R.; Miller, W.; Poustka, A.; Herman, G.E.; Rhodes, M.; Denny, P.; Rosenthal, A.; Brown, S.D.M.

2000-01-01

The progress of human and mouse genome sequencing programs presages the possibility of systematic cross-species comparison of the two genomes as a powerful tool for gene and regulatory element identification. As the opportunities to perform comparative sequence analysis emerge, it is important to develop parameters for such analyses and to examine the outcomes of cross-species comparison. Our analysis used gene prediction and a database search of 430 kb of genomic sequence covering the Bpa/Str region of the mouse X chromosome, and 745 kb of genomic sequence from the homologous human X chromosome region. We identified 11 genes in mouse and 13 genes and two pseudogenes in human. In addition, we compared the mouse and human sequences using pairwise alignment and searches for evolutionary conserved regions (ECRs) exceeding a defined threshold of sequence identity. This approach aided the identification of at least four further putative conserved genes in the region. Comparative sequencing revealed that this region is a mosaic in evolutionary terms, with considerably more rearrangement between the two species than realized previously from comparative mapping studies. Surprisingly, this region showed an extremely high LINE and low SINE content, low G+C content, and yet a relatively high gene density, in contrast to the low gene density usually associated with such regions. [The sequence data described in this paper have been submitted to EMBL under the following accession nos.: Mouse Genomic Sequence: Mouse contig A (AL021127), Mouse contig B (AL049866), BAC41M10 (AL136328), PAC303O11(AL136329). Human Genomic Sequence: Human contig 1 (U82671, U82670), Human contig 2 (U82695).] PMID:10854409
Partial Shotgun Sequencing of the Boechera stricta Genome Reveals Extensive Microsynteny and Promoter Conservation with Arabidopsis1[W

PubMed Central

Windsor, Aaron J.; Schranz, M. Eric; Formanová, Nataša; Gebauer-Jung, Steffi; Bishop, John G.; Schnabelrauch, Domenica; Kroymann, Juergen; Mitchell-Olds, Thomas

2006-01-01

Comparative genomics provides insight into the evolutionary dynamics that shape discrete sequences as well as whole genomes. To advance comparative genomics within the Brassicaceae, we have end sequenced 23,136 medium-sized insert clones from Boechera stricta, a wild relative of Arabidopsis (Arabidopsis thaliana). A significant proportion of these sequences, 18,797, are nonredundant and display highly significant similarity (BLASTn e-value ≤ 10−30) to low copy number Arabidopsis genomic regions, including more than 9,000 annotated coding sequences. We have used this dataset to identify orthologous gene pairs in the two species and to perform a global comparison of DNA regions 5′ to annotated coding regions. On average, the 500 nucleotides upstream to coding sequences display 71.4% identity between the two species. In a similar analysis, 61.4% identity was observed between 5′ noncoding sequences of Brassica oleracea and Arabidopsis, indicating that regulatory regions are not as diverged among these lineages as previously anticipated. By mapping the B. stricta end sequences onto the Arabidopsis genome, we have identified nearly 2,000 conserved blocks of microsynteny (bracketing 26% of the Arabidopsis genome). A comparison of fully sequenced B. stricta inserts to their homologous Arabidopsis genomic regions indicates that indel polymorphisms >5 kb contribute substantially to the genome size difference observed between the two species. Further, we demonstrate that microsynteny inferred from end-sequence data can be applied to the rapid identification and cloning of genomic regions of interest from nonmodel species. These results suggest that among diploid relatives of Arabidopsis, small- to medium-scale shotgun sequencing approaches can provide rapid and cost-effective benefits to evolutionary and/or functional comparative genomic frameworks. PMID:16607030
GTF2IRD2 is located in the Williams–Beuren syndrome critical region 7q11.23 and encodes a protein with two TFII-I-like helix–loop–helix repeats

PubMed Central

Makeyev, Aleksandr V.; Erdenechimeg, Lkhamsuren; Mungunsukh, Ognoon; Roth, Jutta J.; Enkhmandakh, Badam; Ruddle, Frank H.; Bayarsaihan, Dashzeveg

2004-01-01

Williams–Beuren syndrome (also known as Williams syndrome) is caused by a deletion of a 1.55- to 1.84-megabase region from chromosome band 7q11.23. GTF2IRD1 and GTF2I, located within this critical region, encode proteins of the TFII-I family with multiple helix–loop–helix domains known as I repeats. In the present work, we characterize a third member, GTF2IRD2, which has sequence and structural similarity to the GTF2I and GTF2IRD1 paralogs. The ORF encodes a protein with several features characteristic of regulatory factors, including two I repeats, two leucine zippers, and a single Cys-2/His-2 zinc finger. The genomic organization of human, baboon, rat, and mouse genes is well conserved. Our exon-by-exon comparison has revealed that GTF2IRD2 is more closely related to GTF2I than to GTF2IRD1 and apparently is derived from the GTF2I sequence. The comparison of GTF2I and GTF2IRD2 genes revealed two distinct regions of homology, indicating that the helix–loop–helix domain structure of the GTF2IRD2 gene has been generated by two independent genomic duplications. We speculate that GTF2I is derived from GTF2IRD1 as a result of local duplication and the further evolution of its structure was associated with its functional specialization. Comparison of genomic sequences surrounding GTF2IRD2 genes in mice and humans allows refinement of the centromeric breakpoint position of the primate-specific inversion within the Williams–Beuren syndrome critical region. PMID:15243160
GTF2IRD2 is located in the Williams-Beuren syndrome critical region 7q11.23 and encodes a protein with two TFII-I-like helix-loop-helix repeats.

PubMed

Makeyev, Aleksandr V; Erdenechimeg, Lkhamsuren; Mungunsukh, Ognoon; Roth, Jutta J; Enkhmandakh, Badam; Ruddle, Frank H; Bayarsaihan, Dashzeveg

2004-07-27

Williams-Beuren syndrome (also known as Williams syndrome) is caused by a deletion of a 1.55- to 1.84-megabase region from chromosome band 7q11.23. GTF2IRD1 and GTF2I, located within this critical region, encode proteins of the TFII-I family with multiple helix-loop-helix domains known as I repeats. In the present work, we characterize a third member, GTF2IRD2, which has sequence and structural similarity to the GTF2I and GTF2IRD1 paralogs. The ORF encodes a protein with several features characteristic of regulatory factors, including two I repeats, two leucine zippers, and a single Cys-2/His-2 zinc finger. The genomic organization of human, baboon, rat, and mouse genes is well conserved. Our exon-by-exon comparison has revealed that GTF2IRD2 is more closely related to GTF2I than to GTF2IRD1 and apparently is derived from the GTF2I sequence. The comparison of GTF2I and GTF2IRD2 genes revealed two distinct regions of homology, indicating that the helix-loop-helix domain structure of the GTF2IRD2 gene has been generated by two independent genomic duplications. We speculate that GTF2I is derived from GTF2IRD1 as a result of local duplication and the further evolution of its structure was associated with its functional specialization. Comparison of genomic sequences surrounding GTF2IRD2 genes in mice and humans allows refinement of the centromeric breakpoint position of the primate-specific inversion within the Williams-Beuren syndrome critical region.

Sequence stratigraphy of the Triassic in the Barentsz Sea

DOE Office of Scientific and Technical Information (OSTI.GOV)

Skjold, L.JU.; Van Veen, P.M.; Gjelberg, J.

1990-05-01

A regional study of the Triassic in the Barentsz Sea (20-32{degree}E, 71-74{degree}N) revealed sequences that correlate seismically for hundreds of kilometers. Recent offshore drilling results enabled them to establish a biostratigraphic time framework. Comparisons with information from onshore outcrops (such as the Svalbard Archipelago) aided the piecing together of these superregional sequences. Seismic character analysis identified three units with composite progradational patterns (Induan, Olenekian, and Anisian). Fluvial, deltaic, and marine deposits can be distinguished and located relative to the paleocoastlines. Corresponding downlap surfaces suggest the development of condensed intervals, predicted to consist of organic-rich source rocks, as was later confirmedmore » by drilling. Regional predictions based on this sequence-stratigraphic approach have proved valuable when correlating and evaluating well information. The sequences identified also help define third-order sea level curves for the area; these improve published curves thought to have global significance.« less
Wide distribution of O157-antigen biosynthesis gene clusters in Escherichia coli.

PubMed

Iguchi, Atsushi; Shirai, Hiroki; Seto, Kazuko; Ooka, Tadasuke; Ogura, Yoshitoshi; Hayashi, Tetsuya; Osawa, Kayo; Osawa, Ro

2011-01-01

Most Escherichia coli O157-serogroup strains are classified as enterohemorrhagic E. coli (EHEC), which is known as an important food-borne pathogen for humans. They usually produce Shiga toxin (Stx) 1 and/or Stx2, and express H7-flagella antigen (or nonmotile). However, O157 strains that do not produce Stxs and express H antigens different from H7 are sometimes isolated from clinical and other sources. Multilocus sequence analysis revealed that these 21 O157:non-H7 strains tested in this study belong to multiple evolutionary lineages different from that of EHEC O157:H7 strains, suggesting a wide distribution of the gene set encoding the O157-antigen biosynthesis in multiple lineages. To gain insight into the gene organization and the sequence similarity of the O157-antigen biosynthesis gene clusters, we conducted genomic comparisons of the chromosomal regions (about 59 kb in each strain) covering the O-antigen gene cluster and its flanking regions between six O157:H7/non-H7 strains. Gene organization of the O157-antigen gene cluster was identical among O157:H7/non-H7 strains, but was divided into two distinct types at the nucleotide sequence level. Interestingly, distribution of the two types did not clearly follow the evolutionary lineages of the strains, suggesting that horizontal gene transfer of both types of O157-antigen gene clusters has occurred independently among E. coli strains. Additionally, detailed sequence comparison revealed that some positions of the repetitive extragenic palindromic (REP) sequences in the regions flanking the O-antigen gene clusters were coincident with possible recombination points. From these results, we conclude that the horizontal transfer of the O157-antigen gene clusters induced the emergence of multiple O157 lineages within E. coli and speculate that REP sequences may involve one of the driving forces for exchange and evolution of O-antigen loci.
Analysis of a native whitefly transcriptome and its sequence divergence with two invasive whitefly species.

PubMed

Wang, Xiao-Wei; Zhao, Qiong-Yi; Luan, Jun-Bo; Wang, Yu-Jun; Yan, Gen-Hong; Liu, Shu-Sheng

2012-10-04

Genomic divergence between invasive and native species may provide insight into the molecular basis underlying specific characteristics that drive the invasion and displacement of closely related species. In this study, we sequenced the transcriptome of an indigenous species, Asia II 3, of the Bemisia tabaci complex and compared its genetic divergence with the transcriptomes of two invasive whiteflies species, Middle East Asia Minor 1 (MEAM1) and Mediterranean (MED), respectively. More than 16 million reads of 74 base pairs in length were obtained for the Asia II 3 species using the Illumina sequencing platform. These reads were assembled into 52,535 distinct sequences (mean size: 466 bp) and 16,596 sequences were annotated with an E-value above 10-5. Protein family comparisons revealed obvious diversification among the transcriptomes of these species suggesting species-specific adaptations during whitefly evolution. On the contrary, substantial conservation of the whitefly transcriptomes was also evident, despite their differences. The overall divergence of coding sequences between the orthologous gene pairs of Asia II 3 and MEAM1 is 1.73%, which is comparable to the average divergence of Asia II 3 and MED transcriptomes (1.84%) and much higher than that of MEAM1 and MED (0.83%). This is consistent with the previous phylogenetic analyses and crossing experiments suggesting these are distinct species. We also identified hundreds of highly diverged genes and compiled sequence identify data into gene functional groups and found the most divergent gene classes are Cytochrome P450, Glutathione metabolism and Oxidative phosphorylation. These results strongly suggest that the divergence of genes related to metabolism might be the driving force of the MEAM1 and Asia II 3 differentiation. We also analyzed single nucleotide polymorphisms within the orthologous gene pairs of indigenous and invasive whiteflies which are helpful for the investigation of association between allelic and phenotypes. Our data present the most comprehensive sequences for the indigenous whitefly species Asia II 3. The extensive comparisons of Asia II 3, MEAM1 and MED transcriptomes will serve as an invaluable resource for revealing the genetic basis of whitefly invasion and the molecular mechanisms underlying their biological differences.
Analysis of a native whitefly transcriptome and its sequence divergence with two invasive whitefly species

PubMed Central

2012-01-01

Background Genomic divergence between invasive and native species may provide insight into the molecular basis underlying specific characteristics that drive the invasion and displacement of closely related species. In this study, we sequenced the transcriptome of an indigenous species, Asia II 3, of the Bemisia tabaci complex and compared its genetic divergence with the transcriptomes of two invasive whiteflies species, Middle East Asia Minor 1 (MEAM1) and Mediterranean (MED), respectively. Results More than 16 million reads of 74 base pairs in length were obtained for the Asia II 3 species using the Illumina sequencing platform. These reads were assembled into 52,535 distinct sequences (mean size: 466 bp) and 16,596 sequences were annotated with an E-value above 10-5. Protein family comparisons revealed obvious diversification among the transcriptomes of these species suggesting species-specific adaptations during whitefly evolution. On the contrary, substantial conservation of the whitefly transcriptomes was also evident, despite their differences. The overall divergence of coding sequences between the orthologous gene pairs of Asia II 3 and MEAM1 is 1.73%, which is comparable to the average divergence of Asia II 3 and MED transcriptomes (1.84%) and much higher than that of MEAM1 and MED (0.83%). This is consistent with the previous phylogenetic analyses and crossing experiments suggesting these are distinct species. We also identified hundreds of highly diverged genes and compiled sequence identify data into gene functional groups and found the most divergent gene classes are Cytochrome P450, Glutathione metabolism and Oxidative phosphorylation. These results strongly suggest that the divergence of genes related to metabolism might be the driving force of the MEAM1 and Asia II 3 differentiation. We also analyzed single nucleotide polymorphisms within the orthologous gene pairs of indigenous and invasive whiteflies which are helpful for the investigation of association between allelic and phenotypes. Conclusions Our data present the most comprehensive sequences for the indigenous whitefly species Asia II 3. The extensive comparisons of Asia II 3, MEAM1 and MED transcriptomes will serve as an invaluable resource for revealing the genetic basis of whitefly invasion and the molecular mechanisms underlying their biological differences. PMID:23036081
Genetic diversity of Clostridium perfringens type A isolates from animals, food poisoning outbreaks and sludge

PubMed Central

Johansson, Anders; Aspan, Anna; Bagge, Elisabeth; Båverud, Viveca; Engström, Björn E; Johansson, Karl-Erik

2006-01-01

Background Clostridium perfringens, a serious pathogen, causes enteric diseases in domestic animals and food poisoning in humans. The epidemiological relationship between C. perfringens isolates from the same source has previously been investigated chiefly by pulsed-field gel electrophoresis (PFGE). In this study the genetic diversity of C. perfringens isolated from various animals, from food poisoning outbreaks and from sludge was investigated. Results We used PFGE to examine the genetic diversity of 95 C. perfringens type A isolates from eight different sources. The isolates were also examined for the presence of the beta2 toxin gene (cpb2) and the enterotoxin gene (cpe). The cpb2 gene from the 28 cpb2-positive isolates was also partially sequenced (519 bp, corresponding to positions 188 to 706 in the consensus cpb2 sequence). The results of PFGE revealed a wide genetic diversity among the C. perfringens type A isolates. The genetic relatedness of the isolates ranged from 58 to 100% and 56 distinct PFGE types were identified. Almost all clusters with similar patterns comprised isolates with a known epidemiological correlation. Most of the isolates from pig, horse and sheep carried the cpb2 gene. All isolates originating from food poisoning outbreaks carried the cpe gene and three of these also carried cpb2. Two evolutionary different populations were identified by sequence analysis of the partially sequenced cpb2 genes from our study and cpb2 sequences previously deposited in GenBank. Conclusion As revealed by PFGE, there was a wide genetic diversity among C. perfringens isolates from different sources. Epidemiologically related isolates showed a high genetic similarity, as expected, while isolates with no obvious epidemiological relationship expressed a lesser degree of genetic similarity. The wide diversity revealed by PFGE was not reflected in the 16S rRNA sequences, which had a considerable degree of sequence similarity. Sequence comparison of the partially sequenced cpb2 gene revealed two genetically different populations. This is to our knowledge the first study in which the genetic diversity of C. perfringens isolates both from different animals species, from food poisoning outbreaks and from sludge has been investigated. PMID:16737528
Archaebacterial rhodopsin sequences: Implications for evolution

NASA Technical Reports Server (NTRS)

Lanyi, J. K.

1991-01-01

It was proposed over 10 years ago that the archaebacteria represent a separate kingdom which diverged very early from the eubacteria and eukaryotes. It follows that investigations of archaebacterial characteristics might reveal features of early evolution. So far, two genes, one for bacteriorhodopsin and another for halorhodopsin, both from Halobacterium halobium, have been sequenced. We cloned and sequenced the gene coding for the polypeptide of another one of these rhodopsins, a halorhodopsin in Natronobacterium pharaonis. Peptide sequencing of cyanogen bromide fragments, and immuno-reactions of the protein and synthetic peptides derived from the C-terminal gene sequence, confirmed that the open reading frame was the structural gene for the pharaonis halorhodopsin polypeptide. The flanking DNA sequences of this gene, as well as those of other bacterial rhodopsins, were compared to previously proposed archaebacterial consensus sequences. In pairwise comparisons of the open reading frame with DNA sequences for bacterio-opsin and halo-opsin from Halobacterium halobium, silent divergences were calculated. These indicate very considerable evolutionary distance between each pair of genes, even in the dame organism. In spite of this, three protein sequences show extensive similarities, indicating strong selective pressures.
Sequence Diversity Diagram for comparative analysis of multiple sequence alignments.

PubMed

Sakai, Ryo; Aerts, Jan

2014-01-01

The sequence logo is a graphical representation of a set of aligned sequences, commonly used to depict conservation of amino acid or nucleotide sequences. Although it effectively communicates the amount of information present at every position, this visual representation falls short when the domain task is to compare between two or more sets of aligned sequences. We present a new visual presentation called a Sequence Diversity Diagram and validate our design choices with a case study. Our software was developed using the open-source program called Processing. It loads multiple sequence alignment FASTA files and a configuration file, which can be modified as needed to change the visualization. The redesigned figure improves on the visual comparison of two or more sets, and it additionally encodes information on sequential position conservation. In our case study of the adenylate kinase lid domain, the Sequence Diversity Diagram reveals unexpected patterns and new insights, for example the identification of subgroups within the protein subfamily. Our future work will integrate this visual encoding into interactive visualization tools to support higher level data exploration tasks.
Sequence analysis of the phs operon in Salmonella typhimurium and the contribution of thiosulfate reduction to anaerobic energy metabolism.

PubMed Central

Heinzinger, N K; Fujimoto, S Y; Clark, M A; Moreno, M S; Barrett, E L

1995-01-01

The phs chromosomal locus of Salmonella typhimurium is essential for the dissimilatory anaerobic reduction of thiosulfate to hydrogen sulfide. Sequence analysis of the phs region revealed a functional operon with three open reading frames, designated phsA, phsB, and phsC, which encode peptides of 82.7, 21.3, and 28.5 kDa, respectively. The predicted products of phsA and phsB exhibited significant homology with the catalytic and electron transfer subunits of several other anaerobic molybdoprotein oxidoreductases, including Escherichia coli dimethyl sulfoxide reductase, nitrate reductase, and formate dehydrogenase. Simultaneous comparison of PhsA to seven homologous molybdoproteins revealed numerous similarities among all eight throughout the entire frame, hence, significant amino acid conservation among molybdoprotein oxidoreductases. Comparison of PhsB to six other homologous sequences revealed four highly conserved iron-sulfur clusters. The predicted phsC product was highly hydrophobic and similar in size to the hydrophobic subunits of the molybdoprotein oxidoreductases containing subunits homologous to phsA and phsB. Thus, phsABC appears to encode thiosulfate reductase. Single-copy phs-lac translational fusions required both anaerobiosis and thiosulfate for full expression, whereas multicopy phs-lac translational fusions responded to either thiosulfate or anaerobiosis, suggesting that oxygen and thiosulfate control of phs involves negative regulation. A possible role for thiosulfate reduction in anaerobic respiration was examined. Thiosulfate did not significantly augment the final densities of anaerobic cultures grown on any of the 18 carbon sources tested. on the other hand, washed stationary-phase cells depleted of ATP were shown to synthesize small amounts of ATP on the addition of the formate and thiosulfate, suggesting that the thiosulfate reduction plays a unique role in anaerobic energy conservation by S typhimurium. PMID:7751291
alpha-Tubulin of Histriculus cavicola (Ciliophora; Hypotrichea).

PubMed

Pérez-Romero, P; Villalobo, E; Díaz-Ramos, C; Calvo, P; Santos-Rosa, F; Torres, A

1997-03-01

An alpha-tubulin gene fragment amplified by PCR from the hypotrichous ciliate Histriculus cavicola has been sequenced. This fragment, 1,182 bp long, contains an in-frame "stop" codon (UAA), which in other hypotrichous species codes for a glutamine residue. The comparison of the alpha-tubulin genes from several ciliates classes have revealed amino acid positions which could serve to distinguish these taxonomic groups.
Genetic differences between two strains of Xylella fastidiosa revealed by suppression subtractive hybridization.

PubMed

Harakava, Ricardo; Gabriel, Dean W

2003-02-01

Suppression subtractive hybridization was used to rapidly identify 18 gene differences between a citrus variegated chlorosis (CVC) strain and a Pierce's disease of grape (PD) strain of Xylella fastidiosa. The results were validated as being highly representative of actual differences by comparison of the completely sequenced genome of a CVC strain with that of a PD strain.
The primary structures of ribosomal proteins L16, L23 and L33 from the archaebacterium Halobacterium marismortui.

PubMed

Hatakeyama, T; Hatakeyama, T; Kimura, M

1988-11-21

The complete amino acid sequences of ribosomal proteins L16, L23 and L33 from the archaebacterium Halobacterium marismortui were determined. The sequences were established by manual sequencing of peptides produced with several proteases as well as by cleavage with dilute HCl. Proteins L16, L23 and L33 consist of 119, 154 and 69 amino acid residues, and their molecular masses are 13,538, 16,812 and 7620 Da, respectively. The comparison of their sequences with those of ribosomal proteins from other organisms revealed that L23 and L33 are related to eubacterial ribosomal proteins from Escherichia coli and Bacillus stearothermophilus, while protein L16 was found to be homologous to a eukaryotic ribosomal protein from yeast. These results provide information about the special phylogenetic position of archaebacteria.
The heterogeneity of human CD127(+) innate lymphoid cells revealed by single-cell RNA sequencing.

PubMed

Björklund, Åsa K; Forkel, Marianne; Picelli, Simone; Konya, Viktoria; Theorell, Jakob; Friberg, Danielle; Sandberg, Rickard; Mjösberg, Jenny

2016-04-01

Innate lymphoid cells (ILCs) are increasingly appreciated as important participants in homeostasis and inflammation. Substantial plasticity and heterogeneity among ILC populations have been reported. Here we have delineated the heterogeneity of human ILCs through single-cell RNA sequencing of several hundreds of individual tonsil CD127(+) ILCs and natural killer (NK) cells. Unbiased transcriptional clustering revealed four distinct populations, corresponding to ILC1 cells, ILC2 cells, ILC3 cells and NK cells, with their respective transcriptomes recapitulating known as well as unknown transcriptional profiles. The single-cell resolution additionally divulged three transcriptionally and functionally diverse subpopulations of ILC3 cells. Our systematic comparison of single-cell transcriptional variation within and between ILC populations provides new insight into ILC biology during homeostasis, with additional implications for dysregulation of the immune system.
A specific indel marker for the Philippines Schistosoma japonicum revealed by analysis of mitochondrial genome sequences.

PubMed

Li, Juan; Chen, Fen; Sugiyama, Hiromu; Blair, David; Lin, Rui-Qing; Zhu, Xing-Quan

2015-07-01

In the present study, near-complete mitochondrial (mt) genome sequences for Schistosoma japonicum from different regions in the Philippines and Japan were amplified and sequenced. Comparisons among S. japonicum from the Philippines, Japan, and China revealed a geographically based length difference in mt genomes, but the mt genomic organization and gene arrangement were the same. Sequence differences among samples from the Philippines and all samples from the three endemic areas were 0.57-2.12 and 0.76-3.85 %, respectively. The most variable part of the mt genome was the non-coding region. In the coding portion of the genome, protein-coding genes varied more than rRNA genes and tRNAs. The near-complete mt genome sequences for Philippine specimens were identical in length (14,091 bp) which was 4 bp longer than those of S. japonicum samples from Japan and China. This indel provides a unique genetic marker for S. japonicum samples from the Philippines. Phylogenetic analyses based on the concatenated amino acids of 12 protein-coding genes showed that samples of S. japonicum clustered according to their geographical origins. The identified mitochondrial indel marker will be useful for tracing the source of S. japonicum infection in humans and animals in Southeast Asia.
Structure of the human protein kinase MPSK1 reveals an atypical activation loop architecture.

PubMed

Eswaran, Jeyanthy; Bernad, Antonio; Ligos, Jose M; Guinea, Barbara; Debreczeni, Judit E; Sobott, Frank; Parker, Sirlester A; Najmanovich, Rafael; Turk, Benjamin E; Knapp, Stefan

2008-01-01

The activation segment of protein kinases is structurally highly conserved and central to regulation of kinase activation. Here we report an atypical activation segment architecture in human MPSK1 comprising a beta sheet and a large alpha-helical insertion. Sequence comparisons suggested that similar activation segments exist in all members of the MPSK1 family and in MAST kinases. The consequence of this nonclassical activation segment on substrate recognition was studied using peptide library screens that revealed a preferred substrate sequence of X-X-P/V/I-phi-H/Y-T*-N/G-X-X-X (phi is an aliphatic residue). In addition, we identified the GTPase DRG1 as an MPSK1 interaction partner and specific substrate. The interaction domain in DRG1 was mapped to the N terminus, leading to recruitment and phosphorylation at Thr100 within the GTPase domain. The presented data reveal an atypical kinase structural motif and suggest a role of MPSK1 regulating DRG1, a GTPase involved in regulation of cellular growth.
Phylogenetic analysis of several Thermus strains from Rehai of Tengchong, Yunnan, China.

PubMed

Lin, Lianbing; Zhang, Jie; Wei, Yunlin; Chen, Chaoyin; Peng, Qian

2005-10-01

Several Thermus strains were isolated from 10 hot springs of the Rehai geothermal area in Tengchong, Yunnan province. The diversity of Thermus strains was examined by sequencing the 16S rRNA genes and comparing their sequences. Phylogenetic analysis showed that the 16S rDNA sequences from the Rehai geothermal isolates form four branches in the phylogenetic tree and had greater than 95.9% similarity in the phylogroup. Secondary structure comparison also indicated that the 16S rRNA from the Rehai geothermal isolates have unique secondary structure characteristics in helix 6, helix 9, and helix 10 (reference to Escherichia coli). This research is the first attempt to reveal the diversity of Thermus strains that are distributed in the Rehai geothermal area.
The genome sequence and transcriptome of Potentilla micrantha and their comparison to Fragaria vesca (the woodland strawberry)

PubMed Central

Moretto, Marco; Barghini, Elena; Mascagni, Flavia; Natali, Lucia; Brilli, Matteo; Lomsadze, Alexandre; Sonego, Paolo; Giongo, Lara; Alonge, Michael; Velasco, Riccardo; Varotto, Claudio; Šurbanovski, Nada; Borodovsky, Mark; Ward, Judson A; Engelen, Kristof; Cavallini, Andrea; Cestaro, Alessandro

2018-01-01

Abstract Background The genus Potentilla is closely related to that of Fragaria, the economically important strawberry genus. Potentilla micrantha is a species that does not develop berries but shares numerous morphological and ecological characteristics with Fragaria vesca. These similarities make P. micrantha an attractive choice for comparative genomics studies with F. vesca. Findings In this study, the P. micrantha genome was sequenced and annotated, and RNA-Seq data from the different developmental stages of flowering and fruiting were used to develop a set of gene predictions. A 327 Mbp sequence and annotation of the genome of P. micrantha, spanning 2674 sequence contigs, with an N50 size of 335,712, estimated to cover 80% of the total genome size of the species was developed. The genus Potentilla has a characteristically larger genome size than Fragaria, but the recovered sequence scaffolds were remarkably collinear at the micro-syntenic level with the genome of F. vesca, its closest sequenced relative. A total of 33,602 genes were predicted, and 95.1% of bench-marking universal single-copy orthologous genes were complete within the presented sequence. Thus, we argue that the majority of the gene-rich regions of the genome have been sequenced. Conclusions Comparisons of RNA-Seq data from the stages of floral and fruit development revealed genes differentially expressed between P. micrantha and F. vesca.The data presented are a valuable resource for future studies of berry development in Fragaria and the Rosaceae and they also shed light on the evolution of genome size and organization in this family. PMID:29659812
The genome sequence and transcriptome of Potentilla micrantha and their comparison to Fragaria vesca (the woodland strawberry).

PubMed

Buti, Matteo; Moretto, Marco; Barghini, Elena; Mascagni, Flavia; Natali, Lucia; Brilli, Matteo; Lomsadze, Alexandre; Sonego, Paolo; Giongo, Lara; Alonge, Michael; Velasco, Riccardo; Varotto, Claudio; Šurbanovski, Nada; Borodovsky, Mark; Ward, Judson A; Engelen, Kristof; Cavallini, Andrea; Cestaro, Alessandro; Sargent, Daniel James

2018-04-01

The genus Potentilla is closely related to that of Fragaria, the economically important strawberry genus. Potentilla micrantha is a species that does not develop berries but shares numerous morphological and ecological characteristics with Fragaria vesca. These similarities make P. micrantha an attractive choice for comparative genomics studies with F. vesca. In this study, the P. micrantha genome was sequenced and annotated, and RNA-Seq data from the different developmental stages of flowering and fruiting were used to develop a set of gene predictions. A 327 Mbp sequence and annotation of the genome of P. micrantha, spanning 2674 sequence contigs, with an N50 size of 335,712, estimated to cover 80% of the total genome size of the species was developed. The genus Potentilla has a characteristically larger genome size than Fragaria, but the recovered sequence scaffolds were remarkably collinear at the micro-syntenic level with the genome of F. vesca, its closest sequenced relative. A total of 33,602 genes were predicted, and 95.1% of bench-marking universal single-copy orthologous genes were complete within the presented sequence. Thus, we argue that the majority of the gene-rich regions of the genome have been sequenced. Comparisons of RNA-Seq data from the stages of floral and fruit development revealed genes differentially expressed between P. micrantha and F. vesca.The data presented are a valuable resource for future studies of berry development in Fragaria and the Rosaceae and they also shed light on the evolution of genome size and organization in this family.
Dynamical differences of hemoglobin and the ionotropic glutamate receptor in different states revealed by a new dynamics alignment method.

PubMed

Tobi, Dror

2017-08-01

A new algorithm for comparison of protein dynamics is presented. Compared protein structures are superposed and their modes of motions are calculated using the anisotropic network model. The obtained modes are aligned using the dynamic programming algorithm of Needleman and Wunsch, commonly used for sequence alignment. Dynamical comparison of hemoglobin in the T and R2 states reveals that the dynamics of the allosteric effector 2,3-bisphosphoglycerate binding site is different in the two states. These differences can contribute to the selectivity of the effector to the T state. Similar comparison of the ionotropic glutamate receptor in the kainate+(R,R)-2b and ZK bound states reveals that the kainate+(R,R)-2b bound states slow modes describe upward motions of ligand binding domain and the transmembrane domain regions. Such motions may lead to the opening of the receptor. The upper lobes of the LBDs of the ZK bound state have a smaller interface with the amino terminal domains above them and have a better ability to move together. The present study exemplifies the use of dynamics comparison as a tool to study protein function. Proteins 2017; 85:1507-1517. © 2014 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Enhanced arbovirus surveillance with deep sequencing: Identification of novel rhabdoviruses and bunyaviruses in Australian mosquitoes.

PubMed

Coffey, Lark L; Page, Brady L; Greninger, Alexander L; Herring, Belinda L; Russell, Richard C; Doggett, Stephen L; Haniotis, John; Wang, Chunlin; Deng, Xutao; Delwart, Eric L

2014-01-05

Viral metagenomics characterizes known and identifies unknown viruses based on sequence similarities to any previously sequenced viral genomes. A metagenomics approach was used to identify virus sequences in Australian mosquitoes causing cytopathic effects in inoculated mammalian cell cultures. Sequence comparisons revealed strains of Liao Ning virus (Reovirus, Seadornavirus), previously detected only in China, livestock-infecting Stretch Lagoon virus (Reovirus, Orbivirus), two novel dimarhabdoviruses, named Beaumont and North Creek viruses, and two novel orthobunyaviruses, named Murrumbidgee and Salt Ash viruses. The novel virus proteomes diverged by ≥ 50% relative to their closest previously genetically characterized viral relatives. Deep sequencing also generated genomes of Warrego and Wallal viruses, orbiviruses linked to kangaroo blindness, whose genomes had not been fully characterized. This study highlights viral metagenomics in concert with traditional arbovirus surveillance to characterize known and new arboviruses in field-collected mosquitoes. Follow-up epidemiological studies are required to determine whether the novel viruses infect humans. © 2013 Elsevier Inc. All rights reserved.
Comparison of the nucleotide and amino acid sequences of the RsrI and EcoRI restriction endonucleases.

PubMed

Stephenson, F H; Ballard, B T; Boyer, H W; Rosenberg, J M; Greene, P J

1989-12-21

The RsrI endonuclease, a type-II restriction endonuclease (ENase) found in Rhodobacter sphaeroides, is an isoschizomer of the EcoRI ENase. A clone containing an 11-kb BamHI fragment was isolated from an R. sphaeroides genomic DNA library by hybridization with synthetic oligodeoxyribonucleotide probes based on the N-terminal amino acid (aa) sequence of RsrI. Extracts of E. coli containing a subclone of the 11-kb fragment display RsrI activity. Nucleotide sequence analysis reveals an 831-bp open reading frame encoding a polypeptide of 277 aa. A 50% identity exists within a 266-aa overlap between the deduced aa sequences of RsrI and EcoRI. Regions of 75-100% aa sequence identity correspond to key structural and functional regions of EcoRI. The type-II ENases have many common properties, and a common origin might have been expected. Nevertheless, this is the first demonstration of aa sequence similarity between ENases produced by different organisms.

Harnessing Whole Genome Sequencing in Medical Mycology.

PubMed

Cuomo, Christina A

2017-01-01

Comparative genome sequencing studies of human fungal pathogens enable identification of genes and variants associated with virulence and drug resistance. This review describes current approaches, resources, and advances in applying whole genome sequencing to study clinically important fungal pathogens. Genomes for some important fungal pathogens were only recently assembled, revealing gene family expansions in many species and extreme gene loss in one obligate species. The scale and scope of species sequenced is rapidly expanding, leveraging technological advances to assemble and annotate genomes with higher precision. By using iteratively improved reference assemblies or those generated de novo for new species, recent studies have compared the sequence of isolates representing populations or clinical cohorts. Whole genome approaches provide the resolution necessary for comparison of closely related isolates, for example, in the analysis of outbreaks or sampled across time within a single host. Genomic analysis of fungal pathogens has enabled both basic research and diagnostic studies. The increased scale of sequencing can be applied across populations, and new metagenomic methods allow direct analysis of complex samples.
Polypeptide p41 of a Norwalk-Like Virus Is a Nucleic Acid-Independent Nucleoside Triphosphatase

PubMed Central

Pfister, Thomas; Wimmer, Eckard

2001-01-01

Southampton virus (SHV) is a member of the Norwalk-like viruses (NLVs), one of four genera of the family Caliciviridae. The genome of SHV contains three open reading frames (ORFs). ORF 1 encodes a polyprotein that is autocatalytically processed into six proteins, one of which is p41. p41 shares sequence motifs with protein 2C of picornaviruses and superfamily 3 helicases. We have expressed p41 of SHV in bacteria. Purified p41 exhibited nucleoside triphosphate (NTP)-binding and NTP hydrolysis activities. The NTPase activity was not stimulated by single-stranded nucleic acids. SHV p41 had no detectable helicase activity. Protein sequence comparison between the consensus sequences of NLV p41 and enterovirus protein 2C revealed regions of high similarity. According to secondary structure prediction, the conserved regions were located within a putative central domain of alpha helices and beta strands. This study reveals for the first time an NTPase activity associated with a calicivirus-encoded protein. Based on enzymatic properties and sequence information, a functional relationship between NLV p41 and enterovirus 2C is discussed in regard to the role of 2C-like proteins in virus replication. PMID:11160659
Allelic diversity of the MHC class II DRB genes in brown bears (Ursus arctos) and a comparison of DRB sequences within the family Ursidae.

PubMed

Goda, N; Mano, T; Kosintsev, P; Vorobiev, A; Masuda, R

2010-11-01

The allelic diversity of the DRB locus in major histocompatibility complex (MHC) genes was analyzed in the brown bear (Ursus arctos) from the Hokkaido Island of Japan, Siberia, and Kodiak of Alaska. Nineteen alleles of the DRB exon 2 were identified from a total of 38 individuals of U. arctos and were highly polymorphic. Comparisons of non-synonymous and synonymous substitutions in the antigen-binding sites of deduced amino acid sequences indicated evidence for balancing selection on the bear DRB locus. The phylogenetic analysis of the DRB alleles among three genera (Ursus, Tremarctos, and Ailuropoda) in the family Ursidae revealed that DRB allelic lineages were not separated according to species. This strongly shows trans-species persistence of DRB alleles within the Ursidae. © 2010 John Wiley & Sons A/S.
Complete Genome Sequence of Pig-Tailed Macaque Rhadinovirus 2 and Its Evolutionary Relationship with Rhesus Macaque Rhadinovirus and Human Herpesvirus 8/Kaposi's Sarcoma-Associated Herpesvirus

PubMed Central

Bruce, A. Gregory; Thouless, Margaret E.; Haines, Anthony S.; Pallen, Mark J.; Grundhoff, Adam

2015-01-01

ABSTRACT Two rhadinovirus lineages have been identified in Old World primates. The rhadinovirus 1 (RV1) lineage consists of human herpesvirus 8, Kaposi's sarcoma-associated herpesvirus (KSHV), and closely related rhadinoviruses of chimpanzees, gorillas, macaques and other Old World primates. The RV2 rhadinovirus lineage is distinct and consists of closely related viruses from the same Old World primate species. Rhesus macaque rhadinovirus (RRV) is the RV2 prototype, and two RRV isolates, 26-95 and 17577, were sequenced. We determined that the pig-tailed macaque RV2 rhadinovirus, MneRV2, is highly associated with lymphomas in macaques with simian AIDS. To further study the role of rhadinoviruses in the development of lymphoma, we sequenced the complete genome of MneRV2 and identified 87 protein coding genes and 17 candidate microRNAs (miRNAs). A strong genome colinearity and sequence homology were observed between MneRV2 and RRV26-95, although the open reading frame (ORF) encoding the KSHV ORFK15 homolog was disrupted in RRV26-95. Comparison with MneRV2 revealed several genomic anomalies in RRV17577 that were not present in other rhadinovirus genomes, including an N-terminal duplication in ORF4 and a recombinative exchange of more distantly related homologs of the ORF22/ORF47 interacting glycoprotein genes. The comparison with MneRV2 has revealed novel genes and important conservation of protein coding domains and transcription initiation, termination, and splicing signals, which have added to our knowledge of RV2 rhadinovirus genetics. Further comparisons with KSHV and other RV1 rhadinoviruses will provide important avenues for dissecting the biology, evolution, and pathology of these closely related tumor-inducing viruses in humans and other Old World primates. IMPORTANCE This work provides the sequence characterization of MneRV2, the pig-tailed macaque homolog of rhesus rhadinovirus (RRV). MneRV2 and RRV belong to the rhadinovirus 2 (RV2) rhadinovirus lineage of Old World primates and are distinct but related to Kaposi's sarcoma-associated herpesvirus (KSHV), the etiologic agent of Kaposi's sarcoma. Pig-tailed macaques provide important models of human disease, and our previous studies have indicated that MneRV2 plays a causal role in AIDS-related lymphomas in macaques. Delineation of the MneRV2 sequence has allowed a detailed characterization of the genome structure, and evolutionary comparisons with RRV and KSHV have identified conserved promoters, splice junctions, and novel genes. This comparison provides insight into RV2 rhadinovirus biology and sets the groundwork for more intensive next-generation (Next-Gen) transcript and genetic analysis of this class of tumor-inducing herpesvirus. This study supports the use of MneRV2 in pig-tailed macaques as an important model for studying rhadinovirus biology, transmission and pathology. PMID:25609822
Deep sequencing of the LRRK2 gene in 14,002 individuals reveals evidence of purifying selection and independent origin of the p.Arg1628Pro mutation in Europe

PubMed Central

Rubio, Justin P.; Topp, Simon; Warren, Liling; St Jean, Pamela L.; Wegmann, Daniel; Kessner, Darren; Novembre, John; Shen, Judong; Fraser, Dana; Aponte, Jennifer; Nangle, Keith; Cardon, Lon R.; Ehm, Margaret G.; Chissoe, Stephanie L.; Whittaker, John C.; Nelson, Matthew R.; Mooser, Vincent E.

2012-01-01

Genetic variation in LRRK2 predisposes to Parkinson disease (PD), which underpins its development as a therapeutic target. Here, we aimed to identify novel genotype-phenotype associations that might support developing LRRK2 therapies for other conditions. We sequenced the 51 exons of LRRK2 in cases comprising 12 common diseases (n = 9,582), and in 4,420 population controls. We identified 739 single nucleotide variants (SNVs), 62% of which were observed in only one person, including 316 novel exonic variants. We found evidence of purifying selection for the LRRK2 gene and a trend suggesting that this is more pronounced in the central (ROC-COR-kinase) core protein domains of LRRK2 than the flanking domains. Population genetic analyses revealed that LRRK2 is not especially polymorphic or differentiated in comparison to 201 other drug target genes. Amongst Europeans, we identified 17 carriers (0.13%) of pathogenic LRRK2 mutations that were not significantly enriched within any disease or in those reporting a family history of PD. Analysis of pathogenic mutations within Europe reveals that the p.Arg1628Pro (c4883G>C) mutation arose independently in Europe and Asia. Taken together, these findings demonstrate how targeted deep sequencing can help to reveal fundamental characteristics of clinically important loci. PMID:22415848
Deep sequencing of the LRRK2 gene in 14,002 individuals reveals evidence of purifying selection and independent origin of the p.Arg1628Pro mutation in Europe.

PubMed

Rubio, Justin P; Topp, Simon; Warren, Liling; St Jean, Pamela L; Wegmann, Daniel; Kessner, Darren; Novembre, John; Shen, Judong; Fraser, Dana; Aponte, Jennifer; Nangle, Keith; Cardon, Lon R; Ehm, Margaret G; Chissoe, Stephanie L; Whittaker, John C; Nelson, Matthew R; Mooser, Vincent E

2012-07-01

Genetic variation in LRRK2 predisposes to Parkinson disease (PD), which underpins its development as a therapeutic target. Here, we aimed to identify novel genotype-phenotype associations that might support developing LRRK2 therapies for other conditions. We sequenced the 51 exons of LRRK2 in cases comprising 12 common diseases (n = 9,582), and in 4,420 population controls. We identified 739 single-nucleotide variants, 62% of which were observed in only one person, including 316 novel exonic variants. We found evidence of purifying selection for the LRRK2 gene and a trend suggesting that this is more pronounced in the central (ROC-COR-kinase) core protein domains of LRRK2 than the flanking domains. Population genetic analyses revealed that LRRK2 is not especially polymorphic or differentiated in comparison to 201 other drug target genes. Among Europeans, we identified 17 carriers (0.13%) of pathogenic LRRK2 mutations that were not significantly enriched within any disease or in those reporting a family history of PD. Analysis of pathogenic mutations within Europe reveals that the p.Arg1628Pro (c4883G>C) mutation arose independently in Europe and Asia. Taken together, these findings demonstrate how targeted deep sequencing can help to reveal fundamental characteristics of clinically important loci. © 2012 Wiley Periodicals, Inc.
Evolutionary relationships in the ilarviruses: nucleotide sequence of prunus necrotic ringspot virus RNA 3.

PubMed

Sánchez-Navarro, J A; Pallás, V

1997-01-01

The complete nucleotide sequence of an isolate of prunus necrotic ringspot virus (PNRSV) RNA 3 has been determined. Elucidation of the amino acid sequence of the proteins encoded by the two large open reading frames (ORFs) allowed us to carry out comparative and phylogenetic studies on the movement (MP) and coat (CP) proteins in the ilarvirus group. Amino acid sequence comparison of the MP revealed a highly conserved basic sequence motif with an amphipathic alpha-helical structure preceding the conserved motif of the '30K superfamily' proposed by Mushegian and Koonin [26] for MP's. Within this '30K' motif a strictly conserved transmembrane domain is present in all ilarviruses sequenced so far. At the amino-terminal end, prune dwarf virus (PDV) has an extension not present in other ilarviruses but which is observed in all bromo- and cucumoviruses, suggesting a common ancestor or a recombinational event in the Bromoviridae family. Examination of the N-terminus of the CP's of all ilarviruses revealed a highly basic region, part of which resembles the Arg-rich motif that has been characterized in the RNA-binding protein family. This motif has also been found in the other members of the Bromoviridae family, suggesting its involvement in a structural function. Furthermore this region is required for infectivity in ilarviruses. The similarities found in this Arg-rich motif are discussed in terms of this process known as genome activation. Finally, phylogenetic analysis of both the MP and CP proteins revealed a higher relationship of A1MV to PNRSV, apple mosaic virus (ApMV) and PDV than any other member of the ilarvirus group. In that sense, A1MV should be considered as a true ilarvirus instead of forming a distinct group of viruses.
Use of mutation spectra analysis software.

PubMed

Rogozin, I; Kondrashov, F; Glazko, G

2001-02-01

The study and comparison of mutation(al) spectra is an important problem in molecular biology, because these spectra often reflect on important features of mutations and their fixation. Such features include the interaction of DNA with various mutagens, the function of repair/replication enzymes, and properties of target proteins. It is known that mutability varies significantly along nucleotide sequences, such that mutations often concentrate at certain positions, called "hotspots," in a sequence. In this paper, we discuss in detail two approaches for mutation spectra analysis: the comparison of mutation spectra with a HG-PUBL program, (FTP: sunsite.unc.edu/pub/academic/biology/dna-mutations/hyperg) and hotspot prediction with the CLUSTERM program (www.itba.mi.cnr.it/webmutation; ftp.bionet.nsc.ru/pub/biology/dbms/clusterm.zip). Several other approaches for mutational spectra analysis, such as the analysis of a target protein structure, hotspot context revealing, multiple spectra comparisons, as well as a number of mutation databases are briefly described. Mutation spectra in the lacI gene of E. coli and the human p53 gene are used for illustration of various difficulties of such analysis. Copyright 2001 Wiley-Liss, Inc.
Sequence of the structural gene for granule-bound starch synthase of potato (Solanum tuberosum L.) and evidence for a single point deletion in the amf allele.

PubMed

van der Leij, F R; Visser, R G; Ponstein, A S; Jacobsen, E; Feenstra, W J

1991-08-01

The genomic sequence of the potato gene for starch granule-bound starch synthase (GBSS; "waxy protein") has been determined for the wild-type allele of a monoploid genotype from which an amylose-free (amf) mutant was derived, and for the mutant part of the amf allele. Comparison of the wild-type sequence with a cDNA sequence from the literature and a newly isolated cDNA revealed the presence of 13 introns, the first of which is located in the untranslated leader. The promoter contains a G-box-like sequence. The deduced amino acid sequence of the precursor of GBSS shows a high degree of identity with monocot waxy protein sequences in the region corresponding to the mature form of the enzyme. The transit peptide of 77 amino acids, required for routing of the precursor to the plastids, shows much less identity with the transit peptides of the other waxy preproteins, but resembles the hydropathic distributions of these peptides. Alignment of the amino acid sequences of the four mature starch synthases with the Escherichia coli glgA gene product revealed the presence of at least three conserved boxes; there is no homology with previously proposed starch-binding domains of other enzymes involved in starch metabolism. We report the use of chimeric constructs with wild-type and amf sequences to localize, via complementation experiments, the region of the amf allele in which the mutation resides. Direct sequencing of polymerase chain reaction products confirmed that the amf mutation is a deletion of a single AT basepair in the region coding for the transit peptide.(ABSTRACT TRUNCATED AT 250 WORDS)
Vibration-Rotation Bands of HF and DF

DTIC Science & Technology

1977-09-23

98 IZa. Comparison of Observed and Calculated Line Positions of HF, Av = I Sequence ........................... 99 f2b. Comparison of Observed and...Calculated Line Positions of HF, Av = 2 Sequence ........................... 102 12c. Comparison of Observed and Calculated Line Positions of HF, Av = 3...Sequence ........................... 107 i2d. Comparison of Observed and Calculated Line Positions ofHF, Av = 4 Sequence ........................... fi
Global Genomic Diversity of Oryza sativa Varieties Revealed by Comparative Physical Mapping

PubMed Central

Wang, Xiaoming; Kudrna, David A.; Pan, Yonglong; Wang, Hao; Liu, Lin; Lin, Haiyan; Zhang, Jianwei; Song, Xiang; Goicoechea, Jose Luis; Wing, Rod A.; Zhang, Qifa; Luo, Meizhong

2014-01-01

Bacterial artificial chromosome (BAC) physical maps embedding a large number of BAC end sequences (BESs) were generated for Oryza sativa ssp. indica varieties Minghui 63 (MH63) and Zhenshan 97 (ZS97) and were compared with the genome sequences of O. sativa spp. japonica cv. Nipponbare and O. sativa ssp. indica cv. 93-11. The comparisons exhibited substantial diversities in terms of large structural variations and small substitutions and indels. Genome-wide BAC-sized and contig-sized structural variations were detected, and the shared variations were analyzed. In the expansion regions of the Nipponbare reference sequence, in comparison to the MH63 and ZS97 physical maps, as well as to the previously constructed 93-11 physical map, the amounts and types of the repeat contents, and the outputs of gene ontology analysis, were significantly different from those of the whole genome. Using the physical maps of four wild Oryza species from OMAP (http://www.omap.org) as a control, we detected many conserved and divergent regions related to the evolution process of O. sativa. Between the BESs of MH63 and ZS97 and the two reference sequences, a total of 1532 polymorphic simple sequence repeats (SSRs), 71,383 SNPs, 1767 multiple nucleotide polymorphisms, 6340 insertions, and 9137 deletions were identified. This study provides independent whole-genome resources for intra- and intersubspecies comparisons and functional genomics studies in O. sativa. Both the comparative physical maps and the GBrowse, which integrated the QTL and molecular markers from GRAMENE (http://www.gramene.org) with our physical maps and analysis results, are open to the public through our Web site (http://gresource.hzau.edu.cn/resource/resource.html). PMID:24424778
Genomic organization and sequence of the Gus-s/sup a/ allele of the murine. beta. -glucuronidase gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Funkenstein, B.; Leary, S.L.; Stein, J.C.

1988-03-01

The Gus-s/sup ..cap alpha../ allele of the mouse ..beta..-glucuronidase gene exhibits a high degree of inducibility by androgens due to its linkage with the Gus-r/sup ..cap alpha../ regulatory locus. The authors isolated Gus-s/sup ..cap alpha../ on a 28-kilobase pair fragment of mouse chromosome 5 and found that it contains 12 exons and 11 intervening sequences spanning 14 kilobase pairs of this genomic segment. The mRNA cap site was identified by ribonuclease protection and primer extension analyses which revealed an unusually short 5' noncoding sequence of 12 nucleotides. Proximal regulatory sequences in the 5'-flanking DNA and the complete sequence of themore » Gus-s/sup ..cap alpha../ mRNA transcript were also determined. Comparison of the amino acid sequence determined from the Gus-s/sup ..cap alpha../ nucleotide sequence with that of human ..beta..-glucuronidase indicated that the two human mRNA species differ due to alternate splicing of an exon homologous to exon 6 of the mouse gene.« less
Genome sequence, comparative analysis and haplotype structure of the domestic dog.

PubMed

Lindblad-Toh, Kerstin; Wade, Claire M; Mikkelsen, Tarjei S; Karlsson, Elinor K; Jaffe, David B; Kamal, Michael; Clamp, Michele; Chang, Jean L; Kulbokas, Edward J; Zody, Michael C; Mauceli, Evan; Xie, Xiaohui; Breen, Matthew; Wayne, Robert K; Ostrander, Elaine A; Ponting, Chris P; Galibert, Francis; Smith, Douglas R; DeJong, Pieter J; Kirkness, Ewen; Alvarez, Pablo; Biagi, Tara; Brockman, William; Butler, Jonathan; Chin, Chee-Wye; Cook, April; Cuff, James; Daly, Mark J; DeCaprio, David; Gnerre, Sante; Grabherr, Manfred; Kellis, Manolis; Kleber, Michael; Bardeleben, Carolyne; Goodstadt, Leo; Heger, Andreas; Hitte, Christophe; Kim, Lisa; Koepfli, Klaus-Peter; Parker, Heidi G; Pollinger, John P; Searle, Stephen M J; Sutter, Nathan B; Thomas, Rachael; Webber, Caleb; Baldwin, Jennifer; Abebe, Adal; Abouelleil, Amr; Aftuck, Lynne; Ait-Zahra, Mostafa; Aldredge, Tyler; Allen, Nicole; An, Peter; Anderson, Scott; Antoine, Claudel; Arachchi, Harindra; Aslam, Ali; Ayotte, Laura; Bachantsang, Pasang; Barry, Andrew; Bayul, Tashi; Benamara, Mostafa; Berlin, Aaron; Bessette, Daniel; Blitshteyn, Berta; Bloom, Toby; Blye, Jason; Boguslavskiy, Leonid; Bonnet, Claude; Boukhgalter, Boris; Brown, Adam; Cahill, Patrick; Calixte, Nadia; Camarata, Jody; Cheshatsang, Yama; Chu, Jeffrey; Citroen, Mieke; Collymore, Alville; Cooke, Patrick; Dawoe, Tenzin; Daza, Riza; Decktor, Karin; DeGray, Stuart; Dhargay, Norbu; Dooley, Kimberly; Dooley, Kathleen; Dorje, Passang; Dorjee, Kunsang; Dorris, Lester; Duffey, Noah; Dupes, Alan; Egbiremolen, Osebhajajeme; Elong, Richard; Falk, Jill; Farina, Abderrahim; Faro, Susan; Ferguson, Diallo; Ferreira, Patricia; Fisher, Sheila; FitzGerald, Mike; Foley, Karen; Foley, Chelsea; Franke, Alicia; Friedrich, Dennis; Gage, Diane; Garber, Manuel; Gearin, Gary; Giannoukos, Georgia; Goode, Tina; Goyette, Audra; Graham, Joseph; Grandbois, Edward; Gyaltsen, Kunsang; Hafez, Nabil; Hagopian, Daniel; Hagos, Birhane; Hall, Jennifer; Healy, Claire; Hegarty, Ryan; Honan, Tracey; Horn, Andrea; Houde, Nathan; Hughes, Leanne; Hunnicutt, Leigh; Husby, M; Jester, Benjamin; Jones, Charlien; Kamat, Asha; Kanga, Ben; Kells, Cristyn; Khazanovich, Dmitry; Kieu, Alix Chinh; Kisner, Peter; Kumar, Mayank; Lance, Krista; Landers, Thomas; Lara, Marcia; Lee, William; Leger, Jean-Pierre; Lennon, Niall; Leuper, Lisa; LeVine, Sarah; Liu, Jinlei; Liu, Xiaohong; Lokyitsang, Yeshi; Lokyitsang, Tashi; Lui, Annie; Macdonald, Jan; Major, John; Marabella, Richard; Maru, Kebede; Matthews, Charles; McDonough, Susan; Mehta, Teena; Meldrim, James; Melnikov, Alexandre; Meneus, Louis; Mihalev, Atanas; Mihova, Tanya; Miller, Karen; Mittelman, Rachel; Mlenga, Valentine; Mulrain, Leonidas; Munson, Glen; Navidi, Adam; Naylor, Jerome; Nguyen, Tuyen; Nguyen, Nga; Nguyen, Cindy; Nguyen, Thu; Nicol, Robert; Norbu, Nyima; Norbu, Choe; Novod, Nathaniel; Nyima, Tenchoe; Olandt, Peter; O'Neill, Barry; O'Neill, Keith; Osman, Sahal; Oyono, Lucien; Patti, Christopher; Perrin, Danielle; Phunkhang, Pema; Pierre, Fritz; Priest, Margaret; Rachupka, Anthony; Raghuraman, Sujaa; Rameau, Rayale; Ray, Verneda; Raymond, Christina; Rege, Filip; Rise, Cecil; Rogers, Julie; Rogov, Peter; Sahalie, Julie; Settipalli, Sampath; Sharpe, Theodore; Shea, Terrance; Sheehan, Mechele; Sherpa, Ngawang; Shi, Jianying; Shih, Diana; Sloan, Jessie; Smith, Cherylyn; Sparrow, Todd; Stalker, John; Stange-Thomann, Nicole; Stavropoulos, Sharon; Stone, Catherine; Stone, Sabrina; Sykes, Sean; Tchuinga, Pierre; Tenzing, Pema; Tesfaye, Senait; Thoulutsang, Dawa; Thoulutsang, Yama; Topham, Kerri; Topping, Ira; Tsamla, Tsamla; Vassiliev, Helen; Venkataraman, Vijay; Vo, Andy; Wangchuk, Tsering; Wangdi, Tsering; Weiand, Michael; Wilkinson, Jane; Wilson, Adam; Yadav, Shailendra; Yang, Shuli; Yang, Xiaoping; Young, Geneva; Yu, Qing; Zainoun, Joanne; Zembek, Lisa; Zimmer, Andrew; Lander, Eric S

2005-12-08

Here we report a high-quality draft genome sequence of the domestic dog (Canis familiaris), together with a dense map of single nucleotide polymorphisms (SNPs) across breeds. The dog is of particular interest because it provides important evolutionary information and because existing breeds show great phenotypic diversity for morphological, physiological and behavioural traits. We use sequence comparison with the primate and rodent lineages to shed light on the structure and evolution of genomes and genes. Notably, the majority of the most highly conserved non-coding sequences in mammalian genomes are clustered near a small subset of genes with important roles in development. Analysis of SNPs reveals long-range haplotypes across the entire dog genome, and defines the nature of genetic diversity within and across breeds. The current SNP map now makes it possible for genome-wide association studies to identify genes responsible for diseases and traits, with important consequences for human and companion animal health.
Construction of a large collection of small genome variations in French dairy and beef breeds using whole-genome sequences.

PubMed

Boussaha, Mekki; Michot, Pauline; Letaief, Rabia; Hozé, Chris; Fritz, Sébastien; Grohs, Cécile; Esquerré, Diane; Duchesne, Amandine; Philippe, Romain; Blanquet, Véronique; Phocas, Florence; Floriot, Sandrine; Rocha, Dominique; Klopp, Christophe; Capitan, Aurélien; Boichard, Didier

2016-11-15

In recent years, several bovine genome sequencing projects were carried out with the aim of developing genomic tools to improve dairy and beef production efficiency and sustainability. In this study, we describe the first French cattle genome variation dataset obtained by sequencing 274 whole genomes representing several major dairy and beef breeds. This dataset contains over 28 million single nucleotide polymorphisms (SNPs) and small insertions and deletions. Comparisons between sequencing results and SNP array genotypes revealed a very high genotype concordance rate, which indicates the good quality of our data. To our knowledge, this is the first large-scale catalog of small genomic variations in French dairy and beef cattle. This resource will contribute to the study of gene functions and population structure and also help to improve traits through genotype-guided selection.
A communal catalogue reveals Earth's multiscale microbial diversity.

PubMed

Thompson, Luke R; Sanders, Jon G; McDonald, Daniel; Amir, Amnon; Ladau, Joshua; Locey, Kenneth J; Prill, Robert J; Tripathi, Anupriya; Gibbons, Sean M; Ackermann, Gail; Navas-Molina, Jose A; Janssen, Stefan; Kopylova, Evguenia; Vázquez-Baeza, Yoshiki; González, Antonio; Morton, James T; Mirarab, Siavash; Zech Xu, Zhenjiang; Jiang, Lingjing; Haroon, Mohamed F; Kanbar, Jad; Zhu, Qiyun; Jin Song, Se; Kosciolek, Tomasz; Bokulich, Nicholas A; Lefler, Joshua; Brislawn, Colin J; Humphrey, Gregory; Owens, Sarah M; Hampton-Marcell, Jarrad; Berg-Lyons, Donna; McKenzie, Valerie; Fierer, Noah; Fuhrman, Jed A; Clauset, Aaron; Stevens, Rick L; Shade, Ashley; Pollard, Katherine S; Goodwin, Kelly D; Jansson, Janet K; Gilbert, Jack A; Knight, Rob

2017-11-23

Our growing awareness of the microbial world's importance and diversity contrasts starkly with our limited understanding of its fundamental structure. Despite recent advances in DNA sequencing, a lack of standardized protocols and common analytical frameworks impedes comparisons among studies, hindering the development of global inferences about microbial life on Earth. Here we present a meta-analysis of microbial community samples collected by hundreds of researchers for the Earth Microbiome Project. Coordinated protocols and new analytical methods, particularly the use of exact sequences instead of clustered operational taxonomic units, enable bacterial and archaeal ribosomal RNA gene sequences to be followed across multiple studies and allow us to explore patterns of diversity at an unprecedented scale. The result is both a reference database giving global context to DNA sequence data and a framework for incorporating data from future studies, fostering increasingly complete characterization of Earth's microbial diversity.
The structure of cell adhesion molecule uvomorulin. Insights into the molecular mechanism of Ca2+-dependent cell adhesion.

PubMed Central

Ringwald, M; Schuh, R; Vestweber, D; Eistetter, H; Lottspeich, F; Engel, J; Dölz, R; Jähnig, F; Epplen, J; Mayer, S

1987-01-01

We have determined the amino acid sequence of the Ca2+-dependent cell adhesion molecule uvomorulin as it appears on the cell surface. The extracellular part of the molecule exhibits three internally repeated domains of 112 residues which are most likely generated by gene duplication. Each of the repeated domains contains two highly conserved units which could represent putative Ca2+-binding sites. Secondary structure predictions suggest that the putative Ca2+-binding units are located in external loops at the surface of the protein. The protein sequence exhibits a single membrane-spanning region and a cytoplasmic domain. Sequence comparison reveals extensive homology to the chicken L-CAM. Both uvomorulin and L-CAM are identical in 65% of their entire amino acid sequence suggesting a common origin for both CAMs. Images Fig. 1. Fig. 4. Fig. 7. PMID:3501370
High-resolution MRI of cranial nerves in posterior fossa at 3.0 T.

PubMed

Guo, Zi-Yi; Chen, Jing; Liang, Qi-Zhou; Liao, Hai-Yan; Cheng, Qiong-Yue; Fu, Shui-Xi; Chen, Cai-Xiang; Yu, Dan

2013-02-01

To evaluate the influence of high-resolution imaging obtainable with the higher field strength of 3.0 T on the visualization of the brain nerves in the posterior fossa. In total, 20 nerves were investigated on MRI of 12 volunteers each and selected for comparison, respectively, with the FSE sequences with 5 mm and 2 mm section thicknesses and gradient recalled echo (GRE) sequences acquired with a 3.0-T scanner. The MR images were evaluated by three independent readers who rated image quality according to depiction of anatomic detail and contrast with use of a rating scale. In general, decrease of the slice thickness showed a significant increase in the detection of nerves as well as in the image quality characteristics. Comparing FSE and GRE imaging, the course of brain nerves and brainstem vessels was visualized best with use of the three-dimensional (3D) pulse sequence. The comparison revealed the clear advantage of a thin section. The increased resolution enabled immediate identification of all brainstem nerves. GRE sequence most distinctly and confidently depicted pertinent structures and enables 3D reconstruction to illustrate complex relations of the brainstem. Copyright © 2013 Hainan Medical College. Published by Elsevier B.V. All rights reserved.
New Ceratocystis species from Eucalyptus and Cunninghamia in South China.

PubMed

Liu, FeiFei; Mbenoun, Michael; Barnes, Irene; Roux, Jolanda; Wingfield, Michael J; Li, GuoQing; Li, JieQiong; Chen, ShuaiFei

2015-06-01

During routine surveys for possible fungal pathogens in the rapidly expanding plantations of Eucalyptus and Cunninghamia lanceolata in China, numerous isolates of unknown species in the genus Ceratocystis (Microascales) were obtained from tree wounds. In this study we identified the Ceratocystis isolates from Eucalyptus and Cunninghamia in the GuangDong, GuangXi, FuJian and HaiNan Provinces of South China based on morphology and through comparisons of DNA sequence data for the ITS, partial β-tubulin and TEF-1α gene regions. Morphological and DNA sequence comparisons revealed two previously unknown species residing in the Indo-Pacific Clade. These are described here as Ceratocystis cercfabiensis sp. nov. and Ceratocystis collisensis sp. nov. Isolates of Ceratocystis cercfabiensis showed intragenomic variation in their ITS sequences and four strains were selected for cloning of the ITS gene region. Twelve ITS haplotypes were obtained from 17 clones selected for sequencing, differing in up to seven base positions and representing two separate phylogenetic groups. This is the first evidence of multiple ITS types in isolates of Ceratocystis residing in the Indo-Pacific Clade. Caution should thus be exercised when using the ITS gene region as a barcoding marker for Ceratocystis species in this clade. This study also represents the first record of a species of Ceratocystis from Cunninghamia.
Comparison of Molecular Typing Methods Useful for Detecting Clusters of Campylobacter jejuni and C. coli Isolates through Routine Surveillance

PubMed Central

Taboada, Eduardo; Grant, Christopher C. R.; Blakeston, Connie; Pollari, Frank; Marshall, Barbara; Rahn, Kris; MacKinnon, Joanne; Daignault, Danielle; Pillai, Dylan; Ng, Lai-King

2012-01-01

Campylobacter spp. may be responsible for unreported outbreaks of food-borne disease. The detection of these outbreaks is made more difficult by the fact that appropriate methods for detecting clusters of Campylobacter have not been well defined. We have compared the characteristics of five molecular typing methods on Campylobacter jejuni and C. coli isolates obtained from human and nonhuman sources during sentinel site surveillance during a 3-year period. Comparative genomic fingerprinting (CGF) appears to be one of the optimal methods for the detection of clusters of cases, and it could be supplemented by the sequencing of the flaA gene short variable region (flaA SVR sequence typing), with or without subsequent multilocus sequence typing (MLST). Different methods may be optimal for uncovering different aspects of source attribution. Finally, the use of several different molecular typing or analysis methods for comparing individuals within a population reveals much more about that population than a single method. Similarly, comparing several different typing methods reveals a great deal about differences in how the methods group individuals within the population. PMID:22162562
A communal catalogue reveals Earth’s multiscale microbial diversity

DOE PAGES

Thompson, Luke R.; Sanders, Jon G.; McDonald, Daniel; ...

2017-11-01

Our growing awareness of the importance and diversity of the microbial world contrasts starkly with our limited understanding of its fundamental structure. Despite remarkable advances in DNA sequence generation, a lack of standardized protocols and common analytical framework impede useful comparison between studies, hindering development of global inferences about microbial life on Earth. Here, we show that with coordinated protocols, exact microbial 16S rRNA gene sequences can be followed across scores of individual studies, revealing patterns of diversity, community structure, and life history strategy at a planetary scale. Using 27,751 crowdsourced environmental samples comprising more than 2.2 billion reads, wemore » find sharp divides between host-associated and free-living communities. We show that the distribution of taxonomic and sequence diversity follows consistent trends across samples types and along gradients of environmental parameters, highlighting some of the global evolutionary patterns and ecological principles that underpin Earth’s microbiome. Here, this dataset provides the most complete environmental survey of our microbial world to date, and serves as a growing reference to provide immediate global context to future microbial surveys.« less

The Pseudomonas putida peptidoglycan-associated outer membrane lipoprotein is involved in maintenance of the integrity of the cell cell envelope.

PubMed Central

Rodríguez-Herva, J J; Ramos-Gonzalez, M I; Ramos, J L

1996-01-01

Pseudomonas putida 14G-3, a derivative of the natural soil inhabitant P. putida KT2440, exhibited a chromosomal insertion of a mini-Tn5/'phoA transposon that resulted in reduced ability to colonize soil. In vitro characterization of P. putida 14G-3 revealed that it exhibited an altered cell morphology and envelope, as revealed by electron microscopy. The derived strain was sensitive to sodium dodecyl sulfate, deoxycholate, and EDTA, produced clumps when it reached high cell densities in the late logarithmic growth phase, and did not grow on low-osmolarity medium. The P. putida DNA surrounding the mini-Tn5/'phoA insertion was cloned and used as a probe to rescue the wild-type gene, which was sequenced. Comparison of the deduced peptide sequence with sequences in the Swiss-Prot database allowed the knocked-out gene to be identified as that encoding the peptidoglycan-associated lipoprotein (Pal or OprL) of P. putida. The protein was identified in coupled transcription and translation assays in vitro. PMID:8626299
A communal catalogue reveals Earth’s multiscale microbial diversity

DOE Office of Scientific and Technical Information (OSTI.GOV)

Thompson, Luke R.; Sanders, Jon G.; McDonald, Daniel

Our growing awareness of the importance and diversity of the microbial world contrasts starkly with our limited understanding of its fundamental structure. Despite remarkable advances in DNA sequence generation, a lack of standardized protocols and common analytical framework impede useful comparison between studies, hindering development of global inferences about microbial life on Earth. Here, we show that with coordinated protocols, exact microbial 16S rRNA gene sequences can be followed across scores of individual studies, revealing patterns of diversity, community structure, and life history strategy at a planetary scale. Using 27,751 crowdsourced environmental samples comprising more than 2.2 billion reads, wemore » find sharp divides between host-associated and free-living communities. We show that the distribution of taxonomic and sequence diversity follows consistent trends across samples types and along gradients of environmental parameters, highlighting some of the global evolutionary patterns and ecological principles that underpin Earth’s microbiome. Here, this dataset provides the most complete environmental survey of our microbial world to date, and serves as a growing reference to provide immediate global context to future microbial surveys.« less
The Genome Sequence of Methanohalophilus mahii SLP T Reveals Differences in the Energy Metabolism among Members of the Methanosarcinaceae Inhabiting Freshwater and Saline Environments

DOE Office of Scientific and Technical Information (OSTI.GOV)

Spring, Stefan; Scheuner, Carmen; Lapidus, Alla

Methanohalophilus mahii is the type species of the genus Methanohalophilus , which currently comprises three distinct species with validly published names. Mhp. mahii represents moderately halophilic methanogenic archaea with a strictly methylotrophic metabolism. The type strain SLP T was isolated from hypersaline sediments collected from the southern arm of Great Salt Lake, Utah. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,012,424 bp genome is a single replicon with 2032 protein-coding and 63 RNA genes and part of the Genomic Encyclopedia of Bacteria and Archaea project. A comparison of the reconstructedmore » energy metabolism in the halophilic species Mhp. mahii with other representatives of the Methanosarcinaceae reveals some interesting differences to freshwater species.« less
The Genome Sequence of Methanohalophilus mahii SLP T Reveals Differences in the Energy Metabolism among Members of the Methanosarcinaceae Inhabiting Freshwater and Saline Environments

DOE PAGES

Spring, Stefan; Scheuner, Carmen; Lapidus, Alla; ...

2010-01-01

Methanohalophilus mahii is the type species of the genus Methanohalophilus , which currently comprises three distinct species with validly published names. Mhp. mahii represents moderately halophilic methanogenic archaea with a strictly methylotrophic metabolism. The type strain SLP T was isolated from hypersaline sediments collected from the southern arm of Great Salt Lake, Utah. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,012,424 bp genome is a single replicon with 2032 protein-coding and 63 RNA genes and part of the Genomic Encyclopedia of Bacteria and Archaea project. A comparison of the reconstructedmore » energy metabolism in the halophilic species Mhp. mahii with other representatives of the Methanosarcinaceae reveals some interesting differences to freshwater species.« less
Exome sequencing supports a de novo mutational paradigm for schizophrenia

PubMed Central

Xu, Bin; Roos, J. Louw; Dexheimer, Phillip; Boone, Braden; Plummer, Brooks; Levy, Shawn; Gogos, Joseph A.; Karayiorgou, Maria

2011-01-01

Despite high heritability, a large fraction of cases with schizophrenia do not have a family history of the disease (sporadic cases). Here, we examine the possibility that rare de novo protein-altering mutations contribute to the genetic component of schizophrenia by sequencing the exome of 53 sporadic cases, 22 unaffected controls and their parents. We identified 40 de novo mutations in 27 patients affecting 40 genes including a potentially disruptive mutation in DGCR2, a gene removed by the recurrent schizophrenia-predisposing 22q11.2 microdeletion. Comparison to rare inherited variants revealed that the identified de novo mutations show a large excess of nonsynonymous changes in cases, as well as a greater potential to affect protein structure and function. Our analysis reveals a major role of de novo mutations in schizophrenia and also a large mutational target, which together provide a plausible explanation for the high global incidence and persistence of the disease. PMID:21822266
IgV peptide mapping of native Ro60 autoantibody proteomes in primary Sjögren's syndrome reveals molecular markers of Ro/La diversification.

PubMed

Wang, Jing J; Al Kindi, Mahmood A; Colella, Alex D; Dykes, Lukah; Jackson, Michael W; Chataway, Tim K; Reed, Joanne H; Gordon, Tom P

2016-12-01

We have used high-resolution mass spectrometry to sequence precipitating anti-Ro60 proteomes from sera of patients with primary Sjögren's syndrome and compare immunoglobulin variable-region (IgV) peptide signatures in Ro/La autoantibody subsets. Anti-Ro60 were purified by elution from native Ro60-coated ELISA plates and subjected to combined de novo amino acid sequencing and database matching. Monospecific anti-Ro60 Igs comprised dominant public and minor private sets of IgG1 kappa and lambda restricted heavy and light chains. Specific IgV amino acid substitutions stratified anti-Ro60 from anti-Ro60/La responses, providing a molecular fingerprint of Ro60/La determinant spreading and suggesting that different forms of Ro60 antigen drive these responses. Sequencing of linked anti-Ro52 proteomes from individual patients and comparison with their anti-Ro60 partners revealed sharing of a dominant IGHV3-23/IGKV3-20 paired clonotype but with divergent IgV mutational signatures. In summary, anti-Ro60 IgV peptide mapping provides insights into Ro/La autoantibody diversification and reveals serum-based molecular markers of humoral Ro60 autoimmunity. Copyright Â© 2016 Elsevier Inc. All rights reserved.
Rhizobium etli asparaginase II

PubMed Central

Huerta-Saquero, Alejandro; Evangelista-Martínez, Zahaed; Moreno-Enriquez, Angélica; Perez-Rueda, Ernesto

2013-01-01

Bacterial l-asparaginase has been a universal component of therapies for childhood acute lymphoblastic leukemia since the 1970s. Two principal enzymes derived from Escherichia coli and Erwinia chrysanthemi are the only options clinically approved to date. We recently reported a study of recombinant l-asparaginase (AnsA) from Rhizobium etli and described an increasing type of AnsA family members. Sequence analysis revealed four conserved motifs with notable differences with respect to the conserved regions of amino acid sequences of type I and type II l-asparaginases, particularly in comparison with therapeutic enzymes from E. coli and E. chrysanthemi. These differences suggested a distinct immunological specificity. Here, we report an in silico analysis that revealed immunogenic determinants of AnsA. Also, we used an extensive approach to compare the crystal structures of E. coli and E. chrysantemi asparaginases with a computational model of AnsA and identified immunogenic epitopes. A three-dimensional model of AsnA revealed, as expected based on sequence dissimilarities, completely different folding and different immunogenic epitopes. This approach could be very useful in transcending the problem of immunogenicity in two major ways: by chemical modifications of epitopes to reduce drug immunogenicity, and by site-directed mutagenesis of amino acid residues to diminish immunogenicity without reduction of enzymatic activity. PMID:22895060
Rhizobium etli asparaginase II: an alternative for acute lymphoblastic leukemia (ALL) treatment.

PubMed

Huerta-Saquero, Alejandro; Evangelista-Martínez, Zahaed; Moreno-Enriquez, Angélica; Perez-Rueda, Ernesto

2013-01-01

Bacterial L-asparaginase has been a universal component of therapies for childhood acute lymphoblastic leukemia since the 1970s. Two principal enzymes derived from Escherichia coli and Erwinia chrysanthemi are the only options clinically approved to date. We recently reported a study of recombinant L-asparaginase (AnsA) from Rhizobium etli and described an increasing type of AnsA family members. Sequence analysis revealed four conserved motifs with notable differences with respect to the conserved regions of amino acid sequences of type I and type II L-asparaginases, particularly in comparison with therapeutic enzymes from E. coli and E. chrysanthemi. These differences suggested a distinct immunological specificity. Here, we report an in silico analysis that revealed immunogenic determinants of AnsA. Also, we used an extensive approach to compare the crystal structures of E. coli and E. chrysantemi asparaginases with a computational model of AnsA and identified immunogenic epitopes. A three-dimensional model of AsnA revealed, as expected based on sequence dissimilarities, completely different folding and different immunogenic epitopes. This approach could be very useful in transcending the problem of immunogenicity in two major ways: by chemical modifications of epitopes to reduce drug immunogenicity, and by site-directed mutagenesis of amino acid residues to diminish immunogenicity without reduction of enzymatic activity.
Characterization of a prototype strain of hepatitis E virus.

PubMed

Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H

1992-01-15

A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia.
The mechanical design of spider silks: from fibroin sequence to mechanical function.

PubMed

Gosline, J M; Guerette, P A; Ortlepp, C S; Savage, K N

1999-12-01

Spiders produce a variety of silks, and the cloning of genes for silk fibroins reveals a clear link between protein sequence and structure-property relationships. The fibroins produced in the spider's major ampullate (MA) gland, which forms the dragline and web frame, contain multiple repeats of motifs that include an 8-10 residue long poly-alanine block and a 24-35 residue long glycine-rich block. When fibroins are spun into fibres, the poly-alanine blocks form (&bgr;)-sheet crystals that crosslink the fibroins into a polymer network with great stiffness, strength and toughness. As illustrated by a comparison of MA silks from Araneus diadematus and Nephila clavipes, variation in fibroin sequence and properties between spider species provides the opportunity to investigate the design of these remarkable biomaterials.
Contryphan-Bt: A pyroglutamic acid containing conopeptide isolated from the venom of Conus betulinus.

PubMed

Han, Penggang; Cao, Ying; Liu, Shangyi; Dai, Xiandong; Yao, Ge; Fan, Chongxu; Wu, Wenjian; Chen, Jisheng

2017-09-01

A new member of the contryphans family was isolated from the venom of Conus betilinus, a vermivorous species distributed in the South China Sea. Its sequence, ZSGCO(D-W)KPWC-NH 2 (Z, pyroglutamic acid), was established by a combination of de novo MS/MS sequencing and venom-duct transcriptome sequencing. The occurrence of D-Trp 6 was confirmed by chemical synthesis and HPLC behavior comparison. Like known contryphans, contryphan-Bt produces the "stiff-tail" syndrome in mice and contains one disulfide bond, a hydroxyproline, a D-tryptophan, and an amidated C-terminus. However, contryphan-Bt differs from previously identified contryphans by a pyroglutamic acid at the N terminus. CD spectrum reveals that contryphan-Bt possess β-turn in solution. Copyright © 2017 Elsevier Ltd. All rights reserved.
Molecular comparison of the structural proteins encoding gene clusters of two related Lactobacillus delbrueckii bacteriophages.

PubMed Central

Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T

1993-01-01

Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043
Sequence conservation from human to prokaryotes of Surf1, a protein involved in cytochrome c oxidase assembly, deficient in Leigh syndrome.

PubMed

Poyau, A; Buchet, K; Godinot, C

1999-12-03

The human SURF1 gene encoding a protein involved in cytochrome c oxidase (COX) assembly, is mutated in most patients presenting Leigh syndrome associated with COX deficiency. Proteins homologous to the human Surf1 have been identified in nine eukaryotes and six prokaryotes using database alignment tools, structure prediction and/or cDNA sequencing. Their sequence comparison revealed a remarkable Surf1 conservation during evolution and put forward at least four highly conserved domains that should be essential for Surf1 function. In Paracoccus denitrificans, the Surf1 homologue is found in the quinol oxidase operon, suggesting that Surf1 is associated with a primitive quinol oxidase which belongs to the same superfamily as cytochrome oxidase.
Virulence and molecular polymorphism of Prunus necrotic ringspot virus isolates.

PubMed

Hammond, R W; Crosslin, J M

1998-07-01

Prunus necrotic ringspot virus (PNRSV) occurs as numerous strains or isolates that vary widely in their pathogenic, biophysical and serological properties. Prior attempts to distinguish pathotypes based upon physical properties have not been successful; our approach was to examine the molecular properties that may distinguish these isolates. The nucleic acid sequence was determined from 1.65 kbp RT-PCR products derived from RNA 3 of seven distinct isolates of PNRSV that differ serologically and in pathology on sweet cherry. Sequence comparisons of ORF 3a (putative movement protein) and ORF 3b (coat protein) revealed single nucleotide and amino acid differences with strong correlations to serology and symptom types (pathotypes). Sequence differences between serotypes and pathotypes were also reflected in the overall phylogenetic relationships between the isolates.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Boore, Jeffrey L.; Staton, Joseph

We have determined the sequence of about half (7470 nts) of the mitochondrial genome of the sipunculid Phascolopsis gouldii, the first representative of this phylum to be so studied. All of the 19 identified genes are transcribed from the same DNA strand. The arrangement of these genes is remarkably similar to that of the oligochaete annelid Lumbricus terrestris. Comparison of both the inferred amino acid sequences and the gene arrangements of a variety of diverse metazoan taxa reveals that the phylum Sipuncula is more closely related to Annelida than to Mollusca. This requires reinterpretation of the homology of several embryologicalmore » features and of patterns of animal body plan evolution.« less
When Genomics Is Not Enough: Experimental Evidence for a Decrease in LINE-1 Activity During the Evolution of Australian Marsupials

PubMed Central

Gallus, Susanne; Lammers, Fritjof

2016-01-01

The autonomous transposable element LINE-1 is a highly abundant element that makes up between 15% and 20% of therian mammal genomes. Since their origin before the divergence of marsupials and placental mammals, LINE-1 elements have contributed actively to the genome landscape. A previous in silico screen of the Tasmanian devil genome revealed a lack of functional coding LINE-1 sequences. In this study we present the results of an in vitro analysis from a partial LINE-1 reverse transcriptase coding sequence in five marsupial species. Our experimental screen supports the in silico findings of the genome-wide degradation of LINE-1 sequences in the Tasmanian devil, and identifies a high frequency of degraded LINE-1 sequences in other Australian marsupials. The comparison between the experimentally obtained LINE-1 sequences and reference genome assemblies suggests that conclusions from in silico analyses of retrotransposition activity can be influenced by incomplete genome assemblies from short reads. PMID:27389686
Evaluating, Comparing, and Interpreting Protein Domain Hierarchies

PubMed Central

2014-01-01

Abstract Arranging protein domain sequences hierarchically into evolutionarily divergent subgroups is important for investigating evolutionary history, for speeding up web-based similarity searches, for identifying sequence determinants of protein function, and for genome annotation. However, whether or not a particular hierarchy is optimal is often unclear, and independently constructed hierarchies for the same domain can often differ significantly. This article describes methods for statistically evaluating specific aspects of a hierarchy, for probing the criteria underlying its construction and for direct comparisons between hierarchies. Information theoretical notions are used to quantify the contributions of specific hierarchical features to the underlying statistical model. Such features include subhierarchies, sequence subgroups, individual sequences, and subgroup-associated signature patterns. Underlying properties are graphically displayed in plots of each specific feature's contributions, in heat maps of pattern residue conservation, in “contrast alignments,” and through cross-mapping of subgroups between hierarchies. Together, these approaches provide a deeper understanding of protein domain functional divergence, reveal uncertainties caused by inconsistent patterns of sequence conservation, and help resolve conflicts between competing hierarchies. PMID:24559108
Genome sequence analysis of the model grass Brachypodium distachyon: insights into grass genome evolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schulman, Al

2009-08-09

Three subfamilies of grasses, the Erhardtoideae (rice), the Panicoideae (maize, sorghum, sugar cane and millet), and the Pooideae (wheat, barley and cool season forage grasses) provide the basis of human nutrition and are poised to become major sources of renewable energy. Here we describe the complete genome sequence of the wild grass Brachypodium distachyon (Brachypodium), the first member of the Pooideae subfamily to be completely sequenced. Comparison of the Brachypodium, rice and sorghum genomes reveals a precise sequence- based history of genome evolution across a broad diversity of the grass family and identifies nested insertions of whole chromosomes into centromericmore » regions as a predominant mechanism driving chromosome evolution in the grasses. The relatively compact genome of Brachypodium is maintained by a balance of retroelement replication and loss. The complete genome sequence of Brachypodium, coupled to its exceptional promise as a model system for grass research, will support the development of new energy and food crops« less
Conserved intergenic sequences revealed by CTAG-profiling in Salmonella: thermodynamic modeling for function prediction

NASA Astrophysics Data System (ADS)

Tang, Le; Zhu, Songling; Mastriani, Emilio; Fang, Xin; Zhou, Yu-Jie; Li, Yong-Guo; Johnston, Randal N.; Guo, Zheng; Liu, Gui-Rong; Liu, Shu-Lin

2017-03-01

Highly conserved short sequences help identify functional genomic regions and facilitate genomic annotation. We used Salmonella as the model to search the genome for evolutionarily conserved regions and focused on the tetranucleotide sequence CTAG for its potentially important functions. In Salmonella, CTAG is highly conserved across the lineages and large numbers of CTAG-containing short sequences fall in intergenic regions, strongly indicating their biological importance. Computer modeling demonstrated stable stem-loop structures in some of the CTAG-containing intergenic regions, and substitution of a nucleotide of the CTAG sequence would radically rearrange the free energy and disrupt the structure. The postulated degeneration of CTAG takes distinct patterns among Salmonella lineages and provides novel information about genomic divergence and evolution of these bacterial pathogens. Comparison of the vertically and horizontally transmitted genomic segments showed different CTAG distribution landscapes, with the genome amelioration process to remove CTAG taking place inward from both terminals of the horizontally acquired segment.
Genetic diversity of the captive Asian tapir population in Thailand, based on mitochondrial control region sequence data and the comparison of its nucleotide structure with Brazilian tapir.

PubMed

Muangkram, Yuttamol; Amano, Akira; Wajjwalku, Worawidh; Pinyopummintr, Tanu; Thongtip, Nikorn; Kaolim, Nongnid; Sukmak, Manakorn; Kamolnorranath, Sumate; Siriaroonrat, Boripat; Tipkantha, Wanlaya; Maikaew, Umaporn; Thomas, Warisara; Polsrila, Kanda; Dongsaard, Kwanreaun; Sanannu, Saowaphang; Wattananorrasate, Anuwat

2017-07-01

The Asian tapir (Tapirus indicus) has been classified as Endangered on the IUCN Red List of Threatened Species (2008). Genetic diversity data provide important information for the management of captive breeding and conservation of this species. We analyzed mitochondrial control region (CR) sequences from 37 captive Asian tapirs in Thailand. Multiple alignments of the full-length CR sequences sized 1268 bp comprised three domains as described in other mammal species. Analysis of 16 parsimony-informative variable sites revealed 11 haplotypes. Furthermore, the phylogenetic analysis using median-joining network clearly showed three clades correlated with our earlier cytochrome b gene study in this endangered species. The repetitive motif is located between first and second conserved sequence blocks, similar to the Brazilian tapir. The highest polymorphic site was located in the extended termination associated sequences domain. The results could be applied for future genetic management based in captivity and wild that shows stable populations.

Cloning and characterization of an autonomous replication sequence from Coxiella burnetii.

PubMed Central

Suhan, M; Chen, S Y; Thompson, H A; Hoover, T A; Hill, A; Williams, J C

1994-01-01

A Coxiella burnetii chromosomal fragment capable of functioning as an origin for the replication of a kanamycin resistance (Kanr) plasmid was isolated by use of origin search methods utilizing an Escherichia coli host. The 5.8-kb fragment was subcloned into phagemid vectors and was deleted progressively by an exonuclease III-S1 technique. Plasmids containing progressively shorter DNA fragments were then tested for their capability to support replication by transformation of an E. coli polA strain. A minimal autonomous replication sequence (ARS) was delimited to 403 bp. Sequencing of the entire 5.8-kb region revealed that the minimal ARS contained two consensus DnaA boxes, three A + T-rich 21-mers, a transcriptional promoter leading rightwards, and potential integration host factor and factor of inversion stimulation binding sites. Database comparisons of deduced amino acid sequences revealed that open reading frames located around the ARS were homologous to genes often, but not always, found near bacterial chromosomal origins; these included identities with rpmH and rnpA in E. coli and identities with the 9K protein and 60K membrane protein in E. coli and Pseudomonas species. These and direct hybridization data suggested that the ARS was chromosomal and not associated with the resident plasmid QpH1. Two-dimensional agarose gel electrophoresis did not reveal the presence of initiating intermediates, indicating that the ARS did not initiate chromosome replication during laboratory growth of C. burnetii. Images PMID:8071197
A global comparability approach for biosimilar monoclonal antibodies using LC-tandem MS based proteomics.

PubMed

Chen, Shun-Li; Wu, Shiaw-Lin; Huang, Li-Juan; Huang, Jia-Bao; Chen, Shu-Hui

2013-06-01

Liquid chromatography-tandem mass spectrometry-based proteomics for peptide mapping and sequencing was used to characterize the marketed monoclonal antibody trastuzumab and compare it with two biosimilar products, mAb A containing D359E and L361M variations at the Fc site and mAb B without variants. Complete sequence coverage (100%) including disulfide linkages, glycosylations and other commonly occurring modifications (i.e., deamidation, oxidation, dehydration and K-clipping) were identified using maps generated from multi-enzyme digestions. In addition to the targeted comparison for the relative populations of targeted modification forms, a non-targeted approach was used to globally compare ion intensities in tryptic maps. The non-targeted comparison provided an extra-dimensional view to examine any possible differences related to variants or modifications. A peptide containing the two variants in mAb A, D359E and L361M, was revealed using the non-targeted comparison of the tryptic maps. In contrast, no significant differences were observed when trastuzumab was self-compared or compared with mAb B. These results were consistent with the data derived from peptide sequencing via collision induced dissociation/electron transfer dissociation. Thus, combined targeted and non-targeted approaches using powerful mass spectrometry-based proteomic tools hold great promise for the structural characterization of biosimilar products. Copyright © 2013 Elsevier B.V. All rights reserved.
Comparative structural analysis of Bru1 region homeologs in Saccharum spontaneum and S. officinarum

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zhang, Jisen; Sharma, Anupma; Yu, Qingyi

Here, sugarcane is a major sugar and biofuel crop, but genomic research and molecular breeding have lagged behind other major crops due to the complexity of auto-allopolyploid genomes. Sugarcane cultivars are frequently aneuploid with chromosome number ranging from 100 to 130, consisting of 70-80 % S. officinarum, 10-20 % S. spontaneum, and 10 % recombinants between these two species. Analysis of a genomic region in the progenitor autoploid genomes of sugarcane hybrid cultivars will reveal the nature and divergence of homologous chromosomes. As a result, to investigate the origin and evolution of haplotypes in the Bru1 genomic regions in sugarcanemore » cultivars, we identified two BAC clones from S. spontaneum and four from S. officinarum and compared to seven haplotype sequences from sugarcane hybrid R570. The results clarified the origin of seven homologous haplotypes in R570, four haplotypes originated from S. officinarum, two from S. spontaneum and one recombinant.. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence ranged from 18.2 % to 60.5 % with an average of 33. 7 %. Gene content and gene structure were relatively well conserved among the homologous haplotypes. Exon splitting occurred in haplotypes of the hybrid genome but not in its progenitor genomes. Tajima's D analysis revealed that S. spontaneum hapotypes in the Bru1 genomic regions were under strong directional selection. Numerous inversions, deletions, insertions and translocations were found between haplotypes within each genome. In conclusion, this is the first comparison among haplotypes of a modern sugarcane hybrid and its two progenitors. Tajima's D results emphasized the crucial role of this fungal disease resistance gene for enhancing the fitness of this species and indicating that the brown rust resistance gene in R570 is from S. spontaneum. Species-specific InDel, sequences similarity and phylogenetic analysis of homologous genes can be used for identifying the origin of S. spontaneum and S. officinarum haplotype in Saccharum hybrids. Comparison of exon splitting among the homologous haplotypes suggested that the genome rearrangements in Saccharum hybrids S. officinarum would be sufficient for proper genome assembly of this autopolyploid genome. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence may allow sequencing and assembling the autopolyploid Saccharum genomes and the auto-allopolyploid hybrid genomes using whole genome shotgun sequencing.« less
Comparative structural analysis of Bru1 region homeologs in Saccharum spontaneum and S. officinarum

DOE PAGES

Zhang, Jisen; Sharma, Anupma; Yu, Qingyi; ...

2016-06-10

Here, sugarcane is a major sugar and biofuel crop, but genomic research and molecular breeding have lagged behind other major crops due to the complexity of auto-allopolyploid genomes. Sugarcane cultivars are frequently aneuploid with chromosome number ranging from 100 to 130, consisting of 70-80 % S. officinarum, 10-20 % S. spontaneum, and 10 % recombinants between these two species. Analysis of a genomic region in the progenitor autoploid genomes of sugarcane hybrid cultivars will reveal the nature and divergence of homologous chromosomes. As a result, to investigate the origin and evolution of haplotypes in the Bru1 genomic regions in sugarcanemore » cultivars, we identified two BAC clones from S. spontaneum and four from S. officinarum and compared to seven haplotype sequences from sugarcane hybrid R570. The results clarified the origin of seven homologous haplotypes in R570, four haplotypes originated from S. officinarum, two from S. spontaneum and one recombinant.. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence ranged from 18.2 % to 60.5 % with an average of 33. 7 %. Gene content and gene structure were relatively well conserved among the homologous haplotypes. Exon splitting occurred in haplotypes of the hybrid genome but not in its progenitor genomes. Tajima's D analysis revealed that S. spontaneum hapotypes in the Bru1 genomic regions were under strong directional selection. Numerous inversions, deletions, insertions and translocations were found between haplotypes within each genome. In conclusion, this is the first comparison among haplotypes of a modern sugarcane hybrid and its two progenitors. Tajima's D results emphasized the crucial role of this fungal disease resistance gene for enhancing the fitness of this species and indicating that the brown rust resistance gene in R570 is from S. spontaneum. Species-specific InDel, sequences similarity and phylogenetic analysis of homologous genes can be used for identifying the origin of S. spontaneum and S. officinarum haplotype in Saccharum hybrids. Comparison of exon splitting among the homologous haplotypes suggested that the genome rearrangements in Saccharum hybrids S. officinarum would be sufficient for proper genome assembly of this autopolyploid genome. Retrotransposon insertions and sequences variations among the homologous haplotypes sequence divergence may allow sequencing and assembling the autopolyploid Saccharum genomes and the auto-allopolyploid hybrid genomes using whole genome shotgun sequencing.« less
The tapeworm Atractolytocestus tenuicollis (Cestoda: Caryophyllidea)--a sister species or ancestor of an invasive A. huronensis?

PubMed

Králová-Hromadová, Ivica; Štefka, Jan; Bazsalovicsová, Eva; Bokorová, Silvia; Oros, Mikuláš

2013-10-01

Atractolytocestus tenuicollis (Li, 1964) Xi, Wang, Wu, Gao et Nie, 2009 is a monozoic, non-segmented tapeworm of the order Caryophyllidea, parasitizing exclusively common carp (Cyprinus carpio L.). In the current work, the first molecular data, in particular complete ribosomal internal transcribed spacer 2 (ITS2) and partial mitochondrial cytochrome c oxidase subunit I (cox1) on A. tenuicollis from Niushan Lake, Wuhan, China, are provided. In order to evaluate molecular interrelationships within Atractolytocestus, the data on A. tenuicollis were compared with relevant data on two other congeners, Atractolytocestus huronensis and Atractolytocestus sagittatus. Divergent intragenomic copies (ITS2 paralogues) were detected in the ITS2 ribosomal spacer of A. tenuicollis; the same phenomenon has previously been observed also in two other congeners. ITS2 structure of A. tenuicollis was very similar to that of A. huronensis from Slovakia, USA and UK; overall pairwise sequence identity was 91.7-95.2%. On the other hand, values of sequence identity between A. tenuicollis and A. sagittatus were lower, 69.7-70.9%. Cox1 sequence, analysed in five A. tenuicollis individuals, were 100 % identical and no intraspecific variation was observed. Comparison of A. tenuicollis cox1 with respective sequences of two other Atractolytocestus species showed that the mitochondrial haplotype found in Chinese A. tenuicollis is structurally specific (haplotype 4; Ha4) and differs from all so far determined Atractolytocestus haplotypes (Ha1 and Ha2 for A. huronensis; Ha3 for A. sagittatus). Pairwise sequence identity between A. tenuicollis cox1 haplotype and remaining three haplotypes followed the same pattern as in ITS2. The nucleotide and amino acide (aa) sequence comparison with A. huronensis Ha1 and Ha2 revealed higher sequence identity, 90.3-90.8% (96.9% in aa), while lower values were achieved between A. tenuicollis haplotype and Ha3 of Japanese A. sagittatus-75.2 % (81.9 % in aa). The phylogenetic analyses using cox1, ITS2 and combined cox1 + ITS2 sequences revealed close genetic interrelationship between A. tenuicollis and A. huronensis. Independently of a type of analysis and DNA region used, the topology of obtained trees was always identical; A. tenuicollis formed separate clade with A. huronensis forming a closely related sister group.
Phylogenetic Characterizations of Highly Mutated EV-B106 Recombinants Showing Extensive Genetic Exchanges with Other EV-B in Xinjiang, China.

PubMed

Song, Yang; Zhang, Yong; Fan, Qin; Cui, Hui; Yan, Dongmei; Zhu, Shuangli; Tang, Haishu; Sun, Qiang; Wang, Dongyan; Xu, Wenbo

2017-02-23

Human enterovirus B106 (EV-B106) is a new member of the enterovirus B species. To date, only three nucleotide sequences of EV-B106 have been published, and only one full-length genome sequence (the Yunnan strain 148/YN/CHN/12) is available in the GenBank database. In this study, we conducted phylogenetic characterisation of four EV-B106 strains isolated in Xinjiang, China. Pairwise comparisons of the nucleotide sequences and the deduced amino acid sequences revealed that the four Xinjiang EV-B106 strains had only 80.5-80.8% nucleotide identity and 95.4-97.3% amino acid identity with the Yunnan EV-B106 strain, indicating high mutagenicity. Similarity plots and bootscanning analyses revealed that frequent intertypic recombination occurred in all four Xinjiang EV-B106 strains in the non-structural region. These four strains may share a donor sequence with the EV-B85 strain, which circulated in Xinjiang in 2011, indicating extensive genetic exchanges between these strains. All Xinjiang EV-B106 strains were temperature-sensitive. An antibody seroprevalence study against EV-B106 in two Xinjiang prefectures also showed low titres of neutralizing antibodies, suggesting limited exposure and transmission in the population. This study contributes the whole genome sequences of EV-B106 to the GenBank database and provides valuable information regarding the molecular epidemiology of EV-B106 in China.
Phylogenetic Characterizations of Highly Mutated EV-B106 Recombinants Showing Extensive Genetic Exchanges with Other EV-B in Xinjiang, China

PubMed Central

Song, Yang; Zhang, Yong; Fan, Qin; Cui, Hui; Yan, Dongmei; Zhu, Shuangli; Tang, Haishu; Sun, Qiang; Wang, Dongyan; Xu, Wenbo

2017-01-01

Human enterovirus B106 (EV-B106) is a new member of the enterovirus B species. To date, only three nucleotide sequences of EV-B106 have been published, and only one full-length genome sequence (the Yunnan strain 148/YN/CHN/12) is available in the GenBank database. In this study, we conducted phylogenetic characterisation of four EV-B106 strains isolated in Xinjiang, China. Pairwise comparisons of the nucleotide sequences and the deduced amino acid sequences revealed that the four Xinjiang EV-B106 strains had only 80.5–80.8% nucleotide identity and 95.4–97.3% amino acid identity with the Yunnan EV-B106 strain, indicating high mutagenicity. Similarity plots and bootscanning analyses revealed that frequent intertypic recombination occurred in all four Xinjiang EV-B106 strains in the non-structural region. These four strains may share a donor sequence with the EV-B85 strain, which circulated in Xinjiang in 2011, indicating extensive genetic exchanges between these strains. All Xinjiang EV-B106 strains were temperature-sensitive. An antibody seroprevalence study against EV-B106 in two Xinjiang prefectures also showed low titres of neutralizing antibodies, suggesting limited exposure and transmission in the population. This study contributes the whole genome sequences of EV-B106 to the GenBank database and provides valuable information regarding the molecular epidemiology of EV-B106 in China. PMID:28230168
Augmenting Chinese hamster genome assembly by identifying regions of high confidence.

PubMed

Vishwanathan, Nandita; Bandyopadhyay, Arpan A; Fu, Hsu-Yuan; Sharma, Mohit; Johnson, Kathryn C; Mudge, Joann; Ramaraj, Thiruvarangan; Onsongo, Getiria; Silverstein, Kevin A T; Jacob, Nitya M; Le, Huong; Karypis, George; Hu, Wei-Shou

2016-09-01

Chinese hamster Ovary (CHO) cell lines are the dominant industrial workhorses for therapeutic recombinant protein production. The availability of genome sequence of Chinese hamster and CHO cells will spur further genome and RNA sequencing of producing cell lines. However, the mammalian genomes assembled using shot-gun sequencing data still contain regions of uncertain quality due to assembly errors. Identifying high confidence regions in the assembled genome will facilitate its use for cell engineering and genome engineering. We assembled two independent drafts of Chinese hamster genome by de novo assembly from shotgun sequencing reads and by re-scaffolding and gap-filling the draft genome from NCBI for improved scaffold lengths and gap fractions. We then used the two independent assemblies to identify high confidence regions using two different approaches. First, the two independent assemblies were compared at the sequence level to identify their consensus regions as "high confidence regions" which accounts for at least 78 % of the assembled genome. Further, a genome wide comparison of the Chinese hamster scaffolds with mouse chromosomes revealed scaffolds with large blocks of collinearity, which were also compiled as high-quality scaffolds. Genome scale collinearity was complemented with EST based synteny which also revealed conserved gene order compared to mouse. As cell line sequencing becomes more commonly practiced, the approaches reported here are useful for assessing the quality of assembly and potentially facilitate the engineering of cell lines. Copyright © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Genomic structure of the human D-site binding protein (DBP) gene

DOE Office of Scientific and Technical Information (OSTI.GOV)

Shutler, G.; Glassco, T.; Kang, Xiaolin

1996-06-15

The human gene for the D-Site Binding Protein (DBP) has been sequenced and characterized. This gene is a member of the b/ZIP family of transcription factors and is one of three genes forming the PAR sub-family. DBP has been implicated in the diurnal regulation of a variety of liver-specific genes. Examination of the genomic structure of DBP reveals that the gene is divided into four exons and is contained within a relatively compact region of approximately 6 kb. These exons appear to correspond to functional divisions the DBP protein. Exon 1 contains a long 5{prime} UTR, and conservation between themore » rat and the human genes of the presence of small open reading frames within this region suggests that is may play a role in translational control. Exon 2 contains a limited region of similarity to the other PAR domain genes, which may be part of a potential activation domain. Exon 3 contains the PAR domain and differs by only 1 of 71 amino acids between rat and human. Exon 4, containing both the basic and the leucine zipper domains, is likewise highly conserved. The overall degree of homology between the rat and the human cDNA sequences is 82% for the nucleic acid sequence and 92% for the protein sequence. comparison of the rat and human proximal promoters reveals extensive sequence conservation, with two previously characterized DNA binding sites being conserved at the functional and sequence levels. 31 refs., 4 figs.« less
The Center for Regenerative Biology and Medicine at Mount Desert Island Biological Laboratory

DTIC Science & Technology

2013-06-01

system through in vivo disruption of gene function. 15. SUBJECT TERMS limb regeneration Positional Memory Code Axolotl ...another selection factor to identify those genes that are similarly controlled in both Polypterus and axolotl samples. These comparisons revealed a...sequence IDs among Axolotl and Polypterus contigs that were up-regulated and down regulated greater than 2-fold between 0 and 7 dpa. (Left) The
Genome-Wide Comparison of Magnaporthe Species Reveals a Host-Specific Pattern of Secretory Proteins and Transposable Elements

PubMed Central

Gowda, Malali

2016-01-01

Blast disease caused by the Magnaporthe species is a major factor affecting the productivity of rice, wheat and millets. This study was aimed at generating genomic information for rice and non-rice Magnaporthe isolates to understand the extent of genetic variation. We have sequenced the whole genome of the Magnaporthe isolates, infecting rice (leaf and neck), finger millet (leaf and neck), foxtail millet (leaf) and buffel grass (leaf). Rice and finger millet isolates infecting both leaf and neck tissues were sequenced, since the damage and yield loss caused due to neck blast is much higher as compared to leaf blast. The genome-wide comparison was carried out to study the variability in gene content, candidate effectors, repeat element distribution, genes involved in carbohydrate metabolism and SNPs. The analysis of repeat element footprints revealed some genes such as naringenin, 2-oxoglutarate 3-dioxygenase being targeted by Pot2 and Occan, in isolates from different host species. Some repeat insertions were host-specific while other insertions were randomly shared between isolates. The distributions of repeat elements, secretory proteins, CAZymes and SNPs showed significant variation across host-specific lineages of Magnaporthe indicating an independent genome evolution orchestrated by multiple genomic factors. PMID:27658241
Comparison of the Nodule vs. Root Transcriptome of the Actinorhizal Plant Datisca glomerata: Actinorhizal Nodules Contain a Specific Class of Defensins

PubMed Central

Santos, Patricia; Plaszczyca, Marian; Pawlowski, Katharina

2013-01-01

Actinorhizal root nodule symbioses are very diverse, and the symbiosis of Datisca glomerata has previously been shown to have many unusual aspects. In order to gain molecular information on the infection mechanism, nodule development and nodule metabolism, we compared the transcriptomes of D. glomerata roots and nodules. Root and nodule libraries representing the 3′-ends of cDNAs were subjected to high-throughput parallel 454 sequencing. To identify the corresponding genes and to improve the assembly, Illumina sequencing of the nodule transcriptome was performed as well. The evaluation revealed 406 differentially regulated genes, 295 of which (72.7%) could be assigned a function based on homology. Analysis of the nodule transcriptome showed that genes encoding components of the common symbiosis signaling pathway were present in nodules of D. glomerata, which in combination with the previously established function of SymRK in D. glomerata nodulation suggests that this pathway is also active in actinorhizal Cucurbitales. Furthermore, comparison of the D. glomerata nodule transcriptome with nodule transcriptomes from actinorhizal Fagales revealed a new subgroup of nodule-specific defensins that might play a role specific to actinorhizal symbioses. The D. glomerata members of this defensin subgroup contain an acidic C-terminal domain that was never found in plant defensins before. PMID:24009681
Gene silencing reveals multiple functions of Na+/K+-ATPase in the salmon louse (Lepeophtheirus salmonis).

PubMed

Komisarczuk, Anna Z; Kongshaug, Heidi; Nilsen, Frank

2018-02-01

Na + /K + -ATPase has a key function in a variety of physiological processes including membrane excitability, osmoregulation, regulation of cell volume, and transport of nutrients. While knowledge about Na + /K + -ATPase function in osmoregulation in crustaceans is extensive, the role of this enzyme in other physiological and developmental processes is scarce. Here, we report characterization, transcriptional distribution and likely functions of the newly identified L. salmonis Na + /K + -ATPase (LsalNa + /K + -ATPase) α subunit in various developmental stages. The complete mRNA sequence was identified, with 3003 bp open reading frame encoding a putative protein of 1001 amino acids. Putative protein sequence of LsalNa + /K + -ATPase revealed all typical features of Na + /K + -ATPase and demonstrated high sequence identity to other invertebrate and vertebrate species. Quantitative RT-PCR analysis revealed higher LsalNa + /K + -ATPase transcript level in free-living stages in comparison to parasitic stages. In situ hybridization analysis of copepodids and adult lice revealed LsalNa + /K + -ATPase transcript localization in a wide variety of tissues such as nervous system, intestine, reproductive system, and subcuticular and glandular tissue. RNAi mediated knock-down of LsalNa + /K + -ATPase caused locomotion impairment, and affected reproduction and feeding. Morphological analysis of dsRNA treated animals revealed muscle degeneration in larval stages, severe changes in the oocyte formation and maturation in females and abnormalities in tegmental glands. Thus, the study represents an important foundation for further functional investigation and identification of physiological pathways in which Na + /K + -ATPase is directly or indirectly involved. Copyright © 2018 Elsevier Inc. All rights reserved.
The genome sequence of ectromelia virus Naval and Cornell isolates from outbreaks in North America.

PubMed

Mavian, Carla; López-Bueno, Alberto; Bryant, Neil A; Seeger, Kathy; Quail, Michael A; Harris, David; Barrell, Bart; Alcami, Antonio

2014-08-01

Ectromelia virus (ECTV) is the causative agent of mousepox, a disease of laboratory mouse colonies and an excellent model for human smallpox. We report the genome sequence of two isolates from outbreaks in laboratory mouse colonies in the USA in 1995 and 1999: ECTV-Naval and ECTV-Cornell, respectively. The genome of ECTV-Naval and ECTV-Cornell was sequenced by the 454-Roche technology. The ECTV-Naval genome was also sequenced by the Sanger and Illumina technologies in order to evaluate these technologies for poxvirus genome sequencing. Genomic comparisons revealed that ECTV-Naval and ECTV-Cornell correspond to the same virus isolated from independent outbreaks. Both ECTV-Naval and ECTV-Cornell are extremely virulent in susceptible BALB/c mice, similar to ECTV-Moscow. This is consistent with the ECTV-Naval genome sharing 98.2% DNA sequence identity with that of ECTV-Moscow, and indicates that the genetic differences with ECTV-Moscow do not affect the virulence of ECTV-Naval in the mousepox model of footpad infection. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.
A novel cry2Ab gene from the indigenous isolate Bacillus thuringiensis subsp. kurstaki.

PubMed

Sevim, Ali; Eryüzlü, Emine; Demirbağ, Zihni; Demir, Ismail

2012-01-01

A novel cry2Ab gene was cloned and sequenced from the indigenous isolate of Bacillus thuringiensis subsp. kurstaki. This gene was designated as cry2Ab25 and its sequence revealed an open reading frame of 1,902 bp encoding a 633 aa protein with calculated molecular mass of 70 kDa and pI value of 8.98. The amino acid sequence of the Cry2Ab25 protein was compared with previously known Cry2Ab toxins, and the phylogenetic relationships among them were determined. The deduced amino acid sequence of the Cry2Ab25 protein showed 99% homology to the known Cry2Ab proteins, except for Cry2Ab10 and Cry2Ab12 with 97% homology, and a variation in one amino acid residue in comparison with all known Cry2Ab proteins. The cry2Ab25 gene was expressed in Escherichia coli BL21(DE3) cells. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) revealed that the Cry2Ab25 protein is about 70 kDa. The toxin expressed in BL21(DE3) exhibited high toxicity against Malacosoma neustria and Rhagoletis cerasi with 73% and 75% mortality after 5 days of treatment, respectively.
Strategies for high-altitude adaptation revealed from high-quality draft genome of non-violacein producing Janthinobacterium lividum ERGS5:01.

PubMed

Kumar, Rakshak; Acharya, Vishal; Singh, Dharam; Kumar, Sanjay

2018-01-01

A light pink coloured bacterial strain ERGS5:01 isolated from glacial stream water of Sikkim Himalaya was affiliated to Janthinobacterium lividum based on 16S rRNA gene sequence identity and phylogenetic clustering. Whole genome sequencing was performed for the strain to confirm its taxonomy as it lacked the typical violet pigmentation of the genus and also to decipher its survival strategy at the aquatic ecosystem of high elevation. The PacBio RSII sequencing generated genome of 5,168,928 bp with 4575 protein-coding genes and 118 RNA genes. Whole genome-based multilocus sequence analysis clustering, in silico DDH similarity value of 95.1% and, the ANI value of 99.25% established the identity of the strain ERGS5:01 (MCC 2953) as a non-violacein producing J. lividum . The genome comparisons across genus Janthinobacterium revealed an open pan-genome with the scope of the addition of new orthologous cluster to complete the genomic inventory. The genomic insight provided the genetic basis of freezing and frequent freeze-thaw cycle tolerance and, for industrially important enzymes. Extended insight into the genome provided clues of crucial genes associated with adaptation in the harsh aquatic ecosystem of high altitude.
Functional sequencing read annotation for high precision microbiome analysis

PubMed Central

Zhu, Chengsheng; Miller, Maximilian; Marpaka, Srinayani; Vaysberg, Pavel; Rühlemann, Malte C; Wu, Guojun; Heinsen, Femke-Anouska; Tempel, Marie; Zhao, Liping; Lieb, Wolfgang; Franke, Andre; Bromberg, Yana

2018-01-01

Abstract The vast majority of microorganisms on Earth reside in often-inseparable environment-specific communities—microbiomes. Meta-genomic/-transcriptomic sequencing could reveal the otherwise inaccessible functionality of microbiomes. However, existing analytical approaches focus on attributing sequencing reads to known genes/genomes, often failing to make maximal use of available data. We created faser (functional annotation of sequencing reads), an algorithm that is optimized to map reads to molecular functions encoded by the read-correspondent genes. The mi-faser microbiome analysis pipeline, combining faser with our manually curated reference database of protein functions, accurately annotates microbiome molecular functionality. mi-faser’s minutes-per-microbiome processing speed is significantly faster than that of other methods, allowing for large scale comparisons. Microbiome function vectors can be compared between different conditions to highlight environment-specific and/or time-dependent changes in functionality. Here, we identified previously unseen oil degradation-specific functions in BP oil-spill data, as well as functional signatures of individual-specific gut microbiome responses to a dietary intervention in children with Prader–Willi syndrome. Our method also revealed variability in Crohn's Disease patient microbiomes and clearly distinguished them from those of related healthy individuals. Our analysis highlighted the microbiome role in CD pathogenicity, demonstrating enrichment of patient microbiomes in functions that promote inflammation and that help bacteria survive it. PMID:29194524
Molecular cloning of a cDNA encoding the glycoprotein of hen oviduct microsomal signal peptidase.

PubMed Central

Newsome, A L; McLean, J W; Lively, M O

1992-01-01

Detergent-solubilized hen oviduct signal peptidase has been characterized previously as an apparent complex of a 19 kDa protein and a 23 kDa glycoprotein (GP23) [Baker & Lively (1987) Biochemistry 26, 8561-8567]. A cDNA clone encoding GP23 from a chicken oviduct lambda gt11 cDNA library has now been characterized. The cDNA encodes a protein of 180 amino acid residues with a single site for asparagine-linked glycosylation that has been directly identified by amino acid sequence analysis of a tryptic-digest peptide containing the glycosylated site. Immunoblot analysis reveals cross-reactivity with a dog pancreas protein. Comparison of the deduced amino acid sequence of GP23 with the 22/23 kDa glycoprotein of dog microsomal signal peptidase [Shelness, Kanwar & Blobel (1988) J. Biol. Chem. 263, 17063-17070], one of five proteins associated with this enzyme, reveals that the amino acid sequences are 90% identical. Thus the signal peptidase glycoprotein is as highly conserved as the sequences of cytochromes c and b from these same species and is likely to be found in a similar form in many, if not all, vertebrate species. The data also show conclusively that the dog and avian signal peptidases have at least one protein subunit in common. Images Fig. 1. PMID:1546959
The mitochondrial genome of Paraspadella gotoi is highly reduced and reveals that chaetognaths are a sister-group to protostomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Helfenbein, Kevin G.; Fourcade, H. Matthew; Vanjani, Rohit G.

2004-05-01

We report the first complete mitochondrial (mt) DNA sequence from a member of the phylum Chaetognatha (arrow worms). The Paraspadella gotoi mtDNA is highly unusual, missing 23 of the genes commonly found in animal mtDNAs, including atp6, which has otherwise been found universally to be present. Its 14 genes are unusually arranged into two groups, one on each strand. One group is punctuated by numerous non-coding intergenic nucleotides, while the other group is tightly packed, having no non-coding nucleotides, leading to speculation that there are two transcription units with differing modes of expression. The phylogenetic position of the Chaetognatha withinmore » the Metazoa has long been uncertain, with conflicting or equivocal results from various morphological analyses and rRNA sequence comparisons. Comparisons here of amino acid sequences from mitochondrially encoded proteins gives a single most parsimonious tree that supports a position of Chaetognatha as sister to the protostomes studied here. From this, one can more clearly interpret the patterns of evolution of various developmental features, especially regarding the embryological fate of the blastopore.« less
The adipokinetic hormone of Mantodea in comparison to other Dictyoptera.

PubMed

Gäde, Gerd; Marco, Heather G

2017-03-01

Six species of the order Mantodea (praying mantises) are investigated for the presence and sequence of putative adipokinetic hormones (AKHs). The selected species span a wide evolutionary range of various families and subfamilies of the clade Mantodea. The corpora cardiaca of the different species are dissected, methanolic extracts prepared, peptides separated by liquid chromatography, and AKHs detected and sequenced by ion trap mass spectrometry. All six species investigated contain an octapeptide with the primary structure pGlu-Val-Asn-Phe-Thr-Pro-Asn-Trp amide, which is code-named Emppe-AKH and had been found earlier in three other species of Mantodea. Conspecific bioassays with the species Creoboter sp. (family Hymenopodidae) reveal an adipokinetic but not a hypertrehalosemic function of Emppe-AKH. Comparison with other members of the Dictyoptera (cockroaches, termites) show that Emppe-AKH is only found in certain termites, which have been recently placed into the Blattaria (cockroaches) as sister group to the family Cryptocercidae. Termites and cockroaches both show biodiversity in the sequence of AKHs, and some cockroach species even contain two AKHs. In contrast, all praying mantises-irrespective of their phylogenetic position-synthesize uniformly only one and the same octapeptide Emppe-AKH. © 2017 Wiley Periodicals, Inc.

Whole-genome comparative analysis of three phytopathogenic Xylella fastidiosa strains.

PubMed

Bhattacharyya, Anamitra; Stilwagen, Stephanie; Ivanova, Natalia; D'Souza, Mark; Bernal, Axel; Lykidis, Athanasios; Kapatral, Vinayak; Anderson, Iain; Larsen, Niels; Los, Tamara; Reznik, Gary; Selkov, Eugene; Walunas, Theresa L; Feil, Helene; Feil, William S; Purcell, Alexander; Lassez, Jean-Louis; Hawkins, Trevor L; Haselkorn, Robert; Overbeek, Ross; Predki, Paul F; Kyrpides, Nikos C

2002-09-17

Xylella fastidiosa (Xf) causes wilt disease in plants and is responsible for major economic and crop losses globally. Owing to the public importance of this phytopathogen we embarked on a comparative analysis of the complete genome of Xf pv citrus and the partial genomes of two recently sequenced strains of this species: Xf pv almond and Xf pv oleander, which cause leaf scorch in almond and oleander plants, respectively. We report a reanalysis of the previously sequenced Xf 9a5c (CVC, citrus) strain and the two "gapped" Xf genomes revealing ORFs encoding critical functions in pathogenicity and conjugative transfer. Second, a detailed whole-genome functional comparison was based on the three sequenced Xf strains, identifying the unique genes present in each strain, in addition to those shared between strains. Third, an "in silico" cellular reconstruction of these organisms was made, based on a comparison of their core functional subsystems that led to a characterization of their conjugative transfer machinery, identification of potential differences in their adhesion mechanisms, and highlighting of the absence of a classical quorum-sensing mechanism. This study demonstrates the effectiveness of comparative analysis strategies in the interpretation of genomes that are closely related.
Molecular characterization and distribution of a 145-bp tandem repeat family in the genus Populus.

PubMed

Rajagopal, J; Das, S; Khurana, D K; Srivastava, P S; Lakshmikumaran, M

1999-10-01

This report aims to describe the identification and molecular characterization of a 145-bp tandem repeat family that accounts for nearly 1.5% of the Populus genome. Three members of this repeat family were cloned and sequenced from Populus deltoides and P. ciliata. The dimers of the repeat were sequenced in order to confirm the head-to-tail organization of the repeat. Hybridization-based analysis using the 145-bp tandem repeat as a probe on genomic DNA gave rise to ladder patterns which were identified to be a result of methylation and (or) sequence heterogeneity. Analysis of the methylation pattern of the repeat family using methylation-sensitive isoschizomers revealed variable methylation of the C residues and lack of methylation of the A residues. Sequence comparisons between the monomers revealed a high degree of sequence divergence that ranged between 6% and 11% in P. deltoides and between 4.2% and 8.3% in P. ciliata. This indicated the presence of sub-families within the 145-bp tandem family of repeats. Divergence was mainly due to the accumulation of point mutations and was concentrated in the central region of the repeat. The 145-bp tandem repeat family did not show significant homology to known tandem repeats from plants. A short stretch of 36 bp was found to show homology of 66.7% to a centromeric repeat from Chironomus plumosus. Dot-blot analysis and Southern hybridization data revealed the presence of the repeat family in 13 of the 14 Populus species examined. The absence of the 145-bp repeat from P. euphratica suggested that this species is relatively distant from other members of the genus, which correlates with taxonomic classifications. The widespread occurrence of the tandem family in the genus indicated that this family may be of ancient origin.
Coverage Bias and Sensitivity of Variant Calling for Four Whole-genome Sequencing Technologies

PubMed Central

Lasitschka, Bärbel; Jones, David; Northcott, Paul; Hutter, Barbara; Jäger, Natalie; Kool, Marcel; Taylor, Michael; Lichter, Peter; Pfister, Stefan; Wolf, Stephan; Brors, Benedikt; Eils, Roland

2013-01-01

The emergence of high-throughput, next-generation sequencing technologies has dramatically altered the way we assess genomes in population genetics and in cancer genomics. Currently, there are four commonly used whole-genome sequencing platforms on the market: Illumina’s HiSeq2000, Life Technologies’ SOLiD 4 and its completely redesigned 5500xl SOLiD, and Complete Genomics’ technology. A number of earlier studies have compared a subset of those sequencing platforms or compared those platforms with Sanger sequencing, which is prohibitively expensive for whole genome studies. Here we present a detailed comparison of the performance of all currently available whole genome sequencing platforms, especially regarding their ability to call SNVs and to evenly cover the genome and specific genomic regions. Unlike earlier studies, we base our comparison on four different samples, allowing us to assess the between-sample variation of the platforms. We find a pronounced GC bias in GC-rich regions for Life Technologies’ platforms, with Complete Genomics performing best here, while we see the least bias in GC-poor regions for HiSeq2000 and 5500xl. HiSeq2000 gives the most uniform coverage and displays the least sample-to-sample variation. In contrast, Complete Genomics exhibits by far the smallest fraction of bases not covered, while the SOLiD platforms reveal remarkable shortcomings, especially in covering CpG islands. When comparing the performance of the four platforms for calling SNPs, HiSeq2000 and Complete Genomics achieve the highest sensitivity, while the SOLiD platforms show the lowest false positive rate. Finally, we find that integrating sequencing data from different platforms offers the potential to combine the strengths of different technologies. In summary, our results detail the strengths and weaknesses of all four whole-genome sequencing platforms. It indicates application areas that call for a specific sequencing platform and disallow other platforms. This helps to identify the proper sequencing platform for whole genome studies with different application scopes. PMID:23776689
Comparative analysis of the complete sequence of the plastid genome of Parthenium argentatum and identification of DNA barcodes to differentiate Parthenium species and lines

PubMed Central

2009-01-01

Background Parthenium argentatum (guayule) is an industrial crop that produces latex, which was recently commercialized as a source of latex rubber safe for people with Type I latex allergy. The complete plastid genome of P. argentatum was sequenced. The sequence provides important information useful for genetic engineering strategies. Comparison to the sequences of plastid genomes from three other members of the Asteraceae, Lactuca sativa, Guitozia abyssinica and Helianthus annuus revealed details of the evolution of the four genomes. Chloroplast-specific DNA barcodes were developed for identification of Parthenium species and lines. Results The complete plastid genome of P. argentatum is 152,803 bp. Based on the overall comparison of individual protein coding genes with those in L. sativa, G. abyssinica and H. annuus, we demonstrate that the P. argentatum chloroplast genome sequence is most closely related to that of H. annuus. Similar to chloroplast genomes in G. abyssinica, L. sativa and H. annuus, the plastid genome of P. argentatum has a large 23 kb inversion with a smaller 3.4 kb inversion, within the large inversion. Using the matK and psbA-trnH spacer chloroplast DNA barcodes, three of the four Parthenium species tested, P. tomentosum, P. hysterophorus and P. schottii, can be differentiated from P. argentatum. In addition, we identified lines within P. argentatum. Conclusion The genome sequence of the P. argentatum chloroplast will enrich the sequence resources of plastid genomes in commercial crops. The availability of the complete plastid genome sequence may facilitate transformation efficiency by using the precise sequence of endogenous flanking sequences and regulatory elements in chloroplast transformation vectors. The DNA barcoding study forms the foundation for genetic identification of commercially significant lines of P. argentatum that are important for producing latex. PMID:19917140
Identification of a novel plant amalgavirus (Amalgavirus, Amalgaviridae) genome sequence in Cistus incanus.

PubMed

Goh, C J; Park, D; Lee, J S; Sebastiani, F; Hahn, Y

2018-01-01

Amalgaviridae is a family of double-stranded, monosegmented RNA viruses that are associated with plants, fungi, microsporidians, and animals. A sequence contig derived from the transcriptome of a eudicot, Cistus incanus (the family Cistaceae; commonly known as hoary rockrose), was identified as the genome sequence of a novel plant RNA virus and named Cistus incanus RNA virus 1 (CiRV1). Sequence comparison and phylogenetic analysis indicated that CiRV1 is a novel species of the genus Amalgavirus in the family Amalgaviridae. The CiRV1 genome contig has two overlapping open reading frames (ORFs). ORF1 encodes a putative replication factory matrix-like protein, while ORF2 encodes a RNA-dependent RNA polymerase (RdRp) domain. An ORF1+2 fusion protein, which functions in viral RNA replication, is produced by a +1 programmed ribosomal frameshifting (PRF) mechanism. A +1 PRF motif UUU_CGU, which matches the conserved amalgavirus +1 PRF consensus sequence UUU_CGN, was found at the boundary of CiRV1 ORF1 and ORF2. Comparison of 25 amalgavirus ORF1+2 fusion proteins revealed that only three different positions within a 13-amino acid segment were recurrently used at the boundary, possibly being selected so as not to interfere with correct folding and function of the fusion protein. CiRV1 is the first virus found to be associated with the Cistus species and may be useful for studying amalgaviruses.
Saccharomyces cerevisiae: gene annotation and genome variability, state of the art through comparative genomics.

PubMed

Louis, Ed

2011-01-01

In the early days of the yeast genome sequencing project, gene annotation was in its infancy and suffered the problem of many false positive annotations as well as missed genes. The lack of other sequences for comparison also prevented the annotation of conserved, functional sequences that were not coding. We are now in an era of comparative genomics where many closely related as well as more distantly related genomes are available for direct sequence and synteny comparisons allowing for more probable predictions of genes and other functional sequences due to conservation. We also have a plethora of functional genomics data which helps inform gene annotation for previously uncharacterised open reading frames (ORFs)/genes. For Saccharomyces cerevisiae this has resulted in a continuous updating of the gene and functional sequence annotations in the reference genome helping it retain its position as the best characterized eukaryotic organism's genome. A single reference genome for a species does not accurately describe the species and this is quite clear in the case of S. cerevisiae where the reference strain is not ideal for brewing or baking due to missing genes. Recent surveys of numerous isolates, from a variety of sources, using a variety of technologies have revealed a great deal of variation amongst isolates with genome sequence surveys providing information on novel genes, undetectable by other means. We now have a better understanding of the extant variation in S. cerevisiae as a species as well as some idea of how much we are missing from this understanding. As with gene annotation, comparative genomics enhances the discovery and description of genome variation and is providing us with the tools for understanding genome evolution, adaptation and selection, and underlying genetics of complex traits.
Extensive structural variations between mitochondrial genomes of CMS and normal peppers (Capsicum annuum L.) revealed by complete nucleotide sequencing.

PubMed

Jo, Yeong Deuk; Choi, Yoomi; Kim, Dong-Hwan; Kim, Byung-Dong; Kang, Byoung-Cheorl

2014-07-04

Cytoplasmic male sterility (CMS) is an inability to produce functional pollen that is caused by mutation of the mitochondrial genome. Comparative analyses of mitochondrial genomes of lines with and without CMS in several species have revealed structural differences between genomes, including extensive rearrangements caused by recombination. However, the mitochondrial genome structure and the DNA rearrangements that may be related to CMS have not been characterized in Capsicum spp. We obtained the complete mitochondrial genome sequences of the pepper CMS line FS4401 (507,452 bp) and the fertile line Jeju (511,530 bp). Comparative analysis between mitochondrial genomes of peppers and tobacco that are included in Solanaceae revealed extensive DNA rearrangements and poor conservation in non-coding DNA. In comparison between pepper lines, FS4401 and Jeju mitochondrial DNAs contained the same complement of protein coding genes except for one additional copy of an atp6 gene (ψatp6-2) in FS4401. In terms of genome structure, we found eighteen syntenic blocks in the two mitochondrial genomes, which have been rearranged in each genome. By contrast, sequences between syntenic blocks, which were specific to each line, accounted for 30,380 and 17,847 bp in FS4401 and Jeju, respectively. The previously-reported CMS candidate genes, orf507 and ψatp6-2, were located on the edges of the largest sequence segments that were specific to FS4401. In this region, large number of small sequence segments which were absent or found on different locations in Jeju mitochondrial genome were combined together. The incorporation of repeats and overlapping of connected sequence segments by a few nucleotides implied that extensive rearrangements by homologous recombination might be involved in evolution of this region. Further analysis using mtDNA pairs from other plant species revealed common features of DNA regions around CMS-associated genes. Although large portion of sequence context was shared by mitochondrial genomes of CMS and male-fertile pepper lines, extensive genome rearrangements were detected. CMS candidate genes located on the edges of highly-rearranged CMS-specific DNA regions and near to repeat sequences. These characteristics were detected among CMS-associated genes in other species, implying a common mechanism might be involved in the evolution of CMS-associated genes.
Characterization of a prototype strain of hepatitis E virus.

PubMed Central

Tsarev, S A; Emerson, S U; Reyes, G R; Tsareva, T S; Legters, L J; Malik, I A; Iqbal, M; Purcell, R H

1992-01-01

A strain of hepatitis E virus (SAR-55) implicated in an epidemic of enterically transmitted non-A, non-B hepatitis, now called hepatitis E, was characterized extensively. Six cynomolgus monkeys (Macaca fascicularis) were infected with a strain of hepatitis E virus from Pakistan. Reverse transcription-polymerase chain reaction was used to determine the pattern of virus shedding in feces, bile, and serum relative to hepatitis and induction of specific antibodies. Virtually the entire genome of SAR-55 (7195 nucleotides) was sequenced. Comparison of the sequence of SAR-55 with that of a Burmese strain revealed a high level of homology except for one region encoding 100 amino acids of a putative nonstructural polyprotein. Identification of this region as hypervariable was obtained by partial sequencing of a third isolate of hepatitis E virus from Kirgizia. Images PMID:1731327
Evolution of the arginase fold and functional diversity

PubMed Central

Dowling, Daniel P.; Costanzo, Luigi Di; Gennadios, Heather A.; Christianson, David W.

2009-01-01

The large number of protein structures deposited in the Protein Data Bank allows for the identification of novel structural superfamilies based on conservation of fold in addition to conservation of amino acid sequence. Since sequence diverges more rapidly than fold in protein evolution, proteins with little or no significant sequence identity are occasionally observed to adopt similar folds, thereby reflecting unanticipated evolutionary relationships. Here, we review the unique α/β fold first observed in the manganese metalloenzyme rat liver arginase, consisting of a parallel 8 stranded β-sheet surrounded by several helices, and its evolutionary relationship with the zinc-requiring and/or iron-requiring histone deacetylases and acetylpolyamine amidohydrolases. Structural comparisons reveal key features of the core α/β fold that contribute to the divergent metal ion specificity and stoichiometry required for the chemical and biological functions of these enzymes. PMID:18360740
The diversity of eukaryotic microbiota in the traditional Slovak sheep cheese--bryndza.

PubMed

Laurencík, M; Sulo, P; Sláviková, E; Piecková, E; Seman, M; Ebringer, L

2008-09-30

We investigated the occurrence and diversity of yeasts and filamentous fungi in bryndza an artisanal Slovak soft spreadable cheese prepared from raw sheep milk or from a mixture of sheep and cow milk. Samples collected during four months of the summer production period from two locations (northern and southern parts of central Slovakia) contained 10(5)-10(7) (cfu) yeasts and about 10(2) (cfu) of mold per gram of wet weight. Further characterization by conventional taxonomy and sequence comparison of D1/D2 region from 26S rRNA gene revealed Mucor circinelloides v. Tieghem as the predominant filamentous fungus. A novel Geotrichum sp. together with Kluyveromyces (K. lactis/K. marxianus) was identified as the most abundant yeast species. Occasionally other yeasts, such as Candida inconspicua, Candida silvae, Pichia fermentans and Trichosporon domesticum were found. Conventional taxonomy readily identified isolates to the genus level, but DNA sequence comparison was capable of discriminating them at the species level.
Response to comment on "Nuclear genomic sequences reveal that polar bears are an old and distinct bear lineage".

PubMed

Hailer, Frank; Kutschera, Verena E; Hallström, Björn M; Fain, Steven R; Leonard, Jennifer A; Arnason, Ulfur; Janke, Axel

2013-03-29

Nakagome et al. reanalyzed some of our data and assert that we cannot refute the mitochondrial DNA-based scenario for polar bear evolution. Their single-locus test statistic is strongly affected by introgression and incomplete lineage sorting, whereas our multilocus approaches are better suited to recover the true species relationships. Indeed, our sister-lineage model receives high support in a Bayesian model comparison.
First genome sequences of Achromobacter phages reveal new members of the N4 family.

PubMed

Wittmann, Johannes; Dreiseikelmann, Brigitte; Rohde, Manfred; Meier-Kolthoff, Jan P; Bunk, Boyke; Rohde, Christine

2014-01-27

Multi-resistant Achromobacter xylosoxidans has been recognized as an emerging pathogen causing nosocomially acquired infections during the last years. Phages as natural opponents could be an alternative to fight such infections. Bacteriophages against this opportunistic pathogen were isolated in a recent study. This study shows a molecular analysis of two podoviruses and reveals first insights into the genomic structure of Achromobacter phages so far. Growth curve experiments and adsorption kinetics were performed for both phages. Adsorption and propagation in cells were visualized by electron microscopy. Both phage genomes were sequenced with the PacBio RS II system based on single molecule, real-time (SMRT) technology and annotated with several bioinformatic tools. To further elucidate the evolutionary relationships between the phage genomes, a phylogenomic analysis was conducted using the genome Blast Distance Phylogeny approach (GBDP). In this study, we present the first detailed analysis of genome sequences of two Achromobacter phages so far. Phages JWAlpha and JWDelta were isolated from two different waste water treatment plants in Germany. Both phages belong to the Podoviridae and contain linear, double-stranded DNA with a length of 72329 bp and 73659 bp, respectively. 92 and 89 putative open reading frames were identified for JWAlpha and JWDelta, respectively, by bioinformatic analysis with several tools. The genomes have nearly the same organization and could be divided into different clusters for transcription, replication, host interaction, head and tail structure and lysis. Detailed annotation via protein comparisons with BLASTP revealed strong similarities to N4-like phages. Analysis of the genomes of Achromobacter phages JWAlpha and JWDelta and comparisons of different gene clusters with other phages revealed that they might be strongly related to other N4-like phages, especially of the Escherichia group. Although all these phages show a highly conserved genomic structure and partially strong similarities at the amino acid level, some differences could be identified. Those differences, e.g. the existence of specific genes for replication or host interaction in some N4-like phages, seem to be interesting targets for further examination of function and specific mechanisms, which might enlighten the mechanism of phage establishment in the host cell after infection.
Draft Genome Sequencing and Comparative Analysis of Aspergillus sojae NBRC4239

PubMed Central

Sato, Atsushi; Oshima, Kenshiro; Noguchi, Hideki; Ogawa, Masahiro; Takahashi, Tadashi; Oguma, Tetsuya; Koyama, Yasuji; Itoh, Takehiko; Hattori, Masahira; Hanya, Yoshiki

2011-01-01

We conducted genome sequencing of the filamentous fungus Aspergillus sojae NBRC4239 isolated from the koji used to prepare Japanese soy sauce. We used the 454 pyrosequencing technology and investigated the genome with respect to enzymes and secondary metabolites in comparison with other Aspergilli sequenced. Assembly of 454 reads generated a non-redundant sequence of 39.5-Mb possessing 13 033 putative genes and 65 scaffolds composed of 557 contigs. Of the 2847 open reading frames with Pfam domain scores of >150 found in A. sojae NBRC4239, 81.7% had a high degree of similarity with the genes of A. oryzae. Comparative analysis identified serine carboxypeptidase and aspartic protease genes unique to A. sojae NBRC4239. While A. oryzae possessed three copies of α-amyalse gene, A. sojae NBRC4239 possessed only a single copy. Comparison of 56 gene clusters for secondary metabolites between A. sojae NBRC4239 and A. oryzae revealed that 24 clusters were conserved, whereas 32 clusters differed between them that included a deletion of 18 508 bp containing mfs1, mao1, dmaT, and pks-nrps for the cyclopiazonic acid (CPA) biosynthesis, explaining the no productivity of CPA in A. sojae. The A. sojae NBRC4239 genome data will be useful to characterize functional features of the koji moulds used in Japanese industries. PMID:21659486
Next generation sequencing gives an insight into the characteristics of highly selected breeds versus non-breed horses in the course of domestication.

PubMed

Metzger, Julia; Tonda, Raul; Beltran, Sergi; Agueda, Lídia; Gut, Marta; Distl, Ottmar

2014-07-04

Domestication has shaped the horse and lead to a group of many different types. Some have been under strong human selection while others developed in close relationship with nature. The aim of our study was to perform next generation sequencing of breed and non-breed horses to provide an insight into genetic influences on selective forces. Whole genome sequencing of five horses of four different populations revealed 10,193,421 single nucleotide polymorphisms (SNPs) and 1,361,948 insertion/deletion polymorphisms (indels). In comparison to horse variant databases and previous reports, we were able to identify 3,394,883 novel SNPs and 868,525 novel indels. We analyzed the distribution of individual variants and found significant enrichment of private mutations in coding regions of genes involved in primary metabolic processes, anatomical structures, morphogenesis and cellular components in non-breed horses and in contrast to that private mutations in genes affecting cell communication, lipid metabolic process, neurological system process, muscle contraction, ion transport, developmental processes of the nervous system and ectoderm in breed horses. Our next generation sequencing data constitute an important first step for the characterization of non-breed in comparison to breed horses and provide a large number of novel variants for future analyses. Functional annotations suggest specific variants that could play a role for the characterization of breed or non-breed horses.
Genome of the Actinomycete Plant Pathogen Clavibacter michiganensis subsp. sepedonicus Suggests Recent Niche Adaptation▿ †

PubMed Central

Bentley, Stephen D.; Corton, Craig; Brown, Susan E.; Barron, Andrew; Clark, Louise; Doggett, Jon; Harris, Barbara; Ormond, Doug; Quail, Michael A.; May, Georgiana; Francis, David; Knudson, Dennis; Parkhill, Julian; Ishimaru, Carol A.

2008-01-01

Clavibacter michiganensis subsp. sepedonicus is a plant-pathogenic bacterium and the causative agent of bacterial ring rot, a devastating agricultural disease under strict quarantine control and zero tolerance in the seed potato industry. This organism appears to be largely restricted to an endophytic lifestyle, proliferating within plant tissues and unable to persist in the absence of plant material. Analysis of the genome sequence of C. michiganensis subsp. sepedonicus and comparison with the genome sequences of related plant pathogens revealed a dramatic recent evolutionary history. The genome contains 106 insertion sequence elements, which appear to have been active in extensive rearrangement of the chromosome compared to that of Clavibacter michiganensis subsp. michiganensis. There are 110 pseudogenes with overrepresentation in functions associated with carbohydrate metabolism, transcriptional regulation, and pathogenicity. Genome comparisons also indicated that there is substantial gene content diversity within the species, probably due to differential gene acquisition and loss. These genomic features and evolutionary dating suggest that there was recent adaptation for life in a restricted niche where nutrient diversity and perhaps competition are low, correlated with a reduced ability to exploit previously occupied complex niches outside the plant. Toleration of factors such as multiplication and integration of insertion sequence elements, genome rearrangements, and functional disruption of many genes and operons seems to indicate that there has been general relaxation of selective pressure on a large proportion of the genome. PMID:18192393
Molecular characterization, tissue distribution and expression analysis of TRIM25 in Gallus gallus domesticus.

PubMed

Feng, Ze-Qing; Cheng, Yang; Yang, Hui-Ling; Zhu, Qing; Yu, Dandan; Liu, Yi-Ping

2015-04-25

TRIM25, a member of the tripartite motif-containing (TRIM) family of proteins, plays an important role in cell proliferation, protein modification, and the RIG-I-mediated antiviral signaling pathway. However, relatively few studies have investigated the molecular characterization, tissue distribution, and potential function of TRIM25 in chickens. In this study, we cloned the full-length cDNA of chicken TRIM25 that is composed of 2706 bp. Sequence analyses revealed that TRIM25 contains a 1902-bp open-reading frame that probably encodes a 633-amino acid protein. Multiple comparisons with deduced amino acid sequences revealed that the RING finger and B30.2 domains of chicken TRIM25 share a high sequence similarity with human and murine TRIM25, indicating that these domains are critical for the function of chicken TRIM25. qPCR assays revealed that TRIM25 is highly expressed in the spleen, thymus and lungs in chickens. Furthermore, we observed that TRIM25 expression was significantly upregulated both in vitro and in vivo following infection with Newcastle disease virus. TRIM25 expression was also significantly upregulated in chicken embryo fibroblasts upon stimulation with poly(I:C) or poly(dA:dT). Taken together, these findings suggest that TRIM25 plays an important role in antiviral signaling pathways in chickens. Copyright © 2015 Elsevier B.V. All rights reserved.
Characterization of the breakpoints of a polymorphic inversion complex detects strict and broad breakpoint reuse at the molecular level.

PubMed

Puerma, Eva; Orengo, Dorcas J; Salguero, David; Papaceit, Montserrat; Segarra, Carmen; Aguadé, Montserrat

2014-09-01

Inversions are an integral part of structural variation within species, and they play a leading role in genome reorganization across species. Work at both the cytological and genome sequence levels has revealed heterogeneity in the distribution of inversion breakpoints, with some regions being recurrently used. Breakpoint reuse at the molecular level has mostly been assessed for fixed inversions through genome sequence comparison, and therefore rather broadly. Here, we have identified and sequenced the breakpoints of two polymorphic inversions-E1 and E2 that share a breakpoint-in the extant Est and E1 + 2 chromosomal arrangements of Drosophila subobscura. The breakpoints are two medium-sized repeated motifs that mediated the inversions by two different mechanisms: E1 via staggered breaks and subsequent repair and E2 via repeat-mediated ectopic recombination. The fine delimitation of the shared breakpoint revealed its strict reuse at the molecular level regardless of which was the intermediate arrangement. The occurrence of other rearrangements in the most proximal and distal extended breakpoint regions reveals the broad reuse of these regions. This differential degree of fragility might be related to their sharing the presence outside the inverted region of snoRNA-encoding genes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Molecular Characterization of Tomato 3-Dehydroquinate Dehydratase-Shikimate:NADP Oxidoreductase1

PubMed Central

Bischoff, Markus; Schaller, Andreas; Bieri, Fabian; Kessler, Felix; Amrhein, Nikolaus; Schmid, Jürg

2001-01-01

Analysis of cDNAs encoding the bifunctional 3-dehydroquinate dehydratase-shikimate:NADP oxidoreductase (DHQase-SORase) from tomato (Lycopersicon esculentum) revealed two classes of cDNAs that differed by 57 bp within the coding regions, but were otherwise identical. Comparison of these cDNA sequences with the sequence of the corresponding single gene unequivocally proved that the primary transcript is differentially spliced, potentially giving rise to two polypeptides that differ by 19 amino acids. Quantitative real-time polymerase chain reaction revealed that the longer transcript constitutes at most 1% to 2% of DHQase-SORase transcripts. Expression of the respective polypeptides in Escherichia coli mutants lacking the DHQase or the SORase activity gave functional complementation only in case of the shorter polypeptide, indicating that skipping of a potential exon is a prerequisite for the production of an enzymatically active protein. The deduced amino acid sequence revealed that the DHQase-SORase is most likely synthesized as a precursor with a very short (13-amino acid) plastid-specific transit peptide. Like other genes encoding enzymes of the prechorismate pathway in tomato, this gene is elicitor-inducible. Tissue-specific expression resembles the patterns obtained for 3-deoxy-d-arabino-heptulosonate 7-phosphate synthase 2 and dehydroquinate synthase genes. This work completes our studies of the prechorismate pathway in that cDNAs for all seven enzymes (including isozymes) of the prechorismate pathway from tomato have now been characterized. PMID:11299368
Bph32, a novel gene encoding an unknown SCR domain-containing protein, confers resistance against the brown planthopper in rice.

PubMed

Ren, Juansheng; Gao, Fangyuan; Wu, Xianting; Lu, Xianjun; Zeng, Lihua; Lv, Jianqun; Su, Xiangwen; Luo, Hong; Ren, Guangjun

2016-11-23

An urgent need exists to identify more brown planthopper (Nilaparvata lugens Stål, BPH) resistance genes, which will allow the development of rice varieties with resistance to BPH to counteract the increased incidence of this pest species. Here, using bioinformatics and DNA sequencing approaches, we identified a novel BPH resistance gene, LOC_Os06g03240 (MSU LOCUS ID), from the rice variety Ptb33 in the interval between the markers RM19291 and RM8072 on the short arm of chromosome 6, where a gene for resistance to BPH was mapped by Jirapong Jairin et al. and renamed as "Bph32". This gene encodes a unique short consensus repeat (SCR) domain protein. Sequence comparison revealed that the Bph32 gene shares 100% sequence identity with its allele in Oryza latifolia. The transgenic introgression of Bph32 into a susceptible rice variety significantly improved resistance to BPH. Expression analysis revealed that Bph32 was highly expressed in the leaf sheaths, where BPH primarily settles and feeds, at 2 and 24 h after BPH infestation, suggesting that Bph32 may inhibit feeding in BPH. Western blotting revealed the presence of Pph (Ptb33) and Tph (TN1) proteins using a Penta-His antibody, and both proteins were insoluble. This study provides information regarding a valuable gene for rice defence against insect pests.
Bph32, a novel gene encoding an unknown SCR domain-containing protein, confers resistance against the brown planthopper in rice

PubMed Central

Ren, Juansheng; Gao, Fangyuan; Wu, Xianting; Lu, Xianjun; Zeng, Lihua; Lv, Jianqun; Su, Xiangwen; Luo, Hong; Ren, Guangjun

2016-01-01

An urgent need exists to identify more brown planthopper (Nilaparvata lugens Stål, BPH) resistance genes, which will allow the development of rice varieties with resistance to BPH to counteract the increased incidence of this pest species. Here, using bioinformatics and DNA sequencing approaches, we identified a novel BPH resistance gene, LOC_Os06g03240 (MSU LOCUS ID), from the rice variety Ptb33 in the interval between the markers RM19291 and RM8072 on the short arm of chromosome 6, where a gene for resistance to BPH was mapped by Jirapong Jairin et al. and renamed as “Bph32”. This gene encodes a unique short consensus repeat (SCR) domain protein. Sequence comparison revealed that the Bph32 gene shares 100% sequence identity with its allele in Oryza latifolia. The transgenic introgression of Bph32 into a susceptible rice variety significantly improved resistance to BPH. Expression analysis revealed that Bph32 was highly expressed in the leaf sheaths, where BPH primarily settles and feeds, at 2 and 24 h after BPH infestation, suggesting that Bph32 may inhibit feeding in BPH. Western blotting revealed the presence of Pph (Ptb33) and Tph (TN1) proteins using a Penta-His antibody, and both proteins were insoluble. This study provides information regarding a valuable gene for rice defence against insect pests. PMID:27876888

The microbiome of Brazilian mangrove sediments as revealed by metagenomics.

PubMed

Andreote, Fernando Dini; Jiménez, Diego Javier; Chaves, Diego; Dias, Armando Cavalcante Franco; Luvizotto, Danice Mazzer; Dini-Andreote, Francisco; Fasanella, Cristiane Cipola; Lopez, Maryeimy Varon; Baena, Sandra; Taketani, Rodrigo Gouvêa; de Melo, Itamar Soares

2012-01-01

Here we embark in a deep metagenomic survey that revealed the taxonomic and potential metabolic pathways aspects of mangrove sediment microbiology. The extraction of DNA from sediment samples and the direct application of pyrosequencing resulted in approximately 215 Mb of data from four distinct mangrove areas (BrMgv01 to 04) in Brazil. The taxonomic approaches applied revealed the dominance of Deltaproteobacteria and Gammaproteobacteria in the samples. Paired statistical analysis showed higher proportions of specific taxonomic groups in each dataset. The metabolic reconstruction indicated the possible occurrence of processes modulated by the prevailing conditions found in mangrove sediments. In terms of carbon cycling, the sequences indicated the prevalence of genes involved in the metabolism of methane, formaldehyde, and carbon dioxide. With respect to the nitrogen cycle, evidence for sequences associated with dissimilatory reduction of nitrate, nitrogen immobilization, and denitrification was detected. Sequences related to the production of adenylsulfate, sulfite, and H(2)S were relevant to the sulphur cycle. These data indicate that the microbial core involved in methane, nitrogen, and sulphur metabolism consists mainly of Burkholderiaceae, Planctomycetaceae, Rhodobacteraceae, and Desulfobacteraceae. Comparison of our data to datasets from soil and sea samples resulted in the allotment of the mangrove sediments between those samples. The results of this study add valuable data about the composition of microbial communities in mangroves and also shed light on possible transformations promoted by microbial organisms in mangrove sediments.
The Microbiome of Brazilian Mangrove Sediments as Revealed by Metagenomics

PubMed Central

Andreote, Fernando Dini; Jiménez, Diego Javier; Chaves, Diego; Dias, Armando Cavalcante Franco; Luvizotto, Danice Mazzer; Dini-Andreote, Francisco; Fasanella, Cristiane Cipola; Lopez, Maryeimy Varon; Baena, Sandra; Taketani, Rodrigo Gouvêa; de Melo, Itamar Soares

2012-01-01

Here we embark in a deep metagenomic survey that revealed the taxonomic and potential metabolic pathways aspects of mangrove sediment microbiology. The extraction of DNA from sediment samples and the direct application of pyrosequencing resulted in approximately 215 Mb of data from four distinct mangrove areas (BrMgv01 to 04) in Brazil. The taxonomic approaches applied revealed the dominance of Deltaproteobacteria and Gammaproteobacteria in the samples. Paired statistical analysis showed higher proportions of specific taxonomic groups in each dataset. The metabolic reconstruction indicated the possible occurrence of processes modulated by the prevailing conditions found in mangrove sediments. In terms of carbon cycling, the sequences indicated the prevalence of genes involved in the metabolism of methane, formaldehyde, and carbon dioxide. With respect to the nitrogen cycle, evidence for sequences associated with dissimilatory reduction of nitrate, nitrogen immobilization, and denitrification was detected. Sequences related to the production of adenylsulfate, sulfite, and H2S were relevant to the sulphur cycle. These data indicate that the microbial core involved in methane, nitrogen, and sulphur metabolism consists mainly of Burkholderiaceae, Planctomycetaceae, Rhodobacteraceae, and Desulfobacteraceae. Comparison of our data to datasets from soil and sea samples resulted in the allotment of the mangrove sediments between those samples. The results of this study add valuable data about the composition of microbial communities in mangroves and also shed light on possible transformations promoted by microbial organisms in mangrove sediments. PMID:22737213
Transcriptome Analysis and Comparison of Marmota monax and Marmota himalayana.

PubMed

Liu, Yanan; Wang, Baoju; Wang, Lu; Vikash, Vikash; Wang, Qin; Roggendorf, Michael; Lu, Mengji; Yang, Dongliang; Liu, Jia

2016-01-01

The Eastern woodchuck (Marmota monax) is a classical animal model for studying hepatitis B virus (HBV) infection and hepatocellular carcinoma (HCC) in humans. Recently, we found that Marmota himalayana, an Asian animal species closely related to Marmota monax, is susceptible to woodchuck hepatitis virus (WHV) infection and can be used as a new mammalian model for HBV infection. However, the lack of genomic sequence information of both Marmota models strongly limited their application breadth and depth. To address this major obstacle of the Marmota models, we utilized Illumina RNA-Seq technology to sequence the cDNA libraries of liver and spleen samples of two Marmota monax and four Marmota himalayana. In total, over 13 billion nucleotide bases were sequenced and approximately 1.5 billion clean reads were obtained. Following assembly, 106,496 consensus sequences of Marmota monax and 78,483 consensus sequences of Marmota himalayana were detected. For functional annotation, in total 73,603 Unigenes of Marmota monax and 78,483 Unigenes of Marmota himalayana were identified using different databases (NR, NT, Swiss-Prot, KEGG, COG, GO). The Unigenes were aligned by blastx to protein databases to decide the coding DNA sequences (CDS) and in total 41,247 CDS of Marmota monax and 34,033 CDS of Marmota himalayana were predicted. The single nucleotide polymorphisms (SNPs) and the simple sequence repeats (SSRs) were also analyzed for all Unigenes obtained. Moreover, a large-scale transcriptome comparison was performed and revealed a high similarity in transcriptome sequences between the two marmota species. Our study provides an extensive amount of novel sequence information for Marmota monax and Marmota himalayana. This information may serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the identification and characterization of functional genes that are involved in WHV infection and HCC development in the woodchuck model.
Transcriptome Analysis and Comparison of Marmota monax and Marmota himalayana

PubMed Central

Wang, Lu; Vikash, Vikash; Wang, Qin; Roggendorf, Michael; Lu, Mengji; Yang, Dongliang; Liu, Jia

2016-01-01

The Eastern woodchuck (Marmota monax) is a classical animal model for studying hepatitis B virus (HBV) infection and hepatocellular carcinoma (HCC) in humans. Recently, we found that Marmota himalayana, an Asian animal species closely related to Marmota monax, is susceptible to woodchuck hepatitis virus (WHV) infection and can be used as a new mammalian model for HBV infection. However, the lack of genomic sequence information of both Marmota models strongly limited their application breadth and depth. To address this major obstacle of the Marmota models, we utilized Illumina RNA-Seq technology to sequence the cDNA libraries of liver and spleen samples of two Marmota monax and four Marmota himalayana. In total, over 13 billion nucleotide bases were sequenced and approximately 1.5 billion clean reads were obtained. Following assembly, 106,496 consensus sequences of Marmota monax and 78,483 consensus sequences of Marmota himalayana were detected. For functional annotation, in total 73,603 Unigenes of Marmota monax and 78,483 Unigenes of Marmota himalayana were identified using different databases (NR, NT, Swiss-Prot, KEGG, COG, GO). The Unigenes were aligned by blastx to protein databases to decide the coding DNA sequences (CDS) and in total 41,247 CDS of Marmota monax and 34,033 CDS of Marmota himalayana were predicted. The single nucleotide polymorphisms (SNPs) and the simple sequence repeats (SSRs) were also analyzed for all Unigenes obtained. Moreover, a large-scale transcriptome comparison was performed and revealed a high similarity in transcriptome sequences between the two marmota species. Our study provides an extensive amount of novel sequence information for Marmota monax and Marmota himalayana. This information may serve as a valuable genomics resource for further molecular, developmental and comparative evolutionary studies, as well as for the identification and characterization of functional genes that are involved in WHV infection and HCC development in the woodchuck model. PMID:27806133
Complete sequence of two tick-borne flaviviruses isolated from Siberia and the UK: analysis and significance of the 5' and 3'-UTRs.

PubMed

Gritsun, T S; Venugopal, K; Zanotto, P M; Mikhailov, M V; Sall, A A; Holmes, E C; Polkinghorne, I; Frolova, T V; Pogodina, V V; Lashkevich, V A; Gould, E A

1997-05-01

The complete nucleotide sequence of two tick-transmitted flaviviruses, Vasilchenko (Vs) from Siberia and louping ill (LI) from the UK, have been determined. The genomes were respectively, 10928 and 10871 nucleotides (nt) in length. The coding strategy and functional protein sequence motifs of tick-borne flaviviruses are presented in both Vs and LI viruses. The phylogenies based on maximum likelihood, maximum parsimony and distance analysis of the polyproteins, identified Vs virus as a member of the tick-borne encephalitis virus subgroup within the tick-borne serocomplex, genus Flavivirus, family Flaviviridae. Comparative alignment of the 3'-untranslated regions revealed deletions of different lengths essentially at the same position downstream of the stop codon for all tick-borne viruses. Two direct 27 nucleotide repeats at the 3'-end were found only for Vs and LI virus. Immediately following the deletions a region of 332-334 nt with relatively conserved primary structure (67-94% identity) was observed at the 3'-non-coding end of the virus genome. Pairwise comparisons of the nucleotide sequence data revealed similar levels of variation between the coding region, and the 5' and 3'-termini of the genome, implying an equivalent strong selective control for translated and untranslated regions. Indeed the predicted folding of the 5' and 3'-untranslated regions revealed patterns of stem and loop structures conserved for all tick-borne flaviviruses suggesting a purifying selection for preservation of essential RNA secondary structures which could be involved in translational control and replication. The possible implications of these findings are discussed.
DNA microarray-based genome comparison of a pathogenic and a nonpathogenic strain of Xylella fastidiosa delineates genes important for bacterial virulence.

PubMed

Koide, Tie; Zaini, Paulo A; Moreira, Leandro M; Vêncio, Ricardo Z N; Matsukuma, Adriana Y; Durham, Alan M; Teixeira, Diva C; El-Dorry, Hamza; Monteiro, Patrícia B; da Silva, Ana Claudia R; Verjovski-Almeida, Sergio; da Silva, Aline M; Gomes, Suely L

2004-08-01

Xylella fastidiosa is a phytopathogenic bacterium that causes serious diseases in a wide range of economically important crops. Despite extensive comparative analyses of genome sequences of Xylella pathogenic strains from different plant hosts, nonpathogenic strains have not been studied. In this report, we show that X. fastidiosa strain J1a12, associated with citrus variegated chlorosis (CVC), is nonpathogenic when injected into citrus and tobacco plants. Furthermore, a DNA microarray-based comparison of J1a12 with 9a5c, a CVC strain that is highly pathogenic and had its genome completely sequenced, revealed that 14 coding sequences of strain 9a5c are absent or highly divergent in strain J1a12. Among them, we found an arginase and a fimbrial adhesin precursor of type III pilus, which were confirmed to be absent in the nonpathogenic strain by PCR and DNA sequencing. The absence of arginase can be correlated to the inability of J1a12 to multiply in host plants. This enzyme has been recently shown to act as a bacterial survival mechanism by down-regulating host nitric oxide production. The lack of the adhesin precursor gene is in accordance with the less aggregated phenotype observed for J1a12 cells growing in vitro. Thus, the absence of both genes can be associated with the failure of the J1a12 strain to establish and spread in citrus and tobacco plants. These results provide the first detailed comparison between a nonpathogenic strain and a pathogenic strain of X. fastidiosa, constituting an important step towards understanding the molecular basis of the disease.
DNA Microarray-Based Genome Comparison of a Pathogenic and a Nonpathogenic Strain of Xylella fastidiosa Delineates Genes Important for Bacterial Virulence†

PubMed Central

Koide, Tie; Zaini, Paulo A.; Moreira, Leandro M.; Vêncio, Ricardo Z. N.; Matsukuma, Adriana Y.; Durham, Alan M.; Teixeira, Diva C.; El-Dorry, Hamza; Monteiro, Patrícia B.; da Silva, Ana Claudia R.; Verjovski-Almeida, Sergio; da Silva, Aline M.; Gomes, Suely L.

2004-01-01

Xylella fastidiosa is a phytopathogenic bacterium that causes serious diseases in a wide range of economically important crops. Despite extensive comparative analyses of genome sequences of Xylella pathogenic strains from different plant hosts, nonpathogenic strains have not been studied. In this report, we show that X. fastidiosa strain J1a12, associated with citrus variegated chlorosis (CVC), is nonpathogenic when injected into citrus and tobacco plants. Furthermore, a DNA microarray-based comparison of J1a12 with 9a5c, a CVC strain that is highly pathogenic and had its genome completely sequenced, revealed that 14 coding sequences of strain 9a5c are absent or highly divergent in strain J1a12. Among them, we found an arginase and a fimbrial adhesin precursor of type III pilus, which were confirmed to be absent in the nonpathogenic strain by PCR and DNA sequencing. The absence of arginase can be correlated to the inability of J1a12 to multiply in host plants. This enzyme has been recently shown to act as a bacterial survival mechanism by down-regulating host nitric oxide production. The lack of the adhesin precursor gene is in accordance with the less aggregated phenotype observed for J1a12 cells growing in vitro. Thus, the absence of both genes can be associated with the failure of the J1a12 strain to establish and spread in citrus and tobacco plants. These results provide the first detailed comparison between a nonpathogenic strain and a pathogenic strain of X. fastidiosa, constituting an important step towards understanding the molecular basis of the disease. PMID:15292146
A genetic linkage map for the apicomplexan protozoan parasite Eimeria maxima and comparison with Eimeria tenella.

PubMed

Blake, Damer P; Oakes, Richard; Smith, Adrian L

2011-02-01

Eimeria maxima is one of the seven Eimeria spp. that infect the chicken and cause the disease coccidiosis. The well characterised immunogenicity and genetic diversity associated with E. maxima promote its use in genetics-led studies on avian coccidiosis. The development of a genetic map for E. maxima, presented here based upon 647 amplified fragment length polymorphism markers typed from 22 clonal hybrid lines and assembled into 13 major linkage groups, is a major new resource for work with this parasite. Comparison with genetic maps produced for other coccidial parasites indicates relatively high levels of genetic recombination. Conversion of ∼14% of the markers representing the major linkage groups to sequence characterised amplified region markers can provide a scaffold for the assembly of future genomic sequences as well as providing a foundation for more detailed genetic maps. Comparison with the Eimeria tenella genetic map produced 10years ago has revealed a less biased marker distribution, with no more than nine markers mapped within any unresolved heritable unit. Nonetheless, preliminary bioinformatic characterisation of the three largest publicly available genomic E. maxima sequences suggest that the feature-poor/feature-rich structure which has previously been found to define the first sequenced E. tenella chromosome also defines the E. maxima genome. The significance of such a segmented genome and the apparent potential for variation in genetic recombination will be relevant to haplotype stability and the longevity of future anticoccidial strategies based upon multiple loci targeted by novel chemotherapeutic drugs or recombinant subunit vaccines. Copyright © 2010 Australian Society for Parasitology Inc. Published by Elsevier Ltd. All rights reserved.
Complete genome sequence of Pseudomonas stutzeri strain RCH2 isolated from a Hexavalent Chromium [Cr(VI)] contaminated site

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chakraborty, Romy; Woo, Hannah; Dehal, Paramvir

Hexavalent Chromium [Cr(VI)] is a widespread contaminant found in soil, sediment, and ground water in several DOE sites, including Hanford 100 H area. In order to stimulate microbially mediated reduction of Cr(VI) at this site, a poly-lactate hydrogen release compound was injected into the chromium contaminated aquifer. The targeted enrichment of dominant nitrate-reducing bacteria post injection resulted in the isolation of Pseudomonas stutzeri strain RCH2. P. stutzeri strain RCH2 was isolated using acetate as the electron donor and is a complete denitrifier. Experiments with anaerobic washed cell suspension of strain RCH2 revealed it could reduce Cr(VI) and Fe(III). We sequencedmore » the genome of strain RCH2 using a combination of Illumina and 454 sequencing technologies and contained a circular chromosome of 4.6 Mb and three plasmids. Furthermore, global genome comparisons of strain RCH2 with six other fully sequenced P. stutzeri strains revealed most genomic regions are conserved, however strain RCH2 has an additional 244 genes, some of which are involved in chemotaxis, Flp pilus biogenesis and pyruvate/2-oxogluturate complex formation.« less
In silico analysis of L-asparaginase from different source organisms.

PubMed

Dwivedi, Vivek Dhar; Mishra, Sarad Kumar

2014-06-01

L-asparaginases are widely distributed enzymes among plants, fungi and bacteria. This enzyme catalyzes the conversion of l-asparagine to l-aspartate and ammonia and to a lesser extent the formation of l-glutamate from l-glutamine. In the present study, forty-five full-length amino acid sequences of L-asparaginases from bacteria, fungi and plants were collected and subjected to multiple sequence alignment (MSA), domain identification, discovering individual amino acid composition, and phylogenetic tree construction. MSA revealed that two glycine residues were identically found in all analyzed species, two glycine residues were also identically found in all the fungal and bacterial sources and three glycine residues were identically found in all plant and bacterial sources while no residue was identically found in plant and fungal L-asparaginases. Two major sequence clusters were constructed by phylogenetic analysis. One cluster contains eleven species of fungi, twelve species of bacteria, and one species of plant, whereas the other one contains fourteen species of plant, four species of fungi and three species bacteria. The amino acid composition result revealed that the average frequency of amino acid alanine is 10.77 percent that is very high in comparison to other amino acids in all analyzed species.
Complete genome sequence of Pseudomonas stutzeri strain RCH2 isolated from a Hexavalent Chromium [Cr(VI)] contaminated site

DOE PAGES

Chakraborty, Romy; Woo, Hannah; Dehal, Paramvir; ...

2017-02-08

Hexavalent Chromium [Cr(VI)] is a widespread contaminant found in soil, sediment, and ground water in several DOE sites, including Hanford 100 H area. In order to stimulate microbially mediated reduction of Cr(VI) at this site, a poly-lactate hydrogen release compound was injected into the chromium contaminated aquifer. The targeted enrichment of dominant nitrate-reducing bacteria post injection resulted in the isolation of Pseudomonas stutzeri strain RCH2. P. stutzeri strain RCH2 was isolated using acetate as the electron donor and is a complete denitrifier. Experiments with anaerobic washed cell suspension of strain RCH2 revealed it could reduce Cr(VI) and Fe(III). We sequencedmore » the genome of strain RCH2 using a combination of Illumina and 454 sequencing technologies and contained a circular chromosome of 4.6 Mb and three plasmids. Furthermore, global genome comparisons of strain RCH2 with six other fully sequenced P. stutzeri strains revealed most genomic regions are conserved, however strain RCH2 has an additional 244 genes, some of which are involved in chemotaxis, Flp pilus biogenesis and pyruvate/2-oxogluturate complex formation.« less
Comparative Analysis of the Peanut Witches'-Broom Phytoplasma Genome Reveals Horizontal Transfer of Potential Mobile Units and Effectors

PubMed Central

Lo, Wen-Sui; Lin, Chan-Pin; Kuo, Chih-Horng

2013-01-01

Phytoplasmas are a group of bacteria that are associated with hundreds of plant diseases. Due to their economical importance and the difficulties involved in the experimental study of these obligate pathogens, genome sequencing and comparative analysis have been utilized as powerful tools to understand phytoplasma biology. To date four complete phytoplasma genome sequences have been published. However, these four strains represent limited phylogenetic diversity. In this study, we report the shotgun sequencing and evolutionary analysis of a peanut witches'-broom (PnWB) phytoplasma genome. The availability of this genome provides the first representative of the 16SrII group and substantially improves the taxon sampling to investigate genome evolution. The draft genome assembly contains 13 chromosomal contigs with a total size of 562,473 bp, covering ∼90% of the chromosome. Additionally, a complete plasmid sequence is included. Comparisons among the five available phytoplasma genomes reveal the differentiations in gene content and metabolic capacity. Notably, phylogenetic inferences of the potential mobile units (PMUs) in these genomes indicate that horizontal transfer may have occurred between divergent phytoplasma lineages. Because many effectors are associated with PMUs, the horizontal transfer of these transposon-like elements can contribute to the adaptation and diversification of these pathogens. In summary, the findings from this study highlight the importance of improving taxon sampling when investigating genome evolution. Moreover, the currently available sequences are inadequate to fully characterize the pan-genome of phytoplasmas. Future genome sequencing efforts to expand phylogenetic diversity are essential in improving our understanding of phytoplasma evolution. PMID:23626855
Analysis of eight out genes in a cluster required for pectic enzyme secretion by Erwinia chrysanthemi: sequence comparison with secretion genes from other gram-negative bacteria.

PubMed Central

Lindeberg, M; Collmer, A

1992-01-01

Many extracellular proteins produced by Erwinia chrysanthemi require the out gene products for transport across the outer membrane. In a previous report (S. Y. He, M. Lindeberg, A. K. Chatterjee, and A. Collmer, Proc. Natl. Acad. Sci. USA 88:1079-1083, 1991) cosmid pCPP2006, sufficient for secretion of Erwinia chrysanthemi extracellular proteins by Escherichia coli, was partially sequenced, revealing four out genes sharing high homology with pulH through pulK from Klebsiella oxytoca. The nucleotide sequence of eight additional out genes reveals homology with pulC through pulG, pulL, pulM, pulO, and other genes involved in secretion by various gram-negative bacteria. Although signal sequences and hydrophobic regions are generally conserved between Pul and Out proteins, four out genes contain unique inserts, a pulN homolog is not present, and outO appears to be transcribed separately from outC through outM. The sequenced region was subcloned, and an additional 7.6-kb region upstream was identified as being required for secretion in E. coli. out gene homologs were found on Erwinia carotovora cosmid clone pAKC651 but were not detected in E. coli. The outC-through-outM operon is weakly induced by polygalacturonic acid and strongly expressed in the early stationary phase. The out and pul genes are highly similar in sequence, hydropathic properties, and overall arrangement but differ in both transcriptional organization and the nature of their induction. Images PMID:1429461
Complete chloroplast genome sequence and comparative analysis of loblolly pine (Pinus taeda L.) with related species

PubMed Central

Khan, Abdul Latif; Khan, Muhammad Aaqil; Shahzad, Raheem; Lubna; Kang, Sang Mo; Al-Harrasi, Ahmed; Al-Rawahi, Ahmed; Lee, In-Jung

2018-01-01

Pinaceae, the largest family of conifers, has a diversified organization of chloroplast (cp) genomes with two typical highly reduced inverted repeats (IRs). In the current study, we determined the complete sequence of the cp genome of an economically and ecologically important conifer tree, the loblolly pine (Pinus taeda L.), using Illumina paired-end sequencing and compared the sequence with those of other pine species. The results revealed a genome size of 121,531 base pairs (bp) containing a pair of 830-bp IR regions, distinguished by a small single copy (42,258 bp) and large single copy (77,614 bp) region. The chloroplast genome of P. taeda encodes 120 genes, comprising 81 protein-coding genes, four ribosomal RNA genes, and 35 tRNA genes, with 151 randomly distributed microsatellites. Approximately 6 palindromic, 34 forward, and 22 tandem repeats were found in the P. taeda cp genome. Whole cp genome comparison with those of other Pinus species exhibited an overall high degree of sequence similarity, with some divergence in intergenic spacers. Higher and lower numbers of indels and single-nucleotide polymorphism substitutions were observed relative to P. contorta and P. monophylla, respectively. Phylogenomic analyses based on the complete genome sequence revealed that 60 shared genes generated trees with the same topologies, and P. taeda was closely related to P. contorta in the subgenus Pinus. Thus, the complete P. taeda genome provided valuable resources for population and evolutionary studies of gymnosperms and can be used to identify related species. PMID:29596414
Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus.

PubMed

Li, Fagen; Zhou, Changpin; Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

2015-01-01

Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10-56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa.
Comparative Genomics Analyses Reveal Extensive Chromosome Colinearity and Novel Quantitative Trait Loci in Eucalyptus

PubMed Central

Weng, Qijie; Li, Mei; Yu, Xiaoli; Guo, Yong; Wang, Yu; Zhang, Xiaohong; Gan, Siming

2015-01-01

Dense genetic maps, along with quantitative trait loci (QTLs) detected on such maps, are powerful tools for genomics and molecular breeding studies. In the important woody genus Eucalyptus, the recent release of E. grandis genome sequence allows for sequence-based genomic comparison and searching for positional candidate genes within QTL regions. Here, dense genetic maps were constructed for E. urophylla and E. tereticornis using genomic simple sequence repeats (SSR), expressed sequence tag (EST) derived SSR, EST-derived cleaved amplified polymorphic sequence (EST-CAPS), and diversity arrays technology (DArT) markers. The E. urophylla and E. tereticornis maps comprised 700 and 585 markers across 11 linkage groups, totaling at 1,208.2 and 1,241.4 cM in length, respectively. Extensive synteny and colinearity were observed as compared to three earlier DArT-based eucalypt maps (two maps with E. grandis × E. urophylla and one map of E. globulus) and with the E. grandis genome sequence. Fifty-three QTLs for growth (10–56 months of age) and wood density (56 months) were identified in 22 discrete regions on both maps, in which only one colocalizaiton was found between growth and wood density. Novel QTLs were revealed as compared with those previously detected on DArT-based maps for similar ages in Eucalyptus. Eleven to 585 positional candidate genes were obained for a 56-month-old QTL through aligning QTL confidence interval with the E. grandis genome. These results will assist in comparative genomics studies, targeted gene characterization, and marker-assisted selection in Eucalyptus and the related taxa. PMID:26695430
Genetic diversity of the merozoite surface protein-3 gene in Plasmodium falciparum populations in Thailand.

PubMed

Pattaradilokrat, Sittiporn; Sawaswong, Vorthon; Simpalipan, Phumin; Kaewthamasorn, Morakot; Siripoon, Napaporn; Harnyuttanakorn, Pongchai

2016-10-21

An effective malaria vaccine is an urgently needed tool to fight against human malaria, the most deadly parasitic disease of humans. One promising candidate is the merozoite surface protein-3 (MSP-3) of Plasmodium falciparum. This antigenic protein, encoded by the merozoite surface protein (msp-3) gene, is polymorphic and classified according to size into the two allelic types of K1 and 3D7. A recent study revealed that both the K1 and 3D7 alleles co-circulated within P. falciparum populations in Thailand, but the extent of the sequence diversity and variation within each allelic type remains largely unknown. The msp-3 gene was sequenced from 59 P. falciparum samples collected from five endemic areas (Mae Hong Son, Kanchanaburi, Ranong, Trat and Ubon Ratchathani) in Thailand and analysed for nucleotide sequence diversity, haplotype diversity and deduced amino acid sequence diversity. The gene was also subject to population genetic analysis (F st ) and neutrality tests (Tajima's D, Fu and Li D* and Fu and Li' F* tests) to determine any signature of selection. The sequence analyses revealed eight unique DNA haplotypes and seven amino acid sequence variants, with a haplotype and nucleotide diversity of 0.828 and 0.049, respectively. Neutrality tests indicated that the polymorphism detected in the alanine heptad repeat region of MSP-3 was maintained by positive diversifying selection, suggesting its role as a potential target of protective immune responses and supporting its role as a vaccine candidate. Comparison of MSP-3 variants among parasite populations in Thailand, India and Nigeria also inferred a close genetic relationship between P. falciparum populations in Asia. This study revealed the extent of the msp-3 gene diversity in P. falciparum in Thailand, providing the fundamental basis for the better design of future blood stage malaria vaccines against P. falciparum.
Nullomers and High Order Nullomers in Genomic Sequences

PubMed Central

Vergni, Davide; Santoni, Daniele

2016-01-01

A nullomer is an oligomer that does not occur as a subsequence in a given DNA sequence, i.e. it is an absent word of that sequence. The importance of nullomers in several applications, from drug discovery to forensic practice, is now debated in the literature. Here, we investigated the nature of nullomers, whether their absence in genomes has just a statistical explanation or it is a peculiar feature of genomic sequences. We introduced an extension of the notion of nullomer, namely high order nullomers, which are nullomers whose mutated sequences are still nullomers. We studied different aspects of them: comparison with nullomers of random sequences, CpG distribution and mean helical rise. In agreement with previous results we found that the number of nullomers in the human genome is much larger than expected by chance. Nevertheless antithetical results were found when considering a random DNA sequence preserving dinucleotide frequencies. The analysis of CpG frequencies in nullomers and high order nullomers revealed, as expected, a high CpG content but it also highlighted a strong dependence of CpG frequencies on the dinucleotide position, suggesting that nullomers have their own peculiar structure and are not simply sequences whose CpG frequency is biased. Furthermore, phylogenetic trees were built on eleven species based on both the similarities between the dinucleotide frequencies and the number of nullomers two species share, showing that nullomers are fairly conserved among close species. Finally the study of mean helical rise of nullomers sequences revealed significantly high mean rise values, reinforcing the hypothesis that those sequences have some peculiar structural features. The obtained results show that nullomers are the consequence of the peculiar structure of DNA (also including biased CpG frequency and CpGs islands), so that the hypermutability model, also taking into account CpG islands, seems to be not sufficient to explain nullomer phenomenon. Finally, high order nullomers could emphasize those features that already make simple nullomers useful in several applications. PMID:27906971
Ultra Deep Sequencing of Listeria monocytogenes sRNA Transcriptome Revealed New Antisense RNAs

PubMed Central

Behrens, Sebastian; Widder, Stefanie; Mannala, Gopala Krishna; Qing, Xiaoxing; Madhugiri, Ramakanth; Kefer, Nathalie; Mraheil, Mobarak Abu; Rattei, Thomas; Hain, Torsten

2014-01-01

Listeria monocytogenes, a gram-positive pathogen, and causative agent of listeriosis, has become a widely used model organism for intracellular infections. Recent studies have identified small non-coding RNAs (sRNAs) as important factors for regulating gene expression and pathogenicity of L. monocytogenes. Increased speed and reduced costs of high throughput sequencing (HTS) techniques have made RNA sequencing (RNA-Seq) the state-of-the-art method to study bacterial transcriptomes. We created a large transcriptome dataset of L. monocytogenes containing a total of 21 million reads, using the SOLiD sequencing technology. The dataset contained cDNA sequences generated from L. monocytogenes RNA collected under intracellular and extracellular condition and additionally was size fractioned into three different size ranges from <40 nt, 40–150 nt and >150 nt. We report here, the identification of nine new sRNAs candidates of L. monocytogenes and a reevaluation of known sRNAs of L. monocytogenes EGD-e. Automatic comparison to known sRNAs revealed a high recovery rate of 55%, which was increased to 90% by manual revision of the data. Moreover, thorough classification of known sRNAs shed further light on their possible biological functions. Interestingly among the newly identified sRNA candidates are antisense RNAs (asRNAs) associated to the housekeeping genes purA, fumC and pgi and potentially their regulation, emphasizing the significance of sRNAs for metabolic adaptation in L. monocytogenes. PMID:24498259
Identification of Two Novel Amalgaviruses in the Common Eelgrass (Zostera marina) and in Silico Analysis of the Amalgavirus +1 Programmed Ribosomal Frameshifting Sites.

PubMed

Park, Dongbin; Goh, Chul Jun; Kim, Hyein; Hahn, Yoonsoo

2018-04-01

The genome sequences of two novel monopartite RNA viruses were identified in a common eelgrass ( Zostera marina ) transcriptome dataset. Sequence comparison and phylogenetic analyses revealed that these two novel viruses belong to the genus Amalgavirus in the family Amalgaviridae . They were named Zostera marina amalgavirus 1 (ZmAV1) and Zostera marina amalgavirus 2 (ZmAV2). Genomes of both ZmAV1 and ZmAV2 contain two overlapping open reading frames (ORFs). ORF1 encodes a putative replication factory matrix-like protein, while ORF2 encodes a RNA-dependent RNA polymerase (RdRp) domain. The fusion protein (ORF1+2) of ORF1 and ORF2, which mediates RNA replication, was produced using the +1 programmed ribosomal frameshifting (PRF) mechanism. The +1 PRF motif sequence, UUU_CGN, which is highly conserved among known amalgaviruses, was also found in ZmAV1 and ZmAV2. Multiple sequence alignment of the ORF1+2 fusion proteins from 24 amalgaviruses revealed that +1 PRF occurred only at three different positions within the 13-amino acid-long segment, which was surrounded by highly conserved regions on both sides. This suggested that the +1 PRF may be constrained by the structure of fusion proteins. Genome sequences of ZmAV1 and ZmAV2, which are the first viruses to be identified in common eelgrass, will serve as useful resources for studying evolution and diversity of amalgaviruses.

Identification of Two Novel Amalgaviruses in the Common Eelgrass (Zostera marina) and in Silico Analysis of the Amalgavirus +1 Programmed Ribosomal Frameshifting Sites

PubMed Central

Park, Dongbin; Goh, Chul Jun; Kim, Hyein; Hahn, Yoonsoo

2018-01-01

The genome sequences of two novel monopartite RNA viruses were identified in a common eelgrass (Zostera marina) transcriptome dataset. Sequence comparison and phylogenetic analyses revealed that these two novel viruses belong to the genus Amalgavirus in the family Amalgaviridae. They were named Zostera marina amalgavirus 1 (ZmAV1) and Zostera marina amalgavirus 2 (ZmAV2). Genomes of both ZmAV1 and ZmAV2 contain two overlapping open reading frames (ORFs). ORF1 encodes a putative replication factory matrix-like protein, while ORF2 encodes a RNA-dependent RNA polymerase (RdRp) domain. The fusion protein (ORF1+2) of ORF1 and ORF2, which mediates RNA replication, was produced using the +1 programmed ribosomal frameshifting (PRF) mechanism. The +1 PRF motif sequence, UUU_CGN, which is highly conserved among known amalgaviruses, was also found in ZmAV1 and ZmAV2. Multiple sequence alignment of the ORF1+2 fusion proteins from 24 amalgaviruses revealed that +1 PRF occurred only at three different positions within the 13-amino acid-long segment, which was surrounded by highly conserved regions on both sides. This suggested that the +1 PRF may be constrained by the structure of fusion proteins. Genome sequences of ZmAV1 and ZmAV2, which are the first viruses to be identified in common eelgrass, will serve as useful resources for studying evolution and diversity of amalgaviruses. PMID:29628822
Microbial Diversity of Acidic Hot Spring (Kawah Hujan B) in Geothermal Field of Kamojang Area, West Java-Indonesia

PubMed Central

Aditiawati, Pingkan; Yohandini, Heni; Madayanti, Fida; Akhmaloka

2009-01-01

Microbial communities in an acidic hot spring, namely Kawah Hujan B, at Kamojang geothermal field, West Java-Indonesia was examined using culture dependent and culture independent strategies. Chemical analysis of the hot spring water showed a characteristic of acidic-sulfate geothermal activity that contained high sulfate concentrations and low pH values (pH 1.8 to 1.9). Microbial community present in the spring was characterized by 16S rRNA gene combined with denaturing gradient gel electrophoresis (DGGE) analysis. The majority of the sequences recovered from culture-independent method were closely related to Crenarchaeota and Proteobacteria phyla. However, detail comparison among the member of Crenarchaeota showing some sequences variation compared to that the published data especially on the hypervariable and variable regions. In addition, the sequences did not belong to certain genus. Meanwhile, the 16S Rdna sequences from culture-dependent samples revealed mostly close to Firmicute and gamma Proteobacteria. PMID:19440252
The genome sequence of the colonial chordate, Botryllus schlosseri

PubMed Central

Voskoboynik, Ayelet; Neff, Norma F; Sahoo, Debashis; Newman, Aaron M; Pushkarev, Dmitry; Koh, Winston; Passarelli, Benedetto; Fan, H Christina; Mantalas, Gary L; Palmeri, Karla J; Ishizuka, Katherine J; Gissi, Carmela; Griggio, Francesca; Ben-Shlomo, Rachel; Corey, Daniel M; Penland, Lolita; White, Richard A; Weissman, Irving L; Quake, Stephen R

2013-01-01

Botryllus schlosseri is a colonial urochordate that follows the chordate plan of development following sexual reproduction, but invokes a stem cell-mediated budding program during subsequent rounds of asexual reproduction. As urochordates are considered to be the closest living invertebrate relatives of vertebrates, they are ideal subjects for whole genome sequence analyses. Using a novel method for high-throughput sequencing of eukaryotic genomes, we sequenced and assembled 580 Mbp of the B. schlosseri genome. The genome assembly is comprised of nearly 14,000 intron-containing predicted genes, and 13,500 intron-less predicted genes, 40% of which could be confidently parceled into 13 (of 16 haploid) chromosomes. A comparison of homologous genes between B. schlosseri and other diverse taxonomic groups revealed genomic events underlying the evolution of vertebrates and lymphoid-mediated immunity. The B. schlosseri genome is a community resource for studying alternative modes of reproduction, natural transplantation reactions, and stem cell-mediated regeneration. DOI: http://dx.doi.org/10.7554/eLife.00569.001 PMID:23840927
Simian immunodeficiency viruses from African green monkeys display unusual genetic diversity.

PubMed Central

Johnson, P R; Fomsgaard, A; Allan, J; Gravell, M; London, W T; Olmsted, R A; Hirsch, V M

1990-01-01

African green monkeys are asymptomatic carriers of simian immunodeficiency viruses (SIV), commonly called SIVagm. As many as 50% of African green monkeys in the wild may be SIV seropositive. This high seroprevalence rate and the potential for genetic variation of lentiviruses suggested to us that African green monkeys may harbor widely differing genotypes of SIVagm. To investigate this hypothesis, we determined the entire nucleotide sequence of an infectious proviral molecular clone of SIVagm (155-4) and partial sequences (long terminal repeat and Gag) of three other distinct SIVagm isolates (90, gri-1, and ver-1). Comparisons among the SIVagm isolates revealed extreme diversity at the nucleotide and amino acid levels. Long terminal repeat nucleotide sequences varied up to 35% and Gag protein sequences varied up to 30%. The variability among SIVagm isolates exceeded the variability among any other group of primate lentiviruses. Our data suggest that SIVagm has been in the African green monkey population for a long time and may be the oldest primate lentivirus group in existence. PMID:2304139
High sequence variability among hemocyte-specific Kazal-type proteinase inhibitors in decapod crustaceans.

PubMed

Cerenius, Lage; Liu, Haipeng; Zhang, Yanjiao; Rimphanitchayakit, Vichien; Tassanakajon, Anchalee; Gunnar Andersson, M; Söderhäll, Kenneth; Söderhäll, Irene

2010-01-01

Crustacean hemocytes were found to produce a large number of transcripts coding for Kazal-type proteinase inhibitors (KPIs). A detailed study performed with the crayfish Pacifastacus leniusculus and the shrimp Penaeus monodon revealed the presence of at least 26 and 20 different Kazal domains from the hemocyte KPIs, respectively. Comparisons with KPIs from other taxa indicate that the sequences of these domains evolve rapidly. A few conserved positions, e.g. six invariant cysteines were present in all domain sequences whereas the position of P1 amino acid, a determinant for substrate specificity, varied highly. A study with a single crayfish animal suggested that even at the individual level considerable sequence variability among hemocyte KPIs produced exist. Expression analysis of four crayfish KPI transcripts in hematopoietic tissue cells and different hemocyte types suggest that some of these KPIs are likely to be involved in hematopoiesis or hemocyte release as they were produced in particular hemocyte types or maturation stages only.
Swine and Poultry Pathogens: the Complete Genome Sequences of Two Strains of Mycoplasma hyopneumoniae and a Strain of Mycoplasma synoviae†

PubMed Central

Vasconcelos, Ana Tereza R.; Ferreira, Henrique B.; Bizarro, Cristiano V.; Bonatto, Sandro L.; Carvalho, Marcos O.; Pinto, Paulo M.; Almeida, Darcy F.; Almeida, Luiz G. P.; Almeida, Rosana; Alves-Filho, Leonardo; Assunção, Enedina N.; Azevedo, Vasco A. C.; Bogo, Maurício R.; Brigido, Marcelo M.; Brocchi, Marcelo; Burity, Helio A.; Camargo, Anamaria A.; Camargo, Sandro S.; Carepo, Marta S.; Carraro, Dirce M.; de Mattos Cascardo, Júlio C.; Castro, Luiza A.; Cavalcanti, Gisele; Chemale, Gustavo; Collevatti, Rosane G.; Cunha, Cristina W.; Dallagiovanna, Bruno; Dambrós, Bibiana P.; Dellagostin, Odir A.; Falcão, Clarissa; Fantinatti-Garboggini, Fabiana; Felipe, Maria S. S.; Fiorentin, Laurimar; Franco, Gloria R.; Freitas, Nara S. A.; Frías, Diego; Grangeiro, Thalles B.; Grisard, Edmundo C.; Guimarães, Claudia T.; Hungria, Mariangela; Jardim, Sílvia N.; Krieger, Marco A.; Laurino, Jomar P.; Lima, Lucymara F. A.; Lopes, Maryellen I.; Loreto, Élgion L. S.; Madeira, Humberto M. F.; Manfio, Gilson P.; Maranhão, Andrea Q.; Martinkovics, Christyanne T.; Medeiros, Sílvia R. B.; Moreira, Miguel A. M.; Neiva, Márcia; Ramalho-Neto, Cicero E.; Nicolás, Marisa F.; Oliveira, Sergio C.; Paixão, Roger F. C.; Pedrosa, Fábio O.; Pena, Sérgio D. J.; Pereira, Maristela; Pereira-Ferrari, Lilian; Piffer, Itamar; Pinto, Luciano S.; Potrich, Deise P.; Salim, Anna C. M.; Santos, Fabrício R.; Schmitt, Renata; Schneider, Maria P. C.; Schrank, Augusto; Schrank, Irene S.; Schuck, Adriana F.; Seuanez, Hector N.; Silva, Denise W.; Silva, Rosane; Silva, Sérgio C.; Soares, Célia M. A.; Souza, Kelly R. L.; Souza, Rangel C.; Staats, Charley C.; Steffens, Maria B. R.; Teixeira, Santuza M. R.; Urmenyi, Turan P.; Vainstein, Marilene H.; Zuccherato, Luciana W.; Simpson, Andrew J. G.; Zaha, Arnaldo

2005-01-01

This work reports the results of analyses of three complete mycoplasma genomes, a pathogenic (7448) and a nonpathogenic (J) strain of the swine pathogen Mycoplasma hyopneumoniae and a strain of the avian pathogen Mycoplasma synoviae; the genome sizes of the three strains were 920,079 bp, 897,405 bp, and 799,476 bp, respectively. These genomes were compared with other sequenced mycoplasma genomes reported in the literature to examine several aspects of mycoplasma evolution. Strain-specific regions, including integrative and conjugal elements, and genome rearrangements and alterations in adhesin sequences were observed in the M. hyopneumoniae strains, and all of these were potentially related to pathogenicity. Genomic comparisons revealed that reduction in genome size implied loss of redundant metabolic pathways, with maintenance of alternative routes in different species. Horizontal gene transfer was consistently observed between M. synoviae and Mycoplasma gallisepticum. Our analyses indicated a likely transfer event of hemagglutinin-coding DNA sequences from M. gallisepticum to M. synoviae. PMID:16077101
Microbial diversity of acidic hot spring (kawah hujan B) in geothermal field of kamojang area, west java-indonesia.

PubMed

Aditiawati, Pingkan; Yohandini, Heni; Madayanti, Fida; Akhmaloka

2009-01-01

Microbial communities in an acidic hot spring, namely Kawah Hujan B, at Kamojang geothermal field, West Java-Indonesia was examined using culture dependent and culture independent strategies. Chemical analysis of the hot spring water showed a characteristic of acidic-sulfate geothermal activity that contained high sulfate concentrations and low pH values (pH 1.8 to 1.9). Microbial community present in the spring was characterized by 16S rRNA gene combined with denaturing gradient gel electrophoresis (DGGE) analysis. The majority of the sequences recovered from culture-independent method were closely related to Crenarchaeota and Proteobacteria phyla. However, detail comparison among the member of Crenarchaeota showing some sequences variation compared to that the published data especially on the hypervariable and variable regions. In addition, the sequences did not belong to certain genus. Meanwhile, the 16S Rdna sequences from culture-dependent samples revealed mostly close to Firmicute and gamma Proteobacteria.
Precise assignment of the heavy-strand promoter of mouse mitochondrial DNA: cognate start sites are not required for transcriptional initiation.

PubMed Central

Chang, D D; Clayton, D A

1986-01-01

Transcription of the heavy strand of mouse mitochondrial DNA starts from two closely spaced, distinct sites located in the displacement loop region of the genome. We report here an analysis of regulatory sequences required for faithful transcription from these two sites. Data obtained from in vitro assays demonstrated that a 51-base-pair region, encompassing nucleotides -40 to +11 of the downstream start site, contains sufficient information for accurate transcription from both start sites. Deletion of the 3' flanking sequences, including one or both start sites to -17, resulted in the initiation of transcription by the mitochondrial RNA polymerase from alternative sites within vector DNA sequences. This feature places the mouse heavy-strand promoter uniquely among other known mitochondrial promoters, all of which absolutely require cognate start sites for transcription. Comparison of the heavy-strand promoter with those of other vertebrate mitochondrial DNAs revealed a remarkably high rate of sequence divergence among species. Images PMID:3785226
Whole-Genome-Sequencing characterization of bloodstream infection-causing hypervirulent Klebsiella pneumoniae of capsular serotype K2 and ST374.

PubMed

Wang, Xiaoli; Xie, Yingzhou; Li, Gang; Liu, Jialin; Li, Xiaobin; Tian, Lijun; Sun, Jingyong; Ou, Hong-Yu; Qu, Hongping

2018-01-01

Hypervirulent K. pneumoniae variants (hvKP) have been increasingly reported worldwide, causing metastasis of severe infections such as liver abscesses and bacteremia. The capsular serotype K2 hvKP strains show diverse multi-locus sequence types (MLSTs), but with limited genetics and virulence information. In this study, we report a hypermucoviscous K. pneumoniae strain, RJF293, isolated from a human bloodstream sample in a Chinese hospital. It caused a metastatic infection and fatal septic shock in a critical patient. The microbiological features and genetic background were investigated with multiple approaches. The Strain RJF293 was determined to be multilocis sequence type (ST) 374 and serotype K2, displayed a median lethal dose (LD50) of 1.5 × 10 2 CFU in BALB/c mice and was as virulent as the ST23 K1 serotype hvKP strain NTUH-K2044 in a mouse lethality assay. Whole genome sequencing revealed that the RJF293 genome codes for 32 putative virulence factors and exhibits a unique presence/absence pattern in comparison to the other 105 completely sequenced K. pneumoniae genomes. Whole genome SNP-based phylogenetic analysis revealed that strain RJF293 formed a single clade, distant from those containing either ST66 or ST86 hvKP. Compared to the other sequenced hvKP chromosomes, RJF293 contains several strain-variable regions, including one prophage, one ICEKp1 family integrative and conjugative element and six large genomic islands. The sequencing of the first complete genome of an ST374 K2 hvKP clinical strain should reinforce our understanding of the epidemiology and virulence mechanisms of this bloodstream infection-causing hvKP with clinical significance.
Whole-Genome-Sequencing characterization of bloodstream infection-causing hypervirulent Klebsiella pneumoniae of capsular serotype K2 and ST374

PubMed Central

Wang, Xiaoli; Xie, Yingzhou; Li, Gang; Liu, Jialin; Li, Xiaobin; Tian, Lijun; Sun, Jingyong; Qu, Hongping

2018-01-01

ABSTRACT Hypervirulent K. pneumoniae variants (hvKP) have been increasingly reported worldwide, causing metastasis of severe infections such as liver abscesses and bacteremia. The capsular serotype K2 hvKP strains show diverse multi-locus sequence types (MLSTs), but with limited genetics and virulence information. In this study, we report a hypermucoviscous K. pneumoniae strain, RJF293, isolated from a human bloodstream sample in a Chinese hospital. It caused a metastatic infection and fatal septic shock in a critical patient. The microbiological features and genetic background were investigated with multiple approaches. The Strain RJF293 was determined to be multilocis sequence type (ST) 374 and serotype K2, displayed a median lethal dose (LD50) of 1.5 × 102 CFU in BALB/c mice and was as virulent as the ST23 K1 serotype hvKP strain NTUH-K2044 in a mouse lethality assay. Whole genome sequencing revealed that the RJF293 genome codes for 32 putative virulence factors and exhibits a unique presence/absence pattern in comparison to the other 105 completely sequenced K. pneumoniae genomes. Whole genome SNP-based phylogenetic analysis revealed that strain RJF293 formed a single clade, distant from those containing either ST66 or ST86 hvKP. Compared to the other sequenced hvKP chromosomes, RJF293 contains several strain-variable regions, including one prophage, one ICEKp1 family integrative and conjugative element and six large genomic islands. The sequencing of the first complete genome of an ST374 K2 hvKP clinical strain should reinforce our understanding of the epidemiology and virulence mechanisms of this bloodstream infection-causing hvKP with clinical significance. PMID:29338592
The protective effects of acute cardiovascular exercise on the interference of procedural memory.

PubMed

Jo, J S; Chen, J; Riechman, S; Roig, M; Wright, D L

2018-04-10

Numerous studies have reported a positive impact of acute exercise for procedural skill memory. Previous work has revealed this effect, but these findings are confounded by a potential contribution of a night of sleep to the reported exercise-mediated reduction in interference. Thus, it remains unclear if exposure to a brief bout of exercise can provide protection to a newly acquired motor memory. The primary objective of the present study was to examine if a single bout of moderate-intensity cardiovascular exercise after practice of a novel motor sequence reduces the susceptibility to retroactive interference. To address this shortcoming, 17 individuals in a control condition practiced a novel motor sequence that was followed by test after a 6-h wake-filled interval. A separate group of 17 individuals experienced practice with an interfering motor sequence 45 min after practice with the original sequence and were then administered test trials 6 h later. One additional group of 12 participants was exposed to an acute bout of exercise immediately after practice with the original motor sequence but prior to practice with the interfering motor sequence and the subsequent test. In comparison with the control condition, increased response times were revealed during the 6-h test for the individuals that were exposed to interference. The introduction of an acute bout of exercise between the practice of the two motor sequences produced a reduction in interference from practice with the second task at the time of test, however, this effect was not statistically significant. These data reinforce the hypothesis that while there may be a contribution from exercise to post-practice consolidation of procedural skills which is independent of sleep, sleep may interact with exercise to strengthen the effects of the latter on procedural memory.
Sequence analyses and evolutionary relationships among the energy-coupling proteins Enzyme I and HPr of the bacterial phosphoenolpyruvate: sugar phosphotransferase system.

PubMed Central

Reizer, J.; Hoischen, C.; Reizer, A.; Pham, T. N.; Saier, M. H.

1993-01-01

We have previously reported the overexpression, purification, and biochemical properties of the Bacillus subtilis Enzyme I of the phosphoenolpyruvate: sugar phosphotransferase system (PTS) (Reizer, J., et al., 1992, J. Biol. Chem. 267, 9158-9169). We now report the sequencing of the ptsI gene of B. subtilis encoding Enzyme I (570 amino acids and 63,076 Da). Putative transcriptional regulatory signals are identified, and the pts operon is shown to be subject to carbon source-dependent regulation. Multiple alignments of the B. subtilis Enzyme I with (1) six other sequenced Enzymes I of the PTS from various bacterial species, (2) phosphoenolpyruvate synthase of Escherichia coli, and (3) bacterial and plant pyruvate: phosphate dikinases (PPDKs) revealed regions of sequence similarity as well as divergence. Statistical analyses revealed that these three types of proteins comprise a homologous family, and the phylogenetic tree of the 11 sequenced protein members of this family was constructed. This tree was compared with that of the 12 sequence HPr proteins or protein domains. Antibodies raised against the B. subtilis and E. coli Enzymes I exhibited immunological cross-reactivity with each other as well as with PPDK of Bacteroides symbiosus, providing support for the evolutionary relationships of these proteins suggested from the sequence comparisons. Putative flexible linkers tethering the N-terminal and the C-terminal domains of protein members of the Enzyme I family were identified, and their potential significance with regard to Enzyme I function is discussed. The codon choice pattern of the B. subtilis and E. coli ptsI and ptsH genes was found to exhibit a bias toward optimal codons in these organisms.(ABSTRACT TRUNCATED AT 250 WORDS) PMID:7686067
The complete genome sequence of freesia mosaic virus and its relationship to other potyviruses.

PubMed

Choi, H I; Lim, H R; Song, Y S; Kim, M J; Choi, S H; Song, Y S; Bae, S C; Ryu, K H

2010-07-01

We have completed the genomic sequence of a potyvirus, freesia mosaic virus (FreMV), and compared it to those of other known potyviruses. The full-length genome sequence of FreMV consists of 9,489 nucleotides. The large protein contains 3,077 amino acids, with an AUG start codon and UAA stop codon, containing one open reading frame typical of a potyvirus polyprotein. The polyprotein of FreMV-Kr gives rise to eleven proteins (P1, HC-pro, P3, PIPO, 6K1, CI, 6K2, VPg, NIa, NIb and CP), and putative cleavage sites of each protein were identified by sequence comparison to those of other known potyviruses. Phylogenetic analysis of the polyprotein revealed that FreMV-Kr was most closely related to PeMoV and was related to BtMV, BaRMV and PeLMV, which belong to the BCMV subgroup. This is the first information on the complete genome structure of FreMV, and the sequence information clearly supports the status of FreMV as a member of a distinct species in the genus Potyvirus.
Lactobacillus cypricasei Lawson et al. 2001 is a later heterotypic synonym of Lactobacillus acidipiscis Tanasupawat et al. 2000.

PubMed

Naser, Sabri M; Vancanneyt, Marc; Hoste, Bart; Snauwaert, Cindy; Swings, Jean

2006-07-01

The applicability of a multilocus sequence analysis (MLSA)-based identification system for lactobacilli was evaluated. Two housekeeping genes that code for the phenylalanyl-tRNA synthase alpha-subunit (pheS) and RNA polymerase alpha-subunit (rpoA) were sequenced and analysed for members of the Lactobacillus salivarius species group. The type strains of Lactobacillus acidipiscis and Lactobacillus cypricasei were investigated further using a third gene that encodes the alpha-subunit of ATP synthase (atpA). The MLSA data revealed close relatedness between L. acidipiscis and L. cypricasei, with 99.8-100 % pheS, rpoA and atpA gene sequence similarities. Comparison of the 16S rRNA gene sequences of the type strains of the two species confirmed the close relatedness (99.8 % gene sequence similarity) between the two taxa. Similar phenotypes and high DNA-DNA binding values in the range of 84 to 97.5 % confirmed that L. acidipiscis and L. cypricasei are synonymous species. On the basis of the present study, it is proposed that Lactobacillus cypricasei is a later heterotypic synonym of Lactobacillus acidipiscis.
Fabrication of a New Lineage of Artificial Luciferases from Natural Luciferase Pools.

PubMed

Kim, Sung Bae; Nishihara, Ryo; Citterio, Daniel; Suzuki, Koji

2017-09-11

The fabrication of artificial luciferases (ALucs) with unique optical properties has a fundamental impact on bioassays and molecular imaging. In this study, we developed a new lineage of ALucs with unique substrate preferences by extracting consensus amino acids from the alignment of 25 copepod luciferase sequences available in natural luciferase pools. The primary sequence was first created with a sequence logo generator resulting in a total of 11 sibling sequences. Phylogenetic analysis shows that the newly fabricated ALucs form an independent branch, genetically isolated from the natural luciferases, and from a prior series of ALucs produced by our laboratory using a smaller basis set. The new lineage of ALucs were strongly luminescent in living mammalian cells with specific substrate selectivity to native coelenterazine. A single-residue-level comparison of the C-terminal sequences of new ALucs reveals that some amino acids in the C-terminal ends are greatly influential on the optical intensities but limited in the color variance. The success of this approach guides on how to engineer and functionalize marine luciferases for bioluminescence imaging and assays.
Phylogeography of the dark kangaroo mouse, Microdipodops megacephalus: cryptic lineages and dispersal routes in North America's Great Basin.

PubMed

Hafner, John C; Upham, Nathan S

2011-06-01

AIM: The rodent genus Microdipodops (kangaroo mice) includes two sand-obligate endemics of the Great Basin Desert: M. megacephalus and M. pallidus. The dark kangaroo mouse, M. megacephalus, is distributed throughout the Great Basin and our principal aims were to formulate phylogenetic hypotheses for this taxon and make phylogeographical comparisons with its congener. LOCATION: The Great Basin Desert of western North America. METHODS: DNA sequence data from three mitochondrial genes were examined from 186 individuals of M. megacephalus, representing 47 general localities. Phylogenetic inference was used to analyse the sequence data. Directional analysis of phylogeographical patterns was used to examine haplotype sharing patterns and recover routes of gene exchange. Haplotype-area curves were constructed to evaluate the relationship between genetic variation and distributional island size for M. megacephalus and M. pallidus. RESULTS: Microdipodops megacephalus is a rare desert rodent (trapping success was 2.67%). Temporal comparison of trapping data shows that kangaroo mice are becoming less abundant in the study area. The distribution has changed slightly since the 1930s but many northern populations now appear to be small, fragmented, or locally extinct. Four principal phylogroups (the Idaho isolate and the western, central and eastern clades) are evident; mean sequence divergence between phylogroups for cytochrome b is c. 8%. Data from haplotype sharing show two trends: a north-south trend and a web-shaped trend. Analyses of haplotype-area curves reveal significant positive relationships. MAIN CONCLUSIONS: The four phylogroups of M. megacephalus appear to represent morphologically cryptic species; in comparison, a companion study revealed two cryptic lineages in M. pallidus. Estimated divergence times of the principal clades of M. megacephalus (c. 2-4 Ma) indicate that these kangaroo mice were Pleistocene invaders into the Great Basin coincident with the formation of sandy habitats. The north-south and web patterns from directional analyses reveal past routes of gene flow and provide evidence for source-sink population regulation. The web pattern was not seen in the companion study of M. pallidus. Significant haplotype-area curves indicate that the distributional islands are now in approximate genetic equilibrium. The patterns described here are potentially useful to conservation biologists and wildlife managers and may serve as a model for other sand-obligate organisms of the Great Basin.
Phylogenetic analysis of nitrite, nitric oxide, and nitrous oxide respiratory enzymes reveal a complex evolutionary history for denitrification.

PubMed

Jones, Christopher M; Stres, Blaz; Rosenquist, Magnus; Hallin, Sara

2008-09-01

Denitrification is a facultative respiratory pathway in which nitrite (NO2(-)), nitric oxide (NO), and nitrous oxide (N2O) are successively reduced to nitrogen gas (N(2)), effectively closing the nitrogen cycle. The ability to denitrify is widely dispersed among prokaryotes, and this polyphyletic distribution has raised the possibility of horizontal gene transfer (HGT) having a substantial role in the evolution of denitrification. Comparisons of 16S rRNA and denitrification gene phylogenies in recent studies support this possibility; however, these results remain speculative as they are based on visual comparisons of phylogenies from partial sequences. We reanalyzed publicly available nirS, nirK, norB, and nosZ partial sequences using Bayesian and maximum likelihood phylogenetic inference. Concomitant analysis of denitrification genes with 16S rRNA sequences from the same organisms showed substantial differences between the trees, which were supported by examining the posterior probability of monophyletic constraints at different taxonomic levels. Although these differences suggest HGT of denitrification genes, the presence of structural variants for nirK, norB, and nosZ makes it difficult to determine HGT from other evolutionary events. Additional analysis using phylogenetic networks and likelihood ratio tests of phylogenies based on full-length sequences retrieved from genomes also revealed significant differences in tree topologies among denitrification and 16S rRNA gene phylogenies, with the exception of the nosZ gene phylogeny within the data set of the nirK-harboring genomes. However, inspection of codon usage and G + C content plots from complete genomes gave no evidence for recent HGT. Instead, the close proximity of denitrification gene copies in the genomes of several denitrifying bacteria suggests duplication. Although HGT cannot be ruled out as a factor in the evolution of denitrification genes, our analysis suggests that other phenomena, such gene duplication/divergence and lineage sorting, may have differently influenced the evolution of each denitrification gene.
Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity

NASA Astrophysics Data System (ADS)

Corcoran, Martin M.; Phad, Ganesh E.; Bernat, Néstor Vázquez; Stahl-Hennig, Christiane; Sumida, Noriyuki; Persson, Mats A. A.; Martin, Marcel; Hedestam, Gunilla B. Karlsson

2016-12-01

Comprehensive knowledge of immunoglobulin genetics is required to advance our understanding of B cell biology. Validated immunoglobulin variable (V) gene databases are close to completion only for human and mouse. We present a novel computational approach, IgDiscover, that identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species. Further, we describe a novel human IGHV3-21 allele and confirm significant gene differences between Balb/c and C57BL6 mouse strains, demonstrating the power of IgDiscover as a germline V gene discovery tool.
A rural worker infected with a bovine-prevalent genotype of Campylobacter fetus subsp. fetus supports zoonotic transmission and inconsistency of MLST and whole-genome typing.

PubMed

Iraola, G; Betancor, L; Calleros, L; Gadea, P; Algorta, G; Galeano, S; Muxi, P; Greif, G; Pérez, R

2015-08-01

Whole-genome characterisation in clinical microbiology enables to detect trends in infection dynamics and disease transmission. Here, we report a case of bacteraemia due to Campylobacter fetus subsp. fetus in a rural worker under cancer treatment that was diagnosed with cellulitis; the patient was treated with antibiotics and recovered. The routine typing methods were not able to identify the microorganism causing the infection, so it was further analysed by molecular methods and whole-genome sequencing. The multi-locus sequence typing (MLST) revealed the presence of the bovine-associated ST-4 genotype. Whole-genome comparisons with other C. fetus strains revealed an inconsistent phylogenetic position based on the core genome, discordant with previous ST-4 strains. To the best of our knowledge, this is the first C. fetus subsp. fetus carrying the ST-4 isolated from humans and represents a probable case of zoonotic transmission from cattle.
Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity

PubMed Central

Corcoran, Martin M.; Phad, Ganesh E.; Bernat, Néstor Vázquez; Stahl-Hennig, Christiane; Sumida, Noriyuki; Persson, Mats A.A.; Martin, Marcel; Hedestam, Gunilla B. Karlsson

2016-01-01

Comprehensive knowledge of immunoglobulin genetics is required to advance our understanding of B cell biology. Validated immunoglobulin variable (V) gene databases are close to completion only for human and mouse. We present a novel computational approach, IgDiscover, that identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species. Further, we describe a novel human IGHV3-21 allele and confirm significant gene differences between Balb/c and C57BL6 mouse strains, demonstrating the power of IgDiscover as a germline V gene discovery tool. PMID:27995928

Transcriptome Sequencing of Lima Bean (Phaseolus lunatus) to Identify Putative Positive Selection in Phaseolus and Legumes

PubMed Central

Li, Fengqi; Cao, Depan; Liu, Yang; Yang, Ting; Wang, Guirong

2015-01-01

The identification of genes under positive selection is a central goal of evolutionary biology. Many legume species, including Phaseolus vulgaris (common bean) and Phaseolus lunatus (lima bean), have important ecological and economic value. In this study, we sequenced and assembled the transcriptome of one Phaseolus species, lima bean. A comparison with the genomes of six other legume species, including the common bean, Medicago, lotus, soybean, chickpea, and pigeonpea, revealed 15 and 4 orthologous groups with signatures of positive selection among the two Phaseolus species and among the seven legume species, respectively. Characterization of these positively selected genes using Non redundant (nr) annotation, gene ontology (GO) classification, GO term enrichment and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses revealed that these genes are mostly involved in thylakoids, photosynthesis and metabolism. This study identified genes that may be related to the divergence of the Phaseolus and legume species. These detected genes are particularly good candidates for subsequent functional studies. PMID:26151849
Cloning of a new member of the insulin gene superfamily (INSL4) expressed in human placenta

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chassin, D.; Laurent, A.; Janneau, J.L.

1995-09-20

A new member of the insulin gene superfamily was identified by screening a subtracted cDNA library of first-trimester human placenta and, hence, was tentatively named early placenta insulin-like peptide (EPIL). In this paper, we report the cloning and sequencing of the EPIL cDNA and the EPIL gene (INSL4). Comparison of the deduced amino acid sequence of the early placenta insulin-like peptide revealed significant overall and structural homologies with members of the insulin-like hormone superfamily. Moreover, the organization of the early placenta insulin-like gene, which is composed of two exons and one intron, is similiar to that of insulin and relaxin.more » By in situ hybridization, the INSL4 gene was assigned to band p24 of the short arm of chromosome 9. RT-PCR analysis of EPIL tissue distribution revealed that its transcripts are expressed in the placenta and uterus. 22 refs., 3 figs.« less
High-resolution motion compensated MRA in patients with congenital heart disease using extracellular contrast agent at 3 Tesla.

PubMed

Dabir, Darius; Naehle, Claas Philip; Clauberg, Ralf; Gieseke, Juergen; Schild, Hans H; Thomas, Daniel

2012-10-29

Using first-pass MRA (FP-MRA) spatial resolution is limited by breath-hold duration. In addition, image quality may be hampered by respiratory and cardiac motion artefacts. In order to overcome these limitations an ECG- and navigator-gated high-resolution-MRA sequence (HR-MRA) with slow infusion of extracellular contrast agent was implemented at 3 Tesla for the assessment of congenital heart disease and compared to standard first-pass-MRA (FP-MRA). 34 patients (median age: 13 years) with congenital heart disease (CHD) were prospectively examined on a 3 Tesla system. The CMR-protocol comprised functional imaging, FP- and HR-MRA, and viability imaging. After the acquisition of the FP-MRA sequence using a single dose of extracellular contrast agent the motion compensated HR-MRA sequence with isotropic resolution was acquired while injecting the second single dose, utilizing the timeframe before viability imaging. Qualitative scores for image quality (two independent reviewers) as well as quantitative measurements of vessel sharpness and relative contrast were compared using the Wilcoxon signed-rank test. Quantitative measurements of vessel diameters were compared using the Bland-Altman test. The mean image quality score revealed significantly better image quality of the HR-MRA sequence compared to the FP-MRA sequence in all vessels of interest (ascending aorta (AA), left pulmonary artery (LPA), left superior pulmonary vein (LSPV), coronary sinus (CS), and coronary ostia (CO); all p < 0.0001). In comparison to FP-MRA, HR-MRA revealed significantly better vessel sharpness for all considered vessels (AA, LSPV and LPA; all p < 0.0001). The relative contrast of the HR-MRA sequence was less compared to the FP-MRA sequence (AA: p <0.028, main pulmonary artery: p <0.004, LSPV: p <0.005). Both, the results of the intra- and interobserver measurements of the vessel diameters revealed closer correlation and closer 95 % limits of agreement for the HR-MRA. HR-MRA revealed one additional clinical finding, missed by FP-MRA. An ECG- and navigator-gated HR-MRA-protocol with infusion of extracellular contrast agent at 3 Tesla is feasible. HR-MRA delivers significantly better image quality and vessel sharpness compared to FP-MRA. It may be integrated into a standard CMR-protocol for patients with CHD without the need for additional contrast agent injection and without any additional examination time.
Molecular characterization of DnaJ 5 homologs in silkworm Bombyx mori and its expression during egg diapause.

PubMed

Sirigineedi, Sasibhushan; Vijayagowri, Esvaran; Murthy, Geetha N; Rao, Guruprasada; Ponnuvel, Kangayam M

2014-12-01

A comparison of the cDNA sequences (1 056 bp) of Bombyx mori DnaJ 5 homolog with B. mori genome revealed that unlike in other Hsps, it has an intron of 234 bp. The DnaJ 5 homolog contains 351 amino acids, of which 70 contain the conserved DnaJ domain at the N-terminal end. This homolog of B. mori has all desirable functional domains similar to other insects, and the 13 different DnaJ homologs identified in B. mori genome were distributed on different chromosomes. The expressed sequence tag database analysis of Hsp40 gene expression revealed higher expression in wing disc followed by diapause-induced eggs. Microarray analysis revealed higher expression of DnaJ 5 homolog at 18th h after oviposition in diapause-induced eggs. Further validation of DnaJ 5 expression through qPCR in diapause-induced and nondiapause eggs at different time intervals revealed higher expression in diapause eggs at 18 and 24 h after oviposition, which coincided with the expression of Hsp70 as the Hsp 40 is its co-chaperone. This study thus provides an outline of the genome organization of Hsp40 gene, and its role in egg diapause induction in B. mori. © 2013 Institute of Zoology, Chinese Academy of Sciences.
Comparison of predicted binders in Rhipicephalus (Boophilus) microplus intestine protein variants Bm86 Campo Grande strain, Bm86 and Bm95.

PubMed

Andreotti, Renato; Pedroso, Marisela S; Caetano, Alexandre R; Martins, Natália F

2008-01-01

This paper reports the sequence analysis of Bm86 Campo Grande strain comparing it with Bm86 and Bm95 antigens from the preparations TickGardPLUS and Gavac, respectively. The PCR product was cloned into pMOSBlue and sequenced. The secondary structure prediction tool PSIPRED was used to calculate alpha helices and beta strand contents of the predicted polypeptide. The hydrophobicity profile was calculated using the algorithms from the Hopp and Woods method, in addition to identification of potential MHC class-I binding regions in the antigens. Pair-wise alignment revealed that the similarity between Bm86 Campo Grande strain and Bm86 is 0.2% higher than that between Bm86 Campo Grande strain and Bm95 antigens. The identities were 96.5% and 96.3% respectively. Major suggestive differences in hydrophobicity were predicted among the sequences in two specific regions.
Diversity among Tacaribe serocomplex viruses (family Arenaviridae) naturally associated with the Mexican woodrat (Neotoma mexicana)

PubMed Central

Cajimat, Maria N. B.; Milazzo, Mary Louise; Borchert, Jeff N.; Abbott, Ken D.; Bradley, Robert D.; Fulhorst, Charles F.

2008-01-01

The results of analyses of glycoprotein precursor and nucleocapsid protein gene sequences indicated that an arenavirus isolated from a Mexican woodrat (Neotoma mexicana) captured in Arizona is a strain of a novel species (proposed name Skinner Tank virus) and that arenaviruses isolated from Mexican woodrats captured in Colorado, New Mexico, and Utah are strains of Whitewater Arroyo virus or species phylogenetically closely related to Whitewater Arroyo virus. Pairwise comparisons of glycoprotein precursor sequences and nucleocapsid protein sequences revealed a high level of divergence among the viruses isolated from the Mexican woodrats captured in Colorado, New Mexico, and Utah and the Whitewater Arroyo virus prototype strain AV 9310135, which originally was isolated from a white-throated woodrat (Neotoma albigula) captured in New Mexico. Conceptually, the viruses from Colorado, New Mexico, and Utah and strain AV 9310135 could be grouped together in a species complex in the family Arenaviridae, genus Arenavirus. PMID:18304671
Recombination events and variability among full-length genomes of co-circulating molluscum contagiosum virus subtypes 1 and 2.

PubMed

López-Bueno, Alberto; Parras-Moltó, Marcos; López-Barrantes, Olivia; Belda, Sylvia; Alejo, Alí

2017-05-01

Molluscum contagiosum virus (MCV) is the sole member of the Molluscipoxvirus genus and causes a highly prevalent human disease of the skin characterized by the formation of a variable number of lesions that can persist for prolonged periods of time. Two major genotypes, subtype 1 and subtype 2, are recognized, although currently only a single complete genomic sequence corresponding to MCV subtype 1 is available. Using next-generation sequencing techniques, we report the complete genomic sequence of four new MCV isolates, including the first one derived from a subtype 2. Comparisons suggest a relatively distant evolutionary split between both MCV subtypes. Further, our data illustrate concurrent circulation of distinct viruses within a population and reveal the existence of recombination events among them. These results help identify a set of MCV genes with potentially relevant roles in molluscum contagiosum epidemiology and pathogenesis.
A nucleotide substitution in one of the beta-tubulin genes of Trichoderma viride confers resistance to the antimitotic drug methyl benzimidazole-2-yl-carbamate.

PubMed

Goldman, G H; Temmerman, W; Jacobs, D; Contreras, R; Van Montagu, M; Herrera-Estrella, A

1993-07-01

We characterized a Trichoderma viride strain that is resistant to the antimitotic drug methyl benzimidazole-2-yl-carbamate (MBC). This species has two beta-tubulin genes (tub1 and tub2) and by reverse genetics we showed that a mutation in the tub2 gene confers MBC resistance in this strain. Comparison of the tub2 sequence of the mutant strain with that of the wild type revealed that a single amino acid substitution of tyrosine for histidine at a position 6 is responsible for the MBC tolerance. Furthermore, we showed that this gene can be used as a homologous dominant selectable marker in T. viride transformation. Both tubulin genes were completely sequenced. They differ by 48 residues and the degree of identity between their deduced amino acid sequences is 86.3%.
Identification of the skeletal remains of Martin Bormann by mtDNA analysis.

PubMed

Anslinger, K; Weichhold, G; Keil, W; Bayer, B; Eisenmenger, W

2001-01-01

Contrary to statements of an eye-witness who reported that Martin Bormann, the second most powerful man in the Third Reich, died on 2 May 1945 in Berlin, rumours persisted over the years that he had escaped from Germany after World War II. In 1972, skeletal remains were found during construction work, and by investigating the teeth and the bones experts concluded that they were from Bormann. Nevertheless, new rumours arose and in order to end this speculation we were commissioned to identify the skeletal remains by mitochondrial DNA analysis. The comparison of the sequence of HV1 and HV2 from the skeletal remains and a living maternal relative of Martin Bormann revealed no differences and this sequence was not found in 1,500 Caucasoid reference sequences. Based on this investigation, we support the hypothesis that the skeletal remains are those of Martin Bormann.
Deep sequencing reveals persistence of cell-associated mumps vaccine virus in chronic encephalitis.

PubMed

Morfopoulou, Sofia; Mee, Edward T; Connaughton, Sarah M; Brown, Julianne R; Gilmour, Kimberly; Chong, W K 'Kling'; Duprex, W Paul; Ferguson, Deborah; Hubank, Mike; Hutchinson, Ciaran; Kaliakatsos, Marios; McQuaid, Stephen; Paine, Simon; Plagnol, Vincent; Ruis, Christopher; Virasami, Alex; Zhan, Hong; Jacques, Thomas S; Schepelmann, Silke; Qasim, Waseem; Breuer, Judith

2017-01-01

Routine childhood vaccination against measles, mumps and rubella has virtually abolished virus-related morbidity and mortality. Notwithstanding this, we describe here devastating neurological complications associated with the detection of live-attenuated mumps virus Jeryl Lynn (MuV JL5 ) in the brain of a child who had undergone successful allogeneic transplantation for severe combined immunodeficiency (SCID). This is the first confirmed report of MuV JL5 associated with chronic encephalitis and highlights the need to exclude immunodeficient individuals from immunisation with live-attenuated vaccines. The diagnosis was only possible by deep sequencing of the brain biopsy. Sequence comparison of the vaccine batch to the MuV JL5 isolated from brain identified biased hypermutation, particularly in the matrix gene, similar to those found in measles from cases of SSPE. The findings provide unique insights into the pathogenesis of paramyxovirus brain infections.
Adhesion activity of glyceraldehyde-3-phosphate dehydrogenase in a Chinese Streptococcus suis type 2 strain.

PubMed

Wang, Kaicheng; Lu, Chengping

2007-01-01

A total of 36 streptococcal strains, including seven S. equi ssp.zooepidemicus, two S. suis type 1 (SS1), 24 SS2, two SS9, and one SS7, were tested for glyceraldehyde-3-phosphate dehydrogenase gene (gapdh). Except from non-virulent SS2 strain T1 5, all strains harboured gapdh. The gapdh of Chinese Sichuan SS2 isolate ZY05719 and Jiangsu SS2 isolate HA9801 were sequenced and then compared with published sequences in the GenBank. The comparison revealed a 99.9 % and 99.8 % similarity of ZY05719 and HA9801, respectively, with the published sequence. Adherence assay data demonstrated a significant ((p<0.05)) reduction in adhesion of SS2 in HEp-2 cells pre-incubated with purified GAPDH compared to non pre-incubated controls, suggesting the GAPDH mediates SS2 bacterial adhesion to host cells.
Interpreting Mammalian Evolution using Fugu Genome Comparisons

DOE Office of Scientific and Technical Information (OSTI.GOV)

Stubbs, L; Ovcharenko, I; Loots, G G

2004-04-02

Comparative sequence analysis of the human and the pufferfish Fugu rubripes (fugu) genomes has revealed several novel functional coding and noncoding regions in the human genome. In particular, the fugu genome has been extremely valuable for identifying transcriptional regulatory elements in human loci harboring unusually high levels of evolutionary conservation to rodent genomes. In such regions, the large evolutionary distance between human and fishes provides an additional filter through which functional noncoding elements can be detected with high efficiency.
DNA primers for amplification of mitochondrial cytochrome c oxidase subunit I from diverse metazoan invertebrates.

PubMed

Folmer, O; Black, M; Hoeh, W; Lutz, R; Vrijenhoek, R

1994-10-01

We describe "universal" DNA primers for polymerase chain reaction (PCR) amplification of a 710-bp fragment of the mitochondrial cytochrome c oxidase subunit I gene (COI) from 11 invertebrate phyla: Echinodermata, Mollusca, Annelida, Pogonophora, Arthropoda, Nemertinea, Echiura, Sipuncula, Platyhelminthes, Tardigrada, and Coelenterata, as well as the putative phylum Vestimentifera. Preliminary comparisons revealed that these COI primers generate informative sequences for phylogenetic analyses at the species and higher taxonomic levels.
Mitochondrial control-region sequence variation in aboriginal Australians.

PubMed Central

van Holst Pellekaan, S; Frommer, M; Sved, J; Boettcher, B

1998-01-01

The mitochondrial D-loop hypervariable segment 1 (mt HVS1) between nucleotides 15997 and 16377 has been examined in aboriginal Australian people from the Darling River region of New South Wales (riverine) and from Yuendumu in central Australia (desert). Forty-seven unique HVS1 types were identified, varying at 49 nucleotide positions. Pairwise analysis by calculation of BEPPI (between population proportion index) reveals statistically significant structure in the populations, although some identical HVS1 types are seen in the two contrasting regions. mt HVS1 types may reflect more-ancient distributions than do linguistic diversity and other culturally distinguishing attributes. Comparison with sequences from five published global studies reveals that these Australians demonstrate greatest divergence from some Africans, least from Papua New Guinea highlanders, and only slightly more from some Pacific groups (Indonesian, Asian, Samoan, and coastal Papua New Guinea), although the HVS1 types vary at different nucleotide sites. Construction of a median network, displaying three main groups, suggests that several hypervariable nucleotide sites within the HVS1 are likely to have undergone mutation independently, making phylogenetic comparison with global samples by conventional methods difficult. Specific nucleotide-site variants are major separators in median networks constructed from Australian HVS1 types alone and for one global selection. The distribution of these, requiring extended study, suggests that they may be signatures of different groups of prehistoric colonizers into Australia, for which the time of colonization remains elusive. PMID:9463317
Proteomics and Deep Sequencing Comparison of Seasonally Active Venom Glands in the Platypus Reveals Novel Venom Peptides and Distinct Expression Profiles*

PubMed Central

Wong, Emily S. W.; Morgenstern, David; Mofiz, Ehtesham; Gombert, Sara; Morris, Katrina M.; Temple-Smith, Peter; Renfree, Marilyn B.; Whittington, Camilla M.; King, Glenn F.; Warren, Wesley C.; Papenfuss, Anthony T.; Belov, Katherine

2012-01-01

The platypus is a venomous monotreme. Male platypuses possess a spur on their hind legs that is connected to glands in the pelvic region. They produce venom only during the breeding season, presumably to fight off conspecifics. We have taken advantage of this unique seasonal production of venom to compare the transcriptomes of in- and out-of-season venom glands, in conjunction with proteomic analysis, to identify previously undiscovered venom genes. Comparison of the venom glands revealed distinct gene expression profiles that are consistent with changes in venom gland morphology and venom volumes in and out of the breeding season. Venom proteins were identified through shot-gun sequenced venom proteomes of three animals using RNA-seq-derived transcripts for peptide-spectral matching. 5,157 genes were expressed in the venom glands, 1,821 genes were up-regulated in the in-season gland, and 10 proteins were identified in the venom. New classes of platypus-venom proteins identified included antimicrobials, amide oxidase, serpin protease inhibitor, proteins associated with the mammalian stress response pathway, cytokines, and other immune molecules. Five putative toxins have only been identified in platypus venom: growth differentiation factor 15, nucleobindin-2, CD55, a CXC-chemokine, and corticotropin-releasing factor-binding protein. These novel venom proteins have potential biomedical and therapeutic applications and provide insights into venom evolution. PMID:22899769
Proteomics and deep sequencing comparison of seasonally active venom glands in the platypus reveals novel venom peptides and distinct expression profiles.

PubMed

Wong, Emily S W; Morgenstern, David; Mofiz, Ehtesham; Gombert, Sara; Morris, Katrina M; Temple-Smith, Peter; Renfree, Marilyn B; Whittington, Camilla M; King, Glenn F; Warren, Wesley C; Papenfuss, Anthony T; Belov, Katherine

2012-11-01

The platypus is a venomous monotreme. Male platypuses possess a spur on their hind legs that is connected to glands in the pelvic region. They produce venom only during the breeding season, presumably to fight off conspecifics. We have taken advantage of this unique seasonal production of venom to compare the transcriptomes of in- and out-of-season venom glands, in conjunction with proteomic analysis, to identify previously undiscovered venom genes. Comparison of the venom glands revealed distinct gene expression profiles that are consistent with changes in venom gland morphology and venom volumes in and out of the breeding season. Venom proteins were identified through shot-gun sequenced venom proteomes of three animals using RNA-seq-derived transcripts for peptide-spectral matching. 5,157 genes were expressed in the venom glands, 1,821 genes were up-regulated in the in-season gland, and 10 proteins were identified in the venom. New classes of platypus-venom proteins identified included antimicrobials, amide oxidase, serpin protease inhibitor, proteins associated with the mammalian stress response pathway, cytokines, and other immune molecules. Five putative toxins have only been identified in platypus venom: growth differentiation factor 15, nucleobindin-2, CD55, a CXC-chemokine, and corticotropin-releasing factor-binding protein. These novel venom proteins have potential biomedical and therapeutic applications and provide insights into venom evolution.
Molecular Comparison and Evolutionary Analyses of VP1 Nucleotide Sequences of New African Human Enterovirus 71 Isolates Reveal a Wide Genetic Diversity

PubMed Central

Nougairède, Antoine; Joffret, Marie-Line; Deshpande, Jagadish M.; Dubot-Pérès, Audrey; Héraud, Jean-Michel

2014-01-01

Most circulating strains of Human enterovirus 71 (EV-A71) have been classified primarily into three genogroups (A to C) on the basis of genetic divergence between the 1D gene, which encodes the VP1 capsid protein. The aim of the present study was to provide further insights into the diversity of the EV-A71 genogroups following the recent description of highly divergent isolates, in particular those from African countries, including Madagascar. We classified recent EV-A71 isolates by a large comparison of 3,346 VP1 nucleotidic sequences collected from GenBank. Analysis of genetic distances and phylogenetic investigations indicated that some recently-reported isolates did not fall into the genogroups A-C and clustered into three additional genogroups, including one Indian genogroup (genogroup D) and 2 African ones (E and F). Our Bayesian phylogenetic analysis provided consistent data showing that the genogroup D isolates share a recent common ancestor with the members of genogroup E, while the isolates of genogroup F evolved from a recent common ancestor shared with the members of the genogroup B. Our results reveal the wide diversity that exists among EV-A71 isolates and suggest that the number of circulating genogroups is probably underestimated, particularly in developing countries where EV-A71 epidemiology has been poorly studied. PMID:24598878
Evidence of accelerated evolution and ectodermal-specific expression of presumptive BDS toxin cDNAs from Anemonia viridis.

PubMed

Nicosia, Aldo; Maggio, Teresa; Mazzola, Salvatore; Cuttitta, Angela

2013-10-30

Anemonia viridis is a widespread and extensively studied Mediterranean species of sea anemone from which a large number of polypeptide toxins, such as blood depressing substances (BDS) peptides, have been isolated. The first members of this class, BDS-1 and BDS-2, are polypeptides belonging to the β-defensin fold family and were initially described for their antihypertensive and antiviral activities. BDS-1 and BDS-2 are 43 amino acid peptides characterised by three disulfide bonds that act as neurotoxins affecting Kv3.1, Kv3.2 and Kv3.4 channel gating kinetics. In addition, BDS-1 inactivates the Nav1.7 and Nav1.3 channels. The development of a large dataset of A. viridis expressed sequence tags (ESTs) and the identification of 13 putative BDS-like cDNA sequences has attracted interest, especially as scientific and diagnostic tools. A comparison of BDS cDNA sequences showed that the untranslated regions are more conserved than the protein-coding regions. Moreover, the KA/KS ratios calculated for all pairwise comparisons showed values greater than 1, suggesting mechanisms of accelerated evolution. The structures of the BDS homologs were predicted by molecular modelling. All toxins possess similar 3D structures that consist of a triple-stranded antiparallel β-sheet and an additional small antiparallel β-sheet located downstream of the cleavage/maturation site; however, the orientation of the triple-stranded β-sheet appears to differ among the toxins. To characterise the spatial expression profile of the putative BDS cDNA sequences, tissue-specific cDNA libraries, enriched for BDS transcripts, were constructed. In addition, the proper amplification of ectodermal or endodermal markers ensured the tissue specificity of each library. Sequencing randomly selected clones from each library revealed ectodermal-specific expression of ten BDS transcripts, while transcripts of BDS-8, BDS-13, BDS-14 and BDS-15 failed to be retrieved, likely due to under-representation in our cDNA libraries. The calculation of the relative abundance of BDS transcripts in the cDNA libraries revealed that BDS-1, BDS-3, BDS-4, BDS-5 and BDS-6 are the most represented transcripts.
Phylogeny of Banana Streak Virus reveals recent and repetitive endogenization in the genome of its banana host (Musa sp.).

PubMed

Gayral, Philippe; Iskra-Caruana, Marie-Line

2009-07-01

Banana streak virus (BSV) is a plant dsDNA pararetrovirus (family Caulimoviridae, genus badnavirus). Although integration is not an essential step in the BSV replication cycle, the nuclear genome of banana (Musa sp.) contains BSV endogenous pararetrovirus sequences (BSV EPRVs). Some BSV EPRVs are infectious by reconstituting a functional viral genome. Recent studies revealed a large molecular diversity of episomal BSV viruses (i.e., nonintegrated) while others focused on BSV EPRV sequences only. In this study, the evolutionary history of badnavirus integration in banana was inferred from phylogenetic relationships between BSV and BSV EPRVs. The relative evolution rates and selective pressures (d(N)/d(S) ratio) were also compared between endogenous and episomal viral sequences. At least 27 recent independent integration events occurred after the divergence of three banana species, indicating that viral integration is a recent and frequent phenomenon. Relaxation of selective pressure on badnaviral sequences that experienced neutral evolution after integration in the plant genome was recorded. Additionally, a significant decrease (35%) in the EPRV evolution rate was observed compared to BSV, reflecting the difference in the evolution rate between episomal dsDNA viruses and plant genome. The comparison of our results with the evolution rate of the Musa genome and other reverse-transcribing viruses suggests that EPRVs play an active role in episomal BSV diversity and evolution.
The Transcriptome Analysis of Strongyloides stercoralis L3i Larvae Reveals Targets for Intervention in a Neglected Disease

PubMed Central

Marcilla, Antonio; Garg, Gagan; Bernal, Dolores; Ranganathan, Shoba; Forment, Javier; Ortiz, Javier; Muñoz-Antolí, Carla; Dominguez, M. Victoria; Pedrola, Laia; Martinez-Blanch, Juan; Sotillo, Javier; Trelis, Maria; Toledo, Rafael; Esteban, J. Guillermo

2012-01-01

Background Strongyloidiasis is one of the most neglected diseases distributed worldwide with endemic areas in developed countries, where chronic infections are life threatening. Despite its impact, very little is known about the molecular biology of the parasite involved and its interplay with its hosts. Next generation sequencing technologies now provide unique opportunities to rapidly address these questions. Principal Findings Here we present the first transcriptome of the third larval stage of S. stercoralis using 454 sequencing coupled with semi-automated bioinformatic analyses. 253,266 raw sequence reads were assembled into 11,250 contiguous sequences, most of which were novel. 8037 putative proteins were characterized based on homology, gene ontology and/or biochemical pathways. Comparison of the transcriptome of S. strongyloides with those of other nematodes, including S. ratti, revealed similarities in transcription of molecules inferred to have key roles in parasite-host interactions. Enzymatic proteins, like kinases and proteases, were abundant. 1213 putative excretory/secretory proteins were compiled using a new pipeline which included non-classical secretory proteins. Potential drug targets were also identified. Conclusions Overall, the present dataset should provide a solid foundation for future fundamental genomic, proteomic and metabolomic explorations of S. stercoralis, as well as a basis for applied outcomes, such as the development of novel methods of intervention against this neglected parasite. PMID:22389732

Molecular characterization of amino acid deletion in VP1 (1D) protein and novel amino acid substitutions in 3D polymerase protein of foot and mouth disease virus subtype A/Iran87.

PubMed

Esmaelizad, Majid; Jelokhani-Niaraki, Saber; Hashemnejad, Khadije; Kamalzadeh, Morteza; Lotfi, Mohsen

2011-12-01

The nucleotide sequence of the VP1 (1D) and partial 3D polymerase (3D(pol)) coding regions of the foot and mouth disease virus (FMDV) vaccine strain A/Iran87, a highly passaged isolate (~150 passages), was determined and aligned with previously published FMDV serotype A sequences. Overall analysis of the amino acid substitutions revealed that the partial 3D(pol) coding region contained four amino acid alterations. Amino acid sequence comparison of the VP1 coding region of the field isolates revealed deletions in the highly passaged Iranian isolate (A/Iran87). The prominent G-H loop of the FMDV VP1 protein contains the conserved arginine-glycine-aspartic acid (RGD) tripeptide, which is a well-known ligand for a specific cell surface integrin. Despite losing the RGD sequence of the VP1 protein and an Asp(26)→Glu substitution in a beta sheet located within a small groove of the 3D(pol) protein, the virus grew in BHK 21 suspension cell cultures. Since this strain has been used as a vaccine strain, it may be inferred that the RGD deletion has no critical role in virus attachment to the cell during the initiation of infection. It is probable that this FMDV subtype can utilize other pathways for cell attachment.
Analysis of the Genome Structure of the Nonpathogenic Probiotic Escherichia coli Strain Nissle 1917

PubMed Central

Grozdanov, Lubomir; Raasch, Carsten; Schulze, Jürgen; Sonnenborn, Ulrich; Gottschalk, Gerhard; Hacker, Jörg; Dobrindt, Ulrich

2004-01-01

Nonpathogenic Escherichia coli strain Nissle 1917 (O6:K5:H1) is used as a probiotic agent in medicine, mainly for the treatment of various gastroenterological diseases. To gain insight on the genetic level into its properties of colonization and commensalism, this strain's genome structure has been analyzed by three approaches: (i) sequence context screening of tRNA genes as a potential indication of chromosomal integration of horizontally acquired DNA, (ii) sequence analysis of 280 kb of genomic islands (GEIs) coding for important fitness factors, and (iii) comparison of Nissle 1917 genome content with that of other E. coli strains by DNA-DNA hybridization. PCR-based screening of 324 nonpathogenic and pathogenic E. coli isolates of different origins revealed that some chromosomal regions are frequently detectable in nonpathogenic E. coli and also among extraintestinal and intestinal pathogenic strains. Many known fitness factor determinants of strain Nissle 1917 are localized on four GEIs which have been partially sequenced and analyzed. Comparison of these data with the available knowledge of the genome structure of E. coli K-12 strain MG1655 and of uropathogenic E. coli O6 strains CFT073 and 536 revealed structural similarities on the genomic level, especially between the E. coli O6 strains. The lack of defined virulence factors (i.e., alpha-hemolysin, P-fimbrial adhesins, and the semirough lipopolysaccharide phenotype) combined with the expression of fitness factors such as microcins, different iron uptake systems, adhesins, and proteases, which may support its survival and successful colonization of the human gut, most likely contributes to the probiotic character of E. coli strain Nissle 1917. PMID:15292145
A molecular perspective on a complex polymorphic inversion system with cytological evidence of multiply reused breakpoints.

PubMed

Orengo, D J; Puerma, E; Papaceit, M; Segarra, C; Aguadé, M

2015-06-01

Genome sequence comparison across the Drosophila genus revealed that some fixed inversion breakpoints had been multiply reused at this long timescale. Cytological studies of Drosophila inversion polymorphism had previously shown that, also at this shorter timescale, some breakpoints had been multiply reused. The paucity of molecularly characterized polymorphic inversion breakpoints has so far precluded contrasting whether cytologically shared breakpoints of these relatively young inversions are actually reused at the molecular level. The E chromosome of Drosophila subobscura stands out because it presents several inversion complexes. This is the case of the E1+2+9+3 arrangement that originated from the ancestral Est arrangement through the sequential accumulation of four inversions (E1, E2, E9 and E3) sharing some breakpoints. We recently identified the breakpoints of inversions E1 and E2, which allowed establishing reuse at the molecular level of the cytologically shared breakpoint of these inversions. Here, we identified and sequenced the breakpoints of inversions E9 and E3, because they share breakpoints at sections 58D and 64C with those of inversions E1 and E2. This has allowed establishing that E9 and E3 originated through the staggered-break mechanism. Most importantly, sequence comparison has revealed the multiple reuse at the molecular level of the proximal breakpoint (section 58D), which would have been used at least by inversions E2, E9 and E3. In contrast, the distal breakpoint (section 64C) might have been only reused once by inversions E1 and E2, because the distal E3 breakpoint is displaced >70 kb from the other breakpoint limits.
A molecular perspective on a complex polymorphic inversion system with cytological evidence of multiply reused breakpoints

PubMed Central

Orengo, D J; Puerma, E; Papaceit, M; Segarra, C; Aguadé, M

2015-01-01

Genome sequence comparison across the Drosophila genus revealed that some fixed inversion breakpoints had been multiply reused at this long timescale. Cytological studies of Drosophila inversion polymorphism had previously shown that, also at this shorter timescale, some breakpoints had been multiply reused. The paucity of molecularly characterized polymorphic inversion breakpoints has so far precluded contrasting whether cytologically shared breakpoints of these relatively young inversions are actually reused at the molecular level. The E chromosome of Drosophila subobscura stands out because it presents several inversion complexes. This is the case of the E1+2+9+3 arrangement that originated from the ancestral Est arrangement through the sequential accumulation of four inversions (E1, E2, E9 and E3) sharing some breakpoints. We recently identified the breakpoints of inversions E1 and E2, which allowed establishing reuse at the molecular level of the cytologically shared breakpoint of these inversions. Here, we identified and sequenced the breakpoints of inversions E9 and E3, because they share breakpoints at sections 58D and 64C with those of inversions E1 and E2. This has allowed establishing that E9 and E3 originated through the staggered-break mechanism. Most importantly, sequence comparison has revealed the multiple reuse at the molecular level of the proximal breakpoint (section 58D), which would have been used at least by inversions E2, E9 and E3. In contrast, the distal breakpoint (section 64C) might have been only reused once by inversions E1 and E2, because the distal E3 breakpoint is displaced >70 kb from the other breakpoint limits. PMID:25712227
A nucleotide sequence comparison of coxsackievirus B4 isolates from aquatic samples and clinical specimens.

PubMed Central

Hughes, M. S.; Hoey, E. M.; Coyle, P. V.

1993-01-01

Ten coxsackievirus B4 (CVB4) strains isolated from clinical and environmental sources in Northern Ireland in 1985-7, were compared at the nucleotide sequence level. Dideoxynucleotide sequencing of a polymerase chain reaction (PCR) amplified fragment, spanning the VP1/P2A genomic region, classified the isolates into two distinct groups or genotypes as defined by Rico-Hesse and colleagues for poliovirus type 1. Isolates within each group shared approximately 99% sequence identity at the nucleotide level whereas < or = 86% sequence identity was shared between groups. One isolate derived from a clinical specimen in 1987 was grouped with six CVB4 isolates recovered from the aquatic environment in 1986-7. The second group comprised CVB4 isolates from clinical specimens in 1985-6. Both groups were different at the nucleotide level from the prototype strain isolated in 1950. It was concluded that the method could be used to sub-type CVB4 isolates and would be of value in epidemiological studies of CVB4. Predicted amino acid sequences revealed non-conservation of the tyrosine residue at the VP1/P2A cleavage site but were of little value in distinguishing CVB4 variants. PMID:8386098
Effect of the reflectional symmetry on the coherent hole transport across DNA hairpins

NASA Astrophysics Data System (ADS)

Zarea, Mehdi; Berlin, Yuri; Ratner, Mark A.

2017-03-01

The coherent hole transfer in three types of DNA hairpins containing strands with adenine (A) and guanine (G) nucleobases has been studied. The investigated hairpins involve An+1GGAn, AnGAGAn, or (AG)2nA strands that connect the hole donor and hole acceptor located on opposite ends of hairpins. The positive charge transfer from the photo-excited donor to the acceptor is shown to be slower for An+1GGAn in comparison with AnGAGAn and (AG)2nA sequences. We have revealed that this is due to the reflectional symmetry of the last two sequences with respect to the axis passing through the middle base. As has been demonstrated, the symmetry of the sequence structure manifests itself in the reflectional symmetry of the energy eigenstates. In addition, it has been shown that (AG)2nA is the only symmetric sequence with a zero energy state in the middle of the LUMO tight-binding energy band. Based on our theoretical findings, we predict that the hairpin with this sequence should have the fastest coherent hole transfer rate among the class of base sequences studied.
Genome organisation and sequence comparison suggest intraspecies incongruence in M RNA of Watermelon bud necrosis virus.

PubMed

Kumar, Rakesh; Mandal, B; Geetanjali, A S; Jain, R K; Jaiwal, P K

2010-08-01

Watermelon bud necrosis virus (WBNV), a member of the genus Tospovirus, family Bunyaviridae is an important viral pathogen in watermelon cultivation in India. The complete genome sequence properties of WBNV are not available. In the present study, the complete M RNA sequence and the genome organisation of a WBNV isolate infecting watermelon in Delhi (WBNV-wDel) were determined. The M RNA was 4,794 nucleotides (nt) long and potentially coded for a movement protein (NSm) of 34.22 kDa (307 amino acids) on the viral sense strand and a Gn/Gc glycoprotein precursor of 127.15 kDa (1,121 amino acids) on the complementary strand. The two open reading frames were separated by an intergenic region of 402 nt. The 5' and 3' untranslated regions were 55 and 47 nt long, respectively, containing complementary termini typical of tospoviruses. WBNV-wDel was most closely related (79.1% identity) to Groundnut bud necrosis virus, an important tospovirus that occurs in several crops in India, and was different (63.3-75.2% identity) from the other cucurbit-infecting tospoviruses known to occur in Taiwan and Japan. Sequence analysis of NSm and Gn/Gc revealed phylogenetic incongruence between WBNV-wDel and another isolate originating from central India (WBNV-Wm-Som isolate). The Wm-Som isolate showed evolutionary divergence from the wDel isolate in the Gn/Gc protein (74.6% identity) potentially due to recombination with the other tospoviruses that are known to occur in India. This is the first report of a comparison of complete sequences of M RNA of WBNV.
Harnessing NGS and Big Data Optimally: Comparison of miRNA Prediction from Assembled versus Non-assembled Sequencing Data--The Case of the Grass Aegilops tauschii Complex Genome.

PubMed

Budak, Hikmet; Kantar, Melda

2015-07-01

MicroRNAs (miRNAs) are small, endogenous, non-coding RNA molecules that regulate gene expression at the post-transcriptional level. As high-throughput next generation sequencing (NGS) and Big Data rapidly accumulate for various species, efforts for in silico identification of miRNAs intensify. Surprisingly, the effect of the input genomics sequence on the robustness of miRNA prediction was not evaluated in detail to date. In the present study, we performed a homology-based miRNA and isomiRNA prediction of the 5D chromosome of bread wheat progenitor, Aegilops tauschii, using two distinct sequence data sets as input: (1) raw sequence reads obtained from 454-GS FLX Titanium sequencing platform and (2) an assembly constructed from these reads. We also compared this method with a number of available plant sequence datasets. We report here the identification of 62 and 22 miRNAs from raw reads and the assembly, respectively, of which 16 were predicted with high confidence from both datasets. While raw reads promoted sensitivity with the high number of miRNAs predicted, 55% (12 out of 22) of the assembly-based predictions were supported by previous observations, bringing specificity forward compared to the read-based predictions, of which only 37% were supported. Importantly, raw reads could identify several repeat-related miRNAs that could not be detected with the assembly. However, raw reads could not capture 6 miRNAs, for which the stem-loops could only be covered by the relatively longer sequences from the assembly. In summary, the comparison of miRNA datasets obtained by these two strategies revealed that utilization of raw reads, as well as assemblies for in silico prediction, have distinct advantages and disadvantages. Consideration of these important nuances can benefit future miRNA identification efforts in the current age of NGS and Big Data driven life sciences innovation.
Dual-echo ASL based assessment of motor networks: a feasibility study

NASA Astrophysics Data System (ADS)

Storti, Silvia Francesca; Boscolo Galazzo, Ilaria; Pizzini, Francesca B.; Menegaz, Gloria

2018-04-01

Objective. Dual-echo arterial spin labeling (DE-ASL) technique has been recently proposed for the simultaneous acquisition of ASL and blood-oxygenation-level-dependent (BOLD)-functional magnetic resonance imaging (fMRI) data. The assessment of this technique in detecting functional connectivity at rest or during motor and motor imagery tasks is still unexplored both per-se and in comparison with conventional methods. The purpose is to quantify the sensitivity of the DE-ASL sequence with respect to the conventional fMRI sequence (cvBOLD) in detecting brain activations, and to assess and compare the relevance of node features in decoding the network structure. Approach. Thirteen volunteers were scanned acquiring a pseudo-continuous DE-ASL sequence from which the concomitant BOLD (ccBOLD) simultaneously to the ASL can be extracted. The approach consists of two steps: (i) model-based analyses for assessing brain activations at individual and group levels, followed by statistical analysis for comparing the activation elicited by the three sequences under two conditions (motor and motor imagery), respectively; (ii) brain connectivity graph-theoretical analysis for assessing and comparing the network models properties. Main results. Our results suggest that cvBOLD and ccBOLD have comparable sensitivity in detecting the regions involved in the active task, whereas ASL offers a higher degree of co-localization with smaller activation volumes. The connectivity results and the comparative analysis of node features across sequences revealed that there are no strong changes between rest and tasks and that the differences between the sequences are limited to few connections. Significance. Considering the comparable sensitivity of the ccBOLD and cvBOLD sequences in detecting activated brain regions, the results demonstrate that DE-ASL can be successfully applied in functional studies allowing to obtain both ASL and BOLD information within a single sequence. Further, DE-ASL is a powerful technique for research and clinical applications allowing to perform quantitative comparisons as well as to characterize functional connectivity.
Pestoides F, an atypical Yersinia pestis strain from the former Soviet Union.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Garcia, Emilio; Worsham, Patricia; Bearden, S.

2007-01-01

Unlike the classical Yersinia pestis strains, members of an atypical group of Y. pestis from Central Asia, denominated Y. pestis subspecies caucasica (also known as one of several pestoides types), are distinguished by a number of characteristics including their ability to ferment rhamnose and melibiose, their lack of the small plasmid encoding the plasminogen activator (pla) and pesticin, and their exceptionally large variants of the virulence plasmid pMT (encoding murine toxin and capsular antigen). We have obtained the entire genome sequence of Y. pestis Pestoides F, an isolate from the former Soviet Union that has enabled us to carryout amore » comprehensive genome-wide comparison of this organism's genomic content against the six published sequences of Y. pestis and their Y. pseudotuberculosis ancestor. Based on classical glycerol fermentation (+ve) and nitrate reduction (+ve) Y. pestis Pestoides F is an isolate that belongs to the biovar antiqua. This strain is unusual in other characteristics such as the fact that it carries a non-consensus V antigen (lcrV) sequence, and that unlike other Pla(-) strains, Pestoides F retains virulence by the parenteral and aerosol routes. The chromosome of Pestoides F is 4,517,345 bp in size comprising some 3,936 predicted coding sequences, while its pCD and pMT plasmids are 71,507 bp and 137,010 bp in size respectively. Comparison of chromosome-associated genes in Pestoides F with those in the other sequenced Y. pestis strains reveals differences ranging from strain-specific rearrangements, insertions, deletions, single nucleotide polymorphisms, and a unique distribution of insertion sequences. There is a single approximately 7 kb unique region in the chromosome not found in any of the completed Y. pestis strains sequenced to date, but which is present in the Y. pseudotuberculosis ancestor. Taken together, these findings are consistent with Pestoides F being derived from the most ancient lineage of Y. pestis yet sequenced.« less
Pestoides F, and Atypical Yersinia pestis Strain from the Former Soviet Union

DOE Office of Scientific and Technical Information (OSTI.GOV)

Garcia, E; Worsham, P; Bearden, S

2007-01-05

Unlike the classical Yersinia pestis strains, members of an atypical group of Y. pestis from Central Asia, denominated Y. pestis subspecies caucasica (also known as one of several pestoides types), are distinguished by a number of characteristics including their ability to ferment rhamnose and melibiose, their lacking the small plasmid encoding the plasminogen activator (pla) and pesticin, and their exceptionally large variants of the virulence plasmid pMT (encoding murine toxin and capsular antigen). We have obtained the entire genome sequence of Y. pestis Pestoides F, an isolate from the former Soviet Union that has enabled us to carryout a comprehensivemore » genome-wide comparison of this organism's genomic content against the six published sequences of Y. pestis and their Y. pseudotuberculosis ancestor. Based on classical glycerol fermentation (+ve) and nitrate reduction (+ve) Y. pestis Pestoides F is an isolate that belongs to the biovar antiqua. This strain is unusual in other characteristics such as the fact that it carries a non-consensus V antigen (lcrV) sequence, and that unlike other Pla{sup -} strains, Pestoides F retains virulence by the parenteral and aerosol routes. The chromosome of Pestoides F is 4,517,345 bp in size comprising some 3,936 predicted coding sequences, while its pCD and pMT plasmids are 71,507 bp and 137,010 bp in size respectively. Comparison of chromosome-associated genes in Pestoides F with those in the other sequenced Y. pestis strains, reveals a series of differences ranging from strain-specific rearrangements, insertions, deletions, single nucleotide polymorphisms, and a unique distribution of insertion sequences. There is a single {approx}7 kb unique region in the chromosome not found in any of the completed Y. pestis strains sequenced to date, but which is present in the Y. pseudotuberculosis ancestor. Taken together, these findings are consistent with Pestoides F being derived from the most ancient lineage of Y. pestis yet sequenced.« less
Genetic Analyses of the Internal Transcribed Spacer Sequences Suggest Introgression and Duplication in the Medicinal Mushroom Agaricus subrufescens

PubMed Central

Chen, Jie; Moinard, Magalie; Xu, Jianping; Wang, Shouxian; Foulongne-Oriol, Marie; Zhao, Ruilin; Hyde, Kevin D.; Callac, Philippe

2016-01-01

The internal transcribed spacer (ITS) region of the nuclear ribosomal RNA gene cluster is widely used in fungal taxonomy and phylogeographic studies. The medicinal and edible mushroom Agaricus subrufescens has a worldwide distribution with a high level of polymorphism in the ITS region. A previous analysis suggested notable ITS sequence heterogeneity within the wild French isolate CA487. The objective of this study was to investigate the pattern and potential mechanism of ITS sequence heterogeneity within this strain. Using PCR, cloning, and sequencing, we identified three types of ITS sequences, A, B, and C with a balanced distribution, which differed from each other at 13 polymorphic positions. The phylogenetic comparisons with samples from different continents revealed that the type C sequence was similar to those found in Oceanian and Asian specimens of A. subrufescens while types A and B sequences were close to those found in the Americas or in Europe. We further investigated the inheritance of these three ITS sequence types by analyzing their distribution among single-spore isolates from CA487. In this analysis, three co-dominant markers were used firstly to distinguish the homokaryotic offspring from the heterokaryotic offspring. The homokaryotic offspring were then analyzed for their ITS types. Our genetic analyses revealed that types A and B were two alleles segregating at one locus ITSI, while type C was not allelic with types A and B but was located at another unlinked locus ITSII. Furthermore, type C was present in only one of the two constitutive haploid nuclei (n) of the heterokaryotic (n+n) parent CA487. These data suggest that there was a relatively recent introduction of the type C sequence and a duplication of the ITS locus in this strain. Whether other genes were also transferred and duplicated and their impacts on genome structure and stability remain to be investigated. PMID:27228131
Complete genomic sequence of a Tobacco rattle virus isolate from Michigan-grown potatoes.

PubMed

Crosslin, James M; Hamm, Philip B; Kirk, William W; Hammond, Rosemarie W

2010-04-01

Tobacco rattle virus (TRV) causes stem mottle on potato leaves and necrotic arcs and rings in potato tubers, known as corky ringspot disease. Recently, TRV was reported in Michigan potato tubers cv. FL1879 exhibiting corky ringspot disease. Sequence analysis of the RNA-1-encoded 16-kDa gene of the Michigan isolate, designated MI-1, revealed homology to TRV isolates from Florida and Washington. Here, we report the complete genomic sequence of RNA-1 (6,791 nt) and RNA-2 (3,685 nt) of TRV MI-1. RNA-1 is predicted to contain four open reading frames, and the genome structure and phylogenetic analyses of the RNA-1 nucleotide sequence revealed significant homologies to the known sequences of other TRV-1 isolates. The relationships based on the full-length nucleotide sequence were different from than those based on the 16-kDa gene encoded on genomic RNA-1 and reflect sequence variation within a 20-25-aa residue region of the 16-kDa protein. MI-1 RNA-2 is predicted to contain three ORFs, encoding the coat protein (CP), a 37.6-kDa protein (ORF 2b), and a 33.6-kDa protein (ORF 2c). In addition, it contains a region of similarity to the 3' terminus of RNA-1, including a truncated portion of the 16-kDa cistron. Phylogenetic analysis of RNA-2, based on a comparison of nucleotide sequences with other members of the genus Tobravirus, indicates that TRV MI-1 and other North American isolates cluster as a distinct group. TRV M1-1 is only the second North American isolate for which there is a complete sequence of the genome, and it is distinct from the North American isolate TRV ORY. The relationship of the TRV MI-1 isolate to other tobravirus isolates is discussed.
Application of Genotyping during an Extensive Outbreak of Waterborne Giardiasis in Bergen, Norway, during Autumn and Winter 2004†

PubMed Central

Robertson, L. J.; Hermansen, L.; Gjerde, B. K.; Strand, E.; Alvsvåg, J. O.; Langeland, N.

2006-01-01

During the autumn and winter of 2004 and 2005, an extensive outbreak of waterborne giardiasis occurred in Bergen, Norway. Over 1,500 patients were diagnosed with giardiasis. Analysis of water from the implicated source revealed low numbers of Giardia cysts, but the initial contamination event probably occurred up to 10 weeks previously. While sewage leakage from a residential area is now considered to be the probable source of contamination, during the episode waste from one particular septic tank was thought to be a possible source. Genotyping of cysts from the septic tank demonstrated that they were assemblage A cysts, although the sequences were not identical to any previously published sequences. For the β-giardin gene, the closest published subgenotype was subgenotype A3; for the gdh gene, the closest published subgenotype was subgenotype A2. Genotyping of cysts from 21 patient samples revealed that they were assemblage B cysts; thus, the septic tank was unlikely to be the contamination source. Sequencing of the β-giardin and gdh genes from patient samples and a comparison of the sequences gave complex results. For the β-giardin gene, three isolates had sequences identical to subgenotype B3 sequences. However, other isolates had between one and four single-nucleotide polymorphisms (SNPs). For the gdh gene, none of the sequences were identical to the sequence published for subgenotype B3, and the sequences had between one and three SNPs. One isolate, which was identical to subgenotype B3 at the β-giardin gene, was more similar to subgenotype B2 at the gdh gene. Grouping the isolates on the basis of SNPs resulted in different groups for the two genes. The results are discussed in relation to giardiasis in Norway and to other Giardia genotyping studies. PMID:16517674
Algorithm, applications and evaluation for protein comparison by Ramanujan Fourier transform.

PubMed

Zhao, Jian; Wang, Jiasong; Hua, Wei; Ouyang, Pingkai

2015-12-01

The amino acid sequence of a protein determines its chemical properties, chain conformation and biological functions. Protein sequence comparison is of great importance to identify similarities of protein structures and infer their functions. Many properties of a protein correspond to the low-frequency signals within the sequence. Low frequency modes in protein sequences are linked to the secondary structures, membrane protein types, and sub-cellular localizations of the proteins. In this paper, we present Ramanujan Fourier transform (RFT) with a fast algorithm to analyze the low-frequency signals of protein sequences. The RFT method is applied to similarity analysis of protein sequences with the Resonant Recognition Model (RRM). The results show that the proposed fast RFT method on protein comparison is more efficient than commonly used discrete Fourier transform (DFT). RFT can detect common frequencies as significant feature for specific protein families, and the RFT spectrum heat-map of protein sequences demonstrates the information conservation in the sequence comparison. The proposed method offers a new tool for pattern recognition, feature extraction and structural analysis on protein sequences. Copyright © 2015 Elsevier Ltd. All rights reserved.
Cloning and Genomic Organization of a Rhamnogalacturonase Gene from Locally Isolated Strain of Aspergillus niger.

PubMed

Damak, Naourez; Abdeljalil, Salma; Taeib, Noomen Hadj; Gargouri, Ali

2015-08-01

The rhg gene encoding a rhamnogalacturonase was isolated from the novel strain A1 of Aspergillus niger. It consists of an ORF of 1.505 kb encoding a putative protein of 446 amino acids with a predicted molecular mass of 47 kDa, belonging to the family 28 of glycosyl hydrolases. The nature and position of amino acids comprising the active site as well as the three-dimensional structure were well conserved between the A. niger CTM10548 and fungal rhamnogalacturonases. The coding region of the rhg gene is interrupted by three short introns of 56 (introns 1 and 3) and 52 (intron 2) bp in length. The comparison of the peptide sequence with A. niger rhg sequences revealed that the A1 rhg should be an endo-rhamnogalacturonases, more homologous to rhg A than rhg B A. niger known enzymes. The comparison of rhg nucleotide sequence from A. niger A1 with rhg A from A. niger shows several base changes. Most of these changes (59 %) are located at the third base of codons suggesting maintaining the same enzyme function. We used the rhamnogalacturonase A from Aspergillus aculeatus as a template to build a structural model of rhg A1 that adopted a right-handed parallel β-helix.
Genetic divergence analysis of the Common Barn Owl Tyto alba (Scopoli, 1769) and the Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile using COI sequence

PubMed Central

Colihueque, Nelson; Gantz, Alberto; Rau, Jaime Ricardo; Parraguez, Margarita

2015-01-01

Abstract In this paper new mitochondrial COI sequences of Common Barn Owl Tyto alba (Scopoli, 1769) and Short-eared Owl Asio flammeus (Pontoppidan, 1763) from southern Chile are reported and compared with sequences from other parts of the World. The intraspecific genetic divergence (mean p-distance) was 4.6 to 5.5% for the Common Barn Owl in comparison with specimens from northern Europe and Australasia and 3.1% for the Short-eared Owl with respect to samples from north America, northern Europe and northern Asia. Phylogenetic analyses revealed three distinctive groups for the Common Barn Owl: (i) South America (Chile and Argentina) plus Central and North America, (ii) northern Europe and (iii) Australasia, and two distinctive groups for the Short-eared Owl: (i) South America (Chile and Argentina) and (ii) north America plus northern Europe and northern Asia. The level of genetic divergence observed in both species exceeds the upper limit of intraspecific comparisons reported previously for Strigiformes. Therefore, this suggests that further research is needed to assess the taxonomic status, particularly for the Chilean populations that, to date, have been identified as belonging to these species through traditional taxonomy. PMID:26668551
Illumina sequencing-based analyses of bacterial communities during short-chain fatty-acid production from food waste and sewage sludge fermentation at different pH values.

PubMed

Cheng, Weixiao; Chen, Hong; Yan, ShuHai; Su, Jianqiang

2014-09-01

Short-chain fatty acids (SCFAs) can be produced by primary and waste activated sludge anaerobic fermentation. The yield and product spectrum distribution of SCFAs can be significantly affected by different initial pH values. However, most studies have focused on the physical and chemical aspects of SCFA production by waste activated sludge fermentation at different pH values. Information on the bacterial community structures during acidogenic fermentation is limited. In this study, comparisons of the bacterial communities during the co-substrate fermentation of food wastes and sewage sludge at different pH values were performed using the barcoded Illumina paired-end sequencing method. The results showed that different pH environments harbored a characteristic bacterial community, including sequences related to Lactobacillus, Prevotella, Mitsuokella, Treponema, Clostridium, and Ureibacillus. The most abundant bacterial operational taxonomic units in the different pH environments were those related to carbohydrate-degrading bacteria, which are associated with constituents of co-substrate fermentation. Further analyses showed that during organic matter fermentation, a core microbiota composed of Firmicutes, Proteobacteria, and Bacteroidetes existed. Comparison analyses revealed that the bacterial community during fermentation was significantly affected by the pH, and that the diverse product distribution was related to the shift in bacterial communities.
Long-read sequencing improves assembly of Trichinella genomes 10-fold, revealing substantial synteny between lineages diverged over 7 million years.

PubMed

Thompson, Peter C; Zarlenga, Dante S; Liu, Ming-Yuan; Rosenthal, Benjamin M

2017-09-01

Genome assemblies can form the basis of comparative analyses fostering insight into the evolutionary genetics of a parasite's pathogenicity, host-pathogen interactions, environmental constraints and invasion biology; however, the length and complexity of many parasite genomes has hampered the development of well-resolved assemblies. In order to improve Trichinella genome assemblies, the genome of the sylvatic encapsulated species Trichinella murrelli was sequenced using third-generation, long-read technology and, using syntenic comparisons, scaffolded to a reference genome assembly of Trichinella spiralis, markedly improving both. A high-quality draft assembly for T. murrelli was achieved that totalled 63·2 Mbp, half of which was condensed into 26 contigs each longer than 571 000 bp. When compared with previous assemblies for parasites in the genus, ours required 10-fold fewer contigs, which were five times longer, on average. Better assembly across repetitive regions also enabled resolution of 8 Mbp of previously indeterminate sequence. Furthermore, syntenic comparisons identified widespread scaffold misassemblies in the T. spiralis reference genome. The two new assemblies, organized for the first time into three chromosomal scaffolds, will be valuable resources for future studies linking phenotypic traits within each species to their underlying genetic bases.
Insights into the evolution of Yersinia pestis through whole-genome comparison with Yersinia pseudotuberculosis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chain, Patrick S. G.; Carniel, E.; Larimer, Frank W

2004-09-01

Yersinia pestis, the causative agent of plague, is a highly uniform clone that diverged recently from the enteric pathogen Yersinia pseudotuberculosis. Despite their close genetic relationship, they differ radically in their pathogenicity and transmission. Here, we report the complete genomic sequence of Y. pseudotuberculosis IP32953 and its use for detailed genome comparisons with available Y. pestis sequences. Analyses of identified differences across a panel of Yersinia isolates from around the world reveal 32 Y. pestis chromosomal genes that, together with the two Y. pestis-specific plasmids, to our knowledge, represent the only new genetic material in Y. pestis acquired since themore » the divergence from Y. pseudotuberculosis. In contrast, 149 other pseudogenes (doubling the previous estimate) and 317 genes absent from Y. pestis were detected, indicating that as many as 13% of Y. pseudotuberculosis genes no longer function in Y. pestis. Extensive insertion sequence-mediated genome rearrangements and reductive evolution through massive gene loss, resulting in elimination and modification of preexisting gene expression pathways, appear to be more important than acquisition of genes in the evolution of Y. pestis. These results provide a sobering example of how a highly virulent epidemic clone can suddenly emerge from a less virulent, closely related progenitor.« less

Serratia symbiotica from the aphid Cinara cedri: a missing link from facultative to obligate insect endosymbiont.

PubMed

Lamelas, Araceli; Gosalbes, María José; Manzano-Marín, Alejandro; Peretó, Juli; Moya, Andrés; Latorre, Amparo

2011-11-01

The genome sequencing of Buchnera aphidicola BCc from the aphid Cinara cedri, which is the smallest known Buchnera genome, revealed that this bacterium had lost its symbiotic role, as it was not able to synthesize tryptophan and riboflavin. Moreover, the biosynthesis of tryptophan is shared with the endosymbiont Serratia symbiotica SCc, which coexists with B. aphidicola in this aphid. The whole-genome sequencing of S. symbiotica SCc reveals an endosymbiont in a stage of genome reduction that is closer to an obligate endosymbiont, such as B. aphidicola from Acyrthosiphon pisum, than to another S. symbiotica, which is a facultative endosymbiont in this aphid, and presents much less gene decay. The comparison between both S. symbiotica enables us to propose an evolutionary scenario of the transition from facultative to obligate endosymbiont. Metabolic inferences of B. aphidicola BCc and S. symbiotica SCc reveal that most of the functions carried out by B. aphidicola in A. pisum are now either conserved in B. aphidicola BCc or taken over by S. symbiotica. In addition, there are several cases of metabolic complementation giving functional stability to the whole consortium and evolutionary preservation of the actors involved.
The three-dimensional structure of "Lonely Guy" from Claviceps purpurea provides insights into the phosphoribohydrolase function of Rossmann fold-containing lysine decarboxylase-like proteins.

PubMed

Dzurová, Lenka; Forneris, Federico; Savino, Simone; Galuszka, Petr; Vrabka, Josef; Frébort, Ivo

2015-08-01

The recently discovered cytokinin (CK)-specific phosphoribohydrolase "Lonely Guy" (LOG) is a key enzyme of CK biosynthesis, converting inactive CK nucleotides into biologically active free bases. We have determined the crystal structures of LOG from Claviceps purpurea (cpLOG) and its complex with the enzymatic product phosphoribose. The structures reveal a dimeric arrangement of Rossmann folds, with the ligands bound to large pockets at the interface between cpLOG monomers. Structural comparisons highlight the homology of cpLOG to putative lysine decarboxylases. Extended sequence analysis enabled identification of a distinguishing LOG sequence signature. Taken together, our data suggest phosphoribohydrolase activity for several proteins of unknown function. © 2015 Wiley Periodicals, Inc.
Genetic analysis of Fasciola isolates from cattle in Korea based on second internal transcribed spacer (ITS-2) sequence of nuclear ribosomal DNA.

PubMed

Choe, Se-Eun; Nguyen, Thuy Thi-Dieu; Kang, Tae-Gyu; Kweon, Chang-Hee; Kang, Seung-Won

2011-09-01

Nuclear ribosomal DNA sequence of the second internal transcribed spacer (ITS-2) has been used efficiently to identify the liver fluke species collected from different hosts and various geographic regions. ITS-2 sequences of 19 Fasciola samples collected from Korean native cattle were determined and compared. Sequence comparison including ITS-2 sequences of isolates from this study and reference sequences from Fasciola hepatica and Fasciola gigantica and intermediate Fasciola in Genbank revealed seven identical variable sites of investigated isolates. Among 19 samples, 12 individuals had ITS-2 sequences completely identical to that of pure F. hepatica, five possessed the sequences identical to F. gigantica type, whereas two shared the sequence of both F. hepatica and F. gigantica. No variations in length and nucleotide composition of ITS-2 sequence were observed within isolates that belonged to F. hepatica or F. gigantica. At the position of 218, five Fasciola containing a single-base substitution (C>T) formed a distinct branch inside the F. gigantica-type group which was similar to those of Asian-origin isolates. The phylogenetic tree of the Fasciola spp. based on complete ITS-2 sequences from this study and other representative isolates in different locations clearly showed that pure F. hepatica, F. gigantica type and intermediate Fasciola were observed. The result also provided additional genetic evidence for the existence of three forms of Fasciola isolated from native cattle in Korea by genetic approach using ITS-2 sequence.
A comparative analysis of the foamy and ortho virus capsid structures reveals an ancient domain duplication.

PubMed

Taylor, William R; Stoye, Jonathan P; Taylor, Ian A

2017-04-04

The Spumaretrovirinae (foamy viruses) and the Orthoretrovirinae (e.g. HIV) share many similarities both in genome structure and the sequences of the core viral encoded proteins, such as the aspartyl protease and reverse transcriptase. Similarity in the gag region of the genome is less obvious at the sequence level but has been illuminated by the recent solution of the foamy virus capsid (CA) structure. This revealed a clear structural similarity to the orthoretrovirus capsids but with marked differences that left uncertainty in the relationship between the two domains that comprise the structure. We have applied protein structure comparison methods in order to try and resolve this ambiguous relationship. These included both the DALI method and the SAP method, with rigorous statistical tests applied to the results of both methods. For this, we employed collections of artificial fold 'decoys' (generated from the pair of native structures being compared) to provide a customised background distribution for each comparison, thus allowing significance levels to be estimated. We have shown that the relationship of the two domains conforms to a simple linear correspondence rather than a domain transposition. These similarities suggest that the origin of both viral capsids was a common ancestor with a double domain structure. In addition, we show that there is also a significant structural similarity between the amino and carboxy domains in both the foamy and ortho viruses. These results indicate that, as well as the duplication of the double domain capsid, there may have been an even more ancient gene-duplication that preceded the double domain structure. In addition, our structure comparison methodology demonstrates a general approach to problems where the components have a high intrinsic level of similarity.
The genome and phenome of the green alga Chloroidium sp. UTEX 3007 reveal adaptive traits for desert acclimatization.

PubMed

Nelson, David R; Khraiwesh, Basel; Fu, Weiqi; Alseekh, Saleh; Jaiswal, Ashish; Chaiboonchoe, Amphun; Hazzouri, Khaled M; O'Connor, Matthew J; Butterfoss, Glenn L; Drou, Nizar; Rowe, Jillian D; Harb, Jamil; Fernie, Alisdair R; Gunsalus, Kristin C; Salehi-Ashtiani, Kourosh

2017-06-17

To investigate the phenomic and genomic traits that allow green algae to survive in deserts, we characterized a ubiquitous species, Chloroidium sp. UTEX 3007 , which we isolated from multiple locations in the United Arab Emirates (UAE). Metabolomic analyses of Chloroidium sp. UTEX 3007 indicated that the alga accumulates a broad range of carbon sources, including several desiccation tolerance-promoting sugars and unusually large stores of palmitate. Growth assays revealed capacities to grow in salinities from zero to 60 g/L and to grow heterotrophically on >40 distinct carbon sources. Assembly and annotation of genomic reads yielded a 52.5 Mbp genome with 8153 functionally annotated genes. Comparison with other sequenced green algae revealed unique protein families involved in osmotic stress tolerance and saccharide metabolism that support phenomic studies. Our results reveal the robust and flexible biology utilized by a green alga to successfully inhabit a desert coastline.
Sequence Analysis of the Genome of Carnation (Dianthus caryophyllus L.)

PubMed Central

Yagi, Masafumi; Kosugi, Shunichi; Hirakawa, Hideki; Ohmiya, Akemi; Tanase, Koji; Harada, Taro; Kishimoto, Kyutaro; Nakayama, Masayoshi; Ichimura, Kazuo; Onozaki, Takashi; Yamaguchi, Hiroyasu; Sasaki, Nobuhiro; Miyahara, Taira; Nishizaki, Yuzo; Ozeki, Yoshihiro; Nakamura, Noriko; Suzuki, Takamasa; Tanaka, Yoshikazu; Sato, Shusei; Shirasawa, Kenta; Isobe, Sachiko; Miyamura, Yoshinori; Watanabe, Akiko; Nakayama, Shinobu; Kishida, Yoshie; Kohara, Mitsuyo; Tabata, Satoshi

2014-01-01

The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. ‘Francesco’ was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568 887 315 bp, consisting of 45 088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16 644 bp and 60 737 bp, respectively, and the longest scaffold was 1 287 144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp. PMID:24344172
Genome-wide comparison and taxonomic relatedness of multiple Xylella fastidiosa strains reveal the occurrence of three subspecies and a new Xylella species.

PubMed

Marcelletti, Simone; Scortichini, Marco

2016-10-01

A total of 21 Xylella fastidiosa strains were assessed by comparing their genomes to infer their taxonomic relationships. The whole-genome-based average nucleotide identity and tetranucleotide frequency correlation coefficient analyses were performed. In addition, a consensus tree based on comparisons of 956 core gene families, and a genome-wide phylogenetic tree and a Neighbor-net network were constructed with 820,088 nucleotides (i.e., approximately 30-33 % of the entire X. fastidiosa genome). All approaches revealed the occurrence of three well-demarcated genetic clusters that represent X. fastidiosa subspecies fastidiosa, multiplex and pauca, with the latter appeared to diverge. We suggest that the proposed but never formally described subspecies 'sandyi' and 'morus' are instead members of the subspecies fastidiosa. These analyses support the view that the Xylella strain isolated from Pyrus pyrifolia in Taiwan is likely to be a new species. A widely used multilocus sequence typing analysis yielded conflicting results.
Phosphoproteomic investigation of a solvent producing bacterium Clostridium acetobutylicum.

PubMed

Bai, Xue; Ji, Zhihong

2012-07-01

In this study, we employed TiO₂ enrichment and high accuracy liquid chromatography-mass spectrometry-mass spectrometry to identify the phosphoproteome of Clostridium acetobutyicum ATCC824 in acidogenesis and solventogenesis. As many as 82 phosphopeptides in 61 proteins, with 107 phosphorylated sites on serine, threonine, or tyrosine, were identified with high confidence. We detected 52 phosphopeptides from 44 proteins in acidogenesis and 70 phosphopeptides from 51 proteins in solventogenesis, respectively. Bioinformatic analysis revealed most of the phosphoproteins located in cytoplasm and participated in carbon metabolism. Based on comparison between the two stages, we found 27 stage-specific phosphorylated proteins (10 in acidogenesis and 17 in solventogenesis), some of which were solvent production-related enzymes and metabolic regulators, showed significantly different phosphorylated status. Further analysis indicated that protein phosphorylation could be involved in the shift of stages or in solvent production pathway directly. Comparison against several other organisms revealed the evolutionary diversity among them on phosphorylation level in spite of their high homology on protein sequence level.
Nucleotide and deduced amino acid sequence of the envelope gene of the Vasilchenko strain of TBE virus; comparison with other flaviviruses.

PubMed

Gritsun, T S; Frolova, T V; Pogodina, V V; Lashkevich, V A; Venugopal, K; Gould, E A

1993-02-01

A strain of tick-borne encephalitis virus known as Vasilchenko (Vs) exhibits relatively low virulence characteristics in monkeys, Syrian hamsters and humans. The gene encoding the envelope glycoprotein of this virus was cloned and sequenced. Alignment of the sequence with those of other known tick-borne flaviviruses and identification of the recognised amino acid genetic marker EHLPTA confirmed its identity as a member of the TBE complex. However, Vs virus was distinguishable from eastern and western tick-borne serotypes by the presence of the sequence AQQ at amino acid positions 232-234 and also by the presence of other specific amino acid substitutions which may be genetic markers for these viruses and could determine their pathogenetic characteristics. When compared with other tick-borne flaviviruses, Vs virus had 12 unique amino acid substitutions including an additional potential glycosylation site at position (315-317). The Vs virus strain shared closest nucleotide and amino acid homology (84.5% and 95.5% respectively) with western and far eastern strains of tick-borne encephalitis virus. Comparison with the far eastern serotype of tick-borne encephalitis virus, by cross-immunoelectrophoresis of Vs virions and PAGE analysis of the extracted virion proteins, revealed differences in surface charge and virus stability that may account for the different virulence characteristics of Vs virus. These results support and enlarge upon previous data obtained from molecular and serological analysis.
Comparative genomics of four closely related Clostridium perfringens bacteriophages reveals variable evolution among core genes with therapeutic potential

PubMed Central

2011-01-01

Background Because biotechnological uses of bacteriophage gene products as alternatives to conventional antibiotics will require a thorough understanding of their genomic context, we sequenced and analyzed the genomes of four closely related phages isolated from Clostridium perfringens, an important agricultural and human pathogen. Results Phage whole-genome tetra-nucleotide signatures and proteomic tree topologies correlated closely with host phylogeny. Comparisons of our phage genomes to 26 others revealed three shared COGs; of particular interest within this core genome was an endolysin (PF01520, an N-acetylmuramoyl-L-alanine amidase) and a holin (PF04531). Comparative analyses of the evolutionary history and genomic context of these common phage proteins revealed two important results: 1) strongly significant host-specific sequence variation within the endolysin, and 2) a protein domain architecture apparently unique to our phage genomes in which the endolysin is located upstream of its associated holin. Endolysin sequences from our phages were one of two very distinct genotypes distinguished by variability within the putative enzymatically-active domain. The shared or core genome was comprised of genes with multiple sequence types belonging to five pfam families, and genes belonging to 12 pfam families, including the holin genes, which were nearly identical. Conclusions Significant genomic diversity exists even among closely-related bacteriophages. Holins and endolysins represent conserved functions across divergent phage genomes and, as we demonstrate here, endolysins can have significant variability and host-specificity even among closely-related genomes. Endolysins in our phage genomes may be subject to different selective pressures than the rest of the genome. These findings may have important implications for potential biotechnological applications of phage gene products. PMID:21631945
Comparative analysis of complete orthologous centromeres from two subspecies of rice reveals rapid variation of centromere organization and structure.

PubMed

Wu, Jianzhong; Fujisawa, Masaki; Tian, Zhixi; Yamagata, Harumi; Kamiya, Kozue; Shibata, Michie; Hosokawa, Satomi; Ito, Yukiyo; Hamada, Masao; Katagiri, Satoshi; Kurita, Kanako; Yamamoto, Mayu; Kikuta, Ari; Machita, Kayo; Karasawa, Wataru; Kanamori, Hiroyuki; Namiki, Nobukazu; Mizuno, Hiroshi; Ma, Jianxin; Sasaki, Takuji; Matsumoto, Takashi

2009-12-01

Centromeres are sites for assembly of the chromosomal structures that mediate faithful segregation at mitosis and meiosis. This function is conserved across species, but the DNA components that are involved in kinetochore formation differ greatly, even between closely related species. To shed light on the nature, evolutionary timing and evolutionary dynamics of rice centromeres, we decoded a 2.25-Mb DNA sequence covering the centromeric region of chromosome 8 of an indica rice variety, 'Kasalath' (Kas-Cen8). Analysis of repetitive sequences in Kas-Cen8 led to the identification of 222 long terminal repeat (LTR)-retrotransposon elements and 584 CentO satellite monomers, which account for 59.2% of the region. A comparison of the Kas-Cen8 sequence with that of japonica rice 'Nipponbare' (Nip-Cen8) revealed that about 66.8% of the Kas-Cen8 sequence was collinear with that of Nip-Cen8. Although the 27 putative genes are conserved between the two subspecies, only 55.4% of the total LTR-retrotransposon elements in 'Kasalath' had orthologs in 'Nipponbare', thus reflecting recent proliferation of a considerable number of LTR-retrotransposons since the divergence of two rice subspecies of indica and japonica within Oryza sativa. Comparative analysis of the subfamilies, time of insertion, and organization patterns of inserted LTR-retrotransposons between the two Cen8 regions revealed variations between 'Kasalath' and 'Nipponbare' in the preferential accumulation of CRR elements, and the expansion of CentO satellite repeats within the core domain of Cen8. Together, the results provide insights into the recent proliferation of LTR-retrotransposons, and the rapid expansion of CentO satellite repeats, underlying the dynamic variation and plasticity of plant centromeres.
Overlapping protective roles for glutathione transferase gene family members in chemical and oxidative stress response in Agrobacterium tumefaciens.

PubMed

Skopelitou, Katholiki; Muleta, Abdi W; Pavli, Ourania; Skaracis, Georgios N; Flemetakis, Emmanouil; Papageorgiou, Anastassios C; Labrou, Nikolaos E

2012-03-01

In the present work, we describe the characterisation of the glutathione transferase (GST) gene family from Agrobacterium tumefaciens C58. A genome survey revealed the presence of eight GST-like proteins in A. tumefaciens (AtuGSTs). Comparison by multiple sequence alignment generated a dendrogram revealing the phylogenetic relationships of AtuGSTs-like proteins. The beta and theta classes identified in other bacterial species are represented by five members in A. tumefaciens C58. In addition, there are three "orphan" sequences that do not fit into any previously recognised GST classes. The eight GST-like genes were cloned, expressed in Escherichia coli and their substrate specificity was determined towards 17 different substrates. The results showed that AtuGSTs catalyse a broad range of reactions, with different members of the family exhibiting quite varied substrate specificity. The 3D structures of AtuGSTs were predicted using molecular modelling. The use of comparative sequence and structural analysis of the AtuGST isoenzymes allowed us to identify local sequence and structural characteristics between different GST isoenzymes and classes. Gene expression profiling was conducted under normal culture conditions as well as under abiotic stress conditions (addition of xenobiotics, osmotic stress and cold and heat shock) to induce and monitor early stress-response mechanisms. The results reveal the constitutive expression of GSTs in A. tumefaciens and a modulation of GST activity after treatments, indicating that AtuGSTs presumably participate in a wide range of functions, many of which are important in counteracting stress conditions. These functions may be relevant to maintaining cellular homeostasis as well as in the direct detoxification of toxic compounds.
Analysis of the Gull Fecal Microbial Community Reveals the Dominance of Catellicoccus marimammalium in Relation to Culturable Enterococci

PubMed Central

Koskey, Amber M.; Fisher, Jenny C.; Traudt, Mary F.; Newton, Ryan J.

2014-01-01

Gulls are prevalent in beach environments and can be a major source of fecal contamination. Gulls have been shown to harbor a high abundance of fecal indicator bacteria (FIB), such as Escherichia coli and enterococci, which can be readily detected as part of routine beach monitoring. Despite the ubiquitous presence of gull fecal material in beach environments, the associated microbial community is relatively poorly characterized. We generated comprehensive microbial community profiles of gull fecal samples using Roche 454 and Illumina MiSeq platforms to investigate the composition and variability of the gull fecal microbial community and to measure the proportion of FIB. Enterococcaceae and Enterobacteriaceae were the two most abundant families in our gull samples. Sequence comparisons between short-read data and nearly full-length 16S rRNA gene clones generated from the same samples revealed Catellicoccus marimammalium as the most numerous taxon among all samples. The identification of bacteria from gull fecal pellets cultured on membrane-Enterococcus indoxyl-β-d-glucoside (mEI) plates showed that the dominant sequences recovered in our sequence libraries did not represent organisms culturable on mEI. Based on 16S rRNA gene sequencing of gull fecal isolates cultured on mEI plates, 98.8% were identified as Enterococcus spp., 1.2% were identified as Streptococcus spp., and none were identified as C. marimammalium. Illumina deep sequencing indicated that gull fecal samples harbor significantly higher proportions of C. marimammalium 16S rRNA gene sequences (>50-fold) relative to typical mEI culturable Enterococcus spp. C. marimammalium therefore can be confidently utilized as a genetic marker to identify gull fecal pollution in the beach environment. PMID:24242244
Microbial diversity in degraded and non-degraded petroleum samples and comparison across oil reservoirs at local and global scales.

PubMed

Sierra-Garcia, Isabel Natalia; Dellagnezze, Bruna M; Santos, Viviane P; Chaves B, Michel R; Capilla, Ramsés; Santos Neto, Eugenio V; Gray, Neil; Oliveira, Valeria M

2017-01-01

Microorganisms have shown their ability to colonize extreme environments including deep subsurface petroleum reservoirs. Physicochemical parameters may vary greatly among petroleum reservoirs worldwide and so do the microbial communities inhabiting these different environments. The present work aimed at the characterization of the microbiota in biodegraded and non-degraded petroleum samples from three Brazilian reservoirs and the comparison of microbial community diversity across oil reservoirs at local and global scales using 16S rRNA clone libraries. The analysis of 620 16S rRNA bacterial and archaeal sequences obtained from Brazilian oil samples revealed 42 bacterial OTUs and 21 archaeal OTUs. The bacterial community from the degraded oil was more diverse than the non-degraded samples. Non-degraded oil samples were overwhelmingly dominated by gammaproteobacterial sequences with a predominance of the genera Marinobacter and Marinobacterium. Comparisons of microbial diversity among oil reservoirs worldwide suggested an apparent correlation of prokaryotic communities with reservoir temperature and depth and no influence of geographic distance among reservoirs. The detailed analysis of the phylogenetic diversity across reservoirs allowed us to define a core microbiome encompassing three bacterial classes (Gammaproteobacteria, Clostridia, and Bacteroidia) and one archaeal class (Methanomicrobia) ubiquitous in petroleum reservoirs and presumably owning the abilities to sustain life in these environments.
Emergence and subsequent functional specialization of kindlins during evolution of cell adhesiveness

PubMed Central

Meller, Julia; Rogozin, Igor B.; Poliakov, Eugenia; Meller, Nahum; Bedanov-Pack, Mark; Plow, Edward F.; Qin, Jun; Podrez, Eugene A.; Byzova, Tatiana V.

2015-01-01

Kindlins are integrin-interacting proteins essential for integrin-mediated cell adhesiveness. In this study, we focused on the evolutionary origin and functional specialization of kindlins as a part of the evolutionary adaptation of cell adhesive machinery. Database searches revealed that many members of the integrin machinery (including talin and integrins) existed before kindlin emergence in evolution. Among the analyzed species, all metazoan lineages—but none of the premetazoans—had at least one kindlin-encoding gene, whereas talin was present in several premetazoan lineages. Kindlin appears to originate from a duplication of the sequence encoding the N-terminal fragment of talin (the talin head domain) with a subsequent insertion of the PH domain of separate origin. Sequence analysis identified a member of the actin filament–associated protein 1 (AFAP1) superfamily as the most likely origin of the kindlin PH domain. The functional divergence between kindlin paralogues was assessed using the sequence swap (chimera) approach. Comparison of kindlin 2 (K2)/kindlin 3 (K3) chimeras revealed that the F2 subdomain, in particular its C-terminal part, is crucial for the differential functional properties of K2 and K3. The presence of this segment enables K2 but not K3 to localize to focal adhesions. Sequence analysis of the C-terminal part of the F2 subdomain of K3 suggests that insertion of a variable glycine-rich sequence in vertebrates contributed to the loss of constitutive K3 targeting to focal adhesions. Thus emergence and subsequent functional specialization of kindlins allowed multicellular organisms to develop additional tissue-specific adaptations of cell adhesiveness. PMID:25540429
Identification of Genes and Pathways Related to Phenol Degradation in Metagenomic Libraries from Petroleum Refinery Wastewater

PubMed Central

Silva, Cynthia C.; Hayden, Helen; Sawbridge, Tim; Mele, Pauline; De Paula, Sérgio O.; Silva, Lívia C. F.; Vidigal, Pedro M. P.; Vicentini, Renato; Sousa, Maíra P.; Torres, Ana Paula R.; Santiago, Vânia M. J.; Oliveira, Valéria M.

2013-01-01

Two fosmid libraries, totaling 13,200 clones, were obtained from bioreactor sludge of petroleum refinery wastewater treatment system. The library screening based on PCR and biological activity assays revealed more than 400 positive clones for phenol degradation. From these, 100 clones were randomly selected for pyrosequencing in order to evaluate the genetic potential of the microorganisms present in wastewater treatment plant for biodegradation, focusing mainly on novel genes and pathways of phenol and aromatic compound degradation. The sequence analysis of selected clones yielded 129,635 reads at an estimated 17-fold coverage. The phylogenetic analysis showed Burkholderiales and Rhodocyclales as the most abundant orders among the selected fosmid clones. The MG-RAST analysis revealed a broad metabolic profile with important functions for wastewater treatment, including metabolism of aromatic compounds, nitrogen, sulphur and phosphorus. The predicted 2,276 proteins included phenol hydroxylases and cathecol 2,3- dioxygenases, involved in the catabolism of aromatic compounds, such as phenol, byphenol, benzoate and phenylpropanoid. The sequencing of one fosmid insert of 33 kb unraveled the gene that permitted the host, Escherichia coli EPI300, to grow in the presence of aromatic compounds. Additionally, the comparison of the whole fosmid sequence against bacterial genomes deposited in GenBank showed that about 90% of sequence showed no identity to known sequences of Proteobacteria deposited in the NCBI database. This study surveyed the functional potential of fosmid clones for aromatic compound degradation and contributed to our knowledge of the biodegradative capacity and pathways of microbial assemblages present in refinery wastewater treatment system. PMID:23637911
Biochemical Characterization of a Mycobacteriophage Derived DnaB Ortholog Reveals New Insight into the Evolutionary Origin of DnaB Helicases

PubMed Central

Bhowmik, Priyanka; Das Gupta, Sujoy K.

2015-01-01

The bacterial replicative helicases known as DnaB are considered to be members of the RecA superfamily. All members of this superfamily, including DnaB, have a conserved C- terminal domain, known as the RecA core. We unearthed a series of mycobacteriophage encoded proteins in which the RecA core domain alone was present. These proteins were phylogenetically related to each other and formed a distinct clade within the RecA superfamily. A mycobacteriophage encoded protein, Wildcat Gp80 that roots deep in the DnaB family, was found to possess a core domain having significant sequence homology (Expect value < 10-5) with members of this novel cluster. This indicated that Wildcat Gp80, and by extrapolation, other members of the DnaB helicase family, may have evolved from a single domain RecA core polypeptide belonging to this novel group. Biochemical investigations confirmed that Wildcat Gp80 was a helicase. Surprisingly, our investigations also revealed that a thioredoxin tagged truncated version of the protein in which the N-terminal sequences were removed was fully capable of supporting helicase activity, although its ATP dependence properties were different. DnaB helicase activity is thus, primarily a function of the RecA core although additional N-terminal sequences may be necessary for fine tuning its activity and stability. Based on sequence comparison and biochemical studies we propose that DnaB helicases may have evolved from single domain RecA core proteins having helicase activities of their own, through the incorporation of additional N-terminal sequences. PMID:26237048
Circulation of Endemic Type 2 Vaccine-Derived Poliovirus in Egypt from 1983 to 1993

PubMed Central

Yang, Chen-Fu; Naguib, Tary; Yang, Su-Ju; Nasr, Eman; Jorba, Jaume; Ahmed, Nahed; Campagnoli, Ray; van der Avoort, Harrie; Shimizu, Hiroyuki; Yoneyama, Tetsuo; Miyamura, Tatsuo; Pallansch, Mark; Kew, Olen

2003-01-01

From 1988 to 1993, 30 cases of poliomyelitis associated with poliovirus type 2 were found in seven governorates of Egypt. Because many of the cases were geographically and temporally clustered and because the case isolates differed antigenically from the vaccine strain, it was initially assumed that the cases signaled the continued circulation of wild type 2 poliovirus. However, comparison of sequences encoding the major capsid protein, VP1 (903 nucleotides), revealed that the isolates were related (93 to 97% nucleotide sequence identity) to the Sabin type 2 oral poliovirus vaccine (OPV) strain and unrelated (<82% nucleotide sequence identity) to the wild type 2 polioviruses previously indigenous to Egypt (last known isolate: 1979) or to any contemporary wild type 2 polioviruses found elsewhere. The rate and pattern of VP1 divergence among the circulating vaccine-derived poliovirus (cVDPV) isolates suggested that all lineages were derived from a single OPV infection that occurred around 1983 and that progeny from the initiating infection circulated for approximately a decade within Egypt along several independent chains of transmission. Complete genomic sequences of an early (1988) and a late (1993) cVDPV isolate revealed that their 5′ untranslated region (5′ UTR) and noncapsid- 3′ UTR sequences were derived from other species C enteroviruses. Circulation of type 2 cVDPVs occurred at a time of low OPV coverage in the affected communities and ceased when OPV coverage rates increased. The potential for cVDPVs to circulate in populations with low immunity to poliovirus has important implications for current and future strategies to eradicate polio worldwide. PMID:12857906
The Genetic Diversity, Haplotype Analysis, and Phylogenetic Relationship of Aedes albopictus (Diptera: Culicidae) Based on the Cytochrome Oxidase 1 Marker: A Malaysian Scenario.

PubMed

Ismail, Nurul-Ain; Adilah-Amrannudin, Nurul; Hamsidi, Mayamin; Ismail, Rodziah; Dom, Nazri Che; Ahmad, Abu Hassan; Mastuki, Mohd Fahmi; Camalxaman, Siti Nazrina

2017-11-07

The global expansion of Ae. albopictus from its native range in Southeast Asia has been implicated in the recent emergence of dengue endemicity in Malaysia. Genetic variability studies of Ae. albopictus are currently lacking in the Malaysian setting, yet are crucial to enhancing the existing vector control strategies. The study was conducted to establish the genetic variability of maternally inherited mitochondrial DNA encoding for cytochrome oxidase subunit 1 (CO1) gene in Ae. albopictus. Twelve localities were selected in the Subang Jaya district based on temporal indices utilizing 120 mosquito samples. Genetic polymorphism and phylogenetic analysis were conducted to unveil the genetic variability and geographic origins of Ae. albopictus. The haplotype network was mapped to determine the genealogical relationship of sequences among groups of population in the Asian region. Comparison of Malaysian CO1 sequences with sequences derived from five Asian countries revealed genetically distinct Ae. albopictus populations. Phylogenetic analysis revealed that all sequences from other Asian countries descended from the same genetic lineage as the Malaysian sequences. Noteworthy, our study highlights the discovery of 20 novel haplotypes within the Malaysian population which to date had not been reported. These findings could help determine the genetic variation of this invasive species, which in turn could possibly improve the current dengue vector surveillance strategies, locally and regionally. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Solution Structure of Acidocin B, a Circular Bacteriocin Produced by Lactobacillus acidophilus M46

PubMed Central

Acedo, Jeella Z.; van Belkum, Marco J.; Lohans, Christopher T.; McKay, Ryan T.; Miskolzie, Mark

2015-01-01

Acidocin B, a bacteriocin produced by Lactobacillus acidophilus M46, was originally reported to be a linear peptide composed of 59 amino acid residues. However, its high sequence similarity to gassericin A, a circular bacteriocin from Lactobacillus gasseri LA39, suggested that acidocin B might be circular as well. Acidocin B was purified from culture supernatant by a series of hydrophobic interaction chromatographic steps. Its circular nature was ascertained by matrix-assisted laser desorption ionization–time of flight (MALDI-TOF) mass spectrometry and tandem mass spectrometry (MS/MS) sequencing. The peptide sequence was found to consist of 58 amino acids with a molecular mass of 5,621.5 Da. The sequence of the acidocin B biosynthetic gene cluster was also determined and showed high nucleotide sequence similarity to that of gassericin A. The nuclear magnetic resonance (NMR) solution structure of acidocin B in sodium dodecyl sulfate micelles was elucidated, revealing that it is composed of four α-helices of similar length that are folded to form a compact, globular bundle with a central pore. This is a three-dimensional structure for a member of subgroup II circular bacteriocins, which are classified based on their isoelectric points of ∼7 or lower. Comparison of acidocin B with carnocyclin A, a subgroup I circular bacteriocin with four α-helices and a pI of 10, revealed differences in the overall folding. The observed variations could be attributed to inherent diversity in their physical properties, which also required the use of different solvent systems for three-dimensional structural elucidation. PMID:25681186

Lelliottia aquatilis sp. nov., isolated from drinking water.

PubMed

Kämpfer, Peter; Glaeser, Stefanie P; Packroff, Gabriele; Behringer, Katja; Exner, Martin; Chakraborty, Trinad; Schmithausen, Ricarda M; Doijad, Swapnil

2018-06-22

Five beige-pigmented, oxidase-negative bacterial isolates, 6331-17 T , 6332-17, 6333-17, 6334-17 and 9827-07, isolated either from a drinking water storage reservoir or drinking water in 2006 and 2017 in Germany, were examined in detail applying by a polyphasic taxonomic approach. Cells of the isolates were rod-shaped and Gram-stain-negative. Comparison of the 16S rRNA gene sequences of these five isolates showed highest sequence similarities to Lelliottia amnigena (99.98 %) and Lelliottia nimipressuralis (99.99 %). Multilocus sequence analyses based on concatenated partial rpoB, gyrB, infB and atpD sequences confirmed the clustering of these isolates with Lelliottia species, but also revealed a clear distinction to the closest related type strains. Analysis of the genome sequences of these isolates indicated >70 % in silico DNA-DNA hybridization and high average nucleotide identities between strains. Nevertheless, they showed only <70 and <95 % similarity to the type strains of these two Lelliottia species. The fatty acid profiles of these isolates were very similar and consisted of the major fatty acids C16:0, C17 : 0cyclo, C15 : 0iso 2-OH/C16 : 1ω7c and C18 : 1ω7c. In addition, physiological/biochemical tests revealed high phenotypic similarity to each other. These cumulative data indicate that these isolates represent a novel Lelliottia species, for which the name Lelliottia aquatilis sp. nov. is proposed, with strain 6331-17 T (=CCM 8846 T =CIP 111609 T =LMG 30560 T ) as the type strain.
Circulation of endemic type 2 vaccine-derived poliovirus in Egypt from 1983 to 1993.

PubMed

Yang, Chen-Fu; Naguib, Tary; Yang, Su-Ju; Nasr, Eman; Jorba, Jaume; Ahmed, Nahed; Campagnoli, Ray; van der Avoort, Harrie; Shimizu, Hiroyuki; Yoneyama, Tetsuo; Miyamura, Tatsuo; Pallansch, Mark; Kew, Olen

2003-08-01

From 1988 to 1993, 30 cases of poliomyelitis associated with poliovirus type 2 were found in seven governorates of Egypt. Because many of the cases were geographically and temporally clustered and because the case isolates differed antigenically from the vaccine strain, it was initially assumed that the cases signaled the continued circulation of wild type 2 poliovirus. However, comparison of sequences encoding the major capsid protein, VP1 (903 nucleotides), revealed that the isolates were related (93 to 97% nucleotide sequence identity) to the Sabin type 2 oral poliovirus vaccine (OPV) strain and unrelated (<82% nucleotide sequence identity) to the wild type 2 polioviruses previously indigenous to Egypt (last known isolate: 1979) or to any contemporary wild type 2 polioviruses found elsewhere. The rate and pattern of VP1 divergence among the circulating vaccine-derived poliovirus (cVDPV) isolates suggested that all lineages were derived from a single OPV infection that occurred around 1983 and that progeny from the initiating infection circulated for approximately a decade within Egypt along several independent chains of transmission. Complete genomic sequences of an early (1988) and a late (1993) cVDPV isolate revealed that their 5' untranslated region (5' UTR) and noncapsid- 3' UTR sequences were derived from other species C enteroviruses. Circulation of type 2 cVDPVs occurred at a time of low OPV coverage in the affected communities and ceased when OPV coverage rates increased. The potential for cVDPVs to circulate in populations with low immunity to poliovirus has important implications for current and future strategies to eradicate polio worldwide.
Comparison of the Compositions of the Stool Microbiotas of Infants Fed Goat Milk Formula, Cow Milk-Based Formula, or Breast Milk

PubMed Central

Lawley, Blair; Munro, Karen; Gowri Pathmanathan, Siva; Zhou, Shao J.; Makrides, Maria; Gibson, Robert A.; Sullivan, Thomas; Prosser, Colin G.; Lowry, Dianne; Hodgkinson, Alison J.

2013-01-01

The aim of the study was to compare the compositions of the fecal microbiotas of infants fed goat milk formula to those of infants fed cow milk formula or breast milk as the gold standard. Pyrosequencing of 16S rRNA gene sequences was used in the analysis of the microbiotas in stool samples collected from 90 Australian babies (30 in each group) at 2 months of age. Beta-diversity analysis of total microbiota sequences and Lachnospiraceae sequences revealed that they were more similar in breast milk/goat milk comparisons than in breast milk/cow milk comparisons. The Lachnospiraceae were mostly restricted to a single species (Ruminococcus gnavus) in breast milk-fed and goat milk-fed babies compared to a more diverse collection in cow milk-fed babies. Bifidobacteriaceae were abundant in the microbiotas of infants in all three groups. Bifidobacterium longum, Bifidobacterium breve, and Bifidobacterium bifidum were the most commonly detected bifidobacterial species. A semiquantitative PCR method was devised to differentiate between B. longum subsp. longum and B. longum subsp. infantis and was used to test stool samples. B. longum subsp. infantis was seldom present in stools, even of breast milk-fed babies. The presence of B. bifidum in the stools of breast milk-fed infants at abundances greater than 10% of the total microbiota was associated with the highest total abundances of Bifidobacteriaceae. When Bifidobacteriaceae abundance was low, Lachnospiraceae abundances were greater. New information about the composition of the fecal microbiota when goat milk formula is used in infant nutrition was thus obtained. PMID:23455335
Comparison of the compositions of the stool microbiotas of infants fed goat milk formula, cow milk-based formula, or breast milk.

PubMed

Tannock, Gerald W; Lawley, Blair; Munro, Karen; Gowri Pathmanathan, Siva; Zhou, Shao J; Makrides, Maria; Gibson, Robert A; Sullivan, Thomas; Prosser, Colin G; Lowry, Dianne; Hodgkinson, Alison J

2013-05-01

The aim of the study was to compare the compositions of the fecal microbiotas of infants fed goat milk formula to those of infants fed cow milk formula or breast milk as the gold standard. Pyrosequencing of 16S rRNA gene sequences was used in the analysis of the microbiotas in stool samples collected from 90 Australian babies (30 in each group) at 2 months of age. Beta-diversity analysis of total microbiota sequences and Lachnospiraceae sequences revealed that they were more similar in breast milk/goat milk comparisons than in breast milk/cow milk comparisons. The Lachnospiraceae were mostly restricted to a single species (Ruminococcus gnavus) in breast milk-fed and goat milk-fed babies compared to a more diverse collection in cow milk-fed babies. Bifidobacteriaceae were abundant in the microbiotas of infants in all three groups. Bifidobacterium longum, Bifidobacterium breve, and Bifidobacterium bifidum were the most commonly detected bifidobacterial species. A semiquantitative PCR method was devised to differentiate between B. longum subsp. longum and B. longum subsp. infantis and was used to test stool samples. B. longum subsp. infantis was seldom present in stools, even of breast milk-fed babies. The presence of B. bifidum in the stools of breast milk-fed infants at abundances greater than 10% of the total microbiota was associated with the highest total abundances of Bifidobacteriaceae. When Bifidobacteriaceae abundance was low, Lachnospiraceae abundances were greater. New information about the composition of the fecal microbiota when goat milk formula is used in infant nutrition was thus obtained.
Analysis of Bacterial Community Structure in Sulfurous-Oil-Containing Soils and Detection of Species Carrying Dibenzothiophene Desulfurization (dsz) Genes

PubMed Central

Duarte, Gabriela Frois; Rosado, Alexandre Soares; Seldin, Lucy; de Araujo, Welington; van Elsas, Jan Dirk

2001-01-01

The selective effects of sulfur-containing hydrocarbons, with respect to changes in bacterial community structure and selection of desulfurizing organisms and genes, were studied in soil. Samples taken from a polluted field soil (A) along a concentration gradient of sulfurous oil and from soil microcosms treated with dibenzothiophene (DBT)-containing petroleum (FSL soil) were analyzed. Analyses included plate counts of total bacteria and of DBT utilizers, molecular community profiling via soil DNA-based PCR-denaturing gradient gel electrophoresis (PCR-DGGE), and detection of genes that encode enzymes involved in the desulfurization of hydrocarbons, i.e., dszA, dszB, and dszC.Data obtained from the A soil showed no discriminating effects of oil levels on the culturable bacterial numbers on either medium used. Generally, counts of DBT degraders were 10- to 100-fold lower than the total culturable counts. However, PCR-DGGE showed that the numbers of bands detected in the molecular community profiles decreased with increasing oil content of the soil. Analysis of the sequences of three prominent bands of the profiles generated with the highly polluted soil samples suggested that the underlying organisms were related to Actinomyces sp., Arthrobacter sp., and a bacterium of uncertain affiliation. dszA, dszB, and dszC genes were present in all A soil samples, whereas a range of unpolluted soils gave negative results in this analysis. Results from the study of FSL soil revealed minor effects of the petroleum-DBT treatment on culturable bacterial numbers and clear effects on the DBT-utilizing communities. The molecular community profiles were largely stable over time in the untreated soil, whereas they showed a progressive change over time following treatment with DBT-containing petroleum. Direct PCR assessment revealed the presence of dszB-related signals in the untreated FSL soil and the apparent selection of dszA- and dszC-related sequences by the petroleum-DBT treatment. PCR-DGGE applied to sequential enrichment cultures in DBT-containing sulfur-free basal salts medium prepared from the A and treated FSL soils revealed the selection of up to 10 distinct bands. Sequencing a subset of these bands provided evidence for the presence of organisms related to Pseudomonas putida, a Pseudomonas sp., Stenotrophomonas maltophilia, and Rhodococcus erythropolis. Several of 52 colonies obtained from the A and FSL soils on agar plates with DBT as the sole sulfur source produced bands that matched the migration of bands selected in the enrichment cultures. Evidence for the presence of dszB in 12 strains was obtained, whereas dszA and dszC genes were found in only 7 and 6 strains, respectively. Most of the strains carrying dszA or dszC were classified as R. erythropolis related, and all revealed the capacity to desulfurize DBT. A comparison of 37 dszA sequences, obtained via PCR from the A and FSL soils, from enrichments of these soils, and from isolates, revealed the great similarity of all sequences to the canonical (R. erythropolis strain IGTS8) dszA sequence and a large degree of internal conservation. The 37 sequences recovered were grouped in three clusters. One group, consisting of 30 sequences, was minimally 98% related to the IGTS8 sequence, a second group of 2 sequences was slightly different, and a third group of 5 sequences was 95% similar. The first two groups contained sequences obtained from both soil types and enrichment cultures (including isolates), but the last consisted of sequences obtained directly from the polluted A soil. PMID:11229891
Lepidopteran HMG-CoA reductase is a potential selective target for pest control

PubMed Central

Li, Yuan-mei; Huang, Juan; Tobe, Stephen S.

2017-01-01

As a consequence of the negative impacts on the environment of some insecticides, discovery of eco-friendly insecticides and target has received global attention in recent years. Sequence alignment and structural comparison of the rate-limiting enzyme HMG-CoA reductase (HMGR) revealed differences between lepidopteran pests and other organisms, which suggested insect HMGR could be a selective insecticide target candidate. Inhibition of JH biosynthesis in vitro confirmed that HMGR inhibitors showed a potent lethal effect on the lepidopteran pest Manduca sexta, whereas there was little effect on JH biosynthesis in Apis mellifera and Diploptera punctata. The pest control application of these inhibitors demonstrated that they can be insecticide candidates with potent ovicidal activity, larvicidal activity and insect growth regulatory effects. The present study has validated that Lepidopteran HMGR can be a potent selective insecticide target, and the HMGR inhibitors (especially type II statins) could be selective insecticide candidates and lead compounds. Furthermore, we demonstrated that sequence alignment, homology modeling and structural comparison may be useful for determining potential enzymes or receptors which can be eco-friendly pesticide targets. PMID:28133568
Lepidopteran HMG-CoA reductase is a potential selective target for pest control.

PubMed

Li, Yuan-Mei; Kai, Zhen-Peng; Huang, Juan; Tobe, Stephen S

2017-01-01

As a consequence of the negative impacts on the environment of some insecticides, discovery of eco-friendly insecticides and target has received global attention in recent years. Sequence alignment and structural comparison of the rate-limiting enzyme HMG-CoA reductase (HMGR) revealed differences between lepidopteran pests and other organisms, which suggested insect HMGR could be a selective insecticide target candidate. Inhibition of JH biosynthesis in vitro confirmed that HMGR inhibitors showed a potent lethal effect on the lepidopteran pest Manduca sexta , whereas there was little effect on JH biosynthesis in Apis mellifera and Diploptera punctata . The pest control application of these inhibitors demonstrated that they can be insecticide candidates with potent ovicidal activity, larvicidal activity and insect growth regulatory effects. The present study has validated that Lepidopteran HMGR can be a potent selective insecticide target, and the HMGR inhibitors (especially type II statins) could be selective insecticide candidates and lead compounds. Furthermore, we demonstrated that sequence alignment, homology modeling and structural comparison may be useful for determining potential enzymes or receptors which can be eco-friendly pesticide targets.
Conservative secondary structure motifs already present in early-stage folding (in silico) as found in serpines family.

PubMed

Brylinski, Michal; Konieczny, Leszek; Kononowicz, Andrzej; Roterman, Irena

2008-03-21

The well-known procedure implemented in ClustalW oriented on the sequence comparison was applied to structure comparison. The consensus sequence as well as consensus structure has been defined for proteins belonging to serpine family. The structure of early stage intermediate was the object for similarity search. The high values of W(sequence) appeared to be accordant with high values of W(structure) making possible structure comparison using common criteria for sequence and structure comparison. Since the early stage structural form has been created according to limited conformational sub-space which does not include the beta-structure (this structure is mediated by C7eq structural form), is particularly important to see, that the C7eq structural form may be treated as the seed for beta-structure present in the final native structure of protein. The applicability of ClustalW procedure to structure comparison makes these two comparisons unified.
Intestinal microbiota composition in fishes is influenced by host ecology and environment.

PubMed

Wong, Sandi; Rawls, John F

2012-07-01

The digestive tracts of vertebrates are colonized by complex assemblages of micro-organisms, collectively called the gut microbiota. Recent studies have revealed important contributions of gut microbiota to vertebrate health and disease, stimulating intense interest in understanding how gut microbial communities are assembled and how they impact host fitness (Sekirov et al. 2010). Although all vertebrates harbour a gut microbiota, current information on microbiota composition and function has been derived primarily from mammals. Comparisons of different mammalian species have revealed intriguing associations between gut microbiota composition and host diet, anatomy and phylogeny (Ley et al. 2008b). However, mammals constitute <10% of all vertebrate species, and it remains unclear whether similar associations exist in more diverse and ancient vertebrate lineages such as fish. In this issue, Sullam et al. (2012) make an important contribution toward identifying factors determining gut microbiota composition in fishes. The authors conducted a detailed meta-analysis of 25 bacterial 16S rRNA gene sequence libraries derived from the intestines of different fish species. To provide a broader context for their analysis, they compared these data sets to a large collection of 16S rRNA gene sequence data sets from diverse free-living and host-associated bacterial communities. Their results suggest that variation in gut microbiota composition in fishes is strongly correlated with species habitat salinity, trophic level and possibly taxonomy. Comparison of data sets from fish intestines and other environments revealed that fish gut microbiota compositions are often similar to those of other animals and contain relatively few free-living environmental bacteria. These results suggest that the gut microbiota composition of fishes is not a simple reflection of the micro-organisms in their local habitat but may result from host-specific selective pressures within the gut (Bevins & Salzman 2011).
Pseudomonas syringae pv. actinidiae Draft Genomes Comparison Reveal Strain-Specific Features Involved in Adaptation and Virulence to Actinidia Species

PubMed Central

Marcelletti, Simone; Ferrante, Patrizia; Petriccione, Milena; Firrao, Giuseppe; Scortichini, Marco

2011-01-01

A recent re-emerging bacterial canker disease incited by Pseudomonas syringae pv. actinidiae (Psa) is causing severe economic losses to Actinidia chinensis and A. deliciosa cultivations in southern Europe, New Zealand, Chile and South Korea. Little is known about the genetic features of this pathovar. We generated genome-wide Illumina sequence data from two Psa strains causing outbreaks of bacterial canker on the A. deliciosa cv. Hayward in Japan (J-Psa, type-strain of the pathovar) and in Italy (I-Psa) in 1984 and 1992, respectively as well as from a Psa strain (I2-Psa) isolated at the beginning of the recent epidemic on A. chinensis cv. Hort16A in Italy. All strains were isolated from typical leaf spot symptoms. The phylogenetic relationships revealed that Psa is more closely related to P. s. pv. theae than to P. avellanae within genomospecies 8. Comparative genomic analyses revealed both relevant intrapathovar variations and putative pathovar-specific genomic regions in Psa. The genomic sequences of J-Psa and I-Psa were very similar. Conversely, the I2-Psa genome encodes four additional effector protein genes, lacks a 50 kb plasmid and the phaseolotoxin gene cluster, argK-tox but has acquired a 160 kb plasmid and putative prophage sequences. Several lines of evidence from the analysis of the genome sequences support the hypothesis that this strain did not evolve from the Psa population that caused the epidemics in 1984–1992 in Japan and Italy but rather is the product of a recent independent evolution of the pathovar actinidiae for infecting Actinidia spp. All Psa strains share the genetic potential for copper resistance, antibiotic detoxification, high affinity iron acquisition and detoxification of nitric oxide of plant origin. Similar to other sequenced phytopathogenic pseudomonads associated with woody plant species, the Psa strains isolated from leaves also display a set of genes involved in the catabolism of plant-derived aromatic compounds. PMID:22132095
Molecular Characterization of the Skate Peripherin/rds Gene: Relationship to Its Orthologues and Paralogues

PubMed Central

Li, Chibo; Ding, Xi-Qin; O’Brien, John; Al-Ubaidi, Muayyad R.

2010-01-01

PURPOSE A great deal of information about functionally significant domains of a protein may be obtained by comparison of primary sequences of gene homologues over a broad phylogenetic base. This study was designed to identify evolutionarily conserved domains of the photoreceptor disc membrane protein peripherin/rds by analysis of the homologue in a primitive vertebrate, the skate. METHODS A skate retinal cDNA library was screened using a mouse peripherin/rds clone. The 5′ and 3′ untranslated regions of the skate peripherin/rds (srds) cDNA were isolated by the rapid amplification of cDNA ends (RACE) approach. The gene structure was characterized by PCR amplification and sequencing of genomic fragments. Northern and Western blot analyses were used to identify srds transcript and protein, respectively. RESULTS A new homologue of peripherin/rds was identified from the skate retinal cDNA library. SRDS is a glycoprotein with a predicted molecular mass of 40.2 kDa. The srds gene consists of two exons and one small intron and transcribes into a single 6-kb message. Phylogenetic analysis places SRDS at the base of peripherin/rds family and near the division of that group and the branch leading to rds-like and rom-1 genes. SRDS protein is 54.5% identical with peripherin/rds across species. Identity is significantly higher (73%) in the intradiscal domains. Sequence comparison revealed the conservation of all residues that have been shown, on mutation, to associate with retinitis pigmentosa and showed conservation of most residues associated with macular dystrophies. Comparison with ROM-1 and other rds-like proteins revealed the presence of a highly conserved domain in the large intradiscal loop. CONCLUSIONS Srds represents the skate orthologue of mammalian peripherin/rds genes. Conservation of most of the residues associated with human retinal diseases indicates that these residues serve important functional roles. The high degree of conservation of a short stretch within the large intradiscal loop also suggests an important function for this domain. PMID:12766040
Endosymbiont diversity and prevalence in herbivorous spider mite populations in South-Western Europe.

PubMed

Zélé, Flore; Santos, Inês; Olivieri, Isabelle; Weill, Mylène; Duron, Olivier; Magalhães, Sara

2018-04-01

Bacterial endosymbionts are known as important players of the evolutionary ecology of their hosts. However, their distribution, prevalence and diversity are still largely unexplored. To this aim, we investigated infections by the most common bacterial reproductive manipulators in herbivorous spider mites of South-Western Europe. Across 16 populations belonging to three Tetranychus species, Wolbachia was the most prevalent (ca. 61%), followed by Cardinium (12%-15%), while only few individuals were infected by Rickettsia (0.9%-3%), and none carried Arsenophonus or Spiroplasma. These endosymbionts are here reported for the first time in Tetranychus evansi and Tetranychus ludeni, and showed variable infection frequencies between and within species, with several cases of coinfections. Moreover, Cardinium was more prevalent in Wolbachia-infected individuals, which suggests facilitation between these symbionts. Finally, sequence comparisons revealed no variation of the Wolbachia wsp and Rickettsia gtlA genes, but some diversity of the Cardinium 16S rRNA, both between and within populations of the three mite species. Some of the Cardinium sequences identified belonged to distantly-related clades, and the lack of association between these sequences and spider mite mitotypes suggests repeated host switching of Cardinium. Overall, our results reveal a complex community of symbionts in this system, opening the path for future studies.
DNA barcoding of human-biting black flies (Diptera: Simuliidae) in Thailand.

PubMed

Pramual, Pairot; Thaijarern, Jiraporn; Wongpakam, Komgrit

2016-12-01

Black flies (Diptera: Simuliidae) are important insect vectors and pests of humans and animals. Accurate identification, therefore, is important for control and management. In this study, we used mitochondrial cytochrome oxidase I (COI) barcoding sequences to test the efficiency of species identification for the human-biting black flies in Thailand. We used human-biting specimens because they enabled us to link information with previous studies involving the immature stages. Three black fly taxa, Simulium nodosum, S. nigrogilvum and S. doipuiense complex, were collected. The S. doipuiense complex was confirmed for the first time as having human-biting habits. The COI sequences revealed considerable genetic diversity in all three species. Comparisons to a COI sequence library of black flies in Thailand and in a public database indicated a high efficiency for specimen identification for S. nodosum and S. nigrogilvum, but this method was not successful for the S. doipuiense complex. Phylogenetic analyses revealed two divergent lineages in the S. doipuiense complex. Human-biting specimens formed a separate clade from other members of this complex. The results are consistent with the Barcoding Index Number System (BINs) analysis that found six BINs in the S. doipuiense complex. Further taxonomic work is needed to clarify the species status of these human-biting specimens. Copyright © 2016 Elsevier B.V. All rights reserved.
Revealing the transcriptomic complexity of switchgrass by PacBio long-read sequencing.

PubMed

Zuo, Chunman; Blow, Matthew; Sreedasyam, Avinash; Kuo, Rita C; Ramamoorthy, Govindarajan Kunde; Torres-Jerez, Ivone; Li, Guifen; Wang, Mei; Dilworth, David; Barry, Kerrie; Udvardi, Michael; Schmutz, Jeremy; Tang, Yuhong; Xu, Ying

2018-01-01

Switchgrass ( Panicum virgatum L.) is an important bioenergy crop widely used for lignocellulosic research. While extensive transcriptomic analyses have been conducted on this species using short read-based sequencing techniques, very little has been reliably derived regarding alternatively spliced (AS) transcripts. We present an analysis of transcriptomes of six switchgrass tissue types pooled together, sequenced using Pacific Biosciences (PacBio) single-molecular long-read technology. Our analysis identified 105,419 unique transcripts covering 43,570 known genes and 8795 previously unknown genes. 45,168 are novel transcripts of known genes. A total of 60,096 AS transcripts are identified, 45,628 being novel. We have also predicted 1549 transcripts of genes involved in cell wall construction and remodeling, 639 being novel transcripts of known cell wall genes. Most of the predicted transcripts are validated against Illumina-based short reads. Specifically, 96% of the splice junction sites in all the unique transcripts are validated by at least five Illumina reads. Comparisons between genes derived from our identified transcripts and the current genome annotation revealed that among the gene set predicted by both analyses, 16,640 have different exon-intron structures. Overall, substantial amount of new information is derived from the PacBio RNA data regarding both the transcriptome and the genome of switchgrass.
Maize endosperm secretes a novel antifungal protein into adjacent maternal tissue.

PubMed

Serna, A; Maitz, M; O'Connell, T; Santandrea, G; Thevissen, K; Tienens, K; Hueros, G; Faleri, C; Cai, G; Lottspeich, F; Thompson, R D

2001-03-01

A series of endosperm transfer layer-specific transcripts has been identified in maize by differential screening of a cDNA library of transcripts at 10 days after pollination. Sequence comparisons revealed among this class of cDNAs a novel, small gene family of highly diverged sequences encoding basal layer antifungal proteins (BAPs). The bap genes mapped to two loci on chromosomes 4 and 10. So far, bap-homologous sequences have been detected only in maize, teosinte and sorghum, and are not present in grasses outside the Andropogoneae tribe. BAP2 is synthesized as a pre-proprotein, and is processed by successive removal of a signal peptide and a 29-residue prodomain. The proprotein can be detected exclusively in microsomal membrane-containing fractions of kernel extracts. Immunolocalization reveals BAP2 to be predominantly located in the placentochalazal cells of the pedicel, adjacent to the basal endosperm transfer layer (BETL) cells, although the BAP2 transcript is found only in the BETL cells. The biological roles of BAP2 propeptide and mature peptide have been investigated by heterologous expression of the proprotein in Escherichia coli, and by tests of its fungistatic activity and that of the fully processed form in vitro. The mature BAP2 peptide exhibits potent broad-range activity against a range of filamentous fungi, including several plant pathogens.
Metagenomic binning of a marine sponge microbiome reveals unity in defense but metabolic specialization.

PubMed

Slaby, Beate M; Hackl, Thomas; Horn, Hannes; Bayer, Kristina; Hentschel, Ute

2017-11-01

Marine sponges are ancient metazoans that are populated by distinct and highly diverse microbial communities. In order to obtain deeper insights into the functional gene repertoire of the Mediterranean sponge Aplysina aerophoba, we combined Illumina short-read and PacBio long-read sequencing followed by un-targeted metagenomic binning. We identified a total of 37 high-quality bins representing 11 bacterial phyla and two candidate phyla. Statistical comparison of symbiont genomes with selected reference genomes revealed a significant enrichment of genes related to bacterial defense (restriction-modification systems, toxin-antitoxin systems) as well as genes involved in host colonization and extracellular matrix utilization in sponge symbionts. A within-symbionts genome comparison revealed a nutritional specialization of at least two symbiont guilds, where one appears to metabolize carnitine and the other sulfated polysaccharides, both of which are abundant molecules in the sponge extracellular matrix. A third guild of symbionts may be viewed as nutritional generalists that perform largely the same metabolic pathways but lack such extraordinary numbers of the relevant genes. This study characterizes the genomic repertoire of sponge symbionts at an unprecedented resolution and it provides greater insights into the molecular mechanisms underlying microbial-sponge symbiosis.
Definition and Analysis of a System for the Automated Comparison of Curriculum Sequencing Algorithms in Adaptive Distance Learning

ERIC Educational Resources Information Center

Limongelli, Carla; Sciarrone, Filippo; Temperini, Marco; Vaste, Giulia

2011-01-01

LS-Lab provides automatic support to comparison/evaluation of the Learning Object Sequences produced by different Curriculum Sequencing Algorithms. Through this framework a teacher can verify the correspondence between the behaviour of different sequencing algorithms and her pedagogical preferences. In fact the teacher can compare algorithms…
Sorting processes with energy-constrained comparisons*

NASA Astrophysics Data System (ADS)

Geissmann, Barbara; Penna, Paolo

2018-05-01

We study very simple sorting algorithms based on a probabilistic comparator model. In this model, errors in comparing two elements are due to (1) the energy or effort put in the comparison and (2) the difference between the compared elements. Such algorithms repeatedly compare and swap pairs of randomly chosen elements, and they correspond to natural Markovian processes. The study of these Markov chains reveals an interesting phenomenon. Namely, in several cases, the algorithm that repeatedly compares only adjacent elements is better than the one making arbitrary comparisons: in the long-run, the former algorithm produces sequences that are "better sorted". The analysis of the underlying Markov chain poses interesting questions as the latter algorithm yields a nonreversible chain, and therefore its stationary distribution seems difficult to calculate explicitly. We nevertheless provide bounds on the stationary distributions and on the mixing time of these processes in several restrictions.
Finding functional features in Saccharomyces genomes by phylogenetic footprinting.

PubMed

Cliften, Paul; Sudarsanam, Priya; Desikan, Ashwin; Fulton, Lucinda; Fulton, Bob; Majors, John; Waterston, Robert; Cohen, Barak A; Johnston, Mark

2003-07-04

The sifting and winnowing of DNA sequence that occur during evolution cause nonfunctional sequences to diverge, leaving phylogenetic footprints of functional sequence elements in comparisons of genome sequences. We searched for such footprints among the genome sequences of six Saccharomyces species and identified potentially functional sequences. Comparison of these sequences allowed us to revise the catalog of yeast genes and identify sequence motifs that may be targets of transcriptional regulatory proteins. Some of these conserved sequence motifs reside upstream of genes with similar functional annotations or similar expression patterns or those bound by the same transcription factor and are thus good candidates for functional regulatory sequences.
Characterization of the telomere complex, TERF1 and TERF2 genes in muntjac species with fusion karyotypes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Hartmann, Nils; Scherthan, Harry

The telomere binding proteins TRF1 and TRF2 maintain and protect chromosome ends and confer karyotypic stability. Chromosome evolution in the genus Muntiacus is characterized by numerous tandem (end-to-end) fusions. To study TRF1 and TRF2 telomere binding proteins in Muntiacus species, we isolated and characterized the TERF1 and -2 genes from Indian muntjac (Muntiacus muntjak vaginalis; 2n = 6 female) and from Chinese muntjac (Muntiacus reveesi; 2n = 46). Expression analysis revealed that both genes are ubiquitously expressed and sequence analysis identified several transcript variants of both TERF genes. Control experiments disclosed a novel testis-specific splice variant of TERF1 in humanmore » testes. Amino acid sequence comparisons demonstrate that Muntiacus TRF1 and in particular TRF2 are highly conserved between muntjac and human. In vivo TRF2-GFP and immuno-staining studies in muntjac cell lines revealed telomeric TRF2 localization, while deletion of the DNA binding domain abrogated this localization, suggesting muntjac TRF2 represents a functional telomere protein. Finally, expression analysis of a set of telomere-related genes revealed their presence in muntjac fibroblasts and testis tissue, which suggests the presence of a conserved telomere complex in muntjacs. However, a deviation from the common theme was noted for the TERT gene, encoding the catalytic subunit of telomerase; TERT expression could not be detected in Indian or Chinese muntjac cDNA or genomic DNA using a series of conserved primers, while TRAP assay revealed functional telomerase in Chinese muntjac testis tissues. This suggests muntjacs may harbor a diverged telomerase sequence.« less

Sequence variations of the human MPDZ gene and association with alcoholism in subjects with European ancestry.

PubMed

Karpyak, Victor M; Kim, Jeong-Hyun; Biernacka, Joanna M; Wieben, Eric D; Mrazek, David A; Black, John L; Choi, Doo-Sup

2009-04-01

Mpdz gene variations are known contributors of acute alcohol withdrawal severity and seizures in mice. To investigate the relevance of these findings for human alcoholism, we resequenced 46 exons, exon-intron boundaries, and 2 kilobases in the 5' region of the human MPDZ gene in 61 subjects with a history of alcohol withdrawal seizures (AWS), 59 subjects with a history of alcohol withdrawal without AWS, and 64 Coriell samples from self-reported nonalcoholic subjects [all European American (EA) ancestry] and compared with the Mpdz sequences of 3 mouse strains with different propensity to AWS. To explore potential associations of the human MPDZ gene with alcoholism and AWS, single SNP and haplotype analyses were performed using 13 common variants. Sixty-seven new, mostly rare variants were discovered in the human MPDZ gene. Sequence comparison revealed that the human gene does not have variations identical to those comprising Mpdz gene haplotype associated with AWS in mice. We also found no significant association between MPDZ haplotypes and AWS in humans. However, a global test of haplotype association revealed a significant difference in haplotype frequencies between alcohol-dependent subjects without AWS and Coriell controls (p = 0.015), suggesting a potential role of MPDZ in alcoholism and/or related phenotypes other than AWS. Haplotype-specific tests for the most common haplotypes (frequency > 0.05), revealed a specific high-risk haplotype (p = 0.006, maximum statistic p = 0.051), containing rs13297480G allele also found to be significantly more prevalent in alcoholics without AWS compared with nonalcoholic Coriell subjects (p = 0.019). Sequencing of MPDZ gene in individuals with EA ancestry revealed no variations in the sites identical to those associated with AWS in mice. Exploratory haplotype and single SNP association analyses suggest a possible association between the MPDZ gene and alcohol dependence but not AWS. Further functional genomic analysis of MPDZ variants and investigation of their association with a broader array of alcoholism-related phenotypes could reveal additional genetic markers of alcoholism.
The mitochondrial genomes of Amphiascoides atopus and Schizopera knabeni (Harpacticoida: Miraciidae) reveal similarities between the copepod orders Harpacticoida and Poecilostomatoida.

PubMed

Easton, Erin E; Darrow, Emily M; Spears, Trisha; Thistle, David

2014-03-15

Members of subclass Copepoda are abundant, diverse, and-as a result of their variety of ecological roles in marine and freshwater environments-important, but their phylogenetic interrelationships are unclear. Recent studies of arthropods have used gene arrangements in the mitochondrial (mt) genome to infer phylogenies, but for copepods, only seven complete mt genomes have been published. These data revealed several within-order and few among-order similarities. To increase the data available for comparisons, we sequenced the complete mt genome (13,831base pairs) of Amphiascoides atopus and 10,649base pairs of the mt genome of Schizopera knabeni (both in the family Miraciidae of the order Harpacticoida). Comparison of our data to those for Tigriopus japonicus (family Harpacticidae, order Harpacticoida) revealed similarities in gene arrangement among these three species that were consistent with those found within and among families of other copepod orders. Comparison of the mt genomes of our species with those known from other copepod orders revealed the arrangement of mt genes of our Harpacticoida species to be more similar to that of Sinergasilus polycolpus (order Poecilostomatoida) than to that of T. japonicus. The similarities between S. polycolpus and our species are the first to be noted across the boundaries of copepod orders and support the possibility that mt-gene arrangement might be used to infer copepod phylogenies. We also found that our two species had extremely truncated transfer RNAs and that gene overlaps occurred much more frequently than has been reported for other copepod mt genomes. Published by Elsevier B.V.
The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus)

PubMed Central

Ming, Ray; Hou, Shaobin; Feng, Yun; Yu, Qingyi; Dionne-Laporte, Alexandre; Saw, Jimmy H.; Senin, Pavel; Wang, Wei; Ly, Benjamin V.; Lewis, Kanako L. T.; Salzberg, Steven L.; Feng, Lu; Jones, Meghan R.; Skelton, Rachel L.; Murray, Jan E.; Chen, Cuixia; Qian, Wubin; Shen, Junguo; Du, Peng; Eustice, Moriah; Tong, Eric; Tang, Haibao; Lyons, Eric; Paull, Robert E.; Michael, Todd P.; Wall, Kerr; Rice, Danny W.; Albert, Henrik; Wang, Ming-Li; Zhu, Yun J.; Schatz, Michael; Nagarajan, Niranjan; Acob, Ricelle A.; Guan, Peizhu; Blas, Andrea; Wai, Ching Man; Ackerman, Christine M.; Ren, Yan; Liu, Chao; Wang, Jianmei; Wang, Jianping; Na, Jong-Kuk; Shakirov, Eugene V.; Haas, Brian; Thimmapuram, Jyothi; Nelson, David; Wang, Xiyin; Bowers, John E.; Gschwend, Andrea R.; Delcher, Arthur L.; Singh, Ratnesh; Suzuki, Jon Y.; Tripathi, Savarni; Neupane, Kabi; Wei, Hairong; Irikura, Beth; Paidi, Maya; Jiang, Ning; Zhang, Wenli; Presting, Gernot; Windsor, Aaron; Navajas-Pérez, Rafael; Torres, Manuel J.; Feltus, F. Alex; Porter, Brad; Li, Yingjun; Burroughs, A. Max; Luo, Ming-Cheng; Liu, Lei; Christopher, David A.; Mount, Stephen M.; Moore, Paul H.; Sugimura, Tak; Jiang, Jiming; Schuler, Mary A.; Friedman, Vikki; Mitchell-Olds, Thomas; Shippen, Dorothy E.; dePamphilis, Claude W.; Palmer, Jeffrey D.; Freeling, Michael; Paterson, Andrew H.; Gonsalves, Dennis; Wang, Lei; Alam, Maqsudul

2010-01-01

Papaya, a fruit crop cultivated in tropical and subtropical regions, is known for its nutritional benefits and medicinal applications. Here we report a 3× draft genome sequence of ‘SunUp’ papaya, the first commercial virus-resistant transgenic fruit tree1 to be sequenced. The papaya genome is three times the size of the Arabidopsis genome, but contains fewer genes, including significantly fewer disease-resistance gene analogues. Comparison of the five sequenced genomes suggests a minimal angiosperm gene set of 13,311. A lack of recent genome duplication, atypical of other angiosperm genomes sequenced so far2–5, may account for the smaller papaya gene number in most functional groups. Nonetheless, striking amplifications in gene number within particular functional groups suggest roles in the evolution of tree-like habit, deposition and remobilization of starch reserves, attraction of seed dispersal agents, and adaptation to tropical daylengths. Transgenesis at three locations is closely associated with chloroplast insertions into the nuclear genome, and with topoisomerase I recognition sites. Papaya offers numerous advantages as a system for fruit-tree functional genomics, and this draft genome sequence provides the foundation for revealing the basis of Carica's distinguishing morpho-physiological, medicinal and nutritional properties. PMID:18432245
The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus).

PubMed

Ming, Ray; Hou, Shaobin; Feng, Yun; Yu, Qingyi; Dionne-Laporte, Alexandre; Saw, Jimmy H; Senin, Pavel; Wang, Wei; Ly, Benjamin V; Lewis, Kanako L T; Salzberg, Steven L; Feng, Lu; Jones, Meghan R; Skelton, Rachel L; Murray, Jan E; Chen, Cuixia; Qian, Wubin; Shen, Junguo; Du, Peng; Eustice, Moriah; Tong, Eric; Tang, Haibao; Lyons, Eric; Paull, Robert E; Michael, Todd P; Wall, Kerr; Rice, Danny W; Albert, Henrik; Wang, Ming-Li; Zhu, Yun J; Schatz, Michael; Nagarajan, Niranjan; Acob, Ricelle A; Guan, Peizhu; Blas, Andrea; Wai, Ching Man; Ackerman, Christine M; Ren, Yan; Liu, Chao; Wang, Jianmei; Wang, Jianping; Na, Jong-Kuk; Shakirov, Eugene V; Haas, Brian; Thimmapuram, Jyothi; Nelson, David; Wang, Xiyin; Bowers, John E; Gschwend, Andrea R; Delcher, Arthur L; Singh, Ratnesh; Suzuki, Jon Y; Tripathi, Savarni; Neupane, Kabi; Wei, Hairong; Irikura, Beth; Paidi, Maya; Jiang, Ning; Zhang, Wenli; Presting, Gernot; Windsor, Aaron; Navajas-Pérez, Rafael; Torres, Manuel J; Feltus, F Alex; Porter, Brad; Li, Yingjun; Burroughs, A Max; Luo, Ming-Cheng; Liu, Lei; Christopher, David A; Mount, Stephen M; Moore, Paul H; Sugimura, Tak; Jiang, Jiming; Schuler, Mary A; Friedman, Vikki; Mitchell-Olds, Thomas; Shippen, Dorothy E; dePamphilis, Claude W; Palmer, Jeffrey D; Freeling, Michael; Paterson, Andrew H; Gonsalves, Dennis; Wang, Lei; Alam, Maqsudul

2008-04-24

Papaya, a fruit crop cultivated in tropical and subtropical regions, is known for its nutritional benefits and medicinal applications. Here we report a 3x draft genome sequence of 'SunUp' papaya, the first commercial virus-resistant transgenic fruit tree to be sequenced. The papaya genome is three times the size of the Arabidopsis genome, but contains fewer genes, including significantly fewer disease-resistance gene analogues. Comparison of the five sequenced genomes suggests a minimal angiosperm gene set of 13,311. A lack of recent genome duplication, atypical of other angiosperm genomes sequenced so far, may account for the smaller papaya gene number in most functional groups. Nonetheless, striking amplifications in gene number within particular functional groups suggest roles in the evolution of tree-like habit, deposition and remobilization of starch reserves, attraction of seed dispersal agents, and adaptation to tropical daylengths. Transgenesis at three locations is closely associated with chloroplast insertions into the nuclear genome, and with topoisomerase I recognition sites. Papaya offers numerous advantages as a system for fruit-tree functional genomics, and this draft genome sequence provides the foundation for revealing the basis of Carica's distinguishing morpho-physiological, medicinal and nutritional properties.
Biosynthesis of Lipoic Acid in Arabidopsis: Cloning and Characterization of the cDNA for Lipoic Acid Synthase1

PubMed Central

Yasuno, Rie; Wada, Hajime

1998-01-01

Lipoic acid is a coenzyme that is essential for the activity of enzyme complexes such as those of pyruvate dehydrogenase and glycine decarboxylase. We report here the isolation and characterization of LIP1 cDNA for lipoic acid synthase of Arabidopsis. The Arabidopsis LIP1 cDNA was isolated using an expressed sequence tag homologous to the lipoic acid synthase of Escherichia coli. This cDNA was shown to code for Arabidopsis lipoic acid synthase by its ability to complement a lipA mutant of E. coli defective in lipoic acid synthase. DNA-sequence analysis of the LIP1 cDNA revealed an open reading frame predicting a protein of 374 amino acids. Comparisons of the deduced amino acid sequence with those of E. coli and yeast lipoic acid synthase homologs showed a high degree of sequence similarity and the presence of a leader sequence presumably required for import into the mitochondria. Southern-hybridization analysis suggested that LIP1 is a single-copy gene in Arabidopsis. Western analysis with an antibody against lipoic acid synthase demonstrated that this enzyme is located in the mitochondrial compartment in Arabidopsis cells as a 43-kD polypeptide. PMID:9808738
Genetic Identification of Orientobilharzia turkestanicum from Sheep Isolates in Iran.

PubMed

Tabaripour, Reza; Youssefi, Mohammad Reza; Tabaripour, Rabeeh

2015-01-01

Adult worms of Orientobilharzia turkestanicum live in the portal veins, or intestinal veins of cattle, sheep, goat and many other mammals causing orientobilharziasis. Orientobilharziasis causes significant economic losses to livestock industry of Iran. However, there is limited information about genotypes of O. turkestanicum in Iran. In this study, 30 isolates of O. turkestanicum obtained from sheep were characterized by sequencing mitochondrial cytochrome c oxidase subunit 1 (cox1) and nicotinamide adenine dinucleotide dehydrogenase subunit 1 (nad1) gene. The mitochondrial cox1 and nad1 DNA were amplified by polymerase chain reaction (PCR) and then sequenced and compared with O. turkestanicum and that of other members of the Schistosomatidae available in Gen-Bank(™). Phylogenetic relationships between them were re-constructed using the maximum parsimony method. Phylogenetic analyses done in present study placed O. turkestanicum within the Schistosoma genus, and indicates that O. turkestanicum was phylogenetically closer to the African schistosome group than to the Asian schistosome group. Comparison of nad1 and cox1 sequences of O. turkestanicum obtained in this study with corresponding sequences available in Genbank(™) revealed some sequence variations and provided evidence for presence of microvarients in Iran.
Glutamate cysteine ligase (GCL) in the freshwater bivalve Unio tumidus: impact of storage conditions and seasons on activity and identification of partial coding sequence of the catalytic subunit.

PubMed

Coffinet, Stéphanie; Cossu-Leguille, Carole; Rodius, François; Vasseur, Paule

2008-09-01

Glutamate cysteine ligase (GCL; EC 6.3.2.2) is the first enzyme involved in the synthesis of glutathione. A HPLC method with fluorimetric detection was used to measure GCL activity in the gills and the digestive gland of the freshwater bivalve, Unio tumidus. Storage conditions were optimized in order to prevent decrease of GCL activity and consisted in freezing the cytosolic fraction in the presence of protease (1 mM phenylmethylsulfonic fluoric acid) and gamma-glutamyltranspeptidase (1 mM L-serine borate mixture and 0.5 mM acivicin) inhibitors. Seasonal variations of activity in the digestive gland and to a lesser extent in the gills were found with activity increasing in spring compared to winter. No sex differences were revealed. The GCL coding sequence was identified using degenerated primers designed in the highly conserved regions of the catalytic subunit of GCL. The partial sequence identified encoded for 121 amino acids. The comparison of the identified partial coding sequence of U. tumidus with those available from vertebrates and invertebrates indicated that GCL sequence was highly conserved.
'Candidatus Rickettsia mendelii', a novel basal group rickettsia detected in Ixodes ricinus ticks in the Czech Republic.

PubMed

Hajduskova, Eva; Literak, Ivan; Papousek, Ivo; Costa, Francisco B; Novakova, Marketa; Labruna, Marcelo B; Zdrazilova-Dubska, Lenka

2016-04-01

A novel rickettsial sequence in the citrate synthase gltA gene indicating a novel Rickettsia species has been detected in 7 out of 4524 Ixodes ricinus ticks examined within several surveys performed in the Czech Republic from 2005 to 2009. This new Candidatus Rickettsia sp. sequence has been found in 2 nymphs feeding on wild birds (Luscinia megarhynchos and Erithacus rubecula), in a male tick from vegetation, and 4 ticks feeding on a dog (3 males, 1 female tick). Portions of the ompA, ompB, sca4, and htrA genes were not amplifiable in these samples. A maximum likelihood tree of rickettsiae based on comparisons of partial amino acid sequences of citrate synthase and nucleotide sequences of 16S rDNA genes and phylogenetic analysis revealed a basal position of the novel species in the proximity of R. bellii and R. canadensis. The novel species has been named 'Candidatus Rickettsia mendelii' after the founder of genetics, Gregor Mendel. Copyright © 2016 Elsevier GmbH. All rights reserved.
Analysis of the ergosterol biosynthesis pathway cloning, molecular characterization and phylogeny of lanosterol 14 α-demethylase (ERG11) gene of Moniliophthora perniciosa.

PubMed

de Oliveira Ceita, Geruza; Vilas-Boas, Laurival Antônio; Castilho, Marcelo Santos; Carazzolle, Marcelo Falsarella; Pirovani, Carlos Priminho; Selbach-Schnadelbach, Alessandra; Gramacho, Karina Peres; Ramos, Pablo Ivan Pereira; Barbosa, Luciana Veiga; Pereira, Gonçalo Amarante Guimarães; Góes-Neto, Aristóteles

2014-10-01

The phytopathogenic fungus Moniliophthora perniciosa (Stahel) Aime & Philips-Mora, causal agent of witches' broom disease of cocoa, causes countless damage to cocoa production in Brazil. Molecular studies have attempted to identify genes that play important roles in fungal survival and virulence. In this study, sequences deposited in the M. perniciosa Genome Sequencing Project database were analyzed to identify potential biological targets. For the first time, the ergosterol biosynthetic pathway in M. perniciosa was studied and the lanosterol 14α-demethylase gene (ERG11) that encodes the main enzyme of this pathway and is a target for fungicides was cloned, characterized molecularly and its phylogeny analyzed. ERG11 genomic DNA and cDNA were characterized and sequence analysis of the ERG11 protein identified highly conserved domains typical of this enzyme, such as SRS1, SRS4, EXXR and the heme-binding region (HBR). Comparison of the protein sequences and phylogenetic analysis revealed that the M. perniciosa enzyme was most closely related to that of Coprinopsis cinerea.
Analysis of the ergosterol biosynthesis pathway cloning, molecular characterization and phylogeny of lanosterol 14 α-demethylase (ERG11) gene of Moniliophthora perniciosa

PubMed Central

de Oliveira Ceita, Geruza; Vilas-Boas, Laurival Antônio; Castilho, Marcelo Santos; Carazzolle, Marcelo Falsarella; Pirovani, Carlos Priminho; Selbach-Schnadelbach, Alessandra; Gramacho, Karina Peres; Ramos, Pablo Ivan Pereira; Barbosa, Luciana Veiga; Pereira, Gonçalo Amarante Guimarães; Góes-Neto, Aristóteles

2014-01-01

The phytopathogenic fungus Moniliophthora perniciosa (Stahel) Aime & Philips-Mora, causal agent of witches’ broom disease of cocoa, causes countless damage to cocoa production in Brazil. Molecular studies have attempted to identify genes that play important roles in fungal survival and virulence. In this study, sequences deposited in the M. perniciosa Genome Sequencing Project database were analyzed to identify potential biological targets. For the first time, the ergosterol biosynthetic pathway in M. perniciosa was studied and the lanosterol 14α-demethylase gene (ERG11) that encodes the main enzyme of this pathway and is a target for fungicides was cloned, characterized molecularly and its phylogeny analyzed. ERG11 genomic DNA and cDNA were characterized and sequence analysis of the ERG11 protein identified highly conserved domains typical of this enzyme, such as SRS1, SRS4, EXXR and the heme-binding region (HBR). Comparison of the protein sequences and phylogenetic analysis revealed that the M. perniciosa enzyme was most closely related to that of Coprinopsis cinerea. PMID:25505843
The flaA locus of Bacillus subtilis is part of a large operon coding for flagellar structures, motility functions, and an ATPase-like polypeptide.

PubMed Central

Albertini, A M; Caramori, T; Crabb, W D; Scoffone, F; Galizzi, A

1991-01-01

We cloned and sequenced 8.3 kb of Bacillus subtilis DNA corresponding to the flaA locus involved in flagellar biosynthesis, motility, and chemotaxis. The DNA sequence revealed the presence of 10 complete and 2 incomplete open reading frames. Comparison of the deduced amino acid sequences to data banks showed similarities of nine of the deduced products to a number of proteins of Escherichia coli and Salmonella typhimurium for which a role in flagellar functioning has been directly demonstrated. In particular, the sequence data suggest that the flaA operon codes for the M-ring protein, components of the motor switch, and the distal part of the basal-body rod. The gene order is remarkably similar to that described for region III of the enterobacterial flagellar regulon. One of the open reading frames was translated into a protein with 48% amino acid identity to S. typhimurium FliI and 29% identity to the beta subunit of E. coli ATP synthase. PMID:1828465
Extensive sequencing of seven human genomes to characterize benchmark reference materials

PubMed Central

Zook, Justin M.; Catoe, David; McDaniel, Jennifer; Vang, Lindsay; Spies, Noah; Sidow, Arend; Weng, Ziming; Liu, Yuling; Mason, Christopher E.; Alexander, Noah; Henaff, Elizabeth; McIntyre, Alexa B.R.; Chandramohan, Dhruva; Chen, Feng; Jaeger, Erich; Moshrefi, Ali; Pham, Khoa; Stedman, William; Liang, Tiffany; Saghbini, Michael; Dzakula, Zeljko; Hastie, Alex; Cao, Han; Deikus, Gintaras; Schadt, Eric; Sebra, Robert; Bashir, Ali; Truty, Rebecca M.; Chang, Christopher C.; Gulbahce, Natali; Zhao, Keyan; Ghosh, Srinka; Hyland, Fiona; Fu, Yutao; Chaisson, Mark; Xiao, Chunlin; Trow, Jonathan; Sherry, Stephen T.; Zaranek, Alexander W.; Ball, Madeleine; Bobe, Jason; Estep, Preston; Church, George M.; Marks, Patrick; Kyriazopoulou-Panagiotopoulou, Sofia; Zheng, Grace X.Y.; Schnall-Levin, Michael; Ordonez, Heather S.; Mudivarti, Patrice A.; Giorda, Kristina; Sheng, Ying; Rypdal, Karoline Bjarnesdatter; Salit, Marc

2016-01-01

The Genome in a Bottle Consortium, hosted by the National Institute of Standards and Technology (NIST) is creating reference materials and data for human genome sequencing, as well as methods for genome comparison and benchmarking. Here, we describe a large, diverse set of sequencing data for seven human genomes; five are current or candidate NIST Reference Materials. The pilot genome, NA12878, has been released as NIST RM 8398. We also describe data from two Personal Genome Project trios, one of Ashkenazim Jewish ancestry and one of Chinese ancestry. The data come from 12 technologies: BioNano Genomics, Complete Genomics paired-end and LFR, Ion Proton exome, Oxford Nanopore, Pacific Biosciences, SOLiD, 10X Genomics GemCode WGS, and Illumina exome and WGS paired-end, mate-pair, and synthetic long reads. Cell lines, DNA, and data from these individuals are publicly available. Therefore, we expect these data to be useful for revealing novel information about the human genome and improving sequencing technologies, SNP, indel, and structural variant calling, and de novo assembly. PMID:27271295
Complete genome sequence of "Thiodictyon syntrophicum" sp. nov. strain Cad16T, a photolithoautotrophic purple sulfur bacterium isolated from the alpine meromictic Lake Cadagno.

PubMed

Luedin, Samuel M; Pothier, Joël F; Danza, Francesco; Storelli, Nicola; Frigaard, Niels-Ulrik; Wittwer, Matthias; Tonolla, Mauro

2018-01-01

" Thiodictyon syntrophicum" sp. nov. strain Cad16 T is a photoautotrophic purple sulfur bacterium belonging to the family of Chromatiaceae in the class of Gammaproteobacteria . The type strain Cad16 T was isolated from the chemocline of the alpine meromictic Lake Cadagno in Switzerland. Strain Cad16 T represents a key species within this sulfur-driven bacterial ecosystem with respect to carbon fixation. The 7.74-Mbp genome of strain Cad16 T has been sequenced and annotated. It encodes 6237 predicted protein sequences and 59 RNA sequences. Phylogenetic comparison based on 16S rRNA revealed that Thiodictyon elegans strain DSM 232 T the most closely related species. Genes involved in sulfur oxidation, central carbon metabolism and transmembrane transport were found. Noteworthy, clusters of genes encoding the photosynthetic machinery and pigment biosynthesis are found on the 0.48 Mb plasmid pTs485. We provide a detailed insight into the Cad16 T genome and analyze it in the context of the microbial ecosystem of Lake Cadagno.
Occurrence of dsRNA Mycovirus (LeV-FMRI0339) in the Edible Mushroom Lentinula edodes and Meiotic Stability of LeV-FMRI0339 among Monokaryotic Progeny

PubMed Central

Kim, Jung-Mi; Yun, Suk-Hyun; Park, Seung-Moon; Ko, Han-Gyu; Kim, Dae-Hyuk

2013-01-01

dsRNA was found in malformed cultures of Lentinula edodes strain FMRI0339, one of the three most popular sawdust cultivated commercial strains of shiitake, and was also found in healthy-looking fruiting bodies and actively growing mycelia. Cloning of the partial genome of the dsRNA revealed the presence of the RdRp sequence of a novel L. edodes mycovirus (LeV), and sequence comparison of the cloned amplicon showed identical sequences sequence to known RNA-dependent RNA polymerase genes of LeV found in strain HKA. The meiotic stability of dsRNA was examined by measuring the ratio of the presence of dsRNA among sexual monokaryotic progeny. More than 40% of the monokaryotic progeny still contained the dsRNA, indicating the persistence of dsRNA during sexual reproduction. Comparing the mycelia growth of monokaryotic progeny suggested that there appeared to be a tendency toward a lower frequency of virus incidence in actively growing progeny. PMID:25288977
Nonneutral mitochondrial DNA variation in humans and chimpanzees

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nachman, M.W.; Aquadro, C.F.; Brown, W.M.

1996-03-01

We sequenced the NADH dehydrogenase subunit 3 (ND3) gene from a sample of 61 humans, five common chimpanzees, and one gorilla to test whether patterns of mitochondrial DNA (mtDNA) variation are consistent with a neutral model of molecular evolution. Within humans and within chimpanzees, the ratio of replacement to silent nucleotide substitutions was higher than observed in comparisons between species, contrary to neutral expectations. To test the generality of this result, we reanalyzed published human RFLP data from the entire mitochondrial genome. Gains of restriction sites relative to a known human mtDNA sequence were used to infer unambiguous nucleotide substitutions.more » We also compared the complete mtDNA sequences of three humans. Both the RFLP data and the sequence data reveal a higher ratio of replacement to silent nucleotide substitutions within humans than is seen between species. This pattern is observed at most or all human mitochondrial genes and is inconsistent with a strictly neutral model. These data suggest that many mitochondrial protein polymorphisms are slightly deleterious, consistent with studies of human mitochondrial diseases. 59 refs., 2 figs., 8 tabs.« less
Analysis of the genome sequence of the pathogenic Muscovy duck parvovirus strain YY reveals a 14-nucleotide-pair deletion in the inverted terminal repeats.

PubMed

Wang, Jianye; Huang, Yu; Zhou, Mingxu; Zhu, Guoqiang

2016-09-01

Genomic information about Muscovy duck parvovirus is still limited. In this study, the genome of the pathogenic MDPV strain YY was sequenced. The full-length genome of YY is 5075 nucleotides (nt) long, 57 nt shorter than that of strain FM. Sequence alignment indicates that the 5' and 3' inverted terminal repeats (ITR) of strain YY contain a 14-nucleotide-pair deletion in the stem of the palindromic hairpin structure in comparison to strain FM and FZ91-30. The deleted region contains one "E-box" site and one repeated motif with the sequence "TTCCGGT" or "ACCGGAA". Phylogenetic trees constructed based the protein coding genes concordantly showed that YY, together with nine other MDPV isolates from various places, clustered in a separate branch, distinct from the branch formed by goose parvovirus (GPV) strains. These results demonstrate that, despite the distinctive deletion, the YY strain still belongs to the classical MDPV group. Moreover, the deletion of ITR may contribute to the genome evolution of MDPV under immunization pressure.
Comparison of topological clustering within protein networks using edge metrics that evaluate full sequence, full structure, and active site microenvironment similarity.

PubMed

Leuthaeuser, Janelle B; Knutson, Stacy T; Kumar, Kiran; Babbitt, Patricia C; Fetrow, Jacquelyn S

2015-09-01

The development of accurate protein function annotation methods has emerged as a major unsolved biological problem. Protein similarity networks, one approach to function annotation via annotation transfer, group proteins into similarity-based clusters. An underlying assumption is that the edge metric used to identify such clusters correlates with functional information. In this contribution, this assumption is evaluated by observing topologies in similarity networks using three different edge metrics: sequence (BLAST), structure (TM-Align), and active site similarity (active site profiling, implemented in DASP). Network topologies for four well-studied protein superfamilies (enolase, peroxiredoxin (Prx), glutathione transferase (GST), and crotonase) were compared with curated functional hierarchies and structure. As expected, network topology differs, depending on edge metric; comparison of topologies provides valuable information on structure/function relationships. Subnetworks based on active site similarity correlate with known functional hierarchies at a single edge threshold more often than sequence- or structure-based networks. Sequence- and structure-based networks are useful for identifying sequence and domain similarities and differences; therefore, it is important to consider the clustering goal before deciding appropriate edge metric. Further, conserved active site residues identified in enolase and GST active site subnetworks correspond with published functionally important residues. Extension of this analysis yields predictions of functionally determinant residues for GST subgroups. These results support the hypothesis that active site similarity-based networks reveal clusters that share functional details and lay the foundation for capturing functionally relevant hierarchies using an approach that is both automatable and can deliver greater precision in function annotation than current similarity-based methods. © 2015 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.
Phylogenetic structure of Leishmania tropica in the new endemic focus Birjand in East Iran in comparison to other Iranian endemic regions.

PubMed

Karamian, Mehdi; Kuhls, Katrin; Hemmati, Mina; Ghatee, Mohammad Amin

2016-06-01

Iran has been identified being among the countries with the highest number of cutaneous leishmaniasis (CL) cases. South Khorasan province in East Iran is an emerging focus of CL. Species identification of sixty clinical samples by ITS1 PCR-RFLP presented evidence for the dominance of Leishmania tropica (90%) in this region. Analysis of the ITS1 sequence of 19 L. tropica isolates revealed seven closely related sequence types. In addition, ITS1 sequences available in GenBank from other Iranian regions were compiled for comparison with the studied isolates. Iranian L. tropica was distributed in two main clusters. All East Iranian sequence types were grouped with strains from foci from Southeast and Central regions in cluster A, showing highly similar sequences. The highest similarity was observed between most L. tropica from East and all isolates from Southeast regions and from Savojbolagh county in Central Iran. Southwest L. tropica was shown to be paraphyletic as the isolates were distributed in both clusters A and B. All Northeastern L. tropica were part of cluster B, however they showed significant heterogeneity and were distributed in different subclusters. Distribution of L. tropica populations was to some extent congruent with genetic lineages of Phlebotomus sergenti in Iran and may be an evidence for parasite-vector co-evolution. Southeast-East L. tropica was also similar to strains from Herat province in Afghanistan at the East border of Iran. This is the first comprehensive study on population structure of L. tropica in Iran that provides a guideline for appropriate sampling for further molecular based epidemiological studies. Copyright © 2016 Elsevier B.V. All rights reserved.
Comparison of topological clustering within protein networks using edge metrics that evaluate full sequence, full structure, and active site microenvironment similarity

PubMed Central

Leuthaeuser, Janelle B; Knutson, Stacy T; Kumar, Kiran; Babbitt, Patricia C; Fetrow, Jacquelyn S

2015-01-01

The development of accurate protein function annotation methods has emerged as a major unsolved biological problem. Protein similarity networks, one approach to function annotation via annotation transfer, group proteins into similarity-based clusters. An underlying assumption is that the edge metric used to identify such clusters correlates with functional information. In this contribution, this assumption is evaluated by observing topologies in similarity networks using three different edge metrics: sequence (BLAST), structure (TM-Align), and active site similarity (active site profiling, implemented in DASP). Network topologies for four well-studied protein superfamilies (enolase, peroxiredoxin (Prx), glutathione transferase (GST), and crotonase) were compared with curated functional hierarchies and structure. As expected, network topology differs, depending on edge metric; comparison of topologies provides valuable information on structure/function relationships. Subnetworks based on active site similarity correlate with known functional hierarchies at a single edge threshold more often than sequence- or structure-based networks. Sequence- and structure-based networks are useful for identifying sequence and domain similarities and differences; therefore, it is important to consider the clustering goal before deciding appropriate edge metric. Further, conserved active site residues identified in enolase and GST active site subnetworks correspond with published functionally important residues. Extension of this analysis yields predictions of functionally determinant residues for GST subgroups. These results support the hypothesis that active site similarity-based networks reveal clusters that share functional details and lay the foundation for capturing functionally relevant hierarchies using an approach that is both automatable and can deliver greater precision in function annotation than current similarity-based methods. PMID:26073648
Characterization of perch rhabdovirus (PRV) in farmed grayling Thymallus thymallus.

PubMed

Gadd, Tuija; Viljamaa-Dirks, Satu; Holopainen, Riikka; Koski, Perttu; Jakava-Viljanen, Miia

2013-10-11

Two Finnish fish farms experienced elevated mortality rates in farmed grayling Thymallus thymallus fry during the summer months, most typically in July. The mortalities occurred during several years and were connected with a few neurological disorders and peritonitis. Virological investigation detected an infection with an unknown rhabdovirus. Based on the entire glycoprotein (G) and partial RNA polymerase (L) gene sequences, the virus was classified as a perch rhabdovirus (PRV). Pairwise comparisons of the G and L gene regions of grayling isolates revealed that all isolates were very closely related, with 99 to 100% nucleotide identity, which suggests the same origin of infection. Phylogenetic analysis demonstrated that they were closely related to the strain isolated from perch Perca fluviatilis and sea trout Salmo trutta trutta caught from the Baltic Sea. The entire G gene sequences revealed that all Finnish grayling isolates, and both the perch and sea trout isolates, were most closely related to a PRV isolated in France in 2004. According to the partial L gene sequences, all of the Finnish grayling isolates were most closely related to the Danish isolate DK5533 from pike. The genetic analysis of entire G gene and partial L gene sequences showed that the Finnish brown trout isolate ka907_87 shared only approximately 67 and 78% identity, respectively, with our grayling isolates. The grayling isolates were also analysed by an immunofluorescence antibody test. This is the first report of a PRV causing disease in grayling in Finland.

Complete genome sequence of mumps viruses isolated from patients with parotitis, pancreatitis and encephalitis in India.

PubMed

Vaidya, Sunil R; Chowdhury, Deepika T; Jadhav, Santoshkumar M; Hamde, Venkat S

2016-04-01

Limited information is available regarding epidemiology of mumps in India. Mumps vaccine is not included in the Universal Immunization Program of India. The complete genome sequences of Indian mumps virus (MuV) isolates are not available, hence this study was performed. Five isolates from bilateral parotitis and pancreatitis patients from Maharashtra, a MuV isolate from unilateral parotitis patient from Tamil Nadu, and a MuV isolate from encephalitis patient from Uttar Pradesh were genotyped by the standard protocol of the World Health Organization and subsequently complete genomes were sequenced. Indian MuV genomes were compared with published MuV genomes, including reference genotypes and eight vaccine strains for the genetic differences. The SH gene analysis revealed that five MuV isolates belonged to genotype C and two belonged to genotype G strains. The percent nucleotide divergence (PND) was 1.1% amongst five MuV genotype C strains and 2.2% amongst two MuV genotype G strains. A comparison with widely used mumps Jeryl Lynn vaccine strain revealed that Indian mumps isolates had 54, 54, 53, 49, 49, 38, and 49 amino acid substitutions in Chennai-2012, Kushinagar-2013, Pune-2008, Osmanabad-2012a, Osmanabad-2012b, Pune-1986 and Pune-2012, respectively. This study reports the complete genome sequences of Indian MuV strains obtained in years 1986, 2008, 2012 and 2013 that may be useful for further studies in India and globally. Copyright © 2016 Elsevier B.V. All rights reserved.
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health

PubMed Central

Martin, William F.

2017-01-01

Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. PMID:28444372
Massively parallel sequencing of forensic STRs: Considerations of the DNA commission of the International Society for Forensic Genetics (ISFG) on minimal nomenclature requirements.

PubMed

Parson, Walther; Ballard, David; Budowle, Bruce; Butler, John M; Gettings, Katherine B; Gill, Peter; Gusmão, Leonor; Hares, Douglas R; Irwin, Jodi A; King, Jonathan L; Knijff, Peter de; Morling, Niels; Prinz, Mechthild; Schneider, Peter M; Neste, Christophe Van; Willuweit, Sascha; Phillips, Christopher

2016-05-01

The DNA Commission of the International Society for Forensic Genetics (ISFG) is reviewing factors that need to be considered ahead of the adoption by the forensic community of short tandem repeat (STR) genotyping by massively parallel sequencing (MPS) technologies. MPS produces sequence data that provide a precise description of the repeat allele structure of a STR marker and variants that may reside in the flanking areas of the repeat region. When a STR contains a complex arrangement of repeat motifs, the level of genetic polymorphism revealed by the sequence data can increase substantially. As repeat structures can be complex and include substitutions, insertions, deletions, variable tandem repeat arrangements of multiple nucleotide motifs, and flanking region SNPs, established capillary electrophoresis (CE) allele descriptions must be supplemented by a new system of STR allele nomenclature, which retains backward compatibility with the CE data that currently populate national DNA databases and that will continue to be produced for the coming years. Thus, there is a pressing need to produce a standardized framework for describing complex sequences that enable comparison with currently used repeat allele nomenclature derived from conventional CE systems. It is important to discern three levels of information in hierarchical order (i) the sequence, (ii) the alignment, and (iii) the nomenclature of STR sequence data. We propose a sequence (text) string format the minimal requirement of data storage that laboratories should follow when adopting MPS of STRs. We further discuss the variant annotation and sequence comparison framework necessary to maintain compatibility among established and future data. This system must be easy to use and interpret by the DNA specialist, based on a universally accessible genome assembly, and in place before the uptake of MPS by the general forensic community starts to generate sequence data on a large scale. While the established nomenclature for CE-based STR analysis will remain unchanged in the future, the nomenclature of sequence-based STR genotypes will need to follow updated rules and be generated by expert systems that translate MPS sequences to match CE conventions in order to guarantee compatibility between the different generations of STR data. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Mitochondrial D-loop analysis for uncovering the population structure and genetic diversity among the indigenous duck (Anas platyrhynchos) populations of India.

PubMed

Gaur, Uma; Tantia, Madhu Sudan; Mishra, Bina; Bharani Kumar, Settypalli Tirumala; Vijh, Ramesh Kumar; Chaudhury, Ashok

2018-03-01

The indigenous domestic duck (Anas platyrhynchos domestica) which is domesticated from Mallard (Anas platyrhynchos) contributes significantly to poor farming community in coastal and North Eastern regions of India. For conservation and maintenance of indigenous duck populations it is very important to know the existing genetic diversity and population structure. To unravel the population structure and genetic diversity among the five indigenous duck populations of India, the mitochondrial D-loop sequences of 120 ducks were analyzed. The sequence analysis by comparison of mtDNA D-loop region (470 bp) of five Indian duck populations revealed 25 mitochondrial haplotypes. Pairwise F ST value among populations was 0.4243 (p < .01) and the range of nucleotide substitution per site (Dxy) between the five Indian duck populations was 0.00034-0.00555, and the net divergence (Da) was 0-0.00355. The phylogenetic analysis in the present study unveiled three clades. The analysis revealed genetic continuity among ducks of coastal region of the country which formed a separate group from the ducks of the inland area. Both coastal as well as the land birds revealed introgression of the out group breed Khaki Campbell, which is used for breed improvement programs in India. The observations revealed very less selection and a single matrilineal lineage of indigenous domestic ducks.
Characterization of the Agrobacterium vitis pehA gene and comparison of the encoded polygalacturonase with the homologous enzymes from Erwinia carotovora and Ralstonia solanacearum.

PubMed Central

Herlache, T C; Hotchkiss, A T; Burr, T J; Collmer, A

1997-01-01

DNA sequencing of the Agrobacterium vitis pehA gene revealed a predicted protein with an M(r) of 58,000 and significant similarity to the polygalacturonases of two other plant pathogens, Erwinia carotovora and Ralstonia (= Pseudomonas or Burkholderia) solanacearum. Sequencing of the N terminus of the PehA protein demonstrated cleavage of a 34-amino-acid signal peptide from pre-PehA. Mature PehA accumulated primarily in the periplasm of A. vitis and pehA+ Escherichia coli cells during exponential growth. A. vitis PehA released dimers, trimers, and monomers from polygalacturonic acid and caused less electrolyte leakage from potato tuber tissue than did the E. carotovora and R. solanacearum polygalacturonases. PMID:8979363
Comparative genomic analysis of three Leishmania species that cause diverse human disease

PubMed Central

Peacock, Christopher S; Seeger, Kathy; Harris, David; Murphy, Lee; Ruiz, Jeronimo C; Quail, Michael A; Peters, Nick; Adlem, Ellen; Tivey, Adrian; Aslett, Martin; Kerhornou, Arnaud; Ivens, Alasdair; Fraser, Audrey; Rajandream, Marie-Adele; Carver, Tim; Norbertczak, Halina; Chillingworth, Tracey; Hance, Zahra; Jagels, Kay; Moule, Sharon; Ormond, Doug; Rutter, Simon; Squares, Rob; Whitehead, Sally; Rabbinowitsch, Ester; Arrowsmith, Claire; White, Brian; Thurston, Scott; Bringaud, Frédéric; Baldauf, Sandra L; Faulconbridge, Adam; Jeffares, Daniel; Depledge, Daniel P; Oyola, Samuel O; Hilley, James D; Brito, Loislene O; Tosi, Luiz R O; Barrell, Barclay; Cruz, Angela K; Mottram, Jeremy C; Smith, Deborah F; Berriman, Matthew

2008-01-01

Leishmania parasites cause a broad spectrum of clinical disease. Here we report the sequencing of the genomes of two species of Leishmania: Leishmania infantum and Leishmania braziliensis. The comparison of these sequences with the published genome of Leishmania major reveals marked conservation of synteny and identifies only ∼200 genes with a differential distribution between the three species. L. braziliensis, contrary to Leishmania species examined so far, possesses components of a putative RNA-mediated interference pathway, telomere-associated transposable elements and spliced leader–associated SLACS retrotransposons. We show that pseudogene formation and gene loss are the principal forces shaping the different genomes. Genes that are differentially distributed between the species encode proteins implicated in host-pathogen interactions and parasite survival in the macrophage. PMID:17572675
Isolation and characterization of cDNAs encoding wheat 3-hydroxy-3-methylglutaryl coenzyme A reductase.

PubMed Central

Aoyagi, K; Beyou, A; Moon, K; Fang, L; Ulrich, T

1993-01-01

The enzyme 3-hydroxy-3-methylglutaryl coenzyme A reductase (HMGR, EC 1.1.1.34) is a key enzyme in the isoprenoid biosynthetic pathway. We have isolated partial cDNAs from wheat (Triticum aestivum) using the polymerase chain reaction. Comparison of deduced amino acid sequences of these cDNAs shows that they represent a small family of genes that share a high degree of sequence homology among themselves as well as among genes from other organisms including tomato, Arabidopsis, hamster, human, Drosophila, and yeast. Southern blot analysis reveals the presence of at least four genes. Our results concerning the tissue-specific expression as well as developmental regulation of these HMGR cDNAs highlight the important role of this enzyme in the growth and development of wheat. PMID:8108513
The complete mitochondrial genome of domestic sheep, Ovis aries.

PubMed

Hu, Xiao-di; Gao, Li-zhi

2016-01-01

In this study, we report a complete mitochondrial (mt) genome sequence of the Texel ewe, Ovis aries. The total genome is 16,615 bp in length and its overall base composition was estimated to be 33.68% for A, 27.36% for T, 25.86% for C, and 13.10% for G indicating an AT-rich (61.04%) feature in the O. aries mtgenome. It contains a total of 13 protein-coding genes, 22 transfer RNA genes, 2 ribosomal RNA genes and a control region (D-loop region). Comparisons with other publicly available sheep mitogenomes revealed a bunch of nucleotide diversity. This complete mitgenome sequence would enlarge useful genomic information for further studies on sheep evolution and domestication that will enhance germplasm conservation and breeding programs of O. aries.
Progesterone receptor isoforms in the mammary gland of cats and dogs.

PubMed

Gracanin, A; de Gier, J; Zegers, K; Bominaar, M; Rutteman, G R; Schaefers-Okkens, A C; Kooistra, H S; Mol, J A

2012-12-01

Progesterone exerts its effect by binding to specific progesterone receptors (PR) within the cell. In dogs and cats, no data are available on PR isoforms as found in other species. We therefore investigated the sequence of the PR gene and encoded protein in dogs and cats, the expression of PR isoforms in mammary tissue using Western blots and the presence of PR in mammary tissue using immunohistochemistry. Comparison of the amino acid sequence of the canine and feline PR with human PR revealed major differences in the PR-B-specific upstream segment (BUS). However, the essential activation function 3 (AF3) domain was intact in the cat but mutated in the dog. The DNA and ligand-binding domains were highly similar among the species. In cats with fibroadenomatous hyperplasia (FAH), high expression of PR mRNA together with growth hormone (GH), GH receptor (GHR) and IGF-I mRNA was found in comparison with feline mammary carcinomas. Immunohistochemical analysis showed strong nuclear as well as cytoplasmic staining for PR in FAH. Western blot analysis revealed expression of the PR-A and PR-B isoforms in the feline mammary gland. In canine mammary tissue, the most abundant PR staining was found in proliferative zones of the mammary gland. Western blot analyses showed mainly staining for PR-A with lower PR-B staining. It is concluded that in dogs and cats both PR isoforms are expressed. The role of mutations found in the canine PR-B is discussed. © 2012 Blackwell Verlag GmbH.
Genome sequencing and analysis of a highly virulent Vibrio parahaemolyticus strain isolated from the marine environment

NASA Astrophysics Data System (ADS)

Parks, M. C.; Moreno, E.

2016-02-01

Vibrio parahaemolyticus [Vp] is a Gram-negative bacterium and a natural inhabitant of coastal marine ecosystems worldwide. Vp is also a coincidental pathogen of humans. Virulent strains are commonly identified by the presence of the thermostable direct (tdh) or tdh-related (trh) hemolysin genes. However, virulence is multifaceted and many clinical Vp isolates do not carry tdh or trh. In this study, we sequenced and assembled the draft genome of a tdh- and trh-negative environmental isolate (805) shown previously to be highly virulent in zebrafish. To investigate potential mechanisms of virulence, we compared 805 to the clinical V. parahaemolyticus type strain (RIMD2210633). Pairwise comparison revealed the presence of multiple genomic regions including an IncF conjugative pilus (1.3 Kb) and a colicin V plasmid (1.49 Kb). These features are homologous to genomic regions present in clinical V. vulnificus and V. cholerae strains. Genome comparison also revealed the presence of five toxin-antitoxin systems. Isolate 805 likely attained these new features through the lateral acquisition of mobile genomic material - a hypothesis supported by the aberrant GC content of these regions. Colicin V plasmids are a diverse group of IncF plasmids found in invasive bacterial strains. Similarly, an abundance of toxin-antitoxin systems have been linked to virulence in Gram-negative bacteria. Current efforts are focused on characterizing 142 coding features present in 805 but absent from the type strain.
Comparison of Spinach Sex Chromosomes with Sugar Beet Autosomes Reveals Extensive Synteny and Low Recombination at the Male-Determining Locus.

PubMed

Takahata, Satoshi; Yago, Takumi; Iwabuchi, Keisuke; Hirakawa, Hideki; Suzuki, Yutaka; Onodera, Yasuyuki

2016-01-01

Spinach (Spinacia oleracea, 2n = 12) and sugar beet (Beta vulgaris, 2n = 18) are important crop members of the family Chenopodiaceae ss Sugar beet has a basic chromosome number of 9 and a cosexual breeding system, as do most members of the Chenopodiaceae ss. family. By contrast, spinach has a basic chromosome number of 6 and, although certain cultivars and genotypes produce monoecious plants, is considered to be a dioecious species. The loci determining male and monoecious sexual expression were mapped to different loci on the spinach sex chromosomes. In this study, a linkage map with 46 mapped protein-coding sequences was constructed for the spinach sex chromosomes. Comparison of the linkage map with a reference genome sequence of sugar beet revealed that the spinach sex chromosomes exhibited extensive synteny with sugar beet chromosomes 4 and 9. Tightly linked protein-coding genes linked to the male-determining locus in spinach corresponded to genes located in or around the putative pericentromeric and centromeric regions of sugar beet chromosomes 4 and 9, supporting the observation that recombination rates were low in the vicinity of the male-determining locus. The locus for monoecism was confined to a chromosomal segment corresponding to a region of approximately 1.7Mb on sugar beet chromosome 9, which may facilitate future positional cloning of the locus. © The American Genetic Association 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Conserved Noncoding Elements in the Most Distant Genera of Cephalochordates: The Goldilocks Principle

PubMed Central

Yue, Jia-Xing; Kozmikova, Iryna; Ono, Hiroki; Nossa, Carlos W.; Kozmik, Zbynek; Putnam, Nicholas H.; Yu, Jr-Kai; Holland, Linda Z.

2016-01-01

Cephalochordates, the sister group of vertebrates + tunicates, are evolving particularly slowly. Therefore, genome comparisons between two congeners of Branchiostoma revealed so many conserved noncoding elements (CNEs), that it was not clear how many are functional regulatory elements. To more effectively identify CNEs with potential regulatory functions, we compared noncoding sequences of genomes of the most phylogenetically distant cephalochordate genera, Asymmetron and Branchiostoma, which diverged approximately 120–160 million years ago. We found 113,070 noncoding elements conserved between the two species, amounting to 3.3% of the genome. The genomic distribution, target gene ontology, and enriched motifs of these CNEs all suggest that many of them are probably cis-regulatory elements. More than 90% of previously verified amphioxus regulatory elements were re-captured in this study. A search of the cephalochordate CNEs around 50 developmental genes in several vertebrate genomes revealed eight CNEs conserved between cephalochordates and vertebrates, indicating sequence conservation over >500 million years of divergence. The function of five CNEs was tested in reporter assays in zebrafish, and one was also tested in amphioxus. All five CNEs proved to be tissue-specific enhancers. Taken together, these findings indicate that even though Branchiostoma and Asymmetron are distantly related, as they are evolving slowly, comparisons between them are likely optimal for identifying most of their tissue-specific cis-regulatory elements laying the foundation for functional characterizations and a better understanding of the evolution of developmental regulation in cephalochordates. PMID:27412606
Recognition of Yeast Species from Gene Sequence Comparisons

USDA-ARS?s Scientific Manuscript database

This review discusses recognition of yeast species from gene sequence comparisons, which have been responsible for doubling the number of known yeasts over the past decade. The resolution provided by various single gene sequences is examined for both ascomycetous and basidiomycetous species, and th...
Molecular characterisation of Atlantic salmon paramyxovirus (ASPV): A novel paramyxovirus associated with proliferative gill inflammation

USGS Publications Warehouse

Falk, K.; Batts, W.N.; Kvellestad, A.; Kurath, G.; Wiik-Nielsen, J.; Winton, J.R.

2008-01-01

Atlantic salmon paramyxovirus (ASPV) was isolated in 1995 from gills of farmed Atlantic salmon suffering from proliferative gill inflammation. The complete genome sequence of ASPV was determined, revealing a genome 16,968 nucleotides in length consisting of six non-overlapping genes coding for the nucleo- (N), phospho- (P), matrix- (M), fusion- (F), haemagglutinin-neuraminidase- (HN) and large polymerase (L) proteins in the order 3???-N-P-M-F-HN-L-5???. The various conserved features related to virus replication found in most paramyxoviruses were also found in ASPV. These include: conserved and complementary leader and trailer sequences, tri-nucleotide intergenic regions and highly conserved transcription start and stop signal sequences. The P gene expression strategy of ASPV was like that of the respiro-, morbilli- and henipaviruses, which express the P and C proteins from the primary transcript and edit a portion of the mRNA to encode V and W proteins. Sequence similarities among various features related to virus replication, pairwise comparisons of all deduced ASPV protein sequences with homologous regions from other members of the family Paramyxoviridae, and phylogenetic analyses of these amino acid sequences suggested that ASPV was a novel member of the sub-family Paramyxovirinae, most closely related to the respiroviruses. ?? 2008 Elsevier B.V. All rights reserved.
A sequence-based survey of the complex structural organization of tumor genomes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Collins, Colin; Raphael, Benjamin J.; Volik, Stanislav

2008-04-03

The genomes of many epithelial tumors exhibit extensive chromosomal rearrangements. All classes of genome rearrangements can be identified using End Sequencing Profiling (ESP), which relies on paired-end sequencing of cloned tumor genomes. In this study, brain, breast, ovary and prostate tumors along with three breast cancer cell lines were surveyed with ESP yielding the largest available collection of sequence-ready tumor genome breakpoints and providing evidence that some rearrangements may be recurrent. Sequencing and fluorescence in situ hybridization (FISH) confirmed translocations and complex tumor genome structures that include coamplification and packaging of disparate genomic loci with associated molecular heterogeneity. Comparison ofmore » the tumor genomes suggests recurrent rearrangements. Some are likely to be novel structural polymorphisms, whereas others may be bona fide somatic rearrangements. A recurrent fusion transcript in breast tumors and a constitutional fusion transcript resulting from a segmental duplication were identified. Analysis of end sequences for single nucleotide polymorphisms (SNPs) revealed candidate somatic mutations and an elevated rate of novel SNPs in an ovarian tumor. These results suggest that the genomes of many epithelial tumors may be far more dynamic and complex than previously appreciated and that genomic fusions including fusion transcripts and proteins may be common, possibly yielding tumor-specific biomarkers and therapeutic targets.« less
Draft sequencing and comparative genomics of Xylella fastidiosa strains reveal novel biological insights.

PubMed

Bhattacharyya, Anamitra; Stilwagen, Stephanie; Reznik, Gary; Feil, Helene; Feil, William S; Anderson, Iain; Bernal, Axel; D'Souza, Mark; Ivanova, Natalia; Kapatral, Vinayak; Larsen, Niels; Los, Tamara; Lykidis, Athanasios; Selkov, Eugene; Walunas, Theresa L; Purcell, Alexander; Edwards, Rob A; Hawkins, Trevor; Haselkorn, Robert; Overbeek, Ross; Kyrpides, Nikos C; Predki, Paul F

2002-10-01

Draft sequencing is a rapid and efficient method for determining the near-complete sequence of microbial genomes. Here we report a comparative analysis of one complete and two draft genome sequences of the phytopathogenic bacterium, Xylella fastidiosa, which causes serious disease in plants, including citrus, almond, and oleander. We present highlights of an in silico analysis based on a comparison of reconstructions of core biological subsystems. Cellular pathway reconstructions have been used to identify a small number of genes, which are likely to reside within the draft genomes but are not captured in the draft assembly. These represented only a small fraction of all genes and were predominantly large and small ribosomal subunit protein components. By using this approach, some of the inherent limitations of draft sequence can be significantly reduced. Despite the incomplete nature of the draft genomes, it is possible to identify several phage-related genes, which appear to be absent from the draft genomes and not the result of insufficient sequence sampling. This region may therefore identify potential host-specific functions. Based on this first functional reconstruction of a phytopathogenic microbe, we spotlight an unusual respiration machinery as a potential target for biological control. We also predicted and developed a new defined growth medium for Xylella.
Mitochondrial genome sequences and comparative genomics ofPhytophthora ramorum and P. sojae

DOE Office of Scientific and Technical Information (OSTI.GOV)

Martin, Frank N.; Douda, Bensasson; Tyler, Brett M.

The complete sequences of the mitochondrial genomes of theoomycetes of Phytophthora ramorum and P. sojae were determined during thecourse of their complete nuclear genome sequencing (Tyler, et al. 2006).Both are circular, with sizes of 39,314 bp for P. ramorum and 42,975 bpfor P. sojae. Each contains a total of 37 identifiable protein-encodinggenes, 25 or 26 tRNAs (P. sojae and P. ramorum, respectively)specifying19 amino acids, and a variable number of ORFs (7 for P. ramorum and 12for P. sojae) which are potentially additional functional genes.Non-coding regions comprise approximately 11.5 percent and 18.4 percentof the genomes of P. ramorum and P. sojae,more » respectively. Relative to P.sojae, there is an inverted repeat of 1,150 bp in P. ramorum thatincludes an unassigned unique ORF, a tRNA gene, and adjacent non-codingsequences, but otherwise the gene order in both species is identical.Comparisons of these genomes with published sequences of the P. infestansmitochondrial genome reveals a number of similarities, but the gene orderin P. infestans differs in two adjacent locations due to inversions.Sequence alignments of the three genomes indicated sequence conservationranging from 75 to 85 percent and that specific regions were morevariable than others.« less
High-resolution motion compensated MRA in patients with congenital heart disease using extracellular contrast agent at 3 Tesla

PubMed Central

2012-01-01

Background Using first-pass MRA (FP-MRA) spatial resolution is limited by breath-hold duration. In addition, image quality may be hampered by respiratory and cardiac motion artefacts. In order to overcome these limitations an ECG- and navigator-gated high-resolution-MRA sequence (HR-MRA) with slow infusion of extracellular contrast agent was implemented at 3 Tesla for the assessment of congenital heart disease and compared to standard first-pass-MRA (FP-MRA). Methods 34 patients (median age: 13 years) with congenital heart disease (CHD) were prospectively examined on a 3 Tesla system. The CMR-protocol comprised functional imaging, FP- and HR-MRA, and viability imaging. After the acquisition of the FP-MRA sequence using a single dose of extracellular contrast agent the motion compensated HR-MRA sequence with isotropic resolution was acquired while injecting the second single dose, utilizing the timeframe before viability imaging. Qualitative scores for image quality (two independent reviewers) as well as quantitative measurements of vessel sharpness and relative contrast were compared using the Wilcoxon signed-rank test. Quantitative measurements of vessel diameters were compared using the Bland-Altman test. Results The mean image quality score revealed significantly better image quality of the HR-MRA sequence compared to the FP-MRA sequence in all vessels of interest (ascending aorta (AA), left pulmonary artery (LPA), left superior pulmonary vein (LSPV), coronary sinus (CS), and coronary ostia (CO); all p < 0.0001). In comparison to FP-MRA, HR-MRA revealed significantly better vessel sharpness for all considered vessels (AA, LSPV and LPA; all p < 0.0001). The relative contrast of the HR-MRA sequence was less compared to the FP-MRA sequence (AA: p <0.028, main pulmonary artery: p <0.004, LSPV: p <0.005). Both, the results of the intra- and interobserver measurements of the vessel diameters revealed closer correlation and closer 95 % limits of agreement for the HR-MRA. HR-MRA revealed one additional clinical finding, missed by FP-MRA. Conclusions An ECG- and navigator-gated HR-MRA-protocol with infusion of extracellular contrast agent at 3 Tesla is feasible. HR-MRA delivers significantly better image quality and vessel sharpness compared to FP-MRA. It may be integrated into a standard CMR-protocol for patients with CHD without the need for additional contrast agent injection and without any additional examination time. PMID:23107424
The limits of protein sequence comparison?

PubMed Central

Pearson, William R; Sierk, Michael L

2010-01-01

Modern sequence alignment algorithms are used routinely to identify homologous proteins, proteins that share a common ancestor. Homologous proteins always share similar structures and often have similar functions. Over the past 20 years, sequence comparison has become both more sensitive, largely because of profile-based methods, and more reliable, because of more accurate statistical estimates. As sequence and structure databases become larger, and comparison methods become more powerful, reliable statistical estimates will become even more important for distinguishing similarities that are due to homology from those that are due to analogy (convergence). The newest sequence alignment methods are more sensitive than older methods, but more accurate statistical estimates are needed for their full power to be realized. PMID:15919194
Comparative Genome and Proteome Analysis of Anopheles gambiae and Drosophila melanogaster

NASA Astrophysics Data System (ADS)

Zdobnov, Evgeny M.; von Mering, Christian; Letunic, Ivica; Torrents, David; Suyama, Mikita; Copley, Richard R.; Christophides, George K.; Thomasova, Dana; Holt, Robert A.; Subramanian, G. Mani; Mueller, Hans-Michael; Dimopoulos, George; Law, John H.; Wells, Michael A.; Birney, Ewan; Charlab, Rosane; Halpern, Aaron L.; Kokoza, Elena; Kraft, Cheryl L.; Lai, Zhongwu; Lewis, Suzanna; Louis, Christos; Barillas-Mury, Carolina; Nusskern, Deborah; Rubin, Gerald M.; Salzberg, Steven L.; Sutton, Granger G.; Topalis, Pantelis; Wides, Ron; Wincker, Patrick; Yandell, Mark; Collins, Frank H.; Ribeiro, Jose; Gelbart, William M.; Kafatos, Fotis C.; Bork, Peer

2002-10-01

Comparison of the genomes and proteomes of the two diptera Anopheles gambiae and Drosophila melanogaster, which diverged about 250 million years ago, reveals considerable similarities. However, numerous differences are also observed; some of these must reflect the selection and subsequent adaptation associated with different ecologies and life strategies. Almost half of the genes in both genomes are interpreted as orthologs and show an average sequence identity of about 56%, which is slightly lower than that observed between the orthologs of the pufferfish and human (diverged about 450 million years ago). This indicates that these two insects diverged considerably faster than vertebrates. Aligned sequences reveal that orthologous genes have retained only half of their intron/exon structure, indicating that intron gains or losses have occurred at a rate of about one per gene per 125 million years. Chromosomal arms exhibit significant remnants of homology between the two species, although only 34% of the genes colocalize in small ``microsyntenic'' clusters, and major interarm transfers as well as intra-arm shuffling of gene order are detected.

Phylogenetic Analysis of Canine Parvovirus VP2 Gene in China.

PubMed

Yi, L; Tong, M; Cheng, Y; Song, W; Cheng, S

2016-04-01

In this study, a total of 37 samples (58.0%) were found through PCR assay to be positive for canine parvovirus (CPV) of 66 suspected faecal samples of dogs collected from various cities throughout China. Eight CPV isolates could be obtained in the CRFK cell line. The sequencing of the VP2 gene of CPV identified the predominant CPV strain as CPV-2a (Ser297Ala), with two CPV-2b (Ser297Ala). Sequence comparison revealed homologies of 99.3-99.9%, 99.9% and 99.3-99.7% within the CPV 2a isolates, within the CPV 2b isolates and between the CPV 2a and 2b isolates, respectively. In addition, several non-synonymous and synonymous mutations were also recorded. The phylogenetic tree revealed that most of the CPV strains from different areas in China were located in the formation of a large branch, which were grouped together along with the KU143-09 strain from Thailand and followed the same evolution. In this study, we provide an updated molecular characterization of CPV 2 circulation in China. © 2014 Blackwell Verlag GmbH.
Identification of an Astrovirus Commonly Infecting Laboratory Mice in the US and Japan

PubMed Central

Ng, Terry Fei Fan; Kondov, Nikola O.; Hayashimoto, Nobuhito; Uchida, Ritsuki; Cha, Yunhee; Beyer, Ashley I.; Wong, Walt; Pesavento, Patricia A.; Suemizu, Hiroshi; Muench, Marcus O.; Delwart, Eric

2013-01-01

Mice (Mus musculus) are the most commonly used laboratory animals. Viral metagenomics on tissues of immunodeficient mice revealed sequences of a novel mammalian astrovirus. Using PCR, we screened mice from 4 breeders, 4 pharmaceutical companies, 14 research institutes and 30 universities in the US and Japan. Mice from one US breeder tested positive while none from Japanese breeders were positive for MuAstV. Mice in over half of the universities (19/30), institutes (7/14) and pharmaceutical animal facilities (2/4) investigated revealed the presence of MuAstV. Nine mice strains tested positive including both immunodeficient strains (NSG, NOD-SCID, NSG-3GS, C57BL6-Timp-3 −/−, and uPA-NOG) and immunocompetent strains (B6J, ICR, Bash2, BALB/c). Our data indicates that MuAstV has a wide geographical, institutional and host strain distribution. Comparison of the MuAstV RdRp sequences showed numerous mutations indicating ongoing viral divergence in different facilities. This study demonstrates the need for metagenomic screening of laboratory animals to identify adventitious infections that may affect experimental outcomes. PMID:23825590
Evidence of Divergent Amino Acid Usage in Comparative Analyses of R5- and X4-Associated HIV-1 Vpr Sequences

PubMed Central

Antell, Gregory C.; Zhong, Wen; Kercher, Katherine; Passic, Shendra; Williams, Jean; Liu, Yucheng; James, Tony; Jacobson, Jeffrey M.; Szep, Zsofia

2017-01-01

Vpr is an HIV-1 accessory protein that plays numerous roles during viral replication, and some of which are cell type dependent. To test the hypothesis that HIV-1 tropism extends beyond the envelope into the vpr gene, studies were performed to identify the associations between coreceptor usage and Vpr variation in HIV-1-infected patients. Colinear HIV-1 Env-V3 and Vpr amino acid sequences were obtained from the LANL HIV-1 sequence database and from well-suppressed patients in the Drexel/Temple Medicine CNS AIDS Research and Eradication Study (CARES) Cohort. Genotypic classification of Env-V3 sequences as X4 (CXCR4-utilizing) or R5 (CCR5-utilizing) was used to group colinear Vpr sequences. To reveal the sequences associated with a specific coreceptor usage genotype, Vpr amino acid sequences were assessed for amino acid diversity and Jensen-Shannon divergence between the two groups. Five amino acid alphabets were used to comprehensively examine the impact of amino acid substitutions involving side chains with similar physiochemical properties. Positions 36, 37, 41, 89, and 96 of Vpr were characterized by statistically significant divergence across multiple alphabets when X4 and R5 sequence groups were compared. In addition, consensus amino acid switches were found at positions 37 and 41 in comparisons of the R5 and X4 sequence populations. These results suggest an evolutionary link between Vpr and gp120 in HIV-1-infected patients. PMID:28620613
Sequencing of individual chromosomes of plant pathogenic Fusarium oxysporum.

PubMed

Kashiwa, Takeshi; Kozaki, Toshinori; Ishii, Kazuo; Turgeon, B Gillian; Teraoka, Tohru; Komatsu, Ken; Arie, Tsutomu

2017-01-01

A small chromosome in reference isolate 4287 of F. oxysporum f. sp. lycopersici (Fol) has been designated as a 'pathogenicity chromosome' because it carries several pathogenicity related genes such as the Secreted In Xylem (SIX) genes. Sequence assembly of small chromosomes in other isolates, based on a reference genome template, is difficult because of karyotype variation among isolates and a high number of sequences associated with transposable elements. These factors often result in misassembly of sequences, making it unclear whether other isolates possess the same pathogenicity chromosome harboring SIX genes as in the reference isolate. To overcome this difficulty, single chromosome sequencing after Contour-clamped Homogeneous Electric Field (CHEF) separation of chromosomes was performed, followed by de novo assembly of sequences. The assembled sequences of individual chromosomes were consistent with results of probing gels of CHEF separated chromosomes with SIX genes. Individual chromosome sequencing revealed that several SIX genes are located on a single small chromosome in two pathogenic forms of F. oxysporum, beyond the reference isolate 4287, and in the cabbage yellows fungus F. oxysporum f. sp. conglutinans. The particular combination of SIX genes on each small chromosome varied. Moreover, not all SIX genes were found on small chromosomes; depending on the isolate, some were on big chromosomes. This suggests that recombination of chromosomes and/or translocation of SIX genes may occur frequently. Our method improves sequence comparison of small chromosomes among isolates. Copyright © 2016 Elsevier Inc. All rights reserved.
RIKEN Integrated Sequence Analysis (RISA) System—384-Format Sequencing Pipeline with 384 Multicapillary Sequencer

PubMed Central

Shibata, Kazuhiro; Itoh, Masayoshi; Aizawa, Katsunori; Nagaoka, Sumiharu; Sasaki, Nobuya; Carninci, Piero; Konno, Hideaki; Akiyama, Junichi; Nishi, Katsuo; Kitsunai, Tokuji; Tashiro, Hideo; Itoh, Mari; Sumi, Noriko; Ishii, Yoshiyuki; Nakamura, Shin; Hazama, Makoto; Nishine, Tsutomu; Harada, Akira; Yamamoto, Rintaro; Matsumoto, Hiroyuki; Sakaguchi, Sumito; Ikegami, Takashi; Kashiwagi, Katsuya; Fujiwake, Syuji; Inoue, Kouji; Togawa, Yoshiyuki; Izawa, Masaki; Ohara, Eiji; Watahiki, Masanori; Yoneda, Yuko; Ishikawa, Tomokazu; Ozawa, Kaori; Tanaka, Takumi; Matsuura, Shuji; Kawai, Jun; Okazaki, Yasushi; Muramatsu, Masami; Inoue, Yorinao; Kira, Akira; Hayashizaki, Yoshihide

2000-01-01

The RIKEN high-throughput 384-format sequencing pipeline (RISA system) including a 384-multicapillary sequencer (the so-called RISA sequencer) was developed for the RIKEN mouse encyclopedia project. The RISA system consists of colony picking, template preparation, sequencing reaction, and the sequencing process. A novel high-throughput 384-format capillary sequencer system (RISA sequencer system) was developed for the sequencing process. This system consists of a 384-multicapillary auto sequencer (RISA sequencer), a 384-multicapillary array assembler (CAS), and a 384-multicapillary casting device. The RISA sequencer can simultaneously analyze 384 independent sequencing products. The optical system is a scanning system chosen after careful comparison with an image detection system for the simultaneous detection of the 384-capillary array. This scanning system can be used with any fluorescent-labeled sequencing reaction (chain termination reaction), including transcriptional sequencing based on RNA polymerase, which was originally developed by us, and cycle sequencing based on thermostable DNA polymerase. For long-read sequencing, 380 out of 384 sequences (99.2%) were successfully analyzed and the average read length, with more than 99% accuracy, was 654.4 bp. A single RISA sequencer can analyze 216 kb with >99% accuracy in 2.7 h (90 kb/h). For short-read sequencing to cluster the 3′ end and 5′ end sequencing by reading 350 bp, 384 samples can be analyzed in 1.5 h. We have also developed a RISA inoculator, RISA filtrator and densitometer, RISA plasmid preparator which can handle throughput of 40,000 samples in 17.5 h, and a high-throughput RISA thermal cycler which has four 384-well sites. The combination of these technologies allowed us to construct the RISA system consisting of 16 RISA sequencers, which can process 50,000 DNA samples per day. One haploid genome shotgun sequence of a higher organism, such as human, mouse, rat, domestic animals, and plants, can be revealed by seven RISA systems within one month. PMID:11076861
Whole Transcriptome of the Venom Gland from Urodacus yaschenkoi Scorpion

PubMed Central

Juárez-González, Víctor Rivelino; Possani, Lourival D.

2015-01-01

Australian scorpion venoms have been poorly studied, probably because they do not pose an evident threat to humans. In addition, the continent has other medically important venomous animals capable of causing serious health problems. Urodacus yaschenkoi belongs to the most widely distributed family of Australian scorpions (Urodacidae) and it is found all over the continent, making it a useful model system for studying venom composition and evolution. This communication reports the whole set of mRNA transcripts produced by the venom gland. U. yaschenkoi venom is as complex as its overseas counterparts. These transcripts certainly code for several components similar to known scorpion venom components, such as: alpha-KTxs, beta-KTxs, calcins, protease inhibitors, antimicrobial peptides, sodium-channel toxins, toxin-like peptides, allergens, La1-like, hyaluronidases, ribosomal proteins, proteasome components and proteins related to cellular processes. A comparison with the venom gland transcriptome of Centruroides noxius (Buthidae) showed that these two scorpions have similar components related to biological processes, although important differences occur among the venom toxins. In contrast, a comparison with sequences reported for Urodacus manicatus revealed that these two Urodacidae species possess the same subfamily of scorpion toxins. A comparison with sequences of an U. yaschenkoi cDNA library previously reported by our group showed that both techniques are reliable for the description of the venom components, but the whole transcriptome generated with Next Generation Sequencing platform provides sequences of all transcripts expressed. Several of which were identified in the proteome, but many more transcripts were identified including uncommon transcripts. The information reported here constitutes a reference for non-Buthidae scorpion venoms, providing a comprehensive view of genes that are involved in venom production. Further, this work identifies new putative bioactive compounds that could be used to seed research into new pharmacological compounds and increase our understanding of the function of different ion channels. PMID:26020943
Whole Transcriptome of the Venom Gland from Urodacus yaschenkoi Scorpion.

PubMed

Luna-Ramírez, Karen; Quintero-Hernández, Verónica; Juárez-González, Víctor Rivelino; Possani, Lourival D

2015-01-01

Australian scorpion venoms have been poorly studied, probably because they do not pose an evident threat to humans. In addition, the continent has other medically important venomous animals capable of causing serious health problems. Urodacus yaschenkoi belongs to the most widely distributed family of Australian scorpions (Urodacidae) and it is found all over the continent, making it a useful model system for studying venom composition and evolution. This communication reports the whole set of mRNA transcripts produced by the venom gland. U. yaschenkoi venom is as complex as its overseas counterparts. These transcripts certainly code for several components similar to known scorpion venom components, such as: alpha-KTxs, beta-KTxs, calcins, protease inhibitors, antimicrobial peptides, sodium-channel toxins, toxin-like peptides, allergens, La1-like, hyaluronidases, ribosomal proteins, proteasome components and proteins related to cellular processes. A comparison with the venom gland transcriptome of Centruroides noxius (Buthidae) showed that these two scorpions have similar components related to biological processes, although important differences occur among the venom toxins. In contrast, a comparison with sequences reported for Urodacus manicatus revealed that these two Urodacidae species possess the same subfamily of scorpion toxins. A comparison with sequences of an U. yaschenkoi cDNA library previously reported by our group showed that both techniques are reliable for the description of the venom components, but the whole transcriptome generated with Next Generation Sequencing platform provides sequences of all transcripts expressed. Several of which were identified in the proteome, but many more transcripts were identified including uncommon transcripts. The information reported here constitutes a reference for non-Buthidae scorpion venoms, providing a comprehensive view of genes that are involved in venom production. Further, this work identifies new putative bioactive compounds that could be used to seed research into new pharmacological compounds and increase our understanding of the function of different ion channels.
A high quality assembly of the Nile Tilapia (Oreochromis niloticus) genome reveals the structure of two sex determination regions.

PubMed

Conte, Matthew A; Gammerdinger, William J; Bartie, Kerry L; Penman, David J; Kocher, Thomas D

2017-05-02

Tilapias are the second most farmed fishes in the world and a sustainable source of food. Like many other fish, tilapias are sexually dimorphic and sex is a commercially important trait in these fish. In this study, we developed a significantly improved assembly of the tilapia genome using the latest genome sequencing methods and show how it improves the characterization of two sex determination regions in two tilapia species. A homozygous clonal XX female Nile tilapia (Oreochromis niloticus) was sequenced to 44X coverage using Pacific Biosciences (PacBio) SMRT sequencing. Dozens of candidate de novo assemblies were generated and an optimal assembly (contig NG50 of 3.3Mbp) was selected using principal component analysis of likelihood scores calculated from several paired-end sequencing libraries. Comparison of the new assembly to the previous O. niloticus genome assembly reveals that recently duplicated portions of the genome are now well represented. The overall number of genes in the new assembly increased by 27.3%, including a 67% increase in pseudogenes. The new tilapia genome assembly correctly represents two recent vasa gene duplication events that have been verified with BAC sequencing. At total of 146Mbp of additional transposable element sequence are now assembled, a large proportion of which are recent insertions. Large centromeric satellite repeats are assembled and annotated in cichlid fish for the first time. Finally, the new assembly identifies the long-range structure of both a ~9Mbp XY sex determination region on LG1 in O. niloticus, and a ~50Mbp WZ sex determination region on LG3 in the related species O. aureus. This study highlights the use of long read sequencing to correctly assemble recent duplications and to characterize repeat-filled regions of the genome. The study serves as an example of the need for high quality genome assemblies and provides a framework for identifying sex determining genes in tilapia and related fish species.
Exploring halo substructure with giant stars. XIV. The nature of the Triangulum-Andromeda stellar features

DOE Office of Scientific and Technical Information (OSTI.GOV)

Sheffield, Allyson A.; Johnston, Kathryn V.; Majewski, Steven R.

As large-scale stellar surveys have become available over the past decade, the ability to detect and characterize substructures in the Galaxy has increased dramatically. These surveys have revealed the Triangulum-Andromeda (TriAnd) region to be rich with substructures in the distance range 20-30 kpc, and the relation of these features to each other, if any, remains unclear. An exploration using Two Micron All Sky Survey (2MASS) photometry reveals not only the faint sequence in M giants detected by Rocha-Pinto et al. spanning the range 100° < l < 160° and –50° < b < –15°, but, in addition, a second, brightermore » and more densely populated sequence. These sequences are likely associated with the distinct main sequences (MSs) discovered (and labeled TriAnd1 and TriAnd2) by Martin et al. in an optical survey in the direction of M31, where TriAnd2 is the optical counterpart of the fainter red giant branch (RGB)/asymptotic giant branch sequence of Rocha-Pinto et al. Here, the age, distance, and metallicity ranges for TriAnd1 and TriAnd2 are estimated by simultaneously fitting isochrones to the 2MASS RGB tracks and the optical MS/MS turn-off features. The two populations are clearly distinct in age and distance: the brighter sequence (TriAnd1) is younger (6-10 Gyr) and closer (distance of ∼15-21 kpc), whereas the fainter sequence (TriAnd2) is older (10-12 Gyr) and at an estimated distance of ∼24-32 kpc. A comparison with simulations demonstrates that the differences and similarities between TriAnd1 and TriAnd2 can simultaneously be explained if they represent debris originating from the disruption of the same dwarf galaxy, but torn off during two distinct pericentric passages.« less
Purification and sequence analysis of 4-methyl-5-nitrocatechol oxygenase from Burkholderia sp. strain DNT.

PubMed Central

Haigler, B E; Suen, W C; Spain, J C

1996-01-01

4-Methyl-5-nitrocatechol (MNC) is an intermediate in the degradation of 2,4-dinitrotoluene by Burkholderia sp. strain DNT. In the presence of NADPH and oxygen, MNC monooxygenase catalyzes the removal of the nitro group from MNC to form 2-hydroxy-5-methylquinone. The gene (dntB) encoding MNC monooxygenase has been previously cloned and characterized. In order to examine the properties of MNC monooxygenase and to compare it with other enzymes, we sequenced the gene encoding the MNC monooxygenase and purified the enzyme from strain DNT. dntB was localized within a 2.2-kb ApaI DNA fragment. Sequence analysis of this fragment revealed an open reading frame of 1,644 bp with an N-terminal amino acid sequence identical to that of purified MNC monooxygenase from strain DNT. Comparison of the derived amino acid sequences with those of other genes showed that DntB contains the highly conserved ADP and flavin adenine dinucleotide (FAD) binding motifs characteristic of flavoprotein hydroxylases. MNC monooxygenase was purified to homogeneity from strain DNT by anion exchange and gel filtration chromatography. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis revealed a single protein with a molecular weight of 60,200, which is consistent with the size determined from the gene sequence. The native molecular weight determined by gel filtration was 65,000, which indicates that the native enzyme is a monomer. It used either NADH or NADPH as electron donors, and NADPH was the preferred cofactor. The purified enzyme contained 1 mol of FAD per mol of protein, which is also consistent with the detection of an FAD binding motif in the amino acid sequence of DntB. MNC monooxygenase has a narrow substrate specificity. MNC and 4-nitrocatechol are good substrates whereas 3-methyl-4-nitrophenol, 3-methyl-4-nitrocatechol, 4-nitrophenol, 3-nitrophenol, and 4-chlorocatechol were not. These studies suggest that MNC monooxygenase is a flavoprotein that shares some properties with previously studied nitrophenol oxygenases. PMID:8830701
Comparison of ELISA, nested PCR and sequencing and a novel qPCR for detection of Giardia isolates from Jordan.

PubMed

Hijjawi, Nawal; Yang, Rongchang; Hatmal, Ma'mon; Yassin, Yasmeen; Mharib, Taghrid; Mukbel, Rami; Mahmoud, Sameer Alhaj; Al-Shudifat, Abdel-Ellah; Ryan, Una

2018-02-01

Little is known about the prevalence of Giardia duodenalis in human patients in Jordan and all previous studies have used direct microscopy, which lacks sensitivity. The present study developed a novel quantitative PCR (qPCR) assay at the β-giardin (bg) locus and evaluated its use as a frontline test for the diagnosis of giardiasis in comparison with a commercially available ELISA using nested PCR and sequencing of the glutamate dehydrogenase (gdh) locus (gdh nPCR) as the gold standard. A total of 96 human faecal samples were collected from 96 patients suffering from diarrhoea from 5 regions of Jordan and were screened using the ELISA and qPCR. The analytical specificity of the bg qPCR assay revealed no cross-reactions with other genera and detected all the Giardia isolates tested. Analytical sensitivity was 1 Giardia cyst per μl of DNA extract. The overall prevalence of Giardia was 64.6%. The clinical sensitivity and specificity of the bg qPCR was 89.9% and 82.9% respectively compared to 76.5 and 68.0% for the ELISA. This study is the first to compare three different methods (ELISA, bg qPCR, nested PCR and sequencing at the gdh locus) to diagnose Jordanian patients suffering from giardiasis and to analyze their demographic data. Copyright © 2018 Elsevier Inc. All rights reserved.
High-resolution melt analysis to identify and map sequence-tagged site anchor points onto linkage maps: a white lupin (Lupinus albus) map as an exemplar.

PubMed

Croxford, Adam E; Rogers, Tom; Caligari, Peter D S; Wilkinson, Michael J

2008-01-01

* The provision of sequence-tagged site (STS) anchor points allows meaningful comparisons between mapping studies but can be a time-consuming process for nonmodel species or orphan crops. * Here, the first use of high-resolution melt analysis (HRM) to generate STS markers for use in linkage mapping is described. This strategy is rapid and low-cost, and circumvents the need for labelled primers or amplicon fractionation. * Using white lupin (Lupinus albus, x = 25) as a case study, HRM analysis was applied to identify 91 polymorphic markers from expressed sequence tag (EST)-derived and genomic libraries. Of these, 77 generated STS anchor points in the first fully resolved linkage map of the species. The map also included 230 amplified fragment length polymorphisms (AFLP) loci, spanned 1916 cM (84.2% coverage) and divided into the expected 25 linkage groups. * Quantitative trait loci (QTL) analyses performed on the population revealed genomic regions associated with several traits, including the agronomically important time to flowering (tf), alkaloid synthesis and stem height (Ph). Use of HRM-STS markers also allowed us to make direct comparisons between our map and that of the related crop, Lupinus angustifolius, based on the conversion of RFLP, microsatellite and single nucleotide polymorphism (SNP) markers into HRM markers.
Sequencing and analysis of 10,967 full-length cDNA clones from Xenopus laevis and Xenopus tropicalis reveals post-tetraploidization transcriptome remodeling

PubMed Central

Morin, Ryan D.; Chang, Elbert; Petrescu, Anca; Liao, Nancy; Griffith, Malachi; Kirkpatrick, Robert; Butterfield, Yaron S.; Young, Alice C.; Stott, Jeffrey; Barber, Sarah; Babakaiff, Ryan; Dickson, Mark C.; Matsuo, Corey; Wong, David; Yang, George S.; Smailus, Duane E.; Wetherby, Keith D.; Kwong, Peggy N.; Grimwood, Jane; Brinkley, Charles P.; Brown-John, Mabel; Reddix-Dugue, Natalie D.; Mayo, Michael; Schmutz, Jeremy; Beland, Jaclyn; Park, Morgan; Gibson, Susan; Olson, Teika; Bouffard, Gerard G.; Tsai, Miranda; Featherstone, Ruth; Chand, Steve; Siddiqui, Asim S.; Jang, Wonhee; Lee, Ed; Klein, Steven L.; Blakesley, Robert W.; Zeeberg, Barry R.; Narasimhan, Sudarshan; Weinstein, John N.; Pennacchio, Christa Prange; Myers, Richard M.; Green, Eric D.; Wagner, Lukas; Gerhard, Daniela S.; Marra, Marco A.; Jones, Steven J.M.; Holt, Robert A.

2006-01-01

Sequencing of full-insert clones from full-length cDNA libraries from both Xenopus laevis and Xenopus tropicalis has been ongoing as part of the Xenopus Gene Collection Initiative. Here we present 10,967 full ORF verified cDNA clones (8049 from X. laevis and 2918 from X. tropicalis) as a community resource. Because the genome of X. laevis, but not X. tropicalis, has undergone allotetraploidization, comparison of coding sequences from these two clawed (pipid) frogs provides a unique angle for exploring the molecular evolution of duplicate genes. Within our clone set, we have identified 445 gene trios, each comprised of an allotetraploidization-derived X. laevis gene pair and their shared X. tropicalis ortholog. Pairwise dN/dS, comparisons within trios show strong evidence for purifying selection acting on all three members. However, dN/dS ratios between X. laevis gene pairs are elevated relative to their X. tropicalis ortholog. This difference is highly significant and indicates an overall relaxation of selective pressures on duplicated gene pairs. We have found that the paralogs that have been lost since the tetraploidization event are enriched for several molecular functions, but have found no such enrichment in the extant paralogs. Approximately 14% of the paralogous pairs analyzed here also show differential expression indicative of subfunctionalization. PMID:16672307
Multi-species sequence comparison reveals conservation of ghrelin gene-derived splice variants encoding a truncated ghrelin peptide.

PubMed

Seim, Inge; Jeffery, Penny L; Thomas, Patrick B; Walpole, Carina M; Maugham, Michelle; Fung, Jenny N T; Yap, Pei-Yi; O'Keeffe, Angela J; Lai, John; Whiteside, Eliza J; Herington, Adrian C; Chopin, Lisa K

2016-06-01

The peptide hormone ghrelin is a potent orexigen produced predominantly in the stomach. It has a number of other biological actions, including roles in appetite stimulation, energy balance, the stimulation of growth hormone release and the regulation of cell proliferation. Recently, several ghrelin gene splice variants have been described. Here, we attempted to identify conserved alternative splicing of the ghrelin gene by cross-species sequence comparisons. We identified a novel human exon 2-deleted variant and provide preliminary evidence that this splice variant and in1-ghrelin encode a C-terminally truncated form of the ghrelin peptide, termed minighrelin. These variants are expressed in humans and mice, demonstrating conservation of alternative splicing spanning 90 million years. Minighrelin appears to have similar actions to full-length ghrelin, as treatment with exogenous minighrelin peptide stimulates appetite and feeding in mice. Forced expression of the exon 2-deleted preproghrelin variant mirrors the effect of the canonical preproghrelin, stimulating cell proliferation and migration in the PC3 prostate cancer cell line. This is the first study to characterise an exon 2-deleted preproghrelin variant and to demonstrate sequence conservation of ghrelin gene-derived splice variants that encode a truncated ghrelin peptide. This adds further impetus for studies into the alternative splicing of the ghrelin gene and the function of novel ghrelin peptides in vertebrates.
Targeted enrichment of ancient pathogens yielding the pPCP1 plasmid of Yersinia pestis from victims of the Black Death.

PubMed

Schuenemann, Verena J; Bos, Kirsten; DeWitte, Sharon; Schmedes, Sarah; Jamieson, Joslyn; Mittnik, Alissa; Forrest, Stephen; Coombes, Brian K; Wood, James W; Earn, David J D; White, William; Krause, Johannes; Poinar, Hendrik N

2011-09-20

Although investigations of medieval plague victims have identified Yersinia pestis as the putative etiologic agent of the pandemic, methodological limitations have prevented large-scale genomic investigations to evaluate changes in the pathogen's virulence over time. We screened over 100 skeletal remains from Black Death victims of the East Smithfield mass burial site (1348-1350, London, England). Recent methods of DNA enrichment coupled with high-throughput DNA sequencing subsequently permitted reconstruction of ten full human mitochondrial genomes (16 kb each) and the full pPCP1 (9.6 kb) virulence-associated plasmid at high coverage. Comparisons of molecular damage profiles between endogenous human and Y. pestis DNA confirmed its authenticity as an ancient pathogen, thus representing the longest contiguous genomic sequence for an ancient pathogen to date. Comparison of our reconstructed plasmid against modern Y. pestis shows identity with several isolates matching the Medievalis biovar; however, our chromosomal sequences indicate the victims were infected with a Y. pestis variant that has not been previously reported. Our data reveal that the Black Death in medieval Europe was caused by a variant of Y. pestis that may no longer exist, and genetic data carried on its pPCP1 plasmid were not responsible for the purported epidemiological differences between ancient and modern forms of Y. pestis infections.
Automated two-point dixon screening for the evaluation of hepatic steatosis and siderosis: comparison with R2-relaxometry and chemical shift-based sequences.

PubMed

Henninger, B; Zoller, H; Rauch, S; Schocke, M; Kannengiesser, S; Zhong, X; Reiter, G; Jaschke, W; Kremser, C

2015-05-01

To evaluate the automated two-point Dixon screening sequence for the detection and estimated quantification of hepatic iron and fat compared with standard sequences as a reference. One hundred and two patients with suspected diffuse liver disease were included in this prospective study. The following MRI protocol was used: 3D-T1-weighted opposed- and in-phase gradient echo with two-point Dixon reconstruction and dual-ratio signal discrimination algorithm ("screening" sequence); fat-saturated, multi-gradient-echo sequence with 12 echoes; gradient-echo T1 FLASH opposed- and in-phase. Bland-Altman plots were generated and correlation coefficients were calculated to compare the sequences. The screening sequence diagnosed fat in 33, iron in 35 and a combination of both in 4 patients. Correlation between R2* values of the screening sequence and the standard relaxometry was excellent (r = 0.988). A slightly lower correlation (r = 0.978) was found between the fat fraction of the screening sequence and the standard sequence. Bland-Altman revealed systematically lower R2* values obtained from the screening sequence and higher fat fraction values obtained with the standard sequence with a rather high variability in agreement. The screening sequence is a promising method with fast diagnosis of the predominant liver disease. It is capable of estimating the amount of hepatic fat and iron comparable to standard methods. • MRI plays a major role in the clarification of diffuse liver disease. • The screening sequence was introduced for the assessment of diffuse liver disease. • It is a fast and automated algorithm for the evaluation of hepatic iron and fat. • It is capable of estimating the amount of hepatic fat and iron.
Draft genome sequence of marine alphaproteobacterial strain HIMB11, the first cultivated representative of a unique lineage within the Roseobacter clade possessing an unusually small genome

PubMed Central

Durham, Bryndan P.; Grote, Jana; Whittaker, Kerry A.; Bender, Sara J.; Luo, Haiwei; Grim, Sharon L.; Brown, Julia M.; Casey, John R.; Dron, Antony; Florez-Leiva, Lennin; Krupke, Andreas; Luria, Catherine M.; Mine, Aric H.; Nigro, Olivia D.; Pather, Santhiska; Talarmin, Agathe; Wear, Emma K.; Weber, Thomas S.; Wilson, Jesse M.; Church, Matthew J.; DeLong, Edward F.; Karl, David M.; Steward, Grieg F.; Eppley, John M.; Kyrpides, Nikos C.; Schuster, Stephan; Rappé, Michael S.

2014-01-01

Strain HIMB11 is a planktonic marine bacterium isolated from coastal seawater in Kaneohe Bay, Oahu, Hawaii belonging to the ubiquitous and versatile Roseobacter clade of the alphaproteobacterial family Rhodobacteraceae. Here we describe the preliminary characteristics of strain HIMB11, including annotation of the draft genome sequence and comparative genomic analysis with other members of the Roseobacter lineage. The 3,098,747 bp draft genome is arranged in 34 contigs and contains 3,183 protein-coding genes and 54 RNA genes. Phylogenomic and 16S rRNA gene analyses indicate that HIMB11 represents a unique sublineage within the Roseobacter clade. Comparison with other publicly available genome sequences from members of the Roseobacter lineage reveals that strain HIMB11 has the genomic potential to utilize a wide variety of energy sources (e.g. organic matter, reduced inorganic sulfur, light, carbon monoxide), while possessing a reduced number of substrate transporters. PMID:25197450
Draft genome sequence of marine alphaproteobacterial strain HIMB11, the first cultivated representative of a unique lineage within the Roseobacter clade possessing an unusually small genome.

PubMed

Durham, Bryndan P; Grote, Jana; Whittaker, Kerry A; Bender, Sara J; Luo, Haiwei; Grim, Sharon L; Brown, Julia M; Casey, John R; Dron, Antony; Florez-Leiva, Lennin; Krupke, Andreas; Luria, Catherine M; Mine, Aric H; Nigro, Olivia D; Pather, Santhiska; Talarmin, Agathe; Wear, Emma K; Weber, Thomas S; Wilson, Jesse M; Church, Matthew J; DeLong, Edward F; Karl, David M; Steward, Grieg F; Eppley, John M; Kyrpides, Nikos C; Schuster, Stephan; Rappé, Michael S

2014-06-15

Strain HIMB11 is a planktonic marine bacterium isolated from coastal seawater in Kaneohe Bay, Oahu, Hawaii belonging to the ubiquitous and versatile Roseobacter clade of the alphaproteobacterial family Rhodobacteraceae. Here we describe the preliminary characteristics of strain HIMB11, including annotation of the draft genome sequence and comparative genomic analysis with other members of the Roseobacter lineage. The 3,098,747 bp draft genome is arranged in 34 contigs and contains 3,183 protein-coding genes and 54 RNA genes. Phylogenomic and 16S rRNA gene analyses indicate that HIMB11 represents a unique sublineage within the Roseobacter clade. Comparison with other publicly available genome sequences from members of the Roseobacter lineage reveals that strain HIMB11 has the genomic potential to utilize a wide variety of energy sources (e.g. organic matter, reduced inorganic sulfur, light, carbon monoxide), while possessing a reduced number of substrate transporters.
Comparison and correlation of Simple Sequence Repeats distribution in genomes of Brucella species

PubMed Central

Kiran, Jangampalli Adi Pradeep; Chakravarthi, Veeraraghavulu Praveen; Kumar, Yellapu Nanda; Rekha, Somesula Swapna; Kruti, Srinivasan Shanthi; Bhaskar, Matcha

2011-01-01

Computational genomics is one of the important tools to understand the distribution of closely related genomes including simple sequence repeats (SSRs) in an organism, which gives valuable information regarding genetic variations. The central objective of the present study was to screen the SSRs distributed in coding and non-coding regions among different human Brucella species which are involved in a range of pathological disorders. Computational analysis of the SSRs in the Brucella indicates few deviations from expected random models. Statistical analysis also reveals that tri-nucleotide SSRs are overrepresented and tetranucleotide SSRs underrepresented in Brucella genomes. From the data, it can be suggested that over expressed tri-nucleotide SSRs in genomic and coding regions might be responsible in the generation of functional variation of proteins expressed which in turn may lead to different pathogenicity, virulence determinants, stress response genes, transcription regulators and host adaptation proteins of Brucella genomes. Abbreviations SSRs - Simple Sequence Repeats, ORFs - Open Reading Frames. PMID:21738309
Diversity of immunoglobulin lambda light chain gene usage over developmental stages in the horse.

PubMed

Tallmadge, Rebecca L; Tseng, Chia T; Felippe, M Julia B

2014-10-01

To further studies of neonatal immune responses to pathogens and vaccination, we investigated the dynamics of B lymphocyte development and immunoglobulin (Ig) gene diversity. Previously we demonstrated that equine fetal Ig VDJ sequences exhibit combinatorial and junctional diversity levels comparable to those of adult Ig VDJ sequences. Herein, RACE clones from fetal, neonatal, foal, and adult lymphoid tissue were assessed for Ig lambda light chain combinatorial, junctional, and sequence diversity. Remarkably, more lambda variable genes (IGLV) were used during fetal life than later stages and IGLV gene usage differed significantly with time, in contrast to the Ig heavy chain. Junctional diversity measured by CDR3L length was constant over time. Comparison of Ig lambda transcripts to germline revealed significant increases in nucleotide diversity over time, even during fetal life. These results suggest that the Ig lambda light chain provides an additional dimension of diversity to the equine Ig repertoire. Copyright © 2014 Elsevier Ltd. All rights reserved.

Cloning, molecular characterization and heterologous expression of AMY1, an alpha-amylase gene from Cryptococcus flavus.

PubMed

Galdino, Alexsandro S; Ulhoa, Cirano J; Moraes, Lídia Maria P; Prates, Maura V; Bloch, Carlos; Torres, Fernando A G

2008-03-01

A Cryptococcus flavus gene (AMY1) encoding an extracellular alpha-amylase has been cloned. The nucleotide sequence of the cDNA revealed an ORF of 1896 bp encoding for a 631 amino acid polypeptide with high sequence identity with a homologous protein isolated from Cryptococcus sp. S-2. The presence of four conserved signature regions, (I) (144)DVVVNH(149), (II) (235)GLRIDSLQQ(243), (III) (263)GEVFN(267), (IV) (327)FLENQD(332), placed the enzyme in the GH13 alpha-amylase family. Furthermore, sequence comparison suggests that the C. flavusalpha-amylase has a C-terminal starch-binding domain characteristic of the CBM20 family. AMY1 was successfully expressed in Saccharomyces cerevisiae. The time course of amylase secretion in S. cerevisiae resulted in a maximal extracellular amylolytic activity (3.93 U mL(-1)) at 60 h of incubation. The recombinant protein had an apparent molecular mass similar to the native enzyme (c. 67 kDa), part of which was due to N-glycosylation.
Molecular characterization of banana bunchy top virus isolate from Sri Lanka and its genetic relationship with other isolates.

PubMed

Wickramaarachchi, W A R T; Shankarappa, K S; Rangaswamy, K T; Maruthi, M N; Rajapakse, R G A S; Ghosh, Saptarshi

2016-06-01

Bunchy top disease of banana caused by Banana bunchy top virus (BBTV, genus Babuvirus family Nanoviridae) is one of the most important constraints in production of banana in the different parts of the world. Six genomic DNA components of BBTV isolate from Kandy, Sri Lanka (BBTV-K) were amplified by polymerase chain reaction (PCR) with specific primers using total DNA extracted from banana tissues showing typical symptoms of bunchy top disease. The amplicons were of expected size of 1.0-1.1 kb, which were cloned and sequenced. Analysis of sequence data revealed the presence of six DNA components; DNA-R, DNA-U3, DNA-S, DNA-N, DNA-M and DNA-C for Sri Lanka isolate. Comparisons of sequence data of DNA components followed by the phylogenetic analysis, grouped Sri Lanka-(Kandy) isolate in the Pacific Indian Oceans (PIO) group. Sri Lanka-(Kandy) isolate of BBTV is classified a new member of PIO group based on analysis of six components of the virus.
LFQC: a lossless compression algorithm for FASTQ files

PubMed Central

Nicolae, Marius; Pathak, Sudipta; Rajasekaran, Sanguthevar

2015-01-01

Motivation: Next Generation Sequencing (NGS) technologies have revolutionized genomic research by reducing the cost of whole genome sequencing. One of the biggest challenges posed by modern sequencing technology is economic storage of NGS data. Storing raw data is infeasible because of its enormous size and high redundancy. In this article, we address the problem of storage and transmission of large FASTQ files using innovative compression techniques. Results: We introduce a new lossless non-reference based FASTQ compression algorithm named Lossless FASTQ Compressor. We have compared our algorithm with other state of the art big data compression algorithms namely gzip, bzip2, fastqz (Bonfield and Mahoney, 2013), fqzcomp (Bonfield and Mahoney, 2013), Quip (Jones et al., 2012), DSRC2 (Roguski and Deorowicz, 2014). This comparison reveals that our algorithm achieves better compression ratios on LS454 and SOLiD datasets. Availability and implementation: The implementations are freely available for non-commercial purposes. They can be downloaded from http://engr.uconn.edu/rajasek/lfqc-v1.1.zip. Contact: rajasek@engr.uconn.edu PMID:26093148
Detection and quantification of Plasmodium falciparum in blood samples using quantitative nucleic acid sequence-based amplification.

PubMed

Schoone, G J; Oskam, L; Kroon, N C; Schallig, H D; Omar, S A

2000-11-01

A quantitative nucleic acid sequence-based amplification (QT-NASBA) assay for the detection of Plasmodium parasites has been developed. Primers and probes were selected on the basis of the sequence of the small-subunit rRNA gene. Quantification was achieved by coamplification of the RNA in the sample with one modified in vitro RNA as a competitor in a single-tube NASBA reaction. Parasite densities ranging from 10 to 10(8) Plasmodium falciparum parasites per ml could be demonstrated and quantified in whole blood. This is approximately 1,000 times more sensitive than conventional microscopy analysis of thick blood smears. Comparison of the parasite densities obtained by microscopy and QT-NASBA with 120 blood samples from Kenyan patients with clinical malaria revealed that for 112 of 120 (93%) of the samples results were within a 1-log difference. QT-NASBA may be especially useful for the detection of low parasite levels in patients with early-stage malaria and for the monitoring of the efficacy of drug treatment.
Calibrating genomic and allelic coverage bias in single-cell sequencing.

PubMed

Zhang, Cheng-Zhong; Adalsteinsson, Viktor A; Francis, Joshua; Cornils, Hauke; Jung, Joonil; Maire, Cecile; Ligon, Keith L; Meyerson, Matthew; Love, J Christopher

2015-04-16

Artifacts introduced in whole-genome amplification (WGA) make it difficult to derive accurate genomic information from single-cell genomes and require different analytical strategies from bulk genome analysis. Here, we describe statistical methods to quantitatively assess the amplification bias resulting from whole-genome amplification of single-cell genomic DNA. Analysis of single-cell DNA libraries generated by different technologies revealed universal features of the genome coverage bias predominantly generated at the amplicon level (1-10 kb). The magnitude of coverage bias can be accurately calibrated from low-pass sequencing (∼0.1 × ) to predict the depth-of-coverage yield of single-cell DNA libraries sequenced at arbitrary depths. We further provide a benchmark comparison of single-cell libraries generated by multi-strand displacement amplification (MDA) and multiple annealing and looping-based amplification cycles (MALBAC). Finally, we develop statistical models to calibrate allelic bias in single-cell whole-genome amplification and demonstrate a census-based strategy for efficient and accurate variant detection from low-input biopsy samples.
Calibrating genomic and allelic coverage bias in single-cell sequencing

PubMed Central

Francis, Joshua; Cornils, Hauke; Jung, Joonil; Maire, Cecile; Ligon, Keith L.; Meyerson, Matthew; Love, J. Christopher

2016-01-01

Artifacts introduced in whole-genome amplification (WGA) make it difficult to derive accurate genomic information from single-cell genomes and require different analytical strategies from bulk genome analysis. Here, we describe statistical methods to quantitatively assess the amplification bias resulting from whole-genome amplification of single-cell genomic DNA. Analysis of single-cell DNA libraries generated by different technologies revealed universal features of the genome coverage bias predominantly generated at the amplicon level (1–10 kb). The magnitude of coverage bias can be accurately calibrated from low-pass sequencing (~0.1 ×) to predict the depth-of-coverage yield of single-cell DNA libraries sequenced at arbitrary depths. We further provide a benchmark comparison of single-cell libraries generated by multi-strand displacement amplification (MDA) and multiple annealing and looping-based amplification cycles (MALBAC). Finally, we develop statistical models to calibrate allelic bias in single-cell whole-genome amplification and demonstrate a census-based strategy for efficient and accurate variant detection from low-input biopsy samples. PMID:25879913
A Multiple-Sequence Variant of the Multiple-Baseline Design: A Strategy for Analysis of Sequence Effects and Treatment Comparison.

ERIC Educational Resources Information Center

Noell, George H.; Gresham, Frank M.

2001-01-01

Describes design logic and potential uses of a variant of the multiple-baseline design. The multiple-baseline multiple-sequence (MBL-MS) consists of multiple-baseline designs that are interlaced with one another and include all possible sequences of treatments. The MBL-MS design appears to be primarily useful for comparison of treatments taking…
Evaluation of two main RNA-seq approaches for gene quantification in clinical RNA sequencing: polyA+ selection versus rRNA depletion.

PubMed

Zhao, Shanrong; Zhang, Ying; Gamini, Ramya; Zhang, Baohong; von Schack, David

2018-03-19

To allow efficient transcript/gene detection, highly abundant ribosomal RNAs (rRNA) are generally removed from total RNA either by positive polyA+ selection or by rRNA depletion (negative selection) before sequencing. Comparisons between the two methods have been carried out by various groups, but the assessments have relied largely on non-clinical samples. In this study, we evaluated these two RNA sequencing approaches using human blood and colon tissue samples. Our analyses showed that rRNA depletion captured more unique transcriptome features, whereas polyA+ selection outperformed rRNA depletion with higher exonic coverage and better accuracy of gene quantification. For blood- and colon-derived RNAs, we found that 220% and 50% more reads, respectively, would have to be sequenced to achieve the same level of exonic coverage in the rRNA depletion method compared with the polyA+ selection method. Therefore, in most cases we strongly recommend polyA+ selection over rRNA depletion for gene quantification in clinical RNA sequencing. Our evaluation revealed that a small number of lncRNAs and small RNAs made up a large fraction of the reads in the rRNA depletion RNA sequencing data. Thus, we recommend that these RNAs are specifically depleted to improve the sequencing depth of the remaining RNAs.
Molecular cloning of two human liver 3 alpha-hydroxysteroid/dihydrodiol dehydrogenase isoenzymes that are identical with chlordecone reductase and bile-acid binder.

PubMed Central

Deyashiki, Y; Ogasawara, A; Nakayama, T; Nakanishi, M; Miyabe, Y; Sato, K; Hara, A

1994-01-01

Human liver contains two dihydrodiol dehydrogenases, DD2 and DD4, associated with 3 alpha-hydroxysteroid dehydrogenase activity. We have raised polyclonal antibodies that cross-reacted with the two enzymes and isolated two 1.2 kb cDNA clones (C9 and C11) for the two enzymes from a human liver cDNA library using the antibodies. The clones of C9 and C11 contained coding sequences corresponding to 306 and 321 amino acid residues respectively, but lacked 5'-coding regions around the initiation codon. Sequence analyses of several peptides obtained by enzymic and chemical cleavages of the two purified enzymes verified that the C9 and C11 clones encoded DD2 and DD4 respectively, and further indicated that the sequence of DD2 had at least additional 16 residues upward from the N-terminal sequence deduced from the cDNA. There was 82% amino acid sequence identity between the two enzymes, indicating that the enzymes are genetic isoenzymes. A computer-based comparison of the cDNAs of the isoenzymes with the DNA sequence database revealed that the nucleotide and amino acid sequences of DD2 and DD4 are virtually identical with those of human bile-acid binder and human chlordecone reductase cDNAs respectively. Images Figure 1 PMID:8172617
Sequence analysis of the genome of carnation (Dianthus caryophyllus L.).

PubMed

Yagi, Masafumi; Kosugi, Shunichi; Hirakawa, Hideki; Ohmiya, Akemi; Tanase, Koji; Harada, Taro; Kishimoto, Kyutaro; Nakayama, Masayoshi; Ichimura, Kazuo; Onozaki, Takashi; Yamaguchi, Hiroyasu; Sasaki, Nobuhiro; Miyahara, Taira; Nishizaki, Yuzo; Ozeki, Yoshihiro; Nakamura, Noriko; Suzuki, Takamasa; Tanaka, Yoshikazu; Sato, Shusei; Shirasawa, Kenta; Isobe, Sachiko; Miyamura, Yoshinori; Watanabe, Akiko; Nakayama, Shinobu; Kishida, Yoshie; Kohara, Mitsuyo; Tabata, Satoshi

2014-06-01

The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. 'Francesco' was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568,887,315 bp, consisting of 45,088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16,644 bp and 60,737 bp, respectively, and the longest scaffold was 1,287,144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼ 98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp. © The Author 2013. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Molecular evolution of miraculin-like proteins in soybean Kunitz super-family.

PubMed

Selvakumar, Purushotham; Gahloth, Deepankar; Tomar, Prabhat Pratap Singh; Sharma, Nidhi; Sharma, Ashwani Kumar

2011-12-01

Miraculin-like proteins (MLPs) belong to soybean Kunitz super-family and have been characterized from many plant families like Rutaceae, Solanaceae, Rubiaceae, etc. Many of them possess trypsin inhibitory activity and are involved in plant defense. MLPs exhibit significant sequence identity (~30-95%) to native miraculin protein, also belonging to Kunitz super-family compared with a typical Kunitz family member (~30%). The sequence and structure-function comparison of MLPs with that of a classical Kunitz inhibitor have demonstrated that MLPs have evolved to form a distinct group within Kunitz super-family. Sequence analysis of new genes along with available MLP sequences in the literature revealed three major groups for these proteins. A significant feature of Rutaceae MLP type 2 sequences is the presence of phosphorylation motif. Subtle changes are seen in putative reactive loop residues among different MLPs suggesting altered specificities to specific proteases. In phylogenetic analysis, Rutaceae MLP type 1 and type 2 proteins clustered together on separate branches, whereas native miraculin along with other MLPs formed distinct clusters. Site-specific positive Darwinian selection was observed at many sites in both the groups of Rutaceae MLP sequences with most of the residues undergoing positive selection located in loop regions. The results demonstrate the sequence and thereby the structure-function divergence of MLPs as a distinct group within soybean Kunitz super-family due to biotic and abiotic stresses of local environment.
Genome Analysis of the Fruiting Body-Forming Myxobacterium Chondromyces crocatus Reveals High Potential for Natural Product Biosynthesis

PubMed Central

Zaburannyi, Nestor; Bunk, Boyke; Maier, Josef; Overmann, Jörg

2016-01-01

Here, we report the complete genome sequence of the type strain of the myxobacterial genus Chondromyces, Chondromyces crocatus Cm c5. It presents one of the largest prokaryotic genomes featuring a single circular chromosome and no plasmids. Analysis revealed an enlarged set of tRNA genes, along with reduced pressure on preferred codon usage compared to that of other bacterial genomes. The large coding capacity and the plethora of encoded secondary metabolite biosynthetic gene clusters are in line with the capability of Cm c5 to produce an arsenal of antibacterial, antifungal, and cytotoxic compounds. Known pathways of the ajudazol, chondramide, chondrochloren, crocacin, crocapeptin, and thuggacin compound families are complemented by many more natural compound biosynthetic gene clusters in the chromosome. Whole-genome comparison of the fruiting-body-forming type strain (Cm c5, DSM 14714) to an accustomed laboratory strain which has lost this ability (nonfruiting phenotype, Cm c5 fr−) revealed genetic changes in three loci. In addition to the low synteny found with the closest sequenced representative of the same family, Sorangium cellulosum, extensive genetic information duplication and broad application of eukaryotic-type signal transduction systems are hallmarks of this 11.3-Mbp prokaryotic genome. PMID:26773087
Insight into the evolution and origin of leprosy bacilli from the genome sequence of Mycobacterium lepromatosis

PubMed Central

Singh, Pushpendra; Benjak, Andrej; Schuenemann, Verena J.; Herbig, Alexander; Avanzi, Charlotte; Busso, Philippe; Nieselt, Kay; Krause, Johannes; Vera-Cabrera, Lucio; Cole, Stewart T.

2015-01-01

Mycobacterium lepromatosis is an uncultured human pathogen associated with diffuse lepromatous leprosy and a reactional state known as Lucio's phenomenon. By using deep sequencing with and without DNA enrichment, we obtained the near-complete genome sequence of M. lepromatosis present in a skin biopsy from a Mexican patient, and compared it with that of Mycobacterium leprae, which has undergone extensive reductive evolution. The genomes display extensive synteny and are similar in size (∼3.27 Mb). Protein-coding genes share 93% nucleotide sequence identity, whereas pseudogenes are only 82% identical. The events that led to pseudogenization of 50% of the genome likely occurred before divergence from their most recent common ancestor (MRCA), and both M. lepromatosis and M. leprae have since accumulated new pseudogenes or acquired specific deletions. Functional comparisons suggest that M. lepromatosis has lost several enzymes required for amino acid synthesis whereas M. leprae has a defective heme pathway. M. lepromatosis has retained all functions required to infect the Schwann cells of the peripheral nervous system and therefore may also be neuropathogenic. A phylogeographic survey of 227 leprosy biopsies by differential PCR revealed that 221 contained M. leprae whereas only six, all from Mexico, harbored M. lepromatosis. Phylogenetic comparisons indicate that M. lepromatosis is closer than M. leprae to the MRCA, and a Bayesian dating analysis suggests that they diverged from their MRCA approximately 13.9 Mya. Thus, despite their ancient separation, the two leprosy bacilli are remarkably conserved and still cause similar pathologic conditions. PMID:25831531
UFO: a web server for ultra-fast functional profiling of whole genome protein sequences.

PubMed

Meinicke, Peter

2009-09-02

Functional profiling is a key technique to characterize and compare the functional potential of entire genomes. The estimation of profiles according to an assignment of sequences to functional categories is a computationally expensive task because it requires the comparison of all protein sequences from a genome with a usually large database of annotated sequences or sequence families. Based on machine learning techniques for Pfam domain detection, the UFO web server for ultra-fast functional profiling allows researchers to process large protein sequence collections instantaneously. Besides the frequencies of Pfam and GO categories, the user also obtains the sequence specific assignments to Pfam domain families. In addition, a comparison with existing genomes provides dissimilarity scores with respect to 821 reference proteomes. Considering the underlying UFO domain detection, the results on 206 test genomes indicate a high sensitivity of the approach. In comparison with current state-of-the-art HMMs, the runtime measurements show a considerable speed up in the range of four orders of magnitude. For an average size prokaryotic genome, the computation of a functional profile together with its comparison typically requires about 10 seconds of processing time. For the first time the UFO web server makes it possible to get a quick overview on the functional inventory of newly sequenced organisms. The genome scale comparison with a large number of precomputed profiles allows a first guess about functionally related organisms. The service is freely available and does not require user registration or specification of a valid email address.
A Lossy Compression Technique Enabling Duplication-Aware Sequence Alignment

PubMed Central

Freschi, Valerio; Bogliolo, Alessandro

2012-01-01

In spite of the recognized importance of tandem duplications in genome evolution, commonly adopted sequence comparison algorithms do not take into account complex mutation events involving more than one residue at the time, since they are not compliant with the underlying assumption of statistical independence of adjacent residues. As a consequence, the presence of tandem repeats in sequences under comparison may impair the biological significance of the resulting alignment. Although solutions have been proposed, repeat-aware sequence alignment is still considered to be an open problem and new efficient and effective methods have been advocated. The present paper describes an alternative lossy compression scheme for genomic sequences which iteratively collapses repeats of increasing length. The resulting approximate representations do not contain tandem duplications, while retaining enough information for making their comparison even more significant than the edit distance between the original sequences. This allows us to exploit traditional alignment algorithms directly on the compressed sequences. Results confirm the validity of the proposed approach for the problem of duplication-aware sequence alignment. PMID:22518086
Isolation of Propionibacterium acnes among the microbiota of primary endodontic infections with and without intraoral communication.

PubMed

Niazi, Sadia Ambreen; Al Kharusi, Hana Suleiman; Patel, Shanon; Bruce, Kenneth; Beighton, David; Foschi, Federico; Mannocci, Francesco

2016-11-01

The presence of opportunistic pathogens such as Propionibacterium acnes (P. acnes) may contribute to the endodontic pathology. The presence of P. acnes may be influenced by different endodontic conditions. The aims of the study were firstly, to identify P. acnes within the whole cultivable microbiota of primary endodontic infections, to investigate which P. acnes phylotypes predominate in such infections and secondly to determine if the presence of an "open" communication (e.g. a sinus) can be associated with the isolation of P. acnes from the root canal. The predominant cultivable microbiota of 15 primary endodontic lesions (7 without communication with the oral environment and 8 with an open communication) were identified using partial 16S ribosomal RNA (rRNA) gene sequence analysis. The identification of the organism was determined by interrogating the Human Oral Microbiome Database. The P. acnes isolates were typed on the basis of the recA gene sequence comparison. A neighbor-joining tree was constructed using MEGA 4.1 with the inclusion of known recA sequences. There was no difference in the number of species identified from lesions without communication (5.86 ± 3.7) and those with communication (5.37 ± 3.6) (P > 0.05). PCR-based 16S rRNA gene sequencing revealed P. acnes as the most prevalent isolate recovered from lesions with communication. recA gene sequencing revealed two phylogenetic lineages present in lesion with communication, with mainly type I (further split into type IA and type IB) and type II. The presence of P. acnes as opportunistic pathogens has been confirmed and may sustain the traits observed in specific clinical presentations. Clinical management of open lesions may require further disinfection to eliminate opportunistic bacteria.
Determinants of Base-Pair Substitution Patterns Revealed by Whole-Genome Sequencing of DNA Mismatch Repair Defective Escherichia coli.

PubMed

Foster, Patricia L; Niccum, Brittany A; Popodi, Ellen; Townes, Jesse P; Lee, Heewook; MohammedIsmail, Wazim; Tang, Haixu

2018-06-15

Mismatch repair (MMR) is a major contributor to replication fidelity, but its impact varies with sequence context and the nature of the mismatch. Mutation accumulation experiments followed by whole-genome sequencing of MMR-defective E. coli strains yielded ≈30,000 base-pair substitutions, revealing mutational patterns across the entire chromosome. The base-pair substitution spectrum was dominated by A:T > G:C transitions, which occurred predominantly at the center base of 5'N A C3'+5'G T N3' triplets. Surprisingly, growth on minimal medium or at low temperature attenuated these mutations. Mononucleotide runs were also hotspots for base-pair substitutions, and the rate at which these occurred increased with run length. Comparison with ≈2000 base-pair substitutions accumulated in MMR-proficient strains revealed that both kinds of hotspots appeared in the wild-type spectrum and so are likely to be sites of frequent replication errors. In MMR-defective strains transitions were strand biased, occurring twice as often when A and C rather than T and G were on the lagging-strand template. Loss of nucleotide diphosphate kinase increases the cellular concentration of dCTP, which resulted in increased rates of mutations due to misinsertion of C opposite A and T. In an mmr ndk double mutant strain, these mutations were more frequent when the template A and T were on the leading strand, suggesting that lagging-strand synthesis was more error-prone or less well corrected by proofreading than was leading strand synthesis. Copyright © 2018, Genetics.
A first genetic map of date palm (Phoenix dactylifera) reveals long-range genome structure conservation in the palms.

PubMed

Mathew, Lisa S; Spannagl, Manuel; Al-Malki, Ameena; George, Binu; Torres, Maria F; Al-Dous, Eman K; Al-Azwani, Eman K; Hussein, Emad; Mathew, Sweety; Mayer, Klaus F X; Mohamoud, Yasmin Ali; Suhre, Karsten; Malek, Joel A

2014-04-15

The date palm is one of the oldest cultivated fruit trees. It is critical in many ways to cultures in arid lands by providing highly nutritious fruit while surviving extreme heat and environmental conditions. Despite its importance from antiquity, few genetic resources are available for improving the productivity and development of the dioecious date palm. To date there has been no genetic map and no sex chromosome has been identified. Here we present the first genetic map for date palm and identify the putative date palm sex chromosome. We placed ~4000 markers on the map using nearly 1200 framework markers spanning a total of 1293 cM. We have integrated the genetic map, derived from the Khalas cultivar, with the draft genome and placed up to 19% of the draft genome sequence scaffolds onto linkage groups for the first time. This analysis revealed approximately ~1.9 cM/Mb on the map. Comparison of the date palm linkage groups revealed significant long-range synteny to oil palm. Analysis of the date palm sex-determination region suggests it is telomeric on linkage group 12 and recombination is not suppressed in the full chromosome. Based on a modified genotyping-by-sequencing approach we have overcome challenges due to lack of genetic resources and provide the first genetic map for date palm. Combined with the recent draft genome sequence of the same cultivar, this resource offers a critical new tool for date palm biotechnology, palm comparative genomics and a better understanding of sex chromosome development in the palms.
Diversity and distribution of single-stranded DNA phages in the North Atlantic Ocean

PubMed Central

Tucker, Kimberly P; Parsons, Rachel; Symonds, Erin M; Breitbart, Mya

2011-01-01

Knowledge of marine phages is highly biased toward double-stranded DNA (dsDNA) phages; however, recent metagenomic surveys have also identified single-stranded DNA (ssDNA) phages in the oceans. Here, we describe two complete ssDNA phage genomes that were reconstructed from a viral metagenome from 80 m depth at the Bermuda Atlantic Time-series Study (BATS) site in the northwestern Sargasso Sea and examine their spatial and temporal distributions. Both genomes (SARssφ1 and SARssφ2) exhibited similarity to known phages of the Microviridae family in terms of size, GC content, genome organization and protein sequence. PCR amplification of the replication initiation protein (Rep) gene revealed narrow and distinct depth distributions for the newly described ssDNA phages within the upper 200 m of the water column at the BATS site. Comparison of Rep gene sequences obtained from the BATS site over time revealed changes in the diversity of ssDNA phages over monthly time scales, although some nearly identical sequences were recovered from samples collected 4 years apart. Examination of ssDNA phage diversity along transects through the North Atlantic Ocean revealed a positive correlation between genetic distance and geographic distance between sampling sites. Together, the data suggest fundamental differences between the distribution of these ssDNA phages and the distribution of known marine dsDNA phages, possibly because of differences in host range, host distribution, virion stability, or viral evolution mechanisms and rates. Future work needs to elucidate the host ranges for oceanic ssDNA phages and determine their ecological roles in the marine ecosystem. PMID:21124487
New powerful statistics for alignment-free sequence comparison under a pattern transfer model.

PubMed

Liu, Xuemei; Wan, Lin; Li, Jing; Reinert, Gesine; Waterman, Michael S; Sun, Fengzhu

2011-09-07

Alignment-free sequence comparison is widely used for comparing gene regulatory regions and for identifying horizontally transferred genes. Recent studies on the power of a widely used alignment-free comparison statistic D2 and its variants D*2 and D(s)2 showed that their power approximates a limit smaller than 1 as the sequence length tends to infinity under a pattern transfer model. We develop new alignment-free statistics based on D2, D*2 and D(s)2 by comparing local sequence pairs and then summing over all the local sequence pairs of certain length. We show that the new statistics are much more powerful than the corresponding statistics and the power tends to 1 as the sequence length tends to infinity under the pattern transfer model. Copyright © 2011 Elsevier Ltd. All rights reserved.

New Powerful Statistics for Alignment-free Sequence Comparison Under a Pattern Transfer Model

PubMed Central

Liu, Xuemei; Wan, Lin; Li, Jing; Reinert, Gesine; Waterman, Michael S.; Sun, Fengzhu

2011-01-01

Alignment-free sequence comparison is widely used for comparing gene regulatory regions and for identifying horizontally transferred genes. Recent studies on the power of a widely used alignment-free comparison statistic D2 and its variants D2∗ and D2s showed that their power approximates a limit smaller than 1 as the sequence length tends to infinity under a pattern transfer model. We develop new alignment-free statistics based on D2, D2∗ and D2s by comparing local sequence pairs and then summing over all the local sequence pairs of certain length. We show that the new statistics are much more powerful than the corresponding statistics and the power tends to 1 as the sequence length tends to infinity under the pattern transfer model. PMID:21723298
Multiple alignment-free sequence comparison

PubMed Central

Ren, Jie; Song, Kai; Sun, Fengzhu; Deng, Minghua; Reinert, Gesine

2013-01-01

Motivation: Recently, a range of new statistics have become available for the alignment-free comparison of two sequences based on k-tuple word content. Here, we extend these statistics to the simultaneous comparison of more than two sequences. Our suite of statistics contains, first, and , extensions of statistics for pairwise comparison of the joint k-tuple content of all the sequences, and second, , and , averages of sums of pairwise comparison statistics. The two tasks we consider are, first, to identify sequences that are similar to a set of target sequences, and, second, to measure the similarity within a set of sequences. Results: Our investigation uses both simulated data as well as cis-regulatory module data where the task is to identify cis-regulatory modules with similar transcription factor binding sites. We find that although for real data, all of our statistics show a similar performance, on simulated data the Shepp-type statistics are in some instances outperformed by star-type statistics. The multiple alignment-free statistics are more sensitive to contamination in the data than the pairwise average statistics. Availability: Our implementation of the five statistics is available as R package named ‘multiAlignFree’ at be http://www-rcf.usc.edu/∼fsun/Programs/multiAlignFree/multiAlignFreemain.html. Contact: reinert@stats.ox.ac.uk Supplementary information: Supplementary data are available at Bioinformatics online. PMID:23990418
Stress fields and energy of disclination-type defects in zones of localized elastic distortions

NASA Astrophysics Data System (ADS)

Sukhanov, Ivan I.; Tyumentsev, Alexander N.; Ditenberg, Ivan A.

2016-11-01

This paper studies theoretically the elastically deformed state and analyzes deformation mechanisms in nanocrystals in the zones of localized elastic distortions and related disclination-type defects, such as dipole, quadrupole and multipole of partial disclinations. Significant differences in the energies of quadrupole and multipole configurations in comparison with nanodipole are revealed. The mechanism of deformation localization in the field of elastic distortions is proposed, which is a quasi-periodic sequence of formation and relaxation of various disclination ensembles with a periodic change in the energy of the defect.
Comparison of the Exomes of Common Carp (Cyprinus carpio) and Zebrafish (Danio rerio)

PubMed Central

Henkel, Christiaan V.; Dirks, Ron P.; Jansen, Hans J.; Forlenza, Maria; Wiegertjes, Geert F.; Howe, Kerstin; van den Thillart, Guido E.E.J.M.

2012-01-01

Abstract Research on common carp, Cyprinus carpio, is beneficial for zebrafish research because of resources available owing to its large body size, such as the availability of sufficient organ material for transcriptomics, proteomics, and metabolomics. Here we describe the shot gun sequencing of a clonal double-haploid common carp line. The assembly consists of 511891 scaffolds with an N50 of 17 kb, predicting a total genome size of 1.4–1.5 Gb. A detailed analysis of the ten largest scaffolds indicates that the carp genome has a considerably lower repeat coverage than zebrafish, whilst the average intron size is significantly smaller, making it comparable to the fugu genome. The quality of the scaffolding was confirmed by comparisons with RNA deep sequencing data sets and a manual analysis for synteny with the zebrafish, especially the Hox gene clusters. In the ten largest scaffolds analyzed, the synteny of genes is almost complete. Comparisons of predicted exons of common carp with those of the zebrafish revealed only few genes specific for either zebrafish or carp, most of these being of unknown function. This supports the hypothesis of an additional genome duplication event in the carp evolutionary history, which—due to a higher degree of compactness—did not result in a genome larger than that of zebrafish. PMID:22715948
Individual Apostichopus japonicus fecal microbiome reveals a link with polyhydroxybutyrate producers in host growth gaps.

PubMed

Yamazaki, Yohei; Meirelles, Pedro Milet; Mino, Sayaka; Suda, Wataru; Oshima, Kenshiro; Hattori, Masahira; Thompson, Fabiano L; Sakai, Yuichi; Sawabe, Toko; Sawabe, Tomoo

2016-02-24

Gut microbiome shapes various aspects of a host's physiology, but these functions in aquatic animal hosts have yet to be fully investigated. The sea cucumber Apostichopus japonicus Selenka is one such example. The large growth gap in their body size has delayed the development of intensive aquaculture, nevertheless the species is in urgent need of conservation. To understand possible contributions of the gut microbiome to its host's growth, individual fecal microbiome comparisons were performed. High-throughput 16S rRNA sequencing revealed significantly different microbiota in larger and smaller individuals; Rhodobacterales in particular was the most significantly abundant bacterial group in the larger specimens. Further shotgun metagenome of representative samples revealed a significant abundance of microbiome retaining polyhydroxybutyrate (PHB) metabolism genes in the largest individual. The PHB metabolism reads were potentially derived from Rhodobacterales. These results imply a possible link between microbial PHB producers and potential growth promotion in Deuterostomia marine invertebrates.
Hydra meiosis reveals unexpected conservation of structural synaptonemal complex proteins across metazoans.

PubMed

Fraune, Johanna; Alsheimer, Manfred; Volff, Jean-Nicolas; Busch, Karoline; Fraune, Sebastian; Bosch, Thomas C G; Benavente, Ricardo

2012-10-09

The synaptonemal complex (SC) is a key structure of meiosis, mediating the stable pairing (synapsis) of homologous chromosomes during prophase I. Its remarkable tripartite structure is evolutionarily well conserved and can be found in almost all sexually reproducing organisms. However, comparison of the different SC protein components in the common meiosis model organisms Saccharomyces cerevisiae, Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster, and Mus musculus revealed no sequence homology. This discrepancy challenged the hypothesis that the SC arose only once in evolution. To pursue this matter we focused on the evolution of SYCP1 and SYCP3, the two major structural SC proteins of mammals. Remarkably, our comparative bioinformatic and expression studies revealed that SYCP1 and SYCP3 are also components of the SC in the basal metazoan Hydra. In contrast to previous assumptions, we therefore conclude that SYCP1 and SYCP3 form monophyletic groups of orthologous proteins across metazoans.
Hydra meiosis reveals unexpected conservation of structural synaptonemal complex proteins across metazoans

PubMed Central

Fraune, Johanna; Alsheimer, Manfred; Volff, Jean-Nicolas; Busch, Karoline; Fraune, Sebastian; Bosch, Thomas C. G.; Benavente, Ricardo

2012-01-01

The synaptonemal complex (SC) is a key structure of meiosis, mediating the stable pairing (synapsis) of homologous chromosomes during prophase I. Its remarkable tripartite structure is evolutionarily well conserved and can be found in almost all sexually reproducing organisms. However, comparison of the different SC protein components in the common meiosis model organisms Saccharomyces cerevisiae, Arabidopsis thaliana, Caenorhabditis elegans, Drosophila melanogaster, and Mus musculus revealed no sequence homology. This discrepancy challenged the hypothesis that the SC arose only once in evolution. To pursue this matter we focused on the evolution of SYCP1 and SYCP3, the two major structural SC proteins of mammals. Remarkably, our comparative bioinformatic and expression studies revealed that SYCP1 and SYCP3 are also components of the SC in the basal metazoan Hydra. In contrast to previous assumptions, we therefore conclude that SYCP1 and SYCP3 form monophyletic groups of orthologous proteins across metazoans. PMID:23012415
Molecular characterisation of Sarcocystis lutrae n. sp. and Toxoplasma gondii from the musculature of two Eurasian otters (Lutra lutra) in Norway.

PubMed

Gjerde, Bjørn; Josefsen, Terje D

2015-03-01

Sarcocysts were detected in routinely processed histological sections of skeletal muscle, but not cardiac muscle, of two adult male otters (Lutra lutra; Mustelidae) from northern Norway following their post-mortem examination in 1999 and 2000. The sarcocysts were slender, spindle-shaped, up to 970 μm long and 35-70 μm in greatest diameter. The sarcocyst wall was thin (∼ 0.5 μm) and smooth with no visible protrusions. Portions of unfixed diaphragm of both animals were collected at the autopsies and kept frozen for about 14 years pending further examination. When the study was resumed in 2013, the thawed muscle samples were examined for sarcocysts under a stereo microscope, but none could be found. Genomic DNA was therefore extracted from a total of 36 small pieces of the diaphragm from both otters, and samples found to contain Sarcocystidae DNA were used selectively for PCR amplification and sequencing of the nuclear 18S and 28S ribosomal (r) RNA genes and internal transcribed spacer 1 (ITS1) region, as well as the mitochondrial cytochrome b (cytb) and cytochrome c oxidase subunit 1 (cox1) genes. Sequence comparisons revealed that both otters were infected by the same Sarcocystis sp. and that there was no genetic variation (100 % identity) among sequenced isolates at the 18S and 28S rRNA genes (six identical isolates at both loci) or at cox1 (13 identical isolates). PCR products comprising the ITS1 region, on the other hand, had to be cloned before sequencing due to intraspecific sequence variation. A total of 33 clones were sequenced, and the identities between them were 97.9-99.9 %. These sequences were most similar (93.7-96.0 % identity) to a sequence of Sarcocystis kalvikus from the wolverine in Canada, but the phylogenetic analyses placed all of them as a monophyletic sister group to S. kalvikus. Hence, they were considered to represent a novel species, which was named Sarcocystis lutrae. Sequence comparisons and phylogenetic analyses based on sequences of the 18S and 28S rRNA genes and cox1, for which little or no sequence data were available for S. kalvikus, revealed that S. lutrae otherwise was most closely related to various Sarcocystis spp. using birds or carnivores as intermediate hosts. The cox1 sequences of S. lutrae from the otters were identical to two sequences from an arctic fox, which in a previous study had been assigned to Sarcocystis arctica due to a high identity (99.4 %) with the latter species at this gene and a complete identity with S. arctica at three other loci when using the same DNA samples as templates for PCR reactions. Additional PCR amplifications and sequencing of cox1 (ten sequences) and the ITS1 region (four sequences) using four DNA samples from this fox as templates again generated cox1 sequences exclusively of S. lutrae, but ITS1 sequences of S. arctica, and thus confirmed that this arctic fox had acted as intermediate host for both S. arctica and S. lutrae. Based on the phylogenetic placement of S. lutrae, the geographical location of infected animals (otters, arctic fox) and the distribution of carnivores/raptors which may have interacted with them, the white-tailed eagle (Haliaeetus albicilla) seems to be a possible definitive host of S. lutrae. Some of the muscle samples from both otters were shown to harbour stages of Toxoplasma gondii through PCR amplification and sequencing of the entire ITS1 region (five isolates) and/or the partial cytb (eight isolates) and cox1 (one isolate). These sequences were identical to several previous sequences of T. gondii in GenBank. Thus, both otters had a dual infection with S. lutrae and T. gondii.
Novel Hepatozoon in vertebrates from the southern United States.

PubMed

Allen, Kelly E; Yabsley, Michael J; Johnson, Eileen M; Reichard, Mason V; Panciera, Roger J; Ewing, Sidney A; Little, Susan E

2011-08-01

Novel Hepatozoon spp. sequences collected from previously unrecognized vertebrate hosts in North America were compared with documented Hepatozoon 18S rRNA sequences in an effort to examine phylogenetic relationships between the different Hepatozoon organisms found cycling in nature. An approximately 500-base pair fragment of 18S rDNA common to Hepatozoon spp. and some other apicomplexans was amplified and sequenced from the tissues or blood of 16 vertebrate host species from the southern United States, including 1 opossum (Didelphis virginiana), 2 bobcats (Lynx rufus), 1 domestic cat (Felis catus), 3 coyotes (Canis latrans), 1 gray fox (Urocyon cinereoargenteus), 4 raccoons (Procyon lotor), 1 pet boa constrictor (Boa constrictor imperator), 1 swamp rabbit (Sylvilagus aquaticus), 1 cottontail rabbit (Sylvilagus floridanus), 4 woodrats (Neotoma fuscipes and Neotoma micropus), 3 white-footed mice (Peromyscus leucopus), 8 cotton rats (Sigmodon hispidus), 1 cotton mouse (Peromyscus gossypinus), 1 eastern grey squirrel (Sciurus carolinensis), and 1 woodchuck (Marmota monax). Phylogenetic analyses and comparison with sequences in the existing database revealed distinct groups of Hepatozoon spp., with clusters formed by sequences obtained from scavengers and carnivores (opossum, raccoons, canids, and felids) and those obtained from rodents. Surprisingly, Hepatozoon spp. sequences from wild rabbits were most closely related to sequences obtained from carnivores (97.2% identical), and the sequence from the boa constrictor was most closely related to the rodent cluster (97.4% identical). These data are consistent with recent work identifying prey-predator transmission cycles in Hepatozoon spp. and suggest this pattern may be more common than previously recognized.
Genetic diversity of the DBLalpha region in Plasmodium falciparum var genes among Asia-Pacific isolates.

PubMed

Fowler, Elizabeth V; Peters, Jennifer M; Gatton, Michelle L; Chen, Nanhua; Cheng, Qin

2002-03-01

In Plasmodium falciparum a highly polymorphic multi-copy gene family, var, encodes the variant surface antigen P. falciparum erythrocyte membrane protein 1 (PfEMP1), which has an important role in cytoadherence and immune evasion. Using previously described universal PCR primers for the first Duffy binding-like domain (DBLalpha) of var we analysed the DBLalpha repertoires of Dd2 (originally from Thailand) and eight isolates from the Solomon Islands (n=4), Philippines (n=2), Papua New Guinea (n=1) and Africa (n=1). We found 15-32 unique DBLalpha sequence types among these isolates and estimated detectable DBLalpha repertoire sizes ranging from 33-38 to 52-57 copies per genome. Our data suggest that var gene repertoires generally consist of 40-50 copies per genome. Eighteen DBLalpha sequences appeared in more than one Asia-Pacific isolate with the number of sequences shared between any two isolates ranging from 0 to 6 (mean=2.0 +/-1.6). At the amino acid level DBLalpha sequence similarity within isolates ranged from 45.2 +/- 7.1 to 50.2 +/- 6.9%, and was not significantly different from the DBLalpha amino acid sequence similarity among isolates (P>0.1). Comparisons with published sequences also revealed little overlap among DBLalpha sequences from different regions. High DBLalpha sequence diversity and minimal overlap among these isolates suggest that the global var gene repertoire is immense, and may potentially be selected for by the host's protective immune response to the var gene products, PfEMP1.
Sequence and Characterization of the Ig Heavy Chain Constant and Partial Variable Region of the Mouse Strain 129S11

PubMed Central

Retter, Ida; Chevillard, Christophe; Scharfe, Maren; Conrad, Ansgar; Hafner, Martin; Im, Tschong-Hun; Ludewig, Monika; Nordsiek, Gabriele; Severitt, Simone; Thies, Stephanie; Mauhar, America; Blöcker, Helmut; Müller, Werner; Riblet, Roy

2009-01-01

Although the entire mouse genome has been sequenced, there remain challenges concerning the elucidation of particular complex and polymorphic genomic loci. In the murine Igh locus, different haplotypes exist in different inbred mouse strains. For example, the Ighb haplotype sequence of the Mouse Genome Project strain C57BL/6 differs considerably from the Igha haplotype of BALB/c, which has been widely used in the analyses of Ab responses. We have sequenced and annotated the 3′ half of the Igha locus of 129S1/SvImJ, covering the CH region and approximately half of the VH region. This sequence comprises 128 VH genes, of which 49 are judged to be functional. The comparison of the Igha sequence with the homologous Ighb region from C57BL/6 revealed two major expansions in the germline repertoire of Igha. In addition, we found smaller haplotype-specific differences like the duplication of five VH genes in the Igha locus. We generated a VH allele table by comparing the individual VH genes of both haplotypes. Surprisingly, the number and position of DH genes in the 129S1 strain differs not only from the sequence of C57BL/6 but also from the map published for BALB/c. Taken together, the contiguous genomic sequence of the 3′ part of the Igha locus allows a detailed view of the recent evolution of this highly dynamic locus in the mouse. PMID:17675503
The genome and phenome of the green alga Chloroidium sp. UTEX 3007 reveal adaptive traits for desert acclimatization

PubMed Central

Nelson, David R; Khraiwesh, Basel; Fu, Weiqi; Alseekh, Saleh; Jaiswal, Ashish; Chaiboonchoe, Amphun; Hazzouri, Khaled M; O’Connor, Matthew J; Butterfoss, Glenn L; Drou, Nizar; Rowe, Jillian D; Harb, Jamil; Fernie, Alisdair R; Gunsalus, Kristin C; Salehi-Ashtiani, Kourosh

2017-01-01

To investigate the phenomic and genomic traits that allow green algae to survive in deserts, we characterized a ubiquitous species, Chloroidium sp. UTEX 3007, which we isolated from multiple locations in the United Arab Emirates (UAE). Metabolomic analyses of Chloroidium sp. UTEX 3007 indicated that the alga accumulates a broad range of carbon sources, including several desiccation tolerance-promoting sugars and unusually large stores of palmitate. Growth assays revealed capacities to grow in salinities from zero to 60 g/L and to grow heterotrophically on >40 distinct carbon sources. Assembly and annotation of genomic reads yielded a 52.5 Mbp genome with 8153 functionally annotated genes. Comparison with other sequenced green algae revealed unique protein families involved in osmotic stress tolerance and saccharide metabolism that support phenomic studies. Our results reveal the robust and flexible biology utilized by a green alga to successfully inhabit a desert coastline. DOI: http://dx.doi.org/10.7554/eLife.25783.001 PMID:28623667
Discriminative prediction of mammalian enhancers from DNA sequence

PubMed Central

Lee, Dongwon; Karchin, Rachel; Beer, Michael A.

2011-01-01

Accurately predicting regulatory sequences and enhancers in entire genomes is an important but difficult problem, especially in large vertebrate genomes. With the advent of ChIP-seq technology, experimental detection of genome-wide EP300/CREBBP bound regions provides a powerful platform to develop predictive tools for regulatory sequences and to study their sequence properties. Here, we develop a support vector machine (SVM) framework which can accurately identify EP300-bound enhancers using only genomic sequence and an unbiased set of general sequence features. Moreover, we find that the predictive sequence features identified by the SVM classifier reveal biologically relevant sequence elements enriched in the enhancers, but we also identify other features that are significantly depleted in enhancers. The predictive sequence features are evolutionarily conserved and spatially clustered, providing further support of their functional significance. Although our SVM is trained on experimental data, we also predict novel enhancers and show that these putative enhancers are significantly enriched in both ChIP-seq signal and DNase I hypersensitivity signal in the mouse brain and are located near relevant genes. Finally, we present results of comparisons between other EP300/CREBBP data sets using our SVM and uncover sequence elements enriched and/or depleted in the different classes of enhancers. Many of these sequence features play a role in specifying tissue-specific or developmental-stage-specific enhancer activity, but our results indicate that some features operate in a general or tissue-independent manner. In addition to providing a high confidence list of enhancer targets for subsequent experimental investigation, these results contribute to our understanding of the general sequence structure of vertebrate enhancers. PMID:21875935
Analysis of the complete genome of the first Irkut virus isolate from China: comparison across the Lyssavirus genus.

PubMed

Liu, Ye; Li, Nan; Zhang, Shoufeng; Zhang, Fei; Lian, Hai; Wang, Ying; Zhang, Jinxia; Hu, Rongliang

2013-12-01

The genome of Irkut virus, isolate IRKV-THChina12, the first non-rabies lyssavirus from China (of bat origin), has been completely sequenced. In general, coding and non-coding regions of this viral genome are similar to those of other lyssaviruses. However, alignment of the deduced amino acid sequences of the structural proteins of IRKV-THChina12 with those of other lyssavirus representatives revealed significant variability between viral species. The nucleoprotein and matrix protein were found to be the most conserved, followed by the large protein, glycoprotein and phosphoprotein. Differences in the antigenic sites in glycoprotein may result in only partial protection of the available rabies biologics against Irkut virus, which is of particular concern for pre- and post-exposure rabies prophylaxis. Copyright © 2013 Elsevier Inc. All rights reserved.
Enterocin 96, a Novel Class II Bacteriocin Produced by Enterococcus faecalis WHE 96, Isolated from Munster Cheese▿

PubMed Central

Izquierdo, Esther; Wagner, Camille; Marchioni, Eric; Aoude-Werner, Dalal; Ennahar, Saïd

2009-01-01

Enterococcus faecalis WHE 96, a strain isolated from soft cheese based on its anti-Listeria activity, produced a 5,494-Da bacteriocin that was purified to homogeneity by ultrafiltration and cation-exchange and reversed-phase chromatographies. The amino acid sequence of this bacteriocin, named enterocin 96, was determined by Edman degradation, and its structural gene was sequenced, revealing a double-glycine leader peptide. After a comparison with other bacteriocins, it was shown that enterocin 96 was a new class II bacteriocin that showed very little similarity with known structures. Enterocin 96 was indeed a new bacteriocin belonging to class II bacteriocins. The activity spectrum of enterocin 96 covered a wide range of bacteria, with strong activity against most gram-positive strains but very little or no activity against gram-negative strains. PMID:19411428
Enterocin 96, a novel class II bacteriocin produced by Enterococcus faecalis WHE 96, isolated from Munster cheese.

PubMed

Izquierdo, Esther; Wagner, Camille; Marchioni, Eric; Aoude-Werner, Dalal; Ennahar, Saïd

2009-07-01

Enterococcus faecalis WHE 96, a strain isolated from soft cheese based on its anti-Listeria activity, produced a 5,494-Da bacteriocin that was purified to homogeneity by ultrafiltration and cation-exchange and reversed-phase chromatographies. The amino acid sequence of this bacteriocin, named enterocin 96, was determined by Edman degradation, and its structural gene was sequenced, revealing a double-glycine leader peptide. After a comparison with other bacteriocins, it was shown that enterocin 96 was a new class II bacteriocin that showed very little similarity with known structures. Enterocin 96 was indeed a new bacteriocin belonging to class II bacteriocins. The activity spectrum of enterocin 96 covered a wide range of bacteria, with strong activity against most gram-positive strains but very little or no activity against gram-negative strains.
Molecular analysis of ependymins from the cerebrospinal fluid of the orders Clupeiformes and Salmoniformes: no indication for the existence of an euteleost infradivision.

PubMed

Müller-Schmid, A; Ganss, B; Gorr, T; Hoffmann, W

1993-06-01

Ependymins represent the predominant protein constituents in the cerebrospinal fluid of many teleost fish and they are synthesized in meningeal fibroblasts. Here, we present the ependymin sequences from the herring (Clupea harengus) and the pike (Esox lucius). A comparison of ependymin homologous sequences from three different orders of teleost fish (Salmoniformes, Cypriniformes, and Clupeiformes) revealed the highest similarity between Clupeiformes and Cypriniformes. This result is unexpected because it does not reflect current systematics, in which Clupeiformes belong to a separate infradivision (Clupeomorpha) than Salmoniformes and Cypriniformes (Euteleostei). Furthermore, in Salmoniformes the evolutionary rate of ependymins seems to be accelerated mainly on the protein level. However, considering these inconstant rates, neither neighbor-joining trees nor DNA parsimony methods gave any indication that a separate euteleost infradivision exists.
Re-evaluating microglia expression profiles using RiboTag and cell isolation strategies.

PubMed

Haimon, Zhana; Volaski, Alon; Orthgiess, Johannes; Boura-Halfon, Sigalit; Varol, Diana; Shemer, Anat; Yona, Simon; Zuckerman, Binyamin; David, Eyal; Chappell-Maor, Louise; Bechmann, Ingo; Gericke, Martin; Ulitsky, Igor; Jung, Steffen

2018-06-01

Transcriptome profiling is widely used to infer functional states of specific cell types, as well as their responses to stimuli, to define contributions to physiology and pathophysiology. Focusing on microglia, the brain's macrophages, we report here a side-by-side comparison of classical cell-sorting-based transcriptome sequencing and the 'RiboTag' method, which avoids cell retrieval from tissue context and yields translatome sequencing information. Conventional whole-cell microglial transcriptomes were found to be significantly tainted by artifacts introduced by tissue dissociation, cargo contamination and transcripts sequestered from ribosomes. Conversely, our data highlight the added value of RiboTag profiling for assessing the lineage accuracy of Cre recombinase expression in transgenic mice. Collectively, this study indicates method-based biases, reveals observer effects and establishes RiboTag-based translatome profiling as a valuable complement to standard sorting-based profiling strategies.
Molecular characterization and expression of the M6 gene of grass carp hemorrhage virus (GCHV), an aquareovirus.

PubMed

Qiu, T; Lu, R H; Zhang, J; Zhu, Z Y

2001-07-01

The complete nucleotide sequence of M6 gene of grass carp hemorrhage virus (GCHV) was determined. It is 2039 nucleotides in length and contains a single large open reading frame that could encode a protein of 648 amino acids with predicted molecular mass of 68.7 kDa. Amino acid sequence comparison revealed that the protein encoded by GCHV M6 is closely related to the protein mu1 of mammalian reovirus. The M6 gene, encoding the major outer-capsid protein, was expressed using the pET fusion protein vector in Escherichia coli and detected by Western blotting using chicken anti-GCHV immunoglobulin (IgY). The result indicates that the protein encoded by M6 may share a putative Asn-42-Pro-43 proteolytic cleavage site with mu1.
Klebsormidium flaccidum genome reveals primary factors for plant terrestrial adaptation

PubMed Central

Hori, Koichi; Maruyama, Fumito; Fujisawa, Takatomo; Togashi, Tomoaki; Yamamoto, Nozomi; Seo, Mitsunori; Sato, Syusei; Yamada, Takuji; Mori, Hiroshi; Tajima, Naoyuki; Moriyama, Takashi; Ikeuchi, Masahiko; Watanabe, Mai; Wada, Hajime; Kobayashi, Koichi; Saito, Masakazu; Masuda, Tatsuru; Sasaki-Sekimoto, Yuko; Mashiguchi, Kiyoshi; Awai, Koichiro; Shimojima, Mie; Masuda, Shinji; Iwai, Masako; Nobusawa, Takashi; Narise, Takafumi; Kondo, Satoshi; Saito, Hikaru; Sato, Ryoichi; Murakawa, Masato; Ihara, Yuta; Oshima-Yamada, Yui; Ohtaka, Kinuka; Satoh, Masanori; Sonobe, Kohei; Ishii, Midori; Ohtani, Ryosuke; Kanamori-Sato, Miyu; Honoki, Rina; Miyazaki, Daichi; Mochizuki, Hitoshi; Umetsu, Jumpei; Higashi, Kouichi; Shibata, Daisuke; Kamiya, Yuji; Sato, Naoki; Nakamura, Yasukazu; Tabata, Satoshi; Ida, Shigeru; Kurokawa, Ken; Ohta, Hiroyuki

2014-01-01

The colonization of land by plants was a key event in the evolution of life. Here we report the draft genome sequence of the filamentous terrestrial alga Klebsormidium flaccidum (Division Charophyta, Order Klebsormidiales) to elucidate the early transition step from aquatic algae to land plants. Comparison of the genome sequence with that of other algae and land plants demonstrate that K. flaccidum acquired many genes specific to land plants. We demonstrate that K. flaccidum indeed produces several plant hormones and homologues of some of the signalling intermediates required for hormone actions in higher plants. The K. flaccidum genome also encodes a primitive system to protect against the harmful effects of high-intensity light. The presence of these plant-related systems in K. flaccidum suggests that, during evolution, this alga acquired the fundamental machinery required for adaptation to terrestrial environments. PMID:24865297

Complete mitochondrial genome of Xingguo red carp (Cyprinus carpio var. singuonensis) and purse red carp (Cyprinus carpio var. wuyuanensis).

PubMed

Hu, Guang-Fu; Liu, Xiang-Jiang; Li, Zhong; Liang, Hong-Wei; Hu, Shao-Na; Zou, Gui-Wei

2016-01-01

The complete mitochondrial genomes of Xingguo red carp (Cyprinus carpio var. singuonensis) and purse red carp (Cyprinus carpio var. wuyuanensis) were sequenced. Comparison of these two mitochondrial genomes revealed that the mtDNAs of these two common carp varieties were remarkably similar in genome length, gene order and content, and AT content. However, size variation between these two mitochondrial genomes presented here showed 39 site differences in overall length. About 2 site differences were located in rRNAs, 3 in tRNAs, 3 in the control region, 31 in protein-coding genes. Thirty-one variable bases in the protein-coding regions between the two varieties mitochondrial sequences led to three variable amino acids, which were mainly located in the protein ND5 and ND4.
Genome sequence analyses of two isolates from the recent Escherichia coli outbreak in Germany reveal the emergence of a new pathotype: Entero-Aggregative-Haemorrhagic Escherichia coli (EAHEC).

PubMed

Brzuszkiewicz, Elzbieta; Thürmer, Andrea; Schuldes, Jörg; Leimbach, Andreas; Liesegang, Heiko; Meyer, Frauke-Dorothee; Boelter, Jürgen; Petersen, Heiko; Gottschalk, Gerhard; Daniel, Rolf

2011-12-01

The genome sequences of two Escherichia coli O104:H4 strains derived from two different patients of the 2011 German E. coli outbreak were determined. The two analyzed strains were designated E. coli GOS1 and GOS2 (German outbreak strain). Both isolates comprise one chromosome of approximately 5.31 Mbp and two putative plasmids. Comparisons of the 5,217 (GOS1) and 5,224 (GOS2) predicted protein-encoding genes with various E. coli strains, and a multilocus sequence typing analysis revealed that the isolates were most similar to the entero-aggregative E. coli (EAEC) strain 55989. In addition, one of the putative plasmids of the outbreak strain is similar to pAA-type plasmids of EAEC strains, which contain aggregative adhesion fimbrial operons. The second putative plasmid harbors genes for extended-spectrum β-lactamases. This type of plasmid is widely distributed in pathogenic E. coli strains. A significant difference of the E. coli GOS1 and GOS2 genomes to those of EAEC strains is the presence of a prophage encoding the Shiga toxin, which is characteristic for enterohemorrhagic E. coli (EHEC) strains. The unique combination of genomic features of the German outbreak strain, containing characteristics from pathotypes EAEC and EHEC, suggested that it represents a new pathotype Entero-Aggregative-Haemorrhagic E scherichia c oli (EAHEC).
Mung bean proteins and peptides: nutritional, functional and bioactive properties.

PubMed

Yi-Shen, Zhu; Shuai, Sun; FitzGerald, Richard

2018-01-01

To date, no extensive literature review exists regarding potential uses of mung bean proteins and peptides. As mung bean has long been widely used as a food source, early studies evaluated mung bean nutritional value against the Food and Agriculture Organization of the United Nations (FAO)/the World Health Organization (WHO) amino acids dietary recommendations. The comparison demonstrated mung bean to be a good protein source, except for deficiencies in sulphur-containing amino acids, methionine and cysteine. Methionine and cysteine residues have been introduced into the 8S globulin through protein engineering technology. Subsequently, purified mung bean proteins and peptides have facilitated the study of their structural and functional properties. Two main types of extraction methods have been reported for isolation of proteins and peptides from mung bean flours, permitting sequencing of major proteins present in mung bean, including albumins and globulins (notably 8S globulin). However, the sequence for albumin deposited in the UniProt database differs from other sequences reported in the literature. Meanwhile, a limited number of reports have revealed other useful bioactivities for proteins and hydrolysed peptides, including angiotensin-converting enzyme inhibitory activity, anti-fungal activity and trypsin inhibitory activity. Consequently, several mung bean hydrolysed peptides have served as effective food additives to prevent proteolysis during storage. Ultimately, further research will reveal other nutritional, functional and bioactive properties of mung bean for uses in diverse applications.
Novel Curvularia species from clinical specimens.

PubMed

Madrid, H; da Cunha, K C; Gené, J; Dijksterhuis, J; Cano, J; Sutton, D A; Guarro, J; Crous, P W

2014-12-01

The fungal genus Curvularia includes numerous plant pathogens and some emerging opportunistic pathogens of humans. In a previous study we used morphology and sequences of the nuclear ribosomal internal transcribed spacer region (ITS) and the glyceraldehyde-3-phosphate dehydrogenase (gpd) gene to identify species within a set of 99 clinical Curvularia isolates from the USA. Seventy-two isolates could be identified while the remaining 27 isolates belonged in three unclassified clades that were tentatively labelled Curvularia sp. I, II and III. In the present study, we further assess the taxonomic placement of these isolates using sequences of ITS, gpd, the large subunit rDNA, and the second largest subunit of RNA polymerase II. DNA sequence comparisons with a set of 87 isolates representing 33 Curvularia spp. and members of the closely-related genera Bipolaris and Exserohilum revealed that Curvularia sp. I, II and III represent novel lineages in Curvularia. These lineages are morphologically different from the currently accepted species. In the phylogenetic tree, Curvularia sp. I and sp. III were each split into two distinct lineages. Morphology and phylogeny supported the proposal of five new species, to be named C. americana, C. chlamydospora, C. hominis, C. muehlenbeckiae and C. pseudolunata. The concatenated 4-locus phylogeny revealed the existence of six clades in Curvularia, which are associated with particular morphological features. They were named after representative species, namely americana, eragrostidis, hominis, lunata, spicifera and trifolii.
Mung bean proteins and peptides: nutritional, functional and bioactive properties

PubMed Central

Yi-Shen, Zhu; Shuai, Sun; FitzGerald, Richard

2018-01-01

To date, no extensive literature review exists regarding potential uses of mung bean proteins and peptides. As mung bean has long been widely used as a food source, early studies evaluated mung bean nutritional value against the Food and Agriculture Organization of the United Nations (FAO)/the World Health Organization (WHO) amino acids dietary recommendations. The comparison demonstrated mung bean to be a good protein source, except for deficiencies in sulphur-containing amino acids, methionine and cysteine. Methionine and cysteine residues have been introduced into the 8S globulin through protein engineering technology. Subsequently, purified mung bean proteins and peptides have facilitated the study of their structural and functional properties. Two main types of extraction methods have been reported for isolation of proteins and peptides from mung bean flours, permitting sequencing of major proteins present in mung bean, including albumins and globulins (notably 8S globulin). However, the sequence for albumin deposited in the UniProt database differs from other sequences reported in the literature. Meanwhile, a limited number of reports have revealed other useful bioactivities for proteins and hydrolysed peptides, including angiotensin-converting enzyme inhibitory activity, anti-fungal activity and trypsin inhibitory activity. Consequently, several mung bean hydrolysed peptides have served as effective food additives to prevent proteolysis during storage. Ultimately, further research will reveal other nutritional, functional and bioactive properties of mung bean for uses in diverse applications. PMID:29545737
Genome Comparison of Candida orthopsilosis Clinical Strains Reveals the Existence of Hybrids between Two Distinct Subspecies

PubMed Central

Pryszcz, Leszek P.; Németh, Tibor; Gácser, Attila; Gabaldón, Toni

2014-01-01

The Candida parapsilosis species complex comprises a group of emerging human pathogens of varying virulence. This complex was recently subdivided into three different species: C. parapsilosis sensu stricto, C. metapsilosis, and C. orthopsilosis. Within the latter, at least two clearly distinct subspecies seem to be present among clinical isolates (Type 1 and Type 2). To gain insight into the genomic differences between these subspecies, we undertook the sequencing of a clinical isolate classified as Type 1 and compared it with the available sequence of a Type 2 clinical strain. Unexpectedly, the analysis of the newly sequenced strain revealed a highly heterozygous genome, which we show to be the consequence of a hybridization event between both identified subspecies. This implicitly suggests that C. orthopsilosis is able to mate, a so-far unanswered question. The resulting hybrid shows a chimeric genome that maintains a similar gene dosage from both parental lineages and displays ongoing loss of heterozygosity. Several of the differences found between the gene content in both strains relate to virulent-related families, with the hybrid strain presenting a higher copy number of genes coding for efflux pumps or secreted lipases. Remarkably, two clinical strains isolated from distant geographical locations (Texas and Singapore) are descendants of the same hybrid line, raising the intriguing possibility of a relationship between the hybridization event and the global spread of a virulent clone. PMID:24747362
Phylogenetic shadowing of primate sequences to find functional regions of the human genome.

PubMed

Boffelli, Dario; McAuliffe, Jon; Ovcharenko, Dmitriy; Lewis, Keith D; Ovcharenko, Ivan; Pachter, Lior; Rubin, Edward M

2003-02-28

Nonhuman primates represent the most relevant model organisms to understand the biology of Homo sapiens. The recent divergence and associated overall sequence conservation between individual members of this taxon have nonetheless largely precluded the use of primates in comparative sequence studies. We used sequence comparisons of an extensive set of Old World and New World monkeys and hominoids to identify functional regions in the human genome. Analysis of these data enabled the discovery of primate-specific gene regulatory elements and the demarcation of the exons of multiple genes. Much of the information content of the comprehensive primate sequence comparisons could be captured with a small subset of phylogenetically close primates. These results demonstrate the utility of intraprimate sequence comparisons to discover common mammalian as well as primate-specific functional elements in the human genome, which are unattainable through the evaluation of more evolutionarily distant species.
Evolution of toll-like receptors in the context of terrestrial ungulates and cetaceans diversification.

PubMed

Ishengoma, Edson; Agaba, Morris

2017-02-16

Toll-like receptors (TLRs) are the frontline actors in the innate immune response to various pathogens and are expected to be targets of natural selection in species adapted to habitats with contrasting pathogen burdens. The recent publication of genome sequences of giraffe and okapi together afforded the opportunity to examine the evolution of selected TLRs in broad range of terrestrial ungulates and cetaceans during their complex habitat diversification. Through direct sequence comparisons and standard evolutionary approaches, the extent of nucleotide and protein sequence diversity in seven Toll-like receptors (TLR2, TLR3, TLR4, TLR5, TLR7, TLR9 and TLR10) between giraffe and closely related species was determined. In addition, comparison of the patterning of key TLR motifs and domains between giraffe and related species was performed. The quantification of selection pressure and divergence on TLRs among terrestrial ungulates and cetaceans was also performed. Sequence analysis shows that giraffe has 94-99% nucleotide identity with okapi and cattle for all TLRs analyzed. Variations in the number of Leucine-rich repeats were observed in some of TLRs between giraffe, okapi and cattle. Patterning of key TLR domains did not reveal any significant differences in the domain architecture among giraffe, okapi and cattle. Molecular evolutionary analysis for selection pressure identifies positive selection on key sites for all TLRs examined suggesting that pervasive evolutionary pressure has taken place during the evolution of terrestrial ungulates and cetaceans. Analysis of positively selected sites showed some site to be part of Leucine-rich motifs suggesting functional relevance in species-specific recognition of pathogen associated molecular patterns. Notably, clade analysis reveals significant selection divergence between terrestrial ungulates and cetaceans in viral sensing TLR3. Mapping of giraffe TLR3 key substitutions to the structure of the receptor indicates that at least one of giraffe altered sites coincides with TLR3 residue known to play a critical role in receptor signaling activity. There is overall structural conservation in TLRs among giraffe, okapi and cattle indicating that the mechanism for innate immune response utilizing TLR pathways may not have changed very much during the evolution of these species. However, a broader phylogenetic analysis revealed signatures of adaptive evolution among terrestrial ungulates and cetaceans, including the observed selection divergence in TLR3. This suggests that long term ecological dynamics has led to species-specific innovation and functional variation in the mechanisms mediating innate immunity in terrestrial ungulates and cetaceans.
Saccharothrix ghardaiensis sp. nov., an actinobacterium isolated from Saharan soil.

PubMed

Bouznada, Khaoula; Bouras, Noureddine; Mokrane, Salim; Chaabane Chaouch, Fawzia; Zitouni, Abdelghani; Pötter, Gabriele; Spröer, Cathrin; Klenk, Hans-Peter; Sabaou, Nasserdine

2017-03-01

The taxonomic position of a new Saccharothrix strain, designated MB46 T , isolated from a Saharan soil sample collected in Mzab region (Ghardaïa province, South Algeria) was established following a polyphasic approach. The novel microorganism has morphological and chemical characteristics typical of the members of the genus Saccharothrix and formed a phyletic line at the periphery of the Saccharothrix espanaensis subcluster in the 16S rRNA gene dendrograms. Results of the 16S rRNA gene sequence comparisons revealed that strain MB46 T shares high degrees of similarity with S. espanaensis DSM 44229 T (99.2%), Saccharothrix variisporea DSM 43911 T (98.7%) and Saccharothrix texasensis NRRL B-16134 T (98.6%). However, the new strain exhibited only 12.5-17.5% DNA relatedness to the neighbouring Saccharothrix spp. On the basis of phenotypic characteristics, 16S rRNA gene sequence comparisons and DNA-DNA hybridizations, strain MB46 T is concluded to represent a novel species of the genus Saccharothrix, for which the name Saccharothrix ghardaiensis sp. nov. (type strain MB46 T = DSM 46886 T = CECT 9046 T ) is proposed.
Analysis of transcriptome in hickory (Carya cathayensis), and uncover the dynamics in the hormonal signaling pathway during graft process.

PubMed

Qiu, Lingling; Jiang, Bo; Fang, Jia; Shen, Yike; Fang, Zhongxiang; Rm, Saravana Kumar; Yi, Keke; Shen, Chenjia; Yan, Daoliang; Zheng, Bingsong

2016-11-17

Hickory (Carya cathayensis), a woody plant with high nutritional and economic value, is widely planted in China. Due to its long juvenile phase, grafting is a useful technique for large-scale cultivation of hickory. To reveal the molecular mechanism during the graft process, we sequenced the transcriptomes of graft union in hickory. In our study, six RNA-seq libraries yielded a total of 83,676,860 clean short reads comprising 4.19 Gb of sequence data. A large number of differentially expressed genes (DEGs) at three time points during the graft process were identified. In detail, 777 DEGs in the 7 d vs 0 d (day after grafting) comparison were classified into 11 enriched Gene Ontology (GO) categories, and 262 DEGs in the 14 d vs 0 d comparison were classified into 15 enriched GO categories. Furthermore, an overview of the PPI network was constructed by these DEGs. In addition, 20 genes related to the auxin-and cytokinin-signaling pathways were identified, and some were validated by qRT-PCR analysis. Our comprehensive analysis provides basic information on the candidate genes and hormone signaling pathways involved in the graft process in hickory and other woody plants.
Comparison of biological and genomic characteristics between a newly isolated mink enteritis parvovirus MEV-LHV and an attenuated strain MEV-L.

PubMed

Mao, Yaping; Wang, Jigui; Hou, Qiang; Xi, Ji; Zhang, Xiaomei; Bian, Dawei; Yu, Yongle; Wang, Xi; Liu, Weiquan

2016-06-01

A virus isolated from mink showing clinical signs of enteritis was identified as a high virulent mink enteritis parvovirus (MEV) based on its biological characteristics in vivo and in vitro. Mink, challenged with this strain named MEV-LHV, exhibited severe pathological lesions as compared to those challenged with attenuated strain MEV-L. MEV-LHV also showed higher infection and replication efficiencies in vitro than MEV-L. Sequence of the complete genome of MEV-LHV was determined and analyzed in comparison with those in GenBank, which revealed that MEV-LHV shared high homology with virulent strain MEV SD12/01, whereas MEV-L was closely related to Abashiri and vaccine strain MEVB, and belonged to a different branch of the phylogenetic tree. The genomes of the two strains differed by insertions and deletions in their palindromic termini and specific unique mutations (especially VP2 300) in coding sequences which may be involved in viral replication and pathogenicity. The results of this study provide a better understanding of the biological and genomic characteristics of MEV and identify certain regions and sites that may be involved in viral replication and pathogenicity.
Some cross-linguistic evidence for modulation of implicational universals by language-specific frequency effects in phonological development

PubMed Central

Edwards, Jan; Beckman, Mary E.

2009-01-01

While broad-focus comparisons of consonant inventories across children acquiring different language can suggest that phonological development follows a universal sequence, finer-grained statistical comparisons can reveal systematic differences. This cross-linguistic study of word-initial lingual obstruents examined some effects of language-specific frequencies on consonant mastery. Repetitions of real words were elicited from 2- and 3-year-old children who were monolingual speakers of English, Cantonese, Greek, or Japanese. The repetitions were recorded and transcribed by an adult native speaker for each language. Results found support for both language-universal effects in phonological acquisition and for language-specific influences related to phoneme and phoneme sequence frequency. These results suggest that acquisition patterns that are common across languages arise in two ways. One influence is direct, via the universal constraints imposed by the physiology and physics of speech production and perception, and how these predict which contrasts will be easy and which will be difficult for the child to learn to control. The other influence is indirect, via the way universal principles of ease of perception and production tend to influence the lexicons of many languages through commonly attested sound changes. PMID:19890438
BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons

PubMed Central

2011-01-01

Background Visualisation of genome comparisons is invaluable for helping to determine genotypic differences between closely related prokaryotes. New visualisation and abstraction methods are required in order to improve the validation, interpretation and communication of genome sequence information; especially with the increasing amount of data arising from next-generation sequencing projects. Visualising a prokaryote genome as a circular image has become a powerful means of displaying informative comparisons of one genome to a number of others. Several programs, imaging libraries and internet resources already exist for this purpose, however, most are either limited in the number of comparisons they can show, are unable to adequately utilise draft genome sequence data, or require a knowledge of command-line scripting for implementation. Currently, there is no freely available desktop application that enables users to rapidly visualise comparisons between hundreds of draft or complete genomes in a single image. Results BLAST Ring Image Generator (BRIG) can generate images that show multiple prokaryote genome comparisons, without an arbitrary limit on the number of genomes compared. The output image shows similarity between a central reference sequence and other sequences as a set of concentric rings, where BLAST matches are coloured on a sliding scale indicating a defined percentage identity. Images can also include draft genome assembly information to show read coverage, assembly breakpoints and collapsed repeats. In addition, BRIG supports the mapping of unassembled sequencing reads against one or more central reference sequences. Many types of custom data and annotations can be shown using BRIG, making it a versatile approach for visualising a range of genomic comparison data. BRIG is readily accessible to any user, as it assumes no specialist computational knowledge and will perform all required file parsing and BLAST comparisons automatically. Conclusions There is a clear need for a user-friendly program that can produce genome comparisons for a large number of prokaryote genomes with an emphasis on rapidly utilising unfinished or unassembled genome data. Here we present BRIG, a cross-platform application that enables the interactive generation of comparative genomic images via a simple graphical-user interface. BRIG is freely available for all operating systems at http://sourceforge.net/projects/brig/. PMID:21824423
BLAST Ring Image Generator (BRIG): simple prokaryote genome comparisons.

PubMed

Alikhan, Nabil-Fareed; Petty, Nicola K; Ben Zakour, Nouri L; Beatson, Scott A

2011-08-08

Visualisation of genome comparisons is invaluable for helping to determine genotypic differences between closely related prokaryotes. New visualisation and abstraction methods are required in order to improve the validation, interpretation and communication of genome sequence information; especially with the increasing amount of data arising from next-generation sequencing projects. Visualising a prokaryote genome as a circular image has become a powerful means of displaying informative comparisons of one genome to a number of others. Several programs, imaging libraries and internet resources already exist for this purpose, however, most are either limited in the number of comparisons they can show, are unable to adequately utilise draft genome sequence data, or require a knowledge of command-line scripting for implementation. Currently, there is no freely available desktop application that enables users to rapidly visualise comparisons between hundreds of draft or complete genomes in a single image. BLAST Ring Image Generator (BRIG) can generate images that show multiple prokaryote genome comparisons, without an arbitrary limit on the number of genomes compared. The output image shows similarity between a central reference sequence and other sequences as a set of concentric rings, where BLAST matches are coloured on a sliding scale indicating a defined percentage identity. Images can also include draft genome assembly information to show read coverage, assembly breakpoints and collapsed repeats. In addition, BRIG supports the mapping of unassembled sequencing reads against one or more central reference sequences. Many types of custom data and annotations can be shown using BRIG, making it a versatile approach for visualising a range of genomic comparison data. BRIG is readily accessible to any user, as it assumes no specialist computational knowledge and will perform all required file parsing and BLAST comparisons automatically. There is a clear need for a user-friendly program that can produce genome comparisons for a large number of prokaryote genomes with an emphasis on rapidly utilising unfinished or unassembled genome data. Here we present BRIG, a cross-platform application that enables the interactive generation of comparative genomic images via a simple graphical-user interface. BRIG is freely available for all operating systems at http://sourceforge.net/projects/brig/.
Yeast mitochondrial threonyl-tRNA synthetase recognizes tRNA isoacceptors by distinct mechanisms and promotes CUN codon reassignment

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ling, Jiqiang; Peterson, Kaitlyn M.; Simonovic, Ivana

2014-03-12

Aminoacyl-tRNA synthetases (aaRSs) ensure faithful translation of mRNA into protein by coupling an amino acid to a set of tRNAs with conserved anticodon sequences. Here, we show that in mitochondria of Saccharomyces cerevisiae, a single aaRS (MST1) recognizes and aminoacylates two natural tRNAs that contain anticodon loops of different size and sequence. Besides a regular ?? with a threonine (Thr) anticodon, MST1 also recognizes an unusual ??, which contains an enlarged anticodon loop and an anticodon triplet that reassigns the CUN codons from leucine to threonine. Our data show that MST1 recognizes the anticodon loop in both tRNAs, but employsmore » distinct recognition mechanisms. The size but not the sequence of the anticodon loop is critical for ?? recognition, whereas the anticodon sequence is essential for aminoacylation of ??. The crystal structure of MST1 reveals that, while lacking the N-terminal editing domain, the enzyme closely resembles the bacterial threonyl-tRNA synthetase (ThrRS). A detailed structural comparison with Escherichia coli ThrRS, which is unable to aminoacylate ??, reveals differences in the anticodon-binding domain that probably allow recognition of the distinct anticodon loops. Finally, our mutational and modeling analyses identify the structural elements in MST1 (e.g., helix {alpha}11) that define tRNA selectivity. Thus, MTS1 exemplifies that a single aaRS can recognize completely divergent anticodon loops of natural isoacceptor tRNAs and that in doing so it facilitates the reassignment of the genetic code in yeast mitochondria.« less
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health.

PubMed

Hazkani-Covo, Einat; Martin, William F

2017-05-01

Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genome sequencing of the lizard parasite Leishmania tarentolae reveals loss of genes associated to the intracellular stage of human pathogenic species

PubMed Central

Raymond, Frédéric; Boisvert, Sébastien; Roy, Gaétan; Ritt, Jean-François; Légaré, Danielle; Isnard, Amandine; Stanke, Mario; Olivier, Martin; Tremblay, Michel J.; Papadopoulou, Barbara; Ouellette, Marc; Corbeil, Jacques

2012-01-01

The Leishmania tarentolae Parrot-TarII strain genome sequence was resolved to an average 16-fold mean coverage by next-generation DNA sequencing technologies. This is the first non-pathogenic to humans kinetoplastid protozoan genome to be described thus providing an opportunity for comparison with the completed genomes of pathogenic Leishmania species. A high synteny was observed between all sequenced Leishmania species. A limited number of chromosomal regions diverged between L. tarentolae and L. infantum, while remaining syntenic to L. major. Globally, >90% of the L. tarentolae gene content was shared with the other Leishmania species. We identified 95 predicted coding sequences unique to L. tarentolae and 250 genes that were absent from L. tarentolae. Interestingly, many of the latter genes were expressed in the intracellular amastigote stage of pathogenic species. In addition, genes coding for products involved in antioxidant defence or participating in vesicular-mediated protein transport were underrepresented in L. tarentolae. In contrast to other Leishmania genomes, two gene families were expanded in L. tarentolae, namely the zinc metallo-peptidase surface glycoprotein GP63 and the promastigote surface antigen PSA31C. Overall, L. tarentolae's gene content appears better adapted to the promastigote insect stage rather than the amastigote mammalian stage. PMID:21998295
Comparison of microbial DNA enrichment tools for metagenomic whole genome sequencing.

PubMed

Thoendel, Matthew; Jeraldo, Patricio R; Greenwood-Quaintance, Kerryl E; Yao, Janet Z; Chia, Nicholas; Hanssen, Arlen D; Abdel, Matthew P; Patel, Robin

2016-08-01

Metagenomic whole genome sequencing for detection of pathogens in clinical samples is an exciting new area for discovery and clinical testing. A major barrier to this approach is the overwhelming ratio of human to pathogen DNA in samples with low pathogen abundance, which is typical of most clinical specimens. Microbial DNA enrichment methods offer the potential to relieve this limitation by improving this ratio. Two commercially available enrichment kits, the NEBNext Microbiome DNA Enrichment Kit and the Molzym MolYsis Basic kit, were tested for their ability to enrich for microbial DNA from resected arthroplasty component sonicate fluids from prosthetic joint infections or uninfected sonicate fluids spiked with Staphylococcus aureus. Using spiked uninfected sonicate fluid there was a 6-fold enrichment of bacterial DNA with the NEBNext kit and 76-fold enrichment with the MolYsis kit. Metagenomic whole genome sequencing of sonicate fluid revealed 13- to 85-fold enrichment of bacterial DNA using the NEBNext enrichment kit. The MolYsis approach achieved 481- to 9580-fold enrichment, resulting in 7 to 59% of sequencing reads being from the pathogens known to be present in the samples. These results demonstrate the usefulness of these tools when testing clinical samples with low microbial burden using next generation sequencing. Copyright © 2016 Elsevier B.V. All rights reserved.
Cryptosporidium in fish: alternative sequencing approaches and analyses at multiple loci to resolve mixed infections.

PubMed

Paparini, Andrea; Yang, Rongchang; Chen, Linda; Tong, Kaising; Gibson-Kueh, Susan; Lymbery, Alan; Ryan, Una M

2017-11-01

Currently, the systematics, biology and epidemiology of piscine Cryptosporidium species are poorly understood. Here, we compared Sanger ‒ and next-generation ‒ sequencing (NGS), of piscine Cryptosporidium, at the 18S rRNA and actin genes. The hosts comprised 11 ornamental fish species, spanning four orders and eight families. The objectives were: to (i) confirm the rich genetic diversity of the parasite and the high frequency of mixed infections; and (ii) explore the potential of NGS in the presence of complex genetic mixtures. By Sanger sequencing, four main genotypes were obtained at the actin locus, while for the 18S locus, seven genotypes were identified. At both loci, NGS revealed frequent mixed infections, consisting of one highly dominant variant plus substantially rarer genotypes. Both sequencing methods detected novel Cryptosporidium genotypes at both loci, including a novel and highly abundant actin genotype that was identified by both Sanger sequencing and NGS. Importantly, this genotype accounted for 68·9% of all NGS reads from all samples (249 585/362 372). The present study confirms that aquarium fish can harbour a large and unexplored Cryptosporidium genetic diversity. Although commonly used in molecular parasitology studies, nested PCR prevents quantitative comparisons and thwarts the advantages of NGS, when this latter approach is used to investigate multiple infections.
Specific DNA binding of the two chicken Deformed family homeodomain proteins, Chox-1.4 and Chox-a.

PubMed Central

Sasaki, H; Yokoyama, E; Kuroiwa, A

1990-01-01

The cDNA clones encoding two chicken Deformed (Dfd) family homeobox containing genes Chox-1.4 and Chox-a were isolated. Comparison of their amino acid sequences with another chicken Dfd family homeodomain protein and with those of mouse homologues revealed that strong homologies are located in the amino terminal regions and around the homeodomains. Although homologies in other regions were relatively low, some short conserved sequences were also identified. E. coli-made full length proteins were purified and used for the production of specific antibodies and for DNA binding studies. The binding profiles of these proteins to the 5'-leader and 5'-upstream sequences of Chox-1.4 and Chox-a coding regions were analyzed by immunoprecipitation and DNase I footprint assays. These two Chox proteins bound to the same sites in the 5'-flanking sequences of their coding regions with various affinities and their binding affinities to each site were nearly the same. The consensus sequences of the high and low affinity binding sites were TAATGA(C/G) and CTAATTTT, respectively. A clustered binding site was identified in the 5'-upstream of the Chox-a gene, suggesting that this clustered binding site works as a cis-regulatory element for auto- and/or cross-regulation of Chox-a gene expression. Images PMID:1970866

Tracking the Origin and Deciphering the Phylogenetic Relationship of Porcine Epidemic Diarrhea Virus in Ecuador

PubMed Central

Barrera, Maritza; Garrido-Haro, Ana; Vaca, María S.; Granda, Danilo; Acosta-Batallas, Alfredo

2017-01-01

In 2010, new Chinese strains of porcine epidemic diarrhea virus (PEDV), clinically more severe than the classical strains, emerged. These strains were spread to United States in 2013 through an intercontinental transmission from China with further spreading across the world, evidencing the emergent nature of these strains. In the present study, an analysis of PEDV field sequences from Ecuador was conducted by comparing all the PEDV S gene sequences available in the GenBank database. Phylogenetic comparisons and Bayesian phylogeographic inference based on complete S gene sequences were also conducted to track the origin and putative route of PEDV. The sequence from the PED-outbreak in Ecuador was grouped into the clade II of PEDV genogroup 2a together with other sequences of isolates from Mexico, Canada, and United States. The phylogeographic study revealed the emergence of the Chinese PEDV strains, followed by spreading to US in 2013, from US to Korea, and later the introduction of PEDV to Canada, Mexico, and Ecuador directly from the US. The sources of imports of live swine in Ecuador in 2014 were mainly from Chile and US. Thus, this movement of pigs is suggested as the main way for introducing PEDV to Ecuador. PMID:29379796
Barcode Identifiers as a Practical Tool for Reliable Species Assignment of Medically Important Black Yeast Species

PubMed Central

Heinrichs, Guido; de Hoog, G. Sybren

2012-01-01

Herpotrichiellaceous black yeasts and relatives comprise severe pathogens flanked by nonpathogenic environmental siblings. Reliable identification by conventional methods is notoriously difficult. Molecular identification is hampered by the sequence variability in the internal transcribed spacer (ITS) domain caused by difficult-to-sequence homopolymeric regions and by poor taxonomic attribution of sequences deposited in GenBank. Here, we present a potential solution using short barcode identifiers (27 to 50 bp) based on ITS2 ribosomal DNA (rDNA), which allows unambiguous definition of species-specific fragments. Starting from proven sequences of ex-type and authentic strains, we were able to describe 103 identifiers. Multiple BLAST searches of these proposed barcode identifiers in GenBank revealed uniqueness for 100 taxonomic entities, whereas the three remaining identifiers each matched with two entities, but the species of these identifiers could easily be discriminated by differences in the remaining ITS regions. Using the proposed barcode identifiers, a 4.1-fold increase of 100% matches in GenBank was achieved in comparison to the classical approach using the complete ITS sequences. The proposed barcode identifiers will be made accessible for the diagnostic laboratory in a permanently updated online database, thereby providing a highly practical, reliable, and cost-effective tool for identification of clinically important black yeasts and relatives. PMID:22785187
Birth and death of genes linked to chromosomal inversion

PubMed Central

Furuta, Yoshikazu; Kawai, Mikihiko; Yahara, Koji; Takahashi, Noriko; Handa, Naofumi; Tsuru, Takeshi; Oshima, Kenshiro; Yoshida, Masaru; Azuma, Takeshi; Hattori, Masahira; Uchiyama, Ikuo; Kobayashi, Ichizo

2011-01-01

The birth and death of genes is central to adaptive evolution, yet the underlying genome dynamics remain elusive. The availability of closely related complete genome sequences helps to follow changes in gene contents and clarify their relationship to overall genome organization. Helicobacter pylori, bacteria in our stomach, are known for their extreme genome plasticity through mutation and recombination and will make a good target for such an analysis. In comparing their complete genome sequences, we found that gain and loss of genes (loci) for outer membrane proteins, which mediate host interaction, occurred at breakpoints of chromosomal inversions. Sequence comparison there revealed a unique mechanism of DNA duplication: DNA duplication associated with inversion. In this process, a DNA segment at one chromosomal locus is copied and inserted, in an inverted orientation, into a distant locus on the same chromosome, while the entire region between these two loci is also inverted. Recognition of this and three more inversion modes, which occur through reciprocal recombination between long or short sequence similarity or adjacent to a mobile element, allowed reconstruction of synteny evolution through inversion events in this species. These results will guide the interpretation of extensive DNA sequencing results for understanding long- and short-term genome evolution in various organisms and in cancer cells. PMID:21212362
Comparative sequence analysis of a region on human chromosome 13q14, frequently deleted in B-cell chronic lymphocytic leukemia, and its homologous region on mouse chromosome 14.

PubMed

Kapanadze, B; Makeeva, N; Corcoran, M; Jareborg, N; Hammarsund, M; Baranova, A; Zabarovsky, E; Vorontsova, O; Merup, M; Gahrton, G; Jansson, M; Yankovsky, N; Einhorn, S; Oscier, D; Grandér, D; Sangfelt, O

2000-12-15

Previous studies have indicated the presence of a putative tumor suppressor gene on human chromosome 13q14, commonly deleted in patients with B-cell chronic lymphocytic leukemia (B-CLL). We have recently identified a minimally deleted region encompassing parts of two adjacent genes, termed LEU1 and LEU2 (leukemia-associated genes 1 and 2), and several additional transcripts. In addition, 50 kb centromeric to this region we have identified another gene, LEU5/RFP2. To elucidate further the complex genomic organization of this region, we have identified, mapped, and sequenced the homologous region in the mouse. Fluorescence in situ hybridization analysis demonstrated that the region maps to mouse chromosome 14. The overall organization and gene order in this region were found to be highly conserved in the mouse. Sequence comparison between the human deletion hotspot region and its homologous mouse region revealed a high degree of sequence conservation with an overall score of 74%. However, our data also show that in terms of transcribed sequences, only two of those, human LEU2 and LEU5/RFP2, are clearly conserved, strengthening the case for these genes as putative candidate B-CLL tumor suppressor genes.
The Status, Quality, and Expansion of the NIH Full-Length cDNA Project: The Mammalian Gene Collection (MGC)

PubMed Central

2004-01-01

The National Institutes of Health's Mammalian Gene Collection (MGC) project was designed to generate and sequence a publicly accessible cDNA resource containing a complete open reading frame (ORF) for every human and mouse gene. The project initially used a random strategy to select clones from a large number of cDNA libraries from diverse tissues. Candidate clones were chosen based on 5′-EST sequences, and then fully sequenced to high accuracy and analyzed by algorithms developed for this project. Currently, more than 11,000 human and 10,000 mouse genes are represented in MGC by at least one clone with a full ORF. The random selection approach is now reaching a saturation point, and a transition to protocols targeted at the missing transcripts is now required to complete the mouse and human collections. Comparison of the sequence of the MGC clones to reference genome sequences reveals that most cDNA clones are of very high sequence quality, although it is likely that some cDNAs may carry missense variants as a consequence of experimental artifact, such as PCR, cloning, or reverse transcriptase errors. Recently, a rat cDNA component was added to the project, and ongoing frog (Xenopus) and zebrafish (Danio) cDNA projects were expanded to take advantage of the high-throughput MGC pipeline. PMID:15489334
Molecular identification and partial sequence analysis of an aryl hydrocarbon receptor from beluga (Delphinapterus leucas)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jensen, B.A.; Hahn, M.E.

1995-12-31

The aryl hydrocarbon receptor (AhR) mediates the effects of many common and potentially toxic organic hydrocarbons, including some polychlorinated biphenyls and dioxins. Since small cetaceans often inhabit industrially polluted coastal waters, comparison of the molecular structure and function of this protein in cetaeans with other marine and mammalian species is important for evaluating the sensitivity of cetaceans to these pollutants. An AhR protein has been identified in beluga liver by photoaffinity labeling. In the present study, the authors sought to clone and sequence an AhR cDNA from beluga as a prelude to studying its structure and function, using reverse-transcription polymerasemore » chain reaction (RT-PCR) and degenerate primers, a 515 base pair fragment was amplified, cloned and sequenced, revealing homology to the PAS domain (ligand binding and dimerization region) of AhRs from terrestrial mammals. This portion of the putative beluga AhR has 82% amino acid and 81% nucleotide sequence identity to the mouse AhR, and 63% amino acid and 64% nucleotide sequence identity to an AhR from the marine fish Fundulus heteroclitus. A beluga cDNA library was synthesized and is currently being screened with the PCR-generated fragment to obtain the complete coding sequence. This is the first molecular evidence of AhR presence in cetaceans.« less
A survey and evaluations of histogram-based statistics in alignment-free sequence comparison.

PubMed

Luczak, Brian B; James, Benjamin T; Girgis, Hani Z

2017-12-06

Since the dawn of the bioinformatics field, sequence alignment scores have been the main method for comparing sequences. However, alignment algorithms are quadratic, requiring long execution time. As alternatives, scientists have developed tens of alignment-free statistics for measuring the similarity between two sequences. We surveyed tens of alignment-free k-mer statistics. Additionally, we evaluated 33 statistics and multiplicative combinations between the statistics and/or their squares. These statistics are calculated on two k-mer histograms representing two sequences. Our evaluations using global alignment scores revealed that the majority of the statistics are sensitive and capable of finding similar sequences to a query sequence. Therefore, any of these statistics can filter out dissimilar sequences quickly. Further, we observed that multiplicative combinations of the statistics are highly correlated with the identity score. Furthermore, combinations involving sequence length difference or Earth Mover's distance, which takes the length difference into account, are always among the highest correlated paired statistics with identity scores. Similarly, paired statistics including length difference or Earth Mover's distance are among the best performers in finding the K-closest sequences. Interestingly, similar performance can be obtained using histograms of shorter words, resulting in reducing the memory requirement and increasing the speed remarkably. Moreover, we found that simple single statistics are sufficient for processing next-generation sequencing reads and for applications relying on local alignment. Finally, we measured the time requirement of each statistic. The survey and the evaluations will help scientists with identifying efficient alternatives to the costly alignment algorithm, saving thousands of computational hours. The source code of the benchmarking tool is available as Supplementary Materials. © The Author 2017. Published by Oxford University Press.
Unusual Variability of the Drosophila Melanogaster Ref(2)p Protein Which Controls the Multiplication of Sigma Rhabdovirus

PubMed Central

Dru, P.; Bras, F.; Dezelee, S.; Gay, P.; Petitjean, A. M.; Pierre-Deneubourg, A.; Teninges, D.; Contamine, D.

1993-01-01

The ref(2)P gene of Drosophila melanogaster was identified by the discovery of two alleles, P(o) and P(p), respectively, permissive and restrictive for sigma rhabdovirus multiplication. A surprising variability of this gene was first noticed by the observation of size differences between the transcripts of permissive and restrictive alleles. In this paper, another restrictive allele, P(n), clearly distinct from P(p), is described: it exhibits a weaker antiviral effect than P(p) and differs from P(p) by its molecular structure. Five types of alleles were distinguished on the basis of their molecular structure, as revealed by S1 nuclease analysis of 17 D. melanogaster strains; three alleles were permissive and two restrictive. Comparison of the sequences of four haplotypes revealed numerous point mutations, two deletions (21 and 24 bp) and a complex event involving a 3-bp deletion, all affected the coding region. The unusual variability of the ref(2)P locus was confirmed by the high ratio of amino acid replacements to synonymous mutations (7:1), as compared to that of other genes, such as the Adh (2:42). Nevertheless, nucleotide sequence comparison with the Drosophila erecta ref(2)P gene shows that selective pressures are exerted to maintain the existence of a functional protein. The effects of this high variability on the ref(2)P protein are discussed in relation to its specific antiviral properties and to its function in D. melanogaster, where it is required for male fertility. PMID:8462852
Comparative genomics and the role of lateral gene transfer in the evolution of bovine adapted Streptococcus agalactiae

PubMed Central

Richards, Vincent P.; Lang, Ping; Pavinski Bitar, Paulina D.; Lefébure, Tristan; Schukken, Ynte H.; Zadoks, Ruth N.; Stanhope, Michael J.

2011-01-01

In addition to causing severe invasive infections in humans, Streptococcus agalactiae, or group B Streptococcus (GBS), is also a major cause of bovine mastitis. Here we provide the first genome sequence for S. agalactiae isolated from a cow diagnosed with clinical mastitis (strain FSL S3-026). Comparison to eight S. agalactiae genomes obtained from human disease isolates revealed 183 genes specific to the bovine strain. Subsequent polymerase chain reaction (PCR) screening for the presence/absence of a subset of these loci in additional bovine and human strains revealed strong differentiation between the two groups (Fisher exact test: p < 0.0001). The majority of the bovine strain-specific genes (~85%) clustered tightly into eight genomic islands, suggesting these genes were acquired through lateral gene transfer (LGT). This bovine GBS also contained an unusually high proportion of insertion sequences (4.3% of the total genome), suggesting frequent genomic rearrangement. Comparison to other mastitis-causing species of bacteria provided strong evidence for two cases of interspecies LGT within the shared bovine environment: bovine S. agalactiae with Streptococcus uberis (nisin U operon) and Streptococcus dysgalactiae subsp. dysgalactiae (lactose operon). We also found evidence for LGT, involving the salivaricin operon, between the bovine S. agalactiae strain and either Streptococcus pyogenes or Streptococcus salivarius. Our findings provide insight intomechanismsfacilitatingenvironmentaladaptationandacquisitionofpotential virulence factors, while highlighting both the key role LGT has played in the recent evolution of the bovine S. agalactiae strain, and the importance of LGT among pathogens within a shared environment. PMID:21536150
The mitochondrial genome of the Japanese skeleton shrimp Caprella mutica (Amphipoda: Caprellidea) reveals a unique gene order and shared apomorphic translocations with Gammaridea.

PubMed

Kilpert, Fabian; Podsiadlowski, Lars

2010-06-01

This study presents the complete mitochondrial (mt) genome of the amphipod Caprella mutica, an east-Asian species, which recently invaded the coastal regions of North America, Europe, and New Zealand. It is the first complete sequence of a member of the amphipod subclade Caprellidea. The mt genome has a total length of 15,427 bp and is organized in a circular double-strand molecule. All 37 mt genes are present, including the common set of 22 tRNAs. Particularly noticeable is the duplication of the control region (CR). The additional CR is located between nad6 and cob, and is almost identical to the original one. The most extensive changes in the gene order affect nad5 and a block consisting of trnH, nad4, nad4L, and trnP-all inserted near the original CR. The gene nad5 is also inverted. Furthermore, a comparison with the pancrustacean ground pattern reveals additional changes of individual tRNA genes. Some of these changes are also shared by Metacrangonyx longipes and Parhyale hawaiensis. These arrangements were found only in amphipods and might be considered as apomorphic by character states of Amphipoda. In all the three species, there is good evidence that trnG originated from a rare duplication/remolding event of the adjacent trnW gene. Nevertheless, each of the three available amphipod mitogenome sequences also bears unique rearrangements. C. mutica, however, shows the most extensive rearrangement in comparison with the pancrustacean ground pattern.
Genetic Characterization of a Sinorhizobium meliloti Chromosomal Region Involved in Lipopolysaccharide Biosynthesis

PubMed Central

Lagares, Antonio; Hozbor, Daniela F.; Niehaus, Karsten; Otero, Augusto J. L. Pich; Lorenzen, Jens; Arnold, Walter; Pühler, Alfred

2001-01-01

The genetic characterization of a 5.5-kb chromosomal region of Sinorhizobium meliloti 2011 that contains lpsB, a gene required for the normal development of symbiosis with Medicago spp., is presented. The nucleotide sequence of this DNA fragment revealed the presence of six genes: greA and lpsB, transcribed in the forward direction; and lpsE, lpsD, lpsC, and lrp, transcribed in the reverse direction. Except for lpsB, none of the lps genes were relevant for nodulation and nitrogen fixation. Analysis of the transcriptional organization of lpsB showed that greA and lpsB are part of separate transcriptional units, which is in agreement with the finding of a DNA stretch homologous to a “nonnitrogen” promoter consensus sequence between greA and lpsB. The opposite orientation of lpsB with respect to its first downstream coding sequence, lpsE, indicated that the altered LPS and the defective symbiosis of lpsB mutants are both consequences of a primary nonpolar defect in a single gene. Global sequence comparisons revealed that the greA-lpsB and lrp genes of S. meliloti have a genetic organization similar to that of their homologous loci in R. leguminosarum bv. viciae. In particular, high sequence similarity was found between the translation product of lpsB and a core-related biosynthetic mannosyltransferase of R. leguminosarum bv. viciae encoded by the lpcC gene. The functional relationship between these two genes was demonstrated in genetic complementation experiments in which the S. meliloti lpsB gene restored the wild-type LPS phenotype when introduced into lpcC mutants of R. leguminosarum. These results support the view that S. meliloti lpsB also encodes a mannosyltransferase that participates in the biosynthesis of the LPS core. Evidence is provided for the presence of other lpsB-homologous sequences in several members of the family Rhizobiaceae. PMID:11157937
Comparative Analysis of Genome Sequences Covering the Seven Cronobacter Species

PubMed Central

Cummings, Craig A.; Shih, Rita; Degoricija, Lovorka; Rico, Alain; Brzoska, Pius; Hamby, Stephen E.; Masood, Naqash; Hariri, Sumyya; Sonbol, Hana; Chuzhanova, Nadia; McClelland, Michael; Furtado, Manohar R.; Forsythe, Stephen J.

2012-01-01

Background Species of Cronobacter are widespread in the environment and are occasional food-borne pathogens associated with serious neonatal diseases, including bacteraemia, meningitis, and necrotising enterocolitis. The genus is composed of seven species: C. sakazakii, C. malonaticus, C. turicensis, C. dublinensis, C. muytjensii, C. universalis, and C. condimenti. Clinical cases are associated with three species, C. malonaticus, C. turicensis and, in particular, with C. sakazakii multilocus sequence type 4. Thus, it is plausible that virulence determinants have evolved in certain lineages. Methodology/Principal Findings We generated high quality sequence drafts for eleven Cronobacter genomes representing the seven Cronobacter species, including an ST4 strain of C. sakazakii. Comparative analysis of these genomes together with the two publicly available genomes revealed Cronobacter has over 6,000 genes in one or more strains and over 2,000 genes shared by all Cronobacter. Considerable variation in the presence of traits such as type six secretion systems, metal resistance (tellurite, copper and silver), and adhesins were found. C. sakazakii is unique in the Cronobacter genus in encoding genes enabling the utilization of exogenous sialic acid which may have clinical significance. The C. sakazakii ST4 strain 701 contained additional genes as compared to other C. sakazakii but none of them were known specific virulence-related genes. Conclusions/Significance Genome comparison revealed that pair-wise DNA sequence identity varies between 89 and 97% in the seven Cronobacter species, and also suggested various degrees of divergence. Sets of universal core genes and accessory genes unique to each strain were identified. These gene sequences can be used for designing genus/species specific detection assays. Genes encoding adhesins, T6SS, and metal resistance genes as well as prophages are found in only subsets of genomes and have contributed considerably to the variation of genomic content. Differences in gene content likely contribute to differences in the clinical and environmental distribution of species and sequence types. PMID:23166675
Comparative Analysis of the Complete Plastomes of Apostasia wallichii and Neuwiedia singapureana (Apostasioideae) Reveals Different Evolutionary Dynamics of IR/SSC Boundary among Photosynthetic Orchids.

PubMed

Niu, Zhitao; Pan, Jiajia; Zhu, Shuying; Li, Ludan; Xue, Qingyun; Liu, Wei; Ding, Xiaoyu

2017-01-01

Apostasioideae, consists of only two genera, Apostasia and Neuwiedia , which are mainly distributed in Southeast Asia and northern Australia. The floral structure, taxonomy, biogeography, and genome variation of Apostasioideae have been intensively studied. However, detailed analyses of plastome composition and structure and comparisons with those of other orchid subfamilies have not yet been conducted. Here, the complete plastome sequences of Apostasia wallichii and Neuwiedia singapureana were sequenced and compared with 43 previously published photosynthetic orchid plastomes to characterize the plastome structure and evolution in the orchids. Unlike many orchid plastomes (e.g., Paphiopedilum and Vanilla ), the plastomes of Apostasioideae contain a full set of 11 functional NADH dehydrogenase ( ndh ) genes. The distribution of repeat sequences and simple sequence repeat elements enhanced the view that the mutation rate of non-coding regions was higher than that of coding regions. The 10 loci- ndhA intron, matK-5'trnK , clpP-psbB , rps8-rpl14 , trnT-trnL , 3'trnK-matK , clpP intron , psbK-trnK , trnS-psbC , and ndhF-rpl32 -that had the highest degrees of sequence variability were identified as mutational hotspots for the Apostasia plastome. Furthermore, our results revealed that plastid genes exhibited a variable evolution rate within and among different orchid genus. Considering the diversified evolution of both coding and non-coding regions, we suggested that the plastome-wide evolution of orchid species was disproportional. Additionally, the sequences flanking the inverted repeat/small single copy (IR/SSC) junctions of photosynthetic orchid plastomes were categorized into three types according to the presence/absence of ndh genes. Different evolutionary dynamics for each of the three IR/SSC types of photosynthetic orchid plastomes were also proposed.
Comparative Analysis of the Complete Plastomes of Apostasia wallichii and Neuwiedia singapureana (Apostasioideae) Reveals Different Evolutionary Dynamics of IR/SSC Boundary among Photosynthetic Orchids

PubMed Central

Niu, Zhitao; Pan, Jiajia; Zhu, Shuying; Li, Ludan; Xue, Qingyun; Liu, Wei; Ding, Xiaoyu

2017-01-01

Apostasioideae, consists of only two genera, Apostasia and Neuwiedia, which are mainly distributed in Southeast Asia and northern Australia. The floral structure, taxonomy, biogeography, and genome variation of Apostasioideae have been intensively studied. However, detailed analyses of plastome composition and structure and comparisons with those of other orchid subfamilies have not yet been conducted. Here, the complete plastome sequences of Apostasia wallichii and Neuwiedia singapureana were sequenced and compared with 43 previously published photosynthetic orchid plastomes to characterize the plastome structure and evolution in the orchids. Unlike many orchid plastomes (e.g., Paphiopedilum and Vanilla), the plastomes of Apostasioideae contain a full set of 11 functional NADH dehydrogenase (ndh) genes. The distribution of repeat sequences and simple sequence repeat elements enhanced the view that the mutation rate of non-coding regions was higher than that of coding regions. The 10 loci—ndhA intron, matK-5′trnK, clpP-psbB, rps8-rpl14, trnT-trnL, 3′trnK-matK, clpP intron, psbK-trnK, trnS-psbC, and ndhF-rpl32—that had the highest degrees of sequence variability were identified as mutational hotspots for the Apostasia plastome. Furthermore, our results revealed that plastid genes exhibited a variable evolution rate within and among different orchid genus. Considering the diversified evolution of both coding and non-coding regions, we suggested that the plastome-wide evolution of orchid species was disproportional. Additionally, the sequences flanking the inverted repeat/small single copy (IR/SSC) junctions of photosynthetic orchid plastomes were categorized into three types according to the presence/absence of ndh genes. Different evolutionary dynamics for each of the three IR/SSC types of photosynthetic orchid plastomes were also proposed. PMID:29046685
RNA sequencing analysis to capture the transcriptome landscape during skin ulceration syndrome progression in sea cucumber Apostichopus japonicus.

PubMed

Yang, Aifu; Zhou, Zunchun; Pan, Yongjia; Jiang, Jingwei; Dong, Ying; Guan, Xiaoyan; Sun, Hongjuan; Gao, Shan; Chen, Zhong

2016-06-14

Sea cucumber Apostichopus japonicus is an important economic species in China, which is affected by various diseases; skin ulceration syndrome (SUS) is the most serious. In this study, we characterized the transcriptomes in A. japonicus challenged with Vibrio splendidus to elucidate the changes in gene expression throughout the three stages of SUS progression. RNA sequencing of 21 cDNA libraries from various tissues and developmental stages of SUS-affected A. japonicus yielded 553 million raw reads, of which 542 million high-quality reads were generated by deep-sequencing using the Illumina HiSeq™ 2000 platform. The reference transcriptome comprised a combination of the Illumina reads, 454 sequencing data and Sanger sequences obtained from the public database to generate 93,163 unigenes (average length, 1,052 bp; N50 = 1,575 bp); 33,860 were annotated. Transcriptome comparisons between healthy and SUS-affected A. japonicus revealed greater differences in gene expression profiles in the body walls (BW) than in the intestines (Int), respiratory trees (RT) and coelomocytes (C). Clustering of expression models revealed stable up-regulation as the main pattern occurring in the BW throughout the three stages of SUS progression. Significantly affected pathways were associated with signal transduction, immune system, cellular processes, development and metabolism. Ninety-two differentially expressed genes (DEGs) were divided into four functional categories: attachment/pathogen recognition (17), inflammatory reactions (38), oxidative stress response (7) and apoptosis (30). Using quantitative real-time PCR, twenty representative DEGs were selected to validate the sequencing results. The Pearson's correlation coefficient (R) of the 20 DEGs ranged from 0.811 to 0.999, which confirmed the consistency and accuracy between these two approaches. Dynamic changes in global gene expression occur during SUS progression in A. japonicus. Elucidation of these changes is important in clarifying the molecular mechanisms associated with the development of SUS in sea cucumber.
Validation of Minim typing for fast and accurate discrimination of extended-spectrum, beta-lactamase-producing Klebsiella pneumoniae isolates in tertiary care hospital.

PubMed

Brhelova, Eva; Kocmanova, Iva; Racil, Zdenek; Hanslianova, Marketa; Antonova, Mariya; Mayer, Jiri; Lengerova, Martina

2016-09-01

Minim typing is derived from the multi-locus sequence typing (MLST). It targets the same genes, but sequencing is replaced by high resolution melt analysis. Typing can be performed by analysing six loci (6MelT), four loci (4MelT) or using data from four loci plus sequencing the tonB gene (HybridMelT). The aim of this study was to evaluate Minim typing to discriminate extended-spectrum beta-lactamase producing Klebsiella pneumoniae (ESBL-KLPN) isolates at our hospital. In total, 380 isolates were analyzed. The obtained alleles were assigned according to both the 6MelT and 4MelT typing scheme. In 97 isolates, the tonB gene was sequenced to enable HybridMelT typing. We found that the presented method is suitable to quickly monitor isolates of ESBL-KLPN; results are obtained in less than 2 hours and at a lower cost than MLST. We identified a local ESBL-KLPN outbreak and a comparison of colonizing and invasive isolates revealed a long term colonization of patients with the same strain. Copyright © 2016 Elsevier Inc. All rights reserved.
Epitopes of human testis-specific lactate dehydrogenase deduced from a cDNA sequence

DOE Office of Scientific and Technical Information (OSTI.GOV)

Millan, J.L.; Driscoll, C.E.; LeVan, K.M.

The sequence and structure of human testis-specific L-lactate dehydrogenase (LDHC/sub 4/, LDHX; (L)-lactate:NAD/sup +/ oxidoreductase, EC 1.1.1.27) has been derived from analysis of a complementary DNA (cDNA) clone comprising the complete protein coding region of the enzyme. From the deduced amino acid sequence, human LDHC/sub 4/ is as different from rodent LDHC/sub 4/ (73% homology) as it is from human LDHA/sub 4/ (76% homology) and porcine LDHB/sub 4/ (68% homology). Subunit homologies are consistent with the conclusion that the LDHC gene arose by at least two independent duplication events. Furthermore, the lower degree of homology between mouse and human LDHC/submore » 4/ and the appearance of this isozyme late in evolution suggests a higher rate of mutation in the mammalian LDHC genes than in the LDHA and -B genes. Comparison of exposed amino acid residues of discrete anti-genic determinants of mouse and human LDHC/sub 4/ reveals significant differences. Knowledge of the human LDHC/sub 4/ sequence will help design human-specific peptides useful in the development of a contraceptive vaccine.« less
Complete nucleotide sequences of a new bipartite begomovirus from Malvastrum sp. plants with bright yellow mosaic symptoms in South Texas.

PubMed

Alabi, Olufemi J; Villegas, Cecilia; Gregg, Lori; Murray, K Daniel

2016-06-01

Two isolates of a novel bipartite begomovirus, tentatively named malvastrum bright yellow mosaic virus (MaBYMV), were molecularly characterized from naturally infected plants of the genus Malvastrum showing bright yellow mosaic disease symptoms in South Texas. Six complete DNA-A and five DNA-B genome sequences of MaBYMV obtained from the isolates ranged in length from 2,608 to 2,609 nucleotides (nt) and 2,578 to 2,605 nt, respectively. Both genome segments shared a 178- to 180-nt common region. In pairwise comparisons, the complete DNA-A and DNA-B sequences of MaBYMV were most similar (87-88 % and 79-81 % identity, respectively) and phylogenetically related to the corresponding sequences of sida mosaic Sinaloa virus-[MX-Gua-06]. Further analysis revealed that MaBYMV is a putative recombinant virus, thus supporting the notion that malvaceous hosts may be influencing the evolution of several begomoviruses. The design of new diagnostic primers enabled the detection of MaBYMV in cohorts of Bemisia tabaci collected from symptomatic Malvastrum sp. plants, thus implicating whiteflies as potential vectors of the virus.
The genomes and comparative genomics of Lactobacillus delbrueckii phages.

PubMed

Riipinen, Katja-Anneli; Forsman, Päivi; Alatossava, Tapani

2011-07-01

Lactobacillus delbrueckii phages are a great source of genetic diversity. Here, the genome sequences of Lb. delbrueckii phages LL-Ku, c5 and JCL1032 were analyzed in detail, and the genetic diversity of Lb. delbrueckii phages belonging to different taxonomic groups was explored. The lytic isometric group b phages LL-Ku (31,080 bp) and c5 (31,841 bp) showed a minimum nucleotide sequence identity of 90% over about three-fourths of their genomes. The genomic locations of their lysis modules were unique, and the genomes featured several putative overlapping transcription units of genes. LL-Ku and c5 virions displayed peptidoglycan hydrolytic activity associated with a ~36-kDa protein similar in size to the endolysin. Unexpectedly, the 49,433-bp genome of the prolate phage JCL1032 (temperate, group c) revealed a conserved gene order within its structural genes. Lb. delbrueckii phages representing groups a (a phage LL-H), b and c possessed only limited protein sequence homology. Genomic comparison of LL-Ku and c5 suggested that diversification of Lb. delbrueckii phages is mainly due to insertions, deletions and recombination. For the first time, the complete genome sequences of group b and c Lb. delbrueckii phages are reported.
Identification of novel alleles of the rice blast resistance gene Pi54

NASA Astrophysics Data System (ADS)

Vasudevan, Kumar; Gruissem, Wilhelm; Bhullar, Navreet K.

2015-10-01

Rice blast is one of the most devastating rice diseases and continuous resistance breeding is required to control the disease. The rice blast resistance gene Pi54 initially identified in an Indian cultivar confers broad-spectrum resistance in India. We explored the allelic diversity of the Pi54 gene among 885 Indian rice genotypes that were found resistant in our screening against field mixture of naturally existing M. oryzae strains as well as against five unique strains. These genotypes are also annotated as rice blast resistant in the International Rice Genebank database. Sequence-based allele mining was used to amplify and clone the Pi54 allelic variants. Nine new alleles of Pi54 were identified based on the nucleotide sequence comparison to the Pi54 reference sequence as well as to already known Pi54 alleles. DNA sequence analysis of the newly identified Pi54 alleles revealed several single polymorphic sites, three double deletions and an eight base pair deletion. A SNP-rich region was found between a tyrosine kinase phosphorylation site and the nucleotide binding site (NBS) domain. Together, the newly identified Pi54 alleles expand the allelic series and are candidates for rice blast resistance breeding programs.

Short-term application of dexamethasone on stem cells derived from human gingiva reduces the expression of RUNX2 and β-catenin.

PubMed

Kim, Bo-Bae; Kim, Minji; Park, Yun-Hee; Ko, Youngkyung; Park, Jun-Beom

2017-06-01

Objective Next-generation sequencing was performed to evaluate the effects of short-term application of dexamethasone on human gingiva-derived mesenchymal stem cells. Methods Human gingiva-derived stem cells were treated with a final concentration of 10 -7 M dexamethasone and the same concentration of vehicle control. This was followed by mRNA sequencing and data analysis, gene ontology and pathway analysis, quantitative real-time polymerase chain reaction of mRNA, and western blot analysis of RUNX2 and β-catenin. Results In total, 26,364 mRNAs were differentially expressed. Comparison of the results of dexamethasone versus control at 2 hours revealed that 7 mRNAs were upregulated and 25 mRNAs were downregulated. The application of dexamethasone reduced the expression of RUNX2 and β-catenin in human gingiva-derived mesenchymal stem cells. Conclusion The effects of dexamethasone on stem cells were evaluated with mRNA sequencing, and validation of the expression was performed with qualitative real-time polymerase chain reaction and western blot analysis. The results of this study can provide new insights into the role of mRNA sequencing in maxillofacial areas.
Transcriptome Analysis and Development of SSR Molecular Markers in Glycyrrhiza uralensis Fisch.

PubMed Central

Liu, Yaling; Zhang, Pengfei; Song, Meiling; Hou, Junling; Qing, Mei; Wang, Wenquan; Liu, Chunsheng

2015-01-01

Licorice is an important traditional Chinese medicine with clinical and industrial applications. Genetic resources of licorice are insufficient for analysis of molecular biology and genetic functions; as such, transcriptome sequencing must be conducted for functional characterization and development of molecular markers. In this study, transcriptome sequencing on the Illumina HiSeq 2500 sequencing platform generated a total of 5.41 Gb clean data. De novo assembly yielded a total of 46,641 unigenes. Comparison analysis using BLAST showed that the annotations of 29,614 unigenes were conserved. Further study revealed 773 genes related to biosynthesis of secondary metabolites of licorice, 40 genes involved in biosynthesis of the terpenoid backbone, and 16 genes associated with biosynthesis of glycyrrhizic acid. Analysis of unigenes larger than 1 Kb with a length of 11,702 nt presented 7,032 simple sequence repeats (SSR). Sixty-four of 69 randomly designed and synthesized SSR pairs were successfully amplified, 33 pairs of primers were polymorphism in in Glycyrrhiza uralensis Fisch., Glycyrrhiza inflata Bat., Glycyrrhiza glabra L. and Glycyrrhiza pallidiflora Maxim. This study not only presents the molecular biology data of licorice but also provides a basis for genetic diversity research and molecular marker-assisted breeding of licorice. PMID:26571372
Mitochondrial and Nuclear Ribosomal DNA Evidence Supports the Existence of a New Trichuris Species in the Endangered François’ Leaf-Monkey

PubMed Central

Liu, Guo-Hua; Gasser, Robin B.; Nejsum, Peter; Wang, Yan; Chen, Qiang; Song, Hui-Qun; Zhu, Xing-Quan

2013-01-01

The whipworm of humans, Trichuris trichiura, is responsible for a neglected tropical disease (NTD) of major importance in tropical and subtropical countries of the world. Whipworms also infect animal hosts, including pigs, dogs and non-human primates, cause clinical disease (trichuriasis) similar to that of humans. Although Trichuris species are usually considered to be host specific, it is not clear whether non-human primates are infected with T. trichiura or other species. In the present study, we sequenced the complete mitochondrial (mt) genome as well as the first and second internal transcribed spacers (ITS-1 and ITS-2) of Trichuris from the François’ leaf-monkey (langur), and compared them with homologous sequences from human- and pig-derived Trichuris. In addition, sequence comparison of a conserved mt ribosomal gene among multiple individual whipworms revealed substantial nucleotide differences among these three host species but limited sequence variation within each of them. The molecular data indicate that the monkey-derived whipworm is a separate species from that of humans. Future work should focus on detailed population genetic and morphological studies (by electron microscopy) of whipworms from various non-humans primates and humans. PMID:23840431
Mitochondrial and nuclear ribosomal DNA evidence supports the existence of a new Trichuris species in the endangered françois' leaf-monkey.

PubMed

Liu, Guo-Hua; Gasser, Robin B; Nejsum, Peter; Wang, Yan; Chen, Qiang; Song, Hui-Qun; Zhu, Xing-Quan

2013-01-01

The whipworm of humans, Trichuris trichiura, is responsible for a neglected tropical disease (NTD) of major importance in tropical and subtropical countries of the world. Whipworms also infect animal hosts, including pigs, dogs and non-human primates, cause clinical disease (trichuriasis) similar to that of humans. Although Trichuris species are usually considered to be host specific, it is not clear whether non-human primates are infected with T. trichiura or other species. In the present study, we sequenced the complete mitochondrial (mt) genome as well as the first and second internal transcribed spacers (ITS-1 and ITS-2) of Trichuris from the François' leaf-monkey (langur), and compared them with homologous sequences from human- and pig-derived Trichuris. In addition, sequence comparison of a conserved mt ribosomal gene among multiple individual whipworms revealed substantial nucleotide differences among these three host species but limited sequence variation within each of them. The molecular data indicate that the monkey-derived whipworm is a separate species from that of humans. Future work should focus on detailed population genetic and morphological studies (by electron microscopy) of whipworms from various non-humans primates and humans.
Molecular Characterization and Comparative Sequence Analysis of Defense-Related Gene, Oryza rufipogon Receptor-Like Protein Kinase 1

PubMed Central

Law, Yee-Song; Gudimella, Ranganath; Song, Beng-Kah; Ratnam, Wickneswari; Harikrishna, Jennifer Ann

2012-01-01

Many of the plant leucine rich repeat receptor-like kinases (LRR-RLKs) have been found to regulate signaling during plant defense processes. In this study, we selected and sequenced an LRR-RLK gene, designated as Oryza rufipogon receptor-like protein kinase 1 (OrufRPK1), located within yield QTL yld1.1 from the wild rice Oryza rufipogon (accession IRGC105491). A 2055 bp coding region and two exons were identified. Southern blotting determined OrufRPK1 to be a single copy gene. Sequence comparison with cultivated rice orthologs (OsI219RPK1, OsI9311RPK1 and OsJNipponRPK1, respectively derived from O. sativa ssp. indica cv. MR219, O. sativa ssp. indica cv. 9311 and O. sativa ssp. japonica cv. Nipponbare) revealed the presence of 12 single nucleotide polymorphisms (SNPs) with five non-synonymous substitutions, and 23 insertion/deletion sites. The biological role of the OrufRPK1 as a defense related LRR-RLK is proposed on the basis of cDNA sequence characterization, domain subfamily classification, structural prediction of extra cellular domains, cluster analysis and comparative gene expression. PMID:22942769
Identification, characterization and description of Arcobacter faecis sp. nov., isolated from a human waste septic tank.

PubMed

Whiteduck-Léveillée, Kerri; Whiteduck-Léveillée, Jenni; Cloutier, Michel; Tambong, James T; Xu, Renlin; Topp, Edward; Arts, Michael T; Chao, Jerry; Adam, Zaky; Lévesque, C André; Lapen, David R; Villemur, Richard; Khan, Izhar U H

2016-03-01

A study on the taxonomic classification of Arcobacter species was performed on the cultures isolated from various fecal sources where an Arcobacter strain AF1078(T) from human waste septic tank near Ottawa, Ontario, Canada was characterized using a polyphasic approach. Genetic investigations including 16S rRNA, atpA, cpn60, gyrA, gyrB and rpoB gene sequences of strain AF1078(T) are unique in comparison with other arcobacters. Phylogenetic analysis based on the 16S rRNA gene sequence revealed that the strain is most closely related to Arcobacter lanthieri and Arcobacter cibarius. Analyses of atpA, cpn60, gyrA, gyrB and rpoB gene sequences suggested that strain AF1078(T) formed a phylogenetic lineage independent of other species in the genus. Whole-genome sequence, DNA-DNA hybridization, fatty acid profile and phenotypic analysis further supported the conclusion that strain AF1078(T) represents a novel Arcobacter species, for which the name Arcobacter faecis sp. nov. is proposed, with type strain AF1078(T) (=LMG 28519(T); CCUG 66484(T)). Crown Copyright © 2015. Published by Elsevier GmbH. All rights reserved.
Zygosaccharomyces favi sp. nov., an obligate osmophilic yeast species from bee bread and honey.

PubMed

Čadež, Neža; Fülöp, László; Dlauchy, Dénes; Péter, Gábor

2015-03-01

Five yeast strains representing a hitherto undescribed yeast species were isolated from bee bread and honey in Hungary. They are obligate osmophilic, i.e. they are unable to grow in/on high water activity culture media. Following isogamous conjugation, they form 1-4 spheroid or subspheroid ascospores in persistent asci. The analysis of the sequences of their large subunit rRNA gene D1/D2 domain placed the new species in the Zygosaccharomyces clade. In terms of pairwise sequence similarity, Zygosaccharomyces gambellarensis is the most closely related species. Comparisons of D1/D2, internal transcribed spacer and translation elongation factor-1α (EF-1α) gene sequences of the five strains with that of the type strain of Z. gambellarensis revealed that they represent a new yeast species. The name Zygosaccharomyces favi sp. nov. (type strain: NCAIM Y.01994(T) = CBS 13653(T) = NRRL Y-63719(T) = ZIM 2551(T)) is proposed for this new yeast species, which based on phenotype can be distinguished from related Zygosaccharomyces species by its obligate osmophilic nature. Some intragenomic sequence variability, mainly indels, was detected among the ITS copies of the strains of the new species.
[A family of short retroposons (Squaml) from squamate reptiles (Reptilia: Squamata): structure, evolution and correlation with phylogeny].

PubMed

Kosushkin, S A; Borodulina, O R; Solov'eva, E N; Grechko, V V

2008-01-01

We have isolated and characterised sequences of a SINE family specific for squamate reptiles from a genome of lacertid lizard that we called Squam1. Copies are 360-390 bp in length and share a significant similarity with tRNA gene sequence on its 5'-end. This family was also detected by us in DNA of representatives of varanids, iguanids (anolis), gekkonids, and snakes. No signs of it were found in DNA of mammals, birds, amphibians, and crocodiles. Detailed analysis of primary structure of the retroposons obtained by us from genomic libraries or GenBank sequences was carried out. Most taxa possess 2-3 subfamilies of the SINE in their genomes with specific diagnostic features in their primary structure. Individual variability of copies in different families is about 85% and is just slightly lower on the genera level. Comparison of consensus sequences on family level reveals a high degree of structural similarity with a number of specific apomorphic features which makes it a useful marker of phylogeny for this group of reptiles. Snakes do not show specific affinity to varanids when compared to other lizards, as it was suggested earlier.
Streptococcus caprae sp. nov., isolated from Iberian ibex (Capra pyrenaica hispanica).

PubMed

Vela, A I; Mentaberre, G; Lavín, S; Domínguez, L; Fernández-Garayzábal, J F

2016-01-01

Biochemical and molecular genetic studies were performed on a novel Gram-stain-positive, catalase-negative, coccus-shaped organism isolated from tonsil samples of two Iberian ibexes. The micro-organism was identified as a streptococcal species based on its cellular, morphological and biochemical characteristics. 16S rRNA gene sequence comparison studies confirmed its identification as a member of the genus Streptococcus, but the organism did not correspond to any species of this genus. The nearest phylogenetic relative of the unknown coccus from ibex was Streptococcus porci 2923-03T (96.6 % 16S rRNA gene sequence similarity). Analysis based on rpoB and sodA gene sequences revealed sequence similarity values lower than 86.0 and 83.8 %, respectively, from the type strains of recognized Streptococcus species. The novel bacterial isolate was distinguished from Streptococcus porci and other Streptococcus species using biochemical tests. Based on both phenotypic and phylogenetic findings, it is proposed that the unknown bacterium be classified as representing a novel species of the genus Streptococcus, for which the name Streptococcus caprae sp. nov. is proposed. The type strain is DICM07-02790-1CT ( = CECT 8872T = CCUG 67170T).
Genome of the pitcher plant Cephalotus reveals genetic changes associated with carnivory.

PubMed

Fukushima, Kenji; Fang, Xiaodong; Alvarez-Ponce, David; Cai, Huimin; Carretero-Paulet, Lorenzo; Chen, Cui; Chang, Tien-Hao; Farr, Kimberly M; Fujita, Tomomichi; Hiwatashi, Yuji; Hoshi, Yoshikazu; Imai, Takamasa; Kasahara, Masahiro; Librado, Pablo; Mao, Likai; Mori, Hitoshi; Nishiyama, Tomoaki; Nozawa, Masafumi; Pálfalvi, Gergő; Pollard, Stephen T; Rozas, Julio; Sánchez-Gracia, Alejandro; Sankoff, David; Shibata, Tomoko F; Shigenobu, Shuji; Sumikawa, Naomi; Uzawa, Taketoshi; Xie, Meiying; Zheng, Chunfang; Pollock, David D; Albert, Victor A; Li, Shuaicheng; Hasebe, Mitsuyasu

2017-02-06

Carnivorous plants exploit animals as a nutritional source and have inspired long-standing questions about the origin and evolution of carnivory-related traits. To investigate the molecular bases of carnivory, we sequenced the genome of the heterophyllous pitcher plant Cephalotus follicularis, in which we succeeded in regulating the developmental switch between carnivorous and non-carnivorous leaves. Transcriptome comparison of the two leaf types and gene repertoire analysis identified genetic changes associated with prey attraction, capture, digestion and nutrient absorption. Analysis of digestive fluid proteins from C. follicularis and three other carnivorous plants with independent carnivorous origins revealed repeated co-options of stress-responsive protein lineages coupled with convergent amino acid substitutions to acquire digestive physiology. These results imply constraints on the available routes to evolve plant carnivory.
The crystal structure of choline kinase reveals a eukaryotic protein kinase fold

DOE Office of Scientific and Technical Information (OSTI.GOV)

Peisach, D.; Gee, P.; Kent, K.

2010-03-08

Choline kinase catalyzes the ATP-dependent phosphorylation of choline, the first committed step in the CDP-choline pathway for the biosynthesis of phosphatidylcholine. The 2.0 {angstrom} crystal structure of a choline kinase from C. elegans (CKA-2) reveals that the enzyme is a homodimeric protein with each monomer organized into a two-domain fold. The structure is remarkably similar to those of protein kinases and aminoglycoside phosphotransferases, despite no significant similarity in amino acid sequence. Comparisons to the structures of other kinases suggest that ATP binds to CKA-2 in a pocket formed by highly conserved and catalytically important residues. In addition, a choline bindingmore » site is proposed to be near the ATP binding pocket and formed by several structurally flexible loops.« less
Structure of T7 RNA polymerase complexed to the transcriptional inhibitor T7 lysozyme.

PubMed Central

Jeruzalmi, D; Steitz, T A

1998-01-01

The T7 RNA polymerase-T7 lysozyme complex regulates phage gene expression during infection of Escherichia coli. The 2.8 A crystal structure of the complex reveals that lysozyme binds at a site remote from the polymerase active site, suggesting an indirect mechanism of inhibition. Comparison of the T7 RNA polymerase structure with that of the homologous pol I family of DNA polymerases reveals identities in the catalytic site but also differences specific to RNA polymerase function. The structure of T7 RNA polymerase presented here differs significantly from a previously published structure. Sequence similarities between phage RNA polymerases and those from mitochondria and chloroplasts, when interpreted in the context of our revised model of T7 RNA polymerase, suggest a conserved fold. PMID:9670025
Revised Phylogeny and Novel Horizontally Acquired Virulence Determinants of the Model Soft Rot Phytopathogen Pectobacterium wasabiae SCC3193

PubMed Central

Koskinen, Patrik; Nokso-Koivisto, Jussi; Pasanen, Miia; Broberg, Martin; Plyusnin, Ilja; Törönen, Petri; Holm, Liisa; Pirhonen, Minna; Palva, E. Tapio

2012-01-01

Soft rot disease is economically one of the most devastating bacterial diseases affecting plants worldwide. In this study, we present novel insights into the phylogeny and virulence of the soft rot model Pectobacterium sp. SCC3193, which was isolated from a diseased potato stem in Finland in the early 1980s. Genomic approaches, including proteome and genome comparisons of all sequenced soft rot bacteria, revealed that SCC3193, previously included in the species Pectobacterium carotovorum, can now be more accurately classified as Pectobacterium wasabiae. Together with the recently revised phylogeny of a few P. carotovorum strains and an increasing number of studies on P. wasabiae, our work indicates that P. wasabiae has been unnoticed but present in potato fields worldwide. A combination of genomic approaches and in planta experiments identified features that separate SCC3193 and other P. wasabiae strains from the rest of soft rot bacteria, such as the absence of a type III secretion system that contributes to virulence of other soft rot species. Experimentally established virulence determinants include the putative transcriptional regulator SirB, two partially redundant type VI secretion systems and two horizontally acquired clusters (Vic1 and Vic2), which contain predicted virulence genes. Genome comparison also revealed other interesting traits that may be related to life in planta or other specific environmental conditions. These traits include a predicted benzoic acid/salicylic acid carboxyl methyltransferase of eukaryotic origin. The novelties found in this work indicate that soft rot bacteria have a reservoir of unknown traits that may be utilized in the poorly understood latent stage in planta. The genomic approaches and the comparison of the model strain SCC3193 to other sequenced Pectobacterium strains, including the type strain of P. wasabiae, provides a solid basis for further investigation of the virulence, distribution and phylogeny of soft rot bacteria and, potentially, other bacteria as well. PMID:23133391
Lachnospiraceae and Bacteroidales Alternative Fecal Indicators Reveal Chronic Human Sewage Contamination in an Urban Harbor▿†

PubMed Central

Newton, Ryan J.; VandeWalle, Jessica L.; Borchardt, Mark A.; Gorelick, Marc H.; McLellan, Sandra L.

2011-01-01

The complexity of fecal microbial communities and overlap among human and other animal sources have made it difficult to identify source-specific fecal indicator bacteria. However, the advent of next-generation sequencing technologies now provides increased sequencing power to resolve microbial community composition within and among environments. These data can be mined for information on source-specific phylotypes and/or assemblages of phylotypes (i.e., microbial signatures). We report the development of a new genetic marker for human fecal contamination identified through microbial pyrotag sequence analysis of the V6 region of the 16S rRNA gene. Sequence analysis of 37 sewage samples and comparison with database sequences revealed a human-associated phylotype within the Lachnospiraceae family, which was closely related to the genus Blautia. This phylotype, termed Lachno2, was on average the second most abundant fecal bacterial phylotype in sewage influent samples from Milwaukee, WI. We developed a quantitative PCR (qPCR) assay for Lachno2 and used it along with the qPCR-based assays for human Bacteroidales (based on the HF183 genetic marker), total Bacteroidales spp., and enterococci and the conventional Escherichia coli and enterococci plate count assays to examine the prevalence of fecal and human fecal pollution in Milwaukee's harbor. Both the conventional fecal indicators and the human-associated indicators revealed chronic fecal pollution in the harbor, with significant increases following heavy rain events and combined sewer overflows. The two human-associated genetic marker abundances were tightly correlated in the harbor, a strong indication they target the same source (i.e., human sewage). Human adenoviruses were routinely detected under all conditions in the harbor, and the probability of their occurrence increased by 154% for every 10-fold increase in the human indicator concentration. Both Lachno2 and human Bacteroidales increased specificity to detect sewage compared to general indicators, and the relationship to a human pathogen group suggests that the use of these alternative indicators will improve assessments for human health risks in urban waters. PMID:21803887
Co-circulation of West Nile virus and distinct insect-specific flaviviruses in Turkey.

PubMed

Ergünay, Koray; Litzba, Nadine; Brinkmann, Annika; Günay, Filiz; Sarıkaya, Yasemen; Kar, Sırrı; Örsten, Serra; Öter, Kerem; Domingo, Cristina; Erisoz Kasap, Özge; Özkul, Aykut; Mitchell, Luke; Nitsche, Andreas; Alten, Bülent; Linton, Yvonne-Marie

2017-03-20

Active vector surveillance provides an efficient tool for monitoring the presence or spread of emerging or re-emerging vector-borne viruses. This study was undertaken to investigate the circulation of flaviviruses. Mosquitoes were collected from 58 locations in 10 provinces across the Aegean, Thrace and Mediterranean Anatolian regions of Turkey in 2014 and 2015. Following morphological identification, mosquitoes were pooled and screened by nested and real-time PCR assays. Detected viruses were further characterised by sequencing. Positive pools were inoculated onto cell lines for virus isolation. Next generation sequencing was employed for genomic characterisation of the isolates. A total of 12,711 mosquito specimens representing 15 species were screened in 594 pools. Eleven pools (2%) were reactive in the virus screening assays. Sequencing revealed West Nile virus (WNV) in one Culex pipiens (s.l.) pool from Thrace. WNV sequence corresponded to lineage one clade 1a but clustered distinctly from the Turkish prototype isolate. In 10 pools, insect-specific flaviviruses were characterised as Culex theileri flavivirus in 5 pools of Culex theileri and one pool of Cx. pipiens (s.l.), Ochlerotatus caspius flavivirus in two pools of Aedes (Ochlerotatus) caspius, Flavivirus AV-2011 in one pool of Culiseta annulata, and an undetermined flavivirus in one pool of Uranotaenia unguiculata from the Aegean and Thrace regions. DNA forms or integration of the detected insect-specific flaviviruses were not observed. A virus strain, tentatively named as "Ochlerotatus caspius flavivirus Turkey", was isolated from an Ae. caspius pool in C6/36 cells. The viral genome comprised 10,370 nucleotides with a putative polyprotein of 3,385 amino acids that follows the canonical flavivirus polyprotein organisation. Sequence comparisons and phylogenetic analyses revealed the close relationship of this strain with Ochlerotatus caspius flavivirus from Portugal and Hanko virus from Finland. Several conserved structural and amino acid motifs were identified. We identified WNV and several distinct insect-specific flaviviruses during an extensive biosurveillance study of mosquitoes in various regions of Turkey in 2014 and 2015. Ongoing circulation of WNV is revealed, with an unprecedented genetic diversity. A probable replicating form of an insect flavivirus identified only in DNA form was detected.
K2 and K2*: efficient alignment-free sequence similarity measurement based on Kendall statistics.

PubMed

Lin, Jie; Adjeroh, Donald A; Jiang, Bing-Hua; Jiang, Yue

2018-05-15

Alignment-free sequence comparison methods can compute the pairwise similarity between a huge number of sequences much faster than sequence-alignment based methods. We propose a new non-parametric alignment-free sequence comparison method, called K2, based on the Kendall statistics. Comparing to the other state-of-the-art alignment-free comparison methods, K2 demonstrates competitive performance in generating the phylogenetic tree, in evaluating functionally related regulatory sequences, and in computing the edit distance (similarity/dissimilarity) between sequences. Furthermore, the K2 approach is much faster than the other methods. An improved method, K2*, is also proposed, which is able to determine the appropriate algorithmic parameter (length) automatically, without first considering different values. Comparative analysis with the state-of-the-art alignment-free sequence similarity methods demonstrates the superiority of the proposed approaches, especially with increasing sequence length, or increasing dataset sizes. The K2 and K2* approaches are implemented in the R language as a package and is freely available for open access (http://community.wvu.edu/daadjeroh/projects/K2/K2_1.0.tar.gz). yueljiang@163.com. Supplementary data are available at Bioinformatics online.
Molecular analysis of the split cox1 gene from the Basidiomycota Agrocybe aegerita: relationship of its introns with homologous Ascomycota introns and divergence levels from common ancestral copies.

PubMed

Gonzalez, P; Barroso, G; Labarère, J

1998-10-05

The Basidiomycota Agrocybe aegerita (Aa) mitochondrial cox1 gene (6790 nucleotides), encoding a protein of 527aa (58377Da), is split by four large subgroup IB introns possessing site-specific endonucleases assumed to be involved in intron mobility. When compared to other fungal COX1 proteins, the Aa protein is closely related to the COX1 one of the Basidiomycota Schizophyllum commune (Sc). This clade reveals a relationship with the studied Ascomycota ones, with the exception of Schizosaccharomyces pombe (Sp) which ranges in an out-group position compared with both higher fungi divisions. When comparison is extended to other kingdoms, fungal COX1 sequences are found to be more related to algae and plant ones (more than 57.5% aa similarity) than to animal sequences (53.6% aa similarity), contrasting with the previously established close relationship between fungi and animals, based on comparisons of nuclear genes. The four Aa cox1 introns are homologous to Ascomycota or algae cox1 introns sharing the same location within the exonic sequences. The percentages of identity of the intronic nucleotide sequences suggest a possible acquisition by lateral transfers of ancestral copies or of their derived sequences. These identities extend over the whole intronic sequences, arguing in favor of a transfer of the complete intron rather than a transfer limited to the encoded ORF. The intron i4 shares 74% of identity, at the nucleotidic level, with the Podospora anserina (Pa) intron i14, and up to 90.5% of aa similarity between the encoded proteins, i.e. the highest values reported to date between introns of two phylogenetically distant species. This low divergence argues for a recent lateral transfer between the two species. On the contrary, the low sequence identities (below 36%) observed between Aa i1 and the homologous Sp i1 or Prototheca wickeramii (Pw) i1 suggest a long evolution time after the separation of these sequences. The introns i2 and i3 possessed intermediate percentages of identity with their homologous Ascomycota introns. This is the first report of the complete nucleotide sequence and molecular organization of a mitochondrial cox1 gene of any member of the Basidiomycota division.
Origin of the CMS gene locus in rapeseed cybrid mitochondria: active and inactive recombination produces the complex CMS gene region in the mitochondrial genomes of Brassicaceae.

PubMed

Oshima, Masao; Kikuchi, Rie; Imamura, Jun; Handa, Hirokazu

2010-01-01

CMS (cytoplasmic male sterile) rapeseed is produced by asymmetrical somatic cell fusion between the Brassica napus cv. Westar and the Raphanus sativus Kosena CMS line (Kosena radish). The CMS rapeseed contains a CMS gene, orf125, which is derived from Kosena radish. Our sequence analyses revealed that the orf125 region in CMS rapeseed originated from recombination between the orf125/orfB region and the nad1C/ccmFN1 region by way of a 63 bp repeat. A precise sequence comparison among the related sequences in CMS rapeseed, Kosena radish and normal rapeseed showed that the orf125 region in CMS rapeseed consisted of the Kosena orf125/orfB region and the rapeseed nad1C/ccmFN1 region, even though Kosena radish had both the orf125/orfB region and the nad1C/ccmFN1 region in its mitochondrial genome. We also identified three tandem repeat sequences in the regions surrounding orf125, including a 63 bp repeat, which were involved in several recombination events. Interestingly, differences in the recombination activity for each repeat sequence were observed, even though these sequences were located adjacent to each other in the mitochondrial genome. We report results indicating that recombination events within the mitochondrial genomes are regulated at the level of specific repeat sequences depending on the cellular environment.
Comparison versus reminding.

PubMed

Tullis, Jonathan G; Goldstone, Robert L

2016-01-01

Comparison and reminding have both been shown to support learning and transfer. Comparison is thought to support transfer because it allows learners to disregard non-matching features of superficially different episodes in order to abstract the essential structure of concepts. Remindings promote memory for the individual episodes and generalization because they prompt learners to retrieve earlier episodes during the encoding of later related episodes and to compare across episodes. Across three experiments, we compared the consequences of comparison and reminding on memory and transfer. Participants studied a sequence of related, but superficially different, proverb pairs. In the comparison condition, participants saw proverb pairs presented together and compared their meaning. In the reminding condition, participants viewed proverbs one at a time and retrieved any prior studied proverb that shared the same deep meaning as the current proverb. Experiment 1 revealed that participants in the reminding condition recalled more proverbs than those in the comparison condition. Experiment 2 showed that the mnemonic benefits of reminding persisted over a one-week retention interval. Finally, in Experiment 3, we examined the ability of participants to generalize their remembered information to new items in a task that required participants to identify unstudied proverbs that shared the same meaning as studied proverbs. Comparison led to worse discrimination between proverbs related to studied proverbs and proverbs unrelated to studied proverbs than reminding. Reminding supported better memory for individual instances and transfer to new situations than comparison.
Leishmania naiffi and Leishmania guyanensis reference genomes highlight genome structure and gene evolution in the Viannia subgenus

PubMed Central

Coughlan, Simone; Taylor, Ali Shirley; Feane, Eoghan; Sanders, Mandy; Schonian, Gabriele; Cotton, James A.

2018-01-01

The unicellular protozoan parasite Leishmania causes the neglected tropical disease leishmaniasis, affecting 12 million people in 98 countries. In South America, where the Viannia subgenus predominates, so far only L. (Viannia) braziliensis and L. (V.) panamensis have been sequenced, assembled and annotated as reference genomes. Addressing this deficit in molecular information can inform species typing, epidemiological monitoring and clinical treatment. Here, L. (V.) naiffi and L. (V.) guyanensis genomic DNA was sequenced to assemble these two genomes as draft references from short sequence reads. The methods used were tested using short sequence reads for L. braziliensis M2904 against its published reference as a comparison. This assembly and annotation pipeline identified 70 additional genes not annotated on the original M2904 reference. Phylogenetic and evolutionary comparisons of L. guyanensis and L. naiffi with 10 other Viannia genomes revealed four traits common to all Viannia: aneuploidy, 22 orthologous groups of genes absent in other Leishmania subgenera, elevated TATE transposon copies and a high NADH-dependent fumarate reductase gene copy number. Within the Viannia, there were limited structural changes in genome architecture specific to individual species: a 45 Kb amplification on chromosome 34 was present in all bar L. lainsoni, L. naiffi had a higher copy number of the virulence factor leishmanolysin, and laboratory isolate L. shawi M8408 had a possible minichromosome derived from the 3’ end of chromosome 34. This combination of genome assembly, phylogenetics and comparative analysis across an extended panel of diverse Viannia has uncovered new insights into the origin and evolution of this subgenus and can help improve diagnostics for leishmaniasis surveillance. PMID:29765675

Some links on this page may take you to non-federal websites. Their policies may differ from this site.