significant sequence identity: Topics by Science.gov

Sample records for significant sequence identity

ITS rDNA sequences of Pomphorhynchus laevis (Zoega in Müller, 1776) and P. lucyi Williams and Rogers, 1984 (Acanthocephala: Palaeacanthocephala).

PubMed

Král'ová-Hromadová, Iva; Tietz, David F; Shinn, Andrew P; Spakulová, Marta

2003-10-01

The internal transcribed spacers (ITS-1 and ITS-2) of the ribosomal RNA gene of Pomphorhynchus laevis (Zoega in Müller, 1776) (Acanthocephala) isolated from various fish species across Central and Southern Europe were compared with those of P. lucyi Williams and Rogers, 1984 collected from the largemouth bass Micropterus salmonoides Boulenger from the USA. The nucleotide sequences of ITS regions of P. laevis from minnows Phoxinus phoxinus (L.) and chub Leuciscus cephalus (L.) from two distant localities in the Slovak Republic were found to be 100% identical. The ITS-1 and ITS-2 of P. laevis from chub from the Czech Republic and Italy were also mutually identical, but significantly different from Slovak worms (88.7% identity for ITS-1, 91.3% identity for ITS-2). A fifth sample collected from Barbus tyberinus Bonaparte from Italy was very similar to the sympatric Italian isolate from chub, possessing four nucleotide substitutions in ITS-1 (98.4% identity). The ITS rDNA sequences of P. lucyi differed significantly from those of P. laevis; the values of identity were 51.8-56.1% for ITS-1 and 63.1-65.3% for ITS-2, and were significantly higher than the range of P. laevis within-species variability. The results based on the ITS sequences confirmed the occurrence of strains in P. laevis from Continental Europe which are well defined by molecules but reveal only slight differences in their morphology.
Effects of learning with explicit elaboration on implicit transfer of visuomotor sequence learning.

PubMed

Tanaka, Kanji; Watanabe, Katsumi

2013-08-01

Intervals between stimuli and/or responses have significant influences on sequential learning. In the present study, we investigated whether transfer would occur even when the intervals and the visual configurations in a sequence were drastically changed so that participants did not notice that the required sequences of responses were identical. In the experiment, two (or three) sequential button presses comprised a "set," and nine (or six) consecutive sets comprised a "hyperset." In the first session, participants learned either a 2 × 9 or 3 × 6 hyperset by trial and error until they completed it 20 times without error. In the second block, the 2 × 9 (3 × 6) hyperset was changed into the 3 × 6 (2 × 9) hyperset, resulting in different visual configurations and intervals between stimuli and responses. Participants were assigned into two groups: the Identical and Random groups. In the Identical group, the sequence (i.e., the buttons to be pressed) in the second block was identical to that in the first block. In the Random group, a new hyperset was learned. Even in the Identical group, no participants noticed that the sequences were identical. Nevertheless, a significant transfer of performance occurred. However, in the subsequent experiment that did not require explicit trial-and-error learning in the first session, implicit transfer in the second session did not occur. These results indicate that learning with explicit elaboration strengthens the implicit representation of the sequence order as a whole; this might occur independently of the intervals between elements and enable implicit transfer.
GRIL: genome rearrangement and inversion locator.

PubMed

Darling, Aaron E; Mau, Bob; Blattner, Frederick R; Perna, Nicole T

2004-01-01

GRIL is a tool to automatically identify collinear regions in a set of bacterial-size genome sequences. GRIL uses three basic steps. First, regions of high sequence identity are located. Second, some of these regions are filtered based on user-specified criteria. Finally, the remaining regions of sequence identity are used to define significant collinear regions among the sequences. By locating collinear regions of sequence, GRIL provides a basis for multiple genome alignment using current alignment systems. GRIL also provides a basis for using current inversion distance tools to infer phylogeny. GRIL is implemented in C++ and runs on any x86-based Linux or Windows platform. It is available from http://asap.ahabs.wisc.edu/gril
Large-Scale Concatenation cDNA Sequencing

PubMed Central

Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.

1997-01-01

A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
Characterization of acid-tolerant H/CO-utilizing methanogenic enrichment cultures from an acidic peat bog in New York State.

PubMed

Bräuer, Suzanna L; Yashiro, Erika; Ueno, Norikiyo G; Yavitt, Joseph B; Zinder, Stephen H

2006-08-01

Two methanogenic cultures were enriched from acidic peat soil using a growth medium buffered to c. pH 5. One culture, 6A, was obtained from peat after incubation with H(2)/CO(2), whereas culture NTA was derived from a 10(-4) dilution of untreated peat into a modified medium. 16S rRNA gene clone libraries from each culture contained one methanogen and two bacterial sequences. The methanogen 16S rRNA gene sequences were 99% identical with each other and belonged to the novel "R-10/Fen cluster" family of the Methanomicrobiales, whereas their mcrA sequences were 96% identical. One bacterial 16S rRNA gene sequence from culture 6A belonged to the Bacteroidetes and showed 99% identity with sequences from methanogenic enrichments from German and Russian bogs. The other sequence belonged to the Firmicutes and was identical to a thick rod-shaped citrate-utilizing organism isolated from culture 6A, the numbers of which decreased when the Ti (III) chelator was switched from citrate to nitrilotriacetate. Bacterial clones from the NTA culture clustered in the Delta- and Betaproteobacteria. Both cultures contained thin rods, presumably the methanogens, as the predominant morphotype, and represent a significant advance in characterization of the novel acidiphilic R-10 family methanogens.
Characterization of the Campylobacter jejuni cryptic plasmid pTIW94 recovered from wild birds in the southeastern United States.

PubMed

Hiett, Kelli L; Rothrock, Michael J; Seal, Bruce S

2013-09-01

The complete nucleotide sequence was determined for a cryptic plasmid, pTIW94, recovered from several Campylobacter jejuni isolates from wild birds in the southeastern United States. pTIW94 is a circular molecule of 3860 nucleotides, with a G+C content (31.0%) similar to that of many Campylobacter spp. genomes. A typical origin of replication, with iteron sequences, was identified upstream of DNA sequences that demonstrated similarity to replication initiation proteins. A total of five open reading frames (ORFs) were identified; two of the five ORFs demonstrated significant similarity to plasmid pCC2228-2 found within Campylobacter coli. These two ORFs were similar to essential replication proteins RepA (100%; 26/26 aa identity) and RepB (95%; 327/346 aa identity). A third identified ORF demonstrated significant similarity (99%; 421/424 aa identity) to the MOB protein from C. coli 67-8, originally recovered from swine. The other two identified ORFs were either similar to hypothetical proteins from other Campylobacter spp., or exhibited no significant similarity to any DNA or protein sequence in the GenBank database. Promoter regions (-35 and -10 signal sites), ribosomal binding sites upstream of ORFs, and stem-loop structures were also identified within the plasmid. These results demonstrate that pTIW94 represents a previously un-reported small cryptic plasmid with unique sequences as well as highly similar sequences to other small plasmids found within Campylobacter spp., and that this cryptic plasmid is present among Campylobacter spp. recovered from different genera of wild birds. Copyright © 2013. Published by Elsevier Inc.
Genome sequence of a distinct watermelon mosaic virus identified from ginseng (Panax ginseng) transcriptome.

PubMed

Park, D; Kim, H; Hahn, Y

Watermelon mosaic virus (WMV) is a member of the genus Potyvirus, which is the largest genus of plant viruses. WMV is a significant pathogen of crop plants, including Cucurbitaceae species. A WMV strain, designated as WMV-Pg, was identified in transcriptome data collected from ginseng (Panax ginseng) root. WMV-Pg showed 84% nucleotide sequence identity and 91% amino acid sequence identity with its closest related virus, WMV-Fr. A phylogenetic analysis of WMV-Pg with other WMVs and soybean mosaic viruses (SMVs) indicated that WMV-Pg is a distinct subtype of the WMV/SMV group of the genus Potyvirus in the family Potyviridae.
A gyrovirus infecting a sea bird

PubMed Central

Li, Linlin; Pesavento, Patricia A.; Gaynor, Anne M.; Duerr, Rebecca S.; Phan, Tung Gia; Zhang, Wen; Deng, Xutao

2015-01-01

We characterized the genome of a highly divergent gyrovirus (GyV8) in the spleen and uropygial gland tissues of a diseased northern fulmar (Fulmarus glacialis), a pelagic bird beached in San Francisco, California. No other exogenous viral sequences could be identified using viral metagenomics. The small circular DNA genome shared no significant nucleotide sequence identity, and only 38–42 % amino acid sequence identity in VP1, with any of the previously identified gyroviruses. GyV8 is the first member of the third major phylogenetic clade of this viral genus and the first gyrovirus detected in an avian species other than chicken. PMID:26036564
Complete Nucleotide Sequence of Watermelon Chlorotic Stunt Virus Originating from Oman

PubMed Central

Khan, Akhtar J.; Akhtar, Sohail; Briddon, Rob W.; Ammara, Um; Al-Matrooshi, Abdulrahman M.; Mansoor, Shahid

2012-01-01

Watermelon chlorotic stunt virus (WmCSV) is a bipartite begomovirus (genus Begomovirus, family Geminiviridae) that causes economic losses to cucurbits, particularly watermelon, across the Middle East and North Africa. Recently squash (Cucurbita moschata) grown in an experimental field in Oman was found to display symptoms such as leaf curling, yellowing and stunting, typical of a begomovirus infection. Sequence analysis of the virus isolated from squash showed 97.6–99.9% nucleotide sequence identity to previously described WmCSV isolates for the DNA A component and 93–98% identity for the DNA B component. Agrobacterium-mediated inoculation to Nicotiana benthamiana resulted in the development of symptoms fifteen days post inoculation. This is the first bipartite begomovirus identified in Oman. Overall the Oman isolate showed the highest levels of sequence identity to a WmCSV isolate originating from Iran, which was confirmed by phylogenetic analysis. This suggests that WmCSV present in Oman has been introduced from Iran. The significance of this finding is discussed. PMID:22852046
Complete nucleotide sequence of watermelon chlorotic stunt virus originating from Oman.

PubMed

Khan, Akhtar J; Akhtar, Sohail; Briddon, Rob W; Ammara, Um; Al-Matrooshi, Abdulrahman M; Mansoor, Shahid

2012-07-01

Watermelon chlorotic stunt virus (WmCSV) is a bipartite begomovirus (genus Begomovirus, family Geminiviridae) that causes economic losses to cucurbits, particularly watermelon, across the Middle East and North Africa. Recently squash (Cucurbita moschata) grown in an experimental field in Oman was found to display symptoms such as leaf curling, yellowing and stunting, typical of a begomovirus infection. Sequence analysis of the virus isolated from squash showed 97.6-99.9% nucleotide sequence identity to previously described WmCSV isolates for the DNA A component and 93-98% identity for the DNA B component. Agrobacterium-mediated inoculation to Nicotiana benthamiana resulted in the development of symptoms fifteen days post inoculation. This is the first bipartite begomovirus identified in Oman. Overall the Oman isolate showed the highest levels of sequence identity to a WmCSV isolate originating from Iran, which was confirmed by phylogenetic analysis. This suggests that WmCSV present in Oman has been introduced from Iran. The significance of this finding is discussed.
DNA sequences of three beta-1,4-endoglucanase genes from Thermomonospora fusca.

PubMed Central

Lao, G; Ghangas, G S; Jung, E D; Wilson, D B

1991-01-01

The DNA sequences of the Thermomonospora fusca genes encoding cellulases E2 and E5 and the N-terminal end of E4 were determined. Each sequence contains an identical 14-bp inverted repeat upstream of the initiation codon. There were no significant homologies between the coding regions of the three genes. The E2 gene is 73% identical to the celA gene from Microbispora bispora, but this was the only homology found with other cellulase genes. E2 belongs to a family of cellulases that includes celA from M. bispora, cenA from Cellulomonas fimi, casA from an alkalophilic Streptomyces strain, and cellobiohydrolase II from Trichoderma reesei. E4 shows 44% identity to an avocado cellulase, while E5 belongs to the Bacillus cellulase family. There were strong similarities between the amino acid sequences of the E2 and E5 cellulose binding domains, and these regions also showed homology with C. fimi and Pseudomonas fluorescens cellulose binding domains. PMID:1904434
Complete Genomic Sequence and Comparative Analysis of the Genome Segments of Sweet Potato Chlorotic Stunt Virus in China

PubMed Central

Qin, Yanhong; Wang, Li; Zhang, Zhenchen; Qiao, Qi; Zhang, Desheng; Tian, Yuting; Wang, Shuang; Wang, Yongjiang; Yan, Zhaoling

2014-01-01

Background Sweet potato chlorotic stunt virus (family Closteroviridae, genus Crinivirus) features a large bipartite, single-stranded, positive-sense RNA genome. To date, only three complete genomic sequences of SPCSV can be accessed through GenBank. SPCSV was first detected from China in 2011, only partial genomic sequences have been determined in the country. No report on the complete genomic sequence and genome structure of Chinese SPCSV isolates or the genetic relation between isolates from China and other countries is available. Methodology/Principal Findings The complete genomic sequences of five isolates from different areas in China were characterized. This study is the first to report the complete genome sequences of SPCSV from whitefly vectors. Genome structure analysis showed that isolates of WA and EA strains from China have the same coding protein as isolates Can181-9 and m2-47, respectively. Twenty cp genes and four RNA1 partial segments were sequenced and analyzed, and the nucleotide identities of complete genomic, cp, and RNA1 partial sequences were determined. Results indicated high conservation among strains and significant differences between WA and EA strains. Genetic analysis demonstrated that, except for isolates from Guangdong Province, SPCSVs from other areas belong to the WA strain. Genome organization analysis showed that the isolates in this study lack the p22 gene. Conclusions/Significance We presented the complete genome sequences of SPCSV in China. Comparison of nucleotide identities and genome structures between these isolates and previously reported isolates showed slight differences. The nucleotide identities of different SPCSV isolates showed high conservation among strains and significant differences between strains. All nine isolates in this study lacked p22 gene. WA strains were more extensively distributed than EA strains in China. These data provide important insights into the molecular variation and genomic structure of SPCSV in China as well as genetic relationships among isolates from China and other countries. PMID:25170926
OrthoANI: An improved algorithm and software for calculating average nucleotide identity.

PubMed

Lee, Imchang; Ouk Kim, Yeong; Park, Sang-Cheol; Chun, Jongsik

2016-02-01

Species demarcation in Bacteria and Archaea is mainly based on overall genome relatedness, which serves a framework for modern microbiology. Current practice for obtaining these measures between two strains is shifting from experimentally determined similarity obtained by DNA-DNA hybridization (DDH) to genome-sequence-based similarity. Average nucleotide identity (ANI) is a simple algorithm that mimics DDH. Like DDH, ANI values between two genome sequences may be different from each other when reciprocal calculations are compared. We compared 63 690 pairs of genome sequences and found that the differences in reciprocal ANI values are significantly high, exceeding 1 % in some cases. To resolve this problem of not being symmetrical, a new algorithm, named OrthoANI, was developed to accommodate the concept of orthology for which both genome sequences were fragmented and only orthologous fragment pairs taken into consideration for calculating nucleotide identities. OrthoANI is highly correlated with ANI (using BLASTn) and the former showed approximately 0.1 % higher values than the latter. In conclusion, OrthoANI provides a more robust and faster means of calculating average nucleotide identity for taxonomic purposes. The standalone software tools are freely available at http://www.ezbiocloud.net/sw/oat.
On the Role of Aggregation Prone Regions in Protein Evolution, Stability, and Enzymatic Catalysis: Insights from Diverse Analyses

PubMed Central

Buck, Patrick M.; Kumar, Sandeep; Singh, Satish K.

2013-01-01

The various roles that aggregation prone regions (APRs) are capable of playing in proteins are investigated here via comprehensive analyses of multiple non-redundant datasets containing randomly generated amino acid sequences, monomeric proteins, intrinsically disordered proteins (IDPs) and catalytic residues. Results from this study indicate that the aggregation propensities of monomeric protein sequences have been minimized compared to random sequences with uniform and natural amino acid compositions, as observed by a lower average aggregation propensity and fewer APRs that are shorter in length and more often punctuated by gate-keeper residues. However, evidence for evolutionary selective pressure to disrupt these sequence regions among homologous proteins is inconsistent. APRs are less conserved than average sequence identity among closely related homologues (≥80% sequence identity with a parent) but APRs are more conserved than average sequence identity among homologues that have at least 50% sequence identity with a parent. Structural analyses of APRs indicate that APRs are three times more likely to contain ordered versus disordered residues and that APRs frequently contribute more towards stabilizing proteins than equal length segments from the same protein. Catalytic residues and APRs were also found to be in structural contact significantly more often than expected by random chance. Our findings suggest that proteins have evolved by optimizing their risk of aggregation for cellular environments by both minimizing aggregation prone regions and by conserving those that are important for folding and function. In many cases, these sequence optimizations are insufficient to develop recombinant proteins into commercial products. Rational design strategies aimed at improving protein solubility for biotechnological purposes should carefully evaluate the contributions made by candidate APRs, targeted for disruption, towards protein structure and activity. PMID:24146608
Complementary DNA sequencing and identification of mRNAs from the venomous gland of Agkistrodon piscivorus leucostoma.

PubMed

Jia, Ying; Cantu, Bruno A; Sánchez, Elda E; Pérez, John C

2008-06-15

To advance our knowledge on the snake venom composition and transcripts expressed in venom gland at the molecular level, we constructed a cDNA library from the venom gland of Agkistrodon piscivorus leucostoma for the generation of expressed sequence tags (ESTs) database. From the randomly sequenced 2112 independent clones, we have obtained ESTs for 1309 (62%) cDNAs, which showed significant deduced amino acid sequence similarity (scores >80) to previously characterized proteins in National Center for Biotechnology Information (NCBI) database. Ribosomal proteins make up 47 clones (2%) and the remaining 756 (36%) cDNAs represent either unknown identity or show BLASTX sequence identity scores of <80 with known GenBank accessions. The most highly expressed gene encoding phospholipase A(2) (PLA(2)) accounting for 35% of A. p. leucostoma venom gland cDNAs was identified and further confirmed by crude venom applied to sodium dodecyl sulfate/polyacrylamide gel electrophoresis (SDS-PAGE) electrophoresis and protein sequencing. A total of 180 representative genes were obtained from the sequence assemblies and deposited to EST database. Clones showing sequence identity to disintegrins, thrombin-like enzymes, hemorrhagic toxins, fibrinogen clotting inhibitors and plasminogen activators were also identified in our EST database. These data can be used to develop a research program that will help us identify genes encoding proteins that are of medical importance or proteins involved in the mechanisms of the toxin venom.
Nucleotide sequencing and identification of some wild mushrooms.

PubMed

Das, Sudip Kumar; Mandal, Aninda; Datta, Animesh K; Gupta, Sudha; Paul, Rita; Saha, Aditi; Sengupta, Sonali; Dubey, Priyanka Kumari

2013-01-01

The rDNA-ITS (Ribosomal DNA Internal Transcribed Spacers) fragment of the genomic DNA of 8 wild edible mushrooms (collected from Eastern Chota Nagpur Plateau of West Bengal, India) was amplified using ITS1 (Internal Transcribed Spacers 1) and ITS2 primers and subjected to nucleotide sequence determination for identification of mushrooms as mentioned. The sequences were aligned using ClustalW software program. The aligned sequences revealed identity (homology percentage from GenBank data base) of Amanita hemibapha [CN (Chota Nagpur) 1, % identity 99 (JX844716.1)], Amanita sp. [CN 2, % identity 98 (JX844763.1)], Astraeus hygrometricus [CN 3, % identity 87 (FJ536664.1)], Termitomyces sp. [CN 4, % identity 90 (JF746992.1)], Termitomyces sp. [CN 5, % identity 99 (GU001667.1)], T. microcarpus [CN 6, % identity 82 (EF421077.1)], Termitomyces sp. [CN 7, % identity 76 (JF746993.1)], and Volvariella volvacea [CN 8, % identity 100 (JN086680.1)]. Although out of 8 mushrooms 4 could be identified up to species level, the nucleotide sequences of the rest may be relevant to further characterization. A phylogenetic tree is constructed using Neighbor-Joining method showing interrelationship between/among the mushrooms. The determined nucleotide sequences of the mushrooms may provide additional information enriching GenBank database aiding to molecular taxonomy and facilitating its domestication and characterization for human benefits.
Identification of a novel bovine enterovirus possessing highly divergent amino acid sequences in capsid protein.

PubMed

Tsuchiaka, Shinobu; Rahpaya, Sayed Samim; Otomaru, Konosuke; Aoki, Hiroshi; Kishimoto, Mai; Naoi, Yuki; Omatsu, Tsutomu; Sano, Kaori; Okazaki-Terashima, Sachiko; Katayama, Yukie; Oba, Mami; Nagai, Makoto; Mizutani, Tetsuya

2017-01-17

Bovine enterovirus (BEV) belongs to the species Enterovirus E or F, genus Enterovirus and family Picornaviridae. Although numerous studies have identified BEVs in the feces of cattle with diarrhea, the pathogenicity of BEVs remains unclear. Previously, we reported the detection of novel kobu-like virus in calf feces, by metagenomics analysis. In the present study, we identified a novel BEV in diarrheal feces collected for that survey. Complete genome sequences were determined by deep sequencing in feces. Secondary RNA structure analysis of the 5' untranslated region (UTR), phylogenetic tree construction and pairwise identity analysis were conducted. The complete genome sequences of BEV were genetically distant from other EVs and the VP1 coding region contained novel and unique amino acid sequences. We named this strain as BEV AN12/Bos taurus/JPN/2014 (referred to as BEV-AN12). According to genome analysis, the genome length of this virus is 7414 nucleotides excluding the poly (A) tail and its genome consists of a 5'UTR, open reading frame encoding a single polyprotein, and 3'UTR. The results of secondary RNA structure analysis showed that in the 5'UTR, BEV-AN12 had an additional clover leaf structure and small stem loop structure, similarly to other BEVs. In pairwise identity analysis, BEV-AN12 showed high amino acid (aa) identities to Enterovirus F in the polyprotein, P2 and P3 regions (aa identity ≥82.4%). Therefore, BEV-AN12 is closely related to Enterovirus F. However, aa sequences in the capsid protein regions, particularly the VP1 encoding region, showed significantly low aa identity to other viruses in genus Enterovirus (VP1 aa identity ≤58.6%). In addition, BEV-AN12 branched separately from Enterovirus E and F in phylogenetic trees based on the aa sequences of P1 and VP1, although it clustered with Enterovirus F in trees based on sequences in the P2 and P3 genome region. We identified novel BEV possessing highly divergent aa sequences in the VP1 coding region in Japan. According to species definition, we proposed naming this strain as "Enterovirus K", which is a novel species within genus Enterovirus. Further genomic studies are needed to understand the pathogenicity of BEVs.
Bean common mosaic virus isolates causing different symptoms in asparagus bean in China differ greatly in the 5'-parts of their genomes.

PubMed

Zheng, Hongying; Chen, Jiong; Chen, Jianping; Adams, Michael J; Hou, Mingsheng

2002-06-01

Potyvirus isolates from asparagus bean ( Vigna sesquipedalis) plants in Zhejiang province, China, caused either rugose and vein banding mosaic symptoms (isolate R) or severe yellowing (isolate Y) in this host, but were otherwise similar in host range. Both isolates were completely sequenced and shown to be isolates of Bean common mosaic virus (BCMV). The complete sequences were 9992 (R) or 10062 (Y) nucleotides long and shared 91.7% identical nucleotides (93.2% identical amino acids) in their genomes and were more distantly related to the BCMV-Peanut stripe virus sequence (PStV). The isolates were much less similar to one another in the 5'-UTR and the N-terminal region of the P1 protein. In the P1, isolate Y was closer to PStV (76.1% identical amino acids) than to isolate R (64.8%). Phylogenetic analyses of the coat protein region showed that the new isolates grouped with other isolates from Vigna spp., forming the blackeye cowpea mosaic strain subgroup of BCMV with 94-98% nucleotides (96-99% amino acids) identical to one another and about 90% identity to other BCMV isolates. Other significant subgroupings amongst published BCMV isolates were detected.
Subgrouping Automata: automatic sequence subgrouping using phylogenetic tree-based optimum subgrouping algorithm.

PubMed

Seo, Joo-Hyun; Park, Jihyang; Kim, Eun-Mi; Kim, Juhan; Joo, Keehyoung; Lee, Jooyoung; Kim, Byung-Gee

2014-02-01

Sequence subgrouping for a given sequence set can enable various informative tasks such as the functional discrimination of sequence subsets and the functional inference of unknown sequences. Because an identity threshold for sequence subgrouping may vary according to the given sequence set, it is highly desirable to construct a robust subgrouping algorithm which automatically identifies an optimal identity threshold and generates subgroups for a given sequence set. To meet this end, an automatic sequence subgrouping method, named 'Subgrouping Automata' was constructed. Firstly, tree analysis module analyzes the structure of tree and calculates the all possible subgroups in each node. Sequence similarity analysis module calculates average sequence similarity for all subgroups in each node. Representative sequence generation module finds a representative sequence using profile analysis and self-scoring for each subgroup. For all nodes, average sequence similarities are calculated and 'Subgrouping Automata' searches a node showing statistically maximum sequence similarity increase using Student's t-value. A node showing the maximum t-value, which gives the most significant differences in average sequence similarity between two adjacent nodes, is determined as an optimum subgrouping node in the phylogenetic tree. Further analysis showed that the optimum subgrouping node from SA prevents under-subgrouping and over-subgrouping. Copyright © 2013. Published by Elsevier Ltd.
How close is close: 16S rRNA sequence identity may not be sufficient to guarantee species identity

NASA Technical Reports Server (NTRS)

Fox, G. E.; Wisotzkey, J. D.; Jurtshuk, P. Jr

1992-01-01

16S rRNA (genes coding for rRNA) sequence comparisons were conducted with the following three psychrophilic strains: Bacillus globisporus W25T (T = type strain) and Bacillus psychrophilus W16AT, and W5. These strains exhibited more than 99.5% sequence identity and within experimental uncertainty could be regarded as identical. Their close taxonomic relationship was further documented by phenotypic similarities. In contrast, previously published DNA-DNA hybridization results have convincingly established that these strains do not belong to the same species if current standards are used. These results emphasize the important point that effective identity of 16S rRNA sequences is not necessarily a sufficient criterion to guarantee species identity. Thus, although 16S rRNA sequences can be used routinely to distinguish and establish relationships between genera and well-resolved species, very recently diverged species may not be recognizable.

Accuracy of taxonomy prediction for 16S rRNA and fungal ITS sequences

PubMed Central

2018-01-01

Prediction of taxonomy for marker gene sequences such as 16S ribosomal RNA (rRNA) is a fundamental task in microbiology. Most experimentally observed sequences are diverged from reference sequences of authoritatively named organisms, creating a challenge for prediction methods. I assessed the accuracy of several algorithms using cross-validation by identity, a new benchmark strategy which explicitly models the variation in distances between query sequences and the closest entry in a reference database. When the accuracy of genus predictions was averaged over a representative range of identities with the reference database (100%, 99%, 97%, 95% and 90%), all tested methods had ≤50% accuracy on the currently-popular V4 region of 16S rRNA. Accuracy was found to fall rapidly with identity; for example, better methods were found to have V4 genus prediction accuracy of ∼100% at 100% identity but ∼50% at 97% identity. The relationship between identity and taxonomy was quantified as the probability that a rank is the lowest shared by a pair of sequences with a given pair-wise identity. With the V4 region, 95% identity was found to be a twilight zone where taxonomy is highly ambiguous because the probabilities that the lowest shared rank between pairs of sequences is genus, family, order or class are approximately equal. PMID:29682424
An Accurate Scalable Template-based Alignment Algorithm

PubMed Central

Gardner, David P.; Xu, Weijia; Miranker, Daniel P.; Ozer, Stuart; Cannone, Jamie J.; Gutell, Robin R.

2013-01-01

The rapid determination of nucleic acid sequences is increasing the number of sequences that are available. Inherent in a template or seed alignment is the culmination of structural and functional constraints that are selecting those mutations that are viable during the evolution of the RNA. While we might not understand these structural and functional, template-based alignment programs utilize the patterns of sequence conservation to encapsulate the characteristics of viable RNA sequences that are aligned properly. We have developed a program that utilizes the different dimensions of information in rCAD, a large RNA informatics resource, to establish a profile for each position in an alignment. The most significant include sequence identity and column composition in different phylogenetic taxa. We have compared our methods with a maximum of eight alternative alignment methods on different sets of 16S and 23S rRNA sequences with sequence percent identities ranging from 50% to 100%. The results showed that CRWAlign outperformed the other alignment methods in both speed and accuracy. A web-based alignment server is available at http://www.rna.ccbb.utexas.edu/SAE/2F/CRWAlign. PMID:24772376
DNA sequence analysis of a 10 624 bp fragment of the left arm of chromosome XV from Saccharomyces cerevisiae reveals a RNA binding protein, a mitochondrial protein, two ribosomal proteins and two new open reading frames.

PubMed

Lafuente, M J; Gamo, F J; Gancedo, C

1996-09-01

We have determined the sequence of a 10624 bp DNA segment located in the left arm of chromosome XV of Saccharomyces cerevisiae. The sequence contains eight open reading frames (ORFs) longer than 100 amino acids. Two of them do not present significant homology with sequences found in the databases. The product of ORF o0553 is identical to the protein encoded by the gene SMF1. Internal to it there is another ORF, o0555 that is apparently expressed. The proteins encoded by ORFs o0559 and o0565 are identical to ribosomal proteins S19.e and L18 respectively. ORF o0550 encodes a protein with an RNA binding signature including RNP motifs and stretches rich in asparagine, glutamine and arginine.
Sarcocystis spp. in domestic sheep in Kunming City, China: prevalence, morphology, and molecular characteristics.

PubMed

Hu, Jun-Jie; Huang, Si; Wen, Tao; Esch, Gerald W; Liang, Yu; Li, Hong-Liang

2017-01-01

Sheep (Ovis aries) are intermediate hosts for at least six named species of Sarcocystis: S. tenella, S. arieticanis, S. gigantea, S. medusiformis, S. mihoensis, and S. microps. Here, only two species, S. tenella and S. arieticanis, were found in 79 of 86 sheep (91.9%) in Kunming, China, based on their morphological characteristics. Four genetic markers, i.e., 18S rRNA gene, 28S rRNA gene, mitochondrial cox1 gene, and ITS-1 region, were sequenced and characterized for the two species of Sarcocystis. Sequences of the three former markers for S. tenella shared high identities with those of S. capracanis in goats, i.e., 99.0%, 98.3%, and 93.6%, respectively; the same three marker sequences of S. arieticanis shared high identities with those of S. hircicanis in goats, i.e., 98.5%, 96.5%, and 92.5%, respectively. No sequences in GenBank were found to significantly resemble the ITS-1 regions of S. tenella and S. arieticanis. Identities of the four genetic markers for S. tenella and S. arieticanis were 96.3%, 95.4%, 82.5%, and 66.2%, respectively. © J.-J. Hu et al., published by EDP Sciences, 2017.
Coming-out across the life course: implications of age and historical context.

PubMed

Floyd, Frank J; Bakeman, Roger

2006-06-01

Effects of age and the calendar year when individuals first self-identified as gay, lesbian, or bisexual on their sexual orientation identity development were examined in a large community sample (N=767, 47% female, 18-74-years-old). These 2 variables were used to examine the timing and sequencing of 7 coming-out experiences: first awareness of same-sex attraction; first sexual experiences with opposite-sex partners; first sexual experiences with same-sex partners; self-identification as gay, lesbian, or bisexual; disclosure to someone other than a parent; disclosure to mother; and disclosure to father. The significant effects of age revealed that self-identification in adolescence as opposed to adulthood was associated with an overall young coming-out trajectory for all milestone experiences, which occurred in both earlier and recent historical contexts. Adolescents as opposed to adult self-identifiers were also more likely to demonstrate identity-centered sequences in which self-identification preceded same-sex sexual experiences, and fewer of these individuals had any heterosexual experience. Significant historical context effects indicated recent trends toward younger disclosure of orientation to others and to parents, greater likelihood of an identity-centered sequence, and younger ages for first heterosexual but not same-sex, sexual experiences. Among women, there was a recent trend toward greater likelihood of having a bisexual identity milestone. In general, the maturational effects were independent of historical context, with the exception that only adolescent self-identifiers who came out recently disclosed to others and to parents at an average age younger than 18 years. These developmental and historical trends expand on the stage-sequential framework to show how the process of sexual orientation identity development is driven by maturational factors as well as social changes.
Molecular characterization of Atractolytocestus sagittatus (Cestoda: Caryophyllidea), monozoic parasite of common carp, and its differentiation from the invasive species Atractolytocestus huronensis.

PubMed

Bazsalovicsová, Eva; Králová-Hromadová, Ivica; Stefka, Jan; Scholz, Tomáš

2012-05-01

Sequence structure of complete internal transcribed spacer 1 and 2 (ITS1 and ITS2) of the ribosomal DNA region and partial mitochondrial cytochrome c oxidase subunit I (cox1) gene sequences were studied in the monozoic tapeworm Atractolytocestus sagittatus (Kulakovskaya et Akhmerov, 1965) (Cestoda: Caryophyllidea), a parasite of common carp (Cyprinus carpio carpio L.). Intraindividual sequence diversity was observed in both ribosomal spacers. In ITS1, a total number of 19 recombinant clones yielded eight different sequence types (pairwise sequence identity, 99.7-100%) which, however, did not resemble the structure typical for divergent intragenomic ITS copies (paralogues). Polymorphism was displayed by several single nucleotide mutations present exclusively in single clones, but variation in the number of short repetitive motifs was not observed. In ITS2, a total of 21 recombinant clones yielded ten different sequence types (pairwise sequence identity, 97.5-100%). They were mostly characterized by a varying number of (TCGT)(n) repeats resulting in assortment of ITS2 sequences into two sequence variants, which reflected the structure specific for ITS paralogues. The third DNA region analysed, mitochondrial cox1 gene (669 bp) was detected to be 100% identical in all studied A. sagittatus individuals. Comparison of molecular data on A. sagittatus with those on Atractolytocestus huronensis Anthony, 1958, an invasive parasite of common carp, has shown that interspecific differences significantly exceeded intraspecific variation in both ribosomal spacers (81.4-82.5% in ITS1, 74.4-75.2% in ITS2) as well as in mitochondrial cox1, which confirms validity of both congeneric tapeworms parasitic in the same fish host.
Purification and characterization of the restriction endonuclease RsrI, an isoschizomer of EcoRI.

PubMed

Greene, P J; Ballard, B T; Stephenson, F; Kohr, W J; Rodriguez, H; Rosenberg, J M; Boyer, H W

1988-08-15

Rhodobacter sphaeroides strain 630 produces restriction enzyme RsrI which is an isoschizomer of EcoRI. We have purified this enzyme and initiated a comparison with the EcoRI endonuclease. The properties of RsrI are consistent with a reaction mechanism similar to that of EcoRI: the position of cleavage within the -GAATTC-site is identical, the MgCl2 optimum for the cleavage is identical, and the pH profile is similar. Methylation of the substrate sequence by the EcoRI methylase protects the site from cleavage by the RsrI endonuclease. RsrI cross-reacts strongly with anti-EcoRI serum indicating three-dimensional structural similarities. We have determined the sequence of 34 N terminal amino acids for RsrI and this sequence possesses significant similarity to the EcoRI N terminus.
Merida virus, a putative novel rhabdovirus discovered in Culex and Ochlerotatus spp. mosquitoes in the Yucatan Peninsula of Mexico.

PubMed

Charles, Jermilia; Firth, Andrew E; Loroño-Pino, Maria A; Garcia-Rejon, Julian E; Farfan-Ale, Jose A; Lipkin, W Ian; Blitvich, Bradley J; Briese, Thomas

2016-04-01

Sequences corresponding to a putative, novel rhabdovirus [designated Merida virus (MERDV)] were initially detected in a pool of Culex quinquefasciatus collected in the Yucatan Peninsula of Mexico. The entire genome was sequenced, revealing 11 798 nt and five major ORFs, which encode the nucleoprotein (N), phosphoprotein (P), matrix protein (M), glycoprotein (G) and RNA-dependent RNA polymerase (L). The deduced amino acid sequences of the N, G and L proteins have no more than 24, 38 and 43 % identity, respectively, to the corresponding sequences of all other known rhabdoviruses, whereas those of the P and M proteins have no significant identity with any sequences in GenBank and their identity is only suggested based on their genome position. Using specific reverse transcription-PCR assays established from the genome sequence, 27 571 C. quinquefasciatus which had been sorted in 728 pools were screened to assess the prevalence of MERDV in nature and 25 pools were found positive. The minimal infection rate (calculated as the number of positive mosquito pools per 1000 mosquitoes tested) was 0.9, and similar for both females and males. Screening another 140 pools of 5484 mosquitoes belonging to four other genera identified positive pools of Ochlerotatus spp. mosquitoes, indicating that the host range is not restricted to C. quinquefasciatus. Attempts to isolate MERDV in C6/36 and Vero cells were unsuccessful. In summary, we provide evidence that a previously undescribed rhabdovirus occurs in mosquitoes in Mexico.
Electron microscopic analysis and structural characterization of novel NADP(H)-containing methanol: N,N'-dimethyl-4-nitrosoaniline oxidoreductases from the gram-positive methylotrophic bacteria Amycolatopsis methanolica and Mycobacterium gastri MB19.

PubMed Central

Bystrykh, L V; Vonck, J; van Bruggen, E F; van Beeumen, J; Samyn, B; Govorukhina, N I; Arfman, N; Duine, J A; Dijkhuizen, L

1993-01-01

The quaternary protein structure of two methanol:N,N'-dimethyl-4-nitrosoaniline (NDMA) oxidoreductases purified from Amycolatopsis methanolica and Mycobacterium gastri MB19 was analyzed by electron microscopy and image processing. The enzymes are decameric proteins (displaying fivefold symmetry) with estimated molecular masses of 490 to 500 kDa based on their subunit molecular masses of 49 to 50 kDa. Both methanol:NDMA oxidoreductases possess a tightly but noncovalently bound NADP(H) cofactor at an NADPH-to-subunit molar ratio of 0.7. These cofactors are redox active toward alcohol and aldehyde substrates. Both enzymes contain significant amounts of Zn2+ and Mg2+ ions. The primary amino acid sequences of the A. methanolica and M. gastri MB19 methanol:NDMA oxidoreductases share a high degree of identity, as indicated by N-terminal sequence analysis (63% identity among the first 27 N-terminal amino acids), internal peptide sequence analysis, and overall amino acid composition. The amino acid sequence analysis also revealed significant similarity to a decameric methanol dehydrogenase of Bacillus methanolicus C1. Images PMID:8449887
Partial Shotgun Sequencing of the Boechera stricta Genome Reveals Extensive Microsynteny and Promoter Conservation with Arabidopsis1[W

PubMed Central

Windsor, Aaron J.; Schranz, M. Eric; Formanová, Nataša; Gebauer-Jung, Steffi; Bishop, John G.; Schnabelrauch, Domenica; Kroymann, Juergen; Mitchell-Olds, Thomas

2006-01-01

Comparative genomics provides insight into the evolutionary dynamics that shape discrete sequences as well as whole genomes. To advance comparative genomics within the Brassicaceae, we have end sequenced 23,136 medium-sized insert clones from Boechera stricta, a wild relative of Arabidopsis (Arabidopsis thaliana). A significant proportion of these sequences, 18,797, are nonredundant and display highly significant similarity (BLASTn e-value ≤ 10−30) to low copy number Arabidopsis genomic regions, including more than 9,000 annotated coding sequences. We have used this dataset to identify orthologous gene pairs in the two species and to perform a global comparison of DNA regions 5′ to annotated coding regions. On average, the 500 nucleotides upstream to coding sequences display 71.4% identity between the two species. In a similar analysis, 61.4% identity was observed between 5′ noncoding sequences of Brassica oleracea and Arabidopsis, indicating that regulatory regions are not as diverged among these lineages as previously anticipated. By mapping the B. stricta end sequences onto the Arabidopsis genome, we have identified nearly 2,000 conserved blocks of microsynteny (bracketing 26% of the Arabidopsis genome). A comparison of fully sequenced B. stricta inserts to their homologous Arabidopsis genomic regions indicates that indel polymorphisms >5 kb contribute substantially to the genome size difference observed between the two species. Further, we demonstrate that microsynteny inferred from end-sequence data can be applied to the rapid identification and cloning of genomic regions of interest from nonmodel species. These results suggest that among diploid relatives of Arabidopsis, small- to medium-scale shotgun sequencing approaches can provide rapid and cost-effective benefits to evolutionary and/or functional comparative genomic frameworks. PMID:16607030
Phylogenetic relationships in three species of canine Demodex mite based on partial sequences of mitochondrial 16S rDNA.

PubMed

Sastre, Natalia; Ravera, Ivan; Villanueva, Sergio; Altet, Laura; Bardagí, Mar; Sánchez, Armand; Francino, Olga; Ferrer, Lluís

2012-12-01

The historical classification of Demodex mites has been based on their hosts and morphological features. Genome sequencing has proved to be a very effective taxonomic tool in phylogenetic studies and has been applied in the classification of Demodex. Mitochondrial 16S rDNA has been demonstrated to be an especially useful marker to establish phylogenetic relationships. To amplify and sequence a segment of the mitochondrial 16S rDNA from Demodex canis and Demodex injai, as well as from the short-bodied mite called, unofficially, D. cornei and to determine their genetic proximity. Demodex mites were examined microscopically and classified as Demodex folliculorum (one sample), D. canis (four samples), D. injai (two samples) or the short-bodied species D. cornei (three samples). DNA was extracted, and a 338 bp fragment of the 16S rDNA was amplified and sequenced. The sequences of the four D. canis mites were identical and shared 99.6 and 97.3% identity with two D. canis sequences available at GenBank. The sequences of the D. cornei isolates were identical and showed 97.8, 98.2 and 99.6% identity with the D. canis isolates. The sequences of the two D. injai isolates were also identical and showed 76.6% identity with the D. canis sequence. Demodex canis and D. injai are two different species, with a genetic distance of 23.3%. It would seem that the short-bodied Demodex mite D. cornei is a morphological variant of D. canis. © 2012 The Authors. Veterinary Dermatology © 2012 ESVD and ACVD.
Mitochondrial DNA Evidence Supports the Hypothesis that Triodontophorus Species Belong to Cyathostominae

PubMed Central

Gao, Yuan; Zhang, Yan; Yang, Xin; Qiu, Jian-Hua; Duan, Hong; Xu, Wen-Wen; Chang, Qiao-Cheng; Wang, Chun-Ren

2017-01-01

Equine strongyles, the significant nematode pathogens of horses, are characterized by high quantities and species abundance, but classification of this group of parasitic nematodes is debated. Mitochondrial (mt) genome DNA data are often used to address classification controversies. Thus, the objectives of this study were to determine the complete mt genomes of three Cyathostominae nematode species (Cyathostomum catinatum, Cylicostephanus minutus, and Poteriostomum imparidentatum) of horses and reconstruct the phylogenetic relationship of Strongylidae with other nematodes in Strongyloidea to test the hypothesis that Triodontophorus spp. belong to Cyathostominae using the mt genomes. The mt genomes of Cy. catinatum, Cs. minutus, and P. imparidentatum were 13,838, 13,826, and 13,817 bp in length, respectively. Complete mt nucleotide sequence comparison of all Strongylidae nematodes revealed that sequence identity ranged from 77.8 to 91.6%. The mt genome sequences of Triodontophorus species had relatively high identity with Cyathostominae nematodes, rather than Strongylus species of the same subfamily (Strongylinae). Comparative analyses of mt genome organization for Strongyloidea nematodes sequenced to date revealed that members of this superfamily possess identical gene arrangements. Phylogenetic analyses using mtDNA data indicated that the Triodontophorus species clustered with Cyathostominae species instead of Strongylus species. The present study first determined the complete mt genome sequences of Cy. catinatum, Cs. minutus, and P. imparidentatum, which will provide novel genetic markers for further studies of Strongylidae taxonomy, population genetics, and systematics. Importantly, sequence comparison and phylogenetic analyses based on mtDNA sequences supported the hypothesis that Triodontophorus belongs to Cyathostominae. PMID:28824575
Bidirectional Retroviral Integration Site PCR Methodology and Quantitative Data Analysis Workflow.

PubMed

Suryawanshi, Gajendra W; Xu, Song; Xie, Yiming; Chou, Tom; Kim, Namshin; Chen, Irvin S Y; Kim, Sanggu

2017-06-14

Integration Site (IS) assays are a critical component of the study of retroviral integration sites and their biological significance. In recent retroviral gene therapy studies, IS assays, in combination with next-generation sequencing, have been used as a cell-tracking tool to characterize clonal stem cell populations sharing the same IS. For the accurate comparison of repopulating stem cell clones within and across different samples, the detection sensitivity, data reproducibility, and high-throughput capacity of the assay are among the most important assay qualities. This work provides a detailed protocol and data analysis workflow for bidirectional IS analysis. The bidirectional assay can simultaneously sequence both upstream and downstream vector-host junctions. Compared to conventional unidirectional IS sequencing approaches, the bidirectional approach significantly improves IS detection rates and the characterization of integration events at both ends of the target DNA. The data analysis pipeline described here accurately identifies and enumerates identical IS sequences through multiple steps of comparison that map IS sequences onto the reference genome and determine sequencing errors. Using an optimized assay procedure, we have recently published the detailed repopulation patterns of thousands of Hematopoietic Stem Cell (HSC) clones following transplant in rhesus macaques, demonstrating for the first time the precise time point of HSC repopulation and the functional heterogeneity of HSCs in the primate system. The following protocol describes the step-by-step experimental procedure and data analysis workflow that accurately identifies and quantifies identical IS sequences.
Sequence determination and analysis of the NSs genes of two tospoviruses.

PubMed

Hallwass, Mariana; Leastro, Mikhail O; Lima, Mirtes F; Inoue-Nagata, Alice K; Resende, Renato O

2012-03-01

The tospoviruses groundnut ringspot virus (GRSV) and zucchini lethal chlorosis virus (ZLCV) cause severe losses in many crops, especially in solanaceous and cucurbit species. In this study, the non-structural NSs gene and the 5'UTRs of these two biologically distinct tospoviruses were cloned and sequenced. The NSs sequence of GRSV and ZLCV were both 1,404 nucleotides long. Pairwise comparison showed that the NSs amino acid sequence of GRSV shared 69.6% identity with that of ZLCV and 75.9% identity with that of TSWV, while the NSs sequence of ZLCV and TSWV shared 67.9% identity. Phylogenetic analysis based on NSs sequences confirmed that these viruses cluster in the American clade.
Genomic library screening for viruses from the human dental plaque revealed pathogen-specific lytic phage sequences.

PubMed

Al-Jarbou, Ahmed Nasser

2012-01-01

Bacterial pathogenesis presents an astounding arsenal of virulence factors that allow them to conquer many different niches throughout the course of infection. Principally fascinating is the fact that some bacterial species are able to induce different diseases by expression of different combinations of virulence factors. Nevertheless, studies aiming at screening for the presence of bacteriophages in humans have been limited. Such screening procedures would eventually lead to identification of phage-encoded properties that impart increased bacterial fitness and/or virulence in a particular niche, and hence, would potentially be used to reverse the course of bacterial infections. As the human oral cavity represents a rich and dynamic ecosystem for several upper respiratory tract pathogens. However, little is known about virus diversity in human dental plaque which is an important reservoir. We applied the culture-independent approach to characterize virus diversity in human dental plaque making a library from a virus DNA fraction amplified using a multiple displacement method and sequenced 80 clones. The resulting sequence showed 44% significant identities to GenBank databases by TBLASTX analysis. TBLAST homology comparisons showed that 66% was viral; 18% eukarya; 10% bacterial; 6% mobile elements. These sequences were sorted into 6 contigs and 45 single sequences in which 4 contigs and a single sequence showed significant identity to a small region of a putative prophage in the Corynebacterium diphtheria genome. These findings interestingly highlight the uniqueness of over half of the sequences, whilst the dominance of a pathogen-specific prophage sequences imply their role in virulence.
Congruency sequence effect in cross-task context: evidence for dimension-specific modulation.

PubMed

Lee, Jaeyong; Cho, Yang Seok

2013-11-01

The congruency sequence effect refers to a reduced congruency effect after incongruent trials relative to congruent trials. This modulation is thought to be, at least in part, due to the control mechanisms resolving conflict. The present study examined the nature of the control mechanisms by having participants perform two different tasks in an alternating way. When participants performed horizontal and vertical Simon tasks in Experiment 1A, and horizontal and vertical spatial Stroop task in Experiment 1B, no congruency sequence effect was obtained between the task congruencies. When the Simon task and spatial Stroop task were performed with different response sets in Experiment 2, no congruency sequence effect was obtained. However, in Experiment 3, in which the participants performed the horizontal Simon and spatial Stroop tasks with an identical response set, a significant congruency sequence effect was obtained between the task congruencies. In Experiment 4, no congruency sequence effect was obtained when participants performed two tasks having different task-irrelevant dimensions with the identical response set. The findings suggest inhibitory processing between the task-irrelevant dimension and response mode after conflict. © 2013 Elsevier B.V. All rights reserved.
Characterization of the Complete Mitochondrial Genome Sequence of Spirometra erinaceieuropaei (Cestoda: Diphyllobothriidae) from China

PubMed Central

Liu, Guo-Hua; Li, Chun; Li, Jia-Yuan; Zhou, Dong-Hui; Xiong, Rong-Chuan; Lin, Rui-Qing; Zou, Feng-Cai; Zhu, Xing-Quan

2012-01-01

Sparganosis, caused by the plerocercoid larvae of members of the genus Spirometra, can cause significant public health problem and considerable economic losses. In the present study, the complete mitochondrial DNA (mtDNA) sequence of Spirometra erinaceieuropaei from China was determined, characterized and compared with that of S. erinaceieuropaei from Japan. The gene arrangement in the mt genome sequences of S. erinaceieuropaei from China and Japan is identical. The identity of the mt genomes was 99.1% between S. erinaceieuropaei from China and Japan, and the complete mtDNA sequence of S. erinaceieuropaei from China is slightly shorter (2 bp) than that from Japan. Phylogenetic analysis of S. erinaceieuropaei with other representative cestodes using two different computational algorithms [Bayesian inference (BI) and maximum likelihood (ML)] based on concatenated amino acid sequences of 12 protein-coding genes, revealed that S. erinaceieuropaei is closely related to Diphyllobothrium spp., supporting classification based on morphological features. The present study determined the complete mtDNA sequences of S. erinaceieuropaei from China that provides novel genetic markers for studying the population genetics and molecular epidemiology of S. erinaceieuropaei in humans and animals. PMID:22553464
Incorporation of unique molecular identifiers in TruSeq adapters improves the accuracy of quantitative sequencing.

PubMed

Hong, Jungeui; Gresham, David

2017-11-01

Quantitative analysis of next-generation sequencing (NGS) data requires discriminating duplicate reads generated by PCR from identical molecules that are of unique origin. Typically, PCR duplicates are identified as sequence reads that align to the same genomic coordinates using reference-based alignment. However, identical molecules can be independently generated during library preparation. Misidentification of these molecules as PCR duplicates can introduce unforeseen biases during analyses. Here, we developed a cost-effective sequencing adapter design by modifying Illumina TruSeq adapters to incorporate a unique molecular identifier (UMI) while maintaining the capacity to undertake multiplexed, single-index sequencing. Incorporation of UMIs into TruSeq adapters (TrUMIseq adapters) enables identification of bona fide PCR duplicates as identically mapped reads with identical UMIs. Using TrUMIseq adapters, we show that accurate removal of PCR duplicates results in improved accuracy of both allele frequency (AF) estimation in heterogeneous populations using DNA sequencing and gene expression quantification using RNA-Seq.
Analysis of drug binding pockets and repurposing opportunities for twelve essential enzymes of ESKAPE pathogens

PubMed Central

Naz, Sadia; Ngo, Tony; Farooq, Umar

2017-01-01

Background The rapid increase in antibiotic resistance by various bacterial pathogens underlies the significance of developing new therapies and exploring different drug targets. A fraction of bacterial pathogens abbreviated as ESKAPE by the European Center for Disease Prevention and Control have been considered a major threat due to the rise in nosocomial infections. Here, we compared putative drug binding pockets of twelve essential and mostly conserved metabolic enzymes in numerous bacterial pathogens including those of the ESKAPE group and Mycobacterium tuberculosis. The comparative analysis will provide guidelines for the likelihood of transferability of the inhibitors from one species to another. Methods Nine bacterial species including six ESKAPE pathogens, Mycobacterium tuberculosis along with Mycobacterium smegmatis and Eschershia coli, two non-pathogenic bacteria, have been selected for drug binding pocket analysis of twelve essential enzymes. The amino acid sequences were obtained from Uniprot, aligned using ICM v3.8-4a and matched against the Pocketome encyclopedia. We used known co-crystal structures of selected target enzyme orthologs to evaluate the location of their active sites and binding pockets and to calculate a matrix of pairwise sequence identities across each target enzyme across the different species. This was used to generate sequence maps. Results High sequence identity of enzyme binding pockets, derived from experimentally determined co-crystallized structures, was observed among various species. Comparison at both full sequence level and for drug binding pockets of key metabolic enzymes showed that binding pockets are highly conserved (sequence similarity up to 100%) among various ESKAPE pathogens as well as Mycobacterium tuberculosis. Enzymes orthologs having conserved binding sites may have potential to interact with inhibitors in similar way and might be helpful for design of similar class of inhibitors for a particular species. The derived pocket alignments and distance-based maps provide guidelines for drug discovery and repurposing. In addition they also provide recommendations for the relevant model bacteria that may be used for initial drug testing. Discussion Comparing ligand binding sites through sequence identity calculation could be an effective approach to identify conserved orthologs as drug binding pockets have shown higher level of conservation among various species. By using this approach we could avoid the problems associated with full sequence comparison. We identified essential metabolic enzymes among ESKAPE pathogens that share high sequence identity in their putative drug binding pockets (up to 100%), of which known inhibitors can potentially antagonize these identical pockets in the various species in a similar manner. PMID:28948099
Analysis of drug binding pockets and repurposing opportunities for twelve essential enzymes of ESKAPE pathogens.

PubMed

Naz, Sadia; Ngo, Tony; Farooq, Umar; Abagyan, Ruben

2017-01-01

The rapid increase in antibiotic resistance by various bacterial pathogens underlies the significance of developing new therapies and exploring different drug targets. A fraction of bacterial pathogens abbreviated as ESKAPE by the European Center for Disease Prevention and Control have been considered a major threat due to the rise in nosocomial infections. Here, we compared putative drug binding pockets of twelve essential and mostly conserved metabolic enzymes in numerous bacterial pathogens including those of the ESKAPE group and Mycobacterium tuberculosis . The comparative analysis will provide guidelines for the likelihood of transferability of the inhibitors from one species to another. Nine bacterial species including six ESKAPE pathogens, Mycobacterium tuberculosis along with Mycobacterium smegmatis and Eschershia coli , two non-pathogenic bacteria, have been selected for drug binding pocket analysis of twelve essential enzymes. The amino acid sequences were obtained from Uniprot, aligned using ICM v3.8-4a and matched against the Pocketome encyclopedia. We used known co-crystal structures of selected target enzyme orthologs to evaluate the location of their active sites and binding pockets and to calculate a matrix of pairwise sequence identities across each target enzyme across the different species. This was used to generate sequence maps. High sequence identity of enzyme binding pockets, derived from experimentally determined co-crystallized structures, was observed among various species. Comparison at both full sequence level and for drug binding pockets of key metabolic enzymes showed that binding pockets are highly conserved (sequence similarity up to 100%) among various ESKAPE pathogens as well as Mycobacterium tuberculosis . Enzymes orthologs having conserved binding sites may have potential to interact with inhibitors in similar way and might be helpful for design of similar class of inhibitors for a particular species. The derived pocket alignments and distance-based maps provide guidelines for drug discovery and repurposing. In addition they also provide recommendations for the relevant model bacteria that may be used for initial drug testing. Comparing ligand binding sites through sequence identity calculation could be an effective approach to identify conserved orthologs as drug binding pockets have shown higher level of conservation among various species. By using this approach we could avoid the problems associated with full sequence comparison. We identified essential metabolic enzymes among ESKAPE pathogens that share high sequence identity in their putative drug binding pockets (up to 100%), of which known inhibitors can potentially antagonize these identical pockets in the various species in a similar manner.

Bradyrhizobium algeriense sp. nov., a novel species isolated from effective nodules of Retama sphaerocarpa from Northeastern Algeria.

PubMed

Ahnia, Hadjira; Bourebaba, Yasmina; Durán, David; Boulila, Farida; Palacios, José M; Rey, Luis; Ruiz-Argüeso, Tomás; Boulila, Abdelghani; Imperial, Juan

2018-04-04

We have characterized genetic, phenotypic and symbiotic properties of bacterial strains previously isolated from nitrogen-fixing nodules of Retama sphaerocarpa from Northern Algeria. Phylogenetic analyses of 16S rRNA genes and three concatenated housekeeping genes, recA, atpD and glnII, placed them in a new divergent group that is proposed to form a new Bradyrhizobium species, Bradyrhizobium algeriense sp. nov. (type strain RST89 T , LMG 27618 and CECT 8363). Based on these phylogenetic markers and on genomic identity data derived from draft genomic sequences, Bradyrhizobium valentinum LmjM3 T , Bradyrhizobium lablabi CCBAU 23086 T , Bradyrhizobium retamae Ro19 T , and Bradyrhizobium jicamae PAC68 T are the closest relatives of B. algeriense RST89 T , with sequence identities of 92-94% and Average Nucleotide Identities (ANIm) under 90%, well below the 95-96% species circumscription threshold. Likewise, a comparison of whole-cell proteomic patterns, estimated by Matrix-Assisted Laser Desorption/Ionization-Time-of-Flight (MALDI-TOF) mass spectrometric analysis, yielded almost identical spectra between B. algeriense strains but significant differences with B. valentinum, Bradyrhizobium paxllaeri, Bradyrhizobium icense, B. lablabi, B. jicamae and B. retamae. A phylogenetic tree based on symbiotic gene nodC revealed that the B. algeriense sequences cluster with sequences from the Bradyrhizobium symbiovar retamae, previously defined with B. retamae strains isolated from Retama monosperma. B. algeriense strains were able to establish effective symbioses with Retama raetam, Lupinus micranthus, Lupinus albus and Genista numidica, but not with Lupinus angustifolius or Glycine max. Copyright © 2018 Elsevier GmbH. All rights reserved.
Merida virus, a putative novel rhabdovirus discovered in Culex and Ochlerotatus spp. mosquitoes in the Yucatan Peninsula of Mexico

PubMed Central

Charles, Jermilia; Firth, Andrew E.; Loroño-Pino, Maria A.; Garcia-Rejon, Julian E.; Farfan-Ale, Jose A.; Lipkin, W. Ian; Briese, Thomas

2016-01-01

Sequences corresponding to a putative, novel rhabdovirus [designated Merida virus (MERDV)] were initially detected in a pool of Culex quinquefasciatus collected in the Yucatan Peninsula of Mexico. The entire genome was sequenced, revealing 11 798 nt and five major ORFs, which encode the nucleoprotein (N), phosphoprotein (P), matrix protein (M), glycoprotein (G) and RNA-dependent RNA polymerase (L). The deduced amino acid sequences of the N, G and L proteins have no more than 24, 38 and 43 % identity, respectively, to the corresponding sequences of all other known rhabdoviruses, whereas those of the P and M proteins have no significant identity with any sequences in GenBank and their identity is only suggested based on their genome position. Using specific reverse transcription-PCR assays established from the genome sequence, 27 571 C. quinquefasciatus which had been sorted in 728 pools were screened to assess the prevalence of MERDV in nature and 25 pools were found positive. The minimal infection rate (calculated as the number of positive mosquito pools per 1000 mosquitoes tested) was 0.9, and similar for both females and males. Screening another 140 pools of 5484 mosquitoes belonging to four other genera identified positive pools of Ochlerotatus spp. mosquitoes, indicating that the host range is not restricted to C. quinquefasciatus. Attempts to isolate MERDV in C6/36 and Vero cells were unsuccessful. In summary, we provide evidence that a previously undescribed rhabdovirus occurs in mosquitoes in Mexico. PMID:26868915
A putative peroxidase cDNA from turnip and analysis of the encoded protein sequence.

PubMed

Romero-Gómez, S; Duarte-Vázquez, M A; García-Almendárez, B E; Mayorga-Martínez, L; Cervantes-Avilés, O; Regalado, C

2008-12-01

A putative peroxidase cDNA was isolated from turnip roots (Brassica napus L. var. purple top white globe) by reverse transcriptase-polymerase chain reaction (RT-PCR) and rapid amplification of cDNA ends (RACE). Total RNA extracted from mature turnip roots was used as a template for RT-PCR, using a degenerated primer designed to amplify the highly conserved distal motif of plant peroxidases. The resulting partial sequence was used to design the rest of the specific primers for 5' and 3' RACE. Two cDNA fragments were purified, sequenced, and aligned with the partial sequence from RT-PCR, and a complete overlapping sequence was obtained and labeled as BbPA (Genbank Accession No. AY423440, named as podC). The full length cDNA is 1167bp long and contains a 1077bp open reading frame (ORF) encoding a 358 deduced amino acid peroxidase polypeptide. The putative peroxidase (BnPA) showed a calculated Mr of 34kDa, and isoelectric point (pI) of 4.5, with no significant identity with other reported turnip peroxidases. Sequence alignment showed that only three peroxidases have a significant identity with BnPA namely AtP29a (84%), and AtPA2 (81%) from Arabidopsis thaliana, and HRPA2 (82%) from horseradish (Armoracia rusticana). Work is in progress to clone this gene into an adequate host to study the specific role and possible biotechnological applications of this alternative peroxidase source.
Facilitated sequence counting and assembly by template mutagenesis

PubMed Central

Levy, Dan; Wigler, Michael

2014-01-01

Presently, inferring the long-range structure of the DNA templates is limited by short read lengths. Accurate template counts suffer from distortions occurring during PCR amplification. We explore the utility of introducing random mutations in identical or nearly identical templates to create distinguishable patterns that are inherited during subsequent copying. We simulate the applications of this process under assumptions of error-free sequencing and perfect mapping, using cytosine deamination as a model for mutation. The simulations demonstrate that within readily achievable conditions of nucleotide conversion and sequence coverage, we can accurately count the number of otherwise identical molecules as well as connect variants separated by long spans of identical sequence. We discuss many potential applications, such as transcript profiling, isoform assembly, haplotype phasing, and de novo genome assembly. PMID:25313059
Automated Identification of Medically Important Bacteria by 16S rRNA Gene Sequencing Using a Novel Comprehensive Database, 16SpathDB▿

PubMed Central

Woo, Patrick C. Y.; Teng, Jade L. L.; Yeung, Juilian M. Y.; Tse, Herman; Lau, Susanna K. P.; Yuen, Kwok-Yung

2011-01-01

Despite the increasing use of 16S rRNA gene sequencing, interpretation of 16S rRNA gene sequence results is one of the most difficult problems faced by clinical microbiologists and technicians. To overcome the problems we encountered in the existing databases during 16S rRNA gene sequence interpretation, we built a comprehensive database, 16SpathDB (http://147.8.74.24/16SpathDB) based on the 16S rRNA gene sequences of all medically important bacteria listed in the Manual of Clinical Microbiology and evaluated its use for automated identification of these bacteria. Among 91 nonduplicated bacterial isolates collected in our clinical microbiology laboratory, 71 (78%) were reported by 16SpathDB as a single bacterial species having >98.0% nucleotide identity with the query sequence, 19 (20.9%) were reported as more than one bacterial species having >98.0% nucleotide identity with the query sequence, and 1 (1.1%) was reported as no match. For the 71 bacterial isolates reported as a single bacterial species, all results were identical to their true identities as determined by a polyphasic approach. For the 19 bacterial isolates reported as more than one bacterial species, all results contained their true identities as determined by a polyphasic approach and all of them had their true identities as the “best match in 16SpathDB.” For the isolate (Gordonibacter pamelaeae) reported as no match, the bacterium has never been reported to be associated with human disease and was not included in the Manual of Clinical Microbiology. 16SpathDB is an automated, user-friendly, efficient, accurate, and regularly updated database for 16S rRNA gene sequence interpretation in clinical microbiology laboratories. PMID:21389154
Complex structural behavior of oligopurine-oligopyrimidine sequence cloned within the supercoiled plasmid.

PubMed Central

Parniewski, P; Galazka, G; Wilk, A; Klysik, J

1989-01-01

Synthetic sequence GATCC(AG)7ATCG(AT)4CG(AG)7 was cloned into plasmid and its structural behavior under the influence of supercoiling was analysed by chemical modification at variety of experimental conditions. It was found that this sequence adopts at least two different non-B conformations depending on -delta and pH values. Moreover, 12 nucleotide long non-pur.pyr spacer region separating two identical (AG)7 blocks does not provide a significant energy barrier protecting against unusual structures formation. Images PMID:2644622
Pairagon: a highly accurate, HMM-based cDNA-to-genome aligner.

PubMed

Lu, David V; Brown, Randall H; Arumugam, Manimozhiyan; Brent, Michael R

2009-07-01

The most accurate way to determine the intron-exon structures in a genome is to align spliced cDNA sequences to the genome. Thus, cDNA-to-genome alignment programs are a key component of most annotation pipelines. The scoring system used to choose the best alignment is a primary determinant of alignment accuracy, while heuristics that prevent consideration of certain alignments are a primary determinant of runtime and memory usage. Both accuracy and speed are important considerations in choosing an alignment algorithm, but scoring systems have received much less attention than heuristics. We present Pairagon, a pair hidden Markov model based cDNA-to-genome alignment program, as the most accurate aligner for sequences with high- and low-identity levels. We conducted a series of experiments testing alignment accuracy with varying sequence identity. We first created 'perfect' simulated cDNA sequences by splicing the sequences of exons in the reference genome sequences of fly and human. The complete reference genome sequences were then mutated to various degrees using a realistic mutation simulator and the perfect cDNAs were aligned to them using Pairagon and 12 other aligners. To validate these results with natural sequences, we performed cross-species alignment using orthologous transcripts from human, mouse and rat. We found that aligner accuracy is heavily dependent on sequence identity. For sequences with 100% identity, Pairagon achieved accuracy levels of >99.6%, with one quarter of the errors of any other aligner. Furthermore, for human/mouse alignments, which are only 85% identical, Pairagon achieved 87% accuracy, higher than any other aligner. Pairagon source and executables are freely available at http://mblab.wustl.edu/software/pairagon/
Taxonomic evaluation of Streptomyces albus and related species using multilocus sequence analysis

USDA-ARS?s Scientific Manuscript database

In phylogenetic analyses of the genus Streptomyces using 16S rRNA gene sequences, Streptomyces albus subsp. albus NRRL B-1811T formed a cluster with 5 other species having identical or nearly identical 16S rRNA gene sequences. Moreover, the morphological and physiological characteristics of these ot...
Detecting authorized and unauthorized genetically modified organisms containing vip3A by real-time PCR and next-generation sequencing.

PubMed

Liang, Chanjuan; van Dijk, Jeroen P; Scholtens, Ingrid M J; Staats, Martijn; Prins, Theo W; Voorhuijzen, Marleen M; da Silva, Andrea M; Arisi, Ana Carolina Maisonnave; den Dunnen, Johan T; Kok, Esther J

2014-04-01

The growing number of biotech crops with novel genetic elements increasingly complicates the detection of genetically modified organisms (GMOs) in food and feed samples using conventional screening methods. Unauthorized GMOs (UGMOs) in food and feed are currently identified through combining GMO element screening with sequencing the DNA flanking these elements. In this study, a specific and sensitive qPCR assay was developed for vip3A element detection based on the vip3Aa20 coding sequences of the recently marketed MIR162 maize and COT102 cotton. Furthermore, SiteFinding-PCR in combination with Sanger, Illumina or Pacific BioSciences (PacBio) sequencing was performed targeting the flanking DNA of the vip3Aa20 element in MIR162. De novo assembly and Basic Local Alignment Search Tool searches were used to mimic UGMO identification. PacBio data resulted in relatively long contigs in the upstream (1,326 nucleotides (nt); 95 % identity) and downstream (1,135 nt; 92 % identity) regions, whereas Illumina data resulted in two smaller contigs of 858 and 1,038 nt with higher sequence identity (>99 % identity). Both approaches outperformed Sanger sequencing, underlining the potential for next-generation sequencing in UGMO identification.
SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics.

PubMed

Will, Sebastian; Otto, Christina; Miladi, Milad; Möhl, Mathias; Backofen, Rolf

2015-08-01

RNA-Seq experiments have revealed a multitude of novel ncRNAs. The gold standard for their analysis based on simultaneous alignment and folding suffers from extreme time complexity of [Formula: see text]. Subsequently, numerous faster 'Sankoff-style' approaches have been suggested. Commonly, the performance of such methods relies on sequence-based heuristics that restrict the search space to optimal or near-optimal sequence alignments; however, the accuracy of sequence-based methods breaks down for RNAs with sequence identities below 60%. Alignment approaches like LocARNA that do not require sequence-based heuristics, have been limited to high complexity ([Formula: see text] quartic time). Breaking this barrier, we introduce the novel Sankoff-style algorithm 'sparsified prediction and alignment of RNAs based on their structure ensembles (SPARSE)', which runs in quadratic time without sequence-based heuristics. To achieve this low complexity, on par with sequence alignment algorithms, SPARSE features strong sparsification based on structural properties of the RNA ensembles. Following PMcomp, SPARSE gains further speed-up from lightweight energy computation. Although all existing lightweight Sankoff-style methods restrict Sankoff's original model by disallowing loop deletions and insertions, SPARSE transfers the Sankoff algorithm to the lightweight energy model completely for the first time. Compared with LocARNA, SPARSE achieves similar alignment and better folding quality in significantly less time (speedup: 3.7). At similar run-time, it aligns low sequence identity instances substantially more accurate than RAF, which uses sequence-based heuristics. © The Author 2015. Published by Oxford University Press.
MetaSeq: privacy preserving meta-analysis of sequencing-based association studies.

PubMed

Singh, Angad Pal; Zafer, Samreen; Pe'er, Itsik

2013-01-01

Human genetics recently transitioned from GWAS to studies based on NGS data. For GWAS, small effects dictated large sample sizes, typically made possible through meta-analysis by exchanging summary statistics across consortia. NGS studies groupwise-test for association of multiple potentially-causal alleles along each gene. They are subject to similar power constraints and therefore likely to resort to meta-analysis as well. The problem arises when considering privacy of the genetic information during the data-exchange process. Many scoring schemes for NGS association rely on the frequency of each variant thus requiring the exchange of identity of the sequenced variant. As such variants are often rare, potentially revealing the identity of their carriers and jeopardizing privacy. We have thus developed MetaSeq, a protocol for meta-analysis of genome-wide sequencing data by multiple collaborating parties, scoring association for rare variants pooled per gene across all parties. We tackle the challenge of tallying frequency counts of rare, sequenced alleles, for metaanalysis of sequencing data without disclosing the allele identity and counts, thereby protecting sample identity. This apparent paradoxical exchange of information is achieved through cryptographic means. The key idea is that parties encrypt identity of genes and variants. When they transfer information about frequency counts in cases and controls, the exchanged data does not convey the identity of a mutation and therefore does not expose carrier identity. The exchange relies on a 3rd party, trusted to follow the protocol although not trusted to learn about the raw data. We show applicability of this method to publicly available exome-sequencing data from multiple studies, simulating phenotypic information for powerful meta-analysis. The MetaSeq software is publicly available as open source.
PASS2: an automated database of protein alignments organised as structural superfamilies.

PubMed

Bhaduri, Anirban; Pugalenthi, Ganesan; Sowdhamini, Ramanathan

2004-04-02

The functional selection and three-dimensional structural constraints of proteins in nature often relates to the retention of significant sequence similarity between proteins of similar fold and function despite poor sequence identity. Organization of structure-based sequence alignments for distantly related proteins, provides a map of the conserved and critical regions of the protein universe that is useful for the analysis of folding principles, for the evolutionary unification of protein families and for maximizing the information return from experimental structure determination. The Protein Alignment organised as Structural Superfamily (PASS2) database represents continuously updated, structural alignments for evolutionary related, sequentially distant proteins. An automated and updated version of PASS2 is, in direct correspondence with SCOP 1.63, consisting of sequences having identity below 40% among themselves. Protein domains have been grouped into 628 multi-member superfamilies and 566 single member superfamilies. Structure-based sequence alignments for the superfamilies have been obtained using COMPARER, while initial equivalencies have been derived from a preliminary superposition using LSQMAN or STAMP 4.0. The final sequence alignments have been annotated for structural features using JOY4.0. The database is supplemented with sequence relatives belonging to different genomes, conserved spatially interacting and structural motifs, probabilistic hidden markov models of superfamilies based on the alignments and useful links to other databases. Probabilistic models and sensitive position specific profiles obtained from reliable superfamily alignments aid annotation of remote homologues and are useful tools in structural and functional genomics. PASS2 presents the phylogeny of its members both based on sequence and structural dissimilarities. Clustering of members allows us to understand diversification of the family members. The search engine has been improved for simpler browsing of the database. The database resolves alignments among the structural domains consisting of evolutionarily diverged set of sequences. Availability of reliable sequence alignments of distantly related proteins despite poor sequence identity and single-member superfamilies permit better sampling of structures in libraries for fold recognition of new sequences and for the understanding of protein structure-function relationships of individual superfamilies. PASS2 is accessible at http://www.ncbs.res.in/~faculty/mini/campass/pass2.html
Identification, cloning and sequencing of Escherichia coli strain chi1378 (O78:K80) iss gene isolated from poultry colibacillosis in Iran.

PubMed

Derakhshandeh, A; Zahraei Salehi, T; Tadjbakhsh, H; Karimi, V

2009-09-01

To identify, clone and sequence the iss (increased serum survival) gene from E. coli strain chi1378 isolated from Iranian poultry and to predict its protein product, Iss. The iss gene from E. coli strain chi1378 was amplified and cloned into the pTZ57R/T vector and sequenced. From the DNA sequence, the Iss predictive protein was evaluated using bioinformatics. Iss from strain chi1378 had 100% identity with other E. coli serotypes and isolates from different origins and also 98% identity with E. coli O157:H7 Iss protein. Phylogenetic analysis showed no significant different phylogenic groups among E. coli strains. The strong association of predicted Iss protein among different E. coli strains suggests that it could be a good antigen to control and detect avian pathogenic E. coli (APEC). Because the exact pathogenesis and the role of virulence factors are unknown, the Iss protein could be used as a target for vaccination in the future, but further research is required.
Identity processes and coping strategies in college students: short-term longitudinal dynamics and the role of personality.

PubMed

Luyckx, Koen; Klimstra, Theo A; Duriez, Bart; Schwartz, Seth J; Vanhalst, Janne

2012-09-01

Coping strategies and identity processes are hypothesized to influence one another over time. This three-wave longitudinal study (N = 458; 84.9% women) examined, for the first time, how and to what extent identity processes (i.e., commitment making, identification with commitment, exploration in breadth, exploration in depth, and ruminative exploration) and coping strategies (i.e., problem solving, social support seeking, and avoidance) predicted one another over time. Cross-lagged analyses indicated that processes of identity exploration seemed especially to be intertwined with different coping strategies over time, suggesting that identity exploration may resemble problem-solving behavior on the pathway to an achieved identity. Commitment processes were found to be influenced by certain coping strategies, although identification with commitment also negatively influenced avoidance coping. These temporal sequences remained significant when controlling for baseline levels of Big Five personality traits. Hence, evidence was obtained for reciprocal pathways indicating that coping strategies and identity processes reinforce one another over time in college students.
Molecular characterisation of Sarcocystis lutrae n. sp. and Toxoplasma gondii from the musculature of two Eurasian otters (Lutra lutra) in Norway.

PubMed

Gjerde, Bjørn; Josefsen, Terje D

2015-03-01

Sarcocysts were detected in routinely processed histological sections of skeletal muscle, but not cardiac muscle, of two adult male otters (Lutra lutra; Mustelidae) from northern Norway following their post-mortem examination in 1999 and 2000. The sarcocysts were slender, spindle-shaped, up to 970 μm long and 35-70 μm in greatest diameter. The sarcocyst wall was thin (∼ 0.5 μm) and smooth with no visible protrusions. Portions of unfixed diaphragm of both animals were collected at the autopsies and kept frozen for about 14 years pending further examination. When the study was resumed in 2013, the thawed muscle samples were examined for sarcocysts under a stereo microscope, but none could be found. Genomic DNA was therefore extracted from a total of 36 small pieces of the diaphragm from both otters, and samples found to contain Sarcocystidae DNA were used selectively for PCR amplification and sequencing of the nuclear 18S and 28S ribosomal (r) RNA genes and internal transcribed spacer 1 (ITS1) region, as well as the mitochondrial cytochrome b (cytb) and cytochrome c oxidase subunit 1 (cox1) genes. Sequence comparisons revealed that both otters were infected by the same Sarcocystis sp. and that there was no genetic variation (100 % identity) among sequenced isolates at the 18S and 28S rRNA genes (six identical isolates at both loci) or at cox1 (13 identical isolates). PCR products comprising the ITS1 region, on the other hand, had to be cloned before sequencing due to intraspecific sequence variation. A total of 33 clones were sequenced, and the identities between them were 97.9-99.9 %. These sequences were most similar (93.7-96.0 % identity) to a sequence of Sarcocystis kalvikus from the wolverine in Canada, but the phylogenetic analyses placed all of them as a monophyletic sister group to S. kalvikus. Hence, they were considered to represent a novel species, which was named Sarcocystis lutrae. Sequence comparisons and phylogenetic analyses based on sequences of the 18S and 28S rRNA genes and cox1, for which little or no sequence data were available for S. kalvikus, revealed that S. lutrae otherwise was most closely related to various Sarcocystis spp. using birds or carnivores as intermediate hosts. The cox1 sequences of S. lutrae from the otters were identical to two sequences from an arctic fox, which in a previous study had been assigned to Sarcocystis arctica due to a high identity (99.4 %) with the latter species at this gene and a complete identity with S. arctica at three other loci when using the same DNA samples as templates for PCR reactions. Additional PCR amplifications and sequencing of cox1 (ten sequences) and the ITS1 region (four sequences) using four DNA samples from this fox as templates again generated cox1 sequences exclusively of S. lutrae, but ITS1 sequences of S. arctica, and thus confirmed that this arctic fox had acted as intermediate host for both S. arctica and S. lutrae. Based on the phylogenetic placement of S. lutrae, the geographical location of infected animals (otters, arctic fox) and the distribution of carnivores/raptors which may have interacted with them, the white-tailed eagle (Haliaeetus albicilla) seems to be a possible definitive host of S. lutrae. Some of the muscle samples from both otters were shown to harbour stages of Toxoplasma gondii through PCR amplification and sequencing of the entire ITS1 region (five isolates) and/or the partial cytb (eight isolates) and cox1 (one isolate). These sequences were identical to several previous sequences of T. gondii in GenBank. Thus, both otters had a dual infection with S. lutrae and T. gondii.
Characterisation of Potential Antimicrobial Targets in Bacillus spp. I. Aminotransferases and Methionine Regeneration in Bacillus subtilis

DTIC Science & Technology

2002-07-01

DAAT and 45% identical to the Staphylococcus haemolyticus DAAT. The ybgE and ywaA sequences were found in the Illa subfamily, and were 59% identical to...halodurans BH1060 gene product. The two sequences also had a respective 40% and 37% identity to the Staphylococcus aureuts SAV2560 gene product. The 6
Algorithms for optimizing cross-overs in DNA shuffling.

PubMed

He, Lu; Friedman, Alan M; Bailey-Kellogg, Chris

2012-03-21

DNA shuffling generates combinatorial libraries of chimeric genes by stochastically recombining parent genes. The resulting libraries are subjected to large-scale genetic selection or screening to identify those chimeras with favorable properties (e.g., enhanced stability or enzymatic activity). While DNA shuffling has been applied quite successfully, it is limited by its homology-dependent, stochastic nature. Consequently, it is used only with parents of sufficient overall sequence identity, and provides no control over the resulting chimeric library. This paper presents efficient methods to extend the scope of DNA shuffling to handle significantly more diverse parents and to generate more predictable, optimized libraries. Our CODNS (cross-over optimization for DNA shuffling) approach employs polynomial-time dynamic programming algorithms to select codons for the parental amino acids, allowing for zero or a fixed number of conservative substitutions. We first present efficient algorithms to optimize the local sequence identity or the nearest-neighbor approximation of the change in free energy upon annealing, objectives that were previously optimized by computationally-expensive integer programming methods. We then present efficient algorithms for more powerful objectives that seek to localize and enhance the frequency of recombination by producing "runs" of common nucleotides either overall or according to the sequence diversity of the resulting chimeras. We demonstrate the effectiveness of CODNS in choosing codons and allocating substitutions to promote recombination between parents targeted in earlier studies: two GAR transformylases (41% amino acid sequence identity), two very distantly related DNA polymerases, Pol X and β (15%), and beta-lactamases of varying identity (26-47%). Our methods provide the protein engineer with a new approach to DNA shuffling that supports substantially more diverse parents, is more deterministic, and generates more predictable and more diverse chimeric libraries.
Massively parallel sequencing of 124 SNPs included in the precision ID identity panel in three East Asian minority ethnicities.

PubMed

Liu, Jing; Wang, Zheng; He, Guanglin; Zhao, Xueying; Wang, Mengge; Luo, Tao; Li, Chengtao; Hou, Yiping

2018-07-01

Massively parallel sequencing (MPS) technologies can sequence many targeted regions of multiple samples simultaneously and are gaining great interest in the forensic community. The Precision ID Identity Panel contains 90 autosomal SNPs and 34 upper Y-Clade SNPs, which was designed with small amplicons and optimized for forensic degraded or challenging samples. Here, 184 unrelated individuals from three East Asian minority ethnicities (Tibetan, Uygur and Hui) were analyzed using the Precision ID Identity Panel and the Ion PGM System. The sequencing performance and corresponding forensic statistical parameters of this MPS-SNP panel were investigated. The inter-population relationships and substructures among three investigated populations and 30 worldwide populations were further investigated using PCA, MDS, cladogram and STRUCTURE. No significant deviation from Hardy-Weinberg equilibrium (HWE) and Linkage Disequilibrium (LD) tests was observed across all 90 autosomal SNPs. The combined matching probability (CMP) for Tibetan, Uygur and Hui were 2.5880 × 10 -33 , 1.7480 × 10 -35 and 4.6326 × 10 -34 respectively, and the combined power of exclusion (CPE) were 0.999999386152271, 0.999999607712827 and 0.999999696360182 respectively. For 34 Y-SNPs, only 16 haplogroups were obtained, but the haplogroup distributions differ among the three populations. Tibetans from the Sino-Tibetan population and Hui with multiple ethnicities with an admixture population have genetic affinity with East Asian populations, while Uygurs of a Eurasian admixture population have similar genetic components to the South Asian populations and are distributed between East Asian and European populations. The aforementioned results suggest that the Precision ID Identity Panel is informative and polymorphic in three investigated populations and could be used as an effective tool for human forensics. Copyright © 2018 Elsevier B.V. All rights reserved.
The nucleotide sequences of 5S rRNAs from a fern Dryopteris acuminata and a horsetail Equisetum arvense.

PubMed Central

Hori, H; Osawa, S; Takaiwa, F; Sugiura, M

1984-01-01

The nucleotide sequences from two Pteridophyta species, a fern Dryopteris acuminata and a horsetail Equisetum arvense have been determined. These two sequences are more related to those of the Bryophyta species (88% identity on average) than to those of seed plants (84% identity on average). PMID:6538332
ExprAlign - the identification of ESTs in non-model species by alignment of cDNA microarray expression profiles

PubMed Central

2009-01-01

Background Sequence identification of ESTs from non-model species offers distinct challenges particularly when these species have duplicated genomes and when they are phylogenetically distant from sequenced model organisms. For the common carp, an environmental model of aquacultural interest, large numbers of ESTs remained unidentified using BLAST sequence alignment. We have used the expression profiles from large-scale microarray experiments to suggest gene identities. Results Expression profiles from ~700 cDNA microarrays describing responses of 7 major tissues to multiple environmental stressors were used to define a co-expression landscape. This was based on the Pearsons correlation coefficient relating each gene with all other genes, from which a network description provided clusters of highly correlated genes as 'mountains'. We show that these contain genes with known identities and genes with unknown identities, and that the correlation constitutes evidence of identity in the latter. This procedure has suggested identities to 522 of 2701 unknown carp ESTs sequences. We also discriminate several common carp genes and gene isoforms that were not discriminated by BLAST sequence alignment alone. Precision in identification was substantially improved by use of data from multiple tissues and treatments. Conclusion The detailed analysis of co-expression landscapes is a sensitive technique for suggesting an identity for the large number of BLAST unidentified cDNAs generated in EST projects. It is capable of detecting even subtle changes in expression profiles, and thereby of distinguishing genes with a common BLAST identity into different identities. It benefits from the use of multiple treatments or contrasts, and from the large-scale microarray data. PMID:19939286

Complete sequence analysis reveals two distinct poleroviruses infecting cucurbits in China.

PubMed

Xiang, Hai-ying; Shang, Qiao-xia; Han, Cheng-gui; Li, Da-wei; Yu, Jia-lin

2008-01-01

The complete RNA genomes of a Chinese isolate of cucurbit aphid-borne yellows virus (CABYV-CHN) and a new polerovirus tentatively referred to as melon aphid-borne yellows virus (MABYV) were determined. The entire genome of CABYV-CHN shared 89.0% nucleotide sequence identity with the French CABYV isolate. In contrast, nucleotide sequence identities between MABYV and CABYV and other poleroviruses were in the range of 50.7-74.2%, with amino acid sequence identities ranging from 24.8 to 82.9% for individual gene products. We propose that CABYV-CHN is a strain of CABYV and that MABYV is a member of a tentative distinct species within the genus Polerovirus.
TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing.

PubMed

Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David

2018-04-11

Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.
Morphological identification and COI barcodes of adult flies help determine species identities of chironomid larvae (Diptera, Chironomidae).

PubMed

Failla, A J; Vasquez, A A; Hudson, P; Fujimoto, M; Ram, J L

2016-02-01

Establishing reliable methods for the identification of benthic chironomid communities is important due to their significant contribution to biomass, ecology and the aquatic food web. Immature larval specimens are more difficult to identify to species level by traditional morphological methods than their fully developed adult counterparts, and few keys are available to identify the larval species. In order to develop molecular criteria to identify species of chironomid larvae, larval and adult chironomids from Western Lake Erie were subjected to both molecular and morphological taxonomic analysis. Mitochondrial cytochrome c oxidase I (COI) barcode sequences of 33 adults that were identified to species level by morphological methods were grouped with COI sequences of 189 larvae in a neighbor-joining taxon-ID tree. Most of these larvae could be identified only to genus level by morphological taxonomy (only 22 of the 189 sequenced larvae could be identified to species level). The taxon-ID tree of larval sequences had 45 operational taxonomic units (OTUs, defined as clusters with >97% identity or individual sequences differing from nearest neighbors by >3%; supported by analysis of all larval pairwise differences), of which seven could be identified to species or 'species group' level by larval morphology. Reference sequences from the GenBank and BOLD databases assigned six larval OTUs with presumptive species level identifications and confirmed one previously assigned species level identification. Sequences from morphologically identified adults in the present study grouped with and further classified the identity of 13 larval OTUs. The use of morphological identification and subsequent DNA barcoding of adult chironomids proved to be beneficial in revealing possible species level identifications of larval specimens. Sequence data from this study also contribute to currently inadequate public databases relevant to the Great Lakes region, while the neighbor-joining analysis reported here describes the application and confirmation of a useful tool that can accelerate identification and bioassessment of chironomid communities.
Morphological identification and COI barcodes of adult flies help determine species identities of chironomid larvae (Diptera, Chironomidae)

USGS Publications Warehouse

Failla, Andrew Joseph; Vasquez, Adrian Amelio; Hudson, Patrick L.; Fujimoto, Masanori; Ram, Jeffrey L.

2016-01-01

Establishing reliable methods for the identification of benthic chironomid communities is important due to their significant contribution to biomass, ecology and the aquatic food web. Immature larval specimens are more difficult to identify to species level by traditional morphological methods than their fully developed adult counterparts, and few keys are available to identify the larval species. In order to develop molecular criteria to identify species of chironomid larvae, larval and adult chironomids from Western Lake Erie were subjected to both molecular and morphological taxonomic analysis. Mitochondrial cytochrome c oxidase I (COI) barcode sequences of 33 adults that were identified to species level by morphological methods were grouped with COI sequences of 189 larvae in a neighbor-joining taxon-ID tree. Most of these larvae could be identified only to genus level by morphological taxonomy (only 22 of the 189 sequenced larvae could be identified to species level). The taxon-ID tree of larval sequences had 45 operational taxonomic units (OTUs, defined as clusters with >97% identity or individual sequences differing from nearest neighbors by >3%; supported by analysis of all larval pairwise differences), of which seven could be identified to species or ‘species group’ level by larval morphology. Reference sequences from the GenBank and BOLD databases assigned six larval OTUs with presumptive species level identifications and confirmed one previously assigned species level identification. Sequences from morphologically identified adults in the present study grouped with and further classified the identity of 13 larval OTUs. The use of morphological identification and subsequent DNA barcoding of adult chironomids proved to be beneficial in revealing possible species level identifications of larval specimens. Sequence data from this study also contribute to currently inadequate public databases relevant to the Great Lakes region, while the neighbor-joining analysis reported here describes the application and confirmation of a useful tool that can accelerate identification and bioassesment of chironomid communities.
Simultaneous identification of DNA and RNA viruses present in pig faeces using process-controlled deep sequencing.

PubMed

Sachsenröder, Jana; Twardziok, Sven; Hammerl, Jens A; Janczyk, Pawel; Wrede, Paul; Hertwig, Stefan; Johne, Reimar

2012-01-01

Animal faeces comprise a community of many different microorganisms including bacteria and viruses. Only scarce information is available about the diversity of viruses present in the faeces of pigs. Here we describe a protocol, which was optimized for the purification of the total fraction of viral particles from pig faeces. The genomes of the purified DNA and RNA viruses were simultaneously amplified by PCR and subjected to deep sequencing followed by bioinformatic analyses. The efficiency of the method was monitored using a process control consisting of three bacteriophages (T4, M13 and MS2) with different morphology and genome types. Defined amounts of the bacteriophages were added to the sample and their abundance was assessed by quantitative PCR during the preparation procedure. The procedure was applied to a pooled faecal sample of five pigs. From this sample, 69,613 sequence reads were generated. All of the added bacteriophages were identified by sequence analysis of the reads. In total, 7.7% of the reads showed significant sequence identities with published viral sequences. They mainly originated from bacteriophages (73.9%) and mammalian viruses (23.9%); 0.8% of the sequences showed identities to plant viruses. The most abundant detected porcine viruses were kobuvirus, rotavirus C, astrovirus, enterovirus B, sapovirus and picobirnavirus. In addition, sequences with identities to the chimpanzee stool-associated circular ssDNA virus were identified. Whole genome analysis indicates that this virus, tentatively designated as pig stool-associated circular ssDNA virus (PigSCV), represents a novel pig virus. The established protocol enables the simultaneous detection of DNA and RNA viruses in pig faeces including the identification of so far unknown viruses. It may be applied in studies investigating aetiology, epidemiology and ecology of diseases. The implemented process control serves as quality control, ensures comparability of the method and may be used for further method optimization.
Molecular confirmation of Trichomonas gallinae and other parabasalids from Brazil using the 5.8S and ITS-1 rRNA regions.

PubMed

Ecco, Roselene; Preis, Ingred S; Vilela, Daniel A R; Luppi, Marcela M; Malta, Marcelo C C; Beckstead, Robert B; Stimmelmayr, Raphaela; Stimmelmayer, Raphaela; Gerhold, Richard W

2012-11-23

Clinical, gross, and histopathology lesions and molecular characterization of Trichomonas spp. infection were described in two striped owls (Asio (Rhinoptynx) clamator), one American kestrel (Falco sparverius), two green-winged saltators (Saltator similis), and in a toco toucan (Ramphastos toco) from Brazil. These birds presented clinical signs including emaciation, ruffled feathers, abundant salivation and open mouth breathing presumably due to abundant caseous material. Gross lesions were characterized by multifocal yellow friable plaques on the surface of the tongue, pharynx and/or caseous masses partially occluding the laryngeal entrance. In the owls, the caseous material extended into the mandibular muscles and invaded the sinuses of the skull. Histopathologically, marked necrotic and inflammatory lesions were associated with numerous round to oval, pale eosinophilic structures (6-10μm) with basophilic nuclei, consistent with trichomonads. Organisms similar to those described above also were found in the liver of the two green-winged saltators. To the authors' knowledge, this is the first report of trichomonosis in a striped owl and a toco toucan. Sequence analysis of the Trichomonas spp. internal transcribed spacer 1 (ITS-1) region and partial 5.8S of the ribosomal RNA (rRNA) disclosed significant genetic diversity. Two sequences had 100% identity to Trichomonas gallinae, whereas two sequences had a 99% and 92% identity to a Trichomonas vaginalis-like sequence, respectively. One sequence (green-winged saltator 502-08) had a 100% identity to a newly recognized genus Simplicomonas. Copyright © 2012 Elsevier B.V. All rights reserved.
Phylogenetic analysis of simian Plasmodium spp. infecting Anopheles balabacensis Baisas in Sabah, Malaysia

PubMed Central

Manin, Benny O.; Daim, Sylvia; Vythilingam, Indra; Drakeley, Chris

2017-01-01

Background Anopheles balabacensis of the Leucospyrus group has been confirmed as the primary knowlesi malaria vector in Sabah, Malaysian Borneo for some time now. Presently, knowlesi malaria is the only zoonotic simian malaria in Malaysia with a high prevalence recorded in the states of Sabah and Sarawak. Methodology/Principal findings Anopheles spp. were sampled using human landing catch (HLC) method at Paradason village in Kudat district of Sabah. The collected Anopheles were identified morphologically and then subjected to total DNA extraction and polymerase chain reaction (PCR) to detect Plasmodium parasites in the mosquitoes. Identification of Plasmodium spp. was confirmed by sequencing the SSU rRNA gene with species specific primers. MEGA4 software was then used to analyse the SSU rRNA sequences and bulid the phylogenetic tree for inferring the relationship between simian malaria parasites in Sabah. PCR results showed that only 1.61% (23/1,425) of the screened An. balabacensis were infected with one or two of the five simian Plasmodium spp. found in Sabah, viz. Plasmodium coatneyi, P. inui, P. fieldi, P. cynomolgi and P. knowlesi. Sequence analysis of SSU rRNA of Plasmodium isolates showed high percentage of identity within the same Plasmodium sp. group. The phylogenetic tree based on the consensus sequences of P. knowlesi showed 99.7%–100.0% nucleotide identity among the isolates from An. balabacensis, human patients and a long-tailed macaque from the same locality. Conclusions/Significance This is the first study showing high molecular identity between the P. knowlesi isolates from An. balabacensis, human patients and a long-tailed macaque in Sabah. The other common simian Plasmodium spp. found in long-tailed macaques and also detected in An. balabacensis were P. coatneyi, P. inui, P. fieldi and P. cynomolgi. The high percentage identity of nucleotide sequences between the P. knowlesi isolates from the long-tailed macaque, An. balabacensis and human patients suggests a close genetic relationship between the parasites from these hosts. PMID:28968395
Terminal region sequence variations in variola virus DNA.

PubMed

Massung, R F; Loparev, V N; Knight, J C; Totmenin, A V; Chizhikov, V E; Parsons, J M; Safronov, P F; Gutorov, V V; Shchelkunov, S N; Esposito, J J

1996-07-15

Genome DNA terminal region sequences were determined for a Brazilian alastrim variola minor virus strain Garcia-1966 that was associated with an 0.8% case-fatality rate and African smallpox strains Congo-1970 and Somalia-1977 associated with variola major (9.6%) and minor (0.4%) mortality rates, respectively. A base sequence identity of > or = 98.8% was determined after aligning 30 kb of the left- or right-end region sequences with cognate sequences previously determined for Asian variola major strains India-1967 (31% death rate) and Bangladesh-1975 (18.5% death rate). The deduced amino acid sequences of putative proteins of > or = 65 amino acids also showed relatively high identity, although the Asian and African viruses were clearly more related to each other than to alastrim virus. Alastrim virus contained only 10 of 70 proteins that were 100% identical to homologs in Asian strains, and 7 alastrim-specific proteins were noted.
Sequence and structural implications of a bovine corneal keratan sulfate proteoglycan core protein. Protein 37B represents bovine lumican and proteins 37A and 25 are unique

NASA Technical Reports Server (NTRS)

Funderburgh, J. L.; Funderburgh, M. L.; Brown, S. J.; Vergnes, J. P.; Hassell, J. R.; Mann, M. M.; Conrad, G. W.; Spooner, B. S. (Principal Investigator)

1993-01-01

Amino acid sequence from tryptic peptides of three different bovine corneal keratan sulfate proteoglycan (KSPG) core proteins (designated 37A, 37B, and 25) showed similarities to the sequence of a chicken KSPG core protein lumican. Bovine lumican cDNA was isolated from a bovine corneal expression library by screening with chicken lumican cDNA. The bovine cDNA codes for a 342-amino acid protein, M(r) 38,712, containing amino acid sequences identified in the 37B KSPG core protein. The bovine lumican is 68% identical to chicken lumican, with an 83% identity excluding the N-terminal 40 amino acids. Location of 6 cysteine and 4 consensus N-glycosylation sites in the bovine sequence were identical to those in chicken lumican. Bovine lumican had about 50% identity to bovine fibromodulin and 20% identity to bovine decorin and biglycan. About two-thirds of the lumican protein consists of a series of 10 amino acid leucine-rich repeats that occur in regions of calculated high beta-hydrophobic moment, suggesting that the leucine-rich repeats contribute to beta-sheet formation in these proteins. Sequences obtained from 37A and 25 core proteins were absent in bovine lumican, thus predicting a unique primary structure and separate mRNA for each of the three bovine KSPG core proteins.
Conserved features of eukaryotic hsp70 genes revealed by comparison with the nucleotide sequence of human hsp70.

PubMed Central

Hunt, C; Morimoto, R I

1985-01-01

We have determined the nucleotide sequence of the human hsp70 gene and 5' flanking region. The hsp70 gene is transcribed as an uninterrupted primary transcript of 2440 nucleotides composed of a 5' noncoding leader sequence of 212 nucleotides, a 3' noncoding region of 242 nucleotides, and a continuous open reading frame of 1986 nucleotides that encodes a protein with predicted molecular mass of 69,800 daltons. Upstream of the 5' terminus are the canonical TATAAA box, the sequence ATTGG that corresponds in the inverted orientation to the CCAAT motif, and the dyad sequence CTGGAAT/ATTCCCG that shares homology in 12 of 14 positions with the consensus transcription regulatory sequence common to Drosophila heat shock genes. Comparison of the predicted amino acid sequences of human hsp70 with the published sequences of Drosophila hsp70 and Escherichia coli dnaK reveals that human hsp70 is 73% identical to Drosophila hsp70 and 47% identical to E. coli dnaK. Surprisingly, the nucleotide sequences of the human and Drosophila genes are 72% identical and human and E. coli genes are 50% identical, which is more highly conserved than necessary given the degeneracy of the genetic code. The lack of accumulated silent nucleotide substitutions leads us to propose that there may be additional information in the nucleotide sequence of the hsp70 gene or the corresponding mRNA that precludes the maximum divergence allowed in the silent codon positions. PMID:3931075
Diversity of partial RNA-dependent RNA polymerase gene sequences of soybean blotchy mosaic virus isolates from different host-, geographical- and temporal origins.

PubMed

Strydom, Elrea; Pietersen, Gerhard

2018-05-01

Infection of soybean by the plant cytorhabdovirus soybean blotchy mosaic virus (SbBMV) results in significant yield losses in the temperate, lower-lying soybean production regions of South Africa. A 277 bp portion of the RNA-dependent RNA polymerase gene of 66 SbBMV isolates from different: hosts, geographical locations in South Africa, and times of collection (spanning 16 years) were amplified by RT-PCR and sequenced to investigate the genetic diversity of isolates. Phylogenetic reconstruction revealed three main lineages, designated Groups A, B and C, with isolates grouping primarily according to geographic origin. Pairwise nucleotide identities ranged between 85.7% and 100% among all isolates, with isolates in Group A exhibiting the highest degree of sequence identity, and isolates of Groups A and B being more closely related to each other than to those in Group C. This is the first study investigating the genetic diversity of SbBMV.
Identification of the electron transfer flavoprotein as an upregulated enzyme in the benzoate utilization of Desulfotignum balticum.

PubMed

Habe, Hiroshi; Kobuna, Akinori; Hosoda, Akifumi; Kosaka, Tomoyuki; Endoh, Takayuki; Tamura, Hiroto; Yamane, Hisakazu; Nojiri, Hideaki; Omori, Toshio; Watanabe, Kazuya

2009-07-01

Desulfotignum balticum utilizes benzoate coupled to sulfate reduction. Two-dimensional polyacrylamide gel electrophoresis (2D-PAGE) analysis was conducted to detect proteins that increased more after growth on benzoate than on butyrate. A comparison of proteins on 2D gels showed that at least six proteins were expressed. The N-terminal sequences of three proteins exhibited significant identities with the alpha and beta subunits of electron transfer flavoprotein (ETF) from anaerobic aromatic-degraders. By sequence analysis of the fosmid clone insert (37,590 bp) containing the genes encoding the ETF subunits, we identified three genes, whose deduced amino acid sequences showed 58%, 74%, and 62% identity with those of Gmet_2267 (Fe-S oxidoreductase), Gmet_2266 (ETF beta subunit), and Gmet_2265 (ETF alpha subunit) respectively, which exist within the 300-kb genomic island of aromatic-degradation genes from Geobacter metallireducens GS-15. The genes encoding ETF subunits found in this study were upregulated in benzoate utilization.
Contribution of past and future self-defining event networks to personal identity.

PubMed

Demblon, Julie; D'Argembeau, Arnaud

2017-05-01

Personal identity is nourished by memories of significant past experiences and by the imagination of meaningful events that one anticipates to happen in the future. The organisation of such self-defining memories and prospective thoughts in the cognitive system has received little empirical attention, however. In the present study, our aims were to investigate to what extent self-defining memories and future projections are organised in networks of related events, and to determine the nature of the connections linking these events. Our results reveal the existence of self-defining event networks, composed of both memories and future events of similar centrality for identity and characterised by similar identity motives. These self-defining networks expressed a strong internal coherence and frequently organised events in meaningful themes and sequences (i.e., event clusters). Finally, we found that the satisfaction of identity motives in represented events and the presence of clustering across events both contributed to increase in the perceived centrality of events for the sense of identity. Overall, these findings suggest that personal identity is not only nourished by representations of significant past and future events, but also depends on the formation of coherent networks of related events that provide an overarching meaning to specific life experiences.
Phylogenetic analysis of phenotypically characterized Cryptococcus laurentii isolates reveals high frequency of cryptic species.

PubMed

Ferreira-Paim, Kennio; Ferreira, Thatiana Bragine; Andrade-Silva, Leonardo; Mora, Delio Jose; Springer, Deborah J; Heitman, Joseph; Fonseca, Fernanda Machado; Matos, Dulcilena; Melhem, Márcia Souza Carvalho; Silva-Vergara, Mario León

2014-01-01

Although Cryptococcus laurentii has been considered saprophytic and its taxonomy is still being described, several cases of human infections have already reported. This study aimed to evaluate molecular aspects of C. laurentii isolates from Brazil, Botswana, Canada, and the United States. In this study, 100 phenotypically identified C. laurentii isolates were evaluated by sequencing the 18S nuclear ribosomal small subunit rRNA gene (18S-SSU), D1/D2 region of 28S nuclear ribosomal large subunit rRNA gene (28S-LSU), and the internal transcribed spacer (ITS) of the ribosomal region. BLAST searches using 550-bp, 650-bp, and 550-bp sequenced amplicons obtained from the 18S-SSU, 28S-LSU, and the ITS region led to the identification of 75 C. laurentii strains that shared 99-100% identity with C. laurentii CBS 139. A total of nine isolates shared 99% identity with both Bullera sp. VY-68 and C. laurentii RY1. One isolate shared 99% identity with Cryptococcus rajasthanensis CBS 10406, and eight isolates shared 100% identity with Cryptococcus sp. APSS 862 according to the 28S-LSU and ITS regions and designated as Cryptococcus aspenensis sp. nov. (CBS 13867). While 16 isolates shared 99% identity with Cryptococcus flavescens CBS 942 according to the 18S-SSU sequence, only six were confirmed using the 28S-LSU and ITS region sequences. The remaining 10 shared 99% identity with Cryptococcus terrestris CBS 10810, which was recently described in Brazil. Through concatenated sequence analyses, seven sequence types in C. laurentii, three in C. flavescens, one in C. terrestris, and one in the C. aspenensis sp. nov. were identified. Sequencing permitted the characterization of 75% of the environmental C. laurentii isolates from different geographical areas and the identification of seven haplotypes of this species. Among sequenced regions, the increased variability of the ITS region in comparison to the 18S-SSU and 28S-LSU regions reinforces its applicability as a DNA barcode.
Mosaic Graphs and Comparative Genomics in Phage Communities

PubMed Central

Belcaid, Mahdi; Bergeron, Anne

2010-01-01

Abstract Comparing the genomes of two closely related viruses often produces mosaics where nearly identical sequences alternate with sequences that are unique to each genome. When several closely related genomes are compared, the unique sequences are likely to be shared with third genomes, leading to virus mosaic communities. Here we present comparative analysis of sets of Staphylococcus aureus phages that share large identical sequences with up to three other genomes, and with different partners along their genomes. We introduce mosaic graphs to represent these complex recombination events, and use them to illustrate the breath and depth of sequence sharing: some genomes are almost completely made up of shared sequences, while genomes that share very large identical sequences can adopt alternate functional modules. Mosaic graphs also allow us to identify breakpoints that could eventually be used for the construction of recombination networks. These findings have several implications on phage metagenomics assembly, on the horizontal gene transfer paradigm, and more generally on the understanding of the composition and evolutionary dynamics of virus communities. PMID:20874413
Complete nucleotide sequence of a monopartite Begomovirus and associated satellites infecting Carica papaya in Nepal.

PubMed

Shahid, M S; Yoshida, S; Khatri-Chhetri, G B; Briddon, R W; Natsuaki, K T

2013-06-01

Carica papaya (papaya) is a fruit crop that is cultivated mostly in kitchen gardens throughout Nepal. Leaf samples of C. papaya plants with leaf curling, vein darkening, vein thickening, and a reduction in leaf size were collected from a garden in Darai village, Rampur, Nepal in 2010. Full-length clones of a monopartite Begomovirus, a betasatellite and an alphasatellite were isolated. The complete nucleotide sequence of the Begomovirus showed the arrangement of genes typical of Old World begomoviruses with the highest nucleotide sequence identity (>99 %) to an isolate of Ageratum yellow vein virus (AYVV), confirming it as an isolate of AYVV. The complete nucleotide sequence of betasatellite showed greater than 89 % nucleotide sequence identity to an isolate of Tomato leaf curl Java betasatellite originating from Indonesian. The sequence of the alphasatellite displayed 92 % nucleotide sequence identity to Sida yellow vein China alphasatellite. This is the first identification of these components in Nepal and the first time they have been identified in papaya.
A highly conserved N-terminal sequence for teleost vitellogenin with potential value to the biochemistry, molecular biology and pathology of vitellogenesis

USGS Publications Warehouse

Folmar, L.D.; Denslow, N.D.; Wallace, R.A.; LaFleur, G.; Gross, T.S.; Bonomelli, S.; Sullivan, C.V.

1995-01-01

N-terminal amino acid sequences for vitellogenin (Vtg) from six species of teleost fish (striped bass, mummichog, pinfish, brown bullhead, medaka, yellow perch and the sturgeon) are compared with published N-terminal Vtg sequences for the lamprey, clawed frog and domestic chicken. Striped bass and mummichog had 100% identical amino acids between positions 7 and 21, while pinfish, brown bullhead, sturgeon, lamprey, Xenopus and chicken had 87%, 93%, 60%, 47%, 47-60%) for four transcripts and had 40% identical, respectively, with striped bass for the same positions. Partial sequences obtained for medaka and yellow perch were 100% identical between positions 5 to 10. The potential utility of this conserved sequence for studies on the biochemistry, molecular biology and pathology of vitellogenesis is discussed.
Detection and molecular characterization of infectious bronchitis virus isolated from recent outbreaks in broiler flocks in Thailand.

PubMed

Pohuang, Tawatchai; Chansiripornchai, Niwat; Tawatsin, Achara; Sasipreeyajan, Jiroj

2009-09-01

Thirteen field isolates of infectious bronchitis virus (IBV) were isolated from broiler flocks in Thailand between January and June 2008. The 878-bp of the S1 gene covering a hypervariable region was amplified and sequenced. Phylogenetic analysis based on that region revealed that these viruses were separated into two groups (I and II). IBV isolates in group I were not related to other IBV strains published in the GenBank database. Group 1 nucleotide sequence identities were less than 85% and amino acid sequence identities less than 84% in common with IBVs published in the GenBank database. This group likely represents the strains indigenous to Thailand. The isolates in group II showed a close relationship with Chinese IBVs. They had nucleotide sequence identities of 97-98% and amino acid sequence identities 96-98% in common with Chinese IBVs (strain A2, SH and QXIBV). This finding indicated that the recent Thai IBVs evolved separately and at least two groups of viruses are circulating in Thailand.
A Systematic Analysis of the Structures of Heterologously Expressed Proteins and Those from Their Native Hosts in the RCSB PDB Archive.

PubMed

Zhou, Ren-Bin; Lu, Hui-Meng; Liu, Jie; Shi, Jian-Yu; Zhu, Jing; Lu, Qin-Qin; Yin, Da-Chuan

2016-01-01

Recombinant expression of proteins has become an indispensable tool in modern day research. The large yields of recombinantly expressed proteins accelerate the structural and functional characterization of proteins. Nevertheless, there are literature reported that the recombinant proteins show some differences in structure and function as compared with the native ones. Now there have been more than 100,000 structures (from both recombinant and native sources) publicly available in the Protein Data Bank (PDB) archive, which makes it possible to investigate if there exist any proteins in the RCSB PDB archive that have identical sequence but have some difference in structures. In this paper, we present the results of a systematic comparative study of the 3D structures of identical naturally purified versus recombinantly expressed proteins. The structural data and sequence information of the proteins were mined from the RCSB PDB archive. The combinatorial extension (CE), FATCAT-flexible and TM-Align methods were employed to align the protein structures. The root-mean-square distance (RMSD), TM-score, P-value, Z-score, secondary structural elements and hydrogen bonds were used to assess the structure similarity. A thorough analysis of the PDB archive generated five-hundred-seventeen pairs of native and recombinant proteins that have identical sequence. There were no pairs of proteins that had the same sequence and significantly different structural fold, which support the hypothesis that expression in a heterologous host usually could fold correctly into their native forms.
Protein structure determination by exhaustive search of Protein Data Bank derived databases.

PubMed

Stokes-Rees, Ian; Sliz, Piotr

2010-12-14

Parallel sequence and structure alignment tools have become ubiquitous and invaluable at all levels in the study of biological systems. We demonstrate the application and utility of this same parallel search paradigm to the process of protein structure determination, benefitting from the large and growing corpus of known structures. Such searches were previously computationally intractable. Through the method of Wide Search Molecular Replacement, developed here, they can be completed in a few hours with the aide of national-scale federated cyberinfrastructure. By dramatically expanding the range of models considered for structure determination, we show that small (less than 12% structural coverage) and low sequence identity (less than 20% identity) template structures can be identified through multidimensional template scoring metrics and used for structure determination. Many new macromolecular complexes can benefit significantly from such a technique due to the lack of known homologous protein folds or sequences. We demonstrate the effectiveness of the method by determining the structure of a full-length p97 homologue from Trichoplusia ni. Example cases with the MHC/T-cell receptor complex and the EmoB protein provide systematic estimates of minimum sequence identity, structure coverage, and structural similarity required for this method to succeed. We describe how this structure-search approach and other novel computationally intensive workflows are made tractable through integration with the US national computational cyberinfrastructure, allowing, for example, rapid processing of the entire Structural Classification of Proteins protein fragment database.

A Systematic Analysis of the Structures of Heterologously Expressed Proteins and Those from Their Native Hosts in the RCSB PDB Archive

PubMed Central

Zhou, Ren-Bin; Lu, Hui-Meng; Liu, Jie; Shi, Jian-Yu; Zhu, Jing; Lu, Qin-Qin; Yin, Da-Chuan

2016-01-01

Recombinant expression of proteins has become an indispensable tool in modern day research. The large yields of recombinantly expressed proteins accelerate the structural and functional characterization of proteins. Nevertheless, there are literature reported that the recombinant proteins show some differences in structure and function as compared with the native ones. Now there have been more than 100,000 structures (from both recombinant and native sources) publicly available in the Protein Data Bank (PDB) archive, which makes it possible to investigate if there exist any proteins in the RCSB PDB archive that have identical sequence but have some difference in structures. In this paper, we present the results of a systematic comparative study of the 3D structures of identical naturally purified versus recombinantly expressed proteins. The structural data and sequence information of the proteins were mined from the RCSB PDB archive. The combinatorial extension (CE), FATCAT-flexible and TM-Align methods were employed to align the protein structures. The root-mean-square distance (RMSD), TM-score, P-value, Z-score, secondary structural elements and hydrogen bonds were used to assess the structure similarity. A thorough analysis of the PDB archive generated five-hundred-seventeen pairs of native and recombinant proteins that have identical sequence. There were no pairs of proteins that had the same sequence and significantly different structural fold, which support the hypothesis that expression in a heterologous host usually could fold correctly into their native forms. PMID:27517583
Cloning and sequence analysis of the invertase gene INV 1 from the yeast Pichia anomala.

PubMed

Pérez, J A; Rodríguez, J; Rodríguez, L; Ruiz, T

1996-02-01

A genomic library from the yeast Pichia anomala has been constructed and employed to clone the gene encoding the sucrose-hydrolysing enzyme invertase by complementation of a sucrose non-fermenting mutant of Saccharomyces cerevisiae. The cloned gene, INV1, was sequenced and found to encode a polypeptide of 550 amino acids which contained a 22 amino-acid signal sequence and ten potential glycosylation sites. The amino-acid sequence shows significant identity with other yeast invertases and also with Kluyveromyces marxianus inulinase, a yeast beta-fructofuranosidase which has a different substrate specificity. The nucleotide sequences of the 5' and 3' non-coding regions were found to contain several consensus motifs probably involved in the initiation and termination of gene transcription.
Characterization of a variant strain of Norwalk virus from a food-borne outbreak of gastroenteritis on a cruise ship in Hawaii.

PubMed Central

Herwaldt, B L; Lew, J F; Moe, C L; Lewis, D C; Humphrey, C D; Monroe, S S; Pon, E W; Glass, R I

1994-01-01

A gastroenteritis outbreak affecting at least 217 (41%) of 527 passengers on a cruise ship was caused by a variant strain of Norwalk virus (NV) that is related to but distinct from the prototype NV strain. Consumption of fresh-cut fruit served at two buffets was significantly associated with illness (P < or = 0.01), and a significant dose-response relationship was evident between illness and the number of various fresh-cut fruit items eaten. Seven (58%) of 12 paired serum specimens from ill persons demonstrated at least fourfold rises in antibody response to recombinant NV capsid antigen. A 32-nm small round-structured virus was visualized by electron microscopy in 4 (29%) of 14 fecal specimens, but none of the 8 specimens that were examined by an enzyme immunoassay for NV antigen demonstrated antigen. Four (40%) of 10 fecal specimens were positive by reverse transcriptase-PCR by using primer pairs selected from the polymerase region of NV. In a 145-bp region, the PCR product shared only 72% nucleotide sequence identity with the reference NV strain and 77% nucleotide sequence identity with Southampton virus but shared 95% nucleotide sequence identity with UK2 virus, a United Kingdom reference virus strain. In addition, the outbreak virus was serotyped as UK2 virus by solid-phase immune electron microscopy. The genetic and antigenic divergence of the outbreak strain from the reference NV strain highlights the need for more broadly reactive diagnostic assays and for improved understanding of the relatedness of the NV group of agents. Images PMID:8027335
Identification of a Divergent Environmental DNA Sequence Clade Using the Phylogeny of Gregarine Parasites (Apicomplexa) from Crustacean Hosts

PubMed Central

Rueckert, Sonja; Simdyanov, Timur G.; Aleoshin, Vladimir V.; Leander, Brian S.

2011-01-01

Background Environmental SSU rDNA surveys have significantly improved our understanding of microeukaryotic diversity. Many of the sequences acquired using this approach are closely related to lineages previously characterized at both morphological and molecular levels, making interpretation of these data relatively straightforward. Some sequences, by contrast, appear to be phylogenetic orphans and are sometimes inferred to represent “novel lineages” of unknown cellular identity. Consequently, interpretation of environmental DNA surveys of cellular diversity rely on an adequately comprehensive database of DNA sequences derived from identified species. Several major taxa of microeukaryotes, however, are still very poorly represented in these databases, and this is especially true for diverse groups of single-celled parasites, such as gregarine apicomplexans. Methodology/Principal Findings This study attempts to address this paucity of DNA sequence data by characterizing four different gregarine species, isolated from the intestines of crustaceans, at both morphological and molecular levels: Thiriotia pugettiae sp. n. from the graceful kelp crab (Pugettia gracilis), Cephaloidophora cf. communis from two different species of barnacles (Balanus glandula and B. balanus), Heliospora cf. longissima from two different species of freshwater amphipods (Eulimnogammarus verrucosus and E. vittatus), and Heliospora caprellae comb. n. from a skeleton shrimp (Caprella alaskana). SSU rDNA sequences were acquired from isolates of these gregarine species and added to a global apicomplexan alignment containing all major groups of gregarines characterized so far. Molecular phylogenetic analyses of these data demonstrated that all of the gregarines collected from crustacean hosts formed a very strongly supported clade with 48 previously unidentified environmental DNA sequences. Conclusions/Significance This expanded molecular phylogenetic context enabled us to establish a major clade of intestinal gregarine parasites and infer the cellular identities of several previously unidentified environmental SSU rDNA sequences, including several sequences that have formerly been discussed broadly in the literature as a suspected “novel” lineage of eukaryotes. PMID:21483868
Subtype Distribution of Blastocystis Isolates in Sebha, Libya

PubMed Central

Abdulsalam, Awatif M.; Ithoi, Init; Al-Mekhlafi, Hesham M.; Al-Mekhlafi, Abdulsalam M.; Ahmed, Abdulhamid; Surin, Johari

2013-01-01

Background Blastocystis is a genetically diverse and a common intestinal parasite of humans with a controversial pathogenic potential. This study was carried out to identify the Blastocystis subtypes and their association with demographic and socioeconomic factors among outpatients living in Sebha city, Libya. Methods/Findings Blastocystis in stool samples were cultured followed by isolation, PCR amplification of a partial SSU rDNA gene, cloning, and sequencing. The DNA sequences of isolated clones showed 98.3% to 100% identity with the reference Blastocystis isolates from the Genbank. Multiple sequence alignment showed polymorphism from one to seven base substitution and/or insertion/deletion in several groups of non-identical nucleotides clones. Phylogenetic analysis revealed three assemblage subtypes (ST) with ST1 as the most prevalent (51.1%) followed by ST2 (24.4%), ST3 (17.8%) and mixed infections of two concurrent subtypes (6.7%). Blastocystis ST1 infection was significantly associated with female (P = 0.009) and low educational level (P = 0.034). ST2 was also significantly associated with low educational level (P= 0.008) and ST3 with diarrhoea (P = 0.008). Conclusion Phylogenetic analysis of Libyan Blastocystis isolates identified three different subtypes; with ST1 being the predominant subtype and its infection was significantly associated with female gender and low educational level. More extensive studies are needed in order to relate each Blastocystis subtype with clinical symptoms and potential transmission sources in this community. PMID:24376805
Subtype distribution of Blastocystis isolates in Sebha, Libya.

PubMed

Abdulsalam, Awatif M; Ithoi, Init; Al-Mekhlafi, Hesham M; Al-Mekhlafi, Abdulsalam M; Ahmed, Abdulhamid; Surin, Johari

2013-01-01

Blastocystis is a genetically diverse and a common intestinal parasite of humans with a controversial pathogenic potential. This study was carried out to identify the Blastocystis subtypes and their association with demographic and socioeconomic factors among outpatients living in Sebha city, Libya. Blastocystis in stool samples were cultured followed by isolation, PCR amplification of a partial SSU rDNA gene, cloning, and sequencing. The DNA sequences of isolated clones showed 98.3% to 100% identity with the reference Blastocystis isolates from the Genbank. Multiple sequence alignment showed polymorphism from one to seven base substitution and/or insertion/deletion in several groups of non-identical nucleotides clones. Phylogenetic analysis revealed three assemblage subtypes (ST) with ST1 as the most prevalent (51.1%) followed by ST2 (24.4%), ST3 (17.8%) and mixed infections of two concurrent subtypes (6.7%). ST1 infection was significantly associated with female (P = 0.009) and low educational level (P = 0.034). ST2 was also significantly associated with low educational level (P= 0.008) and ST3 with diarrhoea (P = 0.008). Phylogenetic analysis of Libyan Blastocystis isolates identified three different subtypes; with ST1 being the predominant subtype and its infection was significantly associated with female gender and low educational level. More extensive studies are needed in order to relate each Blastocystis subtype with clinical symptoms and potential transmission sources in this community.
Molecular characterization of two genotypes of a new polerovirus infecting brassicas in China.

PubMed

Xiang, Hai-Ying; Dong, Shu-Wei; Shang, Qiao-Xia; Zhou, Cui-Ji; Li, Da-Wei; Yu, Jia-Lin; Han, Cheng-Gui

2011-12-01

The genomic RNA sequences of two genotypes of a brassica-infecting polerovirus from China were determined. Sequence analysis revealed that the virus was closely related to but significantly different from turnip yellows virus (TuYV). This virus and other poleroviruses, including TuYV, had less than 90% amino acid sequence identity in all gene products except the coat protein. Based on the molecular criterion (>10% amino acid sequence difference) for species demarcation in the genus Polerovirus, the virus represents a distinct species for which the name Brassica yellows virus (BrYV) is proposed. Interestingly, there were two genotypes of BrYV, which mainly differed in the 5'-terminal half of the genome.
The bacteria and bacteriophages from a Mesquite Flats site of the Death Valley desert.

PubMed

Prestel, Eric; Regeard, Christophe; Salamitou, Sylvie; Neveu, Julie; Dubow, Michael S

2013-06-01

Arid zones cover over 30 % of the Earth's continental surface. In order to better understand the role of microbes in this type of harsh environment, we isolated and characterized the bacteriophages from samples of the surface sand of the Mesquite Flats region via electron microscopy and DNA sequencing of a select number of cloned phage DNAs. An electron microscopic analysis of the recovered virus-like particles revealed at least 11 apparently different morphotypes sharing structural characteristics of the Caudoviridae family of tailed phages. We found that 36 % of the sequences contained no significant identity (e-value >10(-3)) with sequences in the databases. Pilot sequencing of cloned 16S rRNA genes identified Bacteroidetes and Proteobacteria as the major bacterial groups present in this severe environment. The majority of the 16S rDNA sequences from the total (uncultured) bacterial population displayed ≤96 % identity to 16S rRNA genes in the database, suggesting an unexplored bacterial population likely adapted to a desert environment. In addition, we also isolated and identified 38 cultivable bacterial strains, the majority of which belonged to the genus Bacillus. Mitomycin-C treatment of the cultivable bacteria demonstrated that the vast majority (84 %) contained at least one SOS-inducible prophage.
Sequence of a second gene encoding bovine submaxillary mucin: implication for mucin heterogeneity and cloning.

PubMed

Jiang, W; Woitach, J T; Gupta, D; Bhavanandan, V P

1998-10-20

Secreted epithelial mucins are extremely large and heterogeneous glycoproteins. We report the 5 kilobase DNA sequence of a second gene, BSM2, which encodes bovine submaxillary mucin. The determined nucleotide and deduced amino acid sequences of BSM2 are 95.2% and 92. 2% identical, respectively, to those of the previously described BSM1 gene isolated from the same cow. Further, the five predicted protein domains of the two genes are 100%, 94%, 93%, 77%, and 88% identical. Based on the above results, we propose that expression of multiple homologous core proteins from a single animal is a factor in generating diversity of saccharides in mucins and in providing resistance of the molecules to proteolysis. In addition, this work raises several important issues in mucin cloning such as assembling sequences from seemingly overlapping clones and deducing consensus sequences for nearly identical tandem repeats. Copyright 1998 Academic Press.
Identification of a new Apscaviroid from Japanese persimmon.

PubMed

Nakaune, Ryoji; Nakano, Masaaki

2008-01-01

Three viroid-like sequences were detected from Japanese persimmon (Diospyrus kaki Thunb.) by RT-PCR using primers specific for members of the genus Apscaviroid. Based on the sequences, we determined the complete genomic sequences. Two had 92.1-94.3% sequence identity with citrus viroid OS (CVd-OS) and 91.4-96.3% identity with apple fruit crinkle viroid (AFCVd), respectively. Another one, tentatively named persimmon viroid (PVd), had 396 nucleotides and less than 70% sequence identity with known viroids. The secondary structure of PVd is proposed to be rod-like with extensive base pairing and contains the terminal conserved region and the central conserved region characteristic of the genus Apscaviroid. Moreover, we confirmed that the viroids, including PVd, are graft transmissible from persimmon to persimmon and that persimmon is a natural host of these viroids. According to its molecular and biological properties, PVd should be considered a member of a new species in the genus Apscaviroid.
Genomic Structure of the Luciferase Gene from the Bioluminescent Beetle, Nyctophila cf. Caucasica

PubMed Central

Day, John C.; Chaichi, Mohammad J.; Najafil, Iraj; Whiteley, Andrew S.

2006-01-01

The gene coding for beetle luciferase, the enzyme responsible for bioluminescence in over two thousand coleopteran species has, to date, only been characterized from one Palearctic species of Lampyridae. Here we report the characterization of the luciferase gene from a female beetle of an Iranian lampyrid species, Nyctophila cf. caucasica (Coleoptera:Lampyridae). The luciferase gene was composed of seven exons, coding for 547 amino acids, separated by six introns spanning 1976 bp of genomic DNA. The deduced amino acid sequences of the luciferase gene of N. caucasica showed 98.9% homology to that of the Palearctic species Lampyris noctiluca. Analysis of the 810 bp upstream region of the luciferase gene revealed three TATA boxes and several other consensus transcriptional factor recognition sequences presenting evidence for a putative core promoter region conserved in Lampyrinae from -190 through to -155 upstream of the luciferase start codon. Along with the core promoter region the luciferase gene was compared with orthologous sequences from other lampyrid species and found to have greatest identity to Lampyris turkistanicus and Lampyris noctiluca. The significant sequence identity to the former is discussed in relation to taxonomic issues of Iranian lampyrids. PMID:20298115
A novel begomovirus isolated from sida contains putative cis- and trans-acting replication specificity determinants that have evolved independently in several geographical lineages.

PubMed

Mauricio-Castillo, J A; Torres-Herrera, S I; Cárdenas-Conejo, Y; Pastor-Palacios, G; Méndez-Lozano, J; Argüello-Astorga, G R

2014-09-01

A novel begomovirus isolated from a Sida rhombifolia plant collected in Sinaloa, Mexico, was characterized. The genomic components of sida mosaic Sinaloa virus (SiMSinV) shared highest sequence identity with DNA-A and DNA-B components of chino del tomate virus (CdTV), suggesting a vertical evolutionary relationship between these viruses. However, recombination analysis indicated that a short segment of SiMSinV DNA-A encompassing the plus-strand replication origin and the 5´-proximal 43 codons of the Rep gene was derived from tomato mottle Taino virus (ToMoTV). Accordingly, the putative cis- and trans-acting replication specificity determinants of SiMSinV were identical to those of ToMoTV but differed from those of CdTV. Modeling of the SiMSinV and CdTV Rep proteins revealed significant differences in the region comprising the small β1/β5 sheet element, where five putative DNA-binding specificity determinants (SPDs) of Rep (i.e., amino acid residues 5, 8, 10, 69 and 71) were previously identified. Computer-assisted searches of public databases led to identification of 33 begomoviruses from three continents encoding proteins with SPDs identical to those of the Rep encoded by SiMSinV. Sequence analysis of the replication origins demonstrated that all 33 begomoviruses harbor potential Rep-binding sites identical to those of SiMSinV. These data support the hypothesis that the Rep β1/β5 sheet region determines specificity of this protein for DNA replication origin sequences.
The kinetoplast DNA of the Australian trypanosome, Trypanosoma copemani, shares features with Trypanosoma cruzi and Trypanosoma lewisi.

PubMed

Botero, Adriana; Kapeller, Irit; Cooper, Crystal; Clode, Peta L; Shlomai, Joseph; Thompson, R C Andrew

2018-05-17

Kinetoplast DNA (kDNA) is the mitochondrial genome of trypanosomatids. It consists of a few dozen maxicircles and several thousand minicircles, all catenated topologically to form a two-dimensional DNA network. Minicircles are heterogeneous in size and sequence among species. They present one or several conserved regions that contain three highly conserved sequence blocks. CSB-1 (10 bp sequence) and CSB-2 (8 bp sequence) present lower interspecies homology, while CSB-3 (12 bp sequence) or the Universal Minicircle Sequence is conserved within most trypanosomatids. The Universal Minicircle Sequence is located at the replication origin of the minicircles, and is the binding site for the UMS binding protein, a protein involved in trypanosomatid survival and virulence. Here, we describe the structure and organisation of the kDNA of Trypanosoma copemani, a parasite that has been shown to infect mammalian cells and has been associated with the drastic decline of the endangered Australian marsupial, the woylie (Bettongia penicillata). Deep genomic sequencing showed that T. copemani presents two classes of minicircles that share sequence identity and organisation in the conserved sequence blocks with those of Trypanosoma cruzi and Trypanosoma lewisi. A 19,257 bp partial region of the maxicircle of T. copemani that contained the entire coding region was obtained. Comparative analysis of the T. copemani entire maxicircle coding region with the coding regions of T. cruzi and T. lewisi showed they share 71.05% and 71.28% identity, respectively. The shared features in the maxicircle/minicircle organisation and sequence between T. copemani and T. cruzi/T. lewisi suggest similarities in their process of kDNA replication, and are of significance in understanding the evolution of Australian trypanosomes. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Taxonomic evaluation of Streptomyces albus and related species using multilocus sequence analysis and proposals to emend the description of Streptomyces albus and describe Streptomyces pathocidini sp. nov

USDA-ARS?s Scientific Manuscript database

In phylogenetic analyses of the genus Streptomyces using 16S rRNA gene sequences, Streptomyces albus subsp. albus NRRL B-1811T forms a cluster with 5 other species having identical or nearly identical 16S rRNA gene sequences. Moreover, the morphological and physiological characteristics of these oth...
Utility of 16S rDNA Sequencing for Identification of Rare Pathogenic Bacteria.

PubMed

Loong, Shih Keng; Khor, Chee Sieng; Jafar, Faizatul Lela; AbuBakar, Sazaly

2016-11-01

Phenotypic identification systems are established methods for laboratory identification of bacteria causing human infections. Here, the utility of phenotypic identification systems was compared against 16S rDNA identification method on clinical isolates obtained during a 5-year study period, with special emphasis on isolates that gave unsatisfactory identification. One hundred and eighty-seven clinical bacteria isolates were tested with commercial phenotypic identification systems and 16S rDNA sequencing. Isolate identities determined using phenotypic identification systems and 16S rDNA sequencing were compared for similarity at genus and species level, with 16S rDNA sequencing as the reference method. Phenotypic identification systems identified ~46% (86/187) of the isolates with identity similar to that identified using 16S rDNA sequencing. Approximately 39% (73/187) and ~15% (28/187) of the isolates showed different genus identity and could not be identified using the phenotypic identification systems, respectively. Both methods succeeded in determining the species identities of 55 isolates; however, only ~69% (38/55) of the isolates matched at species level. 16S rDNA sequencing could not determine the species of ~20% (37/187) of the isolates. The 16S rDNA sequencing is a useful method over the phenotypic identification systems for the identification of rare and difficult to identify bacteria species. The 16S rDNA sequencing method, however, does have limitation for species-level identification of some bacteria highlighting the need for better bacterial pathogen identification tools. © 2016 Wiley Periodicals, Inc.
Remarkable sequence conservation of the last intron in the PKD1 gene.

PubMed

Rodova, Marianna; Islam, M Rafiq; Peterson, Kenneth R; Calvet, James P

2003-10-01

The last intron of the PKD1 gene (intron 45) was found to have exceptionally high sequence conservation across four mammalian species: human, mouse, rat, and dog. This conservation did not extend to the comparable intron in pufferfish. Pairwise comparisons for intron 45 showed 91% identity (human vs. dog) to 100% identity (mouse vs. rat) for an average for all four species of 94% identity. In contrast, introns 43 and 44 of the PKD1 gene had average pairwise identities of 57% and 54%, and exons 43, 44, and 45 and the coding region of exon 46 had average pairwise identities of 80%, 84%, 82%, and 80%. Intron 45 is 90 to 95 bp in length, with the major region of sequence divergence being in a central 4-bp to 9-bp variable region. RNA secondary structure analysis of intron 45 predicts a branching stem-loop structure in which the central variable region lies in one loop and the putative branch point sequence lies in another loop, suggesting that the intron adopts a specific stem-loop structure that may be important for its removal. Although intron 45 appears to conform to the class of small, G-triplet-containing introns that are spliced by a mechanism utilizing intron definition, its high sequence conservation may be a reflection of constraints imposed by a unique mechanism that coordinates splicing of this last PKD1 intron with polyadenylation.
tRNADB-CE: tRNA gene database well-timed in the era of big sequence data.

PubMed

Abe, Takashi; Inokuchi, Hachiro; Yamada, Yuko; Muto, Akira; Iwasaki, Yuki; Ikemura, Toshimichi

2014-01-01

The tRNA gene data base curated by experts "tRNADB-CE" (http://trna.ie.niigata-u.ac.jp) was constructed by analyzing 1,966 complete and 5,272 draft genomes of prokaryotes, 171 viruses', 121 chloroplasts', and 12 eukaryotes' genomes plus fragment sequences obtained by metagenome studies of environmental samples. 595,115 tRNA genes in total, and thus two times of genes compiled previously, have been registered, for which sequence, clover-leaf structure, and results of sequence-similarity and oligonucleotide-pattern searches can be browsed. To provide collective knowledge with help from experts in tRNA researches, we added a column for enregistering comments to each tRNA. By grouping bacterial tRNAs with an identical sequence, we have found high phylogenetic preservation of tRNA sequences, especially at the phylum level. Since many species-unknown tRNAs from metagenomic sequences have sequences identical to those found in species-known prokaryotes, the identical sequence group (ISG) can provide phylogenetic markers to investigate the microbial community in an environmental ecosystem. This strategy can be applied to a huge amount of short sequences obtained from next-generation sequencers, as showing that tRNADB-CE is a well-timed database in the era of big sequence data. It is also discussed that batch-learning self-organizing-map with oligonucleotide composition is useful for efficient knowledge discovery from big sequence data.
Bottom-up driven involuntary auditory evoked field change: constant sound sequencing amplifies but does not sharpen neural activity.

PubMed

Okamoto, Hidehiko; Stracke, Henning; Lagemann, Lothar; Pantev, Christo

2010-01-01

The capability of involuntarily tracking certain sound signals during the simultaneous presence of noise is essential in human daily life. Previous studies have demonstrated that top-down auditory focused attention can enhance excitatory and inhibitory neural activity, resulting in sharpening of frequency tuning of auditory neurons. In the present study, we investigated bottom-up driven involuntary neural processing of sound signals in noisy environments by means of magnetoencephalography. We contrasted two sound signal sequencing conditions: "constant sequencing" versus "random sequencing." Based on a pool of 16 different frequencies, either identical (constant sequencing) or pseudorandomly chosen (random sequencing) test frequencies were presented blockwise together with band-eliminated noises to nonattending subjects. The results demonstrated that the auditory evoked fields elicited in the constant sequencing condition were significantly enhanced compared with the random sequencing condition. However, the enhancement was not significantly different between different band-eliminated noise conditions. Thus the present study confirms that by constant sound signal sequencing under nonattentive listening the neural activity in human auditory cortex can be enhanced, but not sharpened. Our results indicate that bottom-up driven involuntary neural processing may mainly amplify excitatory neural networks, but may not effectively enhance inhibitory neural circuits.
Molecular characterization of a distinct monopartite begomovirus associated with betasatellites and alphasatellites infecting Pisum sativum in Nepal.

PubMed

Shahid, M S; Pudashini, B J; Khatri-Chhetri, G B; Briddon, R W; Natsuaki, K T

2017-04-01

Pea (Pisum sativum) plants exhibiting leaf distortion, yellowing, stunted growth and reduction in leaf size from Rampur, Nepal were shown to be infected by a begomovirus in association with betasatellites and alphasatellites. The begomovirus associated with the disease showed only low levels of nucleotide sequence identity (<91%) to previously characterized begomoviruses. This finding indicates that the pea samples were infected with an as yet undescribed begomovirus for which the name Pea leaf distortion virus (PLDV) is proposed. Two species of betasatellite were identified in association with PLDV. One group of sequences had high (>78%) nucleotide sequence identity to isolates of Ludwigia leaf distortion betasatellite (LuLDB), and the second group had less than 78% to all other betasatellite sequences. This showed PLDV to be associated with either LuLDB or a previously undescribed betasatellite for which the name Pea leaf distortion betasatellite is proposed. Two types of alphasatellites were identified in the PLDV-infected pea plants. The first type showed high levels of sequence identity to Ageratum yellow vein alphasatellite, and the second type showed high levels of identity to isolates of Sida yellow vein China alphasatellite. These are the first begomovirus, betasatellites and alphasatellites isolated from pea.
Oligonucleotide fingerprinting of rRNA genes for analysis of fungal community composition.

PubMed

Valinsky, Lea; Della Vedova, Gianluca; Jiang, Tao; Borneman, James

2002-12-01

Thorough assessments of fungal diversity are currently hindered by technological limitations. Here we describe a new method for identifying fungi, oligonucleotide fingerprinting of rRNA genes (OFRG). ORFG sorts arrayed rRNA gene (ribosomal DNA [rDNA]) clones into taxonomic clusters through a series of hybridization experiments, each using a single oligonucleotide probe. A simulated annealing algorithm was used to design an OFRG probe set for fungal rDNA. Analysis of 1,536 fungal rDNA clones derived from soil generated 455 clusters. A pairwise sequence analysis showed that clones with average sequence identities of 99.2% were grouped into the same cluster. To examine the accuracy of the taxonomic identities produced by this OFRG experiment, we determined the nucleotide sequences for 117 clones distributed throughout the tree. For all but two of these clones, the taxonomic identities generated by this OFRG experiment were consistent with those generated by a nucleotide sequence analysis. Eighty-eight percent of the clones were affiliated with Ascomycota, while 12% belonged to BASIDIOMYCOTA: A large fraction of the clones were affiliated with the genera Fusarium (404 clones) and Raciborskiomyces (176 clones). Smaller assemblages of clones had high sequence identities to the Alternaria, Ascobolus, Chaetomium, Cryptococcus, and Rhizoctonia clades.

Phylogenetic Analysis of Phenotypically Characterized Cryptococcus laurentii Isolates Reveals High Frequency of Cryptic Species

PubMed Central

Ferreira-Paim, Kennio; Ferreira, Thatiana Bragine; Andrade-Silva, Leonardo; Mora, Delio Jose; Springer, Deborah J.; Heitman, Joseph; Fonseca, Fernanda Machado; Matos, Dulcilena; Melhem, Márcia Souza Carvalho; Silva-Vergara, Mario León

2014-01-01

Background Although Cryptococcus laurentii has been considered saprophytic and its taxonomy is still being described, several cases of human infections have already reported. This study aimed to evaluate molecular aspects of C. laurentii isolates from Brazil, Botswana, Canada, and the United States. Methods In this study, 100 phenotypically identified C. laurentii isolates were evaluated by sequencing the 18S nuclear ribosomal small subunit rRNA gene (18S-SSU), D1/D2 region of 28S nuclear ribosomal large subunit rRNA gene (28S-LSU), and the internal transcribed spacer (ITS) of the ribosomal region. Results BLAST searches using 550-bp, 650-bp, and 550-bp sequenced amplicons obtained from the 18S-SSU, 28S-LSU, and the ITS region led to the identification of 75 C. laurentii strains that shared 99–100% identity with C. laurentii CBS 139. A total of nine isolates shared 99% identity with both Bullera sp. VY-68 and C. laurentii RY1. One isolate shared 99% identity with Cryptococcus rajasthanensis CBS 10406, and eight isolates shared 100% identity with Cryptococcus sp. APSS 862 according to the 28S-LSU and ITS regions and designated as Cryptococcus aspenensis sp. nov. (CBS 13867). While 16 isolates shared 99% identity with Cryptococcus flavescens CBS 942 according to the 18S-SSU sequence, only six were confirmed using the 28S-LSU and ITS region sequences. The remaining 10 shared 99% identity with Cryptococcus terrestris CBS 10810, which was recently described in Brazil. Through concatenated sequence analyses, seven sequence types in C. laurentii, three in C. flavescens, one in C. terrestris, and one in the C. aspenensis sp. nov. were identified. Conclusions Sequencing permitted the characterization of 75% of the environmental C. laurentii isolates from different geographical areas and the identification of seven haplotypes of this species. Among sequenced regions, the increased variability of the ITS region in comparison to the 18S-SSU and 28S-LSU regions reinforces its applicability as a DNA barcode. PMID:25251413
Full Genome Characterisation of Bluetongue Virus Serotype 6 from the Netherlands 2008 and Comparison to Other Field and Vaccine Strains

PubMed Central

Maan, Sushila; Maan, Narender S.; van Rijn, Piet A.; van Gennip, René G. P.; Sanders, Anna; Wright, Isabel M.; Batten, Carrie; Hoffmann, Bernd; Eschbaumer, Michael; Oura, Chris A. L.; Potgieter, Abraham C.; Nomikou, Kyriaki; Mertens, Peter P.C.

2010-01-01

In mid September 2008, clinical signs of bluetongue (particularly coronitis) were observed in cows on three different farms in eastern Netherlands (Luttenberg, Heeten, and Barchem), two of which had been vaccinated with an inactivated BTV-8 vaccine (during May-June 2008). Bluetongue virus (BTV) infection was also detected on a fourth farm (Oldenzaal) in the same area while testing for export. BTV RNA was subsequently identified by real time RT-PCR targeting genome-segment (Seg-) 10, in blood samples from each farm. The virus was isolated from the Heeten sample (IAH “dsRNA virus reference collection” [dsRNA-VRC] isolate number NET2008/05) and typed as BTV-6 by RT-PCR targeting Seg-2. Sequencing confirmed the virus type, showing an identical Seg-2 sequence to that of the South African BTV-6 live-vaccine-strain. Although most of the other genome segments also showed very high levels of identity to the BTV-6 vaccine (99.7 to 100%), Seg-10 showed greatest identity (98.4%) to the BTV-2 vaccine (RSAvvv2/02), indicating that NET2008/05 had acquired a different Seg-10 by reassortment. Although Seg-7 from NET2008/05 was also most closely related to the BTV-6 vaccine (99.7/100% nt/aa identity), the Seg-7 sequence derived from the blood sample of the same animal (NET2008/06) was identical to that of the Netherlands BTV-8 (NET2006/04 and NET2007/01). This indicates that the blood contained two different Seg-7 sequences, one of which (from the BTV-6 vaccine) was selected during virus isolation in cell-culture. The predominance of the BTV-8 Seg-7 in the blood sample suggests that the virus was in the process of reassorting with the northern field strain of BTV-8. Two genome segments of the virus showed significant differences from the BTV-6 vaccine, indicating that they had been acquired by reassortment event with BTV-8, and another unknown parental-strain. However, the route by which BTV-6 and BTV-8 entered northern Europe was not established. PMID:20428242
Finding similar nucleotide sequences using network BLAST searches.

PubMed

Ladunga, Istvan

2009-06-01

The Basic Local Alignment Search Tool (BLAST) is a keystone of bioinformatics due to its performance and user-friendliness. Beginner and intermediate users will learn how to design and submit blastn and Megablast searches on the Web pages at the National Center for Biotechnology Information. We map nucleic acid sequences to genomes, find identical or similar mRNA, expressed sequence tag, and noncoding RNA sequences, and run Megablast searches, which are much faster than blastn. Understanding results is assisted by taxonomy reports, genomic views, and multiple alignments. We interpret expected frequency thresholds, biological significance, and statistical significance. Weak hits provide no evidence, but hints for further analyses. We find genes that may code for homologous proteins by translated BLAST. We reduce false positives by filtering out low-complexity regions. Parsed BLAST results can be integrated into analysis pipelines. Links in the output connect to Entrez, PUBMED, structural, sequence, interaction, and expression databases. This facilitates integration with a wide spectrum of biological knowledge.
SPARSE: quadratic time simultaneous alignment and folding of RNAs without sequence-based heuristics

PubMed Central

Will, Sebastian; Otto, Christina; Miladi, Milad; Möhl, Mathias; Backofen, Rolf

2015-01-01

Motivation: RNA-Seq experiments have revealed a multitude of novel ncRNAs. The gold standard for their analysis based on simultaneous alignment and folding suffers from extreme time complexity of O(n6). Subsequently, numerous faster ‘Sankoff-style’ approaches have been suggested. Commonly, the performance of such methods relies on sequence-based heuristics that restrict the search space to optimal or near-optimal sequence alignments; however, the accuracy of sequence-based methods breaks down for RNAs with sequence identities below 60%. Alignment approaches like LocARNA that do not require sequence-based heuristics, have been limited to high complexity (≥ quartic time). Results: Breaking this barrier, we introduce the novel Sankoff-style algorithm ‘sparsified prediction and alignment of RNAs based on their structure ensembles (SPARSE)’, which runs in quadratic time without sequence-based heuristics. To achieve this low complexity, on par with sequence alignment algorithms, SPARSE features strong sparsification based on structural properties of the RNA ensembles. Following PMcomp, SPARSE gains further speed-up from lightweight energy computation. Although all existing lightweight Sankoff-style methods restrict Sankoff’s original model by disallowing loop deletions and insertions, SPARSE transfers the Sankoff algorithm to the lightweight energy model completely for the first time. Compared with LocARNA, SPARSE achieves similar alignment and better folding quality in significantly less time (speedup: 3.7). At similar run-time, it aligns low sequence identity instances substantially more accurate than RAF, which uses sequence-based heuristics. Availability and implementation: SPARSE is freely available at http://www.bioinf.uni-freiburg.de/Software/SPARSE. Contact: backofen@informatik.uni-freiburg.de Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25838465
Cloning and expression of Bartonella henselae sucB gene encoding an immunogenic dihydrolipoamide succinyltransferase homologous protein.

PubMed

Kabeya, Hidenori; Maruyama, Soichi; Hirano, Kouji; Mikami, Takeshi

2003-01-01

Immunoscreening of a ZAP genomic library of Bartonella henselae strain Houston-1 expressed in Escherichia coli resulted in the isolation of a clone containing 3.5 kb BamHI genomic DNA fragment. This 3.5 kb DNA fragment was found to contain a sequence of a gene encoding a protein with significant homology to the dihydrolipoamide succinyltransferase of Brucella melitensis (sucB). Subsequent cloning and DNA sequence analysis revealed that the deduced amino acid sequence from the cloned gene showed 66.5% identity to SucB protein of B. melitensis, and 43.4 and 47.2% identities to those of Coxiella burnetii and E. coli, respectively. The gene was expressed as a His-Nus A-tagged fusion protein. The recombinant SucB protein (rSucB) was shown to be an immunoreactive protein of about 115 kDa by Western blot analysis with sera from B. henselae-immunized mice. Therefore the rSucB may be a candidate antigen for a specific serological diagnosis of B. henselae infection.
Isolation of Cryptococcus laurentii from Canada Goose guano in rural upstate New York.

PubMed

Filion, Tera; Kidd, Sarah; Aguirre, Karen

2006-11-01

Cryptococcus neoformans and Cryptococcus gattii are etiologic agents of cryptococcal pneumonia and meningitis, potentially lethal syndromes associated with AIDS. A related species, Cryptococcus laurentii, has recently been implicated in several cases of human disease. Guano from Canada Goose (Branta canadensis), an organism that lives closely beside man and inhabits recreational space in rural and suburban areas, might be a significant environmental reservoir of Cryptococcus organisms in non-urban areas. Cryptococcal organisms were isolated from Canada Goose guano from a site in rural northern New York, with identification based upon colony and microscopic morphology, ability to metabolize L: -Dopa to melanin, and positive reaction with a commercial anti-cryptococcal capsular polysaccharide latex bead agglutination test. DNA sequences from five positive isolates were identical to each other, and identical to the ITS1-5.8S-ITS2 sequences of C. laurentii strain CBS7140 (Accession AY315665) across a 511 bp sequence. All five isolates of C. laurentii possess three of the known virulence factors common to cryptococcal organisms that cause human disease: capsule, ability to grow at 37 degrees C, and laccase activity.
Genetic variability of Rickettsia spp. in Ixodes persulcatus ticks from continental and island areas of the Russian Far East.

PubMed

Igolkina, Y; Bondarenko, E; Rar, V; Epikhina, T; Vysochina, N; Pukhovskaya, N; Tikunov, A; Ivanov, L; Golovljova, I; Ivanov, М; Tikunova, N

2016-10-01

Rickettsia spp. are intracellular Gram-negative bacteria transmitted by arthropods. Two potentially pathogenic rickettsiae, Candidatus Rickettsia tarasevichiae and Rickettsia helvetica, have been found in unfed adult Ixodes persulcatus ticks. The aim of this study was to assess the prevalence and genetic variability of Rickettsia spp. in I. persulcatus ticks collected from different locations in the Russian Far East. In total, 604 adult I. persulcatus ticks collected from four sites in the Khabarovsk Territory (continental area) and one site in Sakhalin Island were examined for the presence of Rickettsia spp. by real-time PCR. Nested PCR with species-specific primers and sequencing were used for genotyping of revealed rickettsiae. The overall prevalence of Rickettsia spp. in ticks collected in different sites varied from 67.9 to 90.7%. However, the proportion of different Rickettsia species observed in ticks from Sakhalin Island significantly differed from that in ticks from the Khabarovsk Territory. In Sakhalin Island, R. helvetica prevailed in examined ticks, while Candidatus R. tarasevichiae was predominant in the Khabarovsk Territory. For gltA and ompB gene fragments, the sequences obtained for Candidatus R. tarasevichiae from all studied sites were identical to each other and to the known sequences of this species. According to sequence analysis of gltA, оmpB and sca4 genes, R. helvetica isolates from Sakhalin Island and the Khabarovsk Territory were identical to each other, but they differed from R. helvetica from other regions and from those found in other tick species. For the first time, DNA of pathogenic Rickettsia heilongjiangensis was detected in I. persulcatus ticks in two sites from the Khabarovsk Territory. The gltA, ompA and оmpB gene sequences of R. heilongjiangensis were identical to or had solitary mismatches with the corresponding sequences of R. heilongjiangensis found in other tick species. Copyright © 2016 Elsevier GmbH. All rights reserved.
Analysis of S-RNase alleles of almond (Prunus dulcis): characterization of new sequences, resolution of synonyms and evidence of intragenic recombination.

PubMed

Ortega, Encarnación; Bosković, Radovan I; Sargent, Daniel J; Tobutt, Kenneth R

2006-11-01

Cross-compatibility relationships in almond are controlled by a gametophytically expressed incompatibility system partly mediated by stylar RNases, of which 29 have been reported. To resolve possible synonyms and to provide data for phylogenetic analysis, 21 almond S-RNase alleles were cloned and sequenced from SP (signal peptide region) or C1 (first conserved region) to C5, except for the S29 allele, which could be cloned only from SP to C1. Nineteen sequences (S4, S6, S11-S22, S25-S29)) were potentially new whereas S10 and S24 had previously been published but with different labels. The sequences for S16 and S17 were identical to that for S1, published previously; likewise, S15 was identical to S5. In addition, S4 and S20 were identical, as were S13 and S19. A revised version of the standard table of almond incompatibility genotypes is presented. Several alleles had AT or GA tandem repeats in their introns. Sequences of the 23 distinct newly cloned or already published alleles were aligned. Sliding windows analysis of Ka/Ks identified regions where positive selection may operate; in contrast to the Maloideae, most of the region from the beginning of C3 to the beginning of RC4 appeared not to be under positive selection. Phylogenetic analysis indicated four pairs of alleles had "bootstrap" support > 80%: S5/S10, S4/S8, S11/S24, and S3/S6. Various motifs up to 19 residues long occurred in at least two alleles, and their distributions were consistent with intragenic recombination, as were separate phylogenetic analyses of the 5' and 3' sections. Sequence comparison of phylogenetically related alleles indicated the significance of the region between RC4 and C5 in defining specificity.
Implication of the cause of differences in 3D structures of proteins with high sequence identity based on analyses of amino acid sequences and 3D structures.

PubMed

Matsuoka, Masanari; Sugita, Masatake; Kikuchi, Takeshi

2014-09-18

Proteins that share a high sequence homology while exhibiting drastically different 3D structures are investigated in this study. Recently, artificial proteins related to the sequences of the GA and IgG binding GB domains of human serum albumin have been designed. These artificial proteins, referred to as GA and GB, share 98% amino acid sequence identity but exhibit different 3D structures, namely, a 3α bundle versus a 4β + α structure. Discriminating between their 3D structures based on their amino acid sequences is a very difficult problem. In the present work, in addition to using bioinformatics techniques, an analysis based on inter-residue average distance statistics is used to address this problem. It was hard to distinguish which structure a given sequence would take only with the results of ordinary analyses like BLAST and conservation analyses. However, in addition to these analyses, with the analysis based on the inter-residue average distance statistics and our sequence tendency analysis, we could infer which part would play an important role in its structural formation. The results suggest possible determinants of the different 3D structures for sequences with high sequence identity. The possibility of discriminating between the 3D structures based on the given sequences is also discussed.
Molecular identification of a new begomovirus infecting yellow passion fruit (Passiflora edulis) in Colombia.

PubMed

Vaca-Vaca, Juan Carlos; Carrasco-Lozano, Emerson Clovis; López-López, Karina

2017-02-01

The complete genome sequence of a bipartite begomovirus (genus Begomovirus, family Geminiviridae) infecting yellow passion fruit (Passiflora edulis) in the state of Valle del Cauca (Colombia) has been determined. The complete DNA-A and DNA-B components were determined to be 2600 and 2572 nt in length, respectively. The DNA-A showed the highest nucleotide sequence identity (87.2 %) to bean dwarf mosaic virus (M88179), a begomovirus found in common bean crops in Colombia, and only 77.4 % identity to passion fruit severe leaf distortion virus (FJ972767), a begomovirus identified infecting passion fruit in Brazil. Based on its sequence identity to all other begomoviruses known to date and in accordance with the ICTV species demarcation criterion for the genus Begomovirus (≥91 % sequence identity for the complete DNA-A), the name passion fruit leaf distortion virus is proposed for this new begomovirus. To our knowledge, this is the first report of a bipartite begomovirus affecting passion fruit in Colombia and the second report of a geminivirus affecting this crop worldwide.
Sequence identity and antigenic cross-reactivity of white face hornet venom allergen, also a hyaluronidase, with other proteins.

PubMed

Lu, G; Kochoumian, L; King, T P

1995-03-03

White face hornet (Dolichovespula maculata) venom has three known protein allergens which induce IgE response in susceptible people. They are antigen 5, phospholipase A1, and hyaluronidase, also known as Dol m 5, 1, and 2, respectively. We have cloned Dol m 2, a protein of 331 residues. When expressed in bacteria, a mixture of recombinant Dol m 2 and its fragments was obtained. The fragments were apparently generated by proteolysis of a Met-Met bond at residue 122, as they were not observed for a Dol m 2 mutant with a Leu-Met bond. Dol m 2 has 56% sequence identity with the honey bee venom allergen hyaluronidase and 27% identity with PH-20, a human sperm protein with hyaluronidase activity. A common feature of hornet venom allergens is their sequence identity with other proteins in our environment. We showed previously the sequence identity of Dol m 5 with a plant protein and a mammalian testis protein and of Dol m 1 with mammalian lipases. In BALB/c mice, Dol m 2 and bee hyaluronidase showed cross-reactivity at both antibody and T cell levels. These findings are relevant to some patients' multiple sensitivity to hornet and bee stings.
Optimization of identity operation in NMR spectroscopy via genetic algorithm: Application to the TEDOR experiment

NASA Astrophysics Data System (ADS)

Manu, V. S.; Veglia, Gianluigi

2016-12-01

Identity operation in the form of π pulses is widely used in NMR spectroscopy. For an isolated single spin system, a sequence of even number of π pulses performs an identity operation, leaving the spin state essentially unaltered. For multi-spin systems, trains of π pulses with appropriate phases and time delays modulate the spin Hamiltonian to perform operations such as decoupling and recoupling. However, experimental imperfections often jeopardize the outcome, leading to severe losses in sensitivity. Here, we demonstrate that a newly designed Genetic Algorithm (GA) is able to optimize a train of π pulses, resulting in a robust identity operation. As proof-of-concept, we optimized the recoupling sequence in the transferred-echo double-resonance (TEDOR) pulse sequence, a key experiment in biological magic angle spinning (MAS) solid-state NMR for measuring multiple carbon-nitrogen distances. The GA modified TEDOR (GMO-TEDOR) experiment with improved recoupling efficiency results in a net gain of sensitivity up to 28% as tested on a uniformly 13C, 15N labeled microcrystalline ubiquitin sample. The robust identity operation achieved via GA paves the way for the optimization of several other pulse sequences used for both solid- and liquid-state NMR used for decoupling, recoupling, and relaxation experiments.
Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2016-02-16

The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less
Polypeptide having swollenin activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius

2015-11-04

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius

2015-09-01

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having cellobiohydrolase activity and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-09-15

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having acetyl xylan esterase activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having carbohydrate degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius

2015-08-18

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Sequence Similarity Presenter: a tool for the graphic display of similarities of long sequences for use in presentations.

PubMed

Fröhlich, K U

1994-04-01

A new method for the presentation of alignments of long sequences is described. The degree of identity for the aligned sequences is averaged for sections of a fixed number of residues. The resulting values are converted to shades of gray, with white corresponding to lack of identity and black corresponding to perfect identity. A sequence alignment is represented as a bar filled with varying shades of gray. The display is compact and allows for a fast and intuitive recognition of the distribution of regions with a high similarity. It is well suited for the presentation of alignments of long sequences, e.g. of protein superfamilies, in plenary lectures. The method is implemented as a HyperCard stack for Apple Macintosh computers. Several options for the modification of the output are available (e.g. background reduction, size of the summation window, consideration of amino acid similarity, inclusion of graphic markers to indicate specific domains). The output is a PostScript file which can be printed, imported as EPS or processed further with Adobe Illustrator.

Isolation of prolactin and growth hormone from the pituitary of the holostean fish Amia calva.

PubMed

Dores, R M; Noso, T; Rand-Weaver, M; Kawauchi, H

1993-06-01

Pituitaries from adult male and female Amia calva (Order Holostei) were acid extracted and fractionated by gel filtration column chromatography and reversed-phase high performance liquid chromatography. This two-step isolation procedure yielded homogeneous pools of Amia prolaction (PRL) and growth hormone (GH). The amino acid composition of both purified polypeptides was determined. Primary sequence analysis of the first 22 positions at the N-terminal of Amia PRL revealed that this region has 63% sequence identity with eel PRL-1. The N-terminal region of Amia PRL lacks the disulfide bridge which is characteristic of tetrapod PRLs. Primary sequence analysis of the first 24 positions at the N-terminal of Amia GH revealed that this region has 62% sequence identity with eel GH and 54% sequence identity with both blue shark GH and sea turtle GH. Based on N-terminal analysis, it appears that Amia PRL and GH are more closely related to teleost PRLs and GHs than they are to tetrapod PRLs and GHs.
Complete mitochondrial genome of the larch hawk moth, Sphinx morio (Lepidoptera: Sphingidae).

PubMed

Kim, Min Jee; Choi, Sei-Woong; Kim, Iksoo

2013-12-01

The larch hawk moth, Sphinx morio, belongs to the lepidopteran family Sphingidae that has long been studied as a family of model insects in a diverse field. In this study, we describe the complete mitochondrial genome (mitogenome) sequences of the species in terms of general genomic features and characteristic short repetitive sequences found in the A + T-rich region. The 15,299-bp-long genome consisted of a typical set of genes (13 protein-coding genes, 2 rRNA genes, and 22 tRNA genes) and one major non-coding A + T-rich region, with the typical arrangement found in Lepidoptera. The 316-bp-long A + T-rich region located between srRNA and tRNA(Met) harbored the conserved sequence blocks that are typically found in lepidopteran insects. Additionally, the A + T-rich region of S. morio contained three characteristic repeat sequences that are rarely found in Lepidoptera: two identical 12-bp repeat, three identical 5-bp-long tandem repeat, and six nearly identical 5-6 bp long repeat sequences.
Capturing diversity of marine heterotrophic protists: one cell at a time

PubMed Central

Heywood, Jane L; Sieracki, Michael E; Bellows, Wendy; Poulton, Nicole J; Stepanauskas, Ramunas

2011-01-01

Recent applications of culture-independent, molecular methods have revealed unexpectedly high diversity in a variety of functional and phylogenetic groups of microorganisms in the ocean. However, none of the existing research tools are free from significant limitations, such as PCR and cloning biases, low phylogenetic resolution and others. Here, we employed novel, single-cell sequencing techniques to assess the composition of small (<10 μm diameter), heterotrophic protists from the Gulf of Maine. Single cells were isolated by flow cytometry, their genomes amplified, and 18S rRNA marker genes were amplified and sequenced. We compared the results to traditional environmental PCR cloning of sorted cells. The diversity of heterotrophic protists was significantly higher in the library of single amplified genomes (SAGs) than in environmental PCR clone libraries of the 18S rRNA gene, obtained from the same coastal sample. Libraries of SAGs, but not clones contained several recently discovered, uncultured groups, including picobiliphytes and novel marine stramenopiles. Clone, but not SAG, libraries contained several large clusters of identical and nearly identical sequences of Dinophyceae, Cercozoa and Stramenopiles. Similar results were obtained using two alternative primer sets, suggesting that PCR biases may not be the only explanation for the observed patterns. Instead, differences in the number of 18S rRNA gene copies among the various protist taxa probably had a significant role in determining the PCR clone composition. These results show that single-cell sequencing has the potential to more accurately assess protistan community composition than previously established methods. In addition, the creation of SAG libraries opens opportunities for the analysis of multiple genes or entire genomes of the uncultured protist groups. PMID:20962875
Parameters of proteome evolution from histograms of amino-acid sequence identities of paralogous proteins

PubMed Central

Axelsen, Jacob Bock; Yan, Koon-Kiu; Maslov, Sergei

2007-01-01

Background The evolution of the full repertoire of proteins encoded in a given genome is mostly driven by gene duplications, deletions, and sequence modifications of existing proteins. Indirect information about relative rates and other intrinsic parameters of these three basic processes is contained in the proteome-wide distribution of sequence identities of pairs of paralogous proteins. Results We introduce a simple mathematical framework based on a stochastic birth-and-death model that allows one to extract some of this information and apply it to the set of all pairs of paralogous proteins in H. pylori, E. coli, S. cerevisiae, C. elegans, D. melanogaster, and H. sapiens. It was found that the histogram of sequence identities p generated by an all-to-all alignment of all protein sequences encoded in a genome is well fitted with a power-law form ~ p-γ with the value of the exponent γ around 4 for the majority of organisms used in this study. This implies that the intra-protein variability of substitution rates is best described by the Gamma-distribution with the exponent α ≈ 0.33. Different features of the shape of such histograms allow us to quantify the ratio between the genome-wide average deletion/duplication rates and the amino-acid substitution rate. Conclusion We separately measure the short-term ("raw") duplication and deletion rates rdup∗, rdel∗ which include gene copies that will be removed soon after the duplication event and their dramatically reduced long-term counterparts rdup, rdel. High deletion rate among recently duplicated proteins is consistent with a scenario in which they didn't have enough time to significantly change their functional roles and thus are to a large degree disposable. Systematic trends of each of the four duplication/deletion rates with the total number of genes in the genome were analyzed. All but the deletion rate of recent duplicates rdel∗ were shown to systematically increase with Ngenes. Abnormally flat shapes of sequence identity histograms observed for yeast and human are consistent with lineages leading to these organisms undergoing one or more whole-genome duplications. This interpretation is corroborated by our analysis of the genome of Paramecium tetraurelia where the p-4 profile of the histogram is gradually restored by the successive removal of paralogs generated in its four known whole-genome duplication events. PMID:18039386
Analysis of 16S-23S intergenic spacer regions of the rRNA operons in Edwardsiella ictaluri and Edwardsiella tarda isolates from fish.

PubMed

Panangala, V S; van Santen, V L; Shoemaker, C A; Klesius, P H

2005-01-01

To analyse interspecies and intraspecies differences based on the 16S-23S rRNA intergenic spacer region (ISR) sequences of the fish pathogens Edwardsiella ictaluri and Edwardsiella tarda. The 16S-23S rRNA spacer regions of 19 Edw. ictaluri and four Edw. tarda isolates from four geographical regions were amplified by PCR with primers complementary to conserved sequences within the flanking 16S-23S rRNA coding sequences. Two products were generated from all isolates, without interspecies or intraspecific size polymorphisms. Sequence analysis of the amplified fragments revealed a smaller ISR of 350 bp, which contained a gene for tRNA(Glu), and a larger ISR of 441 bp, which contained genes for tRNA(Ile) and tRNA(Ala). The sequences of the smaller ISR of different Edw. ictaluri isolates were essentially identical to each other. Partial sequences of larger ISR from several Edw. ictaluri isolates also revealed no differences from the one complete Edw. ictaluri large ISR sequence obtained. The sequences of the smaller ISR of Edw. tarda were 97% identical to the Edw. ictaluri smaller ISR and the larger ISR were 96-98% identical to the Edw. ictaluri larger ISR sequence. The Edw. tarda isolates displayed limited ISR sequence heterogeneity, with > or =97% sequence identity among isolates for both small and large ISR. There is a high degree of size and sequence similarity of 16S-23S ISR both among isolates within Edw. ictaluri and Edw. tarda species and between the two species. Our results confirm a close genetic relationship between Edw. ictaluri and Edw. tarda and the relative homogeneity of Edw. ictaluri isolates compared with Edw. tarda isolates. Because no differences were found in ISR sequences among Edw. ictaluri isolates, sequence analysis of the ISR will not be useful to distinguish isolates of Edw. ictaluri. However, we identified restriction sites that differ between ISR sequences of Edw. ictaluri and Edw. tarda, which will be useful in distinguishing the two species.
Genetic analysis of Fasciola isolates from cattle in Korea based on second internal transcribed spacer (ITS-2) sequence of nuclear ribosomal DNA.

PubMed

Choe, Se-Eun; Nguyen, Thuy Thi-Dieu; Kang, Tae-Gyu; Kweon, Chang-Hee; Kang, Seung-Won

2011-09-01

Nuclear ribosomal DNA sequence of the second internal transcribed spacer (ITS-2) has been used efficiently to identify the liver fluke species collected from different hosts and various geographic regions. ITS-2 sequences of 19 Fasciola samples collected from Korean native cattle were determined and compared. Sequence comparison including ITS-2 sequences of isolates from this study and reference sequences from Fasciola hepatica and Fasciola gigantica and intermediate Fasciola in Genbank revealed seven identical variable sites of investigated isolates. Among 19 samples, 12 individuals had ITS-2 sequences completely identical to that of pure F. hepatica, five possessed the sequences identical to F. gigantica type, whereas two shared the sequence of both F. hepatica and F. gigantica. No variations in length and nucleotide composition of ITS-2 sequence were observed within isolates that belonged to F. hepatica or F. gigantica. At the position of 218, five Fasciola containing a single-base substitution (C>T) formed a distinct branch inside the F. gigantica-type group which was similar to those of Asian-origin isolates. The phylogenetic tree of the Fasciola spp. based on complete ITS-2 sequences from this study and other representative isolates in different locations clearly showed that pure F. hepatica, F. gigantica type and intermediate Fasciola were observed. The result also provided additional genetic evidence for the existence of three forms of Fasciola isolated from native cattle in Korea by genetic approach using ITS-2 sequence.
First description of Grapevine leafroll-associated virus 5 in Argentina and partial genome sequence.

PubMed

Gómez Talquenca, Sebastián; Muñoz, Claudio; Grau, Oscar; Gracia, Olga

2009-02-01

An accession of Vitis vinifera cv. Red Globe from Argentina, was found to be infected with Grapevine leafroll-associated virus-5 by ELISA. It was partially sequenced, and three ORFs, corresponding to HSP70h, HSP90h, and CP, were found. This isolate shares a high aminoacid identity with the previously reported sequence of the virus, and identities between 80% and 90% with previously reported GLRaV-9 and GLRaV-4 isolates. The analysis of the sequence supports the clustering together with GLRaV-4 and GLRV-9 inside the Ampelovirus genus.
Structures of two Arabidopsis thaliana major latex proteins represent novel helix-grip folds

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lytle, Betsy L.; Song, Jikui; de la Cruz, Norberto B.

2009-06-02

Here we report the first structures of two major latex proteins (MLPs) which display unique structural differences from the canonical Bet v 1 fold described earlier. MLP28 (SwissProt/TrEMBL ID Q9SSK9), the product of gene At1g70830.1, and the At1g24000.1 gene product (Swiss- Prot/TrEMBL ID P0C0B0), proteins which share 32% sequence identity, were independently selected as foldspace targets by the Center for Eukaryotic Structural Genomics. The structure of a single domain (residues 17-173) of MLP28 was solved by NMR spectroscopy, while the full-length At1g24000.1 structure was determined by X-ray crystallography. MLP28 displays greater than 30% sequence identity to at least eight MLPsmore » from other species. For example, the MLP28 sequence shares 64% identity to peach Pp-MLP119 and 55% identity to cucumber Csf2.20 In contrast, the At1g24000.1 sequence is highly divergent (see Fig. 1), containing a gap of 33 amino acids when compared with all other known MLPs. Even when the gap is excluded, the sequence identity with MLPs from other species is less than 30%. Unlike some of the MLPs from other species, none of the A. thaliana MLPs have been characterized biochemically. We show by NMR chemical shift mapping that At1g24000.1 binds progesterone, demonstrating that despite its sequence dissimilarity, the hydrophobic binding pocket is conserved and, therefore, may play a role in its biological function and that of the MLP family in general.« less
First isolation of Rickettsia monacensis from a patient in South Korea.

PubMed

Kim, Yeon-Sook; Choi, Yeon-Joo; Lee, Kyung-Min; Ahn, Kyu-Joong; Kim, Heung-Chul; Klein, Terry; Jiang, Ju; Richards, Allen; Park, Kyung-Hee; Jang, Won-Jong

2017-07-01

A Rickettsia sp. was isolated from the blood of a patient with an acute febrile illness using the shell vial technique; the isolate was named CN45Kr and was identified by molecular assay as Rickettsia monacensis, which was first recognized as a pathogen in Spain. Sequencing analysis showed that the gltA sequence of the isolate was identical to that of Rickettsia sp. IRS3. The ompA-5mp fragment sequence showed 100% identity to those of R. monacensis and Rickettsia sp. In56 and ompA-3pA In56 and 100% identity to that of Rickettsia sp. IRS3. The ompB sequence was found to have 99.9% similarity to that of R. monacensis IrR/Munich. This study confirms the pathogenicity of this agent and provides additional information about its geographic distribution. © 2017 The Societies and John Wiley & Sons Australia, Ltd.
Chromobacterium spp. harbour Ambler class A β-lactamases showing high identity with KPC.

PubMed

Gudeta, Dereje Dadi; Bortolaia, Valeria; Jayol, Aurélie; Poirel, Laurent; Nordmann, Patrice; Guardabassi, Luca

2016-06-01

The origin of KPC is unknown. The aim of this study was to detect progenitors of KPC in silico and to functionally verify their β-lactam hydrolysis activity. The sequence of KPC-2 was used to mine the NCBI protein sequence database. The best non-KPC hits were analysed by amino acid (aa) alignment and phylogenetic tree construction. Genes encoding KPC-2 homologues were expressed in Escherichia coli. The carbapenemase activities of the recombinant strains were characterized by the CarbaNP test and UV spectrophotometry and MICs of selected β-lactams were determined. Genes encoding the closest KPC-2 homologues were identified on the chromosome of Chromobacterium piscinae strain ND17 (CRP-1, 76% aa identity), Chromobacterium sp. C-61 (CRS-1, 70% aa identity) and Chromobacterium haemolyticum DSM19808 (CRH-1, 69% aa identity). All three Chromobacterium β-lactamases were phylogenetically more related to KPC than to other Ambler class A β-lactamases. The 27 bp region preceding the start codon of blaCRP-1 displayed high nucleotide identity to the corresponding region upstream from blaKPC (74%). Heterologous expression of blaCRP-1 and to a lesser extent of blaCRH-1 in E. coli significantly increased the MICs of meropenem and most cephalosporins. The CarbaNP test was positive for both recombinant strains, but spectrophotometric analysis confirmed higher carbapenemase activity for CRP-1-producing clones. The recovery of three class A β-lactamases with up to 76% aa identity to KPC from distinct Chromobacterium species is highly indicative of the role played by this genus in the evolution of KPC. © The Author 2016. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
A new begomovirus associated with alpha- and betasatellite molecules isolated from Vernonia cinerea in China.

PubMed

Zulfiqar, Awais; Zhang, Jie; Cui, Xiaofeng; Qian, Yajuan; Zhou, Xueping; Xie, Yan

2012-01-01

A begomovirus disease complex associated with Vernonia cinerea showing yellow vein symptoms was studied. The full-length genomic DNA was comprised of 2739 nucleotides (nt) and contained the typical genome structure of begomoviruses. Comparison analysis showed that it shared the highest (78.9%) nucleotide sequence identity with recently characterized Vernonia yellow vein virus (VeYVV) from India. For associated satellites, betasatellite showed the highest nucleotide sequence identity (52.1%) with Vernonia yellow vein virus betasatellite (VeYVVB) and alphasatellite shared the highest sequence identity (70.7%) with Gossypium mustelinium symptomless alphasatellite (GMusSLA). It is a member of a distinct species with cognate alpha- and betasatellites for which the name Vernonia yellow vein Fujian virus (VeYVFjV) is proposed.
NGSCheckMate: software for validating sample identity in next-generation sequencing studies within and across data types.

PubMed

Lee, Sejoon; Lee, Soohyun; Ouellette, Scott; Park, Woong-Yang; Lee, Eunjung A; Park, Peter J

2017-06-20

In many next-generation sequencing (NGS) studies, multiple samples or data types are profiled for each individual. An important quality control (QC) step in these studies is to ensure that datasets from the same subject are properly paired. Given the heterogeneity of data types, file types and sequencing depths in a multi-dimensional study, a robust program that provides a standardized metric for genotype comparisons would be useful. Here, we describe NGSCheckMate, a user-friendly software package for verifying sample identities from FASTQ, BAM or VCF files. This tool uses a model-based method to compare allele read fractions at known single-nucleotide polymorphisms, considering depth-dependent behavior of similarity metrics for identical and unrelated samples. Our evaluation shows that NGSCheckMate is effective for a variety of data types, including exome sequencing, whole-genome sequencing, RNA-seq, ChIP-seq, targeted sequencing and single-cell whole-genome sequencing, with a minimal requirement for sequencing depth (>0.5X). An alignment-free module can be run directly on FASTQ files for a quick initial check. We recommend using this software as a QC step in NGS studies. https://github.com/parklab/NGSCheckMate. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Individual Differences in Face Identity Processing with Fast Periodic Visual Stimulation.

PubMed

Xu, Buyun; Liu-Shuang, Joan; Rossion, Bruno; Tanaka, James

2017-08-01

A growing body of literature suggests that human individuals differ in their ability to process face identity. These findings mainly stem from explicit behavioral tasks, such as the Cambridge Face Memory Test (CFMT). However, it remains an open question whether such individual differences can be found in the absence of an explicit face identity task and when faces have to be individualized at a single glance. In the current study, we tested 49 participants with a recently developed fast periodic visual stimulation (FPVS) paradigm [Liu-Shuang, J., Norcia, A. M., & Rossion, B. An objective index of individual face discrimination in the right occipitotemporal cortex by means of fast periodic oddball stimulation. Neuropsychologia, 52, 57-72, 2014] in EEG to rapidly, objectively, and implicitly quantify face identity processing. In the FPVS paradigm, one face identity (A) was presented at the frequency of 6 Hz, allowing only one gaze fixation, with different face identities (B, C, D) presented every fifth face (1.2 Hz; i.e., AAAABAAAACAAAAD…). Results showed a face individuation response at 1.2 Hz and its harmonics, peaking over occipitotemporal locations. The magnitude of this response showed high reliability across different recording sequences and was significant in all but two participants, with the magnitude and lateralization differing widely across participants. There was a modest but significant correlation between the individuation response amplitude and the performance of the behavioral CFMT task, despite the fact that CFMT and FPVS measured different aspects of face identity processing. Taken together, the current study highlights the FPVS approach as a promising means for studying individual differences in face identity processing.
The tapeworm Atractolytocestus tenuicollis (Cestoda: Caryophyllidea)--a sister species or ancestor of an invasive A. huronensis?

PubMed

Králová-Hromadová, Ivica; Štefka, Jan; Bazsalovicsová, Eva; Bokorová, Silvia; Oros, Mikuláš

2013-10-01

Atractolytocestus tenuicollis (Li, 1964) Xi, Wang, Wu, Gao et Nie, 2009 is a monozoic, non-segmented tapeworm of the order Caryophyllidea, parasitizing exclusively common carp (Cyprinus carpio L.). In the current work, the first molecular data, in particular complete ribosomal internal transcribed spacer 2 (ITS2) and partial mitochondrial cytochrome c oxidase subunit I (cox1) on A. tenuicollis from Niushan Lake, Wuhan, China, are provided. In order to evaluate molecular interrelationships within Atractolytocestus, the data on A. tenuicollis were compared with relevant data on two other congeners, Atractolytocestus huronensis and Atractolytocestus sagittatus. Divergent intragenomic copies (ITS2 paralogues) were detected in the ITS2 ribosomal spacer of A. tenuicollis; the same phenomenon has previously been observed also in two other congeners. ITS2 structure of A. tenuicollis was very similar to that of A. huronensis from Slovakia, USA and UK; overall pairwise sequence identity was 91.7-95.2%. On the other hand, values of sequence identity between A. tenuicollis and A. sagittatus were lower, 69.7-70.9%. Cox1 sequence, analysed in five A. tenuicollis individuals, were 100 % identical and no intraspecific variation was observed. Comparison of A. tenuicollis cox1 with respective sequences of two other Atractolytocestus species showed that the mitochondrial haplotype found in Chinese A. tenuicollis is structurally specific (haplotype 4; Ha4) and differs from all so far determined Atractolytocestus haplotypes (Ha1 and Ha2 for A. huronensis; Ha3 for A. sagittatus). Pairwise sequence identity between A. tenuicollis cox1 haplotype and remaining three haplotypes followed the same pattern as in ITS2. The nucleotide and amino acide (aa) sequence comparison with A. huronensis Ha1 and Ha2 revealed higher sequence identity, 90.3-90.8% (96.9% in aa), while lower values were achieved between A. tenuicollis haplotype and Ha3 of Japanese A. sagittatus-75.2 % (81.9 % in aa). The phylogenetic analyses using cox1, ITS2 and combined cox1 + ITS2 sequences revealed close genetic interrelationship between A. tenuicollis and A. huronensis. Independently of a type of analysis and DNA region used, the topology of obtained trees was always identical; A. tenuicollis formed separate clade with A. huronensis forming a closely related sister group.
Prevalence and genetic characterization of eimeriid coccidia from feces of black-necked cranes, Grus nigricollis.

PubMed

Liang, Yu; Zhao, ZiJiao; Hu, JunJie; Esch, Gerald W; Peng, MingChun; Liu, Qiong; Chen, JinQing

2018-03-01

Disseminated visceral coccidiosis (DVC) is a widely distributed intestinal and extraintestinal disease of cranes caused by eimeriid coccidia and has lethal pathogenicity to several crane species. Here, feces of 164 black-necked cranes collected in Dashanbao Black-necked Crane National Nature Reserve, China, were examined to determine the prevalence of coccidial oocysts. Of the 164 fecal samples, 76 (46.3%) were positive for oocysts of Eimeria, including E. gruis in 59 (35.9%), E. reichenowi in 52 (31.7%), and E. bosquei in 47 (28.7%) by microscopic observation. Sixty-eight (89.5%) of these positive samples included two or more morphologically identifiable species of Eimeria. The nearly full length 18S rRNA gene (18S rRNA; about 1.8 kb) and partial mitochondrial cytochrome c oxidase I gene (COX1; about 1.3 kb) from oocysts of each morphologically distinct species of Eimeria were amplified, sequenced, and analyzed. BLAST searches using these new 18S rRNA sequences for E. gruis, E. reichenowi, or E. bosquei showed the most similar sequences were those of E. gruis (98.7-99.7% identity), E. reichenowi (97.9-100% identity), or E. gruis (98.6-99.6% identity) isolated from different species of Grus. BLAST searches using the new COX1 sequences for the three species of Eimeria showed that no nucleotide sequences of Eimeria and Isospora coccidia in GenBank have more than 83.0% identity with these species. Identities among the new COX1 sequences were 91.8% for E. gruis and E. reichenowi, 94.5% for E. gruis and E. bosquei, and 91.3% for E. reichenowi and E. bosquei. Phylogenetic analysis based on 18S rRNA or COX1 sequences indicated that Eimeria spp. in black-necked cranes were clustered together with other previously identified Eimeria species from different cranes.
Molecular detection and characterization of Hepatozoon spp. in dogs from the central part of Turkey.

PubMed

Aydin, Mehmet Fatih; Sevinc, Ferda; Sevinc, Mutlu

2015-04-01

Canine hepatozoonosis is a tick-borne protozoal disease caused by Hepatozoon spp. Two species of Hepatozoon are currently known to infect dogs as Hepatozoon canis and H. americanum. Although H. canis generally causes a chronic infection with relatively mild clinical alterations compared to H. americanum, infection by H. canis can be life-threatening. The disease is widespread in USA, Africa, Europe, South America, and Asia. To determine the frequency of infection with Hepatozoon spp. in stray dogs from Central Anatolia Region of Turkey, a total of 221 blood samples collected over a three-year period were evaluated by using genus specific Polymerase Chain Reaction (PCR) designed to amplify a fragment of 666bp located in 18 S rRNA gene of Hepatozoon spp. Eight (3.61%) blood samples were positive for Hepatozoon spp. For the classification of species, all positive PCR products were purified with a PCR purification kit and sequenced. Sequencing results of eight representative amplicons indicated that 6 were 98-99% identical to the sequence of H. canis and the other 2 sequences were 95-97% identical to the sequence of Hepatozoon spp. So it was named Hepatozoon sp. MF. A phylogenetic tree was constructed from the sequences of the tick-borne agents identified previously and in this study using the neighbor-joining method. The nucleotide sequences were compared to the H. canis sequences reported in Turkey using the nucleotide Basic Local Alignment Search Tool (BLAST) program. The results of this study are significant in terms of the presence of a novel canine Hepatozoon genotype. Copyright © 2015 Elsevier GmbH. All rights reserved.
Prospecting for viral natural enemies of the fire ant Solenopsis invicta in Argentina.

PubMed

Valles, Steven M; Porter, Sanford D; Calcaterra, Luis A

2018-01-01

Metagenomics and next generation sequencing were employed to discover new virus natural enemies of the fire ant, Solenopsis invicta Buren in its native range (i.e., Formosa, Argentina) with the ultimate goal of testing and releasing new viral pathogens into U.S. S. invicta populations to provide natural, sustainable control of this ant. RNA was purified from worker ants from 182 S. invicta colonies, which was pooled into 4 groups according to location. A library was created from each group and sequenced using Illumina Miseq technology. After a series of winnowing methods to remove S. invicta genes, known S. invicta virus genes, and all other non-virus gene sequences, 61,944 unique singletons were identified with virus identity. These were assembled de novo yielding 171 contiguous sequences with significant identity to non-plant virus genes. Fifteen contiguous sequences exhibited very high expression rates and were detected in all four gene libraries. One contig (Contig_29) exhibited the highest expression level overall and across all four gene libraries. Random amplification of cDNA ends analyses expanded this contiguous sequence yielding a complete virus genome, which we have provisionally named Solenopsis invicta virus 5 (SINV-5). SINV-5 is a positive-sense, single-stranded RNA virus with genome characteristics consistent with insect-infecting viruses from the family Dicistroviridae. Moreover, the replicative genome strand of SINV-5 was detected in worker ants indicating that S. invicta serves as host for the virus. Many additional sequences were identified that are likely of viral origin. These sequences await further investigation to determine their origins and relationship with S. invicta. This study expands knowledge of the RNA virome diversity found within S. invicta populations.
Prospecting for viral natural enemies of the fire ant Solenopsis invicta in Argentina

PubMed Central

Porter, Sanford D.; Calcaterra, Luis A.

2018-01-01

Metagenomics and next generation sequencing were employed to discover new virus natural enemies of the fire ant, Solenopsis invicta Buren in its native range (i.e., Formosa, Argentina) with the ultimate goal of testing and releasing new viral pathogens into U.S. S. invicta populations to provide natural, sustainable control of this ant. RNA was purified from worker ants from 182 S. invicta colonies, which was pooled into 4 groups according to location. A library was created from each group and sequenced using Illumina Miseq technology. After a series of winnowing methods to remove S. invicta genes, known S. invicta virus genes, and all other non-virus gene sequences, 61,944 unique singletons were identified with virus identity. These were assembled de novo yielding 171 contiguous sequences with significant identity to non-plant virus genes. Fifteen contiguous sequences exhibited very high expression rates and were detected in all four gene libraries. One contig (Contig_29) exhibited the highest expression level overall and across all four gene libraries. Random amplification of cDNA ends analyses expanded this contiguous sequence yielding a complete virus genome, which we have provisionally named Solenopsis invicta virus 5 (SINV-5). SINV-5 is a positive-sense, single-stranded RNA virus with genome characteristics consistent with insect-infecting viruses from the family Dicistroviridae. Moreover, the replicative genome strand of SINV-5 was detected in worker ants indicating that S. invicta serves as host for the virus. Many additional sequences were identified that are likely of viral origin. These sequences await further investigation to determine their origins and relationship with S. invicta. This study expands knowledge of the RNA virome diversity found within S. invicta populations. PMID:29466388
Prevalence of Ehrlichia canis, Anaplasma platys, Babesia canis vogeli, Hepatozoon canis, Bartonella vinsonii berkhoffii, and Rickettsia spp. in dogs from Grenada.

PubMed

Yabsley, Michael J; McKibben, John; Macpherson, Calum N; Cattan, Peggy F; Cherry, Natalie A; Hegarty, Barbara C; Breitschwerdt, Edward B; O'Connor, Tom; Chandrashekar, Ramaswamy; Paterson, Tara; Perea, Marta Lanza; Ball, Geoffrey; Friesen, Stanley; Goedde, Jill; Henderson, Brooke; Sylvester, Wayne

2008-02-14

To identify the tick-borne pathogens in dogs from Grenada, we conducted a serologic survey for Ehrlichia canis in 2004 (104 dogs) and a comprehensive serologic and molecular survey for a variety of tick-borne pathogens in 2006 (73 dogs). In 2004 and 2006, 44 and 32 dogs (42.3% and 43.8%) were seropositive for E. canis, respectively. In 2006, several tick-borne pathogens were identified by serology and PCR. DNA of E. canis, Anaplasma platys, Babesia canis vogeli, Hepatozoon canis, and Bartonella sp. were identified in 18 (24.7%), 14 (19.2%), 5 (7%), 5 (7%), and 1 (1.4%) dogs, respectively. Six (8.2%) dogs were seropositive for Bartonella vinsonii subsp. berkhoffii. All dogs were seronegative and PCR-negative for Rickettsia spp. Coinfection with two or three pathogens was observed in eight dogs. Partial 16S rRNA E. canis and A. platys sequences were identical to sequences in GenBank. Partial 18S rRNA gene sequences from the Grenadian H. canis were identical to each other and had one possible mismatch (ambiguous base) from H. canis detected from Spain and Brazil. Grenadian B. c. vogeli sequences were identical to B. c. vogeli from Brazil and Japan. All of the detected pathogens are transmitted, or suspected to be transmitted, by Rhipicephalus sanguineus. Results of this study indicate that dogs from Grenada are infected with multiple tick-borne pathogens; therefore, tick-borne diseases should be included as differentials for dogs exhibiting thrombocytopenia, leukopenia, fever, or lethargy. One pathogen, E. canis, is also of potential public health significance.
An Alternative Approach to "Identification of Unknowns": Designing a Protocol to Verify the Identities of Nitrogen Fixing Bacteria.

PubMed

Martinez-Vaz, Betsy M; Denny, Roxanne; Young, Nevin D; Sadowsky, Michael J

2015-12-01

Microbiology courses often include a laboratory activity on the identification of unknown microbes. This activity consists of providing students with microbial cultures and running biochemical assays to identify the organisms. This approach lacks molecular techniques such as sequencing of genes encoding 16S rRNA, which is currently the method of choice for identification of unknown bacteria. A laboratory activity was developed to teach students how to identify microorganisms using 16S rRNA polymerase chain reaction (PCR) and validate microbial identities using biochemical techniques. We hypothesized that designing an experimental protocol to confirm the identity of a bacterium would improve students' knowledge of microbial identification techniques and the physiological characteristics of bacterial species. Nitrogen-fixing bacteria were isolated from the root nodules of Medicago truncatula and prepared for 16S rRNA PCR analysis. Once DNA sequencing revealed the identity of the organisms, the students designed experimental protocols to verify the identity of rhizobia. An assessment was conducted by analyzing pre- and posttest scores and by grading students' verification protocols and presentations. Posttest scores were higher than pretest scores at or below p = 0.001. Normalized learning gains (G) showed an improvement of students' knowledge of microbial identification methods (LO4, G = 0.46), biochemical properties of nitrogen-fixing bacteria (LO3, G = 0.45), and the events leading to the establishment of nitrogen-fixing symbioses (LO1&2, G = 0.51, G = 0.37). An evaluation of verification protocols also showed significant improvement with a p value of less than 0.001.

The complete genome of klassevirus – a novel picornavirus in pediatric stool

PubMed Central

Greninger, Alexander L; Runckel, Charles; Chiu, Charles Y; Haggerty, Thomas; Parsonnet, Julie; Ganem, Donald; DeRisi, Joseph L

2009-01-01

Background Diarrhea kills 2 million children worldwide each year, yet an etiological agent is not found in approximately 30–50% of cases. Picornaviral genera such as enterovirus, kobuvirus, cosavirus, parechovirus, hepatovirus, teschovirus, and cardiovirus have all been found in human and animal diarrhea. Modern technologies, especially deep sequencing, allow rapid, high-throughput screening of clinical samples such as stool for new infectious agents associated with human disease. Results A pool of 141 pediatric gastroenteritis samples that were previously found to be negative for known diarrheal viruses was subjected to pyrosequencing. From a total of 937,935 sequence reads, a collection of 849 reads distantly related to Aichi virus were assembled and found to comprise 75% of a novel picornavirus genome. The complete genome was subsequently cloned and found to share 52.3% nucleotide pairwise identity and 38.9% amino acid identity to Aichi virus. The low level of sequence identity suggests a novel picornavirus genus which we have designated klassevirus. Blinded screening of 751 stool specimens from both symptomatic and asymptomatic individuals revealed a second positive case of klassevirus infection, which was subsequently found to be from the index case's 11-month old twin. Conclusion We report the discovery of human klassevirus 1, a member of a novel picornavirus genus, in stool from two infants from Northern California. Further characterization and epidemiological studies will be required to establish whether klasseviruses are significant causes of human infection. PMID:19538752
The Pinus taeda genome is characterized by diverse and highly diverged repetitive sequences

PubMed Central

2010-01-01

Background In today's age of genomic discovery, no attempt has been made to comprehensively sequence a gymnosperm genome. The largest genus in the coniferous family Pinaceae is Pinus, whose 110-120 species have extremely large genomes (c. 20-40 Gb, 2N = 24). The size and complexity of these genomes have prompted much speculation as to the feasibility of completing a conifer genome sequence. Conifer genomes are reputed to be highly repetitive, but there is little information available on the nature and identity of repetitive units in gymnosperms. The pines have extensive genetic resources, with approximately 329000 ESTs from eleven species and genetic maps in eight species, including a dense genetic map of the twelve linkage groups in Pinus taeda. Results We present here the Sanger sequence and annotation of ten P. taeda BAC clones and Genome Analyzer II whole genome shotgun (WGS) sequences representing 7.5% of the genome. Computational annotation of ten BACs predicts three putative protein-coding genes and at least fifteen likely pseudogenes in nearly one megabase of sequence. We found three conifer-specific LTR retroelements in the BACs, and tentatively identified at least 15 others based on evidence from the distantly related angiosperms. Alignment of WGS sequences to the BACs indicates that 80% of BAC sequences have similar copies (≥ 75% nucleotide identity) elsewhere in the genome, but only 23% have identical copies (99% identity). The three most common repetitive elements in the genome were identified and, when combined, represent less than 5% of the genome. Conclusions This study indicates that the majority of repeats in the P. taeda genome are 'novel' and will therefore require additional BAC or genomic sequencing for accurate characterization. The pine genome contains a very large number of diverged and probably defunct repetitive elements. This study also provides new evidence that sequencing a pine genome using a WGS approach is a feasible goal. PMID:20609256
Nucleotide sequence of a chickpea chlorotic stunt virus relative that infects pea and faba bean in China.

PubMed

Zhou, Cui-Ji; Xiang, Hai-Ying; Zhuo, Tao; Li, Da-Wei; Yu, Jia-Lin; Han, Cheng-Gui

2012-07-01

We determined the genome sequence of a new polerovirus that infects field pea and faba bean in China. Its entire nucleotide sequence (6021 nt) was most closely related (83.3% identity) to that of an Ethiopian isolate of chickpea chlorotic stunt virus (CpCSV-Eth). With the exception of the coat protein (encoded by ORF3), amino acid sequence identities of all gene products of this virus to those of CpCSV-Eth and other poleroviruses were <90%. This suggests that it is a new member of the genus Polerovirus, and the name pea mild chlorosis virus is proposed.
PSS-3D1D: an improved 3D1D profile method of protein fold recognition for the annotation of twilight zone sequences.

PubMed

Ganesan, K; Parthasarathy, S

2011-12-01

Annotation of any newly determined protein sequence depends on the pairwise sequence identity with known sequences. However, for the twilight zone sequences which have only 15-25% identity, the pair-wise comparison methods are inadequate and the annotation becomes a challenging task. Such sequences can be annotated by using methods that recognize their fold. Bowie et al. described a 3D1D profile method in which the amino acid sequences that fold into a known 3D structure are identified by their compatibility to that known 3D structure. We have improved the above method by using the predicted secondary structure information and employ it for fold recognition from the twilight zone sequences. In our Protein Secondary Structure 3D1D (PSS-3D1D) method, a score (w) for the predicted secondary structure of the query sequence is included in finding the compatibility of the query sequence to the known fold 3D structures. In the benchmarks, the PSS-3D1D method shows a maximum of 21% improvement in predicting correctly the α + β class of folds from the sequences with twilight zone level of identity, when compared with the 3D1D profile method. Hence, the PSS-3D1D method could offer more clues than the 3D1D method for the annotation of twilight zone sequences. The web based PSS-3D1D method is freely available in the PredictFold server at http://bioinfo.bdu.ac.in/servers/ .
Carbohydrate degrading polypeptide and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Analysis of sequence repeats of proteins in the PDB.

PubMed

Mary Rajathei, David; Selvaraj, Samuel

2013-12-01

Internal repeats in protein sequences play a significant role in the evolution of protein structure and function. Applications of different bioinformatics tools help in the identification and characterization of these repeats. In the present study, we analyzed sequence repeats in a non-redundant set of proteins available in the Protein Data Bank (PDB). We used RADAR for detecting internal repeats in a protein, PDBeFOLD for assessing structural similarity, PDBsum for finding functional involvement and Pfam for domain assignment of the repeats in a protein. Through the analysis of sequence repeats, we found that identity of the sequence repeats falls in the range of 20-40% and, the superimposed structures of the most of the sequence repeats maintain similar overall folding. Analysis sequence repeats at the functional level reveals that most of the sequence repeats are involved in the function of the protein through functionally involved residues in the repeat regions. We also found that sequence repeats in single and two domain proteins often contained conserved sequence motifs for the function of the domain. Copyright © 2013 Elsevier Ltd. All rights reserved.
Identification of four novel HLA-B alleles, B*1590, B*1591, B*2726, and B*4705, from an East African population by high-resolution sequence-based typing.

PubMed

Luo, M; Mao, X; Plummer, F A

2005-02-01

We report here four novel HLA-B alleles, B*1590, B*1591, B*2726, and B*4705, identified from an East African population during sequence-based HLA-B typing. The novel alleles were confirmed by sequencing two separate polymerase chain reaction products, and by molecular cloning and sequencing multiple clones. B*1590 is identical to B*1510 at exon 2 and exon 3, except for a difference (GCCGTC) at codon 158. Sequence differences at codon 152 (GAGGTG) and codon 167 (TGGTCG) differentiate B*1591 from B*1503 at exon 3. B*2726 is identical to B*2708 at exon 2 and exon 3, except for a difference (AAGCAG) at codon 70. B*4705 was identified in three Kenyan women. The allele is identical to B*47010101/02 at exon 2 and exon 3, except for differences at codon 97 (AGGAAT) and codon 99 (TTTTAT). These new alleles have been named by the WHO Nomenclature Committee. Identification of these novel HLA-B alleles reflects the genetic diversity of this East African population.
Differential expression of the virulence-associated protein p57 and characterization of its duplicated gene rosa in virulent and attenuated strains of Renibacterium salmoninarum

USGS Publications Warehouse

O'Farrell, C. L.; Strom, M.S.

1999-01-01

Virulence mechanisms utilized by the salmonid fish pathogen Renibacterium salmoninarum are poorly understood. One potential virulence factor is p57 (also designated MSA for major soluble antigen), an abundant 57 kDa soluble protein that is predominately localized on the bacterial cell surface with significant levels released into the extracellular milieu. Previous studies of an attenuated strain, MT 239, indicated that it differs from virulent strains in the amount of surface-associated p57. In this report, we show overall expression of p57 in R. salmoninarum MT 239 is considerably reduced as compared to a virulent strain, ATCC 33209. The amount of cell-associated p57 is decreased while the level of p57 in the culture supernatant is nearly equivalent between the strains. To determine if lowered amount of cell-associated p57 was due to a sequence defect in p57, a genetic comparison was performed. Two copies of the gene encoding p57 (msa1 and msa2) were found in 33209 and MT 239, as well as in several other virulent isolates. Both copies from 33209 and MT 239 were cloned and sequenced and found to be identical to each other, and identical between the 2 strains. A comparison of msa1 and msa2 within each strain showed that their sequences diverge 40 base pairs 5, to the open reading frame, while sequences 3' to the open reading frame are essentially identical for at least 225 base pairs. Northern blot analysis showed no difference in steady state levels of rosa mRNA between the 2 strains. These data suggest that while cell-surface localization of p57 may be important for R. salmoninarum virulence, the differences in localization, and total p57 expression between 33209 anti MT 239 are not due to differences in rosa sequence or differences in steady state transcript levels.
Genetic diversity and distribution of a distinct strain of Chili leaf curl virus and associated betasatellite infecting tomato and pepper in Oman.

PubMed

Khan, Akhtar J; Akhtar, Sohail; Al-Zaidi, Amal M; Singh, Achuit K; Briddon, Rob W

2013-10-01

Tomato and pepper are widely grown in Oman for local consumption. A countrywide survey was conducted during 2010-2011 to collect samples and assess the diversity of begomoviruses associated with leaf curl disease of tomato and pepper. A virus previously only identified on the Indian subcontinent, chili leaf curl virus (ChLCV), was found associated with tomato and pepper diseases in all vegetable grown areas of Oman. Some of the infected plant samples were also found to contain a betasatellite. A total of 19 potentially full-length begomovirus and eight betasatellite clones were sequenced. The begomovirus clones showed >96% nucleotide sequence identity, showing them to represent a single species. Comparisons to sequences available in the databases showed the highest levels of nucleotide sequence identity (88.0-91.1%) to isolates of the "Pakistan" strain of ChLCV (ChLCV-PK), indicating the virus from Oman to be a distinct strain, for which the name Oman strain (ChLCV-OM) is proposed. An analysis for recombination showed ChLCV-OM likely to have originated by recombination between ChLCV-PK (the major parent), pepper leaf curl Lahore virus and a third strain of ChLCV. The betasatellite sequences obtained were shown to have high levels of identity to isolates of tomato leaf curl betasatellite (ToLCB) previous shown to be present in Oman. For the disease in tomato Koch's postulates were satisfied by Agrobacterium-mediated inoculation of virus and betasatellites clones. This showed the symptoms induced by the virus in the presence of the betasatellite to be enhanced, although viral DNA levels were not affected. ChLCV-OM is the fourth begomovirus identified in tomato in Oman and the first in Capsicum. The significance of these findings is discussed. Copyright © 2013 Elsevier B.V. All rights reserved.
Phylogenetic characterization of Canine Parvovirus VP2 partial sequences from symptomatic dogs samples.

PubMed

Zienius, D; Lelešius, R; Kavaliauskis, H; Stankevičius, A; Šalomskas, A

2016-01-01

The aim of the present study was to detect canine parvovirus (CPV) from faecal samples of clinically ill domestic dogs by polymerase chain reaction (PCR) followed by VP2 gene partial sequencing and molecular characterization of circulating strains in Lithuania. Eleven clinically and antigen-tested positive dog faecal samples, collected during the period of 2014-2015, were investigated by using PCR. The phylogenetic investigations indicated that the Lithuanian CPV VP2 partial sequences (3025-3706 cds) were closely related and showed 99.0-99.9% identity. All Lithuanian sequences were associated with one phylogroup, but grouped in different clusters. Ten of investigated Lithuanian CPV VP2 sequences were closely associated with CPV 2a antigenic variant (99.4% nt identity). Five CPV VP2 sequences from Lithuania were related to CPV-2a, but were rather divergent (6.8 nt differences). Only one CPV VP2 sequence from Lithuania was associated (99.3% nt identity) with CPV-2b VP2 sequences from France, Italy, USA and Korea. The four of eleven investigated Lithuanian dogs with CPV infection symptoms were vaccinated with CPV-2 vaccine, but their VP2 sequences were phylogenetically distantly associated with CPV vaccine strains VP2 sequences (11.5-15.8 nt differences). Ten Lithuanian CPV VP2 sequences had monophyletic relations among the close geographically associated samples, but five of them were rather divergent (1.0% less sequence similarity). The one Lithuanian CPV VP2 sequence was closely related with CPV-2b antigenic variant. All the Lithuanian CPV VP2 partial sequences were conservative and phylogenetically low associated with most commonly used CPV vaccine strains.
Evolution of EF-hand calcium-modulated proteins. IV. Exon shuffling did not determine the domain compositions of EF-hand proteins

NASA Technical Reports Server (NTRS)

Kretsinger, R. H.; Nakayama, S.

1993-01-01

In the previous three reports in this series we demonstrated that the EF-hand family of proteins evolved by a complex pattern of gene duplication, transposition, and splicing. The dendrograms based on exon sequences are nearly identical to those based on protein sequences for troponin C, the essential light chain myosin, the regulatory light chain, and calpain. This validates both the computational methods and the dendrograms for these subfamilies. The proposal of congruence for calmodulin, troponin C, essential light chain, and regulatory light chain was confirmed. There are, however, significant differences in the calmodulin dendrograms computed from DNA and from protein sequences. In this study we find that introns are distributed throughout the EF-hand domain and the interdomain regions. Further, dendrograms based on intron type and distribution bear little resemblance to those based on protein or on DNA sequences. We conclude that introns are inserted, and probably deleted, with relatively high frequency. Further, in the EF-hand family exons do not correspond to structural domains and exon shuffling played little if any role in the evolution of this widely distributed homolog family. Calmodulin has had a turbulent evolution. Its dendrograms based on protein sequence, exon sequence, 3'-tail sequence, intron sequences, and intron positions all show significant differences.
Transcriptional regulation of three EIN3-like genes of carnation (Dianthus caryophyllus L. cv. Improved White Sim) during flower development and upon wounding, pollination, and ethylene exposure.

PubMed

Iordachescu, Mihaela; Verlinden, Sven

2005-08-01

Using a combination of approaches, three EIN3-like (EIL) genes DC-EIL1/2 (AY728191), DC-EIL3 (AY728192), and DC-EIL4 (AY728193) were isolated from carnation (Dianthus caryophyllus) petals. DC-EIL1/2 deduced amino acid sequence shares 98% identity with the previously cloned and characterized carnation DC-EIL1 (AF261654), 62% identity with DC-EIL3, and 60% identity with DC-EIL4. DC-EIL3 deduced amino acid sequence shares 100% identity with a previously cloned carnation gene fragment, Dc106 (CF259543), 61% identity with Dianthus caryophyllus DC-EIL1 (AF261654), and 59% identity with DC-EIL4. DC-EIL4 shared 60% identity with DC-EIL1 (AF261654). Expression analyses performed on vegetative and flower tissues (petals, ovaries, and styles) during growth and development and senescence (natural and ethylene-induced) indicated that the mRNA accumulation of the DC-EIL family of genes in carnation is regulated developmentally and by ethylene. DC-EIL3 mRNA showed significant accumulation upon ethylene exposure, during flower development, and upon pollination in petals and styles. Interestingly, decreasing levels of DC-EIL3 mRNA were found in wounded leaves and ovaries of senescing flowers whenever ethylene levels increased. Flowers treated with sucrose showed a 2 d delay in the accumulation of DC-EIL3 transcripts when compared with control flowers. These observations suggest an important role for DC-EIL3 during growth and development. Changes in DC-EIL1/2 and DC-EIL4 mRNA levels during flower development, and upon ethylene exposure and pollination were very similar. mRNA levels of the DC-EILs in styles of pollinated flowers showed a positive correlation with ethylene production after pollination. The cloning and characterization of the EIN3-like genes in the present study showed their transcriptional regulation not previously observed for EILs.
The complete nucleotide sequence and genome organization of a novel betaflexivirus infecting Citrullus lanatus.

PubMed

Xin, Min; Zhang, Peipei; Liu, Wenwen; Ren, Yingdang; Cao, Mengji; Wang, Xifeng

2017-10-01

The complete nucleotide sequence of a novel positive single-stranded (+ss) RNA virus, tentatively named watermelon virus A (WVA), was determined using a combination of three methods: RNA sequencing, small RNA sequencing, and Sanger sequencing. The full genome of WVA is comprised of 8,372 nucleotides (nt), excluding the poly (A) tail, and contains four open reading frames (ORFs). The largest ORF, ORF1 encodes a putative replication-associated polyprotein (RP) with three conserved domains. ORF2 and ORF4 encode a movement protein (MP) and coat protein (CP), respectively. The putative product encoded by ORF3, of an estimated molecular mass of 25 kDa, has no significant similarity with other proteins. Identity and phylogenetic analysis indicate that WVA is a new virus, closely related to members of the family Betaflexiviridae. However, the final taxonomic allocation of WVA within the family is yet to be determined.
Diverse Antibiotic Resistance Genes in Dairy Cow Manure

PubMed Central

Wichmann, Fabienne; Udikovic-Kolic, Nikolina; Andrew, Sheila; Handelsman, Jo

2014-01-01

ABSTRACT Application of manure from antibiotic-treated animals to crops facilitates the dissemination of antibiotic resistance determinants into the environment. However, our knowledge of the identity, diversity, and patterns of distribution of these antibiotic resistance determinants remains limited. We used a new combination of methods to examine the resistome of dairy cow manure, a common soil amendment. Metagenomic libraries constructed with DNA extracted from manure were screened for resistance to beta-lactams, phenicols, aminoglycosides, and tetracyclines. Functional screening of fosmid and small-insert libraries identified 80 different antibiotic resistance genes whose deduced protein sequences were on average 50 to 60% identical to sequences deposited in GenBank. The resistance genes were frequently found in clusters and originated from a taxonomically diverse set of species, suggesting that some microorganisms in manure harbor multiple resistance genes. Furthermore, amid the great genetic diversity in manure, we discovered a novel clade of chloramphenicol acetyltransferases. Our study combined functional metagenomics with third-generation PacBio sequencing to significantly extend the roster of functional antibiotic resistance genes found in animal gut bacteria, providing a particularly broad resource for understanding the origins and dispersal of antibiotic resistance genes in agriculture and clinical settings. PMID:24757214
The identification and functional annotation of RNA structures conserved in vertebrates

PubMed Central

Seemann, Stefan E.; Mirza, Aashiq H.; Hansen, Claus; Bang-Berthelsen, Claus H.; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T.; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L.; Gorodkin, Jan

2017-01-01

Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human–mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3′ ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. PMID:28487280
Infection of Taenia asiatica in a Bai Person in Dali, China.

PubMed

Wang, Li; Luo, Xuenong; Hou, Junling; Guo, Aijiang; Zhang, Shaohua; Li, Hailong; Cai, Xuepeng

2016-02-01

We report here a human case of Taenia asiatica infection which was confirmed by genetic analyses in Dali, China. A patient was found to have symptoms of taeniasis with discharge of tapeworm proglottids. By sequencing of the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene, we observed nucleotide sequence identity of 99% with T. asiatica and 96% with T. saginata. Using the cytochrome b (cytb) gene, 99% identity with T. asiatica and 96% identity with T. saginata were found. Our findings suggest that taeniasis of people in Dali, China may be mainly caused by T. asiatica.
Infection of Taenia asiatica in a Bai Person in Dali, China

PubMed Central

Wang, Li; Luo, Xuenong; Hou, Junling; Guo, Aijiang; Zhang, Shaohua; Li, Hailong; Cai, Xuepeng

2016-01-01

We report here a human case of Taenia asiatica infection which was confirmed by genetic analyses in Dali, China. A patient was found to have symptoms of taeniasis with discharge of tapeworm proglottids. By sequencing of the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene, we observed nucleotide sequence identity of 99% with T. asiatica and 96% with T. saginata. Using the cytochrome b (cytb) gene, 99% identity with T. asiatica and 96% identity with T. saginata were found. Our findings suggest that taeniasis of people in Dali, China may be mainly caused by T. asiatica. PMID:26951981
Precursors of vertebrate peptide antibiotics dermaseptin b and adenoregulin have extensive sequence identities with precursors of opioid peptides dermorphin, dermenkephalin, and deltorphins.

PubMed

Amiche, M; Ducancel, F; Mor, A; Boulain, J C; Menez, A; Nicolas, P

1994-07-08

The dermaseptins are a family of broad spectrum antimicrobial peptides, 27-34 amino acids long, involved in the defense of the naked skin of frogs against microbial invasion. They are the first vertebrate peptides to show lethal effects against the filamentous fungi responsible for severe opportunistic infections accompanying immunodeficiency syndrome and the use of immunosuppressive agents. A cDNA library was constructed from skin poly(A+) RNA of the arboreal frog Phyllomedusa bicolor and screened with an oligonucleotide probe complementary to the COOH terminus of dermaseptin b. Several clones contained a full-length DNA copy of a 443-nucleotide mRNA that encoded a 78-residue dermaseptin b precursor protein. The deduced precursor contained a putative signal sequence at the NH2 terminus, a 20-residue spacer sequence extremely rich (60%) in glutamic and aspartic acids, and a single copy of a dermaseptin b progenitor sequence at the COOH terminus. One clone contained a complete copy of adenoregulin, a 33-residue peptide reported to enhance the binding of agonists to the A1 adenosine receptor. The mRNAs encoding adenoregulin and dermaseptin b were very similar: 70 and 75% nucleotide identities between the 5'- and 3'-untranslated regions, respectively; 91% amino acid identity between the signal peptides; 82% identity between the acidic spacer sequences; and 38% identity between adenoregulin and dermaseptin b. Because adenoregulin and dermaseptin b have similar precursor designs and antimicrobial spectra, adenoregulin should be considered as a new member of the dermaseptin family and alternatively named dermaseptin b II. Preprodermaseptin b and preproadenoregulin have considerable sequence identities to the precursors encoding the opioid heptapeptides dermorphin, dermenkephalin, and deltorphins. This similarity extended into the 5'-untranslated regions of the mRNAs. These findings suggest that the genes encoding the four preproproteins are all members of the same family despite the fact that they encode end products having very different biological activities. These genes might contain a homologous export exon comprising the 5'-untranslated region, the 22-residue signal peptide, the 20-24-residue acidic spacer, and the basic pair Lys-Arg.
Application of Genotyping during an Extensive Outbreak of Waterborne Giardiasis in Bergen, Norway, during Autumn and Winter 2004†

PubMed Central

Robertson, L. J.; Hermansen, L.; Gjerde, B. K.; Strand, E.; Alvsvåg, J. O.; Langeland, N.

2006-01-01

During the autumn and winter of 2004 and 2005, an extensive outbreak of waterborne giardiasis occurred in Bergen, Norway. Over 1,500 patients were diagnosed with giardiasis. Analysis of water from the implicated source revealed low numbers of Giardia cysts, but the initial contamination event probably occurred up to 10 weeks previously. While sewage leakage from a residential area is now considered to be the probable source of contamination, during the episode waste from one particular septic tank was thought to be a possible source. Genotyping of cysts from the septic tank demonstrated that they were assemblage A cysts, although the sequences were not identical to any previously published sequences. For the β-giardin gene, the closest published subgenotype was subgenotype A3; for the gdh gene, the closest published subgenotype was subgenotype A2. Genotyping of cysts from 21 patient samples revealed that they were assemblage B cysts; thus, the septic tank was unlikely to be the contamination source. Sequencing of the β-giardin and gdh genes from patient samples and a comparison of the sequences gave complex results. For the β-giardin gene, three isolates had sequences identical to subgenotype B3 sequences. However, other isolates had between one and four single-nucleotide polymorphisms (SNPs). For the gdh gene, none of the sequences were identical to the sequence published for subgenotype B3, and the sequences had between one and three SNPs. One isolate, which was identical to subgenotype B3 at the β-giardin gene, was more similar to subgenotype B2 at the gdh gene. Grouping the isolates on the basis of SNPs resulted in different groups for the two genes. The results are discussed in relation to giardiasis in Norway and to other Giardia genotyping studies. PMID:16517674
Cellulolytic Bacteria in the Foregut of the Dromedary Camel (Camelus dromedarius)

PubMed Central

Samsudin, Anjas A.; Wright, André-Denis G.

2012-01-01

Foregut digesta from five feral dromedary camels were inoculated into three different enrichment media: cotton thread, filter paper, and neutral detergent fiber. A total of 283 16S rRNA gene sequences were assigned to 33 operational taxonomic units by using 99% species-level identity. LIBSHUFF revealed significant differences in the community composition across all three libraries. PMID:23042173

Cellulolytic bacteria in the foregut of the dromedary camel (Camelus dromedarius).

PubMed

Samsudin, Anjas A; Wright, André-Denis G; Al Jassim, Rafat

2012-12-01

Foregut digesta from five feral dromedary camels were inoculated into three different enrichment media: cotton thread, filter paper, and neutral detergent fiber. A total of 283 16S rRNA gene sequences were assigned to 33 operational taxonomic units by using 99% species-level identity. LIBSHUFF revealed significant differences in the community composition across all three libraries.
Characterization of gonadotrophin-releasing hormone precursor cDNA in the Old World mole-rat Cryptomys hottentotus pretoriae: high degree of identity with the New World guinea pig sequence.

PubMed

Kalamatianos, T; du Toit, L; Hrabovszky, E; Kalló, I; Marsh, P J; Bennett, N C; Coen, C W

2005-05-01

Regulation of pituitary gonadotrophins by the decapeptide gonadotrophin-releasing hormone 1 (GnRH1) is crucial for the development and maintenance of reproductive functions. A common amino acid sequence for this decapeptide, designated as 'mammalian' GnRH, has been identified in all mammals thus far investigated with the exception of the guinea pig, in which there are two amino acid substitutions. Among hystricognath rodents, the members of the family Bathyergidae regulate reproduction in response to diverse cues. Thus, highveld mole-rats (Cryptomys hottentotus pretoriae) are social bathyergids in which breeding is restricted to a particular season in the dominant female, but continuously suppressed in subordinate colony members. Elucidation of reproductive control in these animals will be facilitated by characterization of their GnRH1 gene. A partial sequence of GnRH1 precursor cDNA was isolated and characterized. Comparative analysis revealed the highest degree of identity (86%) to guinea pig GnRH1 precursor mRNA. Nevertheless, the deduced amino acid sequence of the mole-rat decapeptide is identical to the 'mammalian' sequence rather than that of guinea pigs. Successful detection of GnRH1-synthesizing neurones using either a guinea pig GnRH1 riboprobe or an antibody against the 'mammalian' decapeptide is consistent with the guinea pig-like sequence for the precursor and the classic 'mammalian' form for the decapeptide. The high degree of identity in the GnRH1 precursor sequence between this Old World mole-rat and the New World guinea pig is consistent with the theory that caviomorphs and phiomorphs originated from a common ancestral line in the Palaeocene to mid Eocene, some 63-45 million years ago.
Microbial genomic taxonomy

PubMed Central

2013-01-01

A need for a genomic species definition is emerging from several independent studies worldwide. In this commentary paper, we discuss recent studies on the genomic taxonomy of diverse microbial groups and a unified species definition based on genomics. Accordingly, strains from the same microbial species share >95% Average Amino Acid Identity (AAI) and Average Nucleotide Identity (ANI), >95% identity based on multiple alignment genes, <10 in Karlin genomic signature, and > 70% in silico Genome-to-Genome Hybridization similarity (GGDH). Species of the same genus will form monophyletic groups on the basis of 16S rRNA gene sequences, Multilocus Sequence Analysis (MLSA) and supertree analysis. In addition to the established requirements for species descriptions, we propose that new taxa descriptions should also include at least a draft genome sequence of the type strain in order to obtain a clear outlook on the genomic landscape of the novel microbe. The application of the new genomic species definition put forward here will allow researchers to use genome sequences to define simultaneously coherent phenotypic and genomic groups. PMID:24365132
Molecular and phylogenetic characterization of the homoeologous EPSP Synthase genes of allohexaploid wheat, Triticum aestivum (L.).

PubMed

Aramrak, Attawan; Kidwell, Kimberlee K; Steber, Camille M; Burke, Ian C

2015-10-23

5-Enolpyruvylshikimate-3-phosphate synthase (EPSPS) is the sixth and penultimate enzyme in the shikimate biosynthesis pathway, and is the target of the herbicide glyphosate. The EPSPS genes of allohexaploid wheat (Triticum aestivum, AABBDD) have not been well characterized. Herein, the three homoeologous copies of the allohexaploid wheat EPSPS gene were cloned and characterized. Genomic and coding DNA sequences of EPSPS from the three related genomes of allohexaploid wheat were isolated using PCR and inverse PCR approaches from soft white spring "Louise'. Development of genome-specific primers allowed the mapping and expression analysis of TaEPSPS-7A1, TaEPSPS-7D1, and TaEPSPS-4A1 on chromosomes 7A, 7D, and 4A, respectively. Sequence alignments of cDNA sequences from wheat and wheat relatives served as a basis for phylogenetic analysis. The three genomic copies of wheat EPSPS differed by insertion/deletion and single nucleotide polymorphisms (SNPs), largely in intron sequences. RT-PCR analysis and cDNA cloning revealed that EPSPS is expressed from all three genomic copies. However, TaEPSPS-4A1 is expressed at much lower levels than TaEPSPS-7A1 and TaEPSPS-7D1 in wheat seedlings. Phylogenetic analysis of 1190-bp cDNA clones from wheat and wheat relatives revealed that: 1) TaEPSPS-7A1 is most similar to EPSPS from the tetraploid AB genome donor, T. turgidum (99.7 % identity); 2) TaEPSPS-7D1 most resembles EPSPS from the diploid D genome donor, Aegilops tauschii (100 % identity); and 3) TaEPSPS-4A1 resembles EPSPS from the diploid B genome relative, Ae. speltoides (97.7 % identity). Thus, EPSPS sequences in allohexaploid wheat are preserved from the most two recent ancestors. The wheat EPSPS genes are more closely related to Lolium multiflorum and Brachypodium distachyon than to Oryza sativa (rice). The three related EPSPS homoeologues of wheat exhibited conservation of the exon/intron structure and of coding region sequence, but contained significant sequence variation within intron regions. The genome-specific primers developed will enable future characterization of natural and induced variation in EPSPS sequence and expression. This can be useful in investigating new causes of glyphosate herbicide resistance.
Inferring Higher Functional Information for RIKEN Mouse Full-Length cDNA Clones With FACTS

PubMed Central

Nagashima, Takeshi; Silva, Diego G.; Petrovsky, Nikolai; Socha, Luis A.; Suzuki, Harukazu; Saito, Rintaro; Kasukawa, Takeya; Kurochkin, Igor V.; Konagaya, Akihiko; Schönbach, Christian

2003-01-01

FACTS (Functional Association/Annotation of cDNA Clones from Text/Sequence Sources) is a semiautomated knowledge discovery and annotation system that integrates molecular function information derived from sequence analysis results (sequence inferred) with functional information extracted from text. Text-inferred information was extracted from keyword-based retrievals of MEDLINE abstracts and by matching of gene or protein names to OMIM, BIND, and DIP database entries. Using FACTS, we found that 47.5% of the 60,770 RIKEN mouse cDNA FANTOM2 clone annotations were informative for text searches. MEDLINE queries yielded molecular interaction-containing sentences for 23.1% of the clones. When disease MeSH and GO terms were matched with retrieved abstracts, 22.7% of clones were associated with potential diseases, and 32.5% with GO identifiers. A significant number (23.5%) of disease MeSH-associated clones were also found to have a hereditary disease association (OMIM Morbidmap). Inferred neoplastic and nervous system disease represented 49.6% and 36.0% of disease MeSH-associated clones, respectively. A comparison of sequence-based GO assignments with informative text-based GO assignments revealed that for 78.2% of clones, identical GO assignments were provided for that clone by either method, whereas for 21.8% of clones, the assignments differed. In contrast, for OMIM assignments, only 28.5% of clones had identical sequence-based and text-based OMIM assignments. Sequence, sentence, and term-based functional associations are included in the FACTS database (http://facts.gsc.riken.go.jp/), which permits results to be annotated and explored through web-accessible keyword and sequence search interfaces. The FACTS database will be a critical tool for investigating the functional complexity of the mouse transcriptome, cDNA-inferred interactome (molecular interactions), and pathome (pathologies). PMID:12819151
Divergent nuclear 18S rDNA paralogs in a turkey coccidium, Eimeria meleagrimitis, complicate molecular systematics and identification.

PubMed

El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R

2013-07-01

Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.
Analysis of ChimeriVax Japanese Encephalitis Virus envelope for T-cell epitopes and comparison to circulating strain sequences.

PubMed

De Groot, Anne S; Martin, William; Moise, Leonard; Guirakhoo, Farshad; Monath, Thomas

2007-11-19

T-cell epitope variability is associated with viral immune escape and may influence the outcome of vaccination against the highly variable Japanese Encephalitis Virus (JEV). We computationally analyzed the ChimeriVax-JEV vaccine envelope sequence for T helper epitopes that are conserved in 12 circulating JEV strains and discovered 75% conservation among putative epitopes. Among non-identical epitopes, only minor amino acid changes that would not significantly affect HLA-binding were present. Therefore, in most cases, circulating strain epitopes could be restricted by the same HLA and are likely to stimulate a cross-reactive T-cell response. Based on this analysis, we predict no significant abrogation of ChimeriVax-JEV-conferred protection against circulating JEV strains.
Molecular analysis of the split cox1 gene from the Basidiomycota Agrocybe aegerita: relationship of its introns with homologous Ascomycota introns and divergence levels from common ancestral copies.

PubMed

Gonzalez, P; Barroso, G; Labarère, J

1998-10-05

The Basidiomycota Agrocybe aegerita (Aa) mitochondrial cox1 gene (6790 nucleotides), encoding a protein of 527aa (58377Da), is split by four large subgroup IB introns possessing site-specific endonucleases assumed to be involved in intron mobility. When compared to other fungal COX1 proteins, the Aa protein is closely related to the COX1 one of the Basidiomycota Schizophyllum commune (Sc). This clade reveals a relationship with the studied Ascomycota ones, with the exception of Schizosaccharomyces pombe (Sp) which ranges in an out-group position compared with both higher fungi divisions. When comparison is extended to other kingdoms, fungal COX1 sequences are found to be more related to algae and plant ones (more than 57.5% aa similarity) than to animal sequences (53.6% aa similarity), contrasting with the previously established close relationship between fungi and animals, based on comparisons of nuclear genes. The four Aa cox1 introns are homologous to Ascomycota or algae cox1 introns sharing the same location within the exonic sequences. The percentages of identity of the intronic nucleotide sequences suggest a possible acquisition by lateral transfers of ancestral copies or of their derived sequences. These identities extend over the whole intronic sequences, arguing in favor of a transfer of the complete intron rather than a transfer limited to the encoded ORF. The intron i4 shares 74% of identity, at the nucleotidic level, with the Podospora anserina (Pa) intron i14, and up to 90.5% of aa similarity between the encoded proteins, i.e. the highest values reported to date between introns of two phylogenetically distant species. This low divergence argues for a recent lateral transfer between the two species. On the contrary, the low sequence identities (below 36%) observed between Aa i1 and the homologous Sp i1 or Prototheca wickeramii (Pw) i1 suggest a long evolution time after the separation of these sequences. The introns i2 and i3 possessed intermediate percentages of identity with their homologous Ascomycota introns. This is the first report of the complete nucleotide sequence and molecular organization of a mitochondrial cox1 gene of any member of the Basidiomycota division.
Complete genome sequence of lymphocystis disease virus isolated from China.

PubMed

Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang

2004-07-01

Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1.
Complete Genome Sequence of Lymphocystis Disease Virus Isolated from China

PubMed Central

Zhang, Qi-Ya; Xiao, Feng; Xie, Jian; Li, Zheng-Qiu; Gui, Jian-Fang

2004-01-01

Lymphocystis diseases in fish throughout the world have been extensively described. Here we report the complete genome sequence of lymphocystis disease virus isolated in China (LCDV-C), an LCDV isolated from cultured flounder (Paralichthys olivaceus) with lymphocystis disease in China. The LCDV-C genome is 186,250 bp, with a base composition of 27.25% G+C. Computer-assisted analysis revealed 240 potential open reading frames (ORFs) and 176 nonoverlapping putative viral genes, which encode polypeptides ranging from 40 to 1,193 amino acids. The percent coding density is 67%, and the average length of each ORF is 702 bp. A search of the GenBank database using the 176 individual putative genes revealed 103 homologues to the corresponding ORFs of LCDV-1 and 73 potential genes that were not found in LCDV-1 and other iridoviruses. Among the 73 genes, there are 8 genes that contain conserved domains of cellular genes and 65 novel genes that do not show any significant homology with the sequences in public databases. Although a certain extent of similarity between putative gene products of LCDV-C and corresponding proteins of LCDV-1 was revealed, no colinearity was detected when their ORF arrangements and coding strategies were compared to each other, suggesting that a high degree of genetic rearrangements between them has occurred. And a large number of tandem and overlapping repeated sequences were observed in the LCDV-C genome. The deduced amino acid sequence of the major capsid protein (MCP) presents the highest identity to those of LCDV-1 and other iridoviruses among the LCDV-C gene products. Furthermore, a phylogenetic tree was constructed based on the multiple alignments of nine MCP amino acid sequences. Interestingly, LCDV-C and LCDV-1 were clustered together, but their amino acid identity is much less than that in other clusters. The unexpected levels of divergence between their genomes in size, gene organization, and gene product identity suggest that LCDV-C and LCDV-1 shouldn't belong to a same species and that LCDV-C should be considered a species different from LCDV-1. PMID:15194775
The genome sequence of pepper vein yellows virus (family Luteoviridae, genus Polerovirus).

PubMed

Murakami, Ritsuko; Nakashima, Nobuhiko; Hinomoto, Norihide; Kawano, Shinji; Toyosato, Tetsuya

2011-05-01

The complete genome of pepper vein yellows virus (PeVYV) was sequenced using random amplification of RNA samples isolated from vector insects (Aphis gossypii) that had been given access to PeVYV-infected plants. The PeVYV genome consisted of 6244 nucleotides and had a genomic organization characteristic of members of the genus Polerovirus. PeVYV had highest amino acid sequence identities in ORF0 to ORF3 (75.9 - 91.9%) with tobacco vein distorting polerovirus, with which it was only 25.1% identical in ORF5. These sequence comparisons and previously studied biological properties indicate that PeVYV is a distinctly different virus and belongs to a new species of the genus Polerovirus.
Two divergent endo-beta-1,4-glucanase genes exhibit overlapping expression in ripening fruit and abscising flowers.

PubMed Central

Lashbrook, C C; Gonzalez-Bosch, C; Bennett, A B

1994-01-01

Two structurally divergent endo-beta-1,4-glucanase (EGase) cDNAs were cloned from tomato. Although both cDNAs (Cel1 and Cel2) encode potentially glycosylated, basic proteins of 51 to 53 kD and possess multiple amino acid domains conserved in both plant and microbial EGases, Cel1 and Cel2 exhibit only 50% amino acid identity at the overall sequence level. Amino acid sequence comparisons to other plant EGases indicate that tomato Cel1 is most similar to bean abscission zone EGase (68%), whereas Cel2 exhibits greatest sequence identity to avocado fruit EGase (57%). Sequence comparisons suggest the presence of at least two structurally divergent EGase families in plants. Unlike ripening avocado fruit and bean abscission zones in which a single EGase mRNA predominates, EGase expression in tomato reflects the overlapping accumulation of both Cel1 and Cel2 transcripts in ripening fruit and in plant organs undergoing cell separation. Cel1 mRNA contributes significantly to total EGase mRNA accumulation within plant organs undergoing cell separation (abscission zones and mature anthers), whereas Cel2 mRNA is most abundant in ripening fruit. The overlapping expression of divergent EGase genes within a single species may suggest that multiple activities are required for the cooperative disassembly of cell wall components during fruit ripening, floral abscission, and anther dehiscence. PMID:7994180
Isolation and characterization of two cryptic plasmids in the ammonia-oxidizing bacterium Nitrosomonas sp. strain ENI-11.

PubMed

Yamagata, A; Kato, J; Hirota, R; Kuroda, A; Ikeda, T; Takiguchi, N; Ohtake, H

1999-06-01

Two plasmids were discovered in the ammonia-oxidizing bacterium Nitrosomonas sp. strain ENI-11, which was isolated from activated sludge. The plasmids, designated pAYS and pAYL, were relatively small, being approximately 1.9 kb long. They were cryptic plasmids, having no detectable plasmid-linked antibiotic resistance or heavy metal resistance markers. The complete nucleotide sequences of pAYS and pAYL were determined, and their physical maps were constructed. There existed two major open reading frames, ORF1 in pAYS and ORF2 in pAYL, each of which was more than 500 bp long. The predicted product of ORF2 was 28% identical to part of the replication protein of a Bacillus plasmid, pBAA1. However, no significant similarity to any known protein sequences was detected with the predicted product of ORF1. pAYS and pAYL had a highly homologous region, designated HHR, of 262 bp. The overall identity was 98% between the two nucleotide sequences. Interestingly, HHR-homologous sequences were also detected in the genomes of ENI-11 and the plasmidless strain Nitrosomonas europaea IFO14298. Deletion analysis of pAYS and pAYL indicated that HHR, together with either ORF1 or ORF2, was essential for plasmid maintenance in ENI-11. To our knowledge, pAYS and pAYL are the first plasmids found in the ammonia-oxidizing autotrophic bacteria.
Complete genome analysis of jasmine virus T from Jasminum sambac in China.

PubMed

Tang, Yajun; Gao, Fangluan; Yang, Zhen; Wu, Zujian; Yang, Liang

2016-07-01

The genome of a potyvirus (isolate JaVT_FZ) recovered from jasmine (Jasminum sambac L.) showing yellow ringspot symptoms in Fuzhou, China, was sequenced. JaVT_FZ is closely related to seven other potyviruses with completely sequenced genomes, with which it shares 66-70 % nucleotide and 52-56 % amino acid sequence identity. However, the coat protein (CP) gene shares 82-92 % nucleotide and 90-97 % amino acid sequence identity with those of two partially sequenced potyviruses, named jasmine potyvirus T (JaVT-jasmine) and jasmine yellow mosaic potyvirus (JaYMV-India), respectively. This suggests that JaVT_FZ, JaVT-jasmine and JaYMV-India should be regarded as members of a single potyvirus species, for which the name "Jasmine virus T" has priority.
Selection of optimal oligonucleotide probes for microarrays usingmultiple criteria, global alignment and parameter estimation.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Xingyuan; He, Zhili; Zhou, Jizhong

2005-10-30

The oligonucleotide specificity for microarray hybridizationcan be predicted by its sequence identity to non-targets, continuousstretch to non-targets, and/or binding free energy to non-targets. Mostcurrently available programs only use one or two of these criteria, whichmay choose 'false' specific oligonucleotides or miss 'true' optimalprobes in a considerable proportion. We have developed a software tool,called CommOligo using new algorithms and all three criteria forselection of optimal oligonucleotide probes. A series of filters,including sequence identity, free energy, continuous stretch, GC content,self-annealing, distance to the 3'-untranslated region (3'-UTR) andmelting temperature (Tm), are used to check each possibleoligonucleotide. A sequence identity is calculated based onmore » gapped globalalignments. A traversal algorithm is used to generate alignments for freeenergy calculation. The optimal Tm interval is determined based on probecandidates that have passed all other filters. Final probes are pickedusing a combination of user-configurable piece-wise linear functions andan iterative process. The thresholds for identity, stretch and freeenergy filters are automatically determined from experimental data by anaccessory software tool, CommOligo_PE (CommOligo Parameter Estimator).The program was used to design probes for both whole-genome and highlyhomologous sequence data. CommOligo and CommOligo_PE are freely availableto academic users upon request.« less
New Ehrlichia Species Closely Related to Ehrlichia chaffeensis Isolated from Ixodes ovatus Ticks in Japan

PubMed Central

Shibata, Shin-ichiro; Kawahara, Makoto; Rikihisa, Yasuko; Fujita, Hiromi; Watanabe, Yuriko; Suto, Chiharu; Ito, Tadahiko

2000-01-01

Seven Ehrlichia strains (six HF strains and one Anan strain) that were obtained from laboratory mice by intraperitoneally inoculating homogenates of adult Ixodes ovatus collected in Japan were characterized. 16S rRNA sequences of all six HF strains were identical, and the sequences were 99.7, 98.2, and 97.7% identical to those of Anan strain, Ehrlichia chaffeensis (human monocytic ehrlichiosis agent), and E. muris, respectively. Partial GroEL amino acid sequencing also revealed that the six HF strains had identical sequences, which were 99.0, 98.5, and 97.3% identical to those of E. chaffeensis, the Anan strain, and E. canis, respectively. All HF strains were lethal to mice at higher dosages and intraperitoneal inoculation, whereas the Anan or E. muris strain induced only mild clinical signs. Light and electron microscopy of moribund mice inoculated with one of the HF strains revealed severe liver necrosis and the presence of numerous ehrlichial inclusions (morulae) in various organs. The study revealed that members of E. canis genogroup are naturally present in Ixodes ticks. HF strains that can cause severe illness in immunocompetent laboratory mice would be valuable in studying the pathogenesis and the roles of both cellular and humoral immune responses in ehrlichiosis caused by E. canis genogroup. PMID:10747103
Petroleum hydrocarbon contamination, plant identity and arbuscular mycorrhizal fungal (AMF) community determine assemblages of the AMF spore-associated microbes.

PubMed

Iffis, Bachir; St-Arnaud, Marc; Hijri, Mohamed

2016-09-01

The root-associated microbiome is a key determinant of pollutant degradation, soil nutrient availability and plant biomass productivity, but could not be examined in depth prior to recent advances in high-throughput sequencing. Arbuscular mycorrhizal fungi (AMF) form symbioses with the majority of vascular plants. They are known to enhance mineral uptake and promote plant growth and are postulated to influence the processes involved in phytoremediation. Amplicon sequencing approaches have previously shown that petroleum hydrocarbon pollutant (PHP) concentration strongly influences AMF community structure in in situ phytoremediation experiments. We examined how AMF communities and their spore-associated microbiomes were structured within the rhizosphere of three plant species growing spontaneously in three distinct waste decantation basins of a former petrochemical plant. Our results show that the AMF community was only affected by PHP concentrations, while the AMF-associated fungal and bacterial communities were significantly affected by both PHP concentrations and plant species identity. We also found that some AMF taxa were either positively or negatively correlated with some fungal and bacterial groups. Our results suggest that in addition to PHP concentrations and plant species identity, AMF community composition may also shape the community structure of bacteria and fungi associated with AMF spores. © 2016 Society for Applied Microbiology and John Wiley & Sons Ltd.
Indigenous and introduced potyviruses of legumes and Passiflora spp. from Australia: biological properties and comparison of coat protein sequences

USDA-ARS?s Scientific Manuscript database

Coat protein sequences of 33 Potyvirus isolates from legume and Passiflora spp. were sequenced to determine the identity of infecting viruses. Phylogenetic analysis of the sequences revealed the presence of seven distinct virus species....
Molecular cloning of two human liver 3 alpha-hydroxysteroid/dihydrodiol dehydrogenase isoenzymes that are identical with chlordecone reductase and bile-acid binder.

PubMed Central

Deyashiki, Y; Ogasawara, A; Nakayama, T; Nakanishi, M; Miyabe, Y; Sato, K; Hara, A

1994-01-01

Human liver contains two dihydrodiol dehydrogenases, DD2 and DD4, associated with 3 alpha-hydroxysteroid dehydrogenase activity. We have raised polyclonal antibodies that cross-reacted with the two enzymes and isolated two 1.2 kb cDNA clones (C9 and C11) for the two enzymes from a human liver cDNA library using the antibodies. The clones of C9 and C11 contained coding sequences corresponding to 306 and 321 amino acid residues respectively, but lacked 5'-coding regions around the initiation codon. Sequence analyses of several peptides obtained by enzymic and chemical cleavages of the two purified enzymes verified that the C9 and C11 clones encoded DD2 and DD4 respectively, and further indicated that the sequence of DD2 had at least additional 16 residues upward from the N-terminal sequence deduced from the cDNA. There was 82% amino acid sequence identity between the two enzymes, indicating that the enzymes are genetic isoenzymes. A computer-based comparison of the cDNAs of the isoenzymes with the DNA sequence database revealed that the nucleotide and amino acid sequences of DD2 and DD4 are virtually identical with those of human bile-acid binder and human chlordecone reductase cDNAs respectively. Images Figure 1 PMID:8172617
Visualizing bacterial tRNA identity determinants and antideterminants using function logos and inverse function logos

PubMed Central

Freyhult, Eva; Moulton, Vincent; Ardell, David H.

2006-01-01

Sequence logos are stacked bar graphs that generalize the notion of consensus sequence. They employ entropy statistics very effectively to display variation in a structural alignment of sequences of a common function, while emphasizing its over-represented features. Yet sequence logos cannot display features that distinguish functional subclasses within a structurally related superfamily nor do they display under-represented features. We introduce two extensions to address these needs: function logos and inverse logos. Function logos display subfunctions that are over-represented among sequences carrying a specific feature. Inverse logos generalize both sequence logos and function logos by displaying under-represented, rather than over-represented, features or functions in structural alignments. To make inverse logos, a compositional inverse is applied to the feature or function frequency distributions before logo construction, where a compositional inverse is a mathematical transform that makes common features or functions rare and vice versa. We applied these methods to a database of structurally aligned bacterial tDNAs to create highly condensed, birds-eye views of potentially all so-called identity determinants and antideterminants that confer specific amino acid charging or initiator function on tRNAs in bacteria. We recovered both known and a few potentially novel identity elements. Function logos and inverse logos are useful tools for exploratory bioinformatic analysis of structure–function relationships in sequence families and superfamilies. PMID:16473848

Systematic and fully automated identification of protein sequence patterns.

PubMed

Hart, R K; Royyuru, A K; Stolovitzky, G; Califano, A

2000-01-01

We present an efficient algorithm to systematically and automatically identify patterns in protein sequence families. The procedure is based on the Splash deterministic pattern discovery algorithm and on a framework to assess the statistical significance of patterns. We demonstrate its application to the fully automated discovery of patterns in 974 PROSITE families (the complete subset of PROSITE families which are defined by patterns and contain DR records). Splash generates patterns with better specificity and undiminished sensitivity, or vice versa, in 28% of the families; identical statistics were obtained in 48% of the families, worse statistics in 15%, and mixed behavior in the remaining 9%. In about 75% of the cases, Splash patterns identify sequence sites that overlap more than 50% with the corresponding PROSITE pattern. The procedure is sufficiently rapid to enable its use for daily curation of existing motif and profile databases. Third, our results show that the statistical significance of discovered patterns correlates well with their biological significance. The trypsin subfamily of serine proteases is used to illustrate this method's ability to exhaustively discover all motifs in a family that are statistically and biologically significant. Finally, we discuss applications of sequence patterns to multiple sequence alignment and the training of more sensitive score-based motif models, akin to the procedure used by PSI-BLAST. All results are available at httpl//www.research.ibm.com/spat/.
Gene Deletion in Barley Mediated by LTR-retrotransposon BARE

PubMed Central

Shang, Yi; Yang, Fei; Schulman, Alan H.; Zhu, Jinghuan; Jia, Yong; Wang, Junmei; Zhang, Xiao-Qi; Jia, Qiaojun; Hua, Wei; Yang, Jianming; Li, Chengdao

2017-01-01

A poly-row branched spike (prbs) barley mutant was obtained from soaking a two-rowed barley inflorescence in a solution of maize genomic DNA. Positional cloning and sequencing demonstrated that the prbs mutant resulted from a 28 kb deletion including the inflorescence architecture gene HvRA2. Sequence annotation revealed that the HvRA2 gene is flanked by two LTR (long terminal repeat) retrotransposons (BARE) sharing 89% sequence identity. A recombination between the integrase (IN) gene regions of the two BARE copies resulted in the formation of an intact BARE and loss of HvRA2. No maize DNA was detected in the recombination region although the flanking sequences of HvRA2 gene showed over 73% of sequence identity with repetitive sequences on 10 maize chromosomes. It is still unknown whether the interaction of retrotransposons between barley and maize has resulted in the recombination observed in the present study. PMID:28252053
Radiolabeled Escherichia coli heat-stable enterotoxin analogs for in vivo imaging of colorectal cancer

NASA Astrophysics Data System (ADS)

Giblin, M. F.; Sieckman, G. L.; Owen, N. K.; Hoffman, T. J.; Forte, L. R.; Volkert, W. A.

2005-12-01

The human Escherichia coli heat-stable enterotoxin (STh, amino acid sequence N1SSNYCCELCCNPACTGCY19) binds specifically to the guanylate cyclase C (GC-C) receptor, which is present in high density on the apical surface of normal intestinal epithelial cells as well as on the surface of human colon cancer cells. In the current study, two STh analogs were synthesized and evaluated in vitro and in vivo. Both analogs shared identical 6-19 core sequences, and had N-terminal pendant DOTA moieties. The analogs differed in the identity of a 6 amino acid peptide sequence intervening between DOTA and the 6-19 core. In one analog, the peptide was an RGD-containing sequence found in human fibronectin (GRGDSP), while in the other this peptide sequence was randomly scrambled (GRDSGP). The results indicated that the presence of the human fibronectin sequence in the hybrid peptide did not affect tumor localization in vivo.
Opsin cDNA sequences of a UV and green rhodopsin of the satyrine butterfly Bicyclus anynana.

PubMed

Vanhoutte, K J A; Eggen, B J L; Janssen, J J M; Stavenga, D G

2002-11-01

The cDNAs of an ultraviolet (UV) and long-wavelength (LW) (green) absorbing rhodopsin of the bush brown Bicyclus anynana were partially identified. The UV sequence, encoding 377 amino acids, is 76-79% identical to the UV sequences of the papilionids Papilio glaucus and Papilio xuthus and the moth Manduca sexta. A dendrogram derived from aligning the amino acid sequences reveals an equidistant position of Bicyclus between Papilio and Manduca. The sequence of the green opsin cDNA fragment, which encodes 242 amino acids, represents six of the seven transmembrane regions. At the amino acid level, this fragment is more than 80% identical to the corresponding LW opsin sequences of Dryas, Heliconius, Papilio (rhodopsin 2) and Manduca. Whereas three LW absorbing rhodopsins were identified in the papilionid butterflies, only one green opsin was found in B. anynana.
Characterization of apple stem grooving virus and apple chlorotic leaf spot virus identified in a crab apple tree.

PubMed

Li, Yongqiang; Deng, Congliang; Bian, Yong; Zhao, Xiaoli; Zhou, Qi

2017-04-01

Apple stem grooving virus (ASGV), apple chlorotic leaf spot virus (ACLSV), and prunus necrotic ringspot virus (PNRSV) were identified in a crab apple tree by small RNA deep sequencing. The complete genome sequence of ACLSV isolate BJ (ACLSV-BJ) was 7554 nucleotides and shared 67.0%-83.0% nucleotide sequence identity with other ACLSV isolates. A phylogenetic tree based on the complete genome sequence of all available ACLSV isolates showed that ACLSV-BJ clustered with the isolates SY01 from hawthorn, MO5 from apple, and JB, KMS and YH from pear. The complete nucleotide sequence of ASGV-BJ was 6509 nucleotides (nt) long and shared 78.2%-80.7% nucleotide sequence identity with other isolates. ASGV-BJ and the isolate ASGV_kfp clustered together in the phylogenetic tree as an independent clade. Recombination analysis showed that isolate ASGV-BJ was a naturally occurring recombinant.
Common Amino Acid Subsequences in a Universal Proteome—Relevance for Food Science

PubMed Central

Minkiewicz, Piotr; Darewicz, Małgorzata; Iwaniak, Anna; Sokołowska, Jolanta; Starowicz, Piotr; Bucholska, Justyna; Hrynkiewicz, Monika

2015-01-01

A common subsequence is a fragment of the amino acid chain that occurs in more than one protein. Common subsequences may be an object of interest for food scientists as biologically active peptides, epitopes, and/or protein markers that are used in comparative proteomics. An individual bioactive fragment, in particular the shortest fragment containing two or three amino acid residues, may occur in many protein sequences. An individual linear epitope may also be present in multiple sequences of precursor proteins. Although recent recommendations for prediction of allergenicity and cross-reactivity include not only sequence identity, but also similarities in secondary and tertiary structures surrounding the common fragment, local sequence identity may be used to screen protein sequence databases for potential allergens in silico. The main weakness of the screening process is that it overlooks allergens and cross-reactivity cases without identical fragments corresponding to linear epitopes. A single peptide may also serve as a marker of a group of allergens that belong to the same family and, possibly, reveal cross-reactivity. This review article discusses the benefits for food scientists that follow from the common subsequences concept. PMID:26340620
Novel Detection of Coxiella spp., Theileria luwenshuni, and T. ovis Endosymbionts in Deer Keds (Lipoptena fortisetosa).

PubMed

Lee, Seung-Hun; Kim, Kyoo-Tae; Kwon, Oh-Deog; Ock, Younsung; Kim, Taeil; Choi, Donghag; Kwak, Dongmi

2016-01-01

We describe for the first time the detection of Coxiella-like bacteria (CLB), Theileria luwenshuni, and T. ovis endosymbionts in blood-sucking deer keds. Eight deer keds attached to a Korean water deer were identified as Lipoptena fortisetosa (Diptera: Hippoboscidae) by morphological and genetic analyses. Among the endosymbionts assessed, CLB, Theileria luwenshuni, and T. ovis were identified in L. fortisetosa by PCR and nucleotide sequencing. Based on phylogeny, CLB 16S rRNA sequences were classified into clade B, sharing 99.4% identity with CLB from Haemaphysalis longicornis in South Korea. Although the virulence of CLB to vertebrates is still controversial, several studies have reported clinical symptoms in birds due to CLB infections. The 18S rRNA sequences of T. luwenshuni and T. ovis in this study were 98.8-100% identical to those in GenBank, and all of the obtained sequences of T. ovis and T. luwenshuni in this study were 100% identical to each other, respectively. Although further studies are required to positively confirm L. fortisetosa as a biological vector of these pathogens, strong genetic relationships among sequences from this and previous studies suggest potential transmission among mammalian hosts by ticks and keds.
Complete genome sequence of a new begomovirus associated with yellow mosaic disease of Hemidesmus indicus in India.

PubMed

Reddy, M Sreekanth; Kanakala, S; Srinivas, K P; Hema, M; Malathi, V G; Sreenivasulu, P

2014-05-01

The complete DNA A genome of a virus isolate associated with yellow mosaic disease of a medicinal plant, Hemidesmus indicus, from India was cloned and sequenced. The length of DNA A was 2825 nucleotides, 35 nucleotides longer than the unit genome of monopartite begomoviruses. Comparison of the nucleotide sequence of DNA A of the virus isolate with those of other begomoviruses showed maximum sequence identity of 69 % to DNA A of ageratum yellow vein China virus (AYVCNV; AJ558120) and 68 % with tomato yellow leaf curl virus- LBa4 (TYLCV; EF185318), and it formed a distinct clade in phylogenetic analysis. The genome organization of the present virus isolate was found to be similar to that of Old World monopartite begomoviruses. The genome was considered to be monopartite, because association of DNA B and β satellite DNA components was not detected. Based on its sequence identity (<70 %) to all other begomoviruses known to date and ICTV (International Committee on Taxonomy of Viruses) species demarcating criteria (<89 % identity), it is considered a member of a novel begomovirus species, and the tentative name "Hemidesmus yellow mosaic virus" (HeYMV) is proposed.
Phylogenetic Analysis of Theileria annulata Infected Cell Line S15 Iran Vaccine Strain.

PubMed

Habibi, Gh

2012-01-01

Bovine theileriosis results from infection with obligate intracellular protozoa of the genus Theileria. The phylogenetic relationships between two isolates of Theileria annulata, and 36 Theileria spp., as well as 6 outgroup including Babesia spp. and coccidian protozoa were analyzed using the 18S rRNA gene sequence. The target DNA segment was amplified by PCR. The PCR product was used for direct sequencing. The length of the 18S rRNA gene of all Theileria spp. involved in this study was around 1,400 bp. A phylogenetic tree was inferred based on the 18S rRNA gene sequence of the Iran and Iraq isolates, and other species of Theileria available in GenBank. In the constructed tree, Theileria annulata (Iran vaccine strain) was closely related to other T. annulata from Europe, Asia, as well as T. lestoquardi, T. parva and T. taurotragi all in one clade. Phylogenetic analyses based on small subunit ribosomal RNA gene suggested that the percent identity of the sequence of Iran vaccine strain was completely the same as Iraq sequence (100% identical), but the similarity of Iran vaccine strain with other T. annulata reported from China, Spain and Italy determined the 97.9 to 99.9% identity.
Cloning of an avilamycin biosynthetic gene cluster from Streptomyces viridochromogenes Tü57.

PubMed Central

Gaisser, S; Trefzer, A; Stockert, S; Kirschning, A; Bechthold, A

1997-01-01

A 65-kb region of DNA from Streptomyces viridochromogenes Tü57, containing genes encoding proteins involved in the biosynthesis of avilamycins, was isolated. The DNA sequence of a 6.4-kb fragment from this region revealed four open reading frames (ORF1 to ORF4), three of which are fully contained within the sequenced fragment. The deduced amino acid sequence of AviM, encoded by ORF2, shows 37% identity to a 6-methylsalicylic acid synthase from Penicillium patulum. Cultures of S. lividans TK24 and S. coelicolor CH999 containing plasmids with ORF2 on a 5.5-kb PstI fragment were able to produce orsellinic acid, an unreduced version of 6-methylsalicylic acid. The amino acid sequence encoded by ORF3 (AviD) is 62% identical to that of StrD, a dTDP-glucose synthase from S. griseus. The deduced amino acid sequence of AviE, encoded by ORF4, shows 55% identity to a dTDP-glucose dehydratase (StrE) from S. griseus. Gene insertional inactivation experiments of aviE abolished avilamycin production, indicating the involvement of aviE in the biosynthesis of avilamycins. PMID:9335272
Comparative genomic sequence analysis of novel Helicoverpa armigera nucleopolyhedrovirus (NPV) isolated from Kenya and three other previously sequenced Helicoverpa spp. NPVs.

PubMed

Ogembo, Javier Gordon; Caoili, Barbara L; Shikata, Masamitsu; Chaeychomsri, Sudawan; Kobayashi, Michihiro; Ikeda, Motoko

2009-10-01

A newly cloned Helicoverpa armigera nucleopolyhedrovirus (HearNPV) from Kenya, HearNPV-NNg1, has a higher insecticidal activity than HearNPV-G4, which also exhibits lower insecticidal activity than HearNPV-C1. In the search for genes and/or nucleotide sequences that might be involved in the observed virulence differences among Helicoverpa spp. NPVs, the entire genome of NNg1 was sequenced and compared with previously sequenced genomes of G4, C1 and Helicoverpa zea single-nucleocapsid NPV (Hz). The NNg1 genome was 132,425 bp in length, with a total of 143 putative open reading frames (ORFs), and shared high levels of overall amino acid and nucleotide sequence identities with G4, C1 and Hz. Three NNg1 ORFs, ORF5, ORF100 and ORF124, which were shared with C1, were absent in G4 and Hz, while NNg1 and C1 were missing a homologue of G4/Hz ORF5. Another three ORFs, ORF60 (bro-b), ORF119 and ORF120, and one direct repeat sequence (dr) were unique to NNg1. Relative to the overall nucleotide sequence identity, lower sequence identities were observed between NNg1 hrs and the homologous hrs in the other three Helicoverpa spp. NPVs, despite containing the same number of hrs located at essentially the same positions on the genomes. Differences were also observed between NNg1 and each of the other three Helicoverpa spp. NPVs in the diversity of bro genes encoded on the genomes. These results indicate several putative genes and nucleotide sequences that may be responsible for the virulence differences observed among Helicoverpa spp., yet the specific genes and/or nucleotide sequences responsible have not been identified.
Detection and molecular characterization of Babesia, Theileria, and Hepatozoon species in hard ticks collected from Kagoshima, the southern region in Japan.

PubMed

Masatani, Tatsunori; Hayashi, Kei; Andoh, Masako; Tateno, Morihiro; Endo, Yasuyuki; Asada, Masahito; Kusakisako, Kodai; Tanaka, Tetsuya; Gokuden, Mutsuyo; Hozumi, Nodoka; Nakadohzono, Fumiko; Matsuo, Tomohide

2017-06-01

To reveal the distribution of tick-borne parasites, we established a novel nested polymerase chain reaction (PCR) system to detect the most common agents of tick-borne parasitic diseases, namely Babesia, Theileria, and Hepatozoon parasites. We collected host-seeking or animal-feeding ticks in Kagoshima Prefecture, the southernmost region of Kyusyu Island in southwestern Japan. Twenty of the total of 776 tick samples displayed a specific band of the appropriate size (approximately 1.4-1.6kbp) for the 18S rRNA genes in the novel nested PCR (20/776: 2.58%). These PCR products have individual sequences of Babesia spp. (from 8 ticks), Theileria spp. (from 9 ticks: one tick sample including at least two Theileria spp. sequences), and Hepatozoon spp. (from 3 ticks). Phylogenetic analyses revealed that these sequences were close to those of undescribed Babesia spp. detected in feral raccoons in Japan (5 sequences; 3 sequences being identical), Babesia gibsoni-like parasites detected in pigs in China (3 sequences; all sequences being identical), Theileria spp. detected in sika deer in Japan and China (10 sequences; 2 sequences being identical), Hepatozoon canis (one sequence), and Hepatozoon spp. detected in Japanese martens in Japan (two sequences). In summary, we showed that various tick-borne parasites exist in Kagoshima, the southern region in Japan by using the novel nested PCR system. These including undescribed species such as Babesia gibsoni-like parasites previously detected in pigs in China. Importantly, our results revealed new combinations of ticks and protozoan parasites in southern Japan. The results of this study will aid in the recognition of potential parasitic animal diseases caused by tick-borne parasites. Copyright © 2017 Elsevier GmbH. All rights reserved.
Taxonomic evaluation of Streptomyces albus and related species using multilocus sequence analysis and proposals to emend the description of Streptomyces albus and describe Streptomyces pathocidini sp. nov.

PubMed Central

Doroghazi, J. R.; Ju, K.-S.; Metcalf, W. W.

2014-01-01

In phylogenetic analyses of the genus Streptomyces using 16S rRNA gene sequences, Streptomyces albus subsp. albus NRRL B-1811T forms a cluster with five other species having identical or nearly identical 16S rRNA gene sequences. Moreover, the morphological and physiological characteristics of these other species, including Streptomyces almquistii NRRL B-1685T, Streptomyces flocculus NRRL B-2465T, Streptomyces gibsonii NRRL B-1335T and Streptomyces rangoonensis NRRL B-12378T are quite similar. This cluster is of particular taxonomic interest because Streptomyces albus is the type species of the genus Streptomyces. The related strains were subjected to multilocus sequence analysis (MLSA) utilizing partial sequences of the housekeeping genes atpD, gyrB, recA, rpoB and trpB and confirmation of previously reported phenotypic characteristics. The five strains formed a coherent cluster supported by a 100 % bootstrap value in phylogenetic trees generated from sequence alignments prepared by concatenating the sequences of the housekeeping genes, and identical tree topology was observed using various different tree-making algorithms. Moreover, all but one strain, S. flocculus NRRL B-2465T, exhibited identical sequences for all of the five housekeeping gene loci sequenced, but NRRL B-2465T still exhibited an MLSA evolutionary distance of 0.005 from the other strains, a value that is lower than the 0.007 MLSA evolutionary distance threshold proposed for species-level relatedness. These data support a proposal to reclassify S. almquistii, S. flocculus, S. gibsonii and S. rangoonensis as later heterotypic synonyms of S. albus with NRRL B-1811T as the type strain. The MLSA sequence database also demonstrated utility for quickly and conclusively confirming that numerous strains within the ARS Culture Collection had been previously misidentified as subspecies of S. albus and that Streptomyces albus subsp. pathocidicus should be redescribed as a novel species, Streptomyces pathocidini sp. nov., with the type strain NRRL B-24287T. PMID:24277863
The advent of biosimilars: challenges and risks.

PubMed

Müller, Rüdiger; Renner, Christoph; Gabay, Cem; Cassata, Giuseppe; Lohri, Andreas; Hasler, Paul

2014-01-01

Biosimilars represent a new class of medicinal products that will have significant impact on clinical use. They are identical on an amino acid sequence level to existing reference biopharmaceutical products (originals). However, they may exhibit differences on a protein level. This paper provides a brief overview of biosimilar development and describes the risk and challenges that should be considered during the admission of biosimilars into the clinic.
Deep Sequencing Analysis of Apple Infecting Viruses in Korea

PubMed Central

Cho, In-Sook; Igori, Davaajargal; Lim, Seungmo; Choi, Gug-Seoun; Hammond, John; Lim, Hyoun-Sub; Moon, Jae Sun

2016-01-01

Deep sequencing has generated 52 contigs derived from five viruses; Apple chlorotic leaf spot virus (ACLSV), Apple stem grooving virus (ASGV), Apple stem pitting virus (ASPV), Apple green crinkle associated virus (AGCaV), and Apricot latent virus (ApLV) were identified from eight apple samples showing small leaves and/or growth retardation. Nucleotide (nt) sequence identity of the assembled contigs was from 68% to 99% compared to the reference sequences of the five respective viral genomes. Sequences of ASPV and ASGV were the most abundantly represented by the 52 contigs assembled. The presence of the five viruses in the samples was confirmed by RT-PCR using specific primers based on the sequences of each assembled contig. All five viruses were detected in three of the samples, whereas all samples had mixed infections with at least two viruses. The most frequently detected virus was ASPV, followed by ASGV, ApLV, ACLSV, and AGCaV which were withal found in mixed infections in the tested samples. AGCaV was identified in assembled contigs ID 1012480 and 93549, which showed 82% and 78% nt sequence identity with ORF1 of AGCaV isolate Aurora-1. ApLV was identified in three assembled contigs, ID 65587, 1802365, and 116777, which showed 77%, 78%, and 76% nt sequence identity respectively with ORF1 of ApLV isolate LA2. Deep sequencing assay was shown to be a valuable and powerful tool for detection and identification of known and unknown virome in infected apple trees, here identifying ApLV and AGCaV in commercial orchards in Korea for the first time. PMID:27721694
Culturable Rhodobacter and Shewanella species are abundant in estuarine turbidity maxima of the Columbia River

PubMed Central

Bräuer, S. L.; Adams, C.; Kranzler, K.; Murphy, D.; Xu, M.; Zuber, P.; Simon, H. M.; Baptista, A. M.; Tebo, B. M.

2017-01-01

Summary Measurements of dissolved, ascorbate-reducible and total Mn by ICP-OES revealed significantly higher concentrations during estuarine turbidity maxima (ETM) events, compared with non-events in the Columbia River. Most probable number (MPN) counts of Mn-oxidizing or Mn-reducing heterotrophs were not statistically different from that of other heterotrophs (103–104 cells ml−1) when grown in defined media, but counts of Mn oxidizers were significantly lower in nutrient-rich medium (13 cells ml−1). MPN counts of Mn oxidizers were also significantly lower on Mn(III)-pyrophosphate and glycerol (21 cells ml−1). Large numbers of Rhodobacter spp. were cultured from dilutions of 10−2 to 10−5, and many of these were capable of Mn(III) oxidation. Up to c. 30% of the colonies tested LBB positive, and all 77 of the successfully sequenced LBB positive colonies (of varying morphology) yielded sequences related to Rhodobacter spp. qPCR indicated that a cluster of Rhodobacter isolates and closely related strains (95–99% identity) represented approximately 1–3% of the total Bacteria, consistent with clone library results. Copy numbers of SSU rRNA genes for either Rhodobacter spp. or Bacteria were four to eightfold greater during ETM events compared with non-events. Strains of a Shewanella sp. were retrieved from the highest dilutions (10−5) of Mn reducers, and were also capable of Mn oxidation. The SSU rRNA gene sequences from these strains shared a high identity score (98%) with sequences obtained in clone libraries. Our results support previous findings that ETMs are zones with high microbial activity. Results indicated that Shewanella and Rhodobacter species were present in environmentally relevant concentrations, and further demonstrated that a large proportion of culturable bacteria, including Shewanella and Rhodobacter spp., were capable of Mn cycling in vitro. PMID:20977571
Sequence and RT-PCR expression analysis of two peroxidases from Arabidopsis thaliana belonging to a novel evolutionary branch of plant peroxidases.

PubMed

Kjaersgård, I V; Jespersen, H M; Rasmussen, S K; Welinder, K G

1997-03-01

cDNA clones encoding two new Arabidopsis thaliana peroxidases, ATP 1a and ATP 2a, have been identified by searching the Arabidopsis database of expressed sequence tags (dbEST). They represent a novel branch of hitherto uncharacterized plant peroxidases which is only 35% identical in amino acid sequence to the well characterized group of basic plant peroxidases represented by the horseradish (Armoracia rusticana) isoperoxidases HRP C, HRP E5 and the similar Arabidopsis isoperoxidases ATP Ca, ATP Cb, and ATP Ea. However ATP 1a is 87% identical in amino acid sequence to a peroxidase encoded by an mRNA isolated from cotton (Gossypium hirsutum). As cotton and Arabidopsis belong to rather diverse families (Malvaceae and Crucifereae, respectively), in contrast with Arabidopsis and horseradish (both Crucifereae), the high degree of sequence identity indicates that this novel type of peroxidase, albeit of unknown function, is likely to be widespread in plant species. The atp 1 and atp 2 types of cDNA sequences were the most redundant among the 28 different isoperoxidases identified among about 200 peroxidase encoding ESTs. Interestingly, 8 out of totally 38 EST sequences coding for ATP 1 showed three identical nucleotide substitutions. This variant form is designated ATP 1b. Similarly, six out of totally 16 EST sequences coding for ATP 2 showed a number of deletions and nucleotide changes. This variant form is designated ATP 2b. The selected EST clones are full-length and contain coding regions of 993 nucleotides for atp 1a, and 984 nucleotides for atp 2a. These regions show 61% DNA sequence identity. The predicted mature proteins ATP 1a, and ATP 2a are 57% identical in sequence and contain the structurally and functionally important residues, characteristic of the plant peroxidase superfamily. However, they do show two differences of importance to peroxidase catalysis: (1) the asparagine residue linked with the active site distal histidine via hydrogen bonding is absent; (2) an N-glycosylation site is located right at the entrance to the heme channel. The reverse transcriptase polymerase chain reaction (RT-PCR) was used to identify mRNAs coding for ATP 1a/b and ATP 2a/b in germinating seeds, seedlings, roots, leaves, stems, flowers and cell suspension culture using elongation factor 1alpha (EF-1alpha) for the first time as a positive control. Both mRNAs were transcribed at levels comparable to EF-1alpha in all plant tissues investigated which were more than two days old, and in cell suspension culture. In addition, the mRNA coding for ATP 1a/b was found in two day old germinating seeds. The abundant transcription of ATP 1a/b and ATP 2a/b is in line with their many entries in dbEST, and indicates essential roles for these novel peroxidases.
Complete genome sequence of the first human parechovirus type 3 isolated in Taiwan.

PubMed

Chang, Jenn-Tzong; Yang, Chih-Shiang; Chen, Bao-Chen; Chen, Yao-Shen; Chang, Tsung-Hsien

2017-11-01

The first human parechovirus 3 (HPeV3 VGHKS-2007) in Taiwan was identified from a clinical specimen from a male infant. The entire genome of the HPeV3 isolate was sequenced and compared to known HPeV3 sequences. Genome alignment data showed that HPeV3 VGHKS-2007 shares the highest nucleotide identity, 99%, with the Japanese strain of HPeV3 1361K-162589-Yamagata-2008. All HPeV3 isolates possess at least 97% amino acid identity. The analysis of the genome sequence of HPeV3 VGHKS-2007 will facilitate future investigations of the epidemiology and pathogenicity of HPeV3 infection. Copyright © 2017. Published by Elsevier Taiwan LLC.
Genetic variation in potential Giardia vaccine candidates cyst wall protein 2 and α1-giardin.

PubMed

Radunovic, Matej; Klotz, Christian; Saghaug, Christina Skår; Brattbakk, Hans-Richard; Aebischer, Toni; Langeland, Nina; Hanevik, Kurt

2017-08-01

Giardia is a prevalent intestinal parasitic infection. The trophozoite structural protein a1-giardin (a1-g) and the cyst protein cyst wall protein 2 (CWP2) have shown promise as Giardia vaccine antigen candidates in murine models. The present study assesses the genetic diversity of a1-g and CWP2 between and within assemblages A and B in human clinical isolates. a1-g and CWP2 sequences were acquired from 15 Norwegian isolates by PCR amplification and 20 sequences from German cultured isolates by whole genome sequencing. Sequences were aligned to reference genomes from assemblage A2 and B to identify genetic variance. Genetic diversity was found between assemblage A and B reference sequences for both a1-g (90.8% nucleotide identity) and CWP2 (82.5% nucleotide identity). However, for a1-g, this translated into only 3 amino acid (aa) substitutions, while for CWP2 there were 41 aa substitutions, and also one aa deletion. Genetic diversity within assemblage B was larger; nucleotide identity 92.0% for a1-g and 94.3% for CWP2, than within assemblage A (nucleotide identity 99.0% for a1-g and 99.7% for CWP2). For CWP2, the diversity on both nucleotide and protein level was higher in the C-terminal end. Predicted antigenic epitopes were not affected for a1-g, but partially for CWP2. Despite genetic diversity in a1-g, we found aa sequence, characteristics, and antigenicity to be well preserved. CWP2 showed more aa variance and potential antigenic differences. Several CWP2 antigens might be necessary in a future Giardia vaccine to provide cross protection against both Giardia assemblages infecting humans.
Complete nucleotide sequence and genome structure of a Japanese isolate of hibiscus latent Fort Pierce virus, a unique tobamovirus that contains an internal poly(A) region in its 3' end.

PubMed

Yoshida, Tetsuya; Kitazawa, Yugo; Komatsu, Ken; Neriya, Yutaro; Ishikawa, Kazuya; Fujita, Naoko; Hashimoto, Masayoshi; Maejima, Kensaku; Yamaji, Yasuyuki; Namba, Shigetou

2014-11-01

In this study, we detected a Japanese isolate of hibiscus latent Fort Pierce virus (HLFPV-J), a member of the genus Tobamovirus, in a hibiscus plant in Japan and determined the complete sequence and organization of its genome. HLFPV-J has four open reading frames (ORFs), each of which shares more than 98 % nucleotide sequence identity with those of other HLFPV isolates. Moreover, HLFPV-J contains a unique internal poly(A) region of variable length, ranging from 44 to 78 nucleotides, in its 3'-untranslated region (UTR), as is the case with hibiscus latent Singapore virus (HLSV), another hibiscus-infecting tobamovirus. The length of the HLFPV-J genome was 6431 nucleotides, including the shortest internal poly(A) region. The sequence identities of ORFs 1, 2, 3 and 4 of HLFPV-J to other tobamoviruses were 46.6-68.7, 49.9-70.8, 31.0-70.8 and 39.4-70.1 %, respectively, at the nucleotide level and 39.8-75.0, 43.6-77.8, 19.2-70.4 and 31.2-74.2 %, respectively, at the amino acid level. The 5'- and 3'-UTRs of HLFPV-J showed 24.3-58.6 and 13.0-79.8 % identity, respectively, to other tobamoviruses. In particular, when compared to other tobamoviruses, each ORF and UTR of HLFPV-J showed the highest sequence identity to those of HLSV. Phylogenetic analysis showed that HLFPV-J, other HLFPV isolates and HLSV constitute a malvaceous-plant-infecting tobamovirus cluster. These results indicate that the genomic structure of HLFPV-J has unique features similar to those of HLSV. To our knowledge, this is the first report of the complete genome sequence of HLFPV.

A perchlorate sensitive iodide transporter in frogs

PubMed Central

Carr, Deborah L.; Carr, James A.; Willis, Ray E.; Pressley, Thomas A.

2008-01-01

Nucleotide sequence comparisons have identified a gene product in the genome database of African clawed frogs (Xenopus laevis) as a probable member of the solute carrier family of membrane transporters. To confirm its identity as a putative iodide transporter, we examined the function of this sequence after heterologous expression in mammalian cells. A green monkey kidney cell line transfected with the Xenopus nucleotide sequence had significantly greater 125I uptake than sham-transfected control cells. The uptake in carrier-transfected cells was significantly inhibited in the presence of perchlorate, a competitive inhibitor of mammalian Na+/iodide symporter. Tissue distributions of the sequence were also consistent with a role in iodide uptake. The mRNA encoding the carrier was found to be expressed in the thyroid gland, stomach, and kidney of tadpoles from X. laevis, as well as the bullfrog Rana catesbeiana. The ovaries of adult X. laevis also were found to express the carrier. Phylogenetic analysis suggested that the putative X. laevis iodide transporter is orthologous to vertebrate Na+-dependent iodide symporters. We conclude that the amphibian sequence encodes a protein that is indeed a functional Na+/iodide symporter in Xenopus laevis, as well as Rana catesbeiana. PMID:18275962
Effect of multimedia information sequencing on educational outcome in orthodontic training.

PubMed

Aly, Medhat; Willems, Guy; Van Den Noortgate, Wim; Elen, Jan

2012-08-01

The aim of this research was to compare the effectiveness of hierarchical sequencing (HS) versus elaboration sequencing (ES) models in improving educational outcome of clinical knowledge when using instructional multimedia programs in postgraduate orthodontic training. Twenty-four postgraduate and 24 undergraduate dental students participated in this study. The postgraduates were following an orthodontic speciality training programme. The undergraduates were fourth- and fifth-year dental students. Twelve instructional multimedia modules were developed, six logically sequenced (LS) discussing six different orthodontic topics. Another six modules on identical topics were sequenced according to one macro-sequencing (MS) model. The implemented MS model was either HS or ES. The only difference between LS and MS modules was the adopted sequencing model. All participants were assigned into consistent pairs of students and were randomly divided into a test and a control group. In each pair, one student studied the LS module (control group) while the other studied the MS version (test group). Pre- and post-evaluation tests of each pair of participants were performed to measure knowledge, understanding and application of each participant with regard to the discussed topic. A multilevel analysis was conducted to assess the estimated effect of the different sequencing models. The level of significance was set at 0.05. At baseline, no significant differences (P > 0.05) were found in pre-test scores between groups. The HS model showed a significant effect on the scores achieved (P = 0.05). The test group showed a significantly higher estimated probability of correct answers to the questions (P = 0.003) when applying the HS model. The HS model may improve educational outcome when using instructional multimedia programs in postgraduate orthodontic training.
Molecular characterization of two prunus necrotic ringspot virus isolates from Canada.

PubMed

Cui, Hongguang; Hong, Ni; Wang, Guoping; Wang, Aiming

2012-05-01

We determined the entire RNA1, 2 and 3 sequences of two prunus necrotic ringspot virus (PNRSV) isolates, Chr3 from cherry and Pch12 from peach, obtained from an orchard in the Niagara Fruit Belt, Canada. The RNA1, 2 and 3 of the two isolates share nucleotide sequence identities of 98.6%, 98.4% and 94.5%, respectively. Their RNA1- and 2-encoded amino acid sequences are about 98% identical to the corresponding sequences of a cherry isolate, CH57, the only other PNRSV isolate with complete RNA1 and 2 sequences available. Phylogenetic analysis of the coat protein and movement protein encoded by RNA3 of Pch12 and Chr3 and published PNRSV isolates indicated that Chr3 belongs to the PV96 group and Pch12 belongs to the PV32 group.
Thermal adaptation analyzed by comparison of protein sequences from mesophilic and extremely thermophilic Methanococcus species

NASA Technical Reports Server (NTRS)

Haney, P. J.; Badger, J. H.; Buldak, G. L.; Reich, C. I.; Woese, C. R.; Olsen, G. J.

1999-01-01

The genome sequence of the extremely thermophilic archaeon Methanococcus jannaschii provides a wealth of data on proteins from a thermophile. In this paper, sequences of 115 proteins from M. jannaschii are compared with their homologs from mesophilic Methanococcus species. Although the growth temperatures of the mesophiles are about 50 degrees C below that of M. jannaschii, their genomic G+C contents are nearly identical. The properties most correlated with the proteins of the thermophile include higher residue volume, higher residue hydrophobicity, more charged amino acids (especially Glu, Arg, and Lys), and fewer uncharged polar residues (Ser, Thr, Asn, and Gln). These are recurring themes, with all trends applying to 83-92% of the proteins for which complete sequences were available. Nearly all of the amino acid replacements most significantly correlated with the temperature change are the same relatively conservative changes observed in all proteins, but in the case of the mesophile/thermophile comparison there is a directional bias. We identify 26 specific pairs of amino acids with a statistically significant (P < 0.01) preferred direction of replacement.
Do FY antigens act as minor histocompatibility antigens in the graft-versus-host disease paradigm after human leukocyte antigen-identical sibling hematopoietic stem cell transplantation?

PubMed

Sellami, Mohamed Hichem; Chaabane, Manel; Kaabi, Houda; Torjemane, Lamia; Ladeb, Saloua; Ben Othmane, Tarek; Hmida, Slama

2012-03-01

FY antigens are candidate minor histocompatibility antigens relevant to renal allograft rejection, but no data have been reported about their role in graft-versus-host disease (GVHD) incidence after human leukocyte antigen (HLA)-identical siblings hematopoietic stem cell transplantation (HSCT). The aim of this study was to examine the effect of donor/recipient disparity at FY antigens on the incidence of GVHD in Tunisian patients receiving an HLA-identical HSCT. This work enrolled 105 Tunisian pairs of recipients and their HLA-identical sibling donors of HSCs. FY genotyping was performed with the polymerase chain reaction-sequence-specific primer method and donor/recipient disparity for these antigens was analyzed at two levels: incompatibility and nonidentity. The case-control analyses showed no significant correlation between FY disparity and the incidence of either acute or chronic GVHD. Sample size calculation showed that 572 cases and 1716 controls would be necessary to be able to detect a significant association with 80% power and two-sided type I error level of 5% (α=0.05). The lack of association in the studied cohort may be explained by the low immunogenicity of FY antigens in HSCT context, compared with other antigens such as HA-1 and CD31.
FragIdent--automatic identification and characterisation of cDNA-fragments.

PubMed

Seelow, Dominik; Goehler, Heike; Hoffmann, Katrin

2009-03-02

Many genetic studies and functional assays are based on cDNA fragments. After the generation of cDNA fragments from an mRNA sample, their content is at first unknown and must be assigned by sequencing reactions or hybridisation experiments. Even in characterised libraries, a considerable number of clones are wrongly annotated. Furthermore, mix-ups can happen in the laboratory. It is therefore essential to the relevance of experimental results to confirm or determine the identity of the employed cDNA fragments. However, the manual approach for the characterisation of these fragments using BLAST web interfaces is not suited for larger number of sequences and so far, no user-friendly software is publicly available. Here we present the development of FragIdent, an application for the automatic identification of open reading frames (ORFs) within cDNA-fragments. The software performs BLAST analyses to identify the genes represented by the sequences and suggests primers to complete the sequencing of the whole insert. Gene-specific information as well as the protein domains encoded by the cDNA fragment are retrieved from Internet-based databases and included in the output. The application features an intuitive graphical interface and is designed for researchers without any bioinformatics skills. It is suited for projects comprising up to several hundred different clones. We used FragIdent to identify 84 cDNA clones from a yeast two-hybrid experiment. Furthermore, we identified 131 protein domains within our analysed clones. The source code is freely available from our homepage at http://compbio.charite.de/genetik/FragIdent/.
Complete sequence and diversity of a maize-associated Polerovirus in East Africa.

PubMed

Massawe, Deogracious P; Stewart, Lucy R; Kamatenesi, Jovia; Asiimwe, Theodore; Redinbaugh, Margaret G

2018-06-01

Since 2011-2012, Maize lethal necrosis (MLN) has emerged in East Africa, causing massive yield loss and propelling research to identify viruses and virus populations present in maize. As expected, next generation sequencing (NGS) has revealed diverse and abundant viruses from the family Potyviridae, primarily sugarcane mosaic virus (SCMV), and maize chlorotic mottle virus (MCMV) (Tombusviridae), which are known to cause MLN by synergistic co-infection. In addition to these expected viruses, we identified a virus in the genus Polerovirus (family Luteoviridae) in 104/172 samples selected for MLN or other potential virus symptoms from Kenya, Uganda, Rwanda, and Tanzania. This polerovirus (MF974579) nucleotide sequence is 97% identical to maize-associated viruses recently reported in China, termed 'maize yellow mosaic virus' (MaYMV) and maize yellow dwarf virus (MaYMV; KU291101, KU291107, MYDV-RMV2; KT992824); and 99% identical to MaYMV (KY684356) infecting sugarcane and itch grass in Nigeria; 83% identical to a barley-associated polerovirus recently identified in Korea (BVG; KT962089); and 79% identical to the U.S. maize-infecting polerovirus maize yellow dwarf virus (MYDV-RMV; KT992824). Nucleotide sequences from ORF0 of 20 individual East African isolates collected from Kenya, Uganda, Rwanda, and Tanzania shared 98% or higher identity, and were detected in 104/172 (60.5%) of samples collected for virus-like symptoms, indicating extensive prevalence but limited diversity of this virus in East Africa. We refer to this virus as "MYDV-like polerovirus" until symptoms of the virus in maize are known.
Isolation and sequence analysis of the Pseudomonas syringae pv. tomato gene encoding a 2,3-diphosphoglycerate-independent phosphoglyceromutase.

PubMed

Morris, V L; Jackson, D P; Grattan, M; Ainsworth, T; Cuppels, D A

1995-04-01

Pseudomonas syringae pv. tomato DC3481, a Tn5-induced mutant of the tomato pathogen DC3000, cannot grow and elicit disease symptoms on tomato seedlings. It also cannot grow on minimal medium containing malate, citrate, or succinate, three of the major organic acids found in tomatoes. We report here that this mutant also cannot use, as a sole carbon and/or energy source, a wide variety of hexoses and intermediates of hexose catabolism. Uptake studies have shown that DC3481 is not deficient in transport. A 3.8-kb EcoRI fragment of DC3000 DNA, which complements the Tn5 mutation, has been cloned and sequenced. The deduced amino acid sequences of two of the three open reading frames (ORFs) present on this fragment, ORF2 and ORF3, had no significant homology with sequences in the GenBank databases. However, the 510-amino-acid sequence of ORF1, the site of the Tn5 insertion, strongly resembled the deduced amino acid sequences of the Bacillus subtilis and Zea mays genes encoding 2,3-diphosphoglycerate (DPG)-independent phosphoglyceromutase (PGM) (52% identity and 72% similarity and 37% identity and 57% similarity, respectively). PGMs not requiring the cofactor DPG are usually found in plants and algae. Enzyme assays confirmed that P. syringae PGM activity required an intact ORF1. Not only is DC3481 the first PGM-deficient pseudomonad mutant to be described, but the P. syringae pgm gene is the first gram-negative bacterial gene identified that appears to code for a DPG-independent PGM. PGM activity appears essential for the growth and pathogenicity of P. syringae pv. tomato on its host plant.
The glycoprotein genes and gene junctions of the fish rhabdoviruses spring viremia of carp virus and hirame rhabdovirus: Analysis of relationships with other rhabdoviruses

USGS Publications Warehouse

Bjorklund, H.V.; Higman, K.H.; Kurath, G.

1996-01-01

The nucleotide sequences of the glycoprotein genes and all of the internal gene junctions of the fish pathogenic rhabdoviruses spring viremia of carp virus (SVCV) and hirame rhabdovirus (HIRRV) have been determined from cDNA clones generated from viral genomic RNA. The SVCV glycoprotein gene sequence is 1588 nucleotides (nt) long and encodes a 509 amino acid (aa) protein. The HIRRV glycoprotein gene sequence comprises 1612 nt, coding for a 508 aa protein. In sequence comparisons of 15 rhabdovirus glycoproteins, the SVCV glycoprotein gene showed the highest amino acid sequence identity (31.2–33.2%) with vesicular stomatitis New Jersey virus (VSNJV), Chandipura virus (CHPV) and vesicular stomatitis Indiana virus (VSIV). The HIRRV glycoprotein gene showed a very high amino acid sequence identity (74.3%) with the glycoprotein gene of another fish pathogenic rhabdovirus, infectious hematopoietic necrosis virus (IHNV), but no significant similarity with glycoproteins of VSIV or rabies virus (RABV). In phylogenetic analyses SVCV was grouped consistently with VSIV, VSNJV and CHPV in the Vesiculovirus genus of Rhabdoviridae. The fish rhabdoviruses HIRRV, IHNV and viral hemorrhagic septicemia virus (VHSV) showed close relationships with each other, but only very distant relationships with mammalian rhabdoviruses. The gene junctions are highly conserved between SVCV and VSIV, well conserved between IHNV and HIRRV, but not conserved between HIRRV/IHNV and RABV. Based on the combined results we suggest that the fish lyssa-type rhabdoviruses HIRRV, IHNV and VHSV may be grouped in their own genus within the family Rhabdoviridae. Aquarhabdovirus has been proposed for the name of this new genus.
The glycoprotein genes and gene junctions of the fish rhabdoviruses spring viremia of carp virus and hirame rhabdovirus: Analysis of relationships with other rhabdoviruses

USGS Publications Warehouse

Bjorklund, H.V.; Higman, K.H.; Kurath, G.

1996-01-01

The nucleotide sequences of the glycoprotein genes and all of the internal gene junctions of the fish pathogenic rhabdoviruses spring viremia of carp virus (SVCV) and hirame rhabdovirus (HIRRV) have been determined from cDNA clones generated from viral genomic RNA. The SVCV glycoprotein gene sequence is 1588 nucleotides (nt) long and encodes a 509 amino acid (aa) protein. The HIRRV glycoprotein gene sequence comprises 1612 nt, coding for a 508 aa protein. In sequence comparisons of 15 rhabdovirus glycoproteins, the SVCV glycoprotein gene showed the highest amino acid sequence identity (31.2-33.2%) with vesicular stomatitis New Jersey virus (VSNJV), Chandipura virus (CHPV) and vesicular stomatitis Indiana virus (VSIV). The HIRRV glycoprotein gene showed a very high amino acid sequence identity (74.3%) with the glycoprotein gene of another fish pathogenic rhabdovirus, infectious hematopoietic necrosis virus (IHNV), but no significant similarity with glycoproteins of VSIV or rabies virus (RABV). In phylogenetic analyses SVCV was grouped consistently with VSIV, VSNJV and CHPV in the Vesiculovirus genus of Rhabdoviridae. The fish rhabdoviruses HIRRV, IHNV and viral hemorrhagic septicemia virus (VHSV) showed close relationships with each other, but only very distant relationships with mammalian rhabdoviruses. The gene junctions are highly conserved between SVCV and VSIV, well conserved between IHNV and HIRRV, but not conserved between HIRRV/IHNV and RABV. Based on the combined results we suggest that the fish lyssa-type rhabdoviruses HIRRV, IHNV and VHSV may be grouped in their own genus within the family Rhabdoviridae. Aquarhabdovirus has been proposed for the name of this new genus.
Isolation and sequence analysis of the Pseudomonas syringae pv. tomato gene encoding a 2,3-diphosphoglycerate-independent phosphoglyceromutase.

PubMed Central

Morris, V L; Jackson, D P; Grattan, M; Ainsworth, T; Cuppels, D A

1995-01-01

Pseudomonas syringae pv. tomato DC3481, a Tn5-induced mutant of the tomato pathogen DC3000, cannot grow and elicit disease symptoms on tomato seedlings. It also cannot grow on minimal medium containing malate, citrate, or succinate, three of the major organic acids found in tomatoes. We report here that this mutant also cannot use, as a sole carbon and/or energy source, a wide variety of hexoses and intermediates of hexose catabolism. Uptake studies have shown that DC3481 is not deficient in transport. A 3.8-kb EcoRI fragment of DC3000 DNA, which complements the Tn5 mutation, has been cloned and sequenced. The deduced amino acid sequences of two of the three open reading frames (ORFs) present on this fragment, ORF2 and ORF3, had no significant homology with sequences in the GenBank databases. However, the 510-amino-acid sequence of ORF1, the site of the Tn5 insertion, strongly resembled the deduced amino acid sequences of the Bacillus subtilis and Zea mays genes encoding 2,3-diphosphoglycerate (DPG)-independent phosphoglyceromutase (PGM) (52% identity and 72% similarity and 37% identity and 57% similarity, respectively). PGMs not requiring the cofactor DPG are usually found in plants and algae. Enzyme assays confirmed that P. syringae PGM activity required an intact ORF1. Not only is DC3481 the first PGM-deficient pseudomonad mutant to be described, but the P. syringae pgm gene is the first gram-negative bacterial gene identified that appears to code for a DPG-independent PGM. PGM activity appears essential for the growth and pathogenicity of P. syringae pv. tomato on its host plant. PMID:7896694
Identification of a novel vitivirus from grapevines in New Zealand.

PubMed

Blouin, Arnaud G; Keenan, Sandi; Napier, Kathryn R; Barrero, Roberto A; MacDiarmid, Robin M

2018-01-01

We report a sequence of a novel vitivirus from Vitis vinifera obtained using two high-throughput sequencing (HTS) strategies on RNA. The initial discovery from small-RNA sequencing was confirmed by HTS of the total RNA and Sanger sequencing. The new virus has a genome structure similar to the one reported for other vitiviruses, with five open reading frames (ORFs) coding for the conserved domains described for members of that genus. Phylogenetic analysis of the complete genome sequence confirmed its affiliation to the genus Vitivirus, with the closest described viruses being grapevine virus E (GVE) and Agave tequilana leaf virus (ATLV). However, the virus we report is distinct and shares only 51% amino acid sequence identity with GVE in the replicase polyprotein and 66.8% amino acid sequence identity with ATLV in the coat protein. This is well below the threshold determined by the ICTV for species demarcation, and we propose that this virus represents a new species. It is provisionally named "grapevine virus G".
Sequences Associated with Centromere Competency in the Human Genome

PubMed Central

Hayden, Karen E.; Strome, Erin D.; Merrett, Stephanie L.; Lee, Hye-Ran; Rudd, M. Katharine

2013-01-01

Centromeres, the sites of spindle attachment during mitosis and meiosis, are located in specific positions in the human genome, normally coincident with diverse subsets of alpha satellite DNA. While there is strong evidence supporting the association of some subfamilies of alpha satellite with centromere function, the basis for establishing whether a given alpha satellite sequence is or is not designated a functional centromere is unknown, and attempts to understand the role of particular sequence features in establishing centromere identity have been limited by the near identity and repetitive nature of satellite sequences. Utilizing a broadly applicable experimental approach to test sequence competency for centromere specification, we have carried out a genomic and epigenetic functional analysis of endogenous human centromere sequences available in the current human genome assembly. The data support a model in which functionally competent sequences confer an opportunity for centromere specification, integrating genomic and epigenetic signals and promoting the concept of context-dependent centromere inheritance. PMID:23230266
Complete genome sequence of Southern tomato virus naturally infecting tomatoes in Bangladesh using small RNA deep sequencing

USDA-ARS?s Scientific Manuscript database

The complete genome sequence of a Southern tomato virus (STV) isolate on tomato plants in a seed production field in Bangladesh was obtained for the first time using next generation sequencing. The identified isolate STV_BD-13 shares high degree of sequence identity (99%) with several known STV isol...
Complete genome sequence of southern tomato virus identified from China using next generation sequencing

USDA-ARS?s Scientific Manuscript database

Complete genome sequence of a double-stranded RNA (dsRNA) virus, southern tomato virus (STV), on tomatoes in China, was elucidated using small RNAs deep sequencing. The identified STV_CN12 shares 99% sequence identity to other isolates from Mexico, France, Spain, and U.S. This is the first report ...
DNA sequence analysis, expression, distribution, and physiological role of the Xaa-prolyldipeptidyl aminopeptidase gene from Lactobacillus helveticus CNRZ32.

PubMed

Yüksel, G U; Steele, J L

1996-02-01

Lactobacillus helveticus CNRZ32 possesses an Xaa-prolyldipeptidyl aminopeptidase (PepX), which releases amino-terminal dipeptides from peptides containing proline residues in the penultimate position. The PepX gene, designated pepX, from Lb. helveticus CNRZ32 was sequenced. Analysis of the sequence identified a putative 2379-bp pepX open-reading frame, which encodes a polypeptide of 793 amino acid residues with a deduced molecular mass of 88,111 Da. The gene shows significant sequence identity with sequenced pepX genes from lactic acid bacteria. The product of the gene contains a motif that is almost identical with the active-site motif of the serine-dependent PepX from lactococci. The introduction of pepX into Lactococcus lactis LM0230 on either pGK12 (a low-copy-number plasmid vector) or pIL253 (a high-copy-number plasmid vector) did not result in a significant increase in PepX activity, while the introduction of pepX into CNRZ32 on pGK12 resulted in a four-fold increase in PepX activity. Southern hybridization experiments revealed that the pepX gene from CNRZ32 is well conserved in lactobacilli, pediococci and streptococci. The physiological role of PepX during growth in lactobacillus MRS (a rich medium containing protein hydrolysates along with other ingredients) and milk was examined by comparing growth of CNRZ32 and a CNRZ32 PepX-negative derivative. No difference in growth rate or acid production was observed between CNRZ32 and its PepX-negative derivative in MRS. However, the CNRZ32 PepX-negative derivative grew in milk at a reduced specific growth rate when compared to wild-type CNRZ32. Introduction of the cloned PepX determinant into the CNRZ32 PepX-negative derivative resulted in a construct with a specific growth rate similar to that of wild-type CNRZ32.
Candida mesorugosa sp. nov., a novel yeast species similar to Candida rugosa, isolated from a tertiary hospital in Brazil.

PubMed

Chaves, Guilherme M; Terçarioli, Gisela R; Padovan, Ana Carolina B; Rosas, Robert C; Ferreira, Renata C; Melo, Analy S A; Colombo, Arnaldo L

2013-04-01

Candida rugosa is a yeast species that is emerging as a causative agent of invasive infection, particularly in Latin America. Recently, C. pseudorugosa was proposed as a new species closely related to C. rugosa. We evaluated in this investigation the genetic heterogeneity within the C. rugosa species complex. All clinical isolates used in this study were identified phenotypically as C. rugosa but were genotypically different from the C. rugosa type, ATCC 10571. RAPD marker analysis revealed less than 83% similarity between our clinical isolates and the C. rugosa type strain. The D1/D2 region sequences of our clinical isolates showed 98% identity with C. rugosa but only 94-95% identity with C. pseudorugosa. The ITS rDNA sequences of the Brazilian isolates showed 91% identity with the C. rugosa ATCC 10571 ITS sequence. Network and Bayesian analyses of ITS and housekeeping gene sequences separated our clinical isolates into different branches from C. rugosa type strain. These differences are sufficient to reassign our isolates to a distinct species, named C. mesorugosa.
Emergence of patterns in random processes

NASA Astrophysics Data System (ADS)

Newman, William I.; Turcotte, Donald L.; Malamud, Bruce D.

2012-08-01

Sixty years ago, it was observed that any independent and identically distributed (i.i.d.) random variable would produce a pattern of peak-to-peak sequences with, on average, three events per sequence. This outcome was employed to show that randomness could yield, as a null hypothesis for animal populations, an explanation for their apparent 3-year cycles. We show how we can explicitly obtain a universal distribution of the lengths of peak-to-peak sequences in time series and that this can be employed for long data sets as a test of their i.i.d. character. We illustrate the validity of our analysis utilizing the peak-to-peak statistics of a Gaussian white noise. We also consider the nearest-neighbor cluster statistics of point processes in time. If the time intervals are random, we show that cluster size statistics are identical to the peak-to-peak sequence statistics of time series. In order to study the influence of correlations in a time series, we determine the peak-to-peak sequence statistics for the Langevin equation of kinetic theory leading to Brownian motion. To test our methodology, we consider a variety of applications. Using a global catalog of earthquakes, we obtain the peak-to-peak statistics of earthquake magnitudes and the nearest neighbor interoccurrence time statistics. In both cases, we find good agreement with the i.i.d. theory. We also consider the interval statistics of the Old Faithful geyser in Yellowstone National Park. In this case, we find a significant deviation from the i.i.d. theory which we attribute to antipersistence. We consider the interval statistics using the AL index of geomagnetic substorms. We again find a significant deviation from i.i.d. behavior that we attribute to mild persistence. Finally, we examine the behavior of Standard and Poor's 500 stock index's daily returns from 1928-2011 and show that, while it is close to being i.i.d., there is, again, significant persistence. We expect that there will be many other applications of our methodology both to interoccurrence statistics and to time series.
Analysis of the 16S–23S rRNA Gene Internal Transcribed Spacer Region in Klebsiella Species▿

PubMed Central

Wang, Min; Cao, Boyang; Yu, Qunfang; Liu, Lei; Gao, Qili; Wang, Lei; Feng, Lu

2008-01-01

The 16S-23S rRNA gene internal transcribed spacer (ITS) regions of Klebsiella spp., including Klebsiella pneumoniae subsp. pneumoniae, Klebsiella pneumoniae subsp. ozaenae, Klebsiella pneumoniae subsp. rhinoscleromatis, Klebsiella oxytoca, Klebsiella planticola, Klebsiella terrigena, and Klebsiella ornithinolytica, were characterized, and the feasibility of using ITS sequences to discriminate Klebsiella species and subspecies was explored. A total of 336 ITS sequences from 21 representative strains and 11 clinical isolates of Klebsiella were sequenced and analyzed. Three distinct ITS types—ITSnone (without tRNA genes), ITSglu [with a tRNAGlu (UUC) gene], and ITSile+ala [with tRNAIle (GAU) and tRNAAla (UGC) genes]—were detected in all species except for K. pneumoniae subsp. rhinoscleromatis, which has only ITSglu and ITSile+ala. The presence of ITSnone in Enterobacteriaceae had never been reported before. Both the length and the sequence of each ITS type are highly conserved within the species, with identity levels from 0.961 to 1.000 for ITSnone, from 0.967 to 1.000 for ITSglu, and from 0.968 to 1.000 for ITSile+ala. Interspecies sequence identities range from 0.775 to 0.989 for ITSnone, from 0.798 to 0.997 for ITSglu, and from 0.712 to 0.985 for ITSile+ala. Regions with significant interspecies variations but low intraspecies polymorphisms were identified; these may be targeted in the design of probes for the identification of Klebsiella to the species level. Phylogenetic analysis based on ITS regions reveals the relationships among Klebsiella species similarly to that based on 16S rRNA genes. PMID:18753345
Genetic characterization of Strongyloides spp. from captive, semi-captive and wild Bornean orangutans (Pongo pygmaeus) in Central and East Kalimantan, Borneo, Indonesia.

PubMed

Labes, E M; Nurcahyo, W; Wijayanti, N; Deplazes, P; Mathis, A

2011-09-01

Orangutans (Pongo spp.), Asia's only great apes, are threatened in their survival due to habitat loss, hunting and infections. Nematodes of the genus Strongyloides may represent a severe cause of death in wild and captive individuals. In order to better understand which Strongyloides species/subspecies infect orangutans under different conditions, larvae were isolated from fecal material collected in Indonesia from 9 captive, 2 semi-captive and 9 wild individuals, 18 captive groups of Bornean orangutans and from 1 human working with wild orangutans. Genotyping was done at the genomic rDNA locus (part of the 18S rRNA gene and internal transcribed spacer 1, ITS1) by sequencing amplicons. Thirty isolates, including the one from the human, could be identified as S. fuelleborni fuelleborni with 18S rRNA gene identities of 98·5-100%, with a corresponding published sequence. The ITS1 sequences could be determined for 17 of these isolates revealing a huge variability and 2 main clusters without obvious pattern with regard to attributes of the hosts. The ITS1 amplicons of 2 isolates were cloned and sequenced, revealing considerable variability indicative of mixed infections. One isolate from a captive individual was identified as S. stercoralis (18S rRNA) and showed 99% identity (ITS1) with S. stercoralis sequences from geographically distinct locations and host species. The findings are significant with regard to the zoonotic nature of these parasites and might contribute to the conservation of remaining orangutan populations.

How Much Do rRNA Gene Surveys Underestimate Extant Bacterial Diversity?

PubMed

Rodriguez-R, Luis M; Castro, Juan C; Kyrpides, Nikos C; Cole, James R; Tiedje, James M; Konstantinidis, Konstantinos T

2018-03-15

The most common practice in studying and cataloguing prokaryotic diversity involves the grouping of sequences into operational taxonomic units (OTUs) at the 97% 16S rRNA gene sequence identity level, often using partial gene sequences, such as PCR-generated amplicons. Due to the high sequence conservation of rRNA genes, organisms belonging to closely related yet distinct species may be grouped under the same OTU. However, it remains unclear how much diversity has been underestimated by this practice. To address this question, we compared the OTUs of genomes defined at the 97% or 98.5% 16S rRNA gene identity level against OTUs of the same genomes defined at the 95% whole-genome average nucleotide identity (ANI), which is a much more accurate proxy for species. Our results show that OTUs resulting from a 98.5% 16S rRNA gene identity cutoff are more accurate than 97% compared to 95% ANI (90.5% versus 89.9% accuracy) but indistinguishable from any other threshold in the 98.29 to 98.78% range. Even with the more stringent thresholds, however, the 16S rRNA gene-based approach commonly underestimates the number of OTUs by ∼12%, on average, compared to the ANI-based approach (∼14% underestimation when using the 97% identity threshold). More importantly, the degree of underestimation can become 50% or more for certain taxa, such as the genera Pseudomonas , Burkholderia , Escherichia , Campylobacter , and Citrobacter These results provide a quantitative view of the degree of underestimation of extant prokaryotic diversity by 16S rRNA gene-defined OTUs and suggest that genomic resolution is often necessary. IMPORTANCE Species diversity is one of the most fundamental pieces of information for community ecology and conservational biology. Therefore, employing accurate proxies for what a species or the unit of diversity is are cornerstones for a large set of microbial ecology and diversity studies. The most common proxies currently used rely on the clustering of 16S rRNA gene sequences at some threshold of nucleotide identity, typically 97% or 98.5%. Here, we explore how well this strategy reflects the more accurate whole-genome-based proxies and determine the frequency with which the high conservation of 16S rRNA sequences masks substantial species-level diversity. Copyright © 2018 American Society for Microbiology.
Thermophilic cellobiohydrolase

DOEpatents

Sapra, Rajat; Park, Joshua I.; Datta, Supratim; Simmons, Blake A.

2017-04-18

The present invention provides for a composition comprising a polypeptide comprising a first amino acid sequence having at least 70% identity with the amino acid sequence of Csac GH5 wherein said first amino acid sequence has a thermostable or thermophilic cellobiohydrolase (CBH) or exoglucanase activity.
Detection and characterization of viruses of the genus Megalocytivirus in ornamental fish imported into an Australian border quarantine premises: an emerging risk to national biosecurity.

PubMed

Nolan, D; Stephens, F; Crockford, M; Jones, J B; Snow, M

2015-02-01

This report documents an emerging trend of identification of Megalocytivirus-like inclusions in a range of ornamental fish species intercepted during quarantine detention at the Australian border. From September 2012 to February 2013, 5 species of fish that had suffered mortality levels in excess of 25% whilst in the post-entry quarantine and had Megalocytivirus-like inclusion bodies in histological sections were examined by PCR. The fish had been imported from Singapore, Malaysia and Sri Lanka. Ninety-seven of 111 individual fish from affected tanks of fish tested were positive for the presence of Megalocytivirus by PCR. Sequence analysis of representative PCR products revealed an identical sequence of 621 bp in all cases which was identical to a previously characterized Megalocytivirus (Sabah/RAA1/2012 strain BMGIV48). Phylogenetic analysis of available Megalocytivirus major capsid protein (MCP) sequences confirmed the existence of 3 major clades of Megalocytivirus. The virus detected in this study was identified as a member of Genotype II. The broad host range and pathogenicity of megalocytiviruses, coupled to the documented spread of ornamental fish into the environment, render this a significant and emerging biosecurity threat to Australia. © 2014 John Wiley & Sons Ltd.
Improvement and Optimization of Two Engineered Phage Resistance Mechanisms in Lactococcus lactis

PubMed Central

McGrath, Stephen; Fitzgerald, Gerald F.; van Sinderen, Douwe

2001-01-01

Homologous replication module genes were identified for four P335 type phages. DNA sequence analysis revealed that all four phages exhibited more than 90% DNA homology for at least two genes, designated rep2009 and orf17. One of these genes, rep2009, codes for a putative replisome organizer protein and contains an assumed origin of phage DNA replication (ori2009), which was identical for all four phages. DNA fragments representing the ori2009 sequence confer a phage-encoded resistance (Per) phenotype on lactococcal hosts when they are supplied on a high-copy-number vector. Furthermore, cloning multiple copies of the ori2009 sequence was found to increase the effectiveness of the Per phenotype conferred. A number of antisense plasmids targeting specific genes of the replication module were constructed. Two separate plasmids targeting rep2009 and orf17 were found to efficiently inhibit proliferation of all four phages by interfering with intracellular phage DNA replication. These results represent two highly effective strategies for inhibiting bacteriophage proliferation, and they also identify a novel gene, orf17, which appears to be important for phage DNA replication. Furthermore, these results indicate that although the actual mechanisms of DNA replication are very similar, if not identical, for all four phages, expression of the replication genes is significantly different in each case. PMID:11157223
Characterization by Deep Sequencing of Prunus virus T, a Novel Tepovirus Infecting Prunus Species.

PubMed

Marais, Armelle; Faure, Chantal; Mustafayev, Eldar; Barone, Maria; Alioto, Daniela; Candresse, Thierry

2015-01-01

Double-stranded RNAs purified from a cherry tree collected in Italy and a plum tree collected in Azerbaijan were submitted to deep sequencing. Contigs showing weak but significant identity with various members of the family Betaflexiviridae were reconstructed. Sequence comparisons led to the conclusion that the viral isolates identified in the analyzed Prunus plants belong to the same viral species. Their genome organization is similar to that of some members of the family Betaflexiviridae, with three overlapping open reading frames (RNA polymerase, movement protein, and capsid protein). Phylogenetic analyses of the deduced encoded proteins showed a clustering with the sole member of the genus Tepovirus, Potato virus T (PVT). Given these results, the name Prunus virus T (PrVT) is proposed for the new virus. It should be considered as a new member of the genus Tepovirus, even if the level of nucleotide identity with PVT is borderline with the genus demarcation criteria for the family Betaflexiviridae. A reverse-transcription polymerase chain reaction detection assay was developed and allowed the identification of two other PrVT isolates and an estimate of 1% prevalence in the large Prunus collection screened. Due to the mixed infection status of all hosts identified to date, it was not possible to correlate the presence of PrVT with specific symptoms.
Characterization of CoPK02, a Ca2+/calmodulin-dependent protein kinase in mushroom Coprinopsis cinerea.

PubMed

Yamashita, Masashi; Sueyoshi, Noriyuki; Yamada, Hiroki; Katayama, Syouichi; Senga, Yukako; Takenaka, Yasuhiro; Ishida, Atsuhiko; Kameshita, Isamu; Shigeri, Yasushi

2018-04-20

We surveyed genome sequences from the basidiomycetous mushroom Coprinopsis cinerea and isolated a cDNA homologous to CMKA, a calmodulin-dependent protein kinase (CaMK) in Aspergillus nidulans. We designated this sequence, encoding 580 amino acids with a molecular weight of 63,987, as CoPK02. CoPK02 possessed twelve subdomains specific to protein kinases and exhibited 43, 35, 40% identity with rat CaMKI, CaMKII, CaMKIV, respectively, and 40% identity with CoPK12, one of the CaMK orthologs in C. cinerea. CoPK02 showed significant autophosphorylation activity and phosphorylated exogenous proteins in the presence of Ca 2+ /CaM. By the CaM-overlay assay we confirmed that the C-terminal sequence (Trp346-Arg358) was the calmodulin-binding site, and that the binding of Ca 2+ /CaM to CoPK02 was reduced by the autophosphorylation of CoPK02. Since CoPK02 evolved in a different clade from CoPK12, and showed different gene expression compared to that of CoPK32, which is homologous to mitogen-activated protein kinase-activated protein kinase, CoPK02 and CoPK12 might cooperatively regulate Ca 2+ -signaling in C. cinerea.
The identification and functional annotation of RNA structures conserved in vertebrates.

PubMed

Seemann, Stefan E; Mirza, Aashiq H; Hansen, Claus; Bang-Berthelsen, Claus H; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L; Gorodkin, Jan

2017-08-01

Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3' ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. © 2017 Seemann et al.; Published by Cold Spring Harbor Laboratory Press.
Do you remember where sounds, pictures and words came from? The role of the stimulus format in object location memory.

PubMed

Delogu, Franco; Lilla, Christopher C

2017-11-01

Contrasting results in visual and auditory spatial memory stimulate the debate over the role of sensory modality and attention in identity-to-location binding. We investigated the role of sensory modality in the incidental/deliberate encoding of the location of a sequence of items. In 4 separated blocks, 88 participants memorised sequences of environmental sounds, spoken words, pictures and written words, respectively. After memorisation, participants were asked to recognise old from new items in a new sequence of stimuli. They were also asked to indicate from which side of the screen (visual stimuli) or headphone channel (sounds) the old stimuli were presented in encoding. In the first block, participants were not aware of the spatial requirement while, in blocks 2, 3 and 4 they knew that their memory for item location was going to be tested. Results show significantly lower accuracy of object location memory for the auditory stimuli (environmental sounds and spoken words) than for images (pictures and written words). Awareness of spatial requirement did not influence localisation accuracy. We conclude that: (a) object location memory is more effective for visual objects; (b) object location is implicitly associated with item identity during encoding and (c) visual supremacy in spatial memory does not depend on the automaticity of object location binding.
Sinorhizobium meliloti strains TII7 and A5 by Multilocus Sequence Typing (MLST) have chromsomes identical with Rm1021 and form an effective and ineffective symbiosis with Medicago truncatula line Jemalong A17, respectively

USDA-ARS?s Scientific Manuscript database

The strains TII7 and A5 formed an effective and ineffective symbiosis with Medicago truncatula Jemalong A17, respectively. Both were shown to have identical chromsomes with strains Rm1021 and RCR2011 using a Multilocus Sequence Typing method. The 2260 bp segments of DNA stretching from the 3’ end ...
Distant sequences determine 5′ end formation of cox3 transcripts in Arabidopsis thaliana ecotype C24

PubMed Central

Forner, Joachim; Weber, Bärbel; Wiethölter, Caterina; Meyer, Rhonda C.; Binder, Stefan

2005-01-01

The genomic environments and the transcripts of the mitochondrial cox3 gene are investigated in three Arabidopsis thaliana ecotypes. While the proximate 5′ sequences up to nucleotide position −584, the coding regions and the 3′ flanking regions are identical in Columbia (Col), C24 and Landsberg erecta (Ler), genomic variation is detected in regions further upstream. In the mitochondrial DNA of Col, a 1790 bp fragment flanked by a nonanucleotide direct repeat is present beyond position −584 with respect to the ATG. While in Ler only part of this insertion is conserved, this sequence is completely absent in C24, except for a single copy of the nonanucleotide direct repeat. Northern hybridization reveals identical major transcripts in the three ecotypes, but identifies an additional abundant 60 nt larger mRNA species in C24. The extremities of the most abundant mRNA species are identical in the three ecotypes. In C24, an extra major 5′ end is abundant. This terminus and the other major 5′ ends are located in identical sequence regions. Inspection of Atcox3 transcripts in C24/Col hybrids revealed a female inheritance of the mRNA species with the extra 5′ terminus. Thus, a mitochondrially encoded factor determines the generation of an extra 5′ mRNA end. PMID:16107557
Discovery of a novel retrovirus sequence in an Australian native rodent (Melomys burtoni): a putative link between gibbon ape leukemia virus and koala retrovirus.

PubMed

Simmons, Greg; Clarke, Daniel; McKee, Jeff; Young, Paul; Meers, Joanne

2014-01-01

Gibbon ape leukaemia virus (GALV) and koala retrovirus (KoRV) share a remarkably close sequence identity despite the fact that they occur in distantly related mammals on different continents. It has previously been suggested that infection of their respective hosts may have occurred as a result of a species jump from another, as yet unidentified vertebrate host. To investigate possible sources of these retroviruses in the Australian context, DNA samples were obtained from 42 vertebrate species and screened using PCR in order to detect proviral sequences closely related to KoRV and GALV. Four proviral partial sequences totalling 2880 bases which share a strong similarity with KoRV and GALV were detected in DNA from a native Australian rodent, the grassland melomys, Melomys burtoni. We have designated this novel gammaretrovirus Melomys burtoni retrovirus (MbRV). The concatenated nucleotide sequence of MbRV shares 93% identity with the corresponding sequence from GALV-SEATO and 83% identity with KoRV. The geographic ranges of the grassland melomys and of the koala partially overlap. Thus a species jump by MbRV from melomys to koalas is conceivable. However the genus Melomys does not occur in mainland South East Asia and so it appears most likely that another as yet unidentified host was the source of GALV.
An approach for identification of unknown viruses using sequencing-by-hybridization.

PubMed

Katoski, Sarah E; Meyer, Hermann; Ibrahim, Sofi

2015-09-01

Accurate identification of biological threat agents, especially RNA viruses, in clinical or environmental samples can be challenging because the concentration of viral genomic material in a given sample is usually low, viral genomic RNA is liable to degradation, and RNA viruses are extremely diverse. A two-tiered approach was used for initial identification, then full genomic characterization of 199 RNA viruses belonging to virus families Arenaviridae, Bunyaviridae, Filoviridae, Flaviviridae, and Togaviridae. A Sequencing-by-hybridization (SBH) microarray was used to tentatively identify a viral pathogen then, the identity is confirmed by guided next-generation sequencing (NGS). After optimization and evaluation of the SBH and NGS methodologies with various virus species and strains, the approach was used to test the ability to identify viruses in blinded samples. The SBH correctly identified two Ebola viruses in the blinded samples within 24 hr, and by using guided amplicon sequencing with 454 GS FLX, the identities of the viruses in both samples were confirmed. SBH provides at relatively low-cost screening of biological samples against a panel of viral pathogens that can be custom-designed on a microarray. Once the identity of virus is deduced from the highest hybridization signal on the SBH microarray, guided (amplicon) NGS sequencing can be used not only to confirm the identity of the virus but also to provide further information about the strain or isolate, including a potential genetic manipulation. This approach can be useful in situations where natural or deliberate biological threat incidents might occur and a rapid response is required. © 2015 Wiley Periodicals, Inc.
Sequence analysis of infectious pancreatic necrosis virus isolated from Iranian reared rainbow trout (Oncorhynchus mykiss) in 2012.

PubMed

Dadar, Maryam; Peyghan, Rahim; Memari, Hamid Rajabi; Shapouri, Masod Reza Seifi Abad; Hasanzadeh, Reza; Goudarzi, Laleh Moazzami; Vakharia, Vikram N

2013-12-01

Infectious pancreatic necrosis virus (IPNV) is the causal agent of a highly contagious disease that affects many species of fish and shellfish. This virus causes economically significant diseases of farmed rainbow trout, Oncorhynchus mykiss (Walbaum), in Iran, which is often associated with the transmission of pathogens from European resources. In this study, moribund rainbow trout fry samples were collected during an outbreak of IPNV in three different fish farms in north and west provinces of Iran in 2012; and we investigated the full genome sequence of Iranian IPNV and compared it with previously identified IPNV sequences. The sequences of different structural and nonstructural-protein genes were compared to those of other aquatic birnaviruses sequenced to date. Our results show that the Iranian isolate falls within genogroup 5, serotype A2 strain SP, having 99% identity with the strain 1146 from Spain. These results suggest that the Iranian isolate may have originated from Europe.
Complete nucleotide sequence of Alfalfa mosaic virus isolated from alfalfa (Medicago sativa L.) in Argentina.

PubMed

Trucco, Verónica; de Breuil, Soledad; Bejerman, Nicolás; Lenardon, Sergio; Giolitti, Fabián

2014-06-01

The complete nucleotide sequence of an Alfalfa mosaic virus (AMV) isolate infecting alfalfa (Medicago sativa L.) in Argentina, AMV-Arg, was determined. The virus genome has the typical organization described for AMV, and comprises 3,643, 2,593, and 2,038 nucleotides for RNA1, 2 and 3, respectively. The whole genome sequence and each encoding region were compared with those of other four isolates that have been completely sequenced from China, Italy, Spain and USA. The nucleotide identity percentages ranged from 95.9 to 99.1 % for the three RNAs and from 93.7 to 99 % for the protein 1 (P1), protein 2 (P2), movement protein and coat protein (CP) encoding regions, whereas the amino acid identity percentages of these proteins ranged from 93.4 to 99.5 %, the lowest value corresponding to P2. CP sequences of AMV-Arg were compared with those of other 25 available isolates, and the phylogenetic analysis based on the CP gene was carried out. The highest percentage of nucleotide sequence identity of the CP gene was 98.3 % with a Chinese isolate and 98.6 % at the amino acid level with four isolates, two from Italy, one from Brazil and the remaining one from China. The phylogenetic analysis showed that AMV-Arg is closely related to subgroup I of AMV isolates. To our knowledge, this is the first report of a complete nucleotide sequence of AMV from South America and the first worldwide report of complete nucleotide sequence of AMV isolated from alfalfa as natural host.
Complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera, and comparative analyses with other grass genomes

PubMed Central

Saski, Christopher; Lee, Seung-Bum; Fjellheim, Siri; Guda, Chittibabu; Jansen, Robert K.; Luo, Hong; Tomkins, Jeffrey; Rognli, Odd Arne; Clarke, Jihong Liu

2009-01-01

Comparisons of complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera to six published grass chloroplast genomes reveal that gene content and order are similar but two microstructural changes have occurred. First, the expansion of the IR at the SSC/IRa boundary that duplicates a portion of the 5′ end of ndhH is restricted to the three genera of the subfamily Pooideae (Agrostis, Hordeum and Triticum). Second, a 6 bp deletion in ndhK is shared by Agrostis, Hordeum, Oryza and Triticum, and this event supports the sister relationship between the subfamilies Erhartoideae and Pooideae. Repeat analysis identified 19–37 direct and inverted repeats 30 bp or longer with a sequence identity of at least 90%. Seventeen of the 26 shared repeats are found in all the grass chloroplast genomes examined and are located in the same genes or intergenic spacer (IGS) regions. Examination of simple sequence repeats (SSRs) identified 16–21 potential polymorphic SSRs. Five IGS regions have 100% sequence identity among Zea mays, Saccharum officinarum and Sorghum bicolor, whereas no spacer regions were identical among Oryza sativa, Triticum aestivum, H. vulgare and A. stolonifera despite their close phylogenetic relationship. Alignment of EST sequences and DNA coding sequences identified six C–U conversions in both Sorghum bicolor and H. vulgare but only one in A. stolonifera. Phylogenetic trees based on DNA sequences of 61 protein-coding genes of 38 taxa using both maximum parsimony and likelihood methods provide moderate support for a sister relationship between the subfamilies Erhartoideae and Pooideae. PMID:17534593
Method of determining the necessary number of observations for video stream documents recognition

NASA Astrophysics Data System (ADS)

Arlazarov, Vladimir V.; Bulatov, Konstantin; Manzhikov, Temudzhin; Slavin, Oleg; Janiszewski, Igor

2018-04-01

This paper discusses a task of document recognition on a sequence of video frames. In order to optimize the processing speed an estimation is performed of stability of recognition results obtained from several video frames. Considering identity document (Russian internal passport) recognition on a mobile device it is shown that significant decrease is possible of the number of observations necessary for obtaining precise recognition result.
Rattlesnake Neurotoxin Structure, Mechanism of Action, Immunology and Molecular Biology

DTIC Science & Technology

1992-09-10

and Kaiser, 1990). Sequencing of the three peptides present in the acidic subunit, two of which are blocked by pyroglutamate , represents a significant...deblock with pyroglutamate aminopeptidase were unsuccessful. The B-chain contained 35 amino acids and showed 91% amino acid identity witn the...similarities of all rattlesnake neurotoxins, showed that the acidic subunit plays more than a chaperone role for the basic subunit and is clearly
Multi-vendor reliability of arterial spin labeling perfusion MRI using a near-identical sequence: implications for multi-center studies.

PubMed

Mutsaerts, Henri J M M; van Osch, Matthias J P; Zelaya, Fernando O; Wang, Danny J J; Nordhøy, Wibeke; Wang, Yi; Wastling, Stephen; Fernandez-Seara, Maria A; Petersen, E T; Pizzini, Francesca B; Fallatah, Sameeha; Hendrikse, Jeroen; Geier, Oliver; Günther, Matthias; Golay, Xavier; Nederveen, Aart J; Bjørnerud, Atle; Groote, Inge R

2015-06-01

A main obstacle that impedes standardized clinical and research applications of arterial spin labeling (ASL), is the substantial differences between the commercial implementations of ASL from major MRI vendors. In this study, we compare a single identical 2D gradient-echo EPI pseudo-continuous ASL (PCASL) sequence implemented on 3T scanners from three vendors (General Electric Healthcare, Philips Healthcare and Siemens Healthcare) within the same center and with the same subjects. Fourteen healthy volunteers (50% male, age 26.4±4.7years) were scanned twice on each scanner in an interleaved manner within 3h. Because of differences in gradient and coil specifications, two separate studies were performed with slightly different sequence parameters, with one scanner used across both studies for comparison. Reproducibility was evaluated by means of quantitative cerebral blood flow (CBF) agreement and inter-session variation, both on a region-of-interest (ROI) and voxel level. In addition, a qualitative similarity comparison of the CBF maps was performed by three experienced neuro-radiologists. There were no CBF differences between vendors in study 1 (p>0.1), but there were CBF differences of 2-19% between vendors in study 2 (p<0.001 in most gray matter ROIs) and 10-22% difference in CBF values obtained with the same vendor between studies (p<0.001 in most gray matter ROIs). The inter-vendor inter-session variation was not significantly larger than the intra-vendor variation in all (p>0.1) but one of the ROIs (p<0.001). This study demonstrates the possibility to acquire comparable cerebral CBF maps on scanners of different vendors. Small differences in sequence parameters can have a larger effect on the reproducibility of ASL than hardware or software differences between vendors. These results suggest that researchers should strive to employ identical labeling and readout strategies in multi-center ASL studies. Copyright © 2015 Elsevier Inc. All rights reserved.
Complete genome sequence of a tomato infecting tomato mottle mosaic virus in New York

USDA-ARS?s Scientific Manuscript database

Complete genome sequence of an emerging isolate of tomato mottle mosaic virus (ToMMV) infecting experimental nicotianan benthamiana plants in up-state New York was obtained using small RNA deep sequencing. ToMMV_NY-13 shared 99% sequence identity to ToMMV isolates from Mexico and Florida. Broader d...
SEAN: SNP prediction and display program utilizing EST sequence clusters.

PubMed

Huntley, Derek; Baldo, Angela; Johri, Saurabh; Sergot, Marek

2006-02-15

SEAN is an application that predicts single nucleotide polymorphisms (SNPs) using multiple sequence alignments produced from expressed sequence tag (EST) clusters. The algorithm uses rules of sequence identity and SNP abundance to determine the quality of the prediction. A Java viewer is provided to display the EST alignments and predicted SNPs.

Molecular evolution of miraculin-like proteins in soybean Kunitz super-family.

PubMed

Selvakumar, Purushotham; Gahloth, Deepankar; Tomar, Prabhat Pratap Singh; Sharma, Nidhi; Sharma, Ashwani Kumar

2011-12-01

Miraculin-like proteins (MLPs) belong to soybean Kunitz super-family and have been characterized from many plant families like Rutaceae, Solanaceae, Rubiaceae, etc. Many of them possess trypsin inhibitory activity and are involved in plant defense. MLPs exhibit significant sequence identity (~30-95%) to native miraculin protein, also belonging to Kunitz super-family compared with a typical Kunitz family member (~30%). The sequence and structure-function comparison of MLPs with that of a classical Kunitz inhibitor have demonstrated that MLPs have evolved to form a distinct group within Kunitz super-family. Sequence analysis of new genes along with available MLP sequences in the literature revealed three major groups for these proteins. A significant feature of Rutaceae MLP type 2 sequences is the presence of phosphorylation motif. Subtle changes are seen in putative reactive loop residues among different MLPs suggesting altered specificities to specific proteases. In phylogenetic analysis, Rutaceae MLP type 1 and type 2 proteins clustered together on separate branches, whereas native miraculin along with other MLPs formed distinct clusters. Site-specific positive Darwinian selection was observed at many sites in both the groups of Rutaceae MLP sequences with most of the residues undergoing positive selection located in loop regions. The results demonstrate the sequence and thereby the structure-function divergence of MLPs as a distinct group within soybean Kunitz super-family due to biotic and abiotic stresses of local environment.
New advances in molecular epizootiology of canine hematic protozoa from Venezuela, Thailand and Spain.

PubMed

Criado-Fornelio, A; Rey-Valeiron, C; Buling, A; Barba-Carretero, J C; Jefferies, R; Irwin, P

2007-03-31

The prevalence of hematozoan infections (Hepatozoon canis and Babesia sp., particularly Babesia canis vogeli) in canids from Venezuela, Thailand and Spain was studied by amplification and sequencing of the 18S rRNA gene. H. canis infections caused simultaneously by two different isolates were confirmed by RFLP analysis in samples from all the geographic regions studied. In Venezuela, blood samples from 134 dogs were surveyed. Babesia infections were found in 2.24% of the dogs. Comparison of sequences of the 18S rRNA gene indicated that protozoan isolates were genetically identical to B. canis vogeli from Japan and Brazil. H. canis infected 44.77 per cent of the dogs. A representative sample of Venezuelan H. canis isolates (21.6% of PCR-positives) was sequenced. Many of them showed 18S rRNA gene sequences identical to H. canis Spain 2, albeit two less frequent genotypes were found in the sample studied. In Thailand, 20 dogs were analyzed. No infections caused by Babesia were diagnosed, whereas 30 per cent of the dogs were positive to hematozoan infection. Two protozoa isolates showing 99.7-100% identity to H. canis Spain 2 were found. In Spain, 250 dogs were studied. B. canis vogeli infected 0.01% of the animals. The sequence of the 18S rRNA gene in Spanish isolates of this protozoa was closely related to those previously deposited in GenBank (> 99% identity). Finally, 20 red foxes were screened for hematozoans employing semi-nested PCR and primers designed to detect Babesia/Theileria. Fifty percent of the foxes were positive to Theileria annae. In addition, it was found that the PCR assay was able as well to detect Hepatozoon infections. Thirty five percent of the foxes were infected with two different H. canis isolates showing 99.8-100% identity to Curupira 1 from Brazil.
Identity of Fasciola spp. in sheep in Egypt.

PubMed

Amer, Said; ElKhatam, Ahmed; Zidan, Shereif; Feng, Yaoyu; Xiao, Lihua

2016-12-01

In Egypt, liver flukes, Fasciola spp. (Digenea: Fasciolidae), have a serious impact on the farming industry and public health. Both Fasciola hepatica and Fasciola gigantica are known to occur in cattle, providing the opportunity for genetic recombination. Little is known on the identity and genetic variability of Fasciola populations in sheep. This study was performed to determine the prevalence of liver flukes in sheep in Menofia Province as a representative area of the delta region in Egypt, as measured by postmortem examination of slaughtered animals at three abattoirs. The identity and genetic variability of Fasciola spp. in slaughtered animals were determined by PCR-sequence analysis of the nuclear ribosomal internal transcribed spacer 1 (ITS1) and the mitochondrial NADH dehydrogenase subunit 1 (nad1) genes. Physical inspection of the liver indicated that 302 of 2058 (14.7%) slaughtered sheep were infected with Fasciola spp. Sequence analysis of the ITS1 and nad1 genes of liver flukes from 17 animals revealed that 11 animals were infected with F. hepatica, four with F. gigantica, and two with both species. Seventy eight of 103 flukes genetically characterized from these animals were F. hepatica, 23 were F. gigantica, and two had ITS1 sequences identical to F. hepatica but nad1 sequences identical to F. gigantica. nad1 sequences of Egyptian isolates of F. gigantica showed pronounced differences from those in the GenBank database. Egyptian F. gigantica haplotypes formed haplogroup D, which clustered in a sister clade with haplogroups A, B and C circulating in Asia, indicating the existence of geographic isolation in the species. Both F. hepatica and F. gigantica are prevalent in sheep in Egypt and an introgressed form of the two occurs as the result of genetic recombination. In addition, a geographically isolated F. gigantica population is present in the country. The importance of these observations in epidemiology of fascioliasis needs to be examined in future studies.
Characterization, genetic diversity, and evolutionary link of Cucumber mosaic virus strain New Delhi from India.

PubMed

Koundal, Vikas; Haq, Qazi Mohd Rizwanul; Praveen, Shelly

2011-02-01

The genome of Cucumber mosaic virus New Delhi strain (CMV-ND) from India, obtained from tomato, was completely sequenced and compared with full genome sequences of 14 known CMV strains from subgroups I and II, for their genetic diversity. Sequence analysis suggests CMV-ND shares maximum sequence identity at the nucleotide level with a CMV strain from Taiwan. Among all 15 strains of CMV, the encoded protein 2b is least conserved, whereas the coat protein (CP) is most conserved. Sequence identity values and phylogram results indicate that CMV-ND belongs to subgroup I. Based on the recombination detection program result, it appears that CMV is prone to recombination, and different RNA components of CMV-ND have evolved differently. Recombinational analysis of all 15 CMV strains detected maximum recombination breakpoints in RNA2; CP showed the least recombination sites.
Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

PubMed

Yohda, Masafumi; Yagi, Osami; Takechi, Ayane; Kitajima, Mizuki; Matsuda, Hisashi; Miyamura, Naoaki; Aizawa, Tomoko; Nakajima, Mutsuyasu; Sunairi, Michio; Daiba, Akito; Miyajima, Takashi; Teruya, Morimi; Teruya, Kuniko; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Juan, Ayaka; Nakano, Kazuma; Aoyama, Misako; Terabayashi, Yasunobu; Satou, Kazuhito; Hirano, Takashi

2015-07-01

A Dehalococcoides-containing bacterial consortium that performed dechlorination of 0.20 mM cis-1,2-dichloroethene to ethene in 14 days was obtained from the sediment mud of the lotus field. To obtain detailed information of the consortium, the metagenome was analyzed using the short-read next-generation sequencer SOLiD 3. Matching the obtained sequence tags with the reference genome sequences indicated that the Dehalococcoides sp. in the consortium was highly homologous to Dehalococcoides mccartyi CBDB1 and BAV1. Sequence comparison with the reference sequence constructed from 16S rRNA gene sequences in a public database showed the presence of Sedimentibacter, Sulfurospirillum, Clostridium, Desulfovibrio, Parabacteroides, Alistipes, Eubacterium, Peptostreptococcus and Proteocatella in addition to Dehalococcoides sp. After further enrichment, the members of the consortium were narrowed down to almost three species. Finally, the full-length circular genome sequence of the Dehalococcoides sp. in the consortium, D. mccartyi IBARAKI, was determined by analyzing the metagenome with the single-molecule DNA sequencer PacBio RS. The accuracy of the sequence was confirmed by matching it to the tag sequences obtained by SOLiD 3. The genome is 1,451,062 nt and the number of CDS is 1566, which includes 3 rRNA genes and 47 tRNA genes. There exist twenty-eight RDase genes that are accompanied by the genes for anchor proteins. The genome exhibits significant sequence identity with other Dehalococcoides spp. throughout the genome, but there exists significant difference in the distribution RDase genes. The combination of a short-read next-generation DNA sequencer and a long-read single-molecule DNA sequencer gives detailed information of a bacterial consortium. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
MetaGO: Predicting Gene Ontology of Non-homologous Proteins Through Low-Resolution Protein Structure Prediction and Protein-Protein Network Mapping.

PubMed

Zhang, Chengxin; Zheng, Wei; Freddolino, Peter L; Zhang, Yang

2018-03-10

Homology-based transferal remains the major approach to computational protein function annotations, but it becomes increasingly unreliable when the sequence identity between query and template decreases below 30%. We propose a novel pipeline, MetaGO, to deduce Gene Ontology attributes of proteins by combining sequence homology-based annotation with low-resolution structure prediction and comparison, and partner's homology-based protein-protein network mapping. The pipeline was tested on a large-scale set of 1000 non-redundant proteins from the CAFA3 experiment. Under the stringent benchmark conditions where templates with >30% sequence identity to the query are excluded, MetaGO achieves average F-measures of 0.487, 0.408, and 0.598, for Molecular Function, Biological Process, and Cellular Component, respectively, which are significantly higher than those achieved by other state-of-the-art function annotations methods. Detailed data analysis shows that the major advantage of the MetaGO lies in the new functional homolog detections from partner's homology-based network mapping and structure-based local and global structure alignments, the confidence scores of which can be optimally combined through logistic regression. These data demonstrate the power of using a hybrid model incorporating protein structure and interaction networks to deduce new functional insights beyond traditional sequence homology-based referrals, especially for proteins that lack homologous function templates. The MetaGO pipeline is available at http://zhanglab.ccmb.med.umich.edu/MetaGO/. Copyright © 2018. Published by Elsevier Ltd.
Isolation and Characterization of Two Cryptic Plasmids in the Ammonia-Oxidizing Bacterium Nitrosomonas sp. Strain ENI-11

PubMed Central

Yamagata, Akira; Kato, Junichi; Hirota, Ryuichi; Kuroda, Akio; Ikeda, Tsukasa; Takiguchi, Noboru; Ohtake, Hisao

1999-01-01

Two plasmids were discovered in the ammonia-oxidizing bacterium Nitrosomonas sp. strain ENI-11, which was isolated from activated sludge. The plasmids, designated pAYS and pAYL, were relatively small, being approximately 1.9 kb long. They were cryptic plasmids, having no detectable plasmid-linked antibiotic resistance or heavy metal resistance markers. The complete nucleotide sequences of pAYS and pAYL were determined, and their physical maps were constructed. There existed two major open reading frames, ORF1 in pAYS and ORF2 in pAYL, each of which was more than 500 bp long. The predicted product of ORF2 was 28% identical to part of the replication protein of a Bacillus plasmid, pBAA1. However, no significant similarity to any known protein sequences was detected with the predicted product of ORF1. pAYS and pAYL had a highly homologous region, designated HHR, of 262 bp. The overall identity was 98% between the two nucleotide sequences. Interestingly, HHR-homologous sequences were also detected in the genomes of ENI-11 and the plasmidless strain Nitrosomonas europaea IFO14298. Deletion analysis of pAYS and pAYL indicated that HHR, together with either ORF1 or ORF2, was essential for plasmid maintenance in ENI-11. To our knowledge, pAYS and pAYL are the first plasmids found in the ammonia-oxidizing autotrophic bacteria. PMID:10348848
In silico analysis of L-asparaginase from different source organisms.

PubMed

Dwivedi, Vivek Dhar; Mishra, Sarad Kumar

2014-06-01

L-asparaginases are widely distributed enzymes among plants, fungi and bacteria. This enzyme catalyzes the conversion of l-asparagine to l-aspartate and ammonia and to a lesser extent the formation of l-glutamate from l-glutamine. In the present study, forty-five full-length amino acid sequences of L-asparaginases from bacteria, fungi and plants were collected and subjected to multiple sequence alignment (MSA), domain identification, discovering individual amino acid composition, and phylogenetic tree construction. MSA revealed that two glycine residues were identically found in all analyzed species, two glycine residues were also identically found in all the fungal and bacterial sources and three glycine residues were identically found in all plant and bacterial sources while no residue was identically found in plant and fungal L-asparaginases. Two major sequence clusters were constructed by phylogenetic analysis. One cluster contains eleven species of fungi, twelve species of bacteria, and one species of plant, whereas the other one contains fourteen species of plant, four species of fungi and three species bacteria. The amino acid composition result revealed that the average frequency of amino acid alanine is 10.77 percent that is very high in comparison to other amino acids in all analyzed species.
Grapevine virus I, a putative new vitivirus detected in co-infection with grapevine virus G in New Zealand.

PubMed

Blouin, Arnaud G; Chooi, Kar Mun; Warren, Ben; Napier, Kathryn R; Barrero, Roberto A; MacDiarmid, Robin M

2018-05-01

A novel virus, with characteristics of viruses classified within the genus Vitivirus, was identified from a sample of Vitis vinifera cv. Chardonnay in New Zealand. The virus was detected with high throughput sequencing (small RNA and total RNA) and its sequence was confirmed by Sanger sequencing. Its genome is 7507 nt long (excluding the polyA tail) with an organisation similar to that described for other classifiable members of the genus Vitivirus. The closest relative of the virus is grapevine virus E (GVE) with 65% aa identity in ORF1 (65% nt identity) and 63% aa identity in the coat protein (66% nt identity). The relationship with GVE was confirmed with phylogenetic analysis, showing the new virus branching with GVE, Agave tequilina leaf virus and grapevine virus G (GVG). A limited survey revealed the presence of this virus in multiple plants from the same location where the newly described GVG was discovered, and in most cases both viruses were detected as co-infections. The genetic characteristics of this virus suggest it represents an isolate of a new species within the genus Vitivirus and following the current nomenclature, we propose the name "Grapevine virus I".
Vacuolar H[sup +]-ATPase 69-kilodalton catalytic subunit cDNA from developing cotton (Gossypium hirsutum) ovules

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wilkins, T.A.

1993-06-01

This study investigates the molecular events of vacuole ontogeny in rapidly elongated cotton plant cells. Within the DNA coding region, the cotton and carrot cDNA clones exhibit 82.2% nucleotide sequence homology; at the amino acid level cotton and carrot catalytic subunits exhibited 95.7% identity and 2.1% amino acid similarity. When aligned with the analogous sequences from yeast, the cotton protein shared only 60.5% amino acid identity and 12.7% similarity. 10 refs., 1 tab.
Molecular detection of kobuviruses in European roe deer (Capreolus capreolus) in Italy.

PubMed

Di Martino, Barbara; Di Profio, Federica; Melegari, Irene; Di Felice, Elisabetta; Robetto, Serena; Guidetti, Cristina; Orusa, Riccardo; Martella, Vito; Marsilio, Fulvio

2015-08-01

Kobuvirus RNA was found in 6.6 % (13/198) of stool specimens from roe deer (Capreolus capreolus) captured during the regular hunting season. Upon sequence analysis of a fragment of the 3D gene, nine strains displayed the highest nucleotide sequence identity (91.2-97.4 %) to bovine kobuviruses previously detected in either diarrhoeic or asymptomatic calves. Interestingly, four strains were genetically related to the newly discovered caprine kobuviruses (84.2-87.6 % nucleotide identity) identified in black goats in Korea.
Phylogeny of the Genus Flavivirus

PubMed Central

Kuno, Goro; Chang, Gwong-Jen J.; Tsuchiya, K. Richard; Karabatsos, Nick; Cropp, C. Bruce

1998-01-01

We undertook a comprehensive phylogenetic study to establish the genetic relationship among the viruses of the genus Flavivirus and to compare the classification based on molecular phylogeny with the existing serologic method. By using a combination of quantitative definitions (bootstrap support level and the pairwise nucleotide sequence identity), the viruses could be classified into clusters, clades, and species. Our phylogenetic study revealed for the first time that from the putative ancestor two branches, non-vector and vector-borne virus clusters, evolved and from the latter cluster emerged tick-borne and mosquito-borne virus clusters. Provided that the theory of arthropod association being an acquired trait was correct, pairwise nucleotide sequence identity among these three clusters provided supporting data for a possibility that the non-vector cluster evolved first, followed by the separation of tick-borne and mosquito-borne virus clusters in that order. Clades established in our study correlated significantly with existing antigenic complexes. We also resolved many of the past taxonomic problems by establishing phylogenetic relationships of the antigenically unclassified viruses with the well-established viruses and by identifying synonymous viruses. PMID:9420202
Phylogeny of the genus Flavivirus.

PubMed

Kuno, G; Chang, G J; Tsuchiya, K R; Karabatsos, N; Cropp, C B

1998-01-01

We undertook a comprehensive phylogenetic study to establish the genetic relationship among the viruses of the genus Flavivirus and to compare the classification based on molecular phylogeny with the existing serologic method. By using a combination of quantitative definitions (bootstrap support level and the pairwise nucleotide sequence identity), the viruses could be classified into clusters, clades, and species. Our phylogenetic study revealed for the first time that from the putative ancestor two branches, non-vector and vector-borne virus clusters, evolved and from the latter cluster emerged tick-borne and mosquito-borne virus clusters. Provided that the theory of arthropod association being an acquired trait was correct, pairwise nucleotide sequence identity among these three clusters provided supporting data for a possibility that the non-vector cluster evolved first, followed by the separation of tick-borne and mosquito-borne virus clusters in that order. Clades established in our study correlated significantly with existing antigenic complexes. We also resolved many of the past taxonomic problems by establishing phylogenetic relationships of the antigenically unclassified viruses with the well-established viruses and by identifying synonymous viruses.
Some identities of generalized Fibonacci sequence

NASA Astrophysics Data System (ADS)

Chong, Chin-Yoon; Cheah, C. L.; Ho, C. K.

2014-07-01

We introduced the generalized Fibonacci sequence {Un} defined by U0 = 0, U1 = 1, and Un+2 = pUn+1+qUn for all p, q∈Z+ and for all non-negative integers n. In this paper, we obtained some recursive formulas of the sequence.
Complete genome sequence of a new maize-associated cytorhabdovirus

USDA-ARS?s Scientific Manuscript database

A new 11,877 nt cytorhabdovirus sequence with 6 open reading frames has been identified in a maize sample. It shares 50 and 51% genome-wide nucleotide sequence identity with northern cereal mosaic cytorhabdovirus (NCMV) and barley yellow striate mosaic cytorhabdovirus (BYSMV), respectively....
Molecular detection and phylogenetic analysis of Hepatozoon spp. in questing Ixodes ricinus ticks and rodents from Slovakia and Czech Republic.

PubMed

Hamšíková, Zuzana; Silaghi, Cornelia; Rudolf, Ivo; Venclíková, Kristýna; Mahríková, Lenka; Slovák, Mirko; Mendel, Jan; Blažejová, Hana; Berthová, Lenka; Kocianová, Elena; Hubálek, Zdeněk; Schnittger, Leonhard; Kazimírová, Mária

2016-10-01

By amplification and sequencing of 18S rRNA gene fragments, Hepatozoon spp. DNA was detected in 0.08 % (4/5057) and 0.04 % (1/2473) of questing Ixodes ricinus ticks from Slovakia and Czech Republic, respectively. Hepatozoon spp. DNA was also detected in spleen and/or lungs of 4.45 % (27/606) of rodents from Slovakia. Prevalence of infection was significantly higher in Myodes glareolus (11.45 %) than in Apodemus spp. (0.28 %) (P < 0.001). Sequencing of 18S rRNA Hepatozoon spp. gene amplicons from I. ricinus showed 100 % identity with Hepatozoon canis isolates from red foxes or dogs in Europe. Phylogenetic analysis showed that at least two H. canis 18S rRNA genotypes exist in Slovakia of which one was identified also in the Czech Republic. The finding of H. canis in questing I. ricinus suggests the geographical spread of the parasite and a potential role of other ticks as its vectors in areas where Rhipicephalus sanguineus is not endemic. Sequencing of 18S rRNA gene amplicons from M. glareolus revealed the presence of two closely related genetic variants, Hepatozoon sp. SK1 and Hepatozoon sp. SK2, showing 99-100 % identity with isolates from M. glareolus from other European countries. Phylogenetic analysis demonstrates that 18S rRNA variants SK1 and SK2 correspond to previously described genotypes UR1 and UR2 of H. erhardovae, respectively. The isolate from Apodemus flavicollis (Hepatozoon sp. SK3b) was 99 % identical with isolates from reptiles in Africa and Asia. Further studies are necessary to identify the taxonomic status of Hepatozoon spp. parasitizing rodents in Europe and the host-parasite interactions in natural foci.
Metabolism of β-valine via a CoA-dependent ammonia lyase pathway.

PubMed

Otzen, Marleen; Crismaru, Ciprian G; Postema, Christiaan P; Wijma, Hein J; Heberling, Matthew M; Szymanski, Wiktor; de Wildeman, Stefaan; Janssen, Dick B

2015-11-01

Pseudomonas species strain SBV1 can rapidly grow on medium containing β-valine as a sole nitrogen source. The tertiary amine feature of β-valine prevents direct deamination reactions catalyzed by aminotransferases, amino acid dehydrogenases, and amino acid oxidases. However, lyase- or aminomutase-mediated conversions would be possible. To identify enzymes involved in the degradation of β-valine, a PsSBV1 gene library was prepared and used to complement the β-valine growth deficiency of a closely related Pseudomonas strain. This resulted in the identification of a gene encoding β-valinyl-coenzyme A ligase (BvaA) and two genes encoding β-valinyl-CoA ammonia lyases (BvaB1 and BvaB2). The BvaA protein demonstrated high sequence identity to several known phenylacetate CoA ligases. Purified BvaA enzyme did not convert phenyl acetic acid but was able to activate β-valine in an adenosine triphosphate (ATP)- and CoA-dependent manner. The substrate range of the enzyme appears to be narrow, converting only β-valine and to a lesser extent, 3-aminobutyrate and β-alanine. Characterization of BvaB1 and BvaB2 revealed that both enzymes were able to deaminate β-valinyl-CoA to produce 3-methylcrotonyl-CoA, a common intermediate in the leucine degradation pathway. Interestingly, BvaB1 and BvaB2 demonstrated no significant sequence identity to known CoA-dependent ammonia lyases, suggesting they belong to a new family of enzymes. BLAST searches revealed that BvaB1 and BvaB2 show high sequence identity to each other and to several enoyl-CoA hydratases, a class of enzymes that catalyze a similar reaction with water instead of amine as the leaving group.
The context of transcription start site regions is crucial for transcription of a plant tRNA(Lys)(UUU) gene group both in vitro and in vivo.

PubMed

Yukawa, Yasushi; Akama, Kazuhito; Noguchi, Kanta; Komiya, Masaaki; Sugiura, Masahiro

2013-01-10

Nuclear tRNA genes are transcribed by RNA polymerase III. The A- and B-boxes located within the transcribed regions are essential promoter elements for nuclear tRNA gene transcription. The Arabidopsis genome contains ten annotated genes encoding identical tRNA(Lys)(UUU) molecules, which are scattered on the five chromosomes. In this study, we prepared ten tDNA constructs including each of the tRNA(Lys)(UUU) coding sequences with their individual 5' and 3' flanking sequences, and assayed tRNA expression using an in vitro RNA polymerase III-dependent transcription system. Transcription levels differed significantly among the ten genes and two of the tRNA genes were transcribed at a very low level, despite possessing A- and B-boxes identical to those of the other tRNA genes. To examine whether the in vitro results were reproducible in vivo, the 5' flanking sequence of an amber suppressor tRNA gene was then replaced with those of the ten tRNA(Lys) genes. An in vivo experiment based on an amber suppressor tRNA that mediates suppression of a premature amber codon in a β-glucuronidase (GUS) reporter gene in plant tissues generated nearly identical results to those obtained in vitro. Analysis of mutated versions of the amber suppressor tRNA gene, which contained base substitutions around the transcription start site (TSS), showed that the context around the transcription start sites is a crucial determinant for transcription of plant tRNA(Lys)(UUU) both in vitro and in vivo. The above transcription regulation by context around TSS differed between tRNA genes and other Pol III-dependent genes. Copyright © 2012 Elsevier B.V. All rights reserved.
Sequence Variation in the Small-Subunit rRNA Gene of Plasmodium malariae and Prevalence of Isolates with the Variant Sequence in Sichuan, China

PubMed Central

Liu, Qing; Zhu, Shenghua; Mizuno, Sahoko; Kimura, Masatsugu; Liu, Peina; Isomura, Shin; Wang, Xingzhen; Kawamoto, Fumihiko

1998-01-01

By two PCR-based diagnostic methods, Plasmodium malariae infections have been rediscovered at two foci in the Sichuan province of China, a region where no cases of P. malariae have been officially reported for the last 2 decades. In addition, a variant form of P. malariae which has a deletion of 19 bp and seven substitutions of base pairs in the target sequence of the small-subunit (SSU) rRNA gene was detected with high frequency. Alignment analysis of Plasmodium sp. SSU rRNA gene sequences revealed that the 5′ region of the variant sequence is identical to that of P. vivax or P. knowlesi and its 3′ region is identical to that of P. malariae. The same sequence variations were also found in P. malariae isolates collected along the Thai-Myanmar border, suggesting a wide distribution of this variant form from southern China to Southeast Asia. PMID:9774600
Sequence analysis of MHC class I α2 from sockeye salmon (Oncorhynchus nerka).

PubMed

McClelland, Erin K; Ming, Tobi J; Tabata, Amy; Miller, Kristina M

2011-09-01

Most studies assessing adaptive MHC diversity in salmon populations have focused on the classical class II DAB or DAA loci, as these have been most amenable to single PCR amplifications due to their relatively low level of sequence divergence. Herein, we report the characterization of the classical class I UBA α2 locus based on collections taken throughout the species range of sockeye salmon (Oncorhynchus nerka). Through use of multiple lineage-specific primer sets, denaturing gradient gel electrophoresis and sequencing, we identified thirty-four alleles from three highly divergent lineages. Sequence identity between lineages ranged from 30.0% to 56.8% but was relatively high within lineages. Allelic identity within the antigen recognition site (ARS) was greater than for the longer sequence. Global positive selection on UBA was seen at the sequence level (dN:dS = 1.012) with four codons under positive selection and 12 codons under negative selection. Crown Copyright © 2011. Published by Elsevier Ltd. All rights reserved.

Increasing Sequence Diversity with Flexible Backbone Protein Design: The Complete Redesign of a Protein Hydrophobic Core

DOE Office of Scientific and Technical Information (OSTI.GOV)

Murphy, Grant S.; Mills, Jeffrey L.; Miley, Michael J.

2015-10-15

Protein design tests our understanding of protein stability and structure. Successful design methods should allow the exploration of sequence space not found in nature. However, when redesigning naturally occurring protein structures, most fixed backbone design algorithms return amino acid sequences that share strong sequence identity with wild-type sequences, especially in the protein core. This behavior places a restriction on functional space that can be explored and is not consistent with observations from nature, where sequences of low identity have similar structures. Here, we allow backbone flexibility during design to mutate every position in the core (38 residues) of a four-helixmore » bundle protein. Only small perturbations to the backbone, 12 {angstrom}, were needed to entirely mutate the core. The redesigned protein, DRNN, is exceptionally stable (melting point >140C). An NMR and X-ray crystal structure show that the side chains and backbone were accurately modeled (all-atom RMSD = 1.3 {angstrom}).« less
Comparison of the nucleotide and amino acid sequences of the RsrI and EcoRI restriction endonucleases.

PubMed

Stephenson, F H; Ballard, B T; Boyer, H W; Rosenberg, J M; Greene, P J

1989-12-21

The RsrI endonuclease, a type-II restriction endonuclease (ENase) found in Rhodobacter sphaeroides, is an isoschizomer of the EcoRI ENase. A clone containing an 11-kb BamHI fragment was isolated from an R. sphaeroides genomic DNA library by hybridization with synthetic oligodeoxyribonucleotide probes based on the N-terminal amino acid (aa) sequence of RsrI. Extracts of E. coli containing a subclone of the 11-kb fragment display RsrI activity. Nucleotide sequence analysis reveals an 831-bp open reading frame encoding a polypeptide of 277 aa. A 50% identity exists within a 266-aa overlap between the deduced aa sequences of RsrI and EcoRI. Regions of 75-100% aa sequence identity correspond to key structural and functional regions of EcoRI. The type-II ENases have many common properties, and a common origin might have been expected. Nevertheless, this is the first demonstration of aa sequence similarity between ENases produced by different organisms.
PASTA in Penicillin Binding Proteins and Serine/Threonine Kinases: A Recipe of Structural, Dynamic and Binding Properties.

PubMed

Calvanese, Luisa; Falcigno, Lucia; Squeglia, Flavia; D'Auria, Gabriella; Berisio, Rita

2017-11-24

Penicillin binding proteins (PBPs) and Serine Threonine kinases (STPKs) are two classes of bacterial enzymes whose involvement in a series of vital processes in bacterial growth and division is well assessed. Many PBPs and STPKs show linked an ancillary domain named PASTA, whose functional role is not completely deciphered so far. It has been proposed that PASTAs are sensor modules that by binding opportune ligands (i.e. muropeptides) activate the cognate proteins to their functions. However, based on recent data, the sensor annotation sounds true for PASTA from STPKs, and false for PASTA from PBPs. Different PASTA domains, belonging or not to different protein classes, sharing or not appreciable sequence identities, always show identical folds. This survey of the structural, binding and dynamic properties of PASTA domains pursues the reasons why identical topologies may turn in different roles. Amino acid compositions, total charges and distribution of the hydrophobic/hydrophilic patches on the surface, significantly vary among PASTAs from STPKs and PBPs and appear to correlate with different functions. A possible criterion to discriminate between PASTA modules of STPKs or PBPs solely based on their sequences is proposed. Possibly reflecting different species as well as functional roles and evolutionary profile, our routine represents a fast even though approximate method to distinguish between PASTA belonging to different classes. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
The annotation-enriched non-redundant patent sequence databases.

PubMed

Li, Weizhong; Kondratowicz, Bartosz; McWilliam, Hamish; Nauche, Stephane; Lopez, Rodrigo

2013-01-01

The EMBL-European Bioinformatics Institute (EMBL-EBI) offers public access to patent sequence data, providing a valuable service to the intellectual property and scientific communities. The non-redundant (NR) patent sequence databases comprise two-level nucleotide and protein sequence clusters (NRNL1, NRNL2, NRPL1 and NRPL2) based on sequence identity (level-1) and patent family (level-2). Annotation from the source entries in these databases is merged and enhanced with additional information from the patent literature and biological context. Corrections in patent publication numbers, kind-codes and patent equivalents significantly improve the data quality. Data are available through various user interfaces including web browser, downloads via FTP, SRS, Dbfetch and EBI-Search. Sequence similarity/homology searches against the databases are available using BLAST, FASTA and PSI-Search. In this article, we describe the data collection and annotation and also outline major changes and improvements introduced since 2009. Apart from data growth, these changes include additional annotation for singleton clusters, the identifier versioning for tracking entry change and the entry mappings between the two-level databases. Database URL: http://www.ebi.ac.uk/patentdata/nr/
The Annotation-enriched non-redundant patent sequence databases

PubMed Central

Li, Weizhong; Kondratowicz, Bartosz; McWilliam, Hamish; Nauche, Stephane; Lopez, Rodrigo

2013-01-01

The EMBL-European Bioinformatics Institute (EMBL-EBI) offers public access to patent sequence data, providing a valuable service to the intellectual property and scientific communities. The non-redundant (NR) patent sequence databases comprise two-level nucleotide and protein sequence clusters (NRNL1, NRNL2, NRPL1 and NRPL2) based on sequence identity (level-1) and patent family (level-2). Annotation from the source entries in these databases is merged and enhanced with additional information from the patent literature and biological context. Corrections in patent publication numbers, kind-codes and patent equivalents significantly improve the data quality. Data are available through various user interfaces including web browser, downloads via FTP, SRS, Dbfetch and EBI-Search. Sequence similarity/homology searches against the databases are available using BLAST, FASTA and PSI-Search. In this article, we describe the data collection and annotation and also outline major changes and improvements introduced since 2009. Apart from data growth, these changes include additional annotation for singleton clusters, the identifier versioning for tracking entry change and the entry mappings between the two-level databases. Database URL: http://www.ebi.ac.uk/patentdata/nr/ PMID:23396323
Initial Detection and Molecular Characterization of Namaycush Herpesvirus (Salmonid Herpesvirus 5) in Lake Trout.

PubMed

Glenney, Gavin W; Barbash, Patricia A; Coll, John A

2016-03-01

A novel herpesvirus was found by molecular methods in samples of Lake Trout Salvelinus namaycush from Lake Erie, Pennsylvania, and Lake Ontario, Keuka Lake, and Lake Otsego, New York. Based on PCR amplification and partial sequencing of polymerase, terminase, and glycoprotein genes, a number of isolates were identified as a novel virus, which we have named Namaycush herpesvirus (NamHV) salmonid herpesvirus 5 (SalHV5). Phylogenetic analyses of three NamHV genes indicated strong clustering with other members of the genus Salmonivirus, placing these isolates into family Alloherpesviridae. The NamHV isolates were identical in the three partially sequenced genes; however, they varied from other salmonid herpesviruses in nucleotide sequence identity. In all three of the genes sequenced, NamHV shared the highest sequence identity with Atlantic Salmon papillomatosis virus (ASPV; SalHV4) isolated from Atlantic Salmon Salmo salar in northern Europe, including northwestern Russia. These results lead one to believe that NamHV and ASPV have a common ancestor that may have made a relatively recent host jump from Atlantic Salmon to Lake Trout or vice versa. Partial nucleotide sequence comparisons between NamHV and ASPV for the polymerase and glycoprotein genes differ by >5% and >10%, respectively. Additional nucleotide sequence comparisons between NamHV and epizootic epitheliotropic disease virus (EEDV/SalHV3) in the terminase, glycoprotein, and polymerase genes differ by >5%, >20%, and >10%, respectively. Thus, NamHV and EEDV may be occupying discrete ecological niches in Lake Trout. Even though NamHV shared the highest genetic identity with ASPV, each of these viruses has a separate host species, which also implies speciation. Additionally, NamHV has been detected over the last 4 years in four separate water bodies across two states, which suggests that NamHV is a distinct, naturally replicating lineage. This, in combination with a divergence in nucleotide sequence from EEDV, indicates that NamHV is a new species in the genus Salmonivirus. Received April 20, 2015; accepted October 11, 2015.
Complete genome sequence of a novel genotype of squash mosaic virus

USDA-ARS?s Scientific Manuscript database

Complete genome sequence of a novel genotype of Squash mosaic virus (SqMV) infecting squash plants in Spain was obtained using deep sequencing of small ribonucleic acids and assembly. The low nucleotide sequence identities, with 87-88% on RNA1 and 84-86% on RNA2 to known SqMV isolates, suggest a new...
First complete genome sequence of an emerging cucumber green mottle mosaic virus isolate in North America

USDA-ARS?s Scientific Manuscript database

The complete genome sequence (6,423 nt) of an emerging Cucumber green mottle mosaic virus (CGMMV) isolate on cucumber in North America was determined through deep sequencing of sRNA and rapid amplification of cDNA ends. It shares 99% nucleotide sequence identity to the Asian genotype, but only 90% t...
First report of Beet western yellows virus infecting Epiphyllum spp

USDA-ARS?s Scientific Manuscript database

Beet western yellow virus (BWYV) was identified from an orchid cactus (Epiphyllum spp.) hybrid without obvious symptoms by high-throughput sequencing. The nearly complete genomic sequence of 5,458 nucleotides of the virus was determined. The isolate has the highest nucleotide sequence identity (93%)...
A new betasatellite associated with cotton leaf curl Burewala virus infecting tomato in India: influence on symptoms and viral accumulation.

PubMed

Kumar, Jitendra; Gunapati, Samatha; Singh, Sudhir P; Kumar, Abhinav; Lalit, Adarsh; Sharma, Naresh C; Puranik, Rekha; Tuli, Rakesh

2013-06-01

A begomovirus and its associated alpha- and betasatellite were detected in tomato plants affected with leaf curl disease. Based on a nucleotide sequence identity of 99 %, this begomovirus was designated an isolate of cotton leaf curl Burewala virus (CLCuBuV). The alphasatellite exhibited 93 % sequence identity to cotton leaf curl Burewala alphasatellite (CLCuBuA) and is hence referred to here as a variant of CLCuBuA. The detected betasatellite was recombinant in nature and showed 70 % sequence identity to the known betasatellites. Inoculation of healthy tomato with CLCuBuV plus betasatellite, either in the presence or the absence of alphasatellite, led to typical leaf curling, while inoculation with CLCuBuV in the absence of betasatellite resulted in mild symptoms. This confirmed the role of the betasatellite in expression of disease symptoms. We propose to name the newly detected betasatellite tomato leaf curl Hajipur betasatellite (ToLCHJB).
Whole Genome Sequences of Three Treponema pallidum ssp. pertenue Strains: Yaws and Syphilis Treponemes Differ in Less than 0.2% of the Genome Sequence

PubMed Central

Chen, Lei; Pospíšilová, Petra; Strouhal, Michal; Qin, Xiang; Mikalová, Lenka; Norris, Steven J.; Muzny, Donna M.; Gibbs, Richard A.; Fulton, Lucinda L.; Sodergren, Erica; Weinstock, George M.; Šmajs, David

2012-01-01

Background The yaws treponemes, Treponema pallidum ssp. pertenue (TPE) strains, are closely related to syphilis causing strains of Treponema pallidum ssp. pallidum (TPA). Both yaws and syphilis are distinguished on the basis of epidemiological characteristics, clinical symptoms, and several genetic signatures of the corresponding causative agents. Methodology/Principal Findings To precisely define genetic differences between TPA and TPE, high-quality whole genome sequences of three TPE strains (Samoa D, CDC-2, Gauthier) were determined using next-generation sequencing techniques. TPE genome sequences were compared to four genomes of TPA strains (Nichols, DAL-1, SS14, Chicago). The genome structure was identical in all three TPE strains with similar length ranging between 1,139,330 bp and 1,139,744 bp. No major genome rearrangements were found when compared to the four TPA genomes. The whole genome nucleotide divergence (dA) between TPA and TPE subspecies was 4.7 and 4.8 times higher than the observed nucleotide diversity (π) among TPA and TPE strains, respectively, corresponding to 99.8% identity between TPA and TPE genomes. A set of 97 (9.9%) TPE genes encoded proteins containing two or more amino acid replacements or other major sequence changes. The TPE divergent genes were mostly from the group encoding potential virulence factors and genes encoding proteins with unknown function. Conclusions/Significance Hypothetical genes, with genetic differences, consistently found between TPE and TPA strains are candidates for syphilitic treponemes virulence factors. Seventeen TPE genes were predicted under positive selection, and eleven of them coded either for predicted exported proteins or membrane proteins suggesting their possible association with the cell surface. Sequence changes between TPE and TPA strains and changes specific to individual strains represent suitable targets for subspecies- and strain-specific molecular diagnostics. PMID:22292095
rRNA Gene Internal Transcribed Spacer 1 and 2 Sequences of Asexual, Anthropophilic Dermatophytes Related to Trichophyton rubrum

PubMed Central

Summerbell, R. C.; Haugland, R. A.; Li, A.; Gupta, A. K.

1999-01-01

The ribosomal region spanning the two internal transcribed spacer (ITS) regions and the 5.8S ribosomal DNA region was sequenced for asexual, anthropophilic dermatophyte species with morphological similarity to Trichophyton rubrum, as well as for members of the three previously delineated, related major clades in the T. mentagrophytes complex. Representative isolates of T. raubitschekii, T. fischeri, and T. kanei were found to have ITS sequences identical to that of T. rubrum. The ITS sequences of T. soudanense and T. megninii differed from that of T. rubrum by only a small number of base pairs. Their continued status as species, however, appears to meet criteria outlined in the population genetics-based cohesion species concept of A. R. Templeton. The ITS sequence of T. tonsurans differed from that of the biologically distinct T. equinum by only 1 bp, while the ITS sequence of the recently described species T. krajdenii had a sequence identical to that of T. mentagrophytes isolates related to the teleomorph Arthroderma vanbreuseghemii. PMID:10565922
Evidence of three new members of malignant catarrhal fever virus group in Muskox (Ovibos moschatus), Nubian ibex (Capra nubiana), and gemsbok (Oryx gazella)

USGS Publications Warehouse

Li, H.; Gailbreath, K.; Bender, L.C.; West, K.; Keller, J.; Crawford, T.B.

2003-01-01

Six members of the malignant catarrhal fever (MCF) virus group of ruminant rhadinoviruses have been identified to date. Four of these viruses are clearly associated with clinical disease: alcelaphine herpesvirus 1 (AlHV-1) carried by wildebeest (Connochaetes spp.); ovine herpesvirus 2 (OvHV-2), ubiquitous in domestic sheep; caprine herpesvirus 2 (CpHV-2), endemic in domestic goats; and the virus of unknown origin found causing classic MCF in white-tailed deer (Odocoileus virginianus; MCFV-WTD). Using serology and polymerase chain reaction with degenerate primers targeting a portion of the herpesviral DNA polymerase gene, evidence of three previously unrecognized rhadinoviruses in the MCF virus group was found in muskox (Ovibos moschatus), Nubian ibex (Capra nubiana), and gemsbok (South African oryx, Oryx gazella), respectively. Based on sequence alignment, the viral sequence in the muskox is most closely related to MCFV-WTD (81.5% sequence identity) and that in the Nubian ibex is closest to CpHV-2 (89.3% identity). The viral sequence in the gemsbok is most closely related to AlHV-1 (85.1% identity). No evidence of disease association with these viruses has been found. ?? Wildlife Disease Association 2003.
Evolution of the arginase fold and functional diversity

PubMed Central

Dowling, Daniel P.; Costanzo, Luigi Di; Gennadios, Heather A.; Christianson, David W.

2009-01-01

The large number of protein structures deposited in the Protein Data Bank allows for the identification of novel structural superfamilies based on conservation of fold in addition to conservation of amino acid sequence. Since sequence diverges more rapidly than fold in protein evolution, proteins with little or no significant sequence identity are occasionally observed to adopt similar folds, thereby reflecting unanticipated evolutionary relationships. Here, we review the unique α/β fold first observed in the manganese metalloenzyme rat liver arginase, consisting of a parallel 8 stranded β-sheet surrounded by several helices, and its evolutionary relationship with the zinc-requiring and/or iron-requiring histone deacetylases and acetylpolyamine amidohydrolases. Structural comparisons reveal key features of the core α/β fold that contribute to the divergent metal ion specificity and stoichiometry required for the chemical and biological functions of these enzymes. PMID:18360740
Genetic Characteristics of Coronaviruses from Korean Bats in 2016.

PubMed

Lee, Saemi; Jo, Seong-Deok; Son, Kidong; An, Injung; Jeong, Jipseol; Wang, Seung-Jun; Kim, Yongkwan; Jheong, Weonhwa; Oem, Jae-Ku

2018-01-01

Bats have increasingly been recognized as the natural reservoir of severe acute respiratory syndrome (SARS), coronavirus, and other coronaviruses found in mammals. However, little research has been conducted on bat coronaviruses in South Korea. In this study, bat samples (332 oral swabs, 245 fecal samples, 38 urine samples, and 57 bat carcasses) were collected at 33 natural bat habitat sites in South Korea. RT-PCR and sequencing were performed for specific coronavirus genes to identify the bat coronaviruses in different bat samples. Coronaviruses were detected in 2.7% (18/672) of the samples: 13 oral swabs from one species of the family Rhinolophidae, and four fecal samples and one carcass (intestine) from three species of the family Vespertiliodae. To determine the genetic relationships of the 18 sequences obtained in this study and previously known coronaviruses, the nucleotide sequences of a 392-nt region of the RNA-dependent RNA polymerase (RdRp) gene were analyzed phylogenetically. Thirteen sequences belonging to SARS-like betacoronaviruses showed the highest nucleotide identity (97.1-99.7%) with Bat-CoV-JTMC15 reported in China. The other five sequences were most similar to MERS-like betacoronaviruses. Four nucleotide sequences displayed the highest identity (94.1-95.1%) with Bat-CoV-HKU5 from Hong Kong. The one sequence from a carcass showed the highest nucleotide identity (99%) with Bat-CoV-SC2013 from China. These results suggest that careful surveillance of coronaviruses from bats should be continued, because animal and human infections may result from the genetic variants present in bat coronavirus reservoirs.
Homology-based Modeling of Rhodopsin-like Family Members in the Inactive State: Structural Analysis and Deduction of Tips for Modeling and Optimization.

PubMed

Pappalardo, Matteo; Rayan, Mahmoud; Abu-Lafi, Saleh; Leonardi, Martha E; Milardi, Danilo; Guccione, Salvatore; Rayan, Anwar

2017-08-01

Modeling G-Protein Coupled Receptors (GPCRs) is an emergent field of research, since utility of high-quality models in receptor structure-based strategies might facilitate the discovery of interesting drug candidates. The findings from a quantitative analysis of eighteen resolved structures of rhodopsin family "A" receptors crystallized with antagonists and 153 pairs of structures are described. A strategy termed endeca-amino acids fragmentation was used to analyze the structures models aiming to detect the relationship between sequence identity and Root Mean Square Deviation (RMSD) at each trans-membrane-domain. Moreover, we have applied the leave-one-out strategy to study the shiftiness likelihood of the helices. The type of correlation between sequence identity and RMSD was studied using the aforementioned set receptors as representatives of membrane proteins and 98 serine proteases with 4753 pairs of structures as representatives of globular proteins. Data analysis using fragmentation strategy revealed that there is some extent of correlation between sequence identity and global RMSD of 11AA width windows. However, spatial conservation is not always close to the endoplasmic side as was reported before. A comparative study with globular proteins shows that GPCRs have higher standard deviation and higher slope in the graph with correlation between sequence identity and RMSD. The extracted information disclosed in this paper could be incorporated in the modeling protocols while using technique for model optimization and refinement. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.
Bradyrhizobium paxllaeri sp. nov. and Bradyrhizobium icense sp. nov., nitrogen-fixing rhizobial symbionts of Lima bean (Phaseolus lunatus L.) in Peru.

PubMed

Durán, David; Rey, Luis; Mayo, Juan; Zúñiga-Dávila, Doris; Imperial, Juan; Ruiz-Argüeso, Tomás; Martínez-Romero, Esperanza; Ormeño-Orrillo, Ernesto

2014-06-01

A group of strains isolated from root nodules of Phaseolus lunatus (Lima bean) in Peru were characterized by genotypic, genomic and phenotypic methods. All strains possessed identical 16S rRNA gene sequences that were 99.9% identical to that of Bradyrhizobium lablabi CCBAU 23086(T). Despite having identical 16S rRNA gene sequences, the Phaseolus lunatus strains could be divided into two clades by sequence analysis of recA, atpD, glnII, dnaK and gyrB genes. The genome sequence of a representative of each clade was obtained and compared to the genomes of closely related species of the genus Bradyrhizobium. Average nucleotide identity values below the species circumscription threshold were obtained when comparing the two clades to each other (88.6%) and with all type strains of the genus Bradyrhizobium (≤92.9%). Phenotypes distinguishing both clades from all described and closely related species of the genus Bradyrhizobium were found. On the basis of the results obtained, two novel species, Bradyrhizobium paxllaeri sp. nov. (type strain LMTR 21(T) = DSM 18454(T) = HAMBI 2911(T)) and Bradyrhizobium icense sp. nov. (type strain LMTR 13(T) = HAMBI 3584(T) = CECT 8509(T) = CNPSo 2583(T)), are proposed to accommodate the uncovered clades of Phaseolus lunatus bradyrhizobia. These species share highly related but distinct nifH and nodC symbiosis genes. © 2014 IUMS.
Cloning and Sequence Analysis of Vibrio halioticoli Genes Encoding Three Types of Polyguluronate Lyase.

PubMed

Sugimura; Sawabe; Ezura

2000-01-01

The alginate lyase-coding genes of Vibrio halioticoli IAM 14596(T), which was isolated from the gut of the abalone Haliotis discus hannai, were cloned using plasmid vector pUC 18, and expressed in Escherichia coli. Three alginate lyase-positive clones, pVHB, pVHC, and pVHE, were obtained, and all clones expressed the enzyme activity specific for polyguluronate. Three genes, alyVG1, alyVG2, and alyVG3, encoding polyguluronate lyase were sequenced: alyVG1 from pVHB was composed of a 1056-bp open reading frame (ORF) encoding 352 amino acid residues; alyVG2 gene from pVHC was composed of a 993-bp ORF encoding 331 amino acid residues; and alyVG3 gene from pVHE was composed of a 705-bp ORF encoding 235 amino acid residues. Comparison of nucleotide and deduced amino acid sequences among AlyVG1, AlyVG2, and AlyVG3 revealed low homologies. The identity value between AlyVG1 and AlyVG2 was 18.7%, and that between AlyVG2 and AlyVG3 was 17.0%. A higher identity value (26.0%) was observed between AlyVG1 and AlyVG3. Sequence comparison among known polyguluronate lyases including AlyVG1, AlyVG2, and AlyVG3 also did not reveal an identical region in these sequences. However, AlyVG1 showed the highest identity value (36.2%) and the highest similarity (73.3%) to AlyA from Klebsiella pneumoniae. A consensus region comprising nine amino acid (YFKAGXYXQ) in the carboxy-terminal region previously reported by Mallisard and colleagues was observed only in AlyVG1 and AlyVG2.
Identification of a third feline Demodex species through partial sequencing of the 16S rDNA and frequency of Demodex species in 74 cats using a PCR assay.

PubMed

Ferreira, Diana; Sastre, Natalia; Ravera, Iván; Altet, Laura; Francino, Olga; Bardagí, Mar; Ferrer, Lluís

2015-08-01

Demodex cati and Demodex gatoi are considered the two Demodex species of cats. However, several reports have identified Demodex mites morphologically different from these two species. The differentiation of Demodex mites is usually based on morphology, but within the same species different morphologies can occur. DNA amplification/sequencing has been used effectively to identify and differentiate Demodex mites in humans, dogs and cats. The aim was to develop a PCR technique to identify feline Demodex mites and use this technique to investigate the frequency of Demodex in cats. Demodex cati, D. gatoi and Demodex mites classified morphologically as the third unnamed feline species were obtained. Hair samples were taken from 74 cats. DNA was extracted; a 330 bp fragment of the 16S rDNA was amplified and sequenced. The sequences of D. cati and D. gatoi shared >98% identity with those published on GenBank. The sequence of the third unnamed species showed 98% identity with a recently published feline Demodex sequence and only 75.2 and 70.9% identity with D. gatoi and D. cati sequences, respectively. Demodex DNA was detected in 19 of 74 cats tested; 11 DNA sequences corresponded to Demodex canis, five to Demodex folliculorum, three to D. cati and two to Demodex brevis. Three Demodex species can be found in cats, because the third unnamed Demodex species is likely to be a distinct species. Apart from D. cati and D. gatoi, DNA from D. canis, D. folliculorum and D. brevis was found on feline skin. © 2015 ESVD and ACVD.
Piscine reovirus: Genomic and molecular phylogenetic analysis from farmed and wild salmonids collected on the Canada/US Pacific Coast

USGS Publications Warehouse

Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul S.; Richmond, Zina; Purcell, Maureen K.; Johns, Robert; Johnson, Stewart C.; Sakasida, Sonja M.

2015-01-01

Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period.

Piscine Reovirus: Genomic and Molecular Phylogenetic Analysis from Farmed and Wild Salmonids Collected on the Canada/US Pacific Coast

PubMed Central

Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul; Richmond, Zina; Johns, Robert; Purcell, Maureen K.; Johnson, Stewart C.; Saksida, Sonja M.

2015-01-01

Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period. PMID:26536673
Genotypic analysis of Mucor from the platypus in Australia.

PubMed

Connolly, J H; Stodart, B J; Ash, G J

2010-01-01

Mucor amphibiorum is the only pathogen known to cause significant morbidity and mortality in the free-living platypus (Ornithorhynchus anatinus) in Tasmania. Infection has also been reported in free-ranging cane toads (Bufo marinus) and green tree frogs (Litoria caerulea) from mainland Australia but has not been confirmed in platypuses from the mainland. To date, there has been little genotyping specifically conducted on M. amphibiorum. A collection of 21 Mucor isolates representing isolates from the platypus, frogs and toads, and environmental samples were obtained for genotypic analysis. Internal transcribed spacer (ITS) region sequencing and GenBank comparison confirmed the identity of most of the isolates. Representative isolates from infected platypuses formed a clade containing the reference isolates of M. amphibiorum from the Centraal Bureau voor Schimmelcultures repository. The M. amphibiorum isolates showed a close sequence identity with Mucor indicus and consisted of two haplotypes, differentiated by single nucleotide polymorphisms within the ITS1 and ITS2 regions. With the exception of isolate 96-4049, all isolates from platypuses were in one haplotype. Multilocus fingerprinting via the use of intersimple sequence repeats polymerase chain reaction identified 19 genotypes. Two major clusters were evident: 1) M. amphibiorum and Mucor racemosus; and 2) Mucor circinelloides, Mucor ramosissimus, and Mucor fragilis. Seven M. amphibiorum isolates from platypuses were present in two subclusters, with isolate 96-4053 appearing genetically distinct from all other isolates. Isolates classified as M. circinelloides by sequence analysis formed a separate subcluster, distinct from other Mucor spp. The combination of sequencing and multilocus fingerprinting has the potential to provide the tools for rapid identification of M. amphibiorum. Data presented on the diversity of the pathogen and further work in linking genetic diversity to functional diversity will provide critical information for its management in Tasmanian river systems.
Diversity in VP3, NSP3, and NSP4 of rotavirus B detected from Japanese cattle.

PubMed

Hayashi-Miyamoto, Michiko; Murakami, Toshiaki; Minami-Fukuda, Fujiko; Tsuchiaka, Shinobu; Kishimoto, Mai; Sano, Kaori; Naoi, Yuki; Asano, Keigo; Ichimaru, Toru; Haga, Kei; Omatsu, Tsutomu; Katayama, Yukie; Oba, Mami; Aoki, Hiroshi; Shirai, Junsuke; Ishida, Motohiko; Katayama, Kazuhiko; Mizutani, Tetsuya; Nagai, Makoto

2017-04-01

Bovine rotavirus B (RVB) is an etiological agent of diarrhea mostly in adult cattle. Currently, a few sequences of viral protein (VP)1, 2, 4, 6, and 7 and nonstructural protein (NSP)1, 2, and 5 of bovine RVB are available in the DDBJ/EMBL/GenBank databases, and none have been reported for VP3, NSP3, and NSP4. In order to fill this gap in the genetic characterization of bovine RVB strains, we used a metagenomics approach and sequenced and analyzed the complete coding sequences (CDS) of VP3, NSP3, and NSP4 genes, as well as the partial or complete CDS of other genes of RVBs detected from Japanese cattle. VP3, NSP3, and NSP4 of bovine RVBs shared low nucleotide sequence identities (63.3-64.9% for VP3, 65.9-68.2% for NSP3, and 52.6-56.2% for NSP4) with those of murine, human, and porcine RVBs, suggesting that bovine RVBs belong to a novel genotype. Furthermore, significantly low amino acid sequence identities were observed for NSP4 (36.1-39.3%) between bovine RVBs and the RVBs of other species. In contrast, hydrophobic plot analysis of NSP4 revealed profiles similar to those of RVBs of other species and rotavirus A (RVA) strains. Phylogenetic analyses of all gene segments revealed that bovine RVB strains formed a cluster that branched distantly from other RVBs. These results suggest that bovine RVBs have evolved independently from other RVBs but in a similar manner to other rotaviruses. These findings provide insights into the evolution and diversity of RVB strains. Copyright © 2017 Elsevier B.V. All rights reserved.
Evidence for Widespread Reticulate Evolution within Human Duplicons

PubMed Central

Jackson, Michael S. ; Oliver, Karen ; Loveland, Jane ; Humphray, Sean ; Dunham, Ian ; Rocchi, Mariano ; Viggiano, Luigi ; Park, Jonathan P. ; Hurles, Matthew E. ; Santibanez-Koref, Mauro

2005-01-01

Approximately 5% of the human genome consists of segmental duplications that can cause genomic mutations and may play a role in gene innovation. Reticulate evolutionary processes, such as unequal crossing-over and gene conversion, are known to occur within specific duplicon families, but the broader contribution of these processes to the evolution of human duplications remains poorly characterized. Here, we use phylogenetic profiling to analyze multiple alignments of 24 human duplicon families that span >8 Mb of DNA. Our results indicate that none of them are evolving independently, with all alignments showing sharp discontinuities in phylogenetic signal consistent with reticulation. To analyze these results in more detail, we have developed a quartet method that estimates the relative contribution of nucleotide substitution and reticulate processes to sequence evolution. Our data indicate that most of the duplications show a highly significant excess of sites consistent with reticulate evolution, compared with the number expected by nucleotide substitution alone, with 15 of 30 alignments showing a >20-fold excess over that expected. Using permutation tests, we also show that at least 5% of the total sequence shares 100% sequence identity because of reticulation, a figure that includes 74 independent tracts of perfect identity >2 kb in length. Furthermore, analysis of a subset of alignments indicates that the density of reticulation events is as high as 1 every 4 kb. These results indicate that phylogenetic relationships within recently duplicated human DNA can be rapidly disrupted by reticulate evolution. This finding has important implications for efforts to finish the human genome sequence, complicates comparative sequence analysis of duplicon families, and could profoundly influence the tempo of gene-family evolution. PMID:16252241
FunGene: the functional gene pipeline and repository.

PubMed

Fish, Jordan A; Chai, Benli; Wang, Qiong; Sun, Yanni; Brown, C Titus; Tiedje, James M; Cole, James R

2013-01-01

Ribosomal RNA genes have become the standard molecular markers for microbial community analysis for good reasons, including universal occurrence in cellular organisms, availability of large databases, and ease of rRNA gene region amplification and analysis. As markers, however, rRNA genes have some significant limitations. The rRNA genes are often present in multiple copies, unlike most protein-coding genes. The slow rate of change in rRNA genes means that multiple species sometimes share identical 16S rRNA gene sequences, while many more species share identical sequences in the short 16S rRNA regions commonly analyzed. In addition, the genes involved in many important processes are not distributed in a phylogenetically coherent manner, potentially due to gene loss or horizontal gene transfer. While rRNA genes remain the most commonly used markers, key genes in ecologically important pathways, e.g., those involved in carbon and nitrogen cycling, can provide important insights into community composition and function not obtainable through rRNA analysis. However, working with ecofunctional gene data requires some tools beyond those required for rRNA analysis. To address this, our Functional Gene Pipeline and Repository (FunGene; http://fungene.cme.msu.edu/) offers databases of many common ecofunctional genes and proteins, as well as integrated tools that allow researchers to browse these collections and choose subsets for further analysis, build phylogenetic trees, test primers and probes for coverage, and download aligned sequences. Additional FunGene tools are specialized to process coding gene amplicon data. For example, FrameBot produces frameshift-corrected protein and DNA sequences from raw reads while finding the most closely related protein reference sequence. These tools can help provide better insight into microbial communities by directly studying key genes involved in important ecological processes.
Single sea urchin phagocytes express messages of a single sequence from the diverse Sp185/333 gene family in response to bacterial challenge.

PubMed

Majeske, Audrey J; Oren, Matan; Sacchi, Sandro; Smith, L Courtney

2014-12-01

Immune systems in animals rely on fast and efficient responses to a wide variety of pathogens. The Sp185/333 gene family in the purple sea urchin, Strongylocentrotus purpuratus, consists of an estimated 50 (±10) members per genome that share a basic gene structure but show high sequence diversity, primarily due to the mosaic appearance of short blocks of sequence called elements. The genes show significantly elevated expression in three subpopulations of phagocytes responding to marine bacteria. The encoded Sp185/333 proteins are highly diverse and have central effector functions in the immune system. In this study we report the Sp185/333 gene expression in single sea urchin phagocytes. Sea urchins challenged with heat-killed marine bacteria resulted in a typical increase in coelomocyte concentration within 24 h, which included an increased proportion of phagocytes expressing Sp185/333 proteins. Phagocyte fractions enriched from coelomocytes were used in limiting dilutions to obtain samples of single cells that were evaluated for Sp185/333 gene expression by nested RT-PCR. Amplicon sequences showed identical or nearly identical Sp185/333 amplicon sequences in single phagocytes with matches to six known Sp185/333 element patterns, including both common and rare element patterns. This suggested that single phagocytes show restricted expression from the Sp185/333 gene family and infers a diverse, flexible, and efficient response to pathogens. This type of expression pattern from a family of immune response genes in single cells has not been identified previously in other invertebrates. Copyright © 2014 by The American Association of Immunologists, Inc.
A first report and complete genome sequence of alfalfa enamovirus from Sudan

USDA-ARS?s Scientific Manuscript database

A full genome sequence of a viral pathogen, provisionally named alfalfa enamovirus 2 (AEV-2), was reconstructed from short reads obtained by Illumina RNA sequencing of alfalfa sample originating from Sudan. Ambiguous nucleotides in the resultant consensus assembly and identity of the predicted virus...
A three-layered model of primate prefrontal cortex encodes identity and abstract categorical structure of behavioral sequences.

PubMed

Hinaut, Xavier; Dominey, Peter Ford

2011-01-01

Categorical encoding is crucial for mastering large bodies of related sensory-motor experiences, but what is its neural substrate? In an effort to respond to this question, recent single-unit recording studies in the macaque lateral prefrontal cortex (LPFC) have demonstrated two characteristic forms of neural encoding of the sequential structure of the animal's sensory-motor experience. One population of neurons encodes the specific behavioral sequences. A second population of neurons encodes the sequence category (e.g. ABAB, AABB or AAAA) and does not differentiate sequences within the category (Shima, K., Isoda, M., Mushiake, H., Tanji, J., 2007. Categorization of behavioural sequences in the prefrontal cortex. Nature 445, 315-318.). Interestingly these neurons are intermingled in the lateral prefrontal cortex, and not topographically segregated. Thus, LPFC may provide a neurophysiological basis for sensorimotor categorization. Here we report on a neural network simulation study that reproduces and explains these results. We model a cortical circuit composed of three layers (infragranular, granular, and supragranular) of 5*5 leaky integrator neurons with a sigmoidal output function, and we examine 1000 such circuits running in parallel. Crucially the three layers are interconnected with recurrent connections, thus producing a dynamical system that is inherently sensitive to the spatiotemporal structure of the sequential inputs. The model is presented with 11 four-element sequences following Shima et al. We isolated one subpopulation of neurons each of whose activity predicts individual sequences, and a second population that predicts category independent of the specific sequence. We argue that a richly interconnected cortical circuit is capable of internally generating a neural representation of category membership, thus significantly extending the scope of recurrent network computation. In order to demonstrate that these representations can be used to create an explicit categorization capability, we introduced an additional neural structure corresponding to the striatum. We showed that via cortico-striatal plasticity, neurons in the striatum could produce an explicit representation both of the identity of each sequence, and its category membership. Copyright © 2011 Elsevier Ltd. All rights reserved.
Cloning and sequencing of the allophycocyanin genes from Spirulina maxima (Cyanophyta)

NASA Astrophysics Data System (ADS)

Qin, Song; Hiroyuki, Kojima; Yoshikazu, Kawata; Shin-Ichi, Yano; Zeng, Cheng-Kui

1998-03-01

The genes coding for the α-and β-subunit of allophycocyanin ( apcA and apcB) from the cyanophyte Spirulina maxima were cloned and sequenced. The results revealed 44.4% of nucleotide sequence similarity and 30.4% of similarity of deduced amino acid sequence between them. The amino acid sequence identities between S. maxima and S. platensis are 99.4% for α subunit and 100% for β subunit.
Corruption of genomic databases with anomalous sequence.

PubMed

Lamperti, E D; Kittelberger, J M; Smith, T F; Villa-Komaroff, L

1992-06-11

We describe evidence that DNA sequences from vectors used for cloning and sequencing have been incorporated accidentally into eukaryotic entries in the GenBank database. These incorporations were not restricted to one type of vector or to a single mechanism. Many minor instances may have been the result of simple editing errors, but some entries contained large blocks of vector sequence that had been incorporated by contamination or other accidents during cloning. Some cases involved unusual rearrangements and areas of vector distant from the normal insertion sites. Matches to vector were found in 0.23% of 20,000 sequences analyzed in GenBank Release 63. Although the possibility of anomalous sequence incorporation has been recognized since the inception of GenBank and should be easy to avoid, recent evidence suggests that this problem is increasing more quickly than the database itself. The presence of anomalous sequence may have serious consequences for the interpretation and use of database entries, and will have an impact on issues of database management. The incorporated vector fragments described here may also be useful for a crude estimate of the fidelity of sequence information in the database. In alignments with well-defined ends, the matching sequences showed 96.8% identity to vector; when poorer matches with arbitrary limits were included, the aggregate identity to vector sequence was 94.8%.
Gene encoding the group B streptococcal protein R4, its presence in clinical reference laboratory isolates & R4 protein pepsin sensitivity.

PubMed

Smith, B L; Flores, A; Dechaine, J; Krepela, J; Bergdall, A; Ferrieri, P

2004-05-01

R proteins were first identified by Lancefield in group B Streptococcus (GBS) as resistant to trypsin at pH8 and sensitive to pepsin at pH2. The R4 protein found predominantly in type III and some type II and V invasive isolates conforms to these criteria. The Rib protein, although structurally and epidemiologically similar to R4, was reported as resistant to both proteases. We report here the gene encoding the R4 protein from a type III group B streptococcal isolate (76-043) well characterized in our laboratory. Trypsin extracted GBS proteins were assayed for protease sensitivities by double-diffusion Ouchterlony using varying conditions for the enzyme pepsin. Standard haemoglobin assay was used to examine pepsin enzymatic activity. Thirty clinical isolates of varying protein profiles identified by double-diffusion from our reference strain laboratory were screened by PCR and Southern technique. SDS-PAGE gel purified R4 amino acid sequences were determined and used to design oligonucleotide primers for screening a 76-043 genomic library. R4 was sensitive to pepsin at pH2 but appeared resistant at pH4, the reported pH used for Rib. By standard haemoglobin assay and trypsin extract studies of R4 protein, pepsin was shown to be active at pH2, yet easily inactivated; assays of GBS surface proteins are critical at pH2. Of the amino acids initially sequenced from R4, 88 per cent (61/69) showed identity to Rib; the r4 nucleotide sequence was identical to that of rib. All isolates with strong positive protein reactions for R4 were positive in both PCR and Southern technique, whereas isolates expressing alpha, beta, R1/R4, and R5 (BPS) protein profiles were not. Sequenced PCR products aligned with identity to the R4 and Rib nucleotide sequences and confirmed the identity of these proteins and their molecular sequences.
Molecular characterization of nucleopolyhedrovirus of three lepidopteran pests using late expression factor-8 gene.

PubMed

Jose, Jency; Jalali, S K; Shivalingaswamy, T M; Kumar, N K Krishna; Bhatnagar, R; Bandyopadhyay, A

2013-06-01

A PCR based method for detection of viral DNA in nucleopolyhedrovirus of three lepidopterans, Spodoptera litura, Amsacta albistriga and Helicoverpa armigera, was developed by employing the late expression factor-8 (lef-8) gene of three NPV using specific primers. The amplicons of 689, 699 and 665 bp were amplified, respectively, and the nucleotide sequences were submitted to GenBank and the accession numbers were obtained. The sequences of lef-8 gene of S. litura NPV and H. armigera NPV matched with those of their respective references in the GenBank database, thereby confirming their identity, however, the sequence of A. albistriga NPV was the first sequence submitted to the GenBank database. The sequence similarity analysis between the three lef-8 gene of NPV sequenced in the present study revealed that there was no significant similarity between them, however A. albistriga NPV and S. litura NPV were found to be closely related. CLUSTAL alignment of the sequences generated revealed general relatedness among NPVs lef-8 gene. The study confirmed that lef-8 gene can be used for quick and correct discriminatory identification of insect viruses.
HIV drug resistance testing among patients failing second line antiretroviral therapy. Comparison of in-house and commercial sequencing.

PubMed

Chimukangara, Benjamin; Varyani, Bhavini; Shamu, Tinei; Mutsvangwa, Junior; Manasa, Justen; White, Elizabeth; Chimbetete, Cleophas; Luethy, Ruedi; Katzenstein, David

2017-05-01

HIV genotyping is often unavailable in low and middle-income countries due to infrastructure requirements and cost. We compared genotype resistance testing in patients with virologic failure, by amplification of HIV pol gene, followed by "in-house" sequencing and commercial sequencing. Remnant plasma samples from adults and children failing second-line ART were amplified and sequenced using in-house and commercial di-deoxysequencing, and analyzed in Harare, Zimbabwe and at Stanford, U.S.A, respectively. HIV drug resistance mutations were determined using the Stanford HIV drug resistance database. Twenty-six of 28 samples were amplified and 25 were successfully genotyped. Comparison of average percent nucleotide and amino acid identities between 23 pairs sequenced in both laboratories were 99.51 (±0.56) and 99.11 (±0.95), respectively. All pairs clustered together in phylogenetic analysis. Sequencing analysis identified 6/23 pairs with mutation discordances resulting in differences in phenotype, but these did not impact future regimens. The results demonstrate our ability to produce good quality drug resistance data in-house. Despite discordant mutations in some sequence pairs, the phenotypic predictions were not clinically significant. Copyright © 2016 Elsevier B.V. All rights reserved.
Comparisons of Highly Virulent H5N1 Influenza A Viruses Isolated from Humans and Chickens from Hong Kong

PubMed Central

Suarez, David L.; Perdue, Michael L.; Cox, Nancy; Rowe, Thomas; Bender, Catherine; Huang, Jing; Swayne, David E.

1998-01-01

Genes of an influenza A (H5N1) virus from a human in Hong Kong isolated in May 1997 were sequenced and found to be all avian-like (K. Subbarao et al., Science 279:393–395, 1998). Gene sequences of this human isolate were compared to those of a highly pathogenic chicken H5N1 influenza virus isolated from Hong Kong in April 1997. Sequence comparisons of all eight RNA segments from the two viruses show greater than 99% sequence identity between them. However, neither isolate’s gene sequence was closely (>95% sequence identity) related to any other gene sequences found in the GenBank database. Phylogenetic analysis demonstrated that the nucleotide sequences of at least four of the eight RNA segments clustered with Eurasian origin avian influenza viruses. The hemagglutinin gene phylogenetic analysis also included the sequences from an additional three human and two chicken H5N1 virus isolates from Hong Kong, and the isolates separated into two closely related groups. However, no single amino acid change separated the chicken origin and human origin isolates, but they all contained multiple basic amino acids at the hemagglutinin cleavage site, which is associated with a highly pathogenic phenotype in poultry. In experimental intravenous inoculation studies with chickens, all seven viruses were highly pathogenic, killing most birds within 24 h. All infected chickens had virtually identical pathologic lesions, including moderate to severe diffuse edema and interstitial pneumonitis. Viral nucleoprotein was most frequently demonstrated in vascular endothelium, macrophages, heterophils, and cardiac myocytes. Asphyxiation from pulmonary edema and generalized cardiovascular collapse were the most likely pathogenic mechanisms responsible for illness and death. In summary, a small number of changes in hemagglutinin gene sequences defined two closely related subgroups, with both subgroups having human and chicken members, among the seven viruses examined from Hong Kong, and all seven viruses were highly pathogenic in chickens and caused similar lesions in experimental inoculations. PMID:9658115
Molecular variation and distribution of Anopheles fluviatilis (Diptera: Culicidae) complex in Iran.

PubMed

Naddaf, Saied Reza; Razavi, Mohammad Reza; Bahramali, Golnaz

2010-09-01

Anopheles fluviatilis James (Diptera: Culicidae) is one of the known malaria vectors in south and southeastern Iran. Earlier ITS2 sequences analysis of specimens from Iran demonstrated only a single genotype that was identical to species Y in India, which is also the same as species T. We identified 2 haplotypes in the An. fluviatilis populations of Iran based on differences in nucleotide sequences of D3 domain of the 28S locus of ribosomal DNA (rDNA). Comparison of sequence data from 44 Iranian specimens with those publicly available in the Genbank database showed that all of the 28S-D3 sequences from Kazeroun and Khesht regions in Fars Province were identical to the database entry representing species U in India. In other regions, all the individuals showed heterozygosity at the single nucleotide position, which identifies species U and T. It is argued that the 2 species may co-occur in some regions and hybridize; however, the heterozygosity in the 28S-D3 locus was not reflected in ITS2 sequences and this locus for all individuals was identical to species T. This study shows that in a newly diverged species, like members of An. fluviatilis complex, a single molecular marker may not be sufficiently discriminatory to identify all the taxa over a vast geographical area. In addition, other molecular markers may provide more reliable information for species discrimination.
Molecular homogeneity of heat-stable enterotoxins produced by bovine enterotoxigenic Escherichia coli.

PubMed Central

Saeed, A M; Magnuson, N S; Sriranganathan, N; Burger, D; Cosand, W

1984-01-01

Heat-stable enterotoxins (STs) from four strains of bovine enterotoxigenic Escherichia coli representing four serogroups were purified to homogeneity by utilizing previously published purification schemata. Biochemical characterization of the purified STs showed that they met the basic criteria for the heat-stable enterotoxins of E. coli. Amino acid analysis of the purified STs revealed that they were peptides of identical amino acid composition. This composition consisted of 18 residues of 10 different amino acids, 6 of which were cysteine. The amino acid composition of the four ST peptides was identical to that reported for the STs of human and porcine E. coli. In addition, complete sequence analysis of two of the ST peptides and partial sequencing of several others revealed strong homology to the sequences of STs from human and porcine E. coli and to the sequence predicted from the last 18 codons of the transposon Tn1681. There was also substantial homology to the sequence predicted from the ST-coding genetic element of human E. coli, which may indicate the existence of identical bioactive configuration among ST peptides of E. coli strains of various host origins. These data support the hypothesis that STs produced by human, bovine, and porcine E. coli are coded by a closely related genetic element which may have originated from a single, widely disseminated transposon. Images PMID:6376355
RT-PCR and sequence analysis of the full-length fusion protein of Canine Distemper Virus from domestic dogs.

PubMed

Romanutti, Carina; Gallo Calderón, Marina; Keller, Leticia; Mattion, Nora; La Torre, José

2016-02-01

During 2007-2014, 84 out of 236 (35.6%) samples from domestic dogs submitted to our laboratory for diagnostic purposes were positive for Canine Distemper Virus (CDV), as analyzed by RT-PCR amplification of a fragment of the nucleoprotein gene. Fifty-nine of them (70.2%) were from dogs that had been vaccinated against CDV. The full-length gene encoding the Fusion (F) protein of fifteen isolates was sequenced and compared with that of those of other CDVs, including wild-type and vaccine strains. Phylogenetic analysis using the F gene full-length sequences grouped all the Argentinean CDV strains in the SA2 clade. Sequence identity with the Onderstepoort vaccine strain was 89.0-90.6%, and the highest divergence was found in the 135 amino acids corresponding to the F protein signal-peptide, Fsp (64.4-66.7% identity). In contrast, this region was highly conserved among the local strains (94.1-100% identity). One extra putative N-glycosylation site was identified in the F gene of CDV Argentinean strains with respect to the vaccine strain. The present report is the first to analyze full-length F protein sequences of CDV strains circulating in Argentina, and contributes to the knowledge of molecular epidemiology of CDV, which may help in understanding future disease outbreaks. Copyright © 2015 Elsevier B.V. All rights reserved.
Novel molecular approach to define pest species status and tritrophic interactions from historical Bemisia specimens.

PubMed

Tay, W T; Elfekih, S; Polaszek, A; Court, L N; Evans, G A; Gordon, K H J; De Barro, P J

2017-03-27

Museum specimens represent valuable genomic resources for understanding host-endosymbiont/parasitoid evolutionary relationships, resolving species complexes and nomenclatural problems. However, museum collections suffer DNA degradation, making them challenging for molecular-based studies. Here, the mitogenomes of a single 1912 Sri Lankan Bemisia emiliae cotype puparium, and of a 1942 Japanese Bemisia puparium are characterised using a Next-Generation Sequencing approach. Whiteflies are small sap-sucking insects including B. tabaci pest species complex. Bemisia emiliae's draft mitogenome showed a high degree of homology with published B. tabaci mitogenomes, and exhibited 98-100% partial mitochondrial DNA Cytochrome Oxidase I (mtCOI) gene identity with the B. tabaci species known as Asia II-7. The partial mtCOI gene of the Japanese specimen shared 99% sequence identity with the Bemisia 'JpL' genetic group. Metagenomic analysis identified bacterial sequences in both Bemisia specimens, while hymenopteran sequences were also identified in the Japanese Bemisia puparium, including complete mtCOI and rRNA genes, and various partial mtDNA genes. At 88-90% mtCOI sequence identity to Aphelinidae wasps, we concluded that the 1942 Bemisia nymph was parasitized by an Eretmocerus parasitoid wasp. Our approach enables the characterisation of genomes and associated metagenomic communities of museum specimens using 1.5 ng gDNA, and to infer historical tritrophic relationships in Bemisia whiteflies.
Limited Genetic Diversity Preceded Extinction of the Tasmanian Tiger

PubMed Central

Menzies, Brandon R.; Renfree, Marilyn B.; Heider, Thomas; Mayer, Frieder; Hildebrandt, Thomas B.; Pask, Andrew J.

2012-01-01

The Tasmanian tiger or thylacine was the largest carnivorous marsupial when Europeans first reached Australia. Sadly, the last known thylacine died in captivity in 1936. A recent analysis of the genome of the closely related and extant Tasmanian devil demonstrated limited genetic diversity between individuals. While a similar lack of diversity has been reported for the thylacine, this analysis was based on just two individuals. Here we report the sequencing of an additional 12 museum-archived specimens collected between 102 and 159 years ago. We examined a portion of the mitochondrial DNA hyper-variable control region and determined that all sequences were on average 99.5% identical at the nucleotide level. As a measure of accuracy we also sequenced mitochondrial DNA from a mother and two offspring. As expected, these samples were found to be 100% identical, validating our methods. We also used 454 sequencing to reconstruct 2.1 kilobases of the mitochondrial genome, which shared 99.91% identity with the two complete thylacine mitochondrial genomes published previously. Our thylacine genomic data also contained three highly divergent putative nuclear mitochondrial sequences, which grouped phylogenetically with the published thylacine mitochondrial homologs but contained 100-fold more polymorphisms than the conserved fragments. Together, our data suggest that the thylacine population in Tasmania had limited genetic diversity prior to its extinction, possibly as a result of their geographic isolation from mainland Australia approximately 10,000 years ago. PMID:22530022
The "expanding universe" of piroplasms.

PubMed

Criado-Fornelio, A; Gónzalez-del-Río, M A; Buling-Saraña, A; Barba-Carretero, J C

2004-02-06

The present paper is the continuation of our previous studies dealing with the genetic characterization of piroplasmid species found in southern Europe. We report in this work new data concerning sequences of the 18s rRNA gene in Spanish piroplasms not studied (or not totally sequenced) in our former surveys. Molecular data analysis indicated that Spanish Cytauxzoon felis (cat isolate) has 98% identity with Cytauxzoon sp. from Mongolia and 95% identity compared to African C. felis. There are at least two main genetic variants of Babesia caballi in Spain: The first variety (isolate Spain 1) shows a relatively low homology with the African genotype (97% identity). The second variety (represented by two isolates, Spain 2 and Spain 3, differing by a single base) shows high genetic similarity with the African genotype (99.7-100% identity). There are also two genetic variants of Babesia equi (isolates Spain 1 and Spain 2, differing by four bases) in Spain, sharing 99% identity with the African genotype. At least one of them (Spain 1) can infect dogs. All of the phylogenetic analysis procedures employed indicated that Spanish isolates of C. felis, B. caballi (Spain 1) and B. equi (Spain 1 and Spain 2) are genetically different from their African relatives, all those dichotomies showing very high bootstrap support. Nonetheless, the lack of information on their morphology and the fact that the sequences were obtained in a single isolate preclude any conclusion about their definitive taxonomic status.

Investigation of Hydrological Response of Three Identical Artificial Hillslopes at the Landscape Evolution Observatory

NASA Astrophysics Data System (ADS)

Matos, K.; Alves Meira Neto, A.; Troch, P. A. A.; Volkmann, T.

2017-12-01

Hydrological processes at the hillslope scale are complex and heterogeneous, but monitoring hillslopes with a large number of sensors or replicate experimental designs is rarely feasible. The Landscape Evolution Observatory (LEO) at Biosphere 2 consists of three replicated, large (330 m2) artificial hillslopes (East, Center and West) packed with 1-m depth of initially homogeneous, basaltic soil. Each landscape contains a spatially dense network of sensors capable of resolving meter-scale lateral heterogeneity and sub-meter scale vertical heterogeneity in moisture content and water potential, as well as the hillslope-integrated water balance components. A sophisticated irrigation system allows performing controlled forcing experiments. The three hillslopes are thought to be nearly identical, however recent data showed significant differences in discharge and storage behavior. A 45-day periodic-steady-state tracer experiment was conducted in November and December of 2016, where a 3.5-day long, identical irrigation sequence was repeated 15 times. Each sequence's rainfall, runoff, and storage dynamics were recorded, and distributed moisture characteristics were derived using paired moisture content and matric potential data from 496 positions in each hillslope. In order to understand why the three hillslopes behave hydrologically different, we analyzed soil water retention characteristics at various scales ranging from individually paired moisture and matric potential to whole-hillslope soil water retention characteristics. The results confirm the distinct hydrological behavior between the three hillslopes. The East and West hillslopes behave more similar with respect to the release of water. In contrast, the East and Center hillslopes are more similar with respect to their storage behavior. The differences in hillslope behavior arising from three identically built hillslopes are a surprising and beneficial opportunity to explore how differences in small-scale heterogeneity can impact hydrological dynamics at the hillslope scale.
Comparative genomic analysis of bacteriophages specific to the channel catfish pathogen Edwardsiella ictaluri

PubMed Central

2011-01-01

Background The bacterial pathogen Edwardsiella ictaluri is a primary cause of mortality in channel catfish raised commercially in aquaculture farms. Additional treatment and diagnostic regimes are needed for this enteric pathogen, motivating the discovery and characterization of bacteriophages specific to E. ictaluri. Results The genomes of three Edwardsiella ictaluri-specific bacteriophages isolated from geographically distant aquaculture ponds, at different times, were sequenced and analyzed. The genomes for phages eiAU, eiDWF, and eiMSLS are 42.80 kbp, 42.12 kbp, and 42.69 kbp, respectively, and are greater than 95% identical to each other at the nucleotide level. Nucleotide differences were mostly observed in non-coding regions and in structural proteins, with significant variability in the sequences of putative tail fiber proteins. The genome organization of these phages exhibit a pattern shared by other Siphoviridae. Conclusions These E. ictaluri-specific phage genomes reveal considerable conservation of genomic architecture and sequence identity, even with considerable temporal and spatial divergence in their isolation. Their genomic homogeneity is similarly observed among E. ictaluri bacterial isolates. The genomic analysis of these phages supports the conclusion that these are virulent phages, lacking the capacity for lysogeny or expression of virulence genes. This study contributes to our knowledge of phage genomic diversity and facilitates studies on the diagnostic and therapeutic applications of these phages. PMID:21214923
A Systematic Approach for Discovering Novel, Clinically Relevant Bacteria

PubMed Central

Simmon, Keith E.; Fisher, Mark A.

2012-01-01

Sequencing of the 16S rRNA gene (16S) is a reference method for bacterial identification. Its expanded use has led to increased recognition of novel bacterial species. In most clinical laboratories, novel species are infrequently encountered, and their pathogenic potential is often difficult to assess. We reviewed partial 16S sequences from >26,000 clinical isolates, analyzed during February 2006–June 2010, and identified 673 that have <99% sequence identity with valid reference sequences and are thus possibly novel species. Of these 673 isolates, 111 may represent novel genera (<95% identity). Isolates from 95 novel taxa were recovered from multiple patients, indicating possible clinical relevance. Most repeatedly encountered novel taxa belonged to the genera Nocardia (14 novel taxa, 42 isolates) and Actinomyces (12 novel taxa, 52 isolates). This systematic approach for recognition of novel species with potential diagnostic or therapeutic relevance provides a basis for epidemiologic surveys and improvement of sequence databases and may lead to identification of new clinical entities. PMID:22377371
IVisTMSA: Interactive Visual Tools for Multiple Sequence Alignments.

PubMed

Pervez, Muhammad Tariq; Babar, Masroor Ellahi; Nadeem, Asif; Aslam, Naeem; Naveed, Nasir; Ahmad, Sarfraz; Muhammad, Shah; Qadri, Salman; Shahid, Muhammad; Hussain, Tanveer; Javed, Maryam

2015-01-01

IVisTMSA is a software package of seven graphical tools for multiple sequence alignments. MSApad is an editing and analysis tool. It can load 409% more data than Jalview, STRAP, CINEMA, and Base-by-Base. MSA comparator allows the user to visualize consistent and inconsistent regions of reference and test alignments of more than 21-MB size in less than 12 seconds. MSA comparator is 5,200% efficient and more than 40% efficient as compared to BALiBASE c program and FastSP, respectively. MSA reconstruction tool provides graphical user interfaces for four popular aligners and allows the user to load several sequence files at a time. FASTA generator converts seven formats of alignments of unlimited size into FASTA format in a few seconds. MSA ID calculator calculates identity matrix of more than 11,000 sequences with a sequence length of 2,696 base pairs in less than 100 seconds. Tree and Distance Matrix calculation tools generate phylogenetic tree and distance matrix, respectively, using neighbor joining% identity and BLOSUM 62 matrix.
A systematic approach for discovering novel, clinically relevant bacteria.

PubMed

Schlaberg, Robert; Simmon, Keith E; Fisher, Mark A

2012-03-01

Sequencing of the 16S rRNA gene (16S) is a reference method for bacterial identification. Its expanded use has led to increased recognition of novel bacterial species. In most clinical laboratories, novel species are infrequently encountered, and their pathogenic potential is often difficult to assess. We reviewed partial 16S sequences from >26,000 clinical isolates, analyzed during February 2006-June 2010, and identified 673 that have <99% sequence identity with valid reference sequences and are thus possibly novel species. Of these 673 isolates, 111 may represent novel genera (<95% identity). Isolates from 95 novel taxa were recovered from multiple patients, indicating possible clinical relevance. Most repeatedly encountered novel taxa belonged to the genera Nocardia (14 novel taxa, 42 isolates) and Actinomyces (12 novel taxa, 52 isolates). This systematic approach for recognition of novel species with potential diagnostic or therapeutic relevance provides a basis for epidemiologic surveys and improvement of sequence databases and may lead to identification of new clinical entities.
Characterization of Austrian koi herpesvirus samples based on the ORF40 region.

PubMed

Marek, A; Schachner, O; Bilic, I; Hess, M

2010-02-17

Using a PCR that amplifies a region of the thymidine kinase (TK) gene, an epidemic spread of koi herpesvirus (KHV) was determined in koi carps in Austria in 2007. A total of 15 virus samples from different locations in Austria were analyzed to determine their genetic relatedness following PCR and nucleic acid sequencing of the open reading frame 40 (ORF40) region of the KHV genome. ORF40-specific PCR amplification products that were obtained from tissue samples shared 100% nucleotide sequence identity with the published sequence of the Japanese strain of KHV. The ORF40 sequence of one isolate from the UK that was included in the present study was 100% identical with the published sequence of an Israeli strain of KHV. This is the first study that used a larger number of samples and a PCR method, which allowed distinguishing all 3 strains of KHV. The present investigation provides information on the epidemiology of KHV infections in Europe and describes a useful molecular tool for epidemiological studies.
Variability of Actinobacteria, a minor component of rumen microflora.

PubMed

Suľák, M; Sikorová, L; Jankuvová, J; Javorský, P; Pristaš, P

2012-07-01

Actinobacteria (Actinomycetes) are a significant and interesting group of gram-positive bacteria. They are regular, though infrequent, members of the microbial life in the rumen and represent up to 3 % of total rumen bacteria; there is considerable lack of information about ecology and biology of rumen actinobacteria. During the characterization of variability of rumen treponemas using non-cultivation approach, we also noted the variability of rumen actinobacteria. By using Treponema-specific primers a specific 16S rRNA gene library was prepared from cow and sheep rumen total DNA. About 10 % of recombinant clones contained actinobacteria-like sequences. Phylogenetic analyses of 11 clones obtained showed the high variability of actinobacteria in the ruminant digestive system. While some sequences are nearly identical to known sequences of actinobacteria, we detected completely new clusters of actinobacteria-like sequences, representing probably new, as yet undiscovered, group of rumen Actinobacteria. Further research will be necessary for understanding their nature and functions in the rumen.
Recent advances in plant centromere biology.

PubMed

Feng, Chao; Liu, YaLin; Su, HanDong; Wang, HeFei; Birchler, James; Han, FangPu

2015-03-01

The centromere, which is one of the essential parts of a chromosome, controls kinetochore formation and chromosome segregation during mitosis and meiosis. While centromere function is conserved in eukaryotes, the centromeric DNA sequences evolve rapidly and have few similarities among species. The histone H3 variant CENH3 (CENP-A in human), which mostly exists in centromeric nucleosomes, is a universal active centromere mark in eukaryotes and plays an essential role in centromere identity determination. The relationship between centromeric DNA sequences and centromere identity determination is one of the intriguing questions in studying centromere formation. Due to the discoveries in the past decades, including "neocentromeres" and "centromere inactivation", it is now believed that the centromere identity is determined by epigenetic mechanisms. This review will present recent progress in plant centromere biology.
Human centromere genomics: now it's personal.

PubMed

Hayden, Karen E

2012-07-01

Advances in human genomics have accelerated studies in evolution, disease, and cellular regulation. However, centromere sequences, defining the chromosomal interface with spindle microtubules, remain largely absent from ongoing genomic studies and disconnected from functional, genome-wide analyses. This disparity results from the challenge of predicting the linear order of multi-megabase-sized regions that are composed almost entirely of near-identical satellite DNA. Acknowledging these challenges, the field of human centromere genomics possesses the potential to rapidly advance given the availability of individual, or personalized, genome projects matched with the promise of long-read sequencing technologies. Here I review the current genomic model of human centromeres in consideration of those studies involving functional datasets that examine the role of sequence in centromere identity.
Full genome sequences of zebra-borne equine herpesvirus type 1 isolated from zebra, onager and Thomson's gazelle.

PubMed

Guo, Xiaoqin; Izume, Satoko; Okada, Ayaka; Ohya, Kenji; Kimura, Takashi; Fukushi, Hideto

2014-09-01

A strain of equine herpesvirus type 1 (EHV-1) was isolated from zebra. This strain, called "zebra-borne EHV-1", was also isolated from an onager and a gazelle in zoological gardens in U.S.A. The full genome sequences of the 3 strains were determined. They shared 99% identities with each other, while they shared 98% and 95% identities with the horse derived EHV-1 and equine herpesvirus type 9, respectively. Sequence data indicated that the EHV-1 isolated from a polar bear in Germany is one of the zebra-borne EHV-1 and not a recombinant virus. These results indicated that zebra-borne EHV-1 is a subtype of EHV-1.
The complete nucleotide sequence of the barley yellow dwarf GPV isolate from China shows that it is a new member of the genus Polerovirus.

PubMed

Zhang, Wenwei; Cheng, Zhuomin; Xu, Lei; Wu, Maosen; Waterhouse, Peter; Zhou, Guanghe; Li, Shifang

2009-01-01

The complete nucleotide sequence of the ssRNA genome of a Chinese GPV isolate of barley yellow dwarf virus (BYDV) was determined. It comprised 5673 nucleotides, and the deduced genome organization resembled that of members of the genus Polerovirus. It was most closely related to cereal yellow dwarf virus-RPV (77% nt identity over the entire genome; coat protein amino acid identity 79%). The GPV isolate also differs in vector specificity from other BYDV strains. Biological properties, phylogenetic analyses and detailed sequence comparisons suggest that GPV should be considered a member of a new species within the genus, and the name Wheat yellow dwarf virus-GPV is proposed.
Horizontal Transfer of Non-LTR Retrotransposons from Arthropods to Flowering Plants.

PubMed

Gao, Dongying; Chu, Ye; Xia, Han; Xu, Chunming; Heyduk, Karolina; Abernathy, Brian; Ozias-Akins, Peggy; Leebens-Mack, James H; Jackson, Scott A

2018-02-01

Even though lateral movements of transposons across families and even phyla within multicellular eukaryotic kingdoms have been found, little is known about transposon transfer between the kingdoms Animalia and Plantae. We discovered a novel non-LTR retrotransposon, AdLINE3, in a wild peanut species. Sequence comparisons and phylogenetic analyses indicated that AdLINE3 is a member of the RTE clade, originally identified in a nematode and rarely reported in plants. We identified RTE elements in 82 plants, spanning angiosperms to algae, including recently active elements in some flowering plants. RTE elements in flowering plants were likely derived from a single family we refer to as An-RTE. Interestingly, An-RTEs show significant DNA sequence identity with non-LTR retroelements from 42 animals belonging to four phyla. Moreover, the sequence identity of RTEs between two arthropods and two plants was higher than that of homologous genes. Phylogenetic and evolutionary analyses of RTEs from both animals and plants suggest that the An-RTE family was likely transferred horizontally into angiosperms from an ancient aphid(s) or ancestral arthropod(s). Notably, some An-RTEs were recruited as coding sequences of functional genes participating in metabolic or other biochemical processes in plants. This is the first potential example of horizontal transfer of transposons between animals and flowering plants. Our findings help to understand exchanges of genetic material between the kingdom Animalia and Plantae and suggest arthropods likely impacted on plant genome evolution. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Evaluation of a Campylobacter fetus subspecies venerealis real-time quantitative polymerase chain reaction for direct analysis of bovine preputial samples

PubMed Central

Chaban, Bonnie; Chu, Shirley; Hendrick, Steven; Waldner, Cheryl; Hill, Janet E.

2012-01-01

The detection and subspeciation of Campylobacter fetus subsp. venerealis (CFV) from veterinary samples is important for both clinical and economic reasons. Campylobacter fetus subsp. venerealis is the causative agent of bovine genital campylobacteriosis, a venereal disease that can lead to serious reproductive problems in cattle, and strict international regulations require animals and animal products to be CFV-free for trade. This study evaluated methods reported in the literature for CFV detection and reports the translation of an extensively tested CFV-specific polymerase chain reaction (PCR) primer set; including the VenSF/VenSR primers and a real-time, quantitative PCR (qPCR) platform using SYBR Green chemistry. Three methods of preputial sample preparation for direct qPCR were evaluated and a heat lysis DNA extraction method was shown to allow for CFV detection at the level of approximately one cell equivalent per reaction (or 1.0 × 103 CFU/mL) from prepuce. The optimized sample preparation and qPCR protocols were then used to evaluate 3 western Canadian bull cohorts, which included 377 bulls, for CFV. The qPCR assay detected 11 positive bulls for the CFV-specific parA gene target. DNA sequence data confirmed the identity of the amplified product and revealed that positive samples were comprised of 2 sequence types; one identical to previously reported CFV parA gene sequences and one with a 9% sequence divergence. These results add valuable information towards our understanding of an important CFV subspeciation target and offer a significantly improved format for an internationally recognized PCR test. PMID:23277694
Novel magnetite-producing magnetotactic bacteria belonging to the Gammaproteobacteria.

PubMed

Lefèvre, Christopher T; Viloria, Nathan; Schmidt, Marian L; Pósfai, Mihály; Frankel, Richard B; Bazylinski, Dennis A

2012-02-01

Two novel magnetotactic bacteria (MTB) were isolated from sediment and water collected from the Badwater Basin, Death Valley National Park and southeastern shore of the Salton Sea, respectively, and were designated as strains BW-2 and SS-5, respectively. Both organisms are rod-shaped, biomineralize magnetite, and are motile by means of flagella. The strains grow chemolithoautotrophically oxidizing thiosulfate and sulfide microaerobically as electron donors, with thiosulfate oxidized stoichiometrically to sulfate. They appear to utilize the Calvin-Benson-Bassham cycle for autotrophy based on ribulose-1,5-bisphosphate carboxylase/oxygenase (RubisCO) activity and the presence of partial sequences of RubisCO genes. Strains BW-2 and SS-5 biomineralize chains of octahedral magnetite crystals, although the crystals of SS-5 are elongated. Based on 16S rRNA gene sequences, both strains are phylogenetically affiliated with the Gammaproteobacteria class. Strain SS-5 belongs to the order Chromatiales; the cultured bacterium with the highest 16S rRNA gene sequence identity to SS-5 is Thiohalocapsa marina (93.0%). Strain BW-2 clearly belongs to the Thiotrichales; interestingly, the organism with the highest 16S rRNA gene sequence identity to this strain is Thiohalospira alkaliphila (90.2%), which belongs to the Chromatiales. Each strain represents a new genus. This is the first report of magnetite-producing MTB phylogenetically associated with the Gammaproteobacteria. This finding is important in that it significantly expands the phylogenetic diversity of the MTB. Physiology of these strains is similar to other MTB and continues to demonstrate their potential in nitrogen, iron, carbon and sulfur cycling in natural environments.
A novel enterovirus species identified from severe diarrheal goats.

PubMed

Wang, Mingyue; He, Jia; Lu, Haibing; Liu, Yajing; Deng, Yingrui; Zhu, Lisai; Guo, Changming; Tu, Changchun; Wang, Xinping

2017-01-01

The Enterovirus genus of the family of Picornaviridae consists of 9 species of Enteroviruses and 3 species of Rhinoviruses based on the latest virus taxonomy. Those viruses contribute significantly to respiratory and digestive disorders in human and animals. Out of 9 Enterovirus species, Enterovirus E-G are closely related to diseases affecting on livestock industry. While enterovirus infection has been increasingly reported in cattle and swine, the enterovirus infections in small ruminants remain largely unknown. Virology, molecular and bioinformatics methods were employed to characterize a novel enterovirus CEV-JL14 from goats manifesting severe diarrhea with morbidity and mortality respectively up to 84% and 54% in China. CEV-JL14 was defined and proposed as a new Enterovirus species L within the genus of Enterovirus of the family Picornaviridae. CEV-JL14 had a complete genome sequence of 7461 nucleotides with an ORF encoding 2172 amino acids, and shared 77.1% of genomic sequence identity with TB4-OEV, an ovine enterovirus. Comparison of 5'-UTR and structural genes of CEV-JL14 with known Enterovirus species revealed highly genetic variations among CEV-JL14 with known Enterovirus species. VP1 nucleotide sequence identities of CEV-14 were 51.8%-53.5% with those of Enterovirus E and F, 30.9%-65.3% with Enterovirus G, and 43.8-51. 5% with Enterovirus A-D, respectively. CEV-JL14 was proposed as a novel species within the genus of Enterovirus according to the current ICTV demarcation criteria of enteroviruses. CEV-JL14 clustered phylogenetically to neither Enterovirus E and F, nor to Enterovirus G. It was defined and proposed as novel species L within the genus of Enterovirus. This is the first report of caprine enterovirus in China, the first complete genomic sequence of a caprine enterovirus revealed, and the unveiling of significant genetic variations between ovine enterovirus and caprine enterovirus, thus broadening the current understanding of enteroviruses.
A novel enterovirus species identified from severe diarrheal goats

PubMed Central

Liu, Yajing; Deng, Yingrui; Zhu, Lisai; Guo, Changming; Tu, Changchun; Wang, Xinping

2017-01-01

Backgrounds The Enterovirus genus of the family of Picornaviridae consists of 9 species of Enteroviruses and 3 species of Rhinoviruses based on the latest virus taxonomy. Those viruses contribute significantly to respiratory and digestive disorders in human and animals. Out of 9 Enterovirus species, Enterovirus E-G are closely related to diseases affecting on livestock industry. While enterovirus infection has been increasingly reported in cattle and swine, the enterovirus infections in small ruminants remain largely unknown. Methods Virology, molecular and bioinformatics methods were employed to characterize a novel enterovirus CEV-JL14 from goats manifesting severe diarrhea with morbidity and mortality respectively up to 84% and 54% in China. Results CEV-JL14 was defined and proposed as a new Enterovirus species L within the genus of Enterovirus of the family Picornaviridae. CEV-JL14 had a complete genome sequence of 7461 nucleotides with an ORF encoding 2172 amino acids, and shared 77.1% of genomic sequence identity with TB4-OEV, an ovine enterovirus. Comparison of 5’-UTR and structural genes of CEV-JL14 with known Enterovirus species revealed highly genetic variations among CEV-JL14 with known Enterovirus species. VP1 nucleotide sequence identities of CEV-14 were 51.8%-53.5% with those of Enterovirus E and F, 30.9%-65.3% with Enterovirus G, and 43.8–51. 5% with Enterovirus A-D, respectively. CEV-JL14 was proposed as a novel species within the genus of Enterovirus according to the current ICTV demarcation criteria of enteroviruses. Conclusions CEV-JL14 clustered phylogenetically to neither Enterovirus E and F, nor to Enterovirus G. It was defined and proposed as novel species L within the genus of Enterovirus. This is the first report of caprine enterovirus in China, the first complete genomic sequence of a caprine enterovirus revealed, and the unveiling of significant genetic variations between ovine enterovirus and caprine enterovirus, thus broadening the current understanding of enteroviruses. PMID:28376123
Further insight into genetic variation and haplotype diversity of Cherry virus A from China

PubMed Central

Candresse, Thierry; He, Zhen; Li, Shifang; Ma, Yuxin

2017-01-01

Cherry virus A (CVA) infection appears to be prevalent in cherry plantations worldwide. In this study, the diversity of CVA isolates from 31 cherry samples collected from different orchards around Bohai Bay in northeastern China was analyzed. The complete genome of one of these isolates, ChYT52, was found to be 7,434 nt in length excluding the poly (A) tail. It shares between 79.9–98.7% identity with CVA genome sequences in GenBank, while its RdRp core is more divergent (79.1–90.7% nt identity), likely as a consequence of a recombination event. Phylogenetic analysis of ChYT52 genome with CVA genomes in Genbank resulted in at least 7 major clusters plus additional 5 isolates alone at the end of long branches suggesting the existence of further phylogroups diversity in CVA. The genetic diversity of Chinese CVA isolates from 31 samples and GenBank sequences were analyzed in three genomic regions that correspond to the coat protein, the RNA-dependent RNA polymerase core region, and the movement protein genes. With few exceptions likely representing further recombination impact, the trees various trees are largely congruent, indicating that each region provides valuable phylogenetic information. In all cases, the majority of the Chinese CVA isolates clustering in phylogroup I, together with the X82547 reference sequence from Germany. Statistically significant negative values were obtained for Tajima’s D in the three genes for phylogroup I, suggesting that it may be undergoing a period of expansion. There was considerable haplotype diversity in the individual samples and more than half samples contained genetically diverse haplotypes belonging to different phylogroups. In addition, a number of statistically significant recombination events were detected in CVA genomes or in the partial genomic sequences indicating an important contribution of recombination to CVA evolution. This work provides a foundation for elucidation of the epidemiological characteristics and evolutionary history of CVA populations. PMID:29020049
Porcine insulin receptor substrate 4 (IRS4) gene: cloning, polymorphism and association study

USDA-ARS?s Scientific Manuscript database

Using PCR and IPCR techniques we obtained a 4498 bp nucleotide sequence FN424076 encompassing the complete coding sequence of the porcine IRS4 gene and its proximal promoter. The 1269-amino acid porcine protein deduced from the nucleotide sequence shares 92% identity with the human IRS4 and possesse...
Genome Sequences for Five Strains of the Emerging Pathogen Haemophilus haemolyticus

PubMed Central

Jordan, I. King; Conley, Andrew B.; Antonov, Ivan V.; Arthur, Robert A.; Cook, Erin D.; Cooper, Guy P.; Jones, Bernard L.; Knipe, Kristen M.; Lee, Kevin J.; Liu, Xing; Mitchell, Gabriel J.; Pande, Pushkar R.; Petit, Robert A.; Qin, Shaopu; Rajan, Vani N.; Sarda, Shruti; Sebastian, Aswathy; Tang, Shiyuyun; Thapliyal, Racchit; Varghese, Neha J.; Ye, Tianjun; Katz, Lee S.; Wang, Xin; Rowe, Lori; Frace, Michael; Mayer, Leonard W.

2011-01-01

We report the first whole-genome sequences for five strains, two carried and three pathogenic, of the emerging pathogen Haemophilus haemolyticus. Preliminary analyses indicate that these genome sequences encode markers that distinguish H. haemolyticus from its closest Haemophilus relatives and provide clues to the identity of its virulence factors. PMID:21952546
Complete Genome Sequences of Bacillus Phages Janet and OTooleKemple52

PubMed Central

2018-01-01

ABSTRACT We report here the genome sequences of two novel Bacillus cereus group-infecting bacteriophages, Janet and OTooleKemple52. These bacteriophages are double-stranded DNA-containing Myoviridae isolated from soil samples. While their genomes share a high degree of sequence identity with one another, their host preferences are unique. PMID:29748396

Development and application of a PCR assay to detect chicken and turkey parvoviruses in commercial poultry flocks in the United States.

USDA-ARS?s Scientific Manuscript database

Comparative sequence analysis of six independent chicken and turkey parvovirus nonstructural (NS) genes revealed specific genomic regions with 100% nucleotide sequence identity. A PCR assay with primers targeting these conserved genome sequences proved to be highly specific and sensitive to detect p...
Perception and the Temporal Properties of Speech.

DTIC Science & Technology

1993-01-11

conditions. In the embedded condition, phoneme sequences equivalent to these words formed the second syllable of a two-syllable word. In the unembedded ... unembedded in the sequence "warm lips". These priming sequences were based on the sequences used in Experiment 2. Each combinable priming sequence in...unrelated, to the embedded or unembedded prime word. The probes used in this experiment were identical to the ones used in Experiment 2. Subjects were tested
High resolution identity testing of inactivated poliovirus vaccines

PubMed Central

Mee, Edward T.; Minor, Philip D.; Martin, Javier

2015-01-01

Background Definitive identification of poliovirus strains in vaccines is essential for quality control, particularly where multiple wild-type and Sabin strains are produced in the same facility. Sequence-based identification provides the ultimate in identity testing and would offer several advantages over serological methods. Methods We employed random RT-PCR and high throughput sequencing to recover full-length genome sequences from monovalent and trivalent poliovirus vaccine products at various stages of the manufacturing process. Results All expected strains were detected in previously characterised products and the method permitted identification of strains comprising as little as 0.1% of sequence reads. Highly similar Mahoney and Sabin 1 strains were readily discriminated on the basis of specific variant positions. Analysis of a product known to contain incorrect strains demonstrated that the method correctly identified the contaminants. Conclusion Random RT-PCR and shotgun sequencing provided high resolution identification of vaccine components. In addition to the recovery of full-length genome sequences, the method could also be easily adapted to the characterisation of minor variant frequencies and distinction of closely related products on the basis of distinguishing consensus and low frequency polymorphisms. PMID:26049003
Reduced TCOF1 mRNA level in a rhesus macaque with Treacher Collins-like syndrome: further evidence for haploinsufficiency of treacle as the cause of disease.

PubMed

Shows, Kathryn H; Ward, Christy; Summers, Laura; Li, Lin; Ziegler, Gregory R; Hendrickx, Andrew G; Shiang, Rita

2006-02-01

Mutations in the human gene TCOF1 cause a mandibulofacial dysostosis known as Treacher Collins syndrome (TCS). An infant rhesus macaque (Macaca mulatta) that displayed the TCS phenotype was identified at the California National Primate Research Center. The TCOF1 coding region was cloned from a normal rhesus macaque and sequenced. The rhesus macaque homolog of TCOF1 is 91.6% identical in cDNA sequence and 93.8% identical in translated protein sequence compared to human TCOF1. Sequencing of TCOF1 in the TCS-affected rhesus macaque showed no mutations within the coding region or splice sites; however, real-time quantitative PCR showed an 87% reduction of spleen TCOF1 mRNA level in the TCS affected macaque when compared with normal macaque spleen.
Host Cell Virus Entry Mediated by Australian Bat Lyssavirus Envelope G glycoprotein

DTIC Science & Technology

2013-10-24

39 Figure 7. Comparison of the amino acid sequences of Saccolaimus and Pteropus ABLV G mature protein... sequence analysis revealed that the PCR products were identical. Sequence comparisons of the ABLV N and other lyssavirus N proteins showed that ABLV...Saccolaimus flaviventris) (129). Nucleoprotein sequence comparisons revealed that the Saccolaimus N protein shared 96% amino acid homology with the Pteropus
Full-Genome Sequence of Infectious Laryngotracheitis Virus (Gallid Alphaherpesvirus 1) Strain VFAR-043, Isolated in Peru

PubMed Central

Bendezu Eguis, Jorge; Montesinos, Ricardo; Fernández-Díaz, Manolo

2018-01-01

ABSTRACT We report here the first genome sequence of infectious laryngotracheitis virus isolated in Peru from tracheal tissues of layer chickens. The genome showed 99.98% identity to the J2 strain genome sequence. Single nucleotide polymorphisms were detected in five gene-coding sequences related to vaccine development, virus attachment, and viral immune evasion. PMID:29519822
Sequence variations of the human MPDZ gene and association with alcoholism in subjects with European ancestry.

PubMed

Karpyak, Victor M; Kim, Jeong-Hyun; Biernacka, Joanna M; Wieben, Eric D; Mrazek, David A; Black, John L; Choi, Doo-Sup

2009-04-01

Mpdz gene variations are known contributors of acute alcohol withdrawal severity and seizures in mice. To investigate the relevance of these findings for human alcoholism, we resequenced 46 exons, exon-intron boundaries, and 2 kilobases in the 5' region of the human MPDZ gene in 61 subjects with a history of alcohol withdrawal seizures (AWS), 59 subjects with a history of alcohol withdrawal without AWS, and 64 Coriell samples from self-reported nonalcoholic subjects [all European American (EA) ancestry] and compared with the Mpdz sequences of 3 mouse strains with different propensity to AWS. To explore potential associations of the human MPDZ gene with alcoholism and AWS, single SNP and haplotype analyses were performed using 13 common variants. Sixty-seven new, mostly rare variants were discovered in the human MPDZ gene. Sequence comparison revealed that the human gene does not have variations identical to those comprising Mpdz gene haplotype associated with AWS in mice. We also found no significant association between MPDZ haplotypes and AWS in humans. However, a global test of haplotype association revealed a significant difference in haplotype frequencies between alcohol-dependent subjects without AWS and Coriell controls (p = 0.015), suggesting a potential role of MPDZ in alcoholism and/or related phenotypes other than AWS. Haplotype-specific tests for the most common haplotypes (frequency > 0.05), revealed a specific high-risk haplotype (p = 0.006, maximum statistic p = 0.051), containing rs13297480G allele also found to be significantly more prevalent in alcoholics without AWS compared with nonalcoholic Coriell subjects (p = 0.019). Sequencing of MPDZ gene in individuals with EA ancestry revealed no variations in the sites identical to those associated with AWS in mice. Exploratory haplotype and single SNP association analyses suggest a possible association between the MPDZ gene and alcohol dependence but not AWS. Further functional genomic analysis of MPDZ variants and investigation of their association with a broader array of alcoholism-related phenotypes could reveal additional genetic markers of alcoholism.
Communities of arbuscular mycorrhizal fungi in Pyrus pyrifolia var. culta (Japanese pear) and an understory herbaceous plant Plantago asiatica.

PubMed

Yoshimura, Yuko; Ido, Akifumi; Matsumoto, Teruyuki; Yamato, Masahide

2013-01-01

We investigated communities of arbuscular mycorrhizal fungi (AMF) in the fine roots of Pyrus pyrifolia var. culta, and Plantago asiatica to consider the relationship between orchard trees and herbaceous plants in AMF symbioses. The AMF communities were analyzed on the basis of the partial fungal DNA sequences of the nuclear small subunit ribosomal RNA gene (SSU rDNA), which were amplified using the AMF-specific primers AML1 and AML2. Phylogenetic analysis showed that the obtained AMF sequences were divided into 23 phylotypes. Among them, 12 phylotypes included AMF from both host plants, and most of the obtained sequences (689/811) were affiliated to them. Canonical correspondence analysis showed that the host plant species did not have a significant effect on the distribution of AMF phylotypes, whereas the effects of sampling site, soil total C, soil total N and soil-available P were significant. It was also found that the mean observed overlaps of AMF phylotypes between the paired host plants in the same soil cores (27.1% of phylotypes shared) were significantly higher than the mean 1,000 simulated overlaps (14.2%). Furthermore, the same AMF sequences (100% sequence identity) were detected from both host plants in 8/12 soil cores having both roots. Accordingly, we concluded that Py. pyrifolia and Pl. asiatica examined shared some AMF communities, which suggested that understory herbaceous plants may function as AMF inoculum sources for orchard trees.
Communities of Arbuscular Mycorrhizal Fungi in Pyrus pyrifolia var. culta (Japanese pear) and an Understory Herbaceous Plant Plantago asiatica

PubMed Central

Yoshimura, Yuko; Ido, Akifumi; Matsumoto, Teruyuki; Yamato, Masahide

2013-01-01

We investigated communities of arbuscular mycorrhizal fungi (AMF) in the fine roots of Pyrus pyrifolia var. culta, and Plantago asiatica to consider the relationship between orchard trees and herbaceous plants in AMF symbioses. The AMF communities were analyzed on the basis of the partial fungal DNA sequences of the nuclear small subunit ribosomal RNA gene (SSU rDNA), which were amplified using the AMF-specific primers AML1 and AML2. Phylogenetic analysis showed that the obtained AMF sequences were divided into 23 phylotypes. Among them, 12 phylotypes included AMF from both host plants, and most of the obtained sequences (689/811) were affiliated to them. Canonical correspondence analysis showed that the host plant species did not have a significant effect on the distribution of AMF phylotypes, whereas the effects of sampling site, soil total C, soil total N and soil-available P were significant. It was also found that the mean observed overlaps of AMF phylotypes between the paired host plants in the same soil cores (27.1% of phylotypes shared) were significantly higher than the mean 1,000 simulated overlaps (14.2%). Furthermore, the same AMF sequences (100% sequence identity) were detected from both host plants in 8/12 soil cores having both roots. Accordingly, we concluded that Py. pyrifolia and Pl. asiatica examined shared some AMF communities, which suggested that understory herbaceous plants may function as AMF inoculum sources for orchard trees. PMID:23614902
Detection of Co-Infection of Notocactus leninghausii f. cristatus with Six Virus Species in South Korea

PubMed Central

Park, Chung Hwa; Song, Eun Gyeong; Ryu, Ki Hyun

2018-01-01

Co-infection with two virus species was previously reported in some cactus plants. Here, we showed that Notocactus leninghausii f. cristatus can be co-infected with six different viruses: cactus mild mottle virus (CMMoV)-Nl, cactus virus X (CVX)-Nl, pitaya virus X (PiVX)-Nl, rattail cactus necrosis-associated virus (RCNaV)-Nl, schlumbergera virus X (SchVX)-Nl, and zygocactus virus X (ZyVX)-Nl. The coat protein sequences of these viruses were compared with those of previously reported viruses. CMMoV-Nl, CVX-Nl, PiVX-Nl, RCNaV-Nl, SchVX-Nl, and ZyVX-Nl showed the greatest nucleotide sequence homology to CMMoV-Kr (99.8% identity, GenBank accession NC_011803), CVX-Jeju (77.5% identity, GenBank accession LC12841), PiVX-P37 (98.4% identity, GenBank accession NC_024458), RCNaV (99.4% identity, GenBank accession NC_016442), SchVX-K11 (95.7% identity, GenBank accession NC_011659), and ZyVX-B1 (97.9% identity, GenBank accession NC_006059), respectively. This study is the first report of co-infection with six virus species in N. leninghausii f. cristatus in South Korea. PMID:29422789
A cysteine protease encoded by the baculovirus Bombyx mori nuclear polyhedrosis virus.

PubMed Central

Ohkawa, T; Majima, K; Maeda, S

1994-01-01

Sequence analysis of the BamHI F fragment of the genome of Bombyx mori nuclear polyhedrosis virus (BmNPV) revealed an open reading frame whose deduced amino acid sequence had homology to those of cysteine proteases of the papain superfamily. The putative cysteine protease sequence (BmNPV-CP) was 323 amino acids long and showed 35% identity to a cysteine proteinase precursor from Trypanosoma brucei. Of 36 residues conserved among cathepsins B, H, L, and S and papain, 31 were identical in BmNPV-CP. In order to determine the activity and function of the putative cysteine protease, a BmNPV mutant (BmCysPD) was constructed by homologous recombination of the protease gene with a beta-galactosidase gene cassette. BmCysPD-infected BmN cell extracts were significantly reduced in acid protease activity compared with wild-type virus-infected cell extracts. The cysteine protease inhibitor E-64 [trans-epoxysuccinylleucylamido-(4-guanidino)butane] inhibited wild-type virus-expressed protease activity. Deletion of the cysteine protease gene had no significant effect on viral growth or polyhedron production in BmN cells, indicating that the cysteine protease was not essential for viral replication in vitro. However, B. mori larvae infected with BmCysPD showed symptoms different from those of wild-type BmNPV-infected larvae, e.g., less degradation of the body, including fat body cells, white body surface color due presumably to undegraded epidermal cells, and an increase in the number of polyhedra released into the hemolymph. This is the first report of (i) a virus-encoded protease with activity on general substrates and (ii) evidence that a virus-encoded protease may play a role in degradation of infected larvae to facilitate horizontal transmission of the virus. Images PMID:8083997
Molecular characterization of the ribosomal DNA unit of Sarcocystis singaporensis, Sarcocystis zamani and Sarcocystis zuoi from rodents in Thailand

PubMed Central

WATTHANAKAIWAN, Vichan; SUKMAK, Manakorn; HAMARIT, Kriengsak; KAOLIM, Nongnid; WAJJWALKU, Worawidh; MUANGKRAM, Yuttamol

2017-01-01

Sarcocystis species are heteroxenous cyst-forming coccidian protozoan parasites with a wide host range, including rodents. In this study, Sarcocystis spp. samples were isolated from Bandicota indica, Rattus argentiventer, R. tiomanicus and R. norvegicus across five provinces of Thailand. Two major groups of Sarcocystis cysts were determined in this study: large and small cysts. By sequence comparisons and phylogenetic analyses based on the partial sequences of 28S ribosomal DNA, the large cysts showed the highest identity value (99%) with the S. zamani in GenBank database. While the small cysts could be divided into 2 groups of Sarcocystis: S. singaporensis and presupposed S. zuoi. The further analysis on 18S rDNA supported that the 2 isolates (S2 and B6 no.2) were as identified as S. singaporensis shared a high sequence identity with the S. singaporensis in GenBank database and the unidentified Sarcocystis (4 isolates, i.e., B6 no.10, B6 no.12, B10 no.4 and B10 no.7) showed 96.3–99.5% identity to S. zuoi as well as high distinct identity from others Sarcocystis spp. (≤93%). The result indicated that these four samples should be S. zuoi. In this study, we provided complete sequence of internal transcribed spacer 1 (ITS1), 5.8S rDNA and internal transcribed spacer 2 (ITS2) of these three Sarcocystis species and our new primer set could be useful to study the evolution of Sarcocystis. PMID:28701623
Molecular characterization of the ribosomal DNA unit of Sarcocystis singaporensis, Sarcocystis zamani and Sarcocystis zuoi from rodents in Thailand.

PubMed

Watthanakaiwan, Vichan; Sukmak, Manakorn; Hamarit, Kriengsak; Kaolim, Nongnid; Wajjwalku, Worawidh; Muangkram, Yuttamol

2017-08-18

Sarcocystis species are heteroxenous cyst-forming coccidian protozoan parasites with a wide host range, including rodents. In this study, Sarcocystis spp. samples were isolated from Bandicota indica, Rattus argentiventer, R. tiomanicus and R. norvegicus across five provinces of Thailand. Two major groups of Sarcocystis cysts were determined in this study: large and small cysts. By sequence comparisons and phylogenetic analyses based on the partial sequences of 28S ribosomal DNA, the large cysts showed the highest identity value (99%) with the S. zamani in GenBank database. While the small cysts could be divided into 2 groups of Sarcocystis: S. singaporensis and presupposed S. zuoi. The further analysis on 18S rDNA supported that the 2 isolates (S2 and B6 no.2) were as identified as S. singaporensis shared a high sequence identity with the S. singaporensis in GenBank database and the unidentified Sarcocystis (4 isolates, i.e., B6 no.10, B6 no.12, B10 no.4 and B10 no.7) showed 96.3-99.5% identity to S. zuoi as well as high distinct identity from others Sarcocystis spp. (≤93%). The result indicated that these four samples should be S. zuoi. In this study, we provided complete sequence of internal transcribed spacer 1 (ITS1), 5.8S rDNA and internal transcribed spacer 2 (ITS2) of these three Sarcocystis species and our new primer set could be useful to study the evolution of Sarcocystis.
Moussa virus: a new member of the Rhabdoviridae family isolated from Culex decens mosquitoes in Côte d’Ivoire

PubMed Central

Quan, Phenix-Lan; Junglen, Sandra; Tashmukhamedova, Alla; Conlan, Sean; Hutchison, Stephen K.; Kurth, Andreas; Ellerbrok, Heinz; Egholm, Michael; Briese, Thomas; Leendertz, Fabian H.; Ian Lipkin, W

2009-01-01

Characterization of arboviruses at the interface of pristine habitats and anthropogenic landscapes is crucial to comprehensive emergent disease surveillance and forecasting efforts. In context of surveillance campaign in and around a West African rainforest, particles morphologically consistent with rhabdoviruses were identified in cell cultures infected with homogenates of trapped mosquitoes. RNA recovered from these cultures was used to derive the first complete genome sequence of a rhabdovirus isolated from Culex decens mosquitoes in Côte d’Ivoire, tentatively named Moussa virus (MOUV). MOUV shows the classical genome organization of rhabdoviruses, with five open reading frames (ORF) in a linear order. However, sequences show only limited conservation (12–33% identity at amino acid level), and ORF2 and ORF3 have no significant similarity to sequences deposited in GenBank. Phylogenetic analysis indicates a potential new species with distant relationship to Tupaia and Tibrogargan virus. PMID:19804801
Moussa virus: a new member of the Rhabdoviridae family isolated from Culex decens mosquitoes in Côte d'Ivoire.

PubMed

Quan, Phenix-Lan; Junglen, Sandra; Tashmukhamedova, Alla; Conlan, Sean; Hutchison, Stephen K; Kurth, Andreas; Ellerbrok, Heinz; Egholm, Michael; Briese, Thomas; Leendertz, Fabian H; Lipkin, W Ian

2010-01-01

Characterization of arboviruses at the interface of pristine habitats and anthropogenic landscapes is crucial to comprehensive emergent disease surveillance and forecasting efforts. In context of a surveillance campaign in and around a West African rainforest, particles morphologically consistent with rhabdoviruses were identified in cell cultures infected with homogenates of trapped mosquitoes. RNA recovered from these cultures was used to derive the first complete genome sequence of a rhabdovirus isolated from Culex decens mosquitoes in Côte d'Ivoire, tentatively named Moussa virus (MOUV). MOUV shows the classical genome organization of rhabdoviruses, with five open reading frames (ORF) in a linear order. However, sequences show only limited conservation (12-33% identity at amino acid level), and ORF2 and ORF3 have no significant similarity to sequences deposited in GenBank. Phylogenetic analysis indicates a potential new species with distant relationship to Tupaia and Tibrogargan virus.
Plastome Sequence Determination and Comparative Analysis for Members of the Lolium-Festuca Grass Species Complex

PubMed Central

Hand, Melanie L.; Spangenberg, German C.; Forster, John W.; Cogan, Noel O. I.

2013-01-01

Chloroplast genome sequences are of broad significance in plant biology, due to frequent use in molecular phylogenetics, comparative genomics, population genetics, and genetic modification studies. The present study used a second-generation sequencing approach to determine and assemble the plastid genomes (plastomes) of four representatives from the agriculturally important Lolium-Festuca species complex of pasture grasses (Lolium multiflorum, Festuca pratensis, Festuca altissima, and Festuca ovina). Total cellular DNA was extracted from either roots or leaves, was sequenced, and the output was filtered for plastome-related reads. A comparison between sources revealed fewer plastome-related reads from root-derived template but an increase in incidental bacterium-derived sequences. Plastome assembly and annotation indicated high levels of sequence identity and a conserved organization and gene content between species. However, frequent deletions within the F. ovina plastome appeared to contribute to a smaller plastid genome size. Comparative analysis with complete plastome sequences from other members of the Poaceae confirmed conservation of most grass-specific features. Detailed analysis of the rbcL–psaI intergenic region, however, revealed a “hot-spot” of variation characterized by independent deletion events. The evolutionary implications of this observation are discussed. The complete plastome sequences are anticipated to provide the basis for potential organelle-specific genetic modification of pasture grasses. PMID:23550121
Non-redundant patent sequence databases with value-added annotations at two levels

PubMed Central

Li, Weizhong; McWilliam, Hamish; de la Torre, Ana Richart; Grodowski, Adam; Benediktovich, Irina; Goujon, Mickael; Nauche, Stephane; Lopez, Rodrigo

2010-01-01

The European Bioinformatics Institute (EMBL-EBI) provides public access to patent data, including abstracts, chemical compounds and sequences. Sequences can appear multiple times due to the filing of the same invention with multiple patent offices, or the use of the same sequence by different inventors in different contexts. Information relating to the source invention may be incomplete, and biological information available in patent documents elsewhere may not be reflected in the annotation of the sequence. Search and analysis of these data have become increasingly challenging for both the scientific and intellectual-property communities. Here, we report a collection of non-redundant patent sequence databases, which cover the EMBL-Bank nucleotides patent class and the patent protein databases and contain value-added annotations from patent documents. The databases were created at two levels by the use of sequence MD5 checksums. Sequences within a level-1 cluster are 100% identical over their whole length. Level-2 clusters were defined by sub-grouping level-1 clusters based on patent family information. Value-added annotations, such as publication number corrections, earliest publication dates and feature collations, significantly enhance the quality of the data, allowing for better tracking and cross-referencing. The databases are available format: http://www.ebi.ac.uk/patentdata/nr/. PMID:19884134
Non-redundant patent sequence databases with value-added annotations at two levels.

PubMed

Li, Weizhong; McWilliam, Hamish; de la Torre, Ana Richart; Grodowski, Adam; Benediktovich, Irina; Goujon, Mickael; Nauche, Stephane; Lopez, Rodrigo

2010-01-01

The European Bioinformatics Institute (EMBL-EBI) provides public access to patent data, including abstracts, chemical compounds and sequences. Sequences can appear multiple times due to the filing of the same invention with multiple patent offices, or the use of the same sequence by different inventors in different contexts. Information relating to the source invention may be incomplete, and biological information available in patent documents elsewhere may not be reflected in the annotation of the sequence. Search and analysis of these data have become increasingly challenging for both the scientific and intellectual-property communities. Here, we report a collection of non-redundant patent sequence databases, which cover the EMBL-Bank nucleotides patent class and the patent protein databases and contain value-added annotations from patent documents. The databases were created at two levels by the use of sequence MD5 checksums. Sequences within a level-1 cluster are 100% identical over their whole length. Level-2 clusters were defined by sub-grouping level-1 clusters based on patent family information. Value-added annotations, such as publication number corrections, earliest publication dates and feature collations, significantly enhance the quality of the data, allowing for better tracking and cross-referencing. The databases are available format: http://www.ebi.ac.uk/patentdata/nr/.
The influence of phonological priming on variability in articulation

NASA Astrophysics Data System (ADS)

Babel, Molly E.; Munson, Benjamin

2004-05-01

Previous research [Sevald and Dell, Cognition 53, 91-127 (1994)] has found that reiterant sequences of CVC words are produced more quickly when the prime word and target word share VC sequences (i.e., sequences like sit sick) than when they are identical (sequences like sick sick). Even slower production rates are found when primes and targets share a CV sequence (sequences like kick sick). These data have been used to support a model of speech production in which lexical items and their constituent phonemes are activated sequentially. The current experiment investigated whether phonological priming also influences variability in the acoustic characteristics of words. Specifically, we examined whether greater variability in the acoustic characteristics of target words was noted in the CV-related prime context than in the identical-prime context, and whether less variability was noted in the VC-related context. Thirty adult subjects with typical speech, language, and hearing ability produced reiterant two-word sequences that varied in their phonological similarity. The duration, first, and second formant frequencies of the target-words' vowels were measured. Preliminary analyses indicate that phonological priming does not have a systematic effect on variability in these acoustic parameters.
Sequence diversity of wheat mosaic virus isolates.

PubMed

Stewart, Lucy R

2016-02-02

Wheat mosaic virus (WMoV), transmitted by eriophyid wheat curl mites (Aceria tosichella) is the causal agent of High Plains disease in wheat and maize. WMoV and other members of the genus Emaravirus evaded thorough molecular characterization for many years due to the experimental challenges of mite transmission and manipulating multisegmented negative sense RNA genomes. Recently, the complete genome sequence of a Nebraska isolate of WMoV revealed eight segments, plus a variant sequence of the nucleocapsid protein-encoding segment. Here, near-complete and partial consensus sequences of five more WMoV isolates are reported and compared to the Nebraska isolate: an Ohio maize isolate (GG1), a Kansas barley isolate (KS7), and three Ohio wheat isolates (H1, K1, W1). Results show two distinct groups of WMoV isolates: Ohio wheat isolate RNA segments had 84% or lower nucleotide sequence identity to the NE isolate, whereas GG1 and KS7 had 98% or higher nucleotide sequence identity to the NE isolate. Knowledge of the sequence variability of WMoV isolates is a step toward understanding virus biology, and potentially explaining observed biological variation. Published by Elsevier B.V.

Genotypes of Ciprofloxacin-Resistant Klebsiella pneumoniae in Korea and Their Characteristics According to the Genetic Lineages.

PubMed

Park, Dong Jin; Yu, Jin Kyung; Park, Kang Gyun; Park, Yeon-Joon

2015-12-01

We investigated the molecular genotypes of ciprofloxacin-resistant Klebsiella pneumoniae and their characteristics according to the genetic lineages. For 160 K. pneumoniae collected in 2013, ciprofloxacin minimum inhibitory concentrations (MICs) were determined by agar dilution method. The genotypes of ciprofloxacin-resistant K. pneumoniae isolates were determined by multilocus sequence typing (MLST) and wzi gene typing. The presence of plasmid-mediated resistance determinants [qnrA, qnrB, qnrS, aac(6')-Ib-cr, blaCTX-M, and blaSHV] was investigated. The gyrA and parC genes were sequenced. Fifty-seven isolates showed ciprofloxacin resistance. By MLST, four major sequence types (STs) or clonal complexes (CCs), that is, ST307, CC11, CC147, and ST15, were found and the two most prevalent STs were ST307 (14/57, 24.6%) and ST11 (12/57, 21.1%). By wzi gene sequencing, 46 of the 57 isolates could be differentiated. All the ST307 isolates had an identical wzi sequence and harbored qnrB. The majority of them harbored aac(6')-Ib-cr (85.7%) and CTX-M-15 (92.9%). In contrast, 12 ST11 isolates were divided into five sublineages by wzi sequence and qnrB, qnrS, and aac(6')-Ib-cr were carried by nine, seven, and three isolates, respectively. They harbored SHV-type extended-spectrum β-lactamase more frequently than CTX-M-15 (nine and four isolates, respectively). The prevalence of CTX-M-15, qnrB1, and aac(6')-Ib-cr was significantly higher in ST307 than in ST11 (p=0.003, p=0.000, and p=0.002, respectively). Both clones had identical amino acid substitution in gyrA (S83I) and parC (S80I). K. pneumoniae ST307 and ST11 were the two most common clones, and the ST307 isolates were highly homogeneous, suggesting their recent emergence.
Rapid and Accurate Sequencing of Enterovirus Genomes Using MinION Nanopore Sequencer.

PubMed

Wang, Ji; Ke, Yue Hua; Zhang, Yong; Huang, Ke Qiang; Wang, Lei; Shen, Xin Xin; Dong, Xiao Ping; Xu, Wen Bo; Ma, Xue Jun

2017-10-01

Knowledge of an enterovirus genome sequence is very important in epidemiological investigation to identify transmission patterns and ascertain the extent of an outbreak. The MinION sequencer is increasingly used to sequence various viral pathogens in many clinical situations because of its long reads, portability, real-time accessibility of sequenced data, and very low initial costs. However, information is lacking on MinION sequencing of enterovirus genomes. In this proof-of-concept study using Enterovirus 71 (EV71) and Coxsackievirus A16 (CA16) strains as examples, we established an amplicon-based whole genome sequencing method using MinION. We explored the accuracy, minimum sequencing time, discrimination and high-throughput sequencing ability of MinION, and compared its performance with Sanger sequencing. Within the first minute (min) of sequencing, the accuracy of MinION was 98.5% for the single EV71 strain and 94.12%-97.33% for 10 genetically-related CA16 strains. In as little as 14 min, 99% identity was reached for the single EV71 strain, and in 17 min (on average), 99% identity was achieved for 10 CA16 strains in a single run. MinION is suitable for whole genome sequencing of enteroviruses with sufficient accuracy and fine discrimination and has the potential as a fast, reliable and convenient method for routine use. Copyright © 2017 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.
Interaction of the p85 subunit of PI 3-kinase and its N-terminal SH2 domain with a PDGF receptor phosphorylation site: structural features and analysis of conformational changes.

PubMed Central

Panayotou, G; Bax, B; Gout, I; Federwisch, M; Wroblowski, B; Dhand, R; Fry, M J; Blundell, T L; Wollmer, A; Waterfield, M D

1992-01-01

Circular dichroism and fluorescence spectroscopy were used to investigate the structure of the p85 alpha subunit of the PI 3-kinase, a closely related p85 beta protein, and a recombinant SH2 domain-containing fragment of p85 alpha. Significant spectral changes, indicative of a conformational change, were observed on formation of a complex with a 17 residue peptide containing a phosphorylated tyrosine residue. The sequence of this peptide is identical to the sequence surrounding Tyr751 in the kinase-insert region of the platelet-derived growth factor beta-receptor (beta PDGFR). The rotational correlation times measured by fluorescence anisotropy decay indicated that phosphopeptide binding changed the shape of the SH2 domain-containing fragment. The CD and fluorescence spectroscopy data support the secondary structure prediction based on sequence analysis and provide evidence for flexible linker regions between the various domains of the p85 proteins. The significance of these results for SH2 domain-containing proteins is discussed. Images PMID:1330535
Variability Studies of Two Prunus-Infecting Fabaviruses with the Aid of High-Throughput Sequencing

PubMed Central

Sarkisova, Tatiana; Lenz, Ondřej; Přibylová, Jaroslava; Špak, Josef; Lotos, Leonidas; Beta, Christina; Katsiani, Asimina; Candresse, Thierry

2018-01-01

During their lifetime, perennial woody plants are expected to face multiple infection events. Furthermore, multiple genotypes of individual virus species may co-infect the same host. This may eventually lead to a situation where plants harbor complex communities of viral species/strains. Using high-throughput sequencing, we describe co-infection of sweet and sour cherry trees with diverse genomic variants of two closely related viruses, namely prunus virus F (PrVF) and cherry virus F (CVF). Both viruses are most homologous to members of the Fabavirus genus (Secoviridae family). The comparison of CVF and PrVF RNA2 genomic sequences suggests that the two viruses may significantly differ in their expression strategy. Indeed, similar to comoviruses, the smaller genomic segment of PrVF, RNA2, may be translated in two collinear proteins while CVF likely expresses only the shorter of these two proteins. Linked with the observation that identity levels between the coat proteins of these two viruses are significantly below the family species demarcation cut-off, these findings support the idea that CVF and PrVF represent two separate Fabavirus species. PMID:29670059
Koi herpesvirus epizootic in cultured carp and koi, Cyprinus carpio L., in Taiwan.

PubMed

Cheng, L; Chen, C-Y; Tsai, M-A; Wang, P-C; Hsu, J-P; Chern, R-S; Chen, S-C

2011-07-01

Koi herpesvirus (KHV) poses a significant threat to cultured koi and common carp, both Cyprinus carpio L. Since the first reported case in Israel in 1998, KHV has rapidly spread worldwide. This study investigates the spread of KHV to Taiwan by collecting 49 cases of suspected common carp and koi infections from 2003 to 2005 for analysis. Clinical signs included lethargy, anorexia, increased respiratory movements and uncoordinated swimming. Hyperaemia, haemorrhage on body surface and necrotic gill filaments were recorded. Gill epithelial hyperplasia, necrosis and eosinophilic intranuclear inclusion bodies were observed by histological examination, while virions were detected using transmission electron microscopy. By detecting the presence of the KHV thymidine kinase (TK) gene and the KHV 9/5 gene using polymerase chain reaction (PCR), 37 cases were identified as KHV-positive, and the cumulative mortality of infected fish was 70-100%. Positive cases showed identical sequences for the genes analysed, implying that they were of the same origin. For the KHV 9/5 gene sequence, these cases exhibited 100% identity with the Japanese strain (TUMST1, accession number AP008984) and 99% identity with the Israeli (KHV-I, DQ177346) and US (KHV-U, DQ657948) strains. Additionally, a loop-mediated isothermal amplification (LAMP) assay was performed and found to be more sensitive than PCR tests, suggesting its potential use as a rapid diagnostic method for KHV. This is the first epidemiological study of KHV infection in cultured common carp and koi in Taiwan. © 2011 Blackwell Publishing Ltd.
Molecular evidence that the asexual industrial fungus Trichoderma reesei is a clonal derivative of the ascomycete Hypocrea jecorina.

PubMed Central

Kuhls, K; Lieckfeldt, E; Samuels, G J; Kovacs, W; Meyer, W; Petrini, O; Gams, W; Börner, T; Kubicek, C P

1996-01-01

The relationship of the important cellulase producing asexual fungus Trichoderma reesei to its putative teleomorphic (sexual) ancestor Hypocrea jecorina and other species of the Trichoderma sect. Longibrachiatum was studied by PCR-fingerprinting and sequence analyses of the nuclear ribosomal DNA region containing the internal transcribed spacers (ITS-1 and ITS-2) and the 5.8S rRNA gene. The differences in the corresponding ITS sequences allowed a grouping of anamorphic (asexual) species of Trichoderma sect. Longibrachiatum into Trichoderma longibrachiatum, Trichoderma pseudokoningii, and Trichoderma reesei. The sexual species Hypocrea schweinitzii and H. jecorina were also clearly separated from each other. H. jecorina and T. reesei exhibited identical sequences, suggesting close relatedness or even species identity. Intraspecific and interspecific variation in the PCR-fingerprinting patterns supported the differentiation of species based on ITS sequences, the grouping of the strains, and the assignment of these strains to individual species. The variations between T. reesei and H. jecorina were at the same order of magnitude as found between all strains of H. jecorina, but much lower than the observed interspecific variations. Identical ITS sequences and the high similarity of PCR-fingerprinting patterns indicate a very close relationship between T. reesei and H. jecorina, whereas differences of the ITS sequences and the PCR-fingerprinting patterns show a clear phylogenetic distance between T. reesei/H. jecorina and T. longibrachiatum. T. reesei is considered to be an asexual, clonal line derived from a population of the tropical ascomycete H. jecorina. Images Fig. 2 PMID:8755548
Distribution and molecular diversity of three cucurbit-infecting poleroviruses in China.

PubMed

Shang, Qiao-xia; Xiang, Hai-ying; Han, Cheng-gui; Li, Da-wei; Yu, Jia-lin

2009-11-01

Cucurbit aphid-borne yellows virus (CABYV) and Melon aphid-borne yellows virus (MABYV) have been found to be associated with cucurbit yellowing disease in China. Our report identifies for the first time a third distinct polerovirus, tentatively named Suakwa aphid-borne yellows virus (SABYV), infecting Suakwa vegetable sponge. To better understand the distribution and molecular diversity of these three poleroviruses infecting cucurbits, a total of 214 cucurbitaceous crop samples were collected from 25 provinces in China, and were investigated by RT-PCR and sequencing. Of these, 108 samples tested positive for CABYV, while 40 samples from five provinces were positive for MABYV, and SABYV was detected in only 4 samples which were collected in the southern part of China. Forty-one PCR-amplified fragments containing a portion of the RdRp gene, intergenic NCR and CP gene were cloned and sequenced. Sequence comparisons showed that CABYV isolates shared 78.0-79.2% nucleotide sequence identity with MABYV isolates, and 69.7-70.8% with SABYV. Sequence identity between MABYV and SABYV was 73.3-76.5%. In contrast, the nucleotide identities within each species were 93.2-98.7% (CABYV), 98.1-99.9% (MABYV), and 96.1-98.6% (SABYV). Phylogenetic analyses revealed that the polerovirus isolates fit into three distinct groups, corresponding to the three species. The CABYV group could be further divided into two subgroups: the Asia subgroup and the Mediterranean subgroup, based on CP gene and partial RdRp gene sequences. Recombination analysis suggested that MABYV may be a recombinant virus.
First isolation of hirame rhabdovirus from freshwater fish in Europe.

PubMed

Borzym, E; Matras, M; Maj-Paluch, J; Baud, M; De Boisséson, C; Talbi, C; Olesen, N J; Bigarré, L

2014-05-01

A rhabdovirus was isolated in cell culture inoculated with tissue material from diseased grayling, Thymallus thymallus (L.), originating from a fish farm affected by a mortality episode in Poland. Diagnostics tests showed that the virus was not related to novirhabdoviruses known in Europe, nor to vesiculovirus-like species, except perch rhabdovirus (PRhV) with which it shared moderate serological relations. However, RT-PCR with PRhV probes gave negative results. To identify the virus, a random-priming sequence-independent single primer amplification was adopted. Surprisingly, two of the obtained sequences exhibited a high identity (>99%) with hirame rhabdovirus (HIRRV), a novirhabdovirus usually found in fish in marine Asiatic countries, for instance Japan, China and Korea. The full-length sequence of the phosphoprotein gene (P) demonstrated a higher identity of the present isolate with HIRRV from China compared with the Korean isolate. An identical viral sequence was also found in brown trout, Salmo trutta trutta L., affected by mortalities in a second farm in the same region, after a likely contamination from the grayling farm. To our knowledge, this is the first report of HIRRV in Europe, and in two hosts from fresh water that have not been described before as susceptible species. © 2013 John Wiley & Sons Ltd.
Higher order memories for objects encountered in different spatio-temporal contexts in mice: evidence for episodic memory.

PubMed

Dere, Ekrem; Silva, Maria A De Souza; Huston, Joseph P

2004-01-01

The ability to build higher order multi-modal memories comprising information about the spatio-temporal context of events has been termed 'episodic memory'. Deficits in episodic memory are apparent in a number of neuropsychiatric diseases. Unfortunately, the development of animal models of episodic memory has made little progress. Towards the goal of such a model we devised an object exploration task for mice, providing evidence that rodents can associate object, spatial and temporal information. In our task the mice learned the temporal sequence by which identical objects were introduced into two different contexts. The 'what' component of an episodic memory was operationalized via physically distinct objects; the 'where' component through physically different contexts, and, most importantly, the 'when' component via the context-specific inverted sequence in which four objects were presented. Our results suggest that mice are able to recollect the inverted temporal sequence in which identical objects were introduced into two distinct environments. During two consecutive test trials mice showed an inverse context-specific exploration pattern regarding identical objects that were previously encountered with even frequencies. It seems that the contexts served as discriminative stimuli signaling which of the two sequences are decisive during the two test trials.
Genetic diversity of the human head lice, Pediculus humanus capitis, among primary school girls in Saudi Arabia, with reference to their prevalence.

PubMed

Al-Shahrani, Sarah A; Alajmi, Reem A; Ayaad, Tahany H; Al-Shahrani, Mohammed A; Shaurub, El-Sayed H

2017-10-01

The present work aimed at investigating the genetic diversity of the head louse Pediculus humanus capitis (P. humanus capitis) among infested primary school girls at Bisha governorate, Saudi Arabia, based on the sequence of mitochondrial cytochrome b (mt cyt b) gene of 121 P. humanus capitis adults. Additionally, the prevalence of pediculosis capitis was surveyed. The results of sequencing were compared with the sequence of human head lice that are genotyped previously. Phylogenetic tree analysis showed the presence of 100% identity (n = 26) of louse specimens with clade A (prevalent worldwide) of the GenBank data base. Louse individuals (n = 50) showed 99.8% similarity with the same clade A reference having a single base pair difference. Also, a number of 22 louse individuals revealed 99.8% identity with clade B reference (prevalent in North and Central Americas, Europe, and Australia) with individual diversity in two base pairs. Moreover, 14 louse individual sequences revealed 99.4% identity with three base pair differences. It was concluded that moderate pediculosis (~13%) prevailed among the female students of the primary schools. It was age-and hair texture (straight or curly)-dependent. P. humanus capitis prevalence diversity is of clades A and B genotyping.
A novel flavivirus detected in two Aedes spp. collected near the demilitarized zone of the Republic of Korea.

PubMed

Korkusol, Achareeya; Takhampunya, Ratree; Hang, Jun; Jarman, Richard G; Tippayachai, Bousaraporn; Kim, Heung-Chul; Chong, Sung-Tae; Davidson, Silas A; Klein, Terry A

2017-05-01

Flaviviruses comprise a large and diverse group of positive-stranded RNA viruses, including tick-, mosquito- and unknown-vector-borne flaviviruses. A novel flavivirus was detected in pools of Aedes vexans nipponii (n=1) and Aedes esoensis (n=3) collected in 2012 and 2013 near the demilitarized zone (DMZ), Republic of Korea (ROK). Phylogenetic analyses of the NS5, E gene and complete polyprotein coding sequence (CDS) showed that the novel virus fell within the Aedes-borne flaviviruses (ABFVs), with nucleotide identity ranging from 57.8-75.1 %, 46.1-74.2 % and 51.1-76.2 %, respectively. While the novel ABFV was distant from other flaviviruses within the group, it formed a clade with Ilomantsi virus (ILOV). Sequence alignments of the partial NS5 gene, full-length E gene and polyprotein CDS between the novel virus and ILOV showed approximately 76.2 % nucleotide identity and 90 % amino acid identity, respectively. The ABFV identified in Aedes mosquitoes from the ROK is a novel ABFV based on the sequence analyses and is designated as Panmunjeom flavivirus (PANFV).
Detection of a divergent variant of grapevine virus F by next-generation sequencing.

PubMed

Molenaar, Nicholas; Burger, Johan T; Maree, Hans J

2015-08-01

The complete genome sequence of a South African isolate of grapevine virus F (GVF) is presented. It was first detected by metagenomic next-generation sequencing of field samples and validated through direct Sanger sequencing. The genome sequence of GVF isolate V5 consists of 7539 nucleotides and contains a poly(A) tail. It has a typical vitivirus genome arrangement that comprises five open reading frames (ORFs), which share only 88.96 % nucleotide sequence identity with the existing complete GVF genome sequence (JX105428).
HSP70 gene expression in Mytilus galloprovincialis hemocytes is triggered by moderate heat shock and Vibrio anguillarum, but not by V. splendidus or Micrococcus lysodeikticus.

PubMed

Cellura, Cinzia; Toubiana, Mylène; Parrinello, Nicolo; Roch, Philippe

2006-01-01

Complete sequence of HSP70 cDNA from the mussel, Mytilus galloprovincialis was established before quantifying its expression following moderate heat shock or injection of heat-killed bacteria. HSP70 cDNA is comprised of 2378 bp including one ORF of 654 aa, with a predicted 70 bp 5'-UTR and a 343 bp 3'-UTR (GenBank, 18 Jan 05, AY861684). Alignment identity ranged from 89% for Crassostrea ariakensis to 72% for C. virginica. Curiously, HSP70 gene and cDNA sequences from M. galloprovincialis, deposited later (03 and 27 May), show only 73% identity with the present sequence. Meanwhile, characteristic motifs of the HSP70 family were located in conserved positions. Expression of HSP70 gene was quantified on circulating hemocyte mRNA using Q-PCR after RT using random hexaprimers. Housekeeping gene was 28S rRNA. Four stresses were applied: heat shock that consisted of immersing mussels for 90 min at 30 degrees C and returning them to 20 degrees C sea water, one injection of heat-killed Gram-negative bacteria, Vibrio splendidus LGP32, one injection of heat-killed Gram-negative bacteria Vibrio anguillarum, one injection of heat-killed Gram-positive bacteria Micrococcus lysodeikticus. We found no significant modification of 28S rRNA gene expression. Significant increase of 5.2 +/- 0.4 fold the ratio HSP70/28S rRNA was observed 6 h after heat shock and was maximum at 15 h (6.1 +/- 1.1), and still significant after 24 h (1.7 +/- 0.03). Similarly, injecting V. anguillarum resulted in a significant increase of 2.7 +/- 0.1 after 12 h. Expression was maximum after 48 h (5.2 +/- 0.05) and returned to baseline after 72 h. In contrast, injecting V. splendidus or M. lysodeikticus failed to significantly modulate HSP70 gene expression at least during the first 3 days post-injection. Consequently, mussel hemocytes appeared to discriminate between pathogenic and non-pathogenic Vibrios, as well as between Gram-negative and Gram-positive bacteria.
Random Amplification and Pyrosequencing for Identification of Novel Viral Genome Sequences

PubMed Central

Hang, Jun; Forshey, Brett M.; Kochel, Tadeusz J.; Li, Tao; Solórzano, Víctor Fiestas; Halsey, Eric S.; Kuschner, Robert A.

2012-01-01

ssRNA viruses have high levels of genomic divergence, which can lead to difficulty in genomic characterization of new viruses using traditional PCR amplification and sequencing methods. In this study, random reverse transcription, anchored random PCR amplification, and high-throughput pyrosequencing were used to identify orthobunyavirus sequences from total RNA extracted from viral cultures of acute febrile illness specimens. Draft genome sequence for the orthobunyavirus L segment was assembled and sequentially extended using de novo assembly contigs from pyrosequencing reads and orthobunyavirus sequences in GenBank as guidance. Accuracy and continuous coverage were achieved by mapping all reads to the L segment draft sequence. Subsequently, RT-PCR and Sanger sequencing were used to complete the genome sequence. The complete L segment was found to be 6936 bases in length, encoding a 2248-aa putative RNA polymerase. The identified L segment was distinct from previously published South American orthobunyaviruses, sharing 63% and 54% identity at the nucleotide and amino acid level, respectively, with the complete Oropouche virus L segment and 73% and 81% identity at the nucleotide and amino acid level, respectively, with a partial Caraparu virus L segment. The result demonstrated the effectiveness of a sequence-independent amplification and next-generation sequencing approach for obtaining complete viral genomes from total nucleic acid extracts and its use in pathogen discovery. PMID:22468136
Control processes through the suppression of the automatic response activation triggered by task-irrelevant information in the Simon-type tasks.

PubMed

Kim, Sanga; Lee, Sang Ho; Cho, Yang Seok

2015-11-01

The congruency sequence effect, one of the indices of cognitive control, refers to a smaller congruency effect after an incongruent than congruent trial. Although the effect has been found across a variety of conflict tasks, there is not yet agreement on the underlying mechanism. The present study investigated the mechanism underlying cognitive control by using a cross-task paradigm. In Experiments 1, 2, and 3, participants performed a modified Simon task and a spatial Stroop task alternately in a trial-by-trial manner. The task-irrelevant dimension of the two tasks was perceptually and conceptually identical in Experiment 1, whereas it was perceptually different but conceptually identical in Experiment 2. The response sets for both tasks were different in Experiment 3. In Experiment 4, participants performed two Simon tasks with different task-relevant dimensions. In all experiments in which the task-irrelevant dimension and response mode were shared, significant congruency sequence effects were found between the two different congruencies, indicating that Simon-type conflicts were resolved by a control mechanism, which is specific to an abstract task-irrelevant stimulus spatial dimension. Copyright © 2015 Elsevier B.V. All rights reserved.
On the apparent positions of T Tauri stars in the H-R diagram

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kenyon, S.J.; Hartmann, L.W.

1990-01-01

The spread in apparent luminosities of T Tauri stars caused by occultation and emission from protostellar disks is investigated. A random distribution of disk inclination angles, coupled with a plausible range of accretion rates, introduces a significant scatter in apparent luminosities for intrinsically identical stars. The observed dispersion of luminosities for K7-M1 Hayashi track stars thought to have disks in Taurus-Auriga is similar to predictions of the simple accretion disk model, which suggets that age determinations form many pre-main-sequence stars are uncertain. The results also suggest that Stahler's birthline for convective track pre-main-sequence stars may be located at slightly lowermore » luminosities than previously thought. 38 refs.« less
Identification and expression analysis of a novel R-type lectin from the coleopteran beetle, Tenebrio molitor.

PubMed

Kim, Dong Hyun; Patnaik, Bharat Bhusan; Seo, Gi Won; Kang, Seong Min; Lee, Yong Seok; Lee, Bok Luel; Han, Yeon Soo

2013-11-01

We have identified novel ricin-type (R-type) lectin by sequencing of random clones from cDNA library of the coleopteran beetle, Tenebrio molitor. The cDNA sequence is comprised of 495 bp encoding a protein of 164 amino acid residues and shows 49% identity with galectin of Tribolium castaneum. Bioinformatics analysis shows that the amino acid residues from 35 to 162 belong to ricin-type beta-trefoil structure. The transcript was significantly upregulated after early hours of injection with peptidoglycans derived from Gram (+) and Gram (-) bacteria, beta-1, 3 glucan from fungi and an intracellular pathogen, Listeria monocytogenes suggesting putative function in innate immunity. Copyright © 2013 Elsevier Inc. All rights reserved.
SCPRED: accurate prediction of protein structural class for sequences of twilight-zone similarity with predicting sequences.

PubMed

Kurgan, Lukasz; Cios, Krzysztof; Chen, Ke

2008-05-01

Protein structure prediction methods provide accurate results when a homologous protein is predicted, while poorer predictions are obtained in the absence of homologous templates. However, some protein chains that share twilight-zone pairwise identity can form similar folds and thus determining structural similarity without the sequence similarity would be desirable for the structure prediction. The folding type of a protein or its domain is defined as the structural class. Current structural class prediction methods that predict the four structural classes defined in SCOP provide up to 63% accuracy for the datasets in which sequence identity of any pair of sequences belongs to the twilight-zone. We propose SCPRED method that improves prediction accuracy for sequences that share twilight-zone pairwise similarity with sequences used for the prediction. SCPRED uses a support vector machine classifier that takes several custom-designed features as its input to predict the structural classes. Based on extensive design that considers over 2300 index-, composition- and physicochemical properties-based features along with features based on the predicted secondary structure and content, the classifier's input includes 8 features based on information extracted from the secondary structure predicted with PSI-PRED and one feature computed from the sequence. Tests performed with datasets of 1673 protein chains, in which any pair of sequences shares twilight-zone similarity, show that SCPRED obtains 80.3% accuracy when predicting the four SCOP-defined structural classes, which is superior when compared with over a dozen recent competing methods that are based on support vector machine, logistic regression, and ensemble of classifiers predictors. The SCPRED can accurately find similar structures for sequences that share low identity with sequence used for the prediction. The high predictive accuracy achieved by SCPRED is attributed to the design of the features, which are capable of separating the structural classes in spite of their low dimensionality. We also demonstrate that the SCPRED's predictions can be successfully used as a post-processing filter to improve performance of modern fold classification methods.
SCPRED: Accurate prediction of protein structural class for sequences of twilight-zone similarity with predicting sequences

PubMed Central

Kurgan, Lukasz; Cios, Krzysztof; Chen, Ke

2008-01-01

Background Protein structure prediction methods provide accurate results when a homologous protein is predicted, while poorer predictions are obtained in the absence of homologous templates. However, some protein chains that share twilight-zone pairwise identity can form similar folds and thus determining structural similarity without the sequence similarity would be desirable for the structure prediction. The folding type of a protein or its domain is defined as the structural class. Current structural class prediction methods that predict the four structural classes defined in SCOP provide up to 63% accuracy for the datasets in which sequence identity of any pair of sequences belongs to the twilight-zone. We propose SCPRED method that improves prediction accuracy for sequences that share twilight-zone pairwise similarity with sequences used for the prediction. Results SCPRED uses a support vector machine classifier that takes several custom-designed features as its input to predict the structural classes. Based on extensive design that considers over 2300 index-, composition- and physicochemical properties-based features along with features based on the predicted secondary structure and content, the classifier's input includes 8 features based on information extracted from the secondary structure predicted with PSI-PRED and one feature computed from the sequence. Tests performed with datasets of 1673 protein chains, in which any pair of sequences shares twilight-zone similarity, show that SCPRED obtains 80.3% accuracy when predicting the four SCOP-defined structural classes, which is superior when compared with over a dozen recent competing methods that are based on support vector machine, logistic regression, and ensemble of classifiers predictors. Conclusion The SCPRED can accurately find similar structures for sequences that share low identity with sequence used for the prediction. The high predictive accuracy achieved by SCPRED is attributed to the design of the features, which are capable of separating the structural classes in spite of their low dimensionality. We also demonstrate that the SCPRED's predictions can be successfully used as a post-processing filter to improve performance of modern fold classification methods. PMID:18452616
Detection of Grapevine Leafroll-associated virus 7 using real-time qRT-PCR and conventional RT-PCR

USDA-ARS?s Scientific Manuscript database

Nine isolates of Grapevine Leafroll-associated Virus 7 (GLRaV-7) from California have been sequenced to design more sensitive molecular diagnostic tools. These sequences were from the coat protein (CP) and the homologous heat shock protein (hHSP70) genes. Sequence identity among these isolates rang...

Sequences of Zika Virus Genomes from a Pediatric Cohort in Nicaragua.

PubMed

Oldfield, Lauren M; Fedorova, Nadia; Puri, Vinita; Shrivastava, Susmita; Amedeo, Paolo; Durbin, Alan; Rocchi, Iara; Williams, Torrey; Shabman, Reed S; Tan, Gene S; Balmaseda, Angel; Kuan, Guillermina; Saborio, Saira; Gordon, Aubree; Harris, Eva; Pickett, Brett E

2018-06-14

We report here the whole-genome sequence of 11 Zika virus (ZIKV) samples from six pediatric patients in Nicaragua. Serum samples were collected, and ZIKV was isolated in tissue culture. Both serum and virus isolates were sequenced. The consensus ZIKV genomes are greater than 99% identical to each other. Copyright © 2018 Oldfield et al.
Arrays of nucleic acid probes on biological chips

DOEpatents

Chee, Mark; Cronin, Maureen T.; Fodor, Stephen P. A.; Huang, Xiaohua X.; Hubbell, Earl A.; Lipshutz, Robert J.; Lobban, Peter E.; Morris, MacDonald S.; Sheldon, Edward L.

1998-11-17

DNA chips containing arrays of oligonucleotide probes can be used to determine whether a target nucleic acid has a nucleotide sequence identical to or different from a specific reference sequence. The array of probes comprises probes exactly complementary to the reference sequence, as well as probes that differ by one or more bases from the exactly complementary probes.
Complete Genome Sequences of Bacillus Phages Janet and OTooleKemple52.

PubMed

Kent, Brenna; Raymond, Thomas; Mosier, Philip D; Johnson, Allison A

2018-05-10

We report here the genome sequences of two novel Bacillus cereus group-infecting bacteriophages, Janet and OTooleKemple52. These bacteriophages are double-stranded DNA-containing Myoviridae isolated from soil samples. While their genomes share a high degree of sequence identity with one another, their host preferences are unique. Copyright © 2018 Kent et al.
Isolation and genetic characterization of an Actinobacillus pleuropneumoniae serovar K12:O3 strain.

PubMed

Ito, Hiroya; Matsumoto, Atsuko

2015-01-01

An atypical Actinobacillus pleuropneumoniae serovar 12 strain, termed QAS106, was isolated from a clinical case of porcine pleuropneumonia in Japan. An immunodiffusion (ID) test identified the strain as serovar 12. However, the ID test also demonstrated that strain QAS106 shared antigenic determinants with both the serovar 3 and 15 reference strains. Strain QAS106 was positive in the capsular serovar 12-specific polymerase chain reaction (PCR) assay, while the PCR toxin gene profiling and omlA PCR typing assays indicated that strain QAS106 was similar to serovar 3. The nucleotide sequence of the 16S ribosomal DNA (rDNA) of strain QAS106 was identical with that of serovars 3 and 12, but it showed 99.7% identity with that of serovar 15. Nucleotide sequence analysis revealed that genes involved in biosynthesis of the capsular polysaccharide (CPS) of strain QAS106 were identical to those of serovar 12 at the amino acid level. On the other hand, strain QAS106 would express putative proteins involved in the biosynthesis of lipopolysaccharide (LPS) O-polysaccharide (O-PS), the amino acid sequences of which were identical or nearly identical to those of serovars 3 and 15. In conclusion, strain QAS106 should be recognized as K12:O3, even though typical serovar 12 strains are K12:O12. The emergence of an atypical A. pleuropneumoniae serovar 12 strain expressing a rare combination of CPS and O-PS antigens would hamper precise serodiagnosis by the use of either CPS- or LPS-based serodiagnostic methodology alone. © 2014 The Author(s).
Detection and genome characterization of four novel bat hepadnaviruses and a hepevirus in China.

PubMed

Wang, Bo; Yang, Xing-Lou; Li, Wen; Zhu, Yan; Ge, Xing-Yi; Zhang, Li-Biao; Zhang, Yun-Zhi; Bock, Claus-Thomas; Shi, Zheng-Li

2017-02-22

In recent years, novel hepadnaviruses, hepeviruses, hepatoviruses, and hepaciviruses have been discovered in various species of bat around the world, indicating that bats may act as natural reservoirs for these hepatitis viruses. In order to further assess the distribution of hepatitis viruses in bat populations in China, we tested the presence of these hepatitis viruses in our archived bat liver samples that originated from several bat species and various geographical regions in China. A total of 78 bat liver samples (involving two families, five genera, and 17 species of bat) were examined using nested or heminested reverse transcription PCR (RT-PCR) with degenerate primers. Full-length genomic sequences of two virus strains were sequenced followed by phylogenetic analyses. Four samples were positive for hepadnavirus, only one was positive for hepevirus, and none of the samples were positive for hepatovirus or hepacivirus. The hepadnaviruses were discovered in the horseshoe bats, Rhinolophus sinicus and Rhinolophus affinis, and the hepevirus was found in the whiskered bat Myotis davidii. The full-length genomic sequences were determined for one of the two hepadnaviruses identified in R. sinicus (designated BtHBVRs3364) and the hepevirus (designated BtHEVMd2350). A sequence identity analysis indicated that BtHBVRs3364 had the highest degree of identity with a previously reported hepadnavirus from the roundleaf bat, Hipposideros pomona, from China, and BtHEVMd2350 had the highest degree of identity with a hepevirus found in the serotine bat, Eptesicus serotinus, from Germany, but it exhibited high levels of divergence at both the nucleotide and the amino acid levels. This is the first study to report that the Chinese horseshoe bat and the Chinese whiskered bat have been found to carry novel hepadnaviruses and a novel hepevirus, respectively. The discovery of BtHBVRs3364 further supports the significance of host switches evolution while opposing the co-evolutionary theory associated with hepadnaviruses. According to the latest criterion of the International Committee on Taxonomy of Viruses (ICTV), we hypothesize that BtHEVMd2350 represents an independent genotype within the species Orthohepevirus D of the family Hepeviridae.
Experimental induction of proventricular dilatation disease in cockatiels (Nymphicus hollandicus) inoculated with brain homogenates containing avian bornavirus 4

PubMed Central

Gancz, Ady Y; Kistler, Amy L; Greninger, Alexander L; Farnoushi, Yigal; Mechani, Sara; Perl, Shmuel; Berkowitz, Asaf; Perez, Noa; Clubb, Susan; DeRisi, Joseph L; Ganem, Don; Lublin, Avishai

2009-01-01

Background Proventricular dilatation disease (PDD) is a fatal disorder of psittacine birds worldwide. The disease is characterized by lymphoplasmacytic infiltration of the central and peripheral nervous systems, leading to gastrointestinal motility and/or central nervous system dysfunction. Recently, we detected a significant association between avian bornavirus (ABV) infection and clinical signs of PDD in psittacines. However, it remains unclear whether ABV infection actually causes PDD. To address this question, we examined the impact of ABV inoculation on the cockatiel (Nymphicus hollandicus). Results Five cockatiels were inoculated via multiple routes (intramuscular, intraocular, intranasal, and oral) with a brain homogenate derived from either a PDD(+) avian bornavirus 4 (ABV4) (+) case (n = 3 inoculees) or from a PDD(-) ABV(-) control (n = 2 inoculees). The control birds remained free of clinical or pathological signs of PDD, and tested ABV(-) by RT-PCR and immunohistochemistry (IHC). In contrast, all three cockatiels inoculated with ABV4(+) brain homogenate developed gross and microscopic PDD lesions, and two exhibited overt clinical signs. In numerous tissues, ABV RT-PCR and sequence analysis demonstrated the presence of ABV4 RNA nearly identical to that in the inoculum. ABV was detected in the central nervous system of the three ABV-inoculees by IHC. Pyrosequencing to investigate the viral flora in the ABV4(+) inoculum uncovered 7 unique reads sharing 73–100% nucleotide sequence identity with previously identified ABV sequences and 24 reads sharing 40–89% amino acid sequence identity with viruses in the Retroviridae and Astroviridae families. Of these candidate viral species, only ABV RNA was recovered from tissues of the inoculated birds. Conclusion In this study, the clinical and pathological manifestations of PDD were induced by inoculation of cockatiels with brain homogenates containing avian bornavirus 4. By using high throughput pyrosequencing an in-depth view of the viral content of the inoculum was achieved, revealing that of 3 candidate virus families detected, only the presence of ABV RNA correlated with the development of PDD. This study provides evidence of a causal association between ABV4 infection and PDD in cockatiels. PMID:19589169
How to Choose the Suitable Template for Homology Modelling of GPCRs: 5-HT7 Receptor as a Test Case.

PubMed

Shahaf, Nir; Pappalardo, Matteo; Basile, Livia; Guccione, Salvatore; Rayan, Anwar

2016-09-01

G protein-coupled receptors (GPCRs) are a super-family of membrane proteins that attract great pharmaceutical interest due to their involvement in almost every physiological activity, including extracellular stimuli, neurotransmission, and hormone regulation. Currently, structural information on many GPCRs is mainly obtained by the techniques of computer modelling in general and by homology modelling in particular. Based on a quantitative analysis of eighteen antagonist-bound, resolved structures of rhodopsin family "A" receptors - also used as templates to build 153 homology models - it was concluded that a higher sequence identity between two receptors does not guarantee a lower RMSD between their structures, especially when their pair-wise sequence identity (within trans-membrane domain and/or in binding pocket) lies between 25 % and 40 %. This study suggests that we should consider all template receptors having a sequence identity ≤50 % with the query receptor. In fact, most of the GPCRs, compared to the currently available resolved structures of GPCRs, fall within this range and lack a correlation between structure and sequence. When testing suitability for structure-based drug design, it was found that choosing as a template the most similar resolved protein, based on sequence resemblance only, led to unsound results in many cases. Molecular docking analyses were carried out, and enrichment factors as well as attrition rates were utilized as criteria for assessing suitability for structure-based drug design. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
nuID: a universal naming scheme of oligonucleotides for Illumina, Affymetrix, and other microarrays

PubMed Central

Du, Pan; Kibbe, Warren A; Lin, Simon M

2007-01-01

Background Oligonucleotide probes that are sequence identical may have different identifiers between manufacturers and even between different versions of the same company's microarray; and sometimes the same identifier is reused and represents a completely different oligonucleotide, resulting in ambiguity and potentially mis-identification of the genes hybridizing to that probe. Results We have devised a unique, non-degenerate encoding scheme that can be used as a universal representation to identify an oligonucleotide across manufacturers. We have named the encoded representation 'nuID', for nucleotide universal identifier. Inspired by the fact that the raw sequence of the oligonucleotide is the true definition of identity for a probe, the encoding algorithm uniquely and non-degenerately transforms the sequence itself into a compact identifier (a lossless compression). In addition, we added a redundancy check (checksum) to validate the integrity of the identifier. These two steps, encoding plus checksum, result in an nuID, which is a unique, non-degenerate, permanent, robust and efficient representation of the probe sequence. For commercial applications that require the sequence identity to be confidential, we have an encryption schema for nuID. We demonstrate the utility of nuIDs for the annotation of Illumina microarrays, and we believe it has universal applicability as a source-independent naming convention for oligomers. Reviewers This article was reviewed by Itai Yanai, Rong Chen (nominated by Mark Gerstein), and Gregory Schuler (nominated by David Lipman). PMID:17540033
Sequence and features of the tryptophan operon of Vibrio parahemolyticus.

PubMed

Crawford, I P; Han, C Y; Silverman, M

1991-01-01

The nucleotide sequence of the trp operon of the marine enteric bacterium Vibrio parahemolyticus is presented. The gene order E, G, D, C(F), B, A is identical to that of other enterics. The structural genes of the operon are preceded by a long leader region encoding a 41-residue peptide containing five tryptophan residues. The organization of the leader region suggests that transcription of the operon is subject to attenuation control. The promoter-operator region of the V. parahemolyticus trp operon is almost identical to the corresponding promoter-operator of E. coli. The similarities suggest that promoter strength and operator function are identical in the two species, and that transcription initiation is regulated by repression. The operon appears to lack the internal promoter within trpD that is common in terrestrial enteric species.
Post-transcriptional modifications in the small subunit ribosomal RNA from Thermotoga maritima, including presence of a novel modified cytidine

PubMed Central

Guymon, Rebecca; Pomerantz, Steven C.; Ison, J. Nicholas; Crain, Pamela F.; McCloskey, James A.

2007-01-01

Post-transcriptional modifications of RNA are nearly ubiquitous in the principal RNAs involved in translation. However, in the case of rRNA the functional roles of modification are far less established than for tRNA, and are subject to less knowledge in terms of specific nucleoside identities and their sequence locations. Post-transcriptional modifications have been studied in the SSU rRNA from Thermotoga maritima (optimal growth 80°C), one of the most deeply branched organisms in the Eubacterial phylogenetic tree. A total of 10 different modified nucleosides were found, the greatest number reported for bacterial SSU rRNA, occupying a net of ∼14 sequence sites, compared with a similar number of sites recently reported for Thermus thermophilus and 11 for Escherichia coli. The relatively large number of modifications in Thermotoga offers modest support for the notion that thermophile rRNAs are more extensively modified than those from mesophiles. Seven of the Thermotoga modified sites are identical (location and identity) to those in E. coli. An unusual derivative of cytidine was found, designated N-330 (M r 330.117), and was sequenced to position 1404 in the decoding region of the rRNA. It was unexpectedly found to be identical to an earlier reported nucleoside of unknown structure at the same location in the SSU RNA of the archaeal mesophile Haloferax volcanii. PMID:17255199
High levels of p19/nm23 protein in neuroblastoma are associated with advanced stage disease and with N-myc gene amplification.

PubMed Central

Hailat, N; Keim, D R; Melhem, R F; Zhu, X X; Eckerskorn, C; Brodeur, G M; Reynolds, C P; Seeger, R C; Lottspeich, F; Strahler, J R

1991-01-01

The gene encoding a novel protein designated nm23-H1, which was recently identified as identical to the A subunit of nucleotide diphosphate kinase from human erythrocytes, has been proposed to play a role in tumor metastasis suppression. We report that untreated neuroblastoma tumors contain a cellular polypeptide (Mr = 19,000) designated p19, identified in two-dimensional electrophoretic gels, which occurs at significantly higher levels (P = 0.0001) in primary tumors containing amplified N-myc gene. The partial amino acid sequence obtained for p19 is identical to the sequence of the human nm23-H1 protein. An antibody to the A subunit of erythrocyte nucleotide diphosphate kinase reacted exclusively with p19. In this study, significantly higher levels of p19/nm23 occurred in primary neuroblastoma tumors from patients with advanced stages (III and IV) relative to tumors from patients with limited stages (I and II) of the disease. Even among patients with a single copy of the N-myc gene, tumors from patients with stages III and IV had statistically significantly higher levels of p19/nm23 than tumors from patients with stages I and II. Our findings indicate that, in contrast to a proposed role for nm23-H1 as a tumor metastasis suppressor, increased p19/nm23 protein in neuroblastoma is correlated with features of the disease that are associated with aggressive tumors. Therefore, nm23-H1 may have distinct if not opposite roles in different tumors. Images PMID:2056128
Complete Genome Sequence of a CTX-M-15-Producing Escherichia coli Strain from the H30Rx Subclone of Sequence Type 131 from a Patient with Recurrent Urinary Tract Infections, Closely Related to a Lethal Urosepsis Isolate from the Patient’s Sister

PubMed Central

Johnson, Timothy J.; Liu, Cindy M.; Sokurenko, Evgeni; Kisiela, Dagmara I.; Paul, Sandip; Andersen, Paal; Johnson, James R.; Price, Lance B.

2016-01-01

We report here the complete genome sequence, including five plasmid sequences, of Escherichia coli sequence type 131 (ST131) strain JJ1887. The strain was isolated in 2007 in the United States from a patient with recurrent cystitis, whose caregiver sister died from urosepsis caused by a nearly identical strain. PMID:27174264
The complete sequence of Cymbidium mosaic virus from Vanilla fragrans in Hainan, China.

PubMed

He, Zhen; Jiang, Dongmei; Liu, Aiqin; Sang, Liwei; Li, Wenfeng; Li, Shifang

2011-06-01

The complete nucleotide sequence of Cymbidium mosaic virus (CymMV) isolated from vanilla in Hainan province, China was determined for the first time. It comprised 6,224 nucleotides; sequence analysis suggested that the isolate we obtained was a member of the genus Potexvirus, and its sequence shared 86.67-96.61% identities with previously reported sequences. Phylogenetic analysis suggested that CymMV from vanilla fragrans was clustered into subgroup A and the isolates in this subgroup displayed little regional difference.
Improved Model for Predicting the Free Energy Contribution of Dinucleotide Bulges to RNA Duplex Stability.

PubMed

Tomcho, Jeremy C; Tillman, Magdalena R; Znosko, Brent M

2015-09-01

Predicting the secondary structure of RNA is an intermediate in predicting RNA three-dimensional structure. Commonly, determining RNA secondary structure from sequence uses free energy minimization and nearest neighbor parameters. Current algorithms utilize a sequence-independent model to predict free energy contributions of dinucleotide bulges. To determine if a sequence-dependent model would be more accurate, short RNA duplexes containing dinucleotide bulges with different sequences and nearest neighbor combinations were optically melted to derive thermodynamic parameters. These data suggested energy contributions of dinucleotide bulges were sequence-dependent, and a sequence-dependent model was derived. This model assigns free energy penalties based on the identity of nucleotides in the bulge (3.06 kcal/mol for two purines, 2.93 kcal/mol for two pyrimidines, 2.71 kcal/mol for 5'-purine-pyrimidine-3', and 2.41 kcal/mol for 5'-pyrimidine-purine-3'). The predictive model also includes a 0.45 kcal/mol penalty for an A-U pair adjacent to the bulge and a -0.28 kcal/mol bonus for a G-U pair adjacent to the bulge. The new sequence-dependent model results in predicted values within, on average, 0.17 kcal/mol of experimental values, a significant improvement over the sequence-independent model. This model and new experimental values can be incorporated into algorithms that predict RNA stability and secondary structure from sequence.
Rank-order-selective neurons form a temporal basis set for the generation of motor sequences.

PubMed

Salinas, Emilio

2009-04-08

Many behaviors are composed of a series of elementary motor actions that must occur in a specific order, but the neuronal mechanisms by which such motor sequences are generated are poorly understood. In particular, if a sequence consists of a few motor actions, a primate can learn to replicate it from memory after practicing it for just a few trials. How do the motor and premotor areas of the brain assemble motor sequences so fast? The network model presented here reveals part of the solution to this problem. The model is based on experiments showing that, during the performance of motor sequences, some cortical neurons are always activated at specific times, regardless of which motor action is being executed. In the model, a population of such rank-order-selective (ROS) cells drives a layer of downstream motor neurons so that these generate specific movements at different times in different sequences. A key ingredient of the model is that the amplitude of the ROS responses must be modulated by sequence identity. Because of this modulation, which is consistent with experimental reports, the network is able not only to produce multiple sequences accurately but also to learn a new sequence with minimal changes in connectivity. The ROS neurons modulated by sequence identity thus serve as a basis set for constructing arbitrary sequences of motor responses downstream. The underlying mechanism is analogous to the mechanism described in parietal areas for generating coordinate transformations in the spatial domain.
RANK-ORDER-SELECTIVE NEURONS FORM A TEMPORAL BASIS SET FOR THE GENERATION OF MOTOR SEQUENCES

PubMed Central

Salinas, Emilio

2009-01-01

Many behaviors are composed of a series of elementary motor actions that must occur in a specific order, but the neuronal mechanisms by which such motor sequences are generated are poorly understood. In particular, if a sequence consists of a few motor actions, a primate can learn to replicate it from memory after practicing it for just a few trials. How do the motor and premotor areas of the brain assemble motor sequences so fast? The network model presented here reveals part of the solution to this problem. The model is based on experiments showing that, during the performance of motor sequences, some cortical neurons are always activated at specific times, regardless of which motor action is being executed. In the model, a population of such rank-order-selective (ROS) cells drives a layer of downstream motor neurons so that these generate specific movements at different times in different sequences. A key ingredient of the model is that the amplitude of the ROS responses must be modulated by sequence identity. Because of this modulation, which is consistent with experimental reports, the network is able not only to produce multiple sequences accurately but also to learn a new sequence with minimal changes in connectivity. The ROS neurons modulated by sequence identity thus serve as a basis set for constructing arbitrary sequences of motor responses downstream. The underlying mechanism is analogous to the mechanism described in parietal areas for generating coordinate transformations in the spatial domain. PMID:19357265
Global versus Local Regulatory Roles for Lrp-Related Proteins: Haemophilus influenzae as a Case Study

PubMed Central

Friedberg, Devorah; Midkiff, Michael; Calvo, Joseph M.

2001-01-01

Lrp (leucine-responsive regulatory protein) plays a global regulatory role in Escherichia coli, affecting expression of dozens of operons. Numerous lrp-related genes have been identified in different bacteria and archaea, including asnC, an E. coli gene that was the first reported member of this family. Pairwise comparisons of amino acid sequences of the corresponding proteins shows an average sequence identity of only 29% for the vast majority of comparisons. By contrast, Lrp-related proteins from enteric bacteria show more than 97% amino acid identity. Is the global regulatory role associated with E. coli Lrp limited to enteric bacteria? To probe this question we investigated LrfB, an Lrp-related protein from Haemophilus influenzae that shares 75% sequence identity with E. coli Lrp (highest sequence identity among 42 sequences compared). A strain of H. influenzae having an lrfB null allele grew at the wild-type growth rate but with a filamentous morphology. A comparison of two-dimensional (2D) electrophoretic patterns of proteins from parent and mutant strains showed only two differences (comparable studies with lrp+ and lrp E. coli strains by others showed 20 differences). The abundance of LrfB in H. influenzae, estimated by Western blotting experiments, was about 130 dimers per cell (compared to 3,000 dimers per E. coli cell). LrfB expressed in E. coli replaced Lrp as a repressor of the lrp gene but acted only to a limited extent as an activator of the ilvIH operon. Thus, although LrfB resembles Lrp sufficiently to perform some of its functions, its low abundance is consonant with a more local role in regulating but a few genes, a view consistent with the results of the 2D electrophoretic analysis. We speculate that an Lrp having a global regulatory role evolved to help enteric bacteria adapt to their ecological niches and that it is unlikely that Lrp-related proteins in other organisms have a broad regulatory function. PMID:11395465
Cloning and sequence analysis of a cDNA clone coding for the mouse GM2 activator protein.

PubMed Central

Bellachioma, G; Stirling, J L; Orlacchio, A; Beccari, T

1993-01-01

A cDNA (1.1 kb) containing the complete coding sequence for the mouse GM2 activator protein was isolated from a mouse macrophage library using a cDNA for the human protein as a probe. There was a single ATG located 12 bp from the 5' end of the cDNA clone followed by an open reading frame of 579 bp. Northern blot analysis of mouse macrophage RNA showed that there was a single band with a mobility corresponding to a size of 2.3 kb. We deduce from this that the mouse mRNA, in common with the mRNA for the human GM2 activator protein, has a long 3' untranslated sequence of approx. 1.7 kb. Alignment of the mouse and human deduced amino acid sequences showed 68% identity overall and 75% identity for the sequence on the C-terminal side of the first 31 residues, which in the human GM2 activator protein contains the signal peptide. Hydropathicity plots showed great similarity between the mouse and human sequences even in regions of low sequence similarity. There is a single N-glycosylation site in the mouse GM2 activator protein sequence (Asn151-Phe-Thr) which differs in its location from the single site reported in the human GM2 activator protein sequence (Asn63-Val-Thr). Images Figure 1 PMID:7689829
Statistical theory for protein combinatorial libraries. Packing interactions, backbone flexibility, and the sequence variability of a main-chain structure.

PubMed

Kono, H; Saven, J G

2001-02-23

Combinatorial experiments provide new ways to probe the determinants of protein folding and to identify novel folding amino acid sequences. These types of experiments, however, are complicated both by enormous conformational complexity and by large numbers of possible sequences. Therefore, a quantitative computational theory would be helpful in designing and interpreting these types of experiment. Here, we present and apply a statistically based, computational approach for identifying the properties of sequences compatible with a given main-chain structure. Protein side-chain conformations are included in an atom-based fashion. Calculations are performed for a variety of similar backbone structures to identify sequence properties that are robust with respect to minor changes in main-chain structure. Rather than specific sequences, the method yields the likelihood of each of the amino acids at preselected positions in a given protein structure. The theory may be used to quantify the characteristics of sequence space for a chosen structure without explicitly tabulating sequences. To account for hydrophobic effects, we introduce an environmental energy that it is consistent with other simple hydrophobicity scales and show that it is effective for side-chain modeling. We apply the method to calculate the identity probabilities of selected positions of the immunoglobulin light chain-binding domain of protein L, for which many variant folding sequences are available. The calculations compare favorably with the experimentally observed identity probabilities.
Nanopore DNA Sequencing and Genome Assembly on the International Space Station.

PubMed

Castro-Wallace, Sarah L; Chiu, Charles Y; John, Kristen K; Stahl, Sarah E; Rubins, Kathleen H; McIntyre, Alexa B R; Dworkin, Jason P; Lupisella, Mark L; Smith, David J; Botkin, Douglas J; Stephenson, Timothy A; Juul, Sissel; Turner, Daniel J; Izquierdo, Fernando; Federman, Scot; Stryke, Doug; Somasekar, Sneha; Alexander, Noah; Yu, Guixia; Mason, Christopher E; Burton, Aaron S

2017-12-21

We evaluated the performance of the MinION DNA sequencer in-flight on the International Space Station (ISS), and benchmarked its performance off-Earth against the MinION, Illumina MiSeq, and PacBio RS II sequencing platforms in terrestrial laboratories. Samples contained equimolar mixtures of genomic DNA from lambda bacteriophage, Escherichia coli (strain K12, MG1655) and Mus musculus (female BALB/c mouse). Nine sequencing runs were performed aboard the ISS over a 6-month period, yielding a total of 276,882 reads with no apparent decrease in performance over time. From sequence data collected aboard the ISS, we constructed directed assemblies of the ~4.6 Mb E. coli genome, ~48.5 kb lambda genome, and a representative M. musculus sequence (the ~16.3 kb mitochondrial genome), at 100%, 100%, and 96.7% consensus pairwise identity, respectively; de novo assembly of the E. coli genome from raw reads yielded a single contig comprising 99.9% of the genome at 98.6% consensus pairwise identity. Simulated real-time analyses of in-flight sequence data using an automated bioinformatic pipeline and laptop-based genomic assembly demonstrated the feasibility of sequencing analysis and microbial identification aboard the ISS. These findings illustrate the potential for sequencing applications including disease diagnosis, environmental monitoring, and elucidating the molecular basis for how organisms respond to spaceflight.

Diversity of the P2 protein among nontypeable Haemophilus influenzae isolates.

PubMed Central

Bell, J; Grass, S; Jeanteur, D; Munson, R S

1994-01-01

The genes for outer membrane protein P2 of four nontypeable Haemophilus influenzae strains were cloned and sequenced. The derived amino acid sequences were compared with the outer membrane protein P2 sequence from H. influenzae type b MinnA and the sequences of P2 from three additional nontypeable H. influenzae strains. The sequences were 76 to 94% identical. The sequences had regions with considerable variability separated by regions which were highly conserved. The variable regions mapped to putative surface-exposed loops of the protein. PMID:8188390
Genome Sequences of Ilzat and Eleri, Two Phages Isolated Using Microbacterium foliorum NRRL B-24224

PubMed Central

Ali, Ilzat; Jones, Acacia Eleri; Mohamed, Aleem

2018-01-01

ABSTRACT Bacteriophages Ilzat and Eleri are newly isolated Siphoviridae infecting Microbacterium foliorum NRRL B-24224. The phage genomes are similar in length, G+C content, and architecture and share 62.9% nucleotide sequence identity. PMID:29650566
Preservation of protein clefts in comparative models.

PubMed

Piedra, David; Lois, Sergi; de la Cruz, Xavier

2008-01-16

Comparative, or homology, modelling of protein structures is the most widely used prediction method when the target protein has homologues of known structure. Given that the quality of a model may vary greatly, several studies have been devoted to identifying the factors that influence modelling results. These studies usually consider the protein as a whole, and only a few provide a separate discussion of the behaviour of biologically relevant features of the protein. Given the value of the latter for many applications, here we extended previous work by analysing the preservation of native protein clefts in homology models. We chose to examine clefts because of their role in protein function/structure, as they are usually the locus of protein-protein interactions, host the enzymes' active site, or, in the case of protein domains, can also be the locus of domain-domain interactions that lead to the structure of the whole protein. We studied how the largest cleft of a protein varies in comparative models. To this end, we analysed a set of 53507 homology models that cover the whole sequence identity range, with a special emphasis on medium and low similarities. More precisely we examined how cleft quality - measured using six complementary parameters related to both global shape and local atomic environment, depends on the sequence identity between target and template proteins. In addition to this general analysis, we also explored the impact of a number of factors on cleft quality, and found that the relationship between quality and sequence identity varies depending on cleft rank amongst the set of protein clefts (when ordered according to size), and number of aligned residues. We have examined cleft quality in homology models at a range of seq.id. levels. Our results provide a detailed view of how quality is affected by distinct parameters and thus may help the user of comparative modelling to determine the final quality and applicability of his/her cleft models. In addition, the large variability in model quality that we observed within each sequence bin, with good models present even at low sequence identities (between 20% and 30%), indicates that properly developed identification methods could be used to recover good cleft models in this sequence range.
Deep Sequencing Reveals a Divergent Ugandan cassava brown streak virus Isolate from Malawi

PubMed Central

Winter, Stephan; Mukasa, Settumba; Tairo, Fred; Sseruwagi, Peter; Ndunguru, Joseph; Duffy, Siobain

2017-01-01

ABSTRACT Illumina sequencing of RNA from a cassava cutting from northern Malawi produced a genome of Ugandan cassava brown streak virus (UCBSV-MW-NB7_2013). Sequence comparisons revealed stronger similarity to an isolate from nearby Tanzania (93.4% pairwise nucleotide identity) than to those previously reported from Malawi (86.9 to 87.0%). PMID:28818908
First Complete Genome Sequence of Suakwa aphid-borne yellows virus from East Timor

PubMed Central

Maina, Solomon; Edwards, Owain R.; de Almeida, Luis; Ximenes, Abel

2016-01-01

We present here the first complete genomic RNA sequence of the polerovirus Suakwa aphid-borne yellows virus (SABYV), from East Timor. The isolate sequenced came from a virus-infected pumpkin plant. The East Timorese genome had a nucleotide identity of 86.5% with the only other SABYV genome available, which is from Taiwan. PMID:27469955
Genome variability of foot-and-mouth disease virus during the short period of the 2010 epidemic in Japan.

PubMed

Nishi, Tatsuya; Yamada, Manabu; Fukai, Katsuhiko; Shimada, Nobuaki; Morioka, Kazuki; Yoshida, Kazuo; Sakamoto, Kenichi; Kanno, Toru; Yamakawa, Makoto

2017-02-01

Foot-and-mouth disease virus (FMDV) is highly contagious and has a high mutation rate, leading to extensive genetic variation. To investigate how FMDV genetically evolves over a short period of an epidemic after initial introduction into an FMD-free area, whole L-fragment sequences of 104 FMDVs isolated from the 2010 epidemic in Japan, which continued for less than three months were determined and phylogenetically and comparatively analyzed. Phylogenetic analysis of whole L-fragment sequences showed that these isolates were classified into a single group, indicating that FMDV was introduced into Japan in the epidemic via a single introduction. Nucleotide sequences of 104 virus isolates showed more than 99.56% pairwise identity rates without any genetic deletion or insertion, although no sequences were completely identical with each other. These results indicate that genetic substitutions of FMDV occurred gradually and constantly during the epidemic and generation of an extensive mutant virus could have been prevented by rapid eradication strategy. From comparative analysis of variability of each FMDV protein coding region, VP4 and 2C regions showed the highest average identity rates and invariant rates, and were confirmed as highly conserved. In contrast, the protein coding regions VP2 and VP1 were confirmed to be highly variable regions with the lowest average identity rates and invariant rates, respectively. Our data demonstrate the importance of rapid eradication strategy in an FMD epidemic and provide valuable information on the genome variability of FMDV during the short period of an epidemic. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.
A nucleotide sequence comparison of coxsackievirus B4 isolates from aquatic samples and clinical specimens.

PubMed Central

Hughes, M. S.; Hoey, E. M.; Coyle, P. V.

1993-01-01

Ten coxsackievirus B4 (CVB4) strains isolated from clinical and environmental sources in Northern Ireland in 1985-7, were compared at the nucleotide sequence level. Dideoxynucleotide sequencing of a polymerase chain reaction (PCR) amplified fragment, spanning the VP1/P2A genomic region, classified the isolates into two distinct groups or genotypes as defined by Rico-Hesse and colleagues for poliovirus type 1. Isolates within each group shared approximately 99% sequence identity at the nucleotide level whereas < or = 86% sequence identity was shared between groups. One isolate derived from a clinical specimen in 1987 was grouped with six CVB4 isolates recovered from the aquatic environment in 1986-7. The second group comprised CVB4 isolates from clinical specimens in 1985-6. Both groups were different at the nucleotide level from the prototype strain isolated in 1950. It was concluded that the method could be used to sub-type CVB4 isolates and would be of value in epidemiological studies of CVB4. Predicted amino acid sequences revealed non-conservation of the tyrosine residue at the VP1/P2A cleavage site but were of little value in distinguishing CVB4 variants. PMID:8386098
High resolution identity testing of inactivated poliovirus vaccines.

PubMed

Mee, Edward T; Minor, Philip D; Martin, Javier

2015-07-09

Definitive identification of poliovirus strains in vaccines is essential for quality control, particularly where multiple wild-type and Sabin strains are produced in the same facility. Sequence-based identification provides the ultimate in identity testing and would offer several advantages over serological methods. We employed random RT-PCR and high throughput sequencing to recover full-length genome sequences from monovalent and trivalent poliovirus vaccine products at various stages of the manufacturing process. All expected strains were detected in previously characterised products and the method permitted identification of strains comprising as little as 0.1% of sequence reads. Highly similar Mahoney and Sabin 1 strains were readily discriminated on the basis of specific variant positions. Analysis of a product known to contain incorrect strains demonstrated that the method correctly identified the contaminants. Random RT-PCR and shotgun sequencing provided high resolution identification of vaccine components. In addition to the recovery of full-length genome sequences, the method could also be easily adapted to the characterisation of minor variant frequencies and distinction of closely related products on the basis of distinguishing consensus and low frequency polymorphisms. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.
Molecular characterization of a novel luteovirus infecting apple by next-generation sequencing.

PubMed

Shen, Pan; Tian, Xin; Zhang, Song; Ren, Fang; Li, Ping; Yu, Yun-Qi; Li, Ruhui; Zhou, Changyong; Cao, Mengji

2018-03-01

A new single-stranded positive-sense RNA virus, which shares the highest nucleotide (nt) sequence identity of 53.4% with the genome sequence of cherry-associated luteovirus South Korean isolate (ChALV-SK, genus Luteovirus), was discovered in this work. It is provisionally named apple-associated luteovirus (AaLV). The complete genome sequence of AaLV comprises 5,890 nt and contains eight open reading frames (ORFs), in a very similar arrangement that is typical of members of the genus Luteovirus. When compared with other members of the family Luteoviridae, ORF1 of AaLV was found to encompass another ORF, ORF1a, which encodes a putative 32.9-kDa protein. The ORF1-ORF2 region (RNA-dependent RNA polymerase, RdRP) showed the greatest amino acid (aa) sequence identity (59.7%) to that of cherry-associated luteovirus Czech Republic isolate (ChALV-CZ, genus Luteovirus). The results of genome sequence comparisons and phylogenetic analysis, suggest that AaLV should be a member of a novel species in the genus Luteovirus. To our knowledge, it is the sixth member of the genus Luteovirus reported to naturally infect rosaceous plants.
Cloning and sequencing of the cDNA species for mammalian dimeric dihydrodiol dehydrogenases.

PubMed Central

Arimitsu, E; Aoki, S; Ishikura, S; Nakanishi, K; Matsuura, K; Hara, A

1999-01-01

Cynomolgus and Japanese monkey kidneys, dog and pig livers and rabbit lens contain dimeric dihydrodiol dehydrogenase (EC 1.3.1.20) associated with high carbonyl reductase activity. Here we have isolated cDNA species for the dimeric enzymes by reverse transcriptase-PCR from human intestine in addition to the above five animal tissues. The amino acid sequences deduced from the monkey, pig and dog cDNA species perfectly matched the partial sequences of peptides digested from the respective enzymes of these animal tissues, and active recombinant proteins were expressed in a bacterial system from the monkey and human cDNA species. Northern blot analysis revealed the existence of a single 1.3 kb mRNA species for the enzyme in these animal tissues. The human enzyme shared 94%, 85%, 84% and 82% amino acid identity with the enzymes of the two monkey strains (their sequences were identical), the dog, the pig and the rabbit respectively. The sequences of the primate enzymes consisted of 335 amino acid residues and lacked one amino acid compared with the other animal enzymes. In contrast with previous reports that other types of dihydrodiol dehydrogenase, carbonyl reductases and enzymes with either activity belong to the aldo-keto reductase family or the short-chain dehydrogenase/reductase family, dimeric dihydrodiol dehydrogenase showed no sequence similarity with the members of the two protein families. The dimeric enzyme aligned with low degrees of identity (14-25%) with several prokaryotic proteins, in which 47 residues are strictly or highly conserved. Thus dimeric dihydrodiol dehydrogenase has a primary structure distinct from the previously known mammalian enzymes and is suggested to constitute a novel protein family with the prokaryotic proteins. PMID:10477285
A proposal to rename the hyperthermophile Pyrococcus woesei as Pyrococcus furiosus subsp. woesei.

PubMed

Kanoksilapatham, Wirojne; González, Juan M; Maeder, Dennis L; DiRuggiero, Jocelyne; Robb, Frank T

2004-10-01

Pyrococcus species are hyperthermophilic members of the order Thermococcales, with optimal growth temperatures approaching 100 degrees C. All species grow heterotrophically and produce H2 or, in the presence of elemental sulfur (S(o)), H2S. Pyrococcus woesei and P. furiosus were isolated from marine sediments at the same Vulcano Island beach site and share many morphological and physiological characteristics. We report here that the rDNA operons of these strains have identical sequences, including their intergenic spacer regions and part of the 23S rRNA. Both species grow rapidly and produce H2 in the presence of 0.1% maltose and 10-100 microM sodium tungstate in S(o)-free medium. However, P. woesei shows more extensive autolysis than P. furiosus in the stationary phase. Pyrococcus furiosus and P. woesei share three closely related families of insertion sequences (ISs). A Southern blot performed with IS probes showed extensive colinearity between the genomes of P. woesei and P. furiosus. Cloning and sequencing of ISs that were in different contexts in P. woesei and P. furiosus revealed that the napA gene in P. woesei is disrupted by a type III IS element, whereas in P. furiosus, this gene is intact. A type I IS element, closely linked to the napA gene, was observed in the same context in both P. furiosus and P. woesei genomes. Our results suggest that the IS elements are implicated in genomic rearrangements and reshuffling in these closely related strains. We propose to rename P. woesei a subspecies of P. furiosus based on their identical rDNA operon sequences, many common IS elements that are shared genomic markers, and the observation that all P. woesei nucleotide sequences deposited in GenBank to date are > 99% identical to P. furiosus sequences.
Cloning and Characterization of the Lactococcal Plasmid-Encoded Type II Restriction/Modification System, LlaDII

PubMed Central

Madsen, Annette; Josephsen, Jytte

1998-01-01

The LlaDII restriction/modification (R/M) system was found on the naturally occurring 8.9-kb plasmid pHW393 in Lactococcus lactis subsp. cremoris W39. A 2.4-kb PstI-EcoRI fragment inserted into the Escherichia coli-L. lactis shuttle vector pCI3340 conferred to L. lactis LM2301 and L. lactis SMQ86 resistance against representatives of the three most common lactococcal phage species: 936, P335, and c2. The LlaDII endonuclease was partially purified and found to recognize and cleave the sequence 5′-GC↓NGC-3′, where the arrow indicates the cleavage site. It is thus an isoschizomer of the commercially available restriction endonuclease Fnu4HI. Sequencing of the 2.4-kb PstI-EcoRI fragment revealed two open reading frames arranged tandemly and separated by a 105-bp intergenic region. The endonuclease gene of 543 bp preceded the methylase gene of 954 bp. The deduced amino acid sequence of the LlaDII R/M system showed high homology to that of its only sequenced isoschizomer, Bsp6I from Bacillus sp. strain RFL6, with 41% identity between the endonucleases and 60% identity between the methylases. The genetic organizations of the LlaDII and Bsp6I R/M systems are identical. Both methylases have two recognition sites (5′-GCGGC-3′ and 5′-GCCGC-3′) forming a putative stem-loop structure spanning part of the presumed −35 sequence and part of the intervening region between the −35 and −10 sequences. Alignment of the LlaDII and Bsp6I methylases with other m5C methylases showed that the protein primary structures possessed the same organization. PMID:9647810
SH2-catalytic domain linker heterogeneity influences allosteric coupling across the SFK family.

PubMed

Register, A C; Leonard, Stephen E; Maly, Dustin J

2014-11-11

Src-family kinases (SFKs) make up a family of nine homologous multidomain tyrosine kinases whose misregulation is responsible for human disease (cancer, diabetes, inflammation, etc.). Despite overall sequence homology and identical domain architecture, differences in SH3 and SH2 regulatory domain accessibility and ability to allosterically autoinhibit the ATP-binding site have been observed for the prototypical SFKs Src and Hck. Biochemical and structural studies indicate that the SH2-catalytic domain (SH2-CD) linker, the intramolecular binding epitope for SFK SH3 domains, is responsible for allosterically coupling SH3 domain engagement to autoinhibition of the ATP-binding site through the conformation of the αC helix. As a relatively unconserved region between SFK family members, SH2-CD linker sequence variability across the SFK family is likely a source of nonredundant cellular functions between individual SFKs via its effect on the availability of SH3 and SH2 domains for intermolecular interactions and post-translational modification. Using a combination of SFKs engineered with enhanced or weakened regulatory domain intramolecular interactions and conformation-selective inhibitors that report αC helix conformation, this study explores how SH2-CD sequence heterogeneity affects allosteric coupling across the SFK family by examining Lyn, Fyn1, and Fyn2. Analyses of Fyn1 and Fyn2, isoforms that are identical but for a 50-residue sequence spanning the SH2-CD linker, demonstrate that SH2-CD linker sequence differences can have profound effects on allosteric coupling between otherwise identical kinases. Most notably, a dampened allosteric connection between the SH3 domain and αC helix leads to greater autoinhibitory phosphorylation by Csk, illustrating the complex effects of SH2-CD linker sequence on cellular function.
The human homolog of S. cerevisiae CDC27, CDC27 Hs, is encoded by a highly conserved intronless gene present in multiple copies in the human genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Devor, E.J.; Dill-Devor, R.M.

1994-09-01

We have obtained a number of unique sequences via PCR amplification of human genomic DNA using degenerate primers under low stringency (42{degrees}C). One of these, an 853 bp product, has been identified as a partial genomic sequence of the human homolog of the S. cerevisiae CDC27 gene, CDC27Hs (GenBank No. U00001). This gene, reported by Turgendreich et al. is also designated EST00556 from Adams et al. We have undertaken a more detailed examination of our sequence, MCP34N, and have found that: 1. the genomic sequence is nearly identical to CDC27Hs over its entire 853 bp length; 2. an MCP34N-specific PCRmore » assay of several non-human primate species reveals amplification products in chimpanzee and gorilla genomes having greater than 90% sequence identity with CDC27Hs; and 3. an MCP34N-specific PCR assay of the BIOS hybrid cell line panel gives a discordancy pattern suggesting multiple loci. Based upon these data, we present the following initial characterization: 1. the complete MCP34N sequence identity with CDC27Hs indicates that the latter is encoded by an intronless gene; 2. CDC27Hs is highly conserved among higher primates; and 3. CDC27Hs is present in multiple copies in the human genome. These characteristics, taken together with those initially reported for CDC27Hs, suggest that this is an old gene that carries out an important but, as yet, unknown function in the human brain.« less
Further delineation of nonhomologous-based recombination and evidence for subtelomeric segmental duplications in 1p36 rearrangements.

PubMed

D'Angelo, Carla S; Gajecka, Marzena; Kim, Chong A; Gentles, Andrew J; Glotzbach, Caron D; Shaffer, Lisa G; Koiffmann, Célia P

2009-06-01

The mechanisms involved in the formation of subtelomeric rearrangements are now beginning to be elucidated. Breakpoint sequencing analysis of 1p36 rearrangements has made important contributions to this line of inquiry. Despite the unique architecture of segmental duplications inherent to human subtelomeres, no common mechanism has been identified thus far and different nonexclusive recombination-repair mechanisms seem to predominate. In order to gain further insights into the mechanisms of chromosome breakage, repair, and stabilization mediating subtelomeric rearrangements in humans, we investigated the constitutional rearrangements of 1p36. Cloning of the breakpoint junctions in a complex rearrangement and three non-reciprocal translocations revealed similarities at the junctions, such as microhomology of up to three nucleotides, along with no significant sequence identity in close proximity to the breakpoint regions. All the breakpoints appeared to be unique and their occurrence was limited to non-repetitive, unique DNA sequences. Several recombination- or cleavage-associated motifs that may promote non-homologous recombination were observed in close proximity to the junctions. We conclude that NHEJ is likely the mechanism of DNA repair that generates these rearrangements. Additionally, two apparently pure terminal deletions were also investigated, and the refinement of the breakpoint regions identified two distinct genomic intervals ~25-kb apart, each containing a series of 1p36 specific segmental duplications with 90-98% identity. Segmental duplications can serve as substrates for ectopic homologous recombination or stimulate genomic rearrangements.
Clinical and microbiological parameters of naturally occurring periodontitis in the non-human primate Macaca mulatta

PubMed Central

Colombo, A. P. V.; Paster, B. J.; Grimaldi, G.; Lourenço, T. G. B.; Teva, A.; Campos-Neto, A.; McCluskey, J.; Kleanthous, H.; Van Dyke, T. E.; Stashenko, P.

2017-01-01

ABSTRACT Background: Non-human primates appear to represent the most faithful model of human disease, but to date the oral microbiome in macaques has not been fully characterized using next-generation sequencing. Objective: In the present study, we characterized the clinical and microbiological features of naturally occurring periodontitis in non-human primates (Macaca mulatta). Design: Clinical parameters of periodontitis including probing pocket depth (PD) and bleeding on probing (BOP) were measured in 40 adult macaques (7–22 yrs), at six sites per tooth. Subgingival plaque was collected from diseased and healthy sites, and subjected to 16S rDNA sequencing and identification at the species or higher taxon level. Results: All macaques had mild periodontitis at minimum, with numerous sites of PD ≥ 4 mm and BOP. A subset (14/40) had moderate-severe disease, with >2 sites with PD ≥ 5mm, deeper mean PD, and more BOP. Animals with mild vs moderate-severe disease were identical in age, suggesting genetic heterogeneity. 16S rDNA sequencing revealed that all macaques had species that were identical to those in humans or closely related to human counterparts, including Porphyromonas gingivalis which was present in all animals. Diseased and healthy sites harboured distinct microbiomes; however there were no significant differences in the microbiomes in moderate-severe vs. mild periodontitis. Conclusions: Naturally occurring periodontitis in older macaques closely resembles human adult periodontitis, thus validating a useful model to evaluate novel anti-microbial therapies. PMID:29805776
Dog leukocyte antigen class II-associated genetic risk testing for immune disorders of dogs: simplified approaches using Pug dog necrotizing meningoencephalitis as a model.

PubMed

Pedersen, Niels; Liu, Hongwei; Millon, Lee; Greer, Kimberly

2011-01-01

A significantly increased risk for a number of autoimmune and infectious diseases in purebred and mixed-breed dogs has been associated with certain alleles or allele combinations of the dog leukocyte antigen (DLA) class II complex containing the DRB1, DQA1, and DQB1 genes. The exact level of risk depends on the specific disease, the alleles in question, and whether alleles exist in a homozygous or heterozygous state. The gold standard for identifying high-risk alleles and their zygosity has involved direct sequencing of the exon 2 regions of each of the 3 genes. However, sequencing and identification of specific alleles at each of the 3 loci are relatively expensive and sequencing techniques are not ideal for additional parentage or identity determination. However, it is often possible to get the same information from sequencing only 1 gene given the small number of possible alleles at each locus in purebred dogs, extensive homozygosity, and tendency for disease-causing alleles at each of the 3 loci to be strongly linked to each other into haplotypes. Therefore, genetic testing in purebred dogs with immune diseases can be often simplified by sequencing alleles at 1 rather than 3 loci. Further simplification of genetic tests for canine immune diseases can be achieved by the use of alternative genetic markers in the DLA class II region that are also strongly linked with the disease genotype. These markers consist of either simple tandem repeats or single nucleotide polymorphisms that are also in strong linkage with specific DLA class II genotypes and/or haplotypes. The current study uses necrotizing meningoencephalitis of Pug dogs as a paradigm to assess simple alternative genetic tests for disease risk. It was possible to attain identical necrotizing meningoencephalitis risk assessments to 3-locus DLA class II sequencing by sequencing only the DQB1 gene, using 3 DLA class II-linked simple tandem repeat markers, or with a small single nucleotide polymorphism array designed to identify breed-specific DQB1 alleles.
Molecular detection and genetic characterization of Babesia, Theileria and Anaplasma amongst apparently healthy sheep and goats in the central region of Turkey.

PubMed

Zhou, Mo; Cao, Shinuo; Sevinc, Ferda; Sevinc, Mutlu; Ceylan, Onur; Ekici, Sepil; Jirapattharasate, Charoonluk; Moumouni, Paul Franck Adjou; Liu, Mingming; Wang, Guanbo; Iguchi, Aiko; Vudriko, Patrick; Suzuki, Hiroshi; Xuan, Xuenan

2017-02-01

Babesia spp., Theileria spp. and Anaplasma spp. are significant tick-borne pathogens of livestock globally. In this study, we investigated the presence and distribution of Babesia ovis, Theileria ovis and Anaplasma ovis in 343 small ruminants (249 sheep and 94 goats) from 13 towns in the Central Anatolia region of Turkey using species-specific PCR assays. The PCR were conducted using the primers based on the B. ovis ssu rRNA (BoSSUrRNA), T. ovis ssu rRNA (ToSSUrRNA) and A. ovis major surface protein 4 (AoMSP4) genes, respectively. Fragments of these genes were sequenced for phylogenetic analysis. PCR results revealed that the overall infections of A. ovis, T. ovis and B. ovis were 60.0%, 35.9% and 5.2%, respectively. Co-infection of the animals with two or three pathogens was detected in 105/343 (30.6%) of the ovine samples. The results of sequence analysis indicated that AoMSP4 were conserved among the Turkish samples, with 100% sequence identity values. In contrast, the BoSSUrRNA and ToSSUrRNA gene sequences were relatively diverse with identity values of 98.54%-99.82% and 99.23%-99.81%, respectively. Phylograms were inferred based on the BoSSUrRNA, ToSSUrRNA and AoMSP4 sequences obtained in this study and those from previous studies. B. ovis isolates from Turkey were found in the same clade as the isolates from other countries in phylogenetic analysis. On the other hand, the Turkish T. ovis isolates in the present study formed a monophyletic grouping with the isolates from other countries in a phylogeny based on ToSSUrRNA sequences. Furthermore, phylogenetic analysis using AoMSP4 sequences showed the presence of three genotypes of A. ovis. This study provides important data for understanding the epidemiology of tick-borne diseases in small ruminants and the degree of genetic heterogeneities among these pathogens in Turkey. To our knowledge, this is the first study on the co-infection of Babesia, Theileria and Anaplasma in sheep and goats in Turkey. Copyright © 2016 Elsevier GmbH. All rights reserved.
Identification of a monopartite begomovirus associated with yellow vein mosaic of Mentha longifolia in Saudi Arabia.

PubMed

Sohrab, Sayed Sartaj; Daur, Ihsanullah

2018-02-01

Mentha is a very important crop grown and used extensively for many purposes in the Kingdom of Saudi Arabia. Begomoviruses are whitefly-transmitted viruses causing serious disease in many important plants exhibiting variable symptoms with significant economic loss globally. During farmers' field survey, yellow vein mosaic disease was observed in Mentha longifolia plants growing near tomato fields in Saudi Arabia. The causative agent was identified in 11 out of 19 samples using begomovirus-specific primers and the association of begomovirus with yellow vein mosaic disease in M. longifolia was confirmed. The full-length viral genome and betasatellite were amplified, cloned, and sequenced bidirectionally. The full DNA-A genome was found to have 2785 nucleotides with 1365 bp-associated betasatellite molecule. An attempt was made to amplify DNA-B, but none of the samples produced any positive amplicon of expected size which indicated the presence of monopartite begomovirus. The sequence identity matrix and phylogenetic analysis, based on full genome showed the highest identity (99.6%) with Tomato yellow leaf curl virus (TYLCV) and in phylogenetic analysis it formed a closed cluster with Tomato leaf curl virus infecting tomato and Corchorus crop in Saudi Arabia. The sequence analysis results of betasatellites showed the highest identity (98.9%) with Tomato yellow leaf curl betasatellites infecting tomato and phylogenetic analysis using betasatellites formed a close cluster with Tomato yellow leaf curl betasatellites infecting tomato and Corchorus crops, which has already been reported to cause yellow vein mosaic and leaf curl disease in many cultivated and weed crops growing in Saudi Arabia. The identified begomovirus associated with yellow vein mosaic disease in mentha could be a mutated strain of TYLCV and tentatively designated as TYLCV-Mentha isolate. Based on published data and latest information, this is the first report of identification of Tomato yellow leaf curl virus associated with yellow vein mosaic disease of M. longifolia from Saudi Arabia.
Structure-Function, Stability, and Chemical Modification of the Cyanobacterial Cytochrome b6f Complex from Nostoc sp. PCC 7120*

PubMed Central

Baniulis, Danas; Yamashita, Eiki; Whitelegge, Julian P.; Zatsman, Anna I.; Hendrich, Michael P.; Hasan, S. Saif; Ryan, Christopher M.; Cramer, William A.

2009-01-01

The crystal structure of the cyanobacterial cytochrome b6f complex has previously been solved to 3.0-Å resolution using the thermophilic Mastigocladus laminosus whose genome has not been sequenced. Several unicellular cyanobacteria, whose genomes have been sequenced and are tractable for mutagenesis, do not yield b6f complex in an intact dimeric state with significant electron transport activity. The genome of Nostoc sp. PCC 7120 has been sequenced and is closer phylogenetically to M. laminosus than are unicellular cyanobacteria. The amino acid sequences of the large core subunits and four small peripheral subunits of Nostoc are 88 and 80% identical to those in the M. laminosus b6f complex. Purified b6f complex from Nostoc has a stable dimeric structure, eight subunits with masses similar to those of M. laminosus, and comparable electron transport activity. The crystal structure of the native b6f complex, determined to a resolution of 3.0Å (PDB id: 2ZT9), is almost identical to that of M. laminosus. Two unique aspects of the Nostoc complex are: (i) a dominant conformation of heme bp that is rotated 180° about the α- and γ-meso carbon axis relative to the orientation in the M. laminosus complex and (ii) acetylation of the Rieske iron-sulfur protein (PetC) at the N terminus, a post-translational modification unprecedented in cyanobacterial membrane and electron transport proteins, and in polypeptides of cytochrome bc complexes from any source. The high spin electronic character of the unique heme cn is similar to that previously found in the b6f complex from other sources. PMID:19189962

Experimental evidence for the ancestry of allotetraploid Trifolium repens and creation of synthetic forms with value for plant breeding.

PubMed

Williams, Warren M; Ellison, Nicholas W; Ansari, Helal A; Verry, Isabelle M; Hussain, S Wajid

2012-04-24

White clover (Trifolium repens) is a ubiquitous weed of the temperate world that through use of improved cultivars has also become the most important legume of grazed pastures world-wide. It has long been suspected to be allotetraploid, but the diploid ancestral species have remained elusive. Putative diploid ancestors were indicated by DNA sequence phylogeny to be T. pallescens and T. occidentale. Here, we use further DNA evidence as well as a combination of molecular cytogenetics (FISH and GISH) and experimental hybridization to test the hypothesis that white clover originated as a hybrid between T. pallescens and T. occidentale. T. pallescens plants were identified with chloroplast trnL intron DNA sequences identical to those of white clover. Similarly, T. occidentale plants with nuclear ITS sequences identical to white clover were also identified. Reciprocal GISH experiments, alternately using labeled genomic DNA probes from each of the putative ancestral species on the same white clover cells, showed that half of the chromosomes hybridized with each probe. F1 hybrids were generated by embryo rescue and these showed strong interspecific chromosome pairing and produced a significant frequency of unreduced gametes, indicating the likely mode of polyploidization. The F1 hybrids are inter-fertile with white clover and function as synthetic white clovers, a valuable new resource for the re-incorporation of ancestral genomes into modern white clover for future plant breeding. Evidence from DNA sequence analyses, molecular cytogenetics, interspecific hybridization and breeding experiments supports the hypothesis that a diploid alpine species (T. pallescens) hybridized with a diploid coastal species (T. occidentale) to generate tetraploid T. repens. The coming together of these two narrowly adapted species (one alpine and the other maritime), along with allotetraploidy, has led to a transgressive hybrid with a broad adaptive range.
The Impact of Mutation and Gene Conversion on the Local Diversification of Antigen Genes in African Trypanosomes

PubMed Central

Gjini, Erida; Haydon, Daniel T.; Barry, J. David; Cobbold, Christina A.

2012-01-01

Patterns of genetic diversity in parasite antigen gene families hold important information about their potential to generate antigenic variation within and between hosts. The evolution of such gene families is typically driven by gene duplication, followed by point mutation and gene conversion. There is great interest in estimating the rates of these processes from molecular sequences for understanding the evolution of the pathogen and its significance for infection processes. In this study, a series of models are constructed to investigate hypotheses about the nucleotide diversity patterns between closely related gene sequences from the antigen gene archive of the African trypanosome, the protozoan parasite causative of human sleeping sickness in Equatorial Africa. We use a hidden Markov model approach to identify two scales of diversification: clustering of sequence mismatches, a putative indicator of gene conversion events with other lower-identity donor genes in the archive, and at a sparser scale, isolated mismatches, likely arising from independent point mutations. In addition to quantifying the respective probabilities of occurrence of these two processes, our approach yields estimates for the gene conversion tract length distribution and the average diversity contributed locally by conversion events. Model fitting is conducted using a Bayesian framework. We find that diversifying gene conversion events with lower-identity partners occur at least five times less frequently than point mutations on variant surface glycoprotein (VSG) pairs, and the average imported conversion tract is between 14 and 25 nucleotides long. However, because of the high diversity introduced by gene conversion, the two processes have almost equal impact on the per-nucleotide rate of sequence diversification between VSG subfamily members. We are able to disentangle the most likely locations of point mutations and conversions on each aligned gene pair. PMID:22735079
Prevalence of Theileria and Babesia species in Tunisian sheep.

PubMed

Rjeibi, Mohamed R; Darghouth, Mohamed A; Gharbi, Mohamed

2016-05-24

In this study, the prevalence of Theileria and Babesia species in sheep was assessed with Giemsastained blood smear examination and polymerase chain reaction to identify the different piroplasms in 270 sheep from three Tunisian bioclimatic zones (north, centre, and south). The overall infection prevalence by Babesia spp. and Theileria spp. in Giemsa-stained blood smears was 2.9% (8/270) and 4.8% (13/270) respectively. The molecular results showed that sheep were more often infected by Theileria ovis than Babesia ovis with an overall prevalence of 16.3% (44/270) and 7.8% (21/270) respectively (p = 0.01). The molecular prevalence by Babesia ovis was significantly higher in females than in males (p < 0.05). According to localities B. ovis was found exclusively in sheep from the centre of Tunisia (Kairouan) whereas Theileria ovis was found in all regions. Infections with T. ovis and B. ovis were confirmed by sequencing. The sequence of T. ovis in this study (accession numbers KM924442) falls into the same clade as T. ovis deposited in GenBank. The T. ovis amplicons (KM924442) showed 99%-100% identities with GenBank sequences. Moreover, comparison of the partial sequences of 18S rRNA gene of B. ovis described in this study (KP670199) revealed 99.4% similarity with B. ovis recently reported in northern Tunisia from sheep and goats. Three nucleotides were different at positions 73 (A/T), 417 (A/T), and 420 (G/T). It also had 99% identity with B. ovis from Spain, Turkey and Iraq. The results suggest a high T. ovis prevalence in Tunisia with a decreasing north-south gradient. This could be correlated to the vector tick distribution.
Structure-Function, Stability, and Chemical Modification of the Cyanobacterial Cytochrome b[subscript 6]f Complex from Nostoc sp. PCC 7120

DOE Office of Scientific and Technical Information (OSTI.GOV)

Baniulis, Danas; Yamashita, Eiki; Whitelegge, Julian P.

2009-06-08

The crystal structure of the cyanobacterial cytochrome b{sub 6}f complex has previously been solved to 3.0-{angstrom} resolution using the thermophilic Mastigocladus laminosus whose genome has not been sequenced. Several unicellular cyanobacteria, whose genomes have been sequenced and are tractable for mutagenesis, do not yield b{sub 6}f complex in an intact dimeric state with significant electron transport activity. The genome of Nostoc sp. PCC 7120 has been sequenced and is closer phylogenetically to M. laminosus than are unicellular cyanobacteria. The amino acid sequences of the large core subunits and four small peripheral subunits of Nostoc are 88 and 80% identical tomore » those in the M. laminosus b{sub 6}f complex. Purified b{sub 6}f complex from Nostoc has a stable dimeric structure, eight subunits with masses similar to those of M. laminosus, and comparable electron transport activity. The crystal structure of the native b{sub 6}f complex, determined to a resolution of 3.0{angstrom} (PDB id: 2ZT9), is almost identical to that of M. laminosus. Two unique aspects of the Nostoc complex are: (i) a dominant conformation of heme b{sub p} that is rotated 180 deg. about the {alpha}- and {gamma}-meso carbon axis relative to the orientation in the M. laminosus complex and (ii) acetylation of the Rieske iron-sulfur protein (PetC) at the N terminus, a post-translational modification unprecedented in cyanobacterial membrane and electron transport proteins, and in polypeptides of cytochrome bc complexes from any source. The high spin electronic character of the unique heme cn is similar to that previously found in the b{sub 6}f complex from other sources.« less
The contribution of alu elements to mutagenic DNA double-strand break repair.

PubMed

Morales, Maria E; White, Travis B; Streva, Vincent A; DeFreece, Cecily B; Hedges, Dale J; Deininger, Prescott L

2015-03-01

Alu elements make up the largest family of human mobile elements, numbering 1.1 million copies and comprising 11% of the human genome. As a consequence of evolution and genetic drift, Alu elements of various sequence divergence exist throughout the human genome. Alu/Alu recombination has been shown to cause approximately 0.5% of new human genetic diseases and contribute to extensive genomic structural variation. To begin understanding the molecular mechanisms leading to these rearrangements in mammalian cells, we constructed Alu/Alu recombination reporter cell lines containing Alu elements ranging in sequence divergence from 0%-30% that allow detection of both Alu/Alu recombination and large non-homologous end joining (NHEJ) deletions that range from 1.0 to 1.9 kb in size. Introduction of as little as 0.7% sequence divergence between Alu elements resulted in a significant reduction in recombination, which indicates even small degrees of sequence divergence reduce the efficiency of homology-directed DNA double-strand break (DSB) repair. Further reduction in recombination was observed in a sequence divergence-dependent manner for diverged Alu/Alu recombination constructs with up to 10% sequence divergence. With greater levels of sequence divergence (15%-30%), we observed a significant increase in DSB repair due to a shift from Alu/Alu recombination to variable-length NHEJ which removes sequence between the two Alu elements. This increase in NHEJ deletions depends on the presence of Alu sequence homeology (similar but not identical sequences). Analysis of recombination products revealed that Alu/Alu recombination junctions occur more frequently in the first 100 bp of the Alu element within our reporter assay, just as they do in genomic Alu/Alu recombination events. This is the first extensive study characterizing the influence of Alu element sequence divergence on DNA repair, which will inform predictions regarding the effect of Alu element sequence divergence on both the rate and nature of DNA repair events.
Dissimilation in the Second Language Acquisition of Mandarin Chinese Tones

ERIC Educational Resources Information Center

Zhang, Hang

2016-01-01

This article extends Optimality Theoretic studies to the research on second language tone phonology. Specifically, this work analyses the acquisition of identical tone sequences in Mandarin Chinese by adult speakers of three non-tonal languages: English, Japanese and Korean. This study finds that the learners prefer not to use identical lexical…
Transitive homology-guided structural studies lead to discovery of Cro proteins with 40% sequence identity but different folds

PubMed Central

Roessler, Christian G.; Hall, Branwen M.; Anderson, William J.; Ingram, Wendy M.; Roberts, Sue A.; Montfort, William R.; Cordes, Matthew H. J.

2008-01-01

Proteins that share common ancestry may differ in structure and function because of divergent evolution of their amino acid sequences. For a typical diverse protein superfamily, the properties of a few scattered members are known from experiment. A satisfying picture of functional and structural evolution in relation to sequence changes, however, may require characterization of a larger, well chosen subset. Here, we employ a “stepping-stone” method, based on transitive homology, to target sequences intermediate between two related proteins with known divergent properties. We apply the approach to the question of how new protein folds can evolve from preexisting folds and, in particular, to an evolutionary change in secondary structure and oligomeric state in the Cro family of bacteriophage transcription factors, initially identified by sequence-structure comparison of distant homologs from phages P22 and λ. We report crystal structures of two Cro proteins, Xfaso 1 and Pfl 6, with sequences intermediate between those of P22 and λ. The domains show 40% sequence identity but differ by switching of α-helix to β-sheet in a C-terminal region spanning ≈25 residues. Sedimentation analysis also suggests a correlation between helix-to-sheet conversion and strengthened dimerization. PMID:18227506
Towards the Rational Design of a Candidate Vaccine against Pregnancy Associated Malaria: Conserved Sequences of the DBL6ε Domain of VAR2CSA

PubMed Central

Badaut, Cyril; Bertin, Gwladys; Rustico, Tatiana; Fievet, Nadine; Massougbodji, Achille; Gaye, Alioune; Deloron, Philippe

2010-01-01

Background Placental malaria is a disease linked to the sequestration of Plasmodium falciparum infected red blood cells (IRBC) in the placenta, leading to reduced materno-fetal exchanges and to local inflammation. One of the virulence factors of P. falciparum involved in cytoadherence to chondroitin sulfate A, its placental receptor, is the adhesive protein VAR2CSA. Its localisation on the surface of IRBC makes it accessible to the immune system. VAR2CSA contains six DBL domains. The DBL6ε domain is the most variable. High variability constitutes a means for the parasite to evade the host immune response. The DBL6ε domain could constitute a very attractive basis for a vaccine candidate but its reported variability necessitates, for antigenic characterisations, identifying and classifying commonalities across isolates. Methodology/Principal Findings Local alignment analysis of the DBL6ε domain had revealed that it is not as variable as previously described. Variability is concentrated in seven regions present on the surface of the DBL6ε domain. The main goal of our work is to classify and group variable sequences that will simplify further research to determine dominant epitopes. Firstly, variable sequences were grouped following their average percent pairwise identity (APPI). Groups comprising many variable sequences sharing low variability were found. Secondly, ELISA experiments following the IgG recognition of a recombinant DBL6ε domain, and of peptides mimicking its seven variable blocks, allowed to determine an APPI cut-off and to isolate groups represented by a single consensus sequence. Conclusions/Significance A new sequence approach is used to compare variable regions in sequences that have extensive segmental gene relationship. Using this approach, the VAR2CSA DBL6 domain is composed of 7 variable blocks with limited polymorphism. Each variable block is composed of a limited number of consensus types. Based on peptide based ELISA, variable blocks with 85% or greater sequence identity are expected to be recognized equally well by antibody and can be considered the same consensus type. Therefore, the analysis of the antibody response against the classified small number of sequences should be helpful to determine epitopes. PMID:20585655
Phylogeny of the Defined Murine Microbiota: Altered Schaedler Flora

PubMed Central

Dewhirst, Floyd E.; Chien, Chih-Ching; Paster, Bruce J.; Ericson, Rebecca L.; Orcutt, Roger P.; Schauer, David B.; Fox, James G.

1999-01-01

The “altered Schaedler flora” (ASF) was developed for colonizing germfree rodents with a standardized microbiota. The purpose of this study was to identify each of the eight ASF strains by 16S rRNA sequence analysis. Three strains were previously identified as Lactobacillus acidophilus (strain ASF 360), Lactobacillus salivarius (strain ASF 361), and Bacteroides distasonis (strain ASF 519) based on phenotypic criteria. 16S rRNA analysis indicated that each of the strains differed from its presumptive identity. The 16S rRNA sequence of strain ASF 361 is essentially identical to the 16S rRNA sequences of the type strains of Lactobacillus murinis and Lactobacillus animalis (both isolated from mice), and all of these strains probably belong to a single species. Strain ASF 360 is a novel lactobacillus that clusters with L. acidophilus and Lactobacillus lactis. Strain ASF 519 falls into an unnamed genus containing [Bacteroides] distasonis, [Bacteroides] merdae, [Bacteroides] forsythus, and CDC group DF-3. This unnamed genus is in the Cytophaga-Flavobacterium-Bacteroides phylum and is most closely related to the genus Porphyromonas. The spiral-shaped strain, strain ASF 457, is in the Flexistipes phylum and exhibits sequence identity with rodent isolates of Robertson. The remaining four ASF strains, which are extremely oxygen-sensitive fusiform bacteria, group phylogenetically with the low-G+C-content gram-positive bacteria (Firmicutes, Bacillus-Clostridium group). ASF 356, ASF 492, and ASF 502 fall into Clostridium cluster XIV of Collins et al. Morphologically, ASF 492 resembles members of this cluster, Roseburia cecicola, and Eubacterium plexicaudatum. The 16S rRNA sequence of ASF 492 is identical to that of E. plexicaudatum. Since the type strain and other viable original isolates of E. plexicaudatum have been lost, strain ASF 492 is a candidate for a neotype strain. Strain ASF 500 branches deeply in the low-G+C-content gram-positive phylogenetic tree but is not closely related to any organisms whose 16S rRNA sequences are currently in the GenBank database. The 16S rRNA sequence information determined in the present study should allow rapid identification of ASF strains and should permit detailed analysis of the interactions of ASF organisms during development of intestinal disease in mice that are coinfected with a variety of pathogenic microorganisms. PMID:10427008
Isolation and characterization of adrenoleukodystrophy protein (ALDP) related sequences in the human genome

DOE Office of Scientific and Technical Information (OSTI.GOV)

Geraghty, M.T.; Stetten, G.; Kearns, W.

1994-09-01

X-linked adrenoleukodystrophy (ALD) is a disorder of peroxisomal {beta}-oxidation of very long chain fatty acids. It presents either as progressive dementia in childhood or as progressive paraparesis in later years. Adrenal insufficiency occurs in both phenotypes. The gene of the ALD protein has been mapped to Xq28 and has recently been cloned and characterized. The ALD protein has significant homology to the peroxisomal membrane protein, PMP70 and belongs to the ATP binding cassette superfamily of transporters. We screened a human genomic library with an ALDP cDNA and isolated 5 different but highly similar clones containing sequences corresponding to the 3{prime}more » end of the ALDP gene. Comparison of the sequences over the region corresponding to exon 9 through the 3{prime} end of the ALDP gene reveals {approximately}96% nucleotide identity in both exonic and intronic regions. Splice sites and open reading frames are maintained. Using both FISH and human-rodent DNA mapping panels, we positively assign these ALDP-related sequences to chromosomes 2, 16 and 22, and provisionally to 1 and 20. Southern blot of primate DNA probed with a partial ALDP cDNA (exon 2-10) shows that expansion of ALDP-related sequences occurred in higher primates (chimp, gorilla and human). Although Northern blots show multiple ALDP-hybridizing transcripts in certain tissues, we have no evidence to date for expression of these ALDP-related sequences. In conclusion, our data show there has been an unusual and recent dispersal to multiple chromosomes of structural gene sequences related to the ALDP gene. The functional significance of these sequences remains to be determined but their existence complicates PCR and mutation analysis of the ALDP gene.« less
Epidemic Keratoconjunctivitis Due to the Novel Hexon-Chimeric-Intermediate 22,37/H8 Human Adenovirus ▿

PubMed Central

Aoki, Koki; Ishiko, Hiroaki; Konno, Tsunetada; Shimada, Yasushi; Hayashi, Akio; Kaneko, Hisatoshi; Ohguchi, Takeshi; Tagawa, Yoshitsugu; Ohno, Shigeaki; Yamazaki, Shudo

2008-01-01

In a 2-month period in 2003, we encountered an outbreak of epidemic keratoconjunctivitis (EKC) in Japan. We detected 67 human adenoviruses (HAdVs) by PCR from eye swabs of patients with EKC at five eye clinics in different parts of Japan. Forty-one of the 67 HAdV DNAs from the swabs were identified as HAdV-37 by phylogenetic analysis using a partial hexon gene sequence. When the restriction patterns of these viral genomes were compared with that of the HAdV-37 prototype strain, one isolate showed a never-before-seen restriction pattern. Within 1 year, we encountered three more EKC cases caused by a genetically identical virus: two nosocomial infections at two different university hospitals and a sporadic infection at an eye clinic. We determined the nucleotide sequences of the full-length hexon and fiber genes of these isolates and compared them to those of the 51 prototype strains. Surprisingly, the sequence of the hexon (ɛ determinant) loop-1 and -2 regions showed the highest nucleotide identity with HAdV-22, a rare EKC isolate. However, the nucleotide sequence of the fiber gene was identical to that of the HAdV-8 prototype strain. 22 We propose that this virus is a new hexon-chimeric intermediate HAdV-22,37/H8, and may be an etiological agent of EKC. PMID:18701656
Phylogenetic analyses indicate little variation among reticuloendotheliosis viruses infecting avian species, including the endangered Attwater's prairie chicken.

PubMed

Bohls, Ryan L; Linares, Jose A; Gross, Shannon L; Ferro, Pam J; Silvy, Nova J; Collisson, Ellen W

2006-08-01

Reticuloendotheliosis virus infection, which typically causes systemic lymphomas and high mortality in the endangered Attwater's prairie chicken, has been described as a major obstacle in repopulation efforts of captive breeding facilities in Texas. Although antigenic relationships among reticuloendotheliosis virus (REV) strains have been previously determined, phylogenetic relationships have not been reported. The pol and env of REV proviral DNA from prairie chickens (PC-R92 and PC-2404), from poxvirus lesions in domestic chickens, the prototype poultry derived REV-A and chick syncytial virus (CSV), and duck derived spleen necrosis virus (SNV) were PCR amplified and sequenced. The 5032bp, that included the pol and most of env genes, of the PC-R92 and REV-A were 98% identical, and nucleotide sequence identities of smaller regions within the pol and env from REV strains examined ranged from 95 to 99% and 93 to 99%, respectively. The putative amino acid sequences were 97-99% identical in the polymerase and 90-98% in the envelope. Phylogenetic analyses of the nucleotide and amino acid sequences indicated the closest relationship among the recent fowl pox-associated chicken isolates, the prairie chicken isolates and the prototype CSV while only the SNV appeared to be distinctly divergent. While the origin of the naturally occurring viruses is not known, the avian poxvirus may be a critical component of transmission of these ubiquitous oncogenic viruses.
Complete genome sequences of cowpea polerovirus 1 and cowpea polerovirus 2 infecting cowpea plants in Burkina Faso.

PubMed

Palanga, Essowè; Martin, Darren P; Galzi, Serge; Zabré, Jean; Bouda, Zakaria; Neya, James Bouma; Sawadogo, Mahamadou; Traore, Oumar; Peterschmitt, Michel; Roumagnac, Philippe; Filloux, Denis

2017-07-01

The full-length genome sequences of two novel poleroviruses found infecting cowpea plants, cowpea polerovirus 1 (CPPV1) and cowpea polerovirus 2 (CPPV2), were determined using overlapping RT-PCR and RACE-PCR. Whereas the 5845-nt CPPV1 genome was most similar to chickpea chlorotic stunt virus (73% identity), the 5945-nt CPPV2 genome was most similar to phasey bean mild yellow virus (86% identity). The CPPV1 and CPPV2 genomes both have a typical polerovirus genome organization. Phylogenetic analysis of the inferred P1-P2 and P3 amino acid sequences confirmed that CPPV1 and CPPV2 are indeed poleroviruses. Four apparently unique recombination events were detected within a dataset of 12 full polerovirus genome sequences, including two events in the CPPV2 genome. Based on the current species demarcation criteria for the family Luteoviridae, we tentatively propose that CPPV1 and CPPV2 should be considered members of novel polerovirus species.
Primary structures of ribosomal proteins from the archaebacterium Halobacterium marismortui and the eubacterium Bacillus stearothermophilus.

PubMed

Arndt, E; Scholzen, T; Krömer, W; Hatakeyama, T; Kimura, M

1991-06-01

Approximately 40 ribosomal proteins from each Halobacterium marismortui and Bacillus stearothermophilus have been sequenced either by direct protein sequence analysis or by DNA sequence analysis of the appropriate genes. The comparison of the amino acid sequences from the archaebacterium H marismortui with the available ribosomal proteins from the eubacterial and eukaryotic kingdoms revealed four different groups of proteins: 24 proteins are related to both eubacterial as well as eukaryotic proteins. Eleven proteins are exclusively related to eukaryotic counterparts. For three proteins only eubacterial relatives-and for another three proteins no counterpart-could be found. The similarities of the halobacterial ribosomal proteins are in general somewhat higher to their eukaryotic than to their eubacterial counterparts. The comparison of B stearothermophilus proteins with their E coli homologues showed that the proteins evolved at different rates. Some proteins are highly conserved with 64-76% identity, others are poorly conserved with only 25-34% identical amino acid residues.
Evidence for Elizabethkingia anophelis transmission from mother to infant, Hong Kong.

PubMed

Lau, Susanna K P; Wu, Alan K L; Teng, Jade L L; Tse, Herman; Curreem, Shirly O T; Tsui, Stephen K W; Huang, Yi; Chen, Jonathan H K; Lee, Rodney A; Yuen, Kwok-Yung; Woo, Patrick C Y

2015-02-01

Elizabethkingia anophelis, recently discovered from mosquito gut, is an emerging bacterium associated with neonatal meningitis and nosocomial outbreaks. However, its transmission route remains unknown. We use rapid genome sequencing to investigate 3 cases of E. anophelis sepsis involving 2 neonates who had meningitis and 1 neonate's mother who had chorioamnionitis. Comparative genomics revealed evidence for perinatal vertical transmission from a mother to her neonate; the 2 isolates from these patients, HKU37 and HKU38, shared essentially identical genome sequences. In contrast, the strain from another neonate (HKU36) was genetically divergent, showing only 78.6% genome sequence identity to HKU37 and HKU38, thus excluding a clonal outbreak. Comparison to genomes from mosquito strains revealed potential metabolic adaptations in E. anophelis under different environments. Maternal infection, not mosquitoes, is most likely the source of neonatal E. anophelis infections. Our findings highlight the power of genome sequencing in gaining rapid insights on transmission and pathogenesis of emerging pathogens.
Cloning and High-Level Expression of α-Galactosidase cDNA from Penicillium purpurogenum

PubMed Central

Shibuya, Hajime; Nagasaki, Hiroaki; Kaneko, Satoshi; Yoshida, Shigeki; Park, Gwi Gun; Kusakabe, Isao; Kobayashi, Hideyuki

1998-01-01

The cDNA coding for Penicillium purpurogenum α-galactosidase (αGal) was cloned and sequenced. The deduced amino acid sequence of the α-Gal cDNA showed that the mature enzyme consisted of 419 amino acid residues with a molecular mass of 46,334 Da. The derived amino acid sequence of the enzyme showed similarity to eukaryotic αGals from plants, animals, yeasts, and filamentous fungi. The highest similarity observed (57% identity) was to Trichoderma reesei AGLI. The cDNA was expressed in Saccharomyces cerevisiae under the control of the yeast GAL10 promoter. Almost all of the enzyme produced was secreted into the culture medium, and the expression level reached was approximately 0.2 g/liter. The recombinant enzyme purified to homogeneity was highly glycosylated, showed slightly higher specific activity, and exhibited properties almost identical to those of the native enzyme from P. purpurogenum in terms of the N-terminal amino acid sequence, thermoactivity, pH profile, and mode of action on galacto-oligosaccharides. PMID:9797312
Short communication: Conservation of Streptococcus uberis adhesion molecule and the sua gene in strains of Streptococcus uberis isolated from geographically diverse areas.

PubMed

Yuan, Ying; Dego, Oudessa Kerro; Chen, Xueyan; Abadin, Eurife; Chan, Shangfeng; Jory, Lauren; Kovacevic, Steven; Almeida, Raul A; Oliver, Stephen P

2014-12-01

The objective was to identify and sequence the sua gene (GenBank no. DQ232760; http://www.ncbi.nlm.nih.gov/genbank/) and detect Streptococcus uberis adhesion molecule (SUAM) expression by Western blot using serum from naturally S. uberis-infected cows in strains of S. uberis isolated in milk from cows with mastitis from geographically diverse areas of the world. All strains evaluated yielded a 4.4-kb sua-containing PCR fragment that was subsequently sequenced. Deduced SUAM AA sequences from those S. uberis strains evaluated shared >97% identity. The pepSUAM sequence located at the N terminus of SUAM was >99% identical among strains of S. uberis. Streptococcus uberis adhesion molecule expression was detected in all strains of S. uberis tested. These results suggest that sua is ubiquitous among strains of S. uberis isolated from diverse geographic locations and that SUAM is immunogenic. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Diversity of 16S rRNA genes of new Ehrlichia strains isolated from horses with clinical signs of Potomac horse fever.

PubMed

Wen, B; Rikihisa, Y; Fuerst, P A; Chaichanasiriwithaya, W

1995-04-01

Ehrlichia risticii is the causative agent of Potomac horse fever. Variations among the major antigens of different local E. risticii strains have been detected previously. To further assess genetic variability in this species or species complex, the sequences of the 16S rRNA genes of several isolates obtained from sick horses diagnosed as having Potomac horse fever were determined. The sequences of six isolates obtained from Ohio and three isolates obtained from Kentucky were amplified by PCR. Three groups of sequences were identified. The sequences of five of the Ohio isolates were identical to the sequence of the type strain of E. risticii, the Illinois strain. The sequence of one Ohio isolate, isolate 081, was unique; this sequence differed in 10 nucleotides from the sequence of the type strain (level of similarity, 99.3%). The sequences of the three Kentucky isolates were identical to each other, but differed by five bases from the sequence of the type strain (level of similarity, 99.6%). The levels of sequence similarity of isolate 081, the Kentucky isolates, and the type strain to the next most closely related Ehrlichia sp., Ehrlichia sennetsu, were 99.3, 99.2, and 99.2%, respectively. On the basis of the distinct antigenic profiles and the levels of 16S rRNA sequence divergence, isolate 081 is as divergent from the type strain of E. risticii as E. sennetsu is. Therefore, we suggest that strain 081 and the Kentucky isolates may represent two new distinct Ehrlichia species.
The complete mitochondrial genome of the Japanese ornamental koi carp (Cyprinus carpio) and its implication for the history of koi.

PubMed

Mabuchi, Kohji; Song, Hayeun

2014-02-01

Complete mitochondrial genome (mitogenome) sequences were determined for two individuals of Japanese ornamental koi carp. Interestingly, the obtained mitogenomes (16,581 bp) were both completely identical to the recently reported mitogenome of Oujiang color carp from China. Control region (CR) sequences in DNA database demonstrated that more than half (65%) of the koi carp individuals so far reported had partial or complete CR sequences identical to those from Oujiang color carp. These results might suggest that the Japanese koi carp has been originated from Chinese Oujiang color carp, contrary to the belief in Japan that the koi carps have been developed directly from carp stocks in Japan. In any case, the present results emphasize the importance of analyzing Oujiang color carp when studying the origin of koi carp.
Donkey Orchid Symptomless Virus: A Viral ‘Platypus’ from Australian Terrestrial Orchids

PubMed Central

Wylie, Stephen J.; Li, Hua; Jones, Michael G. K.

2013-01-01

Complete and partial genome sequences of two isolates of an unusual new plant virus, designated Donkey orchid symptomless virus (DOSV) were identified using a high-throughput sequencing approach. The virus was identified from asymptomatic plants of Australian terrestrial orchid Diuris longifolia (Common donkey orchid) growing in a remnant forest patch near Perth, western Australia. DOSV was identified from two D. longifolia plants of 264 tested, and from at least one plant of 129 Caladenia latifolia (pink fairy orchid) plants tested. Phylogenetic analysis of the genome revealed open reading frames (ORF) encoding seven putative proteins of apparently disparate origins. A 69-kDa protein (ORF1) that overlapped the replicase shared low identity with MPs of plant tymoviruses (Tymoviridae). A 157-kDa replicase (ORF2) and 22-kDa coat protein (ORF4) shared 32% and 40% amino acid identity, respectively, with homologous proteins encoded by members of the plant virus family Alphaflexiviridae. A 44-kDa protein (ORF3) shared low identity with myosin and an autophagy protein from Squirrelpox virus. A 27-kDa protein (ORF5) shared no identity with described proteins. A 14-kDa protein (ORF6) shared limited sequence identity (26%) over a limited region of the envelope glycoprotein precursor of mammal-infecting Crimea-Congo hemorrhagic fever virus (Bunyaviridae). The putative 25-kDa movement protein (MP) (ORF7) shared limited (27%) identity with 3A-like MPs of members of the plant-infecting Tombusviridae and Virgaviridae. Transmissibility was shown when DOSV systemically infected Nicotiana benthamiana plants. Structure and organization of the domains within the putative replicase of DOSV suggests a common evolutionary origin with ‘potexvirus-like’ replicases of viruses within the Alphaflexiviridae and Tymoviridae, and the CP appears to be ancestral to CPs of allexiviruses (Alphaflexiviridae). The MP shares an evolutionary history with MPs of dianthoviruses, but the other putative proteins are distant from plant viruses. DOSV is not readily classified in current lower order virus taxa. PMID:24223974

A homologue of the defender against the apoptotic death gene (dad1 )in UV-exposed Chlamydomonas cells is downregulated with the onset of programmed cell death.

PubMed

Moharikar, Swati; D'Souza, Jacinta S; Rao, Basuthkar J

2007-03-01

We report here the isolation of a homologue of the potential anti-apoptotic gene, defender against apoptotic death (dad1 )from Chlamydomonas reinhardtii cells.Using polymerase chain reaction (PCR),we investigated its expression in the execution process of programmed cell death (PCD)in UV-C exposed dying C.reinhardtii cells.Reverse- transcriptase (RT)-PCR showed that C.reinhardtii dad1 amplification was drastically reduced in UV-C exposed dying C.reinhardtii cells.We connect the downregulation of dad1 with the upregulation of apoptosis protease activating factor-1 (APAF-1)and the physiological changes that occur in C.reinhardtii cells upon exposure to 12 J/m 2 UV-C in order to show a reciprocal relationship between proapoptotic and inhibitor of apoptosis factors.The temporal changes indicate a correlation between the onset of cell death and dad1 downregulation.The sequence of the PCR product of the cDNA encoding the dad1 homologue was aligned with the annotated dad1 (C_20215)from the Chlamydomonas database (http://genome.jgi-psf.org:8080/annotator/servlet/jgi.annotation.Annotation?pDb=chlre2); Annotation?pDb=chlre2 );this sequence was found to show 100% identity,both at the nucleotide and amino acid level. The 327 bp transcript showed an open reading frame of 87 amino acid residues.The deduced amino acid sequence of the putative C.reinhardtii DAD1 homologue showed 54% identity with Oryza sativa, 56 identity with Drosophila melanogaster, 66% identity with Xenopus laevis, and 64% identity with Homo sapiens,Sus scrofa,Gallus gallus,Rattus norvegicus and Mus musculus.
Identification of cDNAs encoding viper venom hyaluronidases: cross-generic sequence conservation of full-length and unusually short variant transcripts.

PubMed

Harrison, Robert A; Ibison, Frances; Wilbraham, Davina; Wagstaff, Simon C

2007-05-01

The immobilisation of prey by snakes is most efficiently achieved by the rapid dissemination of venom from its site of injection into the blood stream. Hyaluronidase is a common component of snake venoms and has been termed the "venom spreading factor". In the absence of nucleotide or protein sequence data to confirm the functional identity of this venom component, we interrogated a venom gland EST database for the saw-scaled viper, Echis ocellatus (Nigeria), using the gene ontology (GO) term "carbohydrate metabolism". A single hyalurononglucosaminadase-activity matching sequence (EOC00242) was found and used to design PCR primers to acquire the full-length cDNA sequence. Although very different from the bee venom and mammalian hyaluronidase sequences, the E. ocellatus sequence retained all the catalytic, positional and structural residues that characterise this class of carbohydrate metabolising hydrolases. An extraordinarily high level of sequence identity (>95%) was observed in analogous venom gland cDNA sequences isolated (by PCR) from another saw-scaled viper species, E. pyramidum leakeyi (Kenya), and from the sahara horned viper, Cerastes cerastes cerastes (Egypt) and the puff adder, Bitis arietans (Nigeria). Smaller amplicons, lacking hyaluronidase catalytic residues because of 768 bp or 855 bp central deletions, appear to encode either truncated peptides without hyaluronidase activity, or are non-translated transcripts because they lack consensus translation initiating motifs.
Novel Human Adenovirus Causing Nosocomial Epidemic Keratoconjunctivitis▿

PubMed Central

Ishiko, Hiroaki; Shimada, Yasushi; Konno, Tsunetada; Hayashi, Akio; Ohguchi, Takeshi; Tagawa, Yoshitsugu; Aoki, Koki; Ohno, Shigeaki; Yamazaki, Shudo

2008-01-01

In 2000, we encountered cases of nosocomial infections with epidemic keratoconjunctivitis (EKC) at a university hospital in Kobe, in the western part of Japan. Two human adenovirus (HAdV) strains, Kobe-H and Kobe-S, were isolated from patients with nosocomial EKC infection. They were untypeable by existing neutralizing antisera; however, the isolate was neutralized with homologous antisera. We then encountered several cases of EKC due to nosocomial infections in eye clinics in different parts of Japan. A total of 80 HAdVs were isolated from patients with EKC at eight different hospitals. The partial hexon gene sequences of the isolates were determined and compared to those of the prototype strains of 51 serotypes. All isolates had identical partial hexon nucleotide sequences. Phylogenetic analysis classified these isolates into species of HAdV-D. The isolates showed 93.9 to 96.7% nucleotide identity with HAdV-D prototype strains, while all 32 HAdV-D prototype strains ranged from 93.2 to 99.2% identity. The sequences of the loop 2 and fiber knob regions from the representative strain, Kobe-H, were dissimilar in all prototype strains of 51 serotypes. We believe that this virus is a novel serotype of HAdV that causes EKC. PMID:18385435
Detection of Rickettsia helvetica and Candidatus R. tarasevichiae DNA in Ixodes persulcatus ticks collected in Northeastern European Russia (Komi Republic).

PubMed

Kartashov, Mikhail Yu; Glushkova, Ludmila I; Mikryukova, Tamara P; Korabelnikov, Igor V; Egorova, Yulia I; Tupota, Natalia L; Protopopova, Elena V; Konovalova, Svetlana N; Ternovoi, Vladimir A; Loktev, Valery B

2017-06-01

The number of tick-borne infections in the northern European regions of Russia has increased considerably in the last years. In the present study, 676 unfed adult Ixodes persulcatus ticks were collected in the Komi Republic from 2011 to 2013 to study tick-borne rickettsioses. Rickettsia spp. DNA was detected by PCR in 51 (7.6%) ticks. The nucleotide sequence analysis of gltA fragments (765bp) from 51 ticks indicated that 60.8% and 39.2% of the ticks were infected with Rickettsia helvetica and Candidatus R. tarasevichiae, respectively. The gltA fragments showed 100% identity with those of Candidatus R. tarasevichiae previously discovered in Siberia and China, whereas R. helvetica showed 99.9% sequence identity with European isolates. The ompB had 8 nucleotide substitutions, 6 of which resulted in amino acid substitutions. In the sca9 gene, 3 nucleotide substitutions were detected, and only one resulted in amino acid substitution. The smpA, ompW, and β-lactamase genes of R. helvetica also showed a high level of sequence identity. Copyright © 2017 Elsevier GmbH. All rights reserved.
Morphological and Molecular Identification of Globodera pallida Associated with Potato in Idaho

PubMed Central

Skantar, A. M.; Handoo, Z. A.; Carta, L. K.; Chitwood, D. J.

2007-01-01

The identity of a newly discovered population of pale potato cyst nematode Globodera pallida associated with potato in eastern Idaho was established by morphological and molecular methods. Morphometrics of cysts and second-stage juveniles were generally within the expected ranges for G. pallida with some variations noted. The Idaho population and paratype material from Epworth, Lincolnshire, England, both showed variations in tail shape, with bluntly rounded to finely pointed tail termini. Compared to literature values for the paratypes, second-stage juveniles of the Idaho population had a somewhat shorter mean body length, and cysts had a slightly higher mean distance from the anus to the nearest edge of the fenestra. PCR-RFLP of the rDNA ITS region, sequence-specific multiplex PCR and DNA sequence comparisons all confirmed the identity of the Idaho population as G. pallida. The ITS rDNA sequence of the Idaho isolate was identical to those from York, England, and the Netherlands. Species-specific primers that can positively identify the tobacco cyst nematode Globodera tabacum were also developed, providing a new assay for distinguishing this species from G. pallida and the golden potato cyst nematode Globodera rostochiensis. PMID:19259482
Development and application of triple antibody sandwich enzyme-linked immunosorbent assays for begomovirus detection using monoclonal antibodies against Tomato yellow leaf curl Thailand virus.

PubMed

Seepiban, Channarong; Charoenvilaisiri, Saengsoon; Warin, Nuchnard; Bhunchoth, Anjana; Phironrit, Namthip; Phuangrat, Bencharong; Chatchawankanphanich, Orawan; Attathom, Supat; Gajanandana, Oraprapai

2017-05-30

Tomato yellow leaf curl Thailand virus, TYLCTHV, is a begomovirus that causes severe losses of tomato crops in Thailand as well as several countries in Southeast and East Asia. The development of monoclonal antibodies (MAbs) and serological methods for detecting TYLCTHV is essential for epidemiological studies and screening for virus-resistant cultivars. The recombinant coat protein (CP) of TYLCTHV was expressed in Escherichia coli and used to generate MAbs against TYLCTHV through hybridoma technology. The MAbs were characterized and optimized to develop triple antibody sandwich enzyme-linked immunosorbent assays (TAS-ELISAs) for begomovirus detection. The efficiency of TAS-ELISAs for begomovirus detection was evaluated with tomato, pepper, eggplant, okra and cucurbit plants collected from several provinces in Thailand. Molecular identification of begomoviruses in these samples was also performed through PCR and DNA sequence analysis of the CP gene. Two MAbs (M1 and D2) were generated and used to develop TAS-ELISAs for begomovirus detection. The results of begomovirus detection in 147 field samples indicated that MAb M1 reacted with 2 begomovirus species, TYLCTHV and Tobacco leaf curl Yunnan virus (TbLCYnV), whereas MAb D2 reacted with 4 begomovirus species, TYLCTHV, TbLCYnV, Tomato leaf curl New Delhi virus (ToLCNDV) and Squash leaf curl China virus (SLCCNV). Phylogenetic analyses of CP amino acid sequences from these begomoviruses revealed that the CP sequences of begomoviruses recognized by the narrow-spectrum MAb M1 were highly conserved, sharing 93% identity with each other but only 72-81% identity with MAb M1-negative begomoviruses. The CP sequences of begomoviruses recognized by the broad-spectrum MAb D2 demonstrated a wider range of amino acid sequence identity, sharing 78-96% identity with each other and 72-91% identity with those that were not detected by MAb D2. TAS-ELISAs using the narrow-specificity MAb M1 proved highly efficient for the detection of TYLCTHV and TbLCYnV, whereas TAS-ELISAs using the broad-specificity MAb D2 were highly efficient for the detection of TYLCTHV, TbLCYnV, ToLCNDV and SLCCNV. Both newly developed assays allow for sensitive, inexpensive, high-throughput detection of begomoviruses in field plant samples, as well as screening for virus-resistant cultivars.
Genetic diversity of pneumococcal surface protein A in invasive pneumococcal isolates from Korean children, 1991-2016.

PubMed

Yun, Ki Wook; Choi, Eun Hwa; Lee, Hoan Jong

2017-01-01

Pneumococcal surface protein A (PspA) is an important virulence factor of pneumococci and has been investigated as a primary component of a capsular serotype-independent pneumococcal vaccine. Thus, we sought to determine the genetic diversity of PspA to explore its potential as a vaccine candidate. Among the 190 invasive pneumococcal isolates collected from Korean children between 1991 and 2016, two (1.1%) isolates were found to have no pspA by multiple polymerase chain reactions. The full length pspA genes from 185 pneumococcal isolates were sequenced. The length of pspA varied, ranging from 1,719 to 2,301 base pairs with 55.7-100% nucleotide identity. Based on the sequences of the clade-defining regions, 68.7% and 49.7% were in PspA family 2 and clade 3/family 2, respectively. PspA clade types were correlated with genotypes using multilocus sequence typing and divided into several subclades based on diversity analysis of the N-terminal α-helical regions, which showed nucleotide sequence identities of 45.7-100% and amino acid sequence identities of 23.1-100%. Putative antigenicity plots were also diverse among individual clades and subclades. The differences in antigenicity patterns were concentrated within the N-terminal 120 amino acids. In conclusion, the N-terminal α-helical domain, which is known to be the major immunogenic portion of PspA, is genetically variable and should be further evaluated for antigenic differences and cross-reactivity between various PspA types from pneumococcal isolates.
Genome organisation and sequence comparison suggest intraspecies incongruence in M RNA of Watermelon bud necrosis virus.

PubMed

Kumar, Rakesh; Mandal, B; Geetanjali, A S; Jain, R K; Jaiwal, P K

2010-08-01

Watermelon bud necrosis virus (WBNV), a member of the genus Tospovirus, family Bunyaviridae is an important viral pathogen in watermelon cultivation in India. The complete genome sequence properties of WBNV are not available. In the present study, the complete M RNA sequence and the genome organisation of a WBNV isolate infecting watermelon in Delhi (WBNV-wDel) were determined. The M RNA was 4,794 nucleotides (nt) long and potentially coded for a movement protein (NSm) of 34.22 kDa (307 amino acids) on the viral sense strand and a Gn/Gc glycoprotein precursor of 127.15 kDa (1,121 amino acids) on the complementary strand. The two open reading frames were separated by an intergenic region of 402 nt. The 5' and 3' untranslated regions were 55 and 47 nt long, respectively, containing complementary termini typical of tospoviruses. WBNV-wDel was most closely related (79.1% identity) to Groundnut bud necrosis virus, an important tospovirus that occurs in several crops in India, and was different (63.3-75.2% identity) from the other cucurbit-infecting tospoviruses known to occur in Taiwan and Japan. Sequence analysis of NSm and Gn/Gc revealed phylogenetic incongruence between WBNV-wDel and another isolate originating from central India (WBNV-Wm-Som isolate). The Wm-Som isolate showed evolutionary divergence from the wDel isolate in the Gn/Gc protein (74.6% identity) potentially due to recombination with the other tospoviruses that are known to occur in India. This is the first report of a comparison of complete sequences of M RNA of WBNV.
Cloning and molecular characterization of scorpion Buthus martensi venom hyaluronidases: a novel full-length and diversiform noncoding isoforms.

PubMed

Xia, Xichao; Liu, Rongzhi; Li, Yi; Xue, Shipeng; Liu, Qingchun; Jiang, Xiao; Zhang, Wenjuan; Ding, Ke

2014-09-01

Hyaluronidase is a common component of scorpion venom and has been considered as "spreading factor" that promotes a fast penetration of the venom in the anaphylactic reaction. In the current study, a novel full-length of hyaluronidase BmHYI and three noncoding isoforms of BmHYII, BmHYIII and BmHYIV were cloned by using a combined strategy based on peptide sequencing and Rapid Amplification of cDNA Ends (RACE). BmHYI has 410 amino acid residues containing the catalytic, positional and five potential N-glycosylation sites. The deduced protein sequence of BmHYI shares significant identity with venom hyaluronidases from bees and snakes. The phylogenetic analysis showed early divergence and independent evolution of BmHYI from other hyaluronidases. An extraordinarily high level of sequence similarity was detected among four sequences. But, BmHYII, BmHYIII and BmHYIV were short of stop-codon in the open reading frame and poly(A) signal in the 3' end. Copyright © 2014 Elsevier B.V. All rights reserved.
Multi-Donor Longitudinal Antibody Repertoire Sequencing Reveals the Existence of Public Antibody Clonotypes in HIV-1 Infection.

PubMed

Setliff, Ian; McDonnell, Wyatt J; Raju, Nagarajan; Bombardi, Robin G; Murji, Amyn A; Scheepers, Cathrine; Ziki, Rutendo; Mynhardt, Charissa; Shepherd, Bryan E; Mamchak, Alusha A; Garrett, Nigel; Karim, Salim Abdool; Mallal, Simon A; Crowe, James E; Morris, Lynn; Georgiev, Ivelin S

2018-06-13

Characterization of single antibody lineages within infected individuals has provided insights into the development of Env-specific antibodies. However, a systems-level understanding of the humoral response against HIV-1 is limited. Here, we interrogated the antibody repertoires of multiple HIV-infected donors from an infection-naive state through acute and chronic infection using next-generation sequencing. This analysis revealed the existence of "public" antibody clonotypes that were shared among multiple HIV-infected individuals. The HIV-1 reactivity for representative antibodies from an identified public clonotype shared by three donors was confirmed. Furthermore, a meta-analysis of publicly available antibody repertoire sequencing datasets revealed antibodies with high sequence identity to known HIV-reactive antibodies, even in repertoires that were reported to be HIV naive. The discovery of public antibody clonotypes in HIV-infected individuals represents an avenue of significant potential for better understanding antibody responses to HIV-1 infection, as well as for clonotype-specific vaccine development. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Kinact: a computational approach for predicting activating missense mutations in protein kinases.

PubMed

Rodrigues, Carlos H M; Ascher, David B; Pires, Douglas E V

2018-05-21

Protein phosphorylation is tightly regulated due to its vital role in many cellular processes. While gain of function mutations leading to constitutive activation of protein kinases are known to be driver events of many cancers, the identification of these mutations has proven challenging. Here we present Kinact, a novel machine learning approach for predicting kinase activating missense mutations using information from sequence and structure. By adapting our graph-based signatures, Kinact represents both structural and sequence information, which are used as evidence to train predictive models. We show the combination of structural and sequence features significantly improved the overall accuracy compared to considering either primary or tertiary structure alone, highlighting their complementarity. Kinact achieved a precision of 87% and 94% and Area Under ROC Curve of 0.89 and 0.92 on 10-fold cross-validation, and on blind tests, respectively, outperforming well established tools (P < 0.01). We further show that Kinact performs equally well on homology models built using templates with sequence identity as low as 33%. Kinact is freely available as a user-friendly web server at http://biosig.unimelb.edu.au/kinact/.
Spatio-temporal Variations of Characteristic Repeating Earthquake Sequences along the Middle America Trench in Mexico

NASA Astrophysics Data System (ADS)

Dominguez, L. A.; Taira, T.; Hjorleifsdottir, V.; Santoyo, M. A.

2015-12-01

Repeating earthquake sequences are sets of events that are thought to rupture the same area on the plate interface and thus provide nearly identical waveforms. We systematically analyzed seismic records from 2001 through 2014 to identify repeating earthquakes with highly correlated waveforms occurring along the subduction zone of the Cocos plate. Using the correlation coefficient (cc) and spectral coherency (coh) of the vertical components as selection criteria, we found a set of 214 sequences whose waveforms exceed cc≥95% and coh≥95%. Spatial clustering along the trench shows large variations in repeating earthquakes activity. Particularly, the rupture zone of the M8.1, 1985 earthquake shows an almost absence of characteristic repeating earthquakes, whereas the Guerrero Gap zone and the segment of the trench close to the Guerrero-Oaxaca border shows a significantly larger number of repeating earthquakes sequences. Furthermore, temporal variations associated to stress changes due to major shows episodes of unlocking and healing of the interface. Understanding the different components that control the location and recurrence time of characteristic repeating sequences is a key factor to pinpoint areas where large megathrust earthquakes may nucleate and consequently to improve the seismic hazard assessment.
Towards comprehensive structural motif mining for better fold annotation in the "twilight zone" of sequence dissimilarity

PubMed Central

Jia, Yi; Huan, Jun; Buhr, Vincent; Zhang, Jintao; Carayannopoulos, Leonidas N

2009-01-01

Background Automatic identification of structure fingerprints from a group of diverse protein structures is challenging, especially for proteins whose divergent amino acid sequences may fall into the "twilight-" or "midnight-" zones where pair-wise sequence identities to known sequences fall below 25% and sequence-based functional annotations often fail. Results Here we report a novel graph database mining method and demonstrate its application to protein structure pattern identification and structure classification. The biologic motivation of our study is to recognize common structure patterns in "immunoevasins", proteins mediating virus evasion of host immune defense. Our experimental study, using both viral and non-viral proteins, demonstrates the efficiency and efficacy of the proposed method. Conclusion We present a theoretic framework, offer a practical software implementation for incorporating prior domain knowledge, such as substitution matrices as studied here, and devise an efficient algorithm to identify approximate matched frequent subgraphs. By doing so, we significantly expanded the analytical power of sophisticated data mining algorithms in dealing with large volume of complicated and noisy protein structure data. And without loss of generality, choice of appropriate compatibility matrices allows our method to be easily employed in domains where subgraph labels have some uncertainty. PMID:19208148
Distinctive acceptor-end structure and other determinants of Escherichia coli tRNAPro identity.

PubMed Central

McClain, W H; Schneider, J; Gabriel, K

1994-01-01

The previously uncharacterized determinants of the specificity of tRNAPro for aminoacylation (tRNAPro identity) were defined by a computer comparison of all Escherichia coli tRNA sequences and tested by a functional analysis of amber suppressor tRNAs in vivo. We determined the amino acid specificity of tRNA by sequencing a suppressed protein and the aminoacylation efficiency of tRNA by examining the steady-state level of aminoacyl-tRNA. On substituting nucleotides derived from the acceptor end and variable pocket of tRNAPro for the corresponding nucleotides in a tRNAPhe gene, the identity of the resulting tRNA changed substantially but incompletely to that of tRNAPro. The redesigned tRNAPhe was weakly active and aminoacyl-tRNA was not detected. Ethyl methanesulfonate mutagenesis of the redesigned tRNAPhe gene produced a mutant with a wobble pair in place of a base pair in the end of the acceptor-stem helix of the transcribed tRNA. This mutant exhibited both a tRNAPro identity and substantial aminoacyl-tRNA. The results speak for the importance of a distinctive conformation in the acceptor-stem helix of tRNAPro for aminoacylation by the prolyl-tRNA synthetase. The anticodon also contributes to tRNAPro identity but is not necessary in vivo. Images PMID:8127693
Systematic analysis of protein identity between Zika virus and other arthropod-borne viruses.

PubMed

Chang, Hsiao-Han; Huber, Roland G; Bond, Peter J; Grad, Yonatan H; Camerini, David; Maurer-Stroh, Sebastian; Lipsitch, Marc

2017-07-01

To analyse the proportions of protein identity between Zika virus and dengue, Japanese encephalitis, yellow fever, West Nile and chikungunya viruses as well as polymorphism between different Zika virus strains. We used published protein sequences for the Zika virus and obtained protein sequences for the other viruses from the National Center for Biotechnology Information (NCBI) protein database or the NCBI virus variation resource. We used BLASTP to find regions of identity between viruses. We quantified the identity between the Zika virus and each of the other viruses, as well as within-Zika virus polymorphism for all amino acid k -mers across the proteome, with k ranging from 6 to 100. We assessed accessibility of protein fragments by calculating the solvent accessible surface area for the envelope and nonstructural-1 (NS1) proteins. In total, we identified 294 Zika virus protein fragments with both low proportion of identity with other viruses and low levels of polymorphisms among Zika virus strains. The list includes protein fragments from all Zika virus proteins, except NS3. NS4A has the highest number (190 k -mers) of protein fragments on the list. We provide a candidate list of protein fragments that could be used when developing a sensitive and specific serological test to detect previous Zika virus infections.
Next-Generation Sequencing-Based Approaches for Mutation Mapping and Identification in Caenorhabditis elegans

PubMed Central

Doitsidou, Maria; Jarriault, Sophie; Poole, Richard J.

2016-01-01

The use of next-generation sequencing (NGS) has revolutionized the way phenotypic traits are assigned to genes. In this review, we describe NGS-based methods for mapping a mutation and identifying its molecular identity, with an emphasis on applications in Caenorhabditis elegans. In addition to an overview of the general principles and concepts, we discuss the main methods, provide practical and conceptual pointers, and guide the reader in the types of bioinformatics analyses that are required. Owing to the speed and the plummeting costs of NGS-based methods, mapping and cloning a mutation of interest has become straightforward, quick, and relatively easy. Removing this bottleneck previously associated with forward genetic screens has significantly advanced the use of genetics to probe fundamental biological processes in an unbiased manner. PMID:27729495
N-terminus conservation in the terminal pigment of phycobilisomes from a prokaryotic and eukaryotic alga. [Porphyridium cruentum; Nostoc

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gantt, E.; Cunningham, F.X. Jr.; Lipschultz, C.A.

1988-04-01

High molecular weight polypeptides from phycobilisomes, believed to be involved in facilitating the energy flow from phycobilisomes to thylakoids, are conserved in the prokaryote Nostoc sp. and the eukaryote Porphyridium cruentum. Partial N-terminal sequence analysis of the phycobilisome-polypeptides of Nostoc (94 kilodalton) and Porphyridium (92 kilodalton) revealed 55% identity in the first 20 residues, but no significant homology with sequences of other phycobiliproteins or phycobilisome-linkers. Polypeptides (94 and 92 kilodalton) from Nostoc thylakoids free of phycobilisomes, previously presumed to be involved in the phycobilisome-thylakoid linkage exhibit the same immunocrossreactivity but are different from the 94 kilodalton-phycobilisome polypeptide by having blockedmore » N-termini and a different amino acid composition.« less
Bacterial Communities in Boreal Forest Mushrooms Are Shaped Both by Soil Parameters and Host Identity

PubMed Central

Pent, Mari; Põldmaa, Kadri; Bahram, Mohammad

2017-01-01

Despite recent advances in understanding the microbiome of eukaryotes, little is known about microbial communities in fungi. Here we investigate the structure of bacterial communities in mushrooms, including common edible ones, with respect to biotic and abiotic factors in the boreal forest. Using a combination of culture-based and Illumina high-throughput sequencing, we characterized the bacterial communities in fruitbodies of fungi from eight genera spanning four orders of the class Agaricomycetes (Basidiomycota). Our results revealed that soil pH followed by fungal identity are the main determinants of the structure of bacterial communities in mushrooms. While almost half of fruitbody bacteria were also detected from soil, the abundance of several bacterial taxa differed considerably between the two environments. The effect of host identity was significant at the fungal genus and order level and could to some extent be ascribed to the distinct bacterial community of the chanterelle, representing Cantharellales—the earliest diverged group of mushroom-forming basidiomycetes. These data suggest that besides the substantial contribution of soil as a major taxa source of bacterial communities in mushrooms, the structure of these communities is also affected by the identity of the host. Thus, bacteria inhabiting fungal fruitbodies may be non-randomly selected from environment based on their symbiotic functions and/or habitat requirements. PMID:28539921
Characterisation of the nicotianamine aminotransferase and deoxymugineic acid synthase genes essential to Strategy II iron uptake in bread wheat (Triticum aestivum L.)

PubMed Central

Johnson, Alexander A. T.

2017-01-01

Iron (Fe) uptake in graminaceous plant species occurs via the release and uptake of Fe-chelating compounds known as mugineic acid family phytosiderophores (MAs). In the MAs biosynthetic pathway, nicotianamine aminotransferase (NAAT) and deoxymugineic acid synthase (DMAS) enzymes catalyse the formation of 2’-deoxymugineic acid (DMA) from nicotianamine (NA). Here we describe the identification and characterisation of six TaNAAT and three TaDMAS1 genes in bread wheat (Triticum aestivum L.). The coding sequences of all six TaNAAT homeologs consist of seven exons with ≥88.0% nucleotide sequence identity and most sequence variation present in the first exon. The coding sequences of the three TaDMAS1 homeologs consist of three exons with ≥97.8% nucleotide sequence identity. Phylogenetic analysis revealed that the TaNAAT and TaDMAS1 proteins are most closely related to the HvNAAT and HvDMAS1 proteins of barley and that there are two distinct groups of TaNAAT proteins—TaNAAT1 and TaNAAT2 –that correspond to the HvNAATA and HvNAATB proteins, respectively. Quantitative reverse transcription-PCR analysis revealed that the TaNAAT2 genes are expressed at highest levels in anther tissues whilst the TaNAAT1 and TaDMAS1 genes are expressed at highest levels in root tissues of bread wheat. Furthermore, the TaNAAT1, TaNAAT2 and TaDMAS1 genes were differentially regulated by plant Fe status and their expression was significantly upregulated in root tissues from day five onwards during a seven-day Fe deficiency treatment. The identification and characterization of the TaNAAT1, TaNAAT2 and TaDMAS1 genes provides a valuable genetic resource for improving bread wheat growth on Fe deficient soils and enhancing grain Fe nutrition. PMID:28475636
An epidemiological survey of bovine Babesia and Theileria parasites in cattle, buffaloes, and sheep in Egypt.

PubMed

Elsify, Ahmed; Sivakumar, Thillaiampalam; Nayel, Mohammed; Salama, Akram; Elkhtam, Ahmed; Rizk, Mohamed; Mosaab, Omar; Sultan, Khaled; Elsayed, Shimaa; Igarashi, Ikuo; Yokoyama, Naoaki

2015-02-01

Cattle, buffaloes, and sheep are the main sources of meat and milk in Egypt, but their productivity is thought to be greatly reduced by hemoprotozoan parasitic diseases. In this study, we analyzed the infection rates of Babesia bovis, Babesia bigemina, Theileria annulata, and Theileria orientalis, using parasite-specific PCR assays in blood-DNA samples sourced from cattle (n=439), buffaloes (n=50), and sheep (n=105) reared in Menoufia, Behera, Giza, and Sohag provinces of Egypt. In cattle, the positive rates of B. bovis, B. bigemina, T. annulata, and T. orientalis were 3.18%, 7.97%, 9.56%, and 0.68%, respectively. On the other hand, B. bovis and T. orientalis were the only parasites detected in buffaloes and each of these parasites was only found in two individual DNA samples (both 2%), while one (0.95%) and two (1.90%) of the sheep samples were positive for B. bovis and B. bigemina, respectively. Sequence analysis showed that the B. bovis Rhoptry Associated Protein-1 and the B. bigemina Apical Membrane Antigen-1 genes were highly conserved among the samples, with 99.3-100% and 95.3-100% sequence identity values, respectively. In contrast, the Egyptian T. annulata merozoite surface antigen-1 gene sequences were relatively diverse (87.8-100% identity values), dispersing themselves across several clades in the phylogenetic tree containing sequences from other countries. Additionally, the T. orientalis Major Piroplasm Surface Protein (MPSP) gene sequences were classified as types 1 and 2. This is the first report of T. orientalis in Egypt, and of type 2 MPSP in buffaloes. Detection of MPSP type 2, which is considered a relatively virulent genotype, suggests that T. orientalis infection may have veterinary and economic significance in Egypt. In conclusion, the present study, which analyzed multiple species of Babesia and Theileria parasites in different livestock animals, may shed an additional light on the epidemiology of hemoprotozoan parasites in Egypt. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

Synthetic versions of firefly luciferase and Renilla luciferase reporter genes that resist transgene silencing in sugarcane

PubMed Central

2014-01-01

Background Down-regulation or silencing of transgene expression can be a major hurdle to both molecular studies and biotechnology applications in many plant species. Sugarcane is particularly effective at silencing introduced transgenes, including reporter genes such as the firefly luciferase gene. Synthesizing transgene coding sequences optimized for usage in the host plant is one method of enhancing transgene expression and stability. Using specified design rules we have synthesised new coding sequences for both the firefly luciferase and Renilla luciferase reporter genes. We have tested these optimized versions for enhanced levels of luciferase activity and for increased steady state luciferase mRNA levels in sugarcane. Results The synthetic firefly luciferase (luc*) and Renilla luciferase (Renluc*) coding sequences have elevated G + C contents in line with sugarcane codon usage, but maintain 75% identity to the native firefly or Renilla luciferase nucleotide sequences and 100% identity to the protein coding sequences. Under the control of the maize pUbi promoter, the synthetic luc* and Renluc* genes yielded 60x and 15x higher luciferase activity respectively, over the native firefly and Renilla luciferase genes in transient assays on sugarcane suspension cell cultures. Using a novel transient assay in sugarcane suspension cells combining co-bombardment and qRT-PCR, we showed that synthetic luc* and Renluc* genes generate increased transcript levels compared to the native firefly and Renilla luciferase genes. In stable transgenic lines, the luc* transgene generated significantly higher levels of expression than the native firefly luciferase transgene. The fold difference in expression was highest in the youngest tissues. Conclusions We developed synthetic versions of both the firefly and Renilla luciferase reporter genes that resist transgene silencing in sugarcane. These transgenes will be particularly useful for evaluating the expression patterns conferred by existing and newly isolated promoters in sugarcane tissues. The strategies used to design the synthetic luciferase transgenes could be applied to other transgenes that are aggressively silenced in sugarcane. PMID:24708613
Synthetic versions of firefly luciferase and Renilla luciferase reporter genes that resist transgene silencing in sugarcane.

PubMed

Chou, Ting-Chun; Moyle, Richard L

2014-04-08

Down-regulation or silencing of transgene expression can be a major hurdle to both molecular studies and biotechnology applications in many plant species. Sugarcane is particularly effective at silencing introduced transgenes, including reporter genes such as the firefly luciferase gene.Synthesizing transgene coding sequences optimized for usage in the host plant is one method of enhancing transgene expression and stability. Using specified design rules we have synthesised new coding sequences for both the firefly luciferase and Renilla luciferase reporter genes. We have tested these optimized versions for enhanced levels of luciferase activity and for increased steady state luciferase mRNA levels in sugarcane. The synthetic firefly luciferase (luc*) and Renilla luciferase (Renluc*) coding sequences have elevated G + C contents in line with sugarcane codon usage, but maintain 75% identity to the native firefly or Renilla luciferase nucleotide sequences and 100% identity to the protein coding sequences.Under the control of the maize pUbi promoter, the synthetic luc* and Renluc* genes yielded 60x and 15x higher luciferase activity respectively, over the native firefly and Renilla luciferase genes in transient assays on sugarcane suspension cell cultures.Using a novel transient assay in sugarcane suspension cells combining co-bombardment and qRT-PCR, we showed that synthetic luc* and Renluc* genes generate increased transcript levels compared to the native firefly and Renilla luciferase genes.In stable transgenic lines, the luc* transgene generated significantly higher levels of expression than the native firefly luciferase transgene. The fold difference in expression was highest in the youngest tissues. We developed synthetic versions of both the firefly and Renilla luciferase reporter genes that resist transgene silencing in sugarcane. These transgenes will be particularly useful for evaluating the expression patterns conferred by existing and newly isolated promoters in sugarcane tissues. The strategies used to design the synthetic luciferase transgenes could be applied to other transgenes that are aggressively silenced in sugarcane.
Draft Genome Sequences of Two Bacillus thuringiensis Strains and Characterization of a Putative 41.9-kDa Insecticidal Toxin

PubMed Central

Palma, Leopoldo; Muñoz, Delia; Berry, Colin; Murillo, Jesús; Caballero, Primitivo

2014-01-01

In this work, we report the genome sequencing of two Bacillus thuringiensis strains using Illumina next-generation sequencing technology (NGS). Strain Hu4-2, toxic to many lepidopteran pest species and to some mosquitoes, encoded genes for two insecticidal crystal (Cry) proteins, cry1Ia and cry9Ea, and a vegetative insecticidal protein (Vip) gene, vip3Ca2. Strain Leapi01 contained genes coding for seven Cry proteins (cry1Aa, cry1Ca, cry1Da, cry2Ab, cry9Ea and two cry1Ia gene variants) and a vip3 gene (vip3Aa10). A putative novel insecticidal protein gene 1143 bp long was found in both strains, whose sequences exhibited 100% nucleotide identity. The predicted protein showed 57 and 100% pairwise identity to protein sequence 72 from a patented Bt strain (US8318900) and to a putative 41.9-kDa insecticidal toxin from Bacillus cereus, respectively. The 41.9-kDa protein, containing a C-terminal 6× HisTag fusion, was expressed in Escherichia coli and tested for the first time against four lepidopteran species (Mamestra brassicae, Ostrinia nubilalis, Spodoptera frugiperda and S. littoralis) and the green-peach aphid Myzus persicae at doses as high as 4.8 µg/cm2 and 1.5 mg/mL, respectively. At these protein concentrations, the recombinant 41.9-kDa protein caused no mortality or symptoms of impaired growth against any of the insects tested, suggesting that these species are outside the protein’s target range or that the protein may not, in fact, be toxic. While the use of the polymerase chain reaction has allowed a significant increase in the number of Bt insecticidal genes characterized to date, novel NGS technologies promise a much faster, cheaper and efficient screening of Bt pesticidal proteins. PMID:24784323
Independent studies using deep sequencing resolve the same set of core bacterial species dominating gut communities of honey bees.

PubMed

Sabree, Zakee L; Hansen, Allison K; Moran, Nancy A

2012-01-01

Starting in 2003, numerous studies using culture-independent methodologies to characterize the gut microbiota of honey bees have retrieved a consistent and distinctive set of eight bacterial species, based on near identity of the 16S rRNA gene sequences. A recent study [Mattila HR, Rios D, Walker-Sperling VE, Roeselers G, Newton ILG (2012) Characterization of the active microbiotas associated with honey bees reveals healthier and broader communities when colonies are genetically diverse. PLoS ONE 7(3): e32962], using pyrosequencing of the V1-V2 hypervariable region of the 16S rRNA gene, reported finding entirely novel bacterial species in honey bee guts, and used taxonomic assignments from these reads to predict metabolic activities based on known metabolisms of cultivable species. To better understand this discrepancy, we analyzed the Mattila et al. pyrotag dataset. In contrast to the conclusions of Mattila et al., we found that the large majority of pyrotag sequences belonged to clusters for which representative sequences were identical to sequences from previously identified core species of the bee microbiota. On average, they represent 95% of the bacteria in each worker bee in the Mattila et al. dataset, a slightly lower value than that found in other studies. Some colonies contain small proportions of other bacteria, mostly species of Enterobacteriaceae. Reanalysis of the Mattila et al. dataset also did not support a relationship between abundances of Bifidobacterium and of putative pathogens or a significant difference in gut communities between colonies from queens that were singly or multiply mated. Additionally, consistent with previous studies, the dataset supports the occurrence of considerable strain variation within core species, even within single colonies. The roles of these bacteria within bees, or the implications of the strain variation, are not yet clear.
Rapid and accurate pyrosequencing of angiosperm plastid genomes

PubMed Central

Moore, Michael J; Dhingra, Amit; Soltis, Pamela S; Shaw, Regina; Farmerie, William G; Folta, Kevin M; Soltis, Douglas E

2006-01-01

Background Plastid genome sequence information is vital to several disciplines in plant biology, including phylogenetics and molecular biology. The past five years have witnessed a dramatic increase in the number of completely sequenced plastid genomes, fuelled largely by advances in conventional Sanger sequencing technology. Here we report a further significant reduction in time and cost for plastid genome sequencing through the successful use of a newly available pyrosequencing platform, the Genome Sequencer 20 (GS 20) System (454 Life Sciences Corporation), to rapidly and accurately sequence the whole plastid genomes of the basal eudicot angiosperms Nandina domestica (Berberidaceae) and Platanus occidentalis (Platanaceae). Results More than 99.75% of each plastid genome was simultaneously obtained during two GS 20 sequence runs, to an average depth of coverage of 24.6× in Nandina and 17.3× in Platanus. The Nandina and Platanus plastid genomes shared essentially identical gene complements and possessed the typical angiosperm plastid structure and gene arrangement. To assess the accuracy of the GS 20 sequence, over 45 kilobases of sequence were generated for each genome using conventional sequencing. Overall error rates of 0.043% and 0.031% were observed in GS 20 sequence for Nandina and Platanus, respectively. More than 97% of all observed errors were associated with homopolymer runs, with ~60% of all errors associated with homopolymer runs of 5 or more nucleotides and ~50% of all errors associated with regions of extensive homopolymer runs. No substitution errors were present in either genome. Error rates were generally higher in the single-copy and noncoding regions of both plastid genomes relative to the inverted repeat and coding regions. Conclusion Highly accurate and essentially complete sequence information was obtained for the Nandina and Platanus plastid genomes using the GS 20 System. More importantly, the high accuracy observed in the GS 20 plastid genome sequence was generated for a significant reduction in time and cost over traditional shotgun-based genome sequencing techniques, although with approximately half the coverage of previously reported GS 20 de novo genome sequence. The GS 20 should be broadly applicable to angiosperm plastid genome sequencing, and therefore promises to expand the scale of plant genetic and phylogenetic research dramatically. PMID:16934154
Combining phage display with de novo protein sequencing for reverse engineering of monoclonal antibodies.

PubMed

Rickert, Keith W; Grinberg, Luba; Woods, Robert M; Wilson, Susan; Bowen, Michael A; Baca, Manuel

2016-01-01

The enormous diversity created by gene recombination and somatic hypermutation makes de novo protein sequencing of monoclonal antibodies a uniquely challenging problem. Modern mass spectrometry-based sequencing will rarely, if ever, provide a single unambiguous sequence for the variable domains. A more likely outcome is computation of an ensemble of highly similar sequences that can satisfy the experimental data. This outcome can result in the need for empirical testing of many candidate sequences, sometimes iteratively, to identity one which can replicate the activity of the parental antibody. Here we describe an improved approach to antibody protein sequencing by using phage display technology to generate a combinatorial library of sequences that satisfy the mass spectrometry data, and selecting for functional candidates that bind antigen. This approach was used to reverse engineer 2 commercially-obtained monoclonal antibodies against murine CD137. Proteomic data enabled us to assign the majority of the variable domain sequences, with the exception of 3-5% of the sequence located within or adjacent to complementarity-determining regions. To efficiently resolve the sequence in these regions, small phage-displayed libraries were generated and subjected to antigen binding selection. Following enrichment of antigen-binding clones, 2 clones were selected for each antibody and recombinantly expressed as antigen-binding fragments (Fabs). In both cases, the reverse-engineered Fabs exhibited identical antigen binding affinity, within error, as Fabs produced from the commercial IgGs. This combination of proteomic and protein engineering techniques provides a useful approach to simplifying the technically challenging process of reverse engineering monoclonal antibodies from protein material.
Combining phage display with de novo protein sequencing for reverse engineering of monoclonal antibodies

PubMed Central

Rickert, Keith W.; Grinberg, Luba; Woods, Robert M.; Wilson, Susan; Bowen, Michael A.; Baca, Manuel

2016-01-01

ABSTRACT The enormous diversity created by gene recombination and somatic hypermutation makes de novo protein sequencing of monoclonal antibodies a uniquely challenging problem. Modern mass spectrometry-based sequencing will rarely, if ever, provide a single unambiguous sequence for the variable domains. A more likely outcome is computation of an ensemble of highly similar sequences that can satisfy the experimental data. This outcome can result in the need for empirical testing of many candidate sequences, sometimes iteratively, to identity one which can replicate the activity of the parental antibody. Here we describe an improved approach to antibody protein sequencing by using phage display technology to generate a combinatorial library of sequences that satisfy the mass spectrometry data, and selecting for functional candidates that bind antigen. This approach was used to reverse engineer 2 commercially-obtained monoclonal antibodies against murine CD137. Proteomic data enabled us to assign the majority of the variable domain sequences, with the exception of 3–5% of the sequence located within or adjacent to complementarity-determining regions. To efficiently resolve the sequence in these regions, small phage-displayed libraries were generated and subjected to antigen binding selection. Following enrichment of antigen-binding clones, 2 clones were selected for each antibody and recombinantly expressed as antigen-binding fragments (Fabs). In both cases, the reverse-engineered Fabs exhibited identical antigen binding affinity, within error, as Fabs produced from the commercial IgGs. This combination of proteomic and protein engineering techniques provides a useful approach to simplifying the technically challenging process of reverse engineering monoclonal antibodies from protein material. PMID:26852694
Bacterial diversity in the oral cavity of ten healthy individuals

PubMed Central

Bik, Elisabeth M.; Long, Clara Davis; Armitage, Gary C.; Loomer, Peter; Emerson, Joanne; Mongodin, Emmanuel F.; Nelson, Karen E.; Gill, Steven R.; Fraser-Liggett, Claire M.; Relman, David A.

2010-01-01

The composition of the oral microbiota from 10 individuals with healthy oral tissues was determined using culture-independent techniques. From each individual, 26 specimens, each from different oral sites at a single point in time, were collected and pooled. An eleventh pool was constructed using portions of the subgingival specimens from all 10 individuals. The 16S rRNA gene was amplified using broad-range bacterial primers, and clone libraries from the individual and subgingival pools were constructed. From a total of 11 368 high-quality, non-chimeric, near full-length sequences, 247 species-level phylotypes (using a 99% sequence identity threshold) and 9 bacteria phyla were identified. At least 15 bacterial genera were conserved among all 10 individuals, with significant interindividual differences at the species and strain level. Comparisons of these oral bacterial sequences to near full-length sequences found previously in the large intestines and feces of other healthy individuals suggest that the mouth and intestinal tract harbor distinct sets of bacteria. Co-occurrence analysis demonstrated significant segregation of taxa when community membership was examined at the level of genus, but not at the level of species, suggesting that ecologically-significant, competitive interactions are more apparent at a broader taxonomic level than species. This study is one of the more comprehensive, high-resolution analyses of bacterial diversity within the healthy human mouth to date, and highlights the value of tools from macroecology for enhancing our understanding of bacterial ecology in human health. PMID:20336157
Grizzly bear corticosteroid binding globulin: Cloning and serum protein expression.

PubMed

Chow, Brian A; Hamilton, Jason; Alsop, Derek; Cattet, Marc R L; Stenhouse, Gordon; Vijayan, Mathilakath M

2010-06-01

Serum corticosteroid levels are routinely measured as markers of stress in wild animals. However, corticosteroid levels rise rapidly in response to the acute stress of capture and restraint for sampling, limiting its use as an indicator of chronic stress. We hypothesized that serum corticosteroid binding globulin (CBG), the primary transport protein for corticosteroids in circulation, may be a better marker of the stress status prior to capture in grizzly bears (Ursus arctos). To test this, a full-length CBG cDNA was cloned and sequenced from grizzly bear testis and polyclonal antibodies were generated for detection of this protein in bear sera. The deduced nucleotide and protein sequences were 1218 bp and 405 amino acids, respectively. Multiple sequence alignments showed that grizzly bear CBG (gbCBG) was 90% and 83% identical to the dog CBG nucleotide and amino acid sequences, respectively. The affinity purified rabbit gbCBG antiserum detected grizzly bear but not human CBG. There were no sex differences in serum total cortisol concentration, while CBG expression was significantly higher in adult females compared to males. Serum cortisol levels were significantly higher in bears captured by leg-hold snare compared to those captured by remote drug delivery from helicopter. However, serum CBG expression between these two groups did not differ significantly. Overall, serum CBG levels may be a better marker of chronic stress, especially because this protein is not modulated by the stress of capture and restraint in grizzly bears. Copyright 2010 Elsevier Inc. All rights reserved.
Sugarcane transgenics expressing MYB transcription factors show improved glucose release

DOE PAGES

Poovaiah, Charleson R.; Bewg, William P.; Lan, Wu; ...

2016-07-15

In this study, sugarcane, a tropical C4 perennial crop, is capable of producing 30-100 tons or more of biomass per hectare annually. The lignocellulosic residue remaining after sugar extraction is currently underutilized and can provide a significant source of biomass for the production of second-generation bioethanol. As a result, MYB31 and MYB42 were cloned from maize and expressed in sugarcane with and without the UTR sequences. The cloned sequences were 98 and 99 % identical to the published nucleotide sequences. The inclusion of the UTR sequences did not affect any of the parameters tested. There was little difference in plantmore » height and the number of internodes of the MYB-overexpressing sugarcane plants when compared with controls. MYB transgene expression determined by qPCR exhibited continued expression in young and maturing internodes. MYB31 downregulated more genes within the lignin biosynthetic pathway than MYB42. MYB31 and MYB42 expression resulted in decreased lignin content in some lines. All MYB42 plants further analyzed showed significant increases in glucose release by enzymatic hydrolysis in 72 h, whereas only two MYB31 plants released more glucose than control plants. This correlated directly with a significant decrease in acid-insoluble lignin. Soluble sucrose content of the MYB42 transgenic plants did not vary compared to control plants. In conclusion, this study demonstrates the use of MYB transcription factors to improve the production of bioethanol from sugarcane bagasse remaining after sugar extraction.« less
Sugarcane transgenics expressing MYB transcription factors show improved glucose release

DOE Office of Scientific and Technical Information (OSTI.GOV)

Poovaiah, Charleson R.; Bewg, William P.; Lan, Wu

In this study, sugarcane, a tropical C4 perennial crop, is capable of producing 30-100 tons or more of biomass per hectare annually. The lignocellulosic residue remaining after sugar extraction is currently underutilized and can provide a significant source of biomass for the production of second-generation bioethanol. As a result, MYB31 and MYB42 were cloned from maize and expressed in sugarcane with and without the UTR sequences. The cloned sequences were 98 and 99 % identical to the published nucleotide sequences. The inclusion of the UTR sequences did not affect any of the parameters tested. There was little difference in plantmore » height and the number of internodes of the MYB-overexpressing sugarcane plants when compared with controls. MYB transgene expression determined by qPCR exhibited continued expression in young and maturing internodes. MYB31 downregulated more genes within the lignin biosynthetic pathway than MYB42. MYB31 and MYB42 expression resulted in decreased lignin content in some lines. All MYB42 plants further analyzed showed significant increases in glucose release by enzymatic hydrolysis in 72 h, whereas only two MYB31 plants released more glucose than control plants. This correlated directly with a significant decrease in acid-insoluble lignin. Soluble sucrose content of the MYB42 transgenic plants did not vary compared to control plants. In conclusion, this study demonstrates the use of MYB transcription factors to improve the production of bioethanol from sugarcane bagasse remaining after sugar extraction.« less
The complete genome sequence of a south Indian isolate of Rice tungro spherical virus reveals evidence of genetic recombination between distinct isolates.

PubMed

Sailaja, B; Anjum, Najreen; Patil, Yogesh K; Agarwal, Surekha; Malathi, P; Krishnaveni, D; Balachandran, S M; Viraktamath, B C; Mangrauthia, Satendra K

2013-12-01

In this study, complete genome of a south Indian isolate of Rice tungro spherical virus (RTSV) from Andhra Pradesh (AP) was sequenced, and the predicted amino acid sequence was analysed. The RTSV RNA genome consists of 12,171 nt without the poly(A) tail, encoding a putative typical polyprotein of 3,470 amino acids. Furthermore, cleavage sites and sequence motifs of the polyprotein were predicted. Multiple alignment with other RTSV isolates showed a nucleotide sequence identity of 95% to east Indian isolates and 90% to Philippines isolates. A phylogenetic tree based on complete genome sequence showed that Indian isolates clustered together, while Vt6 and PhilA isolates of Philippines formed two separate clusters. Twelve recombination events were detected in RNA genome of RTSV using the Recombination Detection Program version 3. Recombination analysis suggested significant role of 5' end and central region of genome in virus evolution. Further, AP and Odisha isolates appeared as important RTSV isolates involved in diversification of this virus in India through recombination phenomenon. The new addition of complete genome of first south Indian isolate provided an opportunity to establish the molecular evolution of RTSV through recombination analysis and phylogenetic relationship.
Identification of Mycobacterium spp. of veterinary importance using rpoB gene sequencing

PubMed Central

2011-01-01

Background Studies conducted on Mycobacterium spp. isolated from human patients indicate that sequencing of a 711 bp portion of the rpoB gene can be useful in assigning a species identity, particularly for members of the Mycobacterium avium complex (MAC). Given that MAC are important pathogens in livestock, companion animals, and zoo/exotic animals, we were interested in evaluating the use of rpoB sequencing for identification of Mycobacterium isolates of veterinary origin. Results A total of 386 isolates, collected over 2008 - June 2011 from 378 animals (amphibians, reptiles, birds, and mammals) underwent PCR and sequencing of a ~ 711 bp portion of the rpoB gene; 310 isolates (80%) were identified to the species level based on similarity at ≥ 98% with a reference sequence. The remaining 76 isolates (20%) displayed < 98% similarity with reference sequences and were assigned to a clade based on their location in a neighbor-joining tree containing reference sequences. For a subset of 236 isolates that received both 16S rRNA and rpoB sequencing, 167 (70%) displayed a similar species/clade assignation for both sequencing methods. For the remaining 69 isolates, species/clade identities were different with each sequencing method. Mycobacterium avium subsp. hominissuis was the species most frequently isolated from specimens from pigs, cervids, companion animals, cattle, and exotic/zoo animals. Conclusions rpoB sequencing proved useful in identifying Mycobacterium isolates of veterinary origin to clade, species, or subspecies levels, particularly for assemblages (such as the MAC) where 16S rRNA sequencing alone is not adequate to demarcate these taxa. rpoB sequencing can represent a cost-effective identification tool suitable for routine use in the veterinary diagnostic laboratory. PMID:22118247
Chameleon sequences in neurodegenerative diseases.

PubMed

Bahramali, Golnaz; Goliaei, Bahram; Minuchehr, Zarrin; Salari, Ali

2016-03-25

Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to "helix to strand (HE)", "helix to coil (HC)" and "strand to coil (CE)" alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases. Copyright © 2016 Elsevier Inc. All rights reserved.
Chameleon sequences in neurodegenerative diseases

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bahramali, Golnaz; Goliaei, Bahram, E-mail: goliaei@ut.ac.ir; Minuchehr, Zarrin, E-mail: minuchehr@nigeb.ac.ir

2016-03-25

Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to “helix to strand (HE)”, “helix tomore » coil (HC)” and “strand to coil (CE)” alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.« less
Protecting genomic sequence anonymity with generalization lattices.

PubMed

Malin, B A

2005-01-01

Current genomic privacy technologies assume the identity of genomic sequence data is protected if personal information, such as demographics, are obscured, removed, or encrypted. While demographic features can directly compromise an individual's identity, recent research demonstrates such protections are insufficient because sequence data itself is susceptible to re-identification. To counteract this problem, we introduce an algorithm for anonymizing a collection of person-specific DNA sequences. The technique is termed DNA lattice anonymization (DNALA), and is based upon the formal privacy protection schema of k -anonymity. Under this model, it is impossible to observe or learn features that distinguish one genetic sequence from k-1 other entries in a collection. To maximize information retained in protected sequences, we incorporate a concept generalization lattice to learn the distance between two residues in a single nucleotide region. The lattice provides the most similar generalized concept for two residues (e.g. adenine and guanine are both purines). The method is tested and evaluated with several publicly available human population datasets ranging in size from 30 to 400 sequences. Our findings imply the anonymization schema is feasible for the protection of sequences privacy. The DNALA method is the first computational disclosure control technique for general DNA sequences. Given the computational nature of the method, guarantees of anonymity can be formally proven. There is room for improvement and validation, though this research provides the groundwork from which future researchers can construct genomics anonymization schemas tailored to specific datasharing scenarios.
CRITICA: coding region identification tool invoking comparative analysis

NASA Technical Reports Server (NTRS)

Badger, J. H.; Olsen, G. J.; Woese, C. R. (Principal Investigator)

1999-01-01

Gene recognition is essential to understanding existing and future DNA sequence data. CRITICA (Coding Region Identification Tool Invoking Comparative Analysis) is a suite of programs for identifying likely protein-coding sequences in DNA by combining comparative analysis of DNA sequences with more common noncomparative methods. In the comparative component of the analysis, regions of DNA are aligned with related sequences from the DNA databases; if the translation of the aligned sequences has greater amino acid identity than expected for the observed percentage nucleotide identity, this is interpreted as evidence for coding. CRITICA also incorporates noncomparative information derived from the relative frequencies of hexanucleotides in coding frames versus other contexts (i.e., dicodon bias). The dicodon usage information is derived by iterative analysis of the data, such that CRITICA is not dependent on the existence or accuracy of coding sequence annotations in the databases. This independence makes the method particularly well suited for the analysis of novel genomes. CRITICA was tested by analyzing the available Salmonella typhimurium DNA sequences. Its predictions were compared with the DNA sequence annotations and with the predictions of GenMark. CRITICA proved to be more accurate than GenMark, and moreover, many of its predictions that would seem to be errors instead reflect problems in the sequence databases. The source code of CRITICA is freely available by anonymous FTP (rdp.life.uiuc.edu in/pub/critica) and on the World Wide Web (http:/(/)rdpwww.life.uiuc.edu).
Sequencing of diverse mandarin, pummelo and orange genomes reveals complex history of admixture during citrus domestication

USDA-ARS?s Scientific Manuscript database

Cultivated citrus are selections from, or hybrids of, wild progenitor species whose identities and contributions to citrus domestication remain controversial. Here we sequence and compare citrus genomes—a high-quality reference haploid clementine genome and mandarin, pummelo, sweet-orange and sour-o...
The complete mitochondrial genome of Haliotis laevigata (Gastropoda: Haliotidae) using MiSeq and HiSeq sequencing.

PubMed

Robinson, Nick A; Hall, Nathan E; Ross, Elizabeth M; Cooke, Ira R; Shiel, Brett P; Robinson, Andrew J; Strugnell, Jan M

2016-01-01

The mitochondrial genome of greenlip abalone, Haliotis laevigata, is reported. MiSeq and HiSeq sequencing of one individual was assembled to yield a single 16,545 bp contig. The sequence shares 92% identity to the H. rubra mitochondrial genome (a closely related species that hybridize with H. laevigata in the wild). The sequence will be useful for determining the maternal contribution to hybrid populations, for investigating population structure and stock-enhancement effectiveness.
Some special values of vertices of trees on the suborbital graphs

NASA Astrophysics Data System (ADS)

Deǧer, A. H.; Akbaba, Ü.

2018-01-01

In the present study, the action of a congruence subgroup of S L(2, Z) on ℚ ^ is examined. From this action and its properties, vertices of paths of minimal length on the suborbital graph Fu,N give rise to some special sequence values, that are alternate sequences such as identity, Fibonacci and Lucas sequences. These types of vertices also give rise to special continued fractions, hence from recurrence relations for continued fractions, values of these vertices and values of special sequences were associated.

Structural comparisons of two allelic variants of human placental alkaline phosphatase.

PubMed

Millán, J L; Stigbrand, T; Jörnvall, H

1985-01-01

A simple immunosorbent purification scheme based on monoclonal antibodies has been devised for human placental alkaline phosphatase. The two most common allelic variants, S and F, have similar amino acid compositions with identical N-terminal amino acid sequences through the first 13 residues. Both variants have identical lectin binding properties towards concanavalin A, lentil-lectin, wheat germ agglutinin, phytohemagglutinin and soybean agglutinin, and identical carbohydrate contents as revealed by methylation analysis. CNBr fragments of the variants demonstrate identical high performance liquid chromatography patterns. The carbohydrate containing fragment is different from the 32P-labeled active site fragment and the N-terminal fragment.
Transcriptome analysis in Concholepas concholepas (Gastropoda, Muricidae): mining and characterization of new genomic and molecular markers.

PubMed

Cárdenas, Leyla; Sánchez, Roland; Gomez, Daniela; Fuenzalida, Gonzalo; Gallardo-Escárate, Cristián; Tanguy, Arnaud

2011-09-01

The marine gastropod Concholepas concholepas, locally known as the "loco", is the main target species of the benthonic Chilean fisheries. Genetic and genomic tools are necessary to study the genome of this species in order to understand the molecular basis of its development, growth, and other key traits to improve the management strategies and to identify local adaptation to prevent loss of biodiversity. Here, we use pyrosequencing technologies to generate the first transcriptomic database from adult specimens of the loco. After trimming, a total of 140,756 Expressed Sequence Tag sequences were achieved. Clustering and assembly analysis identified 19,219 contigs and 105,435 singleton sequences. BlastN analysis showed a significant identity with Expressed Sequence Tags of different gastropod species available in public databases. Similarly, BlastX results showed that only 895 out of the total 124,654 had significant hits and may represent novel genes for marine gastropods. From this database, simple sequence repeat motifs were also identified and a total of 38 primer pairs were designed and tested to assess their potential as informative markers and to investigate their cross-species amplification in different related gastropod species. This dataset represents the first publicly available 454 data for a marine gastropod endemic to the southeastern Pacific coast, providing a valuable transcriptomic resource for future efforts of gene discovery and development of functional markers in other marine gastropods. Copyright © 2011 Elsevier B.V. All rights reserved.
General overview on structure prediction of twilight-zone proteins.

PubMed

Khor, Bee Yin; Tye, Gee Jun; Lim, Theam Soon; Choong, Yee Siew

2015-09-04

Protein structure prediction from amino acid sequence has been one of the most challenging aspects in computational structural biology despite significant progress in recent years showed by critical assessment of protein structure prediction (CASP) experiments. When experimentally determined structures are unavailable, the predictive structures may serve as starting points to study a protein. If the target protein consists of homologous region, high-resolution (typically <1.5 Å) model can be built via comparative modelling. However, when confronted with low sequence similarity of the target protein (also known as twilight-zone protein, sequence identity with available templates is less than 30%), the protein structure prediction has to be initiated from scratch. Traditionally, twilight-zone proteins can be predicted via threading or ab initio method. Based on the current trend, combination of different methods brings an improved success in the prediction of twilight-zone proteins. In this mini review, the methods, progresses and challenges for the prediction of twilight-zone proteins were discussed.
TRedD—A database for tandem repeats over the edit distance

PubMed Central

Sokol, Dina; Atagun, Firat

2010-01-01

A ‘tandem repeat’ in DNA is a sequence of two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats are common in the genomes of both eukaryotic and prokaryotic organisms. They are significant markers for human identity testing, disease diagnosis, sequence homology and population studies. In this article, we describe a new database, TRedD, which contains the tandem repeats found in the human genome. The database is publicly available online, and the software for locating the repeats is also freely available. The definition of tandem repeats used by TRedD is a new and innovative definition based upon the concept of ‘evolutive tandem repeats’. In addition, we have developed a tool, called TandemGraph, to graphically depict the repeats occurring in a sequence. This tool can be coupled with any repeat finding software, and it should greatly facilitate analysis of results. Database URL: http://tandem.sci.brooklyn.cuny.edu/ PMID:20624712
Unveiling in situ interactions between marine protists and bacteria through single cell sequencing

PubMed Central

Martinez-Garcia, Manuel; Brazel, David; Poulton, Nicole J; Swan, Brandon K; Gomez, Monica Lluesma; Masland, Dashiell; Sieracki, Michael E; Stepanauskas, Ramunas

2012-01-01

Heterotrophic protists are a highly diverse and biogeochemically significant component of marine ecosystems, yet little is known about their species-specific prey preferences and symbiotic interactions in situ. Here we demonstrate how these previously unresolved questions can be addressed by sequencing the eukaryote and bacterial SSU rRNA genes from individual, uncultured protist cells collected from their natural marine environment and sorted by flow cytometry. We detected Pelagibacter ubique in association with a MAST-4 protist, an actinobacterium in association with a chrysophyte and three bacteroidetes in association with diverse protist groups. The presence of identical phylotypes among the putative prey and the free bacterioplankton in the same sample provides evidence for predator–prey interactions. Our results also suggest a discovery of novel symbionts, distantly related to Rickettsiales and the candidate divisions ZB3 and TG2, associated with Cercozoa and Chrysophyta cells. This study demonstrates the power of single cell sequencing to untangle ecological interactions between uncultured protists and prokaryotes. PMID:21938022
The DNA Bank: High-Security Bank Accounts to Protect and Share Your Genetic Identity.

PubMed

den Dunnen, Johan T

2015-07-01

With the cost of genome sequencing decreasing every day, DNA information has the potential of affecting the lives of everyone. Surprisingly, an individual has little knowledge about his own DNA information, can rarely access it, and has hardly any control over its use. This may result in preventable, life-threatening situations, and also significantly inhibits scientific progress. What we urgently need is a "DNA bank," a resource providing a secure personal account where, similar to a financial institution, you can store your DNA sequence. Using this private and secure DNA bank account, you govern your sequence-related business. For any genetic study performed, the data generated must be transferred (paid) to your DNA account. Using your account, you regulate access, knowing for what purpose (informed consent) and only for the genetic data you are willing to share. The DNA account ensures you are in the driver's seat, know what is known, and control what is happening with it. © 2015 WILEY PERIODICALS, INC.
Cloning and characterization of the ddc homolog encoding L-2,4-diaminobutyrate decarboxylase in Enterobacter aerogenes.

PubMed

Yamamoto, S; Mutoh, N; Tsuzuki, D; Ikai, H; Nakao, H; Shinoda, S; Narimatsu, S; Miyoshi, S I

2000-05-01

L-2,4-diaminobutyrate decarboxylase (DABA DC) catalyzes the formation of 1,3-diaminopropane (DAP) from DABA. In the present study, the ddc gene encoding DABA DC from Enterobacter aerogenes ATCC 13048 was cloned and characterized. Determination of the nucleotide sequence revealed an open reading frame of 1470 bp encoding a 53659-Da protein of 490 amino acids, whose deduced NH2-terminal sequence was identical to that of purified DABA DC from E. aerogenes. The deduced amino acid sequence was highly similar to those of Acinetobacter baumannii and Haemophilus influenzae DABA DCs encoded by the ddc genes. The lysine-307 of the E. aerogenes DABA DC was identified as the pyridoxal 5'-phosphate binding residue by site-directed mutagenesis. Furthermore, PCR analysis revealed the distribution of E. aerogenes ddc homologs in some other species of Enterobacteriaceae. Such a relatively wide occurrence of the ddc homologs implies biological significance of DABA DC and its product DAP.
Molecular Simulations of Sequence-Specific Association of Transmembrane Proteins in Lipid Bilayers

NASA Astrophysics Data System (ADS)

Doxastakis, Manolis; Prakash, Anupam; Janosi, Lorant

2011-03-01

Association of membrane proteins is central in material and information flow across the cellular membranes. Amino-acid sequence and the membrane environment are two critical factors controlling association, however, quantitative knowledge on such contributions is limited. In this work, we study the dimerization of helices in lipid bilayers using extensive parallel Monte Carlo simulations with recently developed algorithms. The dimerization of Glycophorin A is examined employing a coarse-grain model that retains a level of amino-acid specificity, in three different phospholipid bilayers. Association is driven by a balance of protein-protein and lipid-induced interactions with the latter playing a major role at short separations. Following a different approach, the effect of amino-acid sequence is studied using the four transmembrane domains of the epidermal growth factor receptor family in identical lipid environments. Detailed characterization of dimer formation and estimates of the free energy of association reveal that these helices present significant affinity to self-associate with certain dimers forming non-specific interfaces.
The ABC transporter Rv1272c of Mycobacterium tuberculosis enhances the import of long-chain fatty acids in Escherichia coli.

PubMed

Martin, Audrey; Daniel, Jaiyanth

2018-02-05

Mycobacterium tuberculosis (Mtb), which causes tuberculosis, is capable of accumulating triacylglycerol (TAG) by utilizing fatty acids from host cells. ATP-binding cassette (ABC) transporters are involved in transport processes in all organisms. Among the classical ABC transporters in Mtb none have been implicated in fatty acid import. Since the transport of fatty acids from the host cell is important for dormancy-associated TAG synthesis in the pathogen, mycobacterial ABC transporter(s) could potentially be involved in this process. Based on sequence identities with a bacterial ABC transporter that mediates fatty acid import for TAG synthesis, we identified Rv1272c, a hitherto uncharacterized ABC-transporter in Mtb that also shows sequence identities with a plant ABC transporter involved in fatty acid transport. We expressed Rv1272c in E. coli and show that it enhances the import of radiolabeled fatty acids. We also show that Rv1272c causes a significant increase in the metabolic incorporation of radiolabeled long-chain fatty acids into cardiolipin, a tetra-acylated phospholipid, and phosphatidylglycerol in E. coli. This is the first report on the function of Rv1272c showing that it displays a long-chain fatty acid transport function. Copyright © 2018 Elsevier Inc. All rights reserved.
Word segmentation in phonemically identical and prosodically different sequences using cochlear implants: A case study.

PubMed

Basirat, Anahita

2017-01-01

Cochlear implant (CI) users frequently achieve good speech understanding based on phoneme and word recognition. However, there is a significant variability between CI users in processing prosody. The aim of this study was to examine the abilities of an excellent CI user to segment continuous speech using intonational cues. A post-lingually deafened adult CI user and 22 normal hearing (NH) subjects segmented phonemically identical and prosodically different sequences in French such as 'l'affiche' (the poster) versus 'la fiche' (the sheet), both [lafiʃ]. All participants also completed a minimal pair discrimination task. Stimuli were presented in auditory-only and audiovisual presentation modalities. The performance of the CI user in the minimal pair discrimination task was 97% in the auditory-only and 100% in the audiovisual condition. In the segmentation task, contrary to the NH participants, the performance of the CI user did not differ from the chance level. Visual speech did not improve word segmentation. This result suggests that word segmentation based on intonational cues is challenging when using CIs even when phoneme/word recognition is very well rehabilitated. This finding points to the importance of the assessment of CI users' skills in prosody processing and the need for specific interventions focusing on this aspect of speech communication.
Comparative proteomic analysis of male and female venoms from the Cuban scorpion Rhopalurus junceus.

PubMed

Rodríguez-Ravelo, Rodolfo; Batista, Cesar V F; Coronas, Fredy I V; Zamudio, Fernando Z; Hernández-Orihuela, Lorena; Espinosa-López, Georgina; Ruiz-Urquiola, Ariel; Possani, Lourival D

2015-12-01

A complete mass spectrometry analysis of venom components from male and female scorpions of the species Rhophalurus junceus of Cuba is reported. In the order of 200 individual molecular masses were identified in both venoms, from which 63 are identical in male and females genders. It means that a significant difference of venom components exists between individuals of different sexes, but the most abundant components are present in both sexes. The relative abundance of identical components is different among the genders. Three well defined groups of different peptides were separated and identified. The first group corresponds to peptides with molecular masses of 1000-2000 Da; the second to peptides with 3500-4500 Da molecular weight, and the third with 6500-8000 Da molecular weights. A total of 86 peptides rich in disulfide bridges were found in the venoms, 27 with three disulfide bridges and 59 with four disulfide bridges. LC-MS/MS analysis allowed the identification and amino acid sequence determination of 31 novel peptides in male venom. Two new putative K(+)-channel peptides were sequences by Edman degradation. They contain 37 amino acid residues, packed by three disulfide bridges and were assigned the systematic numbers: α-KTx 1.18 and α-KTx 2.15. Copyright © 2015 Elsevier Ltd. All rights reserved.
Molecular evolution of type 2 porcine reproductive and respiratory syndrome viruses circulating in Vietnam from 2007 to 2015.

PubMed

Do, Hai Quynh; Trinh, Dinh Thau; Nguyen, Thi Lan; Vu, Thi Thu Hang; Than, Duc Duong; Van Lo, Thi; Yeom, Minjoo; Song, Daesub; Choe, SeEun; An, Dong-Jun; Le, Van Phan

2016-11-17

Porcine respiratory and reproductive syndrome (PRRS) virus is one of the most economically significant pathogens in the Vietnamese swine industry. ORF5, which participates in many functional processes, including virion assembly, entry of the virus into the host cell, and viral adaptation to the host immune response, has been widely used in molecular evolution and phylogeny studies. Knowing of molecular evolution of PRRSV fields strains might contribute to PRRS control in Vietnam. The results showed that phylogenetic analysis indicated that all strains belonged to sub-lineages 8.7 and 5.1. The nucleotide and amino acid identities between strains were 84.5-100% and 82-100%, respectively. Furthermore, the results revealed differences in nucleotide and amino acid identities between the 2 sub-lineage groups. N-glycosylation prediction identified 7 potential N-glycosylation sites and 11 glycotypes. Analyses of the GP5 sequences, revealed 7 sites under positive selective pressure and 25 under negative selective pressure. Phylogenetic analysis based on ORF5 sequence indicated the diversity of PRRSV in Vietnam. Furthermore, the variance of N-glycosylation sites and position under selective pressure were demonstrated. This study expands existing knowledge on the genetic diversity and evolution of PRRSV in Vietnam and assists the effective strategies for PRRS vaccine development in Vietnam.
Evolution of heliobacteria: implications for photosynthetic reaction center complexes

NASA Technical Reports Server (NTRS)

Vermaas, W. F.; Blankenship, R. E. (Principal Investigator)

1994-01-01

The evolutionary position of the heliobacteria, a group of green photosynthetic bacteria with a photosynthetic apparatus functionally resembling Photosystem I of plants and cyanobacteria, has been investigated with respect to the evolutionary relationship to Gram-positive bacteria and cyanobacteria. On the basis of 16S rRNA sequence analysis, the heliobacteria appear to be most closely related to Gram-positive bacteria, but also an evolutionary link to cyanobacteria is evident. Interestingly, a 46-residue domain including the putative sixth membrane-spanning region of the heliobacterial reaction center protein show rather strong similarity (33% identity and 72% similarity) to a region including the sixth membrane-spanning region of the CP47 protein, a chlorophyll-binding core antenna polypeptide of Photosystem II. The N-terminal half of the heliobacterial reaction center polypeptide shows a moderate sequence similarity (22% identity over 232 residues) with the CP47 protein, which is significantly more than the similarity with the Photosystem I core polypeptides in this region. An evolutionary model for photosynthetic reaction center complexes is discussed, in which an ancestral homodimeric reaction center protein (possibly resembling the heliobacterial reaction center protein) with 11 membrane-spanning regions per polypeptide has diverged to give rise to the core of Photosystem I, Photosystem II, and of the photosynthetic apparatus in green, purple, and heliobacteria.
Determination of the DNA-binding kinetics of three related but heteroimmune bacteriophage repressors using EMSA and SPR analysis

PubMed Central

Henriksson-Peltola, Petri; Sehlén, Wilhelmina; Haggård-Ljungquist, Elisabeth

2007-01-01

Bacteriophages P2, P2 Hy dis and WΦ are very similar but heteroimmune Escherichia coli phages. The structural genes show over 96% identity, but the repressors show between 43 and 63% identities. Furthermore, the operators, which contain two directly repeated sequences, vary in sequence, length, location relative to the promoter and spacing between the direct repeats. We have compared the in vivo effects of the wild type and mutated operators on gene expression with the complexes formed between the repressors and their wild type or mutated operators using electrophoretic mobility shift assay (EMSA), and real-time kinetics of the protein–DNA interactions using surface plasmon resonance (SPR) analysis. Using EMSA, the repressors formed different protein–DNA complexes, and only WΦ was significantly affected by point mutations. However, SPR analysis showed a reduced association rate constant and an increased dissociation rate constant for P2 and WΦ operator mutants. The association rate constants of P2 Hy dis was too fast to be determined. The P2 Hy dis dissociation response curves were shown to be triphasic, while both P2 and WΦ C were biphasic. Thus, the kinetics of complex formation and the nature of the complexes formed differ extensively between these very closely related phages. PMID:17412705
Phenotypic variation in a significant spore character in Kudoa (Myxosporea: Multivalvulida) species infecting brain tissue.

PubMed

Burger, Mieke A A; Adlard, Robert D

2010-10-01

Some Kudoa species display variations in the number of polar capsules in spores within an individual pseudocyst. Nonetheless, there is usually a dominant morphotype which forms a significant element of diagnosis. In 2007, a Kudoa isolate from whiting (spores with 5 (dominant) or 6 (minor) polar capsules) was characterized by Burger et al. (2007) as being 100% identical in SSU rDNA to Kudoa yasunagai (spores with 7 polar capsules) from a halibut, despite its obvious morphological differences. The authors hypothesized that either SSU rDNA had reached its level of resolution or that the genetic identity revealed conspecificity. To further investigate these hypotheses, SSU and LSU rDNA sequence data were coupled with principal components, correlation, and regression analyses of morphometric data from different kudoid isolates that infect brain tissue to determine the relationships between spore morphotypes and different kudoid isolates. The trends in morphometrics between the spores of particular isolates were so similar that it was concluded that the molecular results did indicate conspecificity rather than SSU reaching its level of resolution. This phenotypic influence on a significant diagnostic character within the Kudoidae has a major impact on the diagnosis of this, and potentially other, pathogenic species.
Genetic diversity in Trichomonas vaginalis.

PubMed

Meade, John C; Carlton, Jane M

2013-09-01

Recent advances in genetic characterisation of Trichomonas vaginalis isolates show that the extensive clinical variability in trichomoniasis and its disease sequelae are matched by significant genetic diversity in the organism itself, suggesting a connection between the genetic identity of isolates and their clinical manifestations. Indeed, a high degree of genetic heterogeneity in T vaginalis isolates has been observed using multiple genotyping techniques. A unique two-type population structure that is both local and global in distribution has been identified, and there is evidence of recombination within each group, although sexual recombination between the groups appears to be constrained. There is conflicting evidence in these studies for correlations between T vaginalis genetic identity and clinical presentation, metronidazole susceptibility, and the presence of T vaginalis virus, underscoring the need for adoption of a common standard for genotyping the parasite. Moving forward, microsatellite genotyping and multilocus sequence typing are the most robust techniques for future investigations of T vaginalis genotype-phenotype associations.
Identification of largemouth bass virus in the introduced Northern Snakehead inhabiting the Chesapeake Bay watershed.

PubMed

Iwanowicz, L; Densmore, C; Hahn, C; McAllister, P; Odenkirk, J

2013-09-01

The Northern Snakehead Channa argus is an introduced species that now inhabits the Chesapeake Bay. During a preliminary survey for introduced pathogens possibly harbored by these fish in Virginia waters, a filterable agent was isolated from five specimens that produced cytopathic effects in BF-2 cells. Based on PCR amplification and partial sequencing of the major capsid protein (MCP), DNA polymerase (DNApol), and DNA methyltransferase (Mtase) genes, the isolates were identified as Largemouth Bass virus (LMBV). Nucleotide sequences of the MCP (492 bp) and DNApol (419 pb) genes were 100% identical to those of LMBV. The nucleotide sequence of the Mtase (206 bp) gene was 99.5% identical to that of LMBV, and the single nucleotide substitution did not lead to a predicted amino acid coding change. This is the first report of LMBV from the Northern Snakehead, and provides evidence that noncentrarchid fishes may be susceptible to this virus.
Partial structure of the phylloxin gene from the giant monkey frog, Phyllomedusa bicolor: parallel cloning of precursor cDNA and genomic DNA from lyophilized skin secretion.

PubMed

Chen, Tianbao; Gagliardo, Ron; Walker, Brian; Zhou, Mei; Shaw, Chris

2005-12-01

Phylloxin is a novel prototype antimicrobial peptide from the skin of Phyllomedusa bicolor. Here, we describe parallel identification and sequencing of phylloxin precursor transcript (mRNA) and partial gene structure (genomic DNA) from the same sample of lyophilized skin secretion using our recently-described cloning technique. The open-reading frame of the phylloxin precursor was identical in nucleotide sequence to that previously reported and alignment with the nucleotide sequence derived from genomic DNA indicated the presence of a 175 bp intron located in a near identical position to that found in the dermaseptins. The highly-conserved structural organization of skin secretion peptide genes in P. bicolor can thus be extended to include that encoding phylloxin (plx). These data further reinforce our assertion that application of the described methodology can provide robust genomic/transcriptomic/peptidomic data without the need for specimen sacrifice.
Complete sequence of RNA3 of Cucumber mosaic virus isolates infecting Gerbera jamesonii suggests its grouping under IB subgroup.

PubMed

Gautum, K K; Raj, R; Kumar, S; Raj, S K; Roy, R K; Katiyar, R

2014-01-01

The complete RNA3 genome of Cucumber mosaic virus (CMV) was amplified by RT-PCR from three infected gerbera (Gerbera jamesonii) leaf samples exhibiting severe chlorotic mosaic and flower deformation symptoms. The amplicons obtained were cloned sequenced and deposited in GenBank under the accessions JN692495, JX913531 (from cv. Zingaro) and JX888093 (from cv. Silvester). These sequences shared 98-99 % identities to each other and with a strain of CMV-Banana reported from India, and 90-95 % identities with various strains of CMV reported worldwide. Phylogenetic analysis revealed their closest affinity with CMV-Banana strain, and close relationships with several other strains of CMV of subgroup IB. This study provides evidence of subgroup IB CMV causing severe chlorosis and flower deformation in two cultivars (Zingaro and Silvester) of G. jamesonii in India.
Identification of largemouth bass virus in the introduced Northern snakehead inhabiting the Cheasapeake Bay watershed

USGS Publications Warehouse

Iwanowicz, Luke R.; Densmore, Christine L.; Hahn, Cassidy M.; McAllister, Phillip; Odenkirk, John

2013-01-01

The Northern Snakehead Channa argus is an introduced species that now inhabits the Chesapeake Bay. During a preliminary survey for introduced pathogens possibly harbored by these fish in Virginia waters, a filterable agent was isolated from five specimens that produced cytopathic effects in BF-2 cells. Based on PCR amplification and partial sequencing of the major capsid protein (MCP), DNA polymerase (DNApol), and DNA methyltransferase (Mtase) genes, the isolates were identified as Largemouth Bass virus (LMBV). Nucleotide sequences of the MCP (492 bp) and DNApol (419 pb) genes were 100% identical to those of LMBV. The nucleotide sequence of the Mtase (206 bp) gene was 99.5% identical to that of LMBV, and the single nucleotide substitution did not lead to a predicted amino acid coding change. This is the first report of LMBV from the Northern Snakehead, and provides evidence that noncentrarchid fishes may be susceptible to this virus.

Identification of the allergen Psi c 2 from the basidiomycete Psilocybe cubensis as a fungal cyclophilin.

PubMed

Horner, W E; Reese, G; Lehrer, S B

1995-01-01

Basidiospores are a prevalent and frequent cause of respiratory allergies, yet their allergens remain poorly defined; thus, we have attempted a molecular characterization of representative basidiomycete allergens. A Psilocybe cubensis mycelial cDNA library was immunoscreened with patient serum. A clone was isolated that expressed a 23-kD recombinant allergen as a fusion protein and inhibited a 16-kD band (Psi c 2) in immunoprints of P. cubenis extract, indicating antigenic identity. Sequence (cDNA) analysis of the clone indicates homology with cyclophilin and the deduced amino acid sequence of Psi c 2 showed 78% identity and 4% similarity with the amino acid sequence of Schizosaccharomyces pombe cyclophilin. This recombinant allergen is a useful model for epitope analysis of basidiospore allergens and fungal allergen cross-reactivity, and may provide an improved reagent for basidiospore allergy diagnosis and treatment.
Characterization of Zea mays endosperm C-24 sterol methyltransferase: one of two types of sterol methyltransferase in higher plants.

PubMed

Grebenok, R J; Galbraith, D W; Penna, D D

1997-08-01

We report the characterization of a higher-plant C-24 sterol methyltransferase by yeast complementation. A Zea mays endosperm expressed sequence tag (EST) was identified which, upon complete sequencing, showed 46% identity to the yeast C-24 methyltransferase gene (ERG6) and 75% and 37% amino acid identity to recently isolated higher-plant sterol methyltransferases from soybean and Arabidopsis, respectively. When placed under GALA regulation, the Z. mays cDNA functionally complemented the erg6 mutation, restoring ergosterol production and conferring resistance to cycloheximide. Complementation was both plasmid-dependent and galactose-inducible. The Z. mays cDNA clone contains an open reading frame encoding a 40 kDa protein containing motifs common to a large number of S-adenosyl-L-methionine methyltransferases (SMTs). Sequence comparisons and functional studies of the maize, soybean and Arabidopsis cDNAs indicates two types of C-24 SMTs exist in higher plants.
Amino acid sequence of human cholinesterase. Annual report, 30 September 1984-30 September 1985

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lockridge, O.

1985-10-01

The active-site serine residue is located 198 amino acids from the N-terminal. The active-site peptide was isolated from three different genetic types of human serum cholinesterase: from usual, atypical, and atypical-silent genotypes. It was found that the amino acid sequence of the active-site peptide was identical in all three genotypes. Comparison of the complete sequences of cholinesterase from human serum and acetylcholinesterase from the electric organ of Torpedo californica shows an identity of 53%. Cholinesterase is of interest to the Department of Defense because cholinesterase protects against organophosphate poisons of the type used in chemical warfare. The structural results presentedmore » here will serve as the basis for cloning the gene for cholinesterase. The potential uses of large amounts of cholinesterase would be for cleaning up spills of organophosphates and possibly for detoxifying exposed personnel.« less
Isolation, sequence, and characterization of the Cercospora nicotianae phytoene dehydrogenase gene.

PubMed Central

Ehrenshaft, M; Daub, M E

1994-01-01

We have cloned and sequenced the Cercospora nicotianae gene for the carotenoid biosynthetic enzyme phytoene dehydrogenase. Analysis of the derived amino acid sequence revealed it has greater than 50% identity with its counterpart in Neurospora crassa and approximately 30% identity with prokaryotic phytoene dehydrogenases and is related, but more distantly, to phytoene dehydrogenases from plants and cyanobacteria. Our analysis confirms that phytoene dehydrogenase proteins fall into two groups: those from plants and cyanobacteria and those from eukaryotic and noncyanobacter prokaryotic microbes. Southern analysis indicated that the C. nicotianae phytoene dehydrogenase gene is present in a single copy. Extraction of beta-carotene, the sole carotenoid accumulated by C. nicotianae, showed that both light- and dark-grown cultures synthesize carotenoids, but higher levels accumulate in the light. Northern (RNA) analysis of poly(A)+ RNA, however, showed no differential accumulation of phytoene dehydrogenase mRNA between light- and dark-grown fungal cultures. Images PMID:8085820
Effects of a Non-Conservative Sequence on the Properties of β-glucuronidase from Aspergillus terreus Li-20

PubMed Central

Liu, Yanli; Huangfu, Jie; Qi, Feng; Kaleem, Imdad; E, Wenwen; Li, Chun

2012-01-01

We cloned the β-glucuronidase gene (AtGUS) from Aspergillus terreus Li-20 encoding 657 amino acids (aa), which can transform glycyrrhizin into glycyrrhetinic acid monoglucuronide (GAMG) and glycyrrhetinic acid (GA). Based on sequence alignment, the C-terminal non-conservative sequence showed low identity with those of other species; thus, the partial sequence AtGUS(-3t) (1–592 aa) was amplified to determine the effects of the non-conservative sequence on the enzymatic properties. AtGUS and AtGUS(-3t) were expressed in E. coli BL21, producing AtGUS-E and AtGUS(-3t)-E, respectively. At the similar optimum temperature (55°C) and pH (AtGUS-E, 6.6; AtGUS(-3t)-E, 7.0) conditions, the thermal stability of AtGUS(-3t)-E was enhanced at 65°C, and the metal ions Co2+, Ca2+ and Ni2+ showed opposite effects on AtGUS-E and AtGUS(-3t)-E, respectively. Furthermore, Km of AtGUS(-3t)-E (1.95 mM) was just nearly one-seventh that of AtGUS-E (12.9 mM), whereas the catalytic efficiency of AtGUS(-3t)-E was 3.2 fold higher than that of AtGUS-E (7.16 vs. 2.24 mM s−1), revealing that the truncation of non-conservative sequence can significantly improve the catalytic efficiency of AtGUS. Conformational analysis illustrated significant difference in the secondary structure between AtGUS-E and AtGUS(-3t)-E by circular dichroism (CD). The results showed that the truncation of the non-conservative sequence could preferably alter and influence the stability and catalytic efficiency of enzyme. PMID:22347419
[Vertical variability of Pinus sylvestris var. mongolica tree ring delta13C and its relationship with tree ring width in northern Daxing' an Mountains of Northeast China].

PubMed

Shang, Zhi-Yuan; Wang, Jian; Zhang, Wen; Li, Yan-Yan; Cui, Ming-Xing; Chen, Zhen-Ju; Zhao, Xing-Yun

2013-01-01

A measurement was made on the vertical direction tree ring stable carbon isotope ratio (delta13C) and tree ring width of Pinus sylvestris var. mongolica in northern Daxing' an Mountains of Northeast China, with the relationship between the vertical direction variations of the tree ring delta13C and tree ring width analyzed. In the whole ring of xylem, earlywood (EW) and bark endodermis, the delta13C all exhibited an increasing trend from the top to the base at first, with the maximum at the bottom of tree crown, and then, decreased rapidly to the minimum downward. The EW and late-wood (LW) had an increasing ratio of average tree ring width from the base to the top. The average annual sequence of the delta13C in vertical direction had an obvious reverse correspondence with the average annual sequence of tree ring width, and had a trend comparatively in line with the average annual sequence of the tree ring width ratio of EW to LW above tree crown. The variance analysis showed that there existed significant differences in the sequences of tree ring delta13C and ring width in vertical direction, and the magnitude of vertical delta13C variability was basically the same as that of the inter-annual delta13C variability. The year-to-year variation trend of the vertical delta13C sequence was approximately identical. For each sample, the delta13C sequence at the same heights was negatively correlated with the ring width sequence, but the statistical significance differed with tree height.
MALDI Top-Down sequencing: calling N- and C-terminal protein sequences with high confidence and speed.

PubMed

Suckau, Detlev; Resemann, Anja

2009-12-01

The ability to match Top-Down protein sequencing (TDS) results by MALDI-TOF to protein sequences by classical protein database searching was evaluated in this work. Resulting from these analyses were the protein identity, the simultaneous assignment of the N- and C-termini and protein sequences of up to 70 residues from either terminus. In combination with de novo sequencing using the MALDI-TDS data, even fusion proteins were assigned and the detailed sequence around the fusion site was elucidated. MALDI-TDS allowed to efficiently match protein sequences quickly and to validate recombinant protein structures-in particular, protein termini-on the level of undigested proteins.
Complete Genome Sequence of a Common Midwife Toad Virus-Like Ranavirus Associated with Mass Mortalities in Wild Amphibians in the Netherlands

PubMed Central

Hughes, Joseph; Saucedo, Bernardo; Rijks, Jolianne; Kik, Marja; Haenen, Olga L. M.; Engelsma, Marc Y.; Gröne, Andrea; Verheije, M. Helene; Wilkie, Gavin

2014-01-01

A ranavirus associated with mass mortalities in wild water frogs (Pelophylax spp.) and other amphibians in the Netherlands since 2010 was isolated, and its complete genome sequence was determined. The virus has a genome of 107,772 bp and shows 96.5% sequence identity with the common midwife toad virus from Spain. PMID:25540340
Isolation of a complete circular virus genome sequence from an Alaskan black-capped chickadee (Poecile atricapillus) gastrointestinal tract sample.

USGS Publications Warehouse

Hanna, Zachary R.; Runckel, Charles; Fuchs, Jerome; DeRisi, Joseph L.; Mindell, David P.; Van Hemert, Caroline R.; Handel, Colleen M.; Dumbacher, John P.

2015-01-01

We report here the genome sequence of a circular virus isolated from samples of an Alaskan black-capped chickadee (Poecile atricapillus) gastrointestinal tract. The genome is 2,152 bp in length and is most similar (30 to 44.5% amino acid identity) to the genome sequences of other single-stranded DNA (ssDNA) circular viruses belonging to the gemycircularvirus group.
Complete Genome Sequence of Pelosinus fermentans JBW45, a Member of a Remarkably Competitive Group of Negativicutes in the Firmicutes Phylum

DOE PAGES

De León, Kara B.; Utturkar, Sagar M.; Camilleri, Laura B.; ...

2015-09-24

The genome of Pelosinus fermentans JBW45, isolated from a chromium-contaminated site in Hanford, Washington, USA, has been completed with PacBio sequencing. Finally, nine copies of the rRNA gene operon and multiple transposase genes with identical sequences resulted in breaks in the original draft genome and may suggest genomic instability of JBW45.
Draft Genome Sequences of Three Novel Low-Abundance Species Strains Isolated from Kefir Grain.

PubMed

Kim, Yongkyu; Blasche, Sonja; Patil, Kiran R

2017-09-28

We report here the genome sequences of three novel bacterial species strains- Bacillus kefirresidentii Opo, Rothia kefirresidentii KRP, and Streptococcus kefirresidentii YK-isolated from kefir grains collected in Germany. The draft genomes of these isolates were remarkably dissimilar (average nucleotide identities, 77.80%, 89.01%, and 92.10%, respectively) to those of the previously sequenced strains. Copyright © 2017 Kim et al.
Complete amino acid sequence of the myoglobin from the Pacific sei whale, Balaenoptera borealis.

PubMed

Jones, B N; Rothgeb, T M; England, R D; Gurd, F R

1979-04-25

The complete amino acid sequence of the major component myoglobin from Pacific sei whale, Balaenoptera borealis, was determined by specific cleavage of the protein to obtain large peptides which are readily degraded by the automatic sequencer. The acetimidated apomyoglobin was selectively cleaved at its two methionyl residues with cyanogen bromide and at its three arginyl residues by trypsin. From the sequence analysis of four of these peptides and the apomyoglobin, over 75% of the covalent structure of the protein was obtained. The remainder of the primary structure was determined by the sequence analysis of peptides that resulted from further digestion of the amino-terminal and central cyanogen bromide fragments. The amino-terminal fragment was specifically cleaved at its two tryptophanyl residues with N-chlorosuccinimide and the central cyanogen bromide fragment was cleaved at its glutamyl residues with staphylococcal protease and at its single tyrosyl residue with N-bromosuccinimide. The primary structure of this myoglobin proved identical with that from the gray whale but differs from that of the finback whale at four positions, from that of the minke whale at three positions and from the myoglobin of the humpback whale at one position. The above sequence identities and differences reflect the close taxonomic relationship of these five species of Cetacea.
Hepatozoon canis infecting dogs in the State of Espírito Santo, southeastern Brazil.

PubMed

Spolidorio, Mariana G; Labruna, Marcelo B; Zago, Augusto M; Donatele, Dirlei M; Caliari, Késia M; Yoshinari, Natalino H

2009-08-26

From May 2007 to March 2008, blood samples were collected from 92 healthy dogs living in 21 households (17 farms in rural area, and 4 homes in urban area) in 6 counties of the State of Espírito Santo, southeastern Brazil. In addition, ticks were collected from these dogs. A mean of 4.4+/-3.0 dogs (range: 1-12) were sampled per household; 78 and 14 dogs were from rural and urban areas, respectively. Polymerase chain reaction (PCR) designed to amplify fragments of the 18S rDNA gene of Babesia spp or Hepatozoon spp revealed amplicons of the expected size in 20 (21.7%) dogs for Babesia, and 54 (58.7%) dogs for Hepatozoon. All Babesia-positive dogs were also Hepatozoon-positive. Among the 21 households, 15 (71.4%) from 3 counties had at least one PCR-positive dog, including 13 farms (rural area) and 2 homes (urban area). A total of 40 PCR products from the Hepatozoon-PCR, and 19 products from the Babesia-PCR were submitted to DNA sequencing. All generated sequences from Hepatozoon-PCR were identical to each other, and to corresponding 18S rDNA sequences of H. canis in GenBank. Surprisingly, all generated sequences from the Babesia PCR were also identical to corresponding 18S rDNA sequences of H. canis in GenBank. Dogs from 10 rural and 2 urban households were found infested by Rhipicephalus sanguineus ticks. Immature of Amblyomma cajennense ticks were found in dogs from only 4 rural households (also infested by R. sanguineus). All but one household with R. sanguineus-infested dogs had at least one Hepatozoon-infected dog. Statistical analysis showed that the presence of ticks (i.e. R. sanguineus) infesting dogs in the households was significantly (P<0.05) associated with at least one PCR-positive dog. There was no significant association (P>0.05) between PCR-positive dogs and urban or rural households. Canine hepatozoonosis caused by H. canis is a high frequent infection in Espírito Santo, Brazil, where it is possibly vectored by R. sanguineus. Since all infected dogs were found apparently healthy, the pathogenicity of H. canis for dogs in Espírito Santo is yet to be elucidated.
Molecular detection and genetic identification of Babesia bigemina, Theileria annulata, Theileria orientalis and Anaplasma marginale in Turkey.

PubMed

Zhou, Mo; Cao, Shinuo; Sevinc, Ferda; Sevinc, Mutlu; Ceylan, Onur; Moumouni, Paul Franck Adjou; Jirapattharasate, Charoonluk; Liu, Mingming; Wang, Guanbo; Iguchi, Aiko; Vudriko, Patrick; Suzuki, Hiroshi; Xuan, Xuenan

2016-02-01

Babesia spp., Theileria spp. and Anaplasma spp. are significant tick-borne pathogens of livestock globally. In this study, we investigated the presence and distribution of Babesia bigemina, Theileria annulata, Theileria orientalis and Anaplasma marginale in cattle from 6 provinces of Turkey using species-specific PCR assays. The PCR were conducted using the primers based on the B. bigemina rhoptry-associated protein 1a (BbiRAP-1a), T. annulata merozoite surface antigen-1 (Tams-1), T. orientalis major piroplasm surface protein (ToMPSP) and A. marginale major surface protein 4 (AmMSP4) genes, respectively. Fragments of B. bigemina internal transcribed spacer (BbiITS), T. annulata internal transcribed spacer (TaITS), ToMPSP and AmMSP4 genes were sequenced for phylogenetic analysis. PCR results revealed that the overall infections of A. marginale, T. annulata, B. bigemina and T. orientalis were 29.1%, 18.9%, 11.2% and 5.6%, respectively. The co-infection of two or three pathogens was detected in 29/196 (15.1%) of the cattle samples. The results of sequence analysis indicated that BbiRAP-1a, BbiITS, Tams-1, ToMPSP and AmMSP4 were conserved among the Turkish samples, with 99.76%, 99-99.8%, 99.34-99.78%, 96.9-99.61% and 99.42-99.71% sequence identity values, respectively. In contrast, the Turkish TaITS gene sequences were relatively diverse with 92.3-96.63% identity values. B. bigemina isolates from Turkey were found in the same clade as the isolates from other countries in phylogenetic analysis. On the other hand, phylogenetic analysis based on T. annulata ITS sequences revealed significant differences in the genotypes of T. annulata isolates from Turkey. Additionally, the T. orientalis isolates from Turkish samples were classified as MPSP type 3 genotype. This is the first report of type 3 MPSP in Turkey. Moreover, AmMSP4 isolates from Turkey were found in the same clade as the isolates from other countries. This study provides important data for understanding the epidemiology of tick-borne diseases and it is expected to improve approach for diagnosis and control of tick-borne diseases in Turkey. Copyright © 2015 Elsevier GmbH. All rights reserved.
Three closely related herpesviruses are associated with fibropapillomatosis in marine turtles

USGS Publications Warehouse

Quackenbush, S.L.; Work, Thierry M.; Balazs, George H.; Casey, Rufina N.; Rovnak, J.; Chaves, A.; duToit, L.; Baines, J.D.; Parrish, C.R.; Bowser, Paul R.; Casey, James W.

1998-01-01

Green turtle fibropapillomatosis is a neoplastic disease of increasingly significant threat to the survivability of this species. Degenerate PCR primers that target highly conserved regions of genes encoding herpesvirus DNA polymerases were used to amplify a DNA sequence from fibropapillomas and fibromas from Hawaiian and Florida green turtles. All of the tumors tested (n= 23) were found to harbor viral DNA, whereas no viral DNA was detected in skin biopsies from tumor-negative turtles. The tissue distribution of the green turtle herpesvirus appears to be generally limited to tumors where viral DNA was found to accumulate at approximately two to five copies per cell and is occasionally detected, only by PCR, in some tissues normally associated with tumor development. In addition, herpesviral DNA was detected in fibropapillomas from two loggerhead and four olive ridley turtles. Nucleotide sequencing of a 483-bp fragment of the turtle herpesvirus DNA polymerase gene determined that the Florida green turtle and loggerhead turtle sequences are identical and differ from the Hawaiian green turtle sequence by five nucleotide changes, which results in two amino acid substitutions. The olive ridley sequence differs from the Florida and Hawaiian green turtle sequences by 15 and 16 nucleotide changes, respectively, resulting in four amino acid substitutions, three of which are unique to the olive ridley sequence. Our data suggest that these closely related turtle herpesviruses are intimately involved in the genesis of fibropapillomatosis.
Purification, characterization and sequence analysis of Omp50,a new porin isolated from Campylobacter jejuni.

PubMed Central

Bolla, J M; Dé, E; Dorez, A; Pagès, J M

2000-01-01

A novel pore-forming protein identified in Campylobacter was purified by ion-exchange chromatography and named Omp50 according to both its molecular mass and its outer membrane localization. We observed a pore-forming ability of Omp50 after re-incorporation into artificial membranes. The protein induced cation-selective channels with major conductance values of 50-60 pS in 1 M NaCl. N-terminal sequencing allowed us to identify the predicted coding sequence Cj1170c from the Campylobacter jejuni genome database as the corresponding gene in the NCTC 11168 genome sequence. The gene, designated omp50, consists of a 1425 bp open reading frame encoding a deduced 453-amino acid protein with a calculated pI of 5.81 and a molecular mass of 51169.2 Da. The protein possessed a 20-amino acid leader sequence. No significant similarity was found between Omp50 and porin protein sequences already determined. Moreover, the protein showed only weak sequence identity with the major outer-membrane protein (MOMP) of Campylobacter, correlating with the absence of antigenic cross-reactivity between these two proteins. Omp50 is expressed in C. jejuni and Campylobacter lari but not in Campylobacter coli. The gene, however, was detected in all three species by PCR. According to its conformation and functional properties, the protein would belong to the family of outer-membrane monomeric porins. PMID:11104668
Iterative dictionary construction for compression of large DNA data sets.

PubMed

Kuruppu, Shanika; Beresford-Smith, Bryan; Conway, Thomas; Zobel, Justin

2012-01-01

Genomic repositories increasingly include individual as well as reference sequences, which tend to share long identical and near-identical strings of nucleotides. However, the sequential processing used by most compression algorithms, and the volumes of data involved, mean that these long-range repetitions are not detected. An order-insensitive, disk-based dictionary construction method can detect this repeated content and use it to compress collections of sequences. We explore a dictionary construction method that improves repeat identification in large DNA data sets. Our adaptation, COMRAD, of an existing disk-based method identifies exact repeated content in collections of sequences with similarities within and across the set of input sequences. COMRAD compresses the data over multiple passes, which is an expensive process, but allows COMRAD to compress large data sets within reasonable time and space. COMRAD allows for random access to individual sequences and subsequences without decompressing the whole data set. COMRAD has no competitor in terms of the size of data sets that it can compress (extending to many hundreds of gigabytes) and, even for smaller data sets, the results are competitive compared to alternatives; as an example, 39 S. cerevisiae genomes compressed to 0.25 bits per base.
Assessment of Species Diversity and Distribution of an Ancient Diatom Lineage Using a DNA Metabarcoding Approach

PubMed Central

Nanjappa, Deepak; Audic, Stephane; Romac, Sarah; Kooistra, Wiebe H. C. F.; Zingone, Adriana

2014-01-01

Background Continuous efforts to estimate actual diversity and to trace the species distribution and ranges in the natural environments have gone in equal pace with advancements of the technologies in the study of microbial species diversity from microscopic observations to DNA-based barcoding. DNA metabarcoding based on Next Generation Sequencing (NGS) constitutes the latest advancement in these efforts. Here we use NGS data from different sites to investigate the geographic range of six species of the diatom family Leptocylindraceae and to identify possible new taxa within the family. Methodology/Principal Findings We analysed the V4 and V9 regions of the nuclear-encoded SSU rDNA gene region in the NGS database of the European ERA-Biodiversa project BioMarKs, collected in plankton and sediments at six coastal sites in European coastal waters, as well as environmental sequences from the NCBI database. All species known in the family Leptocylindraceae were detected in both datasets, but the much larger Illumina V9 dataset showed a higher species coverage at the various sites than the 454 V4 dataset. Sequences identical or similar to the references of Leptocylindrus aporus, L. convexus, L. danicus/hargravesii and Tenuicylindrus belgicus were found in the Mediterranean Sea, North Atlantic Ocean and Black Sea as well as at locations outside Europe. Instead, sequences identical or close to that of L. minimus were found in the North Atlantic Ocean and the Black Sea but not in the Mediterranean Sea, while sequences belonging to a yet undescribed taxon were encountered only in Oslo Fjord and Baffin Bay. Conclusions/Significance Identification of Leptocylindraceae species in NGS datasets has expanded our knowledge of the species biogeographic distribution and of the overall diversity of this diatom family. Individual species appear to be widespread, but not all of them are found everywhere. Despite the sequencing depth allowed by NGS and the wide geographic area covered by this study, the diversity of this ancient diatom family appears to be low, at least at the level of the marker used in this study. PMID:25133638
Increased risk for Campylobacter jejuni and C. coli infection of pet origin in dog owners and evidence for genetic association between strains causing infection in humans and their pets.

PubMed

Mughini Gras, L; Smid, J H; Wagenaar, J A; Koene, M G J; Havelaar, A H; Friesema, I H M; French, N P; Flemming, C; Galson, J D; Graziani, C; Busani, L; VAN Pelt, W

2013-12-01

We compared Campylobacter jejuni/coli multilocus sequence types (STs) from pets (dogs/cats) and their owners and investigated risk factors for pet-associated human campylobacteriosis using a combined source-attribution and case-control analysis. In total, 132/687 pet stools were Campylobacter-positive, resulting in 499 strains isolated (320 C. upsaliensis/helveticus, 100 C. jejuni, 33 C. hyointestinalis/fetus, 10 C. lari, 4 C. coli, 32 unidentified). There were 737 human and 104 pet C. jejuni/coli strains assigned to 154 and 49 STs, respectively. Dog, particularly puppy, owners were at increased risk of infection with pet-associated STs. In 2/68 cases vs. 0.134/68 expected by chance, a pet and its owner were infected with an identical ST (ST45, ST658). Although common sources of infection and directionality of transmission between pets and humans were unknown, dog ownership significantly increased the risk for pet-associated human C. jejuni/coli infection and isolation of identical strains in humans and their pets occurred significantly more often than expected.
Rapid Mitochondrial Genome Evolution through Invasion of Mobile Elements in Two Closely Related Species of Arbuscular Mycorrhizal Fungi

PubMed Central

Beaudet, Denis; Nadimi, Maryam; Iffis, Bachir; Hijri, Mohamed

2013-01-01

Arbuscular mycorrhizal fungi (AMF) are common and important plant symbionts. They have coenocytic hyphae and form multinucleated spores. The nuclear genome of AMF is polymorphic and its organization is not well understood, which makes the development of reliable molecular markers challenging. In stark contrast, their mitochondrial genome (mtDNA) is homogeneous. To assess the intra- and inter-specific mitochondrial variability in closely related Glomus species, we performed 454 sequencing on total genomic DNA of Glomus sp. isolate DAOM-229456 and we compared its mtDNA with two G. irregulare isolates. We found that the mtDNA of Glomus sp. is homogeneous, identical in gene order and, with respect to the sequences of coding regions, almost identical to G. irregulare. However, certain genomic regions vary substantially, due to insertions/deletions of elements such as introns, mitochondrial plasmid-like DNA polymerase genes and mobile open reading frames. We found no evidence of mitochondrial or cytoplasmic plasmids in Glomus species, and mobile ORFs in Glomus are responsible for the formation of four gene hybrids in atp6, atp9, cox2, and nad3, which are most probably the result of horizontal gene transfer and are expressed at the mRNA level. We found evidence for substantial sequence variation in defined regions of mtDNA, even among closely related isolates with otherwise identical coding gene sequences. This variation makes it possible to design reliable intra- and inter-specific markers. PMID:23637766

Some links on this page may take you to non-federal websites. Their policies may differ from this site.