Sample records for variable region sequence

  1. Length and sequence variability in mitochondrial control region of the milkfish, Chanos chanos.

    PubMed

    Ravago, Rachel G; Monje, Virginia D; Juinio-Meñez, Marie Antonette

    2002-01-01

    Extensive length variability was observed in the mitochondrial control region of the milkfish, Chanos chanos. The nucleotide sequence of the control region and flanking regions was determined. Length variability and heteroplasmy was due to the presence of varying numbers of a 41-bp tandemly repeated sequence and a 48-bp insertion/deletion (indel). The structure and organization of the milkfish control region is similar to that of other teleost fish and vertebrates. However, extensive variation in the copy number of tandem repeats (4-20 copies) and the presence of a relatively large (48-bp) indel, are apparently uncommon in teleost fish control region sequences reported to date. High sequence variability of control region peripheral domains indicates the potential utility of selected regions as markers for population-level studies.

  2. Diversity of the P2 protein among nontypeable Haemophilus influenzae isolates.

    PubMed Central

    Bell, J; Grass, S; Jeanteur, D; Munson, R S

    1994-01-01

    The genes for outer membrane protein P2 of four nontypeable Haemophilus influenzae strains were cloned and sequenced. The derived amino acid sequences were compared with the outer membrane protein P2 sequence from H. influenzae type b MinnA and the sequences of P2 from three additional nontypeable H. influenzae strains. The sequences were 76 to 94% identical. The sequences had regions with considerable variability separated by regions which were highly conserved. The variable regions mapped to putative surface-exposed loops of the protein. PMID:8188390

  3. DNA Barcode Sequence Identification Incorporating Taxonomic Hierarchy and within Taxon Variability

    PubMed Central

    Little, Damon P.

    2011-01-01

    For DNA barcoding to succeed as a scientific endeavor an accurate and expeditious query sequence identification method is needed. Although a global multiple–sequence alignment can be generated for some barcoding markers (e.g. COI, rbcL), not all barcoding markers are as structurally conserved (e.g. matK). Thus, algorithms that depend on global multiple–sequence alignments are not universally applicable. Some sequence identification methods that use local pairwise alignments (e.g. BLAST) are unable to accurately differentiate between highly similar sequences and are not designed to cope with hierarchic phylogenetic relationships or within taxon variability. Here, I present a novel alignment–free sequence identification algorithm–BRONX–that accounts for observed within taxon variability and hierarchic relationships among taxa. BRONX identifies short variable segments and corresponding invariant flanking regions in reference sequences. These flanking regions are used to score variable regions in the query sequence without the production of a global multiple–sequence alignment. By incorporating observed within taxon variability into the scoring procedure, misidentifications arising from shared alleles/haplotypes are minimized. An explicit treatment of more inclusive terminals allows for separate identifications to be made for each taxonomic level and/or for user–defined terminals. BRONX performs better than all other methods when there is imperfect overlap between query and reference sequences (e.g. mini–barcode queries against a full–length barcode database). BRONX consistently produced better identifications at the genus–level for all query types. PMID:21857897

  4. Variability and transmission by Aphis glycines of North American and Asian Soybean mosaic virus isolates.

    PubMed

    Domier, L L; Latorre, I J; Steinlage, T A; McCoppin, N; Hartman, G L

    2003-10-01

    The variability of North American and Asian strains and isolates of Soybean mosaic virus was investigated. First, polymerase chain reaction (PCR) products representing the coat protein (CP)-coding regions of 38 SMVs were analyzed for restriction fragment length polymorphisms (RFLP). Second, the nucleotide and predicted amino acid sequence variability of the P1-coding region of 18 SMVs and the helper component/protease (HC/Pro) and CP-coding regions of 25 SMVs were assessed. The CP nucleotide and predicted amino acid sequences were the most similar and predicted phylogenetic relationships similar to those obtained from RFLP analysis. Neither RFLP nor sequence analyses of the CP-coding regions grouped the SMVs by geographical origin. The P1 and HC/Pro sequences were more variable and separated the North American and Asian SMV isolates into two groups similar to previously reported differences in pathogenic diversity of the two sets of SMV isolates. The P1 region was the most informative of the three regions analyzed. To assess the biological relevance of the sequence differences in the HC/Pro and CP coding regions, the transmissibility of 14 SMV isolates by Aphis glycines was tested. All field isolates of SMV were transmitted efficiently by A. glycines, but the laboratory isolates analyzed were transmitted poorly. The amino acid sequences from most, but not all, of the poorly transmitted isolates contained mutations in the aphid transmission-associated DAG and/or KLSC amino acid sequence motifs of CP and HC/Pro, respectively.

  5. Variability among the Most Rapidly Evolving Plastid Genomic Regions is Lineage-Specific: Implications of Pairwise Genome Comparisons in Pyrus (Rosaceae) and Other Angiosperms for Marker Choice

    PubMed Central

    Ter-Voskanyan, Hasmik; Allgaier, Martin; Borsch, Thomas

    2014-01-01

    Plastid genomes exhibit different levels of variability in their sequences, depending on the respective kinds of genomic regions. Genes are usually more conserved while noncoding introns and spacers evolve at a faster pace. While a set of about thirty maximum variable noncoding genomic regions has been suggested to provide universally promising phylogenetic markers throughout angiosperms, applications often require several regions to be sequenced for many individuals. Our project aims to illuminate evolutionary relationships and species-limits in the genus Pyrus (Rosaceae)—a typical case with very low genetic distances between taxa. In this study, we have sequenced the plastid genome of Pyrus spinosa and aligned it to the already available P. pyrifolia sequence. The overall p-distance of the two Pyrus genomes was 0.00145. The intergenic spacers between ndhC–trnV, trnR–atpA, ndhF–rpl32, psbM–trnD, and trnQ–rps16 were the most variable regions, also comprising the highest total numbers of substitutions, indels and inversions (potentially informative characters). Our comparative analysis of further plastid genome pairs with similar low p-distances from Oenothera (representing another rosid), Olea (asterids) and Cymbidium (monocots) showed in each case a different ranking of genomic regions in terms of variability and potentially informative characters. Only two intergenic spacers (ndhF–rpl32 and trnK–rps16) were consistently found among the 30 top-ranked regions. We have mapped the occurrence of substitutions and microstructural mutations in the four genome pairs. High AT content in specific sequence elements seems to foster frequent mutations. We conclude that the variability among the fastest evolving plastid genomic regions is lineage-specific and thus cannot be precisely predicted across angiosperms. The often lineage-specific occurrence of stem-loop elements in the sequences of introns and spacers also governs lineage-specific mutations. Sequencing whole plastid genomes to find markers for evolutionary analyses is therefore particularly useful when overall genetic distances are low. PMID:25405773

  6. Identification and verification of hybridoma-derived monoclonal antibody variable region sequences using recombinant DNA technology and mass spectrometry

    USDA-ARS?s Scientific Manuscript database

    Antibody engineering requires the identification of antigen binding domains or variable regions (VR) unique to each antibody. It is the VR that define the unique antigen binding properties and proper sequence identification is essential for functional evaluation and performance of recombinant antibo...

  7. The nuclear 18S ribosomal RNA gene as a source of phylogenetic information in the genus Taenia.

    PubMed

    Yan, Hongbin; Lou, Zhongzi; Li, Li; Ni, Xingwei; Guo, Aijiang; Li, Hongmin; Zheng, Yadong; Dyachenko, Viktor; Jia, Wanzhong

    2013-03-01

    Most species of the genus Taenia are of considerable medical and veterinary significance. In this study, complete nuclear 18S rRNA gene sequences were obtained from seven members of genus Taenia [Taenia multiceps, Taenia saginata, Taenia asiatica, Taenia solium, Taenia pisiformis, Taenia hydatigena, and Taenia taeniaeformis] and a phylogeny inferred using these sequences. Most of the variable sites fall within the variable regions, V1-V5. We show that sequences from the nuclear 18S ribosomal RNA gene have considerable promise as sources of phylogenetic information within the genus Taenia. Furthermore, given that almost all the variable sites lie within defined variable portions of that gene, it will be appropriate and economical to sequence only those regions for additional species of Taenia.

  8. Sequences of heavy and light chain variable regions from four bovine immunoglobulins.

    PubMed

    Armour, K L; Tempest, P R; Fawcett, P H; Fernie, M L; King, S I; White, P; Taylor, G; Harris, W J

    1994-12-01

    Oligodeoxyribonucleotide primers based on the 5' ends of bovine IgG1/2 and lambda constant (C) region genes, together with primers encoding conserved amino acids at the N-terminus of mature variable (V) regions from other species, have been used in cDNA and polymerase chain reactions (PCRs) to amplify heavy and light chain V region cDNA from bovine heterohybridomas. The amino acid sequences of VH and V lambda from four bovine immunoglobulins of different specificities are presented.

  9. Cloning and sequence analysis of complementary DNA encoding an aberrantly rearranged human T-cell gamma chain.

    PubMed Central

    Dialynas, D P; Murre, C; Quertermous, T; Boss, J M; Leiden, J M; Seidman, J G; Strominger, J L

    1986-01-01

    Complementary DNA (cDNA) encoding a human T-cell gamma chain has been cloned and sequenced. At the junction of the variable and joining regions, there is an apparent deletion of two nucleotides in the human cDNA sequence relative to the murine gamma-chain cDNA sequence, resulting simultaneously in the generation of an in-frame stop codon and in a translational frameshift. For this reason, the sequence presented here encodes an aberrantly rearranged human T-cell gamma chain. There are several surprising differences between the deduced human and murine gamma-chain amino acid sequences. These include poor homology in the variable region, poor homology in a discrete segment of the constant region precisely bounded by the expected junctions of exon CII, and the presence in the human sequence of five potential sites for N-linked glycosylation. Images PMID:3458221

  10. Highly conserved intragenic HSV-2 sequences: Results from next-generation sequencing of HSV-2 UL and US regions from genital swabs collected from 3 continents.

    PubMed

    Johnston, Christine; Magaret, Amalia; Roychoudhury, Pavitra; Greninger, Alexander L; Cheng, Anqi; Diem, Kurt; Fitzgibbon, Matthew P; Huang, Meei-Li; Selke, Stacy; Lingappa, Jairam R; Celum, Connie; Jerome, Keith R; Wald, Anna; Koelle, David M

    2017-10-01

    Understanding the variability in circulating herpes simplex virus type 2 (HSV-2) genomic sequences is critical to the development of HSV-2 vaccines. Genital lesion swabs containing ≥ 10 7 log 10 copies HSV DNA collected from Africa, the USA, and South America underwent next-generation sequencing, followed by K-mer based filtering and de novo genomic assembly. Sites of heterogeneity within coding regions in unique long and unique short (U L _U S ) regions were identified. Phylogenetic trees were created using maximum likelihood reconstruction. Among 46 samples from 38 persons, 1468 intragenic base-pair substitutions were identified. The maximum nucleotide distance between strains for concatenated U L_ U S segments was 0.4%. Phylogeny did not reveal geographic clustering. The most variable proteins had non-synonymous mutations in < 3% of amino acids. Unenriched HSV-2 DNA can undergo next-generation sequencing to identify intragenic variability. The use of clinical swabs for sequencing expands the information that can be gathered directly from these specimens. Copyright © 2017 Elsevier Inc. All rights reserved.

  11. Species composition of the genus Saprolegnia in fin fish aquaculture environments, as determined by nucleotide sequence analysis of the nuclear rDNA ITS regions.

    PubMed

    de la Bastide, Paul Y; Leung, Wai Lam; Hintz, William E

    2015-01-01

    The ITS region of the rDNA gene was compared for Saprolegnia spp. in order to improve our understanding of nucleotide sequence variability within and between species of this genus, determine species composition in Canadian fin fish aquaculture facilities, and to assess the utility of ITS sequence variability in genetic marker development. From a collection of more than 400 field isolates, ITS region nucleotide sequences were studied and it was determined that there was sufficient consistent inter-specific variation to support the designation of species identity based on ITS sequence data. This non-subjective approach to species identification does not rely upon transient morphological features. Phylogenetic analyses comparing our ITS sequences and species designations with data from previous studies generally supported the clade scheme of Diéguez-Uribeondo et al. (2007) and found agreement with the molecular taxonomic cluster system of Sandoval-Sierra et al. (2014). Our Canadian ITS sequence collection will thus contribute to the public database and assist the clarification of Saprolegnia spp. taxonomy. The analysis of ITS region sequence variability facilitated genus- and species-level identification of unknown samples from aquaculture facilities and provided useful information on species composition. A unique ITS-RFLP for the identification of S. parasitica was also described. Copyright © 2014 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.

  12. Epstein-Barr Virus Latent Membrane Protein 1 Genetic Variability in Peripheral Blood B Cells and Oropharyngeal Fluids

    PubMed Central

    Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R.; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F.

    2014-01-01

    ABSTRACT We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. IMPORTANCE This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed. PMID:24429365

  13. Epstein-Barr virus latent membrane protein 1 genetic variability in peripheral blood B cells and oropharyngeal fluids.

    PubMed

    Renzette, Nicholas; Somasundaran, Mohan; Brewster, Frank; Coderre, James; Weiss, Eric R; McManus, Margaret; Greenough, Thomas; Tabak, Barbara; Garber, Manuel; Kowalik, Timothy F; Luzuriaga, Katherine

    2014-04-01

    We report the diversity of latent membrane protein 1 (LMP1) gene founder sequences and the level of Epstein-Barr virus (EBV) genome variability over time and across anatomic compartments by using virus genomes amplified directly from oropharyngeal wash specimens and peripheral blood B cells during acute infection and convalescence. The intrahost nucleotide variability of the founder virus was 0.02% across the region sequences, and diversity increased significantly over time in the oropharyngeal compartment (P = 0.004). The LMP1 region showing the greatest level of variability in both compartments, and over time, was concentrated within the functional carboxyl-terminal activating regions 2 and 3 (CTAR2 and CTAR3). Interestingly, a deletion in a proline-rich repeat region (amino acids 274 to 289) of EBV commonly reported in EBV sequenced from cancer specimens was not observed in acute infectious mononucleosis (AIM) patients. Taken together, these data highlight the diversity in circulating EBV genomes and its potential importance in disease pathogenesis and vaccine design. This study is among the first to leverage an improved high-throughput deep-sequencing methodology to investigate directly from patient samples the degree of diversity in Epstein-Barr virus (EBV) populations and the extent to which viral genome diversity develops over time in the infected host. Significant variability of circulating EBV latent membrane protein 1 (LMP1) gene sequences was observed between cellular and oral wash samples, and this variability increased over time in oral wash samples. The significance of EBV genetic diversity in transmission and disease pathogenesis are discussed.

  14. Multi-region and single-cell sequencing reveal variable genomic heterogeneity in rectal cancer.

    PubMed

    Liu, Mingshan; Liu, Yang; Di, Jiabo; Su, Zhe; Yang, Hong; Jiang, Beihai; Wang, Zaozao; Zhuang, Meng; Bai, Fan; Su, Xiangqian

    2017-11-23

    Colorectal cancer is a heterogeneous group of malignancies with complex molecular subtypes. While colon cancer has been widely investigated, studies on rectal cancer are very limited. Here, we performed multi-region whole-exome sequencing and single-cell whole-genome sequencing to examine the genomic intratumor heterogeneity (ITH) of rectal tumors. We sequenced nine tumor regions and 88 single cells from two rectal cancer patients with tumors of the same molecular classification and characterized their mutation profiles and somatic copy number alterations (SCNAs) at the multi-region and the single-cell levels. A variable extent of genomic heterogeneity was observed between the two patients, and the degree of ITH increased when analyzed on the single-cell level. We found that major SCNAs were early events in cancer development and inherited steadily. Single-cell sequencing revealed mutations and SCNAs which were hidden in bulk sequencing. In summary, we studied the ITH of rectal cancer at regional and single-cell resolution and demonstrated that variable heterogeneity existed in two patients. The mutational scenarios and SCNA profiles of two patients with treatment naïve from the same molecular subtype are quite different. Our results suggest each tumor possesses its own architecture, which may result in different diagnosis, prognosis, and drug responses. Remarkable ITH exists in the two patients we have studied, providing a preliminary impression of ITH in rectal cancer.

  15. Sequence variation and phylogenetic analysis of envelope glycoprotein of hepatitis G virus.

    PubMed

    Lim, M Y; Fry, K; Yun, A; Chong, S; Linnen, J; Fung, K; Kim, J P

    1997-11-01

    A transfusion-transmissible agent provisionally designated hepatitis G virus (HGV) was recently identified. In this study, we examined the variability of the HGV genome by analysing sequences in the putative envelope region from 72 isolates obtained from diverse geographical sources. The 1561 nucleotide sequence of the E1/E2/NS2a region of HGV was determined from 12 isolates, and compared with three published sequences. The most variability was observed in 400 nucleotides at the N terminus of E2. We next analysed this 400 nucleotide envelope variable region (EV) from an additional 60 HGV isolates. This sequence varied considerably among the 75 isolates, with overall identity ranging from 79.3% to 99.5% at the nucleotide level, and from 83.5% to 100% at the amino acid level. However, hypervariable regions were not identified. Phylogenetic analyses indicated that the 75 HGV isolates belong to a single genotype. A single-tier distribution of evolutionary distances was observed among the 15 E1/E2/NS2a sequences and the 75 EV sequences. In contrast, 11 isolates of HCV were analysed and showed a three-tiered distribution, representing genotypes, subtypes, and isolates. The 75 isolates of HGV fell into four clusters on the phylogenetic tree. Tight geographical clustering was observed among the HGV isolates from Japan and Korea.

  16. Intraspecific ITS Variability in the Kingdom Fungi as Expressed in the International Sequence Databases and Its Implications for Molecular Species Identification

    PubMed Central

    Nilsson, R. Henrik; Kristiansson, Erik; Ryberg, Martin; Hallenberg, Nils; Larsson, Karl-Henrik

    2008-01-01

    The internal transcribed spacer (ITS) region of the nuclear ribosomal repeat unit is the most popular locus for species identification and subgeneric phylogenetic inference in sequence-based mycological research. The region is known to show certain variability even within species, although its intraspecific variability is often held to be limited and clearly separated from interspecific variability. The existence of such a divide between intra- and interspecific variability is implicitly assumed by automated approaches to species identification, but whether intraspecific variability indeed is negligible within the fungal kingdom remains contentious. The present study estimates the intraspecific ITS variability in all fungi presently available to the mycological community through the international sequence databases. Substantial differences were found within the kingdom, and the results are not easily correlated to the taxonomic affiliation or nutritional mode of the taxa considered. No single unifying yet stringent upper limit for intraspecific variability, such as the canonical 3% threshold, appears to be applicable with the desired outcome throughout the fungi. Our results caution against simplified approaches to automated ITS-based species delimitation and reiterate the need for taxonomic expertise in the translation of sequence data into species names. PMID:19204817

  17. Method for altering antibody light chain interactions

    DOEpatents

    Stevens, Fred J.; Stevens, Priscilla Wilkins; Raffen, Rosemarie; Schiffer, Marianne

    2002-01-01

    A method for recombinant antibody subunit dimerization including modifying at least one codon of a nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in the interface segment of the light polypeptide variable region, the charged amino acid having a first polarity; and modifying at least one codon of the nucleic acid sequence to replace an amino acid occurring naturally in the antibody with a charged amino acid at a position in an interface segment of the heavy polypeptide variable region corresponding to a position in the light polypeptide variable region, the charged amino acid having a second polarity opposite the first polarity. Nucleic acid sequences which code for novel light chain proteins, the latter of which are used in conjunction with the inventive method, are also provided.

  18. SeqFIRE: a web application for automated extraction of indel regions and conserved blocks from protein multiple sequence alignments.

    PubMed

    Ajawatanawong, Pravech; Atkinson, Gemma C; Watson-Haigh, Nathan S; Mackenzie, Bryony; Baldauf, Sandra L

    2012-07-01

    Analyses of multiple sequence alignments generally focus on well-defined conserved sequence blocks, while the rest of the alignment is largely ignored or discarded. This is especially true in phylogenomics, where large multigene datasets are produced through automated pipelines. However, some of the most powerful phylogenetic markers have been found in the variable length regions of multiple alignments, particularly insertions/deletions (indels) in protein sequences. We have developed Sequence Feature and Indel Region Extractor (SeqFIRE) to enable the automated identification and extraction of indels from protein sequence alignments. The program can also extract conserved blocks and identify fast evolving sites using a combination of conservation and entropy. All major variables can be adjusted by the user, allowing them to identify the sets of variables most suited to a particular analysis or dataset. Thus, all major tasks in preparing an alignment for further analysis are combined in a single flexible and user-friendly program. The output includes a numbered list of indels, alignments in NEXUS format with indels annotated or removed and indel-only matrices. SeqFIRE is a user-friendly web application, freely available online at www.seqfire.org/.

  19. Method to amplify variable sequences without imposing primer sequences

    DOEpatents

    Bradbury, Andrew M.; Zeytun, Ahmet

    2006-11-14

    The present invention provides methods of amplifying target sequences without including regions flanking the target sequence in the amplified product or imposing amplification primer sequences on the amplified product. Also provided are methods of preparing a library from such amplified target sequences.

  20. Phylogenetic Network for European mtDNA

    PubMed Central

    Finnilä, Saara; Lehtonen, Mervi S.; Majamaa, Kari

    2001-01-01

    The sequence in the first hypervariable segment (HVS-I) of the control region has been used as a source of evolutionary information in most phylogenetic analyses of mtDNA. Population genetic inference would benefit from a better understanding of the variation in the mtDNA coding region, but, thus far, complete mtDNA sequences have been rare. We determined the nucleotide sequence in the coding region of mtDNA from 121 Finns, by conformation-sensitive gel electrophoresis and subsequent sequencing and by direct sequencing of the D loop. Furthermore, 71 sequences from our previous reports were included, so that the samples represented all the mtDNA haplogroups present in the Finnish population. We found a total of 297 variable sites in the coding region, which allowed the compilation of unambiguous phylogenetic networks. The D loop harbored 104 variable sites, and, in most cases, these could be localized within the coding-region networks, without discrepancies. Interestingly, many homoplasies were detected in the coding region. Nucleotide variation in the rRNA and tRNA genes was 6%, and that in the third nucleotide positions of structural genes amounted to 22% of that in the HVS-I. The complete networks enabled the relationships between the mtDNA haplogroups to be analyzed. Phylogenetic networks based on the entire coding-region sequence in mtDNA provide a rich source for further population genetic studies, and complete sequences make it easier to differentiate between disease-causing mutations and rare polymorphisms. PMID:11349229

  1. Characterization of the variable-number tandem repeats in vrrA from different Bacillus anthracis isolates

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jackson, P.J.; Walthers, E.A.; Richmond, K.L.

    1997-04-01

    PCR analysis of 198 Bacillus anthracis isolates revealed a variable region of DNA sequence differing in length among the isolates. Five Polymorphisms differed by the presence Of two to six copies of the 12-bp tandem repeat 5{prime}-CAATATCAACAA-3{prime}. This variable-number tandem repeat (VNTR) region is located within a larger sequence containing one complete open reading frame that encodes a putative 30-kDa protein. Length variation did not change the reading frame of the encoded protein and only changed the copy number of a 4-amino-acid sequence (QYQQ) from 2 to 6. The structure of the VNTR region suggests that these multiple repeats aremore » generated by recombination or polymerase slippage. Protein structures predicted from the reverse-translated DNA sequence suggest that any structural changes in the encoded protein are confined to the region encoded by the VNTR sequence. Copy number differences in the VNTR region were used to define five different B. anthracis alleles. Characterization of 198 isolates revealed allele frequencies of 6.1, 17.7, 59.6, 5.6, and 11.1% sequentially from shorter to longer alleles. The high degree of polymorphism in the VNTR region provides a criterion for assigning isolates to five allelic categories. There is a correlation between categories and geographic distribution. Such molecular markers can be used to monitor the epidemiology of anthrax outbreaks in domestic and native herbivore populations. 22 refs., 4 figs., 3 tabs.« less

  2. Comparative and Evolutionary Analyses of Meloidogyne spp. Based on Mitochondrial Genome Sequences

    PubMed Central

    García, Laura Evangelina; Sánchez-Puerta, M. Virginia

    2015-01-01

    Molecular taxonomy and evolution of nematodes have been recently the focus of several studies. Mitochondrial sequences were proposed as an alternative for precise identification of Meloidogyne species, to study intraspecific variability and to follow maternal lineages. We characterized the mitochondrial genomes (mtDNAs) of the root knot nematodes M. floridensis, M. hapla and M. incognita. These were AT rich (81–83%) and highly compact, encoding 12 proteins, 2 rRNAs, and 22 tRNAs. Comparisons with published mtDNAs of M. chitwoodi, M. incognita (another strain) and M. graminicola revealed that they share protein and rRNA gene order but differ in the order of tRNAs. The mtDNAs of M. floridensis and M. incognita were strikingly similar (97–100% identity for all coding regions). In contrast, M. floridensis, M. chitwoodi, M. hapla and M. graminicola showed 65–84% nucleotide identity for coding regions. Variable mitochondrial sequences are potentially useful for evolutionary and taxonomic studies. We developed a molecular taxonomic marker by sequencing a highly-variable ~2 kb mitochondrial region, nad5-cox1, from 36 populations of root-knot nematodes to elucidate relationships within the genus Meloidogyne. Isolates of five species formed monophyletic groups and showed little intraspecific variability. We also present a thorough analysis of the mitochondrial region cox2-rrnS. Phylogenies based on either mitochondrial region had good discrimination power but could not discriminate between M. arenaria, M. incognita and M. floridensis. PMID:25799071

  3. The rDNA ITS region in the lessepsian marine angiosperm Halophila stipulacea (Forssk.) Aschers. (Hydrocharitaceae): intragenomic variability and putative pseudogenic sequences.

    PubMed

    Ruggiero, Maria Valeria; Procaccini, Gabriele

    2004-01-01

    Halophila stipulacea is a dioecious marine angiosperm, widely distributed along the western coasts of the Indian Ocean and the Red Sea. This species is thought to be a Lessepsian immigrant that entered the Mediterranean Sea from the Red Sea after the opening of the Suez Canal (1869). Previous studies have revealed both high phenotypic and genetic variability in Halophila stipulacea populations from the western Mediterranean basin. In order to test the hypothesis of a Lessepsian introduction, we compare genetic polymorphism between putative native (Red Sea) and introduced (Mediterranean) populations through rDNA ITS region (ITS1-5.8S-ITS2) sequence analysis. A high degree of intraindividual variability of ITS sequences was found. Most of the intragenomic polymorphism was due to pseudogenic sequences, present in almost all individuals. Features of ITS functional sequences and pseudogenes are described. Possible causes for the lack of homogenization of ITS paralogues within individuals are discussed.

  4. Estimating Bacterial Diversity for Ecological Studies: Methods, Metrics, and Assumptions

    PubMed Central

    Birtel, Julia; Walser, Jean-Claude; Pichon, Samuel; Bürgmann, Helmut; Matthews, Blake

    2015-01-01

    Methods to estimate microbial diversity have developed rapidly in an effort to understand the distribution and diversity of microorganisms in natural environments. For bacterial communities, the 16S rRNA gene is the phylogenetic marker gene of choice, but most studies select only a specific region of the 16S rRNA to estimate bacterial diversity. Whereas biases derived from from DNA extraction, primer choice and PCR amplification are well documented, we here address how the choice of variable region can influence a wide range of standard ecological metrics, such as species richness, phylogenetic diversity, β-diversity and rank-abundance distributions. We have used Illumina paired-end sequencing to estimate the bacterial diversity of 20 natural lakes across Switzerland derived from three trimmed variable 16S rRNA regions (V3, V4, V5). Species richness, phylogenetic diversity, community composition, β-diversity, and rank-abundance distributions differed significantly between 16S rRNA regions. Overall, patterns of diversity quantified by the V3 and V5 regions were more similar to one another than those assessed by the V4 region. Similar results were obtained when analyzing the datasets with different sequence similarity thresholds used during sequences clustering and when the same analysis was used on a reference dataset of sequences from the Greengenes database. In addition we also measured species richness from the same lake samples using ARISA Fingerprinting, but did not find a strong relationship between species richness estimated by Illumina and ARISA. We conclude that the selection of 16S rRNA region significantly influences the estimation of bacterial diversity and species distributions and that caution is warranted when comparing data from different variable regions as well as when using different sequencing techniques. PMID:25915756

  5. Towards the Rational Design of a Candidate Vaccine against Pregnancy Associated Malaria: Conserved Sequences of the DBL6ε Domain of VAR2CSA

    PubMed Central

    Badaut, Cyril; Bertin, Gwladys; Rustico, Tatiana; Fievet, Nadine; Massougbodji, Achille; Gaye, Alioune; Deloron, Philippe

    2010-01-01

    Background Placental malaria is a disease linked to the sequestration of Plasmodium falciparum infected red blood cells (IRBC) in the placenta, leading to reduced materno-fetal exchanges and to local inflammation. One of the virulence factors of P. falciparum involved in cytoadherence to chondroitin sulfate A, its placental receptor, is the adhesive protein VAR2CSA. Its localisation on the surface of IRBC makes it accessible to the immune system. VAR2CSA contains six DBL domains. The DBL6ε domain is the most variable. High variability constitutes a means for the parasite to evade the host immune response. The DBL6ε domain could constitute a very attractive basis for a vaccine candidate but its reported variability necessitates, for antigenic characterisations, identifying and classifying commonalities across isolates. Methodology/Principal Findings Local alignment analysis of the DBL6ε domain had revealed that it is not as variable as previously described. Variability is concentrated in seven regions present on the surface of the DBL6ε domain. The main goal of our work is to classify and group variable sequences that will simplify further research to determine dominant epitopes. Firstly, variable sequences were grouped following their average percent pairwise identity (APPI). Groups comprising many variable sequences sharing low variability were found. Secondly, ELISA experiments following the IgG recognition of a recombinant DBL6ε domain, and of peptides mimicking its seven variable blocks, allowed to determine an APPI cut-off and to isolate groups represented by a single consensus sequence. Conclusions/Significance A new sequence approach is used to compare variable regions in sequences that have extensive segmental gene relationship. Using this approach, the VAR2CSA DBL6 domain is composed of 7 variable blocks with limited polymorphism. Each variable block is composed of a limited number of consensus types. Based on peptide based ELISA, variable blocks with 85% or greater sequence identity are expected to be recognized equally well by antibody and can be considered the same consensus type. Therefore, the analysis of the antibody response against the classified small number of sequences should be helpful to determine epitopes. PMID:20585655

  6. FUNGAL-SPECIFIC PCR PRIMERS DEVELOPED FOR ANALYSIS OF THE ITS REGION OF ENVIRONMENTAL DNA EXTRACTS

    EPA Science Inventory

    Background The Internal Transcribed Spacer (ITS) regions of fungal ribosomal DNA (rDNA) are highly variable sequences of great importance in distinguishing fungal species by PCR analysis. Previously published PCR primers available for amplifying these sequences from environmenta...

  7. An RNAi in silico approach to find an optimal shRNA cocktail against HIV-1

    PubMed Central

    2010-01-01

    Background HIV-1 can be inhibited by RNA interference in vitro through the expression of short hairpin RNAs (shRNAs) that target conserved genome sequences. In silico shRNA design for HIV has lacked a detailed study of virus variability constituting a possible breaking point in a clinical setting. We designed shRNAs against HIV-1 considering the variability observed in naïve and drug-resistant isolates available at public databases. Methods A Bioperl-based algorithm was developed to automatically scan multiple sequence alignments of HIV, while evaluating the possibility of identifying dominant and subdominant viral variants that could be used as efficient silencing molecules. Student t-test and Bonferroni Dunn correction test were used to assess statistical significance of our findings. Results Our in silico approach identified the most common viral variants within highly conserved genome regions, with a calculated free energy of ≥ -6.6 kcal/mol. This is crucial for strand loading to RISC complex and for a predicted silencing efficiency score, which could be used in combination for achieving over 90% silencing. Resistant and naïve isolate variability revealed that the most frequent shRNA per region targets a maximum of 85% of viral sequences. Adding more divergent sequences maintained this percentage. Specific sequence features that have been found to be related with higher silencing efficiency were hardly accomplished in conserved regions, even when lower entropy values correlated with better scores. We identified a conserved region among most HIV-1 genomes, which meets as many sequence features for efficient silencing. Conclusions HIV-1 variability is an obstacle to achieving absolute silencing using shRNAs designed against a consensus sequence, mainly because there are many functional viral variants. Our shRNA cocktail could be truly effective at silencing dominant and subdominant naïve viral variants. Additionally, resistant isolates might be targeted under specific antiretroviral selective pressure, but in both cases these should be tested exhaustively prior to clinical use. PMID:21172023

  8. Intergenic Sequence Ribotyping using a region neighboring dkgB links genovar to Kauffman-White serotype of Salmonella enterica

    USDA-ARS?s Scientific Manuscript database

    Previous research identified that the 5S ribosomal (rrn) gene and associated flanking sequences that are closely linked to the dkgB gene of Salmonella enterica were highly variable between serotypes, but not between subpopulations within the same serotype (PMID: 17005008). The degree of variability ...

  9. The phosphotransferase system-dependent sucrose utilization regulon in enteropathogenic Escherichia coli strains is located in a variable chromosomal region containing iap sequences.

    PubMed

    Treviño-Quintanilla, Luis Gerardo; Escalante, Adelfo; Caro, Alma Delia; Martínez, Alfredo; González, Ricardo; Puente, José Luis; Bolívar, Francisco; Gosset, Guillermo

    2007-01-01

    The capacity to utilize sucrose as a carbon and energy source (Scr(+) phenotype) is a highly variable trait among Escherichia coli strains. In this study, seven enteropathogenic E. coli (EPEC) strains from different sources were studied for their capacity to grow using sucrose. Liquid media cultures showed that all analyzed strains have the Scr(+) phenotype and two distinct groups were defined: one of five and another of two strains displaying doubling times of 67 and 125 min, respectively. The genes conferring the Scr(+) phenotype in one of the fast-growing strains (T19) were cloned and sequenced. Comparative sequence analysis revealed that this strain possesses the scr regulon genes scrKYABR, encoding phosphoenolpyruvate:phosphotransferase system-dependent sucrose transport and utilization activities. Transcript level quantification revealed sucrose-dependent induction of scrK and scrR genes in fast-growing strains, whereas no transcripts were detected in slow-growing strains. Sequence comparison analysis revealed that the scr genes in strain T19 are almost identical to those present in the scr regulon of prototype EPEC E2348/69 and in both strains, the scr genes are inserted in the chromosomal intergenic region of hypothetical genes ygcE and ygcF. Comparison of the ygcE-ygcF intergenic region sequence of strains MG1655, enterohemorrhagic EDL933, uropathogenic ECFT073 and EPEC T19-E2348/69 revealed that the number of extragenic highly repeated iap sequences corresponded to nine, four, two and none, respectively. These results show that the iap sequence-containing chromosomal ygcE-ygcF intergenic region is highly variable in E. coli. Copyright (c) 2007 S. Karger AG, Basel.

  10. rpoB Gene Sequence-Based Identification of Aerobic Gram-Positive Cocci of the Genera Streptococcus, Enterococcus, Gemella, Abiotrophia, and Granulicatella

    PubMed Central

    Drancourt, Michel; Roux, Véronique; Fournier, Pierre-Edouard; Raoult, Didier

    2004-01-01

    We developed a new molecular tool based on rpoB gene (encoding the beta subunit of RNA polymerase) sequencing to identify streptococci. We first sequenced the complete rpoB gene for Streptococcus anginosus, S. equinus, and Abiotrophia defectiva. Sequences were aligned with these of S. pyogenes, S. agalactiae, and S. pneumoniae available in GenBank. Using an in-house analysis program (SVARAP), we identified a 740-bp variable region surrounded by conserved, 20-bp zones and, by using these conserved zones as PCR primer targets, we amplified and sequenced this variable region in an additional 30 Streptococcus, Enterococcus, Gemella, Granulicatella, and Abiotrophia species. This region exhibited 71.2 to 99.3% interspecies homology. We therefore applied our identification system by PCR amplification and sequencing to a collection of 102 streptococci and 60 bacterial isolates belonging to other genera. Amplicons were obtained in streptococci and Bacillus cereus, and sequencing allowed us to make a correct identification of streptococci. Molecular signatures were determined for the discrimination of closely related species within the S. pneumoniae-S. oralis-S. mitis group and the S. agalactiae-S. difficile group. These signatures allowed us to design a S. pneumoniae-specific PCR and sequencing primer pair. PMID:14766807

  11. Comparison of the theoretical and real-world evolutionary potential of a genetic circuit

    NASA Astrophysics Data System (ADS)

    Razo-Mejia, M.; Boedicker, J. Q.; Jones, D.; DeLuna, A.; Kinney, J. B.; Phillips, R.

    2014-04-01

    With the development of next-generation sequencing technologies, many large scale experimental efforts aim to map genotypic variability among individuals. This natural variability in populations fuels many fundamental biological processes, ranging from evolutionary adaptation and speciation to the spread of genetic diseases and drug resistance. An interesting and important component of this variability is present within the regulatory regions of genes. As these regions evolve, accumulated mutations lead to modulation of gene expression, which may have consequences for the phenotype. A simple model system where the link between genetic variability, gene regulation and function can be studied in detail is missing. In this article we develop a model to explore how the sequence of the wild-type lac promoter dictates the fold-change in gene expression. The model combines single-base pair resolution maps of transcription factor and RNA polymerase binding energies with a comprehensive thermodynamic model of gene regulation. The model was validated by predicting and then measuring the variability of lac operon regulation in a collection of natural isolates. We then implement the model to analyze the sensitivity of the promoter sequence to the regulatory output, and predict the potential for regulation to evolve due to point mutations in the promoter region.

  12. Change in IgHV Mutational Status of CLL Suggests Origin From Multiple Clones.

    PubMed

    Osman, Afaf; Gocke, Christopher D; Gladstone, Douglas E

    2017-02-01

    Fluorescence in situ hybridization and immunoglobulin (Ig) heavy-chain variable-region (IgHV) mutational status are used to predict outcome in chronic lymphocytic leukemia (CLL). Although DNA aberrations change over time, IgHV sequences and mutational status are considered stable. In a retrospective review, 409 CLL patients, between 2008 and 2015, had IgHV analysis: 56 patients had multiple analyses performed. Seven patients' IgHV results changed: 2 from unmutated to mutated and 5 from mutated to unmutated IgHV sequence. Three concurrently changed their variable heavy-chain sequence. Secondary to allelic exclusion, 2 of the new variable heavy chains produced were biologically nonplausible. The existence of these new nonplausible heavy-chain variable regions suggests either the CLL cancer stem-cell maintains the ability to rearrange a previously silenced IgH allele or more likely that the cancer stem-cell produced at least 2 subclones, suggesting that the CLL cancer stem cell exists before the process of allelic exclusion occurs. Copyright © 2016 Elsevier Inc. All rights reserved.

  13. Setting up a probe based, closed tube real-time PCR assay for focused detection of variable sequence alterations.

    PubMed

    Becságh, Péter; Szakács, Orsolya

    2014-10-01

    During diagnostic workflow when detecting sequence alterations, sometimes it is important to design an algorithm that includes screening and direct tests in combination. Normally the use of direct test, which is mainly sequencing, is limited. There is an increased need for effective screening tests, with "closed tube" during the whole process and therefore decreasing the risk of PCR product contamination. The aim of this study was to design such a closed tube, detection probe based screening assay to detect different kind of sequence alterations in the exon 11 of the human c-kit gene region. Inside this region there are variable possible deletions and single nucleotide changes. During assay setup, more probe chemistry formats were screened and tested. After some optimization steps the taqman probe format was selected.

  14. Diversity in the 18S SSU rRNA V4 hyper-variable region of Theileria spp. in Cape buffalo (Syncerus caffer) and cattle from southern Africa.

    PubMed

    Mans, Ben J; Pienaar, Ronel; Latif, Abdalla A; Potgieter, Fred T

    2011-05-01

    Sequence variation within the 18S SSU rRNA V4 hyper-variable region can affect the accuracy of real-time hybridization probe-based diagnostics for the detection of Theileria spp. infections. This is relevant for assays that use non-specific primers, such as the real-time hybridization assay for T. parva (Sibeko et al. 2008). To assess the effect of sequence variation on this test, the Theileria 18S gene from 62 buffalo and 49 cattle samples was cloned and ∼1000 clones sequenced. Twenty-six genotypes were detected which included known and novel genotypes for the T. buffeli, T. mutans, T. taurotragi and T. velifera clades. A novel genotype related to T. sp. (sable) was also detected in 1 bovine sample. Theileria genotypic diversity was higher in buffalo compared to cattle. Polymorphism within the T. parva hyper-variable region was confirmed by aberrant real-time melting peaks and supported by sequencing of the S5 ribosomal gene. Analysis of the S5 gene suggests that this gene can be a marker for species differentiation. T. parva, T. sp. (buffalo) and T. sp. (bougasvlei) remain the only genotypes amplified by the primer set of the hybridization assay. Therefore, the 18S sequence diversity observed does not seem to affect the current real-time hybridization assay for T. parva.

  15. Assessment of sequence variability in a p23 gene region within and among three genotypes of the Theileria orientalis complex from south-eastern Australia.

    PubMed

    Perera, Piyumali K; Gasser, Robin B; Jabbar, Abdul

    2015-03-01

    Oriental theileriosis is a tick-borne, protozoan disease of cattle caused by one or more genotypes of Theileria orientalis complex. In this study, we assessed sequence variability in a region of the 23kDa piroplasm membrane protein (p23) gene within and among three T. orientalis genotypes (designated buffeli, chitose and ikeda) in south-eastern Australia. Genomic DNA (n=100) was extracted from blood of infected cattle from various locations endemic for oriental theileriosis and tested by polymerase chain reaction (PCR)-coupled mutation scanning (single-strand conformation polymorphism (SSCP)) and targeted sequencing analysis. Eight distinct sequences represented all DNA samples, and three genotypes were found: buffeli (n=3), chitose (3) and ikeda (2). Nucleotide pairwise comparisons among these eight sequences revealed considerably higher variability among the genotypes (6.6-11.7%) than within them (0-1.9%), indicating that the p23 gene region allows the accurate identification of T. orientalis genotypes. In the future, we will combine this gene with other molecular markers to study the genetic structure of T. orientalis populations in Australasia, which will pave the way to establish a highly sensitive and specific PCR-based assay for genotypic diagnosis of infection and for assessing levels of parasitaemia in cattle. Copyright © 2014 Elsevier GmbH. All rights reserved.

  16. B-Bolivia, an Allele of the Maize b1 Gene with Variable Expression, Contains a High Copy Retrotransposon-Related Sequence Immediately Upstream1

    PubMed Central

    Selinger, David A.; Chandler, Vicki L.

    2001-01-01

    The maize (Zea mays) b1 gene encodes a transcription factor that regulates the anthocyanin pigment pathway. Of the b1 alleles with distinct tissue-specific expression, B-Peru and B-Bolivia are the only alleles that confer seed pigmentation. B-Bolivia produces variable and weaker seed expression but darker, more regular plant expression relative to B-Peru. Our experiments demonstrated that B-Bolivia is not expressed in the seed when transmitted through the male. When transmitted through the female the proportion of kernels pigmented and the intensity of pigment varied. Molecular characterization of B-Bolivia demonstrated that it shares the first 530 bp of the upstream region with B-Peru, a region sufficient for seed expression. Immediately upstream of 530 bp, B-Bolivia is completely divergent from B-Peru. These sequences share sequence similarity to retrotransposons. Transient expression assays of various promoter constructs identified a 33-bp region in B-Bolivia that can account for the reduced aleurone pigment amounts (40%) observed with B-Bolivia relative to B-Peru. Transgenic plants carrying the B-Bolivia promoter proximal region produced pigmented seeds. Similar to native B-Bolivia, some transgene loci are variably expressed in seeds. In contrast to native B-Bolivia, the transgene loci are expressed in seeds when transmitted through both the male and female. Some transgenic lines produced pigment in vegetative tissues, but the tissue-specificity was different from B-Bolivia, suggesting the introduced sequences do not contain the B-Bolivia plant-specific regulatory sequences. We hypothesize that the chromatin context of the B-Bolivia allele controls its epigenetic seed expression properties, which could be influenced by the adjacent highly repeated retrotransposon sequence. PMID:11244116

  17. Pre-main sequence variables in young cluster Stock 18

    NASA Astrophysics Data System (ADS)

    Sinha, Tirthendu; Sharma, Saurabh; Pandey, Rakesh; Pandey, Anil Kumar

    2018-04-01

    We have carried out multi-epoch deep I band photometry of the open cluster Stock 18 to search for variable stars in star forming regions. In the present study, we identified 65 periodic and 217 non-periodic variable stars. The periods of most of the periodic variables are between 2 hours to 15 days and their magnitude varies between 0.05 to 0.6 mag. We have derived spectral energy distributions for 48 probable pre-main sequence variables. Their average age and mass are 2.7 ± 0.3 Myrs and 2.7 ± 0.2 Mo, respectively.

  18. The mitochondrial genome of Pocillopora (Cnidaria: Scleractinia) contains two variable regions: the putative D-loop and a novel ORF of unknown function.

    PubMed

    Flot, Jean-François; Tillier, Simon

    2007-10-15

    The complete mitochondrial genomes of two individuals attributed to different morphospecies of the scleractinian coral genus Pocillopora have been sequenced. Both genomes, respectively 17,415 and 17,422 nt long, share the presence of a previously undescribed ORF encoding a putative protein made up of 302 amino acids and of unknown function. Surprisingly, this ORF turns out to be the second most variable region of the mitochondrial genome (1% nucleotide sequence difference between the two individuals) after the putative control region (1.5% sequence difference). Except for the presence of this ORF and for the location of the putative control region, the mitochondrial genome of Pocillopora is organized in a fashion similar to the other scleractinian coral genomes published to date. For the first time in a cnidarian, a putative second origin of replication is described based on its secondary structure similar to the stem-loop structure of O(L), the origin of L-strand replication in vertebrates.

  19. The bias associated with amplicon sequencing does not affect the quantitative assessment of bacterial community dynamics.

    PubMed

    Ibarbalz, Federico M; Pérez, María Victoria; Figuerola, Eva L M; Erijman, Leonardo

    2014-01-01

    The performance of two sets of primers targeting variable regions of the 16S rRNA gene V1-V3 and V4 was compared in their ability to describe changes of bacterial diversity and temporal turnover in full-scale activated sludge. Duplicate sets of high-throughput amplicon sequencing data of the two 16S rRNA regions shared a collection of core taxa that were observed across a series of twelve monthly samples, although the relative abundance of each taxon was substantially different between regions. A case in point was the changes in the relative abundance of filamentous bacteria Thiothrix, which caused a large effect on diversity indices, but only in the V1-V3 data set. Yet the relative abundance of Thiothrix in the amplicon sequencing data from both regions correlated with the estimation of its abundance determined using fluorescence in situ hybridization. In nonmetric multidimensional analysis samples were distributed along the first ordination axis according to the sequenced region rather than according to sample identities. The dynamics of microbial communities indicated that V1-V3 and the V4 regions of the 16S rRNA gene yielded comparable patterns of: 1) the changes occurring within the communities along fixed time intervals, 2) the slow turnover of activated sludge communities and 3) the rate of species replacement calculated from the taxa-time relationships. The temperature was the only operational variable that showed significant correlation with the composition of bacterial communities over time for the sets of data obtained with both pairs of primers. In conclusion, we show that despite the bias introduced by amplicon sequencing, the variable regions V1-V3 and V4 can be confidently used for the quantitative assessment of bacterial community dynamics, and provide a proper qualitative account of general taxa in the community, especially when the data are obtained over a convenient time window rather than at a single time point.

  20. ITS all right mama: investigating the formation of chimeric sequences in the ITS2 region by DNA metabarcoding analyses of fungal mock communities of different complexities.

    PubMed

    Bjørnsgaard Aas, Anders; Davey, Marie Louise; Kauserud, Håvard

    2017-07-01

    The formation of chimeric sequences can create significant methodological bias in PCR-based DNA metabarcoding analyses. During mixed-template amplification of barcoding regions, chimera formation is frequent and well documented. However, profiling of fungal communities typically uses the more variable rDNA region ITS. Due to a larger research community, tools for chimera detection have been developed mainly for the 16S/18S markers. However, these tools are widely applied to the ITS region without verification of their performance. We examined the rate of chimera formation during amplification and 454 sequencing of the ITS2 region from fungal mock communities of different complexities. We evaluated the chimera detecting ability of two common chimera-checking algorithms: perseus and uchime. Large proportions of the chimeras reported were false positives. No false negatives were found in the data set. Verified chimeras accounted for only 0.2% of the total ITS2 reads, which is considerably less than what is typically reported in 16S and 18S metabarcoding analyses. Verified chimeric 'parent sequences' had significantly higher per cent identity to one another than to random members of the mock communities. Community complexity increased the rate of chimera formation. GC content was higher around the verified chimeric break points, potentially facilitating chimera formation through base pair mismatching in the neighbouring regions of high similarity in the chimeric region. We conclude that the hypervariable nature of the ITS region seems to buffer the rate of chimera formation in comparison with other, less variable barcoding regions, due to shorter regions of high sequence similarity. © 2016 John Wiley & Sons Ltd.

  1. DsaV methyltransferase and its isoschizomers contain a conserved segment that is similar to the segment in Hhai methyltransferase that is in contact with DNA bases.

    PubMed Central

    Gopal, J; Yebra, M J; Bhagwat, A S

    1994-01-01

    The methyltransferase (MTase) in the DsaV restriction--modification system methylates within 5'-CCNGG sequences. We have cloned the gene for this MTase and determined its sequence. The predicted sequence of the MTase protein contains sequence motifs conserved among all cytosine-5 MTases and is most similar to other MTases that methylate CCNGG sequences, namely M.ScrFI and M.SsoII. All three MTases methylate the internal cytosine within their recognition sequence. The 'variable' region within the three enzymes that methylate CCNGG can be aligned with the sequences of two enzymes that methylate CCWGG sequences. Remarkably, two segments within this region contain significant similarity with the region of M.HhaI that is known to contact DNA bases. These alignments suggest that many cytosine-5 MTases are likely to interact with DNA using a similar structural framework. Images PMID:7971279

  2. Importance of Viral Sequence Length and Number of Variable and Informative Sites in Analysis of HIV Clustering.

    PubMed

    Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor; Essex, M

    2015-05-01

    To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice.

  3. Importance of Viral Sequence Length and Number of Variable and Informative Sites in Analysis of HIV Clustering

    PubMed Central

    Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor

    2015-01-01

    Abstract To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice. PMID:25560745

  4. Spatial organization of the gastrointestinal microbiota in urban Canada geese

    USGS Publications Warehouse

    Drovetski, Sergei V.; O'Mahoney, Michael; Ransome, Emma J.; Matterson, Kenan O.; Lim, Haw Chuan; Chesser, Terry; Graves, Gary R.

    2018-01-01

    Recent reviews identified the reliance on fecal or cloacal samples as a significant limitation hindering our understanding of the avian gastrointestinal (gut) microbiota and its function. We investigated the microbiota of the esophagus, duodenum, cecum, and colon of a wild urban population of Canada goose (Branta canadensis). From a population sample of 30 individuals, we sequenced the V4 region of the 16S SSU rRNA on an Illumina MiSeq and obtained 8,628,751 sequences with a median of 76,529 per sample. These sequences were assigned to 420 bacterial OTUs and a single archaeon. Firmicutes, Proteobacteria, and Bacteroidetes accounted for 90% of all sequences. Microbiotas from the four gut regions differed significantly in their richness, composition, and variability among individuals. Microbial communities of the esophagus were the most distinctive whereas those of the colon were the least distinctive, reflecting the physical downstream mixing of regional microbiotas. The downstream mixing of regional microbiotas was also responsible for the majority of observed co-occurrence patterns among microbial families. Our results indicate that fecal and cloacal samples inadequately represent the complex patterns of richness, composition, and variability of the gut microbiota and obscure patterns of co-occurrence of microbial lineages.

  5. Mitochondrial sequences of Seriatopora corals show little agreement with morphology and reveal the duplication of a tRNA gene near the control region

    NASA Astrophysics Data System (ADS)

    Flot, J.-F.; Licuanan, W. Y.; Nakano, Y.; Payri, C.; Cruaud, C.; Tillier, S.

    2008-12-01

    The taxonomy of corals of the genus Seriatopora has not previously been studied using molecular sequence markers. As a first step toward a re-evaluation of species boundaries in this genus, mitochondrial sequence variability was analyzed in 51 samples collected from Okinawa, New Caledonia, and the Philippines. Four clusters of sequences were detected that showed little concordance with species currently recognized on a morphological basis. The most likely explanation is that the skeletal characters used for species identification are highly variable (polymorphic or phenotypically plastic); alternative explanations include introgression/hybridization, or deep coalescence and the retention of ancestral mitochondrial polymorphisms. In all individuals sequenced, two copies of trnW were found on either side of the atp8 gene near the putative D-loop, a novel mitochondrial gene arrangement that may have arisen from a duplication of the trnW-atp8 region followed by a deletion of one atp8.

  6. In silico segmentations of lentivirus envelope sequences

    PubMed Central

    Boissin-Quillon, Aurélia; Piau, Didier; Leroux, Caroline

    2007-01-01

    Background The gene encoding the envelope of lentiviruses exhibits a considerable plasticity, particularly the region which encodes the surface (SU) glycoprotein. Interestingly, mutations do not appear uniformly along the sequence of SU, but they are clustered in restricted areas, called variable (V) regions, which are interspersed with relatively more stable regions, called constant (C) regions. We look for specific signatures of C/V regions, using hidden Markov models constructed with SU sequences of the equine, human, small ruminant and simian lentiviruses. Results Our models yield clear and accurate delimitations of the C/V regions, when the test set and the training set were made up of sequences of the same lentivirus, but also when they were made up of sequences of different lentiviruses. Interestingly, the models predicted the different regions of lentiviruses such as the bovine and feline lentiviruses, not used in the training set. Models based on composite training sets produce accurate segmentations of sequences of all these lentiviruses. Conclusion Our results suggest that each C/V region has a specific statistical oligonucleotide composition, and that the C (respectively V) regions of one of these lentiviruses are statistically more similar to the C (respectively V) regions of the other lentiviruses, than to the V (respectively C) regions of the same lentivirus. PMID:17376229

  7. Genome variability of foot-and-mouth disease virus during the short period of the 2010 epidemic in Japan.

    PubMed

    Nishi, Tatsuya; Yamada, Manabu; Fukai, Katsuhiko; Shimada, Nobuaki; Morioka, Kazuki; Yoshida, Kazuo; Sakamoto, Kenichi; Kanno, Toru; Yamakawa, Makoto

    2017-02-01

    Foot-and-mouth disease virus (FMDV) is highly contagious and has a high mutation rate, leading to extensive genetic variation. To investigate how FMDV genetically evolves over a short period of an epidemic after initial introduction into an FMD-free area, whole L-fragment sequences of 104 FMDVs isolated from the 2010 epidemic in Japan, which continued for less than three months were determined and phylogenetically and comparatively analyzed. Phylogenetic analysis of whole L-fragment sequences showed that these isolates were classified into a single group, indicating that FMDV was introduced into Japan in the epidemic via a single introduction. Nucleotide sequences of 104 virus isolates showed more than 99.56% pairwise identity rates without any genetic deletion or insertion, although no sequences were completely identical with each other. These results indicate that genetic substitutions of FMDV occurred gradually and constantly during the epidemic and generation of an extensive mutant virus could have been prevented by rapid eradication strategy. From comparative analysis of variability of each FMDV protein coding region, VP4 and 2C regions showed the highest average identity rates and invariant rates, and were confirmed as highly conserved. In contrast, the protein coding regions VP2 and VP1 were confirmed to be highly variable regions with the lowest average identity rates and invariant rates, respectively. Our data demonstrate the importance of rapid eradication strategy in an FMD epidemic and provide valuable information on the genome variability of FMDV during the short period of an epidemic. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  8. Analysis of whole genome sequences of 16 strains of rubella virus from the United States, 1961-2009.

    PubMed

    Abernathy, Emily; Chen, Min-hsin; Bera, Jayati; Shrivastava, Susmita; Kirkness, Ewen; Zheng, Qi; Bellini, William; Icenogle, Joseph

    2013-01-25

    Rubella virus is the causative agent of rubella, a mild rash illness, and a potent teratogenic agent when contracted by a pregnant woman. Global rubella control programs target the reduction and elimination of congenital rubella syndrome. Phylogenetic analysis of partial sequences of rubella viruses has contributed to virus surveillance efforts and played an important role in demonstrating that indigenous rubella viruses have been eliminated in the United States. Sixteen wild-type rubella viruses were chosen for whole genome sequencing. All 16 viruses were collected in the United States from 1961 to 2009 and are from 8 of the 13 known rubella genotypes. Phylogenetic analysis of 30 whole genome sequences produced a maximum likelihood tree giving high bootstrap values for all genotypes except provisional genotype 1a. Comparison of the 16 new complete sequences and 14 previously sequenced wild-type viruses found regions with clusters of variable amino acids. The 5' 250 nucleotides of the genome are more conserved than any other part of the genome. Genotype specific deletions in the untranslated region between the non-structural and structural open reading frames were observed for genotypes 2B and genotype 1G. No evidence was seen for recombination events among the 30 viruses. The analysis presented here is consistent with previous reports on the genetic characterization of rubella virus genomes. Conserved and variable regions were identified and additional evidence for genotype specific nucleotide deletions in the intergenic region was found. Phylogenetic analysis confirmed genotype groupings originally based on structural protein coding region sequences, which provides support for the WHO nomenclature for genetic characterization of wild-type rubella viruses.

  9. Intraspecific variation in Cryptocaryon irritans.

    PubMed

    Diggles, B K; Adlard, R D

    1997-01-01

    Intraspecific variation in the ciliate Cryptocaryon irritans was examined using sequences of the first internal transcribed spacer region (ITS-1) of ribosomal DNA (rDNA) combined with developmental and morphological characters. Amplified rDNA sequences consisting of 151 bases of the flanking 18 S and 5.8 S regions, and the entire ITS-1 region (169 or 170 bases), were determined and compared for 16 isolates of C. irritans from Australia, Israel and the USA. There was one variable base between isolates in the 18 S region and 11 variable bases in the ITS-1 region. Despite their similar morphology, significant sequence variation (4.1% divergence) and developmental differences indicate that Australian C. irritans isolates from estuarine (Moreton Bay) and coral reef (Heron Island) environments are distinct. The Heron Island isolate was genetically closer to morphologically dissimilar isolates from Israel (1.8% divergence) and the USA (2.3% divergence) than it was to the Moreton Bay isolates. Three isolates maintained in our laboratory since February 1994 differed in sequence from earlier laboratory isolates (2.9% to 3.5% divergence), even though all were similar morphologically and originated from the same source. During this time the sequence of the isolates from wild fish in Moreton Bay remained unchanged. These genetic differences indicate the existence of a founder effect in laboratory populations of C. irritans. The genetic variation found here, combined with known morphological and developmental differences, is used to characterise four strains of C. irritans.

  10. IG and TR single chain fragment variable (scFv) sequence analysis: a new advanced functionality of IMGT/V-QUEST and IMGT/HighV-QUEST.

    PubMed

    Giudicelli, Véronique; Duroux, Patrice; Kossida, Sofia; Lefranc, Marie-Paule

    2017-06-26

    IMGT®, the international ImMunoGeneTics information system® ( http://www.imgt.org ), was created in 1989 in Montpellier, France (CNRS and Montpellier University) to manage the huge and complex diversity of the antigen receptors, and is at the origin of immunoinformatics, a science at the interface between immunogenetics and bioinformatics. Immunoglobulins (IG) or antibodies and T cell receptors (TR) are managed and described in the IMGT® databases and tools at the level of receptor, chain and domain. The analysis of the IG and TR variable (V) domain rearranged nucleotide sequences is performed by IMGT/V-QUEST (online since 1997, 50 sequences per batch) and, for next generation sequencing (NGS), by IMGT/HighV-QUEST, the high throughput version of IMGT/V-QUEST (portal begun in 2010, 500,000 sequences per batch). In vitro combinatorial libraries of engineered antibody single chain Fragment variable (scFv) which mimic the in vivo natural diversity of the immune adaptive responses are extensively screened for the discovery of novel antigen binding specificities. However the analysis of NGS full length scFv (~850 bp) represents a challenge as they contain two V domains connected by a linker and there is no tool for the analysis of two V domains in a single chain. The functionality "Analyis of single chain Fragment variable (scFv)" has been implemented in IMGT/V-QUEST and, for NGS, in IMGT/HighV-QUEST for the analysis of the two V domains of IG and TR scFv. It proceeds in five steps: search for a first closest V-REGION, full characterization of the first V-(D)-J-REGION, then search for a second V-REGION and full characterization of the second V-(D)-J-REGION, and finally linker delimitation. For each sequence or NGS read, positions of the 5'V-DOMAIN, linker and 3'V-DOMAIN in the scFv are provided in the 'V-orientated' sense. Each V-DOMAIN is fully characterized (gene identification, sequence description, junction analysis, characterization of mutations and amino changes). The functionality is generic and can analyse any IG or TR single chain nucleotide sequence containing two V domains, provided that the corresponding species IMGT reference directory is available. The "Analysis of single chain Fragment variable (scFv)" implemented in IMGT/V-QUEST and, for NGS, in IMGT/HighV-QUEST provides the identification and full characterization of the two V domains of full-length scFv (~850 bp) nucleotide sequences from combinatorial libraries. The analysis can also be performed on concatenated paired chains of expressed antigen receptor IG or TR repertoires.

  11. The mini-exon genes of three Phytomonas isolates that differ in plant tissue tropism.

    PubMed

    Sturm, N R; Fernandes, O; Campbell, D A

    1995-08-01

    The tandem mini-exon gene repeat is an ideal diagnostic target for trypanosomatids because it includes sequences that are conserved absolutely coupled with regions of extreme variability. We have exploited these features and the polymerase chain reaction to differentiate Phytomonas strains isolated from phloem, fruit or latex of various host plants. While the transcribed regions are nearly identical, the intergenic sequences are variable in size and content (130-332 base pairs). The mini-exon genes of these phytomonads can therefore be distinguished from each other and from the corresponding genes in insect trypanosomes, with which they are oft confused.

  12. Remarkable sequence conservation of the last intron in the PKD1 gene.

    PubMed

    Rodova, Marianna; Islam, M Rafiq; Peterson, Kenneth R; Calvet, James P

    2003-10-01

    The last intron of the PKD1 gene (intron 45) was found to have exceptionally high sequence conservation across four mammalian species: human, mouse, rat, and dog. This conservation did not extend to the comparable intron in pufferfish. Pairwise comparisons for intron 45 showed 91% identity (human vs. dog) to 100% identity (mouse vs. rat) for an average for all four species of 94% identity. In contrast, introns 43 and 44 of the PKD1 gene had average pairwise identities of 57% and 54%, and exons 43, 44, and 45 and the coding region of exon 46 had average pairwise identities of 80%, 84%, 82%, and 80%. Intron 45 is 90 to 95 bp in length, with the major region of sequence divergence being in a central 4-bp to 9-bp variable region. RNA secondary structure analysis of intron 45 predicts a branching stem-loop structure in which the central variable region lies in one loop and the putative branch point sequence lies in another loop, suggesting that the intron adopts a specific stem-loop structure that may be important for its removal. Although intron 45 appears to conform to the class of small, G-triplet-containing introns that are spliced by a mechanism utilizing intron definition, its high sequence conservation may be a reflection of constraints imposed by a unique mechanism that coordinates splicing of this last PKD1 intron with polyadenylation.

  13. Plastid primers for angiosperm phylogenetics and phylogeography.

    PubMed

    Prince, Linda M

    2015-06-01

    PCR primers are available for virtually every region of the plastid genome. Selection of which primer pairs to use is second only to selection of the genic region. This is particularly true for research at the species/population interface. Primer pairs for 130 regions of the chloroplast genome were evaluated in 12 species distributed across the angiosperms. Likelihood of amplification success was inferred based upon number and location of mismatches to target sequence. Intraspecific sequence variability was evaluated under three different criteria in four species. Many published primer pairs should work across all taxa sampled, with the exception of failure due to genomic reorganization events. Universal barcoding primers were the least likely to work (65% success). The list of most variable regions for use within species has little in common with the lists identified in prior studies. Published primer sequences should amplify a diversity of flowering plant DNAs, even those designed for specific taxonomic groups. "Universal" primers may have extremely limited utility. There was little consistency in likelihood of amplification success for any given publication across lineages or within lineage across publications.

  14. Common 5S rRNA variants are likely to be accepted in many sequence contexts

    NASA Technical Reports Server (NTRS)

    Zhang, Zhengdong; D'Souza, Lisa M.; Lee, Youn-Hyung; Fox, George E.

    2003-01-01

    Over evolutionary time RNA sequences which are successfully fixed in a population are selected from among those that satisfy the structural and chemical requirements imposed by the function of the RNA. These sequences together comprise the structure space of the RNA. In principle, a comprehensive understanding of RNA structure and function would make it possible to enumerate which specific RNA sequences belong to a particular structure space and which do not. We are using bacterial 5S rRNA as a model system to attempt to identify principles that can be used to predict which sequences do or do not belong to the 5S rRNA structure space. One promising idea is the very intuitive notion that frequently seen sequence changes in an aligned data set of naturally occurring 5S rRNAs would be widely accepted in many other 5S rRNA sequence contexts. To test this hypothesis, we first developed well-defined operational definitions for a Vibrio region of the 5S rRNA structure space and what is meant by a highly variable position. Fourteen sequence variants (10 point changes and 4 base-pair changes) were identified in this way, which, by the hypothesis, would be expected to incorporate successfully in any of the known sequences in the Vibrio region. All 14 of these changes were constructed and separately introduced into the Vibrio proteolyticus 5S rRNA sequence where they are not normally found. Each variant was evaluated for its ability to function as a valid 5S rRNA in an E. coli cellular context. It was found that 93% (13/14) of the variants tested are likely valid 5S rRNAs in this context. In addition, seven variants were constructed that, although present in the Vibrio region, did not meet the stringent criteria for a highly variable position. In this case, 86% (6/7) are likely valid. As a control we also examined seven variants that are seldom or never seen in the Vibrio region of 5S rRNA sequence space. In this case only two of seven were found to be potentially valid. The results demonstrate that changes that occur multiple times in a local region of RNA sequence space in fact usually will be accepted in any sequence context in that same local region.

  15. TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing.

    PubMed

    Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David

    2018-04-11

    Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.

  16. Using msa-2b as a molecular marker for genotyping Mexican isolates of Babesia bovis.

    PubMed

    Genis, Alma D; Perez, Jocelin; Mosqueda, Juan J; Alvarez, Antonio; Camacho, Minerva; Muñoz, Maria de Lourdes; Rojas, Carmen; Figueroa, Julio V

    2009-12-01

    Variable merozoite surface antigens of Babesia bovis are exposed glycoproteins having a role in erythrocyte invasion. Members of this gene family include msa-1 and msa-2 (msa-2c, msa-2a(1), msa-2a(2) and msa-2b). To determine the sequence variation among B. bovis Mexican isolates using msa-2b as a genetic marker, PCR amplicons corresponding to msa-2b were cloned and plasmids carrying the corresponding inserts were purified and sequenced. Comparative analysis of nucleotide and deduced amino acid sequences revealed distinct degrees of variability and identity among the coding gene sequences obtained from 16 geographically different Mexican B. bovis isolates and a reference strain. Clustal-W multiple alignments of the MSA-2b deduced amino acid sequences performed with the 17 B. bovis Mexican isolates, revealed the identification of three genotypes with a distinct set each of amino acid residues present at the variable region: Genotype I represented by the MO7 strain (in vitro culture-derived from the Mexico isolate) as well as RAD, Chiapas-1, Tabasco and Veracruz-3 isolates; Genotype II, represented by the Jalisco, Mexico and Veracruz-2 isolates; and Genotype III comprising the sequences from most of the isolates studied, Tamaulipas-1, Chiapas-2, Guerrero-1, Nayarit, Quintana Roo, Nuevo Leon, Tamaulipas-2, Yucatan and Guerrero-2. Moreover, these three genotypes could be discriminated against each other by using a PCR-RFLP approach. The results suggest that occurrence of indels within the variable region of msa-2b sequences can be useful markers for identifying a particular genotype present in field populations of B. bovis isolated from infected cattle in Mexico.

  17. The Bias Associated with Amplicon Sequencing Does Not Affect the Quantitative Assessment of Bacterial Community Dynamics

    PubMed Central

    Figuerola, Eva L. M.; Erijman, Leonardo

    2014-01-01

    The performance of two sets of primers targeting variable regions of the 16S rRNA gene V1–V3 and V4 was compared in their ability to describe changes of bacterial diversity and temporal turnover in full-scale activated sludge. Duplicate sets of high-throughput amplicon sequencing data of the two 16S rRNA regions shared a collection of core taxa that were observed across a series of twelve monthly samples, although the relative abundance of each taxon was substantially different between regions. A case in point was the changes in the relative abundance of filamentous bacteria Thiothrix, which caused a large effect on diversity indices, but only in the V1–V3 data set. Yet the relative abundance of Thiothrix in the amplicon sequencing data from both regions correlated with the estimation of its abundance determined using fluorescence in situ hybridization. In nonmetric multidimensional analysis samples were distributed along the first ordination axis according to the sequenced region rather than according to sample identities. The dynamics of microbial communities indicated that V1–V3 and the V4 regions of the 16S rRNA gene yielded comparable patterns of: 1) the changes occurring within the communities along fixed time intervals, 2) the slow turnover of activated sludge communities and 3) the rate of species replacement calculated from the taxa–time relationships. The temperature was the only operational variable that showed significant correlation with the composition of bacterial communities over time for the sets of data obtained with both pairs of primers. In conclusion, we show that despite the bias introduced by amplicon sequencing, the variable regions V1–V3 and V4 can be confidently used for the quantitative assessment of bacterial community dynamics, and provide a proper qualitative account of general taxa in the community, especially when the data are obtained over a convenient time window rather than at a single time point. PMID:24923665

  18. Nuclear and mitochondrial rDNA variability in Crinipellis perniciosa from different geographic origins and hosts.

    PubMed

    de Arruda, Maricília C C; Ferreira, Marisa A S V; Miller, Robert N G; Resende, Mário Lúcio V; Felipe, Maria Sueli S

    2003-01-01

    Genetic variability in Crinipellis perniciosa, the causal organism of witches' broom disease in Theobroma cacao, was determined in strains originating from T. cacao and other susceptible host species Heteropterys acutifolia and Solanum lycocarpum in Brazil, in order to clarify host specificity and geographical variability. RFLP analysis of the ribosomal DNA ITS regions (rDNA ITS), and the mitochondrial DNA small subunit ribosomal DNA gene (mtDNA SSU rDNA) did not reveal any genetic variability in 120 tested strains, possibly serving only as species level markers. Genetic variability was observed in the ribosomal DNA IGS spacer region, in terms of IGS size, RFLPs and sequence data. Phylogenetic analyses (using CLUSTAL W, PHYLIP and TREEVIEW) indicated considerable differences between C. perniciosa strains from T. cacao and those from H. acutifolia (85-86%) and S. lycocarpum (95-96%). Sequence differences also indicated that C. perniciosa from T. cacao in Bahia is less variable (98%) when compared to the pathogen on T. cacao in Amazonas (97-98%), perhaps reflecting a recent introduction to T. cacao in Bahia.

  19. A multiple-alignment based primer design algorithm for genetically highly variable DNA targets

    PubMed Central

    2013-01-01

    Background Primer design for highly variable DNA sequences is difficult, and experimental success requires attention to many interacting constraints. The advent of next-generation sequencing methods allows the investigation of rare variants otherwise hidden deep in large populations, but requires attention to population diversity and primer localization in relatively conserved regions, in addition to recognized constraints typically considered in primer design. Results Design constraints include degenerate sites to maximize population coverage, matching of melting temperatures, optimizing de novo sequence length, finding optimal bio-barcodes to allow efficient downstream analyses, and minimizing risk of dimerization. To facilitate primer design addressing these and other constraints, we created a novel computer program (PrimerDesign) that automates this complex procedure. We show its powers and limitations and give examples of successful designs for the analysis of HIV-1 populations. Conclusions PrimerDesign is useful for researchers who want to design DNA primers and probes for analyzing highly variable DNA populations. It can be used to design primers for PCR, RT-PCR, Sanger sequencing, next-generation sequencing, and other experimental protocols targeting highly variable DNA samples. PMID:23965160

  20. Sequence variability of Campylobacter temperate bacteriophages

    PubMed Central

    Clark, Clifford G; Ng, Lai-King

    2008-01-01

    Background Prophages integrated within the chromosomes of Campylobacter jejuni isolates have been demonstrated very recently. Prior work with Campylobacter temperate bacteriophages, as well as evidence from prophages in other enteric bacteria, suggests these prophages might have a role in the biology and virulence of the organism. However, very little is known about the genetic variability of Campylobacter prophages which, if present, could lead to differential phenotypes in isolates carrying the phages versus those that do not. As a first step in the characterization of C. jejuni prophages, we investigated the distribution of prophage DNA within a C. jejuni population assessed the DNA and protein sequence variability within a subset of the putative prophages found. Results Southern blotting of C. jejuni DNA using probes from genes within the three putative prophages of the C. jejuni sequenced strain RM 1221 demonstrated the presence of at least one prophage gene in a large proportion (27/35) of isolates tested. Of these, 15 were positive for 5 or more of the 7 Campylobacter Mu-like phage 1 (CMLP 1, also designated Campylobacter jejuni integrated element 1, or CJIE 1) genes tested. Twelve of these putative prophages were chosen for further analysis. DNA sequencing of a 9,000 to 11,000 nucleotide region of each prophage demonstrated a close homology with CMLP 1 in both gene order and nucleotide sequence. Structural and sequence variability, including short insertions, deletions, and allele replacements, were found within the prophage genomes, some of which would alter the protein products of the ORFs involved. No insertions of novel genes were detected within the sequenced regions. The 12 prophages and RM 1221 had a % G+C very similar to C. jejuni sequenced strains, as well as promoter regions characteristic of C. jejuni. None of the putative prophages were successfully induced and propagated, so it is not known if they were functional or if they represented remnant prophage DNA in the bacterial chromosomes. Conclusion These putative prophages form a family of phages with conserved sequences, and appear to be adapted to Campylobacter. There was evidence for recombination among groups of prophages, suggesting that the prophages had a mosaic structure. In many of these properties, the Mu-like CMLP 1 homologs characterized in this study resemble temperate bacteriophages of enteric bacteria that are responsible for contributions to virulence and host adaptation. PMID:18366706

  1. Genetic variability in isolates of Chromobacterium violaceum from pulmonary secretion, water, and soil.

    PubMed

    Santini, A C; Magalhães, J T; Cascardo, J C M; Corrêa, R X

    2016-04-28

    Chromobacterium violaceum is a free-living Gram-negative bacillus usually found in the water and soil in tropical regions, which causes infections in humans. Chromobacteriosis is characterized by rapid dissemination and high mortality. The aim of this study was to detect the genetic variability among C. violaceum type strain ATCC 12472, and seven isolates from the environment and one from a pulmonary secretion from a chromobacteriosis patient from Ilhéus, Bahia. The molecular characterization of all samples was performed by polymerase chain reaction (PCR) sequencing and 16S rDNA analysis. Primers specific for two ATCC 12472 pathogenicity genes, hilA and yscD, as well as random amplified polymorphic DNA (RAPD), were used for PCR amplification and comparative sequencing of the products. For a more specific approach, the PCR products of 16S rDNA were digested with restriction enzymes. Seven of the samples, including type-strain ATCC 12472, were amplified by the hilA primers; these were subsequently sequenced. Gene yscD was amplified only in type-strain ATCC 12472. MspI and AluI digestion revealed 16S rDNA polymorphisms. This data allowed the generation of a dendogram for each analysis. The isolates of C. violaceum have variability in random genomic regions demonstrated by RAPD. Also, these isolates have variability in pathogenicity genes, as demonstrated by sequencing and restriction enzyme digestion.

  2. Authentication of an endangered herb Changium smyrnioides from different producing areas based on rDNA ITS sequences and allele-specific PCR.

    PubMed

    Sun, Xiaoqin; Wei, Yanglian; Qin, Minjian; Guo, Qiaosheng; Guo, Jianlin; Zhou, Yifeng; Hang, Yueyu

    2012-03-01

    The rDNA ITS region of 18 samples of Changium smyrnioides from 7 areas and of 2 samples of Chuanminshen violaceum were sequenced and analyzed. The amplified ITS region of the samples, including a partial sequence of ITS1 and complete sequences of 5.8S and ITS2, had a total length of 555 bp. After complete alignment, there were 49 variable sites, of which 45 were informative, when gaps were treated as missing data. Samples of C. smyrnioides from different locations could be identified exactly based on the variable sites. The maximum parsimony (MP) and neighbor joining (NJ) tree constructed from the ITS sequences based on Kumar's two-parameter model showed that the genetic distances of the C. smyrnioides samples from different locations were not always related to their geographical distances. A specific primer set for Allele-specific PCR authentication of C. violaceum from Jurong of Jiangsu was designed based on the SNP in the ITS sequence alignment. C. violaceum from the major genuine producing area in Jurong of Jiangsu could be identified exactly and quickly by Allele-specific PCR.

  3. Minding the gap: Frequency of indels in mtDNA control region sequence data and influence on population genetic analyses

    USGS Publications Warehouse

    Pearce, J.M.

    2006-01-01

    Insertions and deletions (indels) result in sequences of various lengths when homologous gene regions are compared among individuals or species. Although indels are typically phylogenetically informative, occurrence and incorporation of these characters as gaps in intraspecific population genetic data sets are rarely discussed. Moreover, the impact of gaps on estimates of fixation indices, such as FST, has not been reviewed. Here, I summarize the occurrence and population genetic signal of indels among 60 published studies that involved alignments of multiple sequences from the mitochondrial DNA (mtDNA) control region of vertebrate taxa. Among 30 studies observing indels, an average of 12% of both variable and parsimony-informative sites were composed of these sites. There was no consistent trend between levels of population differentiation and the number of gap characters in a data block. Across all studies, the average influence on estimates of ??ST was small, explaining only an additional 1.8% of among population variance (range 0.0-8.0%). Studies most likely to observe an increase in ??ST with the inclusion of gap characters were those with < 20 variable sites, but a near equal number of studies with few variable sites did not show an increase. In contrast to studies at interspecific levels, the influence of indels for intraspecific population genetic analyses of control region DNA appears small, dependent upon total number of variable sites in the data block, and related to species-specific characteristics and the spatial distribution of mtDNA lineages that contain indels. ?? 2006 Blackwell Publishing Ltd.

  4. Spatiotemporal attention operator using isotropic contrast and regional homogeneity

    NASA Astrophysics Data System (ADS)

    Palenichka, Roman; Lakhssassi, Ahmed; Zaremba, Marek

    2011-04-01

    A multiscale operator for spatiotemporal isotropic attention is proposed to reliably extract attention points during image sequence analysis. Its consecutive local maxima indicate attention points as the centers of image fragments of variable size with high intensity contrast, region homogeneity, regional shape saliency, and temporal change presence. The scale-adaptive estimation of temporal change (motion) and its aggregation with the regional shape saliency contribute to the accurate determination of attention points in image sequences. Multilocation descriptors of an image sequence are extracted at the attention points in the form of a set of multidimensional descriptor vectors. A fast recursive implementation is also proposed to make the operator's computational complexity independent from the spatial scale size, which is the window size in the spatial averaging filter. Experiments on the accuracy of attention-point detection have proved the operator consistency and its high potential for multiscale feature extraction from image sequences.

  5. Human heavy chain disease protein WIS: implications for the organization of immunoglobulin genes.

    PubMed Central

    Franklin, E C; Prelli, F; Frangione, B

    1979-01-01

    Protein WIS is a human gamma3 heavy (H) chain disease immunoglobulin variant whose amino acid sequence is most readily interpreted by postulating that three residues of the amino terminus are followed by a deletion of most of the variable (VH) domain, which ends at the variable-constant (VC) joining region. Then there is a stretch of eight residues, three of which are unusual, while the other five have striking homology to the VC junction sequence. This is followed by a second deletion, which ends at the beginning of the quadruplicated hinge region. These findings are consistent with mutations resulting in deletions of most of the gene coding for the V region and CH1 domain followed by splicing at the VC joining region and at the hinge. These structural features fit well the notion of genetic discontinuity between V and C genes and also suggest similar mechanisms of excision and splicing in the interdomain regions of the C gene of the heavy chain. PMID:106391

  6. Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment

    PubMed Central

    2013-01-01

    Background Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. Results In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Conclusion Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA. PMID:24564200

  7. Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment.

    PubMed

    Nagar, Anurag; Hahsler, Michael

    2013-01-01

    Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA.

  8. Characterization of Lipooligosaccharide-Biosynthetic Loci of Campylobacter jejuni Reveals New Lipooligosaccharide Classes: Evidence of Mosaic Organizations▿ †

    PubMed Central

    Parker, Craig T.; Gilbert, Michel; Yuki, Nobuhiro; Endtz, Hubert P.; Mandrell, Robert E.

    2008-01-01

    The lipooligosaccharide (LOS) biosynthesis region is one of the more variable genomic regions between strains of Campylobacter jejuni. Indeed, eight classes of LOS biosynthesis loci have been established previously based on gene content and organization. In this study, we characterize additional classes of LOS biosynthesis loci and analyze various mechanisms that result in changes to LOS structures. To gain further insights into the genomic diversity of C. jejuni LOS biosynthesis region, we sequenced the LOS biosynthesis loci of 15 strains that possessed gene content that was distinct from the eight classes. This analysis identified 11 new classes of LOS loci that exhibited examples of deletions and insertions of genes and cassettes of genes found in other LOS classes or capsular biosynthesis loci leading to mosaic LOS loci. The sequence analysis also revealed both missense mutations leading to “allelic” glycosyltransferases and phase-variable and non-phase-variable gene inactivation by the deletion or insertion of bases. Specifically, we demonstrated that gene inactivation is an important mechanism for altering the LOS structures of strains possessing the same class of LOS biosynthesis locus. Together, these observations suggest that LOS biosynthesis region is a hotspot for genetic exchange and variability, often leading to changes in the LOS produced. PMID:18556784

  9. Comparison of CNVs in Buffalo with other species

    USDA-ARS?s Scientific Manuscript database

    Using a read-depth (RD) and a hybrid read-pair, split-read (RAPTR-SV) CNV detection method, we identified over 1425 unique CNVs in 14 Water Buffalo individual compared to the cattle genome sequence. Total variable sequence of the CNV regions (CNVR) from the RD method approached 59 megabases (~ 2% of...

  10. Somatic diversification in the heavy chain variable region genes expressed by human autoantibodies bearing a lupus-associated nephritogenic anti-DNA idiotype

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Demaison, C.; Chastagner, P.; Theze, J.

    1994-01-18

    Monoclonal anti-DNA antibodies bearing a lupus nephritis-associated idiotype were derived from five patients with systemic lupus erythematosus (SLE). Genes encoding their heavy (H)-chain variable (V[sub H]) regions were cloned and sequenced. When compared with their closest V[sub h] germ-line gene relatives, these sequences exhibit a number of silent (S) and replacement (R) substitutions. The ratios of R/S mutations were much higher in the complementarity-determining regions (CDRs) of the antibodies than in the framework regions. Molecular amplification of genomic V[sub H] genes and Southern hybridization with somatic CDR2-specific oligonucleotide probes showed that the configuration of the V[sub H] genes corresponding tomore » V[sub H] sequences in the nephritogenic antibodies is not present in the patient's own germ-line DNA, implying that the B-cell clones underwent somatic mutation in vivo. These findings, together with the characteristics of the diversity and junctional gene elements utilized to form the antibody, indicate that these autoantibodies have been driven through somatic selection processes reminiscent of those that govern antibody responses triggered by exogenous stimuli.« less

  11. Three ingredients for Improved global aftershock forecasts: Tectonic region, time-dependent catalog incompleteness, and inter-sequence variability

    USGS Publications Warehouse

    Page, Morgan T.; Van Der Elst, Nicholas; Hardebeck, Jeanne L.; Felzer, Karen; Michael, Andrew J.

    2016-01-01

    Following a large earthquake, seismic hazard can be orders of magnitude higher than the long‐term average as a result of aftershock triggering. Because of this heightened hazard, emergency managers and the public demand rapid, authoritative, and reliable aftershock forecasts. In the past, U.S. Geological Survey (USGS) aftershock forecasts following large global earthquakes have been released on an ad hoc basis with inconsistent methods, and in some cases aftershock parameters adapted from California. To remedy this, the USGS is currently developing an automated aftershock product based on the Reasenberg and Jones (1989) method that will generate more accurate forecasts. To better capture spatial variations in aftershock productivity and decay, we estimate regional aftershock parameters for sequences within the García et al. (2012) tectonic regions. We find that regional variations for mean aftershock productivity reach almost a factor of 10. We also develop a method to account for the time‐dependent magnitude of completeness following large events in the catalog. In addition to estimating average sequence parameters within regions, we develop an inverse method to estimate the intersequence parameter variability. This allows for a more complete quantification of the forecast uncertainties and Bayesian updating of the forecast as sequence‐specific information becomes available.

  12. The Complete Nucleotide Sequence of the Human Immunoglobulin Heavy Chain Variable Region Locus

    PubMed Central

    Matsuda, Fumihiko; Ishii, Kazuo; Bourvagnet, Patrice; Kuma, Kei-ichi; Hayashida, Hidenori; Miyata, Takashi; Honjo, Tasuku

    1998-01-01

    The complete nucleotide sequence of the 957-kb DNA of the human immunoglobulin heavy chain variable (VH) region locus was determined and 43 novel VH segments were identified. The region contains 123 VH segments classifiable into seven different families, of which 79 are pseudogenes. Of the 44 VH segments with an open reading frame, 39 are expressed as heavy chain proteins and 1 as mRNA, while the remaining 4 are not found in immunoglobulin cDNAs. Combinatorial diversity of VH region was calculated to be ∼6,000. Conservation of the promoter and recombination signal sequences was observed to be higher in functional VH segments than in pseudogenes. Phylogenetic analysis of 114 VH segments clearly showed clustering of the VH segments of each family. However, an independent branch in the tree contained a single VH, V4-44.1P, sharing similar levels of homology to human VH families and to those of other vertebrates. Comparison between different copies of homologous units that appear repeatedly across the locus clearly demonstrates that dynamic DNA reorganization of the locus took place at least eight times between 133 and 10 million years ago. One nonimmunoglobulin gene of unknown function was identified in the intergenic region. PMID:9841928

  13. Interspecific and intraspecific gene variability in a 1-Mb region containing the highest density of NBS-LRR genes found in the melon genome.

    PubMed

    González, Víctor M; Aventín, Núria; Centeno, Emilio; Puigdomènech, Pere

    2014-12-17

    Plant NBS-LRR -resistance genes tend to be found in clusters, which have been shown to be hot spots of genome variability. In melon, half of the 81 predicted NBS-LRR genes group in nine clusters, and a 1 Mb region on linkage group V contains the highest density of R-genes and presence/absence gene polymorphisms found in the melon genome. This region is known to contain the locus of Vat, an agronomically important gene that confers resistance to aphids. However, the presence of duplications makes the sequencing and annotation of R-gene clusters difficult, usually resulting in multi-gapped sequences with higher than average errors. A 1-Mb sequence that contains the largest NBS-LRR gene cluster found in melon was improved using a strategy that combines Illumina paired-end mapping and PCR-based gap closing. Unknown sequence was decreased by 70% while about 3,000 SNPs and small indels were corrected. As a result, the annotations of 18 of a total of 23 NBS-LRR genes found in this region were modified, including additional coding sequences, amino acid changes, correction of splicing boundaries, or fussion of ORFs in common transcription units. A phylogeny analysis of the R-genes and their comparison with syntenic sequences in other cucurbits point to a pattern of local gene amplifications since the diversification of cucurbits from other families, and through speciation within the family. A candidate Vat gene is proposed based on the sequence similarity between a reported Vat gene from a Korean melon cultivar and a sequence fragment previously absent in the unrefined sequence. A sequence refinement strategy allowed substantial improvement of a 1 Mb fragment of the melon genome and the re-annotation of the largest cluster of NBS-LRR gene homologues found in melon. Analysis of the cluster revealed that resistance genes have been produced by sequence duplication in adjacent genome locations since the divergence of cucurbits from other close families, and through the process of speciation within the family a candidate Vat gene was also identified using sequence previously unavailable, which demonstrates the advantages of genome assembly refinements when analyzing complex regions such as those containing clusters of highly similar genes.

  14. Phylogenetic Relationship in Different Commercial Strains of Pleurotus nebrodensis Based on ITS Sequence and RAPD.

    PubMed

    Alam, Nuhu; Shim, Mi Ja; Lee, Min Woong; Shin, Pyeong Gyun; Yoo, Young Bok; Lee, Tae Soo

    2009-09-01

    The molecular phylogeny in nine different commercial cultivated strains of Pleurotus nebrodensis was studied based on their internal transcribed spacer (ITS) region and RAPD. In the sequence of ITS region of selected strains, it was revealed that the total length ranged from 592 to 614 bp. The size of ITS1 and ITS2 regions varied among the strains from 219 to 228 bp and 211 to 229 bp, respectively. The sequence of ITS2 was more variable than ITS1 and the region of 5.8S sequences were identical. Phylogenetic tree of the ITS region sequences indicated that selected strains were classified into five clusters. The reciprocal homologies of the ITS region sequences ranged from 99 to 100%. The strains were also analyzed by RAPD with 20 arbitrary primers. Twelve primers were efficient to applying amplification of the genomic DNA. The sizes of the polymorphic fragments obtained were in the range of 200 to 2000 bp. RAPD and ITS analysis techniques were able to detect genetic variation among the tested strains. Experimental results suggested that IUM-1381, IUM-3914, IUM-1495 and AY-581431 strains were genetically very similar. Therefore, all IUM and NCBI gene bank strains of P. nebrodensis were genetically same with some variations.

  15. Harnessing Gene Conversion in Chicken B Cells to Create a Human Antibody Sequence Repertoire

    PubMed Central

    Schusser, Benjamin; Yi, Henry; Collarini, Ellen J.; Izquierdo, Shelley Mettler; Harriman, William D.; Etches, Robert J.; Leighton, Philip A.

    2013-01-01

    Transgenic chickens expressing human sequence antibodies would be a powerful tool to access human targets and epitopes that have been intractable in mammalian hosts because of tolerance to conserved proteins. To foster the development of the chicken platform, it is beneficial to validate transgene constructs using a rapid, cell culture-based method prior to generating fully transgenic birds. We describe a method for the expression of human immunoglobulin variable regions in the chicken DT40 B cell line and the further diversification of these genes by gene conversion. Chicken VL and VH loci were knocked out in DT40 cells and replaced with human VK and VH genes. To achieve gene conversion of human genes in chicken B cells, synthetic human pseudogene arrays were inserted upstream of the functional human VK and VH regions. Proper expression of chimeric IgM comprised of human variable regions and chicken constant regions is shown. Most importantly, sequencing of DT40 genetic variants confirmed that the human pseudogene arrays contributed to the generation of diversity through gene conversion at both the Igl and Igh loci. These data show that engineered pseudogene arrays produce a diverse pool of human antibody sequences in chicken B cells, and suggest that these constructs will express a functional repertoire of chimeric antibodies in transgenic chickens. PMID:24278246

  16. Analysis of simian immunodeficiency virus sequence variation in tissues of rhesus macaques with simian AIDS.

    PubMed Central

    Kodama, T; Mori, K; Kawahara, T; Ringler, D J; Desrosiers, R C

    1993-01-01

    One rhesus macaque displayed severe encephalomyelitis and another displayed severe enterocolitis following infection with molecularly cloned simian immunodeficiency virus (SIV) strain SIVmac239. Little or no free anti-SIV antibody developed in these two macaques, and they died relatively quickly (4 to 6 months) after infection. Manifestation of the tissue-specific disease in these macaques was associated with the emergence of variants with high replicative capacity for macrophages and primary infection of tissue macrophages. The nature of sequence variation in the central region (vif, vpr, and vpx), the env gene, and the nef long terminal repeat (LTR) region in brain, colon, and other tissues was examined to see whether specific genetic changes were associated with SIV replication in brain or gut. Sequence analysis revealed strong conservation of the intergenic central region, nef, and the LTR. However, analysis of env sequences in these two macaques and one other revealed significant, interesting patterns of sequence variation. (i) Changes in env that were found previously to contribute to the replicative ability of SIVmac for macrophages in culture were present in the tissues of these animals. (ii) The greatest variability was located in the regions between V1 and V2 and from "V3" through C3 in gp120, which are different in location from the variable regions observed previously in animals with strong antibody responses and long-term persistent infection. (iii) The predominant sequence change of D-->N at position 385 in C3 is most surprising, since this change in both SIV and human immunodeficiency virus type 1 has been associated with dramatically diminished affinity for CD4 and replication in vitro. (iv) The nature of sequence changes at some positions (146, 178, 345, 385, and "V3") suggests that viral replication in brain and gut may be facilitated by specific sequence changes in env in addition to those that impart a general ability to replicate well in macrophages. These results demonstrate that complex selective pressures, including immune responses and varying cell and tissue specificity, can influence the nature of sequence changes in env. Images PMID:8411355

  17. The complete mitochondrial genome sequence of the Tibetan red fox (Vulpes vulpes montana).

    PubMed

    Zhang, Jin; Zhang, Honghai; Zhao, Chao; Chen, Lei; Sha, Weilai; Liu, Guangshuai

    2015-01-01

    In this study, the complete mitochondrial genome of the Tibetan red fox (Vulpes Vulpes montana) was sequenced for the first time using blood samples obtained from a wild female red fox captured from Lhasa in Tibet, China. Qinghai--Tibet Plateau is the highest plateau in the world with an average elevation above 3500 m. Sequence analysis showed it contains 12S rRNA gene, 16S rRNA gene, 22 tRNA genes, 13 protein-coding genes and 1 control region (CR). The variable tandem repeats in CR is the main reason of the length variability of mitochondrial genome among canide animals.

  18. Implementing targeted region capture sequencing for the clinical detection of Alagille syndrome: An efficient and cost‑effective method.

    PubMed

    Huang, Tianhong; Yang, Guilin; Dang, Xiao; Ao, Feijian; Li, Jiankang; He, Yizhou; Tang, Qiyuan; He, Qing

    2017-11-01

    Alagille syndrome (AGS) is a highly variable, autosomal dominant disease that affects multiple structures including the liver, heart, eyes, bones and face. Targeted region capture sequencing focuses on a panel of known pathogenic genes and provides a rapid, cost‑effective and accurate method for molecular diagnosis. In a Chinese family, this method was used on the proband and Sanger sequencing was applied to validate the candidate mutation. A de novo heterozygous mutation (c.3254_3255insT p.Leu1085PhefsX24) of the jagged 1 gene was identified as the potential disease‑causing gene mutation. In conclusion, the present study suggested that target region capture sequencing is an efficient, reliable and accurate approach for the clinical diagnosis of AGS. Furthermore, these results expand on the understanding of the pathogenesis of AGS.

  19. Species-specific identification of Dekkera/Brettanomyces yeasts by fluorescently labeled DNA probes targeting the 26S rRNA.

    PubMed

    Röder, Christoph; König, Helmut; Fröhlich, Jürgen

    2007-09-01

    Sequencing of the complete 26S rRNA genes of all Dekkera/Brettanomyces species colonizing different beverages revealed the potential for a specific primer and probe design to support diagnostic PCR approaches and FISH. By analysis of the complete 26S rRNA genes of all five currently known Dekkera/Brettanomyces species (Dekkera bruxellensis, D. anomala, Brettanomyces custersianus, B. nanus and B. naardenensis), several regions with high nucleotide sequence variability yet distinct from the D1/D2 domains were identified. FISH species-specific probes targeting the 26S rRNA gene's most variable regions were designed. Accessibility of probe targets for hybridization was facilitated by the construction of partially complementary 'side'-labeled probes, based on secondary structure models of the rRNA sequences. The specificity and routine applicability of the FISH-based method for yeast identification were tested by analyzing different wine isolates. Investigation of the prevalence of Dekkera/Brettanomyces yeasts in the German viticultural regions Wonnegau, Nierstein and Bingen (Rhinehesse, Rhineland-Palatinate) resulted in the isolation of 37 D. bruxellensis strains from 291 wine samples.

  20. Inter-individual and intragenomic variations in the ITS region of Clonorchis sinensis (Trematoda: Opisthorchiidae) from Russia and Vietnam.

    PubMed

    Tatonova, Yulia V; Chelomina, Galina N; Nguyen, Hung Manh

    2017-11-01

    Here we examined the intraspecific genetic variability of Clonorchis sinensis from Russia and Vietnam using nuclear DNA sequences (the 5.8S gene and two internal transcribed spacers of the ribosomal cluster). Despite the low level of variability in the ITS1 region, this marker has revealed some features of C. sinensis across multiple geographic regions. The genetic diversity levels for the Russian and Vietnamese populations were similar (0.1 and 0.09%, respectively) but were significantly lower than the C. sinensis from China (0.31%). About half of the sequences of the Chinese (53%) and Korean (47%) populations and about a tenth of the Vietnamese (12%) and Russian (8%) sequences included a 5bp insertion. No sequences with nucleotide substitutions both upstream and downstream of the 5bp insertion were found within the whole data set. The population of northern China had both sequence variants (with substitutions either upstream or downstream of the insertion), while only one of these variants was presented at the other localities. The Vietnamese population had a higher frequency of intragenomic polymorphism than the Russian population (69% vs. 46% and 23% vs. 3% at the 114bp and 339bp positions, respectively). These data are discussed in connection with parasite origin and adaptation, and also its invasive capacity and drug-resistance. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. Sequence intrinsic somatic mutation mechanisms contribute to affinity maturation of VRC01-class HIV-1 broadly neutralizing antibodies

    PubMed Central

    Hwang, Joyce K.; Wang, Chong; Du, Zhou; Meyers, Robin M.; Kepler, Thomas B.; Neuberg, Donna; Kwong, Peter D.; Mascola, John R.; Joyce, M. Gordon; Bonsignori, Mattia; Haynes, Barton F.; Yeap, Leng-Siew; Alt, Frederick W.

    2017-01-01

    Variable regions of Ig chains provide the antigen recognition portion of B-cell receptors and derivative antibodies. Ig heavy-chain variable region exons are assembled developmentally from V, D, J gene segments. Each variable region contains three antigen-contacting complementarity-determining regions (CDRs), with CDR1 and CDR2 encoded by the V segment and CDR3 encoded by the V(D)J junction region. Antigen-stimulated germinal center (GC) B cells undergo somatic hypermutation (SHM) of V(D)J exons followed by selection for SHMs that increase antigen-binding affinity. Some HIV-1–infected human subjects develop broadly neutralizing antibodies (bnAbs), such as the potent VRC01-class bnAbs, that neutralize diverse HIV-1 strains. Mature VRC01-class bnAbs, including VRC-PG04, accumulate very high SHM levels, a property that hinders development of vaccine strategies to elicit them. Because many VRC01-class bnAb SHMs are not required for broad neutralization, high overall SHM may be required to achieve certain functional SHMs. To elucidate such requirements, we used a V(D)J passenger allele system to assay, in mouse GC B cells, sequence-intrinsic SHM-targeting rates of nucleotides across substrates representing maturation stages of human VRC-PG04. We identify rate-limiting SHM positions for VRC-PG04 maturation, as well as SHM hotspots and intrinsically frequent deletions associated with SHM. We find that mature VRC-PG04 has low SHM capability due to hotspot saturation but also demonstrate that generation of new SHM hotspots and saturation of existing hotspot regions (e.g., CDR3) does not majorly influence intrinsic SHM in unmutated portions of VRC-PG04 progenitor sequences. We discuss implications of our findings for bnAb affinity maturation mechanisms. PMID:28747530

  2. Distribution of Helicobacter pylori virulence markers in patients with gastroduodenal diseases in a region at high risk of gastric cancer.

    PubMed

    Wang, Ming-yi; Chen, Cheng; Gao, Xiao-zhong; Li, Jie; Yue, Jing; Ling, Feng; Wang, Xiao-chun; Shao, Shi-he

    2013-01-01

    Helicobacter pylori (H. pylori) is a major human pathogen that is responsible for various gastroduodenal diseases. We investigated the prevalence of H. pylori virulence markers in a region at high risk of gastric cancer. One hundred and sixteen H. pylori strains were isolated from patients with gastroduodenal diseases. cagA, the cagA 3' variable region, cagPAI genes, vacA, and dupA genotypes were determined by PCR, and some amplicons of the cagA 3' variable region, cagPAI genes and dupA were sequenced. cagA was detected in all strains. The cagA 3' variable region of 85 strains (73.3%) was amplified, and the sequences of 24 strains were obtained including 22 strains possessing the East Asian-type. The partial cagPAI presented at a higher frequency in chronic gastritis (44.4%) than that of the severe clinical outcomes (9.7%, p < 0.001). The most prevalent vacA genotypes were s1a/m2 (48.3%) and s1c/m2 (13.8%). Thirty-six strains (31.0%) possessed dupA and sequencing of dupA revealed an ORF of 2449-bp. The prevalence of dupA was significantly higher in strains from patients with the severe clinical outcomes (40.3%) than that from chronic gastritis (20.4%, p = 0.02). The high rate of East Asian-type cagA, intact cagPAI, virulent vacA genotypes, and the intact long-type dupA may underlie the high risk of gastric cancer in the region. Copyright © 2013 Elsevier Ltd. All rights reserved.

  3. Mutations in the S gene and in the overlapping reverse transcriptase region in chronic hepatitis B Chinese patients with coexistence of HBsAg and anti-HBs.

    PubMed

    Ding, Feng; Miao, Xi-Li; Li, Yan-Xia; Dai, Jin-Fen; Yu, Hong-Gang

    2016-01-01

    The mechanism underlying the coexistence of hepatitis B surface antigen and antibodies to HBsAg in chronic hepatitis B patients remains unknown. This research aimed to determine the clinical and virological features of the rare pattern. A total of 32 chronic hepatitis B patients infected by HBV genotype C were included: 15 carrying both HBsAg and anti-HBs (group I) and 17 solely positive for HBsAg (group II). S gene and reverse transcriptase region sequences were amplified, sequenced and compared with the reference sequences. The amino acid variability within major hydrophilic region, especially the "a" determinant region, and within reverse transcriptase for regions overlapping the major hydrophilic region in group I is significantly higher than those in group II. Mutation sI126S/T within the "a" determinant was the most frequent change, and only patients from group I had the sQ129R, sG130N, sF134I, sG145R amino acid changes, which are known to alter immunogenicity. In chronic patients, the concurrent HBsAg/anti-HBs serological profile is associated with an increased aa variability in several key areas of HBV genome. Additional research on these genetic mutants are needed to clarify their biological significance for viral persistence. Copyright © 2015 Elsevier Editora Ltda. All rights reserved.

  4. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR.

    PubMed

    Tyson, Jess; Armour, John A L

    2012-12-11

    Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example.

  5. Amino terminal sequence of heavy and light chains from ratfish immunoglobulin.

    PubMed

    De Ioannes, A E; Aguila, H L

    1989-01-01

    The ratfish, Callorhinchus callorhinchus, a representative of the Holocephali, has a natural serum hemagglutinin (Mr 960,000), composed of heavy (Mr 71,000), light (Mr 22,500), and J (Mr 16,000) chains. To approach the mechanisms that generate diversity at this level of evolution, the amino terminal sequence of the heavy and light chains was determined by automated microsequencing. The chains are unblocked and have modest internal sequence heterogeneity. The heavy chains show sequence similarity with the terminal region of the heavy chain from the horned shark, Heterodontus francisci, and other species. In contrast to the heavy chain, the ratfish light chains display low sequence similarity with their shark kappa counterparts. However, their similarity with the variable region of the chicken lambda light chains is about 75%.

  6. [Comparative analysis of variable regions in the genomes of variola virus].

    PubMed

    Babkin, I V; Nepomniashchikh, T S; Maksiutov, R A; Gutorov, V V; Babkina, I N; Shchelkunov, S N

    2008-01-01

    Nucleotide sequences of two extended segments of the terminal variable regions in variola virus genome were determined. The size of the left segment was 13.5 kbp and of the right, 10.5 kbp. Totally, over 540 kbp were sequenced for 22 variola virus strains. The conducted phylogenetic analysis and the data published earlier allowed us to find the interrelations between 70 variola virus isolates, the character of their clustering, and the degree of intergroup and intragroup variations of the clusters of variola virus strains. The most polymorphic loci of the genome segments studied were determined. It was demonstrated that that these loci are localized to either noncoding genome regions or to the regions of destroyed open reading frames, characteristic of the ancestor virus. These loci are promising for development of the strategy for genotyping variola virus strains. Analysis of recombination using various methods demonstrated that, with the only exception, no statistically significant recombinational events in the genomes of variola virus strains studied were detectable.

  7. Comparison of variable region 3 sequences of human immunodeficiency virus type 1 from infected children with the RNA and DNA sequences of the virus populations of their mothers.

    PubMed Central

    Scarlatti, G; Leitner, T; Halapi, E; Wahlberg, J; Marchisio, P; Clerici-Schoeller, M A; Wigzell, H; Fenyö, E M; Albert, J; Uhlén, M

    1993-01-01

    We have compared the variable region 3 sequences from 10 human immunodeficiency virus type 1 (HIV-1)-infected infants to virus sequences from the corresponding mothers. The sequences were derived from DNA of uncultured peripheral blood mononuclear cells (PBMC), DNA of cultured PBMC, and RNA from serum collected at or shortly after delivery. The infected infants, in contrast to the mothers, harbored homogeneous virus populations. Comparison of sequences from the children and clones derived from DNA of the corresponding mothers showed that the transmitted virus represented either a minor or a major virus population of the mother. In contrast to an earlier study, we found no evidence of selection of minor virus variants during transmission. Furthermore, the transmitted virus variant did not show any characteristic molecular features. In some cases the transmitted virus was more related to the virus RNA population of the mother and in other cases it was more related to the virus DNA population. This suggests that either cell-free or cell-associated virus may be transmitted. These data will help AIDS researchers to understand the mechanism of transmission and to plan strategies for prevention of transmission. PMID:8446584

  8. Full-genome sequences of hepatitis B virus subgenotype D3 isolates from the Brazilian Amazon Region.

    PubMed

    Spitz, Natália; Mello, Francisco C A; Araujo, Natalia Motta

    2015-02-01

    The Brazilian Amazon Region is a highly endemic area for hepatitis B virus (HBV). However, little is known regarding the genetic variability of the strains circulating in this geographical region. Here, we describe the first full-length genomes of HBV isolated in the Brazilian Amazon Region; these genomes are also the first complete HBV subgenotype D3 genomes reported for Brazil. The genomes of the five Brazilian isolates were all 3,182 base pairs in length and the isolates were classified as belonging to subgenotype D3, subtypes ayw2 (n = 3) and ayw3 (n = 2). Phylogenetic analysis suggested that the Brazilian sequences are not likely to be closely related to European D3 sequences. Such results will contribute to further epidemiological and evolutionary studies of HBV.

  9. Variable Copy Number, Intra-Genomic Heterogeneities and Lateral Transfers of the 16S rRNA Gene in Pseudomonas

    PubMed Central

    Bodilis, Josselin; Nsigue-Meilo, Sandrine; Besaury, Ludovic; Quillet, Laurent

    2012-01-01

    Even though the 16S rRNA gene is the most commonly used taxonomic marker in microbial ecology, its poor resolution is still not fully understood at the intra-genus level. In this work, the number of rRNA gene operons, intra-genomic heterogeneities and lateral transfers were investigated at a fine-scale resolution, throughout the Pseudomonas genus. In addition to nineteen sequenced Pseudomonas strains, we determined the 16S rRNA copy number in four other Pseudomonas strains by Southern hybridization and Pulsed-Field Gel Electrophoresis, and studied the intra-genomic heterogeneities by Denaturing Gradient Gel Electrophoresis and sequencing. Although the variable copy number (from four to seven) seems to be correlated with the evolutionary distance, some close strains in the P. fluorescens lineage showed a different number of 16S rRNA genes, whereas all the strains in the P. aeruginosa lineage displayed the same number of genes (four copies). Further study of the intra-genomic heterogeneities revealed that most of the Pseudomonas strains (15 out of 19 strains) had at least two different 16S rRNA alleles. A great difference (5 or 19 nucleotides, essentially grouped near the V1 hypervariable region) was observed only in two sequenced strains. In one of our strains studied (MFY30 strain), we found a difference of 12 nucleotides (grouped in the V3 hypervariable region) between copies of the 16S rRNA gene. Finally, occurrence of partial lateral transfers of the 16S rRNA gene was further investigated in 1803 full-length sequences of Pseudomonas available in the databases. Remarkably, we found that the two most variable regions (the V1 and V3 hypervariable regions) had probably been laterally transferred from another evolutionary distant Pseudomonas strain for at least 48.3 and 41.6% of the 16S rRNA sequences, respectively. In conclusion, we strongly recommend removing these regions of the 16S rRNA gene during the intra-genus diversity studies. PMID:22545126

  10. Combining phage display with de novo protein sequencing for reverse engineering of monoclonal antibodies.

    PubMed

    Rickert, Keith W; Grinberg, Luba; Woods, Robert M; Wilson, Susan; Bowen, Michael A; Baca, Manuel

    2016-01-01

    The enormous diversity created by gene recombination and somatic hypermutation makes de novo protein sequencing of monoclonal antibodies a uniquely challenging problem. Modern mass spectrometry-based sequencing will rarely, if ever, provide a single unambiguous sequence for the variable domains. A more likely outcome is computation of an ensemble of highly similar sequences that can satisfy the experimental data. This outcome can result in the need for empirical testing of many candidate sequences, sometimes iteratively, to identity one which can replicate the activity of the parental antibody. Here we describe an improved approach to antibody protein sequencing by using phage display technology to generate a combinatorial library of sequences that satisfy the mass spectrometry data, and selecting for functional candidates that bind antigen. This approach was used to reverse engineer 2 commercially-obtained monoclonal antibodies against murine CD137. Proteomic data enabled us to assign the majority of the variable domain sequences, with the exception of 3-5% of the sequence located within or adjacent to complementarity-determining regions. To efficiently resolve the sequence in these regions, small phage-displayed libraries were generated and subjected to antigen binding selection. Following enrichment of antigen-binding clones, 2 clones were selected for each antibody and recombinantly expressed as antigen-binding fragments (Fabs). In both cases, the reverse-engineered Fabs exhibited identical antigen binding affinity, within error, as Fabs produced from the commercial IgGs. This combination of proteomic and protein engineering techniques provides a useful approach to simplifying the technically challenging process of reverse engineering monoclonal antibodies from protein material.

  11. Combining phage display with de novo protein sequencing for reverse engineering of monoclonal antibodies

    PubMed Central

    Rickert, Keith W.; Grinberg, Luba; Woods, Robert M.; Wilson, Susan; Bowen, Michael A.; Baca, Manuel

    2016-01-01

    ABSTRACT The enormous diversity created by gene recombination and somatic hypermutation makes de novo protein sequencing of monoclonal antibodies a uniquely challenging problem. Modern mass spectrometry-based sequencing will rarely, if ever, provide a single unambiguous sequence for the variable domains. A more likely outcome is computation of an ensemble of highly similar sequences that can satisfy the experimental data. This outcome can result in the need for empirical testing of many candidate sequences, sometimes iteratively, to identity one which can replicate the activity of the parental antibody. Here we describe an improved approach to antibody protein sequencing by using phage display technology to generate a combinatorial library of sequences that satisfy the mass spectrometry data, and selecting for functional candidates that bind antigen. This approach was used to reverse engineer 2 commercially-obtained monoclonal antibodies against murine CD137. Proteomic data enabled us to assign the majority of the variable domain sequences, with the exception of 3–5% of the sequence located within or adjacent to complementarity-determining regions. To efficiently resolve the sequence in these regions, small phage-displayed libraries were generated and subjected to antigen binding selection. Following enrichment of antigen-binding clones, 2 clones were selected for each antibody and recombinantly expressed as antigen-binding fragments (Fabs). In both cases, the reverse-engineered Fabs exhibited identical antigen binding affinity, within error, as Fabs produced from the commercial IgGs. This combination of proteomic and protein engineering techniques provides a useful approach to simplifying the technically challenging process of reverse engineering monoclonal antibodies from protein material. PMID:26852694

  12. Bacterial diversity of Taxus rhizosphere: culture-independent and culture-dependent approaches.

    PubMed

    Hao, Da Cheng; Ge, Guang Bo; Yang, Ling

    2008-07-01

    The regional variability of Taxus rhizosphere bacterial community composition and diversity was studied by comparative analysis of three large 16S rRNA gene clone libraries from the Taxus rhizosphere in different regions of China (subtropical and temperate regions). One hundred and forty-six clones were screened for three libraries. Phylogenetic analysis of 16S rRNA gene sequences demonstrated that the abundance of sequences affiliated with Gammaproteobacteria, Betaproteobacteria, and Actinobacteria was higher in the library from the T. xmedia rhizosphere of the temperate region compared with the subtropical Taxus mairei rhizosphere. On the other hand, Acidobacteria was more abundant in libraries from the subtropical Taxus mairei rhizosphere. Richness estimates and diversity indices of three libraries revealed major differences, indicating a higher richness in the Taxus rhizosphere bacterial communities of the subtropical region and considerable variability in the bacterial community composition within this region. By enrichment culture, a novel Actinobacteria strain DICP16 was isolated from the T. xmedia rhizosphere of the temperate region and was identified as Leifsonia shinshuensis sp. via 16S rRNA gene and gyrase B sequence analyses. DICP16 was able to remove the xylosyl group from 7-xylosyl-10-deacetylbaccatin III and 7-xylosyl-10-deacetylpaclitaxel, thereby making the xylosyltaxanes available as sources of 10-deacetylbaccatin III and the anticancer drug paclitaxel. Taken together, the present studies provide, for the first time, the knowledge of the biodiversity of microorganisms populating Taxus rhizospheres.

  13. Changes in tau phosphorylation in hibernating rodents.

    PubMed

    León-Espinosa, Gonzalo; García, Esther; García-Escudero, Vega; Hernández, Félix; Defelipe, Javier; Avila, Jesús

    2013-07-01

    Tau is a cytoskeletal protein present mainly in the neurons of vertebrates. By comparing the sequence of tau molecule among different vertebrates, it was found that the variability of the N-terminal sequence in tau protein is higher than that of the C-terminal region. The N-terminal region is involved mainly in the binding of tau to cellular membranes, whereas the C-terminal region of the tau molecule contains the microtubule-binding sites. We have compared the sequence of Syrian hamster tau with the sequences of other hibernating and nonhibernating rodents and investigated how differences in the N-terminal region of tau could affect the phosphorylation level and tau binding to cell membranes. We also describe a change, in tau phosphorylation, on a casein kinase 1 (ck1)-dependent site that is found only in hibernating rodents. This ck1 site seems to play an important role in the regulation of tau binding to membranes. Copyright © 2013 Wiley Periodicals, Inc.

  14. Full Genome Sequencing Reveals New Southern African Territories Genotypes Bringing Us Closer to Understanding True Variability of Foot-and-Mouth Disease Virus in Africa

    PubMed Central

    Lasecka-Dykes, Lidia; Wright, Caroline F.; Di Nardo, Antonello; Logan, Grace; Mioulet, Valerie; Jackson, Terry; Tuthill, Tobias J.; Knowles, Nick J.; King, Donald P.

    2018-01-01

    Foot-and-mouth disease virus (FMDV) causes a highly contagious disease of cloven-hooved animals that poses a constant burden on farmers in endemic regions and threatens the livestock industries in disease-free countries. Despite the increased number of publicly available whole genome sequences, FMDV data are biased by the opportunistic nature of sampling. Since whole genomic sequences of Southern African Territories (SAT) are particularly underrepresented, this study sequenced 34 isolates from eastern and southern Africa. Phylogenetic analyses revealed two novel genotypes (that comprised 8/34 of these SAT isolates) which contained unusual 5′ untranslated and non-structural encoding regions. While recombination has occurred between these sequences, phylogeny violation analyses indicated that the high degree of sequence diversity for the novel SAT genotypes has not solely arisen from recombination events. Based on estimates of the timing of ancestral divergence, these data are interpreted as being representative of un-sampled FMDV isolates that have been subjected to geographical isolation within Africa by the effects of the Great African Rinderpest Pandemic (1887–1897), which caused a mass die-out of FMDV-susceptible hosts. These findings demonstrate that further sequencing of African FMDV isolates is likely to reveal more unusual genotypes and will allow for better understanding of natural variability and evolution of FMDV. PMID:29652800

  15. Cloning and molecular characterization of the cDNAs encoding the variable regions of an anti-CD20 monoclonal antibody.

    PubMed

    Shanehbandi, Dariush; Majidi, Jafar; Kazemi, Tohid; Baradaran, Behzad; Aghebati-Maleki, Leili

    2017-01-01

    CD20-based targeting of B-cells in hematologic malignancies and autoimmune disorders is associated with outstanding clinical outcomes. Isolation and characterization of VH and VL cDNAs encoding the variable regions of the heavy and light chains of monoclonal antibodies (MAb) is necessary to produce next generation MAbs and their derivatives such as bispecific antibodies (bsAb) and single-chain variable fragments (scFv). This study was aimed at cloning and characterization of the VH and VL cDNAs from a hybridoma cell line producing an anti-CD20 MAb. VH and VL fragments were amplified, cloned and characterized. Furthermore, amino acid sequences of VH, VL and corresponding complementarity-determining regions (CDR) were determined and compared with those of four approved MAbs including Rituximab (RTX), Ibritumomab tiuxetan, Ofatumumab and GA101. The cloned VH and VL cDNAs were found to be functional and follow a consensus pattern. Amino acid sequences corresponding to the VH and VL fragments also indicated noticeable homologies to those of RTX and Ibritumomab. Furthermore, amino acid sequences of the relating CDRs had remarkable similarities to their counterparts in RTX and Ibritumomab. Successful recovery of VH and VL fragments encourages the development of novel CD20 targeting bsAbs, scFvs, antibody conjugates and T-cells armed with chimeric antigen receptors.

  16. Demonstration of mRNA editing and localization of guide RNA genes in kinetoplast-mitochondria of the plant trypanosomatid Phytomonas serpens.

    PubMed

    Maslov, D A; Hollar, L; Haghighat, P; Nawathean, P

    1998-06-01

    Maxicircle molecules of kDNA in several isolates of Phytomonas were detected by hybridization with the 12S rRNA gene probe from Leishmania tarentolae. The estimated size of maxicircles is isolate-specific and varies from 27 to 36 kb. Fully edited and polyadenylated mRNA for kinetoplast-encoded ribosomal protein S12 (RPS12) was found in the steady-state kinetoplast RNA isolated from Phytomonas serpens strain 1G. Two minicircles (1.45 kb) from this strain were also sequenced. Each minicircle contains two 120 bp conserved regions positioned 180 degrees apart, a region enriched with G and T bases and a variable region. One minicircle encodes a gRNA for the first block of editing of RPSl2 mRNA, and the other encodes a gRNA with unknown function. A gRNA gene for the second block of RPSl2 was found on a minicircle sequenced previously. On each minicircle, a gRNA gene is located in the variable region in a similar position and orientation with respect to the conserved regions.

  17. Amino acid sequence of the Fv region of a human monoclonal IgM (protein WEA) with antibody activity against 3,4-pyruvylated galactose in Klebsiella polysaccharides K30 and K33.

    PubMed Central

    Goñi, F; Frangione, B

    1983-01-01

    We have determined the amino acid sequence of the Fv [variable heavy (VH) and variable light (VL)] region of a human monoclonal IgM-kappa with antibody activity against 3,4-pyruvylated galactose, isolated from the plasma of patient WEA with Waldenström macroglobulinemia. The VH region has 114 residues, belongs to subgroup III, and has a very short third complementarity-determining region (CDR3), probably due to a small D segment/or an unusual D-J rearrangement (D, diversity; J, joining). The VL region has 108 residues and belongs to subgroup V kappa I. Compared to other members of the human VHIII and V kappa I families, WEA Fv does not appear to have significant differences within the framework residues but has unique CDRs that might be responsible for the particular antibody activity. Another IgM-kappa (GAL), which has an as-yet-undetermined antibody activity, shares a striking homology in V kappa with WEA, including an identical CDR1. PMID:6410398

  18. Genome Sequences of Akhmeta Virus, an Early Divergent Old World Orthopoxvirus.

    PubMed

    Gao, Jinxin; Gigante, Crystal; Khmaladze, Ekaterine; Liu, Pengbo; Tang, Shiyuyun; Wilkins, Kimberly; Zhao, Kun; Davidson, Whitni; Nakazawa, Yoshinori; Maghlakelidze, Giorgi; Geleishvili, Marika; Kokhreidze, Maka; Carroll, Darin S; Emerson, Ginny; Li, Yu

    2018-05-12

    Annotated whole genome sequences of three isolates of the Akhmeta virus (AKMV), a novel species of orthopoxvirus (OPXV), isolated from the Akhmeta and Vani regions of the country Georgia, are presented and discussed. The AKMV genome is similar in genomic content and structure to that of the cowpox virus (CPXV), but a lower sequence identity was found between AKMV and Old World OPXVs than between other known species of Old World OPXVs. Phylogenetic analysis showed that AKMV diverged prior to other Old World OPXV. AKMV isolates formed a monophyletic clade in the OPXV phylogeny, yet the sequence variability between AKMV isolates was higher than between the monkeypox virus strains in the Congo basin and West Africa. An AKMV isolate from Vani contained approximately six kb sequence in the left terminal region that shared a higher similarity with CPXV than with other AKMV isolates, whereas the rest of the genome was most similar to AKMV, suggesting recombination between AKMV and CPXV in a region containing several host range and virulence genes.

  19. ScaffoldSeq: Software for characterization of directed evolution populations.

    PubMed

    Woldring, Daniel R; Holec, Patrick V; Hackel, Benjamin J

    2016-07-01

    ScaffoldSeq is software designed for the numerous applications-including directed evolution analysis-in which a user generates a population of DNA sequences encoding for partially diverse proteins with related functions and would like to characterize the single site and pairwise amino acid frequencies across the population. A common scenario for enzyme maturation, antibody screening, and alternative scaffold engineering involves naïve and evolved populations that contain diversified regions, varying in both sequence and length, within a conserved framework. Analyzing the diversified regions of such populations is facilitated by high-throughput sequencing platforms; however, length variability within these regions (e.g., antibody CDRs) encumbers the alignment process. To overcome this challenge, the ScaffoldSeq algorithm takes advantage of conserved framework sequences to quickly identify diverse regions. Beyond this, unintended biases in sequence frequency are generated throughout the experimental workflow required to evolve and isolate clones of interest prior to DNA sequencing. ScaffoldSeq software uniquely handles this issue by providing tools to quantify and remove background sequences, cluster similar protein families, and dampen the impact of dominant clones. The software produces graphical and tabular summaries for each region of interest, allowing users to evaluate diversity in a site-specific manner as well as identify epistatic pairwise interactions. The code and detailed information are freely available at http://research.cems.umn.edu/hackel. Proteins 2016; 84:869-874. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  20. Characterization and evolution of the mitochondrial DNA control region in hornbills (Bucerotiformes).

    PubMed

    Delport, Wayne; Ferguson, J Willem H; Bloomer, Paulette

    2002-06-01

    We determined the mitochondrial DNA control region sequences of six Bucerotiformes. Hornbills have the typical avian gene order and their control region is similar to other avian control regions in that it is partitioned into three domains: two variable domains that flank a central conserved domain. Two characteristics of the hornbill control region sequence differ from that of other birds. First, domain I is AT rich as opposed to AC rich, and second, the control region is approximately 500 bp longer than that of other birds. Both these deviations from typical avian control region sequence are explainable on the basis of repeat motifs in domain I of the hornbill control region. The repeat motifs probably originated from a duplication of CSB-1 as has been determined in chicken, quail, and snowgoose. Furthermore, the hornbill repeat motifs probably arose before the divergence of hornbills from each other but after the divergence of hornbills from other avian taxa. The mitochondrial control region of hornbills is suitable for both phylogenetic and population studies, with domains I and II probably more suited to population and phylogenetic analyses, respectively.

  1. A complementarity-determining region synthetic peptide acts as a miniantibody and neutralizes human immunodeficiency virus type 1 in vitro.

    PubMed Central

    Levi, M; Sällberg, M; Rudén, U; Herlyn, D; Maruyama, H; Wigzell, H; Marks, J; Wahren, B

    1993-01-01

    A complementarity-determining region (CDR) of the mouse monoclonal antibody (mAb) F58 was constructed with specificity to a neutralization-inducing region of human immunodeficiency virus type 1 (HIV-1). The mAb has its major reactivity to the amino acid sequence I--GPGRA in the V3 viral envelope region. All CDRs including several framework amino acids were synthesized from the sequence deduced by cloning and sequencing mAb F58 heavy- and light-chain variable domains. Peptides derived from the third heavy-chain domain (CDR-H3) alone or in combination with the other CDR sequences competed with F58 mAb for the V3 region. The CDR-H3 peptide was chemically modified by cyclization and then inhibited HIV-1 replication as well as syncytium formation by infected cells. Both the homologous IIIB viral strain to which the F58 mAb was induced and the heterologous SF2 strain were inhibited. This synthetic peptide had unexpectedly potent antiviral activity and may be a potential tool for treatment of HIV-infected persons. PMID:7685100

  2. Tomato (Solanum lycopersicum) variety discrimination and hybridization analysis based on the 5S rRNA region.

    PubMed

    Sun, Yan-Lin; Kang, Ho-Min; Kim, Young-Sik; Baek, Jun-Pill; Zheng, Shi-Lin; Xiang, Jin-Jun; Hong, Soon-Kwan

    2014-05-04

    The tomato ( Solanum lycopersicum ) is a major vegetable crop worldwide. To satisfy popular demand, more than 500 tomato varieties have been bred. However, a clear variety identification has not been found. Thorough understanding of the phylogenetic relationship and hybridization information of tomato varieties is very important for further variety breeding. Thus, in this study, we collected 26 tomato varieties and attempted to distinguish them based on the 5S rRNA region, which is widely used in the determination of phylogenetic relations. Sequence analysis of the 5S rRNA region suggested that a large number of nucleotide variations exist among tomato varieties. These variable nucleotide sites were also informative regarding hybridization. Chromas sequencing of Yellow Mountain View and Seuwiteuking varieties indicated three and one variable nucleotide sites in the non-transcribed spacer (NTS) of the 5S rRNA region showing hybridization, respectively. Based on a phylogenetic tree constructed using the 5S rRNA sequences, we observed that 16 tomato varieties were divided into three groups at 95% similarity. Rubiking and Sseommeoking, Lang Selection Procedure and Seuwiteuking, and Acorn Gold and Yellow Mountain View exhibited very high identity with their partners. This work will aid variety authentication and provides a basis for further tomato variety breeding.

  3. [Identification of new conserved and variable regions in the 16S rRNA gene of acetic acid bacteria and acetobacteraceae family].

    PubMed

    Chakravorty, S; Sarkar, S; Gachhui, R

    2015-01-01

    The Acetobacteraceae family of the class Alpha Proteobacteria is comprised of high sugar and acid tolerant bacteria. The Acetic Acid Bacteria are the economically most significant group of this family because of its association with food products like vinegar, wine etc. Acetobacteraceae are often hard to culture in laboratory conditions and they also maintain very low abundances in their natural habitats. Thus identification of the organisms in such environments is greatly dependent on modern tools of molecular biology which require a thorough knowledge of specific conserved gene sequences that may act as primers and or probes. Moreover unconserved domains in genes also become markers for differentiating closely related genera. In bacteria, the 16S rRNA gene is an ideal candidate for such conserved and variable domains. In order to study the conserved and variable domains of the 16S rRNA gene of Acetic Acid Bacteria and the Acetobacteraceae family, sequences from publicly available databases were aligned and compared. Near complete sequences of the gene were also obtained from Kombucha tea biofilm, a known Acetobacteraceae family habitat, in order to corroborate the domains obtained from the alignment studies. The study indicated that the degree of conservation in the gene is significantly higher among the Acetic Acid Bacteria than the whole Acetobacteraceae family. Moreover it was also observed that the previously described hypervariable regions V1, V3, V5, V6 and V7 were more or less conserved in the family and the spans of the variable regions are quite distinct as well.

  4. Mind the gap! The mitochondrial control region and its power as a phylogenetic marker in echinoids.

    PubMed

    Bronstein, Omri; Kroh, Andreas; Haring, Elisabeth

    2018-05-30

    In Metazoa, mitochondrial markers are the most commonly used targets for inferring species-level molecular phylogenies due to their extremely low rate of recombination, maternal inheritance, ease of use and fast substitution rate in comparison to nuclear DNA. The mitochondrial control region (CR) is the main non-coding area of the mitochondrial genome and contains the mitochondrial origin of replication and transcription. While sequences of the cytochrome oxidase subunit 1 (COI) and 16S rRNA genes are the prime mitochondrial markers in phylogenetic studies, the highly variable CR is typically ignored and not targeted in such analyses. However, the higher substitution rate of the CR can be harnessed to infer the phylogeny of closely related species, and the use of a non-coding region alleviates biases resulting from both directional and purifying selection. Additionally, complete mitochondrial genome assemblies utilizing next generation sequencing (NGS) data often show exceptionally low coverage at specific regions, including the CR. This can only be resolved by targeted sequencing of this region. Here we provide novel sequence data for the echinoid mitochondrial control region in over 40 species across the echinoid phylogenetic tree. We demonstrate the advantages of directly targeting the CR and adjacent tRNAs to facilitate complementing low coverage NGS data from complete mitochondrial genome assemblies. Finally, we test the performance of this region as a phylogenetic marker both in the lab and in phylogenetic analyses, and demonstrate its superior performance over the other available mitochondrial markers in echinoids. Our target region of the mitochondrial CR (1) facilitates the first thorough investigation of this region across a wide range of echinoid taxa, (2) provides a tool for complementing missing data in NGS experiments, and (3) identifies the CR as a powerful, novel marker for phylogenetic inference in echinoids due to its high variability, lack of selection, and high compatibility across the entire class, outperforming conventional mitochondrial markers.

  5. Determination of haplotypes at structurally complex regions using emulsion haplotype fusion PCR

    PubMed Central

    2012-01-01

    Background Genotyping and massively-parallel sequencing projects result in a vast amount of diploid data that is only rarely resolved into its constituent haplotypes. It is nevertheless this phased information that is transmitted from one generation to the next and is most directly associated with biological function and the genetic causes of biological effects. Despite progress made in genome-wide sequencing and phasing algorithms and methods, problems assembling (and reconstructing linear haplotypes in) regions of repetitive DNA and structural variation remain. These dynamic and structurally complex regions are often poorly understood from a sequence point of view. Regions such as these that are highly similar in their sequence tend to be collapsed onto the genome assembly. This is turn means downstream determination of the true sequence haplotype in these regions poses a particular challenge. For structurally complex regions, a more focussed approach to assembling haplotypes may be required. Results In order to investigate reconstruction of spatial information at structurally complex regions, we have used an emulsion haplotype fusion PCR approach to reproducibly link sequences of up to 1kb in length to allow phasing of multiple variants from neighbouring loci, using allele-specific PCR and sequencing to detect the phase. By using emulsion systems linking flanking regions to amplicons within the CNV, this led to the reconstruction of a 59kb haplotype across the DEFA1A3 CNV in HapMap individuals. Conclusion This study has demonstrated a novel use for emulsion haplotype fusion PCR in addressing the issue of reconstructing structural haplotypes at multiallelic copy variable regions, using the DEFA1A3 locus as an example. PMID:23231411

  6. Fine Analysis of Genetic Diversity of the tpr Gene Family among Treponemal Species, Subspecies and Strains

    PubMed Central

    Centurion-Lara, Arturo; Giacani, Lorenzo; Godornes, Charmie; Molini, Barbara J.; Brinck Reid, Tara; Lukehart, Sheila A.

    2013-01-01

    Background The pathogenic non-cultivable treponemes include three subspecies of Treponema pallidum (pallidum, pertenue, endemicum), T. carateum, T. paraluiscuniculi, and the unclassified Fribourg-Blanc treponeme (Simian isolate). These treponemes are morphologically indistinguishable and antigenically and genetically highly similar, yet cross-immunity is variable or non-existent. Although all of these organisms cause chronic, multistage skin and systemic disease, they have historically been classified by mode of transmission, clinical presentations and host ranges. Whole genome studies underscore the high degree of sequence identity among species, subspecies and strains, pinpointing a limited number of genomic regions for variation. Many of these “hot spots” include members of the tpr gene family, composed of 12 paralogs encoding candidate virulence factors. We hypothesize that the distinct clinical presentations, host specificity, and variable cross-immunity might reside on virulence factors such as the tpr genes. Methodology/Principal Findings Sequence analysis of 11 tpr loci (excluding tprK) from 12 strains demonstrated an impressive heterogeneity, including SNPs, indels, chimeric genes, truncated gene products and large deletions. Comparative analyses of sequences and 3D models of predicted proteins in Subfamily I highlight the striking co-localization of discrete variable regions with predicted surface-exposed loops. A hallmark of Subfamily II is the presence of chimeric genes in the tprG and J loci. Diversity in Subfamily III is limited to tprA and tprL. Conclusions/Significance An impressive sequence variability was found in tpr sequences among the Treponema isolates examined in this study, with most of the variation being consistent within subspecies or species, or between syphilis vs. non-syphilis strains. Variability was seen in the pallidum subspecies, which can be divided into 5 genogroups. These findings support a genetic basis for the classification of these organisms into their respective subspecies and species. Future functional studies will determine whether the identified genetic differences relate to cross-immunity, clinical differences, or host ranges. PMID:23696912

  7. Introduced T cell receptor variable region gene segments recombine in pre-B cells: evidence that B and T cells use a common recombinase.

    PubMed

    Yancopoulos, G D; Blackwell, T K; Suh, H; Hood, L; Alt, F W

    1986-01-31

    We have recently proposed that a common recombinase performs all of the many variable region gene assembly events in B and T cells, and that the specificity of these joining events is mediated by regulating the "accessibility" of the involved gene segments. To test this possibility, we have introduced "accessible" T cell receptor (TCR) variable region gene segments into a pre-B cell line capable of recombining endogenous and transfected immunoglobulin (Ig) variable region gene segments. Although the corresponding "inaccessible" endogenous TCR gene segments do not rearrange in this line or in B cells in general, the introduced TCR gene segments join very frequently and, in fact, closely resemble introduced Ig gene segments in their recombination characteristics. These observations suggest a new role for conventional Ig transcriptional enhancers--recombinational enhancement. Our studies provide insight into additional aspects of the joining mechanism such as N region insertion, aberrant joining, and recombination-recognition sequence requirements for joining.

  8. Phylogenetic analysis of feline immunodeficiency virus strains from naturally infected cats in Belgium and The Netherlands.

    PubMed

    Roukaerts, Inge D M; Theuns, Sebastiaan; Taffin, Elien R L; Daminet, Sylvie; Nauwynck, Hans J

    2015-01-22

    Feline immunodeficiency virus (FIV) is a major pathogen in feline populations worldwide, with seroprevalences up to 26%. Virus strains circulating in domestic cats are subdivided into different phylogenetic clades (A-E), based on the genetic diversity of the V3-V4 region of the env gene. In this report, a phylogenetic analysis of the V3-V4 env region, and a variable region in the gag gene was made for 36 FIV strains isolated in Belgium and The Netherlands. All newly generated gag sequences clustered together with previously known clade A FIV viruses, confirming the dominance of clade A viruses in Northern Europe. The same was true for the obtained env sequences, with only one sample of an unknown env subtype. Overall, the genetic diversity of FIV strains sequenced in this report was low. This indicates a relatively recent introduction of FIV in Belgium and The Netherlands. However, the sample with an unknown env subtype indicates that new introductions of FIV from unknown origin do occur and this will likely increase genetic variability in time. Copyright © 2014 Elsevier B.V. All rights reserved.

  9. The immunoglobulin heavy chain locus of the duck. Genomic organization and expression of D, J, and C region genes.

    PubMed

    Lundqvist, M L; Middleton, D L; Hazard, S; Warr, G W

    2001-12-14

    The region of the duck IgH locus extending from upstream of the proximal diversity (D) segment to downstream of the constant gene cluster has been cloned and mapped. A sequence contig of 48,796 base pairs established that the organization of the genes is D-J(H)-mu-alpha-upsilon. No evidence for a functional homologue (or remnant) of a delta gene was found. The alpha gene is in inverted transcriptional orientation; class switch to IgA expression thus requires inversion of the approximately 27-kilobase pair region that includes both mu and alpha genes. The secreted forms of duck alpha and mu are each encoded by 4 constant region exons, and the hydrophobic C-terminal regions of the membrane receptor forms of alpha and mu are encoded by one and two transmembrane exons, respectively. Putative switch (S) regions were identified for duck mu and upsilon by comparison with chicken Smu and Supsilon sequences and for duck alpha by comparison with mouse Salpha. The duck IgH locus is rich in complex variable number tandem repeats, which occupy approximately 60% of the sequenced region, and occur at a much higher frequency in the IgH locus than in other sequenced regions of the duck genome.

  10. The complete chloroplast genome of Cinnamomum camphora and its comparison with related Lauraceae species.

    PubMed

    Chen, Caihui; Zheng, Yongjie; Liu, Sian; Zhong, Yongda; Wu, Yanfang; Li, Jiang; Xu, Li-An; Xu, Meng

    2017-01-01

    Cinnamomum camphora , a member of the Lauraceae family, is a valuable aromatic and timber tree that is indigenous to the south of China and Japan. All parts of Cinnamomum camphora have secretory cells containing different volatile chemical compounds that are utilized as herbal medicines and essential oils. Here, we reported the complete sequencing of the chloroplast genome of Cinnamomum camphora using illumina technology. The chloroplast genome of Cinnamomum camphora is 152,570 bp in length and characterized by a relatively conserved quadripartite structure containing a large single copy region of 93,705 bp, a small single copy region of 19,093 bp and two inverted repeat (IR) regions of 19,886 bp. Overall, the genome contained 123 coding regions, of which 15 were repeated in the IR regions. An analysis of chloroplast sequence divergence revealed that the small single copy region was highly variable among the different genera in the Lauraceae family. A total of 40 repeat structures and 83 simple sequence repeats were detected in both the coding and non-coding regions. A phylogenetic analysis indicated that Calycanthus is most closely related to Lauraceae , both being members of Laurales , which forms a sister group to Magnoliids . The complete sequence of the chloroplast of Cinnamomum camphora will aid in in-depth taxonomical studies of the Lauraceae family in the future. The genetic sequence information will also have valuable applications for chloroplast genetic engineering.

  11. spa typing for epidemiological surveillance of Staphylococcus aureus.

    PubMed

    Hallin, Marie; Friedrich, Alexander W; Struelens, Marc J

    2009-01-01

    The spa typing method is based on sequencing of the polymorphic X region of the protein A gene (spa), present in all strains of Staphylococcus aureus. The X region is constituted of a variable number of 24-bp repeats flanked by well-conserved regions. This single-locus sequence-based typing method combines a number of technical advantages, such as rapidity, reproducibility, and portability. Moreover, due to its repeat structure, the spa locus simultaneously indexes micro- and macrovariations, enabling the use of spa typing in both local and global epidemiological studies. These studies are facilitated by the establishment of standardized spa type nomenclature and Internet shared databases.

  12. Polyomavirus BK non-coding control region rearrangements in health and disease.

    PubMed

    Sharma, Preety M; Gupta, Gaurav; Vats, Abhay; Shapiro, Ron; Randhawa, Parmjeet S

    2007-08-01

    BK virus is an increasingly recognized pathogen in transplanted patients. DNA sequencing of this virus shows considerable genomic variability. To understand the clinical significance of rearrangements in the non-coding control region (NCCR) of BK virus (BKV), we report a meta-analysis of 507 sequences, including 40 sequences generated in our own laboratory, for associations between rearrangements and disease, tissue tropism, geographic origin, and viral genotype. NCCR rearrangements were less frequent in (a) asymptomatic BKV viruria compared to patients viral nephropathy (1.7% vs. 22.5%), and (b) viral genotype 1 compared to other genotypes (2.4% vs. 11.2%). Rearrangements were commoner in malignancy (78.6%), and Norwegians (45.7%), and less common in East Indians (0%), and Japanese (4.3%). A surprising number of rearranged sequences were reported from mononuclear cells of healthy subjects, whereas most plasma sequences were archetypal. This difference could not be related to potential recombinase activity in lymphocytes, as consensus recombination signal sequences could not be found in the NCCR region. NCCR rearrangements are neither required nor a sufficient condition to produce clinical disease. BKV nephropathy and hemorrhagic cystitis are not associated with any unique NCCR configuration or nucleotide sequence.

  13. Influence of flanking sequences on variability in expression levels of an introduced gene in transgenic tobacco plants.

    PubMed Central

    Dean, C; Jones, J; Favreau, M; Dunsmuir, P; Bedbrook, J

    1988-01-01

    The petunia rbcS gene SSU301 was introduced into tobacco using Agrobacterium tumefaciens-mediated transformation. The time at which rbcS expression was maximal after transfer of the tobacco plants to the greenhouse was determined. The expression level of the SSU301 gene varied up to 9 fold between individual tobacco plants which had been standardized physiologically as much as possible. The presence of adjacent pUC plasmid sequences did not affect the expression of the SSU301 gene. In an attempt to reduce the between-transformant variability in expression, the SSU301 gene was introduced into tobacco surrounded by 10kb of 5' and 13 kb of 3' DNA sequences which normally flank SSU301 in petunia. The longer flanking regions did not reduce the between-transformant variability of SSU301 gene expression. Images PMID:3174450

  14. Diatom centromeres suggest a mechanism for nuclear DNA acquisition

    DOE PAGES

    Diner, Rachel E.; Noddings, Chari M.; Lian, Nathan C.; ...

    2017-07-18

    Centromeres are essential for cell division and growth in all eukaryotes, and knowledge of their sequence and structure guides the development of artificial chromosomes for functional cellular biology studies. Centromeric proteins are conserved among eukaryotes; however, centromeric DNA sequences are highly variable. We combined forward and reverse genetic approaches with chromatin immunoprecipitation to identify centromeres of the model diatom Phaeodactylum tricornutum. We observed 25 unique centromere sequences typically occurring once per chromosome, a finding that helps to resolve nuclear genome organization and indicates monocentric regional centromeres. Diatom centromere sequences contain low-GC content regions but lack repeats or other conserved sequencemore » features. Native and foreign sequences with similar GC content to P. tricornutum centromeres can maintain episomes and recruit the diatom centromeric histone protein CENH3, suggesting nonnative sequences can also function as diatom centromeres. Thus, simple sequence requirements may enable DNA from foreign sources to persist in the nucleus as extrachromosomal episomes, revealing a potential mechanism for organellar and foreign DNA acquisition.« less

  15. Diatom centromeres suggest a mechanism for nuclear DNA acquisition

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Diner, Rachel E.; Noddings, Chari M.; Lian, Nathan C.

    Centromeres are essential for cell division and growth in all eukaryotes, and knowledge of their sequence and structure guides the development of artificial chromosomes for functional cellular biology studies. Centromeric proteins are conserved among eukaryotes; however, centromeric DNA sequences are highly variable. We combined forward and reverse genetic approaches with chromatin immunoprecipitation to identify centromeres of the model diatom Phaeodactylum tricornutum. We observed 25 unique centromere sequences typically occurring once per chromosome, a finding that helps to resolve nuclear genome organization and indicates monocentric regional centromeres. Diatom centromere sequences contain low-GC content regions but lack repeats or other conserved sequencemore » features. Native and foreign sequences with similar GC content to P. tricornutum centromeres can maintain episomes and recruit the diatom centromeric histone protein CENH3, suggesting nonnative sequences can also function as diatom centromeres. Thus, simple sequence requirements may enable DNA from foreign sources to persist in the nucleus as extrachromosomal episomes, revealing a potential mechanism for organellar and foreign DNA acquisition.« less

  16. Analysis of variable sites between two complete South China tiger (Panthera tigris amoyensis) mitochondrial genomes.

    PubMed

    Zhang, Wenping; Yue, Bisong; Wang, Xiaofang; Zhang, Xiuyue; Xie, Zhong; Liu, Nonglin; Fu, Wenyuan; Yuan, Yaohua; Chen, Daqing; Fu, Danghua; Zhao, Bo; Yin, Yuzhong; Yan, Xiahui; Wang, Xinjing; Zhang, Rongying; Liu, Jie; Li, Maoping; Tang, Yao; Hou, Rong; Zhang, Zhihe

    2011-10-01

    In order to investigate the mitochondrial genome of Panthera tigris amoyensis, two South China tigers (P25 and P27) were analyzed following 15 cymt-specific primer sets. The entire mtDNA sequence was found to be 16,957 bp and 17,001 bp long for P25 and P27 respectively, and this difference in length between P25 and P27 occurred in the number of tandem repeats in the RS-3 segment of the control region. The structural characteristics of complete P. t. amoyensis mitochondrial genomes were also highly similar to those of P. uncia. Additionally, the rate of point mutation was only 0.3% and a total of 59 variable sites between P25 and P27 were found. Out of the 59 variable sites, 6 were located in 6 different tRNA genes, 6 in the 2 rRNA genes, 7 in non-coding regions (one located between tRNA-Asn and tRNA-Tyr and six in the D-loop), and 40 in 10 protein-coding genes. COI held the largest amount of variable sites (9 sites) and Cytb contained the highest variable rate (0.7%) in the complete sequences. Moreover, out of the 40 variable sites located in 10 protein-coding genes, 12 sites were nonsynonymous.

  17. Analysis of mitochondrial DNA in Bolivian llama, alpaca and vicuna populations: a contribution to the phylogeny of the South American camelids.

    PubMed

    Barreta, J; Gutiérrez-Gil, B; Iñiguez, V; Saavedra, V; Chiri, R; Latorre, E; Arranz, J J

    2013-04-01

    The objectives of this work were to assess the mtDNA diversity of Bolivian South American camelid (SAC) populations and to shed light on the evolutionary relationships between the Bolivian camelids and other populations of SACs. We have analysed two different mtDNA regions: the complete coding region of the MT-CYB gene and 513 bp of the D-loop region. The populations sampled included Bolivian llamas, alpacas and vicunas, and Chilean guanacos. High levels of genetic diversity were observed in the studied populations. In general, MT-CYB was more variable than D-loop. On a species level, the vicunas showed the lowest genetic variability, followed by the guanacos, alpacas and llamas. Phylogenetic analyses performed by including additional available mtDNA sequences from the studied species confirmed the existence of the two monophyletic clades previously described by other authors for guanacos (G) and vicunas (V). Significant levels of mtDNA hybridization were found in the domestic species. Our sequence analyses revealed significant sequence divergence within clade G, and some of the Bolivian llamas grouped with the majority of the southern guanacos. This finding supports the existence of more than the one llama domestication centre in South America previously suggested on the basis of archaeozoological evidence. Additionally, analysis of D-loop sequences revealed two new matrilineal lineages that are distinct from the previously reported G and V clades. The results presented here represent the first report on the population structure and genetic variability of Bolivian camelids and may help to elucidate the complex and dynamic domestication process of SAC populations. © 2012 The Authors, Animal Genetics © 2012 Stichting International Foundation for Animal Genetics.

  18. The Genetic Diversity, Haplotype Analysis, and Phylogenetic Relationship of Aedes albopictus (Diptera: Culicidae) Based on the Cytochrome Oxidase 1 Marker: A Malaysian Scenario.

    PubMed

    Ismail, Nurul-Ain; Adilah-Amrannudin, Nurul; Hamsidi, Mayamin; Ismail, Rodziah; Dom, Nazri Che; Ahmad, Abu Hassan; Mastuki, Mohd Fahmi; Camalxaman, Siti Nazrina

    2017-11-07

    The global expansion of Ae. albopictus from its native range in Southeast Asia has been implicated in the recent emergence of dengue endemicity in Malaysia. Genetic variability studies of Ae. albopictus are currently lacking in the Malaysian setting, yet are crucial to enhancing the existing vector control strategies. The study was conducted to establish the genetic variability of maternally inherited mitochondrial DNA encoding for cytochrome oxidase subunit 1 (CO1) gene in Ae. albopictus. Twelve localities were selected in the Subang Jaya district based on temporal indices utilizing 120 mosquito samples. Genetic polymorphism and phylogenetic analysis were conducted to unveil the genetic variability and geographic origins of Ae. albopictus. The haplotype network was mapped to determine the genealogical relationship of sequences among groups of population in the Asian region. Comparison of Malaysian CO1 sequences with sequences derived from five Asian countries revealed genetically distinct Ae. albopictus populations. Phylogenetic analysis revealed that all sequences from other Asian countries descended from the same genetic lineage as the Malaysian sequences. Noteworthy, our study highlights the discovery of 20 novel haplotypes within the Malaysian population which to date had not been reported. These findings could help determine the genetic variation of this invasive species, which in turn could possibly improve the current dengue vector surveillance strategies, locally and regionally. © The Authors 2017. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  19. Analysis of human papillomavirus 16 E6, E7 genes and Long Control Region in cervical samples from Uruguayan women.

    PubMed

    Ramas, Viviana; Mirazo, Santiago; Bonilla, Sylvia; Ruchansky, Dora; Arbiza, Juan

    2018-05-15

    This study aims to investigate the HPV16 variant distribution by sequence analyses of E6, E7 oncogenes and the Long Control Region (LCR), from cervical cells collected from Uruguayan women, and to reconstruct the phylogenetic relationships among variants. Forty-seven HPV16 variants, obtained from women with HSIL, LSIL, ASCUS and NILM cytological classes were analyzed for LCR and 12 were further studied for E6 and E7. Detailed sequence comparison, genetic heterogeneity analyses and phylogenetic reconstruction were performed. A high variability was observed among LCR sequences, which were distributed in 18 different variants. E6 and E7 sequences exhibited novel non-synonymous substitutions. Uruguayan sequences mainly belonged to the European lineage, and only 5 sequences clustered in non-European branches; 3 of them in the Asian-American and North-American linage and 2 in an African branch. Additionally, 6 new variants from European and African clusters were identified. HPV16 isolates mainly belonged to the European lineage, though strains from African and Asian-American lineages were also identified. Herein is reported for the first time the distribution and molecular characterization of HPV16 variants from Uruguay, providing novel insights on the molecular epidemiology of this infectious disease in the South America. A high variability among HPV 16 isolates mainly belonged to European lineage, provides an extensive sequence dataset from a country with high burden of cervical cancer. Copyright © 2018 Elsevier B.V. All rights reserved.

  20. Differential recognition of the ORF2 region in a complete genome sequence of porcine circovirus type 2 (PCV2) isolated from boar bone marrow in Korea.

    PubMed

    Kweon, Chang-Hee; Nguyen, Lien Thi Kim; Yoo, Mi-Sun; Kang, Seung-Won

    2015-09-15

    Porcine circovirus type 2 (PCV2) is the causative agent of post-weaning multisystemic wasting syndrome (PMWS) in swine. Here, a phylogenetic tree was constructed using PCV2 nucleotide sequences derived from the bone marrow of Korean boar and previously reported PCV2 sequences isolated from various countries. PCV2 from Korean boar bone marrow (KC188796) was classified into the group containing PCV2a-Canada and other PCV2 strain from Korea. While the ORF1 region of the PCV2 genome was highly conserved, ORF2 (the capsid protein coding region) was relatively variable. The nucleotide sequences for bone marrow-derived PCV2 were 93.4-99.0% homologous to the other reference sequences. The deduced amino acid sequences for the ORF1 and ORF2 coding regions were 97.4-99.3% and 84.5-97.4% homologous with the other reference strains, respectively, indicating that KC188796 did not differ markedly from the other PCV2 strains. Phylogenetic analysis demonstrated that bone marrow-derived PCV2 was highly similar to PCV2a from Canada and may be related to persistent PCV2 infections in swine. Copyright © 2015 Elsevier B.V. All rights reserved.

  1. Analysis of sequence variability in the macronuclear DNA of Paramecium tetraurelia: A somatic view of the germline

    PubMed Central

    Duret, Laurent; Cohen, Jean; Jubin, Claire; Dessen, Philippe; Goût, Jean-François; Mousset, Sylvain; Aury, Jean-Marc; Jaillon, Olivier; Noël, Benjamin; Arnaiz, Olivier; Bétermier, Mireille; Wincker, Patrick; Meyer, Eric; Sperling, Linda

    2008-01-01

    Ciliates are the only unicellular eukaryotes known to separate germinal and somatic functions. Diploid but silent micronuclei transmit the genetic information to the next sexual generation. Polyploid macronuclei express the genetic information from a streamlined version of the genome but are replaced at each sexual generation. The macronuclear genome of Paramecium tetraurelia was recently sequenced by a shotgun approach, providing access to the gene repertoire. The 72-Mb assembly represents a consensus sequence for the somatic DNA, which is produced after sexual events by reproducible rearrangements of the zygotic genome involving elimination of repeated sequences, precise excision of unique-copy internal eliminated sequences (IES), and amplification of the cellular genes to high copy number. We report use of the shotgun sequencing data (>106 reads representing 13× coverage of a completely homozygous clone) to evaluate variability in the somatic DNA produced by these developmental genome rearrangements. Although DNA amplification appears uniform, both of the DNA elimination processes produce sequence heterogeneity. The variability that arises from IES excision allowed identification of hundreds of putative new IESs, compared to 42 that were previously known, and revealed cases of erroneous excision of segments of coding sequences. We demonstrate that IESs in coding regions are under selective pressure to introduce premature termination of translation in case of excision failure. PMID:18256234

  2. Variability and molecular typing of the woody-tree infecting prunus necrotic ringspot ilarvirus.

    PubMed

    Vasková, D; Petrzik, K; Karesová, R

    2000-01-01

    The 3'-part of the movement protein gene, the intergenic region and the complete coat protein gene of sixteen isolates of Prunus necrotic ringspot virus (PNRSV) from five different host species from the Czech Republic were sequenced in order to search for the bases of extensive variability of viroses caused by this pathogen. According to phylogenetic analyses all the 46 isolates sequenced to date split into three main groups, which correlated to a certain extend with their geographic origin. Modelled serological properties showed that all the new isolates belong to one serotype.

  3. Evolution of sfbI Encoding Streptococcal Fibronectin-Binding Protein I: Horizontal Genetic Transfer and Gene Mosaic Structure

    PubMed Central

    Towers, Rebecca J.; Fagan, Peter K.; Talay, Susanne R.; Currie, Bart J.; Sriprakash, Kadaba S.; Walker, Mark J.; Chhatwal, Gursharan S.

    2003-01-01

    Streptococcal fibronectin-binding protein is an important virulence factor involved in colonization and invasion of epithelial cells and tissues by Streptococcus pyogenes. In order to investigate the mechanisms involved in the evolution of sfbI, the sfbI genes from 54 strains were sequenced. Thirty-four distinct alleles were identified. Three principal mechanisms appear to have been involved in the evolution of sfbI. The amino-terminal aromatic amino acid-rich domain is the most variable region and is apparently generated by intergenic recombination of horizontally acquired DNA cassettes, resulting in a genetic mosaic in this region. Two distinct and divergent sequence types that shared only 61 to 70% identity were identified in the central proline-rich region, while variation at the 3′ end of the gene is due to deletion or duplication of defined repeat units. Potential antigenic and functional variabilities in SfbI imply significant selective pressure in vivo with direct implications for the microbial pathogenesis of S. pyogenes. PMID:14662917

  4. Complete mitochondrial genome of Xingguo red carp (Cyprinus carpio var. singuonensis) and purse red carp (Cyprinus carpio var. wuyuanensis).

    PubMed

    Hu, Guang-Fu; Liu, Xiang-Jiang; Li, Zhong; Liang, Hong-Wei; Hu, Shao-Na; Zou, Gui-Wei

    2016-01-01

    The complete mitochondrial genomes of Xingguo red carp (Cyprinus carpio var. singuonensis) and purse red carp (Cyprinus carpio var. wuyuanensis) were sequenced. Comparison of these two mitochondrial genomes revealed that the mtDNAs of these two common carp varieties were remarkably similar in genome length, gene order and content, and AT content. However, size variation between these two mitochondrial genomes presented here showed 39 site differences in overall length. About 2 site differences were located in rRNAs, 3 in tRNAs, 3 in the control region, 31 in protein-coding genes. Thirty-one variable bases in the protein-coding regions between the two varieties mitochondrial sequences led to three variable amino acids, which were mainly located in the protein ND5 and ND4.

  5. Region 4 of Rhizobium etli Primary Sigma Factor (SigA) Confers Transcriptional Laxity in Escherichia coli.

    PubMed

    Santillán, Orlando; Ramírez-Romero, Miguel A; Lozano, Luis; Checa, Alberto; Encarnación, Sergio M; Dávila, Guillermo

    2016-01-01

    Sigma factors are RNA polymerase subunits engaged in promoter recognition and DNA strand separation during transcription initiation in bacteria. Primary sigma factors are responsible for the expression of housekeeping genes and are essential for survival. RpoD, the primary sigma factor of Escherichia coli, a γ-proteobacteria, recognizes consensus promoter sequences highly similar to those of some α-proteobacteria species. Despite this resemblance, RpoD is unable to sustain transcription from most of the α-proteobacterial promoters tested so far. In contrast, we have found that SigA, the primary sigma factor of Rhizobium etli, an α-proteobacteria, is able to transcribe E. coli promoters, although it exhibits only 48% identity (98% coverage) to RpoD. We have called this the transcriptional laxity phenomenon. Here, we show that SigA partially complements the thermo-sensitive deficiency of RpoD285 from E. coli strain UQ285 and that the SigA region σ4 is responsible for this phenotype. Sixteen out of 74 residues (21.6%) within region σ4 are variable between RpoD and SigA. Mutating these residues significantly improves SigA ability to complement E. coli UQ285. Only six of these residues fall into positions already known to interact with promoter DNA and to comprise a helix-turn-helix motif. The remaining variable positions are located on previously unexplored sites inside region σ4, specifically into the first two α-helices of the region. Neither of the variable positions confined to these helices seem to interact directly with promoter sequence; instead, we adduce that these residues participate allosterically by contributing to correct region folding and/or positioning of the HTH motif. We propose that transcriptional laxity is a mechanism for ensuring transcription in spite of naturally occurring mutations from endogenous promoters and/or horizontally transferred DNA sequences, allowing survival and fast environmental adaptation of α-proteobacteria.

  6. Region 4 of Rhizobium etli Primary Sigma Factor (SigA) Confers Transcriptional Laxity in Escherichia coli

    PubMed Central

    Santillán, Orlando; Ramírez-Romero, Miguel A.; Lozano, Luis; Checa, Alberto; Encarnación, Sergio M.; Dávila, Guillermo

    2016-01-01

    Sigma factors are RNA polymerase subunits engaged in promoter recognition and DNA strand separation during transcription initiation in bacteria. Primary sigma factors are responsible for the expression of housekeeping genes and are essential for survival. RpoD, the primary sigma factor of Escherichia coli, a γ-proteobacteria, recognizes consensus promoter sequences highly similar to those of some α-proteobacteria species. Despite this resemblance, RpoD is unable to sustain transcription from most of the α-proteobacterial promoters tested so far. In contrast, we have found that SigA, the primary sigma factor of Rhizobium etli, an α-proteobacteria, is able to transcribe E. coli promoters, although it exhibits only 48% identity (98% coverage) to RpoD. We have called this the transcriptional laxity phenomenon. Here, we show that SigA partially complements the thermo-sensitive deficiency of RpoD285 from E. coli strain UQ285 and that the SigA region σ4 is responsible for this phenotype. Sixteen out of 74 residues (21.6%) within region σ4 are variable between RpoD and SigA. Mutating these residues significantly improves SigA ability to complement E. coli UQ285. Only six of these residues fall into positions already known to interact with promoter DNA and to comprise a helix-turn-helix motif. The remaining variable positions are located on previously unexplored sites inside region σ4, specifically into the first two α-helices of the region. Neither of the variable positions confined to these helices seem to interact directly with promoter sequence; instead, we adduce that these residues participate allosterically by contributing to correct region folding and/or positioning of the HTH motif. We propose that transcriptional laxity is a mechanism for ensuring transcription in spite of naturally occurring mutations from endogenous promoters and/or horizontally transferred DNA sequences, allowing survival and fast environmental adaptation of α-proteobacteria. PMID:27468278

  7. A comparison of chloroplast genome sequences in Aconitum (Ranunculaceae): a traditional herbal medicinal genus

    PubMed Central

    Yao, Gang

    2017-01-01

    The herbal medicinal genus Aconitum L., belonging to the Ranunculaceae family, represents the earliest diverging lineage within the eudicots. It currently comprises of two subgenera, A. subgenus Lycoctonum and A. subg. Aconitum. The complete chloroplast (cp) genome sequences were characterized in three species: A. angustius, A. finetianum, and A. sinomontanum in subg. Lycoctonum and compared to other Aconitum species to clarify their phylogenetic relationship and provide molecular information for utilization of Aconitum species particularly in Eastern Asia. The length of the chloroplast genome sequences were 156,109 bp in A. angustius, 155,625 bp in A. finetianum and 157,215 bp in A. sinomontanum, with each species possessing 126 genes with 84 protein coding genes (PCGs). While genomic rearrangements were absent, structural variation was detected in the LSC/IR/SSC boundaries. Five pseudogenes were identified, among which Ψrps19 and Ψycf1 were in the LSC/IR/SSC boundaries, Ψrps16 and ΨinfA in the LSC region, and Ψycf15 in the IRb region. The nucleotide variability (Pi) of Aconitum was estimated to be 0.00549, with comparably higher variations in the LSC and SSC than the IR regions. Eight intergenic regions were revealed to be highly variable and a total of 58–62 simple sequence repeats (SSRs) were detected in all three species. More than 80% of SSRs were present in the LSC region. Altogether, 64.41% and 46.81% of SSRs are mononucleotides in subg. Lycoctonum and subg. Aconitum, respectively, while a higher percentage of di-, tri-, tetra-, and penta- SSRs were present in subg. Aconitum. Most species of subg. Aconitum in Eastern Asia were first used for phylogenetic analyses. The availability of the complete cp genome sequences of these species in subg. Lycoctonum will benefit future phylogenetic analyses and aid in germplasm utilization in Aconitum species. PMID:29134154

  8. A comparison of chloroplast genome sequences in Aconitum (Ranunculaceae): a traditional herbal medicinal genus.

    PubMed

    Kong, Hanghui; Liu, Wanzhen; Yao, Gang; Gong, Wei

    2017-01-01

    The herbal medicinal genus Aconitum L., belonging to the Ranunculaceae family, represents the earliest diverging lineage within the eudicots. It currently comprises of two subgenera, A . subgenus Lycoctonum and A . subg. Aconitum . The complete chloroplast (cp) genome sequences were characterized in three species: A. angustius , A. finetianum , and A. sinomontanum in subg. Lycoctonum and compared to other Aconitum species to clarify their phylogenetic relationship and provide molecular information for utilization of Aconitum species particularly in Eastern Asia. The length of the chloroplast genome sequences were 156,109 bp in A. angustius , 155,625 bp in A. finetianum and 157,215 bp in A. sinomontanum , with each species possessing 126 genes with 84 protein coding genes (PCGs). While genomic rearrangements were absent, structural variation was detected in the LSC/IR/SSC boundaries. Five pseudogenes were identified, among which Ψ rps 19 and Ψ ycf 1 were in the LSC/IR/SSC boundaries, Ψ rps 16 and Ψ inf A in the LSC region, and Ψ ycf 15 in the IRb region. The nucleotide variability ( Pi ) of Aconitum was estimated to be 0.00549, with comparably higher variations in the LSC and SSC than the IR regions. Eight intergenic regions were revealed to be highly variable and a total of 58-62 simple sequence repeats (SSRs) were detected in all three species. More than 80% of SSRs were present in the LSC region. Altogether, 64.41% and 46.81% of SSRs are mononucleotides in subg. Lycoctonum and subg. Aconitum , respectively, while a higher percentage of di-, tri-, tetra-, and penta- SSRs were present in subg. Aconitum . Most species of subg. Aconitum in Eastern Asia were first used for phylogenetic analyses. The availability of the complete cp genome sequences of these species in subg. Lycoctonum will benefit future phylogenetic analyses and aid in germplasm utilization in Aconitum species.

  9. Characteristics, stratigraphic architecture, and time framework of multi-order mixed siliciclastic and carbonate depositional sequences, outcropping Cisco Group (Late Pennsylvanian and Early Permian), Eastern Shelf, north-central Texas, USA

    NASA Astrophysics Data System (ADS)

    Yang, Wan; Kominz, Michelle A.

    2003-01-01

    The Cisco Group on the Eastern Shelf of the Midland Basin is composed of fluvial, deltaic, shelf, shelf-margin, and slope-to-basin carbonate and siliciclastic rocks. Sedimentologic and stratigraphic analyses of 181 meter-to-decimeter-scale depositional sequences exposed in the up-dip shelf indicated that the siliciclastic and carbonate parasequences in the transgressive systems tracts (TST) are thin and upward deepening, whereas those in highstand systems tracts (HST) are thick and upward shallowing. The sequences can be subdivided into five types on the basis of principal lithofacies, and exhibit variable magnitude of facies shift corresponding to variable extents of marine transgression and regression on the shelf. The sequence stacking patterns and their regional persistence suggest a three-level sequence hierarchy controlled by eustasy, whereas local and regional changes in lithology, thickness, and sequence type, magnitude, and absence were controlled by interplay of eustasy, differential shelf subsidence, depositional topography, and pattern of siliciclastic supply. The outcropping Cisco Group is highly incomplete with an estimated 6-11% stratigraphic completeness. The average duration of deposition of the major (third-order) sequences is estimated as 67-102 ka on the up-dip shelf and increases down dip, while the average duration of the major sequence boundaries (SB) is estimated as 831-1066 ka and decreases down dip. The nondepositional and erosional hiatus on the up-dip shelf was represented by lowstand deltaic systems in the basin and slope.

  10. The complementarity-determining region sequences in IgY antivenom hypervariable regions.

    PubMed

    da Rocha, David Gitirana; Fernandez, Jorge Hernandez; de Almeida, Claudia Maria Costa; da Silva, Claudia Letícia; Magnoli, Fabio Carlos; da Silva, Osmair Élder; da Silva, Wilmar Dias

    2017-08-01

    The data presented in this article are related to the research article entitled "Development of IgY antibodies against anti-snake toxins endowed with highly lethal neutralizing activity" (da Rocha et al., 2017) [1]. Complementarity-determining region (CDR) sequences are variable antibody (Ab) sequences that respond with specificity, duration and strength to identify and bind to antigen (Ag) epitopes. B lymphocytes isolated from hens immunized with Bitis arietans (Ba) and anti- Crotalus durissus terrificus (Cdt) venoms and expressing high specificity, affinity and toxicity neutralizing antibody titers were used as DNA sources. The VLF1, CDR1, CDR2, VLR1 and CDR3 sequences were validated by BLASTp, and values corresponding to IgY V L and V H anti-Ba or anti-Cdt venoms were identified, registered [ Gallus gallus IgY Fv Light chain (GU815099)/ Gallus gallus IgY Fv Heavy chain (GU815098)] and used for molecular modeling of IgY scFv anti-Ba. The resulting CDR1, CDR2 and CDR3 sequences were combined to construct the three - dimensional structure of the Ab paratope.

  11. Analysis of new isolates reveals new genome organization and a hypervariable region in infectious myonecrosis virus (IMNV).

    PubMed

    Dantas, Márcia Danielle A; Chavante, Suely F; Teixeira, Dárlio Inácio A; Lima, João Paulo M S; Lanza, Daniel C F

    2015-05-04

    Infectious myonecrosis virus (IMNV) has been the cause of many losses in shrimp farming since 2002, when the first myonecrosis outbreak was reported at Brazilian's northeast coast. Two additional genomes of Brazilian IMNV isolates collected in 2009 and 2013 were sequenced and analyzed in the present study. The sequencing revealed extra 643 bp and 22 bp, at 5' and 3' ends of IMNV genome respectively, confirming that its actual size is at least 8226 bp long. Considering these additional sequences in genome extremities, ORF1 can starts at nt 470, encoding a 1708 aa polyprotein. Computational predictions reveal two stem loops and two pseudoknots in the 5' end and a putative stem loop and a slippery motif located at 3' end, indicating that these regions can be involved in the start and termination of translation. Through a careful phylogenetic analysis, a higher genetic variability among Brazilian isolates could be observed, comparing with Indonesian IMNV isolates. It was also observed that the most variable region of IMNV genome is located in the first half of ORF1, coinciding with a region which probably encodes the capsid protrusions. The results presented here are a starting point to elucidate the viral's translational regulation and the mechanisms involved in virulence. Copyright © 2015 Elsevier B.V. All rights reserved.

  12. Eurasian otters, Lutra lutra, have a dominant mtDNA haplotype from the Iberian Peninsula to Scandinavia.

    PubMed

    Ferrando, Ainhoa; Ponsà, Montserrat; Marmi, Josep; Domingo-Roura, Xavier

    2004-01-01

    The Eurasian otter, Lutra lutra, has a Palaearctic distribution and has suffered a severe decline throughout Europe during the last century. Previous studies in this and other mustelids have shown reduced levels of variability in mitochondrial DNA, although otter phylogeographic studies were restricted to central-western Europe. In this work we have sequenced 361 bp of the mtDNA control region in 73 individuals from eight countries and added our results to eight sequences available from GenBank and the literature. The range of distribution has been expanded in relation to previous works north towards Scandinavia, east to Russia and Belarus, and south to the Iberian Peninsula. We found a single dominant haplotype in 91.78% of the samples, and six more haplotypes deviating a maximum of two mutations from the dominant haplotype restricted to a single country. Variability was extremely low in western Europe but higher in eastern countries. This, together with the lack of phylogeographical structuring, supports the postglacial recolonization of Europe from a single refugium. The Eurasian otter mtDNA control region has a 220-bp variable minisatellite in Domain III that we sequenced in 29 otters. We found a total of 19 minisatellite haplotypes, but they showed no phylogenetic information.

  13. Systematic Evaluation of the Dependence of Deoxyribozyme Catalysis on Random Region Length

    PubMed Central

    Velez, Tania E.; Singh, Jaydeep; Xiao, Ying; Allen, Emily C.; Wong, On Yi; Chandra, Madhavaiah; Kwon, Sarah C.; Silverman, Scott K.

    2012-01-01

    Functional nucleic acids are DNA and RNA aptamers that bind targets, or they are deoxyribozymes and ribozymes that have catalytic activity. These functional DNA and RNA sequences can be identified from random-sequence pools by in vitro selection, which requires choosing the length of the random region. Shorter random regions allow more complete coverage of sequence space but may not permit the structural complexity necessary for binding or catalysis. In contrast, longer random regions are sampled incompletely but may allow adoption of more complicated structures that enable function. In this study, we systematically examined random region length (N20 through N60) for two particular deoxyribozyme catalytic activities, DNA cleavage and tyrosine-RNA nucleopeptide linkage formation. For both activities, we previously identified deoxyribozymes using only N40 regions. In the case of DNA cleavage, here we found that shorter N20 and N30 regions allowed robust catalytic function, either by DNA hydrolysis or by DNA deglycosylation and strand scission via β-elimination, whereas longer N50 and N60 regions did not lead to catalytically active DNA sequences. Follow-up selections with N20, N30, and N40 regions revealed an interesting interplay of metal ion cofactors and random region length. Separately, for Tyr-RNA linkage formation, N30 and N60 regions provided catalytically active sequences, whereas N20 was unsuccessful, and the N40 deoxyribozymes were functionally superior (in terms of rate and yield) to N30 and N60. Collectively, the results indicate that with future in vitro selection experiments for DNA and RNA catalysts, and by extension for aptamers, random region length should be an important experimental variable. PMID:23088677

  14. Phylogenetic analysis of phenotypically characterized Cryptococcus laurentii isolates reveals high frequency of cryptic species.

    PubMed

    Ferreira-Paim, Kennio; Ferreira, Thatiana Bragine; Andrade-Silva, Leonardo; Mora, Delio Jose; Springer, Deborah J; Heitman, Joseph; Fonseca, Fernanda Machado; Matos, Dulcilena; Melhem, Márcia Souza Carvalho; Silva-Vergara, Mario León

    2014-01-01

    Although Cryptococcus laurentii has been considered saprophytic and its taxonomy is still being described, several cases of human infections have already reported. This study aimed to evaluate molecular aspects of C. laurentii isolates from Brazil, Botswana, Canada, and the United States. In this study, 100 phenotypically identified C. laurentii isolates were evaluated by sequencing the 18S nuclear ribosomal small subunit rRNA gene (18S-SSU), D1/D2 region of 28S nuclear ribosomal large subunit rRNA gene (28S-LSU), and the internal transcribed spacer (ITS) of the ribosomal region. BLAST searches using 550-bp, 650-bp, and 550-bp sequenced amplicons obtained from the 18S-SSU, 28S-LSU, and the ITS region led to the identification of 75 C. laurentii strains that shared 99-100% identity with C. laurentii CBS 139. A total of nine isolates shared 99% identity with both Bullera sp. VY-68 and C. laurentii RY1. One isolate shared 99% identity with Cryptococcus rajasthanensis CBS 10406, and eight isolates shared 100% identity with Cryptococcus sp. APSS 862 according to the 28S-LSU and ITS regions and designated as Cryptococcus aspenensis sp. nov. (CBS 13867). While 16 isolates shared 99% identity with Cryptococcus flavescens CBS 942 according to the 18S-SSU sequence, only six were confirmed using the 28S-LSU and ITS region sequences. The remaining 10 shared 99% identity with Cryptococcus terrestris CBS 10810, which was recently described in Brazil. Through concatenated sequence analyses, seven sequence types in C. laurentii, three in C. flavescens, one in C. terrestris, and one in the C. aspenensis sp. nov. were identified. Sequencing permitted the characterization of 75% of the environmental C. laurentii isolates from different geographical areas and the identification of seven haplotypes of this species. Among sequenced regions, the increased variability of the ITS region in comparison to the 18S-SSU and 28S-LSU regions reinforces its applicability as a DNA barcode.

  15. Sequence and Characterization of the Ig Heavy Chain Constant and Partial Variable Region of the Mouse Strain 129S11

    PubMed Central

    Retter, Ida; Chevillard, Christophe; Scharfe, Maren; Conrad, Ansgar; Hafner, Martin; Im, Tschong-Hun; Ludewig, Monika; Nordsiek, Gabriele; Severitt, Simone; Thies, Stephanie; Mauhar, America; Blöcker, Helmut; Müller, Werner; Riblet, Roy

    2009-01-01

    Although the entire mouse genome has been sequenced, there remain challenges concerning the elucidation of particular complex and polymorphic genomic loci. In the murine Igh locus, different haplotypes exist in different inbred mouse strains. For example, the Ighb haplotype sequence of the Mouse Genome Project strain C57BL/6 differs considerably from the Igha haplotype of BALB/c, which has been widely used in the analyses of Ab responses. We have sequenced and annotated the 3′ half of the Igha locus of 129S1/SvImJ, covering the CH region and approximately half of the VH region. This sequence comprises 128 VH genes, of which 49 are judged to be functional. The comparison of the Igha sequence with the homologous Ighb region from C57BL/6 revealed two major expansions in the germline repertoire of Igha. In addition, we found smaller haplotype-specific differences like the duplication of five VH genes in the Igha locus. We generated a VH allele table by comparing the individual VH genes of both haplotypes. Surprisingly, the number and position of DH genes in the 129S1 strain differs not only from the sequence of C57BL/6 but also from the map published for BALB/c. Taken together, the contiguous genomic sequence of the 3′ part of the Igha locus allows a detailed view of the recent evolution of this highly dynamic locus in the mouse. PMID:17675503

  16. Systematics of Cladophora spp. (Chlorophyta) from North Carolina, USA, based upon morphology and DNA sequence data with a description of Cladophora subtilissima sp. nov.

    PubMed

    Taylor, Robin L; Bailey, Jeffrey Craig; Freshwater, David Wilson

    2017-06-01

    Identification of Cladophora species is challenging due to conservation of gross morphology, few discrete autapomorphies, and environmental influences on morphology. Twelve species of marine Cladophora were reported from North Carolina waters. Cladophora specimens were collected from inshore and offshore marine waters for DNA sequence and morphological analyses. The nuclear-encoded rRNA internal transcribed spacer regions (ITS) were sequenced for 105 specimens and used in molecular assisted identification. The ITS1 and ITS2 region was highly variable, and sequences were sorted into ITS Sets of Alignable Sequences (SASs). Sequencing of short hyper-variable ITS1 sections from Cladophora type specimens was used to positively identify species represented by SASs when the types were made available. Secondary structures for the ITS1 locus were also predicted for each specimen and compared to predicted structures from Cladophora sequences available in GenBank. Nine ITS SASs were identified and representative specimens chosen for phylogenetic analyses of 18S and 28S rRNA gene sequences to reveal relationships with other Cladophora species. Phylogenetic analyses indicated that marine Cladophorales were polyphyletic and separated into two clades, the Cladophora clade and the "Siphonocladales" clade. Morphological analyses were performed to assess the consistency of character states within species, and complement the DNA sequence analyses. These analyses revealed intra- and interspecific character state variation, and that combined molecular and morphological analyses were required for the identification of species. One new report, Cladophora dotyana, and one new species Cladophora subtilissima sp. nov., were revealed, and increased the biodiversity of North Carolina marine Cladophora to 14 species. © 2017 Phycological Society of America.

  17. Array-Based Rational Design of Short Peptide Probe-Derived from an Anti-TNT Monoclonal Antibody.

    PubMed

    Okochi, Mina; Muto, Masaki; Yanai, Kentaro; Tanaka, Masayoshi; Onodera, Takeshi; Wang, Jin; Ueda, Hiroshi; Toko, Kiyoshi

    2017-10-09

    Complementarity-determining regions (CDRs) are sites on the variable chains of antibodies responsible for binding to specific antigens. In this study, a short peptide probe for recognition of 2,4,6-trinitrotoluene (TNT), was identified by testing sequences derived from the CDRs of an anti-TNT monoclonal antibody. The major TNT-binding site in this antibody was identified in the heavy chain CDR3 by antigen docking simulation and confirmed by an immunoassay using a spot-synthesis based peptide array comprising amino acid sequences of six CDRs in the variable region. A peptide derived from heavy chain CDR3 (RGYSSFIYWF) bound to TNT with a dissociation constant of 1.3 μM measured by surface plasmon resonance. Substitution of selected amino acids with basic residues increased TNT binding while substitution with acidic amino acids decreased affinity, an isoleucine to arginine change showed the greatest improvement of 1.8-fold. The ability to create simple peptide binders of volatile organic compounds from sequence information provided by the immune system in the creation of an immune response will be beneficial for sensor developments in the future.

  18. High levels of diversity characterize mandrill (Mandrillus sphinx) Mhc-DRB sequences.

    PubMed

    Abbott, Kristin M; Wickings, E Jean; Knapp, Leslie A

    2006-08-01

    The major histocompatibility complex (MHC) is highly polymorphic in most primate species studied thus far. The rhesus macaque (Macaca mulatta) has been studied extensively and the Mhc-DRB region demonstrates variability similar to humans. The extent of MHC diversity is relatively unknown for other Old World monkeys (OWM), especially among genera other than Macaca. A molecular survey of the Mhc-DRB region in mandrills (Mandrillus sphinx) revealed extensive variability, suggesting that other OWMs may also possess high levels of Mhc-DRB polymorphism. In the present study, 33 Mhc-DRB loci were identified from only 13 animals. Eleven were wild-born and presumed to be unrelated and two were captive-born twins. Two to seven different sequences were identified for each individual, suggesting that some mandrills may have as many as four Mhc-DRB loci on a single haplotype. From these sequences, representatives of at least six Mhc-DRB loci or lineages were identified. As observed in other primates, some new lineages may have arisen through the process of gene conversion. These findings indicate that mandrills have Mhc-DRB diversity not unlike rhesus macaques and humans.

  19. An Integrated Tool to Study MHC Region: Accurate SNV Detection and HLA Genes Typing in Human MHC Region Using Targeted High-Throughput Sequencing

    PubMed Central

    Liu, Xiao; Xu, Yinyin; Liang, Dequan; Gao, Peng; Sun, Yepeng; Gifford, Benjamin; D’Ascenzo, Mark; Liu, Xiaomin; Tellier, Laurent C. A. M.; Yang, Fang; Tong, Xin; Chen, Dan; Zheng, Jing; Li, Weiyang; Richmond, Todd; Xu, Xun; Wang, Jun; Li, Yingrui

    2013-01-01

    The major histocompatibility complex (MHC) is one of the most variable and gene-dense regions of the human genome. Most studies of the MHC, and associated regions, focus on minor variants and HLA typing, many of which have been demonstrated to be associated with human disease susceptibility and metabolic pathways. However, the detection of variants in the MHC region, and diagnostic HLA typing, still lacks a coherent, standardized, cost effective and high coverage protocol of clinical quality and reliability. In this paper, we presented such a method for the accurate detection of minor variants and HLA types in the human MHC region, using high-throughput, high-coverage sequencing of target regions. A probe set was designed to template upon the 8 annotated human MHC haplotypes, and to encompass the 5 megabases (Mb) of the extended MHC region. We deployed our probes upon three, genetically diverse human samples for probe set evaluation, and sequencing data show that ∼97% of the MHC region, and over 99% of the genes in MHC region, are covered with sufficient depth and good evenness. 98% of genotypes called by this capture sequencing prove consistent with established HapMap genotypes. We have concurrently developed a one-step pipeline for calling any HLA type referenced in the IMGT/HLA database from this target capture sequencing data, which shows over 96% typing accuracy when deployed at 4 digital resolution. This cost-effective and highly accurate approach for variant detection and HLA typing in the MHC region may lend further insight into immune-mediated diseases studies, and may find clinical utility in transplantation medicine research. This one-step pipeline is released for general evaluation and use by the scientific community. PMID:23894464

  20. Complete mitochondrial genome of the frillneck lizard (Chlamydosaurus kingii, Reptilia; Agamidae), another squamate with two control regions.

    PubMed

    Ujvari, Beata; Madsen, Thomas

    2008-10-01

    Using PCR, the complete mitochondrial genome was sequenced in three frillneck lizards (Chlamydosaurus kingii). The mitochondria spanned over 16,761bp. As in other vertebrates, two rRNA genes, 22 tRNA genes and 13 protein coding genes were identified. However, similar to some other squamate reptiles, two control regions (CRI and CRII) were identified, spanning 801 and 812 bp, respectively. Our results were compared with another Australian member of the family Agamidae, the bearded dragon (Pogana vitticeps). The overall base composition of the light-strand sequence largely mirrored that observed in P vitticeps. Furthermore, similar to P. vitticeps, we observed an insertion 801 bp long between the ND5 and ND6 genes. However, in contrast to P vitticeps we did not observe a conserved sequence block III region. Based on a comparison among the three frillneck lizards, we also present data on the proportion of variable sites within the major mitochondrial regions.

  1. Pyrosequencing the Canine Faecal Microbiota: Breadth and Depth of Biodiversity

    PubMed Central

    Hand, Daniel; Wallis, Corrin; Colyer, Alison; Penn, Charles W.

    2013-01-01

    Mammalian intestinal microbiota remain poorly understood despite decades of interest and investigation by culture-based and other long-established methodologies. Using high-throughput sequencing technology we now report a detailed analysis of canine faecal microbiota. The study group of animals comprised eleven healthy adult miniature Schnauzer dogs of mixed sex and age, some closely related and all housed in kennel and pen accommodation on the same premises with similar feeding and exercise regimes. DNA was extracted from faecal specimens and subjected to PCR amplification of 16S rDNA, followed by sequencing of the 5′ region that included variable regions V1 and V2. Barcoded amplicons were sequenced by Roche-454 FLX high-throughput pyrosequencing. Sequences were assigned to taxa using the Ribosomal Database Project Bayesian classifier and revealed dominance of Fusobacterium and Bacteroidetes phyla. Differences between animals in the proportions of different taxa, among 10,000 reads per animal, were clear and not supportive of the concept of a “core microbiota”. Despite this variability in prominent genera, littermates were shown to have a more similar faecal microbial composition than unrelated dogs. Diversity of the microbiota was also assessed by assignment of sequence reads into operational taxonomic units (OTUs) at the level of 97% sequence identity. The OTU data were then subjected to rarefaction analysis and determination of Chao1 richness estimates. The data indicated that faecal microbiota comprised possibly as many as 500 to 1500 OTUs. PMID:23382835

  2. Phylogenetic utility, and variability in structure and content, of complete mitochondrial genomes among genetic lineages of the Hawaiian anchialine shrimp Halocaridina rubra Holthuis 1963 (Atyidae:Decapoda).

    PubMed

    Justice, Joshua L; Weese, David A; Santos, Scott Ross

    2016-07-01

    The Atyidae are caridean shrimp possessing hair-like setae on their claws and are important contributors to ecological services in tropical and temperate fresh and brackish water ecosystems. Complete mitochondrial genomes have only been reported from five of the 449 species in the family, thus limiting understanding of mitochondrial genome evolution and the phylogenetic utility of complete mitochondrial sequences in the Atyidae. Here, comparative analyses of complete mitochondrial genomes from eight genetic lineages of Halocaridina rubra, an atyid endemic to the anchialine ecosystem of the Hawaiian Archipelago, are presented. Although gene number, order, and orientation were syntenic among genomes, three regions were identified and further quantified where conservation was substantially lower: (1) high length and sequence variability in the tRNA-Lys and tRNA-Asp intergenic region; (2) a 317-bp insertion between the NAD6 and CytB genes confined to a single lineage and representing a partial duplication of CytB; and (3) the putative control region. Phylogenetic analyses utilizing complete mitochondrial sequences provided new insights into relationships among the H. rubra genetic lineages, with the topology of one clade correlating to the geologic sequence of the islands. However, deeper nodes in the phylogeny lacked bootstrap support. Overall, our results from H. rubra suggest intra-specific mitochondrial genomic diversity could be underestimated across the Metazoa since the vast majority of complete genomes are from just a single individual of a species.

  3. Genetic structure and evolution of natural populations of viruses causing the tomato yellow leaf curl disease in Spain.

    PubMed

    Font, María Isabel; Rubio, Luis; Martínez-Culebras, Pedro Vicente; Jordá, Concepción

    2007-09-01

    The population structure and genetic variation of two begomoviruses: tomato yellow leaf curl Sardinia virus (TYLCSV) and tomato yellow leaf curl virus (TYLCV) in tomato crops of Spain were studied from 1997 until 2001. Restriction digestion of a genomic region comprised of the CP coat protein gene (CPR) of 358 TYLC virus isolates enabled us to classify them into 14 haplotypes. Nucleotide sequences of two genomic regions: CPR, and the surrounding intergenic region (SIR) were determined for at least two isolates per haplotype. SIR was more variable than CPR and showed multiple recombination events whereas no recombination was detected within CPR. In all geographic regions except Murcia, the population was, or evolved to be composed of one predominant haplotype with a low genetic diversity (<0.0180). In Murcia, two successive changes of the predominant haplotype were observed in the best studied population. Phylogenetic analysis showed that the TYLCSV sequences determined clustered with sequences obtained from the GenBank of other TYLCSV Spanish isolates which were clearly separated from TYLCSV Italian isolates. Most of our TYLCV sequences were similar to those of isolates from Japan and Portugal, and the sequences obtained from TYLCV isolates from the Canary island of Lanzarote were similar to those of Caribbean TYLCV isolates.

  4. Genetic Relatedness among Hepatitis A Virus Strains Associated with Food-Borne Outbreaks

    PubMed Central

    Vaughan, Gilberto; Xia, Guoliang; Forbi, Joseph C.; Purdy, Michael A.; Rossi, Lívia Maria Gonçalves; Spradling, Philip R.; Khudyakov, Yury E.

    2013-01-01

    The genetic characterization of hepatitis A virus (HAV) strains is commonly accomplished by sequencing subgenomic regions, such as the VP1/P2B junction. HAV genome is not extensively variable, thus presenting opportunity for sharing sequences of subgenomic regions among genetically unrelated isolates. The degree of misrepresentation of phylogenetic relationships by subgenomic regions is especially important for tracking transmissions. Here, we analyzed whole-genome (WG) sequences of 101 HAV strains identified from 4 major multi-state, food-borne outbreaks of hepatitis A in the Unites States and from 14 non-outbreak-related HAV strains that shared identical VP1/P2B sequences with the outbreak strains. Although HAV strains with an identical VP1/P2B sequence were specific to each outbreak, WG were different, with genetic diversity reaching 0.31% (mean 0.09%). Evaluation of different subgenomic regions did not identify any other section of the HAV genome that could accurately represent phylogenetic relationships observed using WG sequences. The identification of 2–3 dominant HAV strains in 3 out of 4 outbreaks indicates contamination of the implicated food items with a heterogeneous HAV population. However, analysis of intra-host HAV variants from eight patients involved in one outbreak showed that only a single sequence variant established infection in each patient. Four non-outbreak strains were found closely related to strains from 2 outbreaks, whereas ten were genetically different from the outbreak strains. Thus, accurate tracking of HAV strains can be accomplished using HAV WG sequences, while short subgenomic regions are useful for identification of transmissions only among cases with known epidemiological association. PMID:24223112

  5. Genetic differences between blood- and brain-derived viral sequences from human immunodeficiency virus type 1-infected patients: evidence of conserved elements in the V3 region of the envelope protein of brain-derived sequences.

    PubMed Central

    Korber, B T; Kunstman, K J; Patterson, B K; Furtado, M; McEvilly, M M; Levy, R; Wolinsky, S M

    1994-01-01

    Human immunodeficiency virus type 1 (HIV-1) sequences were generated from blood and from brain tissue obtained by stereotactic biopsy from six patients undergoing a diagnostic neurosurgical procedure. Proviral DNA was directly amplified by nested PCR, and 8 to 36 clones from each sample were sequenced. Phylogenetic analysis of intrapatient envelope V3-V5 region HIV-1 DNA sequence sets revealed that brain viral sequences were clustered relative to the blood viral sequences, suggestive of tissue-specific compartmentalization of the virus in four of the six cases. In the other two cases, the blood and brain virus sequences were intermingled in the phylogenetic analyses, suggesting trafficking of virus between the two tissues. Slide-based PCR-driven in situ hybridization of two of the patients' brain biopsy samples confirmed our interpretation of the intrapatient phylogenetic analyses. Interpatient V3 region brain-derived sequence distances were significantly less than blood-derived sequence distances. Relative to the tip of the loop, the set of brain-derived viral sequences had a tendency towards negative or neutral charge compared with the set of blood-derived viral sequences. Entropy calculations were used as a measure of the variability at each position in alignments of blood and brain viral sequences. A relatively conserved set of positions were found, with a significantly lower entropy in the brain-than in the blood-derived viral sequences. These sites constitute a brain "signature pattern," or a noncontiguous set of amino acids in the V3 region conserved in viral sequences derived from brain tissue. This brain-derived signature pattern was also well preserved among isolates previously characterized in vitro as macrophage tropic. Macrophage-monocyte tropism may be the biological constraint that results in the conservation of the viral brain signature pattern. Images PMID:7933130

  6. The Mitochondrial Cytochrome Oxidase Subunit I Gene Occurs on a Minichromosome with Extensive Heteroplasmy in Two Species of Chewing Lice, Geomydoecus aurei and Thomomydoecus minor

    PubMed Central

    Pietan, Lucas L.; Spradling, Theresa A.

    2016-01-01

    In animals, mitochondrial DNA (mtDNA) typically occurs as a single circular chromosome with 13 protein-coding genes and 22 tRNA genes. The various species of lice examined previously, however, have shown mitochondrial genome rearrangements with a range of chromosome sizes and numbers. Our research demonstrates that the mitochondrial genomes of two species of chewing lice found on pocket gophers, Geomydoecus aurei and Thomomydoecus minor, are fragmented with the 1,536 base-pair (bp) cytochrome-oxidase subunit I (cox1) gene occurring as the only protein-coding gene on a 1,916–1,964 bp minicircular chromosome in the two species, respectively. The cox1 gene of T. minor begins with an atypical start codon, while that of G. aurei does not. Components of the non-protein coding sequence of G. aurei and T. minor include a tRNA (isoleucine) gene, inverted repeat sequences consistent with origins of replication, and an additional non-coding region that is smaller than the non-coding sequence of other lice with such fragmented mitochondrial genomes. Sequences of cox1 minichromosome clones for each species reveal extensive length and sequence heteroplasmy in both coding and noncoding regions. The highly variable non-gene regions of G. aurei and T. minor have little sequence similarity with one another except for a 19-bp region of phylogenetically conserved sequence with unknown function. PMID:27589589

  7. Mouse Vk gene classification by nucleic acid sequence similarity.

    PubMed

    Strohal, R; Helmberg, A; Kroemer, G; Kofler, R

    1989-01-01

    Analyses of immunoglobulin (Ig) variable (V) region gene usage in the immune response, estimates of V gene germline complexity, and other nucleic acid hybridization-based studies depend on the extent to which such genes are related (i.e., sequence similarity) and their organization in gene families. While mouse Igh heavy chain V region (VH) gene families are relatively well-established, a corresponding systematic classification of Igk light chain V region (Vk) genes has not been reported. The present analysis, in the course of which we reviewed the known extent of the Vk germline gene repertoire and Vk gene usage in a variety of responses to foreign and self antigens, provides a classification of mouse Vk genes in gene families composed of members with greater than 80% overall nucleic acid sequence similarity. This classification differed in several aspects from that of VH genes: only some Vk gene families were as clearly separated (by greater than 25% sequence dissimilarity) as typical VH gene families; most Vk gene families were closely related and, in several instances, members from different families were very similar (greater than 80%) over large sequence portions; frequently, classification by nucleic acid sequence similarity diverged from existing classifications based on amino-terminal protein sequence similarity. Our data have implications for Vk gene analyses by nucleic acid hybridization and describe potentially important differences in sequence organization between VH and Vk genes.

  8. Conformational divergence in the HA-33/HA-17 trimer of serotype C and D botulinum toxin complex.

    PubMed

    Sagane, Yoshimasa; Hayashi, Shintaro; Akiyama, Tomonori; Matsumoto, Takashi; Hasegawa, Kimiko; Yamano, Akihito; Suzuki, Tomonori; Niwa, Koichi; Watanabe, Toshihiro; Yajima, Shunsuke

    2016-08-05

    Clostridium botulinum produces a large toxin complex (L-TC) comprising botulinum neurotoxin associated with auxiliary nontoxic proteins. A complex of 33- and 17-kDa hemagglutinins (an HA-33/HA-17 trimer) enhances L-TC transport across the intestinal epithelial cell layer via binding HA-33 to a sugar on the cell surface. At least two subtypes of serotype C/D HA-33 exhibit differing preferences for the sugars sialic acid and galactose. Here, we compared the three-dimensional structures of the galactose-binding HA-33 and HA-33/HA-17 trimers produced by the C-Yoichi strain. Comparisons of serotype C/D HA-33 sequences reveal a variable region with relatively low sequence similarity across the C. botulinum strains; the variability of this region may influence the manner of sugar-recognition by HA-33. Crystal structures of sialic acid- and galactose-binding HA-33 are broadly similar in appearance. However, small-angle X-ray scattering revealed distinct solution structures for HA-33/HA-17 trimers. A structural change in the C-terminal variable region of HA-33 might cause a dramatic shift in the conformation and sugar-recognition mode of HA-33/HA-17 trimer. Copyright © 2016 Elsevier Inc. All rights reserved.

  9. Spatiotemporal Phylogenetic Analysis and Molecular Characterisation of Infectious Bursal Disease Viruses Based on the VP2 Hyper-Variable Region

    PubMed Central

    Dolz, Roser; Valle, Rosa; Perera, Carmen L.; Bertran, Kateri; Frías, Maria T.; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J.

    2013-01-01

    Background Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Methodology/Principal Findings Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. Conclusions/Significance To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide. PMID:23805195

  10. Spatiotemporal Phylogenetic Analysis and Molecular Characterisation of Infectious Bursal Disease Viruses Based on the VP2 Hyper-Variable Region.

    PubMed

    Alfonso-Morales, Abdulahi; Martínez-Pérez, Orlando; Dolz, Roser; Valle, Rosa; Perera, Carmen L; Bertran, Kateri; Frías, Maria T; Majó, Natàlia; Ganges, Llilianne; Pérez, Lester J

    2013-01-01

    Infectious bursal disease is a highly contagious and acute viral disease caused by the infectious bursal disease virus (IBDV); it affects all major poultry producing areas of the world. The current study was designed to rigorously measure the global phylogeographic dynamics of IBDV strains to gain insight into viral population expansion as well as the emergence, spread and pattern of the geographical structure of very virulent IBDV (vvIBDV) strains. Sequences of the hyper-variable region of the VP2 (HVR-VP2) gene from IBDV strains isolated from diverse geographic locations were obtained from the GenBank database; Cuban sequences were obtained in the current work. All sequences were analysed by Bayesian phylogeographic analysis, implemented in the Bayesian Evolutionary Analysis Sampling Trees (BEAST), Bayesian Tip-association Significance testing (BaTS) and Spatial Phylogenetic Reconstruction of Evolutionary Dynamics (SPREAD) software packages. Selection pressure on the HVR-VP2 was also assessed. The phylogeographic association-trait analysis showed that viruses sampled from individual countries tend to cluster together, suggesting a geographic pattern for IBDV strains. Spatial analysis from this study revealed that strains carrying sequences that were linked to increased virulence of IBDV appeared in Iran in 1981 and spread to Western Europe (Belgium) in 1987, Africa (Egypt) around 1990, East Asia (China and Japan) in 1993, the Caribbean Region (Cuba) by 1995 and South America (Brazil) around 2000. Selection pressure analysis showed that several codons in the HVR-VP2 region were under purifying selection. To our knowledge, this work is the first study applying the Bayesian phylogeographic reconstruction approach to analyse the emergence and spread of vvIBDV strains worldwide.

  11. Genetic analysis of human immunodeficiency virus type 1 envelope V3 region isolates from mothers and infants after perinatal transmission.

    PubMed Central

    Ahmad, N; Baroudy, B M; Baker, R C; Chappey, C

    1995-01-01

    The human immunodeficiency virus type 1 (HIV-1) sequences from variable region 3 (V3) of the envelope gene were analyzed from seven infected mother-infant pairs following perinatal transmission. The V3 region sequences directly derived from the DNA of the uncultured peripheral blood mononuclear cells from infected mothers displayed a heterogeneous population. In contrast, the infants' sequences were less diverse than those of their mothers. In addition, the sequences from the younger infants' peripheral blood mononuclear cell DNA were more homogeneous than the older infants' sequences. All infants' sequences were different but displayed patterns similar to those seen in their mothers. In the mother-infant pair sequences analyzed, a minor genotype or subtype found in the mothers predominated in their infants. The conserved N-linked glycosylation site proximal to the first cysteine of the V3 loop was absent only in one infant's sequence set and in some variants of two other infants' sequences. Furthermore, the HIV-1 sequences of the epidemiologically linked mother-infant pairs were closer than the sequences of epidemiologically unlinked individuals, suggesting that the sequence comparison of mother-infant pairs done in order to identify genetic variants transmitted from mother to infant could be performed even in older infants. There was no evidence for transmission of a major genotype or multiple genotypes from mother to infant. In conclusion, a minor genotype of maternal virus is transmitted to the infants, and this finding could be useful in developing strategies to prevent maternal transmission of HIV-1 by means of perinatal interventions. PMID:7815476

  12. Effect of sequence-dependent rigidity on plectoneme localization in dsDNA

    NASA Astrophysics Data System (ADS)

    Medalion, Shlomi; Rabin, Yitzhak

    2016-04-01

    We use Monte-Carlo simulations to study the effect of variable rigidity on plectoneme formation and localization in supercoiled double-stranded DNA. We show that the presence of soft sequences increases the number of plectoneme branches and that the edges of the branches tend to be localized at these sequences. We propose an experimental approach to test our results in vitro, and discuss the possible role played by plectoneme localization in the search process of transcription factors for their targets (promoter regions) on the bacterial genome.

  13. Genetic variations in regions of bovine and bovine-like enteroviral 5'UTR from cattle, Indian bison and goat feces.

    PubMed

    Kosoltanapiwat, Nathamon; Yindee, Marnoch; Chavez, Irwin Fernandez; Leaungwutiwong, Pornsawan; Adisakwattana, Poom; Singhasivanon, Pratap; Thawornkuno, Charin; Thippornchai, Narin; Rungruengkitkun, Amporn; Soontorn, Juthamas; Pearsiriwuttipong, Sasipan

    2016-01-25

    Bovine enteroviruses (BEV) are members of the genus Enterovirus in the family Picornaviridae. They are predominantly isolated from cattle feces, but also are detected in feces of other animals, including goats and deer. These viruses are found in apparently healthy animals, as well as in animals with clinical signs and several studies reported recently suggest a potential role of BEV in causing disease in animals. In this study, we surveyed the presence of BEV in domestic and wild animals in Thailand, and assessed their genetic variability. Viral RNA was extracted from fecal samples of cattle, domestic goats, Indian bison (gaurs), and deer. The 5' untranslated region (5'UTR) was amplified by nested reverse transcription-polymerase chain reaction (RT-PCR) with primers specific to BEV 5'UTR. PCR products were sequenced and analyzed phylogenetically using the neighbor-joining algorithm to observe genetic variations in regions of the bovine and bovine-like enteroviral 5'UTR found in this study. BEV and BEV-like sequences were detected in the fecal samples of cattle (40/60, 67 %), gaurs (3/30, 10 %), and goats (11/46, 24 %). Phylogenetic analyses of the partial 5'UTR sequences indicated that different BEV variants (both EV-E and EV-F species) co-circulated in the domestic cattle, whereas the sequences from gaurs and goats clustered according to the animal species, suggesting that these viruses are host species-specific. Varieties of BEV and BEV-like 5'UTR sequences were detected in fecal samples from both domestic and wild animals. To our knowledge, this is the first report of the genetic variability of BEV in Thailand.

  14. Genetic diversity based on 28S rDNA sequences among populations of Culex quinquefasciatus collected at different locations in Tamil Nadu, India.

    PubMed

    Sakthivelkumar, S; Ramaraj, P; Veeramani, V; Janarthanan, S

    2015-09-01

    The basis of the present study was to distinguish the existence of any genetic variability among populations of Culex quinquefasciatus which would be a valuable tool in the management of mosquito control programmes. In the present study, population of Cx. quinquefasciatus collected at different locations in Tamil Nadu were analyzed for their genetic variation based on 28S rDNA D2 region nucleotide sequences. A high degree of genetic polymorphism was detected in the sequences of D2 region of 28S rDNA on the predicted secondary structures in spite of high nucleotide sequence similarity. The findings based on secondary structure using rDNA sequences suggested the existence of a complex genotypic diversity of Cx. quinquefasciatus population collected at different locations of Tamil Nadu, India. This complexity in genetic diversity in a single mosquito population collected at different locations is considered an important issue towards their influence and nature of vector potential of these mosquitoes.

  15. Genetic diversity of the captive Asian tapir population in Thailand, based on mitochondrial control region sequence data and the comparison of its nucleotide structure with Brazilian tapir.

    PubMed

    Muangkram, Yuttamol; Amano, Akira; Wajjwalku, Worawidh; Pinyopummintr, Tanu; Thongtip, Nikorn; Kaolim, Nongnid; Sukmak, Manakorn; Kamolnorranath, Sumate; Siriaroonrat, Boripat; Tipkantha, Wanlaya; Maikaew, Umaporn; Thomas, Warisara; Polsrila, Kanda; Dongsaard, Kwanreaun; Sanannu, Saowaphang; Wattananorrasate, Anuwat

    2017-07-01

    The Asian tapir (Tapirus indicus) has been classified as Endangered on the IUCN Red List of Threatened Species (2008). Genetic diversity data provide important information for the management of captive breeding and conservation of this species. We analyzed mitochondrial control region (CR) sequences from 37 captive Asian tapirs in Thailand. Multiple alignments of the full-length CR sequences sized 1268 bp comprised three domains as described in other mammal species. Analysis of 16 parsimony-informative variable sites revealed 11 haplotypes. Furthermore, the phylogenetic analysis using median-joining network clearly showed three clades correlated with our earlier cytochrome b gene study in this endangered species. The repetitive motif is located between first and second conserved sequence blocks, similar to the Brazilian tapir. The highest polymorphic site was located in the extended termination associated sequences domain. The results could be applied for future genetic management based in captivity and wild that shows stable populations.

  16. Assessing the intra-species genetic variability in the clonal pathogen Campylobacter fetus: CRISPRs are highly polymorphic DNA markers.

    PubMed

    Calleros, Lucía; Betancor, Laura; Iraola, Gregorio; Méndez, Alejandra; Morsella, Claudia; Paolicchi, Fernando; Silveyra, Silvia; Velilla, Alejandra; Pérez, Ruben

    2017-01-01

    Campylobacter fetus is a Gram-negative, microaerophilic bacterium that infects animals and humans. The subspecies Campylobacter fetus subsp. fetus (Cff) affects a broad range of vertebrate hosts and induces abortion in cows and sheep. Campylobacter fetus subsp. venerealis (Cfv) is restricted to cattle and causes the endemic disease bovine genital campylobacteriosis, which triggers reproductive problems and is responsible for major economic losses. Campylobacter fetus subsp. testudinum (Cft) has been isolated mostly from apparently healthy reptiles belonging to different species but also from ill snakes and humans. Genotypic differentiation of Cff and Cfv is difficult, and epidemiological information is scarce because there are few methods to study the genetic diversity of the strains. We analyze the efficacy of MLST, ribosomal sequences (23S gene and internal spacer region), and CRISPRs to assess the genetic variability of C. fetus in bovine and human isolates. Sequences retrieved from complete genomes were included in the analysis for comparative purposes. MLST and ribosomal sequences had scarce or null variability, while the CRISPR-cas system structure and the sequence of CRISPR1 locus showed remarkable diversity. None of the sequences here analyzed provided evidence of a genetic differentiation of Cff and Cfv in bovine isolates. Comparison of bovine and human isolates with Cft strains showed a striking divergence. Inter-host differences raise the possibility of determining the original host of human infections using CRISPR sequences. CRISPRs are the most variable sequences analyzed in C. fetus so far, and constitute excellent representatives of a dynamic fraction of the genome. CRISPR typing is a promising tool to characterize isolates and to track the source and transmission route of C. fetus infections. Copyright © 2016 Elsevier B.V. All rights reserved.

  17. Quantitation of base substitutions in eukaryotic 5S rRNA: selection for the maintenance of RNA secondary structure.

    PubMed

    Curtiss, W C; Vournakis, J N

    1984-01-01

    Eukaryotic 5S rRNA sequences from 34 diverse species were compared by the following method: (1) The sequences were aligned; (2) the positions of substitutions were located by comparison of all possible pairs of sequences; (3) the substitution sites were mapped to an assumed general base pairing model; and (4) the R-Y model of base stacking was used to study stacking pattern relationships in the structure. An analysis of the sequence and structure variability in each region of the molecule is presented. It was found that the degree of base substitution varies over a wide range, from absolute conservation to occurrence of over 90% of the possible observable substitutions. The substitutions are located primarily in stem regions of the 5S rRNA secondary structure. More than 88% of the substitutions in helical regions maintain base pairing. The disruptive substitutions are primarily located at the edges of helical regions, resulting in shortening of the helical regions and lengthening of the adjacent nonpaired regions. Base stacking patterns determined by the R-Y model are mapped onto the general secondary structure. Intrastrand and interstrand stacking could stabilize alternative coaxial structures and limit the conformational flexibility of nonpaired regions. Two short contiguous regions are 100% conserved in all species. This may reflect evolutionary constraints imposed at the DNA level by the requirement for binding of a 5S gene transcription initiation factor during gene expression.

  18. Study on the association between drug‑resistance and gene mutations of the active efflux pump acrAB‑tolC gene and its regulatory genes.

    PubMed

    Ma, Quan-Ping; Su, Liang; Liu, Jing-Wen; Yao, Ming-Xiao; Yuan, Guang-Ying

    2018-06-01

    The aim of the present study was to investigate the correlation between the multi‑drug resistance of Shigella flexneri and the drug‑resistant gene cassette carried by integrons; in the meanwhile, to detect the associations between drug‑resistance and gene mutations of the active efflux pump acrAB‑tolC gene and its regulatory genes, including marOR, acrR and soxS. A total of 158 isolates were isolated from the stool samples of 1,026 children with diarrhoea aged 14 years old between May 2012 and October 2015 in Henan. The K‑B method was applied for the determination of drug resistance of Shigella flexneri, and polymerase chain reaction amplification was used for class 1, 2 and 3 integrase genes. Enzyme digestion and sequence analysis were performed for the variable regions of positive strains. Based on the drug sensitivity assessment, multi‑drug resistant strains that were resistant to five or more antibiotics, and sensitive strains were selected for amplification. Their active efflux pump genes, acrA and acrB, and regulatory genes, marOR, acrR and soxS, were selected for sequencing. The results revealed that 91.1% of the 158 strains were multi‑resistant to ampicillin, chloramphenicol, tetracycline and streptomycin, and 69.6% of the strains were multi‑resistant to sulfamethoxazole/trimethoprim. The resistance to ceftazidime, ciprofloxacin and levofloxacin was <32.9%. All strains (100%) were sensitive to cefoxitin, cefoperazone/sulbactam and imipenem. The rate of the class 1 integron positivity was 91.9% (144/158). Among these class 1 integron‑positive strains, 18 strains exhibited the resistance gene cassette dfrV in the variable region of the strain, four strains exhibited dfrA17‑aadA5 in the variable region and 140 strains exhibited blaOXA‑30‑aadA1 in the variable region. Four strains showed no resistance gene in the variable regions. The rate of class 2 integron positivity was 86.1% (136/158), and all positive strains harboured the dfrA1‑sat1‑aadA resistance gene cassette in the variable region. The class 3 integrase gene was not detected in these strains. The gene sequencing showed the deletion of base CATT in the 36, 37, 38, 39 site in the marOR gene, which is a regulatory gene of the active efflux pump, AcrAB‑TolC. Taken together, the multi‑drug resistance of Shigella flexneri was closely associated with gene mutations of class 1 and 2 integrons and the marOR gene.

  19. A comprehensive analysis of three Asiatic black bear mitochondrial genomes (subspecies ussuricus, formosanus and mupinensis), with emphasis on the complete mtDNA sequence of Ursus thibetanus ussuricus (Ursidae).

    PubMed

    Hwang, Dae-Sik; Ki, Jang-Seu; Jeong, Dong-Hyuk; Kim, Bo-Hyun; Lee, Bae-Keun; Han, Sang-Hoon; Lee, Jae-Seong

    2008-08-01

    In the present paper, we describe the mitochondrial genome sequence of the Asiatic black bear (Ursus thibetanus ussuricus) with particular emphasis on the control region (CR), and compared with mitochondrial genomes on molecular relationships among the bears. The mitochondrial genome sequence of U. thibetanus ussuricus was 16,700 bp in size with mostly conserved structures (e.g. 13 protein-coding, two rRNA genes, 22 tRNA genes). The CR consisted of several typical conserved domains such as F, E, D, and C boxes, and a conserved sequence block. Nucleotide sequences and the repeated motifs in the CR were different among the bear species, and their copy numbers were also variable according to populations, even within F1 generations of U. thibetanus ussuricus. Comparative analyses showed that the CR D1 region was highly informative for the discrimination of the bear family. These findings suggest that nucleotide sequences of both repeated motifs and CR D1 in the bear family are good markers for species discriminations.

  20. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana).

    PubMed

    Baurens, Franc-Christophe; Bocs, Stéphanie; Rouard, Mathieu; Matsumoto, Takashi; Miller, Robert N G; Rodier-Goud, Marguerite; MBéguié-A-MBéguié, Didier; Yahiaoui, Nabila

    2010-07-16

    Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana.

  1. Phylogenetic Analysis of Phenotypically Characterized Cryptococcus laurentii Isolates Reveals High Frequency of Cryptic Species

    PubMed Central

    Ferreira-Paim, Kennio; Ferreira, Thatiana Bragine; Andrade-Silva, Leonardo; Mora, Delio Jose; Springer, Deborah J.; Heitman, Joseph; Fonseca, Fernanda Machado; Matos, Dulcilena; Melhem, Márcia Souza Carvalho; Silva-Vergara, Mario León

    2014-01-01

    Background Although Cryptococcus laurentii has been considered saprophytic and its taxonomy is still being described, several cases of human infections have already reported. This study aimed to evaluate molecular aspects of C. laurentii isolates from Brazil, Botswana, Canada, and the United States. Methods In this study, 100 phenotypically identified C. laurentii isolates were evaluated by sequencing the 18S nuclear ribosomal small subunit rRNA gene (18S-SSU), D1/D2 region of 28S nuclear ribosomal large subunit rRNA gene (28S-LSU), and the internal transcribed spacer (ITS) of the ribosomal region. Results BLAST searches using 550-bp, 650-bp, and 550-bp sequenced amplicons obtained from the 18S-SSU, 28S-LSU, and the ITS region led to the identification of 75 C. laurentii strains that shared 99–100% identity with C. laurentii CBS 139. A total of nine isolates shared 99% identity with both Bullera sp. VY-68 and C. laurentii RY1. One isolate shared 99% identity with Cryptococcus rajasthanensis CBS 10406, and eight isolates shared 100% identity with Cryptococcus sp. APSS 862 according to the 28S-LSU and ITS regions and designated as Cryptococcus aspenensis sp. nov. (CBS 13867). While 16 isolates shared 99% identity with Cryptococcus flavescens CBS 942 according to the 18S-SSU sequence, only six were confirmed using the 28S-LSU and ITS region sequences. The remaining 10 shared 99% identity with Cryptococcus terrestris CBS 10810, which was recently described in Brazil. Through concatenated sequence analyses, seven sequence types in C. laurentii, three in C. flavescens, one in C. terrestris, and one in the C. aspenensis sp. nov. were identified. Conclusions Sequencing permitted the characterization of 75% of the environmental C. laurentii isolates from different geographical areas and the identification of seven haplotypes of this species. Among sequenced regions, the increased variability of the ITS region in comparison to the 18S-SSU and 28S-LSU regions reinforces its applicability as a DNA barcode. PMID:25251413

  2. Bacterial community composition in different sediments from the Eastern Mediterranean Sea: a comparison of four 16S ribosomal DNA clone libraries.

    PubMed

    Polymenakou, Paraskevi N; Bertilsson, Stefan; Tselepides, Anastasios; Stephanou, Euripides G

    2005-10-01

    The regional variability of sediment bacterial community composition and diversity was studied by comparative analysis of four large 16S ribosomal DNA (rDNA) clone libraries from sediments in different regions of the Eastern Mediterranean Sea (Thermaikos Gulf, Cretan Sea, and South lonian Sea). Amplified rDNA restriction analysis of 664 clones from the libraries indicate that the rDNA richness and evenness was high: for example, a near-1:1 relationship among screened clones and number of unique restriction patterns when up to 190 clones were screened for each library. Phylogenetic analysis of 207 bacterial 16S rDNA sequences from the sediment libraries demonstrated that Gamma-, Delta-, and Alphaproteobacteria, Holophaga/Acidobacteria, Planctomycetales, Actinobacteria, Bacteroidetes, and Verrucomicrobia were represented in all four libraries. A few clones also grouped with the Betaproteobacteria, Nitrospirae, Spirochaetales, Chlamydiae, Firmicutes, and candidate division OPl 1. The abundance of sequences affiliated with Gammaproteobacteria was higher in libraries from shallow sediments in the Thermaikos Gulf (30 m) and the Cretan Sea (100 m) compared to the deeper South Ionian station (2790 m). Most sequences in the four sediment libraries clustered with uncultured 16S rDNA phylotypes from marine habitats, and many of the closest matches were clones from hydrocarbon seeps, benzene-mineralizing consortia, sulfate reducers, sulk oxidizers, and ammonia oxidizers. LIBSHUFF statistics of 16S rDNA gene sequences from the four libraries revealed major differences, indicating either a very high richness in the sediment bacterial communities or considerable variability in bacterial community composition among regions, or both.

  3. Comparative Analysis of the Complete Plastomes of Apostasia wallichii and Neuwiedia singapureana (Apostasioideae) Reveals Different Evolutionary Dynamics of IR/SSC Boundary among Photosynthetic Orchids.

    PubMed

    Niu, Zhitao; Pan, Jiajia; Zhu, Shuying; Li, Ludan; Xue, Qingyun; Liu, Wei; Ding, Xiaoyu

    2017-01-01

    Apostasioideae, consists of only two genera, Apostasia and Neuwiedia , which are mainly distributed in Southeast Asia and northern Australia. The floral structure, taxonomy, biogeography, and genome variation of Apostasioideae have been intensively studied. However, detailed analyses of plastome composition and structure and comparisons with those of other orchid subfamilies have not yet been conducted. Here, the complete plastome sequences of Apostasia wallichii and Neuwiedia singapureana were sequenced and compared with 43 previously published photosynthetic orchid plastomes to characterize the plastome structure and evolution in the orchids. Unlike many orchid plastomes (e.g., Paphiopedilum and Vanilla ), the plastomes of Apostasioideae contain a full set of 11 functional NADH dehydrogenase ( ndh ) genes. The distribution of repeat sequences and simple sequence repeat elements enhanced the view that the mutation rate of non-coding regions was higher than that of coding regions. The 10 loci- ndhA intron, matK-5'trnK , clpP-psbB , rps8-rpl14 , trnT-trnL , 3'trnK-matK , clpP intron , psbK-trnK , trnS-psbC , and ndhF-rpl32 -that had the highest degrees of sequence variability were identified as mutational hotspots for the Apostasia plastome. Furthermore, our results revealed that plastid genes exhibited a variable evolution rate within and among different orchid genus. Considering the diversified evolution of both coding and non-coding regions, we suggested that the plastome-wide evolution of orchid species was disproportional. Additionally, the sequences flanking the inverted repeat/small single copy (IR/SSC) junctions of photosynthetic orchid plastomes were categorized into three types according to the presence/absence of ndh genes. Different evolutionary dynamics for each of the three IR/SSC types of photosynthetic orchid plastomes were also proposed.

  4. Comparative Analysis of the Complete Plastomes of Apostasia wallichii and Neuwiedia singapureana (Apostasioideae) Reveals Different Evolutionary Dynamics of IR/SSC Boundary among Photosynthetic Orchids

    PubMed Central

    Niu, Zhitao; Pan, Jiajia; Zhu, Shuying; Li, Ludan; Xue, Qingyun; Liu, Wei; Ding, Xiaoyu

    2017-01-01

    Apostasioideae, consists of only two genera, Apostasia and Neuwiedia, which are mainly distributed in Southeast Asia and northern Australia. The floral structure, taxonomy, biogeography, and genome variation of Apostasioideae have been intensively studied. However, detailed analyses of plastome composition and structure and comparisons with those of other orchid subfamilies have not yet been conducted. Here, the complete plastome sequences of Apostasia wallichii and Neuwiedia singapureana were sequenced and compared with 43 previously published photosynthetic orchid plastomes to characterize the plastome structure and evolution in the orchids. Unlike many orchid plastomes (e.g., Paphiopedilum and Vanilla), the plastomes of Apostasioideae contain a full set of 11 functional NADH dehydrogenase (ndh) genes. The distribution of repeat sequences and simple sequence repeat elements enhanced the view that the mutation rate of non-coding regions was higher than that of coding regions. The 10 loci—ndhA intron, matK-5′trnK, clpP-psbB, rps8-rpl14, trnT-trnL, 3′trnK-matK, clpP intron, psbK-trnK, trnS-psbC, and ndhF-rpl32—that had the highest degrees of sequence variability were identified as mutational hotspots for the Apostasia plastome. Furthermore, our results revealed that plastid genes exhibited a variable evolution rate within and among different orchid genus. Considering the diversified evolution of both coding and non-coding regions, we suggested that the plastome-wide evolution of orchid species was disproportional. Additionally, the sequences flanking the inverted repeat/small single copy (IR/SSC) junctions of photosynthetic orchid plastomes were categorized into three types according to the presence/absence of ndh genes. Different evolutionary dynamics for each of the three IR/SSC types of photosynthetic orchid plastomes were also proposed. PMID:29046685

  5. Ultra-barcoding in cacao (Theobroma spp.; Malvaceae) using whole chloroplast genomes and nuclear ribosomal DNA.

    PubMed

    Kane, Nolan; Sveinsson, Saemundur; Dempewolf, Hannes; Yang, Ji Yong; Zhang, Dapeng; Engels, Johannes M M; Cronk, Quentin

    2012-02-01

    To reliably identify lineages below the species level such as subspecies or varieties, we propose an extension to DNA-barcoding using next-generation sequencing to produce whole organellar genomes and substantial nuclear ribosomal sequence. Because this method uses much longer versions of the traditional DNA-barcoding loci in the plastid and ribosomal DNA, we call our approach ultra-barcoding (UBC). We used high-throughput next-generation sequencing to scan the genome and generate reliable sequence of high copy number regions. Using this method, we examined whole plastid genomes as well as nearly 6000 bases of nuclear ribosomal DNA sequences for nine genotypes of Theobroma cacao and an individual of the related species T. grandiflorum, as well as an additional publicly available whole plastid genome of T. cacao. All individuals of T. cacao examined were uniquely distinguished, and evidence of reticulation and gene flow was observed. Sequence variation was observed in some of the canonical barcoding regions between species, but other regions of the chloroplast were more variable both within species and between species, as were ribosomal spacers. Furthermore, no single region provides the level of data available using the complete plastid genome and rDNA. Our data demonstrate that UBC is a viable, increasingly cost-effective approach for reliably distinguishing varieties and even individual genotypes of T. cacao. This approach shows great promise for applications where very closely related or interbreeding taxa must be distinguished.

  6. Re-sequencing regions of the ovine Y chromosome in domestic and wild sheep reveals novel paternal haplotypes.

    PubMed

    Meadows, J R S; Kijas, J W

    2009-02-01

    The male-specific region of the ovine Y chromosome (MSY) remains poorly characterized, yet sequence variants from this region have the potential to reveal the wild progenitor of domestic sheep or examples of domestic and wild paternal introgression. The 5' promoter region of the sex-determining gene SRY was re-sequenced using a subset of wild sheep including bighorn (Ovis canadensis), thinhorn (Ovis dalli spp.), urial (Ovis vignei), argali (Ovis ammon), mouflon (Ovis musimon) and domestic sheep (Ovis aries). Seven novel SNPs (oY2-oY8) were revealed; these were polymorphic between but not within species. Re-sequencing and fragment analysis was applied to the MSY microsatellite SRYM18. It contains a complex compound repeat structure and sequencing of three novel size fragments revealed that a pentanucleotide element remained fixed, whilst a dinucleotide element displayed variability within species. Comparison of the sequence between species revealed that urial and argali sheep grouped more closely to the mouflon and domestic breeds than the pachyceriforms (bighorn and thinhorn). SNP and microsatellite data were combined to define six previously undetected haplotypes. Analysis revealed the mouflon as the only species to share a haplotype with domestic sheep, consistent with its status as a feral domesticate that has undergone male-mediated exchange with domestic animals. A comparison of the remaining wild species and domestic sheep revealed that O. aries is free from signatures of wild sheep introgression.

  7. Genetic characterization of UCS region of Pneumocystis jirovecii and construction of allelic profiles of Indian isolates based on sequence typing at three regions.

    PubMed

    Gupta, Rashmi; Mirdha, Bijay Ranjan; Guleria, Randeep; Kumar, Lalit; Luthra, Kalpana; Agarwal, Sanjay Kumar; Sreenivas, Vishnubhatla

    2013-01-01

    Pneumocystis jirovecii is an opportunistic pathogen that causes severe pneumonia in immunocompromised patients. To study the genetic diversity of P. jirovecii in India the upstream conserved sequence (UCS) region of Pneumocystis genome was amplified, sequenced and genotyped from a set of respiratory specimens obtained from 50 patients with a positive result for nested mitochondrial large subunit ribosomal RNA (mtLSU rRNA) PCR during the years 2005-2008. Of these 50 cases, 45 showed a positive PCR for UCS region. Variations in the tandem repeats in UCS region were characterized by sequencing all the positive cases. Of the 45 cases, one case showed five repeats, 11 cases showed four repeats, 29 cases showed three repeats and four cases showed two repeats. By running amplified DNA from all these cases on a high-resolution gel, mixed infection was observed in 12 cases (26.7%, 12/45). Forty three of 45 cases included in this study had previously been typed at mtLSU rRNA and internal transcribed spacer (ITS) region by our group. In the present study, the genotypes at those two regions were combined with UCS repeat patterns to construct allelic profiles of 43 cases. A total of 36 allelic profiles were observed in 43 isolates indicating high genetic variability. A statistically significant association was observed between mtLSU rRNA genotype 1, ITS type Ea and UCS repeat pattern 4. Copyright © 2012 Elsevier B.V. All rights reserved.

  8. Genetic polymorphisms in the amino acid transporters LAT1 and LAT2 in relation to the pharmacokinetics and side effects of melphalan.

    PubMed

    Kühne, Annett; Kaiser, Rolf; Schirmer, Markus; Heider, Ulrike; Muhlke, Sabine; Niere, Wiebke; Overbeck, Tobias; Hohloch, Karin; Trümper, Lorenz; Sezer, Orhan; Brockmöller, Jürgen

    2007-07-01

    Melphalan is widely used in the treatment of multiple myeloma. Pharmacokinetics of this alkylating drug shows high inter-individual variability. As melphalan is a phenylalanine derivative, the pharmacokinetic variability may be determined by genetic polymorphisms in the L-type amino acid transporters LAT1 (SLC7A5) and LAT2 (SLC7A8). Pharmacokinetics were analysed in 64 patients after first administration of intravenous melphalan. Severity of side effects was documented according to WHO criteria. Genomic DNA was analysed for polymorphisms in LAT1 and LAT2 by sequencing of the entire coding region, intron-exon boundaries and 2 kb upstream promoter region. Selected polymorphisms in the common heavy chain of both transporters, the protein 4F2hc (SLC3A2), were analysed by single nucleotide primer extension. Melphalan pharmacokinetics was highly variable with up to 6.2-fold differences in total clearance. A total of 44 polymorphisms were identified in LAT1 and 21 polymorphisms in LAT2. From all variants, only five were in the coding region and only one heterozygous non-synonymous polymorphism (Ala94Thr) was found in LAT2. Numerous polymorphisms were found in the LAT1 and LAT2 5'-flanking regions but did not correlate with expression of the respective genes. No significant correlations could be observed between the polymorphisms in 4F2hc, LAT1, and LAT2 with melphalan pharmacokinetics or with melphalan side effects. The study confirmed that these transporter genes are highly conserved, particularly in the coding sequences. Genetic variation in 4F2hc, LAT1, and LAT2 does not appear to be a major cause of inter-individual variability in pharmacokinetics and of adverse reactions to melphalan.

  9. African Swine Fever Virus Isolate, Georgia, 2007

    PubMed Central

    Rowlands, Rebecca J.; Michaud, Vincent; Heath, Livio; Hutchings, Geoff; Oura, Chris; Vosloo, Wilna; Dwarka, Rahana; Onashvili, Tinatin; Albina, Emmanuel

    2008-01-01

    African swine fever (ASF) is widespread in Africa but is rarely introduced to other continents. In June 2007, ASF was confirmed in the Caucasus region of Georgia, and it has since spread to neighboring countries. DNA fragments amplified from the genome of the isolates from domestic pigs in Georgia in 2007 were sequenced and compared with other ASF virus (ASFV) isolates to establish the genotype of the virus. Sequences were obtained from 4 genome regions, including part of the gene B646L that encodes the p72 capsid protein, the complete E183L and CP204L genes, which encode the p54 and p30 proteins and the variable region of the B602L gene. Analysis of these sequences indicated that the Georgia 2007 isolate is closely related to isolates belonging to genotype II, which is circulating in Mozambique, Madagascar, and Zambia. One possibility for the spread of disease to Georgia is that pigs were fed ASFV-contaminated pork brought in on ships and, subsequently, the disease was disseminated throughout the region. PMID:19046509

  10. Sequence of a cDNA encoding pancreatic preprosomatostatin-22.

    PubMed Central

    Magazin, M; Minth, C D; Funckes, C L; Deschenes, R; Tavianini, M A; Dixon, J E

    1982-01-01

    We report the nucleotide sequence of a precursor to somatostatin that upon proteolytic processing may give rise to a hormone of 22 amino acids. The nucleotide sequence of a cDNA from the channel catfish (Ictalurus punctatus) encodes a precursor to somatostatin that is 105 amino acids (Mr, 11,500). The cDNA coding for somatostatin-22 consists of 36 nucleotides in the 5' untranslated region, 315 nucleotides that code for the precursor to somatostatin-22, 269 nucleotides at the 3' untranslated region, and a variable length of poly(A). The putative preprohormone contains a sequence of hydrophobic amino acids at the amino terminus that has the properties of a "signal" peptide. A connecting sequence of approximately 57 amino acids is followed by a single Arg-Arg sequence, which immediately precedes the hormone. Somatostatin-22 is homologous to somatostatin-14 in 7 of the 14 amino acids, including the Phe-Trp-Lys sequence. Hybridization selection of mRNA, followed by its translation in a wheat germ cell-free system, resulted in the synthesis of a single polypeptide having a molecular weight of approximately 10,000 as estimated on Na-DodSO4/polyacrylamide gels. Images PMID:6127673

  11. HIA: a genome mapper using hybrid index-based sequence alignment.

    PubMed

    Choi, Jongpill; Park, Kiejung; Cho, Seong Beom; Chung, Myungguen

    2015-01-01

    A number of alignment tools have been developed to align sequencing reads to the human reference genome. The scale of information from next-generation sequencing (NGS) experiments, however, is increasing rapidly. Recent studies based on NGS technology have routinely produced exome or whole-genome sequences from several hundreds or thousands of samples. To accommodate the increasing need of analyzing very large NGS data sets, it is necessary to develop faster, more sensitive and accurate mapping tools. HIA uses two indices, a hash table index and a suffix array index. The hash table performs direct lookup of a q-gram, and the suffix array performs very fast lookup of variable-length strings by exploiting binary search. We observed that combining hash table and suffix array (hybrid index) is much faster than the suffix array method for finding a substring in the reference sequence. Here, we defined the matching region (MR) is a longest common substring between a reference and a read. And, we also defined the candidate alignment regions (CARs) as a list of MRs that is close to each other. The hybrid index is used to find candidate alignment regions (CARs) between a reference and a read. We found that aligning only the unmatched regions in the CAR is much faster than aligning the whole CAR. In benchmark analysis, HIA outperformed in mapping speed compared with the other aligners, without significant loss of mapping accuracy. Our experiments show that the hybrid of hash table and suffix array is useful in terms of speed for mapping NGS sequencing reads to the human reference genome sequence. In conclusion, our tool is appropriate for aligning massive data sets generated by NGS sequencing.

  12. Bacterial CRISPR Regions: General Features and their Potential for Epidemiological Molecular Typing Studies.

    PubMed

    Karimi, Zahra; Ahmadi, Ali; Najafi, Ali; Ranjbar, Reza

    2018-01-01

    CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) loci as novel and applicable regions in prokaryotic genomes have gained great attraction in the post genomics era. These unique regions are diverse in number and sequence composition in different pathogenic bacteria and thereby can be a suitable candidate for molecular epidemiology and genotyping studies. Results:Furthermore, the arrayed structure of CRISPR loci (several unique repeats spaced with the variable sequence) and associated cas genes act as an active prokaryotic immune system against viral replication and conjugative elements. This property can be used as a tool for RNA editing in bioengineering studies. The aim of this review was to survey some details about the history, nature, and potential applications of CRISPR arrays in both genetic engineering and bacterial genotyping studies.

  13. Molecular Variability Among Isolates of Prunus Necrotic Ringspot Virus from Different Prunus spp.

    PubMed

    Aparicio, F; Myrta, A; Di Terlizzi, B; Pallás, V

    1999-11-01

    ABSTRACT Viral sequences amplified by polymerase chain reaction from 25 isolates of Prunus necrotic ringspot virus (PNRSV), varying in the symptomatology they cause in six different Prunus spp., were analyzed for restriction fragment polymorphisms. Most of the isolates could be discriminated by using a combination of three different restriction enzymes. The nucleotide sequences of the RNA 4 of 15 of these isolates were determined. Sequence comparisons and phylogenetic analyses of the RNA 4 and coat proteins (CPs) revealed that all of the isolates clustered into three different groups, represented by three previously sequenced PNRSV isolates: PV32, PE5, and PV96. The PE5-type group was characterized by a 5' untranslated region that was clearly different from that of the other two groups. The PV32-type group was characterized by an extra hexanucleotide consisting of a duplication of the six immediately preceding nucleotides. Although most of the variability was observed in the first third of the CP, the amino acid residues in this region, which were previously thought to be functionally important in the replication cycle of the virus, were strictly conserved. No clear correlation with the type of symptom or host specificity could be observed. The validity of this grouping was confirmed when other isolates recently characterized by other authors were included in these analyses.

  14. Historical connectivity, contemporary isolation and local adaptation in a widespread but discontinuously distributed species endemic to Taiwan, Rhododendron oldhamii (Ericaceae)

    PubMed Central

    Hsieh, Y-C; Chung, J-D; Wang, C-N; Chang, C-T; Chen, C-Y; Hwang, S-Y

    2013-01-01

    Elucidation of the evolutionary processes that constrain or facilitate adaptive divergence is a central goal in evolutionary biology, especially in non-model organisms. We tested whether changes in dynamics of gene flow (historical vs contemporary) caused population isolation and examined local adaptation in response to environmental selective forces in fragmented Rhododendron oldhamii populations. Variation in 26 expressed sequence tag-simple sequence repeat loci from 18 populations in Taiwan was investigated by examining patterns of genetic diversity, inbreeding, geographic structure, recent bottlenecks, and historical and contemporary gene flow. Selection associated with environmental variables was also examined. Bayesian clustering analysis revealed four regional population groups of north, central, south and southeast with significant genetic differentiation. Historical bottlenecks beginning 9168–13,092 years ago and ending 1584–3504 years ago were revealed by estimates using approximate Bayesian computation for all four regional samples analyzed. Recent migration within and across geographic regions was limited. However, major dispersal sources were found within geographic regions. Altitudinal clines of allelic frequencies of environmentally associated positively selected outliers were found, indicating adaptive divergence. Our results point to a transition from historical population connectivity toward contemporary population isolation and divergence on a regional scale. Spatial and temporal dispersal differences may have resulted in regional population divergence and local adaptation associated with environmental variables, which may have played roles as selective forces at a regional scale. PMID:23591517

  15. rpoB-Based Identification of Nonpigmented and Late-Pigmenting Rapidly Growing Mycobacteria

    PubMed Central

    Adékambi, Toïdi; Colson, Philippe; Drancourt, Michel

    2003-01-01

    Nonpigmented and late-pigmenting rapidly growing mycobacteria (RGM) are increasingly isolated in clinical microbiology laboratories. Their accurate identification remains problematic because classification is labor intensive work and because new taxa are not often incorporated into classification databases. Also, 16S rRNA gene sequence analysis underestimates RGM diversity and does not distinguish between all taxa. We determined the complete nucleotide sequence of the rpoB gene, which encodes the bacterial β subunit of the RNA polymerase, for 20 RGM type strains. After using in-house software which analyzes and graphically represents variability stretches of 60 bp along the nucleotide sequence, our analysis focused on a 723-bp variable region exhibiting 83.9 to 97% interspecies similarity and 0 to 1.7% intraspecific divergence. Primer pair Myco-F-Myco-R was designed as a tool for both PCR amplification and sequencing of this region for molecular identification of RGM. This tool was used for identification of 63 RGM clinical isolates previously identified at the species level on the basis of phenotypic characteristics and by 16S rRNA gene sequence analysis. Of 63 clinical isolates, 59 (94%) exhibited <2% partial rpoB gene sequence divergence from 1 of 20 species under study and were regarded as correctly identified at the species level. Mycobacterium abscessus and Mycobacterium mucogenicum isolates were clearly distinguished from Mycobacterium chelonae; Mycobacterium mageritense isolates were clearly distinguished from “Mycobacterium houstonense.” Four isolates were not identified at the species level because they exhibited >3% partial rpoB gene sequence divergence from the corresponding type strain; they belonged to three taxa related to M. mucogenicum, Mycobacterium smegmatis, and Mycobacterium porcinum. For M. abscessus and M. mucogenicum, this partial sequence yielded a high genetic heterogeneity within the clinical isolates. We conclude that molecular identification by analysis of the 723-bp rpoB sequence is a rapid and accurate tool for identification of RGM. PMID:14662964

  16. Next-generation genomic shotgun sequencing indicates greater genetic variability in the mitochondria of Hypophthalmichthys molitrix relative to H. nobilis from the Mississippi River, USA and provides tools for research and detection

    USGS Publications Warehouse

    Miller, John J; Eackles, Michael S.; Stauffer, Jay R; King, Timothy L.

    2015-01-01

    We characterized variation within the mitochondrial genomes of the invasive silver carp (Hypophthalmichthys molitrix) and bighead carp (H. nobilis) from the Mississippi River drainage by mapping our Next-Generation sequences to their publicly available genomes. Variant detection resulted in 338 single-nucleotide polymorphisms for H. molitrix and 39 for H. nobilis. The much greater genetic variation in H. molitrix mitochondria relative to H. nobilis may be indicative of a greater North American female effective population size of the former. When variation was quantified by gene, many tRNA loci appear to have little or no variability based on our results whereas protein-coding regions were more frequently polymorphic. These results provide biologists with additional regions of DNA to be used as markers to study the invasion dynamics of these species.

  17. Design, Construction and Evaluation of 1a/JFH1 HCV Chimera by Replacing the Intergenotypic Variable Region

    PubMed Central

    Ghasemi, Faezeh; Ghayour-Mobarhan, Majid; Pasdar, Alireza; Pourianfar, Hamid; Reza Aghasadeghi, Mohammad; Gouklani, Hamed; Meshkat, Zahra

    2016-01-01

    Background The E2 glycoprotein is an important encoded hepatitis C virus (HCV) protein that contains three different variable regions. Objectives The aim of the present study was to construct an HCV 1a/JFH1 chimeric virus by replacing the intergenotypic variable region (igVR) fragment of the highly variable region of the E2 gene of the Japanese Fulminant hepatitis genotype 2a JFH1 virus with a similar region of HCV genotype 1a. This chimera was produced as a model virus with the ability to be cultured. We analyzed the adapted virus and the variations of nucleic acids within it. Methods Specific primers were designed for the igVR of HCV genotype 1a followed by the overlap-PCR method for the synthesis of the desired DNA fragment. The amplified igVR-1a chimera gene and pFL-J6/JFH were digested by KpnI and BsiWI restriction enzymes, and the fragment was ligated into pFL-J6/JFH. The recombinant vector was transformed into Escherichia coli JM109 strain competent cells. All clones were confirmed by colony PCR using specific primers, and the confirmed recombinant vector was sequenced. The recombinant vector was targeted for RNA synthesis by T7 RNA polymerase enzyme. RNA transfection was performed in the Huh7.5 cell line. Virus production in several passages and the evaluated viral load were studied using quantitative real-time PCR and ELISA methods. After 30 passages, the RNA virus was extracted and cloned in PCDNA3.1 vector, and was then sequenced Results Quantitative real-time PCR results showed 11,292,514 copies/mL of chimeric virus production in cell culture. The virus production was confirmed using ELISA, which showed a virus core production of 808.2 pg/mL. The results of cloning and sequencing showed that some of the nucleic acids in the chimera virus were changed, affecting the viral behavior in the cell culture. Conclusions Real-time PCR and ELISA showed high levels of production of 1a/JFH1 chimeric HCV in the Huh7.5 cell culture. The constructed virus can be used for future studies, including the development of new HCV drugs and vaccines. PMID:27882063

  18. Bi-exponential T2 analysis of healthy and diseased Achilles tendons: an in vivo preliminary magnetic resonance study and correlation with clinical score.

    PubMed

    Juras, Vladimir; Apprich, Sebastian; Szomolanyi, Pavol; Bieri, Oliver; Deligianni, Xeni; Trattnig, Siegfried

    2013-10-01

    To compare mono- and bi-exponential T2 analysis in healthy and degenerated Achilles tendons using a recently introduced magnetic resonance variable-echo-time sequence (vTE) for T2 mapping. Ten volunteers and ten patients were included in the study. A variable-echo-time sequence was used with 20 echo times. Images were post-processed with both techniques, mono- and bi-exponential [T2 m, short T2 component (T2 s) and long T2 component (T2 l)]. The number of mono- and bi-exponentially decaying pixels in each region of interest was expressed as a ratio (B/M). Patients were clinically assessed with the Achilles Tendon Rupture Score (ATRS), and these values were correlated with the T2 values. The means for both T2 m and T2 s were statistically significantly different between patients and volunteers; however, for T2 s, the P value was lower. In patients, the Pearson correlation coefficient between ATRS and T2 s was -0.816 (P = 0.007). The proposed variable-echo-time sequence can be successfully used as an alternative method to UTE sequences with some added benefits, such as a short imaging time along with relatively high resolution and minimised blurring artefacts, and minimised susceptibility artefacts and chemical shift artefacts. Bi-exponential T2 calculation is superior to mono-exponential in terms of statistical significance for the diagnosis of Achilles tendinopathy. • Magnetic resonance imaging offers new insight into healthy and diseased Achilles tendons • Bi-exponential T2 calculation in Achilles tendons is more beneficial than mono-exponential • A short T2 component correlates strongly with clinical score • Variable echo time sequences successfully used instead of ultrashort echo time sequences.

  19. Accurate, Rapid Taxonomic Classification of Fungal Large-Subunit rRNA Genes

    PubMed Central

    Liu, Kuan-Liang; Porras-Alfaro, Andrea; Eichorst, Stephanie A.

    2012-01-01

    Taxonomic and phylogenetic fingerprinting based on sequence analysis of gene fragments from the large-subunit rRNA (LSU) gene or the internal transcribed spacer (ITS) region is becoming an integral part of fungal classification. The lack of an accurate and robust classification tool trained by a validated sequence database for taxonomic placement of fungal LSU genes is a severe limitation in taxonomic analysis of fungal isolates or large data sets obtained from environmental surveys. Using a hand-curated set of 8,506 fungal LSU gene fragments, we determined the performance characteristics of a naïve Bayesian classifier across multiple taxonomic levels and compared the classifier performance to that of a sequence similarity-based (BLASTN) approach. The naïve Bayesian classifier was computationally more rapid (>460-fold with our system) than the BLASTN approach, and it provided equal or superior classification accuracy. Classifier accuracies were compared using sequence fragments of 100 bp and 400 bp and two different PCR primer anchor points to mimic sequence read lengths commonly obtained using current high-throughput sequencing technologies. Accuracy was higher with 400-bp sequence reads than with 100-bp reads. It was also significantly affected by sequence location across the 1,400-bp test region. The highest accuracy was obtained across either the D1 or D2 variable region. The naïve Bayesian classifier provides an effective and rapid means to classify fungal LSU sequences from large environmental surveys. The training set and tool are publicly available through the Ribosomal Database Project (http://rdp.cme.msu.edu/classifier/classifier.jsp). PMID:22194300

  20. Mitochondrial DNA variation of indigenous goats in Narok and Isiolo counties of Kenya.

    PubMed

    Kibegwa, F M; Githui, K E; Jung'a, J O; Badamana, M S; Nyamu, M N

    2016-06-01

    Phylogenetic relationships among and genetic variability within 60 goats from two different indigenous breeds in Narok and Isiolo counties in Kenya and 22 published goat samples were analysed using mitochondrial control region sequences. The results showed that there were 54 polymorphic sites in a 481-bp sequence and 29 haplotypes were determined. The mean haplotype diversity and nucleotide diversity were 0.981 ± 0.006 and 0.019 ± 0.001, respectively. The phylogenetic analysis in combination with goat haplogroup reference sequences from GenBank showed that all goat sequences were clustered into two haplogroups (A and G), of which haplogroup A was the commonest in the two populations. A very high percentage (99.90%) of the genetic variation was distributed within the regions, and a smaller percentage (0.10%) distributed among regions as revealed by the analysis of molecular variance (amova). This amova results showed that the divergence between regions was not statistically significant. We concluded that the high levels of intrapopulation diversity in Isiolo and Narok goats and the weak phylogeographic structuring suggested that there existed strong gene flow among goat populations probably caused by extensive transportation of goats in history. © 2015 Blackwell Verlag GmbH.

  1. Intrinsic challenges in ancient microbiome reconstruction using 16S rRNA gene amplification.

    PubMed

    Ziesemer, Kirsten A; Mann, Allison E; Sankaranarayanan, Krithivasan; Schroeder, Hannes; Ozga, Andrew T; Brandt, Bernd W; Zaura, Egija; Waters-Rist, Andrea; Hoogland, Menno; Salazar-García, Domingo C; Aldenderfer, Mark; Speller, Camilla; Hendy, Jessica; Weston, Darlene A; MacDonald, Sandy J; Thomas, Gavin H; Collins, Matthew J; Lewis, Cecil M; Hofman, Corinne; Warinner, Christina

    2015-11-13

    To date, characterization of ancient oral (dental calculus) and gut (coprolite) microbiota has been primarily accomplished through a metataxonomic approach involving targeted amplification of one or more variable regions in the 16S rRNA gene. Specifically, the V3 region (E. coli 341-534) of this gene has been suggested as an excellent candidate for ancient DNA amplification and microbial community reconstruction. However, in practice this metataxonomic approach often produces highly skewed taxonomic frequency data. In this study, we use non-targeted (shotgun metagenomics) sequencing methods to better understand skewed microbial profiles observed in four ancient dental calculus specimens previously analyzed by amplicon sequencing. Through comparisons of microbial taxonomic counts from paired amplicon (V3 U341F/534R) and shotgun sequencing datasets, we demonstrate that extensive length polymorphisms in the V3 region are a consistent and major cause of differential amplification leading to taxonomic bias in ancient microbiome reconstructions based on amplicon sequencing. We conclude that systematic amplification bias confounds attempts to accurately reconstruct microbiome taxonomic profiles from 16S rRNA V3 amplicon data generated using universal primers. Because in silico analysis indicates that alternative 16S rRNA hypervariable regions will present similar challenges, we advocate for the use of a shotgun metagenomics approach in ancient microbiome reconstructions.

  2. Intrinsic challenges in ancient microbiome reconstruction using 16S rRNA gene amplification

    PubMed Central

    Ziesemer, Kirsten A.; Mann, Allison E.; Sankaranarayanan, Krithivasan; Schroeder, Hannes; Ozga, Andrew T.; Brandt, Bernd W.; Zaura, Egija; Waters-Rist, Andrea; Hoogland, Menno; Salazar-García, Domingo C.; Aldenderfer, Mark; Speller, Camilla; Hendy, Jessica; Weston, Darlene A.; MacDonald, Sandy J.; Thomas, Gavin H.; Collins, Matthew J.; Lewis, Cecil M.; Hofman, Corinne; Warinner, Christina

    2015-01-01

    To date, characterization of ancient oral (dental calculus) and gut (coprolite) microbiota has been primarily accomplished through a metataxonomic approach involving targeted amplification of one or more variable regions in the 16S rRNA gene. Specifically, the V3 region (E. coli 341–534) of this gene has been suggested as an excellent candidate for ancient DNA amplification and microbial community reconstruction. However, in practice this metataxonomic approach often produces highly skewed taxonomic frequency data. In this study, we use non-targeted (shotgun metagenomics) sequencing methods to better understand skewed microbial profiles observed in four ancient dental calculus specimens previously analyzed by amplicon sequencing. Through comparisons of microbial taxonomic counts from paired amplicon (V3 U341F/534R) and shotgun sequencing datasets, we demonstrate that extensive length polymorphisms in the V3 region are a consistent and major cause of differential amplification leading to taxonomic bias in ancient microbiome reconstructions based on amplicon sequencing. We conclude that systematic amplification bias confounds attempts to accurately reconstruct microbiome taxonomic profiles from 16S rRNA V3 amplicon data generated using universal primers. Because in silico analysis indicates that alternative 16S rRNA hypervariable regions will present similar challenges, we advocate for the use of a shotgun metagenomics approach in ancient microbiome reconstructions. PMID:26563586

  3. DNA Sequence Analysis of Sry Alleles (Subgenus Mus) Implicates Misregulation as the Cause of C57bl/6j-Y(pos) Sex Reversal and Defines the Sry Functional Unit

    PubMed Central

    Albrecht, K. H.; Eicher, E. M.

    1997-01-01

    The Sry (sex determining region, Y chromosome) open reading frame from mice representing four species of the genus Mus was sequenced in an effort to understand the conditional dysfunction of some M. domesticus Sry alleles when present on the C57BL/6J inbred strain genetic background and to delimit the functionally important protein regions. Twenty-two Sry alleles were sequenced, most from wild-derived Y chromosomes, including 11 M. domesticus alleles, seven M. musculus alleles and two alleles each from the related species M. spicilegus and M. spretus. We found that the HMG domain (high mobility group DNA binding domain) and the unique regions are well conserved, while the glutamine repeat cluster (GRC) region is quite variable. No correlation was found between the predicted protein isoforms and the ability of a Sry allele to allow differentiation of ovarian tissue when on the C57BL/6J genetic background, strongly suggesting that the cause of this sex reversal is not the Sry protein itself, but rather the regulation of SRY expression. Furthermore, our interspecies sequence analysis provides compelling evidence that the M. musculus and M. domesticus SRY functional domain is contained in the first 143 amino acids, which includes the HMG domain and adjacent unique region (UR-2). PMID:9383069

  4. Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination.

    PubMed

    Savidor, Alon; Barzilay, Rotem; Elinger, Dalia; Yarden, Yosef; Lindzen, Moshit; Gabashvili, Alexandra; Adiv Tal, Ophir; Levin, Yishai

    2017-06-01

    Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  5. Complete genome sequence of a new enamovirus from Argentina infecting alfalfa plants showing dwarfism symptoms.

    PubMed

    Bejerman, Nicolás; Giolitti, Fabián; Trucco, Verónica; de Breuil, Soledad; Dietzgen, Ralf G; Lenardon, Sergio

    2016-07-01

    Alfalfa dwarf disease, probably caused by synergistic interactions of mixed virus infections, is a major and emergent disease that threatens alfalfa production in Argentina. Deep sequencing of diseased alfalfa plant samples from the central region of Argentina resulted in the identification of a new virus genome resembling enamoviruses in sequence and genome structure. Phylogenetic analysis suggests that it is a new member of the genus Enamovirus, family Luteoviridae. The virus is tentatively named "alfalfa enamovirus 1" (AEV-1). The availability of the AEV-1 genome sequence will make it possible to assess the genetic variability of this virus and to construct an infectious clone to investigate its role in alfalfa dwarfism disease.

  6. Sequence diversity among badnavirus isolates infecting black pepper and related species in India.

    PubMed

    Bhat, A I; Sasi, Shina; Revathy, K A; Deeshma, K P; Saji, K V

    2014-01-01

    The badnavirus, piper yellow mottle virus (PYMoV) is known to infect black pepper (Piper nigrum), betelvine (P. betle) and Indian long pepper (P. longum) in India and other parts of the world. Occurrence of PYMoV or other badnaviruses in other species of Piper and its variability is not reported so far. We have analysed sequence variability in the conserved putative reverse transcriptase (RT)/ribonuclease H (RNase H) coding region of the virus using specific badnavirus primers from 13 virus isolates of black pepper collected from different cultivars and regions and one isolate each from 23 other species of Piper. Of these, four species failed to produce expected amplicon while amplicon from four other species showed more similarities to plant sequences than to badnaviruses. Of the remaining, isolates from black pepper, P. argyrophyllum, P. attenuatum, P. barberi, P. betle, P. colubrinum, P. galeatum, P. longum, P. ornatum, P. sarmentosum and P. trichostachyon showed an identity of >85 % at the nucleotide and >90 % at the amino acid level with PYMoV indicating that they are isolates of PYMoV. On the other hand high sequence variability of 21-43 % at nucleotide and 17-46 % at amino acid level compared to PYMoV was found among isolates infecting P. bababudani, P. chaba, P. peepuloides, P. mullesua and P. thomsonii suggesting the presence of new badnaviruses. Phylogenetic analyses showed close clustering of all PYMoV isolates that were well separated from other known badnaviruses. This is the first report of occurrence of PYMoV in eight Piper spp and likely occurrence of four new species in five Piper spp.

  7. Variability among Cucurbitaceae species (melon, cucumber and watermelon) in a genomic region containing a cluster of NBS-LRR genes.

    PubMed

    Morata, Jordi; Puigdomènech, Pere

    2017-02-08

    Cucurbitaceae species contain a significantly lower number of genes coding for proteins with similarity to plant resistance genes belonging to the NBS-LRR family than other plant species of similar genome size. A large proportion of these genes are organized in clusters that appear to be hotspots of variability. The genomes of the Cucurbitaceae species measured until now are intermediate in size (between 350 and 450 Mb) and they apparently have not undergone any genome duplications beside those at the origin of eudicots. The cluster containing the largest number of NBS-LRR genes has previously been analyzed in melon and related species and showed a high degree of interspecific and intraspecific variability. It was of interest to study whether similar behavior occurred in other cluster of the same family of genes. The cluster of NBS-LRR genes located in melon chromosome 9 was analyzed and compared with the syntenic regions in other cucurbit genomes. This is the second cluster in number within this species and it contains nine sequences with a NBS-LRR annotation including two genes, Fom1 and Prv, providing resistance against Fusarium and Ppapaya ring-spot virus (PRSV). The variability within the melon species appears to consist essentially of single nucleotide polymorphisms. Clusters of similar genes are present in the syntenic regions of the two species of Cucurbitaceae that were sequenced, cucumber and watermelon. Most of the genes in the syntenic clusters can be aligned between species and a hypothesis of generation of the cluster is proposed. The number of genes in the watermelon cluster is similar to that in melon while a higher number of genes (12) is present in cucumber, a species with a smaller genome than melon. After comparing genome resequencing data of 115 cucumber varieties, deletion of a group of genes is observed in a group of varieties of Indian origin. Clusters of genes coding for NBS-LRR proteins in cucurbits appear to have specific variability in different regions of the genome and between different species. This observation is in favour of considering that the adaptation of plant species to changing environments is based upon the variability that may occur at any location in the genome and that has been produced by specific mechanisms of sequence variation acting on plant genomes. This information could be useful both to understand the evolution of species and for plant breeding.

  8. A population study of the minicircles in Trypanosoma cruzi: predicting guide RNAs in the absence of empirical RNA editing.

    PubMed

    Thomas, Sean; Martinez, L L Isadora Trejo; Westenberger, Scott J; Sturm, Nancy R

    2007-05-24

    The structurally complex network of minicircles and maxicircles comprising the mitochondrial DNA of kinetoplastids mirrors the complexity of the RNA editing process that is required for faithful expression of encrypted maxicircle genes. Although a few of the guide RNAs that direct this editing process have been discovered on maxicircles, guide RNAs are mostly found on the minicircles. The nuclear and maxicircle genomes have been sequenced and assembled for Trypanosoma cruzi, the causative agent of Chagas disease, however the complement of 1.4-kb minicircles, carrying four guide RNA genes per molecule in this parasite, has been less thoroughly characterised. Fifty-four CL Brener and 53 Esmeraldo strain minicircle sequence reads were extracted from T. cruzi whole genome shotgun sequencing data. With these sequences and all published T. cruzi minicircle sequences, 108 unique guide RNAs from all known T. cruzi minicircle sequences and two guide RNAs from the CL Brener maxicircle were predicted using a local alignment algorithm and mapped onto predicted or experimentally determined sequences of edited maxicircle open reading frames. For half of the sequences no statistically significant guide RNA could be assigned. Likely positions of these unidentified gRNAs in T. cruzi minicircle sequences are estimated using a simple Hidden Markov Model. With the local alignment predictions as a standard, the HMM had an ~85% chance of correctly identifying at least 20 nucleotides of guide RNA from a given minicircle sequence. Inter-minicircle recombination was documented. Variable regions contain species-specific areas of distinct nucleotide preference. Two maxicircle guide RNA genes were found. The identification of new minicircle sequences and the further characterization of all published minicircles are presented, including the first observation of recombination between minicircles. Extrapolation suggests a level of 4% recombinants in the population, supporting a relatively high recombination rate that may serve to minimize the persistence of gRNA pseudogenes. Characteristic nucleotide preferences observed within variable regions provide potential clues regarding the transcription and maturation of T. cruzi guide RNAs. Based on these preferences, a method of predicting T. cruzi guide RNAs using only primary minicircle sequence data was created.

  9. Parallel computation of genome-scale RNA secondary structure to detect structural constraints on human genome.

    PubMed

    Kawaguchi, Risa; Kiryu, Hisanori

    2016-05-06

    RNA secondary structure around splice sites is known to assist normal splicing by promoting spliceosome recognition. However, analyzing the structural properties of entire intronic regions or pre-mRNA sequences has been difficult hitherto, owing to serious experimental and computational limitations, such as low read coverage and numerical problems. Our novel software, "ParasoR", is designed to run on a computer cluster and enables the exact computation of various structural features of long RNA sequences under the constraint of maximal base-pairing distance. ParasoR divides dynamic programming (DP) matrices into smaller pieces, such that each piece can be computed by a separate computer node without losing the connectivity information between the pieces. ParasoR directly computes the ratios of DP variables to avoid the reduction of numerical precision caused by the cancellation of a large number of Boltzmann factors. The structural preferences of mRNAs computed by ParasoR shows a high concordance with those determined by high-throughput sequencing analyses. Using ParasoR, we investigated the global structural preferences of transcribed regions in the human genome. A genome-wide folding simulation indicated that transcribed regions are significantly more structural than intergenic regions after removing repeat sequences and k-mer frequency bias. In particular, we observed a highly significant preference for base pairing over entire intronic regions as compared to their antisense sequences, as well as to intergenic regions. A comparison between pre-mRNAs and mRNAs showed that coding regions become more accessible after splicing, indicating constraints for translational efficiency. Such changes are correlated with gene expression levels, as well as GC content, and are enriched among genes associated with cytoskeleton and kinase functions. We have shown that ParasoR is very useful for analyzing the structural properties of long RNA sequences such as mRNAs, pre-mRNAs, and long non-coding RNAs whose lengths can be more than a million bases in the human genome. In our analyses, transcribed regions including introns are indicated to be subject to various types of structural constraints that cannot be explained from simple sequence composition biases. ParasoR is freely available at https://github.com/carushi/ParasoR .

  10. Complete coding regions of the prototypes enterovirus B93 and C95: phylogenetic analyses of the P1 and P3 regions of EV-B and EV-C strains.

    PubMed

    Junttila, N; Lévêque, N; Magnius, L O; Kabue, J P; Muyembe-Tamfum, J J; Maslin, J; Lina, B; Norder, H

    2015-03-01

    Complete coding regions were sequenced for two new enterovirus genomes: EV-B93 previously identified by VP1 sequencing, derived from a child with acute flaccid paralysis in the Democratic Republic of Congo; and EV-C95 from a French soldier with acute gastroenteritis in Djibouti. The EV-B93 P1 had more than 30% nucleotide divergence from other EV-B types, with highest similarity to E-15 and EV-B80. The P1 nucleotide sequence of EV-C95 was most similar, 71%, to CV-A21. Complete coding regions for the new enteroviruses were compared with those of 135 EV-B and 176 EV-C strains representing all types available in GenBank. When strains from the same outbreak or strains isolated during the same year in the same geographical region were excluded, 27 of the 58 EV-B, and 16 of the 23 EV-C types were represented by more than one sequence. However, for EV-B the P3 sequences formed three clades mainly according to origin or time of isolation, irrespective of type, while for EV-C the P3 sequences segregated mainly according to disease manifestation, with most strains causing paralysis, including polioviruses, forming one clade, and strains causing respiratory illness forming another. There was no intermixing of types between these two clades, apart from two EV-C96 strains. The EV-B P3 sequences had lower inter-clade and higher intra-clade variability as compared to the EV-C sequences, which may explain why inter-clade recombinations are more frequent in EV-B. Further analysis of more isolates may shed light on the role of recombinations in the evolution of EV-B in geographical context. © 2014 Wiley Periodicals, Inc.

  11. Genetic diversity of Pinus nigra Arn. populations in Southern Spain and Northern Morocco revealed by inter-simple sequence repeat profiles.

    PubMed

    Rubio-Moraga, Angela; Candel-Perez, David; Lucas-Borja, Manuel E; Tiscar, Pedro A; Viñegla, Benjamin; Linares, Juan C; Gómez-Gómez, Lourdes; Ahrazem, Oussama

    2012-01-01

    Eight Pinus nigra Arn. populations from Southern Spain and Northern Morocco were examined using inter-simple sequence repeat markers to characterize the genetic variability amongst populations. Pair-wise population genetic distance ranged from 0.031 to 0.283, with a mean of 0.150 between populations. The highest inter-population average distance was between PaCU from Cuenca and YeCA from Cazorla, while the lowest distance was between TaMO from Morocco and MA Sierra Mágina populations. Analysis of molecular variance (AMOVA) and Nei's genetic diversity analyses revealed higher genetic variation within the same population than among different populations. Genetic differentiation (Gst) was 0.233. Cuenca showed the highest Nei's genetic diversity followed by the Moroccan region, Sierra Mágina, and Cazorla region. However, clustering of populations was not in accordance with their geographical locations. Principal component analysis showed the presence of two major groups-Group 1 contained all populations from Cuenca while Group 2 contained populations from Cazorla, Sierra Mágina and Morocco-while Bayesian analysis revealed the presence of three clusters. The low genetic diversity observed in PaCU and YeCA is probably a consequence of inappropriate management since no estimation of genetic variability was performed before the silvicultural treatments. Data indicates that the inter-simple sequence repeat (ISSR) method is sufficiently informative and powerful to assess genetic variability among populations of P. nigra.

  12. Genetic Diversity of Pinus nigra Arn. Populations in Southern Spain and Northern Morocco Revealed By Inter-Simple Sequence Repeat Profiles †

    PubMed Central

    Rubio-Moraga, Angela; Candel-Perez, David; Lucas-Borja, Manuel E.; Tiscar, Pedro A.; Viñegla, Benjamin; Linares, Juan C.; Gómez-Gómez, Lourdes; Ahrazem, Oussama

    2012-01-01

    Eight Pinus nigra Arn. populations from Southern Spain and Northern Morocco were examined using inter-simple sequence repeat markers to characterize the genetic variability amongst populations. Pair-wise population genetic distance ranged from 0.031 to 0.283, with a mean of 0.150 between populations. The highest inter-population average distance was between PaCU from Cuenca and YeCA from Cazorla, while the lowest distance was between TaMO from Morocco and MA Sierra Mágina populations. Analysis of molecular variance (AMOVA) and Nei’s genetic diversity analyses revealed higher genetic variation within the same population than among different populations. Genetic differentiation (Gst) was 0.233. Cuenca showed the highest Nei’s genetic diversity followed by the Moroccan region, Sierra Mágina, and Cazorla region. However, clustering of populations was not in accordance with their geographical locations. Principal component analysis showed the presence of two major groups—Group 1 contained all populations from Cuenca while Group 2 contained populations from Cazorla, Sierra Mágina and Morocco—while Bayesian analysis revealed the presence of three clusters. The low genetic diversity observed in PaCU and YeCA is probably a consequence of inappropriate management since no estimation of genetic variability was performed before the silvicultural treatments. Data indicates that the inter-simple sequence repeat (ISSR) method is sufficiently informative and powerful to assess genetic variability among populations of P. nigra. PMID:22754321

  13. Detection of viral sequences in archival spinal cords from fatal cases of poliomyelitis in 1951-1952.

    PubMed

    Rekand, Tiina; Male, Rune; Myking, Andreas O; Nygaard, Svein J T; Aarli, Johan A; Haarr, Lars; Langeland, Nina

    2003-12-01

    Poliovirus (PV) subjected to genetic characterization is often isolated from faecal carriage. Such virus is not necessarily identical to the virus causing paralytic disease since genetic modifications may occur during replication outside the nervous system. We have searched for poliovirus genomes in the 14 fatal cases occurring during the last epidemics in Norway in 1951-1952. A method was developed for isolation and analysis of poliovirus RNA from formalin-fixed and paraffin-embedded archival tissue. RNA was purified by incubation with Chelex-100 and heating followed by treatment with the proteinase K and chloroform extraction. Viral sequences were amplified by a reverse transcriptase-polymerase chain reaction (RT-PCR), the products subjected to TA cloning and sequenced. RNA from the beta-actin gene, as a control, was identified in 13 cases, while sequences specific for poliovirus were achieved in 11 cases. The sequences from the 2C region of poliovirus were rather conserved while those in the 5'-untranslated region were variable. The developed method should be suitable also for other genetic studies of old archival material.

  14. Next Generation Semiconductor Based Sequencing of the Donkey (Equus asinus) Genome Provided Comparative Sequence Data against the Horse Genome and a Few Millions of Single Nucleotide Polymorphisms

    PubMed Central

    Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca

    2015-01-01

    Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources. PMID:26151450

  15. Next Generation Semiconductor Based Sequencing of the Donkey (Equus asinus) Genome Provided Comparative Sequence Data against the Horse Genome and a Few Millions of Single Nucleotide Polymorphisms.

    PubMed

    Bertolini, Francesca; Scimone, Concetta; Geraci, Claudia; Schiavo, Giuseppina; Utzeri, Valerio Joe; Chiofalo, Vincenzo; Fontanesi, Luca

    2015-01-01

    Few studies investigated the donkey (Equus asinus) at the whole genome level so far. Here, we sequenced the genome of two male donkeys using a next generation semiconductor based sequencing platform (the Ion Proton sequencer) and compared obtained sequence information with the available donkey draft genome (and its Illumina reads from which it was originated) and with the EquCab2.0 assembly of the horse genome. Moreover, the Ion Torrent Personal Genome Analyzer was used to sequence reduced representation libraries (RRL) obtained from a DNA pool including donkeys of different breeds (Grigio Siciliano, Ragusano and Martina Franca). The number of next generation sequencing reads aligned with the EquCab2.0 horse genome was larger than those aligned with the draft donkey genome. This was due to the larger N50 for contigs and scaffolds of the horse genome. Nucleotide divergence between E. caballus and E. asinus was estimated to be ~ 0.52-0.57%. Regions with low nucleotide divergence were identified in several autosomal chromosomes and in the whole chromosome X. These regions might be evolutionally important in equids. Comparing Y-chromosome regions we identified variants that could be useful to track donkey paternal lineages. Moreover, about 4.8 million of single nucleotide polymorphisms (SNPs) in the donkey genome were identified and annotated combining sequencing data from Ion Proton (whole genome sequencing) and Ion Torrent (RRL) runs with Illumina reads. A higher density of SNPs was present in regions homologous to horse chromosome 12, in which several studies reported a high frequency of copy number variants. The SNPs we identified constitute a first resource useful to describe variability at the population genomic level in E. asinus and to establish monitoring systems for the conservation of donkey genetic resources.

  16. Worldwide genetic variability of the Duffy binding protein: insights into Plasmodium vivax vaccine development.

    PubMed

    Nóbrega de Sousa, Taís; Carvalho, Luzia Helena; Alves de Brito, Cristiana Ferreira

    2011-01-01

    The dependence of Plasmodium vivax on invasion mediated by Duffy binding protein (DBP) makes this protein a prime candidate for development of a vaccine. However, the development of a DBP-based vaccine might be hampered by the high variability of the protein ligand (DBP(II)), known to bias the immune response toward a specific DBP variant. Here, the hypothesis being investigated is that the analysis of the worldwide DBP(II) sequences will allow us to determine the minimum number of haplotypes (MNH) to be included in a DBP-based vaccine of broad coverage. For that, all DBP(II) sequences available were compiled and MNH was based on the most frequent nonsynonymous single nucleotide polymorphisms, the majority mapped on B and T cell epitopes. A preliminary analysis of DBP(II) genetic diversity from eight malaria-endemic countries estimated that a number between two to six DBP haplotypes (17 in total) would target at least 50% of parasite population circulating in each endemic region. Aiming to avoid region-specific haplotypes, we next analyzed the MNH that broadly cover worldwide parasite population. The results demonstrated that seven haplotypes would be required to cover around 60% of DBP(II) sequences available. Trying to validate these selected haplotypes per country, we found that five out of the eight countries will be covered by the MNH (67% of parasite populations, range 48-84%). In addition, to identify related subgroups of DBP(II) sequences we used a Bayesian clustering algorithm. The algorithm grouped all DBP(II) sequences in six populations that were independent of geographic origin, with ancestral populations present in different proportions in each country. In conclusion, in this first attempt to undertake a global analysis about DBP(II) variability, the results suggest that the development of DBP-based vaccine should consider multi-haplotype strategies; otherwise a putative P. vivax vaccine may not target some parasite populations.

  17. Deep sequencing of hepatitis C virus hypervariable region 1 reveals no correlation between genetic heterogeneity and antiviral treatment outcome

    PubMed Central

    2014-01-01

    Background Hypervariable region 1 (HVR1) contained within envelope protein 2 (E2) gene is the most variable part of HCV genome and its translation product is a major target for the host immune response. Variability within HVR1 may facilitate evasion of the immune response and could affect treatment outcome. The aim of the study was to analyze the impact of HVR1 heterogeneity employing sensitive ultra-deep sequencing, on the outcome of PEG-IFN-α (pegylated interferon α) and ribavirin treatment. Methods HVR1 sequences were amplified from pretreatment serum samples of 25 patients infected with genotype 1b HCV (12 responders and 13 non-responders) and were subjected to pyrosequencing (GS Junior, 454/Roche). Reads were corrected for sequencing error using ShoRAH software, while population reconstruction was done using three different minimal variant frequency cut-offs of 1%, 2% and 5%. Statistical analysis was done using Mann–Whitney and Fisher’s exact tests. Results Complexity, Shannon entropy, nucleotide diversity per site, genetic distance and the number of genetic substitutions were not significantly different between responders and non-responders, when analyzing viral populations at any of the three frequencies (≥1%, ≥2% and ≥5%). When clonal sample was used to determine pyrosequencing error, 4% of reads were found to be incorrect and the most abundant variant was present at a frequency of 1.48%. Use of ShoRAH reduced the sequencing error to 1%, with the most abundant erroneous variant present at frequency of 0.5%. Conclusions While deep sequencing revealed complex genetic heterogeneity of HVR1 in chronic hepatitis C patients, there was no correlation between treatment outcome and any of the analyzed quasispecies parameters. PMID:25016390

  18. Molecular basis of length polymorphism in the human zeta-globin gene complex.

    PubMed Central

    Goodbourn, S E; Higgs, D R; Clegg, J B; Weatherall, D J

    1983-01-01

    The length polymorphism between the human zeta-globin gene and its pseudogene is caused by an allele-specific variation in the copy number of a tandemly repeating 36-base-pair sequence. This sequence is related to a tandemly repeated 14-base-pair sequence in the 5' flanking region of the human insulin gene, which is known to cause length polymorphism, and to a repetitive sequence in intervening sequence (IVS) 1 of the pseudo-zeta-globin gene. Evidence is presented that the latter is also of variable length, probably because of differences in the copy number of the tandem repeat. The homology between the three length polymorphisms may be an indication of the presence of a more widespread group of related sequences in the human genome, which might be useful for generalized linkage studies. PMID:6308667

  19. Structure of Infaunal Communities on the Beaufort Sea Shelf and Slope: Insights from Morphological and Environmental DNA Sequencing Approaches

    NASA Astrophysics Data System (ADS)

    Hardy, S. M.; Bik, H.; Walker, A.; Sharma, J.; Blanchard, A.

    2016-02-01

    Rapid change is occurring in the Arctic concurrently with increased human activity, yet our knowledge of the structure and function of high-Arctic sediment communities is still rudimentary. The Beaufort Sea is particularly poorly sampled, and largely unexplored at slope depths, providing little information with which to assess the impacts of petroleum exploration activities now beginning in this area. We are investigating diversity and community structure of meio- and macrobenthic infauna on the continental shelf and slope of the Beaufort Sea across a range of depths (50 to 1000 m) using traditional taxonomic and environmental DNA sequencing approaches, and comparing results to additional sites in the adjacent NE Chukchi Sea petroleum lease-sale area. The Beaufort slope is topographically complex and characterized by an east-west gradient in benthic habitat characteristics, with heavy input of terrestrial organic matter particularly in the region of the Mackenzie River delta. Warmer, saltier subsurface Atlantic water masses impact benthic communities at mid-slope depths, likely influencing turnover in community structure observed with depth. Food resources are variable across the region, with very high sediment chlorophyll concentrations at 350 m depth in some areas. Differences in nematode assemblages were detected across the Beaufort Sea shelf/slope, across depths within the Beaufort Sea, and between the Beaufort and adjacent NE Chukchi Sea. These differences were apparent in both morphological and environmental sequencing data. Macrofaunal communities showed variable community structure among transects, with high abundance and high dominance in polychaete assemblages coincident with the chlorophyll maximum. Sequencing data also revealed an abundance of protists in sediments which have been mostly ignored in studies of ecosystem dynamics in this region, and may represent an important component of the food web.

  20. Intraspecific variation between the ITS sequences of Toxocara canis, Toxocara cati and Toxascaris leonina from different host species in south-western Poland.

    PubMed

    Fogt-Wyrwas, R; Mizgajska-Wiktor, H; Pacoń, J; Jarosz, W

    2013-12-01

    Some parasitic nematodes can inhabit different definitive hosts, which raises the question of the intraspecific variability of the nematode genotype affecting their preferences to choose particular species as hosts. Additionally, the issue of a possible intraspecific DNA microheterogeneity in specimens from different parts of the world seems to be interesting, especially from the evolutionary point of view. The problem was analysed in three related species - Toxocara canis, Toxocara cati and Toxascaris leonina - specimens originating from Central Europe (Poland). Using specific primers for species identification, internal transcribed spacer (ITS)-1 and ITS-2 regions were amplified and then sequenced. The sequences obtained were compared with sequences previously described for specimens originating from other geographical locations. No differences in nucleotide sequences were established in T. canis isolated from two different hosts (dogs and foxes). A comparison of ITS sequences of T. canis from Poland with sequences deposited in GenBank showed that the scope of intraspecific variability of the species did not exceed 0.4%, while in T. cati the differences did not exceed 2%. Significant differences were found in T. leonina, where ITS-1 differed by 3% and ITS-2 by as much as 7.4% in specimens collected from foxes in Poland and dogs in Australia. Such scope of differences in the nucleotide sequence seems to exceed the intraspecific variation of the species.

  1. Variability and genetic structure of the population of watermelon mosaic virus infecting melon in Spain.

    PubMed

    Moreno, I M; Malpica, J M; Díaz-Pendón, J A; Moriones, E; Fraile, A; García-Arenal, F

    2004-01-05

    The genetic structure of the population of Watermelon mosaic virus (WMV) in Spain was analysed by the biological and molecular characterisation of isolates sampled from its main host plant, melon. The population was a highly homogeneous one, built of a single pathotype, and comprising isolates closely related genetically. There was indication of temporal replacement of genotypes, but not of spatial structure of the population. Analyses of nucleotide sequences in three genomic regions, that is, in the cistrons for the P1, cylindrical inclusion (CI) and capsid (CP) proteins, showed lower similar values of nucleotide diversity for the P1 than for the CI or CP cistrons. The CI protein and the CP were under tighter evolutionary constraints than the P1 protein. Also, for the CI and CP cistrons, but not for the P1 cistron, two groups of sequences, defining two genetic strains, were apparent. Thus, different genomic regions of WMV show different evolutionary dynamics. Interestingly, for the CI and CP cistrons, sequences were clustered into two regions of the sequence space, defining the two strains above, and no intermediary sequences were identified. Recombinant isolates were found, accounting for at least 7% of the population. These recombinants presented two interesting features: (i) crossover points were detected between the analysed regions in the CI and CP cistrons, but not between those in the P1 and CI cistrons, (ii) crossover points were not observed within the analysed coding regions for the P1, CI or CP proteins. This indicates strong selection against isolates with recombinant proteins, even when originated from closely related strains. Hence, data indicate that genotypes of WMV, generated by mutation or recombination, outside of acceptable, discrete, regions in the evolutionary space, are eliminated from the virus population by negative selection.

  2. The Relation between Recombination Rate and Patterns of Molecular Evolution and Variation in Drosophila melanogaster

    PubMed Central

    Campos, José L.; Halligan, Daniel L.; Haddrill, Penelope R.; Charlesworth, Brian

    2014-01-01

    Genetic recombination associated with sexual reproduction increases the efficiency of natural selection by reducing the strength of Hill–Robertson interference. Such interference can be caused either by selective sweeps of positively selected alleles or by background selection (BGS) against deleterious mutations. Its consequences can be studied by comparing patterns of molecular evolution and variation in genomic regions with different rates of crossing over. We carried out a comprehensive study of the benefits of recombination in Drosophila melanogaster, both by contrasting five independent genomic regions that lack crossing over with the rest of the genome and by comparing regions with different rates of crossing over, using data on DNA sequence polymorphisms from an African population that is geographically close to the putatively ancestral population for the species, and on sequence divergence from a related species. We observed reductions in sequence diversity in noncrossover (NC) regions that are inconsistent with the effects of hard selective sweeps in the absence of recombination. Overall, the observed patterns suggest that the recombination rate experienced by a gene is positively related to an increase in the efficiency of both positive and purifying selection. The results are consistent with a BGS model with interference among selected sites in NC regions, and joint effects of BGS, selective sweeps, and a past population expansion on variability in regions of the genome that experience crossing over. In such crossover regions, the X chromosome exhibits a higher rate of adaptive protein sequence evolution than the autosomes, implying a Faster-X effect. PMID:24489114

  3. Bacterial CRISPR Regions: General Features and their Potential for Epidemiological Molecular Typing Studies

    PubMed Central

    Karimi, Zahra; Ahmadi, Ali; Najafi, Ali; Ranjbar, Reza

    2018-01-01

    Introduction: CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) loci as novel and applicable regions in prokaryotic genomes have gained great attraction in the post genomics era. Methods: These unique regions are diverse in number and sequence composition in different pathogenic bacteria and thereby can be a suitable candidate for molecular epidemiology and genotyping studies. Results:Furthermore, the arrayed structure of CRISPR loci (several unique repeats spaced with the variable sequence) and associated cas genes act as an active prokaryotic immune system against viral replication and conjugative elements. This property can be used as a tool for RNA editing in bioengineering studies. Conclusion: The aim of this review was to survey some details about the history, nature, and potential applications of CRISPR arrays in both genetic engineering and bacterial genotyping studies. PMID:29755603

  4. Cloning, expression and phylogenetic analysis of Hemolin, from the Chinese oak silkmoth, Antheraea pernyi.

    PubMed

    Li, Wenli; Terenius, Olle; Hirai, Makoto; Nilsson, Anders S; Faye, Ingrid

    2005-01-01

    The Chinese oak silk moth Antheraea pernyi is an important silk producer. To understand microbial resistance of this moth, we cloned Hemolin, encoding a multifunctional immune protein belonging to the immunoglobulin superfamily, and examined the expression in gonads and fat body. The ApHemolin amino acid sequence was compared to other Hemolin sequences in order to predict functional sites. Several sites were conserved; among them a phosphate binding site, which according to 3D structure modelling does not appear in neuroglian, the phylogenetically closest related protein. In addition, two conserved KDG sequences in the C-C' loop of immunoglobulin domains 1 and 3, give rise to gamma-turns, which is a common motif in the C'-C'' loop of the hypervariable region L2 in vertebrate immunoglobulins. The comparisons also show variable regions of specific interest for future studies of hemolin and its interaction with microbial entities.

  5. Plant centromere organization: a dynamic structure with conserved functions.

    PubMed

    Ma, Jianxin; Wing, Rod A; Bennetzen, Jeffrey L; Jackson, Scott A

    2007-03-01

    Although the structural features of centromeres from most multicellular eukaryotes remain to be characterized, recent analyses of the complete sequences of two centromeric regions of rice, together with data from Arabidopsis thaliana and maize, have illuminated the considerable size variation and sequence divergence of plant centromeres. Despite the severe suppression of meiotic chromosomal exchange in centromeric and pericentromeric regions of rice, the centromere core shows high rates of unequal homologous recombination in the absence of chromosomal exchange, resulting in frequent and extensive DNA rearrangement. Not only is the sequence of centromeric tandem and non-tandem repeats highly variable but also the copy number, spacing, order and orientation, providing ample natural variation as the basis for selection of superior centromere performance. This review article focuses on the structural and evolutionary dynamics of plant centromere organization and the potential molecular mechanisms responsible for the rapid changes of centromeric components.

  6. Mechanisms of haplotype divergence at the RGA08 nucleotide-binding leucine-rich repeat gene locus in wild banana (Musa balbisiana)

    PubMed Central

    2010-01-01

    Background Comparative sequence analysis of complex loci such as resistance gene analog clusters allows estimating the degree of sequence conservation and mechanisms of divergence at the intraspecies level. In banana (Musa sp.), two diploid wild species Musa acuminata (A genome) and Musa balbisiana (B genome) contribute to the polyploid genome of many cultivars. The M. balbisiana species is associated with vigour and tolerance to pests and disease and little is known on the genome structure and haplotype diversity within this species. Here, we compare two genomic sequences of 253 and 223 kb corresponding to two haplotypes of the RGA08 resistance gene analog locus in M. balbisiana "Pisang Klutuk Wulung" (PKW). Results Sequence comparison revealed two regions of contrasting features. The first is a highly colinear gene-rich region where the two haplotypes diverge only by single nucleotide polymorphisms and two repetitive element insertions. The second corresponds to a large cluster of RGA08 genes, with 13 and 18 predicted RGA genes and pseudogenes spread over 131 and 152 kb respectively on each haplotype. The RGA08 cluster is enriched in repetitive element insertions, in duplicated non-coding intergenic sequences including low complexity regions and shows structural variations between haplotypes. Although some allelic relationships are retained, a large diversity of RGA08 genes occurs in this single M. balbisiana genotype, with several RGA08 paralogs specific to each haplotype. The RGA08 gene family has evolved by mechanisms of unequal recombination, intragenic sequence exchange and diversifying selection. An unequal recombination event taking place between duplicated non-coding intergenic sequences resulted in a different RGA08 gene content between haplotypes pointing out the role of such duplicated regions in the evolution of RGA clusters. Based on the synonymous substitution rate in coding sequences, we estimated a 1 million year divergence time for these M. balbisiana haplotypes. Conclusions A large RGA08 gene cluster identified in wild banana corresponds to a highly variable genomic region between haplotypes surrounded by conserved flanking regions. High level of sequence identity (70 to 99%) of the genic and intergenic regions suggests a recent and rapid evolution of this cluster in M. balbisiana. PMID:20637079

  7. Global Sea-level Changes Revealed in the Sediments of the Canterbury Basin, New Zealand: IODP Expedition 317

    NASA Astrophysics Data System (ADS)

    McHugh, C. M.; Fulthorpe, C.; Blum, P.; Rios, J.; Chow, Y.; Mishkin, K.

    2012-12-01

    Continental margins are composed of thick sedimentary sections that preserve the record of local processes modulated by global sea-level (eustatic) changes and climate. Understanding this regional variability permits us to extract the eustatic record. Integrated Ocean Drilling Program Expedition 317 drilled four sites in the offshore Canterbury Basin, eastern South Island of New Zealand, in water depths of 85 m to 320 m. One of the objectives of the expedition was to understand the influence of eustasy on continental margins sedimentation and to test the concepts of sequence stratigraphy. A high-resolution multiproxy approach that involves geochemical elemental analyses, lithostratigraphy and biostratigraphy is applied to understand the margin's sedimentation for the past ~5 million years. Multichannel seismic data (EW00-01 survey) provide a seismic sequence stratigraphic framework against which to interpret the multiproxy data. The mid- to late Pleistocene sedimentation is characterized by variable lithologies and changing facies. However, elemental compositions and facies follow predictable patterns within seismic sequences. Oxygen isotope measurements for the latest Pleistocene indicate that 100 ky Milankovich astronomical forcing controlled this variability. In contrast, Pliocene and early Pleistocene sediments are composed of repetitive siliciclastic and carbonate mud lithologies with less facies variability. Results of our analyses suggest that repetitive alternations of green and gray mud were deposited during warmer and cooler periods, respectively. Oxygen isotopes suggest that this cyclicity may reflect 40 ky Milankovich forcing. Ocean Drilling Program Legs 150 and 174A drilled on the New Jersey continental margin with similar objectives to those of Expedition 317. Results from this northern and southern hemisphere drilling reveal that eustasy, controlled by Milankovich forcing, strongly influences margin sedimentation and the formation of basin-wide unconformities. However, the correlation between eustasy and seismic sequence formation is not always one to one. High sedimentation rates in the Pleistocene offshore Canterbury Basin record a one- to-one correlation between glacioeustasy and seismic sequences, and in some sequences possibly a higher order frequency. But this is not the case for offshore New Jersey, where accumulation rates were lower and only the uppermost seismic sequences represent 100 ky cycles. Furthermore, Pliocene sedimentation in the Canterbury Basin was also controlled by eustasy, but does not show a one-to-one correlation between Milankovich cycles and seismic stratigraphy. Northern and southern hemisphere comparisons provide a powerful tool to better understand controls on regional sedimentation and extract a global signal.

  8. Surface display of a massively variable lipoprotein by a Legionella diversity-generating retroelement.

    PubMed

    Arambula, Diego; Wong, Wenge; Medhekar, Bob A; Guo, Huatao; Gingery, Mari; Czornyj, Elizabeth; Liu, Minghsun; Dey, Sanghamitra; Ghosh, Partho; Miller, Jeff F

    2013-05-14

    Diversity-generating retroelements (DGRs) are a unique family of retroelements that confer selective advantages to their hosts by facilitating localized DNA sequence evolution through a specialized error-prone reverse transcription process. We characterized a DGR in Legionella pneumophila, an opportunistic human pathogen that causes Legionnaires disease. The L. pneumophila DGR is found within a horizontally acquired genomic island, and it can theoretically generate 10(26) unique nucleotide sequences in its target gene, legionella determinent target A (ldtA), creating a repertoire of 10(19) distinct proteins. Expression of the L. pneumophila DGR resulted in transfer of DNA sequence information from a template repeat to a variable repeat (VR) accompanied by adenine-specific mutagenesis of progeny VRs at the 3'end of ldtA. ldtA encodes a twin-arginine translocated lipoprotein that is anchored in the outer leaflet of the outer membrane, with its C-terminal variable region surface exposed. Related DGRs were identified in L. pneumophila clinical isolates that encode unique target proteins with homologous VRs, demonstrating the adaptability of DGR components. This work characterizes a DGR that diversifies a bacterial protein and confirms the hypothesis that DGR-mediated mutagenic homing occurs through a conserved mechanism. Comparative bioinformatics predicts that surface display of massively variable proteins is a defining feature of a subset of bacterial DGRs.

  9. PanACEA: a bioinformatics tool for the exploration and visualization of bacterial pan-chromosomes.

    PubMed

    Clarke, Thomas H; Brinkac, Lauren M; Inman, Jason M; Sutton, Granger; Fouts, Derrick E

    2018-06-27

    Bacterial pan-genomes, comprised of conserved and variable genes across multiple sequenced bacterial genomes, allow for identification of genomic regions that are phylogenetically discriminating or functionally important. Pan-genomes consist of large amounts of data, which can restrict researchers ability to locate and analyze these regions. Multiple software packages are available to visualize pan-genomes, but currently their ability to address these concerns are limited by using only pre-computed data sets, prioritizing core over variable gene clusters, or by not accounting for pan-chromosome positioning in the viewer. We introduce PanACEA (Pan-genome Atlas with Chromosome Explorer and Analyzer), which utilizes locally-computed interactive web-pages to view ordered pan-genome data. It consists of multi-tiered, hierarchical display pages that extend from pan-chromosomes to both core and variable regions to single genes. Regions and genes are functionally annotated to allow for rapid searching and visual identification of regions of interest with the option that user-supplied genomic phylogenies and metadata can be incorporated. PanACEA's memory and time requirements are within the capacities of standard laptops. The capability of PanACEA as a research tool is demonstrated by highlighting a variable region important in differentiating strains of Enterobacter hormaechei. PanACEA can rapidly translate the results of pan-chromosome programs into an intuitive and interactive visual representation. It will empower researchers to visually explore and identify regions of the pan-chromosome that are most biologically interesting, and to obtain publication quality images of these regions.

  10. Sequence information gain based motif analysis.

    PubMed

    Maynou, Joan; Pairó, Erola; Marco, Santiago; Perera, Alexandre

    2015-11-09

    The detection of regulatory regions in candidate sequences is essential for the understanding of the regulation of a particular gene and the mechanisms involved. This paper proposes a novel methodology based on information theoretic metrics for finding regulatory sequences in promoter regions. This methodology (SIGMA) has been tested on genomic sequence data for Homo sapiens and Mus musculus. SIGMA has been compared with different publicly available alternatives for motif detection, such as MEME/MAST, Biostrings (Bioconductor package), MotifRegressor, and previous work such Qresiduals projections or information theoretic based detectors. Comparative results, in the form of Receiver Operating Characteristic curves, show how, in 70% of the studied Transcription Factor Binding Sites, the SIGMA detector has a better performance and behaves more robustly than the methods compared, while having a similar computational time. The performance of SIGMA can be explained by its parametric simplicity in the modelling of the non-linear co-variability in the binding motif positions. Sequence Information Gain based Motif Analysis is a generalisation of a non-linear model of the cis-regulatory sequences detection based on Information Theory. This generalisation allows us to detect transcription factor binding sites with maximum performance disregarding the covariability observed in the positions of the training set of sequences. SIGMA is freely available to the public at http://b2slab.upc.edu.

  11. Depositional facies, environments and sequence stratigraphic interpretation of the Middle Triassic-Lower Cretaceous (pre-Late Albian) succession in Arif El-Naga anticline, northeast Sinai, Egypt

    NASA Astrophysics Data System (ADS)

    El-Azabi, M. H.; El-Araby, A.

    2005-01-01

    The Middle Triassic-Lower Cretaceous (pre-Late Albian) succession of Arif El-Naga anticline comprises various distinctive facies and environments that are connected with eustatic relative sea-level changes, local/regional tectonism, variable sediment influx and base-level changes. It displays six unconformity-bounded depositional sequences. The Triassic deposits are divided into a lower clastic facies (early Middle Triassic sequence) and an upper carbonate unit (late Middle- and latest Middle/early Late Triassic sequences). The early Middle Triassic sequence consists of sandstone with shale/mudstone interbeds that formed under variable regimes, ranging from braided fluvial, lower shoreface to beach foreshore. The marine part of this sequence marks retrogradational and progradational parasequences of transgressive- and highstand systems tract deposits respectively. Deposition has taken place under warm semi-arid climate and a steady supply of clastics. The late Middle- and latest Middle/early Late Triassic sequences are carbonate facies developed on an extensive shallow marine shelf under dry-warm climate. The late Middle Triassic sequence includes retrogradational shallow subtidal oyster rudstone and progradational lower intertidal lime-mudstone parasequences that define the transgressive- and highstand systems tracts respectively. It terminates with upper intertidal oncolitic packstone with bored upper surface. The next latest Middle/early Late Triassic sequence is marked by lime-mudstone, packstone/grainstone and algal stromatolitic bindstone with minor shale/mudstone. These lower intertidal/shallow subtidal deposits of a transgressive-systems tract are followed upward by progradational highstand lower intertidal lime-mudstone deposits. The overlying Jurassic deposits encompass two different sequences. The Lower Jurassic sequence is made up of intercalating lower intertidal lime-mudstone and wave-dominated beach foreshore sandstone which formed during a short period of rising sea-level with a relative increase in clastic supply. The Middle-Upper Jurassic sequence is represented by cycles of cross-bedded sandstone topped with thin mudstone that accumulated by northerly flowing braided-streams accompanying regional uplift of the Arabo-Nubian shield. It is succeeded by another regressive fluvial sequence of Early Cretaceous age due to a major eustatic sea-level fall. The Lower Cretaceous sequence is dominated by sandy braided-river deposits with minor overbank fines and basal debris flow conglomerate.

  12. DISTINCT ANTIBODY SPECIES: STRUCTURAL DIFFERENCES CREATING THERAPEUTIC OPPORTUNITIES

    PubMed Central

    Muyldermans, Serge; Smider, Vaughn V.

    2016-01-01

    Antibodies have been a remarkably successful class of molecules for binding a large number of antigens in therapeutic, diagnostic, and research applications. Typical antibodies derived from mouse or human sources use the surface formed by complementarity determining regions (CDRs) on the variable regions of the heavy chain/light chain heterodimer, which typically forms a relatively flat binding surface. Alternative species, particularly camelids and bovines, provide a unique paradigm for antigen recognition through novel domains which form the antigen binding paratope. For camelids, heavy chain antibodies bind antigen with only a single heavy chain variable region, in the absence of light chains. In bovines, ultralong CDR-H3 regions form an independently folding minidomain, which protrudes from the surface of the antibody and is diverse in both its sequence and disulfide patterns. The atypical paratopes of camelids and bovines potentially provide the ability to interact with different epitopes, particularly recessed or concave surfaces, compared to traditional antibodies. PMID:26922135

  13. HLA genotyping by next-generation sequencing of complementary DNA.

    PubMed

    Segawa, Hidenobu; Kukita, Yoji; Kato, Kikuya

    2017-11-28

    Genotyping of the human leucocyte antigen (HLA) is indispensable for various medical treatments. However, unambiguous genotyping is technically challenging due to high polymorphism of the corresponding genomic region. Next-generation sequencing is changing the landscape of genotyping. In addition to high throughput of data, its additional advantage is that DNA templates are derived from single molecules, which is a strong merit for the phasing problem. Although most currently developed technologies use genomic DNA, use of cDNA could enable genotyping with reduced costs in data production and analysis. We thus developed an HLA genotyping system based on next-generation sequencing of cDNA. Each HLA gene was divided into 3 or 4 target regions subjected to PCR amplification and subsequent sequencing with Ion Torrent PGM. The sequence data were then subjected to an automated analysis. The principle of the analysis was to construct candidate sequences generated from all possible combinations of variable bases and arrange them in decreasing order of the number of reads. Upon collecting candidate sequences from all target regions, 2 haplotypes were usually assigned. Cases not assigned 2 haplotypes were forwarded to 4 additional processes: selection of candidate sequences applying more stringent criteria, removal of artificial haplotypes, selection of candidate sequences with a relaxed threshold for sequence matching, and countermeasure for incomplete sequences in the HLA database. The genotyping system was evaluated using 30 samples; the overall accuracy was 97.0% at the field 3 level and 98.3% at the G group level. With one sample, genotyping of DPB1 was not completed due to short read size. We then developed a method for complete sequencing of individual molecules of the DPB1 gene, using the molecular barcode technology. The performance of the automatic genotyping system was comparable to that of systems developed in previous studies. Thus, next-generation sequencing of cDNA is a viable option for HLA genotyping.

  14. Non coding extremities of the seven influenza virus type C vRNA segments: effect on transcription and replication by the type C and type A polymerase complexes

    PubMed Central

    Crescenzo-Chaigne, Bernadette; Barbezange, Cyril; van der Werf, Sylvie

    2008-01-01

    Background The transcription/replication of the influenza viruses implicate the terminal nucleotide sequences of viral RNA, which comprise sequences at the extremities conserved among the genomic segments as well as variable 3' and 5' non-coding (NC) regions. The plasmid-based system for the in vivo reconstitution of functional ribonucleoproteins, upon expression of viral-like RNAs together with the nucleoprotein and polymerase proteins has been widely used to analyze transcription/replication of influenza viruses. It was thus shown that the type A polymerase could transcribe and replicate type A, B, or C vRNA templates whereas neither type B nor type C polymerases were able to transcribe and replicate type A templates efficiently. Here we studied the importance of the NC regions from the seven segments of type C influenza virus for efficient transcription/replication by the type A and C polymerases. Results The NC sequences of the seven genomic segments of the type C influenza virus C/Johannesburg/1/66 strain were found to be more variable in length than those of the type A and B viruses. The levels of transcription/replication of viral-like vRNAs harboring the NC sequences of the respective type C virus segments flanking the CAT reporter gene were comparable in the presence of either type C or type A polymerase complexes except for the NS and PB2-like vRNAs. For the NS-like vRNA, the transcription/replication level was higher after introduction of a U residue at position 6 in the 5' NC region as for all other segments. For the PB2-like vRNA the CAT expression level was particularly reduced with the type C polymerase. Analysis of mutants of the 5' NC sequence in the PB2-like vRNA, the shortest 5' NC sequence among the seven segments, showed that additional sequences within the PB2 ORF were essential for the efficiency of transcription but not replication by the type C polymerase complex. Conclusion In the context of a PB2-like reporter vRNA template, the sequence upstream the polyU stretch plays a role in the transcription/replication process by the type C polymerase complex. PMID:18973655

  15. Global sequence variation in the histidine-rich proteins 2 and 3 of Plasmodium falciparum: implications for the performance of malaria rapid diagnostic tests

    PubMed Central

    2010-01-01

    Background Accurate diagnosis is essential for prompt and appropriate treatment of malaria. While rapid diagnostic tests (RDTs) offer great potential to improve malaria diagnosis, the sensitivity of RDTs has been reported to be highly variable. One possible factor contributing to variable test performance is the diversity of parasite antigens. This is of particular concern for Plasmodium falciparum histidine-rich protein 2 (PfHRP2)-detecting RDTs since PfHRP2 has been reported to be highly variable in isolates of the Asia-Pacific region. Methods The pfhrp2 exon 2 fragment from 458 isolates of P. falciparum collected from 38 countries was amplified and sequenced. For a subset of 80 isolates, the exon 2 fragment of histidine-rich protein 3 (pfhrp3) was also amplified and sequenced. DNA sequence and statistical analysis of the variation observed in these genes was conducted. The potential impact of the pfhrp2 variation on RDT detection rates was examined by analysing the relationship between sequence characteristics of this gene and the results of the WHO product testing of malaria RDTs: Round 1 (2008), for 34 PfHRP2-detecting RDTs. Results Sequence analysis revealed extensive variations in the number and arrangement of various repeats encoded by the genes in parasite populations world-wide. However, no statistically robust correlation between gene structure and RDT detection rate for P. falciparum parasites at 200 parasites per microlitre was identified. Conclusions The results suggest that despite extreme sequence variation, diversity of PfHRP2 does not appear to be a major cause of RDT sensitivity variation. PMID:20470441

  16. Genetic Characterization of Circulating African Swine Fever Viruses in Nigeria (2007-2015).

    PubMed

    Luka, P D; Achenbach, J E; Mwiine, F N; Lamien, C E; Shamaki, D; Unger, H; Erume, J

    2017-10-01

    Sequencing and analysis of three discrete genome regions of African swine fever viruses (ASFV) from archival samples collected in 2007-2011 and active and passive surveillance between 2012 and 2015 in Nigeria were carried out. Analysis was conducted by genotyping of three single-copy African swine fever (ASF) genes. The E183L and B646L genes that encode structural proteins p54 and p72, respectively, were utilized to delineate genotypes before intragenotypic resolution by characterization of the tetrameric amino acid repeat region within the hypervariable central variable region of the B602L gene. The results showed no variation in the p72 and p54 gene regions sequenced. Phylogeny of p72 sequences revealed that all the Nigerian isolates belonged to genotype I, while that of the p54 recovered the Ia genotype. Analysis of B602L gene revealed the differences in the number of tetrameric repeats. Four new variants (Tet-15, Tet-17a, Tet-17b and Tet-48) were recovered, while a fifth variant (Tet-20) was the most widely distributed in the country displacing Tet-36 reported previously in 2003-2006. The viruses responsible for ASF outbreaks in Nigeria are from very closely related but mutated variants of the virus that have been circulating since 1997. A practical implication of the genetic variability of the Nigerian viral isolates in this study is the need for continuous sampling and analysis of circulating viruses, which will provide epidemiological information on the evolution of ASFV in the field versus new incursion for informed strategic control of the disease in the country. © 2016 Blackwell Verlag GmbH.

  17. Comprehensive characterization of immunoglobulin gene rearrangements in patients with chronic lymphocytic leukaemia

    PubMed Central

    René, Céline; Prat, Nathalie; Thuizat, Audrey; Broctawik, Mélanie; Avinens, Odile; Eliaou, Jean-François

    2014-01-01

    Previous studies have suggested a geographical pattern of immunoglobulin rearrangement in chronic lymphocytic leukaemia (CLL), which could be as a result of a genetic background or an environmental antigen. However, the characteristics of Ig rearrangements in the population from the South of France have not yet been established. Here, we studied CLL B-cell repertoire and mutational pattern in a Southern French cohort of patients using an in-house protocol for whole sequencing of the rearranged immunoglobulin heavy-chain genes. Described biased usage of variable, diversity and joining genes between the mutated and unmutated groups was found in our population. However, variable gene frequencies are more in accordance with those observed in the Mediterranean patients. We found that the third complementary-determining region (CDR) length was higher in unmutated sequences, because of bias in the diversity and joining genes usage and not due to the N diversity. Mutations found in CLL followed the features of canonical somatic hypermutation mechanism: preference of targeting for activation-induced cytidine deaminase and polymerase motifs, base change bias for transitions and more replacement mutations occurring in CDRs than in framework regions. Surprisingly, localization of activation-induced cytidine deaminase motifs onto the variable gene showed a preference for framework regions. The study of the characteristics at the age of diagnosis showed no difference in clinical outcome, but suggested a tendency of increased replacement and transition-over-transversion mutations and a longer third CDR length in older patients. PMID:24725733

  18. Genotype differentiation of Agamid Adenovirus 1 in bearded dragons (Pogona vitticeps) in the USA by hexon gene sequence.

    PubMed

    Parkin, Derek B; Archer, Linda L; Childress, April L; Wellehan, James F X

    2009-07-01

    Bearded dragons (Pogona vitticeps) are popular pets in the United States. Agamid Adenovirus 1 (AgAdV1) is an important infectious agent of bearded dragons. The only AgAdV1 sequences available to date are from a highly conserved region of the DNA polymerase gene. Degenerate primers were designed to amplify a variable region of the AgAdV1 hexon gene for sequencing. Genetic differences were identified within the hexon gene of 17 bearded dragons from 4 collections. Much less diversity was present in the polymerase gene. Bayesian analysis of the hexon nucleotide alignment identified two larger groups and two isolates that did not tightly cluster with these two groups. Multiple genotypes were identified within collections, and individual genotypes were seen in different collections. Three bearded dragons appeared to be infected by multiple strains. These findings show that this hexon region is useful for AgAdV1 genotyping, which can be used epidemiologically as well as in future investigations of AgAdV1 evolution and clinical implications of strain differences.

  19. Reconstitution of wild type viral DNA in simian cells transfected with early and late SV40 defective genomes.

    PubMed

    O'Neill, F J; Gao, Y; Xu, X

    1993-11-01

    The DNAs of polyomaviruses ordinarily exist as a single circular molecule of approximately 5000 base pairs. Variants of SV40, BKV and JCV have been described which contain two complementing defective DNA molecules. These defectives, which form a bipartite genome structure, contain either the viral early region or the late region. The defectives have the unique property of being able to tolerate variable sized reiterations of regulatory and terminus region sequences, and portions of the coding region. They can also exchange coding region sequences with other polyomaviruses. It has been suggested that the bipartite genome structure might be a stage in the evolution of polyomaviruses which can uniquely sustain genome and sequence diversity. However, it is not known if the regulatory and terminus region sequences are highly mutable. Also, it is not known if the bipartite genome structure is reversible and what the conditions might be which would favor restoration of the monomolecular genome structure. We addressed the first question by sequencing the reiterated regulatory and terminus regions of E- and L-SV40 DNAs. This revealed a large number of mutations in the regulatory regions of the defective genomes, including deletions, insertions, rearrangements and base substitutions. We also detected insertions and base substitutions in the T-antigen gene. We addressed the second question by introducing into permissive simian cells, E- and L-SV40 genomes which had been engineered to contain only a single regulatory region. Analysis of viral DNA from transfected cells demonstrated recombined genomes containing a wild type monomolecular DNA structure. However, the complete defectives, containing reiterated regulatory regions, could often compete away the wild type genomes. The recombinant monomolecular genomes were isolated, cloned and found to be infectious. All of the DNA alterations identified in one of the regulatory regions of E-SV40 DNA were present in the recombinant monomolecular genomes. These and other findings indicate that the bipartite genome state can sustain many mutations which wtSV40 cannot directly sustain. However, the mutations can later be introduced into the wild type genomes when the E- and L-SV40 DNAs recombine to generate a new monomolecular genome structure.

  20. De Novo Assembly of Human Herpes Virus Type 1 (HHV-1) Genome, Mining of Non-Canonical Structures and Detection of Novel Drug-Resistance Mutations Using Short- and Long-Read Next Generation Sequencing Technologies

    PubMed Central

    Karamitros, Timokratis; Piorkowska, Renata; Katzourakis, Aris; Magiorkinis, Gkikas; Mbisa, Jean Lutamyo

    2016-01-01

    Human herpesvirus type 1 (HHV-1) has a large double-stranded DNA genome of approximately 152 kbp that is structurally complex and GC-rich. This makes the assembly of HHV-1 whole genomes from short-read sequencing data technically challenging. To improve the assembly of HHV-1 genomes we have employed a hybrid genome assembly protocol using data from two sequencing technologies: the short-read Roche 454 and the long-read Oxford Nanopore MinION sequencers. We sequenced 18 HHV-1 cell culture-isolated clinical specimens collected from immunocompromised patients undergoing antiviral therapy. The susceptibility of the samples to several antivirals was determined by plaque reduction assay. Hybrid genome assembly resulted in a decrease in the number of contigs in 6 out of 7 samples and an increase in N(G)50 and N(G)75 of all 7 samples sequenced by both technologies. The approach also enhanced the detection of non-canonical contigs including a rearrangement between the unique (UL) and repeat (T/IRL) sequence regions of one sample that was not detectable by assembly of 454 reads alone. We detected several known and novel resistance-associated mutations in UL23 and UL30 genes. Genome-wide genetic variability ranged from <1% to 53% of amino acids in each gene exhibiting at least one substitution within the pool of samples. The UL23 gene had one of the highest genetic variabilities at 35.2% in keeping with its role in development of drug resistance. The assembly of accurate, full-length HHV-1 genomes will be useful in determining genetic determinants of drug resistance, virulence, pathogenesis and viral evolution. The numerous, complex repeat regions of the HHV-1 genome currently remain a barrier towards this goal. PMID:27309375

  1. De Novo Assembly of Human Herpes Virus Type 1 (HHV-1) Genome, Mining of Non-Canonical Structures and Detection of Novel Drug-Resistance Mutations Using Short- and Long-Read Next Generation Sequencing Technologies.

    PubMed

    Karamitros, Timokratis; Harrison, Ian; Piorkowska, Renata; Katzourakis, Aris; Magiorkinis, Gkikas; Mbisa, Jean Lutamyo

    2016-01-01

    Human herpesvirus type 1 (HHV-1) has a large double-stranded DNA genome of approximately 152 kbp that is structurally complex and GC-rich. This makes the assembly of HHV-1 whole genomes from short-read sequencing data technically challenging. To improve the assembly of HHV-1 genomes we have employed a hybrid genome assembly protocol using data from two sequencing technologies: the short-read Roche 454 and the long-read Oxford Nanopore MinION sequencers. We sequenced 18 HHV-1 cell culture-isolated clinical specimens collected from immunocompromised patients undergoing antiviral therapy. The susceptibility of the samples to several antivirals was determined by plaque reduction assay. Hybrid genome assembly resulted in a decrease in the number of contigs in 6 out of 7 samples and an increase in N(G)50 and N(G)75 of all 7 samples sequenced by both technologies. The approach also enhanced the detection of non-canonical contigs including a rearrangement between the unique (UL) and repeat (T/IRL) sequence regions of one sample that was not detectable by assembly of 454 reads alone. We detected several known and novel resistance-associated mutations in UL23 and UL30 genes. Genome-wide genetic variability ranged from <1% to 53% of amino acids in each gene exhibiting at least one substitution within the pool of samples. The UL23 gene had one of the highest genetic variabilities at 35.2% in keeping with its role in development of drug resistance. The assembly of accurate, full-length HHV-1 genomes will be useful in determining genetic determinants of drug resistance, virulence, pathogenesis and viral evolution. The numerous, complex repeat regions of the HHV-1 genome currently remain a barrier towards this goal.

  2. Chromosomal Mapping of Repetitive DNA Sequences in the Genus Bryconamericus (Characidae) and DNA Barcoding to Differentiate Populations.

    PubMed

    Santos, Angélica Rossotti Dos; Usso, Mariana Campaner; Gouveia, Juceli Gonzalez; Araya-Jaime, Cristian; Frantine-Silva, Wilson; Giuliano-Caetano, Lucia; Foresti, Fausto; Dias, Ana Lúcia

    2017-06-01

    The mapping of repetitive DNA sites by fluorescence in situ hybridization has been widely used for karyotype studies in different species of fish, especially when dealing with related species or even genera presenting high chromosome variability. This study analyzed three populations of Bryconamericus, with diploid number preserved, but with different karyotype formulae. Bryconamericus ecai, from the Forquetinha river/RS, presented three new cytotypes, increasing the number of karyotype forms to seven in this population. Other two populations of Bryconamericus sp. from the Vermelho stream/PR and Cambuta river/PR exhibited interpopulation variation. The chromosome mapping of rDNA sites revealed unique markings among the three populations, showing inter- and intrapopulation variability located in the terminal region. The molecular analysis using DNA barcoding complementing the cytogenetic analysis also showed differentiation among the three populations. The U2 small nuclear DNA repetitive sequence exhibited conserved features, being located in the interstitial region of a single chromosome pair. This is the first report on its occurrence in the genus Bryconamericus. Data obtained revealed a karyotype variability already assigned to the genus, along with polymorphism of ribosomal sites, demonstrating that this group of fish can be undergoing a divergent evolutionary process, constituting a substantive model for studies of chromosomal evolution.

  3. Classification of Culturable Bifidobacterial Population from Colonic Samples of Wild Pigs (Sus scrofa) Based on Three Molecular Genetic Methods.

    PubMed

    Pechar, Radko; Killer, Jiří; Mekadim, Chahrazed; Geigerová, Martina; Rada, Vojtěch

    2017-11-01

    Occurrence of bifidobacteria, known as health-promoting probiotic microorganisms, in the digestive tract of wild pigs (Sus scrofa) has not been examined yet. One hundred forty-nine fructose-6-phosphate phosphoketolase positive bacterial strains were isolated from colonic content of twenty-two individuals of wild pigs originated from four localities in the Czechia. Based on PCR-DGGE technique targeting the variable V3 region of the 16S rRNA genes, strains were initially differentiated into four groups represented by: (i) probably a new Bifidobacterium species (89 strains), (ii) B. boum/B. thermophilum/B. thermacidophilum subsp. porcinum/B. thermacidophilum subsp. thermacidophilum (sub)species (49 strains), (iii) Pseudoscardovia suis (7 strains), and (iv) B. pseudolongum subsp. globosum/B. pseudolongum subsp. pseudolongum (4 strains), respectively. Given the fact that DGGE technique did not allow to differentiate the representatives of thermophilic bifidobacteria and B. pseudolongum subspecies, strains were further classified by the 16S rRNA and thrS gene sequences. Primers targeting the variable regions of the latter gene were designed to be applicable in identification and phylogeny of Bifidobacteriaceae family. The 16S rRNA-derived phylogenetic study classified members of the first group into five subgroups in a separated cluster of thermophilic bifidobacteria. Comparable results were obtained by the thrS-derived phylogenetic analysis. Remarkably, variability among thrS sequences was higher compared with 16S rRNA gene sequences. Overall, molecular genetic techniques application allowed to identify a new Bifidobacterium phylotype which is predominant in the digestive tract of examined wild pigs.

  4. Epidemiology and Genetic Characterization of Hepatitis A Virus Genotype IIA▿

    PubMed Central

    Desbois, Delphine; Couturier, Elisabeth; Mackiewicz, Vincent; Graube, Arielle; Letort, Marie-José; Dussaix, Elisabeth; Roque-Afonso, Anne-Marie

    2010-01-01

    Three hepatitis A virus (HAV) genotypes, I, II, and III, divided into subtypes A and B, infect humans. Genotype I is the most frequently reported, while genotype II is hardly ever isolated, and its genetic diversity is unknown. From 2002 to 2007, a French epidemiological survey of HAV identified 6 IIA isolates, mostly from patients who did not travel abroad. The possible African origin of IIA strains was investigated by screening the 2008 mandatory notification records of HAV infection: 171 HAV strains from travelers to West Africa and Morocco were identified. Genotyping was performed by sequencing of the VP1/2A junction in 68 available sera. Entire P1 and 5′ untranslated regions of IIA strains were compared to reference sequences of other genotypes. The screening retrieved 5 imported IIA isolates. An additional autochthonous case and 2 more African cases were identified in 2008 and 2009, respectively. A total of 14 IIA isolates (8 African and 6 autochthonous) were analyzed. IIA sequences presented lower nucleotide and amino acid variability than other genotypes. The highest variability was observed in the N-terminal region of VP1, while for other genotypes the highest variability was observed at the VP1/2A junction. Phylogenetic analysis identified 2 clusters, one gathering all African and two autochthonous cases and a second including only autochthonous isolates. In conclusion, most IIA strains isolated in France are imported by travelers returning from West Africa. However, the unexplained contamination mode of autochthonous cases suggests another, still to be discovered geographical origin or a French reservoir to be explored. PMID:20592136

  5. Deletion mutants of Harvey ras p21 protein reveal the absolute requirement of at least two distant regions for GTP-binding and transforming activities.

    PubMed Central

    Lacal, J C; Anderson, P S; Aaronson, S A

    1986-01-01

    Deletions of small sequences from the viral Harvey ras gene have been generated, and resulting ras p21 mutants have been expressed in Escherichia coli. Purification of each deleted protein allowed the in vitro characterization of GTP-binding, GTPase and autokinase activity of the proteins. Microinjection of the highly purified proteins into quiescent NIH/3T3 cells, as well as transfection experiments utilizing a long terminal repeat (LTR)-containing vector, were utilized to analyze the biological activity of the deleted proteins. Two small regions located at 6-23 and 152-165 residues are shown to be absolutely required for in vitro and in vivo activities of the ras product. By contrast, the variable region comprising amino acids 165-184 was shown not to be necessary for either in vitro or in vivo activities. Thus, we demonstrate that: (i) amino acid sequences at positions 5-23 and 152-165 of ras p21 protein are probably directly involved in the GTP-binding activity; (ii) GTP-binding is required for the transforming activity of ras p21 and by extension for the normal function of the proto-oncogene product; and (iii) the variable region at the C-terminal end of the ras p21 molecule from amino acids 165 to 184 is not required for transformation. Images Fig.2. Fig.4. PMID:3011420

  6. Effects of Darwinian Selection and Mutability on Rate of Broadly Neutralizing Antibody Evolution during HIV-1 Infection

    PubMed Central

    Sheng, Zizhang; Schramm, Chaim A.; Connors, Mark; Morris, Lynn; Mascola, John R.; Kwong, Peter D.; Shapiro, Lawrence

    2016-01-01

    Accumulation of somatic mutations in antibody variable regions is critical for antibody affinity maturation, with HIV-1 broadly neutralizing antibodies (bnAbs) generally requiring years to develop. We recently found that the rate at which mutations accumulate decreases over time, but the mechanism governing this slowing is unclear. In this study, we investigated whether natural selection and/or mutability of the antibody variable region contributed significantly to observed decrease in rate. We used longitudinally sampled sequences of immunoglobulin transcripts of single lineages from each of 3 donors, as determined by next generation sequencing. We estimated the evolutionary rates of the complementarity determining regions (CDRs), which are most significant for functional selection, and found they evolved about 1.5- to 2- fold faster than the framework regions. We also analyzed the presence of AID hotspots and coldspots at different points in lineage development and observed an average decrease in mutability of less than 10 percent over time. Altogether, the correlation between Darwinian selection strength and evolutionary rate trended toward significance, especially for CDRs, but cannot fully explain the observed changes in evolutionary rate. The mutability modulated by AID hotspots and coldspots changes correlated only weakly with evolutionary rates. The combined effects of Darwinian selection and mutability contribute substantially to, but do not fully explain, evolutionary rate change for HIV-1-targeting bnAb lineages. PMID:27191167

  7. An Avian Basal Ganglia-Forebrain Circuit Contributes Differentially to Syllable Versus Sequence Variability of Adult Bengalese Finch Song

    PubMed Central

    Hampton, Cara M.; Sakata, Jon T.; Brainard, Michael S.

    2009-01-01

    Behavioral variability is important for motor skill learning but continues to be present and actively regulated even in well-learned behaviors. In adult songbirds, two types of song variability can persist and are modulated by social context: variability in syllable structure and variability in syllable sequencing. The degree to which the control of both types of adult variability is shared or distinct remains unknown. The output of a basal ganglia-forebrain circuit, LMAN (the lateral magnocellular nucleus of the anterior nidopallium), has been implicated in song variability. For example, in adult zebra finches, neurons in LMAN actively control the variability of syllable structure. It is unclear, however, whether LMAN contributes to variability in adult syllable sequencing because sequence variability in adult zebra finch song is minimal. In contrast, Bengalese finches retain variability in both syllable structure and syllable sequencing into adulthood. We analyzed the effects of LMAN lesions on the variability of syllable structure and sequencing and on the social modulation of these forms of variability in adult Bengalese finches. We found that lesions of LMAN significantly reduced the variability of syllable structure but not of syllable sequencing. We also found that LMAN lesions eliminated the social modulation of the variability of syllable structure but did not detect significant effects on the modulation of sequence variability. These results show that LMAN contributes differentially to syllable versus sequence variability of adult song and suggest that these forms of variability are regulated by distinct neural pathways. PMID:19357331

  8. Effects of legacy nuclear waste on the compositional diversity and distributions of sulfate-reducing bacteria in a terrestrial subsurface aquifer.

    PubMed

    Bagwell, Christopher E; Liu, Xuaduan; Wu, Liyou; Zhou, Jizhong

    2006-03-01

    The impact of legacy nuclear waste on the compositional diversity and distribution of sulfate-reducing bacteria in a heavily contaminated subsurface aquifer was examined. dsrAB clone libraries were constructed and restriction fragment length polymorphism (RFLP) analysis used to evaluate genetic variation between sampling wells. Principal component analysis identified nickel, nitrate, technetium, and organic carbon as the primary variables contributing to well-to-well geochemical variability, although comparative sequence analysis showed the sulfate-reducing bacteria community structure to be consistent throughout contaminated and uncontaminated regions of the aquifer. Only 3% of recovered dsrAB gene sequences showed apparent membership to the Deltaproteobacteria. The remainder of recovered sequences may represent novel, deep-branching lineages that, to our knowledge, do not presently contain any cultivated members; although corresponding phylotypes have recently been reported from several different marine ecosystems. These findings imply resiliency and adaptability of sulfate-reducing bacteria to extremes in environmental conditions, although the possibility for horizontal transfer of dsrAB is also discussed.

  9. Characterization of Samples Identified as Hepatitis C Virus Genotype 1 without Subtype by Abbott RealTime HCV Genotype II Assay Using the New Abbott HCV Genotype Plus RUO Test.

    PubMed

    Mokhtari, Camelia; Ebel, Anne; Reinhardt, Birgit; Merlin, Sandra; Proust, Stéphanie; Roque-Afonso, Anne-Marie

    2016-02-01

    Hepatitis C virus (HCV) genotyping continues to be relevant for therapeutic strategies. Some samples are reported as genotype 1 (gt 1) without subtype by the Abbott RealTime HCV Genotype II (GT II) test. To characterize such samples further, the Abbott HCV Genotype Plus RUO (Plus) assay, which targets the core region for gt 1a, gt 1b, and gt 6 detection, was evaluated as a reflex test in reference to NS5B or 5'-untranslated region (UTR)/core region sequencing. Of 3,626 routine samples, results of gt 1 without subtype were received for 171 samples (4.7%), accounting for 11.5% of gt 1 specimens. The Plus assay and sequencing were applied to 98 of those samples. NS5B or 5'-UTR/core region sequencing was successful for 91/98 specimens (92.9%). Plus assay and sequencing results were concordant for 87.9% of specimens (80/91 samples). Sequencing confirmed Plus assay results for 82.6%, 85.7%, 100%, and 89.3% of gt 1a, gt 1b, gt 6, and non-gt 1a/1b/6 results, respectively. Notably, 12 gt 6 samples that had been identified previously as gt 1 without subtype were assigned correctly here; for 25/28 samples reported as "not detected" by the Plus assay, sequencing identified the samples as gt 1 with subtypes other than 1a/1b. The genetic variability of HCV continues to present challenges for the current genotyping platforms regardless of the applied methodology. Samples identified by the GT II assay as gt 1 without subtype can be further resolved and reliably characterized by the new Plus assay. Copyright © 2016 Mokhtari et al.

  10. Complete nucleotide sequence and genome structure of a Japanese isolate of hibiscus latent Fort Pierce virus, a unique tobamovirus that contains an internal poly(A) region in its 3' end.

    PubMed

    Yoshida, Tetsuya; Kitazawa, Yugo; Komatsu, Ken; Neriya, Yutaro; Ishikawa, Kazuya; Fujita, Naoko; Hashimoto, Masayoshi; Maejima, Kensaku; Yamaji, Yasuyuki; Namba, Shigetou

    2014-11-01

    In this study, we detected a Japanese isolate of hibiscus latent Fort Pierce virus (HLFPV-J), a member of the genus Tobamovirus, in a hibiscus plant in Japan and determined the complete sequence and organization of its genome. HLFPV-J has four open reading frames (ORFs), each of which shares more than 98 % nucleotide sequence identity with those of other HLFPV isolates. Moreover, HLFPV-J contains a unique internal poly(A) region of variable length, ranging from 44 to 78 nucleotides, in its 3'-untranslated region (UTR), as is the case with hibiscus latent Singapore virus (HLSV), another hibiscus-infecting tobamovirus. The length of the HLFPV-J genome was 6431 nucleotides, including the shortest internal poly(A) region. The sequence identities of ORFs 1, 2, 3 and 4 of HLFPV-J to other tobamoviruses were 46.6-68.7, 49.9-70.8, 31.0-70.8 and 39.4-70.1 %, respectively, at the nucleotide level and 39.8-75.0, 43.6-77.8, 19.2-70.4 and 31.2-74.2 %, respectively, at the amino acid level. The 5'- and 3'-UTRs of HLFPV-J showed 24.3-58.6 and 13.0-79.8 % identity, respectively, to other tobamoviruses. In particular, when compared to other tobamoviruses, each ORF and UTR of HLFPV-J showed the highest sequence identity to those of HLSV. Phylogenetic analysis showed that HLFPV-J, other HLFPV isolates and HLSV constitute a malvaceous-plant-infecting tobamovirus cluster. These results indicate that the genomic structure of HLFPV-J has unique features similar to those of HLSV. To our knowledge, this is the first report of the complete genome sequence of HLFPV.

  11. The nucleotide sequence of the entire ribosomal DNA operon and the structure of the large subunit rRNA of Giardia muris.

    PubMed

    van Keulen, H; Gutell, R R; Campbell, S R; Erlandsen, S L; Jarroll, E L

    1992-10-01

    The total nucleotide sequence of the rDNA of Giardia muris, an intestinal protozoan parasite of rodents, has been determined. The repeat unit is 7668 basepairs (bp) in size and consists of a spacer of 3314 bp, a small-subunit rRNA (SSU-rRNA) gene of 1429, and a large-subunit rRNA (LSU-rRNA) gene of 2698 bp. The spacer contains long direct repeats and is heterogeneous in size. The LSU-rRNA of G. muris was compared to that of the human intestinal parasite Giardia duodenalis, to the bird parasite Giardia ardeae, and to that of Escherichia coli. The LSU-rRNA has a size comparable to the 23S rRNA of E. coli but shows structural features typical for eukaryotes. Some variable regions are typically small and account for the overall smaller size of this rRNA. The structure of the G. muris LSU-rRNA is similar to that of the other Giardia rRNA, but each rRNA has characteristic features residing in a number of variable regions.

  12. Immunoglobulin kappa light chain gene promoter and enhancer are not responsible for B-cell restricted gene rearrangement.

    PubMed Central

    Goodhardt, M; Babinet, C; Lutfalla, G; Kallenbach, S; Cavelier, P; Rougeon, F

    1989-01-01

    We have produced transgenic mice which synthesize chimeric mouse-rabbit immunoglobulin (Ig) kappa light chains following in vivo recombination of an injected unrearranged kappa gene. The exogenous gene construct contained a mouse germ-line kappa variable (V kappa) gene segment, the mouse germ-line joining (J kappa) locus including the enhancer, and the rabbit b9 constant (C kappa) region. A high level of V-J recombination of the kappa transgene was observed in spleen of the transgenic mice. Surprisingly, a particularly high degree of variability in the exact site of recombination and the presence of non germ-line encoded nucleotides (N-regions) were found at the V-J junction of the rearranged kappa transgene. Furthermore, unlike endogenous kappa genes, rearrangement of the exogenous gene occurred in T-cells of the transgenic mice. These results show that additional sequences, other than the heptamer-nonamer signal sequences and the promoter and enhancer elements, are required to obtain stage- and lineage- specific regulation of Ig kappa light chain gene rearrangement in vivo. Images PMID:2508061

  13. Molecular Strain Typing of Mycobacterium tuberculosis: a Review of Frequently Used Methods

    PubMed Central

    2016-01-01

    Tuberculosis, caused by the bacterium Mycobacterium tuberculosis, remains one of the most serious global health problems. Molecular typing of M. tuberculosis has been used for various epidemiologic purposes as well as for clinical management. Currently, many techniques are available to type M. tuberculosis. Choosing the most appropriate technique in accordance with the existing laboratory conditions and the specific features of the geographic region is important. Insertion sequence IS6110-based restriction fragment length polymorphism (RFLP) analysis is considered the gold standard for the molecular epidemiologic investigations of tuberculosis. However, other polymerase chain reaction-based methods such as spacer oligonucleotide typing (spoligotyping), which detects 43 spacer sequence-interspersing direct repeats (DRs) in the genomic DR region; mycobacterial interspersed repetitive units–variable number tandem repeats, (MIRU-VNTR), which determines the number and size of tandem repetitive DNA sequences; repetitive-sequence-based PCR (rep-PCR), which provides high-throughput genotypic fingerprinting of multiple Mycobacterium species; and the recently developed genome-based whole genome sequencing methods demonstrate similar discriminatory power and greater convenience. This review focuses on techniques frequently used for the molecular typing of M. tuberculosis and discusses their general aspects and applications. PMID:27709842

  14. Molecular Strain Typing of Mycobacterium tuberculosis: a Review of Frequently Used Methods.

    PubMed

    Ei, Phyu Win; Aung, Wah Wah; Lee, Jong Seok; Choi, Go Eun; Chang, Chulhun L

    2016-11-01

    Tuberculosis, caused by the bacterium Mycobacterium tuberculosis, remains one of the most serious global health problems. Molecular typing of M. tuberculosis has been used for various epidemiologic purposes as well as for clinical management. Currently, many techniques are available to type M. tuberculosis. Choosing the most appropriate technique in accordance with the existing laboratory conditions and the specific features of the geographic region is important. Insertion sequence IS6110-based restriction fragment length polymorphism (RFLP) analysis is considered the gold standard for the molecular epidemiologic investigations of tuberculosis. However, other polymerase chain reaction-based methods such as spacer oligonucleotide typing (spoligotyping), which detects 43 spacer sequence-interspersing direct repeats (DRs) in the genomic DR region; mycobacterial interspersed repetitive units-variable number tandem repeats, (MIRU-VNTR), which determines the number and size of tandem repetitive DNA sequences; repetitive-sequence-based PCR (rep-PCR), which provides high-throughput genotypic fingerprinting of multiple Mycobacterium species; and the recently developed genome-based whole genome sequencing methods demonstrate similar discriminatory power and greater convenience. This review focuses on techniques frequently used for the molecular typing of M. tuberculosis and discusses their general aspects and applications.

  15. Myxobolus cerebralis internal transcribed spacer 1 (ITS-1) sequences support recent spread of the parasite to North America and within Europe

    USGS Publications Warehouse

    Whipps, Christopher M.; El-Matbouli, M.; Hedrick, R.P.; Blazer, V.; Kent, M.L.

    2004-01-01

    Molecular approaches for resolving relationships among the Myxozoa have relied mainly on small subunit (SSU) ribosomal DNA (rDNA) sequence analysis. This region of the gene is generally used for higher phylogenetic studies, and the conservative nature of this gene may make it inadequate for intraspecific comparisons. Previous intraspecific studies of Myxobolus cerebralis based on molecular analyses reported that the sequence of SSU rDNA and the internal transcribed spacer (ITS) were highly conserved in representatives of the parasite from North America and Europe. Considering that the ITS is usually a more variable region than the SSU, we reanalyzed available sequences on GenBank and obtained sequences from other M. cerebralis representatives from the states of California and West Virginia in the USA and from Germany and Russia. With the exception of 7 base pairs, most of the sequence designated as ITS-1 in GenBank was a highly conserved portion of the rDNA near the 3-prime end of the SSU region. Nonetheless, the additional ITS-1 sequences obtained from the available geographic representatives were well conserved. It is unlikely that we would have observed virtually identical ITS-1 sequences between European and American M. cerebralis samples had it spread naturally over time, particularly when compared to the variation seen between isolates of another myxozoan (Kudoa thyrsites) that has most likely spread naturally. These data further support the hypothesis that the current distribution of M. cerebralis in North America is a result of recent introductions followed by dispersal via anthropogenic means, largely through the stocking of infected trout for sport fishing.

  16. PCR Primers for Metazoan Nuclear 18S and 28S Ribosomal DNA Sequences

    PubMed Central

    Machida, Ryuji J.; Knowlton, Nancy

    2012-01-01

    Background Metagenetic analyses, which amplify and sequence target marker DNA regions from environmental samples, are increasingly employed to assess the biodiversity of communities of small organisms. Using this approach, our understanding of microbial diversity has expanded greatly. In contrast, only a few studies using this approach to characterize metazoan diversity have been reported, despite the fact that many metazoan species are small and difficult to identify or are undescribed. One of the reasons for this discrepancy is the availability of universal primers for the target taxa. In microbial studies, analysis of the 16S ribosomal DNA is standard. In contrast, the best gene for metazoan metagenetics is less clear. In the present study, we have designed primers that amplify the nuclear 18S and 28S ribosomal DNA sequences of most metazoan species with the goal of providing effective approaches for metagenetic analyses of metazoan diversity in environmental samples, with a particular emphasis on marine biodiversity. Methodology/Principal Findings Conserved regions suitable for designing PCR primers were identified using 14,503 and 1,072 metazoan sequences of the nuclear 18S and 28S rDNA regions, respectively. The sequence similarity of both these newly designed and the previously reported primers to the target regions of these primers were compared for each phylum to determine the expected amplification efficacy. The nucleotide diversity of the flanking regions of the primers was also estimated for genera or higher taxonomic groups of 11 phyla to determine the variable regions within the genes. Conclusions/Significance The identified nuclear ribosomal DNA primers (five primer pairs for 18S and eleven for 28S) and the results of the nucleotide diversity analyses provide options for primer combinations for metazoan metagenetic analyses. Additionally, advantages and disadvantages of not only the 18S and 28S ribosomal DNA, but also other marker regions as targets for metazoan metagenetic analyses, are discussed. PMID:23049971

  17. [Using IRAP markers for analysis of genetic variability in populations of resource and rare species of plants].

    PubMed

    Boronnikova, S V; Kalendar', R N

    2010-01-01

    Species-specific LTR retrotransposons were first cloned in five rare relic species of drug plants located in the Perm' region. Sequences of LTR retrotransposons were used for PCR analysis based on amplification of repeated sequences from LTR or other sites of retrotransposons (IRAP). Genetic diversity was studied in six populations of rare relic species of plants Adonis vernalis L. by means of the IRAP method; 125 polymorphic IRAP-markers were analyzed. Parameters for DNA polymorphism and genetic diversity of A. vernalis populations were determined.

  18. Highly conserved CDR3 region in circulating CD4+Vβ5+ T cells may be associated with cytotoxic activity in Chagas disease

    PubMed Central

    Menezes, C A S; Sullivan, A K; Falta, M T; Mack, D G; Freed, B M; Rocha, M O C; Gollob, K J; Fontenot, A P; Dutra, W O

    2012-01-01

    Human infection with Trypanosoma cruzi leads to Chagas disease, which presents as several different clinical conditions ranging from an asymptomatic form to a severe dilated cardiomyopathy. Several studies have demonstrated that T cells play a critical role in the development of cardiac pathology, as well as in immunoregulation during chronic disease. However, the mechanisms that drive protective or pathogenic T cell response are not known. We have shown that CD4+ T cells from chagasic patients preferentially express T cell receptor (TCR) β-chain variable region (Vβ) 5. The aim of this work was to determine whether T cells expressing this particular Vβ region displayed variable or restricted CDR3 sequences, as an indicator of the nature of the stimulus leading to the activation of these T cells in vivo. Additionally, we aimed to evaluate phenotypic characteristics of these cells that might be associated with pathology. CDR3 junctional region sequencing of Vβ5·1 expressing CD4+ T cells revealed the occurrence of a highly homologous CDR3 region with conserved TCR Jβ region usage among patients with cardiac, but not indeterminate, Chagas disease. Moreover, correlation analysis indicated that the frequency of CD4+Vβ5·1+ cells is associated with granzyme A expression, suggesting that these cells might display cytotoxic function. Together these results provide new insight into T cell recognition of antigens involved in Chagas disease and suggest that these cells may be implicated in the pathogenesis of chagasic cardiomyopathy. PMID:22774985

  19. Small RNA sequencing in cells and exosomes identifies eQTLs and 14q32 as a region of active export

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Tsang, Emily K.; Abell, Nathan S.; Li, Xin

    Exosomes are small extracellular vesicles that carry heterogeneous cargo, including RNA, between cells. Increasing evidence suggests that exosomes are important mediators of intercellular communication and biomarkers of disease. Despite this, the variability of exosomal RNA between individuals has not been well quantified. To assess this variability, we sequenced the small RNA of cells and exosomes from a 17-member family. Across individuals, we show that selective export of miRNAs occurs not only at the level of specific transcripts, but that a cluster of 74 mature miRNAs on chromosome 14q32 is massively exported in exosomes while mostly absent from cells. We alsomore » observe more interindividual variability between exosomal samples than between cellular ones and identify four miRNA expression quantitative trait loci shared between cells and exosomes. Lastly, our findings indicate that genomically colocated miRNAs can be exported together and highlight the variability in exosomal miRNA levels between individuals as relevant for exosome use as diagnostics.« less

  20. Small RNA sequencing in cells and exosomes identifies eQTLs and 14q32 as a region of active export

    DOE PAGES

    Tsang, Emily K.; Abell, Nathan S.; Li, Xin; ...

    2016-10-31

    Exosomes are small extracellular vesicles that carry heterogeneous cargo, including RNA, between cells. Increasing evidence suggests that exosomes are important mediators of intercellular communication and biomarkers of disease. Despite this, the variability of exosomal RNA between individuals has not been well quantified. To assess this variability, we sequenced the small RNA of cells and exosomes from a 17-member family. Across individuals, we show that selective export of miRNAs occurs not only at the level of specific transcripts, but that a cluster of 74 mature miRNAs on chromosome 14q32 is massively exported in exosomes while mostly absent from cells. We alsomore » observe more interindividual variability between exosomal samples than between cellular ones and identify four miRNA expression quantitative trait loci shared between cells and exosomes. Lastly, our findings indicate that genomically colocated miRNAs can be exported together and highlight the variability in exosomal miRNA levels between individuals as relevant for exosome use as diagnostics.« less

  1. Complete genome sequence of a novel Plum pox virus strain W isolate determined by 454 pyrosequencing.

    PubMed

    Sheveleva, Anna; Kudryavtseva, Anna; Speranskaya, Anna; Belenikin, Maxim; Melnikova, Natalia; Chirkov, Sergei

    2013-10-01

    The near-complete (99.7 %) genome sequence of a novel Russian Plum pox virus (PPV) isolate Pk, belonging to the strain Winona (W), has been determined by 454 pyrosequencing with the exception of the thirty-one 5'-terminal nucleotides. This region was amplified using 5'RACE kit and sequenced by the Sanger method. Genomic RNA released from immunocaptured PPV particles was employed for generation of cDNA library using TransPlex Whole transcriptome amplification kit (WTA2, Sigma-Aldrich). The entire Pk genome has identity level of 92.8-94.5 % when compared to the complete nucleotide sequences of other PPV-W isolates (W3174, LV-141pl, LV-145bt, and UKR 44189), confirming a high degree of variability within the PPV-W strain. The isolates Pk and LV-141pl are most closely related. The Pk has been found in a wild plum (Prunus domestica) in a new region of Russia indicating widespread dissemination of the PPV-W strain in the European part of the former USSR.

  2. Haplotype Phasing and Inheritance of Copy Number Variants in Nuclear Families

    PubMed Central

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring. PMID:25853576

  3. Haplotype phasing and inheritance of copy number variants in nuclear families.

    PubMed

    Palta, Priit; Kaplinski, Lauris; Nagirnaja, Liina; Veidenberg, Andres; Möls, Märt; Nelis, Mari; Esko, Tõnu; Metspalu, Andres; Laan, Maris; Remm, Maido

    2015-01-01

    DNA copy number variants (CNVs) that alter the copy number of a particular DNA segment in the genome play an important role in human phenotypic variability and disease susceptibility. A number of CNVs overlapping with genes have been shown to confer risk to a variety of human diseases thus highlighting the relevance of addressing the variability of CNVs at a higher resolution. So far, it has not been possible to deterministically infer the allelic composition of different haplotypes present within the CNV regions. We have developed a novel computational method, called PiCNV, which enables to resolve the haplotype sequence composition within CNV regions in nuclear families based on SNP genotyping microarray data. The algorithm allows to i) phase normal and CNV-carrying haplotypes in the copy number variable regions, ii) resolve the allelic copies of rearranged DNA sequence within the haplotypes and iii) infer the heritability of identified haplotypes in trios or larger nuclear families. To our knowledge this is the first program available that can deterministically phase null, mono-, di-, tri- and tetraploid genotypes in CNV loci. We applied our method to study the composition and inheritance of haplotypes in CNV regions of 30 HapMap Yoruban trios and 34 Estonian families. For 93.6% of the CNV loci, PiCNV enabled to unambiguously phase normal and CNV-carrying haplotypes and follow their transmission in the corresponding families. Furthermore, allelic composition analysis identified the co-occurrence of alternative allelic copies within 66.7% of haplotypes carrying copy number gains. We also observed less frequent transmission of CNV-carrying haplotypes from parents to children compared to normal haplotypes and identified an emergence of several de novo deletions and duplications in the offspring.

  4. Development of an Analysis Pipeline Characterizing Multiple Hypervariable Regions of 16S rRNA Using Mock Samples.

    PubMed

    Barb, Jennifer J; Oler, Andrew J; Kim, Hyung-Suk; Chalmers, Natalia; Wallen, Gwenyth R; Cashion, Ann; Munson, Peter J; Ames, Nancy J

    2016-01-01

    There is much speculation on which hypervariable region provides the highest bacterial specificity in 16S rRNA sequencing. The optimum solution to prevent bias and to obtain a comprehensive view of complex bacterial communities would be to sequence the entire 16S rRNA gene; however, this is not possible with second generation standard library design and short-read next-generation sequencing technology. This paper examines a new process using seven hypervariable or V regions of the 16S rRNA (six amplicons: V2, V3, V4, V6-7, V8, and V9) processed simultaneously on the Ion Torrent Personal Genome Machine (Life Technologies, Grand Island, NY). Four mock samples were amplified using the 16S Ion Metagenomics Kit™ (Life Technologies) and their sequencing data is subjected to a novel analytical pipeline. Results are presented at family and genus level. The Kullback-Leibler divergence (DKL), a measure of the departure of the computed from the nominal bacterial distribution in the mock samples, was used to infer which region performed best at the family and genus levels. Three different hypervariable regions, V2, V4, and V6-7, produced the lowest divergence compared to the known mock sample. The V9 region gave the highest (worst) average DKL while the V4 gave the lowest (best) average DKL. In addition to having a high DKL, the V9 region in both the forward and reverse directions performed the worst finding only 17% and 53% of the known family level and 12% and 47% of the genus level bacteria, while results from the forward and reverse V4 region identified all 17 family level bacteria. The results of our analysis have shown that our sequencing methods using 6 hypervariable regions of the 16S rRNA and subsequent analysis is valid. This method also allowed for the assessment of how well each of the variable regions might perform simultaneously. Our findings will provide the basis for future work intended to assess microbial abundance at different time points throughout a clinical protocol.

  5. Mitochondrial genome sequences and comparative genomics ofPhytophthora ramorum and P. sojae

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Martin, Frank N.; Douda, Bensasson; Tyler, Brett M.

    The complete sequences of the mitochondrial genomes of theoomycetes of Phytophthora ramorum and P. sojae were determined during thecourse of their complete nuclear genome sequencing (Tyler, et al. 2006).Both are circular, with sizes of 39,314 bp for P. ramorum and 42,975 bpfor P. sojae. Each contains a total of 37 identifiable protein-encodinggenes, 25 or 26 tRNAs (P. sojae and P. ramorum, respectively)specifying19 amino acids, and a variable number of ORFs (7 for P. ramorum and 12for P. sojae) which are potentially additional functional genes.Non-coding regions comprise approximately 11.5 percent and 18.4 percentof the genomes of P. ramorum and P. sojae,more » respectively. Relative to P.sojae, there is an inverted repeat of 1,150 bp in P. ramorum thatincludes an unassigned unique ORF, a tRNA gene, and adjacent non-codingsequences, but otherwise the gene order in both species is identical.Comparisons of these genomes with published sequences of the P. infestansmitochondrial genome reveals a number of similarities, but the gene orderin P. infestans differs in two adjacent locations due to inversions.Sequence alignments of the three genomes indicated sequence conservationranging from 75 to 85 percent and that specific regions were morevariable than others.« less

  6. Length variation and sequence divergence in mitochondrial control region of Schizothoracine (Teleostei: Cyperinidae) species.

    PubMed

    Syed, Mudasir Ahmad; Bhat, Farooz Ahmad; Balkhi, Masood-ul Hassan; Bhat, Bilal Ahmad

    2016-01-01

    Schizothoracine fish commonly called snow trouts inhibit the entire network of snow and spring fed cool waters of Kashmir, India. Over 10 species reported earlier, only five species have been found, these include Schizothorax niger, Schizothorax esocinus, Schizothorax plagiostomus, Schizothorax curvifrons and Schizothorax labiatus. The relationship between these species is contradicting. To understand the evolutionary relation of these species, we examined the sequence information of mitochondrial D-loop of 25 individuals representing five species. Sequence alignment showed D-loop region highly variable and length variation was observed in di-nucleotide (TA)n microsatellite between and within species. Interestingly, all these species have (TA)n microsatellite not associated with longer tandem repeats at the 3' end of the mitochondrial control region and do not show heteroplasmy. Our analysis also indicates the presence of four conserved sequence blocks (CSB), CSB-D, CSB-1, CSB-II and CSB-III, four (Termination Associated Sequence) TAS motifs and 15bp pyrimidine block within the mitochondrial control region, that are highly conserved within genus Schizothorax when compared with other species. The phylogenetic analysis carried by Maximum likelihood (ML), Neighbor Joining (NJ) and Bayesian inference (BI) generated almost identical results. The resultant BI tree showed a close genetic relationship of all the five species and supports two distinct grouping of S. esocinus species. Besides the species relation, the presence of length variation in tandem repeats is attributed to differences in predicting the stability of secondary structures. The role of CSBs and TASs, reported so far as main regulatory signals, would explain the conservation of these elements in evolution.

  7. Merozoite surface protein-1 genetic diversity in Plasmodium malariae and Plasmodium brasilianum from Brazil.

    PubMed

    Guimarães, Lilian O; Wunderlich, Gerhard; Alves, João M P; Bueno, Marina G; Röhe, Fabio; Catão-Dias, José L; Neves, Amanda; Malafronte, Rosely S; Curado, Izilda; Domingues, Wilson; Kirchgatter, Karin

    2015-11-16

    The merozoite surface protein 1 (MSP1) gene encodes the major surface antigen of invasive forms of the Plasmodium erythrocytic stages and is considered a candidate vaccine antigen against malaria. Due to its polymorphisms, MSP1 is also useful for strain discrimination and consists of a good genetic marker. Sequence diversity in MSP1 has been analyzed in field isolates of three human parasites: P. falciparum, P. vivax, and P. ovale. However, the extent of variation in another human parasite, P. malariae, remains unknown. This parasite shows widespread, uneven distribution in tropical and subtropical regions throughout South America, Asia, and Africa. Interestingly, it is genetically indistinguishable from P. brasilianum, a parasite known to infect New World monkeys in Central and South America. Specific fragments (1 to 5) covering 60 % of the MSP1 gene (mainly the putatively polymorphic regions), were amplified by PCR in isolates of P. malariae and P. brasilianum from different geographic origin and hosts. Sequencing of the PCR-amplified products or cloned PCR fragments was performed and the sequences were used to construct a phylogenetic tree by the maximum likelihood method. Data were computed to give insights into the evolutionary and phylogenetic relationships of these parasites. Except for fragment 4, sequences from all other fragments consisted of unpublished sequences. The most polymorphic gene region was fragment 2, and in samples where this region lacks polymorphism, all other regions are also identical. The low variability of the P. malariae msp1 sequences of these isolates and the identification of the same haplotype in those collected many years apart at different locations is compatible with a low transmission rate. We also found greater diversity among P. brasilianum isolates compared with P. malariae ones. Lastly, the sequences were segregated according to their geographic origins and hosts, showing a strong genetic and geographic structure. Our data show that there is a low level of sequence diversity and a possible absence of allelic dimorphism of MSP1 in these parasites as opposed to other Plasmodium species. P. brasilianum strains apparently show greater divergence in comparison to P. malariae, thus P. malariae could derive from P. brasilianum, as it has been proposed.

  8. Typing of Intimin Genes in Human and Animal Enterohemorrhagic and Enteropathogenic Escherichia coli: Characterization of a New Intimin Variant

    PubMed Central

    Oswald, E.; Schmidt, H.; Morabito, S.; Karch, H.; Marchès, O.; Caprioli, A.

    2000-01-01

    Enteropathogenic Escherichia coli (EPEC) and enterohemorrhagic E. coli (EHEC) produce the characteristic “attaching and effacing” (A/E) lesion of the brush border. Intimin, an outer membrane protein encoded by eae, is responsible for the tight association of both pathogens with the host cell. Several eae have been cloned from different EPEC and EHEC strains isolated from humans and animals. These sequences are conserved in the N-terminal region but highly variable in the last C-terminal 280 amino acids (aa), where the cell binding activity is localized. Based on these considerations, we developed a panel of specific primers to investigate the eae heterogeneity of the variable 3′ region by using PCR amplification. We then investigated the distribution of the known intimin types in a large collection of EPEC and EHEC strains isolated from humans and different animal species. The existence of a yet-unknown family of intimin was suspected because several EHEC strains, isolated from human and cattle, did not react with any of the specific primer pairs, although these strains were eae positive when primers amplifying the conserved 5′ end were used. We then cloned and sequenced the eae present in one of these strains (EHEC of serotype O103:H2) and subsequently designed a PCR primer that recognizes in a specific manner the variable 3′ region of this new intimin type. This intimin, referred to as “ɛ,” was present in human and bovine EHEC strains of serogroups O8, O11, O45, O103, O121, and O165. Intimin ɛ is the largest intimin cloned to date (948 aa) and shares the greatest overall sequence identity with intimin β, although analysis of the last C-terminal 280 aa suggests a greater similarity with intimins α and γ. PMID:10603369

  9. Mutations in the E2 and NS5A regions in patients infected with hepatitis C virus genotype 1a and their correlation with response to treatment.

    PubMed

    Yahoo, Neda; Sabahi, Farzaneh; Shahzamani, Kiana; Malboobi, Mohamad Ali; Jabbari, Hossain; Sharifi, Houshang; Mousavi-Fard, Seyed Hossein; Merat, Shahin

    2011-08-01

    Heterogeneity of subgenomic regions of hepatitis C virus (HCV) may be associated with response to interferon (IFN) therapy. The amino acid sequences of the PKR/eIF-2α phosphorylation homology domain (pePHD), IFN sensitivity determining region (ISDR), PKR binding domain (PKRBD), and variable region 3 (V3) were studied in 19 patients before and after 4 weeks of treatment. All patients were infected with HCV genotype 1a and were treated with pegylated-IFN and ribavirin. Thirteen patients achieved sustained viral response (responders) and six failed to clear viral RNA (nonresponders). The amino acid sequences in the pePHD and ISDR were identical in responders and nonresponders. However, amino acid substitution at position 2252 of PKRBD was significantly different between responders and nonresponders (P = 0.044). A larger number of mutations were observed in the V3 region of responders (P < 0.001). In this region, the amino acid in position 2364 differed between responders and nonresponders (responders: aspartic acid and serine, nonresponders: asparagine, P = 0.018). The amino acid sequences in the regions which were studied did not change after 4 weeks of treatment. It is concluded that the presence of specific amino acids in position 2252 of PKRBD and position 2364 of V3 might be associated with clinical response to IFN. Copyright © 2011 Wiley-Liss, Inc.

  10. The occurrence of non-pulsating stars in the γ Dor and δ Sct pulsation instability regions: Results from Kepler quarter 14–17 data

    DOE PAGES

    Guzik, J. A.; Bradley, P. A.; Jackiewicz, J.; ...

    2015-04-21

    In this study, the high precision long time-series photometry of the NASA Kepler spacecraft provides an excellent means to discover and characterize variability in main-sequence stars, and to make progress in interpreting the pulsations to derive stellar interior structure and test stellar models. For stars of spectral types A–F, the Kepler data revealed a number of surprises, such as more hybrid pulsating Sct and Dor pulsators than expected, pulsators lying outside of the instability regions predicted by theory, and stars that were expected to pulsate, but showed no variability. In our 2013 Astronomical Review article, we discussed the statistics ofmore » variability for 633 faint (Kepler magnitude 14–16) spectral type A–F stars observed by Kepler during Quarters 6–13 (June 2010–June 2012).« less

  11. The occurrence of non-pulsating stars in the γ Dor and δ Sct pulsation instability regions: Results from Kepler quarter 14–17 data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Guzik, J. A.; Bradley, P. A.; Jackiewicz, J.

    In this study, the high precision long time-series photometry of the NASA Kepler spacecraft provides an excellent means to discover and characterize variability in main-sequence stars, and to make progress in interpreting the pulsations to derive stellar interior structure and test stellar models. For stars of spectral types A–F, the Kepler data revealed a number of surprises, such as more hybrid pulsating Sct and Dor pulsators than expected, pulsators lying outside of the instability regions predicted by theory, and stars that were expected to pulsate, but showed no variability. In our 2013 Astronomical Review article, we discussed the statistics ofmore » variability for 633 faint (Kepler magnitude 14–16) spectral type A–F stars observed by Kepler during Quarters 6–13 (June 2010–June 2012).« less

  12. Development and evaluation of specific PCR primers targeting the ribosomal DNA-internal transcribed spacer (ITS) region of peritrich ciliates in environmental samples

    NASA Astrophysics Data System (ADS)

    Su, Lei; Zhang, Qianqian; Gong, Jun

    2017-07-01

    Peritrich ciliates are highly diverse and can be important bacterial grazers in aquatic ecosystems. Morphological identifications of peritrich species and assemblages in the environment are time-consuming and expertise-demanding. In this study, two peritrich-specific PCR primers were newly designed to amplify a fragment including the internal transcribed spacer (ITS) region of ribosomal rDNA from environmental samples. The primers showed high specificity in silico, and in tests with peritrich isolates and environmental DNA. Application of these primers in clone library construction and sequencing yielded exclusively sequences of peritrichs for water and sediment samples. We also found the ITS1, ITS2, ITS, D1 region of 28S rDNA, and ITS+D1 region co-varied with, and generally more variable than, the V9 region of 18S rDNA in peritrichs. The newly designed specific primers thus provide additional tools to study the molecular diversity, community composition, and phylogeography of these ecologically important protists in different systems.

  13. Sequence variability in three mitochondrial genes among four roundworm species from wild animals in China.

    PubMed

    Chang, Qiao-Cheng; Gao, Jun-Feng; Sheng, Zhong-Hua; Lou, Yan; Zheng, Xu; Wang, Chun-Ren

    2015-02-01

    Sequence variability in three mitochondrial DNA (mtDNA) regions, namely portions of cytochrome c oxidase subunit 1 (pcox1), NADH dehydrogenase subunit 1 (pnad1) and NADH dehydrogenase subunit 4 (pnad4), for Toxocara canis. Baylisacaris transfuga. Ascaris suum and Parascaris equorum from Canis lupus. Ursus thibetanus. Sus scrofa and Equus burchelli in China were examined. The lengths of the sequences of pcox1, pnad1 and pnad4 were 711 bp, 648 bp and 666 bp, respectively. No intra-species differences were detected in pcox1 for the four examined ascarid species, in pnad1 for T. canis. A. suum and P. equorum, and in pnad4 for B. transfuga and P. equorum. Sequence differences in pnad4 for six roundworm samples of T. canis and P. equorum were 0-0.1% and 0-0.3%, respectively, and were 0-0.3% in pnad1 for six roundworm samples isolate of B. transfuga. The inter-specific sequence differences among four species were 8.7-12.4% for pcox1, 13.9-17.7% for pnad1, and 14.0-25.7% for pnad4. Phylogenetic analyses suggested that the three mtDNA fragments could be used to identify ascarid species in families Ascaridiae and Toxocaridae.

  14. Full-length genome sequences of porcine epidemic diarrhoea virus strain CV777; Use of NGS to analyse genomic and sub-genomic RNAs

    PubMed Central

    Rasmussen, Thomas Bruun; Boniotti, Maria Beatrice; Papetti, Alice; Grasland, Béatrice; Frossard, Jean-Pierre; Dastjerdi, Akbar; Hulst, Marcel; Hanke, Dennis; Pohlmann, Anne; Blome, Sandra; van der Poel, Wim H. M.; Steinbach, Falko; Blanchard, Yannick; Lavazza, Antonio; Bøtner, Anette

    2018-01-01

    Porcine epidemic diarrhoea virus, strain CV777, was initially characterized in 1978 as the causative agent of a disease first identified in the UK in 1971. This coronavirus has been widely distributed among laboratories and has been passaged both within pigs and in cell culture. To determine the variability between different stocks of the PEDV strain CV777, sequencing of the full-length genome (ca. 28kb) has been performed in 6 different laboratories, using different protocols. Not surprisingly, each of the different full genome sequences were distinct from each other and from the reference sequence (Accession number AF353511) but they are >99% identical. Unique and shared differences between sequences were identified. The coding region for the surface-exposed spike protein showed the highest proportion of variability including both point mutations and small deletions. The predicted expression of the ORF3 gene product was more dramatically affected in three different variants of this virus through either loss of the initiation codon or gain of a premature termination codon. The genome of one isolate had a substantially rearranged 5´-terminal sequence. This rearrangement was validated through the analysis of sub-genomic mRNAs from infected cells. It is clearly important to know the features of the specific sample of CV777 being used for experimental studies. PMID:29494671

  15. The Clusters AgeS Experiment (CASE). Variable Stars in the Field of the Globular Cluster NGC 3201

    NASA Astrophysics Data System (ADS)

    Kaluzny, J.; Rozyczka, M.; Thompson, I. B.; Narloch, W.; Mazur, B.; Pych, W.; Schwarzenberg-Czerny, A.

    2016-01-01

    The field of the globular cluster NGC 3201 was monitored between 1998 and 2009 in a search for variable stars. BV light curves were obtained for 152 periodic or likely periodic variables, fifty-seven of which are new detections. Thirty-seven newly detected variables are proper motion members of the cluster. Among them we found seven detached or semi-detached eclipsing binaries, four contact binaries, and eight SX Phe pulsators. Four of the eclipsing binaries are located in the turnoff region, one on the lower main sequence and the remaining two slightly above the subgiant branch. Two contact systems are blue stragglers, and another two reside in the turnoff region. In the blue straggler region a total of 266 objects were found, of which 140 are proper motion (PM) members of NGC 3201, and another nineteen are field stars. Seventy-eight of the remaining objects for which we do not have PM data are located within the half-light radius from the center of the cluster, and most of them are likely genuine blue stragglers. Four variable objects in our field of view were found to coincide with X-ray sources: three chromospherically active stars and a quasar at a redshift z≍0.5.

  16. [Polymorphism of KPI-A genes from plants of the subgenus Potatoe (sect. Petota, Estolonifera and Lycopersicum) and subgenus Solanum].

    PubMed

    Krinitsyna, A A; Mel'nikova, N V; Belenikin, M S; Poltronieri, P; Santino, A; Kudriavtseva, A V; Savilova, A M; Speranskaia, A S

    2013-01-01

    Kunitz-type proteinase inhibitor proteins of group A (KPI-A) are involved in the protection of potato plants from pathogens and pests. Although sequences of large number of the KPI-A genes from different species of cultivated potato (Solanum tuberosum subsp. tuberosum) and a few genes from tomato (Solanum lycopersicum) are known to date, information about the allelic diversity of these genes in other species of the genus Solanum is lacking. In our work, the consensus sequences of the KPI-A genes were established in two species of subgenus Potatoe sect. Petota (Solanum tuberosum subsp. andigenum--5 genes and Solanum stoloniferum--2 genes) and in the subgenus Solanum (Solanum nigrum--5 genes) by amplification, cloning, sequencing and subsequent analysis. The determined sequences of KPI-A genes were 97-100% identical to known sequences of the cultivated potato of sect. Petota (cultivated potato Solanum tuberosum subsp. tuberosum) and sect. Etuberosum (S. palustre). The interspecific variability of these genes did not exceed the intraspecific variability for all studied species except Solanum lycopersicum. The distribution of highly variable and conserved sequences in the mature protein-encoding regions was uniform for all investigated KPI-A genes. However, our attempts to amplify the homologous genes using the same primers and the genomes of Solanum dulcamarum, Solanum lycopersicum and Mandragora officinarum resulted in no product formation. Phylogenetic analysis of KPI-A diversity showed that the sequences of the S. lycopersicum form independent cluster, whereas KPI-A of S. nigrum and species of sect. Etuberosum and sect. Petota are closely related and do not form species-specific subclasters. Although Solanum nigrum is resistant to all known races of economically one of the most important diseases of solanaceous plants oomycete Phytophthora infestans aminoacid sequences encoding by KPI-A genes from its genome have nearly or absolutely no differences to the same from genomes of cultivated potatoes involved by P. infestans.

  17. Community Structures of Fecal Bacteria in Cattle from Different Animal Feeding Operations▿†

    PubMed Central

    Shanks, Orin C.; Kelty, Catherine A.; Archibeque, Shawn; Jenkins, Michael; Newton, Ryan J.; McLellan, Sandra L.; Huse, Susan M.; Sogin, Mitchell L.

    2011-01-01

    The fecal microbiome of cattle plays a critical role not only in animal health and productivity but also in food safety, pathogen shedding, and the performance of fecal pollution detection methods. Unfortunately, most published molecular surveys fail to provide adequate detail about variability in the community structures of fecal bacteria within and across cattle populations. Using massively parallel pyrosequencing of a hypervariable region of the rRNA coding region, we profiled the fecal microbial communities of cattle from six different feeding operations where cattle were subjected to consistent management practices for a minimum of 90 days. We obtained a total of 633,877 high-quality sequences from the fecal samples of 30 adult beef cattle (5 individuals per operation). Sequence-based clustering and taxonomic analyses indicate less variability within a population than between populations. Overall, bacterial community composition correlated significantly with fecal starch concentrations, largely reflected in changes in the Bacteroidetes, Proteobacteria, and Firmicutes populations. In addition, network analysis demonstrated that annotated sequences clustered by management practice and fecal starch concentration, suggesting that the structures of bovine fecal bacterial communities can be dramatically different in different animal feeding operations, even at the phylum and family taxonomic levels, and that the feeding operation is a more important determinant of the cattle microbiome than is the geographic location of the feedlot. PMID:21378055

  18. Immunoglobulin from Antarctic fish species of Rajidae family.

    PubMed

    Coscia, Maria Rosaria; Cocca, Ennio; Giacomelli, Stefano; Cuccaro, Fausta; Oreste, Umberto

    2012-03-01

    Immunoglobulins (Ig) of Chondroichthyes have been extensively studied in sharks; in contrast, in skates investigations on Ig remain scarce and fragmentary despite the high occurrence of skates in all of the major oceans of the world. To focus on Rajidae Igμ, the most abundant heavy chain isotype, we have chosen the Antarctic species Bathyraja eatonii, Bathyraja albomaculata, Bathyraja brachyurops, and Amblyraja georgiana which live at high latitudes in the Southern Ocean, and at very low temperatures. We prepared mRNA from the spleen of individuals of each species and performed RT-PCR experiments using two oligonucleotides designed on the alignment of various elasmobranch Igμ heavy chain sequences available in GenBank. The PCR products, about 1400-nt long, were cloned and sequenced. Nucleotide sequence identities calculated for the constant region domains ranged from 88.5% to 97.5% between species, and from 91.1% to 99.7% within species. In a distance tree, including also Raja erinacea sequences, two major branches were obtained, one containing Arhynchobatinae sequences, the other one Rajinae sequences. Four presumptive D gene segments were identified in the region of the VH/D/JH recombination; two different D segments were often found in the same sequence. Moreover, 5-15 genomic fragments of different lengths, carrying the gene locus encoding Igμ chain were revealed by Southern blotting analysis. B. eatonii amino acid sequences were analyzed for the positional diversity by Shannon entropy analysis, showing CH4 as the most conserved domain, and CH3 as the most variable one. B. eatonii CDR3 region length varied between 11 and 15 amino acid residues; the mean length (13.4 aa) was greater than that of Leucoraja eglanteria sequences (7.7 aa). An alignment of representative sequences of Antarctic species and R. erinacea showed that more cysteine residues not involved in the intradomain disulfide bridges were present in Antarctic species. Copyright © 2011 Elsevier B.V. All rights reserved.

  19. Prevalence of human pegivirus-1 and sequence variability of its E2 glycoprotein estimated from screening donors of fetal stem cell-containing material.

    PubMed

    Vitrenko, Yakov; Kostenko, Iryna; Kulebyakina, Kateryna; Sorochynska, Khrystyna

    2017-08-31

    Human pegivirus-1 (HPgV-1) is a member of the Flaviviridae family whose genomic organization and mode of cellular entry is similar to that of hepatitis C virus (HCV). The E2 glycoprotein of HPgV-1 is the principle mediator in the virus-cell interaction and as such harbors most of HPgV-1's antigenic determinants. HPgV-1 persists in blood cell precursors which are increasingly used for cell therapy. We studied HPgV-1 prevalence in a large cohort of females donating fetal tissues for clinical use. PCR was used for screening and estimation of viral load in viremic plasma and fetal samples. Sequence analysis was performed for portions of the 5'-untranslated and E2 regions of HPgV-1 purified from donor plasmas. Sequencing was followed by phylogenetic analysis. HPgV-1 was revealed in 13.7% of plasmas, 5.0% of fetal tissues, 5.4% of chorions, exceeding the prevalence of HCV in these types of samples. Transmission of HPgV-1 occurred in 25.8% of traceable mother-chorion-fetal tissues triads. For HPgV-1-positive donors, a high viral load in plasma appears to be a prerequisite for transmission. However, about one third of fetal samples acquired infection from non-viremic individuals. Sequencing of 5'-untranslated region placed most HPgV-1 samples to genotype 2a. At the same time, a portion of E2 sequence provided a much weaker support for this grouping apparently due to a higher variability. Polymorphisms were detected in important structural and antigenic motifs of E2. HPgV-1 is efficiently transmitted to fetus at early embryonic stages. A high variability in E2 may pose a risk of generation of pathogenic subtypes. Although HPgV-1 is considered benign and no longer tested mandatorily in blood banks, the virus may have adversary effects at target niches if delivered with infected graft upon cell transplantation. This argues for the necessity of HPgV-1 testing of cell samples aimed for clinical use.

  20. Recombination, rearrangement, reshuffling, and divergence in a centromeric region of rice.

    PubMed

    Ma, Jianxin; Bennetzen, Jeffrey L

    2006-01-10

    Centromeres have many unusual biological properties, including kinetochore attachment and severe repression of local meiotic recombination. These properties are partly an outcome, partly a cause, of unusual DNA structure in the centromeric region. Although several plant and animal genomes have been sequenced, most centromere sequences have not been completed or analyzed in depth. To shed light on the unique organization, variability, and evolution of centromeric DNA, detailed analysis of a 1.97-Mb sequence that includes centromere 8 (CEN8) of japonica rice was undertaken. Thirty-three long-terminal repeat (LTR)-retrotransposon families (including 11 previously unknown) were identified in the CEN8 region, totaling 245 elements and fragments that account for 67% of the region. The ratio of solo LTRs to intact elements in the CEN8 region is approximately 0.9:1, compared with approximately 2.2:1 in noncentromeric regions of rice. However, the ratio of solo LTRs to intact elements in the core of the CEN8 region ( approximately 2.5:1) is higher than in any other region investigated in rice, suggesting a hotspot for unequal recombination. Comparison of the CEN8 region of japonica and its orthologous segments from indica rice indicated that approximately 15% of the intact retrotransposons and solo LTRs were inserted into CEN8 after the divergence of japonica and indica from a common ancestor, compared with approximately 50% for previously studied euchromatic regions. Frequent DNA rearrangements were observed in the CEN8 region, including a 212-kb subregion that was found to be composed of three rearranged tandem repeats. Phylogenetic analysis also revealed recent segmental duplication and extensive rearrangement and reshuffling of the CentO satellite repeats.

  1. Genealogical analyses of rabies virus strains from Brazil based on N gene alleles.

    PubMed Central

    Heinemann, M. B.; Fernandes-Matioli, F. M. C.; Cortez, A.; Soares, R. M.; Sakamoto, S. M.; Bernardi, F.; Ito, F. H.; Madeira, A. M. B. N.; Richtzenhain, L. J.

    2002-01-01

    Thirty rabies virus isolates from cows and vampire bats from different regions of São Paulo State, Southeastern Brazil and three rabies vaccines were studied genetically. The analysis was based on direct sequencing of PCR-amplified products of 600 nucleotides coding for the amino terminus of nucleoprotein gene. The sequences were checked to verify their genealogical and evolutionary relationships and possible implication for health programmes. Statistical data indicated that there were no significant genetic differences between samples isolated from distinct hosts, from different geographical regions and between samples collected in the last two decades. According to the HKA test, the variability observed in the sequences is probably due to genetic drift. Since changes in genetic material may produce modifications in the protein responsible for immunogenicity of virus, which may eventually cause vaccine failure in herds, we suggest that continuous efforts in monitoring genetic diversity in rabies virus field strains, in relation to vaccine strains, must be conducted. PMID:12113496

  2. Monitoring of canine parvovirus (CPV) strains detected in vaccinated puppies in Brazil.

    PubMed

    Castro, T X; Costa, E M; Leite, J P; Labarthe, N V; Cubel Garcia, R C N

    2011-04-01

    The objective of this study was to investigate, by partial sequencing of VP2 protein, the variability of CPV detected in 37 fecal samples collected from vaccinated puppies with enteritis. Laboratorial diagnosis of CPV was confirmed by HA/HI and PCR and, for sequencing analyses, two different regions of the VP2 gene were amplified by PCR. From 1995 to 2004, all strains were characterized as CPV-2a. After that, both CPV-2a and CPV-2b were detected. All CPV-2a showed a non-synonymous mutation in the residue 297 (Ser→Ala). A synonymous substitution at the AA 574 was also observed in 15/37 samples. Our findings indicate that the cases of vaccine failure are most likely not associated to the mutations detected in the sequenced regions. However, the monitoring of genotyping mutations that led to new CPV strains is essential to determinate if current vaccines will keep providing protection against all new future variants. Copyright © 2010 Elsevier Ltd. All rights reserved.

  3. Clonal origins of Vibrio cholerae O1 El Tor strains, Papua New Guinea, 2009-2011.

    PubMed

    Horwood, Paul F; Collins, Deirdre; Jonduo, Marinjho H; Rosewell, Alexander; Dutta, Samir R; Dagina, Rosheila; Ropa, Berry; Siba, Peter M; Greenhill, Andrew R

    2011-11-01

    We used multilocus sequence typing and variable number tandem repeat analysis to determine the clonal origins of Vibrio cholerae O1 El Tor strains from an outbreak of cholera that began in 2009 in Papua New Guinea. The epidemic is ongoing, and transmission risk is elevated within the Pacific region.

  4. Data characterizing the chloroplast genomes of extinct and endangered Hawaiian endemic mints (Lamiaceae) and their close relatives.

    PubMed

    Welch, Andreanna J; Collins, Katherine; Ratan, Aakrosh; Drautz-Moses, Daniela I; Schuster, Stephan C; Lindqvist, Charlotte

    2016-06-01

    These data are presented in support of a plastid phylogenomic analysis of the recent radiation of the Hawaiian endemic mints (Lamiaceae), and their close relatives in the genus Stachys, "The quest to resolve recent radiations: Plastid phylogenomics of extinct and endangered Hawaiian endemic mints (Lamiaceae)" [1]. Here we describe the chloroplast genome sequences for 12 mint taxa. Data presented include summaries of gene content and length for these taxa, structural comparison of the mint chloroplast genomes with published sequences from other species in the order Lamiales, and comparisons of variability among three Hawaiian taxa vs. three outgroup taxa. Finally, we provide a list of 108 primer pairs targeting the most variable regions within this group and designed specifically for amplification of DNA extracted from degraded herbarium material.

  5. Rapid isolation of microsatellite DNAs and identification of polymorphic mitochondrial DNA regions in the fish rotan (Perccottus glenii) invading European Russia

    USGS Publications Warehouse

    King, Timothy L.; Eackles, Michael S.; Reshetnikov, Andrey N.

    2015-01-01

    Human-mediated translocations and subsequent large-scale colonization by the invasive fish rotan (Perccottus glenii Dybowski, 1877; Perciformes, Odontobutidae), also known as Amur or Chinese sleeper, has resulted in dramatic transformations of small lentic ecosystems. However, no detailed genetic information exists on population structure, levels of effective movement, or relatedness among geographic populations of P. glenii within the European part of the range. We used massively parallel genomic DNA shotgun sequencing on the semiconductor-based Ion Torrent Personal Genome Machine (PGM) sequencing platform to identify nuclear microsatellite and mitochondrial DNA sequences in P. glenii from European Russia. Here we describe the characterization of nine nuclear microsatellite loci, ascertain levels of allelic diversity, heterozygosity, and demographic status of P. glenii collected from Ilev, Russia, one of several initial introduction points in European Russia. In addition, we mapped sequence reads to the complete P. glenii mitochondrial DNA sequence to identify polymorphic regions. Nuclear microsatellite markers developed for P. glenii yielded sufficient genetic diversity to: (1) produce unique multilocus genotypes; (2) elucidate structure among geographic populations; and (3) provide unique perspectives for analysis of population sizes and historical demographics. Among 4.9 million filtered P. glenii Ion Torrent PGM sequence reads, 11,304 mapped to the mitochondrial genome (NC_020350). This resulted in 100 % coverage of this genome to a mean coverage depth of 102X. A total of 130 variable sites were observed between the publicly available genome from China and the studied composite mitochondrial genome. Among these, 82 were diagnostic and monomorphic between the mitochondrial genomes and distributed among 15 genome regions. The polymorphic sites (N = 48) were distributed among 11 mitochondrial genome regions. Our results also indicate that sequence reads generated from two three-hour runs on the Ion Torrent PGM can generate a sufficient number of nuclear and mitochondrial markers to improve understanding of the evolutionary and ecological dynamics of non-model and in particular, invasive species.

  6. Regional and temporal variability of melts during a Cordilleran magma pulse: Age and chemical evolution of the jurassic arc, eastern mojave desert, California

    USGS Publications Warehouse

    Barth, A.P.; Wooden, J.L.; Miller, David; Howard, Keith A.; Fox, Lydia; Schermer, Elizabeth R.; Jacobson, C.E.

    2017-01-01

    Intrusive rock sequences in the central and eastern Mojave Desert segment of the Jurassic Cordilleran arc of the western United States record regional and temporal variations in magmas generated during the second prominent pulse of Mesozoic continental arc magmatism. U/Pb zircon ages provide temporal control for describing variations in rock and zircon geochemistry that reflect differences in magma source components. These source signatures are discernible through mixing and fractionation processes associated with magma ascent and emplacement. The oldest well-dated Jurassic rocks defining initiation of the Jurassic pulse are a 183 Ma monzodiorite and a 181 Ma ignimbrite. Early to Middle Jurassic intrusive rocks comprising the main stage of magmatism include two high-K calc-alkalic groups: to the north, the deformed 183–172 Ma Fort Irwin sequence and contemporaneous rocks in the Granite and Clipper Mountains, and to the south, the 167–164 Ma Bullion sequence. A Late Jurassic suite of shoshonitic, alkali-calcic intrusive rocks, the Bristol Mountains sequence, ranges in age from 164 to 161 Ma and was emplaced as the pulse began to wane. Whole-rock and zircon trace-element geochemistry defines a compositionally coherent Jurassic arc with regional and secular variations in melt compositions. The arc evolved through the magma pulse by progressively greater input of old cratonic crust and lithospheric mantle into the arc magma system, synchronous with progressive regional crustal thickening.

  7. Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse

    PubMed Central

    Hillier, LaDeana W.; Zody, Michael C.; Goldstein, Steve; She, Xinwe; Bult, Carol J.; Agarwala, Richa; Cherry, Joshua L.; DiCuccio, Michael; Hlavina, Wratko; Kapustin, Yuri; Meric, Peter; Maglott, Donna; Birtle, Zoë; Marques, Ana C.; Graves, Tina; Zhou, Shiguo; Teague, Brian; Potamousis, Konstantinos; Churas, Christopher; Place, Michael; Herschleb, Jill; Runnheim, Ron; Forrest, Daniel; Amos-Landgraf, James; Schwartz, David C.; Cheng, Ze; Lindblad-Toh, Kerstin; Eichler, Evan E.; Ponting, Chris P.

    2009-01-01

    The mouse (Mus musculus) is the premier animal model for understanding human disease and development. Here we show that a comprehensive understanding of mouse biology is only possible with the availability of a finished, high-quality genome assembly. The finished clone-based assembly of the mouse strain C57BL/6J reported here has over 175,000 fewer gaps and over 139 Mb more of novel sequence, compared with the earlier MGSCv3 draft genome assembly. In a comprehensive analysis of this revised genome sequence, we are now able to define 20,210 protein-coding genes, over a thousand more than predicted in the human genome (19,042 genes). In addition, we identified 439 long, non–protein-coding RNAs with evidence for transcribed orthologs in human. We analyzed the complex and repetitive landscape of 267 Mb of sequence that was missing or misassembled in the previously published assembly, and we provide insights into the reasons for its resistance to sequencing and assembly by whole-genome shotgun approaches. Duplicated regions within newly assembled sequence tend to be of more recent ancestry than duplicates in the published draft, correcting our initial understanding of recent evolution on the mouse lineage. These duplicates appear to be largely composed of sequence regions containing transposable elements and duplicated protein-coding genes; of these, some may be fixed in the mouse population, but at least 40% of segmentally duplicated sequences are copy number variable even among laboratory mouse strains. Mouse lineage-specific regions contain 3,767 genes drawn mainly from rapidly-changing gene families associated with reproductive functions. The finished mouse genome assembly, therefore, greatly improves our understanding of rodent-specific biology and allows the delineation of ancestral biological functions that are shared with human from derived functions that are not. PMID:19468303

  8. Phylogeny and variability of Colletotrichum truncatum associated with soybean anthracnose in Brazil.

    PubMed

    Rogério, F; Ciampi-Guillardi, M; Barbieri, M C G; Bragança, C A D; Seixas, C D S; Almeida, A M R; Massola, N S

    2017-02-01

    Fungal diseases are among the main factors limiting high yields of soybean crop. Colletotrichum isolates from soybean plants with anthracnose symptoms were studied from different regions and time periods in Brazil using molecular, morphological and pathogenic analyses. Bayesian phylogenetic inference of GAPDH, HIS3 and ITS-5.8S rDNA sequences, the morphologies of colony and conidia, and inoculation tests on seeds and seedlings were performed. All isolates clustered only with Colletotrichum truncatum species in three well-separated clusters. Intraspecific genetic diversity revealed 27 distinct haplotypes in 51 fungal isolates; some of which were identical to C. truncatum sequences from other regions around the world, while others were related to alternative hosts. Conidia were falcate, hyaline, unicellular and aseptate, formed in acervuli, with variable dimensions. Despite being pathogenic to seedlings by both inoculation methods, variation was observed in the aggressiveness of the tested isolates, which was not correlated with genetic variation. The identification of C. truncatum in the sampled isolates was evidenced as being the only causal agent of soybean anthracnose in Brazil until 2007, with relevant genetic, morphological and pathogenic variability as well as a broad geographical origin. The wide distribution of the predominant C. truncatum haplotype indicated the existence of a highly efficient mechanism of pathogen dispersal over long distances, reinforcing the role of seeds as the primary source of disease inoculum. The characterization and distribution of Colletotrichum species in soybean-producing regions in Brazil is fundamental for understanding the disease epidemiology and for ensuring effective control strategies against anthracnose. © 2016 The Society for Applied Microbiology.

  9. Protein consensus-based surface engineering (ProCoS): a computer-assisted method for directed protein evolution.

    PubMed

    Shivange, Amol V; Hoeffken, Hans Wolfgang; Haefner, Stefan; Schwaneberg, Ulrich

    2016-12-01

    Protein consensus-based surface engineering (ProCoS) is a simple and efficient method for directed protein evolution combining computational analysis and molecular biology tools to engineer protein surfaces. ProCoS is based on the hypothesis that conserved residues originated from a common ancestor and that these residues are crucial for the function of a protein, whereas highly variable regions (situated on the surface of a protein) can be targeted for surface engineering to maximize performance. ProCoS comprises four main steps: ( i ) identification of conserved and highly variable regions; ( ii ) protein sequence design by substituting residues in the highly variable regions, and gene synthesis; ( iii ) in vitro DNA recombination of synthetic genes; and ( iv ) screening for active variants. ProCoS is a simple method for surface mutagenesis in which multiple sequence alignment is used for selection of surface residues based on a structural model. To demonstrate the technique's utility for directed evolution, the surface of a phytase enzyme from Yersinia mollaretii (Ymphytase) was subjected to ProCoS. Screening just 1050 clones from ProCoS engineering-guided mutant libraries yielded an enzyme with 34 amino acid substitutions. The surface-engineered Ymphytase exhibited 3.8-fold higher pH stability (at pH 2.8 for 3 h) and retained 40% of the enzyme's specific activity (400 U/mg) compared with the wild-type Ymphytase. The pH stability might be attributed to a significantly increased (20 percentage points; from 9% to 29%) number of negatively charged amino acids on the surface of the engineered phytase.

  10. Investigation of bacterial and archaeal communities: novel protocols using modern sequencing by Illumina MiSeq and traditional DGGE-cloning.

    PubMed

    Kraková, Lucia; Šoltys, Katarína; Budiš, Jaroslav; Grivalský, Tomáš; Ďuriš, František; Pangallo, Domenico; Szemes, Tomáš

    2016-09-01

    Different protocols based on Illumina high-throughput DNA sequencing and denaturing gradient gel electrophoresis (DGGE)-cloning were developed and applied for investigating hot spring related samples. The study was focused on three target genes: archaeal and bacterial 16S rRNA and mcrA of methanogenic microflora. Shorter read lengths of the currently most popular technology of sequencing by Illumina do not allow analysis of the complete 16S rRNA region, or of longer gene fragments, as was the case of Sanger sequencing. Here, we demonstrate that there is no need for special indexed or tailed primer sets dedicated to short variable regions of 16S rRNA since the presented approach allows the analysis of complete bacterial 16S rRNA amplicons (V1-V9) and longer archaeal 16S rRNA and mcrA sequences. Sample augmented with transposon is represented by a set of approximately 300 bp long fragments that can be easily sequenced by Illumina MiSeq. Furthermore, a low proportion of chimeric sequences was observed. DGGE-cloning based strategies were performed combining semi-nested PCR, DGGE and clone library construction. Comparing both investigation methods, a certain degree of complementarity was observed confirming that the DGGE-cloning approach is not obsolete. Novel protocols were created for several types of laboratories, utilizing the traditional DGGE technique or using the most modern Illumina sequencing.

  11. Desmoglein 4 diversity and correlation analysis with coat color in goat.

    PubMed

    E, G X; Zhao, Y J; Ma, Y H; Cao, G L; He, J N; Na, R S; Zhao, Z Q; Jiang, C D; Zhang, J H; Arlvd, S; Chen, L P; Qiu, X Y; Hu, W; Huang, Y F

    2016-03-04

    Desmoglein 4 (DSG4) has an important role in the development of wool traits in domestic animals. The full-length DSG4 gene, which contains 3918 bp, a complete open-reading-frame, and encodes a 1040-amino acid protein, was amplified from Liaoning cashmere goat. The sequence was compared with that of DSG4 from other animals and the results show that the DSG4 coding region is consistent with interspecies conservation. Thirteen single-nucleotide polymorphisms (SNPs) were identified in a highly variable region of DSG4, and one SNP (M-1, G>T) was significantly correlated with white and black coat color in goat. Haplotype distribution of the highly variable region of DSG4 was assessed in 179 individuals from seven goat breeds to investigate its association with coat color and its differentiation among populations. However, the lack of a signature result indicates DGS4 haplotypes related with the color of goat coat.

  12. Conserved Structural Elements in the V3 Crown of HIV-1 gp120

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jiang, X.; Burke, V; Totrov, M

    2010-01-01

    Binding of the third variable region (V3) of the HIV-1 envelope glycoprotein gp120 to the cell-surface coreceptors CCR5 or CXCR4 during viral entry suggests that there are conserved structural elements in this sequence-variable region. These conserved elements could serve as epitopes to be targeted by a vaccine against HIV-1. Here we perform a systematic structural analysis of representative human anti-V3 monoclonal antibodies in complex with V3 peptides, revealing that the crown of V3 has four conserved structural elements: an arch, a band, a hydrophobic core and the peptide backbone. These are either unaffected by or are subject to minimal sequencemore » variation. As these regions are targeted by cross-clade neutralizing human antibodies, they provide a blueprint for the design of vaccine immunogens that could elicit broadly cross-reactive protective antibodies.« less

  13. BAUM: improving genome assembly by adaptive unique mapping and local overlap-layout-consensus approach.

    PubMed

    Wang, Anqi; Wang, Zhanyu; Li, Zheng; Li, Lei M

    2018-06-15

    It is highly desirable to assemble genomes of high continuity and consistency at low cost. The current bottleneck of draft genome continuity using the second generation sequencing (SGS) reads is primarily caused by uncertainty among repetitive sequences. Even though the single-molecule real-time sequencing technology is very promising to overcome the uncertainty issue, its relatively high cost and error rate add burden on budget or computation. Many long-read assemblers take the overlap-layout-consensus (OLC) paradigm, which is less sensitive to sequencing errors, heterozygosity and variability of coverage. However, current assemblers of SGS data do not sufficiently take advantage of the OLC approach. Aiming at minimizing uncertainty, the proposed method BAUM, breaks the whole genome into regions by adaptive unique mapping; then the local OLC is used to assemble each region in parallel. BAUM can (i) perform reference-assisted assembly based on the genome of a close species (ii) or improve the results of existing assemblies that are obtained based on short or long sequencing reads. The tests on two eukaryote genomes, a wild rice Oryza longistaminata and a parrot Melopsittacus undulatus, show that BAUM achieved substantial improvement on genome size and continuity. Besides, BAUM reconstructed a considerable amount of repetitive regions that failed to be assembled by existing short read assemblers. We also propose statistical approaches to control the uncertainty in different steps of BAUM. http://www.zhanyuwang.xin/wordpress/index.php/2017/07/21/baum. Supplementary data are available at Bioinformatics online.

  14. Large-scale sequence and structural comparisons of human naive and antigen-experienced antibody repertoires.

    PubMed

    DeKosky, Brandon J; Lungu, Oana I; Park, Daechan; Johnson, Erik L; Charab, Wissam; Chrysostomou, Constantine; Kuroda, Daisuke; Ellington, Andrew D; Ippolito, Gregory C; Gray, Jeffrey J; Georgiou, George

    2016-05-10

    Elucidating how antigen exposure and selection shape the human antibody repertoire is fundamental to our understanding of B-cell immunity. We sequenced the paired heavy- and light-chain variable regions (VH and VL, respectively) from large populations of single B cells combined with computational modeling of antibody structures to evaluate sequence and structural features of human antibody repertoires at unprecedented depth. Analysis of a dataset comprising 55,000 antibody clusters from CD19(+)CD20(+)CD27(-) IgM-naive B cells, >120,000 antibody clusters from CD19(+)CD20(+)CD27(+) antigen-experienced B cells, and >2,000 RosettaAntibody-predicted structural models across three healthy donors led to a number of key findings: (i) VH and VL gene sequences pair in a combinatorial fashion without detectable pairing restrictions at the population level; (ii) certain VH:VL gene pairs were significantly enriched or depleted in the antigen-experienced repertoire relative to the naive repertoire; (iii) antigen selection increased antibody paratope net charge and solvent-accessible surface area; and (iv) public heavy-chain third complementarity-determining region (CDR-H3) antibodies in the antigen-experienced repertoire showed signs of convergent paired light-chain genetic signatures, including shared light-chain third complementarity-determining region (CDR-L3) amino acid sequences and/or Vκ,λ-Jκ,λ genes. The data reported here address several longstanding questions regarding antibody repertoire selection and development and provide a benchmark for future repertoire-scale analyses of antibody responses to vaccination and disease.

  15. Mycelial Propagation and Molecular Phylogenetic Relationships of Commercially Cultivated Agrocybe cylindracea based on ITS Sequences and RAPD

    PubMed Central

    Alam, Nuhu; Kim, Jeong Hwa; Shim, Mi Ja; Lee, U Youn

    2010-01-01

    This study evaluated the optimal vegetative growth conditions and molecular phylogenetic relationships of eleven strains of Agrocybe cylindracea collected from different ecological regions of Korea, China and Taiwan. The optimal temperature and pH for mycelial growth were observed at 25℃ and 6. Potato dextrose agar and Hennerberg were the favorable media for vegetative growth, whereas glucose tryptone was unfavorable. Dextrin, maltose, and fructose were the most effective carbon sources. The most suitable nitrogen sources were arginine and glycine, whereas methionine, alanine, histidine, and urea were least effective for the mycelial propagation of A. cylindracea. The internal transcribed spacer (ITS) regions of rDNA were amplified using PCR. The sequence of ITS2 was more variable than that of ITS1, while the 5.8S sequences were identical. The reciprocal homologies of the ITS sequences ranged from 98 to 100%. The strains were also analyzed by random amplification of polymorphic DNA (RAPD) using 20 arbitrary primers. Fifteen primers efficiently amplified the genomic DNA. The average number of polymorphic bands observed per primer was 3.8. The numbers of amplified bands varied based on the primers and strains, with polymorphic fragments ranging from 0.1 to 2.9 kb. The results of RAPD analysis were similar to the ITS region sequences. The results revealed that RAPD and ITS techniques were well suited for detecting the genetic diversity of all A. cylindracea strains tested. PMID:23956633

  16. Human papillomavirus type 18 variant lineages in United States populations characterized by sequence analysis of LCR-E6, E2, and L1 regions.

    PubMed

    Arias-Pulido, Hugo; Peyton, Cheri L; Torrez-Martínez, Norah; Anderson, D Nelson; Wheeler, Cosette M

    2005-07-20

    While HPV 16 variant lineages have been well characterized, the knowledge about HPV 18 variants is limited. In this study, HPV 18 nucleotide variations in the E2 hinge region were characterized by sequence analysis in 47 control and 51 tumor specimens. Fifty of these specimens were randomly selected for sequencing of an LCR-E6 segment and 20 samples representative of LCR-E6 and E2 sequence variants were examined across the L1 region. A total of 2770 nucleotides per HPV 18 variant genome were considered in this study. HPV 18 variant nucleotides were linked among all gene segments analyzed and grouped into three main branches: Asian-American (AA), European (E), and African (Af). These three branches were equally distributed among controls and cases and when stratified by Hispanic and non-Hispanic ethnicities. Among invasive cervical cancer cases, no significant differences in the three HPV variant branches were observed among ethnic groups or when stratified by histopathology (squamous vs. adenocarcinoma). The Af branch showed the greatest nucleotide variability when compared to the HPV 18 reference sequence and was more closely related to HPV 45 than either AA or E branches. Our data also characterize nucleotide and amino acid variations in the L1 capsid gene among HPV 18 variants, which may be relevant to vaccine strategies and subsequent studies of naturally occurring HPV 18 variants. Several novel HPV 18 nucleotide variations were identified in this study.

  17. Scaling Linguistic Characterization of Precipitation Variability

    NASA Astrophysics Data System (ADS)

    Primo, C.; Gutierrez, J. M.

    2003-04-01

    Rainfall variability is influenced by changes in the aggregation of daily rainfall. This problem is of great importance for hydrological, agricultural and ecological applications. Rainfall averages, or accumulations, are widely used as standard climatic parameters. However different aggregation schemes may lead to the same average or accumulated values. In this paper we present a fractal method to characterize different aggregation schemes. The method provides scaling exponents characterizing weekly or monthly rainfall patterns for a given station. To this aim, we establish an analogy with linguistic analysis, considering precipitation as a discrete variable (e.g., rain, no rain). Each weekly, or monthly, symbolic precipitation sequence of observed precipitation is then considered as a "word" (in this case, a binary word) which defines a specific weekly rainfall pattern. Thus, each site defines a "language" characterized by the words observed in that site during a period representative of the climatology. Then, the more variable the observed weekly precipitation sequences, the more complex the obtained language. To characterize these languages, we first applied the Zipf's method obtaining scaling histograms of rank ordered frequencies. However, to obtain significant exponents, the scaling must be maintained some orders of magnitude, requiring long sequences of daily precipitation which are not available at particular stations. Thus this analysis is not suitable for applications involving particular stations (such as regionalization). Then, we introduce an alternative fractal method applicable to data from local stations. The so-called Chaos-Game method uses Iterated Function Systems (IFS) for graphically representing rainfall languages, in a way that complex languages define complex graphical patterns. The box-counting dimension and the entropy of the resulting patterns are used as linguistic parameters to quantitatively characterize the complexity of the patterns. We illustrate the high climatological discrimination power of the linguistic parameters in the Iberian peninsula, when compared with other standard techniques (such as seasonal mean accumulated precipitation). As an example, standard and linguistic parameters are used as inputs for a clustering regionalization method, comparing the resulting clusters.

  18. Variable stars around selected open clusters in the VVV area: Young Stellar Objects

    NASA Astrophysics Data System (ADS)

    Medina, Nicolas; Borissova, Jura; Bayo, Amelia; Kurtev, Radostin; Lucas, Philip

    2017-09-01

    Time-varying phenomena are one of the most substantial sources of astrophysical information, and led to many fundamental discoveries in modern astronomy. We have developed an automated tool to search and analyze variable sources in the near infrared Ks band, using the data from the Vista Variables in the Vía Láctea (VVV) ESO Public Survey ([5, 8]). One of our main goals is to investigate the Young Stellar Objects (YSOs) in the Galactic star forming regions, looking for: Variability. New pre-main sequence star clusters. Here we present the newly discovered YSOs within some selected stellar clusters in our Galaxy.

  19. Amino acid and structural variability of Yersinia pestis LcrV protein

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Anisimov, A P; Dentovskaya, S V; Panfertsev, E A

    2009-11-09

    The LcrV protein is a multifunctional virulence factor and protective antigen of the plague bacterium which is generally conserved between the epidemic strains of Yersinia pestis. They investigated the diversity in the LcrV sequences among non-epidemic Y. pestis strains which have a limited virulence in selected animal models and for humans. Sequencing of lcrV genes from ten Y. pestis strains belonging to different phylogenetic groups (subspecies) showed that the LcrV proteins possess four major variable hotspots at positions 18, 72, 273, and 324-326. These major variations, together with other minor substitutions in amino acid sequences, allowed them to classify themore » LcrV alleles into five sequence types (A-E). They observed that the strains of different Y. pestis subspecies can have the same typ of LcrV, and different types of LcrV can exist within the same natural plague focus. The LcrV polymorphisms were structurally analyzed by comparing the modeled structures of LcrV from all available strains. All changes except one occurred either in flexible regions or on the surface of the protein, but local chemical properties (i.e. those of a hydrophobic, hydrophilic, amphipathic, or charged nature) were conserved across all of the strains. Polymorphisms in flexible and surface regions are likely subject to less selective pressure, and have a limited impact on the structure. In contrast, the substitution of tryptophan at position 113 with either glutamic acid or glycine likely has a serious influence on the regional structure of the protein, and these mutations might have an effect on the function of LcrV. The polymorphisms at positions 18, 72 and 273 were accountable for differences in oligomerization of LcrV. The importance of the latter property in emergence of epidemic strains of Y. pestis during evolution of this pathogen will need to be further investigated.« less

  20. Complete Sequencing of pNDM-HK Encoding NDM-1 Carbapenemase from a Multidrug-Resistant Escherichia coli Strain Isolated in Hong Kong

    PubMed Central

    Ho, Pak Leung; Lo, Wai U.; Yeung, Man Kiu; Lin, Chi Ho; Chow, Kin Hung; Ang, Irene; Tong, Amy Hin Yan; Bao, Jessie Yun-Juan; Lok, Si; Lo, Janice Yee Chi

    2011-01-01

    Background The emergence of plasmid-mediated carbapenemases, such as NDM-1 in Enterobacteriaceae is a major public health issue. Since they mediate resistance to virtually all β-lactam antibiotics and there is often co-resistance to other antibiotic classes, the therapeutic options for infections caused by these organisms are very limited. Methodology We characterized the first NDM-1 producing E. coli isolate recovered in Hong Kong. The plasmid encoding the metallo-β-lactamase gene was sequenced. Principal Findings The plasmid, pNDM-HK readily transferred to E. coli J53 at high frequencies. It belongs to the broad host range IncL/M incompatibility group and is 88803 bp in size. Sequence alignment showed that pNDM-HK has a 55 kb backbone which shared 97% homology with pEL60 originating from the plant pathogen, Erwina amylovora in Lebanon and a 28.9 kb variable region. The plasmid backbone includes the mucAB genes mediating ultraviolet light resistance. The 28.9 kb region has a composite transposon-like structure which includes intact or truncated genes associated with resistance to β-lactams (bla TEM-1, bla NDM-1, Δbla DHA-1), aminoglycosides (aacC2, armA), sulphonamides (sul1) and macrolides (mel, mph2). It also harbors the following mobile elements: IS26, ISCR1, tnpU, tnpAcp2, tnpD, ΔtnpATn1 and insL. Certain blocks within the 28.9 kb variable region had homology with the corresponding sequences in the widely disseminated plasmids, pCTX-M3, pMUR050 and pKP048 originating from bacteria in Poland in 1996, in Spain in 2002 and in China in 2006, respectively. Significance The genetic support of NDM-1 gene suggests that it has evolved through complex pathways. The association with broad host range plasmid and multiple mobile genetic elements explain its observed horizontal mobility in multiple bacterial taxa. PMID:21445317

  1. Variability of Disk Emission in Pre-Main Sequence and Related Stars. I. HD 31648 and HD 163296 - Isolated Herbig Ae Stars Driving Herbig-Haro Flows

    NASA Technical Reports Server (NTRS)

    Sitko, Michael L.; Carpenter, William J.; Kimes, Robin L.; Lynch, David K.; Russell, Ray W.; Rudy, Richard J.; Mazuk, Stephan M.; Venturini, Catherine C.; Puetter, Richard C.; Grady, Carol A.; hide

    2007-01-01

    Infrared photometry and spectroscopy covering a time span of a quarter century are presented for HD 31648 (MWC 480) and HD 163296 (MWC 275). Both are isolated Herbig Ae stars that exhibit signs of active accretion, including driving bipolar flows with embedded Herbig-Haro (HH) objects. HD 163296 was found to be relatively quiescent photometrically in its inner disk region, with the exception of a major increase in emitted flux in a broad wavelength region centered near 3 pm in 2002. In contrast, HD 31648 has exhibited sporadic changes in the entire 3-13 pm region throughout this span of time. In both stars the changes in the 1-5 pm flux indicate structural changes in the region of the disk near the dust sublimation zone, possibly causing its distance from the star to vary with time. Repeated thermal cycling through this region will result in the preferential survival of large grains, and an increase in the degree of crystallinity. The variability observed in these objects has important consequences for the interpretation of other types of observations. For example, source variability will compromise models based on interferometry measurements unless the interferometry observations are accompanied by nearly-simultaneous photometric data.

  2. Variability of Disk Emission in Pre-Main-Sequence and Related Stars. I. HD 31648 and HD 163296: Isolated Herbig Ae Stars Driving Herbig-Haro Flows

    NASA Astrophysics Data System (ADS)

    Sitko, Michael L.; Carpenter, William J.; Kimes, Robin L.; Wilde, J. Leon; Lynch, David K.; Russell, Ray W.; Rudy, Richard J.; Mazuk, Stephan M.; Venturini, Catherine C.; Puetter, Richard C.; Grady, Carol A.; Polomski, Elisha F.; Wisnewski, John P.; Brafford, Suellen M.; Hammel, H. B.; Perry, R. Brad

    2008-05-01

    Infrared photometry and spectroscopy covering a time span of a quarter-century are presented for HD 31648 (MWC 480) and HD 163296 (MWC 275). Both are isolated Herbig Ae stars that exhibit signs of active accretion, including driving bipolar flows with embedded Herbig-Haro (HH) objects. HD 163296 was found to be relatively quiescent photometrically in its inner disk region, with the exception of a major increase in emitted flux in a broad wavelength region centered near 3 μm in 2002. In contrast, HD 31648 has exhibited sporadic changes in the entire 3-13 μm region throughout this span of time. In both stars, the changes in the 1-5 μm flux indicate structural changes in the region of the disk near the dust sublimation zone, possibly causing its distance from the star to vary with time. Repeated thermal cycling through this region will result in the preferential survival of large grains, and an increase in the degree of crystallinity. The variability observed in these objects has important consequences for the interpretation of other types of observations. For example, source variability will compromise models based on interferometry measurements unless the interferometry observations are accompanied by nearly simultaneous photometric data.

  3. In Silico Prediction Analysis of Idiotope-Driven T–B Cell Collaboration in Multiple Sclerosis

    PubMed Central

    Høglund, Rune A.; Lossius, Andreas; Johansen, Jorunn N.; Homan, Jane; Benth, Jūratė Šaltytė; Robins, Harlan; Bogen, Bjarne; Bremel, Robert D.; Holmøy, Trygve

    2017-01-01

    Memory B cells acting as antigen-presenting cells are believed to be important in multiple sclerosis (MS), but the antigen they present remains unknown. We hypothesized that B cells may activate CD4+ T cells in the central nervous system of MS patients by presenting idiotopes from their own immunoglobulin variable regions on human leukocyte antigen (HLA) class II molecules. Here, we use bioinformatics prediction analysis of B cell immunoglobulin variable regions from 11 MS patients and 6 controls with other inflammatory neurological disorders (OINDs), to assess whether the prerequisites for such idiotope-driven T–B cell collaboration are present. Our findings indicate that idiotopes from the complementarity determining region (CDR) 3 of MS patients on average have high predicted affinities for disease associated HLA-DRB1*15:01 molecules and are predicted to be endosomally processed by cathepsin S and L in positions that allows such HLA binding to occur. Additionally, complementarity determining region 3 sequences from cerebrospinal fluid (CSF) B cells from MS patients contain on average more rare T cell-exposed motifs that could potentially escape tolerance and stimulate CD4+ T cells than CSF B cells from OIND patients. Many of these features were associated with preferential use of the IGHV4 gene family by CSF B cells from MS patients. This is the first study to combine high-throughput sequencing of patient immune repertoires with large-scale prediction analysis and provides key indicators for future in vitro and in vivo analyses. PMID:29038659

  4. In Silico Prediction Analysis of Idiotope-Driven T-B Cell Collaboration in Multiple Sclerosis.

    PubMed

    Høglund, Rune A; Lossius, Andreas; Johansen, Jorunn N; Homan, Jane; Benth, Jūratė Šaltytė; Robins, Harlan; Bogen, Bjarne; Bremel, Robert D; Holmøy, Trygve

    2017-01-01

    Memory B cells acting as antigen-presenting cells are believed to be important in multiple sclerosis (MS), but the antigen they present remains unknown. We hypothesized that B cells may activate CD4 + T cells in the central nervous system of MS patients by presenting idiotopes from their own immunoglobulin variable regions on human leukocyte antigen (HLA) class II molecules. Here, we use bioinformatics prediction analysis of B cell immunoglobulin variable regions from 11 MS patients and 6 controls with other inflammatory neurological disorders (OINDs), to assess whether the prerequisites for such idiotope-driven T-B cell collaboration are present. Our findings indicate that idiotopes from the complementarity determining region (CDR) 3 of MS patients on average have high predicted affinities for disease associated HLA-DRB1*15:01 molecules and are predicted to be endosomally processed by cathepsin S and L in positions that allows such HLA binding to occur. Additionally, complementarity determining region 3 sequences from cerebrospinal fluid (CSF) B cells from MS patients contain on average more rare T cell-exposed motifs that could potentially escape tolerance and stimulate CD4 + T cells than CSF B cells from OIND patients. Many of these features were associated with preferential use of the IGHV4 gene family by CSF B cells from MS patients. This is the first study to combine high-throughput sequencing of patient immune repertoires with large-scale prediction analysis and provides key indicators for future in vitro and in vivo analyses.

  5. Identification of Human Papillomavirus Type 16 L1 Surface Loops Required for Neutralization by Human Sera†

    PubMed Central

    Carter, Joseph J.; Wipf, Greg C.; Madeleine, Margaret M.; Schwartz, Stephen M.; Koutsky, Laura A.; Galloway, Denise A.

    2006-01-01

    The variable surface loops on human papillomavirus (HPV) virions required for type-specific neutralization by human sera remain poorly defined. To determine which loops are required for neutralization, a series of hybrid virus-like particles (VLPs) were used to adsorb neutralizing activity from HPV type 16 (HPV16)-reactive human sera before being tested in an HPV16 pseudovirion neutralization assay. The hybrid VLPs used were composed of L1 sequences of either HPV16 or HPV31, on which one or two regions were replaced with homologous sequences from the other type. The regions chosen for substitution were the five known loops that form surface epitopes recognized by monoclonal antibodies and two additional variable regions between residues 400 and 450. Pretreatment of human sera, previously found to react to HPV16 VLPs in enzyme-linked immunosorbent assays, with wild-type HPV16 VLPs and hybrid VLPs that retained the neutralizing epitopes reduced or eliminated the ability of sera to inhibit pseudovirus infection in vitro. Surprisingly, substitution of a single loop often ablated the ability of VLPs to adsorb neutralizing antibodies from human sera. However, for all sera tested, multiple surface loops were found to be important for neutralizing activity. Three regions, defined by loops DE, FG, and HI, were most frequently identified as being essential for binding by neutralizing antibodies. These observations are consistent with the existence of multiple neutralizing epitopes on the HPV virion surface. PMID:16641259

  6. Identification of human papillomavirus type 16 L1 surface loops required for neutralization by human sera.

    PubMed

    Carter, Joseph J; Wipf, Greg C; Madeleine, Margaret M; Schwartz, Stephen M; Koutsky, Laura A; Galloway, Denise A

    2006-05-01

    The variable surface loops on human papillomavirus (HPV) virions required for type-specific neutralization by human sera remain poorly defined. To determine which loops are required for neutralization, a series of hybrid virus-like particles (VLPs) were used to adsorb neutralizing activity from HPV type 16 (HPV16)-reactive human sera before being tested in an HPV16 pseudovirion neutralization assay. The hybrid VLPs used were composed of L1 sequences of either HPV16 or HPV31, on which one or two regions were replaced with homologous sequences from the other type. The regions chosen for substitution were the five known loops that form surface epitopes recognized by monoclonal antibodies and two additional variable regions between residues 400 and 450. Pretreatment of human sera, previously found to react to HPV16 VLPs in enzyme-linked immunosorbent assays, with wild-type HPV16 VLPs and hybrid VLPs that retained the neutralizing epitopes reduced or eliminated the ability of sera to inhibit pseudovirus infection in vitro. Surprisingly, substitution of a single loop often ablated the ability of VLPs to adsorb neutralizing antibodies from human sera. However, for all sera tested, multiple surface loops were found to be important for neutralizing activity. Three regions, defined by loops DE, FG, and HI, were most frequently identified as being essential for binding by neutralizing antibodies. These observations are consistent with the existence of multiple neutralizing epitopes on the HPV virion surface.

  7. HBV Genotypic Variability in Cuba

    PubMed Central

    Loureiro, Carmen L.; Aguilar, Julio C.; Aguiar, Jorge; Muzio, Verena; Pentón, Eduardo; Garcia, Daymir; Guillen, Gerardo; Pujol, Flor H.

    2015-01-01

    The genetic diversity of HBV in human population is often a reflection of its genetic admixture. The aim of this study was to explore the genotypic diversity of HBV in Cuba. The S genomic region of Cuban HBV isolates was sequenced and for selected isolates the complete genome or precore-core sequence was analyzed. The most frequent genotype was A (167/250, 67%), mainly A2 (149, 60%) but also A1 and one A4. A total of 77 isolates were classified as genotype D (31%), with co-circulation of several subgenotypes (56 D4, 2 D1, 5 D2, 7 D3/6 and 7 D7). Three isolates belonged to genotype E, two to H and one to B3. Complete genome sequence analysis of selected isolates confirmed the phylogenetic analysis performed with the S region. Mutations or polymorphisms in precore region were more common among genotype D compared to genotype A isolates. The HBV genotypic distribution in this Caribbean island correlates with the Y lineage genetic background of the population, where a European and African origin prevails. HBV genotypes E, B3 and H isolates might represent more recent introductions. PMID:25742179

  8. Automated subtyping of HIV-1 genetic sequences for clinical and surveillance purposes: performance evaluation of the new REGA version 3 and seven other tools.

    PubMed

    Pineda-Peña, Andrea-Clemencia; Faria, Nuno Rodrigues; Imbrechts, Stijn; Libin, Pieter; Abecasis, Ana Barroso; Deforche, Koen; Gómez-López, Arley; Camacho, Ricardo J; de Oliveira, Tulio; Vandamme, Anne-Mieke

    2013-10-01

    To investigate differences in pathogenesis, diagnosis and resistance pathways between HIV-1 subtypes, an accurate subtyping tool for large datasets is needed. We aimed to evaluate the performance of automated subtyping tools to classify the different subtypes and circulating recombinant forms using pol, the most sequenced region in clinical practice. We also present the upgraded version 3 of the Rega HIV subtyping tool (REGAv3). HIV-1 pol sequences (PR+RT) for 4674 patients retrieved from the Portuguese HIV Drug Resistance Database, and 1872 pol sequences trimmed from full-length genomes retrieved from the Los Alamos database were classified with statistical-based tools such as COMET, jpHMM and STAR; similarity-based tools such as NCBI and Stanford; and phylogenetic-based tools such as REGA version 2 (REGAv2), REGAv3, and SCUEAL. The performance of these tools, for pol, and for PR and RT separately, was compared in terms of reproducibility, sensitivity and specificity with respect to the gold standard which was manual phylogenetic analysis of the pol region. The sensitivity and specificity for subtypes B and C was more than 96% for seven tools, but was variable for other subtypes such as A, D, F and G. With regard to the most common circulating recombinant forms (CRFs), the sensitivity and specificity for CRF01_AE was ~99% with statistical-based tools, with phylogenetic-based tools and with Stanford, one of the similarity based tools. CRF02_AG was correctly identified for more than 96% by COMET, REGAv3, Stanford and STAR. All the tools reached a specificity of more than 97% for most of the subtypes and the two main CRFs (CRF01_AE and CRF02_AG). Other CRFs were identified only by COMET, REGAv2, REGAv3, and SCUEAL and with variable sensitivity. When analyzing sequences for PR and RT separately, the performance for PR was generally lower and variable between the tools. Similarity and statistical-based tools were 100% reproducible, but this was lower for phylogenetic-based tools such as REGA (~99%) and SCUEAL (~96%). REGAv3 had an improved performance for subtype B and CRF02_AG compared to REGAv2 and is now able to also identify all epidemiologically relevant CRFs. In general the best performing tools, in alphabetical order, were COMET, jpHMM, REGAv3, and SCUEAL when analyzing pure subtypes in the pol region, and COMET and REGAv3 when analyzing most of the CRFs. Based on this study, we recommend to confirm subtyping with 2 well performing tools, and be cautious with the interpretation of short sequences. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.

  9. Characterization of Aftershock Sequences from Large Strike-Slip Earthquakes Along Geometrically Complex Faults

    NASA Astrophysics Data System (ADS)

    Sexton, E.; Thomas, A.; Delbridge, B. G.

    2017-12-01

    Large earthquakes often exhibit complex slip distributions and occur along non-planar fault geometries, resulting in variable stress changes throughout the region of the fault hosting aftershocks. To better discern the role of geometric discontinuities on aftershock sequences, we compare areas of enhanced and reduced Coulomb failure stress and mean stress for systematic differences in the time dependence and productivity of these aftershock sequences. In strike-slip faults, releasing structures, including stepovers and bends, experience an increase in both Coulomb failure stress and mean stress during an earthquake, promoting fluid diffusion into the region and further failure. Conversely, Coulomb failure stress and mean stress decrease in restraining bends and stepovers in strike-slip faults, and fluids diffuse away from these areas, discouraging failure. We examine spatial differences in seismicity patterns along structurally complex strike-slip faults which have hosted large earthquakes, such as the 1992 Mw 7.3 Landers, the 2010 Mw 7.2 El-Mayor Cucapah, the 2014 Mw 6.0 South Napa, and the 2016 Mw 7.0 Kumamoto events. We characterize the behavior of these aftershock sequences with the Epidemic Type Aftershock-Sequence Model (ETAS). In this statistical model, the total occurrence rate of aftershocks induced by an earthquake is λ(t) = λ_0 + \\sum_{i:t_i

  10. A specific indel marker for the Philippines Schistosoma japonicum revealed by analysis of mitochondrial genome sequences.

    PubMed

    Li, Juan; Chen, Fen; Sugiyama, Hiromu; Blair, David; Lin, Rui-Qing; Zhu, Xing-Quan

    2015-07-01

    In the present study, near-complete mitochondrial (mt) genome sequences for Schistosoma japonicum from different regions in the Philippines and Japan were amplified and sequenced. Comparisons among S. japonicum from the Philippines, Japan, and China revealed a geographically based length difference in mt genomes, but the mt genomic organization and gene arrangement were the same. Sequence differences among samples from the Philippines and all samples from the three endemic areas were 0.57-2.12 and 0.76-3.85 %, respectively. The most variable part of the mt genome was the non-coding region. In the coding portion of the genome, protein-coding genes varied more than rRNA genes and tRNAs. The near-complete mt genome sequences for Philippine specimens were identical in length (14,091 bp) which was 4 bp longer than those of S. japonicum samples from Japan and China. This indel provides a unique genetic marker for S. japonicum samples from the Philippines. Phylogenetic analyses based on the concatenated amino acids of 12 protein-coding genes showed that samples of S. japonicum clustered according to their geographical origins. The identified mitochondrial indel marker will be useful for tracing the source of S. japonicum infection in humans and animals in Southeast Asia.

  11. The sequence of camelpox virus shows it is most closely related to variola virus, the cause of smallpox.

    PubMed

    Gubser, Caroline; Smith, Geoffrey L

    2002-04-01

    Camelpox virus (CMPV) and variola virus (VAR) are orthopoxviruses (OPVs) that share several biological features and cause high mortality and morbidity in their single host species. The sequence of a virulent CMPV strain was determined; it is 202182 bp long, with inverted terminal repeats (ITRs) of 6045 bp and has 206 predicted open reading frames (ORFs). As for other poxviruses, the genes are tightly packed with little non-coding sequence. Most genes within 25 kb of each terminus are transcribed outwards towards the terminus, whereas genes within the centre of the genome are transcribed from either DNA strand. The central region of the genome contains genes that are highly conserved in other OPVs and 87 of these are conserved in all sequenced chordopoxviruses. In contrast, genes towards either terminus are more variable and encode proteins involved in host range, virulence or immunomodulation. In some cases, these are broken versions of genes found in other OPVs. The relationship of CMPV to other OPVs was analysed by comparisons of DNA and predicted protein sequences, repeats within the ITRs and arrangement of ORFs within the terminal regions. Each comparison gave the same conclusion: CMPV is the closest known virus to variola virus, the cause of smallpox.

  12. Barcode Identifiers as a Practical Tool for Reliable Species Assignment of Medically Important Black Yeast Species

    PubMed Central

    Heinrichs, Guido; de Hoog, G. Sybren

    2012-01-01

    Herpotrichiellaceous black yeasts and relatives comprise severe pathogens flanked by nonpathogenic environmental siblings. Reliable identification by conventional methods is notoriously difficult. Molecular identification is hampered by the sequence variability in the internal transcribed spacer (ITS) domain caused by difficult-to-sequence homopolymeric regions and by poor taxonomic attribution of sequences deposited in GenBank. Here, we present a potential solution using short barcode identifiers (27 to 50 bp) based on ITS2 ribosomal DNA (rDNA), which allows unambiguous definition of species-specific fragments. Starting from proven sequences of ex-type and authentic strains, we were able to describe 103 identifiers. Multiple BLAST searches of these proposed barcode identifiers in GenBank revealed uniqueness for 100 taxonomic entities, whereas the three remaining identifiers each matched with two entities, but the species of these identifiers could easily be discriminated by differences in the remaining ITS regions. Using the proposed barcode identifiers, a 4.1-fold increase of 100% matches in GenBank was achieved in comparison to the classical approach using the complete ITS sequences. The proposed barcode identifiers will be made accessible for the diagnostic laboratory in a permanently updated online database, thereby providing a highly practical, reliable, and cost-effective tool for identification of clinically important black yeasts and relatives. PMID:22785187

  13. Molecular genetic and morphological analyses of the African wild dog (Lycaon pictus).

    PubMed

    Girman, D J; Kat, P W; Mills, M G; Ginsberg, J R; Borner, M; Wilson, V; Fanshawe, J H; Fitzgibbon, C; Lau, L M; Wayne, R K

    1993-01-01

    African wild dog populations have declined precipitously during the last 100 years in eastern Africa. The possible causes of this decline include a reduction in prey abundance and habitat; disease; and loss of genetic variability accompanied by inbreeding depression. We examined the levels of genetic variability and distinctiveness among populations of African wild dogs using mitochondrial DNA (mtDNA) restriction site and sequence analyses and multivariate analysis of cranial and dental measurements. Our results indicate that the genetic variability of eastern African wild dog populations is comparable to that of southern Africa and similar to levels of variability found in other large canids. Southern and eastern populations of wild dogs show about 1% divergence in mtDNA sequence and form two monophyletic assemblages containing three mtDNA genotypes each. No genotypes are shared between the two regions. With one exception, all wild dogs examined from zoos had southern African genotypes. Morphological analysis supports the distinction of eastern and southern African wild dog populations, and we suggest they should be considered separate subspecies. An eastern African wild dog breeding program should be initiated to ensure preservation of the eastern African form and to slow the loss of genetic variability that, while not yet apparent, will inevitably occur if wild populations continue to decline. Finally, we examined the phylogenetic relationships of wild dogs to other wolf-like canids through analysis of 736 base pairs (bp) of cytochrome b sequence and showed wild dogs to belong to a phylogenetically distinct lineage of the wolf-like canids.

  14. Early Miocene sequence development across the New Jersey margin

    USGS Publications Warehouse

    Monteverde, D.H.; Mountain, Gregory S.; Miller, K.G.

    2008-01-01

    Sequence stratigraphy provides an understanding of the interplay between eustasy, sediment supply and accommodation in the sedimentary construction of passive margins. We used this approach to follow the early to middle Miocene growth of the New Jersey margin and analyse the connection between relative changes of sea level and variable sediment supply. Eleven candidate sequence boundaries were traced in high-resolution multi-channel seismic profiles across the inner margin and matched to geophysical log signatures and lithologic changes in ODP Leg 150X onshore coreholes. Chronologies at these drill sites were then used to assign ages to the intervening seismic sequences. We conclude that the regional and global correlation of early Miocene sequences suggests a dominant role of global sea-level change but margin progradation was controlled by localized sediment contribution and that local conditions played a large role in sequence formation and preservation. Lowstand deposits were regionally restricted and their locations point to both single and multiple sediment sources. The distribution of highstand deposits, by contrast, documents redistribution by along shelf currents. We find no evidence that sea level fell below the elevation of the clinoform rollover, and the existence of extensive lowstand deposits seaward of this inflection point indicates efficient cross-shelf sediment transport mechanisms despite the apparent lack of well-developed fluvial drainage. ?? 2008 The Authors. Journal compilation ?? 2008 Blackwell Publishing.

  15. Regularized rare variant enrichment analysis for case-control exome sequencing data.

    PubMed

    Larson, Nicholas B; Schaid, Daniel J

    2014-02-01

    Rare variants have recently garnered an immense amount of attention in genetic association analysis. However, unlike methods traditionally used for single marker analysis in GWAS, rare variant analysis often requires some method of aggregation, since single marker approaches are poorly powered for typical sequencing study sample sizes. Advancements in sequencing technologies have rendered next-generation sequencing platforms a realistic alternative to traditional genotyping arrays. Exome sequencing in particular not only provides base-level resolution of genetic coding regions, but also a natural paradigm for aggregation via genes and exons. Here, we propose the use of penalized regression in combination with variant aggregation measures to identify rare variant enrichment in exome sequencing data. In contrast to marginal gene-level testing, we simultaneously evaluate the effects of rare variants in multiple genes, focusing on gene-based least absolute shrinkage and selection operator (LASSO) and exon-based sparse group LASSO models. By using gene membership as a grouping variable, the sparse group LASSO can be used as a gene-centric analysis of rare variants while also providing a penalized approach toward identifying specific regions of interest. We apply extensive simulations to evaluate the performance of these approaches with respect to specificity and sensitivity, comparing these results to multiple competing marginal testing methods. Finally, we discuss our findings and outline future research. © 2013 WILEY PERIODICALS, INC.

  16. Quantification of peptides from immunoglobulin constant and variable regions by LC-MRM MS for assessment of multiple myeloma patients.

    PubMed

    Remily-Wood, Elizabeth R; Benson, Kaaron; Baz, Rachid C; Chen, Y Ann; Hussein, Mohamad; Hartley-Brown, Monique A; Sprung, Robert W; Perez, Brianna; Liu, Richard Z; Yoder, Sean J; Teer, Jamie K; Eschrich, Steven A; Koomen, John M

    2014-10-01

    Quantitative MS assays for Igs are compared with existing clinical methods in samples from patients with plasma cell dyscrasias, for example, multiple myeloma (MM). Using LC-MS/MS data, Ig constant region peptides, and transitions were selected for LC-MRM MS. Quantitative assays were used to assess Igs in serum from 83 patients. RNA sequencing and peptide-based LC-MRM are used to define peptides for quantification of the disease-specific Ig. LC-MRM assays quantify serum levels of Igs and their isoforms (IgG1-4, IgA1-2, IgM, IgD, and IgE, as well as kappa (κ) and lambda (λ) light chains). LC-MRM quantification has been applied to single samples from a patient cohort and a longitudinal study of an IgE patient undergoing treatment, to enable comparison with existing clinical methods. Proof-of-concept data for defining and monitoring variable region peptides are provided using the H929 MM cell line and two MM patients. LC-MRM assays targeting constant region peptides determine the type and isoform of the involved Ig and quantify its expression; the LC-MRM approach has improved sensitivity compared with the current clinical method, but slightly higher inter-assay variability. Detection of variable region peptides is a promising way to improve Ig quantification, which could produce a dramatic increase in sensitivity over existing methods, and could further complement current clinical techniques. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. Analysis of blaCTX-M-Carrying Plasmids from Escherichia coli Isolates Collected in the BfT-GermVet Study ▿

    PubMed Central

    Schink, Anne-Kathrin; Kadlec, Kristina; Schwarz, Stefan

    2011-01-01

    In this study, 417 Escherichia coli isolates from defined disease conditions of companion and farm animals collected in the BfT-GermVet study were investigated for the presence of extended-spectrum β-lactamase (ESBL) genes. Three ESBL-producing E. coli isolates were identified among the 100 ampicillin-resistant isolates. The E. coli isolates 168 and 246, of canine and porcine origins, respectively, harbored blaCTX-M-1, and the canine isolate 913 harbored blaCTX-M-15, as confirmed by PCR and sequence analysis. The isolates 168 and 246 belonged to the novel multilocus sequence typing (MLST) types ST1576 and ST1153, respectively, while isolate 913 had the MLST type ST410. The ESBL genes were located on structurally related IncN plasmids in isolates 168 and 246 and on an IncF plasmid in isolate 913. The blaCTX-M-1 upstream regions of plasmids pCTX168 and pCTX246 were similar, whereas the downstream regions showed structural differences. The genetic environment of the blaCTX-M-15 gene on plasmid pCTX913 differed distinctly from that of both blaCTX-M-1 genes. Detailed sequence analysis showed that the integration of insertion sequences, as well as interplasmid recombination events, accounted for the structural variability in the blaCTX-M gene regions. PMID:21685166

  18. Use of DNA barcodes to identify flowering plants.

    PubMed

    Kress, W John; Wurdack, Kenneth J; Zimmer, Elizabeth A; Weigt, Lee A; Janzen, Daniel H

    2005-06-07

    Methods for identifying species by using short orthologous DNA sequences, known as "DNA barcodes," have been proposed and initiated to facilitate biodiversity studies, identify juveniles, associate sexes, and enhance forensic analyses. The cytochrome c oxidase 1 sequence, which has been found to be widely applicable in animal barcoding, is not appropriate for most species of plants because of a much slower rate of cytochrome c oxidase 1 gene evolution in higher plants than in animals. We therefore propose the nuclear internal transcribed spacer region and the plastid trnH-psbA intergenic spacer as potentially usable DNA regions for applying barcoding to flowering plants. The internal transcribed spacer is the most commonly sequenced locus used in plant phylogenetic investigations at the species level and shows high levels of interspecific divergence. The trnH-psbA spacer, although short ( approximately 450-bp), is the most variable plastid region in angiosperms and is easily amplified across a broad range of land plants. Comparison of the total plastid genomes of tobacco and deadly nightshade enhanced with trials on widely divergent angiosperm taxa, including closely related species in seven plant families and a group of species sampled from a local flora encompassing 50 plant families (for a total of 99 species, 80 genera, and 53 families), suggest that the sequences in this pair of loci have the potential to discriminate among the largest number of plant species for barcoding purposes.

  19. Analysis of sequence variation among smeDEF multi drug efflux pump genes and flanking DNA from defined 16S rRNA subgroups of clinical Stenotrophomonas maltophilia isolates.

    PubMed

    Gould, Virginia C; Okazaki, Aki; Howe, Robin A; Avison, Matthew B

    2004-08-01

    To determine the level of variation in the smeDEF efflux pump and smeT transcriptional regulator genes among three defined 16S rRNA sequence subgroups of clinical Stenotrophomonas maltophilia isolates. smeDEF sequencing used a PCR genome walking approach. Determination of the sequence surrounding smeDEF used a flanking primer PCR method and specific primers anchored in smeD or smeF together with random primers. smeDEF is chromosomal and located in the same position in the chromosome in all three subgroups of isolates. Flanking smeD is a gene, smeT, encoding a putative transcriptional repressor for smeDEF. Variation at these loci among the isolates is considerably lower (up to 10%) than at intrinsic beta-lactamase loci (up to 30%) in the same isolates, implying greater functional constraint. The smeD-smeT intergenic region contains a highly conserved section, which maps with previously predicted promoter/operator regions, and a hypervariable untranslated region, which can be used to subgroup clinical isolates. These data provide further evidence that it is possible to group clinical isolates of the inherently variable species, S. maltophilia, based on genotypic properties. Isolate D457, in which most work concerning smeDEF expression has been performed, does not fall into S. maltophilia subgroup A, which is the most typical.

  20. Complete sequence of Tvv1, a family of Ty 1 copia-like retrotransposons of Vitis vinifera L., reconstituted by chromosome walking.

    PubMed

    Pelsy, F.; Merdinoglu, D.

    2002-09-01

    A chromosome-walking strategy was used to sequence and characterize retrotransposons in the grapevine genome. The reconstitution of a family of retroelements, named Tvv1, was achieved by six successive steps. These elements share a single, highly conserved open reading frame 4,153 nucleotides-long, putatively encoding the gag, pro, int, rt and rh proteins. Comparison of the Tvv1 open reading frame coding potential with those of drosophila copia and tobacco Tnt1, revealed that Tvv1 is closely related to Ty 1 copia-like retrotransposons. A highly variable untranslated leader region, upstream of the open reading frame, allowed us to differentiate Tvv1 variants, which represent a family of at least 28 copies, in varying sizes. This internal region is flanked by two long terminal repeats in direct orientation, sized between 149 and 157 bp. Among elements theoretically sized from 4,970 to 5,550 bp, we describe the full-length sequence of a reference element Tvv1-1, 5,343 nucleotides-long. The full-length sequence of Tvv1-1 compared to pea PDR1 shows a 53.3% identity. In addition, both elements contain long terminal repeats of nearly the same size in which the U5 region could be entirely absent. Therefore, we assume that Tvv1 and PDR1 could constitute a particular class of short LTRs retroelements.

  1. Compound haplotypes at Xp11.23 and human population growth in Eurasia.

    PubMed

    Alonso, S; Armour, J A L

    2004-09-01

    To investigate patterns of diversity and the evolutionary history of Eurasians, we have sequenced a 2.8 kb region at Xp11.23 in a sample of African and Eurasian chromosomes. This region is in a long intron of CLCN5 and is immediately flanked by a highly variable minisatellite, DXS255, and a human-specific Ta0 LINE. Compared to Africans, Eurasians showed a marked reduction in sequence diversity. The main Euro-Asiatic haplotype seems to be the ancestral haplotype for the whole sample. Coalescent simulations, including recombination and exponential growth, indicate a median length of strong linkage disequilibrium, up to approximately 9 kb for this area. The Ka/Ks ratio between the coding sequence of human CLCN5 and its mouse orthologue is much less than 1. This implies that the region sequenced is unlikely to be under the strong influence of positive selective processes on CLCN5, mutations in which have been associated with disorders such as Dent's disease. In contrast, a scenario based on a population bottleneck and exponential growth seems a more likely explanation for the reduced diversity observed in Eurasians. Coalescent analysis and linked minisatellite diversity (which reaches a gene diversity value greater than 98% in Eurasians) suggest an estimated age of origin of the Euro-Asiatic diversity compatible with a recent out-of-Africa model for colonization of Eurasia by modern Homo sapiens.

  2. Low-copy nuclear primers and ycf1 primers in Cactaceae.

    PubMed

    Franck, Alan R; Cochrane, Bruce J; Garey, James R

    2012-10-01

    To increase the number of variable regions available for phylogenetic study in the Cactaceae, primers were developed for a portion of the plastid ycf1 gene and intron-spanning regions of two low-copy nuclear genes (isi1, nhx1). • Primers were tested on several families within Caryophyllales, focusing on the Cactaceae. Gel electrophoresis indicated positive amplification in most samples. Sequences of these three regions (isi1, nhx1, ycf1) from Harrisia exhibited variation similar to or greater than two plastid regions (atpB-rbcL intergenic spacer and rpl16 intron). • The isi, nhx, and ycf1 primers amplify phylogenetically useful information applicable to the Cactaceae and other families in the Caryophyllales.

  3. Highly sensitive and unbiased approach for elucidating antibody repertoires

    PubMed Central

    Lin, Sherry G.; Ba, Zhaoqing; Du, Zhou; Zhang, Yu; Hu, Jiazhi; Alt, Frederick W.

    2016-01-01

    Developing B lymphocytes undergo V(D)J recombination to assemble germ-line V, D, and J gene segments into exons that encode the antigen-binding variable region of Ig heavy (H) and light (L) chains. IgH and IgL chains associate to form the B-cell receptor (BCR), which, upon antigen binding, activates B cells to secrete BCR as an antibody. Each of the huge number of clonally independent B cells expresses a unique set of IgH and IgL variable regions. The ability of V(D)J recombination to generate vast primary B-cell repertoires results from a combinatorial assortment of large numbers of different V, D, and J segments, coupled with diversification of the junctions between them to generate the complementary determining region 3 (CDR3) for antigen contact. Approaches to evaluate in depth the content of primary antibody repertoires and, ultimately, to study how they are further molded by secondary mutation and affinity maturation processes are of great importance to the B-cell development, vaccine, and antibody fields. We now describe an unbiased, sensitive, and readily accessible assay, referred to as high-throughput genome-wide translocation sequencing-adapted repertoire sequencing (HTGTS-Rep-seq), to quantify antibody repertoires. HTGTS-Rep-seq quantitatively identifies the vast majority of IgH and IgL V(D)J exons, including their unique CDR3 sequences, from progenitor and mature mouse B lineage cells via the use of specific J primers. HTGTS-Rep-seq also accurately quantifies DJH intermediates and V(D)J exons in either productive or nonproductive configurations. HTGTS-Rep-seq should be useful for studies of human samples, including clonal B-cell expansions, and also for following antibody affinity maturation processes. PMID:27354528

  4. Clonal Origins of Vibrio cholerae O1 El Tor Strains, Papua New Guinea, 2009–2011

    PubMed Central

    Collins, Deirdre; Jonduo, Marinjho H.; Rosewell, Alexander; Dutta, Samir R.; Dagina, Rosheila; Ropa, Berry; Siba, Peter M.; Greenhill, Andrew R.

    2011-01-01

    We used multilocus sequence typing and variable number tandem repeat analysis to determine the clonal origins of Vibrio cholerae O1 El Tor strains from an outbreak of cholera that began in 2009 in Papua New Guinea. The epidemic is ongoing, and transmission risk is elevated within the Pacific region. PMID:22099099

  5. The complete chloroplast genome sequence of Actinidia arguta using the PacBio RS II platform

    PubMed Central

    Lin, Miaomiao; Qi, Xiujuan; Chen, Jinyong; Sun, Leiming; Zhong, Yunpeng; Fang, Jinbao; Hu, Chungen

    2018-01-01

    Actinidia arguta is the most basal species in a phylogenetically and economically important genus in the family Actinidiaceae. To better understand the molecular basis of the Actinidia arguta chloroplast (cp), we sequenced the complete cp genome from A. arguta using Illumina and PacBio RS II sequencing technologies. The cp genome from A. arguta was 157,611 bp in length and composed of a pair of 24,232 bp inverted repeats (IRs) separated by a 20,463 bp small single copy region (SSC) and an 88,684 bp large single copy region (LSC). Overall, the cp genome contained 113 unique genes. The cp genomes from A. arguta and three other Actinidia species from GenBank were subjected to a comparative analysis. Indel mutation events and high frequencies of base substitution were identified, and the accD and ycf2 genes showed a high degree of variation within Actinidia. Forty-seven simple sequence repeats (SSRs) and 155 repetitive structures were identified, further demonstrating the rapid evolution in Actinidia. The cp genome analysis and the identification of variable loci provide vital information for understanding the evolution and function of the chloroplast and for characterizing Actinidia population genetics. PMID:29795601

  6. Origin and distribution of Sporothrix globosa causing sapronoses in Asia.

    PubMed

    Moussa, Tarek A A; Kadasa, Naif M S; Al Zahrani, Hassan S; Ahmed, Sarah Abdallah; Feng, Peiying; Gerrits van den Ende, Albertus H G; Zhang, Yu; Kano, Rui; Li, Fuqiu; Li, Shanshan; Song, Yang; Dong, Bilin; Rossato, Luana; Dolatabadi, Somayeh; Hoog, Sybren de

    2017-05-01

    The aim of the study was to evaluate the main sources and epidemiological patterns and speculate on the evolutionary origin of Sporothrix globosa in Asia. Case and case series literature on sporotrichosis in Asia from January 2007 onwards were reviewed using meta-analysis. Phylogenetic analysis of relevant S. globosa was carried out on the basis of concatenated sequences of ITS, TEF3 and CAL. A haplotype network of CAL sequences of 281 Sporothrix isolates was analysed to determine the population structure of S. globosa. Nearly all cases of sporotrichosis caused by S. globosa in Asia were human. In contrast to the remaining pathogenic Sporothrix species, feline transmission was exceptional; nearly all regional cat-associated cases were caused by Sporothrix schenckii. While the latter species was highly variable and showed recombination, S. globosa seemed to be a clonal offshoot, as was Sporothrix brasiliensis. The origin of the segregants was located in an area of high variability in S. schenckii with a relatively high frequency of Asian strains. In Asia, S. globosa was the prevalent species. The low diversity of S. globosa suggested a recent divergence with a founder effect of low variability from the variable ancestral species, S. schenckii.

  7. Metacommunity analysis of amoeboid protists in grassland soils

    PubMed Central

    Fiore-Donno, Anna Maria; Weinert, Jan; Wubet, Tesfaye; Bonkowski, Michael

    2016-01-01

    This study reveals the diversity and distribution of two major ubiquitous groups of soil amoebae, the genus Acanthamoeba and the Myxomycetes (plasmodial slime-moulds) that are rarely, if ever, recovered in environmental sampling studies. We analyzed 150 grassland soil samples from three Biodiversity Exploratories study regions in Germany. We developed specific primers targeting the V2 variable region in the first part of the small subunit of the ribosomal RNA gene for high-throughput pyrotag sequencing. From ca. 1 million reads, applying very stringent filtering and clustering parameters to avoid overestimation of the diversity, we obtained 273 acanthamoebal and 338 myxomycete operational taxonomic units (OTUs, 96% similarity threshold). This number is consistent with the genetic diversity known in the two investigated lineages, but unequalled to date by any environmental sampling study. Only very few OTUs were identical to already known sequences. Strikingly different OTUs assemblages were found between the three German regions (PerMANOVA p.value = 0.001) and even between sites of the same region (multiple-site Simpson-based similarity indices <0.4), showing steep biogeographical gradients. PMID:26750872

  8. Metacommunity analysis of amoeboid protists in grassland soils.

    PubMed

    Fiore-Donno, Anna Maria; Weinert, Jan; Wubet, Tesfaye; Bonkowski, Michael

    2016-01-11

    This study reveals the diversity and distribution of two major ubiquitous groups of soil amoebae, the genus Acanthamoeba and the Myxomycetes (plasmodial slime-moulds) that are rarely, if ever, recovered in environmental sampling studies. We analyzed 150 grassland soil samples from three Biodiversity Exploratories study regions in Germany. We developed specific primers targeting the V2 variable region in the first part of the small subunit of the ribosomal RNA gene for high-throughput pyrotag sequencing. From ca. 1 million reads, applying very stringent filtering and clustering parameters to avoid overestimation of the diversity, we obtained 273 acanthamoebal and 338 myxomycete operational taxonomic units (OTUs, 96% similarity threshold). This number is consistent with the genetic diversity known in the two investigated lineages, but unequalled to date by any environmental sampling study. Only very few OTUs were identical to already known sequences. Strikingly different OTUs assemblages were found between the three German regions (PerMANOVA p.value = 0.001) and even between sites of the same region (multiple-site Simpson-based similarity indices <0.4), showing steep biogeographical gradients.

  9. Salmonella enterica Prophage Sequence Profiles Reflect Genome Diversity and Can Be Used for High Discrimination Subtyping.

    PubMed

    Mottawea, Walid; Duceppe, Marc-Olivier; Dupras, Andrée A; Usongo, Valentine; Jeukens, Julie; Freschi, Luca; Emond-Rheault, Jean-Guillaume; Hamel, Jeremie; Kukavica-Ibrulj, Irena; Boyle, Brian; Gill, Alexander; Burnett, Elton; Franz, Eelco; Arya, Gitanjali; Weadge, Joel T; Gruenheid, Samantha; Wiedmann, Martin; Huang, Hongsheng; Daigle, France; Moineau, Sylvain; Bekal, Sadjia; Levesque, Roger C; Goodridge, Lawrence D; Ogunremi, Dele

    2018-01-01

    Non-typhoidal Salmonella is a leading cause of foodborne illness worldwide. Prompt and accurate identification of the sources of Salmonella responsible for disease outbreaks is crucial to minimize infections and eliminate ongoing sources of contamination. Current subtyping tools including single nucleotide polymorphism (SNP) typing may be inadequate, in some instances, to provide the required discrimination among epidemiologically unrelated Salmonella strains. Prophage genes represent the majority of the accessory genes in bacteria genomes and have potential to be used as high discrimination markers in Salmonella . In this study, the prophage sequence diversity in different Salmonella serovars and genetically related strains was investigated. Using whole genome sequences of 1,760 isolates of S. enterica representing 151 Salmonella serovars and 66 closely related bacteria, prophage sequences were identified from assembled contigs using PHASTER. We detected 154 different prophages in S. enterica genomes. Prophage sequences were highly variable among S. enterica serovars with a median ± interquartile range (IQR) of 5 ± 3 prophage regions per genome. While some prophage sequences were highly conserved among the strains of specific serovars, few regions were lineage specific. Therefore, strains belonging to each serovar could be clustered separately based on their prophage content. Analysis of S . Enteritidis isolates from seven outbreaks generated distinct prophage profiles for each outbreak. Taken altogether, the diversity of the prophage sequences correlates with genome diversity. Prophage repertoires provide an additional marker for differentiating S. enterica subtypes during foodborne outbreaks.

  10. Oligonucleotide indexing of DNA barcodes: identification of tuna and other scombrid species in food products.

    PubMed

    Botti, Sara; Giuffra, Elisabetta

    2010-08-23

    DNA barcodes are a global standard for species identification and have countless applications in the medical, forensic and alimentary fields, but few barcoding methods work efficiently in samples in which DNA is degraded, e.g. foods and archival specimens. This limits the choice of target regions harbouring a sufficient number of diagnostic polymorphisms. The method described here uses existing PCR and sequencing methodologies to detect mitochondrial DNA polymorphisms in complex matrices such as foods. The reported application allowed the discrimination among 17 fish species of the Scombridae family with high commercial interest such as mackerels, bonitos and tunas which are often present in processed seafood. The approach can be easily upgraded with the release of new genetic diversity information to increase the range of detected species. Cocktail of primers are designed for PCR using publicly available sequences of the target sequence. They are composed of a fixed 5' region and of variable 3' cocktail portions that allow amplification of any member of a group of species of interest. The population of short amplicons is directly sequenced and indexed using primers containing a longer 5' region and the non polymorphic portion of the cocktail portion. A 226 bp region of CytB was selected as target after collection and screening of 148 online sequences; 85 SNPs were found, of which 75 were present in at least two sequences. Primers were also designed for two shorter sub-fragments that could be amplified from highly degraded samples. The test was used on 103 samples of seafood (canned tuna and scomber, tuna salad, tuna sauce) and could successfully detect the presence of different or additional species that were not identified on the labelling of canned tuna, tuna salad and sauce samples. The described method is largely independent of the degree of degradation of DNA source and can thus be applied to processed seafood. Moreover, the method is highly flexible: publicly available sequence information on mitochondrial genomes are rapidly increasing for most species, facilitating the choice of target sequences and the improvement of resolution of the test. This is particularly important for discrimination of marine and aquaculture species for which genome information is still limited.

  11. Phylogenetics of the phlebotomine sand fly group Verrucarum (Diptera: Psychodidae: Lutzomyia).

    PubMed

    Cohnstaedt, Lee W; Beati, Lorenza; Caceres, Abraham G; Ferro, Cristina; Munstermann, Leonard E

    2011-06-01

    Within the sand fly genus Lutzomyia, the Verrucarum species group contains several of the principal vectors of American cutaneous leishmaniasis and human bartonellosis in the Andean region of South America. The group encompasses 40 species for which the taxonomic status, phylogenetic relationships, and role of each species in disease transmission remain unresolved. Mitochondrial cytochrome c oxidase I (COI) phylogenetic analysis of a 667-bp fragment supported the morphological classification of the Verrucarum group into series. Genetic sequences from seven species were grouped in well-supported monophyletic lineages. Four species, however, clustered in two paraphyletic lineages that indicate conspecificity--the Lutzomyia longiflocosa-Lutzomyia sauroida pair and the Lutzomyia quasitownsendi-Lutzomyia torvida pair. COI sequences were also evaluated as a taxonomic tool based on interspecific genetic variability within the Verrucarum group and the intraspecific variability of one of its members, Lutzomyia verrucarum, across its known distribution.

  12. Structural classification of CDR-H3 revisited: a lesson in antibody modeling.

    PubMed

    Kuroda, Daisuke; Shirai, Hiroki; Kobori, Masato; Nakamura, Haruki

    2008-11-15

    Among the six complementarity-determining regions (CDRs) in the variable domains of an antibody, the third CDR of the heavy chain (CDR-H3), which lies in the center of the antigen-binding site, plays a particularly important role in antigen recognition. CDR-H3 shows significant variability in its length, sequence, and structure. Although difficult, model building of this segment is the most critical step in antibody modeling. Since our first proposal of the "H3-rules," which classify CDR-H3 structure based on amino acid sequence, the number of experimentally determined antibody structures has increased. Here, we revise these H3-rules and propose an improved classification scheme for CDR-H3 structure modeling. In addition, we determine the common features of CDR-H3 in antibody drugs as well as discuss the concept of "antibody druggability," which can be applied as an indicator of antibody evaluation during drug discovery.

  13. Identification of a novel 15.5 kb SHOX deletion associated with marked intrafamilial phenotypic variability and analysis of its molecular origin.

    PubMed

    Alexandrou, Angelos; Papaevripidou, Ioannis; Tsangaras, Kyriakos; Alexandrou, Ioanna; Tryfonidis, Marios; Christophidou-Anastasiadou, Violetta; Zamba-Papanicolaou, Eleni; Koumbaris, George; Neocleous, Vassos; Phylactou, Leonidas A; Skordis, Nicos; Tanteles, George A; Sismani, Carolina

    2016-12-01

    Haploinsufficiency of the short stature homeobox contaning SHOX gene has been shown to result in a spectrum of phenotypes ranging from Leri-Weill dyschondrosteosis (LWD) at the more severe end to SHOX-related short stature at the milder end of the spectrum. Most alterations are whole gene deletions, point mutations within the coding region, or microdeletions in its flanking sequences. Here, we present the clinical and molecular data as well as the potential molecular mechanism underlying a novel microdeletion, causing a variable SHOX-related haploinsufficiency disorder in a three-generation family. The phenotype resembles that of LWD in females, in males, however, the phenotypic expression is milder. The 15523-bp SHOX intragenic deletion, encompassing exons 3-6, was initially detected by array-CGH, followed by MLPA analysis. Sequencing of the breakpoints indicated an Alu recombination-mediated deletion (ARMD) as the potential causative mechanism.

  14. Phylogenetics of the Phlebotomine Sand Fly Group Verrucarum (Diptera: Psychodidae: Lutzomyia)

    PubMed Central

    Cohnstaedt, Lee W.; Beati, Lorenza; Caceres, Abraham G.; Ferro, Cristina; Munstermann, Leonard E.

    2011-01-01

    Within the sand fly genus Lutzomyia, the Verrucarum species group contains several of the principal vectors of American cutaneous leishmaniasis and human bartonellosis in the Andean region of South America. The group encompasses 40 species for which the taxonomic status, phylogenetic relationships, and role of each species in disease transmission remain unresolved. Mitochondrial cytochrome c oxidase I (COI) phylogenetic analysis of a 667-bp fragment supported the morphological classification of the Verrucarum group into series. Genetic sequences from seven species were grouped in well-supported monophyletic lineages. Four species, however, clustered in two paraphyletic lineages that indicate conspecificity—the Lutzomyia longiflocosa–Lutzomyia sauroida pair and the Lutzomyia quasitownsendi–Lutzomyia torvida pair. COI sequences were also evaluated as a taxonomic tool based on interspecific genetic variability within the Verrucarum group and the intraspecific variability of one of its members, Lutzomyia verrucarum, across its known distribution. PMID:21633028

  15. Characterization and phylogenetic analysis of the swine leukocyte antigen 3 gene from Korean native pigs.

    PubMed

    Chung, H Y; Choi, Y C; Park, H N

    2015-05-18

    We investigated the phylogenetic relationships between pig breeds, compared the genetic similarity between humans and pigs, and provided basic genetic information on Korean native pigs (KNPs), using genetic variants of the swine leukocyte antigen 3 (SLA-3) gene. Primers were based on sequences from GenBank (accession Nos. AF464010 and AF464009). Polymerase chain reaction analysis amplified approximately 1727 bp of segments, which contained 1086 bp of coding regions and 641 bp of the 3'- and 5'-untranslated regions. Bacterial artificial chromosome clones of miniature pigs were used for sequencing the SLA-3 genomic region, which was 3114 bp in total length, including the coding (1086 bp) and non-coding (2028 bp) regions. Sequence analysis detected 53 single nucleotide polymorphisms (SNPs), based on a minor allele frequency greater than 0.01, which is low compared with other pig breeds, and the results suggest that there is low genetic variability in KNPs. Comparative analysis revealed that humans possess approximately three times more genetic variation than do pigs. Approximately 71% of SNPs in exons 2 and 3 were detected in KNPs, and exon 5 in humans is a highly polymorphic region. Newly identified sequences of SLA-3 using KNPs were submitted to GenBank (accession No. DQ992512-18). Cluster analysis revealed that KNPs were grouped according to three major alleles: SLA-3*0502 (DQ992518), SLA-3*0302 (DQ992513 and DQ992516), and SLA-3*0303 (DQ992512, DQ992514, DQ992515, and DQ992517). Alignments revealed that humans have a relatively close genetic relationship with pigs and chimpanzees. The information provided by this study may be useful in KNP management.

  16. Hybridization drives evolution of apomicts in Rubus subgenus Rubus: evidence from microsatellite markers.

    PubMed

    Šarhanová, Petra; Sharbel, Timothy F; Sochor, Michal; Vašut, Radim J; Dancák, Martin; Trávnícek, Bohumil

    2017-08-01

    Rubus subgenus Rubus is a group of mostly apomictic and polyploid species with a complicated taxonomy and history of ongoing hybridization. The only polyploid series with prevailing sexuality is the series Glandulosi , although the apomictic series Discolores and Radula also retain a high degree of sexuality, which is influenced by environmental conditions and/or pollen donors. The aim of this study is to detect sources of genetic variability, determine the origin of apomictic taxa and validate microsatellite markers by cloning and sequencing. A total of 206 individuals from two central European regions were genotyped for 11 nuclear microsatellite loci and the chloroplast trn L- trn F region. Microsatellite alleles were further sequenced in order to determine the exact repeat number and to detect size homoplasy due to insertions/deletions in flanking regions. The results confirm that apomictic microspecies of ser. Radula are derived from crosses between sexual series Glandulosi and apomictic series Discolores , whereby the apomict acts as pollen donor. Each apomictic microspecies is derived from a single distinct genotype differing from the parental taxa, suggesting stabilized clonal reproduction. Intraspecific variation within apomicts is considerably low compared with sexual series Glandulosi , and reflects somatic mutation accumulation. While facultative apomicts produce clonal offspring, sexual species are the conduits of origin for new genetically different apomictic lineages. One of the main driving forces of evolution and speciation in the highly apomictic subgenus Rubus in central Europe is sexuality in the series Glandulosi . Palaeovegetation data suggest that initial hybridizations took place over different time periods in the two studied regions, and that the successful origin and spread of apomictic microspecies of the series Radula took place over several millennia. Additionally, the cloning and sequencing show that standard evaluations of microsatellite repeat numbers underestimate genetic variability considering homoplasy in allele size. © The Author 2017. Published by Oxford University Press on behalf of the Annals of Botany Company. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  17. Decreased mutation frequencies among immunoglobulin G variable region genes during viremic HIV-1 infection.

    PubMed

    Bowers, Elisabeth; Scamurra, Ronald W; Asrani, Anil; Beniguel, Lydie; MaWhinney, Samantha; Keays, Kathryne M; Thurn, Joseph R; Janoff, Edward N

    2014-01-01

    HIV-1 infection is complicated by high rates of opportunistic infections against which specific antibodies contribute to immune defense. Antibody function depends on somatic hypermutation (SHM) of variable regions of immunoglobulin heavy chain genes (VH-D-J). We characterized the frequency of SHM in expressed IgG mRNA immunoglobulin transcripts from control and HIV-1-infected patients. We compared utilization of genes in the most prominent VH family (VH3) and mutation frequencies and patterns of cDNA from VH3-IgG genes from 10 seronegative control subjects and 21 patients with HIV-1 infection (6 without and 15 patients with detectable plasma viremia). Unique IgG VH3 family cDNA sequences (n = 1,565) were PCR amplified, cloned, and sequenced from blood. Sequences were analyzed using online (Vbase) and in-house immunoglobulin alignment resources. Mutation frequencies in the antigen-binding hypervariable complementarity determining regions (CDR1/2) of IgG class-switched B cells were lower among viremic HIV-1-infected patients vs. controls for nucleotides (CDR1/2: 10±5% vs. 13.5±6%, p = 0.03) and amino acids (CDR: 20%±10 vs. 25%±12, p = 0.02) and in structural framework regions. Mutation patterns were similar among groups. The most common VH3 gene, VH3-23, was utilized less frequently among viremic HIV-1-infected patients (p = 0.03), and overall, mutation frequencies were decreased in nearly all VH3 genes compared with controls. B cells from HIV-1-infected patients show decreased mutation frequencies, especially in antigen-binding VH3 CDR genes, and selective defects in gene utilization. Similar mutation patterns suggest defects in the quantity, but not quality, of mutator activity. Lower levels of SHM in IgG class-switched B cells from HIV-1-infected patients may contribute to the increased risk of opportunistic infections and impaired humoral responses to preventative vaccines.

  18. Optical photometric variable stars towards the Galactic H II region NGC 2282

    NASA Astrophysics Data System (ADS)

    Dutta, Somnath; Mondal, Soumen; Joshi, Santosh; Jose, Jessy; Das, Ramkrishna; Ghosh, Supriyo

    2018-05-01

    We report here CCD I-band time series photometry of a young (2-5 Myr) cluster NGC 2282, in order to identify and understand the variability of pre-main-sequence (PMS) stars. The I-band photometry, down to ˜20.5 mag, enables us to probe the variability towards the lower mass end (˜0.1 M⊙) of PMS stars. From the light curves of 1627 stars, we identified 62 new photometric variable candidates. Their association with the region was established from H α emission and infrared (IR) excess. Among 62 variables, 30 young variables exhibit H α emission, near-IR (NIR)/mid-IR (MIR) excess or both and are candidate members of the cluster. Out of 62 variables, 41 are periodic variables, with a rotation rate ranging from 0.2-7 d. The period distribution exhibits a median period at ˜1 d, as in many young clusters (e.g. NGC 2264, ONC, etc.), but it follows a unimodal distribution, unlike others that have bimodality, with slow rotators peaking at ˜6-8 d. To investigate the rotation-disc and variability-disc connection, we derived the NIR excess from Δ(I - K) and the MIR excess from Spitzer [3.6]-[4.5] μm data. No conclusive evidence of slow rotation with the presence of discs around stars and fast rotation for discless stars is obtained from our periodic variables. A clear increasing trend of the variability amplitude with IR excess is found for all variables.

  19. Phylogenetic and Genome-Wide Deep-Sequencing Analyses of Canine Parvovirus Reveal Co-Infection with Field Variants and Emergence of a Recent Recombinant Strain

    PubMed Central

    Pérez, Ruben; Calleros, Lucía; Marandino, Ana; Sarute, Nicolás; Iraola, Gregorio; Grecco, Sofia; Blanc, Hervé; Vignuzzi, Marco; Isakov, Ofer; Shomron, Noam; Carrau, Lucía; Hernández, Martín; Francia, Lourdes; Sosa, Katia; Tomás, Gonzalo; Panzera, Yanina

    2014-01-01

    Canine parvovirus (CPV), a fast-evolving single-stranded DNA virus, comprises three antigenic variants (2a, 2b, and 2c) with different frequencies and genetic variability among countries. The contribution of co-infection and recombination to the genetic variability of CPV is far from being fully elucidated. Here we took advantage of a natural CPV population, recently formed by the convergence of divergent CPV-2c and CPV-2a strains, to study co-infection and recombination. Complete sequences of the viral coding region of CPV-2a and CPV-2c strains from 40 samples were generated and analyzed using phylogenetic tools. Two samples showed co-infection and were further analyzed by deep sequencing. The sequence profile of one of the samples revealed the presence of CPV-2c and CPV-2a strains that differed at 29 nucleotides. The other sample included a minor CPV-2a strain (13.3% of the viral population) and a major recombinant strain (86.7%). The recombinant strain arose from inter-genotypic recombination between CPV-2c and CPV-2a strains within the VP1/VP2 gene boundary. Our findings highlight the importance of deep-sequencing analysis to provide a better understanding of CPV molecular diversity. PMID:25365348

  20. Characterization of expressed sequence tag-derived simple sequence repeat markers for Aspergillus flavus: emphasis on variability of isolates from the southern United States.

    PubMed

    Wang, Xinwang; Wadl, Phillip A; Wood-Jones, Alicia; Windham, Gary; Trigiano, Robert N; Scruggs, Mary; Pilgrim, Candace; Baird, Richard

    2012-12-01

    Simple sequence repeat (SSR) markers were developed from Aspergillus flavus expressed sequence tag (EST) database to conduct an analysis of genetic relationships of Aspergillus isolates from numerous host species and geographical regions, but primarily from the United States. Twenty-nine primers were designed from 362 tri-nucleotide EST-SSR sequences. Eighteen polymorphic loci were used to genotype 96 Aspergillus species isolates. The number of alleles detected per locus ranged from 2 to 24 with a mean of 8.2 alleles. Haploid diversity ranged from 0.28 to 0.91. Genetic distance matrix was used to perform principal coordinates analysis (PCA) and to generate dendrograms using unweighted pair group method with arithmetic mean (UPGMA). Two principal coordinates explained more than 75 % of the total variation among the isolates. One clade was identified for A. flavus isolates (n = 87) with the other Aspergillus species (n = 7) using PCA, but five distinct clusters were present when the others taxa were excluded from the analysis. Six groups were noted when the EST-SSR data were compared using UPGMA. However, the latter PCA or UPGMA comparison resulted in no direct associations with host species, geographical region or aflatoxin production. Furthermore, there was no direct correlation to visible morphological features such as sclerotial types. The isolates from Mississippi Delta region, which contained the largest percentage of isolates, did not show any unusual clustering except for isolates K32, K55, and 199. Further studies of these three isolates are warranted to evaluate their pathogenicity, aflatoxin production potential, additional gene sequences (e.g., RPB2), and morphological comparisons.

  1. Flagellin diversity in Clostridium botulinum groups I and II: a new strategy for strain identification.

    PubMed

    Paul, Catherine J; Twine, Susan M; Tam, Kevin J; Mullen, James A; Kelly, John F; Austin, John W; Logan, Susan M

    2007-05-01

    Strains of Clostridium botulinum are traditionally identified by botulinum neurotoxin type; however, identification of an additional target for typing would improve differentiation. Isolation of flagellar filaments and analysis by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) showed that C. botulinum produced multiple flagellin proteins. Nano-liquid chromatography-tandem mass spectrometry (nLC-MS/MS) analysis of in-gel tryptic digests identified peptides in all flagellin bands that matched two homologous tandem flagellin genes identified in the C. botulinum Hall A genome. Designated flaA1 and flaA2, these open reading frames encode the major structural flagellins of C. botulinum. Colony PCR and sequencing of flaA1/A2 variable regions classified 80 environmental and clinical strains into group I or group II and clustered isolates into 12 flagellar types. Flagellar type was distinct from neurotoxin type, and epidemiologically related isolates clustered together. Sequencing a larger PCR product, obtained during amplification of flaA1/A2 from type E strain Bennett identified a second flagellin gene, flaB. LC-MS analysis confirmed that flaB encoded a large type E-specific flagellin protein, and the predicted molecular mass for FlaB matched that observed by SDS-PAGE. In contrast, the molecular mass of FlaA was 2 to 12 kDa larger than the mass predicted by the flaA1/A2 sequence of a given strain, suggesting that FlaA is posttranslationally modified. While identification of FlaB, and the observation by SDS-PAGE of different masses of the FlaA proteins, showed the flagellin proteins of C. botulinum to be diverse, the presence of the flaA1/A2 gene in all strains examined facilitates single locus sequence typing of C. botulinum using the flagellin variable region.

  2. Symbiosis Island Shuffling with Abundant Insertion Sequences in the Genomes of Extra-Slow-Growing Strains of Soybean Bradyrhizobia

    PubMed Central

    Iida, Takayuki; Itakura, Manabu; Anda, Mizue; Sugawara, Masayuki; Isawa, Tsuyoshi; Okubo, Takashi; Sato, Shusei; Chiba-Kakizaki, Kaori

    2015-01-01

    Extra-slow-growing bradyrhizobia from root nodules of field-grown soybeans harbor abundant insertion sequences (ISs) and are termed highly reiterated sequence-possessing (HRS) strains. We analyzed the genome organization of HRS strains with the focus on IS distribution and symbiosis island structure. Using pulsed-field gel electrophoresis, we consistently detected several plasmids (0.07 to 0.4 Mb) in the HRS strains (NK5, NK6, USDA135, 2281, USDA123, and T2), whereas no plasmids were detected in the non-HRS strain USDA110. The chromosomes of the six HRS strains (9.7 to 10.7 Mb) were larger than that of USDA110 (9.1 Mb). Using MiSeq sequences of 6 HRS and 17 non-HRS strains mapped to the USDA110 genome, we found that the copy numbers of ISRj1, ISRj2, ISFK1, IS1632, ISB27, ISBj8, and IS1631 were markedly higher in HRS strains. Whole-genome sequencing showed that the HRS strain NK6 had four small plasmids (136 to 212 kb) and a large chromosome (9,780 kb). Strong colinearity was found between 7.4-Mb core regions of the NK6 and USDA110 chromosomes. USDA110 symbiosis islands corresponded mainly to five small regions (S1 to S5) within two variable regions, V1 (0.8 Mb) and V2 (1.6 Mb), of the NK6 chromosome. The USDA110 nif gene cluster (nifDKENXSBZHQW-fixBCX) was split into two regions, S2 and S3, where ISRj1-mediated rearrangement occurred between nifS and nifB. ISs were also scattered in NK6 core regions, and ISRj1 insertion often disrupted some genes important for survival and environmental responses. These results suggest that HRS strains of soybean bradyrhizobia were subjected to IS-mediated symbiosis island shuffling and core genome degradation. PMID:25862225

  3. Genome of Horsepox Virus

    PubMed Central

    Tulman, E. R.; Delhon, G.; Afonso, C. L.; Lu, Z.; Zsak, L.; Sandybaev, N. T.; Kerembekova, U. Z.; Zaitsev, V. L.; Kutish, G. F.; Rock, D. L.

    2006-01-01

    Here we present the genomic sequence of horsepox virus (HSPV) isolate MNR-76, an orthopoxvirus (OPV) isolated in 1976 from diseased Mongolian horses. The 212-kbp genome contained 7.5-kbp inverted terminal repeats and lacked extensive terminal tandem repetition. HSPV contained 236 open reading frames (ORFs) with similarity to those in other OPVs, with those in the central 100-kbp region most conserved relative to other OPVs. Phylogenetic analysis of the conserved region indicated that HSPV is closely related to sequenced isolates of vaccinia virus (VACV) and rabbitpox virus, clearly grouping together these VACV-like viruses. Fifty-four HSPV ORFs likely represented fragments of 25 orthologous OPV genes, including in the central region the only known fragmented form of an OPV ribonucleotide reductase large subunit gene. In terminal genomic regions, HSPV lacked full-length homologues of genes variably fragmented in other VACV-like viruses but was unique in fragmentation of the homologue of VACV strain Copenhagen B6R, a gene intact in other known VACV-like viruses. Notably, HSPV contained in terminal genomic regions 17 kbp of OPV-like sequence absent in known VACV-like viruses, including fragments of genes intact in other OPVs and approximately 1.4 kb of sequence present only in cowpox virus (CPXV). HSPV also contained seven full-length genes fragmented or missing in other VACV-like viruses, including intact homologues of the CPXV strain GRI-90 D2L/I4R CrmB and D13L CD30-like tumor necrosis factor receptors, D3L/I3R and C1L ankyrin repeat proteins, B19R kelch-like protein, D7L BTB/POZ domain protein, and B22R variola virus B22R-like protein. These results indicated that HSPV contains unique genomic features likely contributing to a unique virulence/host range phenotype. They also indicated that while closely related to known VACV-like viruses, HSPV contains additional, potentially ancestral sequences absent in other VACV-like viruses. PMID:16940536

  4. Comparative Molecular and Morphological Variation Analysis of Siderastrea (Anthozoa, Scleractinia) Reveals the Presence of Siderastrea stellata in the Gulf of Mexico.

    PubMed

    García, Norberto A Colín; Campos, Jorge E; Musi, José L Tello; Forsman, Zac H; Muñoz, Jorge L Montero; Reyes, Alejandro Monsalvo; González, Jesús E Arias

    2017-02-01

    The genus Siderastrea exhibits high levels of morphological variability. Some of its species share similar morphological characteristics with congeners, making their identification difficult. Siderastrea stellata has been reported as an intermediary of S. siderea and S. radians in the Brazilian reef ecosystem. In an earlier study conducted in Mexico, we detected Siderastrea colonies with morphological features that were not consistent with some siderastreid species previously reported in the Gulf of Mexico. Thus, we performed a combined morphological and molecular analysis to identify Siderastrea species boundaries from the Gulf of Mexico. Some colonies presented high morphologic variability, with characteristics that corresponded to Siderastrea stellata. Molecular analysis, using the nuclear ITS and ITS2 region, corroborated the morphological results, revealing low genetic variability between S. radians and S. stellata. Since the ITS sequences did not distinguish between Siderastrea species, we used the ITS2 region to differentiate S. stellata from S. radians. This is the first report of Siderastrea stellata and its variability in the Gulf of Mexico that is supported by morphological and molecular analyses.

  5. Analyses of mitochondrial amino acid sequence datasets support the proposal that specimens of Hypodontus macropi from three species of macropodid hosts represent distinct species

    PubMed Central

    2013-01-01

    Background Hypodontus macropi is a common intestinal nematode of a range of kangaroos and wallabies (macropodid marsupials). Based on previous multilocus enzyme electrophoresis (MEE) and nuclear ribosomal DNA sequence data sets, H. macropi has been proposed to be complex of species. To test this proposal using independent molecular data, we sequenced the whole mitochondrial (mt) genomes of individuals of H. macropi from three different species of hosts (Macropus robustus robustus, Thylogale billardierii and Macropus [Wallabia] bicolor) as well as that of Macropicola ocydromi (a related nematode), and undertook a comparative analysis of the amino acid sequence datasets derived from these genomes. Results The mt genomes sequenced by next-generation (454) technology from H. macropi from the three host species varied from 13,634 bp to 13,699 bp in size. Pairwise comparisons of the amino acid sequences predicted from these three mt genomes revealed differences of 5.8% to 18%. Phylogenetic analysis of the amino acid sequence data sets using Bayesian Inference (BI) showed that H. macropi from the three different host species formed distinct, well-supported clades. In addition, sliding window analysis of the mt genomes defined variable regions for future population genetic studies of H. macropi in different macropodid hosts and geographical regions around Australia. Conclusions The present analyses of inferred mt protein sequence datasets clearly supported the hypothesis that H. macropi from M. robustus robustus, M. bicolor and T. billardierii represent distinct species. PMID:24261823

  6. [Analysis of COX1 sequences of Taenia isolates from four areas of Guangxi].

    PubMed

    Yang, Yi-Chao; Ou-Yang, Yi; Su, Ai-Rong; Wan, Xiao-Ling; Li, Shu-Lin

    2012-06-01

    To analyze the COX1 sequences of Taenia isolates from four areas of Guangxi Zhuang Autonomous Region, and to understand the distribution of Taenia asiatica in Guangxi. Patients with taeniasis in Luzhai, Rongshui, Tiandong and Sanjiang in Guangxi were treated by deworming, and the Taenia isolates were collected. Cyclooxygenase-1 (COX1) sequences of these isolates were amplified by PCR, and the PCR products were sequenced by T-A clone sequencing. The homogeneities and genetic distances were calculated and analyzed, and the phylogenic trees were constructed by some softwares. Meanwhile, the COX1 sequences of the isolates from the 4 areas were compared separately with the sequences of Taenia species in GenBank. The COX1 sequence of the 5 Taenia isolates collected had the same length of 444 bp. There were 5 variable positions between the Luzhai isolate and Taenia asiatica, the homogeneity was 98.87% and their genetic distance was 0.011. The phylogenetic tree analysis revealed that the Luzhai isolate and Taenia asiatica locating at the same node had a close relationship. The homogeneity between Rongshui isolate A and Taenia solium was 100%, while the homogeneity of Rongshui isolate B with Taeniasis saginata and Taenia asiatica were 98.20% and 96.17%, respectively. The homogeneities of the Tiandong and Sanjiang isolates with Taenia solium were 99.55% and 96.40%, respectively, and the genetic distances were 0.005 and 0.037, respectively. The homogeneity between the Luzhai isolate and Taeniasis saginate was 96.40%. Taenia asiatica exists in Luzhai and Taenia solium and Taenia saginata coexist in Rongshui, Guangxi Zhuang Autonomous Region.

  7. LandScape: a simple method to aggregate p-values and other stochastic variables without a priori grouping.

    PubMed

    Wiuf, Carsten; Schaumburg-Müller Pallesen, Jonatan; Foldager, Leslie; Grove, Jakob

    2016-08-01

    In many areas of science it is custom to perform many, potentially millions, of tests simultaneously. To gain statistical power it is common to group tests based on a priori criteria such as predefined regions or by sliding windows. However, it is not straightforward to choose grouping criteria and the results might depend on the chosen criteria. Methods that summarize, or aggregate, test statistics or p-values, without relying on a priori criteria, are therefore desirable. We present a simple method to aggregate a sequence of stochastic variables, such as test statistics or p-values, into fewer variables without assuming a priori defined groups. We provide different ways to evaluate the significance of the aggregated variables based on theoretical considerations and resampling techniques, and show that under certain assumptions the FWER is controlled in the strong sense. Validity of the method was demonstrated using simulations and real data analyses. Our method may be a useful supplement to standard procedures relying on evaluation of test statistics individually. Moreover, by being agnostic and not relying on predefined selected regions, it might be a practical alternative to conventionally used methods of aggregation of p-values over regions. The method is implemented in Python and freely available online (through GitHub, see the Supplementary information).

  8. Surface Diversity in Mycoplasma agalactiae Is Driven by Site-Specific DNA Inversions within the vpma Multigene Locus

    PubMed Central

    Glew, Michelle D.; Marenda, Marc; Rosengarten, Renate; Citti, Christine

    2002-01-01

    The ruminant pathogen Mycoplasma agalactiae possesses a family of abundantly expressed variable surface lipoproteins called Vpmas. Phenotypic switches between Vpma members have previously been correlated with DNA rearrangements within a locus of vpma genes and are proposed to play an important role in disease pathogenesis. In this study, six vpma genes were characterized in the M. agalactiae type strain PG2. All vpma genes clustered within an 8-kb region and shared highly conserved 5′ untranslated regions, lipoprotein signal sequences, and short N-terminal sequences. Analyses of the vpma loci from consecutive clonal isolates showed that vpma DNA rearrangements were site specific and that cleavage and strand exchange occurred within a minimal region of 21 bp located within the 5′ untranslated region of all vpma genes. This process controlled expression of vpma genes by effectively linking the open reading frame (ORF) of a silent gene to a unique active promoter sequence within the locus. An ORF (xer1) immediately adjacent to one end of the vpma locus did not undergo rearrangement and had significant homology to a distinct subset of genes belonging to the λ integrase family of site-specific xer recombinases. It is proposed that xer1 codes for a site-specific recombinase that is not involved in chromosome dimer resolution but rather is responsible for the observed vpma-specific recombination in M. agalactiae. PMID:12374833

  9. Stereophysicochemical variability plots highlight conserved antigenic areas in Flaviviruses

    PubMed Central

    Schein, Catherine H; Zhou, Bin; Braun, Werner

    2005-01-01

    Background Flaviviruses, which include Dengue (DV) and West Nile (WN), mutate in response to immune system pressure. Identifying escape mutants, variant progeny that replicate in the presence of neutralizing antibodies, is a common way to identify functionally important residues of viral proteins. However, the mutations typically occur at variable positions on the viral surface that are not essential for viral replication. Methods are needed to determine the true targets of the neutralizing antibodies. Results Stereophysicochemical variability plots (SVPs), 3-D images of protein structures colored according to variability, as determined by our PCPMer program, were used to visualize residues conserved in their physical chemical properties (PCPs) near escape mutant positions. The analysis showed 1) that escape mutations in the flavivirus envelope protein are variable residues by our criteria and 2) two escape mutants found at the same position in many flaviviruses sit above clusters of conserved residues from different regions of the linear sequence. Conservation patterns in T-cell epitopes in the NS3- protease suggest a similar mechanism of immune system evasion. Conclusion The SVPs add another dimension to structurally defining the binding sites of neutralizing antibodies. They provide a useful aid for determining antigenically important regions and designing vaccines. PMID:15845145

  10. Large-scale sequence and structural comparisons of human naive and antigen-experienced antibody repertoires

    PubMed Central

    DeKosky, Brandon J.; Lungu, Oana I.; Park, Daechan; Johnson, Erik L.; Charab, Wissam; Chrysostomou, Constantine; Kuroda, Daisuke; Ellington, Andrew D.; Ippolito, Gregory C.; Gray, Jeffrey J.; Georgiou, George

    2016-01-01

    Elucidating how antigen exposure and selection shape the human antibody repertoire is fundamental to our understanding of B-cell immunity. We sequenced the paired heavy- and light-chain variable regions (VH and VL, respectively) from large populations of single B cells combined with computational modeling of antibody structures to evaluate sequence and structural features of human antibody repertoires at unprecedented depth. Analysis of a dataset comprising 55,000 antibody clusters from CD19+CD20+CD27− IgM-naive B cells, >120,000 antibody clusters from CD19+CD20+CD27+ antigen–experienced B cells, and >2,000 RosettaAntibody-predicted structural models across three healthy donors led to a number of key findings: (i) VH and VL gene sequences pair in a combinatorial fashion without detectable pairing restrictions at the population level; (ii) certain VH:VL gene pairs were significantly enriched or depleted in the antigen-experienced repertoire relative to the naive repertoire; (iii) antigen selection increased antibody paratope net charge and solvent-accessible surface area; and (iv) public heavy-chain third complementarity-determining region (CDR-H3) antibodies in the antigen-experienced repertoire showed signs of convergent paired light-chain genetic signatures, including shared light-chain third complementarity-determining region (CDR-L3) amino acid sequences and/or Vκ,λ–Jκ,λ genes. The data reported here address several longstanding questions regarding antibody repertoire selection and development and provide a benchmark for future repertoire-scale analyses of antibody responses to vaccination and disease. PMID:27114511

  11. Indel Group in Genomes (IGG) Molecular Genetic Markers1[OPEN

    PubMed Central

    Burkart-Waco, Diana; Kuppu, Sundaram; Britt, Anne; Chetelat, Roger

    2016-01-01

    Genetic markers are essential when developing or working with genetically variable populations. Indel Group in Genomes (IGG) markers are primer pairs that amplify single-locus sequences that differ in size for two or more alleles. They are attractive for their ease of use for rapid genotyping and their codominant nature. Here, we describe a heuristic algorithm that uses a k-mer-based approach to search two or more genome sequences to locate polymorphic regions suitable for designing candidate IGG marker primers. As input to the IGG pipeline software, the user provides genome sequences and the desired amplicon sizes and size differences. Primer sequences flanking polymorphic insertions/deletions are produced as output. IGG marker files for three sets of genomes, Solanum lycopersicum/Solanum pennellii, Arabidopsis (Arabidopsis thaliana) Columbia-0/Landsberg erecta-0 accessions, and S. lycopersicum/S. pennellii/Solanum tuberosum (three-way polymorphic) are included. PMID:27436831

  12. Characterization of the Fb-Nof Transposable Element of Drosophila Melanogaster

    PubMed Central

    Harden, N.; Ashburner, M.

    1990-01-01

    FB-NOF is a composite transposable element of Drosophila melanogaster. It is composed of foldback sequences, of variable length, which flank a 4-kb NOF sequence with 308-bp inverted repeat termini. The NOF sequence could potentially code for a 120-kD polypeptide. The FB-NOF element is responsible for unstable mutations of the white gene (w(c) and w(DZL)) and is associated with the large TEs of G. Ising. Although most strains of D. melanogaster have 20-30 sites of FB insertion, FB-NOF elements are usually rare, many strains lack this composite element or have only one copy of it. A few strains, including w(DZL) and Basc have many (8-21) copies of FB-NOF, and these show a tendency to insert at ``hot-spots.'' These strains also have an increased number of FB elements. The DNA sequence of the NOF region associated with TE146(Z) has been determined. PMID:2174013

  13. Sequencing of hepatitis C virus for detection of resistance to direct-acting antiviral therapy: A systematic review.

    PubMed

    Bartlett, Sofia R; Grebely, Jason; Eltahla, Auda A; Reeves, Jacqueline D; Howe, Anita Y M; Miller, Veronica; Ceccherini-Silberstein, Francesca; Bull, Rowena A; Douglas, Mark W; Dore, Gregory J; Harrington, Patrick; Lloyd, Andrew R; Jacka, Brendan; Matthews, Gail V; Wang, Gary P; Pawlotsky, Jean-Michel; Feld, Jordan J; Schinkel, Janke; Garcia, Federico; Lennerstrand, Johan; Applegate, Tanya L

    2017-07-01

    The significance of the clinical impact of direct-acting antiviral (DAA) resistance-associated substitutions (RASs) in hepatitis C virus (HCV) on treatment failure is unclear. No standardized methods or guidelines for detection of DAA RASs in HCV exist. To facilitate further evaluations of the impact of DAA RASs in HCV, we conducted a systematic review of RAS sequencing protocols, compiled a comprehensive public library of sequencing primers, and provided expert guidance on the most appropriate methods to screen and identify RASs. The development of standardized RAS sequencing protocols is complicated due to a high genetic variability and the need for genotype- and subtype-specific protocols for multiple regions. We have identified several limitations of the available methods and have highlighted areas requiring further research and development. The development, validation, and sharing of standardized methods for all genotypes and subtypes should be a priority. ( Hepatology Communications 2017;1:379-390).

  14. Phenotype/genotype correlation in a case series of Stargardt's patients identifies novel mutations in the ABCA4 gene.

    PubMed

    Gemenetzi, M; Lotery, A J

    2013-11-01

    To investigate phenotypic variability in terms of best-corrected visual acuity (BCVA) in patients with Stargardt disease (STGD) and confirmed ABCA4 mutations. Entire coding region analysis of the ABCA4 gene by direct sequencing of seven patients with clinical findings of STGD seen in the Retina Clinics of Southampton Eye Unit between 2002 and 2011.Phenotypic variables recorded were BCVA, fluorescein angiographic appearance, electrophysiology, and visual fields. All patients had heterozygous amino acid-changing variants (missense mutations) in the ABCA4 gene. A splice sequence change was found in a 30-year-old patient with severly affected vision. Two novel sequence changes were identified: a missense mutation in a mildly affected 44-year-old patient and a frameshift mutation in a severly affected 34-year-old patient. The identified ABCA4 mutations were compatible with the resulting phenotypes in terms of BCVA. Higher BCVAs were recorded in patients with missense mutations. Sequence changes, predicted to have more deleterious effect on protein function, resulted in a more severe phenotype. This case series of STGD patients demonstrates novel genotype/phenotype correlations, which may be useful to counselling of patients. This information may prove useful in selection of candidates for clinical trials in ABCA4 disease.

  15. Statistical inference of the generation probability of T-cell receptors from sequence repertoires.

    PubMed

    Murugan, Anand; Mora, Thierry; Walczak, Aleksandra M; Callan, Curtis G

    2012-10-02

    Stochastic rearrangement of germline V-, D-, and J-genes to create variable coding sequence for certain cell surface receptors is at the origin of immune system diversity. This process, known as "VDJ recombination", is implemented via a series of stochastic molecular events involving gene choices and random nucleotide insertions between, and deletions from, genes. We use large sequence repertoires of the variable CDR3 region of human CD4+ T-cell receptor beta chains to infer the statistical properties of these basic biochemical events. Because any given CDR3 sequence can be produced in multiple ways, the probability distribution of hidden recombination events cannot be inferred directly from the observed sequences; we therefore develop a maximum likelihood inference method to achieve this end. To separate the properties of the molecular rearrangement mechanism from the effects of selection, we focus on nonproductive CDR3 sequences in T-cell DNA. We infer the joint distribution of the various generative events that occur when a new T-cell receptor gene is created. We find a rich picture of correlation (and absence thereof), providing insight into the molecular mechanisms involved. The generative event statistics are consistent between individuals, suggesting a universal biochemical process. Our probabilistic model predicts the generation probability of any specific CDR3 sequence by the primitive recombination process, allowing us to quantify the potential diversity of the T-cell repertoire and to understand why some sequences are shared between individuals. We argue that the use of formal statistical inference methods, of the kind presented in this paper, will be essential for quantitative understanding of the generation and evolution of diversity in the adaptive immune system.

  16. Genetic variability among Schistosoma japonicum isolates from the Philippines, Japan and China revealed by sequence analysis of three mitochondrial genes.

    PubMed

    Chen, Fen; Li, Juan; Sugiyama, Hiromu; Zhou, Dong-Hui; Song, Hui-Qun; Zhao, Guang-Hui; Zhu, Xing-Quan

    2015-02-01

    The present study examined sequence variability in the mitochondrial (mt) protein-coding genes cytochrome b (cytb), NADH dehydrogenase subunits 2 and 6 (nad2 and nad6) among 24 isolates of Schistosoma japonicum from different endemic regions in the Philippines, Japan and China. The complete cytb, nad2 and nad6 genes were amplified and sequenced separately from individual schistosome. Sequence variations for isolates from the Philippines were 0-0.5% for cytb, 0-0.6% for nad2, and 0-0.9% for nad6. Variation was 0-0.5%, 0.1-0.8%, 0-0.7% for corresponding genes for schistosome samples from mainland China. For worms in Japan, genetic variations were 0-0.2%, 0.1-0.2% and 0 for the three genes, respectively. Sequence variations were 0-1.0%, 0-1.8% and 0-1.1% for cytb, nad2 and nad6, respectively, among schistosome isolates from different geographical strains in the Philippines, Japan and China. Of the three countries, lowest sequence variations were found between isolates from mainland China and the Philippines and highest were detected between Japan and the Philippines in three mtDNA genes. Phylogenetic analyses based on the combined sequences of cytb, nad2 and nad6 revealed that all isolates in the Philippines clustered together sistered to samples from Yunnan and Zhejiang provinces in China, while isolates from Yamanashi in Japan were in a solitary clade. These results demonstrated the usefulness of the combined three mtDNA sequences for studying genetic diversity and population structure among S. japonicum isolates from the Philippines, China and Japan.

  17. Genetic analysis of West Nile virus isolates from an outbreak in Idaho, United States, 2006-2007.

    PubMed

    Grinev, Andriyan; Chancey, Caren; Añez, Germán; Ball, Christopher; Winkelman, Valerie; Williamson, Phillip; Foster, Gregory A; Stramer, Susan L; Rios, Maria

    2013-09-23

    West Nile virus (WNV) appeared in the U.S. in 1999 and has since become endemic, with yearly summer epidemics causing tens of thousands of cases of serious disease over the past 14 years. Analysis of WNV strains isolated during the 2006-2007 epidemic seasons demonstrates that a new genetic variant had emerged coincidentally with an intense outbreak in Idaho during 2006. The isolates belonging to the new variant carry a 13 nt deletion, termed ID-Δ13, located at the variable region of the 3'UTR, and are genetically related. The analysis of deletions and insertions in the 3'UTR of two major lineages of WNV revealed the presence of conserved repeats and two indel motifs in the variable region of the 3'UTR. One human and two bird isolates from the Idaho 2006-2007 outbreaks were sequenced using Illumina technology and within-host variability was analyzed. Continued monitoring of new genetic variants is important for public health as WNV continues to evolve.

  18. Discriminating plants using the DNA barcode rbcLb: an appraisal based on a large data set.

    PubMed

    Dong, Wenpan; Cheng, Tao; Li, Changhao; Xu, Chao; Long, Ping; Chen, Chumming; Zhou, Shiliang

    2014-03-01

    The ideal DNA barcode for plants remains to be discovered, and the candidate barcode rbcL has been met with considerable skepticism since its proposal. In fact, the variability within this gene has never been fully explored across all plant groups from algae to flowering plants, and its performance as a barcode has not been adequately tested. By analysing all of the rbcL sequences currently available in GenBank, we attempted to determine how well a region of rbcL performs as a barcode in species discrimination. We found that the rbcLb region was more variable than the frequently used rbcLa region. Both universal and plant group-specific primers were designed to amplify rbcLb, and the performance of rbcLa and rbcLb was tested in several ways. Using blast, both regions successfully identified all families and nearly all genera; however, the successful species identification rates varied significantly among plant groups, ranging from 24.58% to 85.50% for rbcLa and from 36.67% to 90.89% for rbcLb. Successful species discrimination ranged from 5.19% to 96.33% for rbcLa and from 22.09% to 98.43% for rbcLb in species-rich families, and from 0 to 88.73% for rbcLa and from 2.04% to 100% for rbcLb in species-rich genera. Both regions performed better for lower plants than for higher plants, although rbcLb performed significantly better than rbcLa overall, particularly for angiosperms. Considering the applicability across plants, easy and unambiguous alignment, high primer universality, high sequence quality and high species discrimination power for lower plants, we suggest rbcLb as a universal plant barcode. © 2013 John Wiley & Sons Ltd.

  19. Effects of historical climate change, habitat connectivity, and vicariance on genetic structure and diversity across the range of the Red Tree Vole (Phenacomys longicaudus) in the Pacific Northwest United States

    USGS Publications Warehouse

    Miller, Mark P.; Bellinger, R.M.; Forsman, E.D.; Haig, Susan M.

    2006-01-01

    Phylogeographical analyses conducted in the Pacific Northwestern United States have often revealed concordant patterns of genetic diversity among taxa. These studies demonstrate distinct North/South genetic discontinuities that have been attributed to Pleistocene glaciation. We examined phylogeographical patterns of red tree voles (Phenacomys longicaudus) in western Oregon by analysing mitochondrial control region sequences for 169 individuals from 18 areas across the species' range. Cytochrome b sequences were also analysed from a subset of our samples to confirm the presence of major haplotype groups. Phylogenetic network analyses suggested the presence of two haplotype groups corresponding to northern and southern regions of P. longicaudus' range. Spatial genetic analyses (samova and Genetic Landscape Shapes) of control region sequences demonstrated a primary genetic discontinuity separating northern and southern sampling areas, while a secondary discontinuity separated northern sampling areas into eastern and western groups divided by the Willamette Valley. The North/South discontinuity likely corresponds to a region of secondary contact between lineages rather than an overt barrier. Although the Cordilleran ice sheet (maximum a??12 000 years ago) did not move southward to directly affect the region occupied by P. longicaudus, climate change during glaciation fragmented the forest landscape that it inhabits. Signatures of historical fragmentation were reflected by positive associations between latitude and variables such as Tajima's D and patterns associated with location-specific alleles. Genetic distances between southern sampling areas were smaller, suggesting that forest fragmentation was reduced in southern vs. northern regions.

  20. Transposon Insertions, Structural Variations, and SNPs Contribute to the Evolution of the Melon Genome.

    PubMed

    Sanseverino, Walter; Hénaff, Elizabeth; Vives, Cristina; Pinosio, Sara; Burgos-Paz, William; Morgante, Michele; Ramos-Onsins, Sebastián E; Garcia-Mas, Jordi; Casacuberta, Josep Maria

    2015-10-01

    The availability of extensive databases of crop genome sequences should allow analysis of crop variability at an unprecedented scale, which should have an important impact in plant breeding. However, up to now the analysis of genetic variability at the whole-genome scale has been mainly restricted to single nucleotide polymorphisms (SNPs). This is a strong limitation as structural variation (SV) and transposon insertion polymorphisms are frequent in plant species and have had an important mutational role in crop domestication and breeding. Here, we present the first comprehensive analysis of melon genetic diversity, which includes a detailed analysis of SNPs, SV, and transposon insertion polymorphisms. The variability found among seven melon varieties representing the species diversity and including wild accessions and highly breed lines, is relatively high due in part to the marked divergence of some lineages. The diversity is distributed nonuniformly across the genome, being lower at the extremes of the chromosomes and higher in the pericentromeric regions, which is compatible with the effect of purifying selection and recombination forces over functional regions. Additionally, this variability is greatly reduced among elite varieties, probably due to selection during breeding. We have found some chromosomal regions showing a high differentiation of the elite varieties versus the rest, which could be considered as strongly selected candidate regions. Our data also suggest that transposons and SV may be at the origin of an important fraction of the variability in melon, which highlights the importance of analyzing all types of genetic variability to understand crop genome evolution. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  1. Comparative Genome Analysis of Ciprofloxacin-Resistant Pseudomonas aeruginosa Reveals Genes Within Newly Identified High Variability Regions Associated With Drug Resistance Development

    PubMed Central

    Su, Hsun-Cheng; Khatun, Jainab; Kanavy, Dona M.

    2013-01-01

    The alarming rise of ciprofloxacin-resistant Pseudomonas aeruginosa has been reported in several clinical studies. Though the mutation of resistance genes and their role in drug resistance has been researched, the process by which the bacterium acquires high-level resistance is still not well understood. How does the genomic evolution of P. aeruginosa affect resistance development? Could the exposure of antibiotics to the bacteria enrich genomic variants that lead to the development of resistance, and if so, how are these variants distributed through the genome? To answer these questions, we performed 454 pyrosequencing and a whole genome analysis both before and after exposure to ciprofloxacin. The comparative sequence data revealed 93 unique resistance strain variation sites, which included a mutation in the DNA gyrase subunit A gene. We generated variation-distribution maps comparing the wild and resistant types, and isolated 19 candidates from three discrete resistance-associated high variability regions that had available transposon mutants, to perform a ciprofloxacin exposure assay. Of these region candidates with transposon disruptions, 79% (15/19) showed a reduction in the ability to gain high-level resistance, suggesting that genes within these high variability regions might enrich for certain functions associated with resistance development. PMID:23808957

  2. The skin microbiome in healthy and allergic dogs.

    PubMed

    Rodrigues Hoffmann, Aline; Patterson, Adam P; Diesel, Alison; Lawhon, Sara D; Ly, Hoai Jaclyn; Elkins Stephenson, Christine; Mansell, Joanne; Steiner, Jörg M; Dowd, Scot E; Olivry, Thierry; Suchodolski, Jan S

    2014-01-01

    Changes in the microbial populations on the skin of animals have traditionally been evaluated using conventional microbiology techniques. The sequencing of bacterial 16S rRNA genes has revealed that the human skin is inhabited by a highly diverse and variable microbiome that had previously not been demonstrated by culture-based methods. The goals of this study were to describe the microbiome inhabiting different areas of the canine skin, and to compare the skin microbiome of healthy and allergic dogs. DNA extracted from superficial skin swabs from healthy (n = 12) and allergic dogs (n = 6) from different regions of haired skin and mucosal surfaces were used for 454-pyrosequencing of the 16S rRNA gene. Principal coordinates analysis revealed clustering for the different skin sites across all dogs, with some mucosal sites and the perianal regions clustering separately from the haired skin sites. The rarefaction analysis revealed high individual variability between samples collected from healthy dogs and between the different skin sites. Higher species richness and microbial diversity were observed in the samples from haired skin when compared to mucosal surfaces or mucocutaneous junctions. In all examined regions, the most abundant phylum and family identified in the different regions of skin and mucosal surfaces were Proteobacteria and Oxalobacteriaceae. The skin of allergic dogs had lower species richness when compared to the healthy dogs. The allergic dogs had lower proportions of the Betaproteobacteria Ralstonia spp. when compared to the healthy dogs. The study demonstrates that the skin of dogs is inhabited by much more rich and diverse microbial communities than previously thought using culture-based methods. Our sequence data reveal high individual variability between samples collected from different patients. Differences in species richness was also seen between healthy and allergic dogs, with allergic dogs having lower species richness when compared to healthy dogs.

  3. The Skin Microbiome in Healthy and Allergic Dogs

    PubMed Central

    Rodrigues Hoffmann, Aline; Patterson, Adam P.; Diesel, Alison; Lawhon, Sara D.; Ly, Hoai Jaclyn; Stephenson, Christine Elkins; Mansell, Joanne; Steiner, Jörg M.; Dowd, Scot E.; Olivry, Thierry; Suchodolski, Jan S.

    2014-01-01

    Background Changes in the microbial populations on the skin of animals have traditionally been evaluated using conventional microbiology techniques. The sequencing of bacterial 16S rRNA genes has revealed that the human skin is inhabited by a highly diverse and variable microbiome that had previously not been demonstrated by culture-based methods. The goals of this study were to describe the microbiome inhabiting different areas of the canine skin, and to compare the skin microbiome of healthy and allergic dogs. Methodology/Principal Findings DNA extracted from superficial skin swabs from healthy (n = 12) and allergic dogs (n = 6) from different regions of haired skin and mucosal surfaces were used for 454-pyrosequencing of the 16S rRNA gene. Principal coordinates analysis revealed clustering for the different skin sites across all dogs, with some mucosal sites and the perianal regions clustering separately from the haired skin sites. The rarefaction analysis revealed high individual variability between samples collected from healthy dogs and between the different skin sites. Higher species richness and microbial diversity were observed in the samples from haired skin when compared to mucosal surfaces or mucocutaneous junctions. In all examined regions, the most abundant phylum and family identified in the different regions of skin and mucosal surfaces were Proteobacteria and Oxalobacteriaceae. The skin of allergic dogs had lower species richness when compared to the healthy dogs. The allergic dogs had lower proportions of the Betaproteobacteria Ralstonia spp. when compared to the healthy dogs. Conclusions/Significance The study demonstrates that the skin of dogs is inhabited by much more rich and diverse microbial communities than previously thought using culture-based methods. Our sequence data reveal high individual variability between samples collected from different patients. Differences in species richness was also seen between healthy and allergic dogs, with allergic dogs having lower species richness when compared to healthy dogs. PMID:24421875

  4. Sequence diversity within the reovirus S2 gene: reovirus genes reassort in nature, and their termini are predicted to form a panhandle motif.

    PubMed Central

    Chapell, J D; Goral, M I; Rodgers, S E; dePamphilis, C W; Dermody, T S

    1994-01-01

    To better understand genetic diversity within mammalian reoviruses, we determined S2 nucleotide and deduced sigma 2 amino acid sequences of nine reovirus strains and compared these sequences with those of prototype strains of the three reovirus serotypes. The S2 gene and sigma 2 protein are highly conserved among the four type 1, one type 2, and seven type 3 strains studied. Phylogenetic analyses based on S2 nucleotide sequences of the 12 reovirus strains indicate that diversity within the S2 gene is independent of viral serotype. Additionally, we found marked topological differences between phylogenetic trees generated from S1 and S2 gene nucleotide sequences of the seven type 3 strains. These results demonstrate that reovirus S1 and S2 genes have distinct evolutionary histories, thus providing phylogenetic evidence for lateral transfer of reovirus genes in nature. When variability among the 12 sigma 2-encoding S2 nucleotide sequences was analyzed at synonymous positions, we found that approximately 60 nucleotides at the 5' terminus and 30 nucleotides at the 3' terminus were markedly conserved in comparison with other sigma 2-encoding regions of S2. Predictions of RNA secondary structures indicate that the more conserved S2 sequences participate in the formation of an extended region of duplex RNA interrupted by a pair of stem-loops. Among the 12 deduced sigma 2 amino acid sequences examined, substitutions were observed at only 11% of amino acid positions. This finding suggests that constraints on the structure or function of sigma 2, perhaps in part because of its location in the virion core, have limited sequence diversity within this protein. PMID:8289378

  5. Antigenic Variation in the Lyme Spirochete: Insights into Recombinational Switching with a Suggested Role for Error-Prone Repair.

    PubMed

    Verhey, Theodore B; Castellanos, Mildred; Chaconas, George

    2018-05-29

    The Lyme disease spirochete, Borrelia burgdorferi, uses antigenic variation as a strategy to evade the host's acquired immune response. New variants of surface-localized VlsE are generated efficiently by unidirectional recombination from 15 unexpressed vls cassettes into the vlsE locus. Using algorithms to analyze switching from vlsE sequencing data, we characterize a population of over 45,000 inferred recombination events generated during mouse infection. We present evidence for clustering of these recombination events within the population and along the vlsE gene, a role for the direct repeats flanking the variable region in vlsE, and the importance of sequence homology in determining the location of recombination, despite RecA's dispensability. Finally, we report that non-templated sequence variation is strongly associated with recombinational switching and occurs predominantly at the 5' end of conversion tracts. This likely results from an error-prone repair mechanism operational during recombinational switching that elevates the mutation rate > 5,000-fold in switched regions. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.

  6. UniPrime2: a web service providing easier Universal Primer design.

    PubMed

    Boutros, Robin; Stokes, Nicola; Bekaert, Michaël; Teeling, Emma C

    2009-07-01

    The UniPrime2 web server is a publicly available online resource which automatically designs large sets of universal primers when given a gene reference ID or Fasta sequence input by a user. UniPrime2 works by automatically retrieving and aligning homologous sequences from GenBank, identifying regions of conservation within the alignment, and generating suitable primers that can be used to amplify variable genomic regions. In essence, UniPrime2 is a suite of publicly available software packages (Blastn, T-Coffee, GramAlign, Primer3), which reduces the laborious process of primer design, by integrating these programs into a single software pipeline. Hence, UniPrime2 differs from previous primer design web services in that all steps are automated, linked, saved and phylogenetically delimited, only requiring a single user-defined gene reference ID or input sequence. We provide an overview of the web service and wet-laboratory validation of the primers generated. The system is freely accessible at: http://uniprime.batlab.eu. UniPrime2 is licenced under a Creative Commons Attribution Noncommercial-Share Alike 3.0 Licence.

  7. Molecular genetic diversity of the Saccharomyces yeasts in Taiwan: Saccharomyces arboricola, Saccharomyces cerevisiae and Saccharomyces kudriavzevii.

    PubMed

    Naumov, Gennadi I; Lee, Ching-Fu; Naumova, Elena S

    2013-01-01

    Genetic hybridization, sequence and karyotypic analyses of natural Saccharomyces yeasts isolated in different regions of Taiwan revealed three biological species: Saccharomyces arboricola, Saccharomyces cerevisiae and Saccharomyces kudriavzevii. Intraspecies variability of the D1/D2 and ITS1 rDNA sequences was detected among S. cerevisiae and S. kudriavzevii isolates. According to molecular and genetic analyses, the cosmopolitan species S. cerevisiae and S. kudriavzevii contain local divergent populations in Taiwan, Malaysia and Japan. Six of the seven known Saccharomyces species are documented in East Asia: S. arboricola, S. bayanus, S. cerevisiae, S. kudriavzevii, S. mikatae, and S. paradoxus.

  8. Dissociable Effects on Birdsong of Androgen Signaling in Cortex-Like Brain Regions of Canaries

    PubMed Central

    2017-01-01

    The neural basis of how learned vocalizations change during development and in adulthood represents a major challenge facing cognitive neuroscience. This plasticity in the degree to which learned vocalizations can change in both humans and songbirds is linked to the actions of sex steroid hormones during ontogeny but also in adulthood in the context of seasonal changes in birdsong. We investigated the role of steroid hormone signaling in the brain on distinct features of birdsong using adult male canaries (Serinus canaria), which show extensive seasonal vocal plasticity as adults. Specifically, we bilaterally implanted the potent androgen receptor antagonist flutamide in two key brain regions that control birdsong. We show that androgen signaling in the motor cortical-like brain region, the robust nucleus of the arcopallium (RA), controls syllable and trill bandwidth stereotypy, while not significantly affecting higher order features of song such syllable-type usage (i.e., how many times each syllable type is used) or syllable sequences. In contrast, androgen signaling in the premotor cortical-like brain region, HVC (proper name), controls song variability by increasing the variability of syllable-type usage and syllable sequences, while having no effect on syllable or trill bandwidth stereotypy. Other aspects of song, such as the duration of trills and the number of syllables per song, were also differentially affected by androgen signaling in HVC versus RA. These results implicate androgens in regulating distinct features of complex motor output in a precise and nonredundant manner. SIGNIFICANCE STATEMENT Vocal plasticity is linked to the actions of sex steroid hormones, but the precise mechanisms are unclear. We investigated this question in adult male canaries (Serinus canaria), which show extensive vocal plasticity throughout their life. We show that androgens in two cortex-like vocal control brain regions regulate distinct aspects of vocal plasticity. For example, in HVC (proper name), androgens regulate variability in syntax but not phonology, whereas androgens in the robust nucleus of the arcopallium (RA) regulate variability in phonology but not syntax. Temporal aspects of song were also differentially affected by androgen signaling in HVC versus RA. Thus, androgen signaling may reduce vocal plasticity by acting in a nonredundant and precise manner in the brain. PMID:28821656

  9. High resolution telescope and spectrograph observations of solar fine structure in the 1600 A region

    NASA Technical Reports Server (NTRS)

    Cook, J. W.; Brueckner, G. E.; Bartoe, J.-D. F.

    1983-01-01

    High spatial resolution spectroheliograms of the 1600 A region obtained during the HRTS rocket flight of 1978 February 13 are presented. The morphology, fine structure, and temporal behavior of emission bright points (BPs) in active and quiet regions are illustrated. In quiet regions, network elements persist as morphological units, although individual BPs may vary in intensity while usually lasting the flight duration. In cell centers, the BPs are highly variable on a 1 minute time scale. BPs in plages remain more constant in brightness over the observing sequence. BPs cover less than 4 percent of the quiet surface. The lifetime and degree of packing of BPs vary with the local strength of the magnetic field.

  10. Reliable cloning of functional antibody variable domains from hybridomas and spleen cell repertoires employing a reengineered phage display system.

    PubMed

    Krebber, A; Bornhauser, S; Burmester, J; Honegger, A; Willuda, J; Bosshard, H R; Plückthun, A

    1997-02-14

    A prerequisite for the use of recombinant antibody technologies starting from hybridomas or immune repertoires is the reliable cloning of functional immunoglobulin genes. For this purpose, a standard phage display system was optimized for robustness, vector stability, tight control of scFv-delta geneIII expression, primer usage for PCR amplification of variable region genes, scFv assembly strategy and subsequent directional cloning using a single rare cutting restriction enzyme. This integrated cloning, screening and selection system allowed us to rapidly obtain antigen binding scFvs derived from spleen-cell repertoires of mice immunized with ampicillin as well as from all hybridoma cell lines tested to date. As representative examples, cloning of monoclonal antibodies against a his tag, leucine zippers, the tumor marker EGP-2 and the insecticide DDT is presented. Several hybridomas whose genes could not be cloned in previous experimental setups, but were successfully obtained with the present system, expressed high amounts of aberrant heavy and light chain mRNAs, which were amplified by PCR and greatly exceeded the amount of binding antibody sequences. These contaminating variable region genes were successfully eliminated by employing the optimized phage display system, thus avoiding time consuming sequencing of non-binding scFv genes. To maximize soluble expression of functional scFvs subsequent to cloning, a compatible vector series to simplify modification, detection, multimerization and rapid purification of recombinant antibody fragments was constructed.

  11. Reservoirs of Listeria Species in Three Environmental Ecosystems

    PubMed Central

    Linke, Kristina; Rückerl, Irene; Brugger, Katharina; Karpiskova, Renata; Walland, Julia; Muri-Klinger, Sonja; Tichy, Alexander; Wagner, Martin

    2014-01-01

    Soil and water are suggested to represent pivotal niches for the transmission of Listeria monocytogenes to plant material, animals, and the food chain. In the present study, 467 soil and 68 water samples were collected in 12 distinct geological and ecological sites in Austria from 2007 to 2009. Listeria was present in 30% and 26% of the investigated soil and water samples, respectively. Generally, the most dominant species in soil and water samples were Listeria seeligeri, L. innocua, and L. ivanovii. The human- and animal-pathogenic L. monocytogenes was isolated exclusively from 6% soil samples in regions A (mountainous region) and B (meadow). Distinct ecological preferences were observed for L. seeligeri and L. ivanovii, which were more often isolated from wildlife reserve region C (Lake Neusiedl) and from sites in proximity to wild and domestic ruminants (region A). The higher L. monocytogenes detection and antibiotic resistance rates in regions A and B could be explained by the proximity to agricultural land and urban environment. L. monocytogenes multilocus sequence typing corroborated this evidence since sequence type 37 (ST37), ST91, ST101, and ST517 were repeatedly isolated from regions A and B over several months. A higher L. monocytogenes detection and strain variability was observed during flooding of the river Schwarza (region A) and Danube (region B) in September 2007, indicating dispersion via watercourses. PMID:25002422

  12. Genetic Variability of Beauveria bassiana and a DNA Marker for Environmental Monitoring of a Highly Virulent Isolate Against Cosmopolites sordidus.

    PubMed

    Ferri, D V; Munhoz, C F; Neves, P M O; Ferracin, L M; Sartori, D; Vieira, M L C; Fungaro, M H P

    2012-12-01

    The banana weevil Cosmopolites sordidus (Germar) is one of a number of pests that attack banana crops. The use of the entomopathogenic fungus Beauveria bassiana as a biological control agent for this pest may contribute towards reducing the application of chemical insecticides on banana crops. In this study, the genetic variability of a collection of Brazilian isolates of B. bassiana was evaluated. Samples were obtained from various geographic regions of Brazil, and from different hosts of the Curculionidae family. Based on the DNA fingerprints generated by RAPD and AFLP, we found that 92 and 88 % of the loci were polymorphic, respectively. The B. bassiana isolates were attributed to two genotypic clusters based on the RAPD data, and to three genotypic clusters, when analyzed with AFLP. The nucleotide sequences of nuclear ribosomal DNA intergenic spacers confirmed that all isolates are in fact B. bassiana. Analysis of molecular variance showed that variability among the isolates was not correlated with geographic origin or hosts. A RAPD-specific marker for isolate CG 1024, which is highly virulent to C. sordidus, was cloned and sequenced. Based on the sequences obtained, specific PCR primers BbasCG1024F (5'-TGC GGC TGA GGA GGA CT-3') and BbasCG1024R (5'-TGC GGC TGA GTG TAG AAC-3') were designed for detecting and monitoring this isolate in the field.

  13. Translating working memory into action: behavioral and neural evidence for using motor representations in encoding visuo-spatial sequences.

    PubMed

    Langner, Robert; Sternkopf, Melanie A; Kellermann, Tanja S; Grefkes, Christian; Kurth, Florian; Schneider, Frank; Zilles, Karl; Eickhoff, Simon B

    2014-07-01

    The neurobiological organization of action-oriented working memory is not well understood. To elucidate the neural correlates of translating visuo-spatial stimulus sequences into delayed (memory-guided) sequential actions, we measured brain activity using functional magnetic resonance imaging while participants encoded sequences of four to seven dots appearing on fingers of a left or right schematic hand. After variable delays, sequences were to be reproduced with the corresponding fingers. Recall became less accurate with longer sequences and was initiated faster after long delays. Across both hands, encoding and recall activated bilateral prefrontal, premotor, superior and inferior parietal regions as well as the basal ganglia, whereas hand-specific activity was found (albeit to a lesser degree during encoding) in contralateral premotor, sensorimotor, and superior parietal cortex. Activation differences after long versus short delays were restricted to motor-related regions, indicating that rehearsal during long delays might have facilitated the conversion of the memorandum into concrete motor programs at recall. Furthermore, basal ganglia activity during encoding selectively predicted correct recall. Taken together, the results suggest that to-be-reproduced visuo-spatial sequences are encoded as prospective action representations (motor intentions), possibly in addition to retrospective sensory codes. Overall, our study supports and extends multi-component models of working memory, highlighting the notion that sensory input can be coded in multiple ways depending on what the memorandum is to be used for. Copyright © 2013 Wiley Periodicals, Inc.

  14. Sequencing of the variable region of rpsB to discriminate between Streptococcus pneumoniae and other streptococcal species.

    PubMed

    Wyllie, Anne L; Pannekoek, Yvonne; Bovenkerk, Sandra; van Engelsdorp Gastelaars, Jody; Ferwerda, Bart; van de Beek, Diederik; Sanders, Elisabeth A M; Trzciński, Krzysztof; van der Ende, Arie

    2017-09-01

    The vast majority of streptococci colonizing the human upper respiratory tract are commensals, only sporadically implicated in disease. Of these, the most pathogenic is Mitis group member, Streptococcus pneumoniae Phenotypic and genetic similarities between streptococci can cause difficulties in species identification. Using ribosomal S2-gene sequences extracted from whole-genome sequences published from 501 streptococci, we developed a method to identify streptococcal species. We validated this method on non-pneumococcal isolates cultured from cases of severe streptococcal disease ( n = 101) and from carriage ( n = 103), and on non-typeable pneumococci from asymptomatic individuals ( n = 17) and on whole-genome sequences of 1157 pneumococcal isolates from meningitis in the Netherlands. Following this, we tested 221 streptococcal isolates in molecular assays originally assumed specific for S. pneumoniae , targeting cpsA , lytA , piaB , ply , Spn9802, zmpC and capsule-type-specific genes. Cluster analysis of S2-sequences showed grouping according to species in line with published phylogenies of streptococcal core genomes. S2-typing convincingly distinguished pneumococci from non-pneumococcal species (99.2% sensitivity, 100% specificity). Molecular assays targeting regions of lytA and piaB were 100% specific for S. pneumoniae , whereas assays targeting cpsA , ply , Spn9802, zmpC and selected serotype-specific assays (but not capsular sequence typing) showed a lack of specificity. False positive results were over-represented in species associated with carriage, although no particular confounding signal was unique for carriage isolates. © 2017 The Authors.

  15. Sequencing of the variable region of rpsB to discriminate between Streptococcus pneumoniae and other streptococcal species

    PubMed Central

    Pannekoek, Yvonne; Bovenkerk, Sandra; van Engelsdorp Gastelaars, Jody; Ferwerda, Bart; van de Beek, Diederik; Sanders, Elisabeth A. M.; Trzciński, Krzysztof; van der Ende, Arie

    2017-01-01

    The vast majority of streptococci colonizing the human upper respiratory tract are commensals, only sporadically implicated in disease. Of these, the most pathogenic is Mitis group member, Streptococcus pneumoniae. Phenotypic and genetic similarities between streptococci can cause difficulties in species identification. Using ribosomal S2-gene sequences extracted from whole-genome sequences published from 501 streptococci, we developed a method to identify streptococcal species. We validated this method on non-pneumococcal isolates cultured from cases of severe streptococcal disease (n = 101) and from carriage (n = 103), and on non-typeable pneumococci from asymptomatic individuals (n = 17) and on whole-genome sequences of 1157 pneumococcal isolates from meningitis in the Netherlands. Following this, we tested 221 streptococcal isolates in molecular assays originally assumed specific for S. pneumoniae, targeting cpsA, lytA, piaB, ply, Spn9802, zmpC and capsule-type-specific genes. Cluster analysis of S2-sequences showed grouping according to species in line with published phylogenies of streptococcal core genomes. S2-typing convincingly distinguished pneumococci from non-pneumococcal species (99.2% sensitivity, 100% specificity). Molecular assays targeting regions of lytA and piaB were 100% specific for S. pneumoniae, whereas assays targeting cpsA, ply, Spn9802, zmpC and selected serotype-specific assays (but not capsular sequence typing) showed a lack of specificity. False positive results were over-represented in species associated with carriage, although no particular confounding signal was unique for carriage isolates. PMID:28931649

  16. The Clusters AgeS Experiment (CASE). Variable Stars in the Field of the Globular Cluster M12

    NASA Astrophysics Data System (ADS)

    Kaluzny, J.; Thompson, I. B.; Narloch, W.; Pych, W.; Rozyczka, M.

    2015-09-01

    The field of the globular cluster M12 (NGC 6218) was monitored between 1995 and 2009 in a search for variable stars. BV light curves were obtained for thirty-six periodic or likely periodic variable stars. Thirty-four of these are new detections. Among the latter we identified twenty proper-motion members of the cluster: six detached or semi-detached eclipsing binaries, five contact binaries, five SX Phe pulsators, and three yellow stragglers. Two of the eclipsing binaries are located in the turnoff region, one on the lower main sequence and the remaining three among the blue stragglers. Two contact systems are blue stragglers, and the remaining three reside in the turnoff region. In the blue straggler region a total of 103 objects were found, of which 42 are proper motion members of M12, and another four are field stars. 55 of the remaining objects are located within two core radii from the center of the cluster, and as such they are likely genuine blue stragglers. We also report the discoveries of a radial color gradient of M12, and the shortest period among contact systems in globular clusters in general.

  17. A new high molecular weight immunoglobulin class from the carcharhine shark: implications for the properties of the primordial immunoglobulin.

    PubMed

    Berstein, R M; Schluter, S F; Shen, S; Marchalonis, J J

    1996-04-16

    All immunoglobulins and T-cell receptors throughout phylogeny share regions of highly conserved amino acid sequence. To identify possible primitive immunoglobulins and immunoglobulin-like molecules, we utilized 3' RACE (rapid amplification of cDNA ends) and a highly conserved constant region consensus amino acid sequence to isolate a new immunoglobulin class from the sandbar shark Carcharhinus plumbeus. The immunoglobulin, termed IgW, in its secreted form consists of 782 amino acids and is expressed in both the thymus and the spleen. The molecule overall most closely resembles mu chains of the skate and human and a new putative antigen binding molecule isolated from the nurse shark (NAR). The full-length IgW chain has a variable region resembling human and shark heavy-chain (VH) sequences and a novel joining segment containing the WGXGT motif characteristic of H chains. However, unlike any other H-chain-type molecule, it contains six constant (C) domains. The first C domain contains the cysteine residue characteristic of C mu1 that would allow dimerization with a light (L) chain. The fourth and sixth domains also contain comparable cysteines that would enable dimerization with other H chains or homodimerization. Comparison of the sequences of IgW V and C domains shows homology greater than that found in comparisons among VH and C mu or VL, or CL thereby suggesting that IgW may retain features of the primordial immunoglobulin in evolution.

  18. Epitope mapping of the variable repetitive region with the MB antigen of Ureaplasma urealyticum.

    PubMed Central

    Zheng, X; Lau, K; Frazier, M; Cassell, G H; Watson, H L

    1996-01-01

    One of the major surface structures of Ureaplasma urealyticum recognized by antibodies of patients during infection is the MB antigen. Previously, we showed by Western blot (immunoblot) analysis that any one of the anti-MB monoclonal antibodies (MAbs) 3B1.5, 5B1.1, and 10C6.6 could block the binding of patient antibodies to MB. Subsequent DNA sequencing revealed that a unique six-amino-acid direct tandem repeat region composed the carboxy two-thirds of this antigen. In the present study, using antibody-reactive peptide scanning of this repeat region, we demonstrated that the amino acids defining the epitopes for MAbs 3B1.5 5B1.1 and 10C6.6 are EQP, GK, and KEQPA, respectively. Peptide scanning analysis of an infected patient's serum antibody response showed that the dominant epitope was defined by the sequence PAGK. Mapping of these continuous epitopes revealed overlap between all MAb and patient polyclonal antibody binding sites, thus explaining the ability of a single MAb to apparently block all polyclonal antibody binding sites. We also show that a single amino acid difference in the sequence of the repeats of serovars 3 and 14 accounts for the lack of reactivity with serovar 14 of two of the serovar 3-specific MAbs. Finally, the data demonstrate the need to obtain the sequences of the mba genes of all serovars before an effective serovar-specific antibody detection method can be developed. PMID:8914774

  19. Rapid Mitochondrial Genome Evolution through Invasion of Mobile Elements in Two Closely Related Species of Arbuscular Mycorrhizal Fungi

    PubMed Central

    Beaudet, Denis; Nadimi, Maryam; Iffis, Bachir; Hijri, Mohamed

    2013-01-01

    Arbuscular mycorrhizal fungi (AMF) are common and important plant symbionts. They have coenocytic hyphae and form multinucleated spores. The nuclear genome of AMF is polymorphic and its organization is not well understood, which makes the development of reliable molecular markers challenging. In stark contrast, their mitochondrial genome (mtDNA) is homogeneous. To assess the intra- and inter-specific mitochondrial variability in closely related Glomus species, we performed 454 sequencing on total genomic DNA of Glomus sp. isolate DAOM-229456 and we compared its mtDNA with two G. irregulare isolates. We found that the mtDNA of Glomus sp. is homogeneous, identical in gene order and, with respect to the sequences of coding regions, almost identical to G. irregulare. However, certain genomic regions vary substantially, due to insertions/deletions of elements such as introns, mitochondrial plasmid-like DNA polymerase genes and mobile open reading frames. We found no evidence of mitochondrial or cytoplasmic plasmids in Glomus species, and mobile ORFs in Glomus are responsible for the formation of four gene hybrids in atp6, atp9, cox2, and nad3, which are most probably the result of horizontal gene transfer and are expressed at the mRNA level. We found evidence for substantial sequence variation in defined regions of mtDNA, even among closely related isolates with otherwise identical coding gene sequences. This variation makes it possible to design reliable intra- and inter-specific markers. PMID:23637766

  20. Rapid mitochondrial genome evolution through invasion of mobile elements in two closely related species of arbuscular mycorrhizal fungi.

    PubMed

    Beaudet, Denis; Nadimi, Maryam; Iffis, Bachir; Hijri, Mohamed

    2013-01-01

    Arbuscular mycorrhizal fungi (AMF) are common and important plant symbionts. They have coenocytic hyphae and form multinucleated spores. The nuclear genome of AMF is polymorphic and its organization is not well understood, which makes the development of reliable molecular markers challenging. In stark contrast, their mitochondrial genome (mtDNA) is homogeneous. To assess the intra- and inter-specific mitochondrial variability in closely related Glomus species, we performed 454 sequencing on total genomic DNA of Glomus sp. isolate DAOM-229456 and we compared its mtDNA with two G. irregulare isolates. We found that the mtDNA of Glomus sp. is homogeneous, identical in gene order and, with respect to the sequences of coding regions, almost identical to G. irregulare. However, certain genomic regions vary substantially, due to insertions/deletions of elements such as introns, mitochondrial plasmid-like DNA polymerase genes and mobile open reading frames. We found no evidence of mitochondrial or cytoplasmic plasmids in Glomus species, and mobile ORFs in Glomus are responsible for the formation of four gene hybrids in atp6, atp9, cox2, and nad3, which are most probably the result of horizontal gene transfer and are expressed at the mRNA level. We found evidence for substantial sequence variation in defined regions of mtDNA, even among closely related isolates with otherwise identical coding gene sequences. This variation makes it possible to design reliable intra- and inter-specific markers.

  1. Use of DNA barcodes to identify flowering plants

    PubMed Central

    Kress, W. John; Wurdack, Kenneth J.; Zimmer, Elizabeth A.; Weigt, Lee A.; Janzen, Daniel H.

    2005-01-01

    Methods for identifying species by using short orthologous DNA sequences, known as “DNA barcodes,” have been proposed and initiated to facilitate biodiversity studies, identify juveniles, associate sexes, and enhance forensic analyses. The cytochrome c oxidase 1 sequence, which has been found to be widely applicable in animal barcoding, is not appropriate for most species of plants because of a much slower rate of cytochrome c oxidase 1 gene evolution in higher plants than in animals. We therefore propose the nuclear internal transcribed spacer region and the plastid trnH-psbA intergenic spacer as potentially usable DNA regions for applying barcoding to flowering plants. The internal transcribed spacer is the most commonly sequenced locus used in plant phylogenetic investigations at the species level and shows high levels of interspecific divergence. The trnH-psbA spacer, although short (≈450-bp), is the most variable plastid region in angiosperms and is easily amplified across a broad range of land plants. Comparison of the total plastid genomes of tobacco and deadly nightshade enhanced with trials on widely divergent angiosperm taxa, including closely related species in seven plant families and a group of species sampled from a local flora encompassing 50 plant families (for a total of 99 species, 80 genera, and 53 families), suggest that the sequences in this pair of loci have the potential to discriminate among the largest number of plant species for barcoding purposes. PMID:15928076

  2. Adler hantavirus, a new genetic variant of Tula virus identified in Major's pine voles (Microtus majori) sampled in southern European Russia.

    PubMed

    Tkachenko, Evgeniy A; Witkowski, Peter T; Radosa, Lukas; Dzagurova, Tamara K; Okulova, Nataliya M; Yunicheva, Yulia V; Vasilenko, Ludmila; Morozov, Vyacheslav G; Malkin, Gennadiy A; Krüger, Detlev H; Klempa, Boris

    2015-01-01

    Although at least 30 novel hantaviruses have been recently discovered in novel hosts such as shrews, moles and even bats, hantaviruses (family Bunyaviridae, genus Hantavirus) are primarily known as rodent-borne human pathogens. Here we report on identification of a novel hantavirus variant associated with a rodent host, Major's pine vole (Microtus majori). Altogether 36 hantavirus PCR-positive Major's pine voles were identified in the Krasnodar region of southern European Russia within the years 2008-2011. Initial partial L-segment sequence analysis revealed novel hantavirus sequences. Moreover, we found a single common vole (Microtusarvalis) infected with Tula virus (TULV). Complete S- and M-segment coding sequences were determined from 11 Major's pine voles originating from 8 trapping sites and subjected to phylogenetic analyses. The data obtained show that Major's pine vole is a newly recognized hantavirus reservoir host. The newfound virus, provisionally called Adler hantavirus (ADLV), is closely related to TULV. Based on amino acid differences to TULV (5.6-8.2% for nucleocapsid protein, 9.4-9.5% for glycoprotein precursor) we propose to consider ADLV as a genotype of TULV. Occurrence of ADLV and TULV in the same region suggests that ADLV is not only a geographical variant of TULV but a host-specific genotype. High intra-cluster nucleotide sequence variability (up to 18%) and geographic clustering indicate long-term presence of the virus in this region. Copyright © 2014. Published by Elsevier B.V.

  3. Extensive Variation and Sub-Structuring in Lineage A mtDNA in Indian Sheep: Genetic Evidence for Domestication of Sheep in India

    PubMed Central

    Singh, Sachin; Kumar Jr, Satish; Kolte, Atul P.; Kumar, Satish

    2013-01-01

    Previous studies on mitochondrial DNA analysis of sheep from different regions of the world have revealed the presence of two major- A and B, and three minor- C, D and E maternal lineages. Lineage A is more frequent in Asia and lineage B is more abundant in regions other than Asia. We have analyzed mitochondrial DNA sequences of 330 sheep from 12 different breeds of India. Neighbor-joining analysis revealed lineage A, B and C in Indian sheep. Surprisingly, multidimensional scaling plot based on FST values of control region of mtDNA sequences showed significant breed differentiation in contrast to poor geographical structuring reported earlier in this species. The breed differentiation in Indian sheep was essentially due to variable contribution of two major lineages to different breeds, and sub- structuring of lineage A, possibly the latter resulting from genetic drift. Nucleotide diversity of this lineage was higher in Indian sheep (0.014 ± 0.007) as compared to that of sheep from other regions of the world (0.009 ± 0.005 to 0.01 ± 0.005). Reduced median network analysis of control region and cytochrome b gene sequences of Indian sheep when analyzed along with available published sequences of sheep from other regions of the world showed that several haplotypes of lineage A were exclusive to Indian sheep. Given the high nucleotide diversity in Indian sheep and the poor sharing of lineage A haplotypes between Indian and non-Indian sheep, we propose that lineage A sheep has also been domesticated in the east of Near East, possibly in Indian sub-continent. Finally, our data provide support that lineage B and additional lineage A haplotypes of sheep might have been introduced to Indian sub-continent from Near East, probably by ancient sea trade route. PMID:24244282

  4. The complete chloroplast genome sequence of strawberry (Fragaria  × ananassa Duch.) and comparison with related species of Rosaceae

    PubMed Central

    Cheng, Hui; Li, Jinfeng; Zhang, Hong; Cai, Binhua; Gao, Zhihong

    2017-01-01

    Compared with other members of the family Rosaceae, the chloroplast genomes of Fragaria species exhibit low variation, and this situation has limited phylogenetic analyses; thus, complete chloroplast genome sequencing of Fragaria species is needed. In this study, we sequenced the complete chloroplast genome of F. × ananassa ‘Benihoppe’ using the Illumina HiSeq 2500-PE150 platform and then performed a combination of de novo assembly and reference-guided mapping of contigs to generate complete chloroplast genome sequences. The chloroplast genome exhibits a typical quadripartite structure with a pair of inverted repeats (IRs, 25,936 bp) separated by large (LSC, 85,531 bp) and small (SSC, 18,146 bp) single-copy (SC) regions. The length of the F. × ananassa ‘Benihoppe’ chloroplast genome is 155,549 bp, representing the smallest Fragaria chloroplast genome observed to date. The genome encodes 112 unique genes, comprising 78 protein-coding genes, 30 tRNA genes and four rRNA genes. Comparative analysis of the overall nucleotide sequence identity among ten complete chloroplast genomes confirmed that for both coding and non-coding regions in Rosaceae, SC regions exhibit higher sequence variation than IRs. The Ka/Ks ratio of most genes was less than 1, suggesting that most genes are under purifying selection. Moreover, the mVISTA results also showed a high degree of conservation in genome structure, gene order and gene content in Fragaria, particularly among three octoploid strawberries which were F. × ananassa ‘Benihoppe’, F. chiloensis (GP33) and F. virginiana (O477). However, when the sequences of the coding and non-coding regions of F. × ananassa ‘Benihoppe’ were compared in detail with those of F. chiloensis (GP33) and F. virginiana (O477), a number of SNPs and InDels were revealed by MEGA 7. Six non-coding regions (trnK-matK, trnS-trnG, atpF-atpH, trnC-petN, trnT-psbD and trnP-psaJ) with a percentage of variable sites greater than 1% and no less than five parsimony-informative sites were identified and may be useful for phylogenetic analysis of the genus Fragaria. PMID:29038765

  5. Cloud-based adaptive exon prediction for DNA analysis.

    PubMed

    Putluri, Srinivasareddy; Zia Ur Rahman, Md; Fathima, Shaik Yasmeen

    2018-02-01

    Cloud computing offers significant research and economic benefits to healthcare organisations. Cloud services provide a safe place for storing and managing large amounts of such sensitive data. Under conventional flow of gene information, gene sequence laboratories send out raw and inferred information via Internet to several sequence libraries. DNA sequencing storage costs will be minimised by use of cloud service. In this study, the authors put forward a novel genomic informatics system using Amazon Cloud Services, where genomic sequence information is stored and accessed for processing. True identification of exon regions in a DNA sequence is a key task in bioinformatics, which helps in disease identification and design drugs. Three base periodicity property of exons forms the basis of all exon identification techniques. Adaptive signal processing techniques found to be promising in comparison with several other methods. Several adaptive exon predictors (AEPs) are developed using variable normalised least mean square and its maximum normalised variants to reduce computational complexity. Finally, performance evaluation of various AEPs is done based on measures such as sensitivity, specificity and precision using various standard genomic datasets taken from National Center for Biotechnology Information genomic sequence database.

  6. Sequence Analysis and Domain Motifs in the Porcine Skin Decorin Glycosaminoglycan Chain*

    PubMed Central

    Zhao, Xue; Yang, Bo; Solakylidirim, Kemal; Joo, Eun Ji; Toida, Toshihiko; Higashi, Kyohei; Linhardt, Robert J.; Li, Lingyun

    2013-01-01

    Decorin proteoglycan is comprised of a core protein containing a single O-linked dermatan sulfate/chondroitin sulfate glycosaminoglycan (GAG) chain. Although the sequence of the decorin core protein is determined by the gene encoding its structure, the structure of its GAG chain is determined in the Golgi. The recent application of modern MS to bikunin, a far simpler chondroitin sulfate proteoglycans, suggests that it has a single or small number of defined sequences. On this basis, a similar approach to sequence the decorin of porcine skin much larger and more structurally complex dermatan sulfate/chondroitin sulfate GAG chain was undertaken. This approach resulted in information on the consistency/variability of its linkage region at the reducing end of the GAG chain, its iduronic acid-rich domain, glucuronic acid-rich domain, and non-reducing end. A general motif for the porcine skin decorin GAG chain was established. A single small decorin GAG chain was sequenced using MS/MS analysis. The data obtained in the study suggest that the decorin GAG chain has a small or a limited number of sequences. PMID:23423381

  7. Detection of Plasmodium sp. in capybara.

    PubMed

    dos Santos, Leonilda Correia; Curotto, Sandra Mara Rotter; de Moraes, Wanderlei; Cubas, Zalmir Silvino; Costa-Nascimento, Maria de Jesus; de Barros Filho, Ivan Roque; Biondo, Alexander Welker; Kirchgatter, Karin

    2009-07-07

    In the present study, we have microscopically and molecularly surveyed blood samples from 11 captive capybaras (Hydrochaeris hydrochaeris) from the Sanctuary Zoo for Plasmodium sp. infection. One animal presented positive on blood smear by light microscopy. Polymerase chain reaction was carried out accordingly using a nested genus-specific protocol, which uses oligonucleotides from conserved sequences flanking a variable sequence region in the small subunit ribosomal RNA (ssrRNA) of all Plasmodium organisms. This revealed three positive animals. Products from two samples were purified and sequenced. The results showed less than 1% divergence between the two capybara sequences. When compared with GenBank sequences, a 55% similarity was obtained to Toxoplasma gondii and a higher similarity (73-77.2%) was found to ssrRNAs from Plasmodium species that infect reptile, avian, rodents, and human beings. The most similar Plasmodium sequence was from Plasmodium mexicanum that infects lizards of North America, where around 78% identity was found. This work is the first report of Plasmodium in capybaras, and due to the low similarity with other Plasmodium species, we suggest it is a new species, which, in the future could be denominated "Plasmodium hydrochaeri".

  8. A comparative analysis of exome capture.

    PubMed

    Parla, Jennifer S; Iossifov, Ivan; Grabill, Ian; Spector, Mona S; Kramer, Melissa; McCombie, W Richard

    2011-09-29

    Human exome resequencing using commercial target capture kits has been and is being used for sequencing large numbers of individuals to search for variants associated with various human diseases. We rigorously evaluated the capabilities of two solution exome capture kits. These analyses help clarify the strengths and limitations of those data as well as systematically identify variables that should be considered in the use of those data. Each exome kit performed well at capturing the targets they were designed to capture, which mainly corresponds to the consensus coding sequences (CCDS) annotations of the human genome. In addition, based on their respective targets, each capture kit coupled with high coverage Illumina sequencing produced highly accurate nucleotide calls. However, other databases, such as the Reference Sequence collection (RefSeq), define the exome more broadly, and so not surprisingly, the exome kits did not capture these additional regions. Commercial exome capture kits provide a very efficient way to sequence select areas of the genome at very high accuracy. Here we provide the data to help guide critical analyses of sequencing data derived from these products.

  9. Nucleotide sequence of an exceptionally long 5.8S ribosomal RNA from Crithidia fasciculata.

    PubMed Central

    Schnare, M N; Gray, M W

    1982-01-01

    In Crithidia fasciculata, a trypanosomatid protozoan, the large ribosomal subunit contains five small RNA species (e, f, g, i, j) in addition to 5S rRNA [Gray, M.W. (1981) Mol. Cell. Biol. 1, 347-357]. The complete primary sequence of species i is shown here to be pAACGUGUmCGCGAUGGAUGACUUGGCUUCCUAUCUCGUUGA ... AGAmACGCAGUAAAGUGCGAUAAGUGGUApsiCAAUUGmCAGAAUCAUUCAAUUACCGAAUCUUUGAACGAAACGG ... CGCAUGGGAGAAGCUCUUUUGAGUCAUCCCCGUGCAUGCCAUAUUCUCCAmGUGUCGAA(C)OH. This sequence establishes that species i is a 5.8S rRNA, despite its exceptional length (171-172 nucleotides). The extra nucleotides in C. fasciculata 5.8S rRNA are located in a region whose primary sequence and length are highly variable among 5.8S rRNAs, but which is capable of forming a stable hairpin loop structure (the "G+C-rich hairpin"). The sequence of C. fasciculata 5.8S rRNA is no more closely related to that of another protozoan, Acanthamoeba castellanii, than it is to representative 5.8S rRNA sequences from the other eukaryotic kingdoms, emphasizing the deep phylogenetic divisions that seem to exist within the Kingdom Protista. Images PMID:7079176

  10. Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

    PubMed

    Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

    2017-04-01

    There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.

  11. The population structure of Vibrio cholerae from the Chandigarh Region of Northern India.

    PubMed

    Abd El Ghany, Moataz; Chander, Jagadish; Mutreja, Ankur; Rashid, Mamoon; Hill-Cawthorne, Grant A; Ali, Shahjahan; Naeem, Raeece; Thomson, Nicholas R; Dougan, Gordon; Pain, Arnab

    2014-07-01

    Cholera infection continues to be a threat to global public health. The current cholera pandemic associated with Vibrio cholerae El Tor has now been ongoing for over half a century. Thirty-eight V. cholerae El Tor isolates associated with a cholera outbreak in 2009 from the Chandigarh region of India were characterised by a combination of microbiology, molecular typing and whole-genome sequencing. The genomic analysis indicated that two clones of V. cholera circulated in the region and caused disease during this time. These clones fell into two distinct sub-clades that map independently onto wave 3 of the phylogenetic tree of seventh pandemic V. cholerae El Tor. Sequence analyses of the cholera toxin gene, the Vibrio seventh Pandemic Island II (VSPII) and SXT element correlated with this phylogenetic position of the two clades on the El Tor tree. The clade 2 isolates, characterized by a drug-resistant profile and the expression of a distinct cholera toxin, are closely related to the recent V. cholerae isolated elsewhere, including Haiti, but fell on a distinct branch of the tree, showing they were independent outbreaks. Multi-Locus Sequence Typing (MLST) distinguishes two sequence types among the 38 isolates, that did not correspond to the clades defined by whole-genome sequencing. Multi-Locus Variable-length tandem-nucleotide repeat Analysis (MLVA) identified 16 distinct clusters. The use of whole-genome sequencing enabled the identification of two clones of V. cholerae that circulated during the 2009 Chandigarh outbreak. These clones harboured a similar structure of ICEVchHai1 but differed mainly in the structure of CTX phage and VSPII. The limited capacity of MLST and MLVA to discriminate between the clones that circulated in the 2009 Chandigarh outbreak highlights the value of whole-genome sequencing as a route to the identification of further genetic markers to subtype V. cholerae isolates.

  12. Aftershock Forecasting: Recent Developments and Lessons from the 2016 M5.8 Pawnee, Oklahoma, Earthquake

    NASA Astrophysics Data System (ADS)

    Michael, A. J.; Field, E. H.; Hardebeck, J.; Llenos, A. L.; Milner, K. R.; Page, M. T.; Perry, S. C.; van der Elst, N.; Wein, A. M.

    2016-12-01

    After the Mw 5.8 Pawnee, Oklahoma, earthquake of September 3, 2016 the USGS issued a series of aftershock forecasts for the next month and year. These forecasts were aimed at the emergency response community, those making decisions about well operations in the affected region, and the general public. The forecasts were generated manually using methods planned for automatically released Operational Aftershock Forecasts. The underlying method is from Reasenberg and Jones (Science, 1989) with improvements recently published in Page et al. (BSSA, 2016), implemented in a JAVA Graphical User Interface and presented in a template that is under development. The methodological improvements include initial models based on the tectonic regime as defined by Garcia et al. (BSSA, 2012) and the inclusion of both uncertainty in the clustering parameters and natural random variability. We did not utilize the time-dependent magnitude of completeness model from Page et al. because it applies only to teleseismic events recorded by NEIC. The parameters for Garcia's Generic Active Continental Region underestimated the modified-Omori decay parameter and underestimated the aftershock rate by a factor of 2. And the sequence following the Mw 5.7 Prague, Oklahoma, earthquake of November 6, 2011 was about 3 to 4 times more productive than the Pawnee sequence. The high productivity for these potentially induced sequences is consistent with an increase in productivity in Oklahoma since 2009 (Llenos and Michael, BSSA, 2013) and makes a general tectonic model inapplicable to sequences in this region. Soon after the mainshock occurred, the forecasts relied on the sequence specific parameters. After one month, the Omori decay parameter p is less than one, implying a very long-lived sequence. However, the decay parameter is known to be biased low at early times due to secondary aftershock triggering, and the p-value determined early in the sequence may be inaccurate for long-term forecasting.

  13. Characteristics of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) rRNA genes of Apis mellifera (Insecta: Hymenoptera): structure, organization, and retrotransposable elements

    PubMed Central

    Gillespie, J J; Johnston, J S; Cannone, J J; Gutell, R R

    2006-01-01

    As an accompanying manuscript to the release of the honey bee genome, we report the entire sequence of the nuclear (18S, 5.8S, 28S and 5S) and mitochondrial (12S and 16S) ribosomal RNA (rRNA)-encoding gene sequences (rDNA) and related internally and externally transcribed spacer regions of Apis mellifera (Insecta: Hymenoptera: Apocrita). Additionally, we predict secondary structures for the mature rRNA molecules based on comparative sequence analyses with other arthropod taxa and reference to recently published crystal structures of the ribosome. In general, the structures of honey bee rRNAs are in agreement with previously predicted rRNA models from other arthropods in core regions of the rRNA, with little additional expansion in non-conserved regions. Our multiple sequence alignments are made available on several public databases and provide a preliminary establishment of a global structural model of all rRNAs from the insects. Additionally, we provide conserved stretches of sequences flanking the rDNA cistrons that comprise the externally transcribed spacer regions (ETS) and part of the intergenic spacer region (IGS), including several repetitive motifs. Finally, we report the occurrence of retrotransposition in the nuclear large subunit rDNA, as R2 elements are present in the usual insertion points found in other arthropods. Interestingly, functional R1 elements usually present in the genomes of insects were not detected in the honey bee rRNA genes. The reverse transcriptase products of the R2 elements are deduced from their putative open reading frames and structurally aligned with those from another hymenopteran insect, the jewel wasp Nasonia (Pteromalidae). Stretches of conserved amino acids shared between Apis and Nasonia are illustrated and serve as potential sites for primer design, as target amplicons within these R2 elements may serve as novel phylogenetic markers for Hymenoptera. Given the impending completion of the sequencing of the Nasonia genome, we expect our report eventually to shed light on the evolution of the hymenopteran genome within higher insects, particularly regarding the relative maintenance of conserved rDNA genes, related variable spacer regions and retrotransposable elements. PMID:17069639

  14. The complete Einstein Observatory X-ray survey of the Orion Nebula region.

    NASA Technical Reports Server (NTRS)

    Gagne, Marc; Caillault, Jean-Pierre

    1994-01-01

    We have analyzed archival Einstein Observatory images of a roughly 4.5 square degree region centered on the Orion Nebula. In all, 245 distinct X-ray sources have been detected in six High Resolution Imager (HRI) and 17 Imaging Proportional Counter (IPC) observations. An optical database of over 2700 stars has been assembled to search for candidate counterparts to the X-ray sources. Roughly half the X-ray sources are identified with a single Orion Nebula cluster member. The 10 main-sequence O6-B5 cluster stars detected in Orion have X-ray activity levels comparable to field O and B stars. X-ray emission has also been detected in the direction of four main-sequence late-B and early-A type stars. Since the mechanisms producing X-rays in late-type coronae and early-type winds cannot operate in the late-B and early-A type atmospheres, we argue that the observed X-rays, with L(sub X) approximately = 3 x 10(exp 30) ergs/s, are probably produced in the coronae of unseen late-type binary companions. Over 100 X-ray sources have been associated with late-type pre-main sequence stars. The upper envelope of X-ray activity rises sharply from mid-F to late-G, with L(sub x)/L(sub bol) in the range 10(exp -4) to 2 x 10(exp -3) for stars later than approximately G7. We have looked for variability of the late-type cluster members on timescales of a day to a year and find that 1/4 of the stars show significantly variable X-ray emission. A handful of the late-type stars have published rotational periods and spectroscopic rotational velocities; however, we see no correlation between X-ray activity and rotation. Thus, for this sample of pre-main-sequence stars, the large dispersion in X-ray activity does not appear to be caused by the dispersion in rotation, in contrast with results obtained for low-mass main-sequence stars in the Pleiades and pre-main-sequence stars in Taurus-Auriga.

  15. The Microbial Ferrous Wheel in a Neutral pH Groundwater Seep

    PubMed Central

    Roden, Eric E.; McBeth, Joyce M.; Blöthe, Marco; Percak-Dennett, Elizabeth M.; Fleming, Emily J.; Holyoke, Rebecca R.; Luther, George W.; Emerson, David; Schieber, Juergen

    2012-01-01

    Evidence for microbial Fe redox cycling was documented in a circumneutral pH groundwater seep near Bloomington, Indiana. Geochemical and microbiological analyses were conducted at two sites, a semi-consolidated microbial mat and a floating puffball structure. In situ voltammetric microelectrode measurements revealed steep opposing gradients of O2 and Fe(II) at both sites, similar to other groundwater seep and sedimentary environments known to support microbial Fe redox cycling. The puffball structure showed an abrupt increase in dissolved Fe(II) just at its surface (∼5 cm depth), suggesting an internal Fe(II) source coupled to active Fe(III) reduction. Most probable number enumerations detected microaerophilic Fe(II)-oxidizing bacteria (FeOB) and dissimilatory Fe(III)-reducing bacteria (FeRB) at densities of 102 to 105 cells mL−1 in samples from both sites. In vitro Fe(III) reduction experiments revealed the potential for immediate reduction (no lag period) of native Fe(III) oxides. Conventional full-length 16S rRNA gene clone libraries were compared with high throughput barcode sequencing of the V1, V4, or V6 variable regions of 16S rRNA genes in order to evaluate the extent to which new sequencing approaches could provide enhanced insight into the composition of Fe redox cycling microbial community structure. The composition of the clone libraries suggested a lithotroph-dominated microbial community centered around taxa related to known FeOB (e.g., Gallionella, Sideroxydans, Aquabacterium). Sequences related to recognized FeRB (e.g., Rhodoferax, Aeromonas, Geobacter, Desulfovibrio) were also well-represented. Overall, sequences related to known FeOB and FeRB accounted for 88 and 59% of total clone sequences in the mat and puffball libraries, respectively. Taxa identified in the barcode libraries showed partial overlap with the clone libraries, but were not always consistent across different variable regions and sequencing platforms. However, the barcode libraries provided confirmation of key clone library results (e.g., the predominance of Betaproteobacteria) and an expanded view of lithotrophic microbial community composition. PMID:22783228

  16. HLA-E coding and 3' untranslated region variability determined by next-generation sequencing in two West-African population samples.

    PubMed

    Castelli, Erick C; Mendes-Junior, Celso T; Sabbagh, Audrey; Porto, Iane O P; Garcia, André; Ramalho, Jaqueline; Lima, Thálitta H A; Massaro, Juliana D; Dias, Fabrício C; Collares, Cristhianna V A; Jamonneau, Vincent; Bucheton, Bruno; Camara, Mamadou; Donadi, Eduardo A

    2015-12-01

    HLA-E is a non-classical Human Leucocyte Antigen class I gene with immunomodulatory properties. Whereas HLA-E expression usually occurs at low levels, it is widely distributed amongst human tissues, has the ability to bind self and non-self antigens and to interact with NK cells and T lymphocytes, being important for immunosurveillance and also for fighting against infections. HLA-E is usually the most conserved locus among all class I genes. However, most of the previous studies evaluating HLA-E variability sequenced only a few exons or genotyped known polymorphisms. Here we report a strategy to evaluate HLA-E variability by next-generation sequencing (NGS) that might be used to other HLA loci and present the HLA-E haplotype diversity considering the segment encoding the entire HLA-E mRNA (including 5'UTR, introns and the 3'UTR) in two African population samples, Susu from Guinea-Conakry and Lobi from Burkina Faso. Our results indicate that (a) the HLA-E gene is indeed conserved, encoding mainly two different protein molecules; (b) Africans do present several unknown HLA-E alleles presenting synonymous mutations; (c) the HLA-E 3'UTR is quite polymorphic and (d) haplotypes in the HLA-E 3'UTR are in close association with HLA-E coding alleles. NGS has proved to be an important tool on data generation for future studies evaluating variability in non-classical MHC genes. Copyright © 2015 American Society for Histocompatibility and Immunogenetics. Published by Elsevier Inc. All rights reserved.

  17. Diversity and population-genetic properties of copy number variations and multicopy genes in cattle

    PubMed Central

    Bickhart, Derek M.; Xu, Lingyang; Hutchison, Jana L.; Cole, John B.; Null, Daniel J.; Schroeder, Steven G.; Song, Jiuzhou; Garcia, Jose Fernando; Sonstegard, Tad S.; Van Tassell, Curtis P.; Schnabel, Robert D.; Taylor, Jeremy F.; Lewin, Harris A.; Liu, George E.

    2016-01-01

    The diversity and population genetics of copy number variation (CNV) in domesticated animals are not well understood. In this study, we analysed 75 genomes of major taurine and indicine cattle breeds (including Angus, Brahman, Gir, Holstein, Jersey, Limousin, Nelore, and Romagnola), sequenced to 11-fold coverage to identify 1,853 non-redundant CNV regions. Supported by high validation rates in array comparative genomic hybridization (CGH) and qPCR experiments, these CNV regions accounted for 3.1% (87.5 Mb) of the cattle reference genome, representing a significant increase over previous estimates of the area of the genome that is copy number variable (∼2%). Further population genetics and evolutionary genomics analyses based on these CNVs revealed the population structures of the cattle taurine and indicine breeds and uncovered potential diversely selected CNVs near important functional genes, including AOX1, ASZ1, GAT, GLYAT, and KRTAP9-1. Additionally, 121 CNV gene regions were found to be either breed specific or differentially variable across breeds, such as RICTOR in dairy breeds and PNPLA3 in beef breeds. In contrast, clusters of the PRP and PAG genes were found to be duplicated in all sequenced animals, suggesting that subfunctionalization, neofunctionalization, or overdominance play roles in diversifying those fertility-related genes. These CNV results provide a new glimpse into the diverse selection histories of cattle breeds and a basis for correlating structural variation with complex traits in the future. PMID:27085184

  18. J Genes for Heavy Chain Immunoglobulins of Mouse

    NASA Astrophysics Data System (ADS)

    Newell, Nanette; Richards, Julia E.; Tucker, Philip W.; Blattner, Frederick R.

    1980-09-01

    A 15.8-kilobase pair fragment of BALB/c mouse liver DNA, cloned in the Charon 4Aλ phage vector system, was shown to contain the μ heavy chain constant region (CHμ ) gene for the mouse immunoglobulin M. In addition, this fragment of DNA contains at least two J genes, used to code for the carboxyl terminal portion of heavy chain variable regions. These genes are located in genomic DNA about eight kilobase pairs to the 5' side of the CHμ gene. The complete nucleotide sequence of a 1120-base pair stretch of DNA that includes the two J genes has been determined.

  19. Genetic diversity and molecular evolution of Naga King Chili inferred from internal transcribed spacer sequence of nuclear ribosomal DNA.

    PubMed

    Kehie, Mechuselie; Kumaria, Suman; Devi, Khumuckcham Sangeeta; Tandon, Pramod

    2016-02-01

    Sequences of the Internal Transcribed Spacer (ITS1-5.8S-ITS2) of nuclear ribosomal DNAs were explored to study the genetic diversity and molecular evolution of Naga King Chili. Our study indicated the occurrence of nucleotide polymorphism and haplotypic diversity in the ITS regions. The present study demonstrated that the variability of ITS1 with respect to nucleotide diversity and sequence polymorphism exceeded that of ITS2. Sequence analysis of 5.8S gene revealed a much conserved region in all the accessions of Naga King Chili. However, strong phylogenetic information of this species is the distinct 13 bp deletion in the 5.8S gene which discriminated Naga King Chili from the rest of the Capsicum sp. Neutrality test results implied a neutral variation, and population seems to be evolving at drift-mutation equilibrium and free from directed selection pressure. Furthermore, mismatch analysis showed multimodal curve indicating a demographic equilibrium. Phylogenetic relationships revealed by Median Joining Network (MJN) analysis denoted a clear discrimination of Naga King Chili from its closest sister species (Capsicum chinense and Capsicum frutescens). The absence of star-like network of haplotypes suggested an ancient population expansion of this chili.

  20. Sequence heterogeneities of genes encoding 16S rRNAs in Paenibacillus polymyxa detected by temperature gradient gel electrophoresis.

    PubMed Central

    Nübel, U; Engelen, B; Felske, A; Snaidr, J; Wieshuber, A; Amann, R I; Ludwig, W; Backhaus, H

    1996-01-01

    Sequence heterogeneities in 16S rRNA genes from individual strains of Paenibacillus polymyxa were detected by sequence-dependent separation of PCR products by temperature gradient gel electrophoresis (TGGE). A fragment of the 16S rRNA genes, comprising variable regions V6 to V8, was used as a target sequence for amplifications. PCR products from P. polymyxa (type strain) emerged as a well-defined pattern of bands in the gradient gel. Six plasmids with different inserts, individually demonstrating the migration characteristics of single bands of the pattern, were obtained by cloning the PCR products. Their sequences were analyzed as a representative sample of the total heterogeneity. An amount of 10 variant nucleotide positions in the fragment of 347 bp was observed, with all substitutions conserving the relevant secondary structures of the V6 and V8 regions in the RNA molecules. Hybridizations with specifically designed probes demonstrated different chromosomal locations of the respective rRNA genes. Amplifications of reverse-transcribed rRNA from ribosome preparations, as well as whole-cell hybridizations, revealed a predominant representation of particular sequences in ribosomes of exponentially growing laboratory cultures. Different strains of P. polymyxa showed not only remarkably differing patterns of PCR products in TGGE analysis but also discriminative whole-cell labeling with the designed oligonucleotide probes, indicating the different representation of individual sequences in active ribosomes. Our results demonstrate the usefulness of TGGE for the structural analysis of heterogeneous rRNA genes together with their expression, stress problems of the generation of meaningful data for 16S rRNA sequences and probe designs, and might have consequences for evolutionary concepts. PMID:8824607

  1. The hypervariable region 1 protein of hepatitis C virus broadly reactive with sera of patients with chronic hepatitis C has a similar amino acid sequence with the consensus sequence.

    PubMed

    Watanabe, K; Yoshioka, K; Ito, H; Ishigami, M; Takagi, K; Utsunomiya, S; Kobayashi, M; Kishimoto, H; Yano, M; Kakumu, S

    1999-11-10

    Hypervariable region 1 (HVR1) proteins of hepatitis C virus (HCV) have been reported to react broadly with sera of patients with HCV infection. However, the variability of the broad reactivity of individual HVR1 proteins has not been elucidated. We assessed the reactivity of 25 different HVR1 proteins (genotype 1b) with sera of 81 patients with HCV infection (genotype 1b) by Western blot. HVR1 proteins reacted with 2-60 sera. The number of sera reactive with each HVR1 protein significantly correlated with the number of amino acid residues identical to the consensus sequence defined by Puntoriero et al. (G. Puntoriero, A. Lahm, S. Zucchelli, B. B. Ercole, R. Tafi, M. Penzzanera, M. U. Mondelli, R. Cortese, A. Tramontano, G. Galfre', and A. Nicosia. 1998. EMBO J. 17, 3521-3533. ) (r = 0.561, P < 0.005). The most widely reactive HVR1 protein, 12-22, had a sequence similar to the consensus sequence. The peptide with C-terminal 13-amino-acids sequence of HVR1 protein 12-22 (NH2-CSFTSLFTPGPSQK) was injected into rabbits as an immunogen. The rabbit immune sera reacted with 9 of 25 HVR1 proteins of genotype 1b including HVR1 protein 12-22 and with 3 of 12 proteins of genotype 2a. These results indicate that the HVR1 protein broadly reactive with patients' sera has a sequence similar to the consensus sequence, can induce broadly reactive sera, and could be one of the candidate immunogens in a prophylactic vaccine against HCV. Copyright 1999 Academic Press.

  2. Characterization of the two intra-individual sequence variants in the 18S rRNA gene in the plant parasitic nematode, Rotylenchulus reniformis.

    PubMed

    Nyaku, Seloame T; Sripathi, Venkateswara R; Kantety, Ramesh V; Gu, Yong Q; Lawrence, Kathy; Sharma, Govind C

    2013-01-01

    The 18S rRNA gene is fundamental to cellular and organismal protein synthesis and because of its stable persistence through generations it is also used in phylogenetic analysis among taxa. Sequence variation in this gene within a single species is rare, but it has been observed in few metazoan organisms. More frequently it has mostly been reported in the non-transcribed spacer region. Here, we have identified two sequence variants within the near full coding region of 18S rRNA gene from a single reniform nematode (RN) Rotylenchulus reniformis labeled as reniform nematode variant 1 (RN_VAR1) and variant 2 (RN_VAR2). All sequences from three of the four isolates had both RN variants in their sequences; however, isolate 13B had only RN variant 2 sequence. Specific variable base sites (96 or 5.5%) were found within the 18S rRNA gene that can clearly distinguish the two 18S rDNA variants of RN, in 11 (25.0%) and 33 (75.0%) of the 44 RN clones, for RN_VAR1 and RN_VAR2, respectively. Neighbor-joining trees show that the RN_VAR1 is very similar to the previously existing R. reniformis sequence in GenBank, while the RN_VAR2 sequence is more divergent. This is the first report of the identification of two major variants of the 18S rRNA gene in the same single RN, and documents the specific base variation between the two variants, and hypothesizes on simultaneous co-existence of these two variants for this gene.

  3. Characterization of the Two Intra-Individual Sequence Variants in the 18S rRNA Gene in the Plant Parasitic Nematode, Rotylenchulus reniformis

    PubMed Central

    Nyaku, Seloame T.; Sripathi, Venkateswara R.; Kantety, Ramesh V.; Gu, Yong Q.; Lawrence, Kathy; Sharma, Govind C.

    2013-01-01

    The 18S rRNA gene is fundamental to cellular and organismal protein synthesis and because of its stable persistence through generations it is also used in phylogenetic analysis among taxa. Sequence variation in this gene within a single species is rare, but it has been observed in few metazoan organisms. More frequently it has mostly been reported in the non-transcribed spacer region. Here, we have identified two sequence variants within the near full coding region of 18S rRNA gene from a single reniform nematode (RN) Rotylenchulus reniformis labeled as reniform nematode variant 1 (RN_VAR1) and variant 2 (RN_VAR2). All sequences from three of the four isolates had both RN variants in their sequences; however, isolate 13B had only RN variant 2 sequence. Specific variable base sites (96 or 5.5%) were found within the 18S rRNA gene that can clearly distinguish the two 18S rDNA variants of RN, in 11 (25.0%) and 33 (75.0%) of the 44 RN clones, for RN_VAR1 and RN_VAR2, respectively. Neighbor-joining trees show that the RN_VAR1 is very similar to the previously existing R. reniformis sequence in GenBank, while the RN_VAR2 sequence is more divergent. This is the first report of the identification of two major variants of the 18S rRNA gene in the same single RN, and documents the specific base variation between the two variants, and hypothesizes on simultaneous co-existence of these two variants for this gene. PMID:23593343

  4. Classification of circulation type sequences applied to snow avalanches over the eastern Pyrenees (Andorra and Catalonia)

    NASA Astrophysics Data System (ADS)

    Esteban, Pere; Beck, Christoph; Philipp, Andreas

    2010-05-01

    Using data associated with accidents or damages caused by snow avalanches over the eastern Pyrenees (Andorra and Catalonia) several atmospheric circulation type catalogues have been obtained. For this purpose, different circulation type classification methods based on Principal Component Analysis (T-mode and S-mode using the extreme scores) and on optimization procedures (Improved K-means and SANDRA) were applied . Considering the characteristics of the phenomena studied, not only single day circulation patterns were taken into account but also sequences of circulation types of varying length. Thus different classifications with different numbers of types and for different sequence lengths were obtained using the different classification methods. Simple between type variability, within type variability, and outlier detection procedures have been applied for selecting the best result concerning snow avalanches type classifications. Furthermore, days without occurrence of the hazards were also related to the avalanche centroids using pattern-correlations, facilitating the calculation of the anomalies between hazardous and no hazardous days, and also frequencies of occurrence of hazardous events for each circulation type. Finally, the catalogues statistically considered the best results are evaluated using the avalanche forecaster expert knowledge. Consistent explanation of snow avalanches occurrence by means of circulation sequences is obtained, but always considering results from classifications with different sequence length. This work has been developed in the framework of the COST Action 733 (Harmonisation and Applications of Weather Type Classifications for European regions).

  5. Comparative Analysis of Four Buckwheat Species Based on Morphology and Complete Chloroplast Genome Sequences.

    PubMed

    Wang, Cheng-Long; Ding, Meng-Qi; Zou, Chen-Yan; Zhu, Xue-Mei; Tang, Yu; Zhou, Mei-Liang; Shao, Ji-Rong

    2017-07-26

    Buckwheat is a nutritional and economically crop belonging to Polygonaceae, Fagopyrum. To better understand the mutation patterns and evolution trend in the chloroplast (cp) genome of buckwheat, and found sufficient number of variable regions to explore the phylogenetic relationships of this genus, two complete cp genomes of buckwheat including Fagopyrum dibotrys (F. dibotrys) and Fagopyrum luojishanense (F. luojishanense) were sequenced, and other two Fagopyrum cp genomes were used for comparative analysis. After morphological analysis, the main difference among these buckwheat were height, leaf shape, seeds and flower type. F. luojishanense was distinguishable from the cultivated species easily. Although the F. dibotrys and two cultivated species has some similarity, they different in habit and component contents. The cp genome of F. dibotrys was 159,320 bp while the F. luojishanense was 159,265 bp. 48 and 61 SSRs were found in F. dibotrys and F. luojishanense respectively. Meanwhile, 10 highly variable regions among these buckwheat species were located precisely. The phylogenetic relationships among four Fagopyrum species based on complete cp genomes was showed. The results suggested that F. dibotrys is more closely related to Fagopyrum tataricum. These data provided valuable genetic information for Fagopyrum species identification, taxonomy, phylogenetic study and molecular breeding.

  6. Mitochondrial DNA diversity of the Amerindian populations living in the Andean Piedmont of Bolivia: Chimane, Moseten, Aymara and Quechua.

    PubMed

    Corella, Alfons; Bert, Francesc; Pérez-Pérez, Alejandro; Gené, Manel; Turbón, Daniel

    2007-01-01

    Chimane, Moseten Aymara and Quechua are Amerindian populations living in the Bolivian Piedmont, a characteristic ecoregion between the eastern slope of the Andean mountains and the Amazonian Llanos de Moxos. In both neighbouring areas, dense and complex societies have developed over the centuries. The Piedmont area is especially interesting from a human peopling perspective since there is no clear evidence regarding the genetic influence and peculiarities of these populations. This land has been used extensively as a territory of economic and cultural exchange between the Andes and Amazonia, however Chimane and Moseten populations have been sufficiently isolated from their neighbour groups to be recognized as distinct populations. Genetic information suggests that evolutionary processes, such as genetic drift, natural selection and genetic admixture have formed the history of the Piedmont populations. The objective of this study is to characterize the genetic diversity of the Piedmont populations, analysing the sequence variability of the HVR-I control region in the mitochondrial DNA (mtDNA). Haplogroup mtDNA data available from the whole of Central and South America were utilized to determine the relationship of the Piedmont populations with other Amerindian populations. Hair pulls were obtained in situ, and DNA from non-related individuals was extracted using a standard Chelex 100 method. A 401 bp DNA fragment of HVR-I region was amplified using standard procedures. Two independent 401 and 328 bp DNA fragments were sequenced separately for each sample. The sequence analyses included mismatch distribution and mean pairwise differences, median network analyses, AMOVA and principal component analyses. The genetic diversity of DNA sequences was measured and compared with other South Amerindian populations. The genetic diversity of 401 nucleotide mtDNA sequences, in the hypervariable Control Region, from positions 16 000-16 400, was characterized in a sample of 46 Amerindians living in the Piedmont area in the Beni Department of Bolivia. The results obtained indicate that the genetic diversity in the area is higher than that observed in other American groups living in much larger areas and despite the reduced size of the studied area the human groups analysed show high levels of inter-group variability. In addition, results show that Amerindian populations living in the Piedmont are genetically more related to those in the Andean than in the Amazonian populations.

  7. Morphological and molecular characterization of Seuratascaris numidica (Seurat, 1917) (Ascaridida: Ascarididae).

    PubMed

    Chen, Hui-Xia; Zhang, Kuang; Zhang, Lu-Ping; Li, Liang

    2018-03-26

    Seuratascaris numidica (Seurat, 1917) is a specialized nematode species parasitizing amphibians only. In the present study, the detailed morphology of this poorly known species was studied using light and scanning electron microscopy based on the newly material collected from Hoplobatrachus chinensis (Osbeck) (Amphibia: Anura) in China. We found that the relative length of intestinal caecum in our male specimens (representing 68.4-71.1% of oesophageal length) is slighter longer than the previously reported data (not over 60.0% of oesophageal length). Our SEM observations also revealed the presence of ca. 64-76 small conical denticles on each lip. In addition, Angusticaecum wuyiensis Wang, 1981, collected from Rana schmackeri Boettger (Amphibia: Anura) from Wuyi Mountain in Fujian Province, China was considered as a new synonym of S. numidica. The ITS and cox1 sequences of S. numidica were also sequenced for the first time and there is no nucleotide variability detected in both regions. The present supplementary morphological and molecular data (especially the ITS and cox1 sequences) obtained herein is extremely important and useful to determine the morphological variability, population genetics and phylogenetic position of S. numidica in the future.

  8. First insight into dead wood protistan diversity: a molecular sampling of bright-spored Myxomycetes (Amoebozoa, slime-moulds) in decaying beech logs.

    PubMed

    Clissmann, Fionn; Fiore-Donno, Anna Maria; Hoppe, Björn; Krüger, Dirk; Kahl, Tiemo; Unterseher, Martin; Schnittler, Martin

    2015-06-01

    Decaying wood hosts a large diversity of seldom investigated protists. Environmental sequencing offers novel insights into communities, but has rarely been applied to saproxylic protists. We investigated the diversity of bright-spored wood-inhabiting Myxomycetes by environmental sequencing. Myxomycetes have a complex life cycle culminating in the formation of mainly macroscopic fruiting bodies, highly variable in shape and colour that are often found on decaying logs. Our hypothesis was that diversity of bright-spored Myxomycetes would increase with decay. DNA was extracted from wood chips collected from 17 beech logs of varying decay stages from the Hainich-Dün region in Central Germany. We obtained 260 partial small subunit ribosomal RNA gene sequences of bright-spored Myxomycetes that were assembled into 29 OTUs, of which 65% were less than 98% similar to those in the existing database. The OTU richness revealed by molecular analysis surpassed that of a parallel inventory of fruiting bodies. We tested several environmental variables and identified pH, rather than decay stage, as the main structuring factor of myxomycete distribution. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  9. CSI 2264: Simultaneous optical and X-ray variability in pre-main sequence stars. I. Time resolved X-ray spectral analysis during optical dips and accretion bursts in stars with disks

    NASA Astrophysics Data System (ADS)

    Guarcello, M. G.; Flaccomio, E.; Micela, G.; Argiroffi, C.; Sciortino, S.; Venuti, L.; Stauffer, J.; Rebull, L.; Cody, A. M.

    2017-06-01

    Context. Pre-main sequence stars are variable sources. The main mechanisms responsible for their variability are variable extinction, unsteady accretion, and rotational modulation of both hot and dark photospheric spots and X-ray-active regions. In stars with disks, this variability is related to the morphology of the inner circumstellar region (≤0.1 AU) and that of the photosphere and corona, all impossible to be spatially resolved with present-day techniques. This has been the main motivation for the Coordinated Synoptic Investigation of NGC 2264, a set of simultaneous observations of NGC 2264 with 15 different telescopes. Aims: In this paper, we focus on the stars with disks. We analyze the X-ray spectral properties extracted during optical bursts and dips in order to unveil the nature of these phenomena. Stars without disks are studied in a companion paper. Methods: We analyze simultaneous CoRoT and Chandra/ACIS-I observations to search for coherent optical and X-ray flux variability in stars with disks. Then, stars are analyzed in two different samples. In stars with variable extinction, we look for a simultaneous increase of optical extinction and X-ray absorption during the optical dips; in stars with accretion bursts, we search for soft X-ray emission and increasing X-ray absorption during the bursts. Results: We find evidence for coherent optical and X-ray flux variability among the stars with variable extinction. In 9 of the 24 stars with optical dips, we observe a simultaneous increase of X-ray absorption and optical extinction. In seven dips, it is possible to calculate the NH/AV ratio in order to infer the composition of the obscuring material. In 5 of the 20 stars with optical accretion bursts, we observe increasing soft X-ray emission during the bursts that we associate to the emission of accreting gas. It is not surprising that these properties are not observed in all the stars with dips and bursts, since favorable geometric configurations are required. Conclusions: The observed variable absorption during the dips is mainly due to dust-free material in accretion streams. In stars with accretion bursts, we observe, on average, a larger soft X-ray spectral component not observed in non-accreting stars.

  10. Sequence and Secondary Structure of the Mitochondrial Small-Subunit rRNA V4, V6, and V9 Domains Reveal Highly Species-Specific Variations within the Genus Agrocybe

    PubMed Central

    Gonzalez, Patrice; Labarère, Jacques

    1998-01-01

    A comparative study of variable domains V4, V6, and V9 of the mitochondrial small-subunit (SSU) rRNA was carried out with the genus Agrocybe by PCR amplification of 42 wild isolates belonging to 10 species, Agrocybe aegerita, Agrocybe dura, Agrocybe chaxingu, Agrocybe erebia, Agrocybe firma, Agrocybe praecox, Agrocybe paludosa, Agrocybe pediades, Agrocybe alnetorum, and Agrocybe vervacti. Sequencing of the PCR products showed that the three domains in the isolates belonging to the same species were the same length and had the same sequence, while variations were found among the 10 species. Alignment of the sequences showed that nucleotide motifs encountered in the smallest sequence of each variable domain were also found in the largest sequence, indicating that the sequences evolved by insertion-deletion events. Determination of the secondary structure of each domain revealed that the insertion-deletion events commonly occurred in regions not directly involved in the secondary structure (i.e., the loops). Moreover, conserved sequences ranging from 4 to 25 nucleotides long were found at the beginning and end of each domain and could constitute genus-specific sequences. Comparisons of the V4, V6, and V9 secondary structures resulted in identification of the following four groups: (i) group I, which was characterized by the presence of additional P23-1 and P23-3 helices in the V4 domain and the lack of the P49-1 helix in V9 and included A. aegerita, A. chaxingu, and A. erebia; (ii) group II, which had the P23-3 helix in V4 and the P49-1 helix in V9 and included A. pediades; (iii) group III, which did not have additional helices in V4, had the P49-1 helix in V9 and included A. paludosa, A. firma, A. alnetorum, and A. praecox; and (iv) group IV, which lacked both the V4 additional helices and the P49-1 helix in V9 and included A. vervacti and A. dura. This grouping of species was supported by the structure of a consensus tree based on the variable domain sequences. The conservation of the sequences of the V4, V6, and V9 domains of the mitochondrial SSU rRNA within species and the high degree of interspecific variation found in the Agrocybe species studied open the way for these sequences to be used as specific molecular markers of the Basidiomycota. PMID:9797259

  11. Sequence and secondary structure of the mitochondrial small-subunit rRNA V4, V6, and V9 domains reveal highly species-specific variations within the genus Agrocybe.

    PubMed

    Gonzalez, P; Labarère, J

    1998-11-01

    A comparative study of variable domains V4, V6, and V9 of the mitochondrial small-subunit (SSU) rRNA was carried out with the genus Agrocybe by PCR amplification of 42 wild isolates belonging to 10 species, Agrocybe aegerita, Agrocybe dura, Agrocybe chaxingu, Agrocybe erebia, Agrocybe firma, Agrocybe praecox, Agrocybe paludosa, Agrocybe pediades, Agrocybe alnetorum, and Agrocybe vervacti. Sequencing of the PCR products showed that the three domains in the isolates belonging to the same species were the same length and had the same sequence, while variations were found among the 10 species. Alignment of the sequences showed that nucleotide motifs encountered in the smallest sequence of each variable domain were also found in the largest sequence, indicating that the sequences evolved by insertion-deletion events. Determination of the secondary structure of each domain revealed that the insertion-deletion events commonly occurred in regions not directly involved in the secondary structure (i.e., the loops). Moreover, conserved sequences ranging from 4 to 25 nucleotides long were found at the beginning and end of each domain and could constitute genus-specific sequences. Comparisons of the V4, V6, and V9 secondary structures resulted in identification of the following four groups: (i) group I, which was characterized by the presence of additional P23-1 and P23-3 helices in the V4 domain and the lack of the P49-1 helix in V9 and included A. aegerita, A. chaxingu, and A. erebia; (ii) group II, which had the P23-3 helix in V4 and the P49-1 helix in V9 and included A. pediades; (iii) group III, which did not have additional helices in V4, had the P49-1 helix in V9 and included A. paludosa, A. firma, A. alnetorum, and A. praecox; and (iv) group IV, which lacked both the V4 additional helices and the P49-1 helix in V9 and included A. vervacti and A. dura. This grouping of species was supported by the structure of a consensus tree based on the variable domain sequences. The conservation of the sequences of the V4, V6, and V9 domains of the mitochondrial SSU rRNA within species and the high degree of interspecific variation found in the Agrocybe species studied open the way for these sequences to be used as specific molecular markers of the Basidiomycota.

  12. Mutation scanning in a single and a stacked genetically modified (GM) event by real-time PCR and high resolution melting (HRM) analysis.

    PubMed

    Ben Ali, Sina-Elisabeth; Madi, Zita Erika; Hochegger, Rupert; Quist, David; Prewein, Bernhard; Haslberger, Alexander G; Brandes, Christian

    2014-10-31

    Genetic mutations must be avoided during the production and use of seeds. In the European Union (EU), Directive 2001/18/EC requires any DNA construct introduced via transformation to be stable. Establishing genetic stability is critical for the approval of genetically modified organisms (GMOs). In this study, genetic stability of two GMOs was examined using high resolution melting (HRM) analysis and real-time polymerase chain reaction (PCR) employing Scorpion primers for amplification. The genetic variability of the transgenic insert and that of the flanking regions in a single oilseed rape variety (GT73) and a stacked maize (MON88017×MON810) was studied. The GT73 and the 5' region of MON810 showed no instabilities in the examined regions. However; two out of 100 analyzed samples carried a heterozygous point mutation in the 3' region of MON810 in the stacked variety. These results were verified by direct sequencing of the amplified PCR products as well as by sequencing of cloned PCR fragments. The occurrence of the mutation suggests that the 5' region is more suitable than the 3' region for the quantification of MON810. The identification of the single nucleotide polymorphism (SNP) in a stacked event is in contrast to the results of earlier studies of the same MON810 region in a single event where no DNA polymorphism was found.

  13. Chicken immunoglobulin gamma-heavy chains: limited VH gene repertoire, combinatorial diversification by D gene segments and evolution of the heavy chain locus.

    PubMed

    Parvari, R; Avivi, A; Lentner, F; Ziv, E; Tel-Or, S; Burstein, Y; Schechter, I

    1988-03-01

    cDNA clones encoding the variable and constant regions of chicken immunoglobulin (Ig) gamma-chains were obtained from spleen cDNA libraries. Southern blots of kidney DNA show that the variable region sequences of eight cDNA clones reveal the same set of bands corresponding to approximately 30 cross-hybridizing VH genes of one subgroup. Since the VH clones were randomly selected, it is likely that the bulk of chicken H-chains are encoded by a single VH subgroup. Nucleotide sequence determinations of two cDNA clones reveal VH, D, JH and the constant region. The VH segments are closely related to each other (83% homology) as expected for VH or the same subgroup. The JHs are 15 residues long and differ by one amino acid. The Ds differ markedly in sequence (20% homology) and size (10 and 20 residues). These findings strongly indicate multiple (at least two) D genes which by a combinatorial joining mechanism diversify the H-chains, a mechanism which is not operative in the chicken L-chain locus. The most notable among the chicken Igs is the so-called 7S IgG because its H-chain differs in many important aspects from any mammalian IgG. The sequence of the C gamma cDNA reported here resolves this issue. The chicken C gamma is 426 residues long with four CH domains (unlike mammalian C gamma which has three CH domains) and it shows 25% homology to the chicken C mu. The chicken C gamma is most related to the mammalian C epsilon in length, the presence of four CH domains and the distribution of cysteines in the CH1 and CH2 domains. We propose that the unique chicken C gamma is the ancestor of the mammalian C epsilon and C gamma subclasses, and discuss the evolution of the H-chain locus from that of chicken with presumably three genes (mu, gamma, alpha) to the mammalian loci with 8-10 H-chain genes.

  14. Rabies in the arctic fox population, Svalbard, Norway.

    PubMed

    Mørk, Torill; Bohlin, Jon; Fuglei, Eva; Åsbakk, Kjetil; Tryland, Morten

    2011-10-01

    Arctic foxes, 620 that were trapped and 22 found dead on Svalbard, Norway (1996-2004), as well as 10 foxes trapped in Nenets, North-West Russia (1999), were tested for rabies virus antigen in brain tissue by standard direct fluorescent antibody test. Rabies antigen was found in two foxes from Svalbard and in three from Russia. Blood samples from 515 of the fox carcasses were screened for rabies antibodies with negative result. Our results, together with a previous screening (1980-1989, n=817) indicate that the prevalence of rabies in Svalbard has remained low or that the virus has not been enzootic in the arctic fox population since the first reported outbreak in 1980. Brain tissues from four arctic foxes (one from Svalbard, three from Russia) in which rabies virus antigen was detected were further analyzed by reverse-transcriptase polymerase chain reaction direct amplicon sequencing and phylogenetic analysis. Sequences were compared to corresponding sequences from rabies virus isolates from other arctic regions. The Svalbard isolate and two of the Russian isolates were identical (310 nucleotides), whereas the third Russian isolate differed in six nucleotide positions. However, when translated into amino acid sequences, none of these substitutions produced changes in the amino acid sequence. These findings suggest that the spread of rabies virus to Svalbard was likely due to migration of arctic foxes over sea ice from Russia to Svalbard. Furthermore, when compared to other Arctic rabies virus isolates, a high degree of homology was found, suggesting a high contact rate between arctic fox populations from different arctic regions. The high degree of homology also indicates that other, and more variable, regions of the genome than this part of the nucleoprotein gene should be used to distinguish Arctic rabies virus isolates for epidemiologic purposes.

  15. Computational study of β-N-acetylhexosaminidase from Talaromyces flavus, a glycosidase with high substrate flexibility.

    PubMed

    Kulik, Natallia; Slámová, Kristýna; Ettrich, Rüdiger; Křen, Vladimír

    2015-01-28

    β-N-Acetylhexosaminidase (GH20) from the filamentous fungus Talaromyces flavus, previously identified as a prominent enzyme in the biosynthesis of modified glycosides, lacks a high resolution three-dimensional structure so far. Despite of high sequence identity to previously reported Aspergillus oryzae and Penicilluim oxalicum β-N-acetylhexosaminidases, this enzyme tolerates significantly better substrate modification. Understanding of key structural features, prediction of effective mutants and potential substrate characteristics prior to their synthesis are of general interest. Computational methods including homology modeling and molecular dynamics simulations were applied to shad light on the structure-activity relationship in the enzyme. Primary sequence analysis revealed some variable regions able to influence difference in substrate affinity of hexosaminidases. Moreover, docking in combination with consequent molecular dynamics simulations of C-6 modified glycosides enabled us to identify the structural features required for accommodation and processing of these bulky substrates in the active site of hexosaminidase from T. flavus. To access the reliability of predictions on basis of the reported model, all results were confronted with available experimental data that demonstrated the principal correctness of the predictions as well as the model. The main variable regions in β-N-acetylhexosaminidases determining difference in modified substrate affinity are located close to the active site entrance and engage two loops. Differences in primary sequence and the spatial arrangement of these loops and their interplay with active site amino acids, reflected by interaction energies and dynamics, account for the different catalytic activity and substrate specificity of the various fungal and bacterial β-N-acetylhexosaminidases.

  16. HLA-E regulatory and coding region variability and haplotypes in a Brazilian population sample.

    PubMed

    Ramalho, Jaqueline; Veiga-Castelli, Luciana C; Donadi, Eduardo A; Mendes-Junior, Celso T; Castelli, Erick C

    2017-11-01

    The HLA-E gene is characterized by low but wide expression on different tissues. HLA-E is considered a conserved gene, being one of the least polymorphic class I HLA genes. The HLA-E molecule interacts with Natural Killer cell receptors and T lymphocytes receptors, and might activate or inhibit immune responses depending on the peptide associated with HLA-E and with which receptors HLA-E interacts to. Variable sites within the HLA-E regulatory and coding segments may influence the gene function by modifying its expression pattern or encoded molecule, thus, influencing its interaction with receptors and the peptide. Here we propose an approach to evaluate the gene structure, haplotype pattern and the complete HLA-E variability, including regulatory (promoter and 3'UTR) and coding segments (with introns), by using massively parallel sequencing. We investigated the variability of 420 samples from a very admixed population such as Brazilians by using this approach. Considering a segment of about 7kb, 63 variable sites were detected, arranged into 75 extended haplotypes. We detected 37 different promoter sequences (but few frequent ones), 27 different coding sequences (15 representing new HLA-E alleles) and 12 haplotypes at the 3'UTR segment, two of them presenting a summed frequency of 90%. Despite the number of coding alleles, they encode mainly two different full-length molecules, known as E*01:01 and E*01:03, which corresponds to about 90% of all. In addition, differently from what has been previously observed for other non classical HLA genes, the relationship among the HLA-E promoter, coding and 3'UTR haplotypes is not straightforward because the same promoter and 3'UTR haplotypes were many times associated with different HLA-E coding haplotypes. This data reinforces the presence of only two main full-length HLA-E molecules encoded by the many HLA-E alleles detected in our population sample. In addition, this data does indicate that the distal HLA-E promoter is by far the most variable segment. Further analyses involving the binding of transcription factors and non-coding RNAs, as well as the HLA-E expression in different tissues, are necessary to evaluate whether these variable sites at regulatory segments (or even at the coding sequence) may influence the gene expression profile. Copyright © 2017 Elsevier Ltd. All rights reserved.

  17. The use of reference strand-mediated conformational analysis for the study of cheetah (Acinonyx jubatus) feline leucocyte antigen class II DRB polymorphisms.

    PubMed

    Drake, G J C; Kennedy, L J; Auty, H K; Ryvar, R; Ollier, W E R; Kitchener, A C; Freeman, A R; Radford, A D

    2004-01-01

    There is now considerable evidence to suggest the cheetah (Acinonyx jubatus) has limited genetic diversity. However, the extent of this and its significance to the fitness of the cheetah population, both in the wild and captivity, is the subject of some debate. This reflects the difficulty associated with establishing a direct link between low variability at biologically significant loci and deleterious aspects of phenotype in this, and other, species. Attempts to study one such region, the feline leucocyte antigen (FLA), are hampered by a general reliance on cloning and sequencing which is expensive, labour-intensive, subject to PCR artefact and always likely to underestimate true variability. In this study we have applied reference strand-mediated conformational analysis (RSCA) to determine the FLA-DRB phenotypes of 25 cheetahs. This technique was rapid, repeatable and less prone to polymerase chain reaction (PCR)-induced sequence artefacts associated with cloning. Individual cheetahs were shown to have up to three FLA-DRB genes. A total of five alleles were identified (DRB*ha14-17 and DRB*gd01) distributed among four genotypes. Fifteen cheetahs were DRB*ha14/ha15/ha16/ha17, three were DRB*ha15/ha16/ha17, six were DRB*ha14/ha16/ha17 and one was DRB*ha14/ha15/ha16/ha17/gd01. Sequence analysis of DRB*gd01 suggested it was a recombinant of DRB*ha16 and DRB*ha17. Generation of new alleles is difficult to document, and the clear demonstration of such an event is unusual. This study confirms further the limited genetic variability of the cheetah at a biologically significant region. RSCA will facilitate large-scale studies that will be needed to correlate genetic diversity at such loci with population fitness in the cheetah and other species.

  18. Distinctive mitochondrial genome of Calanoid copepod Calanus sinicus with multiple large non-coding regions and reshuffled gene order: Useful molecular markers for phylogenetic and population studies

    PubMed Central

    2011-01-01

    Background Copepods are highly diverse and abundant, resulting in extensive ecological radiation in marine ecosystems. Calanus sinicus dominates continental shelf waters in the northwest Pacific Ocean and plays an important role in the local ecosystem by linking primary production to higher trophic levels. A lack of effective molecular markers has hindered phylogenetic and population genetic studies concerning copepods. As they are genome-level informative, mitochondrial DNA sequences can be used as markers for population genetic studies and phylogenetic studies. Results The mitochondrial genome of C. sinicus is distinct from other arthropods owing to the concurrence of multiple non-coding regions and a reshuffled gene arrangement. Further particularities in the mitogenome of C. sinicus include low A + T-content, symmetrical nucleotide composition between strands, abbreviated stop codons for several PCGs and extended lengths of the genes atp6 and atp8 relative to other copepods. The monophyletic Copepoda should be placed within the Vericrustacea. The close affinity between Cyclopoida and Poecilostomatoida suggests reassigning the latter as subordinate to the former. Monophyly of Maxillopoda is rejected. Within the alignment of 11 C. sinicus mitogenomes, there are 397 variable sites harbouring three 'hotspot' variable sites and three microsatellite loci. Conclusion The occurrence of the circular subgenomic fragment during laboratory assays suggests that special caution should be taken when sequencing mitogenomes using long PCR. Such a phenomenon may provide additional evidence of mitochondrial DNA recombination, which appears to have been a prerequisite for shaping the present mitochondrial profile of C. sinicus during its evolution. The lack of synapomorphic gene arrangements among copepods has cast doubt on the utility of gene order as a useful molecular marker for deep phylogenetic analysis. However, mitochondrial genomic sequences have been valuable markers for resolving phylogenetic issues concerning copepods. The variable site maps of C. sinicus mitogenomes provide a solid foundation for population genetic studies. PMID:21269523

  19. Genotyping Toxoplasma gondii with the B1 Gene in Naturally Infected Sheep from an Endemic Region in the Pacific Coast of Mexico.

    PubMed

    Martínez-Flores, Williams Arony; Palma-García, José Manuel; Caballero-Ortega, Heriberto; Del Viento-Camacho, Alejandra; López-Escamilla, Eduardo; Martínez-Hernández, Fernando; Vinuesa, Pablo; Correa, Dolores; Maravilla, Pablo

    2017-07-01

    Toxoplasma gondii is a protozoan parasite with a broad ecological valence, which has been detected in a wide range of hosts and landscapes. Although the genus is considered monospecific, in recent years it has been demonstrated to exhibit more genetic variability than previously known. In Mexico, there are few genotyping studies, which suggest that classical, autochthonous, and atypical strains are circulating. The goal of this study was to describe T. gondii genetic diversity in naturally infected sheep from Colima, Mexico. This is a good site to study ecological aspects of this parasite since it is located between the Nearctic and Neotropical ecozones and it includes domestic and wild risks for transmission. We analyzed 305 tissue samples of semicaptive sheep from six coastal and central zones of Colima and border zones of Michoacán. We used an 803 bp amplicon of the B1 gene to genotype T. gondii and seroprevalence was determined by ELISA. Indexes for genetic diversity and genetic differentiation were calculated and compared with reference strains from North America (NA) and South America (SA). Twenty-three tissue samples were positive for the B1 gene by PCR, which were sequenced. Crude prevalence was 24.4%. The genetic analysis showed 16 variable sites along the 803 bp region that grouped all sequences into 13 haplotypes in the phylogenetic tree. Bayesian and haplotype network analysis showed nine new B1-types, of which three were frequent and six had unique alleles. Comparisons among sequence sets revealed that the Mexican population had lower differentiation than SA and an intermediate genetic variability between South America and North America. The B1 gene analysis showed new T. gondii haplotypes in naturally infected sheep; therefore, this marker could be initially used in molecular screening studies to identify potentially virulent genotypes of this parasite using natural host samples directly.

  20. Identification of antigen-specific human monoclonal antibodies using high-throughput sequencing of the antibody repertoire.

    PubMed

    Liu, Ju; Li, Ruihua; Liu, Kun; Li, Liangliang; Zai, Xiaodong; Chi, Xiangyang; Fu, Ling; Xu, Junjie; Chen, Wei

    2016-04-22

    High-throughput sequencing of the antibody repertoire provides a large number of antibody variable region sequences that can be used to generate human monoclonal antibodies. However, current screening methods for identifying antigen-specific antibodies are inefficient. In the present study, we developed an antibody clone screening strategy based on clone dynamics and relative frequency, and used it to identify antigen-specific human monoclonal antibodies. Enzyme-linked immunosorbent assay showed that at least 52% of putative positive immunoglobulin heavy chains composed antigen-specific antibodies. Combining information on dynamics and relative frequency improved identification of positive clones and elimination of negative clones. and increase the credibility of putative positive clones. Therefore the screening strategy could simplify the subsequent experimental screening and may facilitate the generation of antigen-specific antibodies. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. A genetic variant in the LDLR promoter is responsible for part of the LDL-cholesterol variability in primary hypercholesterolemia

    PubMed Central

    2014-01-01

    Background GWAS have consistently revealed that LDLR locus variability influences LDL-cholesterol in general population. Severe LDLR mutations are responsible for familial hypercholesterolemia (FH). However, most primary hypercholesterolemias are polygenic diseases. Although Cis-regulatory regions might be the cause of LDL-cholesterol variability; an extensive analysis of the LDLR distal promoter has not yet been performed. We hypothesized that genetic variants in this region are responsible for the LDLR association with LDL-cholesterol found in GWAS. Methods Four-hundred seventy-seven unrelated subjects with polygenic hypercholesterolemia (PH) and without causative FH-mutations and 525 normolipemic subjects were selected. A 3103 pb from LDLR (-625 to +2468) was sequenced in 125 subjects with PH. All subjects were genotyped for 4 SNPs (rs17242346, rs17242739, rs17248720 and rs17249120) predicted to be potentially involved in transcription regulation by in silico analysis. EMSA and luciferase assays were carried out for the rs17248720 variant. Multivariable linear regression analysis using LDL-cholesterol levels as the dependent variable were done in order to find out the variables that were independently associated with LDL-cholesterol. Results The sequencing of the 125 PH subjects did not show variants with minor allele frequency ≥ 10%. The T-allele from g.3131C > T (rs17248720) had frequencies of 9% (PH) and 16.4% (normolipemic), p < 0.00001. Studies of this variant with EMSA and luciferase assays showed a higher affinity for transcription factors and an increase of 2.5 times in LDLR transcriptional activity (T-allele vs C-allele). At multivariate analysis, this polymorphism with the lipoprotein(a) and age explained ≈ 10% of LDL-cholesterol variability. Conclusion Our results suggest that the T-allele at the g.3131 T > C SNP is associated with LDL-cholesterol levels, and explains part of the LDL-cholesterol variability. As a plausible cause, the T-allele produces an increase in LDLR transcriptional activity and lower LDL-cholesterol levels. PMID:24708769

  2. Generation and Characterization of HIV-1 Transmitted and Founder Virus Consensus Sequence from Intravenous Drug Users in Xinjiang, China.

    PubMed

    Li, Fan; Ma, Liying; Feng, Yi; Hu, Jing; Ni, Na; Ruan, Yuhua; Shao, Yiming

    2017-06-01

    HIV-1 transmission in intravenous drug users (IDUs) has been characterized by high genetic multiplicity and suggests a greater challenge for HIV-1 infection blocking. We investigated a total of 749 sequences of full-length gp160 gene obtained by single genome sequencing (SGS) from 22 HIV-1 early infected IDUs in Xinjiang province, northwest China, and generated a transmitted and founder virus (T/F virus) consensus sequence (IDU.CON). The T/F virus was classified as subtype CRF07_BC and predicted to be CCR5-tropic virus. The variable region (V1, V2, and V4 loop) of IDU.CON showed length variation compared with the heterosexual T/F virus consensus sequence (HSX.CON) and homosexual T/F virus consensus sequence (MSM.CON). A total of 26 N-linked glycosylation sites were discovered in the IDU.CON sequence, which is less than that of MSM.CON and HSX.CON. Characterization of T/F virus from IDUs highlights the genetic make-up and complexity of virus near the moment of transmission or in early infection preceding systemic dissemination and is important toward the development of an effective HIV-1 preventive methods, including vaccines.

  3. Length-independent structural similarities enrich the antibody CDR canonical class model.

    PubMed

    Nowak, Jaroslaw; Baker, Terry; Georges, Guy; Kelm, Sebastian; Klostermann, Stefan; Shi, Jiye; Sridharan, Sudharsan; Deane, Charlotte M

    2016-01-01

    Complementarity-determining regions (CDRs) are antibody loops that make up the antigen binding site. Here, we show that all CDR types have structurally similar loops of different lengths. Based on these findings, we created length-independent canonical classes for the non-H3 CDRs. Our length variable structural clusters show strong sequence patterns suggesting either that they evolved from the same original structure or result from some form of convergence. We find that our length-independent method not only clusters a larger number of CDRs, but also predicts canonical class from sequence better than the standard length-dependent approach. To demonstrate the usefulness of our findings, we predicted cluster membership of CDR-L3 sequences from 3 next-generation sequencing datasets of the antibody repertoire (over 1,000,000 sequences). Using the length-independent clusters, we can structurally classify an additional 135,000 sequences, which represents a ∼20% improvement over the standard approach. This suggests that our length-independent canonical classes might be a highly prevalent feature of antibody space, and could substantially improve our ability to accurately predict the structure of novel CDRs identified by next-generation sequencing.

  4. Intraspecific Variation and Phylogenetic Relationships Are Revealed by ITS1 Secondary Structure Analysis and Single-Nucleotide Polymorphism in Ganoderma lucidum

    PubMed Central

    Pei, Haisheng; Chen, Zhou; Tan, Xiaoyan; Hu, Jing; Yang, Bin; Sun, Junshe

    2017-01-01

    Ganoderma lucidum is a typical polypore fungus used for traditional Chinese medical purposes. The taxonomic delimitation of Ganoderma lucidum is still debated. In this study, we sequenced seven internal transcribed spacer (ITS) sequences of Ganoderma lucidum strains and annotated the ITS1 and ITS2 regions. Phylogenetic analysis of ITS1 differentiated the strains into three geographic groups. Groups 1–3 were originated from Europe, tropical Asia, and eastern Asia, respectively. While ITS2 could only differentiate the strains into two groups in which Group 2 originated from tropical Asia gathered with Groups 1 and 3 originated from Europe and eastern Asia. By determining the secondary structures of the ITS1 sequences, these three groups exhibited similar structures with a conserved central core and differed helices. While compared to Group 2, Groups 1 and 3 of ITS2 sequences shared similar structures with the difference in helix 4. Large-scale evaluation of ITS1 and ITS2 both exhibited that the majority of subgroups in the same group shared the similar structures. Further Weblogo analysis of ITS1 sequences revealed two main variable regions located in helix 2 in which C/T or A/G substitutions frequently occurred and ITS1 exhibited more nucleotide variances compared to ITS2. ITS1 multi-alignment of seven spawn strains and culture tests indicated that a single-nucleotide polymorphism (SNP) site at position 180 correlated with strain antagonism. The HZ, TK and 203 fusion strains of Ganoderma lucidum had a T at position 180, whereas other strains exhibiting antagonism, including DB, RB, JQ, and YS, had a C. Taken together, compared to ITS2 region, ITS1 region could differentiated Ganoderma lucidum into three geographic originations based on phylogenetic analysis and secondary structure prediction. Besides, a SNP in ITS 1 could delineate Ganoderma lucidum strains at the intraspecific level. These findings will be implemented to improve species quality control in the Ganoderma industry. PMID:28056060

  5. Intracellular diversity of the V4 and V9 regions of the 18S rRNA in marine protists (radiolarians) assessed by high-throughput sequencing.

    PubMed

    Decelle, Johan; Romac, Sarah; Sasaki, Eriko; Not, Fabrice; Mahé, Frédéric

    2014-01-01

    Metabarcoding is a powerful tool for exploring microbial diversity in the environment, but its accurate interpretation is impeded by diverse technical (e.g. PCR and sequencing errors) and biological biases (e.g. intra-individual polymorphism) that remain poorly understood. To help interpret environmental metabarcoding datasets, we investigated the intracellular diversity of the V4 and V9 regions of the 18S rRNA gene from Acantharia and Nassellaria (radiolarians) using 454 pyrosequencing. Individual cells of radiolarians were isolated, and PCRs were performed with generalist primers to amplify the V4 and V9 regions. Different denoising procedures were employed to filter the pyrosequenced raw amplicons (Acacia, AmpliconNoise, Linkage method). For each of the six isolated cells, an average of 541 V4 and 562 V9 amplicons assigned to radiolarians were obtained, from which one numerically dominant sequence and several minor variants were found. At the 97% identity, a diversity metrics commonly used in environmental surveys, up to 5 distinct OTUs were detected in a single cell. However, most amplicons grouped within a single OTU whereas other OTUs contained very few amplicons. Different analytical methods provided evidence that most minor variants forming different OTUs correspond to PCR and sequencing artifacts. Duplicate PCR and sequencing from the same DNA extract of a single cell had only 9 to 16% of unique amplicons in common, and alignment visualization of V4 and V9 amplicons showed that most minor variants contained substitutions in highly-conserved regions. We conclude that intracellular variability of the 18S rRNA in radiolarians is very limited despite its multi-copy nature and the existence of multiple nuclei in these protists. Our study recommends some technical guidelines to conservatively discard artificial amplicons from metabarcoding datasets, and thus properly assess the diversity and richness of protists in the environment.

  6. Intraspecific Variation and Phylogenetic Relationships Are Revealed by ITS1 Secondary Structure Analysis and Single-Nucleotide Polymorphism in Ganoderma lucidum.

    PubMed

    Zhang, Xiuqing; Xu, Zhangyang; Pei, Haisheng; Chen, Zhou; Tan, Xiaoyan; Hu, Jing; Yang, Bin; Sun, Junshe

    2017-01-01

    Ganoderma lucidum is a typical polypore fungus used for traditional Chinese medical purposes. The taxonomic delimitation of Ganoderma lucidum is still debated. In this study, we sequenced seven internal transcribed spacer (ITS) sequences of Ganoderma lucidum strains and annotated the ITS1 and ITS2 regions. Phylogenetic analysis of ITS1 differentiated the strains into three geographic groups. Groups 1-3 were originated from Europe, tropical Asia, and eastern Asia, respectively. While ITS2 could only differentiate the strains into two groups in which Group 2 originated from tropical Asia gathered with Groups 1 and 3 originated from Europe and eastern Asia. By determining the secondary structures of the ITS1 sequences, these three groups exhibited similar structures with a conserved central core and differed helices. While compared to Group 2, Groups 1 and 3 of ITS2 sequences shared similar structures with the difference in helix 4. Large-scale evaluation of ITS1 and ITS2 both exhibited that the majority of subgroups in the same group shared the similar structures. Further Weblogo analysis of ITS1 sequences revealed two main variable regions located in helix 2 in which C/T or A/G substitutions frequently occurred and ITS1 exhibited more nucleotide variances compared to ITS2. ITS1 multi-alignment of seven spawn strains and culture tests indicated that a single-nucleotide polymorphism (SNP) site at position 180 correlated with strain antagonism. The HZ, TK and 203 fusion strains of Ganoderma lucidum had a T at position 180, whereas other strains exhibiting antagonism, including DB, RB, JQ, and YS, had a C. Taken together, compared to ITS2 region, ITS1 region could differentiated Ganoderma lucidum into three geographic originations based on phylogenetic analysis and secondary structure prediction. Besides, a SNP in ITS 1 could delineate Ganoderma lucidum strains at the intraspecific level. These findings will be implemented to improve species quality control in the Ganoderma industry.

  7. Intragenomic sequence variation at the ITS1 - ITS2 region and at the 18S and 28S nuclear ribosomal DNA genes of the New Zealand mud snail, Potamopyrgus antipodarum (Hydrobiidae: mollusca)

    USGS Publications Warehouse

    Hoy, Marshal S.; Rodriguez, Rusty J.

    2013-01-01

    Molecular genetic analysis was conducted on two populations of the invasive non-native New Zealand mud snail (Potamopyrgus antipodarum), one from a freshwater ecosystem in Devil's Lake (Oregon, USA) and the other from an ecosystem of higher salinity in the Columbia River estuary (Hammond Harbor, Oregon, USA). To elucidate potential genetic differences between the two populations, three segments of nuclear ribosomal DNA (rDNA), the ITS1-ITS2 regions and the 18S and 28S rDNA genes were cloned and sequenced. Variant sequences within each individual were found in all three rDNA segments. Folding models were utilized for secondary structure analysis and results indicated that there were many sequences which contained structure-altering polymorphisms, which suggests they could be nonfunctional pseudogenes. In addition, analysis of molecular variance (AMOVA) was used for hierarchical analysis of genetic variance to estimate variation within and among populations and within individuals. AMOVA revealed significant variation in the ITS region between the populations and among clones within individuals, while in the 5.8S rDNA significant variation was revealed among individuals within the two populations. High levels of intragenomic variation were found in the ITS regions, which are known to be highly variable in many organisms. More interestingly, intragenomic variation was also found in the 18S and 28S rDNA, which has rarely been observed in animals and is so far unreported in Mollusca. We postulate that in these P. antipodarum populations the effects of concerted evolution are diminished due to the fact that not all of the rDNA genes in their polyploid genome should be essential for sustaining cellular function. This could lead to a lessening of selection pressures, allowing mutations to accumulate in some copies, changing them into variant sequences.                   

  8. Earthquake recurrence and risk assessment in circum-Pacific seismic gaps

    USGS Publications Warehouse

    Thatcher, W.

    1989-01-01

    THE development of the concept of seismic gaps, regions of low earthquake activity where large events are expected, has been one of the notable achievements of seismology and plate tectonics. Its application to long-term earthquake hazard assessment continues to be an active field of seismological research. Here I have surveyed well documented case histories of repeated rupture of the same segment of circum-Pacific plate boundary and characterized their general features. I find that variability in fault slip and spatial extent of great earthquakes rupturing the same plate boundary segment is typical rather than exceptional but sequences of major events fill identified seismic gaps with remarkable order. Earthquakes are concentrated late in the seismic cycle and occur with increasing size and magnitude. Furthermore, earthquake rup-ture starts near zones of concentrated moment release, suggesting that high-slip regions control the timing of recurrent events. The absence of major earthquakes early in the seismic cycle indicates a more complex behaviour for lower-slip regions, which may explain the observed cycle-to-cycle diversity of gap-filling sequences. ?? 1989 Nature Publishing Group.

  9. Molecular characterization of feline calicivirus variants from multicat household and public animal shelter in Rio de Janeiro, Brazil.

    PubMed

    Pereira, Joylson de Jesus; Baumworcel, Natasha; Fioretti, Júlia Monassa; Domingues, Cinthya Fonseca; Moraes, Laís Fernandes de; Marinho, Robson Dos Santos Souza; Vieira, Maria Clara Rodrigues; Pinto, Ana Maria Viana; de Castro, Tatiana Xavier

    2018-02-28

    The aim of this study was to perform the molecular characterization of conserved and variable regions of feline calicivirus capsid genome in order to investigate the molecular diversity of variants in Brazilian cat population. Twenty-six conjunctival samples from cats living in five public short-term animal shelters and three multicat life-long households were analyzed. Fifteen cats had conjunctivitis, three had oral ulceration, eight had respiratory signs (cough, sneeze and nasal discharge) and nine were asymptomatic. Feline calicivirus were isolated in CRFK cells and characterized by reverse transcription PCR target to both conserved and variable regions of open reading frame 2. The amplicons obtained were sequenced. A phylogenetic analysis along with most of the prototypes available in GenBank database and an amino acid analysis were performed. Phylogenetic analysis based on both conserved and variable region revealed two clusters with an aLTR value of 1.00 and 0.98 respectively and the variants from this study belong to feline calicivirus genogroup I. No association between geographical distribution and/or clinical signs and clustering in phylogenetic tree was observed. The variants circulating in public short-term animal shelter demonstrated a high variability because of the relatively rapid turnover of carrier cats constantly introduced of multiple viruses into this location over time. Copyright © 2018 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.

  10. New Perspectives on the Role of Internal Variability in Regional Climate Change and Climate Model Evaluation

    NASA Astrophysics Data System (ADS)

    Deser, C.

    2017-12-01

    Natural climate variability occurs over a wide range of time and space scales as a result of processes intrinsic to the atmosphere, the ocean, and their coupled interactions. Such internally generated climate fluctuations pose significant challenges for the identification of externally forced climate signals such as those driven by volcanic eruptions or anthropogenic increases in greenhouse gases. This challenge is exacerbated for regional climate responses evaluated from short (< 50 years) data records. The limited duration of the observations also places strong constraints on how well the spatial and temporal characteristics of natural climate variability are known, especially on multi-decadal time scales. The observational constraints, in turn, pose challenges for evaluation of climate models, including their representation of internal variability and assessing the accuracy of their responses to natural and anthropogenic radiative forcings. A promising new approach to climate model assessment is the advent of large (10-100 member) "initial-condition" ensembles of climate change simulations with individual models. Such ensembles allow for accurate determination, and straightforward separation, of externally forced climate signals and internal climate variability on regional scales. The range of climate trajectories in a given model ensemble results from the fact that each simulation represents a particular sequence of internal variability superimposed upon a common forced response. This makes clear that nature's single realization is only one of many that could have unfolded. This perspective leads to a rethinking of approaches to climate model evaluation that incorporate observational uncertainty due to limited sampling of internal variability. Illustrative examples across a range of well-known climate phenomena including ENSO, volcanic eruptions, and anthropogenic climate change will be discussed.

  11. The Clusters AgeS Experiment (CASE). Variable Stars in the Field of the Globular Cluster NGC 6362

    NASA Astrophysics Data System (ADS)

    Kaluzny, J.; Thompson, I. B.; Rozyczka, M.; Pych, W.; Narloch, W.

    2014-12-01

    The field of the globular cluster NGC 6362 was monitored between 1995 and 2009 in a search for variable stars. BV light curves were obtained for 69 periodic variable stars including 34 known RR Lyr stars, 10 known objects of other types and 25 newly detected variable stars. Among the latter we identified 18 proper-motion members of the cluster: seven detached eclipsing binaries (DEBs), six SX Phe stars, two W UMa binaries, two spotted red giants, and a very interesting eclipsing binary composed of two red giants - the first example of such a system found in a globular cluster. Five of the DEBs are located at the turnoff region, and the remaining two are redward of the lower main sequence. Eighty-four objects from the central 9×9 arcmin2 of the cluster were found in the region of cluster blue stragglers. Of these 70 are proper motion (PM) members of NGC 6362 (including all SX Phe and two W UMa stars), and five are field stars. The remaining nine objects lacking PM information are located at the very core of the cluster, and as such they are likely genuine blue stragglers.

  12. Genetic variability and evolutionary dynamics of viruses of the family Closteroviridae

    PubMed Central

    Rubio, Luis; Guerri, José; Moreno, Pedro

    2013-01-01

    RNA viruses have a great potential for genetic variation, rapid evolution and adaptation. Characterization of the genetic variation of viral populations provides relevant information on the processes involved in virus evolution and epidemiology and it is crucial for designing reliable diagnostic tools and developing efficient and durable disease control strategies. Here we performed an updated analysis of sequences available in Genbank and reviewed present knowledge on the genetic variability and evolutionary processes of viruses of the family Closteroviridae. Several factors have shaped the genetic structure and diversity of closteroviruses. (I) A strong negative selection seems to be responsible for the high genetic stability in space and time for some viruses. (2) Long distance migration, probably by human transport of infected propagative plant material, have caused that genetically similar virus isolates are found in distant geographical regions. (3) Recombination between divergent sequence variants have generated new genotypes and plays an important role for the evolution of some viruses of the family Closteroviridae. (4) Interaction between virus strains or between different viruses in mixed infections may alter accumulation of certain strains. (5) Host change or virus transmission by insect vectors induced changes in the viral population structure due to positive selection of sequence variants with higher fitness for host-virus or vector-virus interaction (adaptation) or by genetic drift due to random selection of sequence variants during the population bottleneck associated to the transmission process. PMID:23805130

  13. Efficiency of ITS Sequences for DNA Barcoding in Passiflora (Passifloraceae)

    PubMed Central

    Giudicelli, Giovanna Câmara; Mäder, Geraldo; de Freitas, Loreta Brandão

    2015-01-01

    DNA barcoding is a technique for discriminating and identifying species using short, variable, and standardized DNA regions. Here, we tested for the first time the performance of plastid and nuclear regions as DNA barcodes in Passiflora. This genus is a largely variable, with more than 900 species of high ecological, commercial, and ornamental importance. We analyzed 1034 accessions of 222 species representing the four subgenera of Passiflora and evaluated the effectiveness of five plastid regions and three nuclear datasets currently employed as DNA barcodes in plants using barcoding gap, applied similarity-, and tree-based methods. The plastid regions were able to identify less than 45% of species, whereas the nuclear datasets were efficient for more than 50% using “best match” and “best close match” methods of TaxonDNA software. All subgenera presented higher interspecific pairwise distances and did not fully overlap with the intraspecific distance, and similarity-based methods showed better results than tree-based methods. The nuclear ribosomal internal transcribed spacer 1 (ITS1) region presented a higher discrimination power than the other datasets and also showed other desirable characteristics as a DNA barcode for this genus. Therefore, we suggest that this region should be used as a starting point to identify Passiflora species. PMID:25837628

  14. Investigation of the Short-Time Variability of Tropical Tropospheric Ozone

    NASA Technical Reports Server (NTRS)

    Randriambelo, Tantely; Baray, Jean-Luc; Baldy, Serge; Thompson, Anne M.; Oltmans, Samuel; Keckhut, Philippe

    2003-01-01

    Since 1998, a ground based tropospheric ozone lidar has been running at Reunion Island and has been involved with a daily measurement campaign that was performed in the latter part of the biomass burning season, during November-December 1999. The averaged ozone profile obtained during November-December 1 999 agrees well with averaged ozone profile obtained from ozonesondes launch at Reunion during November-December (1992- 2001). Comparing weekly sonde launches (part of the Southern Hemisphere Additional Ozonesondes: SHADOZ program) with the daily ground-based lidar observations shows that some striking features of the day to day variability profiles are not observed in the sonde measurements. Ozone profiles respond to the nature of disturbances which vary from the one day to the next. The vertical ozone distribution at Reunion is examined as a function of prevailing atmospheric circulation. Backtrajectories show that most of the enhanced ozone crossed over biomass burning and convectively active regions in Madagascar and the southern African continent. The analyses of the meteorological data show that ozone stratification profiles are in agreement with the movement of the synoptical situations in November-December 1999. Three different sequences of transport are explained using wind fields. The first sequence from 23 to 25 November is characterized by Northerly transport, the second sequence from 26 to 30 November, the air masses are influenced by meridional transport. The third sequence from 2 to 6 December is characterized by westerly transport associated with the subtropical jet stream. The large standard deviations of lidar profiles in the middle and upper troposphere are in agreement with the upper wind variabilities which evidence passing ridge and trough disturbances. During the transition period between the dry season and the wet season, multiple ozone sources including stratosphere-troposphere exchanges, convection and biomass burning contribute to tropospheric ozone at Reunion Island through sporadic events characterized by a large spatial and temporal variability.

  15. Variation in recombination rate may bias human genetic disease mapping studies.

    PubMed

    Boyle, A Susannah; Noor, Mohamed A F

    2004-11-01

    The availability of the human genome sequence and variability information (as from the International HapMap project) will enhance our ability to map genetic disorders and choose targets for therapeutic intervention. However, several factors, such as regional variation in recombination rate, can bias conclusions from genetic mapping studies. Here, we examine the impact of regional variation in recombination rate across the human genome. Through computer simulations and literature surveys, we conclude that genetic disorders have been mapped to regions of low recombination more often than expected if such diseases were randomly distributed across the genome. This concentration in low recombination regions may be an artifact, and disorders appearing to be caused by a few genes of large effect may be polygenic. Future genetic mapping studies should be conscious of this potential complication by noting the regional recombination rate of regions implicated in diseases.

  16. [Study on ITS sequences of Aconitum vilmorinianum and its medicinal adulterant].

    PubMed

    Zhang, Xiao-nan; Du, Chun-hua; Fu, De-huan; Gao, Li; Zhou, Pei-jun; Wang, Li

    2012-09-01

    To analyze and compare the ITS sequences of Aconitum vilmorinianum and its medicinal adulterant Aconitum austroyunnanense. Total genomic DNA were extracted from sample materials by improved CTAB method, ITS sequences of samples were amplified using PCR systems, directly sequenced and analyzed using software DNAStar, ClustalX1.81 and MEGA 4.0. 299 consistent sites, 19 variable sites and 13 informative sites were found in ITS1 sequences, 162 consistent sites, 2 variable sites and 1 informative sites were found in 5.8S sequences, 217 consistent sites, 3 variable sites and 1 informative site were found in ITS2 sequences. Base transition and transversion was not found only in 5.8S sequences, 2 sites transition and 1 site transversion were found in ITS1 sequences, only 1 site transversion was found in ITS2 sequences comparting the ITS sequences data matrix. By analyzing the ITS sequences data matrix from 2 population of Aconitum vilmorinianum and 3 population of Aconitum austroyunnanense, we found a stable informative site at the 596th base in ITS2 sequences, in all the samples of Aconitum vilmorinianum the base was C, and in all the samples of Aconitum austroyunnanense the base was A. Aconitum vilmorinianum and Aconitum austroyunnanense can be identified by their characters of ITS sequences, and the variable sites in ITS1 sequences are more than in ITS2 sequences.

  17. Idiopathic thromobocytopenic purpura in two mothers of children with DiGeorge sequence: A new component manifestation of deletion 22q11?

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Levy, A.; Philip, N.; Michel, G.

    1997-04-14

    The phenotypic spectrum caused by the microdeletion of chromosome 22q11 region is known to be variable. Nearly all patients with DiGeorge sequence (DGS) and approximately 60% of patients with velocardiofacial syndrome exhibit the deletion. Recent papers have reported various congenital defects in patients with 22q11 deletions. Conversely, some patients have minimal clinical expression. Ten to 25% of parents of patients with DGS exhibit the deletion and are nearly asymptomatic. Two female patients carrying a 22q11 microdeletion and presenting with idiopathic thrombocytopenic purpura are reported. Both had children with typical manifestations of DGS. 12 refs., 4 figs., 1 tab.

  18. Forty keratin-associated beta-proteins (beta-keratins) form the hard layers of scales, claws, and adhesive pads in the green anole lizard, Anolis carolinensis.

    PubMed

    Dalla Valle, Luisa; Nardi, Alessia; Bonazza, Giulia; Zucal, Chiara; Zuccal, Chiara; Emera, Deena; Alibardi, Lorenzo

    2010-01-15

    Using bioinformatic methods we have detected the genes of 40 keratin-associated beta-proteins (KAbetaPs) (beta-keratins) from the first available draft genome sequence of a reptile, the lizard Anolis carolinensis (Broad Institute, Boston). All genes are clustered in a single but not yet identified chromosomal locus, and contain a single intron of variable length. 5'-RACE and RT-PCR analyses using RNA from different epidermal regions show tissue-specific expression of different transcripts. These results were confirmed from the analysis of the A. carolinensis EST libraries (Broad Institute). Most deduced proteins are 12-16 kDa with a pI of 7.5-8.5. Two genes encoding putative proteins of 40 and 45 kDa are also present. Despite variability in amino acid sequences, four main subfamilies can be described. The largest subfamily includes proteins high in glycine, a small subfamily contains proteins high in cysteine, a third large subfamily contains proteins high in cysteine and glycine, and the fourth, smallest subfamily comprises proteins low in cysteine and glycine. An inner region of high amino acid identity is the most constant characteristic of these proteins and maps to a region with two to three close beta-folds in the proteins. This beta-fold region is responsible for the formation of filaments of the corneous material in all types of scales in this species. Phylogenetic analysis shows that A. carolinensis KAbetaPs are more similar to those of other lepidosaurians (snake, lizard, and gecko lizard) than to those of archosaurians (chick and crocodile) and turtles. (c) 2009 Wiley-Liss, Inc.

  19. MOSAIC: an online database dedicated to the comparative genomics of bacterial strains at the intra-species level.

    PubMed

    Chiapello, Hélène; Gendrault, Annie; Caron, Christophe; Blum, Jérome; Petit, Marie-Agnès; El Karoui, Meriem

    2008-11-27

    The recent availability of complete sequences for numerous closely related bacterial genomes opens up new challenges in comparative genomics. Several methods have been developed to align complete genomes at the nucleotide level but their use and the biological interpretation of results are not straightforward. It is therefore necessary to develop new resources to access, analyze, and visualize genome comparisons. Here we present recent developments on MOSAIC, a generalist comparative bacterial genome database. This database provides the bacteriologist community with easy access to comparisons of complete bacterial genomes at the intra-species level. The strategy we developed for comparison allows us to define two types of regions in bacterial genomes: backbone segments (i.e., regions conserved in all compared strains) and variable segments (i.e., regions that are either specific to or variable in one of the aligned genomes). Definition of these segments at the nucleotide level allows precise comparative and evolutionary analyses of both coding and non-coding regions of bacterial genomes. Such work is easily performed using the MOSAIC Web interface, which allows browsing and graphical visualization of genome comparisons. The MOSAIC database now includes 493 pairwise comparisons and 35 multiple maximal comparisons representing 78 bacterial species. Genome conserved regions (backbones) and variable segments are presented in various formats for further analysis. A graphical interface allows visualization of aligned genomes and functional annotations. The MOSAIC database is available online at http://genome.jouy.inra.fr/mosaic.

  20. Oligo kernels for datamining on biological sequences: a case study on prokaryotic translation initiation sites

    PubMed Central

    Meinicke, Peter; Tech, Maike; Morgenstern, Burkhard; Merkl, Rainer

    2004-01-01

    Background Kernel-based learning algorithms are among the most advanced machine learning methods and have been successfully applied to a variety of sequence classification tasks within the field of bioinformatics. Conventional kernels utilized so far do not provide an easy interpretation of the learnt representations in terms of positional and compositional variability of the underlying biological signals. Results We propose a kernel-based approach to datamining on biological sequences. With our method it is possible to model and analyze positional variability of oligomers of any length in a natural way. On one hand this is achieved by mapping the sequences to an intuitive but high-dimensional feature space, well-suited for interpretation of the learnt models. On the other hand, by means of the kernel trick we can provide a general learning algorithm for that high-dimensional representation because all required statistics can be computed without performing an explicit feature space mapping of the sequences. By introducing a kernel parameter that controls the degree of position-dependency, our feature space representation can be tailored to the characteristics of the biological problem at hand. A regularized learning scheme enables application even to biological problems for which only small sets of example sequences are available. Our approach includes a visualization method for transparent representation of characteristic sequence features. Thereby importance of features can be measured in terms of discriminative strength with respect to classification of the underlying sequences. To demonstrate and validate our concept on a biochemically well-defined case, we analyze E. coli translation initiation sites in order to show that we can find biologically relevant signals. For that case, our results clearly show that the Shine-Dalgarno sequence is the most important signal upstream a start codon. The variability in position and composition we found for that signal is in accordance with previous biological knowledge. We also find evidence for signals downstream of the start codon, previously introduced as transcriptional enhancers. These signals are mainly characterized by occurrences of adenine in a region of about 4 nucleotides next to the start codon. Conclusions We showed that the oligo kernel can provide a valuable tool for the analysis of relevant signals in biological sequences. In the case of translation initiation sites we could clearly deduce the most discriminative motifs and their positional variation from example sequences. Attractive features of our approach are its flexibility with respect to oligomer length and position conservation. By means of these two parameters oligo kernels can easily be adapted to different biological problems. PMID:15511290

  1. Chromosomal Copy Number Variation in Saccharomyces pastorianus Is Evidence for Extensive Genome Dynamics in Industrial Lager Brewing Strains.

    PubMed

    van den Broek, M; Bolat, I; Nijkamp, J F; Ramos, E; Luttik, M A H; Koopman, F; Geertman, J M; de Ridder, D; Pronk, J T; Daran, J-M

    2015-09-01

    Lager brewing strains of Saccharomyces pastorianus are natural interspecific hybrids originating from the spontaneous hybridization of Saccharomyces cerevisiae and Saccharomyces eubayanus. Over the past 500 years, S. pastorianus has been domesticated to become one of the most important industrial microorganisms. Production of lager-type beers requires a set of essential phenotypes, including the ability to ferment maltose and maltotriose at low temperature, the production of flavors and aromas, and the ability to flocculate. Understanding of the molecular basis of complex brewing-related phenotypic traits is a prerequisite for rational strain improvement. While genome sequences have been reported, the variability and dynamics of S. pastorianus genomes have not been investigated in detail. Here, using deep sequencing and chromosome copy number analysis, we showed that S. pastorianus strain CBS1483 exhibited extensive aneuploidy. This was confirmed by quantitative PCR and by flow cytometry. As a direct consequence of this aneuploidy, a massive number of sequence variants was identified, leading to at least 1,800 additional protein variants in S. pastorianus CBS1483. Analysis of eight additional S. pastorianus strains revealed that the previously defined group I strains showed comparable karyotypes, while group II strains showed large interstrain karyotypic variability. Comparison of three strains with nearly identical genome sequences revealed substantial chromosome copy number variation, which may contribute to strain-specific phenotypic traits. The observed variability of lager yeast genomes demonstrates that systematic linking of genotype to phenotype requires a three-dimensional genome analysis encompassing physical chromosomal structures, the copy number of individual chromosomes or chromosomal regions, and the allelic variation of copies of individual genes. Copyright © 2015, van den Broek et al.

  2. Chromosomal Copy Number Variation in Saccharomyces pastorianus Is Evidence for Extensive Genome Dynamics in Industrial Lager Brewing Strains

    PubMed Central

    van den Broek, M.; Bolat, I.; Nijkamp, J. F.; Ramos, E.; Luttik, M. A. H.; Koopman, F.; Geertman, J. M.; de Ridder, D.; Pronk, J. T.

    2015-01-01

    Lager brewing strains of Saccharomyces pastorianus are natural interspecific hybrids originating from the spontaneous hybridization of Saccharomyces cerevisiae and Saccharomyces eubayanus. Over the past 500 years, S. pastorianus has been domesticated to become one of the most important industrial microorganisms. Production of lager-type beers requires a set of essential phenotypes, including the ability to ferment maltose and maltotriose at low temperature, the production of flavors and aromas, and the ability to flocculate. Understanding of the molecular basis of complex brewing-related phenotypic traits is a prerequisite for rational strain improvement. While genome sequences have been reported, the variability and dynamics of S. pastorianus genomes have not been investigated in detail. Here, using deep sequencing and chromosome copy number analysis, we showed that S. pastorianus strain CBS1483 exhibited extensive aneuploidy. This was confirmed by quantitative PCR and by flow cytometry. As a direct consequence of this aneuploidy, a massive number of sequence variants was identified, leading to at least 1,800 additional protein variants in S. pastorianus CBS1483. Analysis of eight additional S. pastorianus strains revealed that the previously defined group I strains showed comparable karyotypes, while group II strains showed large interstrain karyotypic variability. Comparison of three strains with nearly identical genome sequences revealed substantial chromosome copy number variation, which may contribute to strain-specific phenotypic traits. The observed variability of lager yeast genomes demonstrates that systematic linking of genotype to phenotype requires a three-dimensional genome analysis encompassing physical chromosomal structures, the copy number of individual chromosomes or chromosomal regions, and the allelic variation of copies of individual genes. PMID:26150454

  3. Lineage divergence detected in the malaria vector Anopheles marajoara (Diptera: Culicidae) in Amazonian Brazil

    PubMed Central

    2010-01-01

    Background Cryptic species complexes are common among anophelines. Previous phylogenetic analysis based on the complete mtDNA COI gene sequences detected paraphyly in the Neotropical malaria vector Anopheles marajoara. The "Folmer region" detects a single taxon using a 3% divergence threshold. Methods To test the paraphyletic hypothesis and examine the utility of the Folmer region, genealogical trees based on a concatenated (white + 3' COI sequences) dataset and pairwise differentiation of COI fragments were examined. The population structure and demographic history were based on partial COI sequences for 294 individuals from 14 localities in Amazonian Brazil. 109 individuals from 12 localities were sequenced for the nDNA white gene, and 57 individuals from 11 localities were sequenced for the ribosomal DNA (rDNA) internal transcribed spacer 2 (ITS2). Results Distinct A. marajoara lineages were detected by combined genealogical analysis and were also supported among COI haplotypes using a median joining network and AMOVA, with time since divergence during the Pleistocene (<100,000 ya). COI sequences at the 3' end were more variable, demonstrating significant pairwise differentiation (3.82%) compared to the more moderate 2.92% detected by the Folmer region. Lineage 1 was present in all localities, whereas lineage 2 was restricted mainly to the west. Mismatch distributions for both lineages were bimodal, likely due to multiple colonization events and spatial expansion (~798 - 81,045 ya). There appears to be gene flow within, not between lineages, and a partial barrier was detected near Rio Jari in Amapá state, separating western and eastern populations. In contrast, both nDNA data sets (white gene sequences with or without the retention of the 4th intron, and ITS2 sequences and length) detected a single A. marajoara lineage. Conclusions Strong support for combined data with significant differentiation detected in the COI and absent in the nDNA suggest that the divergence is recent, and detectable only by the faster evolving mtDNA. A within subgenus threshold of >2% may be more appropriate among sister taxa in cryptic anopheline complexes than the standard 3%. Differences in demographic history and climatic changes may have contributed to mtDNA lineage divergence in A. marajoara. PMID:20929572

  4. The first complete chloroplast genome of the Genistoid legume Lupinus luteus: evidence for a novel major lineage-specific rearrangement and new insights regarding plastome evolution in the legume family

    PubMed Central

    Martin, Guillaume E.; Rousseau-Gueutin, Mathieu; Cordonnier, Solenn; Lima, Oscar; Michon-Coudouel, Sophie; Naquin, Delphine; de Carvalho, Julie Ferreira; Aïnouche, Malika; Salmon, Armel; Aïnouche, Abdelkader

    2014-01-01

    Background and Aims To date chloroplast genomes are available only for members of the non-protein amino acid-accumulating clade (NPAAA) Papilionoid lineages in the legume family (i.e. Millettioids, Robinoids and the ‘inverted repeat-lacking clade’, IRLC). It is thus very important to sequence plastomes from other lineages in order to better understand the unusual evolution observed in this model flowering plant family. To this end, the plastome of a lupine species, Lupinus luteus, was sequenced to represent the Genistoid lineage, a noteworthy but poorly studied legume group. Methods The plastome of L. luteus was reconstructed using Roche-454 and Illumina next-generation sequencing. Its structure, repetitive sequences, gene content and sequence divergence were compared with those of other Fabaceae plastomes. PCR screening and sequencing were performed in other allied legumes in order to determine the origin of a large inversion identified in L. luteus. Key Results The first sequenced Genistoid plastome (L. luteus: 155 894 bp) resulted in the discovery of a 36-kb inversion, embedded within the already known 50-kb inversion in the large single-copy (LSC) region of the Papilionoideae. This inversion occurs at the base or soon after the Genistoid emergence, and most probably resulted from a flip–flop recombination between identical 29-bp inverted repeats within two trnS genes. Comparative analyses of the chloroplast gene content of L. luteus vs. Fabaceae and extra-Fabales plastomes revealed the loss of the plastid rpl22 gene, and its functional relocation to the nucleus was verified using lupine transcriptomic data. An investigation into the evolutionary rate of coding and non-coding sequences among legume plastomes resulted in the identification of remarkably variable regions. Conclusions This study resulted in the discovery of a novel, major 36-kb inversion, specific to the Genistoids. Chloroplast mutational hotspots were also identified, which contain novel and potentially informative regions for molecular evolutionary studies at various taxonomic levels in the legumes. Taken together, the results provide new insights into the evolutionary landscape of the legume plastome. PMID:24769537

  5. Variability and repertoire size of T-cell receptor V alpha gene segments.

    PubMed

    Becker, D M; Pattern, P; Chien, Y; Yokota, T; Eshhar, Z; Giedlin, M; Gascoigne, N R; Goodnow, C; Wolf, R; Arai, K

    The immune system of higher organisms is composed largely of two distinct cell types, B lymphocytes and T lymphocytes, each of which is independently capable of recognizing an enormous number of distinct entities through their antigen receptors; surface immunoglobulin in the case of the former, and the T-cell receptor (TCR) in the case of the latter. In both cell types, the genes encoding the antigen receptors consist of multiple gene segments which recombine during maturation to produce many possible peptides. One striking difference between B- and T-cell recognition that has not yet been resolved by the structural data is the fact that T cells generally require a major histocompatibility determinant together with an antigen whereas, in most cases, antibodies recognize antigen alone. Recently, we and others have found that a series of TCR V beta gene sequences show conservation of many of the same residues that are conserved between heavy- and light-chain immunoglobulin V regions, and these V beta sequences are predicted to have an immunoglobulin-like secondary structure. To extend these studies, we have isolated and sequenced eight additional alpha-chain complementary cDNA clones and compared them with published sequences. Analyses of these sequences, reported here, indicate that V alpha regions have many of the characteristics of V beta gene segments but differ in that they almost always occur as cross-hybridizing gene families. We conclude that there may be very different selective pressures operating on V alpha and V beta sequences and that the V alpha repertoire may be considerably larger than that of V beta.

  6. RAPD and Internal Transcribed Spacer Sequence Analyses Reveal Zea nicaraguensis as a Section Luxuriantes Species Close to Zea luxurians

    PubMed Central

    Wang, Pei; Lu, Yanli; Zheng, Mingmin; Rong, Tingzhao; Tang, Qilin

    2011-01-01

    Genetic relationship of a newly discovered teosinte from Nicaragua, Zea nicaraguensis with waterlogging tolerance, was determined based on randomly amplified polymorphic DNA (RAPD) markers and the internal transcribed spacer (ITS) sequences of nuclear ribosomal DNA using 14 accessions from Zea species. RAPD analysis showed that a total of 5,303 fragments were produced by 136 random decamer primers, of which 84.86% bands were polymorphic. RAPD-based UPGMA analysis demonstrated that the genus Zea can be divided into section Luxuriantes including Zea diploperennis, Zea luxurians, Zea perennis and Zea nicaraguensis, and section Zea including Zea mays ssp. mexicana, Zea mays ssp. parviglumis, Zea mays ssp. huehuetenangensis and Zea mays ssp. mays. ITS sequence analysis showed the lengths of the entire ITS region of the 14 taxa in Zea varied from 597 to 605 bp. The average GC content was 67.8%. In addition to the insertion/deletions, 78 variable sites were recorded in the total ITS region with 47 in ITS1, 5 in 5.8S, and 26 in ITS2. Sequences of these taxa were analyzed with neighbor-joining (NJ) and maximum parsimony (MP) methods to construct the phylogenetic trees, selecting Tripsacum dactyloides L. as the outgroup. The phylogenetic relationships of Zea species inferred from the ITS sequences are highly concordant with the RAPD evidence that resolved two major subgenus clades. Both RAPD and ITS sequence analyses indicate that Zea nicaraguensis is more closely related to Zea luxurians than the other teosintes and cultivated maize, which should be regarded as a section Luxuriantes species. PMID:21525982

  7. Amino acid similarities and divergences in the small surface proteins of genotype C hepatitis B viruses between nucleos(t)ide analogue-naïve and lamivudine-treated patients with chronic hepatitis B.

    PubMed

    Ding, Hai; Liu, Baoming; Zhao, Chengyu; Yang, Jingxian; Yan, Chunhui; Yan, Ling; Zhuang, Hui; Li, Tong

    2014-02-01

    Entire C-genotype small hepatitis B surface (SHBs) sequences were isolated from 139 nucleos(t)ide analogues (NA)-naïve and 74 lamivudine (LMV)-treated chronic hepatitis B (CHB) patients. The conservation and variability of total 226 amino acids (AAs) within the sequences were determined individually, revealing significant higher mutant isolate rate and mutation frequency in LMV-treated cohort than those in the NA-naïve one (P=0.009 and 0.0001, respectively). Three absolutely conserved fragments (s16-s19, s176-s181 and s185-s188) and seven moderately conserved regions (a few AA sites acquiring increased variability after LMV-treatment) were identified. The significant mutation rate increase after LMV-treatment occurred primarily in major hydrophilic region (except 'a' determinant) and transmembrane domain 3/4, but not in other upstream functional regions of SHBs. With little influence on immune escape-associated mutation frequencies within 'a' determinant, LMV-monotherapy significantly induced classical LMVr-associated mirror changes sE164D/rtV173L, sI195M/rtM204V and sW196L/S/rtM204I, as well as non-classical ones sG44E/rtS53N, sT47K/A/rtH55R/Q and sW182stop/rtV191I outside 'a' determinant. Interestingly, another newly-identified truncation mutation sC69stop/rtS78T decreased from 7.91% (11/139) in NA-naïve cohort to 2.70% (2/74) in LMV-treated one. Altogether, the altered AA conservation and diversity in SHBs sequences after LMV-treatment in genotype-C HBV infection might shed new insights into how LMV-therapy affects the SHBs variant evolution and its antigenicity. Copyright © 2013 Elsevier B.V. All rights reserved.

  8. Regulation of pathogenicity in hop stunt viroid-related group II citrus viroids.

    PubMed

    Reanwarakorn, K; Semancik, J S

    1998-12-01

    Nucleotide sequences were determined for two hop stunt viroid-related Group II citrus viroids characterized as either a cachexia disease non-pathogenic variant (CVd-IIa) or a pathogenic variant (CVd-IIb). Sequence identity between the two variants of 95.6% indicated a conserved genome with the principal region of nucleotide difference clustered in the variable (V) domain. Full-length viroid RT-PCR cDNA products were cloned into plasmid SP72. Viroid cDNA clones as well as derived RNA transcripts were transmissible to citron (Citrus medica L.) and Luffa aegyptiaca Mill. To determine the locus of cachexia pathogenicity as well as symptom expression in Luffa, chimeric viroid cDNA clones were constructed from segments of either the left terminal, pathogenic and conserved (T1-P-C) domains or the conserved, variable and right terminal (C-V-T2) domains of CVd-IIa or CVd-IIb in reciprocal exchanges. Symptoms induced by the various chimeric constructs on the two bioassay hosts reflected the differential response observed with CVd-IIa and -IIb. Constructs with the C-V-T2 domains region from clone-IIa induced severe symptoms on Luffa typical of CVd-IIa, but were non-symptomatic on mandarin as a bioassay host for the cachexia disease. Constructs with the same region (C-V-T2) from the clone-IIb genome induced only mild symptoms on Luffa, but produced a severe reaction on mandarin, as observed for CVd-IIb. Specific site-directed mutations were introduced into the V domain of the CVd-IIa clone to construct viroid cDNA clones with either partial or complete conversions to the CVd-IIb sequence. With the introduction of six site-specific changes into the V domain of the clone-IIa genome, cachexia pathogenicity was acquired as well as a moderation of severe symptoms on Luffa.

  9. Discovery of novel targets for multi-epitope vaccines: Screening of HIV-1 genomes using association rule mining

    PubMed Central

    Paul, Sinu; Piontkivska, Helen

    2009-01-01

    Background Studies have shown that in the genome of human immunodeficiency virus (HIV-1) regions responsible for interactions with the host's immune system, namely, cytotoxic T-lymphocyte (CTL) epitopes tend to cluster together in relatively conserved regions. On the other hand, "epitope-less" regions or regions with relatively low density of epitopes tend to be more variable. However, very little is known about relationships among epitopes from different genes, in other words, whether particular epitopes from different genes would occur together in the same viral genome. To identify CTL epitopes in different genes that co-occur in HIV genomes, association rule mining was used. Results Using a set of 189 best-defined HIV-1 CTL/CD8+ epitopes from 9 different protein-coding genes, as described by Frahm, Linde & Brander (2007), we examined the complete genomic sequences of 62 reference HIV sequences (including 13 subtypes and sub-subtypes with approximately 4 representative sequences for each subtype or sub-subtype, and 18 circulating recombinant forms). The results showed that despite inclusion of recombinant sequences that would be expected to break-up associations of epitopes in different genes when two different genomes are recombined, there exist particular combinations of epitopes (epitope associations) that occur repeatedly across the world-wide population of HIV-1. For example, Pol epitope LFLDGIDKA is found to be significantly associated with epitopes GHQAAMQML and FLKEKGGL from Gag and Nef, respectively, and this association rule is observed even among circulating recombinant forms. Conclusion We have identified CTL epitope combinations co-occurring in HIV-1 genomes including different subtypes and recombinant forms. Such co-occurrence has important implications for design of complex vaccines (multi-epitope vaccines) and/or drugs that would target multiple HIV-1 regions at once and, thus, may be expected to overcome challenges associated with viral escape. PMID:19580659

  10. Microbial Diversity of Acidic Hot Spring (Kawah Hujan B) in Geothermal Field of Kamojang Area, West Java-Indonesia

    PubMed Central

    Aditiawati, Pingkan; Yohandini, Heni; Madayanti, Fida; Akhmaloka

    2009-01-01

    Microbial communities in an acidic hot spring, namely Kawah Hujan B, at Kamojang geothermal field, West Java-Indonesia was examined using culture dependent and culture independent strategies. Chemical analysis of the hot spring water showed a characteristic of acidic-sulfate geothermal activity that contained high sulfate concentrations and low pH values (pH 1.8 to 1.9). Microbial community present in the spring was characterized by 16S rRNA gene combined with denaturing gradient gel electrophoresis (DGGE) analysis. The majority of the sequences recovered from culture-independent method were closely related to Crenarchaeota and Proteobacteria phyla. However, detail comparison among the member of Crenarchaeota showing some sequences variation compared to that the published data especially on the hypervariable and variable regions. In addition, the sequences did not belong to certain genus. Meanwhile, the 16S Rdna sequences from culture-dependent samples revealed mostly close to Firmicute and gamma Proteobacteria. PMID:19440252

  11. A new species of Drepanocephalus Dietz, 1909 (Digenea: Echinostomatidae) from the double-crested cormorant Phalacrocorax auritus (Lesson) (Aves: Phalacrocoracidae) in North America.

    PubMed

    Kudlai, Olena; Kostadinova, Aneta; Pulis, Eric E; Tkach, Vasyl V

    2015-03-01

    Drepanocephalus auritus n. sp. is described based on specimens from the double-crested cormorant Phalacrocorax auritus (Lesson) in North America. The new species differs from its congeners in its very narrow, elongate body, long uterine field and widely separated testes. Sequences of the nuclear rRNA gene cluster, spanning the 3' end of the nuclear ribosomal 18S rRNA gene, internal transcribed spacer region (ITS1+5.8S gene+ITS2) and partial 28S gene (2,345 bp), were identical in specimens collected from North Dakota, Minnesota and Mississippi, USA. Sequences of the 651 bp long fragment of the mitochondrial cox1 gene exhibited very low intraspecific variability (< 1%). Comparisons of the newly-generated sequences with those available in the GenBank indicate that the sequences from North America published under the name D. spathans Dietz, 1909 in fact represent D. auritus n. sp.

  12. Tracking B-Cell Repertoires and Clonal Histories in Normal and Malignant Lymphocytes.

    PubMed

    Weston-Bell, Nicola J; Cowan, Graeme; Sahota, Surinder S

    2017-01-01

    Methods for tracking B-cell repertoires and clonal history in normal and malignant B-cells based on immunoglobulin variable region (IGV) gene analysis have developed rapidly with the advent of massive parallel next-generation sequencing (mpNGS) protocols. mpNGS permits a depth of analysis of IGV genes not hitherto feasible, and presents challenges of bioinformatics analysis, which can be readily met by current pipelines. This strategy offers a potential resolution of B-cell usage at a depth that may capture fully the natural state, in a given biological setting. Conventional methods based on RT-PCR amplification and Sanger sequencing are also available where mpNGS is not accessible. Each method offers distinct advantages. Conventional methods for IGV gene sequencing are readily adaptable to most laboratories and provide an ease of analysis to capture salient features of B-cell use. This chapter describes two methods in detail for analysis of IGV genes, mpNGS and conventional RT-PCR with Sanger sequencing.

  13. Automated sequence-specific protein NMR assignment using the memetic algorithm MATCH.

    PubMed

    Volk, Jochen; Herrmann, Torsten; Wüthrich, Kurt

    2008-07-01

    MATCH (Memetic Algorithm and Combinatorial Optimization Heuristics) is a new memetic algorithm for automated sequence-specific polypeptide backbone NMR assignment of proteins. MATCH employs local optimization for tracing partial sequence-specific assignments within a global, population-based search environment, where the simultaneous application of local and global optimization heuristics guarantees high efficiency and robustness. MATCH thus makes combined use of the two predominant concepts in use for automated NMR assignment of proteins. Dynamic transition and inherent mutation are new techniques that enable automatic adaptation to variable quality of the experimental input data. The concept of dynamic transition is incorporated in all major building blocks of the algorithm, where it enables switching between local and global optimization heuristics at any time during the assignment process. Inherent mutation restricts the intrinsically required randomness of the evolutionary algorithm to those regions of the conformation space that are compatible with the experimental input data. Using intact and artificially deteriorated APSY-NMR input data of proteins, MATCH performed sequence-specific resonance assignment with high efficiency and robustness.

  14. Microbial diversity of acidic hot spring (kawah hujan B) in geothermal field of kamojang area, west java-indonesia.

    PubMed

    Aditiawati, Pingkan; Yohandini, Heni; Madayanti, Fida; Akhmaloka

    2009-01-01

    Microbial communities in an acidic hot spring, namely Kawah Hujan B, at Kamojang geothermal field, West Java-Indonesia was examined using culture dependent and culture independent strategies. Chemical analysis of the hot spring water showed a characteristic of acidic-sulfate geothermal activity that contained high sulfate concentrations and low pH values (pH 1.8 to 1.9). Microbial community present in the spring was characterized by 16S rRNA gene combined with denaturing gradient gel electrophoresis (DGGE) analysis. The majority of the sequences recovered from culture-independent method were closely related to Crenarchaeota and Proteobacteria phyla. However, detail comparison among the member of Crenarchaeota showing some sequences variation compared to that the published data especially on the hypervariable and variable regions. In addition, the sequences did not belong to certain genus. Meanwhile, the 16S Rdna sequences from culture-dependent samples revealed mostly close to Firmicute and gamma Proteobacteria.

  15. Probing the phylogenetic relationships of a few newly recorded intertidal zoanthids of Gujarat coast (India) with mtDNA COI sequences.

    PubMed

    Joseph, Sneha; Poriya, Paresh; Kundu, Rahul

    2016-11-01

    The present study reports the phylogenetic relationship of six zoanthid species belonging to three genera, Isaurus, Palythoa, and Zoanthus identified using systematic computational analysis of mtDNA gene sequences. All six species are first recorded from the coasts of Kathiawar Peninsula, India. Genus: Isaurus is represented by Isaurus tuberculatus, genus Zoanthus is represented by Zoanthus kuroshio and Zoanthus sansibaricus, while genus Palythoa is represented by Palythoa tuberculosa, P. sp. JVK-2006 and Palythoa heliodiscus. Results of the present study revealed that among the various species observed along the coastline, a minimum of 99% sequence divergence and a maximum of 96% sequence divergence were seen. An interspecific divergence of 1-4% and negligible intraspecific divergence was observed. These results not only highlighted the efficiency of the COI gene region in species identification but also demonstrated the genetic variability of zoanthids along the Saurashtra coastline of the west coast of India.

  16. Multigene-based analyses on evolutionary phylogeny of two controversial ciliate orders: Pleuronematida and Loxocephalida (Protista, Ciliophora, Oligohymenophorea).

    PubMed

    Gao, Feng; Katz, Laura A; Song, Weibo

    2013-07-01

    Relationships among members of the ciliate subclass Scuticociliatia (Ciliophora, Oligohymenophorea) are largely unresolved. Phylogenetic studies of its orders Pleuronematida and Loxocephalida were initially based on small subunit ribosomal RNA gene (SSU-rDNA) analyses of a limited number of taxa. Here we characterized 37 sequences (SSU-rDNA, ITS-5.8S and LSU-rDNA) from 21 taxonomically controversial members of these orders. Phylogenetic trees constructed to assess the inter- and intra-generic relationships of pleuronematids and loxocephalids reveal the following: (1) the order Loxocephalida and its two families Loxocephalidae and Cinetochilidae are not monophyletic when more taxa are added; (2) the core pleuronematids are divided into two fully supported clades, however, the order Pleuronematida is not monophyletic because Cyclidium glaucoma is closer to Thigmotrichida; (3) the family Pleuronematidae and the genus Schizocalyptra are monophyletic, though rDNA sequences of Pleuronema species are highly variable; (4) Pseudoplatynematum and Sathrophilus are closely related to the subclass Astomatia, while Cinetochilum forms a monophyletic group with the subclass Apostomatia; and (5) Hippocomos falls in the order Pleuronematida and is closely related to Eurystomatellidae and Cyclidium plouneouri. Further, in an effort to provide a better resolution of evolutionary relationships, the secondary structures of ITS2 transcripts and the variable region 4 (V4) of the small subunit ribosomal RNA (SSU-rRNA) are predicted, revealing that ITS2 structures are conserved at the order level while V4 region structures are more variable than ITS2 structures. Copyright © 2013 Elsevier Inc. All rights reserved.

  17. Inferring the expression variability of human transposable element-derived exons by linear model analysis of deep RNA sequencing data.

    PubMed

    Zhang, Wensheng; Edwards, Andrea; Fan, Wei; Fang, Zhide; Deininger, Prescott; Zhang, Kun

    2013-08-28

    The exonization of transposable elements (TEs) has proven to be a significant mechanism for the creation of novel exons. Existing knowledge of the retention patterns of TE exons in mRNAs were mainly established by the analysis of Expressed Sequence Tag (EST) data and microarray data. This study seeks to validate and extend previous studies on the expression of TE exons by an integrative statistical analysis of high throughput RNA sequencing data. We collected 26 RNA-seq datasets spanning multiple tissues and cancer types. The exon-level digital expressions (indicating retention rates in mRNAs) were quantified by a double normalized measure, called the rescaled RPKM (Reads Per Kilobase of exon model per Million mapped reads). We analyzed the distribution profiles and the variability (across samples and between tissue/disease groups) of TE exon expressions, and compared them with those of other constitutive or cassette exons. We inferred the effects of four genomic factors, including the location, length, cognate TE family and TE nucleotide proportion (RTE, see Methods section) of a TE exon, on the exons' expression level and expression variability. We also investigated the biological implications of an assembly of highly-expressed TE exons. Our analysis confirmed prior studies from the following four aspects. First, with relatively high expression variability, most TE exons in mRNAs, especially those without exact counterparts in the UCSC RefSeq (Reference Sequence) gene tables, demonstrate low but still detectable expression levels in most tissue samples. Second, the TE exons in coding DNA sequences (CDSs) are less highly expressed than those in 3' (5') untranslated regions (UTRs). Third, the exons derived from chronologically ancient repeat elements, such as MIRs, tend to be highly expressed in comparison with those derived from younger TEs. Fourth, the previously observed negative relationship between the lengths of exons and the inclusion levels in transcripts is also true for exonized TEs. Furthermore, our study resulted in several novel findings. They include: (1) for the TE exons with non-zero expression and as shown in most of the studied biological samples, a high TE nucleotide proportion leads to their lower retention rates in mRNAs; (2) the considered genomic features (i.e. a continuous variable such as the exon length or a category indicator such as 3'UTR) influence the expression level and the expression variability (CV) of TE exons in an inverse manner; (3) not only the exons derived from Alu elements but also the exons from the TEs of other families were preferentially established in zinc finger (ZNF) genes.

  18. Molecular genetic analysis of the V kappa Ser group associated with two mouse light chain genetic markers. Complementary DNA cloning and southern hybridization analysis

    PubMed Central

    1985-01-01

    Previous studies (21) have shown that two mouse kappa light (L) chain variable (V) region polymorphisms, the IB-peptide and Efla markers, reflect expression of a characteristic group of V kappa regions, called V kappa Ser, by some inbred strains and not others. Expression of V kappa Ser is controlled by a locus on chromosome 6, the chromosome that contains the kappa locus. To further characterize this V kappa group and begin to analyze the basis for its strain-specific expression, full- length complementary DNA (cDNA) copies were produced of L chain mRNA from the M75 myeloma that had been induced in the C.C58 strain of mice, and which produces a V kappa Ser L chain. The C.C58 strain is congenic with BALB/cAn, differing in the region of chromosome 6 that controls expression of the V kappa polymorphisms and the Lyt-2 and Lyt-3 T cell alloantigens. The complete nucleotide sequence of this cloned cDNA was determined and compared with the nucleotide sequences the most closely related BALB/c myeloma L chains known. Results indicated significant differences throughout the variable region, but particularly toward the 5' portion of the sequence. A probe corresponding to 200 bp of the 5' end of the cloned V kappa Ser cDNA was used in Southern hybridizations of restriction digests of liver DNA from a number of inbred, recombinant, and recombinant inbred strains. Under stringent hybridization conditions, one strongly-hybridizing fragment was observed in Bam HI, Hind III, and Eco RI digests, and based on the size of the fragments, strains could be organized into two groups. The presence of strongly hybridizing Bam HI, Hind III, and Eco RI fragments of 3.2, 2.8, and 2.1 kb, respectively, was found to correlate completely with expression by the strain of the IB-peptide and Efla markers. All nonexpressor strains yielded hybridizing fragments of 7.8, 8.4, and 2.8 kb, respectively. Possible explanations for strain- specific expression of V kappa Ser-associated phenotypic markers are discussed. PMID:3926938

  19. Genomic comparison of multi-drug resistant invasive and colonizing Acinetobacter baumannii isolated from diverse human body sites reveals genomic plasticity.

    PubMed

    Sahl, Jason W; Johnson, J Kristie; Harris, Anthony D; Phillippy, Adam M; Hsiao, William W; Thom, Kerri A; Rasko, David A

    2011-06-04

    Acinetobacter baumannii has recently emerged as a significant global pathogen, with a surprisingly rapid acquisition of antibiotic resistance and spread within hospitals and health care institutions. This study examines the genomic content of three A. baumannii strains isolated from distinct body sites. Isolates from blood, peri-anal, and wound sources were examined in an attempt to identify genetic features that could be correlated to each isolation source. Pulsed-field gel electrophoresis, multi-locus sequence typing and antibiotic resistance profiles demonstrated genotypic and phenotypic variation. Each isolate was sequenced to high-quality draft status, which allowed for comparative genomic analyses with existing A. baumannii genomes. A high resolution, whole genome alignment method detailed the phylogenetic relationships of sequenced A. baumannii and found no correlation between phylogeny and body site of isolation. This method identified genomic regions unique to both those isolates found on the surface of the skin or in wounds, termed colonization isolates, and those identified from body fluids, termed invasive isolates; these regions may play a role in the pathogenesis and spread of this important pathogen. A PCR-based screen of 74 A. baumanii isolates demonstrated that these unique genes are not exclusive to either phenotype or isolation source; however, a conserved genomic region exclusive to all sequenced A. baumannii was identified and verified. The results of the comparative genome analysis and PCR assay show that A. baumannii is a diverse and genomically variable pathogen that appears to have the potential to cause a range of human disease regardless of the isolation source.

  20. Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks.

    PubMed

    Umarov, Ramzan Kh; Solovyev, Victor V

    2017-01-01

    Accurate computational identification of promoters remains a challenge as these key DNA regulatory regions have variable structures composed of functional motifs that provide gene-specific initiation of transcription. In this paper we utilize Convolutional Neural Networks (CNN) to analyze sequence characteristics of prokaryotic and eukaryotic promoters and build their predictive models. We trained a similar CNN architecture on promoters of five distant organisms: human, mouse, plant (Arabidopsis), and two bacteria (Escherichia coli and Bacillus subtilis). We found that CNN trained on sigma70 subclass of Escherichia coli promoter gives an excellent classification of promoters and non-promoter sequences (Sn = 0.90, Sp = 0.96, CC = 0.84). The Bacillus subtilis promoters identification CNN model achieves Sn = 0.91, Sp = 0.95, and CC = 0.86. For human, mouse and Arabidopsis promoters we employed CNNs for identification of two well-known promoter classes (TATA and non-TATA promoters). CNN models nicely recognize these complex functional regions. For human promoters Sn/Sp/CC accuracy of prediction reached 0.95/0.98/0,90 on TATA and 0.90/0.98/0.89 for non-TATA promoter sequences, respectively. For Arabidopsis we observed Sn/Sp/CC 0.95/0.97/0.91 (TATA) and 0.94/0.94/0.86 (non-TATA) promoters. Thus, the developed CNN models, implemented in CNNProm program, demonstrated the ability of deep learning approach to grasp complex promoter sequence characteristics and achieve significantly higher accuracy compared to the previously developed promoter prediction programs. We also propose random substitution procedure to discover positionally conserved promoter functional elements. As the suggested approach does not require knowledge of any specific promoter features, it can be easily extended to identify promoters and other complex functional regions in sequences of many other and especially newly sequenced genomes. The CNNProm program is available to run at web server http://www.softberry.com.

  1. Sequencing of emerging canine distemper virus strain reveals new distinct genetic lineage in the United States associated with disease in wildlife and domestic canine populations.

    PubMed

    Riley, Matthew C; Wilkes, Rebecca P

    2015-12-18

    Recent outbreaks of canine distemper have prompted examination of strains from clinical samples submitted to the University of Tennessee College of Veterinary Medicine (UTCVM) Clinical Virology Lab. We previously described a new strain of CDV that significantly diverged from all genotypes reported to date including America 2, the genotype proposed to be the main lineage currently circulating in the US. The aim of this study was to determine when this new strain appeared and how widespread it is in animal populations, given that it has also been detected in fully vaccinated adult dogs. Additionally, we sequenced complete viral genomes to characterize the strain and determine if variation is confined to known variable regions of the genome or if the changes are also present in more conserved regions. Archived clinical samples were genotyped using real-time RT-PCR amplification and sequencing. The genomes of two unrelated viruses from a dog and fox each from a different state were sequenced and aligned with previously published genomes. Phylogenetic analysis was performed using coding, non-coding and genome-length sequences. Virus neutralization assays were used to evaluate potential antigenic differences between this strain and a vaccine strain and mixed ANOVA test was used to compare the titers. Genotyping revealed this strain first appeared in 2011 and was detected in dogs from multiple states in the Southeast region of the United States. It was the main strain detected among the clinical samples that were typed from 2011-2013, including wildlife submissions. Genome sequencing demonstrated that it is highly conserved within a new lineage and preliminary serologic testing showed significant differences in neutralizing antibody titers between this strain and the strain commonly used in vaccines. This new strain represents an emerging CDV in domestic dogs in the US, may be associated with a stable reservoir in the wildlife population, and could facilitate vaccine escape.

  2. Partial sequencing analysis of the NS5B region confirmed the predominance of hepatitis C virus genotype 1 infection in Jeddah, Saudi Arabia.

    PubMed

    El Hadad, Sahar; Al-Hamdan, Hesa; Linjawi, Sabah

    2017-01-01

    Chronic hepatitis C virus (HCV) infection and its progression are major health problems that many countries including Saudi Arabia are facing. Determination of HCV genotypes and subgenotypes is critical for epidemiological and clinical analysis and aids in the determination of the ideal treatment strategy that needs to be followed and the expected therapy response. Although HCV infection has been identified as the second most predominant type of hepatitis in Saudi Arabia, little is known about the molecular epidemiology and genetic variability of HCV circulating in the Jeddah province of Saudi Arabia. The aim of this study was to determine the dominance of various HCV genotypes and subgenotypes circulating in Jeddah using partial sequencing of the NS5B region. To the best of our knowledge, this is the first study of its kind in Saudi Arabia. To characterize HCV genotypes and subgenotypes, serum samples from 56 patients with chronic HCV infection were collected and subjected to partial NS5B gene amplification and sequence analysis. Phylogenetic analysis of the NS5B partial sequences revealed that HCV/1 was the predominant genotype (73%), followed by HCV/4 (24.49%) and HCV/3 (2.04%). Moreover, pairwise analysis also confirmed these results based on the average specific nucleotide distance identity: ±0.112, ±0.112, and ±0.179 for HCV/1, HCV/4, and HCV/3, respectively, without any interference between genotypes. Notably, the phylogenetic tree of the HCV/1 subgenotypes revealed that all the isolates (100%) from the present study belonged to the HCV/1a subgenotype. Our findings also revealed similarities in the nucleotide sequences between HCV circulating in Saudi Arabia and those circulating in countries such as Morocco, Egypt, Canada, India, Pakistan, and France. These results indicated that determination of HCV genotypes and subgenotypes based on partial sequence analysis of the NS5B region is accurate and reliable for HCV subtype determination.

  3. Screening and Identification of a Phage Display Derived Peptide That Specifically Binds to the CD44 Protein Region Encoded by Variable Exons.

    PubMed

    Zhang, Dan; Jia, Huan; Li, Weiming; Hou, Yingchun; Lu, Shaoying; He, Shuixiang

    2016-01-01

    CD44, especially the isoforms with variable exons (CD44v), is a promising biomarker for the detection of cancer. To develop a CD44v-specific probe, we screened a 7-mer phage peptide library against the CD44v3-v10 protein using an improved subtractive method. The consensus sequences with the highest frequency (designated CV-1) emerged after four rounds of panning. The binding affinity and specificity of the CV-1 phage and the synthesized peptide for the region of CD44 encoded by the variable exons were confirmed using enzyme-linked immunosorbent assay and competitive inhibition assays. Furthermore, the binding of the CV-1 probe to gastric cancer cells and tissues was validated using immunofluorescence and immunohistochemistry assays. CV-1 sensitively and specifically bound to CD44v on cancer cells and tissues. Thus, CV-1 has the potential to serve as a promising probe for cancer molecular imaging and target therapy. © 2015 Society for Laboratory Automation and Screening.

  4. Genetic Analysis of West Nile Virus Isolates from an Outbreak in Idaho, United States, 2006–2007

    PubMed Central

    Grinev, Andriyan; Chancey, Caren; Añez, Germán; Ball, Christopher; Winkelman, Valerie; Williamson, Phillip; Foster, Gregory A.; Stramer, Susan L.; Rios, Maria

    2013-01-01

    West Nile virus (WNV) appeared in the U.S. in 1999 and has since become endemic, with yearly summer epidemics causing tens of thousands of cases of serious disease over the past 14 years. Analysis of WNV strains isolated during the 2006–2007 epidemic seasons demonstrates that a new genetic variant had emerged coincidentally with an intense outbreak in Idaho during 2006. The isolates belonging to the new variant carry a 13 nt deletion, termed ID-Δ13, located at the variable region of the 3′UTR, and are genetically related. The analysis of deletions and insertions in the 3′UTR of two major lineages of WNV revealed the presence of conserved repeats and two indel motifs in the variable region of the 3′UTR. One human and two bird isolates from the Idaho 2006–2007 outbreaks were sequenced using Illumina technology and within-host variability was analyzed. Continued monitoring of new genetic variants is important for public health as WNV continues to evolve. PMID:24065039

  5. Complete nucleotide sequence of pig (Sus scrofa) mitochondrial genome and dating evolutionary divergence within Artiodactyla.

    PubMed

    Lin, C S; Sun, Y L; Liu, C Y; Yang, P C; Chang, L C; Cheng, I C; Mao, S J; Huang, M C

    1999-08-05

    The complete nucleotide sequence of the pig (Sus scrofa) mitochondrial genome, containing 16613bp, is presented in this report. The genome is not a specific length because of the presence of the variable numbers of tandem repeats, 5'-CGTGCGTACA in the displacement loop (D-loop). Genes responsible for 12S and 16S rRNAs, 22 tRNAs, and 13 protein-coding regions are found. The genome carries very few intergenic nucleotides with several instances of overlap between protein-coding or tRNA genes, except in the D-loop region. For evaluating the possible evolutionary relationships between Artiodactyla and Cetacea, the nucleotide substitutions and amino acid sequences of 13 protein-coding genes were aligned by pairwise comparisons of the pig, cow, and fin whale. By comparing these sequences, we suggest that there is a closer relationship between the pig and cow than that between either of these species and fin whale. In addition, the accumulation of transversions and gaps in pig 12S and 16S rRNA genes was compared with that in other eutherian species, including cow, fin whale, human, horse, and harbor seal. The results also reveal a close phylogenetic relationship between pig and cow, as compared to fin whale and others. Thus, according to the sequence differences of mitochondrial rRNA genes in eutherian species, the evolutionary separation of pig and cow occurred about 53-60 million years ago.

  6. Determinants of High Titer in Recombinant Porcine Endogenous Retroviruses

    PubMed Central

    Harrison, Ian; Takeuchi, Yasuhiro; Bartosch, Birke; Stoye, Jonathan P.

    2004-01-01

    Porcine endogenous retroviruses (PERVs) pose a potential stumbling block for therapeutic xenotransplantation, with the greatest threat coming from viruses generated by recombination between members of the PERV subgroup A (PERV-A) and PERV-C families (PERV-A/C recombinants). PERV-A and PERV-B have been shown to infect human cells in culture, albeit with low titers. PERV-C has a more restricted host range and cannot infect human cells. A recombinant PERV-A/C virus (PERV-A14/220) contains the PERV-A sequence between the end of pol and the middle of the SU region in env. The remaining sequence is derived from PERV-C. PERV-A14/220 is approximately 500-fold more infectious than PERV-A. To determine the molecular basis for the increased infectivity of PERV-A14/220, we have made a series of vector constructs. The primary determinant for the enhanced replicative potential of the recombinant virus appeared to be the env gene. Using a series of chimeric env genes, we could identify two determinants of high infectivity; one was an isoleucine to valine substitution at position 140 between variable regions A and B, and the other lies within the proline rich region. Taken together, these results show that the novel juxtaposition of env gene sequences enhanced the infectivity of PERV-A14/220 for human cells, perhaps by stabilization of the envelope glycoprotein or increased receptor binding. PMID:15564495

  7. Revealing glacier flow and surge dynamics from animated satellite image sequences: examples from the Karakoram

    NASA Astrophysics Data System (ADS)

    Paul, F.

    2015-04-01

    Although animated images are very popular on the Internet, they have so far found only limited use for glaciological applications. With long time-series of satellite images becoming increasingly available and glaciers being well recognized for their rapid changes and variable flow dynamics, animated sequences of multiple satellite images reveal glacier dynamics in a time-lapse mode, making the otherwise slow changes of glacier movement visible and understandable for a wide public. For this study animated image sequences were created from freely available image quick-looks of orthorectified Landsat scenes for four regions in the central Karakoram mountain range. The animations play automatically in a web-browser and might help to demonstrate glacier flow dynamics for educational purposes. The animations revealed highly complex patterns of glacier flow and surge dynamics over a 15-year time period (1998-2013). In contrast to other regions, surging glaciers in the Karakoram are often small (around 10 km2), steep, debris free, and advance for several years at comparably low annual rates (a few hundred m a-1). The advance periods of individual glaciers are generally out of phase, indicating a limited climatic control on their dynamics. On the other hand, nearly all other glaciers in the region are either stable or slightly advancing, indicating balanced or even positive mass budgets over the past few years to decades.

  8. Molecular characterization of baculovirus Bombyx mori nucleopolyhedrovirus polyhedron mutants.

    PubMed

    Katsuma, S; Noguchi, Y; Shimada, T; Nagata, M; Kobayashi, M; Maeda, S

    1999-01-01

    Four newly isolated and two previously isolated polyhedron mutants of Bombyx mori nucleopolyhedrovirus (BmNPV) were studied. Two polyhedron deficient mutants, #126 and #136, produced small uncrystallized particles of polyhedrin in the nuclei and cytoplasm of infected cells. Mutant #211 produced a large number of variably sized polyhedra in the nucleus and #220 produced a few large cuboidal polyhedra in the nucleus. Mutant #24 and #128 were previously isolated BmNPV mutants. Mutant #24 could not produce polyhedrin mRNA and polyhedra produced by mutant #128 lacked oral infectivity. Nucleotide sequence analysis indicated that five mutants (#126, #136, #211, #220 and #128) had amino acid substitutions in polyhedrin and mutant #24 had a point mutation only in the promoter region of the polyhedrin gene. Cotransfection experiments showed that the altered phenotypes were due to the mutations found in the polyhedrin gene regions. In mutants #126 and #136, amino acid sequences of the nuclear localization signal of polyhedrin were identical to those of wild-type BmNPV, suggesting that this sequence was necessary but not sufficient for nuclear localization of polyhedrin. Electron microscopic observation revealed that fewer occluded virions were contained in polyhedra of #128 and #220.

  9. Molecular Identification of Malassezia Species in Patients with Malassezia folliculitis in Sfax, Tunisia.

    PubMed

    Cheikhrouhou, F; Guidara, R; Masmoudi, A; Trabelsi, H; Neji, S; Sellami, H; Makni, F; Ayadi, A

    2017-06-01

    Malassezia folliculitis is caused by the invasion of hair follicles by large numbers of Malassezia cells. Several Malassezia researches still use cultures, morphology and biochemical techniques. The aim of this study was to identify Malassezia species isolated from patients diagnosed with folliculitis, at the Parasitology and Mycology Laboratory of Sfax University Hospital, and to explore the genetic diversity of Malassezia by using PCR-RFLP and PCR-sequencing targeting the rDNA region of the Malassezia genome. Specimens were taken from 27 patients with Malassezia folliculitis. For the molecular identification, PCR amplification of the 26S rDNAD1/D2 region was carried out using the Malup and Maldown primers and three restriction enzymes (BanI, MspI and HeaII) for RFLP analysis. The nucleotide sequences of each isolate were compared to those in the NCBI GenBank by using BLASTIN algorithm. Three species of Malassezia yeasts were identified among the 31 Malassezia strains isolated: M. globosa (83.9%), M. sympodialis (12. 9%) and M. furfur (3.2%). The sequence analysis of M. globosa showed six genotypes. There is a high genotypic variability of M. globosa colonizing patients with folliculitis.

  10. Nonparametric Bayesian clustering to detect bipolar methylated genomic loci.

    PubMed

    Wu, Xiaowei; Sun, Ming-An; Zhu, Hongxiao; Xie, Hehuang

    2015-01-16

    With recent development in sequencing technology, a large number of genome-wide DNA methylation studies have generated massive amounts of bisulfite sequencing data. The analysis of DNA methylation patterns helps researchers understand epigenetic regulatory mechanisms. Highly variable methylation patterns reflect stochastic fluctuations in DNA methylation, whereas well-structured methylation patterns imply deterministic methylation events. Among these methylation patterns, bipolar patterns are important as they may originate from allele-specific methylation (ASM) or cell-specific methylation (CSM). Utilizing nonparametric Bayesian clustering followed by hypothesis testing, we have developed a novel statistical approach to identify bipolar methylated genomic regions in bisulfite sequencing data. Simulation studies demonstrate that the proposed method achieves good performance in terms of specificity and sensitivity. We used the method to analyze data from mouse brain and human blood methylomes. The bipolar methylated segments detected are found highly consistent with the differentially methylated regions identified by using purified cell subsets. Bipolar DNA methylation often indicates epigenetic heterogeneity caused by ASM or CSM. With allele-specific events filtered out or appropriately taken into account, our proposed approach sheds light on the identification of cell-specific genes/pathways under strong epigenetic control in a heterogeneous cell population.

  11. Genetic variability of Echinococcus granulosus complex in various geographical populations of Iran inferred by mitochondrial DNA sequences.

    PubMed

    Spotin, Adel; Mahami-Oskouei, Mahmoud; Harandi, Majid Fasihi; Baratchian, Mehdi; Bordbar, Ali; Ahmadpour, Ehsan; Ebrahimi, Sahar

    2017-01-01

    To investigate the genetic variability and population structure of Echinococcus granulosus complex, 79 isolates were sequenced from different host species covering human, dog, camel, goat, sheep and cattle as of various geographical sub-populations of Iran (Northwestern, Northern, and Southeastern). In addition, 36 sequences of other geographical populations (Western, Southeastern and Central Iran), were directly retrieved from GenBank database for the mitochondrial cytochrome c oxidase subunit 1 (cox1) gene. The confirmed isolates were grouped as G1 genotype (n=92), G6 genotype (n=14), G3 genotype (n=8) and G2 genotype (n=1). 50 unique haplotypes were identified based on the analyzed sequences of cox1. A parsimonious network of the sequence haplotypes displayed star-like features in the overall population containing IR23 (22: 19.1%) as the most common haplotype. According to the analysis of molecular variance (AMOVA) test, the high value of haplotype diversity of E. granulosus complex was shown the total genetic variability within populations while nucleotide diversity was low in all populations. Neutrality indices of the cox1 (Tajima's D and Fu's Fs tests) were shown negative values in Western-Northwestern, Northern and Southeastern populations which indicating significant divergence from neutrality and positive but not significant in Central isolates. A pairwise fixation index (Fst) as a degree of gene flow was generally low value for all populations (0.00647-0.15198). The statistically Fst values indicate that Echinococcus sensu stricto (genotype G1-G3) populations are not genetically well differentiated in various geographical regions of Iran. To appraise the hypothetical evolutionary scenario, further study is needed to analyze concatenated mitogenomes and as well a panel of single locus nuclear markers should be considered in wider areas of Iran and neighboring countries. Copyright © 2016 Elsevier B.V. All rights reserved.

  12. Evaluation of composition and individual variability of rumen microbiota in yaks by 16S rRNA high-throughput sequencing technology.

    PubMed

    Guo, Wei; Li, Ying; Wang, Lizhi; Wang, Jiwen; Xu, Qin; Yan, Tianhai; Xue, Bai

    2015-08-01

    The Yak (Bos grunniens) is a unique species of ruminant animals that is important to agriculture of the Tibetan plateau, and has a complex intestinal microbial community. The objective of the present study was to characterize the composition and individual variability of microbiota in the rumen of yaks using 16S rRNA gene high-throughput sequencing technique. Rumen samples used in the present study were obtained from grazing adult male yaks (n = 6) in a commercial farm in Ganzi Autonomous Prefecture of Sichuan Province, China. Universal prokaryote primers were used to target the V4-V5 hypervariable region of 16S rRNA gene. A total of 7200 operational taxonomic units (OTUs) were obtained after sequence filtering and chimera removal. Within these OTUs, 0.56% belonged to Archaea (40 OTUs), 7.19% to unassigned species (518 OTUs), and the remaining OTUs (6642) in all samples were of bacterial origin. When examining the community structure of bacteria, we identified 23 phyla within 159 families after taxonomic summarization. Bacteroidetes and Firmicutes were the predominant phyla accounting for 39.68% (SD = 0.05) and 45.90% (SD = 0.06), respectively. Moreover, 3764 OTUs were identified as shared OTUs (i.e. represented in all yaks) and belonged to 35 genera, exhibiting highly variable abundance across individual samples. Phylogenetic placement of these genera across individual samples was examined. In addition, we evaluated the distance among the 6 rumen samples by adding taxon phylogeny using UniFrac, representing 24.1% of average distance. In summary, the current study reveals a shared rumen microbiome and phylogenetic lineage and presents novel information on composition and individual variability of the bacterial community in the rumen of yaks. Copyright © 2015. Published by Elsevier Ltd.

  13. Flagellin Diversity in Clostridium botulinum Groups I and II: a New Strategy for Strain Identification▿

    PubMed Central

    Paul, Catherine J.; Twine, Susan M.; Tam, Kevin J.; Mullen, James A.; Kelly, John F.; Austin, John W.; Logan, Susan M.

    2007-01-01

    Strains of Clostridium botulinum are traditionally identified by botulinum neurotoxin type; however, identification of an additional target for typing would improve differentiation. Isolation of flagellar filaments and analysis by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) showed that C. botulinum produced multiple flagellin proteins. Nano-liquid chromatography-tandem mass spectrometry (nLC-MS/MS) analysis of in-gel tryptic digests identified peptides in all flagellin bands that matched two homologous tandem flagellin genes identified in the C. botulinum Hall A genome. Designated flaA1 and flaA2, these open reading frames encode the major structural flagellins of C. botulinum. Colony PCR and sequencing of flaA1/A2 variable regions classified 80 environmental and clinical strains into group I or group II and clustered isolates into 12 flagellar types. Flagellar type was distinct from neurotoxin type, and epidemiologically related isolates clustered together. Sequencing a larger PCR product, obtained during amplification of flaA1/A2 from type E strain Bennett identified a second flagellin gene, flaB. LC-MS analysis confirmed that flaB encoded a large type E-specific flagellin protein, and the predicted molecular mass for FlaB matched that observed by SDS-PAGE. In contrast, the molecular mass of FlaA was 2 to 12 kDa larger than the mass predicted by the flaA1/A2 sequence of a given strain, suggesting that FlaA is posttranslationally modified. While identification of FlaB, and the observation by SDS-PAGE of different masses of the FlaA proteins, showed the flagellin proteins of C. botulinum to be diverse, the presence of the flaA1/A2 gene in all strains examined facilitates single locus sequence typing of C. botulinum using the flagellin variable region. PMID:17351097

  14. StralSV: assessment of sequence variability within similar 3D structures and application to polio RNA-dependent RNA polymerase.

    PubMed

    Zemla, Adam T; Lang, Dorothy M; Kostova, Tanya; Andino, Raul; Ecale Zhou, Carol L

    2011-06-02

    Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory--still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could help overcome these difficulties by facilitating the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV (structure-alignment sequence variability), a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus, and we demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique, or that share structural similarity with proteins that would be considered distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local structural alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected residues at a given sequence position. StralSV is provided as a web service at http://proteinmodel.org/AS2TS/STRALSV/.

  15. Cloud-based adaptive exon prediction for DNA analysis

    PubMed Central

    Putluri, Srinivasareddy; Fathima, Shaik Yasmeen

    2018-01-01

    Cloud computing offers significant research and economic benefits to healthcare organisations. Cloud services provide a safe place for storing and managing large amounts of such sensitive data. Under conventional flow of gene information, gene sequence laboratories send out raw and inferred information via Internet to several sequence libraries. DNA sequencing storage costs will be minimised by use of cloud service. In this study, the authors put forward a novel genomic informatics system using Amazon Cloud Services, where genomic sequence information is stored and accessed for processing. True identification of exon regions in a DNA sequence is a key task in bioinformatics, which helps in disease identification and design drugs. Three base periodicity property of exons forms the basis of all exon identification techniques. Adaptive signal processing techniques found to be promising in comparison with several other methods. Several adaptive exon predictors (AEPs) are developed using variable normalised least mean square and its maximum normalised variants to reduce computational complexity. Finally, performance evaluation of various AEPs is done based on measures such as sensitivity, specificity and precision using various standard genomic datasets taken from National Center for Biotechnology Information genomic sequence database. PMID:29515813

  16. Movement goals and feedback and feedforward control mechanisms in speech production

    PubMed Central

    Perkell, Joseph S.

    2010-01-01

    Studies of speech motor control are described that support a theoretical framework in which fundamental control variables for phonemic movements are multi-dimensional regions in auditory and somatosensory spaces. Auditory feedback is used to acquire and maintain auditory goals and in the development and function of feedback and feedforward control mechanisms. Several lines of evidence support the idea that speakers with more acute sensory discrimination acquire more distinct goal regions and therefore produce speech sounds with greater contrast. Feedback modification findings indicate that fluently produced sound sequences are encoded as feedforward commands, and feedback control serves to correct mismatches between expected and produced sensory consequences. PMID:22661828

  17. Identification and chromosomal localization of Atm, the mouse homolog of the ataxia-telangiectasia gene.

    PubMed

    Pecker, I; Avraham, K B; Gilbert, D J; Savitsky, K; Rotman, G; Harnik, R; Fukao, T; Schröck, E; Hirotsune, S; Tagle, D A; Collins, F S; Wynshaw-Boris, A; Ried, T; Copeland, N G; Jenkins, N A; Shiloh, Y; Ziv, Y

    1996-07-01

    Atm, the mouse homolog of the human ATM gene defective in ataxia-telangiectasia (A-T), has been identified. The entire coding sequence of the Atm transcript was cloned and found to contain an open reading frame encoding a protein of 3066 amino acids with 84% overall identity and 91% similarity to the human ATM protein. Variable levels of expression of Atm were observed in different tissues. Fluorescence in situ hybridization and linkage analysis located the Atm gene on mouse chromosome 9, band 9C, in a region homologous to the ATM region on human chromosome 11q22-q23.

  18. Examining regional variability in work ethic within Mexico: Individual difference or shared value.

    PubMed

    Arciniega, Luis M; Woehr, David J; Del Rincón, Germán A

    2018-02-19

    Despite the acceptance of work ethic as an important individual difference, little research has examined the extent to which work ethic may reflect shared environmental or socio-economic factors. This research addresses this concern by examining the influence of geographic proximity on the work ethic experienced by 254 employees from Mexico, working in 11 different cities in the Northern, Central and Southern regions of the country. Using a sequence of complementary analyses to assess the main source of variance on seven dimensions of work ethic, our results indicate that work ethic is most appropriately considered at the individual level. © 2018 International Union of Psychological Science.

  19. [Differentiation of geographic biovariants of smallpox virus by PCR].

    PubMed

    Babkin, I V; Babkina, I N

    2010-01-01

    Comparative analysis of amino acid and nucleotides sequences of ORFs located in extended segments of the terminal variable regions in variola virus genome detected a promising locus for viral genotyping according to the geographic origin. This is ORF O1L of VARV. The primers were calculated for synthesis of this ORF fragment by PCR, which makes it possible to distinguish South America-Western Africa genotype from other VARV strains. Subsequent RFLP analysis reliably differentiated Asian strains from African strains (except Western Africa isolates). This method has been tested using 16 VARV strains from various geographic regions. The developed approach is simple, fast and reliable.

  20. Movement goals and feedback and feedforward control mechanisms in speech production.

    PubMed

    Perkell, Joseph S

    2012-09-01

    Studies of speech motor control are described that support a theoretical framework in which fundamental control variables for phonemic movements are multi-dimensional regions in auditory and somatosensory spaces. Auditory feedback is used to acquire and maintain auditory goals and in the development and function of feedback and feedforward control mechanisms. Several lines of evidence support the idea that speakers with more acute sensory discrimination acquire more distinct goal regions and therefore produce speech sounds with greater contrast. Feedback modification findings indicate that fluently produced sound sequences are encoded as feedforward commands, and feedback control serves to correct mismatches between expected and produced sensory consequences.

  1. Analysis of Post-Fire Vegetation Recovery in the Mediterranean Basin using MODIS Derived Vegetation Indices

    NASA Astrophysics Data System (ADS)

    Hawtree, Daniel; San Miguel, Jesus; Sedano, Fernando; Kempeneers, Pieter

    2010-05-01

    The Mediterranean basin region is highly susceptible to wildfire, with approximately 60,000 individual fires and half a million ha of natural vegetation burnt per year. Of particular concern in this region is the impact of repeated wildfires on the ability of natural lands to return to a pre-fire state, and of the possibility of desertification of semi-arid areas. Given these concerns, understanding the temporal patterns of vegetation recovery is important for the management of environmental resources in the region. A valuable tool for evaluating these recovery patterns are vegetation indices derived from remote sensing data. Previous research on post-fire vegetation recovery conducted in this region has found significant variability in recovery times across different study sites. It is unclear what the primary variables are affecting the differences in the rates of recovery, and if any geographic patterns of behavior exist across the Mediterranean basin. This research has primarily been conducted using indices derived from Landsat imagery. However, no extensive analysis of vegetation regeneration for large regions has been published, and assessment of vegetation recovery on the basis of medium-spatial resolution imagery such as that of MODIS has not yet been analyzed. This study examines the temporal pattern of vegetation recovery in a number of fire sites in the Mediterranean basin, using data derived from MODIS 16 -day composite vegetation indices. The intent is to develop a more complete picture of the temporal sequence of vegetation recovery, and to evaluate what additional factors impact variations in the recovery sequence. In addition, this study evaluates the utility of using MODIS derived vegetation indices for regeneration studies, and compares the findings to earlier studies which rely on Landsat data. Wildfires occurring between the years 2000 and 2004 were considered as potential study sites for this research. Using the EFFIS dataset, all wildfires covering an area of at least 1,000 ha were identified. The land-cover / land-use of these large fires sites were then evaluated using the CORINE land-cover data set, and the sites dominated primarily by natural vegetation were identified. Once these candidate sites were identified, a subset was selected across a range of locations and site characteristics for post-fire recovery analysis. To evaluate the post-fire recovery sequence in these locations, time-series of NDVI, EVI, and LAI were derived using 250 meter resolution MODIS data (MOD13Q). The vegetation index values were then compared to pre-fire values to determine recovery relative to the pre-fire vegetative state. The variability in rates of recovery are then considered with respect to moisture availability, vegetation type, and local site conditions to evaluate if any patterns of recovery can be determined.

  2. Characterization of emergent HIV resistance in treatment-naive subjects enrolled in a vicriviroc phase 2 trial.

    PubMed

    McNicholas, Paul; Wei, Yi; Whitcomb, Jeannette; Greaves, Wayne; Black, Todd A; Tremblay, Cecile L; Strizki, Julie M

    2010-05-15

    Vicriviroc is a C-C motif chemokine receptor 5 (CCR5) antagonist that is in clinical development for the treatment of human immunodeficiency virus type 1 (HIV-1) infection. This study explored the molecular basis for the development of phenotypically resistant virus. HIV-1 RNA from treatment-naive subjects who experienced virological failure in a phase 2 dose-finding trial was evaluated for coreceptor usage and susceptibility. For viruses that exhibited reduced susceptibility to vicriviroc, envelope clones were phenotypically and genotypically characterized. Twenty-six vicriviroc-treated subjects experienced virological failure; for 24 the virus remained CCR5-tropic, and 2 had dual/X4 virus. Reduced susceptibility to vicriviroc, manifested as decreases in the maximum percent inhibition value (no increase in median inhibitory concentration), was detected in 4 of the 26 subjects who experienced virological failure. Clonal analysis of envelopes in samples from these 4 subjects revealed multiple sequence changes in gp160, principally within the variable domain 1/variable domain 2, variable domain 3, and variable domain 4 loops. However, no consistent pattern of mutations was observed across subjects. In this study, only a small proportion of treatment failures were associated with tropism changes or reduced susceptibility to vicriviroc. Genotypic analysis of cloned env sequences revealed no specific mutational pattern associated with reduced susceptibility to vicriviroc, although numerous changes were observed in the variable domain 3 loop and in other regions of gp160.

  3. Identification of Colletotrichum spp. isolated from strawberry in Zhejiang Province and Shanghai City, China*

    PubMed Central

    Xie, Liu; Zhang, Jing-ze; Wan, Yao; Hu, Dong-wei

    2010-01-01

    Strawberry anthracnose, caused by Colletotrichum spp., is a major disease of cultivated strawberry. This study identifies 31 isolates of Colletotrichum spp. which cause strawberry anthracnose in Zhejiang Province and Shanghai City, China. Eleven isolates were identified as C. acutatum, 10 as C. gloeosporioides and 10 as C. fragariae based on morphological characteristics, phylogenetic and sequence analyses. Species-specific polymerase chain reaction (PCR) and enzyme digestion further confirmed the identification of the Colletotrichum spp., demonstrating that these three species are currently the causal agents of strawberry anthracnose in the studied regions. Based on analysis of rDNA internal transcribed spacers (ITS) sequences, sequences of all C. acutatum were identical, and little genetic variability was observed between C. fragariae and C. gloeosporioides. However, the conservative nature of the MvnI specific site from isolates of C. gloeosporioides was confirmed, and this site could be used to differentiate C. gloeosporioides from C. fragariae. PMID:20043353

  4. Mutation of domain III and domain VI in L gene conserved domain of Nipah virus

    NASA Astrophysics Data System (ADS)

    Jalani, Siti Aishah; Ibrahim, Nazlina

    2016-11-01

    Nipah virus (NiV) is the etiologic agent responsible for the respiratory illness and causes fatal encephalitis in human. NiV L protein subunit is thought to be responsible for the majority of enzymatic activities involved in viral transcription and replication. The L protein which is the viral RNA dependent RNA polymerase has high sequence homology among negative sense RNA viruses. In negative stranded RNA viruses, based on sequence alignment six conserved domain (domain I-IV) have been determined. Each domain is separated on variable regions that suggest the structure to consist concatenated functional domain. To directly address the roles of domains III and VI, site-directed mutations were constructed by the substitution of bases at sequences 2497, 2500, 5528 and 5532. Each mutated L gene can be used in future studies to test the ability for expression on in vitro translation.

  5. Structural determinants of nuclear export signal orientation in binding to exportin CRM1

    DOE PAGES

    Fung, Ho Yee Joyce; Fu, Szu -Chin; Brautigam, Chad A.; ...

    2015-09-08

    The Chromosome Region of Maintenance 1 (CRM1) protein mediates nuclear export of hundreds of proteins through recognition of their nuclear export signals (NESs), which are highly variable in sequence and structure. The plasticity of the CRM1-NES interaction is not well understood, as there are many NES sequences that seem incompatible with structures of the NES-bound CRM1 groove. Crystal structures of CRM1 bound to two different NESs with unusual sequences showed the NES peptides binding the CRM1 groove in the opposite orientation (minus) to that of previously studied NESs (plus). A comparison of minus and plus NESs identified structural and sequencemore » determinants for NES orientation. The binding of NESs to CRM1 in both orientations results in a large expansion in NES consensus patterns and therefore a corresponding expansion of potential NESs in the proteome.« less

  6. Natural selection of the major histocompatibility complex (Mhc) in Hawaiian honeycreepers (Drepanidinae)

    USGS Publications Warehouse

    Jarvi, S.I.; Tarr, C.L.; Mcintosh, C.E.; Atkinson, C.T.; Fleischer, R.C.

    2004-01-01

    The native Hawaiian honeycreepers represent a classic example of adaptive radiation and speciation, but currently face one the highest extinction rates in the world. Although multiple factors have likely influenced the fate of Hawaiian birds, the relatively recent introduction of avian malaria is thought to be a major factor limiting honeycreeper distribution and abundance. We have initiated genetic analyses of class II ?? chain Mhc genes in four species of honeycreepers using methods that eliminate the possibility of sequencing mosaic variants formed by cloning heteroduplexed polymerase chain reaction products. Phylogenetic analyses group the honeycreeper Mhc sequences into two distinct clusters. Variation within one cluster is high, with dN > d S and levels of diversity similar to other studies of Mhc (B system) genes in birds. The second cluster is nearly invariant and includes sequences from honeycreepers (Fringillidae), a sparrow (Emberizidae) and a blackbird (Emberizidae). This highly conserved cluster appears reminiscent of the independently segregating Rfp-Y system of genes defined in chickens. The notion that balancing selection operates at the Mhc in the honeycreepers is supported by transpecies polymorphism and strikingly high dN/dS ratios at codons putatively involved in peptide interaction. Mitochondrial DNA control region sequences were invariant in the i'iwi, but were highly variable in the 'amakihi. By contrast, levels of variability of class II ?? chain Mhc sequence codons that are hypothesized to be directly involved in peptide interactions appear comparable between i'iwi and 'amakihi. In the i'iwi, natural selection may have maintained variation within the Mhc, even in the face of what appears to a genetic bottleneck.

  7. Identification of copy number variation in French dairy and beef breeds using next-generation sequencing.

    PubMed

    Letaief, Rabia; Rebours, Emmanuelle; Grohs, Cécile; Meersseman, Cédric; Fritz, Sébastien; Trouilh, Lidwine; Esquerré, Diane; Barbieri, Johanna; Klopp, Christophe; Philippe, Romain; Blanquet, Véronique; Boichard, Didier; Rocha, Dominique; Boussaha, Mekki

    2017-10-24

    Copy number variations (CNV) are known to play a major role in genetic variability and disease pathogenesis in several species including cattle. In this study, we report the identification and characterization of CNV in eight French beef and dairy breeds using whole-genome sequence data from 200 animals. Bioinformatics analyses to search for CNV were carried out using four different but complementary tools and we validated a subset of the CNV by both in silico and experimental approaches. We report the identification and localization of 4178 putative deletion-only, duplication-only and CNV regions, which cover 6% of the bovine autosomal genome; they were validated by two in silico approaches and/or experimentally validated using array-based comparative genomic hybridization and single nucleotide polymorphism genotyping arrays. The size of these variants ranged from 334 bp to 7.7 Mb, with an average size of ~ 54 kb. Of these 4178 variants, 3940 were deletions, 67 were duplications and 171 corresponded to both deletions and duplications, which were defined as potential CNV regions. Gene content analysis revealed that, among these variants, 1100 deletions and duplications encompassed 1803 known genes, which affect a wide spectrum of molecular functions, and 1095 overlapped with known QTL regions. Our study is a large-scale survey of CNV in eight French dairy and beef breeds. These CNV will be useful to study the link between genetic variability and economically important traits, and to improve our knowledge on the genomic architecture of cattle.

  8. Standardized quantitative measurements of wrist cartilage in healthy humans using 3T magnetic resonance imaging

    PubMed Central

    Zink, Jean-Vincent; Souteyrand, Philippe; Guis, Sandrine; Chagnaud, Christophe; Fur, Yann Le; Militianu, Daniela; Mattei, Jean-Pierre; Rozenbaum, Michael; Rosner, Itzhak; Guye, Maxime; Bernard, Monique; Bendahan, David

    2015-01-01

    AIM: To quantify the wrist cartilage cross-sectional area in humans from a 3D magnetic resonance imaging (MRI) dataset and to assess the corresponding reproducibility. METHODS: The study was conducted in 14 healthy volunteers (6 females and 8 males) between 30 and 58 years old and devoid of articular pain. Subjects were asked to lie down in the supine position with the right hand positioned above the pelvic region on top of a home-built rigid platform attached to the scanner bed. The wrist was wrapped with a flexible surface coil. MRI investigations were performed at 3T (Verio-Siemens) using volume interpolated breath hold examination (VIBE) and dual echo steady state (DESS) MRI sequences. Cartilage cross sectional area (CSA) was measured on a slice of interest selected from a 3D dataset of the entire carpus and metacarpal-phalangeal areas on the basis of anatomical criteria using conventional image processing radiology software. Cartilage cross-sectional areas between opposite bones in the carpal region were manually selected and quantified using a thresholding method. RESULTS: Cartilage CSA measurements performed on a selected predefined slice were 292.4 ± 39 mm2 using the VIBE sequence and slightly lower, 270.4 ± 50.6 mm2, with the DESS sequence. The inter (14.1%) and intra (2.4%) subject variability was similar for both MRI methods. The coefficients of variation computed for the repeated measurements were also comparable for the VIBE (2.4%) and the DESS (4.8%) sequences. The carpus length averaged over the group was 37.5 ± 2.8 mm with a 7.45% between-subjects coefficient of variation. Of note, wrist cartilage CSA measured with either the VIBE or the DESS sequences was linearly related to the carpal bone length. The variability between subjects was significantly reduced to 8.4% when the CSA was normalized with respect to the carpal bone length. CONCLUSION: The ratio between wrist cartilage CSA and carpal bone length is a highly reproducible standardized measurement which normalizes the natural diversity between individuals. PMID:26396941

  9. Scaling exponents for ordered maxima

    DOE PAGES

    Ben-Naim, E.; Krapivsky, P. L.; Lemons, N. W.

    2015-12-22

    We study extreme value statistics of multiple sequences of random variables. For each sequence with N variables, independently drawn from the same distribution, the running maximum is defined as the largest variable to date. We compare the running maxima of m independent sequences and investigate the probability S N that the maxima are perfectly ordered, that is, the running maximum of the first sequence is always larger than that of the second sequence, which is always larger than the running maximum of the third sequence, and so on. The probability S N is universal: it does not depend on themore » distribution from which the random variables are drawn. For two sequences, S N~N –1/2, and in general, the decay is algebraic, S N~N –σm, for large N. We analytically obtain the exponent σ 3≅1.302931 as root of a transcendental equation. Moreover, the exponents σ m grow with m, and we show that σ m~m for large m.« less

  10. Investigating the diversity of the 18S SSU rRNA hyper-variable region of Theileria in cattle and Cape buffalo (Syncerus caffer) from southern Africa using a next generation sequencing approach.

    PubMed

    Mans, Ben J; Pienaar, Ronel; Ratabane, John; Pule, Boitumelo; Latif, Abdalla A

    2016-07-01

    Molecular classification and systematics of the Theileria is based on the analysis of the 18S rRNA gene. Reverse line blot or conventional sequencing approaches have disadvantages in the study of 18S rRNA diversity and a next-generation 454 sequencing approach was investigated. The 18S rRNA gene was amplified using RLB primers coupled to 96 unique sequence identifiers (MIDs). Theileria positive samples from African buffalo (672) and cattle (480) from southern Africa were combined in batches of 96 and sequenced using the GS Junior 454 sequencer to produce 825711 informative sequences. Sequences were extracted based on MIDs and analysed to identify Theileria genotypes. Genotypes observed in buffalo and cattle were confirmed in the current study, while no new genotypes were discovered. Genotypes showed specific geographic distributions, most probably linked with vector distributions. Host specificity of buffalo and cattle specific genotypes were confirmed and prevalence data as well as relative parasitemia trends indicate preference for different hosts. Mixed infections are common with African buffalo carrying more genotypes compared to cattle. Associative or exclusion co-infection profiles were observed between genotypes that may have implications for speciation and systematics: specifically that more Theileria species may exist in cattle and buffalo than currently recognized. Analysis of primers used for Theileria parva diagnostics indicate that no new genotypes will be amplified by the current primer sets confirming their specificity. T. parva SNP variants that occur in the 18S rRNA hypervariable region were confirmed. A next generation sequencing approach is useful in obtaining comprehensive knowledge regarding 18S rRNA diversity and prevalence for the Theileria, allowing for the assessment of systematics and diagnostic assays based on the 18S gene. Copyright © 2016 Elsevier GmbH. All rights reserved.

  11. Genetic analysis of Fasciola isolates from cattle in Korea based on second internal transcribed spacer (ITS-2) sequence of nuclear ribosomal DNA.

    PubMed

    Choe, Se-Eun; Nguyen, Thuy Thi-Dieu; Kang, Tae-Gyu; Kweon, Chang-Hee; Kang, Seung-Won

    2011-09-01

    Nuclear ribosomal DNA sequence of the second internal transcribed spacer (ITS-2) has been used efficiently to identify the liver fluke species collected from different hosts and various geographic regions. ITS-2 sequences of 19 Fasciola samples collected from Korean native cattle were determined and compared. Sequence comparison including ITS-2 sequences of isolates from this study and reference sequences from Fasciola hepatica and Fasciola gigantica and intermediate Fasciola in Genbank revealed seven identical variable sites of investigated isolates. Among 19 samples, 12 individuals had ITS-2 sequences completely identical to that of pure F. hepatica, five possessed the sequences identical to F. gigantica type, whereas two shared the sequence of both F. hepatica and F. gigantica. No variations in length and nucleotide composition of ITS-2 sequence were observed within isolates that belonged to F. hepatica or F. gigantica. At the position of 218, five Fasciola containing a single-base substitution (C>T) formed a distinct branch inside the F. gigantica-type group which was similar to those of Asian-origin isolates. The phylogenetic tree of the Fasciola spp. based on complete ITS-2 sequences from this study and other representative isolates in different locations clearly showed that pure F. hepatica, F. gigantica type and intermediate Fasciola were observed. The result also provided additional genetic evidence for the existence of three forms of Fasciola isolated from native cattle in Korea by genetic approach using ITS-2 sequence.

  12. Integrity of immunoglobulin variable regions is supported by GANP during AID-induced somatic hypermutation in germinal center B cells

    PubMed Central

    Eid, Mohammed Mansour Abbas; Shimoda, Mayuko; Singh, Shailendra Kumar; Almofty, Sarah Ameen; Pham, Phuong; Goodman, Myron F.; Maeda, Kazuhiko; Sakaguchi, Nobuo

    2017-01-01

    Abstract Immunoglobulin affinity maturation depends on somatic hypermutation (SHM) in immunoglobulin variable (IgV) regions initiated by activation-induced cytidine deaminase (AID). AID induces transition mutations by C→U deamination on both strands, causing C:G→T:A. Error-prone repairs of U by base excision and mismatch repairs (MMRs) create transversion mutations at C/G and mutations at A/T sites. In Neuberger’s model, it remained to be clarified how transition/transversion repair is regulated. We investigate the role of AID-interacting GANP (germinal center-associated nuclear protein) in the IgV SHM profile. GANP enhances transition mutation of the non-transcribed strand G and reduces mutation at A, restricted to GYW of the AID hotspot motif. It reduces DNA polymerase η hotspot mutations associated with MMRs followed by uracil-DNA glycosylase. Mutation comparison between IgV complementary and framework regions (FWRs) by Bayesian statistical estimation demonstrates that GANP supports the preservation of IgV FWR genomic sequences. GANP works to maintain antibody structure by reducing drastic changes in the IgV FWR in affinity maturation. PMID:28541550

  13. Mutation Scanning in a Single and a Stacked Genetically Modified (GM) Event by Real-Time PCR and High Resolution Melting (HRM) Analysis

    PubMed Central

    Ben Ali, Sina-Elisabeth; Madi, Zita Erika; Hochegger, Rupert; Quist, David; Prewein, Bernhard; Haslberger, Alexander G.; Brandes, Christian

    2014-01-01

    Genetic mutations must be avoided during the production and use of seeds. In the European Union (EU), Directive 2001/18/EC requires any DNA construct introduced via transformation to be stable. Establishing genetic stability is critical for the approval of genetically modified organisms (GMOs). In this study, genetic stability of two GMOs was examined using high resolution melting (HRM) analysis and real-time polymerase chain reaction (PCR) employing Scorpion primers for amplification. The genetic variability of the transgenic insert and that of the flanking regions in a single oilseed rape variety (GT73) and a stacked maize (MON88017 × MON810) was studied. The GT73 and the 5' region of MON810 showed no instabilities in the examined regions. However; two out of 100 analyzed samples carried a heterozygous point mutation in the 3' region of MON810 in the stacked variety. These results were verified by direct sequencing of the amplified PCR products as well as by sequencing of cloned PCR fragments. The occurrence of the mutation suggests that the 5' region is more suitable than the 3' region for the quantification of MON810. The identification of the single nucleotide polymorphism (SNP) in a stacked event is in contrast to the results of earlier studies of the same MON810 region in a single event where no DNA polymorphism was found. PMID:25365178

  14. Preliminary notes on dual relevance of ITS sequences and pigments in Hygrocybe taxonomy.

    PubMed

    Babos, M; Halász, K; Zagyva, T; Zöld-Balogh, A; Szegő, D; Bratek, Z

    2011-06-01

    The relationships based on ITS sequences of 48 Hygrocybe s.l. specimens were studied and compared with previously described taxonomic groups. Our specimens formed two well separated genetic groups. The first one includes the species characterized by vivid yellow and red colours, while species belonging to other clades were pallid or pale brown, and in most cases with pink or olive tones. This separation is supported by the presence of muscaflavin pigments among some species referred to Hygrocybe (Bresinsky & Kronawitter 1986). The subgenera distinguished by morphological features can be relatively well recognized on phylogenetic trees, however, the majority of sections were not supported. Variability in the ITS region of Hygrocybe species is unusually high. In some cases sequences differed by more than 25 %, and the lengths of ITS regions also showed large differences. Taxa that were considered as closely related, e.g. the H. conica aggregate, were found to have identical or highly similar sequences. Our results seem to confirm the taxonomic concept of Bresinsky (2008) who proposed the division of the genus Hygrocybe. Hence H. calyptriformis and all examined members of subg. Gliophorus (H. irrigata, H. laeta, H. nitrata, H. psittacina) and subg. Cuphophyllus could be excluded from the genus Hygrocybe s.str. Based on these results further research using DNA markers at the intergeneric level is suggested to revaluate the taxonomy of former Hygrocybe species.

  15. Phylogenetic evidence for multiple intertypic recombinations in enterovirus B81 strains isolated in Tibet, China

    PubMed Central

    Hu, Lan; Zhang, Yong; Hong, Mei; Zhu, Shuangli; Yan, Dongmei; Wang, Dongyan; Li, Xiaolei; Zhu, Zhen; Tsewang; Xu, Wenbo

    2014-01-01

    Enterovirus B81 (EV-B81) is a newly identified serotype within the species enterovirus B (EV-B). To date, only eight nucleotide sequences of EV-B81 have been published and only one full-length genome sequence (the prototype strain) has been made available in the GenBank database. Here, we report the full-length genome sequences of two EV-B81 strains isolated in the Tibet Autonomous Region of China during acute flaccid paralysis surveillance activities, and we also conducted an antibody seroprevalence study in two prefectures of Tibet. The sequence comparison and phylogenetic dendrogram analysis revealed high variability among the global EV-B81 strains and frequent intertypic recombination in the non-structural protein region of EV-B serotypes, suggesting high genetic diversity of EV-B81. However, low positive rates and low titers of neutralizing antibodies against EV-B81 were detected. Nearly 68% of children under the age of five had no neutralizing antibodies against EV-B81. Hence, the extent of transmission and the exposure of the population to this EV type are very limited. Although little is known about the biological and pathogenic properties of EV-B81 because of few research in this field owing to the limited number of isolates, our study provides basic information for further studies of EV-B81. PMID:25112835

  16. Pneumocystis jirovecii multilocus genotyping profiles in patients from Portugal and Spain.

    PubMed

    Esteves, F; Montes-Cano, M A; de la Horra, C; Costa, M C; Calderón, E J; Antunes, F; Matos, O

    2008-04-01

    Pneumonia caused by the opportunistic organism Pneumocystis jirovecii is a clinically important infection affecting AIDS and other immunocompromised patients. The present study aimed to compare and characterise the frequency pattern of DNA sequences from the P. jirovecii mitochondrial large-subunit rRNA (mtLSU rRNA) gene, the dihydropteroate synthase (DHPS) gene and the internal transcribed spacer (ITS) regions of the nuclear rRNA operon in specimens from Lisbon (Portugal) and Seville (Spain). Total DNA was extracted and used for specific molecular sequence analysis of the three loci. In both populations, mtLSU rRNA gene analysis revealed an overall prevalence of genotype 1. In the Portuguese population, genotype 2 was the second most common, followed by genotype 3. Inversely, in the Spanish population, genotype 3 was the second most common, followed by genotype 2. The DHPS wild-type sequence was the genotype observed most frequently in both populations, and the DHPS genotype frequency pattern was identical to distribution patterns revealed in other European studies. ITS types showed a significant diversity in both populations because of the high sequence variability in these genomic regions. The most prevalent ITS type in the Portuguese population was Eg, followed by Cg. In contrast to other European studies, Bi was the most common ITS type in the Spanish samples, followed by Eg. A statistically significant association between mtLSU rRNA genotype 1 and ITS type Eg was revealed.

  17. Effects of a Transposable Element Insertion on Alcohol Dehydrogenase Expression in Drosophila Melanogaster

    PubMed Central

    Dunn, R. C.; Laurie, C. C.

    1995-01-01

    Variation in the DNA sequence and level of alcohol dehydrogenase (Adh) gene expression in Drosophila melanogaster have been studied to determine what types of DNA polymorphisms contribute to phenotypic variation in natural populations. The Adh gene, like many others, shows a high level of variability in both DNA sequence and quantitative level of expression. A number of transposable element insertions occur in the Adh region and one of these, a copia insertion in the 5' flanking region, is associated with unusually low Adh expression. To determine whether this insertion (called RI42) causes the low expression level, the insertion was excised from the cloned RI42 Adh gene and the effect was assessed by P-element transformation. Removal of this insertion causes a threefold increase in the level of ADH, clearly showing that it contributes to the naturally occurring variation in expression at this locus. Removal of all but one LTR also causes a threefold increase, indicating that the mechanism is not a simple sequence disruption. Furthermore, this copia insertion, which is located between the two Adh promoters and their upstream enhancer sequences, has differential effects on the levels of proximal and distal transcripts. Finally, a test for the possible modifying effects of two suppressor loci, su(w(a)) and su(f), on this insertional mutation was negative, in contrast to a previous report in the literature. PMID:7498745

  18. Leishmania tropica isolates from non-healed and healed patients in Iran: A molecular typing and phylogenetic analysis.

    PubMed

    Bamorovat, Mehdi; Sharifi, Iraj; Mohammadi, Mohammad Ali; Eybpoosh, Sana; Nasibi, Saeid; Aflatoonian, Mohammad Reza; Khosravi, Ahmad

    2018-03-01

    The precise identification of the parasite species causing leishmaniasis is essential for selecting proper treatment modality. The present study aims to compare the nucleotide variations of the ITS1, 7SL RNA, and Hsp70 sequences between non-healed and healed anthroponotic cutaneous leishmaniasis (ACL) patients in major foci in Iran. A case-control study was carried out from September 2015 to October 2016 in the cities of Kerman and Bam, in the southeast of Iran. Randomly selected skin-scraping lesions of 40 patients (20 non-healed and 20 healed) were examined and the organisms were grown in a culture medium. Promastigotes were collected by centrifugation and kept for further molecular examinations. The extracted DNA was amplified and sequenced. After global sequence alignment with BioEdit software, maximum likelihood phylogenetic analysis was performed in PhyML for typing of Leishmania isolates. Nucleotide composition of each genetic region was also compared between non-healed and healed patients. Our results showed that all isolates belonged to the Leishmania tropica complex, with their genetic composition in the ITS1 region being different among non-healed and healed patients. 7SL RNA and Hsp70 regions were genetically identical between both groups. Variability in nucleotide patterns observed between both groups in the ITS1 region may serve to encourage future research on the function of these polymorphisms and may improve our understanding of the role of parasite genome properties on patients' response to Leishmania treatment. Our results also do not support future use of 7SL RNA and Hsp70 regions of the parasite for comparative genomic analyses. Copyright © 2018 Elsevier Ltd. All rights reserved.

  19. Multiplex Amplification Refractory Mutation System Polymerase Chain Reaction (ARMS-PCR) for diagnosis of natural infection with canine distemper virus

    PubMed Central

    2010-01-01

    Background Canine distemper virus (CDV) is present worldwide and produces a lethal systemic infection of wild and domestic Canidae. Pre-existing antibodies acquired from vaccination or previous CDV infection might interfere the interpretation of a serologic diagnosis method. In addition, due to the high similarity of nucleic acid sequences between wild-type CDV and the new vaccine strain, current PCR derived methods cannot be applied for the definite confirmation of CD infection. Hence, it is worthy of developing a simple and rapid nucleotide-based assay for differentiation of wild-type CDV which is a cause of disease from attenuated CDVs after vaccination. High frequency variations have been found in the region spanning from the 3'-untranslated region (UTR) of the matrix (M) gene to the fusion (F) gene (designated M-F UTR) in a few CDV strains. To establish a differential diagnosis assay, an amplification refractory mutation analysis was established based on the highly variable region on M-F UTR and F regions. Results Sequences of frequent polymorphisms were found scattered throughout the M-F UTR region; the identity of nucleic acid between local strains and vaccine strains ranged from 82.5% to 93.8%. A track of AAA residue located 35 nucleotides downstream from F gene start codon highly conserved in three vaccine strains were replaced with TGC in the local strains; that severed as target sequences for deign of discrimination primers. The method established in the present study successfully differentiated seven Taiwanese CDV field isolates, all belonging to the Asia-1 lineage, from vaccine strains. Conclusions The method described herein would be useful for several clinical applications, such as confirmation of nature CDV infection, evaluation of vaccination status and verification of the circulating viral genotypes. PMID:20534175

  20. Multiplex Amplification Refractory Mutation System Polymerase Chain Reaction (ARMS-PCR) for diagnosis of natural infection with canine distemper virus.

    PubMed

    Chulakasian, Songkhla; Lee, Min-Shiuh; Wang, Chi-Young; Chiou, Shyan-Song; Lin, Kuan-Hsun; Lin, Fong-Yuan; Hsu, Tien-Huan; Wong, Min-Liang; Chang, Tien-Jye; Hsu, Wei-Li

    2010-06-10

    Canine distemper virus (CDV) is present worldwide and produces a lethal systemic infection of wild and domestic Canidae. Pre-existing antibodies acquired from vaccination or previous CDV infection might interfere the interpretation of a serologic diagnosis method. In addition, due to the high similarity of nucleic acid sequences between wild-type CDV and the new vaccine strain, current PCR derived methods cannot be applied for the definite confirmation of CD infection. Hence, it is worthy of developing a simple and rapid nucleotide-based assay for differentiation of wild-type CDV which is a cause of disease from attenuated CDVs after vaccination. High frequency variations have been found in the region spanning from the 3'-untranslated region (UTR) of the matrix (M) gene to the fusion (F) gene (designated M-F UTR) in a few CDV strains. To establish a differential diagnosis assay, an amplification refractory mutation analysis was established based on the highly variable region on M-F UTR and F regions. Sequences of frequent polymorphisms were found scattered throughout the M-F UTR region; the identity of nucleic acid between local strains and vaccine strains ranged from 82.5% to 93.8%. A track of AAA residue located 35 nucleotides downstream from F gene start codon highly conserved in three vaccine strains were replaced with TGC in the local strains; that severed as target sequences for deign of discrimination primers. The method established in the present study successfully differentiated seven Taiwanese CDV field isolates, all belonging to the Asia-1 lineage, from vaccine strains. The method described herein would be useful for several clinical applications, such as confirmation of nature CDV infection, evaluation of vaccination status and verification of the circulating viral genotypes.

  1. HIV1 V3 loop hypermutability is enhanced by the guanine usage bias in the part of env gene coding for it.

    PubMed

    Khrustalev, Vladislav Victorovich

    2009-01-01

    Guanine is the most mutable nucleotide in HIV genes because of frequently occurring G to A transitions, which are caused by cytosine deamination in viral DNA minus strands catalyzed by APOBEC enzymes. Distribution of guanine between three codon positions should influence the probability for G to A mutation to be nonsynonymous (to occur in first or second codon position). We discovered that nucleotide sequences of env genes coding for third variable regions (V3 loops) of gp120 from HIV1 and HIV2 have different kinds of guanine usage biases. In the HIV1 reference strain and 100 additionally analyzed HIV1 strains the guanine usage bias in V3 loop coding regions (2G>1G>3G) should lead to elevated nonsynonymous G to A transitions occurrence rates. In the HIV2 reference strain and 100 other HIV2 strains guanine usage bias in V3 loop coding regions (3G>2G>1G) should protect V3 loops from hypermutability. According to the HIV1 and HIV2 V3 alignment, insertion of the sequence enriched with 2G (21 codons in length) occurred during the evolution of HIV1 predecessor, while insertion of the different sequence enriched with 3G (19 codons in length) occurred during the evolution of HIV2 predecessor. The higher is the level of 3G in the V3 coding region, the lower should be the immune escaping mutation occurrence rates. This hypothesis was tested in this study by comparing the guanine usage in V3 loop coding regions from HIV1 fast and slow progressors. All calculations have been performed by our algorithms "VVK In length", "VVK Dinucleotides" and "VVK Consensus" (www.barkovsky.hotmail.ru).

  2. National Earthquake Information Center Seismic Event Detections on Multiple Scales

    NASA Astrophysics Data System (ADS)

    Patton, J.; Yeck, W. L.; Benz, H.; Earle, P. S.; Soto-Cordero, L.; Johnson, C. E.

    2017-12-01

    The U.S. Geological Survey National Earthquake Information Center (NEIC) monitors seismicity on local, regional, and global scales using automatic picks from more than 2,000 near-real time seismic stations. This presents unique challenges in automated event detection due to the high variability in data quality, network geometries and density, and distance-dependent variability in observed seismic signals. To lower the overall detection threshold while minimizing false detection rates, NEIC has begun to test the incorporation of new detection and picking algorithms, including multiband (Lomax et al., 2012) and kurtosis (Baillard et al., 2014) pickers, and a new bayesian associator (Glass 3.0). The Glass 3.0 associator allows for simultaneous processing of variably scaled detection grids, each with a unique set of nucleation criteria (e.g., nucleation threshold, minimum associated picks, nucleation phases) to meet specific monitoring goals. We test the efficacy of these new tools on event detection in networks of various scales and geometries, compare our results with previous catalogs, and discuss lessons learned. For example, we find that on local and regional scales, rapid nucleation of small events may require event nucleation with both P and higher-amplitude secondary phases (e.g., S or Lg). We provide examples of the implementation of a scale-independent associator for an induced seismicity sequence (local-scale), a large aftershock sequence (regional-scale), and for monitoring global seismicity. Baillard, C., Crawford, W. C., Ballu, V., Hibert, C., & Mangeney, A. (2014). An automatic kurtosis-based P-and S-phase picker designed for local seismic networks. Bulletin of the Seismological Society of America, 104(1), 394-409. Lomax, A., Satriano, C., & Vassallo, M. (2012). Automatic picker developments and optimization: FilterPicker - a robust, broadband picker for real-time seismic monitoring and earthquake early-warning, Seism. Res. Lett. , 83, 531-540, doi: 10.1785/gssrl.83.3.531.

  3. The genealogy of sequences containing multiple sites subject to strong selection in a subdivided population.

    PubMed Central

    Nordborg, Magnus; Innan, Hideki

    2003-01-01

    A stochastic model for the genealogy of a sample of recombining sequences containing one or more sites subject to selection in a subdivided population is described. Selection is incorporated by dividing the population into allelic classes and then conditioning on the past sizes of these classes. The past allele frequencies at the selected sites are thus treated as parameters rather than as random variables. The purpose of the model is not to investigate the dynamics of selection, but to investigate effects of linkage to the selected sites on the genealogy of the surrounding chromosomal region. This approach is useful for modeling strong selection, when it is natural to parameterize the past allele frequencies at the selected sites. Several models of strong balancing selection are used as examples, and the effects on the pattern of neutral polymorphism in the chromosomal region are discussed. We focus in particular on the statistical power to detect balancing selection when it is present. PMID:12663556

  4. The genealogy of sequences containing multiple sites subject to strong selection in a subdivided population.

    PubMed

    Nordborg, Magnus; Innan, Hideki

    2003-03-01

    A stochastic model for the genealogy of a sample of recombining sequences containing one or more sites subject to selection in a subdivided population is described. Selection is incorporated by dividing the population into allelic classes and then conditioning on the past sizes of these classes. The past allele frequencies at the selected sites are thus treated as parameters rather than as random variables. The purpose of the model is not to investigate the dynamics of selection, but to investigate effects of linkage to the selected sites on the genealogy of the surrounding chromosomal region. This approach is useful for modeling strong selection, when it is natural to parameterize the past allele frequencies at the selected sites. Several models of strong balancing selection are used as examples, and the effects on the pattern of neutral polymorphism in the chromosomal region are discussed. We focus in particular on the statistical power to detect balancing selection when it is present.

  5. A novel program to design siRNAs simultaneously effective to highly variable virus genomes.

    PubMed

    Lee, Hui Sun; Ahn, Jeonghyun; Jun, Eun Jung; Yang, Sanghwa; Joo, Chul Hyun; Kim, Yoo Kyum; Lee, Heuiran

    2009-07-10

    A major concern of antiviral therapy using small interfering RNAs (siRNAs) targeting RNA viral genome is high sequence diversity and mutation rate due to genetic instability. To overcome this problem, it is indispensable to design siRNAs targeting highly conserved regions. We thus designed CAPSID (Convenient Application Program for siRNA Design), a novel bioinformatics program to identify siRNAs targeting highly conserved regions within RNA viral genomes. From a set of input RNAs of diverse sequences, CAPSID rapidly searches conserved patterns and suggests highly potent siRNA candidates in a hierarchical manner. To validate the usefulness of this novel program, we investigated the antiviral potency of universal siRNA for various Human enterovirus B (HEB) serotypes. Assessment of antiviral efficacy using Hela cells, clearly demonstrates that HEB-specific siRNAs exhibit protective effects against all HEBs examined. These findings strongly indicate that CAPSID can be applied to select universal antiviral siRNAs against highly divergent viral genomes.

  6. The influence of phonological priming on variability in articulation

    NASA Astrophysics Data System (ADS)

    Babel, Molly E.; Munson, Benjamin

    2004-05-01

    Previous research [Sevald and Dell, Cognition 53, 91-127 (1994)] has found that reiterant sequences of CVC words are produced more quickly when the prime word and target word share VC sequences (i.e., sequences like sit sick) than when they are identical (sequences like sick sick). Even slower production rates are found when primes and targets share a CV sequence (sequences like kick sick). These data have been used to support a model of speech production in which lexical items and their constituent phonemes are activated sequentially. The current experiment investigated whether phonological priming also influences variability in the acoustic characteristics of words. Specifically, we examined whether greater variability in the acoustic characteristics of target words was noted in the CV-related prime context than in the identical-prime context, and whether less variability was noted in the VC-related context. Thirty adult subjects with typical speech, language, and hearing ability produced reiterant two-word sequences that varied in their phonological similarity. The duration, first, and second formant frequencies of the target-words' vowels were measured. Preliminary analyses indicate that phonological priming does not have a systematic effect on variability in these acoustic parameters.

  7. Genomic and phenotypic variation in epidemic-spanning Salmonella enterica serovar Enteritidis isolates

    PubMed Central

    2009-01-01

    Background Salmonella enterica serovar Enteritidis (S. Enteritidis) has caused major epidemics of gastrointestinal infection in many different countries. In this study we investigate genome divergence and pathogenic potential in S. Enteritidis isolated before, during and after an epidemic in Uruguay. Results 266 S. Enteritidis isolates were genotyped using RAPD-PCR and a selection were subjected to PFGE analysis. From these, 29 isolates spanning different periods, genetic profiles and sources of isolation were assayed for their ability to infect human epithelial cells and subjected to comparative genomic hybridization using a Salmonella pan-array and the sequenced strain S. Enteritidis PT4 P125109 as reference. Six other isolates from distant countries were included as external comparators. Two hundred and thirty three chromosomal genes as well as the virulence plasmid were found as variable among S. Enteritidis isolates. Ten out of the 16 chromosomal regions that varied between different isolates correspond to phage-like regions. The 2 oldest pre-epidemic isolates lack phage SE20 and harbour other phage encoded genes that are absent in the sequenced strain. Besides variation in prophage, we found variation in genes involved in metabolism and bacterial fitness. Five epidemic strains lack the complete Salmonella virulence plasmid. Significantly, strains with indistinguishable genetic patterns still showed major differences in their ability to infect epithelial cells, indicating that the approach used was insufficient to detect the genetic basis of this differential behaviour. Conclusion The recent epidemic of S. Enteritidis infection in Uruguay has been driven by the introduction of closely related strains of phage type 4 lineage. Our results confirm previous reports demonstrating a high degree of genetic homogeneity among S. Enteritidis isolates. However, 10 of the regions of variability described here are for the first time reported as being variable in S. Enteritidis. In particular, the oldest pre-epidemic isolates carry phage-associated genetic regions not previously reported in S. Enteritidis. Overall, our results support the view that phages play a crucial role in the generation of genetic diversity in S. Enteritidis and that phage SE20 may be a key marker for the emergence of particular isolates capable of causing epidemics. PMID:19922635

  8. Systematic analysis and evolution of 5S ribosomal DNA in metazoans.

    PubMed

    Vierna, J; Wehner, S; Höner zu Siederdissen, C; Martínez-Lage, A; Marz, M

    2013-11-01

    Several studies on 5S ribosomal DNA (5S rDNA) have been focused on a subset of the following features in mostly one organism: number of copies, pseudogenes, secondary structure, promoter and terminator characteristics, genomic arrangements, types of non-transcribed spacers and evolution. In this work, we systematically analyzed 5S rDNA sequence diversity in available metazoan genomes, and showed organism-specific and evolutionary-conserved features. Putatively functional sequences (12,766) from 97 organisms allowed us to identify general features of this multigene family in animals. Interestingly, we show that each mammal species has a highly conserved (housekeeping) 5S rRNA type and many variable ones. The genomic organization of 5S rDNA is still under debate. Here, we report the occurrence of several paralog 5S rRNA sequences in 58 of the examined species, and a flexible genome organization of 5S rDNA in animals. We found heterogeneous 5S rDNA clusters in several species, supporting the hypothesis of an exchange of 5S rDNA from one locus to another. A rather high degree of variation of upstream, internal and downstream putative regulatory regions appears to characterize metazoan 5S rDNA. We systematically studied the internal promoters and described three different types of termination signals, as well as variable distances between the coding region and the typical termination signal. Finally, we present a statistical method for detection of linkage among noncoding RNA (ncRNA) gene families. This method showed no evolutionary-conserved linkage among 5S rDNAs and any other ncRNA genes within Metazoa, even though we found 5S rDNA to be linked to various ncRNAs in several clades.

  9. Systematic analysis and evolution of 5S ribosomal DNA in metazoans

    PubMed Central

    Vierna, J; Wehner, S; Höner zu Siederdissen, C; Martínez-Lage, A; Marz, M

    2013-01-01

    Several studies on 5S ribosomal DNA (5S rDNA) have been focused on a subset of the following features in mostly one organism: number of copies, pseudogenes, secondary structure, promoter and terminator characteristics, genomic arrangements, types of non-transcribed spacers and evolution. In this work, we systematically analyzed 5S rDNA sequence diversity in available metazoan genomes, and showed organism-specific and evolutionary-conserved features. Putatively functional sequences (12 766) from 97 organisms allowed us to identify general features of this multigene family in animals. Interestingly, we show that each mammal species has a highly conserved (housekeeping) 5S rRNA type and many variable ones. The genomic organization of 5S rDNA is still under debate. Here, we report the occurrence of several paralog 5S rRNA sequences in 58 of the examined species, and a flexible genome organization of 5S rDNA in animals. We found heterogeneous 5S rDNA clusters in several species, supporting the hypothesis of an exchange of 5S rDNA from one locus to another. A rather high degree of variation of upstream, internal and downstream putative regulatory regions appears to characterize metazoan 5S rDNA. We systematically studied the internal promoters and described three different types of termination signals, as well as variable distances between the coding region and the typical termination signal. Finally, we present a statistical method for detection of linkage among noncoding RNA (ncRNA) gene families. This method showed no evolutionary-conserved linkage among 5S rDNAs and any other ncRNA genes within Metazoa, even though we found 5S rDNA to be linked to various ncRNAs in several clades. PMID:23838690

  10. Automatic Cloud Classification from Multi-Spectral Satellite Data Over Oceanic Regions

    DTIC Science & Technology

    1992-01-14

    parameters the first two colors used are, blue for low values and dark green for high parameter values. If a third class is identified, the intermediate...intermediate yellow and high dark green classes. The color sequence blue-yellow-light green- dark green, then characterizes the low to high parameter value...to light green then to dark green correspond to superpixels of increasing (from low to high) variability in their altitude, (see Table V.3). When the

  11. Exploring variations of earthquake moment on patches with heterogeneous strength

    NASA Astrophysics Data System (ADS)

    Lin, Y. Y.; Lapusta, N.

    2016-12-01

    Finite-fault inversions show that earthquake slip is typically non-uniform over the ruptured region, likely due to heterogeneity of the earthquake source. Observations also show that events from the same fault area can have the same source duration but different magnitude ranging from 0.0 to 2.0 (Lin et al., GJI, 2016). Strong heterogeneity in strength over a patch could provide a potential explanation of such behavior, with the event duration controlled by the size of the patch and event magnitude determined by how much of the patch area has been ruptured. To explore this possibility, we numerically simulate earthquake sequences on a rate-and-state fault, with a seismogenic patch governed by steady-state velocity-weakening friction surrounded by a steady-state velocity-strengthening region. The seismogenic patch contains strong variations in strength due to variable normal stress. Our long-term simulations of slip in this model indeed generate sequences of earthquakes of various magnitudes. In some seismic events, dynamic rupture cannot overcome areas with higher normal strength, and smaller events result. When the higher-strength areas are loaded by previous slip and rupture, larger events result, as expected. Our current work is directed towards exploring a range of such models, determining the variability in the seismic moment that they can produce, and determining the observable properties of the resulting events.

  12. The making of the minibody: an engineered beta-protein for the display of conformationally constrained peptides.

    PubMed

    Tramontano, A; Bianchi, E; Venturini, S; Martin, F; Pessi, A; Sollazzo, M

    1994-03-01

    Conformationally constraining selectable peptides onto a suitable scaffold that enables their conformation to be predicted or readily determined by experimental techniques would considerably boost the drug discovery process by reducing the gap between the discovery of a peptide lead and the design of a peptidomimetic with a more desirable pharmacological profile. With this in mind, we designed the minibody, a 61-residue beta-protein aimed at retaining some desirable features of immunoglobulin variable domains, such as tolerance to sequence variability in selected regions of the protein and predictability of the main chain conformation of the same regions, based on the 'canonical structures' model. To test the ability of the minibody scaffold to support functional sites we also designed a metal binding version of the protein by suitably choosing the sequences of its loops. The minibody was produced both by chemical synthesis and expression in E. coli and characterized by size exclusion chromatography, UV CD (circular dichroism) spectroscopy and metal binding activity. All our data supported the model, but a more detailed structural characterization of the molecule was impaired by its low solubility. We were able to overcome this problem both by further mutagenesis of the framework and by addition of a solubilizing motif. The minibody is being used to select constrained human IL-6 peptidic ligands from a library displayed on the surface of the f1 bacteriophage.

  13. Multiple introductions of serotype O foot-and-mouth disease viruses into East Asia in 2010–2011

    PubMed Central

    2013-01-01

    Foot-and-mouth disease virus (FMDV) is a highly contagious and genetically variable virus. Sporadic introductions of this virus into FMD-free countries may cause outbreaks with devastating consequences. In 2010 and 2011, incursions of the FMDV O/SEA/Mya-98 strain, normally restricted to countries in mainland Southeast Asia, caused extensive outbreaks across East Asia. In this study, 12 full genome FMDV sequences for representative samples collected from the People’s Republic of China (PR China) including the Hong Kong Special Administrative Region (SAR), the Republic of Korea, the Democratic People’s Republic of Korea, Japan, Mongolia and The Russian Federation were generated and compared with additional contemporary sequences from viruses within this lineage. These complete genomes were 8119 to 8193 nucleotides in length and differed at 1181 sites, sharing a nucleotide identity ≥ 91.0% and an amino acid identity ≥ 96.6%. An unexpected deletion of 70 nucleotides within the 5′-untranslated region which resulted in a shorter predicted RNA stem-loop for the S-fragment was revealed in two sequences from PR China and Hong Kong SAR and five additional related samples from the region. Statistical parsimony and Bayesian phylogenetic analysis provide evidence that these outbreaks in East Asia were generated by two independent introductions of the O/SEA/Mya-98 lineage sometime between August 2008 and March 2010. The rapid emergence of these viruses from Southeast Asia highlights the importance of adopting approaches to closely monitor the spread of this lineage that now poses a threat to livestock industries in other regions. PMID:24007643

  14. Multiple introductions of serotype O foot-and-mouth disease viruses into East Asia in 2010-2011.

    PubMed

    Valdazo-González, Begoña; Timina, Anna; Scherbakov, Alexey; Abdul-Hamid, Nor Faizah; Knowles, Nick J; King, Donald P

    2013-09-05

    Foot-and-mouth disease virus (FMDV) is a highly contagious and genetically variable virus. Sporadic introductions of this virus into FMD-free countries may cause outbreaks with devastating consequences. In 2010 and 2011, incursions of the FMDV O/SEA/Mya-98 strain, normally restricted to countries in mainland Southeast Asia, caused extensive outbreaks across East Asia. In this study, 12 full genome FMDV sequences for representative samples collected from the People's Republic of China (PR China) including the Hong Kong Special Administrative Region (SAR), the Republic of Korea, the Democratic People's Republic of Korea, Japan, Mongolia and The Russian Federation were generated and compared with additional contemporary sequences from viruses within this lineage. These complete genomes were 8119 to 8193 nucleotides in length and differed at 1181 sites, sharing a nucleotide identity ≥ 91.0% and an amino acid identity ≥ 96.6%. An unexpected deletion of 70 nucleotides within the 5'-untranslated region which resulted in a shorter predicted RNA stem-loop for the S-fragment was revealed in two sequences from PR China and Hong Kong SAR and five additional related samples from the region. Statistical parsimony and Bayesian phylogenetic analysis provide evidence that these outbreaks in East Asia were generated by two independent introductions of the O/SEA/Mya-98 lineage sometime between August 2008 and March 2010. The rapid emergence of these viruses from Southeast Asia highlights the importance of adopting approaches to closely monitor the spread of this lineage that now poses a threat to livestock industries in other regions.

  15. Revealing glacier flow and surge dynamics from animated satellite image sequences: examples from the Karakoram

    NASA Astrophysics Data System (ADS)

    Paul, F.

    2015-11-01

    Although animated images are very popular on the internet, they have so far found only limited use for glaciological applications. With long time series of satellite images becoming increasingly available and glaciers being well recognized for their rapid changes and variable flow dynamics, animated sequences of multiple satellite images reveal glacier dynamics in a time-lapse mode, making the otherwise slow changes of glacier movement visible and understandable to the wider public. For this study, animated image sequences were created for four regions in the central Karakoram mountain range over a 25-year time period (1990-2015) from freely available image quick-looks of orthorectified Landsat scenes. The animations play automatically in a web browser and reveal highly complex patterns of glacier flow and surge dynamics that are difficult to obtain by other methods. In contrast to other regions, surging glaciers in the Karakoram are often small (10 km2 or less), steep, debris-free, and advance for several years to decades at relatively low annual rates (about 100 m a-1). These characteristics overlap with those of non-surge-type glaciers, making a clear identification difficult. However, as in other regions, the surging glaciers in the central Karakoram also show sudden increases of flow velocity and mass waves travelling down glacier. The surges of individual glaciers are generally out of phase, indicating a limited climatic control on their dynamics. On the other hand, nearly all other glaciers in the region are either stable or slightly advancing, indicating balanced or even positive mass budgets over the past few decades.

  16. cDNA sequences and organization of IgM heavy chain genes in two holostean fish.

    PubMed

    Wilson, M R; van Ravenstein, E; Miller, N W; Clem, L W; Middleton, D L; Warr, G W

    1995-01-01

    Immunoglobulin M heavy chain (mu) sequences of two holostean fish, the bowfin, Amia calva, and the longnose gar, Lepisosteus osseus, were amplified from spleen mRNA by RACE-PCR, cloned, and sequenced. Each mu chain showed the conserved four constant domain structure typical of a secreted mu chain. Southern blot analyses with specific heavy chain variable (VH) and constant (CH) region probes suggest that both fish possess an IgH locus that resembles that of the teleosts, amphibians, and mammals in its organization. The overall sequence similarity of gar and bowfin mu chains was 60% and 48% at the nucleotide and amino acid levels, respectively, while similarity to the mu chains of teleosts and elasmobranchs was lower. The bowfin mu chain possesses a distinctive proline-rich sequence at the C mu 1/C mu 2 boundary; a shorter proline-rich sequence is present at this position in the gar mu chain. Both gar and bowfin show, in their C mu 4 sequences, motifs that could serve as cryptic splice donor sites for the production of mRNA encoding the membrane-bound form of the mu chains, and the bowfin also shows a potential cryptic splice donor site in the C mu 3 exon.

  17. Foreign Plastid Sequences in Plant Mitochondria are Frequently Acquired Via Mitochondrion-to-Mitochondrion Horizontal Transfer

    PubMed Central

    Gandini, C. L.; Sanchez-Puerta, M. V.

    2017-01-01

    Angiosperm mitochondrial genomes (mtDNA) exhibit variable quantities of alien sequences. Many of these sequences are acquired by intracellular gene transfer (IGT) from the plastid. In addition, frequent events of horizontal gene transfer (HGT) between mitochondria of different species also contribute to their expanded genomes. In contrast, alien sequences are rarely found in plastid genomes. Most of the plant-to-plant HGT events involve mitochondrion-to-mitochondrion transfers. Occasionally, foreign sequences in mtDNAs are plastid-derived (MTPT), raising questions about their origin, frequency, and mechanism of transfer. The rising number of complete mtDNAs allowed us to address these questions. We identified 15 new foreign MTPTs, increasing significantly the number of those previously reported. One out of five of the angiosperm species analyzed contained at least one foreign MTPT, suggesting a remarkable frequency of HGT among plants. By analyzing the flanking regions of the foreign MTPTs, we found strong evidence for mt-to-mt transfers in 65% of the cases. We hypothesize that plastid sequences were initially acquired by the native mtDNA via IGT and then transferred to a distantly-related plant via mitochondrial HGT, rather than directly from a foreign plastid to the mitochondrial genome. Finally, we describe three novel putative cases of mitochondrial-derived sequences among angiosperm plastomes. PMID:28262720

  18. Reconstruction of structural evolution in the trnL intron P6b loop of symbiotic Nostoc (Cyanobacteria).

    PubMed

    Olsson, Sanna; Kaasalainen, Ulla; Rikkinen, Jouko

    2012-02-01

    In this study we reconstruct the structural evolution of the hyper-variable P6b region of the group I trnLeu intron in a monophyletic group of lichen-symbiotic Nostoc strains and establish it as a useful marker in the phylogenetic analysis of these organisms. The studied cyanobacteria occur as photosynthetic and/or nitrogen-fixing symbionts in lichen species of the diverse Nephroma guild. Phylogenetic analyses and secondary structure reconstructions are used to improve the understanding of the replication mechanisms in the P6b stem-loop and to explain the observed distribution patterns of indels. The variants of the P6b region in the Nostoc clade studied consist of different combinations of five sequence modules. The distribution of indels together with the ancestral character reconstruction performed enables the interpretation of the evolution of each sequence module. Our results indicate that the indel events are usually associated with single nucleotide changes in the P6b region and have occurred several times independently. In spite of their homoplasy, they provide phylogenetic information for closely related taxa. Thus we recognize that features of the P6b region can be used as molecular markers for species identification and phylogenetic studies involving symbiotic Nostoc cyanobacteria.

  19. Epicardial distribution of ST segment and T wave changes produced by stimulation of intrathoracic ganglia or cardiopulmonary nerves in dogs.

    PubMed

    Savard, P; Cardinal, R; Nadeau, R A; Armour, J A

    1991-06-01

    Sixty-three ventricular epicardial electrograms were recorded simultaneously in 8 atropinized dogs during stimulation of acutely decentralized intrathoracic autonomic ganglia or cardiopulmonary nerves. Three variables were measured: (1) isochronal maps representing the epicardial activation sequence, (2) maps depicting changes in areas under the QRS complex and T wave (regional inhomogeneity of repolarization), and (3) local and total QT intervals. Neural stimulations did not alter the activation sequence but induced changes in the magnitude and polarity of the ST segments and T waves as well as in QRST areas. Stimulation of the same neural structure in different dogs induced electrical changes with different amplitudes and in different regions of the ventricles, except for the ventral lateral cardiopulmonary nerve which usually affected the dorsal wall of the left ventricle. Greatest changes occurred when the right recurrent, left intermediate medial, left caudal pole, left ventral lateral cardiopulmonary nerves and stellate ganglia were stimulated. Local QT durations either decreased or did not change, whereas total QT duration as measured using a root-mean-square signal did not change, indicating the regional nature of repolarization changes. Taken together, these data indicate that intrathoracic efferent sympathetic neurons can induce regional inhomogeneity of repolarization without prolonging the total QT interval.

  20. Effects of the Laramide Structures on the Regional Distribution of Tight-Gas Sandstone in the Upper Mesaverde Group, Uinta Basin, Utah

    NASA Astrophysics Data System (ADS)

    Sitaula, R. P.; Aschoff, J.

    2013-12-01

    Regional-scale sequence stratigraphic correlation, well log analysis, syntectonic unconformity mapping, isopach maps, and depositional environment maps of the upper Mesaverde Group (UMG) in Uinta basin, Utah suggest higher accommodation in northeastern part (Natural Buttes area) and local development of lacustrine facies due to increased subsidence caused by uplift of San Rafael Swell (SRS) in southern and Uinta Uplift in northern parts. Recently discovered lacustrine facies in Natural Buttes area are completely different than the dominant fluvial facies in outcrops along Book Cliffs and could have implications for significant amount of tight-gas sand production from this area. Data used for sequence stratigraphic correlation, isopach maps and depositional environmental maps include > 100 well logs, 20 stratigraphic profiles, 35 sandstone thin sections and 10 outcrop-based gamma ray profiles. Seven 4th order depositional sequences (~0.5 my duration) are identified and correlated within UMG. Correlation was constructed using a combination of fluvial facies and stacking patterns in outcrops, chert-pebble conglomerates and tidally influenced strata. These surfaces were extrapolated into subsurface by matching GR profiles. GR well logs and core log of Natural Buttes area show intervals of coarsening upward patterns suggesting possible lacustrine intervals that might contain high TOC. Locally, younger sequences are completely truncated across SRS whereas older sequences are truncated and thinned toward SRS. The cycles of truncation and thinning represent phases of SRS uplift. Thinning possibly related with the Uinta Uplift is also observed in northwestern part. Paleocurrents are consistent with interpretation of periodic segmentation and deflection of sedimentation. Regional paleocurrents are generally E-NE-directed in Sequences 1-4, and N-directed in Sequences 5-7. From isopach maps and paleocurrent direction it can be interpreted that uplift of SRS changed route of sediment supply from west to southwest. Locally, paleocurrents are highly variable near SRS further suggesting UMG basin-fill was partitioned by uplift of SRS. Sandstone composition analysis also suggests the uplift of SRS causing the variation of source rocks in upper sequences than the lower sequences. In conclusion, we suggest that Uinta basin was episodically partitioned during the deposition of UMG due to uplift of Laramide structures in the basin and accommodation was localized in northeastern part. Understanding of structural controls on accommodation, sedimentation patterns and depositional environments will aid prediction of the best-producing gas reservoirs.

Top