unmutated ighv sequence: Topics by Science.gov

Sample records for unmutated ighv sequence

Change in IgHV Mutational Status of CLL Suggests Origin From Multiple Clones.

PubMed

Osman, Afaf; Gocke, Christopher D; Gladstone, Douglas E

2017-02-01

Fluorescence in situ hybridization and immunoglobulin (Ig) heavy-chain variable-region (IgHV) mutational status are used to predict outcome in chronic lymphocytic leukemia (CLL). Although DNA aberrations change over time, IgHV sequences and mutational status are considered stable. In a retrospective review, 409 CLL patients, between 2008 and 2015, had IgHV analysis: 56 patients had multiple analyses performed. Seven patients' IgHV results changed: 2 from unmutated to mutated and 5 from mutated to unmutated IgHV sequence. Three concurrently changed their variable heavy-chain sequence. Secondary to allelic exclusion, 2 of the new variable heavy chains produced were biologically nonplausible. The existence of these new nonplausible heavy-chain variable regions suggests either the CLL cancer stem-cell maintains the ability to rearrange a previously silenced IgH allele or more likely that the cancer stem-cell produced at least 2 subclones, suggesting that the CLL cancer stem cell exists before the process of allelic exclusion occurs. Copyright © 2016 Elsevier Inc. All rights reserved.
Intraclonal Cell Expansion and Selection Driven by B Cell Receptor in Chronic Lymphocytic Leukemia

PubMed Central

Colombo, Monica; Cutrona, Giovanna; Reverberi, Daniele; Fabris, Sonia; Neri, Antonino; Fabbi, Marina; Quintana, Giovanni; Quarta, Giovanni; Ghiotto, Fabio; Fais, Franco; Ferrarini, Manlio

2011-01-01

The mutational status of the immunoglobulin heavy-chain variable region (IGHV) genes utilized by chronic lymphocytic leukemia (CLL) clones defines two disease subgroups. Patients with unmutated IGHV have a more aggressive disease and a worse outcome than patients with cells having somatic IGHV gene mutations. Moreover, up to 30% of the unmutated CLL clones exhibit very similar or identical B cell receptors (BcR), often encoded by the same IG genes. These “stereotyped” BcRs have been classified into defined subsets. The presence of an IGHV gene somatic mutation and the utilization of a skewed gene repertoire compared with normal B cells together with the expression of stereotyped receptors by unmutated CLL clones may indicate stimulation/selection by antigenic epitopes. This antigenic stimulation may occur prior to or during neoplastic transformation, but it is unknown whether this stimulation/selection continues after leukemogenesis has ceased. In this study, we focused on seven CLL cases with stereotyped BcR Subset #8 found among a cohort of 700 patients; in six, the cells expressed IgG and utilized IGHV4-39 and IGKV1-39/IGKV1D-39 genes, as reported for Subset #8 BcR. One case exhibited special features, including expression of IgM or IgG by different subclones consequent to an isotype switch, allelic inclusion at the IGH locus in the IgM-expressing cells and a particular pattern of cytogenetic lesions. Collectively, the data indicate a process of antigenic stimulation/selection of the fully transformed CLL cells leading to the expansion of the Subset #8 IgG-bearing subclone. PMID:21541442
14q deletions are associated with trisomy 12, NOTCH1 mutations and unmutated IGHV genes in chronic lymphocytic leukemia and small lymphocytic lymphoma.

PubMed

Cosson, Adrien; Chapiro, Elise; Belhouachi, Nabila; Cung, Hong-Anh; Keren, Boris; Damm, Frederik; Algrin, Caroline; Lefebvre, Christine; Fert-Ferrer, Sandra; Luquet, Isabelle; Gachard, Nathalie; Mugneret, Francine; Terre, Christine; Collonge-Rame, Marie-Agnes; Michaux, Lucienne; Rafdord-Weiss, Isabelle; Talmant, Pascaline; Veronese, Lauren; Nadal, Nathalie; Struski, Stephanie; Barin, Carole; Helias, Catherine; Lafage, Marina; Lippert, Eric; Auger, Nathalie; Eclache, Virginie; Roos-Weil, Damien; Leblond, Veronique; Settegrana, Catherine; Maloum, Karim; Davi, Frederic; Merle-Beral, Helene; Lesty, Claude; Nguyen-Khac, Florence

2014-08-01

Deletions of the long arm of chromosome 14 [del(14q)] are rare but recurrently observed in mature B-cell neoplasms, particularly in chronic lymphocytic leukemia (CLL). To further characterize this aberration, we studied 81 cases with del(14q): 54 of CLL and 27 of small lymphocytic lymphoma (SLL), the largest reported series to date. Using karyotype and fluorescence in situ hybridization (FISH), the most frequent additional abnormality was trisomy 12 (tri12), observed in 28/79 (35%) cases, followed by del13q14 (12/79, 15%), delTP53 (11/80, 14%) delATM (5/79, 6%), and del6q21 (3/76, 4%). IGHV genes were unmutated in 41/53 (77%) patients, with a high frequency of IGHV1-69 (21/52, 40%). NOTCH1 gene was mutated in 14/45 (31%) patients. There was no significant difference in cytogenetic and molecular abnormalities between CLL and SLL. Investigations using FISH and SNP-array demonstrated the heterogeneous size of the 14q deletions. However, a group with the same del(14)(q24.1q32.33) was identified in 48% of cases. In this group, tri12 (P = 0.004) and NOTCH1 mutations (P = 0.02) were significantly more frequent than in the other patients. In CLL patients with del(14q), median treatment-free survival (TFS) was 27 months. In conclusion, del(14q) is associated with tri12 and with pejorative prognostic factors: unmutated IGHV genes (with over-representation of the IGHV1-69 repertoire), NOTCH1 mutations, and a short TFS. © 2014 Wiley Periodicals, Inc.
The Number of Overlapping AID Hotspots in Germline IGHV Genes Is Inversely Correlated with Mutation Frequency in Chronic Lymphocytic Leukemia.

PubMed

Yuan, Chaohui; Chu, Charles C; Yan, Xiao-Jie; Bagnara, Davide; Chiorazzi, Nicholas; MacCarthy, Thomas

2017-01-01

The targeting of mutations by Activation-Induced Deaminase (AID) is a key step in generating antibody diversity at the Immunoglobulin (Ig) loci but is also implicated in B-cell malignancies such as chronic lymphocytic leukemia (CLL). AID has previously been shown to preferentially deaminate WRC (W = A/T, R = A/G) hotspots. WGCW sites, which contain an overlapping WRC hotspot on both DNA strands, mutate at much higher frequency than single hotspots. Human Ig heavy chain (IGHV) genes differ in terms of WGCW numbers, ranging from 4 for IGHV3-48*03 to as many as 12 in IGHV1-69*01. An absence of V-region mutations in CLL patients ("IGHV unmutated", or U-CLL) is associated with a poorer prognosis compared to "IGHV mutated" (M-CLL) patients. The reasons for this difference are still unclear, but it has been noted that particular IGHV genes associate with U-CLL vs M-CLL. For example, patients with IGHV1-69 clones tend to be U-CLL with a poor prognosis, whereas patients with IGHV3-30 tend to be M-CLL and have a better prognosis. Another distinctive feature of CLL is that ~30% of (mostly poor prognosis) patients can be classified into "stereotyped" subsets, each defined by HCDR3 similarity, suggesting selection, possibly for a self-antigen. We analyzed >1000 IGHV genes from CLL patients and found a highly significant statistical relationship between the number of WGCW hotspots in the germline V-region and the observed mutation frequency in patients. However, paradoxically, this correlation was inverse, with V-regions with more WGCW hotspots being less likely to be mutated, i.e., more likely to be U-CLL. The number of WGCW hotspots in particular, are more strongly correlated with mutation frequency than either non-overlapping (WRC) hotspots or more general models of mutability derived from somatic hypermutation data. Furthermore, this correlation is not observed in sequences from the B cell repertoires of normal individuals and those with autoimmune diseases.
Major prognostic value of complex karyotype in addition to TP53 and IGHV mutational status in first-line chronic lymphocytic leukemia.

PubMed

Le Bris, Yannick; Struski, Stéphanie; Guièze, Romain; Rouvellat, Caroline; Prade, Naïs; Troussard, Xavier; Tournilhac, Olivier; Béné, Marie C; Delabesse, Eric; Ysebaert, Loïc

2017-12-01

Chronic lymphocytic leukemia (CLL) is a lymphoproliferative disorder of remarkable heterogeneity as demonstrated by cytogenetics and molecular analyses. Complex karyotype (CK), TP53 deletions and/or mutations (TP53 disruption), IGVH mutational status, and, more recently, recurrent somatic mutations have been identified as prognostic markers in CLL. On a cohort of 110 patients with CLL treated with first-line fludarabin, cyclophosphamide, and rituximab treatment compared with 33 untreated (watch and wait) patients with CLL, we report more frequent complex karyotypes (34 vs 15%; P = .05), unmutated IGHV (70 vs 21%; P < .0001), ATM deletion (25 vs 6%, P = .02), and NOTCH mutation (3 vs 17%, P = .04). Among treated patients, 39 relapsed during the follow-up period. These patients were characterized before treatment by a higher incidence of trisomy 12 (38 vs 11%, P < .001) and TP53 disruption (31 vs 4%, P = .0002). A significantly shorter 5-year overall survival was found for treated patients with CK (72.4 vs 85.8%; P = .007), unmutated IGHV (70 vs 100%; P = .04), or TP53 disruption (55.7 vs 82.7%; P < .0001). Three risk groups were defined based on the status of TP53 disruption or unmutated IGVH, which differed significantly in terms of 5-year overall survival. Moreover, the presence of CK impacted pejoratively 5-year overall survival and progression-free survival in all these 3 groups. Conventional karyotyping therefore appears to be of value, CK being an additional factor, undetectable in classical FISH, in patients with CLL at the stage when therapy becomes required. Copyright © 2016 John Wiley & Sons, Ltd.
An extensive molecular cytogenetic characterization in high-risk chronic lymphocytic leukemia identifies karyotype aberrations and TP53 disruption as predictors of outcome and chemorefractoriness

PubMed Central

Cavallari, Maurizio; Quaglia, Francesca Maria; Lista, Enrico; Urso, Antonio; Guardalben, Emanuele; Martinelli, Sara; Saccenti, Elena; Bassi, Cristian; Lupini, Laura; Bardi, Maria Antonella; Volta, Eleonora; Tammiso, Elisa; Melandri, Aurora; Negrini, Massimo

2017-01-01

We investigated whether karyotype analysis and mutational screening by next generation sequencing could predict outcome in 101 newly diagnosed chronic lymphocytic leukemia patients with high-risk features, as defined by the presence of unmutated IGHV gene and/or 11q22/17p13 deletion by FISH and/or TP53 mutations. Cytogenetic analysis showed favorable findings (normal karyotype and isolated 13q14 deletion) in 30 patients, unfavorable (complex karyotype and/or 17p13/11q22 deletion) in 34 cases and intermediate (all other abnormalities) in 36 cases. A complex karyotype was present in 21 patients. Mutations were detected in 56 cases and were associated with unmutated IGHV status (p = 0.040) and complex karyotype (p = 0.047). TP53 disruption (i.e. TP53 mutations and/or 17p13 deletion by FISH) correlated with the presence of ≥ 2 mutations (p = 0.001) and a complex karyotype (p = 0.012). By multivariate analysis, an advanced Binet stage (p < 0.001) and an unfavorable karyotype (p = 0.001) predicted a shorter time to first treatment. TP53 disruption (p = 0.019) and the unfavorable karyotype (p = 0.028) predicted a worse overall survival. A shorter time to chemorefractoriness was associated with TP53 disruption (p = 0.001) and unfavorable karyotype (p = 0.025). Patients with both unfavorable karyotype and TP53 disruption presented a dismal outcome (median overall survival and time to chemorefractoriness of 28.7 and 15.0 months, respectively). In conclusion, karyotype analysis refines risk stratification in high-risk CLL patients and could identify a subset of patients with highly unfavorable outcome requiring alternative treatments. PMID:28427204
Chronic lymphocytic leukemia: A prognostic model comprising only two biomarkers (IGHV mutational status and FISH cytogenetics) separates patients with different outcome and simplifies the CLL-IPI.

PubMed

Delgado, Julio; Doubek, Michael; Baumann, Tycho; Kotaskova, Jana; Molica, Stefano; Mozas, Pablo; Rivas-Delgado, Alfredo; Morabito, Fortunato; Pospisilova, Sarka; Montserrat, Emili

2017-04-01

Rai and Binet staging systems are important to predict the outcome of patients with chronic lymphocytic leukemia (CLL) but do not reflect the biologic diversity of the disease nor predict response to therapy, which ultimately shape patients' outcome. We devised a biomarkers-only CLL prognostic system based on the two most important prognostic parameters in CLL (i.e., IGHV mutational status and fluorescence in situ hybridization [FISH] cytogenetics), separating three different risk groups: (1) low-risk (mutated IGHV + no adverse FISH cytogenetics [del(17p), del(11q)]); (2) intermediate-risk (either unmutated IGHV or adverse FISH cytogenetics) and (3) high-risk (unmutated IGHV + adverse FISH cytogenetics). In 524 unselected subjects with CLL, the 10-year overall survival was 82% (95% CI 76%-88%), 52% (45%-62%), and 27% (17%-42%) for the low-, intermediate-, and high-risk groups, respectively. Patients with low-risk comprised around 50% of the series and had a life expectancy comparable to the general population. The prognostic model was fully validated in two independent cohorts, including 417 patients representative of general CLL population and 337 patients with Binet stage A CLL. The model had a similar discriminatory value as the CLL-IPI. Moreover, it applied to all patients with CLL independently of age, and separated patients with different risk within Rai or Binet clinical stages. The biomarkers-only CLL prognostic system presented here simplifies the CLL-IPI and could be useful in daily practice and to stratify patients in clinical trials. © 2017 Wiley Periodicals, Inc.
Mutation Pattern of Paired Immunoglobulin Heavy and Light Variable Domains in Chronic Lymphocytic Leukemia B Cells

PubMed Central

Ghiotto, Fabio; Marcatili, Paolo; Tenca, Claudya; Calevo, Maria Grazia; Yan, Xiao-Jie; Albesiano, Emilia; Bagnara, Davide; Colombo, Monica; Cutrona, Giovanna; Chu, Charles C; Morabito, Fortunato; Bruno, Silvia; Ferrarini, Manlio; Tramontano, Anna; Fais, Franco; Chiorazzi, Nicholas

2011-01-01

B-cell chronic lymphocytic leukemia (CLL) patients display leukemic clones bearing either germline or somatically mutated immunoglobulin heavy variable (IGHV ) genes. Most information on CLL immunoglobulins (Igs), such as the definition of stereotyped B-cell receptors (BCRs), was derived from germline unmutated Igs. In particular, detailed studies on the distribution and nature of mutations in paired heavy- and light-chain domains of CLL clones bearing mutated Igs are lacking. To address the somatic hyper-mutation dynamics of CLL Igs, we analyzed the mutation pattern of paired IGHV–diversity-joining (IGHV-D-J ) and immunoglobulin kappa/lambda variable-joining (IGK/LV-J ) rearrangements of 193 leukemic clones that displayed ≥2% mutations in at least one of the two immunoglobulin variable (IGV ) genes (IGHV and/or IGK/LV ). The relationship between the mutation frequency in IGHV and IGK/LV complementarity determining regions (CDRs) and framework regions (FRs) was evaluated by correlation analysis. Replacement (R) mutation frequency within IGK/LV chain CDRs correlated significantly with mutation frequency of paired IGHV CDRs in λ but not κ isotype CLL clones. CDRs of IGKV-J rearrangements displayed a lower percentage of R mutations than IGHVs. The frequency/pattern of mutations in kappa CLL Igs differed also from that in κ-expressing normal B cells described in the literature. Instead, the mutation frequency within the FRs of IGHV and either IGKV or IGLV was correlated. Notably, the amount of diversity introduced by replaced amino acids was comparable between IGHVs and IGKVs. The data indicate a different mutation pattern between κ and λ isotype CLL clones and suggest an antigenic selection that, in κ samples, operates against CDR variation. PMID:21785810
Clonal evolution in chronic lymphocytic leukemia: analysis of correlations with IGHV mutational status, NOTCH1 mutations and clinical significance.

PubMed

López, Cristina; Delgado, Julio; Costa, Dolors; Villamor, Neus; Navarro, Alba; Cazorla, Maite; Gómez, Cándida; Arias, Amparo; Muñoz, Concha; Cabezas, Sandra; Baumann, Tycho; Rozman, María; Aymerich, Marta; Colomer, Dolors; Pereira, Arturo; Cobo, Francesc; López-Guillermo, Armando; Campo, Elías; Carrió, Ana

2013-10-01

Chronic lymphocytic leukemia (CLL) is a lymphoproliferative disorder characterized with highly variable clinical course. The most common chromosomal abnormalities in CLL, using conventional and molecular cytogenetics, are trisomy 12, del(13)(q14), del(11)(q22-23), del(17)(p13), and del(6)(q21). Whereas the prognostic marker such as IGHV mutational status remains stable during course of the diseases, chromosomal aberrations may be acquired over time. The aim of this study was to determine the incidence, and biological significance of clonal evolution (CE) using conventional and molecular cytogenetics and its relationship with prognostic markers such as CD38, ZAP70, and the mutational status of IGHV and NOTCH1. One hundred and forty-three untreated CLL patients were included in the study. The median time interval between analyses was 32 months (range 6-156 months). Forty-seven patients (33%) had CE as evidenced by detection of new cytogenetic abnormalities during follow-up. CE was not correlated with high expression of ZAP70, unmutated IGHV genes or NOTCH1 mutations. Multivariate analysis revealed that CE and IGHV mutation status had a significant impact on TFS. The combination of conventional and molecular cytogenetics increased the detection of CE, this phenomenon probably being a reflection of genomic instability and conferring a more aggressive clinical course. Copyright © 2013 Wiley Periodicals, Inc.
Integrated CLL Scoring System, a New and Simple Index to Predict Time to Treatment and Overall Survival in Patients With Chronic Lymphocytic Leukemia.

PubMed

Visentin, Andrea; Facco, Monica; Frezzato, Federica; Castelli, Monica; Trimarco, Valentina; Martini, Veronica; Gattazzo, Cristina; Severin, Filippo; Chiodin, Giorgia; Martines, Annalisa; Bonaldi, Laura; Gianesello, Ilaria; Pagnin, Elisa; Boscaro, Elisa; Piazza, Francesco; Zambello, Renato; Semenzato, Gianpietro; Trentin, Livio

2015-10-01

Several prognostic factors have been identified to predict the outcome of patients with chronic lymphocytic leukemia (CLL), but only a few studies analyzed more markers together. Taking advantage of a population of 608 patients, we identified the strongest prognostic markers of survival and, subsequently, in a cohort of 212 patients we integrated data of cytogenetic lesions, IGHV mutational status, and CD38 expression in a new and easy scoring system we called the integrated CLL scoring system (ICSS). ICSS defines 3 groups of risk: (1) low risk (patients with 13q(-) or normal fluorescence in-situ hybridization analysis results, mutated IGHV, and CD38) (2) high risk (all 11q(-) or 17p(-) patients and/or all unmutated IGHV and CD38(+) patients); and (3) intermediate risk (all remaining patients). Using only these 3 already available prognostic factors, we were able to properly redefine patients and better predict the clinical course of the disease. ICSS could become a useful tool for CLL patients' management. Copyright © 2015 Elsevier Inc. All rights reserved.
CLL Cells Respond to B-Cell Receptor Stimulation with a MicroRNA/mRNA Signature Associated with MYC Activation and Cell Cycle Progression

PubMed Central

Pede, Valerie; Rombout, Ans; Vermeire, Jolien; Naessens, Evelien; Mestdagh, Pieter; Robberecht, Nore; Vanderstraeten, Hanne; Van Roy, Nadine; Vandesompele, Jo; Speleman, Frank; Philippé, Jan; Verhasselt, Bruno

2013-01-01

Chronic lymphocytic leukemia (CLL) is a disease with variable clinical outcome. Several prognostic factors such as the immunoglobulin heavy chain variable genes (IGHV) mutation status are linked to the B-cell receptor (BCR) complex, supporting a role for triggering the BCR in vivo in the pathogenesis. The miRNA profile upon stimulation and correlation with IGHV mutation status is however unknown. To evaluate the transcriptional response of peripheral blood CLL cells upon BCR stimulation in vitro, miRNA and mRNA expression was measured using hybridization arrays and qPCR. We found both IGHV mutated and unmutated CLL cells to respond with increased expression of MYC and other genes associated with BCR activation, and a phenotype of cell cycle progression. Genome-wide expression studies showed hsa-miR-132-3p/hsa-miR-212 miRNA cluster induction associated with a set of downregulated genes, enriched for genes modulated by BCR activation and amplified by Myc. We conclude that BCR triggering of CLL cells induces a transcriptional response of genes associated with BCR activation, enhanced cell cycle entry and progression and suggest that part of the transcriptional profiles linked to IGHV mutation status observed in isolated peripheral blood are not cell intrinsic but rather secondary to in vivo BCR stimulation. PMID:23560086
Targeted next-generation sequencing in chronic lymphocytic leukemia: a high-throughput yet tailored approach will facilitate implementation in a clinical setting.

PubMed

Sutton, Lesley-Ann; Ljungström, Viktor; Mansouri, Larry; Young, Emma; Cortese, Diego; Navrkalova, Veronika; Malcikova, Jitka; Muggen, Alice F; Trbusek, Martin; Panagiotidis, Panagiotis; Davi, Frederic; Belessi, Chrysoula; Langerak, Anton W; Ghia, Paolo; Pospisilova, Sarka; Stamatopoulos, Kostas; Rosenquist, Richard

2015-03-01

Next-generation sequencing has revealed novel recurrent mutations in chronic lymphocytic leukemia, particularly in patients with aggressive disease. Here, we explored targeted re-sequencing as a novel strategy to assess the mutation status of genes with prognostic potential. To this end, we utilized HaloPlex targeted enrichment technology and designed a panel including nine genes: ATM, BIRC3, MYD88, NOTCH1, SF3B1 and TP53, which have been linked to the prognosis of chronic lymphocytic leukemia, and KLHL6, POT1 and XPO1, which are less characterized but were found to be recurrently mutated in various sequencing studies. A total of 188 chronic lymphocytic leukemia patients with poor prognostic features (unmutated IGHV, n=137; IGHV3-21 subset #2, n=51) were sequenced on the HiSeq 2000 and data were analyzed using well-established bioinformatics tools. Using a conservative cutoff of 10% for the mutant allele, we found that 114/180 (63%) patients carried at least one mutation, with mutations in ATM, BIRC3, NOTCH1, SF3B1 and TP53 accounting for 149/177 (84%) of all mutations. We selected 155 mutations for Sanger validation (variant allele frequency, 10-99%) and 93% (144/155) of mutations were confirmed; notably, all 11 discordant variants had a variant allele frequency between 11-27%, hence at the detection limit of conventional Sanger sequencing. Technical precision was assessed by repeating the entire HaloPlex procedure for 63 patients; concordance was found for 77/82 (94%) mutations. In summary, this study demonstrates that targeted next-generation sequencing is an accurate and reproducible technique potentially suitable for routine screening, eventually as a stand-alone test without the need for confirmation by Sanger sequencing. Copyright© Ferrata Storti Foundation.
Clinical significance of productive immunoglobulin heavy chain gene rearrangements in childhood acute lymphoblastic leukemia.

PubMed

Katsibardi, Katerina; Braoudaki, Maria; Papathanasiou, Chrissa; Karamolegou, Kalliopi; Tzortzatou-Stathopoulou, Fotini

2011-09-01

We analyzed the CDR3 region of 80 children with B-cell acute lymphoblastic leukemia (B-ALL) using the ImMunoGeneTics Information System and JOINSOLVER. In total, 108 IGH@ rearrangements were analyzed. Most of them (75.3%) were non-productive. IGHV@ segments proximal to IGHD-IGHJ@ were preferentially rearranged (45.3%). Increased utilization of IGHV3 segments IGHV3-13 (11.3%) and IGHV3-15 (9.3%), IGHD3 (30.5%), and IGHJ4 (34%) was noted. In pro-B ALL more frequent were IGHV3-11 (33.3%) and IGHV6-1 (33.3%), IGHD2-21 (50%), IGHJ4 (50%), and IGHJ6 (50%) segments. Shorter CDR3 length was observed in IGHV@6, IGHD7, and IGHJ1 segments, whereas increased CDR3 length was related to IGHV3, IGHD2, and IGHJ4 segments. Increased risk of relapse was found in patients with productive sequences. Specifically, the relapse-free survival rate at 5 years in patients with productive sequences at diagnosis was 75% (standard error [SE] ±9%), whereas in patients with non-productive sequences it was 97% (SE ±1.92%) (p-value =0.0264). Monoclonality and oligoclonality were identified in 81.2% and 18.75% cases at diagnosis, respectively. Sequence analysis revealed IGHV@ to IGHDJ joining only in 6.6% cases with oligoclonality. The majority (75%) of relapsed patients had monoclonal IGH@ rearrangements. The preferential utilization of IGHV@ segments proximal to IGHDJ depended on their location on the IGHV@ locus. Molecular mechanisms occurring during IGH@ rearrangement might play an essential role in childhood ALL prognosis. In our study, the productivity of the rearranged sequences at diagnosis proved to be a significant prognostic factor.
Targeting Stereotyped B Cell Receptors from Chronic Lymphocytic Leukemia Patients with Synthetic Antigen Surrogates.

PubMed

Sarkar, Mohosin; Liu, Yun; Qi, Junpeng; Peng, Haiyong; Morimoto, Jumpei; Rader, Christoph; Chiorazzi, Nicholas; Kodadek, Thomas

2016-04-01

Chronic lymphocytic leukemia (CLL) is a disease in which a single B-cell clone proliferates relentlessly in peripheral lymphoid organs, bone marrow, and blood. DNA sequencing experiments have shown that about 30% of CLL patients have stereotyped antigen-specific B-cell receptors (BCRs) with a high level of sequence homology in the variable domains of the heavy and light chains. These include many of the most aggressive cases that haveIGHV-unmutated BCRs whose sequences have not diverged significantly from the germ line. This suggests a personalized therapy strategy in which a toxin or immune effector function is delivered selectively to the pathogenic B-cells but not to healthy B-cells. To execute this strategy, serum-stable, drug-like compounds able to target the antigen-binding sites of most or all patients in a stereotyped subset are required. We demonstrate here the feasibility of this approach with the discovery of selective, high affinity ligands for CLL BCRs of the aggressive, stereotyped subset 7P that cross-react with the BCRs of several CLL patients in subset 7p, but not with BCRs from patients outside this subset. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Progranulin Is a Novel Independent Predictor of Disease Progression and Overall Survival in Chronic Lymphocytic Leukemia

PubMed Central

Göbel, Maria; Eisele, Lewin; Möllmann, Michael; Hüttmann, Andreas; Johansson, Patricia; Scholtysik, René; Bergmann, Manuela; Busch, Raymonde; Döhner, Hartmut; Hallek, Michael; Seiler, Till; Stilgenbauer, Stephan; Klein-Hitpass, Ludger; Dührsen, Ulrich; Dürig, Jan

2013-01-01

Progranulin (Pgrn) is a 88 kDa secreted protein with pleiotropic functions including regulation of cell cycle progression, cell motility, wound repair and tumorigenesis. Using microarray based gene expression profiling we have recently demonstrated that the gene for Pgrn, granulin (GRN), is significantly higher expressed in aggressive CD38+ZAP-70+ as compared to indolent CD38−ZAP-70− chronic lymphocytic leukemia (CLL) cases. Here, we measured Pgrn plasma concentrations by enzyme-linked immunosorbent assay (ELISA) in the Essen CLL cohort of 131 patients and examined Pgrn for association with established prognostic markers and clinical outcome. We found that high Pgrn plasma levels were strongly associated with adverse risk factors including unmutated IGHV status, expression of CD38 and ZAP-70, poor risk cytogenetics (11q-, 17p-) as detected by flourescence in situ hybridization (FISH) and high Binet stage. Pgrn as well as the aforementioned risk factors were prognostic for time to first treatment and overall survival in this series. Importantly, these results could be confirmed in the independent multicentric CLL1 cohort of untreated Binet stage A patients (n = 163). Here, multivariate analysis of time to first treatment revealed that high risk Pgrn (HR = 2.06, 95%-CI = 1.13–3.76, p = 0.018), unmutated IGHV status (HR = 5.63, 95%-CI = 3.05–10.38, p<0.001), high risk as defined by the study protocol (HR = 2.06, 95%-CI = 1.09–3.89, p = 0.026) but not poor risk cytogenetics were independent prognostic markers. In summary our results suggest that Pgrn is a novel, robust and independent prognostic marker in CLL that can be easily measured by ELISA. PMID:24009671
Progranulin is a novel independent predictor of disease progression and overall survival in chronic lymphocytic leukemia.

PubMed

Göbel, Maria; Eisele, Lewin; Möllmann, Michael; Hüttmann, Andreas; Johansson, Patricia; Scholtysik, René; Bergmann, Manuela; Busch, Raymonde; Döhner, Hartmut; Hallek, Michael; Seiler, Till; Stilgenbauer, Stephan; Klein-Hitpass, Ludger; Dührsen, Ulrich; Dürig, Jan

2013-01-01

Progranulin (Pgrn) is a 88 kDa secreted protein with pleiotropic functions including regulation of cell cycle progression, cell motility, wound repair and tumorigenesis. Using microarray based gene expression profiling we have recently demonstrated that the gene for Pgrn, granulin (GRN), is significantly higher expressed in aggressive CD38(+)ZAP-70(+) as compared to indolent CD38(-)ZAP-70(-) chronic lymphocytic leukemia (CLL) cases. Here, we measured Pgrn plasma concentrations by enzyme-linked immunosorbent assay (ELISA) in the Essen CLL cohort of 131 patients and examined Pgrn for association with established prognostic markers and clinical outcome. We found that high Pgrn plasma levels were strongly associated with adverse risk factors including unmutated IGHV status, expression of CD38 and ZAP-70, poor risk cytogenetics (11q-, 17p-) as detected by flourescence in situ hybridization (FISH) and high Binet stage. Pgrn as well as the aforementioned risk factors were prognostic for time to first treatment and overall survival in this series. Importantly, these results could be confirmed in the independent multicentric CLL1 cohort of untreated Binet stage A patients (n = 163). Here, multivariate analysis of time to first treatment revealed that high risk Pgrn (HR = 2.06, 95%-CI = 1.13-3.76, p = 0.018), unmutated IGHV status (HR = 5.63, 95%-CI = 3.05-10.38, p<0.001), high risk as defined by the study protocol (HR = 2.06, 95%-CI = 1.09-3.89, p = 0.026) but not poor risk cytogenetics were independent prognostic markers. In summary our results suggest that Pgrn is a novel, robust and independent prognostic marker in CLL that can be easily measured by ELISA.
Immunoglobulin heavy variable (IGHV) genes and alleles: new entities, new names and implications for research and prognostication in chronic lymphocytic leukaemia.

PubMed

Xochelli, Aliki; Agathangelidis, Andreas; Kavakiotis, Ioannis; Minga, Evangelia; Sutton, Lesley Ann; Baliakas, Panagiotis; Chouvarda, Ioanna; Giudicelli, Véronique; Vlahavas, Ioannis; Maglaveras, Nikos; Bonello, Lisa; Trentin, Livio; Tedeschi, Alessandra; Panagiotidis, Panagiotis; Geisler, Christian; Langerak, Anton W; Pospisilova, Sarka; Jelinek, Diane F; Oscier, David; Chiorazzi, Nicholas; Darzentas, Nikos; Davi, Fred; Ghia, Paolo; Rosenquist, Richard; Hadzidimitriou, Anastasia; Belessi, Chrysoula; Lefranc, Marie-Paule; Stamatopoulos, Kostas

2015-01-01

Νext generation sequencing studies in Homo sapiens have identified novel immunoglobulin heavy variable (IGHV) genes and alleles necessitating changes in the international ImMunoGeneTics information system (IMGT) GENE-DB and reference directories of IMGT/V-QUEST. In chronic lymphocytic leukaemia (CLL), the somatic hypermutation (SHM) status of the clonotypic rearranged IGHV gene is strongly associated with patient outcome. Correct determination of this parameter strictly depends on the comparison of the nucleotide sequence of the clonotypic rearranged IGHV gene with that of the closest germline counterpart. Consequently, changes in the reference directories could, in principle, affect the correct interpretation of the IGHV mutational status in CLL. To this end, we analyzed 8066 productive IG heavy chain (IGH) rearrangement sequences from our consortium both before and after the latest update of the IMGT/V-QUEST reference directory. Differences were identified in 405 cases (5 % of the cohort). In 291/405 sequences (71.9 %), changes concerned only the IGHV gene or allele name, whereas a change in the percent germline identity (%GI) was noted in 114/405 (28.1 %) sequences; in 50/114 (43.8 %) sequences, changes in the %GI led to a change in the mutational set. In conclusion, recent changes in the IMGT reference directories affected the interpretation of SHM in a sizeable number of IGH rearrangement sequences from CLL patients. This indicates that both physicians and researchers should consider a re-evaluation of IG sequence data, especially for those IGH rearrangement sequences that, up to date, have a GI close to 98 %, where caution is warranted.
Risk adjusted therapy in chronic lymphocytic leukemia: a phase II cancer trials Ireland (CTRIAL-IE [ICORG 07-01]) study of fludarabine, cyclophosphamide, and rituximab therapy evaluating response adapted, abbreviated frontline therapy with FCR in non-del(17p) CLL.

PubMed

Appleby, Niamh; O'Brien, David; Quinn, Fiona M; Smyth, Liam; Kelly, Johanna; Parker, Imelda; Scott, Kathleen; Cahill, Mary R; Crotty, Gerard; Enright, Helen; Hennessy, Brian; Hodgson, Andrew; Leahy, Maeve; O'Leary, Hilary; O'Dwyer, Michael; Hayat, Amjad; Vandenberghe, Elisabeth A

2018-06-01

Minimal residual disease negative complete response (MRD-negative CR) provides an early marker for time to treatment failure (TTF) in CLL treated with fludarabine, cyclophosphamide, and rituximab (FCR). MRD was assessed after four FCR cycles (FCR4); MRD-negative CR patients discontinued treatment. Fifty-two patients (35M; 17F) were enrolled. Eighteen (18/52; 34.6%) patients reached MRD-negative CR after FCR4 and 29/52 (55.8%) were MRD-negative CR at end of treatment (EOT). Median TTF was 71.1 months (95% CI 61.3-84.1 months), with median overall survival not reached. Mutated immunoglobulin heavy chain gene rearrangements (IGHV) were associated with early MRD-negative remissions, translating into prolonged TTF. Unmutated-IGHV, mutated-SF3B1 and mutated-NOTCH1 were associated with shortened TTF. No TTF difference was observed between patients in MRD-negative CR after four versus six cycles (82.2 versus 85.3 months, p = .6306). Abbreviated FCR therapy is effective for patients achieving early MRD-negative remissions. Interim MRD assessment assists in personalizing therapy and reducing chemotherapy-associated toxicity.
Gain of the short arm of chromosome 2 (2p) is a frequent recurring chromosome aberration in untreated chronic lymphocytic leukemia (CLL) at advanced stages.

PubMed

Chapiro, Elise; Leporrier, Nathalie; Radford-Weiss, Isabelle; Bastard, Christian; Mossafa, Hossein; Leroux, Dominique; Tigaud, Isabelle; De Braekeleer, Marc; Terré, Christine; Brizard, Françoise; Callet-Bauchu, Evelyne; Struski, Stéphanie; Veronese, Lauren; Fert-Ferrer, Sandra; Taviaux, Sylvie; Lesty, Claude; Davi, Frédéric; Merle-Béral, Hélène; Bernard, Olivier A; Sutton, Laurent; Raynaud, Sophie D; Nguyen-Khac, Florence

2010-01-01

Using array-based CGH, we identified 2p gain in 22/78 (28%) untreated Binet stages B/C CLL, which was the second most frequent copy number change after 13q deletion. It never occurred as a sole abnormality and was associated with other changes (6q deletion; 1p gain). The region of 2p gain frequently included two oncogenes, REL and MYCN. All patients with gain of REL were unmutated for IGHV (p=0.03). Gain of MYCN was associated with increased mRNA expression (p=0.005), suggesting a pathogenic role for MYCN. Gain of 2p appears to be a marker of progression and may contribute to the poor prognosis. 2009 Elsevier Ltd. All rights reserved.
Chronic lymphocytic leukemia antibodies with a common stereotypic rearrangement recognize nonmuscle myosin heavy chain IIA

PubMed Central

Catera, Rosa; Hatzi, Katerina; Yan, Xiao-Jie; Zhang, Lu; Wang, Xiao Bo; Fales, Henry M.; Allen, Steven L.; Kolitz, Jonathan E.; Rai, Kanti R.; Chiorazzi, Nicholas

2008-01-01

Leukemic B lymphocytes of a large group of unrelated chronic lymphocytic leukemia (CLL) patients express an unmutated heavy chain immunoglobulin variable (V) region encoded by IGHV1-69, IGHD3-16, and IGHJ3 with nearly identical heavy and light chain complementarity-determining region 3 sequences. The likelihood that these patients developed CLL clones with identical antibody V regions randomly is highly improbable and suggests selection by a common antigen. Monoclonal antibodies (mAbs) from this stereotypic subset strongly bind cytoplasmic structures in HEp-2 cells. Therefore, HEp-2 cell extracts were immunoprecipitated with recombinant stereotypic subset-specific CLL mAbs, revealing a major protein band at approximately 225 kDa that was identified by mass spectrometry as nonmuscle myosin heavy chain IIA (MYHIIA). Reactivity of the stereotypic mAbs with MYHIIA was confirmed by Western blot and immunofluorescence colocalization with anti-MYHIIA antibody. Treatments that alter MYHIIA amounts and cytoplasmic localization resulted in a corresponding change in binding to these mAbs. The appearance of MYHIIA on the surface of cells undergoing stress or apoptosis suggests that CLL mAb may generally bind molecules exposed as a consequence of these events. Binding of CLL mAb to MYHIIA could promote the development, survival, and expansion of these leukemic cells. PMID:18812466

The histone methyltransferase EZH2 as a novel prosurvival factor in clinically aggressive chronic lymphocytic leukemia.

PubMed

Papakonstantinou, Nikos; Ntoufa, Stavroula; Chartomatsidou, Elisavet; Kotta, Konstantia; Agathangelidis, Andreas; Giassafaki, Lefki; Karamanli, Tzeni; Bele, Panagiota; Moysiadis, Theodoros; Baliakas, Panagiotis; Sutton, Lesley Ann; Stavroyianni, Niki; Anagnostopoulos, Achilles; Makris, Antonios M; Ghia, Paolo; Rosenquist, Richard; Stamatopoulos, Kostas

2016-06-14

The histone methyltransferase EZH2 induces gene repression through trimethylation of histone H3 at lysine 27 (H3K27me3). EZH2 overexpression has been reported in many types of cancer and associated with poor prognosis. Here we investigated the expression and functionality of EZH2 in chronic lymphocytic leukemia (CLL). Aggressive cases with unmutated IGHV genes (U-CLL) displayed significantly higher EZH2 expression compared to indolent CLL cases with mutated IGHV genes (M-CLL); furthermore, in U-CLL EZH2 expression was upregulated with disease progression. Within U-CLL, EZH2high cases harbored significantly fewer (p = 0.033) TP53 gene abnormalities compared to EZH2low cases. EZH2high cases displayed high H3K27me3 levels and increased viability suggesting that EZH2 is functional and likely confers a survival advantage to CLL cells. This argument was further supported by siRNA-mediated downmodulation of EZH2 which resulted in increased apoptosis. Notably, at the intraclonal level, cell proliferation was significantly associated with EZH2 expression. Treatment of primary CLL cells with EZH2 inhibitors induced downregulation of H3K27me3 levels leading to increased cell apoptosis. In conclusion, EZH2 is overexpressed in adverse-prognosis CLL and associated with increased cell survival and proliferation. Pharmacologic inhibition of EZH2 catalytic activity promotes apoptosis, highlighting EZH2 as a novel potential therapeutic target for specific subgroups of patients with CLL.
The histone methyltransferase EZH2 as a novel prosurvival factor in clinically aggressive chronic lymphocytic leukemia

PubMed Central

Chartomatsidou, Elisavet; Kotta, Konstantia; Agathangelidis, Andreas; Giassafaki, Lefki; Karamanli, Tzeni; Bele, Panagiota; Moysiadis, Theodoros; Baliakas, Panagiotis; Sutton, Lesley Ann; Stavroyianni, Niki; Anagnostopoulos, Achilles; Makris, Antonios M.; Ghia, Paolo; Rosenquist, Richard; Stamatopoulos, Kostas

2016-01-01

The histone methyltransferase EZH2 induces gene repression through trimethylation of histone H3 at lysine 27 (H3K27me3). EZH2 overexpression has been reported in many types of cancer and associated with poor prognosis. Here we investigated the expression and functionality of EZH2 in chronic lymphocytic leukemia (CLL). Aggressive cases with unmutated IGHV genes (U-CLL) displayed significantly higher EZH2 expression compared to indolent CLL cases with mutated IGHV genes (M-CLL); furthermore, in U-CLL EZH2 expression was upregulated with disease progression. Within U-CLL, EZH2high cases harbored significantly fewer (p = 0.033) TP53 gene abnormalities compared to EZH2low cases. EZH2high cases displayed high H3K27me3 levels and increased viability suggesting that EZH2 is functional and likely confers a survival advantage to CLL cells. This argument was further supported by siRNA-mediated downmodulation of EZH2 which resulted in increased apoptosis. Notably, at the intraclonal level, cell proliferation was significantly associated with EZH2 expression. Treatment of primary CLL cells with EZH2 inhibitors induced downregulation of H3K27me3 levels leading to increased cell apoptosis. In conclusion, EZH2 is overexpressed in adverse-prognosis CLL and associated with increased cell survival and proliferation. Pharmacologic inhibition of EZH2 catalytic activity promotes apoptosis, highlighting EZH2 as a novel potential therapeutic target for specific subgroups of patients with CLL. PMID:27191993
Novel Method for High-Throughput Full-Length IGHV-D-J Sequencing of the Immune Repertoire from Bulk B-Cells with Single-Cell Resolution.

PubMed

Vergani, Stefano; Korsunsky, Ilya; Mazzarello, Andrea Nicola; Ferrer, Gerardo; Chiorazzi, Nicholas; Bagnara, Davide

2017-01-01

Efficient and accurate high-throughput DNA sequencing of the adaptive immune receptor repertoire (AIRR) is necessary to study immune diversity in healthy subjects and disease-related conditions. The high complexity and diversity of the AIRR coupled with the limited amount of starting material, which can compromise identification of the full biological diversity makes such sequencing particularly challenging. AIRR sequencing protocols often fail to fully capture the sampled AIRR diversity, especially for samples containing restricted numbers of B lymphocytes. Here, we describe a library preparation method for immunoglobulin sequencing that results in an exhaustive full-length repertoire where virtually every sampled B-cell is sequenced. This maximizes the likelihood of identifying and quantifying the entire IGHV-D-J repertoire of a sample, including the detection of rearrangements present in only one cell in the starting population. The methodology establishes the importance of circumventing genetic material dilution in the preamplification phases and incorporates the use of certain described concepts: (1) balancing the starting material amount and depth of sequencing, (2) avoiding IGHV gene-specific amplification, and (3) using Unique Molecular Identifier. Together, this methodology is highly efficient, in particular for detecting rare rearrangements in the sampled population and when only a limited amount of starting material is available.
The Number of Overlapping AID Hotspots in Germline IGHV Genes Is Inversely Correlated with Mutation Frequency in Chronic Lymphocytic Leukemia

PubMed Central

Yuan, Chaohui; Chu, Charles C.; Yan, Xiao-Jie; Bagnara, Davide; Chiorazzi, Nicholas

2017-01-01

The targeting of mutations by Activation-Induced Deaminase (AID) is a key step in generating antibody diversity at the Immunoglobulin (Ig) loci but is also implicated in B-cell malignancies such as chronic lymphocytic leukemia (CLL). AID has previously been shown to preferentially deaminate WRC (W = A/T, R = A/G) hotspots. WGCW sites, which contain an overlapping WRC hotspot on both DNA strands, mutate at much higher frequency than single hotspots. Human Ig heavy chain (IGHV) genes differ in terms of WGCW numbers, ranging from 4 for IGHV3-48*03 to as many as 12 in IGHV1-69*01. An absence of V-region mutations in CLL patients (“IGHV unmutated”, or U-CLL) is associated with a poorer prognosis compared to “IGHV mutated” (M-CLL) patients. The reasons for this difference are still unclear, but it has been noted that particular IGHV genes associate with U-CLL vs M-CLL. For example, patients with IGHV1-69 clones tend to be U-CLL with a poor prognosis, whereas patients with IGHV3-30 tend to be M-CLL and have a better prognosis. Another distinctive feature of CLL is that ~30% of (mostly poor prognosis) patients can be classified into “stereotyped” subsets, each defined by HCDR3 similarity, suggesting selection, possibly for a self-antigen. We analyzed >1000 IGHV genes from CLL patients and found a highly significant statistical relationship between the number of WGCW hotspots in the germline V-region and the observed mutation frequency in patients. However, paradoxically, this correlation was inverse, with V-regions with more WGCW hotspots being less likely to be mutated, i.e., more likely to be U-CLL. The number of WGCW hotspots in particular, are more strongly correlated with mutation frequency than either non-overlapping (WRC) hotspots or more general models of mutability derived from somatic hypermutation data. Furthermore, this correlation is not observed in sequences from the B cell repertoires of normal individuals and those with autoimmune diseases. PMID:28125682
miR-150 influences B-cell receptor signaling in chronic lymphocytic leukemia by regulating expression of GAB1 and FOXP1

PubMed Central

Mraz, Marek; Chen, Liguang; Rassenti, Laura Z.; Ghia, Emanuela M.; Li, Hongying; Jepsen, Kristen; Smith, Erin N.; Messer, Karen; Frazer, Kelly A.; Kipps, Thomas J.

2014-01-01

We examined the microRNAs (miRNAs) expressed in chronic lymphocytic leukemia (CLL) and identified miR-150 as the most abundant, but with leukemia cell expression levels that varied among patients. CLL cells that expressed ζ-chain–associated protein of 70 kDa (ZAP-70) or that used unmutated immunoglobulin heavy chain variable (IGHV) genes, each had a median expression level of miR-150 that was significantly lower than that of ZAP-70–negative CLL cells or those that used mutated IGHV genes. In samples stratified for expression of miR-150, CLL cells with low-level miR-150 expressed relatively higher levels of forkhead box P1 (FOXP1) and GRB2-associated binding protein 1 (GAB1), genes with 3′ untranslated regions having evolutionary-conserved binding sites for miR-150. High-level expression of miR-150 could repress expression of these genes, which encode proteins that enhance B-cell receptor signaling, a putative CLL-growth/survival signal. Also, high-level expression of miR-150 was a significant independent predictor of longer treatment-free survival or overall survival, whereas an inverse association was observed for high-level expression of GAB1 or FOXP1 for overall survival. This study demonstrates that expression of miR-150 can influence the relative expression of GAB1 and FOXP1 and the signaling potential of the B-cell receptor, thereby possibly accounting for the noted association of expression of miR-150 and disease outcome. PMID:24787006
miR-150 influences B-cell receptor signaling in chronic lymphocytic leukemia by regulating expression of GAB1 and FOXP1.

PubMed

Mraz, Marek; Chen, Liguang; Rassenti, Laura Z; Ghia, Emanuela M; Li, Hongying; Jepsen, Kristen; Smith, Erin N; Messer, Karen; Frazer, Kelly A; Kipps, Thomas J

2014-07-03

We examined the microRNAs (miRNAs) expressed in chronic lymphocytic leukemia (CLL) and identified miR-150 as the most abundant, but with leukemia cell expression levels that varied among patients. CLL cells that expressed ζ-chain-associated protein of 70 kDa (ZAP-70) or that used unmutated immunoglobulin heavy chain variable (IGHV) genes, each had a median expression level of miR-150 that was significantly lower than that of ZAP-70-negative CLL cells or those that used mutated IGHV genes. In samples stratified for expression of miR-150, CLL cells with low-level miR-150 expressed relatively higher levels of forkhead box P1 (FOXP1) and GRB2-associated binding protein 1 (GAB1), genes with 3' untranslated regions having evolutionary-conserved binding sites for miR-150. High-level expression of miR-150 could repress expression of these genes, which encode proteins that enhance B-cell receptor signaling, a putative CLL-growth/survival signal. Also, high-level expression of miR-150 was a significant independent predictor of longer treatment-free survival or overall survival, whereas an inverse association was observed for high-level expression of GAB1 or FOXP1 for overall survival. This study demonstrates that expression of miR-150 can influence the relative expression of GAB1 and FOXP1 and the signaling potential of the B-cell receptor, thereby possibly accounting for the noted association of expression of miR-150 and disease outcome. © 2014 by The American Society of Hematology.
Diffuse large B-cell lymphoma (Richter syndrome) in patients with chronic lymphocytic leukaemia (CLL): a cohort study of newly diagnosed patients.

PubMed

Parikh, Sameer A; Rabe, Kari G; Call, Timothy G; Zent, Clive S; Habermann, Thomas M; Ding, Wei; Leis, Jose F; Schwager, Susan M; Hanson, Curtis A; Macon, William R; Kay, Neil E; Slager, Susan L; Shanafelt, Tait D

2013-09-01

Nearly all information about patients with chronic lymphocytic leukaemia (CLL) who develop diffuse large B-cell lymphoma [Richter syndrome (RS)] is derived from retrospective case series or patients treated on clinical trials. We used the Mayo Clinic CLL Database to identify patients with newly diagnosed CLL between January 2000 and July 2011. Individuals who developed biopsy-proven RS during follow-up were identified. After a median follow-up of 4 years, 37/1641 (2·3%) CLL patients developed RS. The rate of RS was approximately 0·5%/year. Risk of RS was associated with advanced Rai stage at diagnosis (P < 0·001), high-risk genetic abnormalitites on fluorescence in situ hybridization (P < 0·0001), unmutated IGHV (P = 0·003), and expression of ZAP70 (P = 0·02) and CD38 (P = 0·001). The rate of RS doubled in patients after treatment for CLL (1%/year). Stereotyped B-cell receptors (odds-ratio = 4·2; P = 0·01) but not IGHV4-39 family usage was associated with increased risk of RS. Treatment with combination of purine analogues and alkylating agents increased the risk of RS three-fold (odds-ratio = 3·26, P = 0·0003). Median survival after RS diagnosis was 2·1 years. The RS prognosis score stratified patients into three risk groups with median survivals of 0·5 years, 2·1 years and not reached. Both underlying characteristics of the CLL clone and subsequent CLL therapy influence the risk of RS. Survival after RS remains poor and new therapies are needed. © 2013 John Wiley & Sons Ltd.
Dysregulation of H/ACA ribonucleoprotein components in chronic lymphocytic leukemia.

PubMed

Dos Santos, Patricia Carolina; Panero, Julieta; Stanganelli, Carmen; Palau Nagore, Virginia; Stella, Flavia; Bezares, Raimundo; Slavutsky, Irma

2017-01-01

Telomeres are protective repeats of TTAGGG sequences located at the end of human chromosomes. They are essential to maintain chromosomal integrity and genome stability. Telomerase is a ribonucleoprotein complex containing an internal RNA template (hTR) and a catalytic subunit (hTERT). The human hTR gene consists of three major domains; among them the H/ACA domain is essential for telomere biogenesis. H/ACA ribonucleoprotein (RNP) complex is composed of four evolutionary conserved proteins, including dyskerin (encoded by DKC1 gene), NOP10, NHP2 and GAR1. In this study, we have evaluated the expression profile of the H/ACA RNP complex genes: DKC1, NOP10, NHP2 and GAR1, as well as hTERT and hTR mRNA levels, in patients with chronic lymphocytic leukemia (CLL). Results were correlated with the number and type of genetic alteration detected by conventional cytogenetics and FISH (fluorescence in situ hybridization), IGHV (immunoglobulin heavy chain variable region) mutational status, telomere length (TL) and clinico-pathological characteristics of patients. Our results showed significant decreased expression of GAR1, NOP10, DKC1 and hTR, as well as increased mRNA levels of hTERT in patients compared to controls (p≤0.04). A positive correlation between the expression of GAR1-NHP2, GAR1-NOP10, and NOP10-NHP2 (p<0.0001), were observed. The analysis taking into account prognostic factors showed a significant increased expression of hTERT gene in unmutated-IGHV cases compared to mutated-CLL patients (p = 0.0185). The comparisons among FISH groups exhibited increased expression of DKC1 in cases with two or more alterations with respect to no abnormalities, trisomy 12 and del13q14, and of NHP2 and NOP10 compared to those with del13q14 (p = 0.03). The analysis according to TL showed a significant increased expression of hTERT (p = 0.0074) and DKC1 (p = 0.0036) in patients with short telomeres compared to those with long TL. No association between gene expression and clinical parameters was found. Our results suggest a role for these telomere associated genes in genomic instability and telomere dysfunction in CLL.
Expansion of the Preimmune Antibody Repertoire by Junctional Diversity in Bos taurus

PubMed Central

Liljavirta, Jenni; Niku, Mikael; Pessa-Morikawa, Tiina; Ekman, Anna; Iivanainen, Antti

2014-01-01

Cattle have a limited range of immunoglobulin genes which are further diversified by antigen independent somatic hypermutation in fetuses. Junctional diversity generated during somatic recombination contributes to antibody diversity but its relative significance has not been comprehensively studied. We have investigated the importance of terminal deoxynucleotidyl transferase (TdT) -mediated junctional diversity to the bovine immunoglobulin repertoire. We also searched for new bovine heavy chain diversity (IGHD) genes as the information of the germline sequences is essential to define the junctional boundaries between gene segments. New heavy chain variable genes (IGHV) were explored to address the gene usage in the fetal recombinations. Our bioinformatics search revealed five new IGHD genes, which included the longest IGHD reported so far, 154 bp. By genomic sequencing we found 26 new IGHV sequences that represent potentially new IGHV genes or allelic variants. Sequence analysis of immunoglobulin heavy chain cDNA libraries of fetal bone marrow, ileum and spleen showed 0 to 36 nontemplated N-nucleotide additions between variable, diversity and joining genes. A maximum of 8 N nucleotides were also identified in the light chains. The junctional base profile was biased towards A and T nucleotide additions (64% in heavy chain VD, 52% in heavy chain DJ and 61% in light chain VJ junctions) in contrast to the high G/C content which is usually observed in mice. Sequence analysis also revealed extensive exonuclease activity, providing additional diversity. B-lymphocyte specific TdT expression was detected in bovine fetal bone marrow by reverse transcription-qPCR and immunofluorescence. These results suggest that TdT-mediated junctional diversity and exonuclease activity contribute significantly to the size of the cattle preimmune antibody repertoire already in the fetal period. PMID:24926997
Biological and clinical characterization of recurrent 14q deletions in CLL and other mature B-cell neoplasms.

PubMed

Reindl, Lena; Bacher, Ulrike; Dicker, Frank; Alpermann, Tamara; Kern, Wolfgang; Schnittger, Susanne; Haferlach, Torsten; Haferlach, Claudia

2010-10-01

14q-deletions have been repeatedly described in mature B-cell neoplasms, but not yet characterized in a larger cohort. Based on chromosome banding analysis, the present study identified 47 del(14q) cases in 3054 mature B-cell neoplasms (1·5%) (chronic lymphocytic leukaemia [CLL]: 1·9%; CLL/prolymphocytic leukaemia [PL]: 9·0%; others: 0·2%). Interphase fluorescence in situ hybridization was performed with probes for 14q22.1, 14q24.1, 14q32.33, and IGH@ (14q32.3). The del(14q) had heterogeneous size but showed a breakpoint cluster at the centromeric site in 14q24.1 (62% of cases). At the telomeric side, the most frequent breakpoint was within the IGH@ locus (14q32.3) between IGH@ 3'-flanking and IGHV (IgVH) probes (45%). In 16 cases (34%), breakpoints occurred within 14q24.1 and 14q32.3. Eighty-one percent of del(14q) cases showed 1-3 additional cytogenetic alterations (in 45%, +12), and 56% were IGHV-unmutated. In all cases (16/16) with breakpoints in 14q24.1 and 14q32.3, a B-CLL immunophenotype was found. Clinical follow-up in 32 del(14q) patients was compared to 383 CLL and CLL/PL patients without del(14q). While 3-year-overall survival did not differ significantly, time to treatment was significantly shorter in the del(14q) cohort (21·0 months vs. 80·1 months, P = 0·015). In conclusion, the del(14q) is a rare recurrent alteration in diverse mature B-cell neoplasms, shows variable size but distinct clustering of breakpoints, and is associated with short time to treatment. © 2010 Blackwell Publishing Ltd.
B cell gene signature with massive intrahepatic production of antibodies to hepatitis B core antigen in hepatitis B virus-associated acute liver failure.

PubMed

Farci, Patrizia; Diaz, Giacomo; Chen, Zhaochun; Govindarajan, Sugantha; Tice, Ashley; Agulto, Liane; Pittaluga, Stefania; Boon, Denali; Yu, Claro; Engle, Ronald E; Haas, Mark; Simon, Richard; Purcell, Robert H; Zamboni, Fausto

2010-05-11

Hepatitis B virus (HBV)-associated acute liver failure (ALF) is a dramatic clinical syndrome due to a sudden loss of hepatic cells leading to multiorgan failure. The mechanisms whereby HBV induces ALF are unknown. Here, we show that liver tissue collected at the time of liver transplantation in two patients with HBV-associated ALF is characterized by an overwhelming B cell response apparently centered in the liver with massive accumulation of plasma cells secreting IgG and IgM, accompanied by complement deposition. We demonstrate that the molecular target of these antibodies is the hepatitis B core antigen (HBcAg); that these anti-bodies display a restricted variable heavy chain (V(H)) repertoire and lack somatic mutations; and that these two unrelated individuals with ALF use an identical predominant V(H) gene with unmutated variable domain (IGHV1-3) for both IgG and IgM anti-HBc antibodies, indicating that HBcAg is the target of a germline human V(H) gene. These data suggest that humoral immunity may exert a primary role in the pathogenesis of HBV-associated ALF.
Real world outcomes and management strategies for venetoclax-treated chronic lymphocytic leukemia patients in the United States.

PubMed

Mato, Anthony R; Thompson, Meghan; Allan, John N; Brander, Danielle M; Pagel, John M; Ujjani, Chaitra S; Hill, Brian T; Lamanna, Nicole; Lansigan, Frederick; Jacobs, Ryan; Shadman, Mazyar; Skarbnik, Alan P; Pu, Jeffrey J; Barr, Paul M; Sehgal, Alison R; Cheson, Bruce D; Zent, Clive S; Tuncer, Hande H; Schuster, Stephen J; Pickens, Peter V; Shah, Nirav N; Goy, Andre; Winter, Allison M; Garcia, Christine; Kennard, Kaitlin; Isaac, Krista; Dorsey, Colleen; Gashonia, Lisa M; Singavi, Arun K; Roeker, Lindsey E; Zelenetz, Andrew; Williams, Annalynn; Howlett, Christina; Weissbrot, Hanna; Ali, Naveed; Khajavian, Sirin; Sitlinger, Andrea; Tranchito, Eve; Rhodes, Joanna; Felsenfeld, Joshua; Bailey, Neil; Patel, Bhavisha; Burns, Timothy F; Yacur, Melissa; Malhotra, Mansi; Svoboda, Jakub; Furman, Richard R; Nabhan, Chadi

2018-06-07

Venetoclax is a BCL2 inhibitor approved for 17p-deleted relapsed/refractory chronic lymphocytic leukemia with activity following kinase inhibitors. We conducted a multicenter retrospective cohort analysis of patients with CLL treated with venetoclax to describe outcomes, toxicities, and treatment selection following venetoclax discontinuation. A total of 141 chronic lymphocytic leukemia patients were included (98% relapsed/refractory). Median age at venetoclax initiation was 67 years (range 37-91), median prior therapies was 3 (0-11), 81% unmutated IGHV, 45% del(17p), and 26.8% complex karyotype (≥ 3 abnormalities). Prior to venetoclax initiation, 89% received a B-cell receptor antagonist. For tumor lysis syndrome prophylaxis, 93% received allopurinol, 92% normal saline, and 45% rasburicase. Dose escalation to the maximum recommended dose of 400 mg daily was achieved in 85% of patients. Adverse events of interest included neutropenia in 47.4%, thrombocytopenia in 36%, tumor lysis syndrome in 13.4%, neutropenic fever in 11.6%, and diarrhea in 7.3%. The overall response rate to venetoclax was 72% (19.4% complete remission). With a median follow up of 7 months, median progression free survival and overall survival for the entire cohort have not been reached. To date, 41 venetoclax treated patients have discontinued therapy and 24 have received a subsequent therapy, most commonly ibrutinib. In the largest clinical experience of venetoclax-treated chronic lymphocytic leukemia patients , the majority successfully completed and maintained a maximum recommended dose. Response rates and duration of response appear comparable to clinical trial data. Venetoclax was active in patients with mutations known to confer ibrutinib resistance. Optimal sequencing of newer chronic lymphocytic leukemia therapies requires further study. Copyright © 2018, Ferrata Storti Foundation.
Molecular analysis of immunoglobulin variable genes supports a germinal center experienced normal counterpart in primary cutaneous diffuse large B-cell lymphoma, leg-type.

PubMed

Pham-Ledard, Anne; Prochazkova-Carlotti, Martina; Deveza, Mélanie; Laforet, Marie-Pierre; Beylot-Barry, Marie; Vergier, Béatrice; Parrens, Marie; Feuillard, Jean; Merlio, Jean-Philippe; Gachard, Nathalie

2017-11-01

Immunophenotype of primary cutaneous diffuse large B-cell lymphoma, leg-type (PCLBCL-LT) suggests a germinal center-experienced B lymphocyte (BCL2+ MUM1+ BCL6+/-). As maturation history of B-cell is "imprinted" during B-cell development on the immunoglobulin gene sequence, we studied the structure and sequence of the variable part of the genes (IGHV, IGLV, IGKV), immunoglobulin surface expression and features of class switching in order to determine the PCLBCL-LT cell of origin. Clonality analysis with BIOMED2 protocol and VH leader primers was done on DNA extracted from frozen skin biopsies on retrospective samples from 14 patients. The clonal DNA IGHV sequence of the tumor was aligned and compared with the closest germline sequence and homology percentage was calculated. Superantigen binding sites were studied. Features of selection pressure were evaluated with the multinomial Lossos model. A functional monoclonal sequence was observed in 14 cases as determined for IGHV (10), IGLV (2) or IGKV (3). IGV mutation rates were high (>5%) in all cases but one (median:15.5%), with superantigen binding sites conservation. Features of selection pressure were identified in 11/12 interpretable cases, more frequently negative (75%) than positive (25%). Intraclonal variation was detected in 3 of 8 tumor specimens with a low rate of mutations. Surface immunoglobulin was an IgM in 12/12 cases. FISH analysis of IGHM locus, deleted during class switching, showed heterozygous IGHM gene deletion in half of cases. The genomic PCR analysis confirmed the deletions within the switch μ region. IGV sequences were highly mutated but functional, with negative features of selection pressure suggesting one or more germinal center passage(s) with somatic hypermutation, but superantigen (SpA) binding sites conservation. Genetic features of class switch were observed, but on the non functional allele and co-existing with primary isotype IgM expression. These data suggest that cell-of origin is germinal center experienced and superantigen driven selected B-cell, in a stage between germinal center B-cell and plasma cell. Copyright © 2017 Japanese Society for Investigative Dermatology. Published by Elsevier B.V. All rights reserved.
Germline-encoded neutralization of a Staphylococcus aureus virulence factor by the human antibody repertoire.

PubMed

Yeung, Yik Andy; Foletti, Davide; Deng, Xiaodi; Abdiche, Yasmina; Strop, Pavel; Glanville, Jacob; Pitts, Steven; Lindquist, Kevin; Sundar, Purnima D; Sirota, Marina; Hasa-Moreno, Adela; Pham, Amber; Melton Witt, Jody; Ni, Irene; Pons, Jaume; Shelton, David; Rajpal, Arvind; Chaparro-Riggers, Javier

2016-11-18

Staphylococcus aureus is both an important pathogen and a human commensal. To explore this ambivalent relationship between host and microbe, we analysed the memory humoral response against IsdB, a protein involved in iron acquisition, in four healthy donors. Here we show that in all donors a heavily biased use of two immunoglobulin heavy chain germlines generated high affinity (pM) antibodies that neutralize the two IsdB NEAT domains, IGHV4-39 for NEAT1 and IGHV1-69 for NEAT2. In contrast to the typical antibody/antigen interactions, the binding is primarily driven by the germline-encoded hydrophobic CDRH-2 motifs of IGHV1-69 and IGHV4-39, with a binding mechanism nearly identical for each antibody derived from different donors. Our results suggest that IGHV1-69 and IGHV4-39, while part of the adaptive immune system, may have evolved under selection pressure to encode a binding motif innately capable of recognizing and neutralizing a structurally conserved protein domain involved in pathogen iron acquisition.
Recombinant antibodies encoded by IGHV1-69 react with pUL32, a phosphoprotein of cytomegalovirus and B-cell superantigen

PubMed Central

Steininger, Christoph; Widhopf, George F.; Ghia, Emanuela M.; Morello, Christopher S.; Vanura, Katrina; Sanders, Rebecca; Spector, Deborah; Guiney, Don; Jäger, Ulrich

2012-01-01

Leukemia cells from patients with chronic lymphocytic leukemia (CLL) express a highly restricted immunoglobulin heavy variable chain (IGHV) repertoire, suggesting that a limited set of antigens reacts with leukemic cells. Here, we evaluated the reactivity of a panel of different CLL recombinant antibodies (rAbs) encoded by the most commonly expressed IGHV genes with a panel of selected viral and bacterial pathogens. Six different CLL rAbs encoded by IGHV1-69 or IGHV3-21, but not a CLL rAb encoded by IGHV4-39 genes, reacted with a single protein of human cytomegalovirus (CMV). The CMV protein was identified as the large structural phosphoprotein pUL32. In contrast, none of the CLL rAbs bound to any other structure of CMV, adenovirus serotype 2, Salmonella enterica serovar Typhimurium, or of cells used for propagation of these microorganisms. Monoclonal antibodies or humanized rAbs of irrelevant specificity to pUL32 did not react with any of the proteins present in the different lysates. Still, rAbs encoded by a germ line IGHV1-69 51p1 allele from CMV-seropositive and -negative adults also reacted with pUL32. The observed reactivity of multiple different CLL rAbs and natural antibodies from CMV-seronegative adults with pUL32 is consistent with the properties of a superantigen. PMID:22234695
Overlapping hotspots in CDRs are critical sites for V region diversification.

PubMed

Wei, Lirong; Chahwan, Richard; Wang, Shanzhi; Wang, Xiaohua; Pham, Phuong T; Goodman, Myron F; Bergman, Aviv; Scharff, Matthew D; MacCarthy, Thomas

2015-02-17

Activation-induced deaminase (AID) mediates the somatic hypermutation (SHM) of Ig variable (V) regions that is required for the affinity maturation of the antibody response. An intensive analysis of a published database of somatic hypermutations that arose in the IGHV3-23*01 human V region expressed in vivo by human memory B cells revealed that the focus of mutations in complementary determining region (CDR)1 and CDR2 coincided with a combination of overlapping AGCT hotspots, the absence of AID cold spots, and an abundance of polymerase eta hotspots. If the overlapping hotspots in the CDR1 or CDR2 did not undergo mutation, the frequency of mutations throughout the V region was reduced. To model this result, we examined the mutation of the human IGHV3-23*01 biochemically and in the endogenous heavy chain locus of Ramos B cells. Deep sequencing revealed that IGHV3-23*01 in Ramos cells accumulates AID-induced mutations primarily in the AGCT in CDR2, which was also the most frequent site of mutation in vivo. Replacing the overlapping hotspots in CDR1 and CDR2 with neutral or cold motifs resulted in a reduction in mutations within the modified motifs and, to some degree, throughout the V region. In addition, some of the overlapping hotspots in the CDRs were at sites in which replacement mutations could change the structure of the CDR loops. Our analysis suggests that the local sequence environment of the V region, and especially of the CDR1 and CDR2, is highly evolved to recruit mutations to key residues in the CDRs of the IgV region.
Genome-wide DNA methylation profiling integrated with gene expression profiling identifies PAX9 as a novel prognostic marker in chronic lymphocytic leukemia.

PubMed

Rani, Lata; Mathur, Nitin; Gupta, Ritu; Gogia, Ajay; Kaur, Gurvinder; Dhanjal, Jaspreet Kaur; Sundar, Durai; Kumar, Lalit; Sharma, Atul

2017-01-01

In chronic lymphocytic leukemia (CLL), epigenomic and genomic studies have expanded the existing knowledge about the disease biology and led to the identification of potential biomarkers relevant for implementation of personalized medicine. In this study, an attempt has been made to examine and integrate the global DNA methylation changes with gene expression profile and their impact on clinical outcome in early stage CLL patients. The integration of DNA methylation profile ( n = 14) with the gene expression profile ( n = 21) revealed 142 genes as hypermethylated-downregulated and; 62 genes as hypomethylated-upregulated in early stage CLL patients compared to CD19+ B-cells from healthy individuals. The mRNA expression levels of 17 genes identified to be differentially methylated and/or differentially expressed was further examined in early stage CLL patients ( n = 93) by quantitative real time PCR (RQ-PCR). Significant differences were observed in the mRNA expression of MEIS1 , PMEPA1 , SOX7 , SPRY1 , CDK6 , TBX2 , and SPRY2 genes in CLL cells as compared to B-cells from healthy individuals. The analysis in the IGHV mutation based categories (Unmutated = 39, Mutated = 54) revealed significantly higher mRNA expression of CRY1 and PAX9 genes in the IGHV unmutated subgroup ( p < 0.001). The relative risk of treatment initiation was significantly higher among patients with high expression of CRY1 (RR = 1.91, p = 0.005) or PAX9 (RR = 1.87, p = 0.001). High expression of CRY1 (HR: 3.53, p < 0.001) or PAX9 (HR: 3.14, p < 0.001) gene was significantly associated with shorter time to first treatment. The high expression of PAX9 gene (HR: 3.29, 95% CI 1.172-9.272, p = 0.016) was also predictive of shorter overall survival in CLL. The DNA methylation changes associated with mRNA expression of CRY1 and PAX9 genes allow risk stratification of early stage CLL patients. This comprehensive analysis supports the concept that the epigenetic changes along with the altered expression of genes have the potential to predict clinical outcome in early stage CLL patients.
Development of a Bioinformatics Framework for the Detection of Gene Conversion and the Analysis of Combinatorial Diversity in Immunoglobulin Heavy Chains in Four Cattle Breeds.

PubMed

Walther, Stefanie; Tietze, Manfred; Czerny, Claus-Peter; König, Sven; Diesterbeck, Ulrike S

2016-01-01

We have developed a new bioinformatics framework for the analysis of rearranged bovine heavy chain immunoglobulin (Ig) variable regions by combining and refining widely used alignment algorithms. This bioinformatics framework allowed us to investigate alignments of heavy chain framework regions (FRHs) and the separate alignments of FRHs and heavy chain complementarity determining regions (CDRHs) to determine their germline origin in the four cattle breeds Aubrac, German Black Pied, German Simmental, and Holstein Friesian. Now it is also possible to specifically analyze Ig heavy chains possessing exceptionally long CDR3Hs. In order to gain more insight into breed specific differences in Ig combinatorial diversity, somatic hypermutations and putative gene conversions of IgG, we compared the dominantly transcribed variable (IGHV), diversity (IGHD), and joining (IGHJ) segments and their recombination in the four cattle breeds. The analysis revealed the use of 15 different IGHV segments, 21 IGHD segments, and two IGHJ segments with significant different transcription levels within the breeds. Furthermore, there are preferred rearrangements within the three groups of CDR3H lengths. In the sequences of group 2 (CDR3H lengths (L) of 11-47 amino acid residues (aa)) a higher number of recombination was observed than in sequences of group 1 (L≤10 aa) and 3 (L≥48 aa). The combinatorial diversity of germline IGHV, IGHD, and IGHJ-segments revealed 162 rearrangements that were significantly different. The few preferably rearranged gene segments within group 3 CDR3H regions may indicate specialized antibodies because this length is unique in cattle. The most important finding of this study, which was enabled by using the bioinformatics framework, is the discovery of strong evidence for gene conversion as a rare event using pseudogenes fulfilling all definitions for this particular diversification mechanism.
Development of a Bioinformatics Framework for the Detection of Gene Conversion and the Analysis of Combinatorial Diversity in Immunoglobulin Heavy Chains in Four Cattle Breeds

PubMed Central

Czerny, Claus-Peter; König, Sven; Diesterbeck, Ulrike S.

2016-01-01

We have developed a new bioinformatics framework for the analysis of rearranged bovine heavy chain immunoglobulin (Ig) variable regions by combining and refining widely used alignment algorithms. This bioinformatics framework allowed us to investigate alignments of heavy chain framework regions (FRHs) and the separate alignments of FRHs and heavy chain complementarity determining regions (CDRHs) to determine their germline origin in the four cattle breeds Aubrac, German Black Pied, German Simmental, and Holstein Friesian. Now it is also possible to specifically analyze Ig heavy chains possessing exceptionally long CDR3Hs. In order to gain more insight into breed specific differences in Ig combinatorial diversity, somatic hypermutations and putative gene conversions of IgG, we compared the dominantly transcribed variable (IGHV), diversity (IGHD), and joining (IGHJ) segments and their recombination in the four cattle breeds. The analysis revealed the use of 15 different IGHV segments, 21 IGHD segments, and two IGHJ segments with significant different transcription levels within the breeds. Furthermore, there are preferred rearrangements within the three groups of CDR3H lengths. In the sequences of group 2 (CDR3H lengths (L) of 11–47 amino acid residues (aa)) a higher number of recombination was observed than in sequences of group 1 (L≤10 aa) and 3 (L≥48 aa). The combinatorial diversity of germline IGHV, IGHD, and IGHJ-segments revealed 162 rearrangements that were significantly different. The few preferably rearranged gene segments within group 3 CDR3H regions may indicate specialized antibodies because this length is unique in cattle. The most important finding of this study, which was enabled by using the bioinformatics framework, is the discovery of strong evidence for gene conversion as a rare event using pseudogenes fulfilling all definitions for this particular diversification mechanism. PMID:27828971
Evaluation of 230 patients with relapsed/refractory deletion 17p chronic lymphocytic leukaemia treated with ibrutinib from 3 clinical trials.

PubMed

Jones, Jeffrey; Mato, Anthony; Coutre, Steven; Byrd, John C; Furman, Richard R; Hillmen, Peter; Osterborg, Anders; Tam, Constantine; Stilgenbauer, Stephan; Wierda, William G; Heerema, Nyla A; Eckert, Karl; Clow, Fong; Zhou, Cathy; Chu, Alvina D; James, Danelle F; O'Brien, Susan M

2018-06-05

Patients with chronic lymphocytic leukaemia/small lymphocytic lymphoma (CLL/SLL) with deletion 17p [del(17p)] have poor outcomes with chemoimmunotherapy. Ibrutinib is indicated for the treatment of CLL/SLL, including del(17p) CLL/SLL, and allows for treatment without chemotherapy. This integrated analysis was performed to evaluate outcomes in 230 patients with relapsed/refractory del(17p) CLL/SLL from three ibrutinib studies. With a median of 2 prior therapies (range, 1-12), 18% and 79% of evaluable patients had del(11q) or unmutated IGHV, respectively. With a median follow-up of 28 months, overall response rate was 85% and estimated 30-month progression-free and overall survival rates were 57% [95% confidence interval (CI) 50-64] and 69% (95% CI 61-75), respectively. Patients with normal lactate dehydrogenase or no bulky disease had the most favourable survival outcomes. Sustained haematological improvements in haemoglobin, platelet count and absolute neutrophil count occurred in 61%, 67% and 70% of patients with baseline cytopenias, respectively. New onset severe cytopenias and infections decreased in frequency over time. Progression-free and overall survival with ibrutinib surpass those of other therapies for patients with del(17p) CLL/SLL. These results provide further evidence of the robust clinical activity of ibrutinib in difficult-to-treat CLL/SLL populations. © 2018 The Authors. British Journal of Haematology published by John Wiley & Sons Ltd.

Idelalisib, an inhibitor of phosphatidylinositol 3-kinase p110δ, for relapsed/refractory chronic lymphocytic leukemia

PubMed Central

Byrd, John C.; Coutre, Steven E.; Benson, Don M.; Flinn, Ian W.; Wagner-Johnston, Nina D.; Spurgeon, Stephen E.; Kahl, Brad S.; Bello, Celeste; Webb, Heather K.; Johnson, Dave M.; Peterman, Sissy; Li, Daniel; Jahn, Thomas M.; Lannutti, Brian J.; Ulrich, Roger G.; Yu, Albert S.; Miller, Langdon L.; Furman, Richard R.

2014-01-01

In a phase 1 trial, idelalisib (GS-1101, CAL-101), a selective inhibitor of the lipid kinase PI3Kδ, was evaluated in 54 patients with relapsed/refractory chronic lymphocytic leukemia (CLL) with adverse characteristics including bulky lymphadenopathy (80%), extensive prior therapy (median 5 [range 2-14] prior regimens), treatment-refractory disease (70%), unmutated IGHV (91%), and del17p and/or TP53 mutations (24%). Patients were treated at 6 dose levels of oral idelalisib (range 50-350 mg once or twice daily) and remained on continuous therapy while deriving clinical benefit. Idelalisib-mediated inhibition of PI3Kδ led to abrogation of Akt phosphorylation in patient CLL cells and significantly reduced serum levels of CLL-related chemokines. The most commonly observed grade ≥3 adverse events were pneumonia (20%), neutropenic fever (11%), and diarrhea (6%). Idelalisib treatment resulted in nodal responses in 81% of patients. The overall response rate was 72%, with 39% of patients meeting the criteria for partial response per IWCLL 2008 and 33% meeting the recently updated criteria of PR with treatment-induced lymphocytosis.1,2 The median progression-free survival for all patients was 15.8 months. This study demonstrates the clinical utility of inhibiting the PI3Kδ pathway with idelalisib. Our findings support the further development of idelalisib in patients with CLL. These trials were registered at clinicaltrials.gov as #NCT00710528 and #NCT01090414. PMID:24615777
A phase 2 study of idelalisib plus rituximab in treatment-naïve older patients with chronic lymphocytic leukemia

PubMed Central

Lamanna, Nicole; Kipps, Thomas J.; Flinn, Ian; Zelenetz, Andrew D.; Burger, Jan A.; Keating, Michael; Mitra, Siddhartha; Holes, Leanne; Yu, Albert S.; Johnson, David M.; Miller, Langdon L.; Kim, Yeonhee; Dansey, Roger D.; Dubowy, Ronald L.; Coutre, Steven E.

2015-01-01

Idelalisib is a first-in-class oral inhibitor of PI3Kδ that has shown substantial activity in patients with relapsed/refractory chronic lymphocytic leukemia (CLL). To evaluate idelalisib as initial therapy, 64 treatment-naïve older patients with CLL or small lymphocytic leukemia (median age, 71 years; range, 65-90) were treated with rituximab 375 mg/m2 weekly ×8 and idelalisib 150 mg twice daily continuously for 48 weeks. Patients completing 48 weeks without progression could continue to receive idelalisib on an extension study. The median time on treatment was 22.4 months (range, 0.8-45.8+). The overall response rate (ORR) was 97%, including 19% complete responses. The ORR was 100% in patients with del(17p)/TP53 mutations and 97% in those with unmutated IGHV. Progression-free survival was 83% at 36 months. The most frequent (>30%) adverse events (any grade) were diarrhea (including colitis) (64%), rash (58%), pyrexia (42%), nausea (38%), chills (36%), cough (33%), and fatigue (31%). Elevated alanine transaminase/aspartate transaminase was seen in 67% of patients (23% grade ≥3). The combination of idelalisib and rituximab was highly active, resulting in durable disease control in treatment-naïve older patients with CLL. These results support the further development of idelalisib as initial treatment of CLL. This study is registered at ClinicalTrials.gov as #NCT01203930. PMID:26472751
MALT1 Inhibition Is Efficacious in Both Naïve and Ibrutinib-Resistant Chronic Lymphocytic Leukemia.

PubMed

Saba, Nakhle S; Wong, Deanna H; Tanios, Georges; Iyer, Jessica R; Lobelle-Rich, Patricia; Dadashian, Eman L; Liu, Delong; Fontan, Lorena; Flemington, Erik K; Nichols, Cydney M; Underbayev, Chingiz; Safah, Hana; Melnick, Ari; Wiestner, Adrian; Herman, Sarah E M

2017-12-15

The clinical efficacy displayed by ibrutinib in chronic lymphocytic leukemia (CLL) has been challenged by the frequent emergence of resistant clones. The ibrutinib target, Bruton's tyrosine kinase (BTK), is essential for B-cell receptor signaling, and most resistant cases carry mutations in BTK or PLCG2 , a downstream effector target of BTK. Recent findings show that MI-2, a small molecule inhibitor of the para-caspase MALT1, is effective in preclinical models of another type of BCR pathway-dependent lymphoma. We therefore studied the activity of MI-2 against CLL and ibrutinib-resistant CLL. Treatment of CLL cells in vitro with MI-2 inhibited MALT1 proteolytic activity reduced BCR and NF-κB signaling, inhibited nuclear translocation of RelB and p50, and decreased Bcl-xL levels. MI-2 selectively induced dose and time-dependent apoptosis in CLL cells, sparing normal B lymphocytes. Furthermore, MI-2 abrogated survival signals provided by stromal cells and BCR cross-linking and was effective against CLL cells harboring features associated with poor outcomes, including 17p deletion and unmutated IGHV Notably, MI-2 was effective against CLL cells collected from patients harboring mutations conferring resistance to ibrutinib. Overall, our findings provide a preclinical rationale for the clinical development of MALT1 inhibitors in CLL, in particular for ibrutinib-resistant forms of this disease. Cancer Res; 77(24); 7038-48. ©2017 AACR . ©2017 American Association for Cancer Research.
Normal serum protein electrophoresis and mutated IGHV genes detect very slowly evolving chronic lymphocytic leukemia patients.

PubMed

Chauzeix, Jasmine; Laforêt, Marie-Pierre; Deveza, Mélanie; Crowther, Liam; Marcellaud, Elodie; Derouault, Paco; Lia, Anne-Sophie; Boyer, François; Bargues, Nicolas; Olombel, Guillaume; Jaccard, Arnaud; Feuillard, Jean; Gachard, Nathalie; Rizzo, David

2018-05-09

More than 35 years after the Binet classification, there is still a need for simple prognostic markers in chronic lymphocytic leukemia (CLL). Here, we studied the treatment-free survival (TFS) impact of normal serum protein electrophoresis (SPE) at diagnosis. One hundred twelve patients with CLL were analyzed. The main prognostic factors (Binet stage; lymphocytosis; IGHV mutation status; TP53, SF3B1, NOTCH1, and BIRC3 mutations; and cytogenetic abnormalities) were studied. The frequencies of IGHV mutation status, cytogenetic abnormalities, and TP53, SF3B1, NOTCH1, and BIRC3 mutations were not significantly different between normal and abnormal SPE. Normal SPE was associated with Binet stage A, nonprogressive disease for 6 months, lymphocytosis below 30 G/L, and the absence of the IGHV3-21 gene rearrangement which is associated with poor prognosis. The TFS of patients with normal SPE was significantly longer than that of patients with abnormal SPE (log-rank test: P = 0.0015, with 51% untreated patients at 5.6 years and a perfect plateau afterward vs. a median TFS at 2.64 years for abnormal SPE with no plateau). Multivariate analysis using two different Cox models and bootstrapping showed that normal SPE was an independent good prognostic marker for either Binet stage, lymphocytosis, or IGHV mutation status. TFS was further increased when both normal SPE and mutated IGHV were present (log-rank test: P = 0.008, median not reached, plateau at 5.6 years and 66% untreated patients). A comparison with other prognostic markers suggested that normal SPE could reflect slowly advancing CLL disease. Altogether, our results show that a combination of normal SPE and mutated IGHV genes defines a subgroup of patients with CLL who evolve very slowly and who might never need treatment. © 2018 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.
Dielectrophoretic isolation and detection of cancer-related circulating cell-free DNA biomarkers from blood and plasma

PubMed Central

Sonnenberg, Avery; Marciniak, Jennifer Y.; Skowronski, Elaine A.; Manouchehri, Sareh; Rassenti, Laura; Ghia, Emanuela M.; Widhopf, George F.; Kipps, Thomas J.; Heller, Michael J.

2014-01-01

Conventional methods for the isolation of cancer-related circulating cell-free (ccf) DNA from patient blood (plasma) are time consuming and laborious. A DEP approach utilizing a microarray device now allows rapid isolation of ccf-DNA directly from a small volume of unprocessed blood. In this study, the DEP device is used to compare the ccf-DNA isolated directly from whole blood and plasma from 11 chronic lymphocytic leukemia (CLL) patients and one normal individual. Ccf-DNA from both blood and plasma samples was separated into DEP high-field regions, after which cells (blood), proteins, and other biomolecules were removed by a fluidic wash. The concentrated ccf-DNA was detected on-chip by fluorescence, and then eluted for PCR and DNA sequencing. The complete process from blood to PCR required less than 10 min; an additional 15 min was required to obtain plasma from whole blood. Ccf-DNA from the equivalent of 5 µL of CLL blood and 5 µL of plasma was amplified by PCR using Ig heavy-chain variable (IGHV) specific primers to identify the unique IGHV gene expressed by the leukemic B-cell clone. The PCR and DNA sequencing results obtained by DEP from all 11 CLL blood samples and from 8 of the 11 CLL plasma samples were exactly comparable to the DNA sequencing results obtained from genomic DNA isolated from CLL patient leukemic B cells (gold standard). PMID:24723219
Dielectrophoretic isolation and detection of cancer-related circulating cell-free DNA biomarkers from blood and plasma.

PubMed

Sonnenberg, Avery; Marciniak, Jennifer Y; Skowronski, Elaine A; Manouchehri, Sareh; Rassenti, Laura; Ghia, Emanuela M; Widhopf, George F; Kipps, Thomas J; Heller, Michael J

2014-07-01

Conventional methods for the isolation of cancer-related circulating cell-free (ccf) DNA from patient blood (plasma) are time consuming and laborious. A DEP approach utilizing a microarray device now allows rapid isolation of ccf-DNA directly from a small volume of unprocessed blood. In this study, the DEP device is used to compare the ccf-DNA isolated directly from whole blood and plasma from 11 chronic lymphocytic leukemia (CLL) patients and one normal individual. Ccf-DNA from both blood and plasma samples was separated into DEP high-field regions, after which cells (blood), proteins, and other biomolecules were removed by a fluidic wash. The concentrated ccf-DNA was detected on-chip by fluorescence, and then eluted for PCR and DNA sequencing. The complete process from blood to PCR required less than 10 min; an additional 15 min was required to obtain plasma from whole blood. Ccf-DNA from the equivalent of 5 μL of CLL blood and 5 μL of plasma was amplified by PCR using Ig heavy-chain variable (IGHV) specific primers to identify the unique IGHV gene expressed by the leukemic B-cell clone. The PCR and DNA sequencing results obtained by DEP from all 11 CLL blood samples and from 8 of the 11 CLL plasma samples were exactly comparable to the DNA sequencing results obtained from genomic DNA isolated from CLL patient leukemic B cells (gold standard). © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Monoclonal B-cell lymphocytosis in healthy blood donors: an unexpectedly common finding.

PubMed

Shim, Youn K; Rachel, Jane M; Ghia, Paolo; Boren, Jeff; Abbasi, Fatima; Dagklis, Antonis; Venable, Geri; Kang, Jiyeon; Degheidy, Heba; Plapp, Fred V; Vogt, Robert F; Menitove, Jay E; Marti, Gerald E

2014-02-27

Circulating monoclonal B cells may be detected in healthy adults, a condition called monoclonal B-cell lymphocytosis (MBL). MBL has also been identified in donated blood, but no systematic study of blood donors has been reported. Using sensitive and specific laboratory methods, we detected MBL in 149 (7.1%; 95% confidence interval, 6.0% to 8.3%) of 2098 unique donors ages 45 years or older in a Midwestern US regional blood center between 2010 and 2011. Most of the 149 donors had low-count MBL, including 99 chronic lymphocytic leukemia-like (66.4%), 22 atypical (14.8%), and 19 CD5(-) (12.8%) immunophenotypes. However, 5 donors (3.4%) had B-cell clonal counts above 500 cells per µL, including 3 with 1693 to 2887 cells per µL; the clone accounted for nearly all their circulating B cells. Four donors (2.7%) had 2 distinct MBL clones. Of 51 MBL samples in which immunoglobulin heavy chain (IGH)V-D-J genotypes could be determined, 71% and 29% used IGHV3- and IGHV4-family genes, respectively. Sequencing revealed 82% with somatic hypermutation, whereas 18% had >98% germ-line identity, including 5 with entirely germ-line sequences. In conclusion, MBL prevalence is much higher in blood donors than previously reported, and although uncommon, the presence of high-count MBL warrants further investigations to define the biological fate of the transfused cells in recipients.
5' Rapid Amplification of cDNA Ends and Illumina MiSeq Reveals B Cell Receptor Features in Healthy Adults, Adults With Chronic HIV-1 Infection, Cord Blood, and Humanized Mice.

PubMed

Waltari, Eric; Jia, Manxue; Jiang, Caroline S; Lu, Hong; Huang, Jing; Fernandez, Cristina; Finzi, Andrés; Kaufmann, Daniel E; Markowitz, Martin; Tsuji, Moriya; Wu, Xueling

2018-01-01

Using 5' rapid amplification of cDNA ends, Illumina MiSeq, and basic flow cytometry, we systematically analyzed the expressed B cell receptor (BCR) repertoire in 14 healthy adult PBMCs, 5 HIV-1+ adult PBMCs, 5 cord blood samples, and 3 HIS-CD4/B mice, examining the full-length variable region of μ, γ, α, κ, and λ chains for V-gene usage, somatic hypermutation (SHM), and CDR3 length. Adding to the known repertoire of healthy adults, Illumina MiSeq consistently detected small fractions of reads with high mutation frequencies including hypermutated μ reads, and reads with long CDR3s. Additionally, the less studied IgA repertoire displayed similar characteristics to that of IgG. Compared to healthy adults, the five HIV-1 chronically infected adults displayed elevated mutation frequencies for all μ, γ, α, κ, and λ chains examined and slightly longer CDR3 lengths for γ, α, and λ. To evaluate the reconstituted human BCR sequences in a humanized mouse model, we analyzed cord blood and HIS-CD4/B mice, which all lacked the typical SHM seen in the adult reference. Furthermore, MiSeq revealed identical unmutated IgM sequences derived from separate cell aliquots, thus for the first time demonstrating rare clonal members of unmutated IgM B cells by sequencing.
No evidence for the use of DIR, D–D fusions, chromosome 15 open reading frames or VHreplacement in the peripheral repertoire was found on application of an improved algorithm, JointML, to 6329 human immunoglobulin H rearrangements

PubMed Central

Ohm-Laursen, Line; Nielsen, Morten; Larsen, Stine R; Barington, Torben

2006-01-01

Antibody diversity is created by imprecise joining of the variability (V), diversity (D) and joining (J) gene segments of the heavy and light chain loci. Analysis of rearrangements is complicated by somatic hypermutations and uncertainty concerning the sources of gene segments and the precise way in which they recombine. It has been suggested that D genes with irregular recombination signal sequences (DIR) and chromosome 15 open reading frames (OR15) can replace conventional D genes, that two D genes or inverted D genes may be used and that the repertoire can be further diversified by heavy chain V gene (VH) replacement. Safe conclusions require large, well-defined sequence samples and algorithms minimizing stochastic assignment of segments. Two computer programs were developed for analysis of heavy chain joints. JointHMM is a profile hidden Markow model, while JointML is a maximum-likelihood-based method taking the lengths of the joint and the mutational status of the VH gene into account. The programs were applied to a set of 6329 clonally unrelated rearrangements. A conventional D gene was found in 80% of unmutated sequences and 64% of mutated sequences, while D-gene assignment was kept below 5% in artificial (randomly permutated) rearrangements. No evidence for the use of DIR, OR15, multiple D genes or VH replacements was found, while inverted D genes were used in less than 1‰ of the sequences. JointML was shown to have a higher predictive performance for D-gene assignment in mutated and unmutated sequences than four other publicly available programs. An online version 1·0 of JointML is available at http://www.cbs.dtu.dk/services/VDJsolver. PMID:17005006
Monoclonal B-cell lymphocytosis in healthy blood donors: an unexpectedly common finding

PubMed Central

Rachel, Jane M.; Ghia, Paolo; Boren, Jeff; Abbasi, Fatima; Dagklis, Antonis; Venable, Geri; Kang, Jiyeon; Degheidy, Heba; Plapp, Fred V.; Vogt, Robert F.; Menitove, Jay E.; Marti, Gerald E.

2014-01-01

Circulating monoclonal B cells may be detected in healthy adults, a condition called monoclonal B-cell lymphocytosis (MBL). MBL has also been identified in donated blood, but no systematic study of blood donors has been reported. Using sensitive and specific laboratory methods, we detected MBL in 149 (7.1%; 95% confidence interval, 6.0% to 8.3%) of 2098 unique donors ages 45 years or older in a Midwestern US regional blood center between 2010 and 2011. Most of the 149 donors had low-count MBL, including 99 chronic lymphocytic leukemia–like (66.4%), 22 atypical (14.8%), and 19 CD5– (12.8%) immunophenotypes. However, 5 donors (3.4%) had B-cell clonal counts above 500 cells per µL, including 3 with 1693 to 2887 cells per µL; the clone accounted for nearly all their circulating B cells. Four donors (2.7%) had 2 distinct MBL clones. Of 51 MBL samples in which immunoglobulin heavy chain (IGH)V-D-J genotypes could be determined, 71% and 29% used IGHV3- and IGHV4-family genes, respectively. Sequencing revealed 82% with somatic hypermutation, whereas 18% had >98% germ-line identity, including 5 with entirely germ-line sequences. In conclusion, MBL prevalence is much higher in blood donors than previously reported, and although uncommon, the presence of high-count MBL warrants further investigations to define the biological fate of the transfused cells in recipients. PMID:24345750
Genetics and Prognostication in Splenic Marginal Zone Lymphoma: Revelations from Deep Sequencing

PubMed Central

Gibson, Jane; Wang, Jun; Walewska, Renata; Parker, Helen; Parker, Anton; Davis, Zadie; Gardiner, Anne; McIver-Brown, Neil; Kalpadakis, Christina; Xochelli, Aliki; Anagnostopoulos, Achilles; Fazi, Claudia; de Castro, David Gonzalez; Dearden, Claire; Pratt, Guy; Rosenquist, Richard; Ashton-Key, Margaret; Forconi, Francesco; Collins, Andrew; Ghia, Paolo; Matutes, Estella; Pangalis, Gerassimos; Stamatopoulos, Kostas; Oscier, David; Strefford, Jonathan C

2015-01-01

Purpose Mounting evidence supports the clinical significance of gene mutations and immunogenetic features in common mature B-cell malignancies. Experimental Design We undertook a detailed characterization of the genetic background of splenic marginal zone lymphoma (SMZL), using targeted re-sequencing and explored potential clinical implications in a multinational cohort of 175 SMZL patients. Results We identified recurrent mutations in TP53 (16%), KLF2 (12%), NOTCH2 (10%), TNFAIP3 (7%), MLL2 (11%), MYD88 (7%) and ARID1A (6%), all genes known to be targeted by somatic mutation in SMZL. KLF2 mutations were early, clonal events, enriched in patients with del(7q) and IGHV1-2*04 B-cell receptor immunoglobulins, and were associated with a short median time-to-first-treatment (0.12 vs. 1.11 yrs; P=0.01). In multivariate analysis mutations in NOTCH2 (HR 2.12, 95%CI 1.02-4.4, P=0.044) and 100% germline IGHV gene identity (HR 2.19, 95%CI 1.05-4.55, P=0.036) were independent markers of short time-to-first-treatment, while TP53 mutations were an independent marker of short overall survival (HR 2.36, 95% CI 1.08-5.2, P=0.03). Conclusion We identify key associations between gene mutations and clinical outcome, demonstrating for the first time that NOTCH2 and TP53 gene mutations are independent markers of reduced treatment-free and overall survival, respectively. PMID:25779943
Structural Determination of the Broadly Reactive Anti-IGHV1-69 Anti-idiotypic Antibody G6 and Its Idiotope.

PubMed

Avnir, Yuval; Prachanronarong, Kristina L; Zhang, Zhen; Hou, Shurong; Peterson, Eric C; Sui, Jianhua; Zayed, Hatem; Kurella, Vinodh B; McGuire, Andrew T; Stamatatos, Leonidas; Hilbert, Brendan J; Bohn, Markus-Frederik; Kowalik, Timothy F; Jensen, Jeffrey D; Finberg, Robert W; Wang, Jennifer P; Goodall, Margaret; Jefferis, Roy; Zhu, Quan; Kurt Yilmaz, Nese; Schiffer, Celia A; Marasco, Wayne A

2017-12-12

The heavy chain IGHV1-69 germline gene exhibits a high level of polymorphism and shows biased use in protective antibody (Ab) responses to infections and vaccines. It is also highly expressed in several B cell malignancies and autoimmune diseases. G6 is an anti-idiotypic monoclonal Ab that selectively binds to IGHV1-69 heavy chain germline gene 51p1 alleles that have been implicated in these Ab responses and disease processes. Here, we determine the co-crystal structure of humanized G6 (hG6.3) in complex with anti-influenza hemagglutinin stem-directed broadly neutralizing Ab D80. The core of the hG6.3 idiotope is a continuous string of CDR-H2 residues starting with M53 and ending with N58. G6 binding studies demonstrate the remarkable breadth of binding to 51p1 IGHV1-69 Abs with diverse CDR-H3, light chain, and antigen binding specificities. These studies detail the broad expression of the G6 cross-reactive idiotype (CRI) that further define its potential role in precision medicine. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.
Rapid Electrokinetic Isolation of Cancer-Related Circulating Cell-Free DNA Directly from Blood

PubMed Central

Sonnenberg, Avery; Marciniak, Jennifer Y.; Rassenti, Laura; Ghia, Emanuela M.; Skowronski, Elaine A.; Manouchehri, Sareh; McCanna, James; Widhopf, George F.; Kipps, Thomas J.; Heller, Michael J.

2014-01-01

BACKGROUND Circulating cell-free DNA (ccf-DNA) is becoming an important biomarker for cancer diagnostics and therapy monitoring. The isolation of ccf-DNA from plasma as a “liquid biopsy” may begin to replace more invasive tissue biopsies for the detection and analysis of cancer-related mutations. Conventional methods for the isolation of ccf-DNA from plasma are costly, time-consuming, and complex, preventing the use of ccf-DNA biomarkers for point-of-care diagnostics and limiting other biomedical research applications. METHODS We used an AC electrokinetic device to rapidly isolate ccf-DNA from 25 μL unprocessed blood. ccf-DNA from 15 chronic lymphocytic leukemia (CLL) patients and 3 healthy individuals was separated into dielectrophoretic (DEP) high-field regions, after which other blood components were removed by a fluidic wash. Concentrated ccf-DNA was detected by fluorescence and eluted for quantification,PCR,and DNA sequencing. The complete process, blood to PCR, required <10 min. ccf-DNA was amplified by PCR with immunoglobulin heavy chain variable region (IGHV)-specific primers to identify the unique IGHV gene expressed by the leukemic B-cell clone, and then sequenced. RESULTS PCR and DNA sequencing results obtained by DEP from 25 μL CLL blood matched results obtained by use of conventional methods for ccf-DNA isolation from 1 mL plasma and for genomic DNA isolation from CLL patient leukemic B cells isolated from 15–20 mL blood. CONCLUSIONS Rapid isolation of ccf-DNA directly from a drop of blood will advance disease-related biomarker research, accelerate the transition from tissue to liquid biopsies, and enable point-of-care diagnostic systems for patient monitoring. PMID:24270796
Rapid electrokinetic isolation of cancer-related circulating cell-free DNA directly from blood.

PubMed

Sonnenberg, Avery; Marciniak, Jennifer Y; Rassenti, Laura; Ghia, Emanuela M; Skowronski, Elaine A; Manouchehri, Sareh; McCanna, James; Widhopf, George F; Kipps, Thomas J; Heller, Michael J

2014-03-01

Circulating cell-free DNA (ccf-DNA) is becoming an important biomarker for cancer diagnostics and therapy monitoring. The isolation of ccf-DNA from plasma as a "liquid biopsy" may begin to replace more invasive tissue biopsies for the detection and analysis of cancer-related mutations. Conventional methods for the isolation of ccf-DNA from plasma are costly, time-consuming, and complex, preventing the use of ccf-DNA biomarkers for point-of-care diagnostics and limiting other biomedical research applications. We used an AC electrokinetic device to rapidly isolate ccf-DNA from 25 μL unprocessed blood. ccf-DNA from 15 chronic lymphocytic leukemia (CLL) patients and 3 healthy individuals was separated into dielectrophoretic (DEP) high-field regions, after which other blood components were removed by a fluidic wash. Concentrated ccf-DNA was detected by fluorescence and eluted for quantification, PCR, and DNA sequencing. The complete process, blood to PCR, required <10 min. ccf-DNA was amplified by PCR with immunoglobulin heavy chain variable region (IGHV)-specific primers to identify the unique IGHV gene expressed by the leukemic B-cell clone, and then sequenced. PCR and DNA sequencing results obtained by DEP from 25 μL CLL blood matched results obtained by use of conventional methods for ccf-DNA isolation from 1 mL plasma and for genomic DNA isolation from CLL patient leukemic B cells isolated from 15-20 mL blood. Rapid isolation of ccf-DNA directly from a drop of blood will advance disease-related biomarker research, accelerate the transition from tissue to liquid biopsies, and enable point-of-care diagnostic systems for patient monitoring.
Analysis of the IgV(H) somatic mutations in splenic marginal zone lymphoma defines a group of unmutated cases with frequent 7q deletion and adverse clinical course.

PubMed

Algara, Patricia; Mateo, Marisol S; Sanchez-Beato, Margarita; Mollejo, Manuela; Navas, Immaculada C; Romero, Lourdes; Solé, Francesc; Salido, Marta; Florensa, Lourdes; Martínez, Pedro; Campo, Elias; Piris, Miguel A

2002-02-15

This study aimed to correlate the frequency of somatic mutations in the IgV(H) gene and the use of specific segments in the V(H) repertoire with the clinical and characteristic features of a series of 35 cases of splenic marginal zone lymphoma (SMZL). The cases were studied by seminested polymerase chain reaction by using primers from the FR1 and J(H) region. The results showed unexpected molecular heterogeneity in this entity, with 49% unmutated cases (less than 2% somatic mutations). The 7q31 deletions and a shorter overall survival were more frequent in this group. Additionally a high percentage (18 of 40 sequences) of SMZL cases showed usage of the V(H)1-2 segment, thereby emphasizing the singularity of this neoplasia, suggesting that this tumor derives from a highly selected B-cell population and encouraging the search for specific antigens that are pathogenically relevant in the genesis or progression of this tumor.
Efficacy of bendamustine and rituximab as first salvage treatment in chronic lymphocytic leukemia and indirect comparison with ibrutinib: a GIMEMA, ERIC and UK CLL FORUM study.

PubMed

Cuneo, Antonio; Follows, George; Rigolin, Gian Matteo; Piciocchi, Alfonso; Tedeschi, Alessandra; Trentin, Livio; Medina Perez, Angeles; Coscia, Marta; Laurenti, Luca; Musuraca, Gerardo; Farina, Lucia; Rivas Delgado, Alfredo; Orlandi, Ester Maria; Galieni, Piero; Mauro, Francesca Romana; Visco, Carlo; Amendola, Angela; Billio, Atto; Marasca, Roberto; Chiarenza, Annalisa; Meneghini, Vittorio; Ilariucci, Fiorella; Marchetti, Monia; Molica, Stefano; Re, Francesca; Gaidano, Gianluca; Gonzalez, Marcos; Forconi, Francesco; Ciolli, Stefania; Cortelezzi, Agostino; Montillo, Marco; Smolej, Lukas; Schuh, Anna; Eyre, Toby A; Kennedy, Ben; Bowles, Kris M; Vignetti, Marco; de la Serna, Javier; Moreno, Carol; Foà, Robin; Ghia, Paolo

2018-04-19

We performed an observational study on the efficacy of bendamustine and rituximab as first salvage regimen in chronic lymphocytic leukemia. In an intention-to-treat analysis including 237 patients, the median progression free survival was 25 months. The presence of del(17p), unmutated IGHV and advanced stage were associated with a shorter progression free survival at multivariate analysis. The median time-to-next treatment was 31.3 months. Front-line treatment with a chemoimmunotherapy regimen was the only predictive factor for a shorter time to next treatment at multivariate analysis. The median overall survival was 74.5 months. Advanced Binet stage (i.e. III-IV or C) and resistant disease were the only parameters significantly associated with a shorter OS. Grade 3-5 infections were recorded in 6.3% of patients. A matched-adjusted indirect comparison with ibrutinib given second-line within named patient programs in the United Kingdom and in Italy was carried out with overall survival as objective endpoint. When restricting the analysis to patients with intact 17p who had received chemoimmunotherapy in first line, the overall survival did not differ between patients treated with ibrutinib (63% alive at 36 months) and patients treated with BR (74.4% alive at 36 months). BR is an efficacious first salvage regimen in chronic lymphocytic leukemia in a real-life population, including the elderly and unfit patients. BR and ibrutinib may be equally effective in terms of overall survival when used as first salvage treatment in patients without 17p deletion. ClinicalTrials.gov identifier: NCT02491398. Copyright © 2018, Ferrata Storti Foundation.
The dual Syk/JAK inhibitor cerdulatinib antagonises B-cell receptor and microenvironmental signaling in chronic lymphocytic leukemia

PubMed Central

Blunt, Matthew D; Koehrer, Stefan; Dobson, Rachel; Larrayoz, Marta; Wilmore, Sarah; Hayman, Alice; Parnell, Jack; Smith, Lindsay; Davies, Andrew; Johnson, Peter W; Conley, Pamela B; Pandey, Anjali; Strefford, Jon C; Stevenson, Freda K; Packham, Graham; Forconi, Francesco; Coffey, Greg; Burger, Jan A; Steele, Andrew J

2017-01-01

Purpose B-cell receptor (BCR)-associated kinase inhibitors such as ibrutinib have revolutionised the treatment of chronic lymphocytic leukemia (CLL). However, these agents are not curative and resistance is already emerging in a proportion of patients. Interleukin-4 (IL-4), expressed in CLL lymph nodes, can augment BCR-signalling and reduce the effectiveness of BCR-kinase inhibitors. Therefore simultaneous targeting of the IL-4- and BCR-signalling pathways by cerdulatinib, a novel dual Syk/JAK inhibitor currently in clinical trials (NCT01994382), may improve treatment responses in patients. Experimental Design PBMCs from CLL patients were treated with cerdulatinib alone or in combination with venetoclax. Cell death, chemokine and cell signalling assay were performed and analysed by flow cytometry, immunoblotting, Q-PCR and ELISA as indicated. Results At concentrations achievable in patients, cerdulatinib inhibited BCR- and IL-4-induced downstream signalling in CLL cells using multiple read-outs and prevented anti-IgM- and nurse-like cell (NLC)-mediated CCL3/CCL4 production. Cerdulatinib induced apoptosis of CLL cells, in a time- and concentration-dependent manner, and particularly in IGHV unmutated samples with greater BCR-signalling capacity and response to IL-4, or samples expressing higher levels of sIgM, CD49d+ or ZAP70+. Cerdulatinib overcame anti-IgM, IL-4/CD40L or NLC-mediated protection by preventing upregulation of MCL-1- and BCL-XL, however BCL-2 expression was unaffected. Furthermore in samples treated with IL-4/CD40L, cerdulatinib synergised with venetoclax in vitro to induce greater apoptosis than either drug alone. Conclusion Cerdulatinib is a promising therapeutic for the treatment of CLL either alone or in combination with venetoclax, with the potential to target critical survival pathways in this currently incurable disease. PMID:27697994
Del(20q) in patients with chronic lymphocytic leukemia: a therapy-related abnormality involving lymphoid or myeloid cells.

PubMed

Yin, C Cameron; Tang, Guilin; Lu, Gary; Feng, Xiaoli; Keating, Michael J; Medeiros, L Jeffrey; Abruzzo, Lynne V

2015-08-01

Deletion 20q (Del(20q)), a common cytogenetic abnormality in myeloid neoplasms, is rare in chronic lymphocytic leukemia. We report 64 patients with chronic lymphocytic leukemia and del(20q), as the sole abnormality in 40, a stemline abnormality in 21, and a secondary abnormality in 3 cases. Fluorescence in situ hybridization (FISH) analysis revealed an additional high-risk abnormality, del(11q) or del(17p), in 25/64 (39%) cases. In most cases, the leukemic cells showed atypical cytologic features, unmutated IGHV (immunoglobulin heavy-chain variable region) genes, and ZAP70 positivity. The del(20q) was detected only after chemotherapy in all 27 cases with initial karyotypes available. With a median follow-up of 90 months, 30 patients (47%) died, most as a direct consequence of chronic lymphocytic leukemia. Eight patients developed a therapy-related myeloid neoplasm, seven with a complex karyotype. Combined morphologic and FISH analysis for del(20q) performed in 12 cases without morphologic evidence of a myeloid neoplasm localized the del(20q) to the chronic lymphocytic leukemia cells in 5 (42%) cases, and to myeloid/erythroid cells in 7 (58)% cases. The del(20q) was detected in myeloid cells in all 4 cases of myelodysplastic syndrome. In aggregate, these data indicate that chronic lymphocytic leukemia with del(20q) acquired after therapy is heterogeneous. In cases with morphologic evidence of dysplasia, the del(20q) likely resides in the myeloid lineage. However, in cases without morphologic evidence of dysplasia, the del(20q) may represent clonal evolution and disease progression. Combining morphologic analysis with FISH for del(20q) or performing FISH on immunomagnetically selected sub-populations to localize the cell population with this abnormality may help guide patient management.
Single-agent ibrutinib in treatment-naïve and relapsed/refractory chronic lymphocytic leukemia: a 5-year experience

PubMed Central

Furman, Richard R.; Coutre, Steven; Flinn, Ian W.; Burger, Jan A.; Blum, Kristie; Sharman, Jeff; Wierda, William; Jones, Jeffrey; Zhao, Weiqiang; Heerema, Nyla A.; Johnson, Amy J.; Luan, Ying; James, Danelle F.; Chu, Alvina D.; Byrd, John C.

2018-01-01

We previously reported durable responses and manageable safety of ibrutinib from a 3-year follow-up of treatment-naïve (TN) older patients (≥65 years of age) and relapsed/refractory (R/R) patients with chronic lymphocytic leukemia/small lymphocytic lymphoma (CLL/SLL). We now report on long-term efficacy and safety with median follow-up of 5 years in this patient population with TN (N = 31) and R/R (N = 101) CLL/SLL. With the current 5-year follow-up, ibrutinib continues to yield a high overall response rate of 89%, with complete response rates increasing over time to 29% in TN patients and 10% in R/R patients. The median progression-free survival (PFS) was not reached in TN patients. The 5-year PFS rate was 92% in TN patients and 44% in R/R patients. Median PFS in R/R patients was 51 months; in those with del(11q), del(17p), and unmutated IGHV, it was 51, 26, and 43 months, respectively, demonstrating long-term efficacy of ibrutinib in some high-risk subgroups. Survival outcomes were less robust for R/R patients with del(17p) and those who received more prior therapies. The onset of grade ≥3 cytopenias, such as neutropenia and thrombocytopenia, decreased over time. Treatment--limiting adverse events were more frequent during the first year compared with subsequent periods. These results demonstrate sustained efficacy and acceptable tolerability of ibrutinib over an extended time, providing the longest experience for Bruton tyrosine kinase inhibitor treatment in patients with CLL/SLL. These trials were registered at www.clinicaltrials.gov as #NCT01105247 and #NCT01109069. PMID:29437592
Del(20q) in patients with chronic lymphocytic leukemia: A therapy-related abnormality involving lymphoid or myeloid cells

PubMed Central

Yin, C. Cameron; Tang, Guilin; Lu, Gary; Feng, Xiaoli; Keating, Michael J.; Medeiros, L. Jeffrey; Abruzzo, Lynne V.

2015-01-01

Del(20q), a common cytogenetic abnormality in myeloid neoplasms, is rare in chronic lymphocytic leukemia. We report 64 patients with chronic lymphocytic leukemia and del(20q), as the sole abnormality in 40, a stemline abnormality in 21, and a secondary abnormality in 3 cases. FISH analysis revealed an additional high-risk abnormality, del(11q) or del(17p), in 27/64 (42%) cases. In most cases, the leukemic cells showed atypical cytologic features, unmutated IGHV genes and ZAP70 positivity. The del(20q) was detected only after chemotherapy in all 27 cases with initial karyotypes available. With a median follow-up of 90 months, 30 patients (47%) died, most as a direct consequence of chronic lymphocytic leukemia. Eight patients developed a therapy-related myeloid neoplasm, seven with a complex karyotype. Combined morphologic and FISH analysis for del(20q) performed in 12 cases without morphologic evidence of a myeloid neoplasm localized the del(20q) to the chronic lymphocytic leukemia cells in 5 (42%) cases, and to myeloid/erythroid cells in 7 (58)% cases. The del(20q) was detected in myeloid cells in all 4 cases of myelodysplastic syndrome. In aggregate, these data indicate that chronic lymphocytic leukemia with del(20q) acquired after therapy is heterogeneous. In cases with morphologic evidence of dysplasia, the del(20q) likely resides in the myeloid lineage. However, in cases without morphologic evidence of dysplasia, the del(20q) may represent clonal evolution and disease progression. Combining morphologic analysis with FISH for del(20q) or performing FISH on immunomagnetically-selected subpopulations to localize the cell population with this abnormality may help guide patient management. PMID:25953391

ZAP-70 compared with immunoglobulin heavy-chain gene mutation status as a predictor of disease progression in chronic lymphocytic leukemia.

PubMed

Rassenti, Laura Z; Huynh, Lang; Toy, Tracy L; Chen, Liguang; Keating, Michael J; Gribben, John G; Neuberg, Donna S; Flinn, Ian W; Rai, Kanti R; Byrd, John C; Kay, Neil E; Greaves, Andrew; Weiss, Arthur; Kipps, Thomas J

2004-08-26

The course of chronic lymphocytic leukemia (CLL) is variable. In aggressive disease, the CLL cells usually express an unmutated immunoglobulin heavy-chain variable-region gene (IgV(H)) and the 70-kD zeta-associated protein (ZAP-70), whereas in indolent disease, the CLL cells usually express mutated IgV(H) but lack expression of ZAP-70. We evaluated the CLL B cells from 307 patients with CLL for ZAP-70 and mutations in the rearranged IgV(H) gene. We then investigated the association between the results and the time from diagnosis to initial therapy. We found that ZAP-70 was expressed above a defined threshold level in 117 of the 164 patients with an unmutated IgV(H) gene (71 percent), but in only 24 of the 143 patients with a mutated IgV(H) gene (17 percent, P<0.001). Among the patients with ZAP-70-positive CLL cells, the median time from diagnosis to initial therapy in those who had an unmutated IgV(H) gene (2.8 years) was not significantly different from the median time in those who had a mutated IgV(H) gene (4.2 years, P=0.07). However, the median time from diagnosis to initial treatment in each of these groups was significantly shorter than the time in patients with ZAP-70-negative CLL cells who had either mutated or unmutated IgV(H) genes (P<0.001). The median time from diagnosis to initial therapy among patients who did not have ZAP-70 was 11.0 years in those with a mutated IgV(H) gene and 7.1 years in those with an unmutated IgV(H) gene (P<0.001). Although the presence of an unmutated IgV(H) gene is strongly associated with the expression of ZAP-70, ZAP-70 is a stronger predictor of the need for treatment in B-cell CLL. Copyright 2004 Massachusetts Medical Society
Comprehensive characterization of immunoglobulin gene rearrangements in patients with chronic lymphocytic leukaemia

PubMed Central

René, Céline; Prat, Nathalie; Thuizat, Audrey; Broctawik, Mélanie; Avinens, Odile; Eliaou, Jean-François

2014-01-01

Previous studies have suggested a geographical pattern of immunoglobulin rearrangement in chronic lymphocytic leukaemia (CLL), which could be as a result of a genetic background or an environmental antigen. However, the characteristics of Ig rearrangements in the population from the South of France have not yet been established. Here, we studied CLL B-cell repertoire and mutational pattern in a Southern French cohort of patients using an in-house protocol for whole sequencing of the rearranged immunoglobulin heavy-chain genes. Described biased usage of variable, diversity and joining genes between the mutated and unmutated groups was found in our population. However, variable gene frequencies are more in accordance with those observed in the Mediterranean patients. We found that the third complementary-determining region (CDR) length was higher in unmutated sequences, because of bias in the diversity and joining genes usage and not due to the N diversity. Mutations found in CLL followed the features of canonical somatic hypermutation mechanism: preference of targeting for activation-induced cytidine deaminase and polymerase motifs, base change bias for transitions and more replacement mutations occurring in CDRs than in framework regions. Surprisingly, localization of activation-induced cytidine deaminase motifs onto the variable gene showed a preference for framework regions. The study of the characteristics at the age of diagnosis showed no difference in clinical outcome, but suggested a tendency of increased replacement and transition-over-transversion mutations and a longer third CDR length in older patients. PMID:24725733
Pseudo-Peritoneal Carcinomatosis Presentation of a Crystal-Storing Histiocytosis With an Unmutated Monoclonal κ Light Chain

PubMed Central

Aline-Fardin, Aude; Bender, Sebastien; Fabiani, Bettina; Buob, David; Brahimi, Said; Verpont, Marie Christine; Mothy, Mohamad; Ronco, Pierre; Boffa, Jean Jacques; Aucouturier, Pierre; Garderet, Laurent

2015-01-01

Abstract Crystal-storing histiocytosis (CSH) is a rare complication of monoclonal gammopathies caused by accumulation of crystalline material inside macrophages, and it may result in a variety of clinical manifestations depending on the involved organs. Although immunoglobulin κ light chains (LCs) seem to be the most frequent pathogenic component, very few molecular data are currently available. A 69-year-old man presented with a very poor performance status. Remarkable features were mesenteric lymph node enlargement and proteinuria, including a monoclonal κ LC. Light and electron microscopy studies revealed the presence of crystals within macrophages in the lymph nodes, bone marrow, and kidney, leading to the diagnosis of CSH. The pathogenic κ LC variable domain sequence was identical to the germline Vk3-20∗01/Jk2∗01 gene segments, without any somatic mutation, suggesting an extra-follicular B cell proliferation. The patient was successfully treated with 4 cycles of bortezomib and dexamethasone. After a 12-month follow-up, he remains in hematological and renal remission. CSH may present as pseudo-peritoneal carcinomatosis and relate to a monoclonal κ LC encoded by an unmutated gene. Bortezomib-based therapy proved efficacious in this case. PMID:26266355
IGHV1-69-Encoded Antibodies Expressed in Chronic Lymphocytic Leukemia React with Malondialdehyde–Acetaldehyde Adduct, an Immunodominant Oxidation-Specific Epitope

PubMed Central

Amir, Shahzada; Hartvigsen, Karsten; Hansen, Lotte F.; Woelkers, Douglas; Tsimikas, Sotirios; Binder, Christoph J.; Kipps, Thomas J.; Witztum, Joseph L.

2013-01-01

The immunoglobulins expressed by chronic lymphocytic leukemia (CLL) B cells are highly restricted, suggesting they are selected for binding either self or foreign antigen. Of the immunoglobulin heavy-chain variable (IGHV) genes expressed in CLL, IGHV1-69 is the most common, and often is expressed with little or no somatic mutation, and restricted IGHD and IGHJ gene usage. We found that antibodies encoded by one particular IGHV1-69 subset, designated CLL69C, with the HCDR3 encoded by the IGHD3-3 gene in reading frame 2 and IGHJ6, specifically bound to oxidation-specific epitopes (OSE), which are products of enhanced lipid peroxidation and a major target of innate natural antibodies. Specifically, CLL69C bound immunodominant OSE adducts termed MAA (malondialdehyde–acetaldehyde-adducts), which are found on apoptotic cells, inflammatory tissues, and atherosclerotic lesions. It also reacted specifically with MAA-specific peptide mimotopes. Light chain shuffling indicated that non-stochastically paired L chain of IGLV3-9 contributes to the antigen binding of CLL69C. A nearly identical CLL69C Ig heavy chain was identified from an MAA-enriched umbilical cord phage displayed Fab library, and a derived Fab with the same HCDR3 rearrangement displayed identical MAA-binding properties. These data support the concept that OSE (MAA-epitopes), which are ubiquitous products of inflammation, may play a role in clonal selection and expansion of CLL B cells. PMID:23840319
Identification and validation of biomarkers of IgV(H) mutation status in chronic lymphocytic leukemia using microfluidics quantitative real-time polymerase chain reaction technology.

PubMed

Abruzzo, Lynne V; Barron, Lynn L; Anderson, Keith; Newman, Rachel J; Wierda, William G; O'brien, Susan; Ferrajoli, Alessandra; Luthra, Madan; Talwalkar, Sameer; Luthra, Rajyalakshmi; Jones, Dan; Keating, Michael J; Coombes, Kevin R

2007-09-01

To develop a model incorporating relevant prognostic biomarkers for untreated chronic lymphocytic leukemia patients, we re-analyzed the raw data from four published gene expression profiling studies. We selected 88 candidate biomarkers linked to immunoglobulin heavy-chain variable region gene (IgV(H)) mutation status and produced a reliable and reproducible microfluidics quantitative real-time polymerase chain reaction array. We applied this array to a training set of 29 purified samples from previously untreated patients. In an unsupervised analysis, the samples clustered into two groups. Using a cutoff point of 2% homology to the germline IgV(H) sequence, one group contained all 14 IgV(H)-unmutated samples; the other contained all 15 mutated samples. We confirmed the differential expression of 37 of the candidate biomarkers using two-sample t-tests. Next, we constructed 16 different models to predict IgV(H) mutation status and evaluated their performance on an independent test set of 20 new samples. Nine models correctly classified 11 of 11 IgV(H)-mutated cases and eight of nine IgV(H)-unmutated cases, with some models using three to seven genes. Thus, we can classify cases with 95% accuracy based on the expression of as few as three genes.
Association between a common immunoglobulin heavy chain allele and rheumatic heart disease risk in Oceania

PubMed Central

Parks, Tom; Mirabel, Mariana M.; Kado, Joseph; Auckland, Kathryn; Nowak, Jaroslaw; Rautanen, Anna; Mentzer, Alexander J.; Marijon, Eloi; Jouven, Xavier; Perman, Mai Ling; Cua, Tuliana; Kauwe, John K.; Allen, John B.; Taylor, Henry; Robson, Kathryn J.; Deane, Charlotte M.; Steer, Andrew C.; Hill, Adrian V. S.; Allen, Lori; Allen, Marvin; Braunstein, Corinne; Colquhoun, Samantha M.; Jewine, Aurélia; Ah Kee, Maureen; Kumar, Rina; John Martin, William; Mataika, Reapi; Nadra, Marie; Nadu, Shahin; Naseri, Take; Noël, Baptiste; Simon, Nathalie; Ward, Brenton

2017-01-01

The indigenous populations of the South Pacific experience a high burden of rheumatic heart disease (RHD). Here we report a genome-wide association study (GWAS) of RHD susceptibility in 2,852 individuals recruited in eight Oceanian countries. Stratifying by ancestry, we analysed genotyped and imputed variants in Melanesians (607 cases and 1,229 controls) before follow-up of suggestive loci in three further ancestral groups: Polynesians, South Asians and Mixed or other populations (totalling 399 cases and 617 controls). We identify a novel susceptibility signal in the immunoglobulin heavy chain (IGH) locus centring on a haplotype of nonsynonymous variants in the IGHV4-61 gene segment corresponding to the IGHV4-61*02 allele. We show each copy of IGHV4-61*02 is associated with a 1.4-fold increase in the risk of RHD (odds ratio 1.43, 95% confidence intervals 1.27–1.61, P=4.1 × 10−9). These findings provide new insight into the role of germline variation in the IGH locus in disease susceptibility. PMID:28492228
Inhibition of maternal embryonic leucine zipper kinase with OTSSP167 displays potent anti-leukemic effects in chronic lymphocytic leukemia.

PubMed

Zhang, Ya; Zhou, Xiangxiang; Li, Ying; Xu, Yangyang; Lu, Kang; Li, Peipei; Wang, Xin

2018-06-12

TP53 pathway defects contributed to therapy resistance and adverse clinical outcome in chronic lymphocytic leukemia (CLL), which represents an unmet clinical need with few therapeutic options. Maternal embryonic leucine zipper kinase (MELK) is a novel oncogene, which plays crucial roles in mitotic progression and stem cell maintenance. OTSSP167, an orally administrated inhibitor targeting MELK, is currently in a phase I/II clinical trial in patients with advanced breast cancer and acute myeloid leukemia. Yet, no investigation has been elucidated to date regarding the oncogenic role of MELK and effects of OTSSP167 in chronic lymphocytic leukemia (CLL). Previous studies confirmed MELK inhibition abrogated cancer cell survival via p53 signaling pathway. Thus, we aimed to determine the biological function of MELK and therapeutic potential of OTSSP167 in CLL. Herein, MELK over-expression was observed in CLL cells, and correlated with higher WBC count, advanced stage, elevated LDH, increased β2-MG level, unmutated IGHV, positive ZAP-70, deletion of 17p13 and inferior prognosis of CLL patients. In accordance with functional enrichment analyses in gene expression profiling, CLL cells with depletion or inhibition of MELK exhibited impaired cell proliferation, enhanced fast-onset apoptosis, induced G2/M arrest, attenuated cell chemotaxis and promoted sensitivity to fludarabine and ibrutinib. However, gain-of-function assay showed increased cell proliferation and cell chemotaxis. In addition, OTSSP167 treatment reduced phosphorylation of AKT and ERK1/2. It decreased FoxM1 phosphorylation, expression of FoxM1, cyclin B1 and CDK1, while up-regulating p53 and p21 expression. Taken together, MELK served as a candidate of therapeutic target in CLL. OTSSP167 exhibits potent anti-tumor activities in CLL cells, highlighting a novel molecule-based strategy for leukemic interventions.
ZAP-70 staining in chronic lymphocytic leukemia.

PubMed

Villamor, Neus

2005-05-01

Chronic lymphocytic leukemia (CLL) is the most common chronic leukemia in Western countries. The disease has an extremely variable clinical course, and several prognostic features have been identified to assess individual risk. The configuration of the immunoglobulin variable heavy-chain gene (IgV(H)) is a strong predictor of the outcome. CLL patients with unmutated IgV(H) status have an aggressive clinical course and a short survival. Unfortunately, analysis of IgV(H) gene configuration is not available in most clinical laboratories. A small number of genes are differentially expressed between unmutated IgV(H) and mutated IgV(H) clinical forms of CLL. One of these genes is ZAP-70, which is detected in leukemic cells from patients with the unmutated IgV(H) form of CLL. Flow cytometry presents advantages over other methods to detect ZAP-70, and its quantification by flow cytometry has proved its predictive value. This unit focuses on protocols to quantify ZAP-70 by flow cytometry in CLL.
Focused Evolution of HIV-1 Neutralizing Antibodies Revealed by Structures and Deep Sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wu, Xueling; Zhou, Tongqing; Zhu, Jiang

2013-03-04

Antibody VRC01 is a human immunoglobulin that neutralizes about 90% of HIV-1 isolates. To understand how such broadly neutralizing antibodies develop, we used x-ray crystallography and 454 pyrosequencing to characterize additional VRC01-like antibodies from HIV-1-infected individuals. Crystal structures revealed a convergent mode of binding for diverse antibodies to the same CD4-binding-site epitope. A functional genomics analysis of expressed heavy and light chains revealed common pathways of antibody-heavy chain maturation, confined to the IGHV1-2*02 lineage, involving dozens of somatic changes, and capable of pairing with different light chains. Broadly neutralizing HIV-1 immunity associated with VRC01-like antibodies thus involves the evolution ofmore » antibodies to a highly affinity-matured state required to recognize an invariant viral structure, with lineages defined from thousands of sequences providing a genetic roadmap of their development.« less
Fundamental characteristics of the expressed immunoglobulin VH and VL repertoire in different canine breeds in comparison with those of humans and mice.

PubMed

Steiniger, Sebastian C J; Dunkle, William E; Bammert, Gary F; Wilson, Thomas L; Krishnan, Abhiram; Dunham, Steven A; Ippolito, Gregory C; Bainbridge, Graeme

2014-05-01

Complementarity determining regions (CDR) are responsible for binding antigen and provide substantial diversity to the antibody repertoire, with VH CDR3 of the immunoglobulin variable heavy (VH) domain playing a dominant role. In this study, we examined 1200 unique canine VH and 500 unique variable light (VL) sequences of large and small canine breeds derived from peripheral B cells. Unlike the human and murine repertoire, the canine repertoire is heavily dominated by the Canis lupus familiaris IGHV1 subgroup, evolutionarily closest to the human IGHV3 subgroup. Our studies clearly show that the productive canine repertoire of all analyzed breeds shows similarities to both human and mouse; however, there are distinct differences in terms of VH CDR3 length and amino acid paratope composition. In comparison with the human and murine antibody repertoire, canine VH CDR3 regions are shorter in length than the human counterparts, but longer than the murine VH CDR3. Similar to corresponding human and mouse VH CDR3, the amino acids at the base of the VH CDR3 loop are strictly conserved. For identical CDR positions, there were significant changes in chemical paratope composition. Similar to human and mouse repertoires, the neutral amino acids tyrosine, glycine and serine dominate the canine VH CDR3 interval (comprising 35%) although the interval is nonetheless relatively depleted of tyrosine when compared to human and mouse. Furthermore, canine VH CDR3 displays an overrepresentation of the neutral amino acid threonine and the negatively charged aspartic acid while proline content is similar to that in the human repertoire. In general, the canine repertoire shows a bias towards small, negatively charged amino acids. Overall, this analysis suggests that functional canine therapeutic antibodies can be obtained from human and mouse sequences by methods of speciation and affinity maturation. Copyright © 2014 Elsevier Ltd. All rights reserved.
Clinical and molecular predictors of disease severity and survival in chronic lymphocytic leukemia.

PubMed

Weinberg, J Brice; Volkheimer, Alicia D; Chen, Youwei; Beasley, Bethany E; Jiang, Ning; Lanasa, Mark C; Friedman, Daphne; Vaccaro, Gina; Rehder, Catherine W; Decastro, Carlos M; Rizzieri, David A; Diehl, Louis F; Gockerman, Jon P; Moore, Joseph O; Goodman, Barbara K; Levesque, Marc C

2007-12-01

Several parameters may predict disease severity and overall survival in chronic lymphocytic leukemia (CLL). The purpose of our study of 190 CLL patients was to compare immunoglobulin heavy chain variable region (IgV(H)) mutation status, cytogenetic abnormalities, and leukemia cell CD38 and Zap-70 to older, traditional parameters. We also wanted to construct a simple, inexpensive prognosis score that would significantly predict TTT and survival in patients at the time of diagnosis and help practicing clinicians. In univariate analyses, patients with higher clinical stage, higher leukocyte count at diagnosis, shorter leukocyte doubling time, elevated serum lactate dehydrogenase (LDH), unmutated immunoglobulin heavy chain variable region (IgV(H)) genes, and higher CD38 had a shorter overall survival and time-to-treatment (TTT). CLL cell Zap-70 expression was higher in patients with unmutated IgV(H), and those with higher Zap-70 tended to have shorter survival. IgV(H)4-34 or IgV(H)1-69 was the most common IgV(H) genes used (16 and 12%, respectively). Of those with IgV(H)1-69, 86% had unmutated IgV(H) and had a significantly shorter TTT. A cytogenetic abnormality was noted in 71% of the patients tested. Patients with 11q22 del and 17p13 del or complex abnormalities were significantly more likely to have unmutated IgV(H). We found that a prognostic score constructed using modified Rai stage, cellular CD38, and serum LDH (parameters easily obtained clinically) significantly predicted TTT and survival in patients at the time of diagnosis and performed as well or better than models using the newer markers.
Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity

NASA Astrophysics Data System (ADS)

Corcoran, Martin M.; Phad, Ganesh E.; Bernat, Néstor Vázquez; Stahl-Hennig, Christiane; Sumida, Noriyuki; Persson, Mats A. A.; Martin, Marcel; Hedestam, Gunilla B. Karlsson

2016-12-01

Comprehensive knowledge of immunoglobulin genetics is required to advance our understanding of B cell biology. Validated immunoglobulin variable (V) gene databases are close to completion only for human and mouse. We present a novel computational approach, IgDiscover, that identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species. Further, we describe a novel human IGHV3-21 allele and confirm significant gene differences between Balb/c and C57BL6 mouse strains, demonstrating the power of IgDiscover as a germline V gene discovery tool.
Production of individualized V gene databases reveals high levels of immunoglobulin genetic diversity

PubMed Central

Corcoran, Martin M.; Phad, Ganesh E.; Bernat, Néstor Vázquez; Stahl-Hennig, Christiane; Sumida, Noriyuki; Persson, Mats A.A.; Martin, Marcel; Hedestam, Gunilla B. Karlsson

2016-01-01

Comprehensive knowledge of immunoglobulin genetics is required to advance our understanding of B cell biology. Validated immunoglobulin variable (V) gene databases are close to completion only for human and mouse. We present a novel computational approach, IgDiscover, that identifies germline V genes from expressed repertoires to a specificity of 100%. IgDiscover uses a cluster identification process to produce candidate sequences that, once filtered, results in individualized germline V gene databases. IgDiscover was tested in multiple species, validated by genomic cloning and cross library comparisons and produces comprehensive gene databases even where limited genomic sequence is available. IgDiscover analysis of the allelic content of the Indian and Chinese-origin rhesus macaques reveals high levels of immunoglobulin gene diversity in this species. Further, we describe a novel human IGHV3-21 allele and confirm significant gene differences between Balb/c and C57BL6 mouse strains, demonstrating the power of IgDiscover as a germline V gene discovery tool. PMID:27995928
What do somatic hypermutation and class switch recombination teach us about chronic lymphocytic leukaemia pathogenesis?

PubMed

Oppezzo, P; Dighiero, G

2005-01-01

B-CLL cells express CD5 and IgM/IgD and thus have a mantle zone-like phenotype of naive cells, which, in normal conditions express unmutated Ig genes. However, recent studies have shown that 50%-70% of CLL harbour somatic mutations of VH genes, as if they had matured in a lymphoid follicle. Interestingly, the presence or absence of somatic hypermutation (SHM) process is associated with the use of particular VH genes. Particular alleles of the VH1-69 gene and the VH4-39 gene are preferentially expressed in an unmutated form, while VH4-34 or the majority of VH3 family genes frequently contain somatic mutations. The fact that some genes like VH1-69 and VH3-07 recombine this VH segment to particular JH segments and the restricted use of CDR3 sequences by CLLs expressing the VH4-39 gene suggest that the observed differences in BCR structure in B-CLL could result from selection by distinct antigenic epitopes. It is currently unclear whether this putative antigen-driven process could occur prior to leukaemic transformation and/or that the precursors were transformed into leukaemic cells at distinct maturational stages. The mutational profile of Ig genes has been shown to be associated with disease prognosis. These results could favour the idea that CLL could correspond to two different diseases that look alike in morphologic and phenotypic terms. In CLL with mutated Ig genes, the proliferating B cell may have transited through germinal centres, the physiologic site of hypermutation, whereas in CLL with unmutated Ig genes the malignant B cell may derive from a pre-germinal centre naïve B cell. Despite these clinical and molecular differences, recent studies on gene expression profiling of B-CLL cells showed that CLL is characterized by a common gene expression signature that is irrespective of Ig mutational status and differs from other lymphoid cancers and normal lymphoid subpopulations, suggesting that CLL cases share a common mechanism of transformation and/or cell of origin. Activation induced cytidine deaminase (AID) plays a key role in SHM and class switch recombination (CSR). However, the mechanisms accounting for AID action and control of its expression remain unclear. In a recent work we have shown that in contrast to normal circulating B-cells, AID transcripts are expressed constitutively in CLL patients undergoing active CSR, but interestingly this expression occurs predominately in unmutated CLL B-cells. These data favour the view that AID protein may act differentially on CSR and SHM pathways, but the role-played by AID in both processes remains to be elucidated. Recent work indicates that AID is expressed in a small fraction of tumoral cells, which could suggest that this small fraction of cells may correspond to B-CLL cells that would have recently experienced an AID-inducing stimulus occurring in a specific microenvironment.
Targeting BCL2 with Venetoclax in Relapsed Chronic Lymphocytic Leukemia.

PubMed

Roberts, Andrew W; Davids, Matthew S; Pagel, John M; Kahl, Brad S; Puvvada, Soham D; Gerecitano, John F; Kipps, Thomas J; Anderson, Mary Ann; Brown, Jennifer R; Gressick, Lori; Wong, Shekman; Dunbar, Martin; Zhu, Ming; Desai, Monali B; Cerri, Elisa; Heitner Enschede, Sari; Humerickhouse, Rod A; Wierda, William G; Seymour, John F

2016-01-28

New treatments have improved outcomes for patients with relapsed chronic lymphocytic leukemia (CLL), but complete remissions remain uncommon. Venetoclax has a distinct mechanism of action; it targets BCL2, a protein central to the survival of CLL cells. We conducted a phase 1 dose-escalation study of daily oral venetoclax in patients with relapsed or refractory CLL or small lymphocytic lymphoma (SLL) to assess safety, pharmacokinetic profile, and efficacy. In the dose-escalation phase, 56 patients received active treatment in one of eight dose groups that ranged from 150 to 1200 mg per day. In an expansion cohort, 60 additional patients were treated with a weekly stepwise ramp-up in doses as high as 400 mg per day. The majority of the study patients had received multiple previous treatments, and 89% had poor prognostic clinical or genetic features. Venetoclax was active at all dose levels. Clinical tumor lysis syndrome occurred in 3 of 56 patients in the dose-escalation cohort, with one death. After adjustments to the dose-escalation schedule, clinical tumor lysis syndrome did not occur in any of the 60 patients in the expansion cohort. Other toxic effects included mild diarrhea (in 52% of the patients), upper respiratory tract infection (in 48%), nausea (in 47%), and grade 3 or 4 neutropenia (in 41%). A maximum tolerated dose was not identified. Among the 116 patients who received venetoclax, 92 (79%) had a response. Response rates ranged from 71 to 79% among patients in subgroups with an adverse prognosis, including those with resistance to fludarabine, those with chromosome 17p deletions (deletion 17p CLL), and those with unmutated IGHV. Complete remissions occurred in 20% of the patients, including 5% who had no minimal residual disease on flow cytometry. The 15-month progression-free survival estimate for the 400-mg dose groups was 69%. Selective targeting of BCL2 with venetoclax had a manageable safety profile and induced substantial responses in patients with relapsed CLL or SLL, including those with poor prognostic features. (Funded by AbbVie and Genentech; ClinicalTrials.gov number, NCT01328626.).
Minimal Disease Assessment in the Treatment of Children and Adolescents with Intermediate-Risk (Stage III/IV) B-Cell Non-Hodgkin Lymphoma: A Children’s Oncology Group Report

PubMed Central

Shiramizu, Bruce; Goldman, Stanton; Kusao, Ian; Agsalda, Melissa; Lynch, James; Smith, Lynette; Harrison, Lauren; Morris, Erin; Gross, Thomas G.; Sanger, Warren; Perkins, Sherrie; Cairo, Mitchell S.

2011-01-01

Summary Children/adolescents with mature B-cell non-Hodgkin lymphoma (B-NHL) have an excellent prognosis but relapses still occur. While chromosomal aberrations and/or clonal immunoglobulin (Ig) gene rearrangements may indicate risk of failure, a more universal approach was developed to detect minimal disease (MD). Children/adolescents with intermediate-risk B-NHL were treated with French-British-American/Lymphome Malins de Burkitt 96 (FAB/LMB96) B4 modified chemotherapy and rituximab. Specimens from diagnosis, end of induction (EOI), and end of therapy (EOT) were assayed for MD. Initial specimens were screened for IGHV family usage with primer pools followed by individual primers to identify MD. Thirty-two diagnostic/staging specimens screened positive with primer pools and unique IGHV family primers were identified. Two patients relapsed; first relapse (4 months from diagnosis) was MD-positive at EOI, the second (36 months from diagnosis) was MD-positive at EOT. At EOI, recurrent rates were similar between the MRD-positive and MRD-negative patients (p=0.40). At EOT, only 13/32 patients had MRD data available with 1 relapse in the MRD-positive group and no recurrences in the MRD-negative group (p=0.077). The study demonstrated molecular-disseminated disease in which IgIGHV primer pools could be used to assess MD. This feasibility study supports future investigations to assess the validity and significance of MD screening in a larger cohort of patients with intermediate-risk mature B-NHL. PMID:21496005
Minimal disease assessment in the treatment of children and adolescents with intermediate-risk (Stage III/IV) B-cell non-Hodgkin lymphoma: a children's oncology group report.

PubMed

Shiramizu, Bruce; Goldman, Stanton; Kusao, Ian; Agsalda, Melissa; Lynch, James; Smith, Lynette; Harrison, Lauren; Morris, Erin; Gross, Thomas G; Sanger, Warren; Perkins, Sherrie; Cairo, Mitchell S

2011-06-01

Children/adolescents with mature B-cell non-Hodgkin lymphoma (B-NHL) have an excellent prognosis but relapses still occur. While chromosomal aberrations and/or clonal immunoglobulin (Ig) gene rearrangements may indicate risk of failure, a more universal approach was developed to detect minimal disease (MD). Children/adolescents with intermediate-risk B-NHL were treated with French-British-American/Lymphome Malins de Burkitt 96 (FAB/LMB96) B4 modified chemotherapy and rituximab. Specimens from diagnosis, end of induction (EOI), and end of therapy (EOT) were assayed for MD. Initial specimens were screened for IGHV family usage with primer pools followed by individual primers to identify MD. Thirty-two diagnostic/staging specimens screened positive with primer pools and unique IGHV family primers were identified. Two patients relapsed; first relapse (4 months from diagnosis) was MD-positive at EOI, the second (36 months from diagnosis) was MD-positive at EOT. At EOI, recurrent rates were similar between the MRD-positive and MRD-negative patients (P = 0·40). At EOT, only 13/32 patients had MRD data available with one relapse in the MRD-positive group and no recurrences in the MRD-negative group (P = 0·077). The study demonstrated molecular-disseminated disease in which IgIGHV primer pools could be used to assess MD. This feasibility study supports future investigations to assess the validity and significance of MD screening in a larger cohort of patients with intermediate-risk mature B-NHL. © 2011 Blackwell Publishing Ltd.
Activated Allogeneic NK Cells Preferentially Kill Poor Prognosis B-Cell Chronic Lymphocytic Leukemia Cells.

PubMed

Sánchez-Martínez, Diego; Lanuza, Pilar M; Gómez, Natalia; Muntasell, Aura; Cisneros, Elisa; Moraru, Manuela; Azaceta, Gemma; Anel, Alberto; Martínez-Lostao, Luis; Villalba, Martin; Palomera, Luis; Vilches, Carlos; García Marco, José A; Pardo, Julián

2016-01-01

Mutational status of TP53 together with expression of wild-type (wt) IGHV represents the most widely accepted biomarkers, establishing a very poor prognosis in B-cell chronic lymphocytic leukemia (B-CLL) patients. Adoptive cell therapy using allogeneic HLA-mismatched Natural killer (NK) cells has emerged as an effective and safe alternative in the treatment of acute myeloid and lymphoid leukemias that do not respond to traditional therapies. We have described that allogeneic activated NK cells eliminate hematological cancer cell lines with multidrug resistance acquired by mutations in the apoptotic machinery. This effect depends on the activation protocol, being B-lymphoblastoid cell lines (LCLs) the most effective stimulus to activate NK cells. Here, we have further analyzed the molecular determinants involved in allogeneic NK cell recognition and elimination of B-CLL cells, including the expression of ligands of the main NK cell-activating receptors (NKG2D and NCRs) and HLA mismatch. We present preliminary data suggesting that B-CLL susceptibility significantly correlates with HLA mismatch between NK cell donor and B-CLL patient. Moreover, we show that the sensitivity of B-CLL cells to NK cells depends on the prognosis based on TP53 and IGHV mutational status. Cells from patients with worse prognosis (mutated TP53 and wt IGHV ) are the most susceptible to activated NK cells. Hence, B-CLL prognosis may predict the efficacy of allogenic activated NK cells, and, thus, NK cell transfer represents a good alternative to treat poor prognosis B-CLL patients who present a very short life expectancy due to lack of effective treatments.
Genomic V exons from whole genome shotgun data in reptiles.

PubMed

Olivieri, D N; von Haeften, B; Sánchez-Espinel, C; Faro, J; Gambón-Deza, F

2014-08-01

Reptiles and mammals diverged over 300 million years ago, creating two parallel evolutionary lineages amongst terrestrial vertebrates. In reptiles, two main evolutionary lines emerged: one gave rise to Squamata, while the other gave rise to Testudines, Crocodylia, and Aves. In this study, we determined the genomic variable (V) exons from whole genome shotgun sequencing (WGS) data in reptiles corresponding to the three main immunoglobulin (IG) loci and the four main T cell receptor (TR) loci. We show that Squamata lack the TRG and TRD genes, and snakes lack the IGKV genes. In representative species of Testudines and Crocodylia, the seven major IG and TR loci are maintained. As in mammals, genes of the IG loci can be grouped into well-defined IMGT clans through a multi-species phylogenetic analysis. We show that the reptilian IGHV and IGLV genes are distributed amongst the established mammalian clans, while their IGKV genes are found within a single clan, nearly exclusive from the mammalian sequences. The reptilian and mammalian TRAV genes cluster into six common evolutionary clades (since IMGT clans have not been defined for TR). In contrast, the reptilian TRBV genes cluster into three clades, which have few mammalian members. In this locus, the V exon sequences from mammals appear to have undergone different evolutionary diversification processes that occurred outside these shared reptilian clans. These sequences can be obtained in a freely available public repository (http://vgenerepertoire.org).
IgV peptide mapping of native Ro60 autoantibody proteomes in primary Sjögren's syndrome reveals molecular markers of Ro/La diversification.

PubMed

Wang, Jing J; Al Kindi, Mahmood A; Colella, Alex D; Dykes, Lukah; Jackson, Michael W; Chataway, Tim K; Reed, Joanne H; Gordon, Tom P

2016-12-01

We have used high-resolution mass spectrometry to sequence precipitating anti-Ro60 proteomes from sera of patients with primary Sjögren's syndrome and compare immunoglobulin variable-region (IgV) peptide signatures in Ro/La autoantibody subsets. Anti-Ro60 were purified by elution from native Ro60-coated ELISA plates and subjected to combined de novo amino acid sequencing and database matching. Monospecific anti-Ro60 Igs comprised dominant public and minor private sets of IgG1 kappa and lambda restricted heavy and light chains. Specific IgV amino acid substitutions stratified anti-Ro60 from anti-Ro60/La responses, providing a molecular fingerprint of Ro60/La determinant spreading and suggesting that different forms of Ro60 antigen drive these responses. Sequencing of linked anti-Ro52 proteomes from individual patients and comparison with their anti-Ro60 partners revealed sharing of a dominant IGHV3-23/IGKV3-20 paired clonotype but with divergent IgV mutational signatures. In summary, anti-Ro60 IgV peptide mapping provides insights into Ro/La autoantibody diversification and reveals serum-based molecular markers of humoral Ro60 autoimmunity. Copyright Â© 2016 Elsevier Inc. All rights reserved.

Diversity, cellular origin and autoreactivity of antibody-secreting cell expansions in acute Systemic Lupus Erythematosus

PubMed Central

Tipton, Christopher M; Fucile, Christopher F; Darce, Jaime; Chida, Asiya; Ichikawa, Travis; Gregoretti, Ivan; Schieferl, Sandra; Hom, Jennifer; Jenks, Scott; Feldman, Ron J; Mehr, Ramit; Wei, Chungwen; Lee, F. Eun-Hyung; Cheung, Wan Cheung; Rosenberg, Alexander F; Sanz, Iñaki

2015-01-01

Acute SLE courses with antibody-secreting cells (ASC) surges whose origin, diversity, and contribution to serum autoantibodies remain unknown. Deep sequencing, autoantibody proteome and single-cell analysis demonstrated highly diversified ASC punctuated by VH4-34 clones that produce dominant serum autoantibodies. A fraction of ASC clones contained unmutated autoantibodies, a finding consistent with differentiation outside the germinal centers. A substantial ASC segment derived from a distinct subset of newly activated naïve cells of significant clonality that persist in the circulation for several months. Thus, selection of SLE autoreactivities occurred during polyclonal activation with prolonged recruitment of recently activated naïve B cells. These findings shed light into SLE pathogenesis, help explain the benefit of anti-B cell agents and facilitate the design of future therapies. PMID:26006014
Recombinant host cells and media for ethanol production

DOEpatents

Wood, Brent E; Ingram, Lonnie O; Yomano, Lorraine P; York, Sean W

2014-02-18

Disclosed are recombinant host cells suitable for degrading an oligosaccharide that have been optimized for growth and production of high yields of ethanol, and methods of making and using these cells. The invention further provides minimal media comprising urea-like compounds for economical production of ethanol by recombinant microorganisms. Recombinant host cells in accordance with the invention are modified by gene mutation to eliminate genes responsible for the production of unwanted products other than ethanol, thereby increasing the yield of ethanol produced from the oligosaccharides, relative to unmutated parent strains. The new and improved strains of recombinant bacteria are capable of superior ethanol productivity and yield when grown under conditions suitable for fermentation in minimal growth media containing inexpensive reagents. Systems optimized for ethanol production combine a selected optimized minimal medium with a recombinant host cell optimized for use in the selected medium. Preferred systems are suitable for efficient ethanol production by simultaneous saccharification and fermentation (SSF) using lignocellulose as an oligosaccharide source. The invention also provides novel isolated polynucleotide sequences, polypeptide sequences, vectors and antibodies.
Igs as Substrates for Transglutaminase 2: Implications for Autoantibody Production in Celiac Disease

PubMed Central

Fleur du Pré, M.; Di Niro, Roberto; Sollid, Ludvig M.

2015-01-01

Autoantibodies specific for the enzyme transglutaminase 2 (TG2) are a hallmark of the gluten-sensitive enteropathy celiac disease. Production of the Abs is strictly dependent on exposure to dietary gluten proteins, thus raising the question how a foreign Ag (gluten) can induce an autoimmune response. It has been suggested that TG2-reactive B cells are activated by gluten-reactive T cells following receptor-mediated uptake of TG2–gluten complexes. In this study, we propose a revised model that is based on the ability of the BCR to serve as a substrate to TG2 and become cross-linked to gluten-derived peptides. We show that TG2-specific IgD molecules are preferred in the reaction and that binding of TG2 via a common epitope targeted by cells using the IgH variable gene segment (IGHV)5–51 results in more efficient cross-linking. Based on these findings we hypothesize that IgD-expressing B cells using IGHV5–51 are preferentially activated, and we suggest that this property can explain the previously reported low number of somatic mutations as well as the overrepresentation of IGHV5–51 among TG2-specific plasma cells in the celiac lesion. The model also couples gluten peptide uptake by TG2-reactive B cells directly to peptide deamidation, which is necessary for the activation of gluten-reactive T cells. It thereby provides a link between gluten deamidation, T cell activation, and the production of TG2-specific Abs. These are all key events in the development of celiac disease, and by connecting them the model may explain why the same enzyme that catalyzes gluten deamidation is also an autoantigen, something that is hardly coincidental. PMID:26503953
Methylation status regulates lipoprotein lipase expression in chronic lymphocytic leukemia.

PubMed

Abreu, Cecilia; Moreno, Pilar; Palacios, Florencia; Borge, Mercedes; Morande, Pablo; Landoni, Ana Inés; Gabus, Raul; Dighiero, Guillermo; Giordano, Mirta; Gamberale, Romina; Oppezzo, Pablo

2013-08-01

Among different prognostic factors in chronic lymphocytic leukemia (CLL), we previously demonstrated that lipoprotein lipase (LPL) is associated with an unmutated immunoglobulin profile and clinical poor outcome. Despite the usefulness of LPL for CLL prognosis, its functional role and the molecular mechanism regulating its expression are still open questions. Interaction of CLL B-cells with the tissue microenvironment favors disease progression by promoting malignant B-cell growth. Since tissue methylation can be altered by environmental factors, we investigated the methylation status of the LPL gene and the possibility that overexpression could be associated with microenvironment signals. Our results show that a demethylated state of the LPL gene is responsible for its anomalous expression in unmutated CLL cases and that this expression is dependent on microenvironment signals. Overall, this work proposes that an epigenetic mechanism, triggered by the microenvironment, regulates LPL expression in CLL disease.
Diversion of HIV-1 Vaccine-induced Immunity by gp41-Microbiota Cross-reactive Antibodies

PubMed Central

Williams, Wilton B; Liao, Hua-Xin; Moody, M. Anthony; Kepler, Thomas B.; Alam, S Munir; Gao, Feng; Wiehe, Kevin; Trama, Ashley M.; Jones, Kathryn; Zhang, Ruijun; Song, Hongshuo; Marshall, Dawn J; Whitesides, John F; Sawatzki, Kaitlin; Hua, Axin; Liu, Pinghuang; Tay, Matthew Z; Seaton, Kelly; Shen, Xiaoying; Foulger, Andrew; Lloyd, Krissey E.; Parks, Robert; Pollara, Justin; Ferrari, Guido; Yu, Jae-Sung; Vandergrift, Nathan; Montefiori, David C.; Sobieszczyk, Magdalena E; Hammer, Scott; Karuna, Shelly; Gilbert, Peter; Grove, Doug; Grunenberg, Nicole; McElrath, Julie; Mascola, John R.; Koup, Richard A; Corey, Lawrence; Nabel, Gary J.; Morgan, Cecilia; Churchyard, Gavin; Maenza, Janine; Keefer, Michael; Graham, Barney S.; Baden, Lindsey R.; Tomaras, Georgia D.; Haynes, Barton F.

2015-01-01

A HIV-1 DNA prime-recombinant Adenovirus Type 5 (rAd5) boost vaccine failed to protect from HIV-1 acquisition. We studied the nature of the vaccine-induced antibody (Ab) response to HIV-1 envelope (Env). HIV-1-reactive plasma Ab titers were higher to Env gp41 than gp120, and repertoire analysis demonstrated that 93% of HIV-1-reactive Abs from memory B cells was to Env gp41. Vaccine-induced gp41-reactive monoclonal antibodies (mAbs) were non-neutralizing, and frequently polyreactive with host and environmental antigens including intestinal microbiota (IM). Next generation sequencing of an IGHV repertoire prior to vaccination revealed an Env-IM cross-reactive Ab that was clonally-related to a subsequent vaccine-induced gp41-reactive Ab. Thus, HIV-1 Env DNA-rAd5 vaccine induced a dominant IM-polyreactive, non-neutralizing gp41-reactive Ab repertoire response that was associated with no vaccine efficacy. PMID:26229114
Cell Cycle Reprogramming for PI3K Inhibition Overrides Relapse-Specific C481S BTK Mutation Revealed by Longitudinal Functional Genomics in Mantle Cell Lymphoma

PubMed Central

Chiron, David; Di Liberto, Maurizio; Martin, Peter; Huang, Xiangao; Sharman, Jeff; Blecua, Pedro; Mathew, Susan; Vijay, Priyanka; Eng, Ken; Ali, Siraj; Johnson, Amy; Chang, Betty; Ely, Scott; Elemento, Olivier; Mason, Christopher E.; Leonard, John P.; Chen-Kiang, Selina

2014-01-01

Despite the unprecedented clinical activity of the Bruton’s tyrosine kinase inhibitor ibrutinib in MCL, acquired-resistance is common. By longitudinal integrative whole-exome and whole-transcriptome sequencing and targeted sequencing, we identified the first relapse-specific C481S mutation at the ibrutinib-binding site of BTK in MCL cells at progression following a durable response. This mutation enhanced BTK and AKT activation and tissue-specific proliferation of resistant MCL cells driven by CDK4 activation. It was absent, however, in patients with primary-resistance or progression following transient response to ibrutinib, suggesting alternative mechanisms of resistance. Through synergistic induction of PIK3IP1 and inhibition of PI3K-AKT activation, prolonged early G1 arrest induced by PD 0332991 (palbociclib) inhibition of CDK4 sensitized resistant lymphoma cells to ibrutinib killing when BTK was unmutated, and to PI3K inhibitors independent of C481S mutation. These data identify a genomic basis for acquired-ibrutinib resistance in MCL and suggest a strategy to override both primary- and acquired-ibrutinib resistance. PMID:25082755
Stabilised DNA secondary structures with increasing transcription localise hypermutable bases for somatic hypermutation in IGHV3-23.

PubMed

Duvvuri, Bhargavi; Duvvuri, Venkata R; Wu, Jianhong; Wu, Gillian E

2012-07-01

Somatic hypermutation (SHM) mediated by activation-induced cytidine deaminase (AID) is a transcription-coupled mechanism most responsible for generating high affinity antibodies. An issue remaining enigmatic in SHM is how AID is preferentially targeted during transcription to hypermutable bases in its substrates (WRC motifs) on both DNA strands. AID targets only single stranded DNA. By modelling the dynamical behaviour of IGHV3-23 DNA, a commonly used human variable gene segment, we observed that hypermutable bases on the non-transcribed strand are paired whereas those on transcribed strand are mostly unpaired. Hypermutable bases (both paired and unpaired) are made accessible to AID in stabilised secondary structures formed with increasing transcription levels. This observation provides a rationale for the hypermutable bases on both the strands of DNA being targeted to a similar extent despite having differences in unpairedness. We propose that increasing transcription and RNAP II stalling resulting in the formation and stabilisation of stem-loop structures with AID hotspots in negatively supercoiled region can localise the hypermutable bases of both strands of DNA, to AID-mediated SHM.
Analysis of a Clonal Lineage of HIV-1 Envelope V2/V3 Conformational Epitope-Specific Broadly Neutralizing Antibodies and Their Inferred Unmutated Common Ancestors ▿ †

PubMed Central

Bonsignori, Mattia; Hwang, Kwan-Ki; Chen, Xi; Tsao, Chun-Yen; Morris, Lynn; Gray, Elin; Marshall, Dawn J.; Crump, John A.; Kapiga, Saidi H.; Sam, Noel E.; Sinangil, Faruk; Pancera, Marie; Yongping, Yang; Zhang, Baoshan; Zhu, Jiang; Kwong, Peter D.; O'Dell, Sijy; Mascola, John R.; Wu, Lan; Nabel, Gary J.; Phogat, Sanjay; Seaman, Michael S.; Whitesides, John F.; Moody, M. Anthony; Kelsoe, Garnett; Yang, Xinzhen; Sodroski, Joseph; Shaw, George M.; Montefiori, David C.; Kepler, Thomas B.; Tomaras, Georgia D.; Alam, S. Munir; Liao, Hua-Xin; Haynes, Barton F.

2011-01-01

V2/V3 conformational epitope antibodies that broadly neutralize HIV-1 (PG9 and PG16) have been recently described. Since an elicitation of previously known broadly neutralizing antibodies has proven elusive, the induction of antibodies with such specificity is an important goal for HIV-1 vaccine development. A critical question is which immunogens and vaccine formulations might be used to trigger and drive the development of memory B cell precursors with V2/V3 conformational epitope specificity. In this paper we identified a clonal lineage of four V2/V3 conformational epitope broadly neutralizing antibodies (CH01 to CH04) from an African HIV-1-infected broad neutralizer and inferred their common reverted unmutated ancestor (RUA) antibodies. While conformational epitope antibodies rarely bind recombinant Env monomers, a screen of 32 recombinant envelopes for binding to the CH01 to CH04 antibodies showed monoclonal antibody (MAb) binding to the E.A244 gp120 Env and to chronic Env AE.CM243; MAbs CH01 and CH02 also bound to transmitted/founder Env B.9021. CH01 to CH04 neutralized 38% to 49% of a panel of 91 HIV-1 tier 2 pseudoviruses, while the RUAs neutralized only 16% of HIV-1 isolates. Although the reverted unmutated ancestors showed restricted neutralizing activity, they retained the ability to bind to the E.A244 gp120 HIV-1 envelope with an affinity predicted to trigger B cell development. Thus, E.A244, B.9021, and AE.CM243 Envs are three potential immunogen candidates for studies aimed at defining strategies to induce V2/V3 conformational epitope-specific antibodies. PMID:21795340
Immunological aspects in chronic lymphocytic leukemia (CLL) development.

PubMed

García-Muñoz, Ricardo; Galiacho, Verónica Roldan; Llorente, Luis

2012-07-01

Chronic lymphocytic leukemia (CLL) is unique among B cell malignancies in that the malignant clones can be featured either somatically mutated or unmutated IGVH genes. CLL cells that express unmutated immunoglobulin variable domains likely underwent final development prior to their entry into the germinal center, whereas those that express mutated variable domains likely transited through the germinal center and then underwent final development. Regardless, the cellular origin of CLL remains unknown. The aim of this review is to summarize immunological aspects involved in this process and to provide insights about the complex biology and pathogenesis of this disease. We propose a mechanistic hypothesis to explain the origin of B-CLL clones into our current picture of normal B cell development. In particular, we suggest that unmutated CLL arises from normal B cells with self-reactivity for apoptotic bodies that have undergone receptor editing, CD5 expression, and anergic processes in the bone marrow. Similarly, mutated CLL would arise from cells that, while acquiring self-reactivity for autoantigens-including apoptotic bodies-in germinal centers, are also still subject to tolerization mechanisms, including receptor editing and anergy. We believe that CLL is a proliferation of B lymphocytes selected during clonal expansion through multiple encounters with (auto)antigens, despite the fact that they differ in their state of activation and maturation. Autoantigens and microbial pathogens activate BCR signaling and promote tolerogenic mechanisms such as receptor editing/revision, anergy, CD5+ expression, and somatic hypermutation in CLL B cells. The result of these tolerogenic mechanisms is the survival of CLL B cell clones with similar surface markers and homogeneous gene expression signatures. We suggest that both immunophenotypic surface markers and homogenous gene expression might represent the evidence of several attempts to re-educate self-reactive B cells.
Prolonged lymphocytosis during ibrutinib therapy is associated with distinct molecular characteristics and does not indicate a suboptimal response to therapy

PubMed Central

Smucker, Kelly; Smith, Lisa L.; Lozanski, Arletta; Zhong, Yiming; Ruppert, Amy S.; Lucas, David; Williams, Katie; Zhao, Weiqiang; Rassenti, Laura; Ghia, Emanuela; Kipps, Thomas J.; Mantel, Rose; Jones, Jeffrey; Flynn, Joseph; Maddocks, Kami; O’Brien, Susan; Furman, Richard R.; James, Danelle F.; Clow, Fong; Lozanski, Gerard; Johnson, Amy J.; Byrd, John C.

2014-01-01

The Bruton’s tyrosine kinase (BTK) inhibitor ibrutinib has outstanding activity in patients with chronic lymphocytic leukemia. Most patients experience lymphocytosis, representing lymphocyte egress from nodal compartments. This resolves within 8 months in the majority of patients, but a subgroup has lymphocytosis lasting >12 months. Here we report a detailed characterization of patients with persistent lymphocytosis during ibrutinib therapy. Signaling evaluation showed that while BTK is inhibited, downstream mediators of B-cell receptor (BCR) signaling are activated in persistent lymphocytes. These cells cannot be stimulated through the BCR and do not show evidence of target gene activation. Flow cytometry for κ and λ expression, IGHV sequencing, Zap-70 methylation, and targeted gene sequencing in these patients are identical at baseline and later time points, suggesting that persistent lymphocytes do not represent clonal evolution. In vitro treatment with targeted kinase inhibitors shows that they are not addicted to a single survival pathway. Finally, progression-free survival is not inferior for patients with prolonged lymphocytosis vs those with traditional responses. Thus, prolonged lymphocytosis is common following ibrutinib treatment, likely represents the persistence of a quiescent clone, and does not predict a subgroup of patients likely to relapse early. PMID:24415539
Whole-genome sequencing identifies recurrent mutations in chronic lymphocytic leukaemia

PubMed Central

Puente, Xose S.; Pinyol, Magda; Quesada, Víctor; Conde, Laura; Ordóñez, Gonzalo R.; Villamor, Neus; Escaramis, Georgia; Jares, Pedro; Beà, Sílvia; González-Díaz, Marcos; Bassaganyas, Laia; Baumann, Tycho; Juan, Manel; López-Guerra, Mónica; Colomer, Dolors; Tubío, José M. C.; López, Cristina; Navarro, Alba; Tornador, Cristian; Aymerich, Marta; Rozman, María; Hernández, Jesús M.; Puente, Diana A.; Freije, José M. P.; Velasco, Gloria; Gutiérrez-Fernández, Ana; Costa, Dolors; Carrió, Anna; Guijarro, Sara; Enjuanes, Anna; Hernández, Lluís; Yagüe, Jordi; Nicolás, Pilar; Romeo-Casabona, Carlos M.; Himmelbauer, Heinz; Castillo, Ester; Dohm, Juliane C.; de Sanjosé, Silvia; Piris, Miguel A.; de Alava, Enrique; Miguel, Jesús San; Royo, Romina; Gelpí, Josep L.; Torrents, David; Orozco, Modesto; Pisano, David G.; Valencia, Alfonso; Guigó, Roderic; Bayés, Mónica; Heath, Simon; Gut, Marta; Klatt, Peter; Marshall, John; Raine, Keiran; Stebbings, Lucy A.; Futreal, P. Andrew; Stratton, Michael R.; Campbell, Peter J.; Gut, Ivo; López-Guillermo, Armando; Estivill, Xavier; Montserrat, Emili; López-Otín, Carlos; Campo, Elías

2012-01-01

Chronic lymphocytic leukaemia (CLL), the most frequent leukaemia in adults in Western countries, is a heterogeneous disease with variable clinical presentation and evolution1,2. Two major molecular subtypes can be distinguished, characterized respectively by a high or low number of somatic hypermutations in the variable region of immunoglobulin genes3,4. The molecular changes leading to the pathogenesis of the disease are still poorly understood. Here we performed whole-genome sequencing of four cases of CLL and identified 46 somatic mutations that potentially affect gene function. Further analysis of these mutations in 363 patients with CLL identified four genes that are recurrently mutated: notch 1 (NOTCH1), exportin 1 (XPO1), myeloid differentiation primary response gene 88 (MYD88) and kelch-like 6 (KLHL6). Mutations in MYD88 and KLHL6 are predominant in cases of CLL with mutated immunoglobulin genes, whereas NOTCH1 and XPO1 mutations are mainly detected in patients with unmutated immunoglobulins. The patterns of somatic mutation, supported by functional and clinical analyses, strongly indicate that the recurrent NOTCH1, MYD88 and XPO1 mutations are oncogenic changes that contribute to the clinical evolution of the disease. To our knowledge, this is the first comprehensive analysis of CLL combining whole-genome sequencing with clinical characteristics and clinical outcomes. It highlights the usefulness of this approach for the identification of clinically relevant mutations in cancer. PMID:21642962
Longitudinal analysis of the peripheral B cell repertoire reveals unique effects of immunization with a new influenza virus strain.

PubMed

Cortina-Ceballos, Bernardo; Godoy-Lozano, Elizabeth Ernestina; Téllez-Sosa, Juan; Ovilla-Muñoz, Marbella; Sámano-Sánchez, Hugo; Aguilar-Salgado, Andrés; Gómez-Barreto, Rosa Elena; Valdovinos-Torres, Humberto; López-Martínez, Irma; Aparicio-Antonio, Rodrigo; Rodríguez, Mario H; Martínez-Barnetche, Jesús

2015-11-25

Despite the potential to produce antibodies that can neutralize different virus (heterotypic neutralization), there is no knowledge of why vaccination against influenza induces protection predominantly against the utilized viral strains (homotypic response). Identification of structural patterns of the B cell repertoire associated to heterotypic neutralization may contribute to identify relevant epitopes for a universal vaccine against influenza. Blood samples were collected from volunteers immunized with 2008/2009 trivalent inactivated vaccine (TIV), pandemic H1N1 (pdmH1N1) monovalent inactivated vaccine (MIV) and the 2014/2015 TIV. Neutralization was assessed by hemagglutination and microneutralization test. IgG V(H) amplicons derived from peripheral blood RNA from pre-immune and 7 days post vaccination were subjected to 454-Roche sequencing. Full reconstruction of the sampled repertoires was done with ImmunediveRsity. The TIV induced a predominantly homotypic neutralizing serologic response, while the 09 MIV induced a heterotypic neutralizing seroconversion in 17% of the individuals. Both the 08/09 and the 14/15 TIV were associated with a reduction in clonotypic diversity, whereas 09 MIV was the opposite. Moreover, TIV and MIV induced distinctive patterns of IGHV segment use that are consistent with B cell selection by conserved antigenic determinants shared by the pre-pandemic and the pandemic strains. However, low somatic hypermutation rates in IgG after 09 MIV immunization, but not after 08/09 and 14/15 TIV immunization were observed. Furthermore, no evidence of the original antigenic sin was found in the same individuals after vaccination with the three vaccines. Immunization with a new influenza virus strain (2009 pdmH1N1) induced unique effects in the peripheral B cell repertoire clonal structure, a stereotyped response involving distinctive IGHV segment use and low somatic hypermutation levels. These parameters were contrastingly different to those observed in response to pre-pandemic and post-pandemic vaccination, and may be the result of clonal selection of common antigenic determinants, as well as germinal center-independent responses that wane as the pandemic strain becomes seasonal. Our findings may contribute in the understanding of the structural and cellular basis required to develop a universal influenza vaccine.
Igs expressed by chronic lymphocytic leukemia B cells show limited binding-site structure variability.

PubMed

Marcatili, Paolo; Ghiotto, Fabio; Tenca, Claudya; Chailyan, Anna; Mazzarello, Andrea N; Yan, Xiao-Jie; Colombo, Monica; Albesiano, Emilia; Bagnara, Davide; Cutrona, Giovanna; Morabito, Fortunato; Bruno, Silvia; Ferrarini, Manlio; Chiorazzi, Nicholas; Tramontano, Anna; Fais, Franco

2013-06-01

Ag selection has been suggested to play a role in chronic lymphocytic leukemia (CLL) pathogenesis, but no large-scale analysis has been performed so far on the structure of the Ag-binding sites (ABSs) of leukemic cell Igs. We sequenced both H and L chain V(D)J rearrangements from 366 CLL patients and modeled their three-dimensional structures. The resulting ABS structures were clustered into a small number of discrete sets, each containing ABSs with similar shapes and physicochemical properties. This structural classification correlates well with other known prognostic factors such as Ig mutation status and recurrent (stereotyped) receptors, but it shows a better prognostic value, at least in the case of one structural cluster for which clinical data were available. These findings suggest, for the first time, to our knowledge, on the basis of a structural analysis of the Ab-binding sites, that selection by a finite quota of antigenic structures operates on most CLL cases, whether mutated or unmutated.
Cryptochrome-1 expression: a new prognostic marker in B-cell chronic lymphocytic leukemia.

PubMed

Lewintre, Eloisa Jantus; Martín, Cristina Reinoso; Ballesteros, Carlos García; Montaner, David; Rivera, Rosa Farrás; Mayans, José Ramón; García-Conde, Javier

2009-02-01

Chronic lymphocytic leukemia is an adult-onset leukemia with a heterogeneous clinical behavior. When chronic lymphocytic leukemia cases were divided on the basis of IgV(H) mutational status, widely differing clinical courses were revealed. Since IgV(H) sequencing is difficult to perform in a routine diagnostic laboratory, finding a surrogate for IgV(H) mutational status seems an important priority. In the present study, we proposed the use of Cryptochrome-1 as a new prognostic marker in early-stage chronic lymphocytic leukemia. Seventy patients (Binet stage A, without treatment) were included in the study. We correlated Cryptochrome-1 mRNA with well established prognostic markers such as IgV(H) mutations, ZAP70, LPL or CD38 expression and chromosomal abnormalities. High Cryptochrome-1 expression correlated with IgV(H) unmutated samples. In addition, Cryptochrome-1 was a valuable predictor of disease progression in early-stage chronic lymphocytic leukemia, therefore it can be introduced in clinical practice with the advantage of a simplified method of quantification.
Co-evolution of a broadly neutralizing HIV-1 antibody and founder virus

PubMed Central

Liao, Hua-Xin; Lynch, Rebecca; Zhou, Tongqing; Gao, Feng; Alam, S. Munir; Boyd, Scott D.; Fire, Andrew Z.; Roskin, Krishna M.; Schramm, Chaim A.; Zhang, Zhenhai; Zhu, Jiang; Shapiro, Lawrence; Mullikin, James C.; Gnanakaran, S.; Hraber, Peter; Wiehe, Kevin; Kelsoe, Garnett; Yang, Guang; Xia, Shi-Mao; Montefiori, David C.; Parks, Robert; Lloyd, Krissey E.; Scearce, Richard M.; Soderberg, Kelly A.; Cohen, Myron; Kaminga, Gift; Louder, Mark K.; Tran, Lillan M.; Chen, Yue; Cai, Fangping; Chen, Sheri; Moquin, Stephanie; Du, Xiulian; Joyce, Gordon M.; Srivatsan, Sanjay; Zhang, Baoshan; Zheng, Anqi; Shaw, George M.; Hahn, Beatrice H.; Kepler, Thomas B.; Korber, Bette T.M.; Kwong, Peter D.; Mascola, John R.; Haynes, Barton F.

2013-01-01

Current HIV-1 vaccines elicit strain-specific neutralizing antibodies. However, cross-reactive neutralizing antibodies arise in ~20% of HIV-1-infected individuals, and details of their generation could provide a roadmap for effective vaccination. Here we report the isolation, evolution and structure of a broadly neutralizing antibody from an African donor followed from time of infection. The mature antibody, CH103, neutralized ~55% of HIV-1 isolates, and its co-crystal structure with gp120 revealed a novel loop-based mechanism of CD4-binding site recognition. Virus and antibody gene sequencing revealed concomitant virus evolution and antibody maturation. Notably, the CH103-lineage unmutated common ancestor avidly bound the transmitted/founder HIV-1 envelope glycoprotein, and evolution of antibody neutralization breadth was preceded by extensive viral diversification in and near the CH103 epitope. These data elucidate the viral and antibody evolution leading to induction of a lineage of HIV-1 broadly neutralizing antibodies and provide insights into strategies to elicit similar antibodies via vaccination. PMID:23552890
SDM-Assist software to design site-directed mutagenesis primers introducing “silent” restriction sites

PubMed Central

2013-01-01

Background Over the past decades site-directed mutagenesis (SDM) has become an indispensable tool for biological structure-function studies. In principle, SDM uses modified primer pairs in a PCR reaction to introduce a mutation in a cDNA insert. DpnI digestion of the reaction mixture is used to eliminate template copies before amplification in E. coli; however, this process is inefficient resulting in un-mutated clones which can only be distinguished from mutant clones by sequencing. Results We have developed a program – ‘SDM-Assist’ which creates SDM primers adding a specific identifier: through additional silent mutations a restriction site is included or a previous one removed which allows for highly efficient identification of ‘mutated clones’ by a simple restriction digest. Conclusions The direct identification of SDM clones will save time and money for researchers. SDM-Assist also scores the primers based on factors such as Tm, GC content and secondary structure allowing for simplified selection of optimal primer pairs. PMID:23522286
Relevance of the immunoglobulin VH somatic mutation status in patients with chronic lymphocytic leukemia treated with fludarabine, cyclophosphamide, and rituximab (FCR) or related chemoimmunotherapy regimens.

PubMed

Lin, Katherine I; Tam, Constantine S; Keating, Michael J; Wierda, William G; O'Brien, Susan; Lerner, Susan; Coombes, Kevin R; Schlette, Ellen; Ferrajoli, Alessandra; Barron, Lynn L; Kipps, Thomas J; Rassenti, Laura; Faderl, Stefan; Kantarjian, Hagop; Abruzzo, Lynne V

2009-04-02

Although immunoglobulin V(H) mutation status (IgV(H) MS) is prognostic in patients with chronic lymphocytic leukemia (CLL) who are treated with alkylating agents or single-agent fludarabine, its significance in the era of chemoimmunotherapy is not known. We determined the IgV(H) somatic mutation status (MS) in 177 patients enrolled in a phase 2 study of fludarabine, cyclophosphamide, and rituximab (FCR) and in 127 patients treated with subsequent chemoimmunotherapy protocols. IgV(H) MS did not impact significantly on the complete remission (CR) rate of patients receiving FCR or related regimens. However, CR duration was significantly shorter in patients with CLL that used unmutated IgV(H) than those whose CLL used mutated IgV(H) (TTP 47% vs 82% at 6 years, P < .001). In a multivariate model considering all baseline characteristics, IgV(H) MS emerged as the only determinant of remission duration (hazard ratio 3.8, P < .001). Our results suggest that postremission interventions should be targeted toward patients with unmutated IgV(H) status.
An Unmutated IgM Response to the Vi Polysaccharide of Salmonella Typhi Contributes to Protective Immunity in a Murine Model of Typhoid.

PubMed

Pandya, Kalgi D; Palomo-Caturla, Isabel; Walker, Justin A; K Sandilya, Vijay; Zhong, Zhijiu; Alugupalli, Kishore R

2018-06-15

T cell-dependent B cell responses typically develop in germinal centers. Abs generated during such responses are isotype switched and have a high affinity to the Ag because of somatic hypermutation of Ab genes. B cell responses to purified polysaccharides are T cell independent and do not result in the formation of bona fide germinal centers, and the dominant Ab isotype produced during such responses is IgM with very few or no somatic mutations. Activation-induced cytidine deaminase (AID) is required for both somatic hypermutation and Ig isotype switching in humans and mice. To test the extent to which unmutated polysaccharide-specific IgM confers protective immunity, we immunized wildtype and AID -/- mice with either heat-killed Salmonella enterica serovar Typhi ( S. Typhi) or purified Vi polysaccharide (ViPS). We found that wildtype and AID -/- mice immunized with heat-killed S. Typhi generated similar anti-ViPS IgM responses. As expected, wildtype, but not AID -/- mice generated ViPS-specific IgG. However, the differences in the Ab-dependent killing of S. Typhi mediated by the classical pathway of complement activation were not statistically significant. In ViPS-immunized wildtype and AID -/- mice, the ViPS-specific IgM levels and S. Typhi bactericidal Ab titers at 7 but not at 28 d postimmunization were also comparable. To test the protective immunity conferred by these immunizations, mice were challenged with a chimeric S. Typhimurium strain expressing ViPS. Compared with their naive counterparts, immunized wildtype and AID -/- mice exhibited significantly reduced bacterial burden regardless of the route of infection. These data indicate that an unmutated IgM response to ViPS contributes to protective immunity to S. Typhi. Copyright © 2018 by The American Association of Immunologists, Inc.
Identifying EGFR-Expressed Cells and Detecting EGFR Multi-Mutations at Single-Cell Level by Microfluidic Chip

NASA Astrophysics Data System (ADS)

Li, Ren; Zhou, Mingxing; Li, Jine; Wang, Zihua; Zhang, Weikai; Yue, Chunyan; Ma, Yan; Peng, Hailin; Wei, Zewen; Hu, Zhiyuan

2018-03-01

EGFR mutations companion diagnostics have been proved to be crucial for the efficacy of tyrosine kinase inhibitor targeted cancer therapies. To uncover multiple mutations occurred in minority of EGFR-mutated cells, which may be covered by the noises from majority of un-mutated cells, is currently becoming an urgent clinical requirement. Here we present the validation of a microfluidic-chip-based method for detecting EGFR multi-mutations at single-cell level. By trapping and immunofluorescently imaging single cells in specifically designed silicon microwells, the EGFR-expressed cells were easily identified. By in situ lysing single cells, the cell lysates of EGFR-expressed cells were retrieved without cross-contamination. Benefited from excluding the noise from cells without EGFR expression, the simple and cost-effective Sanger's sequencing, but not the expensive deep sequencing of the whole cell population, was used to discover multi-mutations. We verified the new method with precisely discovering three most important EGFR drug-related mutations from a sample in which EGFR-mutated cells only account for a small percentage of whole cell population. The microfluidic chip is capable of discovering not only the existence of specific EGFR multi-mutations, but also other valuable single-cell-level information: on which specific cells the mutations occurred, or whether different mutations coexist on the same cells. This microfluidic chip constitutes a promising method to promote simple and cost-effective Sanger's sequencing to be a routine test before performing targeted cancer therapy.[Figure not available: see fulltext.
Analysis of immunoglobulin heavy and light chain variable genes in post-transplant lymphoproliferative disorders.

PubMed

Capello, Daniela; Cerri, Michaela; Muti, Giuliana; Lucioni, Marco; Oreste, Pierluigi; Gloghini, Annunziata; Berra, Eva; Deambrogi, Clara; Franceschetti, Silvia; Rossi, Davide; Alabiso, Oscar; Morra, Enrica; Rambaldi, Alessandro; Carbone, Antonino; Paulli, Marco; Gaidano, Gianluca

2006-12-01

Post-transplant lymphoproliferative disorders (PTLD) derive from antigen-experienced B-cells and represent a major complication of solid organ transplantation. We characterized usage, mutation frequency and mutation pattern of immunoglobulin variable (IGV) gene rearrangements in 50 PTLD (polymorphic PTLD, n=10; diffuse large B-cell lymphoma, n=35; and Burkitt/Burkitt-like lymphoma, n=5). Among PTLD yielding clonal IGV amplimers, a functional IGV heavy chain (IGHV) rearrangement was found in 40/50 (80.0%) cases, whereas a potentially functional IGV light chain rearrangement was identified in 36/46 (78.3%) PTLD. By combining IGHV and IGV light chain rearrangements, 10/50 (20.0%) PTLD carried crippling mutations, precluding expression of a functional B-cell receptor (BCR). Immunohistochemistry showed detectable expression of IG light chains in only 18/43 (41.9%) PTLD. Failure to detect a functional IGV rearrangement associated with lack of IGV expression. Our data suggest that a large fraction of PTLD arise from germinal centre (GC)-experienced B-cells that display impaired BCR. Since a functional BCR is required for normal B-cell survival during GC transit, PTLD development may implicate rescue from apoptosis and expansion of B-cells that have failed the GC reaction. The high frequency of IGV loci inactivation appears to be a peculiar feature of PTLD among immunodeficiency-associated lymphoproliferations.

Chronic lymphocytic leukemia with isochromosome 17q: An aggressive subgroup associated with TP53 mutations and complex karyotypes.

PubMed

Collado, Rosa; Puiggros, Anna; López-Guerrero, José Antonio; Calasanz, Ma José; Larráyoz, Ma José; Ivars, David; García-Casado, Zaida; Abella, Eugènia; Orero, Ma Teresa; Talavera, Elisabet; Oliveira, Ana Carla; Hernández-Rivas, Jesús Ma; Hernández-Sánchez, María; Luño, Elisa; Valiente, Alberto; Grau, Javier; Portal, Inmaculada; Gardella, Santiago; Salgado, Anna Camino; Giménez, Ma Teresa; Ardanaz, Ma Teresa; Campeny, Andrea; Hernández, José Julio; Álvarez, Sara; Espinet, Blanca; Carbonell, Félix

2017-11-28

Although i(17q) [i(17q)] is frequently detected in hematological malignancies, few studies have assessed its clinical role in chronic lymphocytic leukemia (CLL). We recruited a cohort of 22 CLL patients with i(17q) and described their biological characteristics, mutational status of the genes TP53 and IGHV and genomic complexity. Furthermore, we analyzed the impact of the type of cytogenetic anomaly bearing the TP53 defect on the outcome of CLL patients and compared the progression-free survival (PFS) and overall survival (OS) of i(17q) cases with those of a group of 38 CLL patients harboring other 17p aberrations. We detected IGHV somatic hypermutation in all assessed patients, and TP53 mutations were observed in 71.4% of the cases. Patients with i(17q) were more commonly associated with complex karyotypes (CK) and tended to have a poorer OS than patients with other anomalies affecting 17p13 (median OS, 44 vs 120 months, P = 0.084). Regarding chromosomal alterations, significant differences in the median OS were found among groups (P = 0.044). In conclusion, our findings provide new insights regarding i(17q) in CLL and show a subgroup with adverse prognostic features. Copyright © 2017 Elsevier B.V. All rights reserved.
Cytogenetic correlates of TET2 mutations in 199 patients with myeloproliferative neoplasms

PubMed Central

Hussein, Kebede; Abdel-Wahab, Omar; Lasho, Terra L.; Van Dyke, Daniel L.; Levine, Ross L.; Hanson, Curtis A.; Pardanani, Animesh; Tefferi, Ayalew

2015-01-01

TET2 is a putative tumor suppressor gene located at chromosome 4q24. TET2 mutations were recently described in several myeloid neoplasms but correlations with cytogenetic findings have not been studied. Among a recently described cohort of patients with myeloproliferative neoplasms (MPN) who underwent TET2 mutation analysis, 199 had information on karyotype at diagnosis or time of TET2 testing: 71 polycythemia vera (PV), 55 primary myelofibrosis (PMF), 43 essential thrombocythemia (ET), 13 post-PV MF, 7 post-ET MF, and 10 blast phase MPN. Forty eight patients (24%) exhibited abnormal karyotype: 15 favorable (sole 20q-, 13q-, or +9), 8 unfavorable (complex karyotype or sole +8), and 25 “other” cytogenetic abnormalities. We found no significant difference either in the incidence or type of cytogenetic abnormalities between TET2 mutated (n = 25) and unmutated (n = 174) cases. Seventy nine patients, including 14 with TET2 mutations, underwent follow-up cytogenetic testing and the findings were again not affected by TET2 mutational status. We conclude that TET2 mutated MPN patients are not cytogenetically different than their TET2 unmutated counterparts. PMID:19957346
Centromere Transcription: Means and Motive.

PubMed

Duda, Zachary; Trusiak, Sarah; O'Neill, Rachel

2017-01-01

The chromosome biology field at large has benefited from studies of the cell cycle components, protein cascades and genomic landscape that are required for centromere identity, assembly and stable transgenerational inheritance. Research over the past 20 years has challenged the classical descriptions of a centromere as a stable, unmutable, and transcriptionally silent chromosome component. Instead, based on studies from a broad range of eukaryotic species, including yeast, fungi, plants, and animals, the centromere has been redefined as one of the more dynamic areas of the eukaryotic genome, requiring coordination of protein complex assembly, chromatin assembly, and transcriptional activity in a cell cycle specific manner. What has emerged from more recent studies is the realization that the transcription of specific types of nucleic acids is a key process in defining centromere integrity and function. To illustrate the transcriptional landscape of centromeres across eukaryotes, we focus this review on how transcripts interact with centromere proteins, when in the cell cycle centromeric transcription occurs, and what types of sequences are being transcribed. Utilizing data from broadly different organisms, a picture emerges that places centromeric transcription as an integral component of centromere function.
Camelid Ig V genes reveal significant human homology not seen in therapeutic target genes, providing for a powerful therapeutic antibody platform

PubMed Central

Klarenbeek, Alex; Mazouari, Khalil El; Desmyter, Aline; Blanchetot, Christophe; Hultberg, Anna; de Jonge, Natalie; Roovers, Rob C; Cambillau, Christian; Spinelli, Sylvia; Del-Favero, Jurgen; Verrips, Theo; de Haard, Hans J; Achour, Ikbel

2015-01-01

Camelid immunoglobulin variable (IGV) regions were found homologous to their human counterparts; however, the germline V repertoires of camelid heavy and light chains are still incomplete and their therapeutic potential is only beginning to be appreciated. We therefore leveraged the publicly available HTG and WGS databases of Lama pacos and Camelus ferus to retrieve the germline repertoire of V genes using human IGV genes as reference. In addition, we amplified IGKV and IGLV genes to uncover the V germline repertoire of Lama glama and sequenced BAC clones covering part of the Lama pacos IGK and IGL loci. Our in silico analysis showed that camelid counterparts of all human IGKV and IGLV families and most IGHV families could be identified, based on canonical structure and sequence homology. Interestingly, this sequence homology seemed largely restricted to the Ig V genes and was far less apparent in other genes: 6 therapeutically relevant target genes differed significantly from their human orthologs. This contributed to efficient immunization of llamas with the human proteins CD70, MET, interleukin (IL)-1β and IL-6, resulting in large panels of functional antibodies. The in silico predicted human-homologous canonical folds of camelid-derived antibodies were confirmed by X-ray crystallography solving the structure of 2 selected camelid anti-CD70 and anti-MET antibodies. These antibodies showed identical fold combinations as found in the corresponding human germline V families, yielding binding site structures closely similar to those occurring in human antibodies. In conclusion, our results indicate that active immunization of camelids can be a powerful therapeutic antibody platform. PMID:26018625
In Silico Prediction Analysis of Idiotope-Driven T–B Cell Collaboration in Multiple Sclerosis

PubMed Central

Høglund, Rune A.; Lossius, Andreas; Johansen, Jorunn N.; Homan, Jane; Benth, Jūratė Šaltytė; Robins, Harlan; Bogen, Bjarne; Bremel, Robert D.; Holmøy, Trygve

2017-01-01

Memory B cells acting as antigen-presenting cells are believed to be important in multiple sclerosis (MS), but the antigen they present remains unknown. We hypothesized that B cells may activate CD4+ T cells in the central nervous system of MS patients by presenting idiotopes from their own immunoglobulin variable regions on human leukocyte antigen (HLA) class II molecules. Here, we use bioinformatics prediction analysis of B cell immunoglobulin variable regions from 11 MS patients and 6 controls with other inflammatory neurological disorders (OINDs), to assess whether the prerequisites for such idiotope-driven T–B cell collaboration are present. Our findings indicate that idiotopes from the complementarity determining region (CDR) 3 of MS patients on average have high predicted affinities for disease associated HLA-DRB1*15:01 molecules and are predicted to be endosomally processed by cathepsin S and L in positions that allows such HLA binding to occur. Additionally, complementarity determining region 3 sequences from cerebrospinal fluid (CSF) B cells from MS patients contain on average more rare T cell-exposed motifs that could potentially escape tolerance and stimulate CD4+ T cells than CSF B cells from OIND patients. Many of these features were associated with preferential use of the IGHV4 gene family by CSF B cells from MS patients. This is the first study to combine high-throughput sequencing of patient immune repertoires with large-scale prediction analysis and provides key indicators for future in vitro and in vivo analyses. PMID:29038659
In Silico Prediction Analysis of Idiotope-Driven T-B Cell Collaboration in Multiple Sclerosis.

PubMed

Høglund, Rune A; Lossius, Andreas; Johansen, Jorunn N; Homan, Jane; Benth, Jūratė Šaltytė; Robins, Harlan; Bogen, Bjarne; Bremel, Robert D; Holmøy, Trygve

2017-01-01

Memory B cells acting as antigen-presenting cells are believed to be important in multiple sclerosis (MS), but the antigen they present remains unknown. We hypothesized that B cells may activate CD4 + T cells in the central nervous system of MS patients by presenting idiotopes from their own immunoglobulin variable regions on human leukocyte antigen (HLA) class II molecules. Here, we use bioinformatics prediction analysis of B cell immunoglobulin variable regions from 11 MS patients and 6 controls with other inflammatory neurological disorders (OINDs), to assess whether the prerequisites for such idiotope-driven T-B cell collaboration are present. Our findings indicate that idiotopes from the complementarity determining region (CDR) 3 of MS patients on average have high predicted affinities for disease associated HLA-DRB1*15:01 molecules and are predicted to be endosomally processed by cathepsin S and L in positions that allows such HLA binding to occur. Additionally, complementarity determining region 3 sequences from cerebrospinal fluid (CSF) B cells from MS patients contain on average more rare T cell-exposed motifs that could potentially escape tolerance and stimulate CD4 + T cells than CSF B cells from OIND patients. Many of these features were associated with preferential use of the IGHV4 gene family by CSF B cells from MS patients. This is the first study to combine high-throughput sequencing of patient immune repertoires with large-scale prediction analysis and provides key indicators for future in vitro and in vivo analyses.
Molecular characterization of neoplastic and normal "sister" lymphoblastoid B-cell lines from chronic lymphocytic leukemia.

PubMed

Lanemo Myhrinder, Anna; Hellqvist, Eva; Bergh, Ann-Charlotte; Jansson, Mattias; Nilsson, Kenneth; Hultman, Per; Jonasson, Jon; Buhl, Anne Mette; Bredo Pedersen, Lone; Jurlander, Jesper; Klein, Eva; Weit, Nicole; Herling, Marco; Rosenquist, Richard; Rosén, Anders

2013-08-01

Chronic lymphocytic leukemia (CLL) B-cells resemble self-renewing CD5 + B-cells carrying auto/xeno-antigen-reactive B-cell receptors (BCRs) and multiple innate pattern-recognition receptors, such as Toll-like receptors and scavenger receptors. Integration of signals from BCRs with multiple surface membrane receptors determines whether the cells will be proliferating, anergic or apoptotic. To better understand the role of antigen in leukemogenesis, CLL cell lines producing monoclonal antibodies (mAbs) will facilitate structural analysis of antigens and supply DNA for genetic studies. We present here a comprehensive genotypic and phenotypic characterization of available CLL and normal B-cell-derived lymphoblastoid cell lines (LCLs) from the same individuals (n = 17). Authenticity and verification studies of CLL-patient origin were done by IGHV sequencing, fluorescence in situ hybridization (FISH) and DNA/short tandem repeat (STR) fingerprinting. Innate B-cell features, i.e. natural Ab production and CD5 receptors, were present in most CLL cell lines, but in none of the normal LCLs. This panel of immortalized CLL-derived cell lines is a valuable reference representing a renewable source of authentic Abs and DNA.
Distinct cellular pathways select germline-encoded and somatically mutated antibodies into immunological memory

PubMed Central

Kaji, Tomohiro; Ishige, Akiko; Hikida, Masaki; Taka, Junko; Hijikata, Atsushi; Kubo, Masato; Nagashima, Takeshi; Takahashi, Yoshimasa; Kurosaki, Tomohiro; Okada, Mariko; Ohara, Osamu

2012-01-01

One component of memory in the antibody system is long-lived memory B cells selected for the expression of somatically mutated, high-affinity antibodies in the T cell–dependent germinal center (GC) reaction. A puzzling observation has been that the memory B cell compartment also contains cells expressing unmutated, low-affinity antibodies. Using conditional Bcl6 ablation, we demonstrate that these cells are generated through proliferative expansion early after immunization in a T cell–dependent but GC-independent manner. They soon become resting and long-lived and display a novel distinct gene expression signature which distinguishes memory B cells from other classes of B cells. GC-independent memory B cells are later joined by somatically mutated GC descendants at roughly equal proportions and these two types of memory cells efficiently generate adoptive secondary antibody responses. Deletion of T follicular helper (Tfh) cells significantly reduces the generation of mutated, but not unmutated, memory cells early on in the response. Thus, B cell memory is generated along two fundamentally distinct cellular differentiation pathways. One pathway is dedicated to the generation of high-affinity somatic antibody mutants, whereas the other preserves germ line antibody specificities and may prepare the organism for rapid responses to antigenic variants of the invading pathogen. PMID:23027924
Quantitative protein expression analysis of CLL B cells from mutated and unmutated IgV(H) subgroups using acid-cleavable isotope-coded affinity tag reagents.

PubMed

Barnidge, David R; Jelinek, Diane F; Muddiman, David C; Kay, Neil E

2005-01-01

Relative protein expression levels were compared in leukemic B cells from two patients with chronic lymphocytic leukemia (CLL) having either mutated (M-CLL) or unmutated (UM-CLL) immunoglobulin variable heavy chain genes (IgV(H)). Cells were separated into cytosol and membrane protein fractions then labeled with acid-cleavable ICAT reagents (cICAT). Labeled proteins were digested with trypsin then subjected to SCX and affinity chromatography followed by LC-ESI-MS/MS analysis on a linear ion trap mass spectrometer. A total of 9 proteins from the cytosol fraction and 4 from the membrane fraction showed a 3-fold or greater difference between M-CLL and UM-CLL and a subset of these were examined by Western blot where results concurred with cICAT abundance ratios. The abundance of one of the proteins in particular, the mitochondrial membrane protein cytochrome c oxidase subunit COX G was examined in 6 M-CLL and 6 UM-CLL patients using western blot and results showed significantly greater levels (P < 0.001) in M-CLL patients vs UM-CLL patients. These results demonstrate that stable isotope labeling and mass spectrometry can complement 2D gel electrophoresis and gene microarray technologies for identifying putative and perhaps unique prognostic markers in CLL.
Primary cutaneous B-cell lymphoma is associated with somatically hypermutated immunoglobulin variable genes and frequent use of VH1-69 and VH4-59 segments.

PubMed

Perez, M; Pacchiarotti, A; Frontani, M; Pescarmona, E; Caprini, E; Lombardo, G A; Russo, G; Faraggiana, T

2010-03-01

Accurate assessment of the somatic mutational status of clonal immunoglobulin variable region (IgV) genes is relevant in elucidating tumour cell origin in B-cell lymphoma; virgin B cells bear unmutated IgV genes, while germinal centre and postfollicular B cells carry mutated IgV genes. Furthermore, biases in the IgV repertoire and distribution pattern of somatic mutations indicate a possible antigen role in the pathogenesis of B-cell malignancies. This work investigates the cellular origin and antigenic selection in primary cutaneous B-cell lymphoma (PCBCL). We analysed the nucleotide sequence of clonal IgV heavy-chain gene (IgVH) rearrangements in 51 cases of PCBCL (25 follicle centre, 19 marginal zone and seven diffuse large B-cell lymphoma, leg-type) and compared IgVH sequences with their closest germline segment in the GenBank database. Molecular data were then correlated with histopathological features. We showed that all but one of the 51 IgVH sequences analysed exhibited extensive somatic hypermutations. The detected mutation rate ranged from 1.6% to 21%, with a median rate of 9.8% and was independent of PCBCL histotype. Calculation of antigen-selection pressure showed that 39% of the mutated IgVH genes displayed a number of replacement mutations and silent mutations in a pattern consistent with antigenic selection. Furthermore, two segments, VH1-69 (12%) and VH4-59 (14%), were preferentially used in our case series. Data indicate that neoplastic B cells of PBCBL have experienced germinal centre reaction and also suggest that the involvement of IgVH genes is not entirely random in PCBCL and that common antigen epitopes could be pathologically relevant in cutaneous lymphomagenesis.
Sequence intrinsic somatic mutation mechanisms contribute to affinity maturation of VRC01-class HIV-1 broadly neutralizing antibodies

PubMed Central

Hwang, Joyce K.; Wang, Chong; Du, Zhou; Meyers, Robin M.; Kepler, Thomas B.; Neuberg, Donna; Kwong, Peter D.; Mascola, John R.; Joyce, M. Gordon; Bonsignori, Mattia; Haynes, Barton F.; Yeap, Leng-Siew; Alt, Frederick W.

2017-01-01

Variable regions of Ig chains provide the antigen recognition portion of B-cell receptors and derivative antibodies. Ig heavy-chain variable region exons are assembled developmentally from V, D, J gene segments. Each variable region contains three antigen-contacting complementarity-determining regions (CDRs), with CDR1 and CDR2 encoded by the V segment and CDR3 encoded by the V(D)J junction region. Antigen-stimulated germinal center (GC) B cells undergo somatic hypermutation (SHM) of V(D)J exons followed by selection for SHMs that increase antigen-binding affinity. Some HIV-1–infected human subjects develop broadly neutralizing antibodies (bnAbs), such as the potent VRC01-class bnAbs, that neutralize diverse HIV-1 strains. Mature VRC01-class bnAbs, including VRC-PG04, accumulate very high SHM levels, a property that hinders development of vaccine strategies to elicit them. Because many VRC01-class bnAb SHMs are not required for broad neutralization, high overall SHM may be required to achieve certain functional SHMs. To elucidate such requirements, we used a V(D)J passenger allele system to assay, in mouse GC B cells, sequence-intrinsic SHM-targeting rates of nucleotides across substrates representing maturation stages of human VRC-PG04. We identify rate-limiting SHM positions for VRC-PG04 maturation, as well as SHM hotspots and intrinsically frequent deletions associated with SHM. We find that mature VRC-PG04 has low SHM capability due to hotspot saturation but also demonstrate that generation of new SHM hotspots and saturation of existing hotspot regions (e.g., CDR3) does not majorly influence intrinsic SHM in unmutated portions of VRC-PG04 progenitor sequences. We discuss implications of our findings for bnAb affinity maturation mechanisms. PMID:28747530
IgVH gene analysis suggests that peritoneal B cells do not contribute to the gut immune system in man.

PubMed

Boursier, Laurent; Farstad, Inger Nina; Mellembakken, Jan Roar; Brandtzaeg, Per; Spencer, Jo

2002-09-01

The contribution of peritoneal B cells to the intestinal lamina propria plasma cell population is well documented in mice, but unknown in humans. We have analyzed immunoglobulin (Ig) genes of human peritoneal B cells, because such genes show distinctive characteristics in mucosal B cells, particularly highly mutated variable regions. Here, we report the characteristics of variable region genes used by IgM, IgA and IgG in peritoneal cells. We focused on the properties of IgV(H)4-34 to allow comparisons of like-with-like between different isotypes and cells from different immune compartments. We observed that the IgM genes were mostly unmutated, and that the mutated subset had less mutations than would be expected in a mucosal B cell population. Likewise, the IgV(H)4-34 genes used by IgA and IgG from peritoneal B cells had significantly lower numbers of mutations than observed in the mucosal counterparts. Other trends observed, while not reaching statistical significance, followed the trend of peripheral B cells. The peritoneal B cell population had more IgA1 than IgA2 sequences, and there was no dominance of J(H)4 in the IgA from peritoneum or spleen, in contrast to the mucosal sequences. Overall, this study suggested that human peritoneal B cell are either peripheral or mixed in origin; they are unlikely to represent an inductive compartment for the mucosal B cell system.
Molecular dynamics reveal BCR-ABL1 polymutants as a unique mechanism of resistance to PAN-BCR-ABL1 kinase inhibitor therapy

PubMed Central

Gibbons, Don L.; Pricl, Sabrina; Posocco, Paola; Laurini, Erik; Fermeglia, Maurizio; Sun, Hanshi; Talpaz, Moshe; Donato, Nicholas; Quintás-Cardama, Alfonso

2014-01-01

The acquisition of mutations within the BCR-ABL1 kinase domain is frequently associated with tyrosine kinase inhibitor (TKI) failure in chronic myeloid leukemia. Sensitive sequencing techniques have revealed a high prevalence of compound BCR-ABL1 mutations (polymutants) in patients failing TKI therapy. To investigate the molecular consequences of such complex mutant proteins with regards to TKI resistance, we determined by cloning techniques the presence of polymutants in a cohort of chronic-phase patients receiving imatinib followed by dasatinib therapy. The analysis revealed a high frequency of polymutant BCR-ABL1 alleles even after failure of frontline imatinib, and also the progressive exhaustion of the pool of unmutated BCR-ABL1 alleles over the course of sequential TKI therapy. Molecular dynamics analyses of the most frequent polymutants in complex with TKIs revealed the basis of TKI resistance. Modeling of BCR-ABL1 in complex with the potent pan-BCR-ABL1 TKI ponatinib highlighted potentially effective therapeutic strategies for patients carrying these recalcitrant and complex BCR-ABL1 mutant proteins while unveiling unique mechanisms of escape to ponatinib therapy. PMID:24550512
LDOC1 mRNA is differentially expressed in chronic lymphocytic leukemia and predicts overall survival in untreated patients

PubMed Central

Duzkale, Hatice; Schweighofer, Carmen D.; Coombes, Kevin R.; Barron, Lynn L.; Ferrajoli, Alessandra; O'Brien, Susan; Wierda, William G.; Pfeifer, John; Majewski, Tadeusz; Czerniak, Bogdan A.; Jorgensen, Jeffrey L.; Medeiros, L. Jeffrey; Freireich, Emil J; Keating, Michael J.

2011-01-01

We previously identified LDOC1 as one of the most significantly differentially expressed genes in untreated chronic lymphocytic leukemia (CLL) patients with respect to the somatic mutation status of the immunoglobulin heavy-chain variable region genes. However, little is known about the normal function of LDOC1, its contribution to the pathophysiology of CLL, or its prognostic significance. In this study, we have investigated LDOC1 mRNA expression in a large cohort of untreated CLL patients, as well as in normal peripheral blood B-cell (NBC) subsets and primary B-cell lymphoma samples. We have confirmed that LDOC1 is dramatically down-regulated in mutated CLL cases compared with unmutated cases, and have identified a new splice variant, LDOC1S. We show that LDOC1 is expressed in NBC subsets (naive > memory), suggesting that it may play a role in normal B-cell development. It is also expressed in primary B-cell lymphoma samples, in which its expression is associated with somatic mutation status. In CLL, we show that high levels of LDOC1 correlate with biomarkers of poor prognosis, including cytogenetic markers, unmutated somatic mutation status, and ZAP70 expression. Finally, we demonstrate that LDOC1 mRNA expression is an excellent predictor of overall survival in untreated CLL patients. PMID:21310924
BCR ligation induced by IgM stimulation results in gene expression and functional changes only in IgV H unmutated chronic lymphocytic leukemia (CLL) cells.

PubMed

Guarini, Anna; Chiaretti, Sabina; Tavolaro, Simona; Maggio, Roberta; Peragine, Nadia; Citarella, Franca; Ricciardi, Maria Rosaria; Santangelo, Simona; Marinelli, Marilisa; De Propris, Maria Stefania; Messina, Monica; Mauro, Francesca Romana; Del Giudice, Ilaria; Foà, Robert

2008-08-01

Chronic lymphocytic leukemia (CLL) patients exhibit a variable clinical course. To investigate the association between clinicobiologic features and responsiveness of CLL cells to anti-IgM stimulation, we evaluated gene expression changes and modifications in cell-cycle distribution, proliferation, and apoptosis of IgV(H) mutated (M) and unmutated (UM) samples upon BCR cross-linking. Unsupervised analysis highlighted a different response profile to BCR stimulation between UM and M samples. Supervised analysis identified several genes modulated exclusively in the UM cases upon BCR cross-linking. Functional gene groups, including signal transduction, transcription, cell-cycle regulation, and cytoskeleton organization, were up-regulated upon stimulation in UM cases. Cell-cycle and proliferation analyses confirmed that IgM cross-linking induced a significant progression into the G(1) phase and a moderate increase of proliferative activity exclusively in UM patients. Moreover, we observed only a small reduction in the percentage of subG(0/1) cells, without changes in apoptosis, in UM cases; contrariwise, a significant increase of apoptotic levels was observed in stimulated cells from M cases. These results document that a differential genotypic and functional response to BCR ligation between IgV(H) M and UM cases is operational in CLL, indicating that response to antigenic stimulation plays a pivotal role in disease progression.
A unique proteomic profile on surface IgM ligation in unmutated chronic lymphocytic leukemia

PubMed Central

Perrot, Aurore; Pionneau, Cédric; Nadaud, Sophie; Davi, Frédéric; Leblond, Véronique; Jacob, Frédéric; Merle-Béral, Hélène; Herbrecht, Raoul; Béné, Marie-Christine; Gribben, John G.; Vallat, Laurent

2011-01-01

Chronic lymphocytic leukemia (CLL) is characterized by a highly variable clinical course with 2 extreme subsets: indolent, ZAP70− and mutated immunoglobulin heavy chain gene (M-CLL); and aggressive, ZAP70+ and unmutated immunoglobulin heavy chain (UM-CLL). Given the long-term suspicion of antigenic stimulation as a primum movens in the disease, the role of the B-cell receptor has been extensively studied in various experimental settings; albeit scarcely in a comparative dynamic proteomic approach. Here we use a quantitative 2-dimensional fluorescence difference gel electrophoresis technology to compare 48 proteomic profiles of the 2 CLL subsets before and after anti-IgM ligation. Differentially expressed proteins were subsequently identified by mass spectrometry. We show that unstimulated M- and UM-CLL cells display distinct proteomic profiles. Furthermore, anti-IgM stimulation induces a specific proteomic response, more pronounced in the more aggressive CLL. Statistical analyses demonstrate several significant protein variations according to stimulation conditions. Finally, we identify an intermediate form of M-CLL cells, with an indolent profile (ZAP70−) but sharing aggressive proteomic profiles alike UM-CLL cells. Collectively, this first quantitative and dynamic proteome analysis of CLL further dissects the complex molecular pathway after B-cell receptor stimulation and depicts distinct proteomic profiles, which could lead to novel molecular stratification of the disease. PMID:21602524
High expression of AID and active class switch recombination might account for a more aggressive disease in unmutated CLL patients: link with an activated microenvironment in CLL disease.

PubMed

Palacios, Florencia; Moreno, Pilar; Morande, Pablo; Abreu, Cecilia; Correa, Agustín; Porro, Valentina; Landoni, Ana Ines; Gabus, Raul; Giordano, Mirta; Dighiero, Guillermo; Pritsch, Otto; Oppezzo, Pablo

2010-06-03

Interaction of chronic lymphocytic leukemia (CLL) B cells with tissue microenvironment has been suggested to favor disease progression by promoting malignant B-cell growth. Previous work has shown expression in peripheral blood (PB) of CLL B cells of activation-induced cytidine deaminase (AID) among CLL patients with an unmutated (UM) profile of immunoglobulin genes and with ongoing class switch recombination (CSR) process. Because AID expression results from interaction with activated tissue microenvironment, we speculated whether the small subset with ongoing CSR is responsible for high levels of AID expression and could be derived from this particular microenvironment. In this work, we quantified AID expression and ongoing CSR in PB of 50 CLL patients and characterized the expression of different molecules related to microenvironment interaction. Our results show that among UM patients (1) high AID expression is restricted to the subpopulation of tumoral cells ongoing CSR; (2) this small subset expresses high levels of proliferation, antiapoptotic and progression markers (Ki-67, c-myc, Bcl-2, CD49d, and CCL3/4 chemokines). Overall, this work outlines the importance of a cellular subset in PB of UM CLL patients with a poor clinical outcome, high AID levels, and ongoing CSR, whose presence might be a hallmark of a recent contact with the microenvironment.
TET2 mutations in B cells of patients affected by angioimmunoblastic T-cell lymphoma.

PubMed

Schwartz, Friederike H; Cai, Qian; Fellmann, Eva; Hartmann, Sylvia; Mäyränpää, Mikko I; Karjalainen-Lindsberg, Marja-Liisa; Sundström, Christer; Scholtysik, René; Hansmann, Martin-Leo; Küppers, Ralf

2017-06-01

Angioimmunoblastic T-cell lymphomas (AITLs) frequently carry mutations in the TET2 and IDH2 genes. TET2 mutations represent early genetic lesions as they had already been detected in haematopoietic precursor cells of AITL patients. We show by analysis of whole-tissue sections and microdissected PD1 + cells that the frequency of TET2-mutated AITL is presumably even higher than reported (12/13 cases in our collection; 92%). In two-thirds of informative AITLs (6/9), a fraction of B cells was also TET2-mutated. Investigation of four AITLs by TET2 and IGHV gene sequencing of single microdissected B cells showed that between 10% and 60% of polyclonal B cells in AITL lymph nodes harboured the identical TET2 mutations of the respective T-cell lymphoma clone. Thus, TET2-mutated haematopoietic precursor cells in AITL patients not only give rise to the T-cell lymphoma but also generate a large population of mutated mature B cells. Future studies will show whether this is a reason why AITL patients frequently also develop B-cell lymphomas. Copyright © 2017 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd. Copyright © 2017 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.
LPL is the strongest prognostic factor in a comparative analysis of RNA-based markers in early chronic lymphocytic leukemia.

PubMed

Kaderi, Mohd Arifin; Kanduri, Meena; Buhl, Anne Mette; Sevov, Marie; Cahill, Nicola; Gunnarsson, Rebeqa; Jansson, Mattias; Smedby, Karin Ekström; Hjalgrim, Henrik; Jurlander, Jesper; Juliusson, Gunnar; Mansouri, Larry; Rosenquist, Richard

2011-08-01

The expression levels of LPL, ZAP70, TCL1A, CLLU1 and MCL1 have recently been proposed as prognostic factors in chronic lymphocytic leukemia. However, few studies have systematically compared these different RNA-based markers. Using real-time quantitative PCR, we measured the mRNA expression levels of these genes in unsorted samples from 252 newly diagnosed chronic lymphocytic leukemia patients and correlated our data with established prognostic markers (for example Binet stage, CD38, IGHV gene mutational status and genomic aberrations) and clinical outcome. High expression levels of all RNA-based markers, except MCL1, predicted shorter overall survival and time to treatment, with LPL being the most significant. In multivariate analysis including the RNA-based markers, LPL expression was the only independent prognostic marker for overall survival and time to treatment. When studying LPL expression and the established markers, LPL expression retained its independent prognostic strength for overall survival. All of the RNA-based markers, albeit with varying ability, added prognostic information to established markers, with LPL expression giving the most significant results. Notably, high LPL expression predicted a worse outcome in good-prognosis subgroups, such as patients with mutated IGHV genes, Binet stage A, CD38 negativity or favorable cytogenetics. In particular, the combination of LPL expression and CD38 could further stratify Binet stage A patients. LPL expression is the strongest RNA-based prognostic marker in chronic lymphocytic leukemia that could potentially be applied to predict outcome in the clinical setting, particularly in the large group of patients with favorable prognosis.
Partial versus Productive Immunoglobulin Heavy Locus Rearrangements in Chronic Lymphocytic Leukemia: Implications for B-Cell Receptor Stereotypy

PubMed Central

Tsakou, Eugenia; Agathagelidis, Andreas; Boudjoghra, Myriam; Raff, Thorsten; Dagklis, Antonis; Chatzouli, Maria; Smilevska, Tatjana; Bourikas, George; Merle-Beral, Helene; Manioudaki-Kavallieratou, Eleni; Anagnostopoulos, Achilles; Brüggemann, Monika; Davi, Frederic; Stamatopoulos, Kostas; Belessi, Chrysoula

2012-01-01

The frequent occurrence of stereotyped heavy complementarity-determining region 3 (VH CDR3) sequences among unrelated cases with chronic lymphocytic leukemia (CLL) is widely taken as evidence for antigen selection. Stereotyped VH CDR3 sequences are often defined by the selective association of certain immunoglobulin heavy diversity (IGHD) genes in specific reading frames with certain immunoglobulin heavy joining (IGHJ ) genes. To gain insight into the mechanisms underlying VH CDR3 restrictions and also determine the developmental stage when restrictions in VH CDR3 are imposed, we analyzed partial IGHD-IGHJ rearrangements (D-J) in 829 CLL cases and compared the productively rearranged D-J joints (that is, in-frame junctions without junctional stop codons) to (a) the productive immunoglobulin heavy variable (IGHV )-IGHD-IGHJ rearrangements (V-D-J) from the same cases and (b) 174 D-J rearrangements from 160 precursor B-cell acute lymphoblastic leukemia cases (pre-B acute lymphoblastic leukemia [ALL]). Partial D-J rearrangements were detected in 272/829 CLL cases (32.8%). Sequence analysis was feasible in 238 of 272 D-J rearrangements; 198 of 238 (83.2%) were productively rearranged. The D-J joints in CLL did not differ significantly from those in pre-B ALL, except for higher frequency of the IGHD7-27 and IGHJ6 genes in the latter. Among CLL carrying productively rearranged D-J, comparison of the IGHD gene repertoire in productive V-D-J versus D-J revealed the following: (a) overuse of IGHD reading frames encoding hydrophilic peptides among V-D-J and (b) selection of the IGHD3-3 and IGHD6-19 genes in V-D-J junctions. These results document that the IGHD and IGHJ gene biases in the CLL expressed VH CDR3 repertoire are not stochastic but are directed by selection operating at the immunoglobulin protein level. PMID:21968789

[Expression of BAG3 Gene in Acute Myeloid Leukemia and Its Prognostic Value].

PubMed

Zhu, Hua-Yuan; Fu, Yuan; Wu, Wei; Xu, Jia-Dai; Chen, Ting-Mei; Qiao, Chun; Li, Jian-Yong; Liu, Peng

2015-08-01

To investigate the expression of BAG3 gene in acue myeloid leukemia (AML) and its prognostic value. Real-time quantitative RT-PCR was used to detect the expression of BAG3 mRNA in 88 previously untreated AML patients. The corelation of BAG3 expression level with clinical characteristics and known prognostic markers of AML was analyzed. In 88 patients with AML, the expression of BAG3 mRNA in NPMI mutated AML patients was obviously lower than that in NPMI unmutated patients (P = 0.018). The expression level of BAG3 mRNA did not related to clinical parameters, such as age, sex, FAB subtype, WBC count, extra-modullary presentation, and to prognostic factors including cytogenetics, FLT3-ITD, c-kit and CEBPα mutation status (P > 0.05). The expression level of BAG3 had no obvious effect on complete remission (CR) of patients in first treatment. The expression level of BAG3 in non-M3 patients was higher than that in relapsed patients (P = 0.036). The expression level of BAG3 had no effect on overall survival (OS) of patients. The expression level of BAG3 does not correlated with known-prognostic markers of AML, only the expression level of BAG3 in NPM1 mutated patients is lower than that in NPM1 unmutated patients. The expression level of BAG3 has no effect on OS of AML patients, the BAG3 can not be difined as a prognostic marker in AML.
Activation of the PI3K/AKT pathway by microRNA-22 results in CLL B-cell proliferation.

PubMed

Palacios, F; Abreu, C; Prieto, D; Morande, P; Ruiz, S; Fernández-Calero, T; Naya, H; Libisch, G; Robello, C; Landoni, A I; Gabus, R; Dighiero, G; Oppezzo, P

2015-01-01

Chronic lymphocytic leukemia (CLL) is characterized by accumulation of clonal B cells arrested in G0/G1 stages that coexist, in different proportions, with proliferative B cells. Understanding the crosstalk between the proliferative subsets and their milieu could provide clues on CLL biology. We previously identified one of these subpopulations in the peripheral blood from unmutated patients that appears to be a hallmark of a progressive disease. Aiming to characterize the molecular mechanism underlying this proliferative behavior, we performed gene expression analysis comparing the global mRNA and microRNA expression of this leukemic subpopulation, and compared it with their quiescent counterparts. Our results suggest that proliferation of this fraction depend on microRNA-22 overexpression that induces phosphatase and tensin homolog downregulation and phosphoinositide 3-kinase (PI3K)/AKT pathway activation. Transfection experiments demonstrated that miR-22 overexpression in CLL B cells switches on PI3K/AKT, leading to downregulation of p27(-Kip1) and overexpression of Survivin and Ki-67 proteins. We also demonstrated that this pathway could be triggered by microenvironment signals like CD40 ligand/interleukin-4 and, more importantly, that this regulatory loop is also present in lymph nodes from progressive unmutated patients. Altogether, these results underline the key role of PI3K/AKT pathway in the generation of the CLL proliferative pool and provide additional rationale for the usage of PI3K inhibitors.
Primary myelofibrosis with or without mutant MPL: comparison of survival and clinical features involving 603 patients.

PubMed

Pardanani, A; Guglielmelli, P; Lasho, T L; Pancrazzi, A; Finke, C M; Vannucchi, A M; Tefferi, A

2011-12-01

MPL and JAK2V617F mutation analysis was performed in 603 patients with primary myelofibrosis (PMF) seen at the Mayo Clinic, USA (n=329) or University of Florence, Italy (n=274). Mutant MPL was detected in 49 (8.1%) patients and JAK2V617F in 350 (58%); 4 patients showed both mutations. MPLW515L/K was the commonest mutation; 2 patients showed novel mutations (L513ins and Q516-P518insAAAA). The US and Italy patient cohorts were separately analyzed for comparison of survival and clinical features between MPL-mutated, JAK2-mutated and JAK2/MPL-unmutated cases. JAK2/MPL-unmutated patients were significantly younger than their JAK2-mutated counterparts, in both patient cohorts (P<0.01). In the Florence only cohort, the presence of mutant MPL was associated with older age (P<0.01) and constitutional symptoms (P=0.04) and JAK2V617F with higher hemoglobin (P<0.01) and leukocyte (P=0.03) count; neither patient cohort showed significant associations with platelet count, hemoglobin <10 g/dl, abnormal/unfavorable karyotype, spleen size or prognostic score distribution. To date, 240 deaths and 79 leukemic transformations have been documented among all 603 study patients. Multivariable analysis disclosed no significant difference in overall or leukemia-free survival between the three molecular subgroups. We conclude that the presence of mutant MPL has narrow and inconsistent phenotypic effect in PMF and does not influence overall or leukemia-free survival.
Loss-of-function CARD8 mutation causes NLRP3 inflammasome activation and Crohn's disease.

PubMed

Mao, Liming; Kitani, Atsushi; Similuk, Morgan; Oler, Andrew J; Albenberg, Lindsey; Kelsen, Judith; Aktay, Atiye; Quezado, Martha; Yao, Michael; Montgomery-Recht, Kim; Fuss, Ivan J; Strober, Warren

2018-05-01

In these studies, we evaluated the contribution of the NLRP3 inflammasome to Crohn's disease (CD) in a kindred containing individuals having a missense mutation in CARD8, a protein known to inhibit this inflammasome. Whole exome sequencing and PCR studies identified the affected individuals as having a V44I mutation in a single allele of the T60 isoform of CARD8. The serum levels of IL-1β in the affected individuals were increased compared with those in healthy controls, and their peripheral monocytes produced increased amounts of IL-1β when stimulated by NLRP3 activators. Immunoblot studies probing the basis of these findings showed that mutated T60 CARD8 failed to downregulate the NLRP3 inflammasome because it did not bind to NLRP3 and inhibit its oligomerization. In addition, these studies showed that mutated T60 CARD8 exerted a dominant-negative effect by its capacity to bind to and form oligomers with unmutated T60 or T48 CARD8 that impeded their binding to NLRP3. Finally, inflammasome activation studies revealed that intact but not mutated CARD8 prevented NLRP3 deubiquitination and serine dephosphorylation. CD due to a CARD8 mutation was not effectively treated by anti-TNF-α, but did respond to IL-1β inhibitors. Thus, patients with anti-TNF-α-resistant CD may respond to this treatment option.
A Low Frequency of Losses in 11q Chromosome Is Associated with Better Outcome and Lower Rate of Genomic Mutations in Patients with Chronic Lymphocytic Leukemia

PubMed Central

Rodríguez-Vicente, Ana Eugenia; Grossmann, Vera; Collado, Rosa; Heras, Cecilia; Puiggros, Anna; Martín, Ana África; Puig, Noemí; Benito, Rocío; Robledo, Cristina; Delgado, Julio; González, Teresa; Queizán, José Antonio; Galende, Josefina; de la Fuente, Ignacio; Martín-Núñez, Guillermo; Alonso, José María; Abrisqueta, Pau; Luño, Elisa; Marugán, Isabel; González-Gascón, Isabel; Bosch, Francesc; Kohlmann, Alexander; González, Marcos; Espinet, Blanca; Hernández-Rivas, Jesús María

2015-01-01

To analyze the impact of the 11q deleted (11q-) cells in CLL patients on the time to first therapy (TFT) and overall survival (OS), 2,493 patients with CLL were studied. 242 patients (9.7%) had 11q-. Fluorescence in situ hybridization (FISH) studies showed a threshold of 40% of deleted cells to be optimal for showing that clinical differences in terms of TFT and OS within 11q- CLLs. In patients with ≥40% of losses in 11q (11q-H) (74%), the median TFT was 19 months compared with 44 months in CLL patients with <40% del(11q) (11q-L) (P<0.0001). In the multivariate analysis, only the presence of 11q-L, mutated IGHV status, early Binet stage and absence of extended lymphadenopathy were associated with longer TFT. Patients with 11q-H had an OS of 90 months, while in the 11q-L group the OS was not reached (P = 0.008). The absence of splenomegaly (P = 0.02), low LDH (P = 0.018) or β2M (P = 0.006), and the presence of 11q-L (P = 0.003) were associated with a longer OS. In addition, to detect the presence of mutations in the ATM, TP53, NOTCH1, SF3B1, MYD88, FBXW7, XPO1 and BIRC3 genes, a select cohort of CLL patients with losses in 11q was sequenced by next-generation sequencing of amplicons. Eighty % of CLLs with 11q- showed mutations and fewer patients with low frequencies of 11q- had mutations among genes examined (50% vs 94.1%, P = 0.023). In summary, CLL patients with <40% of 11q- had a long TFT and OS that could be associated with the presence of fewer mutated genes. PMID:26630574
A Low Frequency of Losses in 11q Chromosome Is Associated with Better Outcome and Lower Rate of Genomic Mutations in Patients with Chronic Lymphocytic Leukemia.

PubMed

Hernández, José Ángel; Hernández-Sánchez, María; Rodríguez-Vicente, Ana Eugenia; Grossmann, Vera; Collado, Rosa; Heras, Cecilia; Puiggros, Anna; Martín, Ana África; Puig, Noemí; Benito, Rocío; Robledo, Cristina; Delgado, Julio; González, Teresa; Queizán, José Antonio; Galende, Josefina; de la Fuente, Ignacio; Martín-Núñez, Guillermo; Alonso, José María; Abrisqueta, Pau; Luño, Elisa; Marugán, Isabel; González-Gascón, Isabel; Bosch, Francesc; Kohlmann, Alexander; González, Marcos; Espinet, Blanca; Hernández-Rivas, Jesús María

2015-01-01

To analyze the impact of the 11q deleted (11q-) cells in CLL patients on the time to first therapy (TFT) and overall survival (OS), 2,493 patients with CLL were studied. 242 patients (9.7%) had 11q-. Fluorescence in situ hybridization (FISH) studies showed a threshold of 40% of deleted cells to be optimal for showing that clinical differences in terms of TFT and OS within 11q- CLLs. In patients with ≥40% of losses in 11q (11q-H) (74%), the median TFT was 19 months compared with 44 months in CLL patients with <40% del(11q) (11q-L) (P<0.0001). In the multivariate analysis, only the presence of 11q-L, mutated IGHV status, early Binet stage and absence of extended lymphadenopathy were associated with longer TFT. Patients with 11q-H had an OS of 90 months, while in the 11q-L group the OS was not reached (P = 0.008). The absence of splenomegaly (P = 0.02), low LDH (P = 0.018) or β2M (P = 0.006), and the presence of 11q-L (P = 0.003) were associated with a longer OS. In addition, to detect the presence of mutations in the ATM, TP53, NOTCH1, SF3B1, MYD88, FBXW7, XPO1 and BIRC3 genes, a select cohort of CLL patients with losses in 11q was sequenced by next-generation sequencing of amplicons. Eighty % of CLLs with 11q- showed mutations and fewer patients with low frequencies of 11q- had mutations among genes examined (50% vs 94.1%, P = 0.023). In summary, CLL patients with <40% of 11q- had a long TFT and OS that could be associated with the presence of fewer mutated genes.
Cell-cycle reprogramming for PI3K inhibition overrides a relapse-specific C481S BTK mutation revealed by longitudinal functional genomics in mantle cell lymphoma.

PubMed

Chiron, David; Di Liberto, Maurizio; Martin, Peter; Huang, Xiangao; Sharman, Jeff; Blecua, Pedro; Mathew, Susan; Vijay, Priyanka; Eng, Ken; Ali, Siraj; Johnson, Amy; Chang, Betty; Ely, Scott; Elemento, Olivier; Mason, Christopher E; Leonard, John P; Chen-Kiang, Selina

2014-09-01

Despite the unprecedented clinical activity of the Bruton tyrosine kinase (BTK) inhibitor ibrutinib in mantle cell lymphoma (MCL), acquired resistance is common. By longitudinal integrative whole-exome and whole-transcriptome sequencing and targeted sequencing, we identified the first relapse-specific C481S mutation at the ibrutinib binding site of BTK in MCL cells at progression following a durable response. This mutation enhanced BTK and AKT activation and tissue-specific proliferation of resistant MCL cells driven by CDK4 activation. It was absent, however, in patients with primary resistance or progression following transient response to ibrutinib, suggesting alternative mechanisms of resistance. Through synergistic induction of PIK3IP1 and inhibition of PI3K-AKT activation, prolonged early G1 arrest induced by PD 0332991 (palbociclib) inhibition of CDK4 sensitized resistant lymphoma cells to ibrutinib killing when BTK was unmutated, and to PI3K inhibitors independent of C481S mutation. These data identify a genomic basis for acquired ibrutinib resistance in MCL and suggest a strategy to override both primary and acquired ibrutinib resistance. We have discovered the first relapse-specific BTK mutation in patients with MCL with acquired resistance, but not primary resistance, to ibrutinib, and demonstrated a rationale for targeting the proliferative resistant MCL cells by inhibiting CDK4 and the cell cycle in combination with ibrutinib in the presence of BTK(WT) or a PI3K inhibitor independent of BTK mutation. As drug resistance remains a major challenge and CDK4 and PI3K are dysregulated at a high frequency in human cancers, targeting CDK4 in genome-based combination therapy represents a novel approach to lymphoma and cancer therapy. Cancer Discov; 4(9); 1022-35. ©2014 AACR. This article is highlighted in the In This Issue feature, p. 973. ©2014 American Association for Cancer Research.
Current Treatment of Chronic Lymphocytic Leukemia.

PubMed

Jamroziak, Krzysztof; Puła, Bartosz; Walewski, Jan

2017-01-01

A number of new treatment options have recently emerged for chronic lymphocytic leukemia (CLL) patients, including the Bruton's tyrosine kinase (BTK) inhibitor ibrutinib, phosphatidylinositol-3-kinase (PI3K) delta isoform inhibitor idelalisib combined with rituximab, the Bcl-2 antagonist venetoclax, and the new anti-CD20 antibodies obinutuzumab and ofatumumab. Most of these agents are already included into treatment algorithms defined by international practice guidelines, but more clinical investigations are needed to answer still remaining questions. Ibrutinib was proven as a primary choice for patients with the TP53 gene deletion/mutation, who otherwise have no active treatment available. Idelalisib with rituximab is also an active therapy, but due to increased risk of serious infections, its use in first-line treatment is limited to patients for whom ibrutinib is not an option. A new indication for ibrutinib was recently approved for older patients with comorbidities, as an alternative to the already existing indication for chlorambucil with obinutuzumab. The use of kinase inhibitors is already well established in recurrent/refractory disease. Immunochemotherapy with fludarabine, cyclophosphamide, rituximab (FCR) remains a major first-line option for many CLL patients without the TP53 gene deletion/mutation, and who have no significant comorbidities or history of infections, and is particularly effective in patients with favorable features including mutated IGHV status. There are a number of issues regarding novel therapies for CLL that need further investigation such as optimum duration of treatment with kinase inhibitors, appropriate sequencing of novel agents, mechanisms of resistance to inhibitors and response to class switching after treatment failure, along with the potential role of combinations of targeted agents.
Efficient generation of monoclonal antibodies from single rhesus macaque antibody secreting cells.

PubMed

Meng, Weixu; Li, Leike; Xiong, Wei; Fan, Xuejun; Deng, Hui; Bett, Andrew J; Chen, Zhifeng; Tang, Aimin; Cox, Kara S; Joyce, Joseph G; Freed, Daniel C; Thoryk, Elizabeth; Fu, Tong-Ming; Casimiro, Danilo R; Zhang, Ningyan; A Vora, Kalpit; An, Zhiqiang

2015-01-01

Nonhuman primates (NHPs) are used as a preclinical model for vaccine development, and the antibody profiles to experimental vaccines in NHPs can provide critical information for both vaccine design and translation to clinical efficacy. However, an efficient protocol for generating monoclonal antibodies from single antibody secreting cells of NHPs is currently lacking. In this study we established a robust protocol for cloning immunoglobulin (IG) variable domain genes from single rhesus macaque (Macaca mulatta) antibody secreting cells. A sorting strategy was developed using a panel of molecular markers (CD3, CD19, CD20, surface IgG, intracellular IgG, CD27, Ki67 and CD38) to identify the kinetics of B cell response after vaccination. Specific primers for the rhesus macaque IG genes were designed and validated using cDNA isolated from macaque peripheral blood mononuclear cells. Cloning efficiency was averaged at 90% for variable heavy (VH) and light (VL) domains, and 78.5% of the clones (n = 335) were matched VH and VL pairs. Sequence analysis revealed that diverse IGHV subgroups (for VH) and IGKV and IGLV subgroups (for VL) were represented in the cloned antibodies. The protocol was tested in a study using an experimental dengue vaccine candidate. About 26.6% of the monoclonal antibodies cloned from the vaccinated rhesus macaques react with the dengue vaccine antigens. These results validate the protocol for cloning monoclonal antibodies in response to vaccination from single macaque antibody secreting cells, which have general applicability for determining monoclonal antibody profiles in response to other immunogens or vaccine studies of interest in NHPs.
Functional Pathway Analysis Using SCNP of FLT3 Receptor Pathway Deregulation in AML Provides Prognostic Information Independent from Mutational Status

PubMed Central

Cesano, Alessandra; Putta, Santosh; Rosen, David B.; Cohen, Aileen C.; Gayko, Urte; Mathi, Kavita; Woronicz, John; Hawtin, Rachael E.; Cripe, Larry; Sun, Zhuoxin; Tallman, Martin S.; Paietta, Elisabeth

2013-01-01

FMS-like tyrosine kinase 3 receptor (FLT3) internal tandem duplication (ITD) mutations result in constitutive activation of this receptor and have been shown to increase the risk of relapse in patients with acute myeloid leukemia (AML); however, substantial heterogeneity in clinical outcomes still exists within both the ITD mutated and unmutated AML subgroups, suggesting alternative mechanisms of disease relapse not accounted by FLT3 mutational status. Single cell network profiling (SCNP) is a multiparametric flow cytometry based assay that simultaneously measures, in a quantitative fashion and at the single cell level, both extracellular surface marker levels and changes in intracellular signaling proteins in response to extracellular modulators. We previously reported an initial characterization of FLT3 ITD-mediated signaling using SCNP. Herein SCNP was applied sequentially to two separate cohorts of samples collected from elderly AML patients at diagnosis. In the first (training) study, AML samples carrying unmutated, wild-type FLT3 (FLT3 WT) displayed a wide range of induced signaling, with a fraction having signaling profiles comparable to FLT3 ITD AML samples. Conversely, the FLT3 ITD AML samples displayed more homogeneous induced signaling, with the exception of patients with low (<40%) mutational load, which had profiles comparable to FLT3 WT AML samples. This observation was then confirmed in an independent (verification) cohort. Data from the second cohort were also used to assess the association between SCNP data and disease-free survival (DFS) in the context of FLT3 and nucleophosmin (NPM1) mutational status among patients who achieved complete remission (CR) to induction chemotherapy. The combination of SCNP read outs together with FLT3 and NPM1 molecular status improved the DFS prediction accuracy of the latter. Taken together, these results emphasize the value of comprehensive functional assessment of biologically relevant signaling pathways in AML as a basis for the development of highly predictive tests for guidance of post-remission therapy. PMID:23431389
Chronic lymphocytic leukemia patients exposed to ionizing radiation due to the Chernobyl NPP accident--with focus on immunoglobulin heavy chain gene analysis.

PubMed

Abramenko, Iryna; Bilous, Nadia; Chumak, Anatoliy; Davidova, Ekaterina; Kryachok, Iryna; Martina, Zoya; Nechaev, Stanislav; Dyagil, Iryna; Bazyka, Dmytriy; Bebeshko, Vladimir

2008-04-01

Clinical data and immunoglobulin variable heavy chain (IgVH) gene configuration were analyzed in 47 CLL patients, exposed to ionizing radiation (IR) due to Chernobyl NPP accident, and 141 non-exposed patients. Clean-up workers of the second quarter of 1986 (n=19) were picked out as separate group with the highest number of unmutated cases (94.4%), increased usage of IgVH1-69 (33.3%) and IgVH3-21 (16.7%) genes, high frequency of secondary solid tumors (6 cases) and Richter transformation (4 cases). These preliminary data suggest that CLL in the most suffered contingent due to Chernobyl NPP accident might have some specific features.
Response to lenalidomide in myelodysplastic syndromes with del(5q): influence of cytogenetics and mutations.

PubMed

Mallo, Mar; Del Rey, Mónica; Ibáñez, Mariam; Calasanz, M José; Arenillas, Leonor; Larráyoz, M José; Pedro, Carmen; Jerez, Andrés; Maciejewski, Jaroslaw; Costa, Dolors; Nomdedeu, Meritxell; Diez-Campelo, María; Lumbreras, Eva; González-Martínez, Teresa; Marugán, Isabel; Such, Esperanza; Cervera, José; Cigudosa, Juan C; Alvarez, Sara; Florensa, Lourdes; Hernández, Jesús M; Solé, Francesc

2013-07-01

Lenalidomide is an effective drug in low-risk myelodysplastic syndromes (MDS) with isolated del(5q), although not all patients respond. Studies have suggested a role for TP53 mutations and karyotype complexity in disease progression and outcome. In order to assess the impact of complex karyotypes on treatment response and disease progression in 52 lenalidomide-treated patients with del(5q) MDS, conventional G-banding cytogenetics (CC), single nucleotide polymorphism array (SNP-A), and genomic sequencing methods were used. SNP-A analysis (with control sample, lymphocytes CD3+, in 30 cases) revealed 5q losses in all cases. Other recurrent abnormalities were infrequent and were not associated with lenalidomide responsiveness. Low karyotype complexity (by CC) and a high baseline platelet count (>280 × 10(9) /l) were associated with the achievement of haematological response (P = 0·020, P = 0·013 respectively). Unmutated TP53 status showed a tendency for haematological response (P = 0·061). Complete cytogenetic response was not observed in any of the mutated TP53 cases. By multivariate analysis, the most important predictor for lenalidomide treatment failure was a platelet count <280 × 10(9) /l (Odds Ratio = 6·17, P = 0·040). This study reveals the importance of a low baseline platelet count, karyotypic complexity and TP53 mutational status for response to lenalidomide treatment. It supports the molecular study of TP53 in MDS patients treated with lenalidomide. © 2013 John Wiley & Sons Ltd.
Natural autoantibodies: from 'horror autotoxicus' to 'gnothi seauton'.

PubMed

Avrameas, S

1991-05-01

The immune system of normal unimmunized animals is characterized by the presence of B cells synthesizing and secreting mainly polyreactive, but also monoreactive, IgM and IgG natural antibodies that can react with a variety of self constituents. These antibodies, like the autoantibodies appearing in several immunopathological states, use the same genetic elements as the antibodies directed against environmental antigens, and seem to be encoded by unmutated germ-line genes. Accumulating evidence indicates that these natural auto-antibodies exert various biological roles, both related and unrelated to the immune system. In this article, Stratis Avrameas proposes that natural auto-antibodies, by interacting with the large number of self constituents present in an organism, establish an extensive dynamic network that contributes to the general homeostasis of the organism.
Wilms’ Tumor 1 Gene Mutations Independently Predict Poor Outcome in Adults With Cytogenetically Normal Acute Myeloid Leukemia: A Cancer and Leukemia Group B Study

PubMed Central

Paschka, Peter; Marcucci, Guido; Ruppert, Amy S.; Whitman, Susan P.; Mrózek, Krzysztof; Maharry, Kati; Langer, Christian; Baldus, Claudia D.; Zhao, Weiqiang; Powell, Bayard L.; Baer, Maria R.; Carroll, Andrew J.; Caligiuri, Michael A.; Kolitz, Jonathan E.; Larson, Richard A.; Bloomfield, Clara D.

2008-01-01

Purpose To analyze the prognostic impact of Wilms’ tumor 1 (WT1) gene mutations in cytogenetically normal acute myeloid leukemia (CN-AML). Patients and Methods We studied 196 adults younger than 60 years with newly diagnosed primary CN-AML, who were treated similarly on Cancer and Leukemia Group B (CALGB) protocols 9621 and 19808, for WT1 mutations in exons 7 and 9. The patients also were assessed for the presence of FLT3 internal tandem duplications (FLT3-ITD), FLT3 tyrosine kinase domain mutations (FLT3-TKD), MLL partial tandem duplications (MLL-PTD), NPM1 and CEBPA mutations, and for the expression levels of ERG and BAALC. Results Twenty-one patients (10.7%) harbored WT1 mutations. Complete remission rates were not significantly different between patients with WT1 mutations and those with unmutated WT1 (P = .36; 76% v 84%). Patients with WT1 mutations had worse disease-free survival (DFS; P < .001; 3-year rates, 13% v 50%) and overall survival (OS; P < .001; 3-year rates, 10% v 56%) than patients with unmutated WT1. In multivariable analyses, WT1 mutations independently predicted worse DFS (P = .009; hazard ratio [HR] = 2.7) when controlling for CEBPA mutational status, ERG expression level, and FLT3-ITD/NPM1 molecular-risk group (ie, FLT3-ITDnegative/NPM1mutated as low risk v FLT3-ITDpositive and/or NPM1wild-type as high risk). WT1 mutations also independently predicted worse OS (P < .001; HR = 3.2) when controlling for CEBPA mutational status, FLT3-ITD/NPM1 molecular-risk group, and white blood cell count. Conclusion We report the first evidence that WT1 mutations independently predict extremely poor outcome in intensively treated, younger patients with CN-AML. Future trials should include testing for WT1 mutations as part of molecularly based risk assessment and risk-adapted treatment stratification of patients with CN-AML. PMID:18559874
Impact of genotype on leukaemic transformation in polycythaemia vera and essential thrombocythaemia.

PubMed

Alvarez-Larrán, Alberto; Senín, Alicia; Fernández-Rodríguez, Concepción; Pereira, Arturo; Arellano-Rodrigo, Eduardo; Gómez, Montse; Ferrer-Marin, Francisca; Martínez-López, Joaquín; Camacho, Laura; Colomer, Dolors; Angona, Anna; Navarro, Blanca; Cervantes, Francisco; Besses, Carlos; Bellosillo, Beatriz; Hernández-Boluda, Juan Carlos

2017-09-01

The influence of driver mutations on leukaemic transformation was analysed in 1747 patients with polycythaemia vera or essential thrombocythaemia. With a median follow-up of 7·2 years, 349 patients died and 62 progressed to acute leukaemia or myelodysplastic syndrome. Taking death as a competing risk, CALR genotype was associated with a lower risk of transformation [subdistribution hazard ratio (SHR): 0·13, 95% confidence interval (CI): 0·2-0·9, P = 0·039], whereas JAK2 V617F showed borderline significance for higher risk (SHR: 2·05, 95% CI: 0·9-4·6, P = 0·09). Myelofibrotic transformation increased leukaemic risk, except in CALR-mutated patients. Next generation sequencing of 51 genes at the time of transformation showed additional mutations (median number: 3; range: 1-5) in 25 out of 29 (86%) assessable cases. Mutations (median: 1; range: 1-3) were detected in 67% of paired samples from the chronic phase. Leukaemia appeared in a JAK2 V617F negative clone in 17 (58%) cases, eleven of them being previously JAK2 V617F-positive. JAK2 V617F-mutated leukaemia was significantly associated with complex karyotype and acquisition of TP53 mutations, whereas EZH2 and RUNX1 mutations were more frequent in JAK2 V617F-negative leukaemia. Survival was longer in JAK2 V617F-unmutated leukaemia (343 days vs. 95 days, P = 0·003). In conclusion, CALR genotype is associated with a lower risk of leukaemic transformation. Leukaemia arising in a JAK2 V617F-negative clone is TP53 independent and shows better survival. © 2017 John Wiley & Sons Ltd.
Cell-intrinsic determinants of ibrutinib-induced apoptosis in Chronic Lymphocytic Leukemia

PubMed Central

Amin, Nisar A.; Balasubramanian, Sriram; Saiya-Cork, Kamlai; Shedden, Kerby; Hu, Nan; Malek, Sami N.

2016-01-01

Purpose Ibrutinib, a Bruton’s tyrosine kinase (BTK) inhibitor, is approved for the treatment of relapsed CLL and CLL with del17p. Mechanistically, ibrutinib interferes with BCR signaling as well as multiple CLL cell to microenvironment interactions. Given the importance of ibrutinib in the management of CLL, a deeper understanding of factors governing sensitivity and resistance is warranted. Experimental Design We studied 48 longitudinally sampled paired CLL samples, 42 of which were procured before and after standard CLL chemotherapies, and characterized them for well-studied CLL molecular traits as well as by whole exome sequencing and SNP 6.0 array profiling. We exposed these samples to 0.25 μM – 5 μM of ibrutinib ex vivo and measured apoptosis fractions as well as BCR signaling by immunoblotting. We disrupted TP53 in HG3, PGA1 and PG-EBV cell lines and measured BCR signaling and ibrutinib responses. Results CLL samples demonstrated a surprisingly wide range of ex vivo sensitivities to ibrutinib with IC50 values ranging from 0.4 μM – 9.7 μM. Unmutated IGVH status, elevated ZAP70 expression and trisomy 12 were associated with heightened sensitivity to ibrutinib treatment. Five CLL samples were substantially more resistant to ibrutinib following relapse from chemotherapy; of these, three had acquired a del17p/TP53 mutated status. A validation sample of 15 CLL carrying TP53 mutations, of which 13 carried both del17p and a TP53 mutation confirmed substantially less sensitivity to ibrutinib-induced apoptosis. Conclusions This study identifies that CLL harboring del17p/TP53 mutated cells are substantially less sensitive to ibrutinib-induced apoptosis than del17p/TP53 wild type cells. PMID:27535981
Case report: Concomitant Chronic Lymphocytic Leukaemia and Cytogenetically Normal de novo Acute Leukaemia in a Patient.

PubMed

Kajtár, Béla; Rajnics, Péter; Egyed, Miklós; Alizadeh, Hussain

2015-01-01

The simultaneous occurrence of acute myeloid leukaemia with untreated chronic lymphocytic leukemia is extremely rare. We report a case of a 74-year-old man who was evaluated for macrocytic anaemia. Based on the morphology and immunophenotyping analysis of peripheral blood, a diagnosis of chronic lymphocytic leukemia was established. Subsequently, the bone marrow examination revealed the presence of two distinct, coexisting CLL and AML clones. Cytogenetic and molecular genetic analysis detected deletion 13q14.3 and unmutated immunoglobulin variable heavy-chain in the CLL clone, only. The AML and CLL clones did not share clonality, and the AML did not involve the peripheral blood. A diagnosis of cytogenetically normal de novo AML occurring concurrently with untreated CLL has not been reported previously in English literature. © 2015 by the Association of Clinical Scientists, Inc.
Growth and gene expression are predominantly controlled by distinct regions of the human IL-4 receptor.

PubMed

Ryan, J J; McReynolds, L J; Keegan, A; Wang, L H; Garfein, E; Rothman, P; Nelms, K; Paul, W E

1996-02-01

IL-4 causes hematopoietic cells to proliferate and express a series of genes, including CD23. We examined whether IL-4-mediated growth, as measured by 4PS phosphorylation, and gene induction were similarly controlled. Studies of M12.4.1 cells expressing human IL-4R truncation mutants indicated that the region between amino acids 557-657 is necessary for full gene expression, which correlated with Stat6 DNA binding activity. This region was not required for 4PS phosphorylation. Tyrosine-to-phenylalanine mutations in the interval between amino acids 557-657 revealed that as long as one tyrosine remained unmutated, CD23 was fully induced. When all three tyrosines were mutated, the receptor was unable to induce CD23. The results indicate that growth regulation and gene expression are principally controlled by distinct regions of IL-4R.
Pre-clinical evidence of PIM kinase inhibitor activity in BCR-ABL1 unmutated and mutated Philadelphia chromosome-positive (Ph+) leukemias

PubMed Central

Curi, Dany A.; Beauchamp, Elspeth M.; Blyth, Gavin T.; Arslan, Ahmet Dirim; Donato, Nicholas J.; Giles, Francis J.; Altman, Jessica K.; Platanias, Leonidas C.

2015-01-01

We investigated the efficacy of targeting the PIM kinase pathway in Philadelphia chromosome-positive (Ph+) leukemias. We provide evidence that inhibition of PIM, with the pan-PIM inhibitor SGI-1776, results in suppression of classic PIM effectors and also elements of the mTOR pathway, suggesting interplay between PIM and mTOR signals. Our data demonstrate that PIM inhibition enhances the effects of imatinib mesylate on Ph+ leukemia cells. We also found that PIM inhibition results in suppression of leukemic cell proliferation and induction of apoptosis of Ph+ leukemia cells, including those resistant to imatinib mesylate. Importantly, inhibition of PIM results in enhanced suppression of primary leukemic progenitors from patients with CML. Altogether these findings suggest that pharmacological PIM targeting may provide a unique therapeutic approach for the treatment of Ph+ leukemias. PMID:26375673
Immunoglobulin heavy chain V-D-J gene rearrangement and mutational status in Uruguayan patients with chronic lymphocytic leukemia.

PubMed

Bianchi, Sergio; Moreno, Pilar; Landoni, Ana Inés; Naya, Hugo; Oppezzo, Pablo; Dighiero, Guillermo; Gabús, Raúl; Pritsch, Otto

2010-11-01

B-cell chronic lymphocytic leukemia (CLL) is characterized by the accumulation of long-lived circulating clonal leukemic B-cells, although the etiopathogenesis remains unclear. The incidence of CLL is variable in different regions around the world. While it is the most frequent chronic leukemia in Western countries, it has a low incidence in Asia. In this work we have investigated the immunoglobulin heavy chain gene rearrangements and mutational status in 80 Uruguayan patients with CLL, and compared these results with those obtained in other geographic regions. Our results demonstrate that Uruguayan patients with CLL display an IGHV gene usage which resembles that observed in Mediterranean countries and exhibits certain differences compared with Brazilian and Asian series, as expected, considering the ethnic basis of the Uruguayan population. This suggests that genetic influences could be important in the development and etiopathogenesis of CLL, but larger studies are necessary to substantiate this possibility.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ying, Tianlei; Prabakaran, Ponraj; Du, Lanying

The MERS-CoV is an emerging virus, which already infected more than 1,300 humans with high (~36%) mortality. Here, we show that m336, an exceptionally potent human anti-MERS-CoV antibody, is almost germline with only one somatic mutation in the heavy chain. The structure of Fab m336 in complex with the MERS-CoV receptor-binding domain reveals that its IGHV1-69-derived heavy chain provides more than 85% binding surface and that its epitope almost completely overlaps with the receptor-binding site. Analysis of antibodies from 69 healthy humans suggests an important role of the V(D)J recombination-generated junctional and allele-specific residues for achieving high affinity of bindingmore » at such low levels of somatic hypermutation. Our results also have important implications for development of vaccine immunogens based on the newly identified m336 epitope as well as for elucidation of mechanisms of neutralization by m336-like antibodies and their elicitation in vivo.« less
Lenalidomide maintenance after first-line therapy for high-risk chronic lymphocytic leukaemia (CLLM1): final results from a randomised, double-blind, phase 3 study.

PubMed

Fink, Anna Maria; Bahlo, Jasmin; Robrecht, Sandra; Al-Sawaf, Othman; Aldaoud, Ali; Hebart, Holger; Jentsch-Ullrich, Kathleen; Dörfel, Steffen; Fischer, Kirsten; Wendtner, Clemens-Martin; Nösslinger, Thomas; Ghia, Paolo; Bosch, Francesc; Kater, Arnon P; Döhner, Hartmut; Kneba, Michael; Kreuzer, Karl-Anton; Tausch, Eugen; Stilgenbauer, Stephan; Ritgen, Matthias; Böttcher, Sebastian; Eichhorst, Barbara; Hallek, Michael

2017-10-01

The combined use of genetic markers and detectable minimal residual disease identifies patients with chronic lymphocytic leukaemia with poor outcome after first-line chemoimmunotherapy. We aimed to assess lenalidomide maintenance therapy in these high-risk patients. In this randomised, double-blind, phase 3 study (CLLM1; CLL Maintenance 1 of the German CLL Study Group), patients older than 18 years and diagnosed with immunophenotypically confirmed chronic lymphocytic leukaemia with active disease, who responded to chemoimmunotherapy 2-5 months after completion of first-line therapy and who were assessed as having a high risk for an early progression with at least a partial response after four or more cycles of first-line chemoimmunotherapy, were eligible if they had high minimal residual disease levels or intermediate levels combined with an unmutated IGHV gene status or TP53 alterations. Patients were randomly assigned (2:1) to receive either lenalidomide (5 mg) or placebo. Randomisation was done with a fixed block size of three, and was stratified according to the minimal residual disease level achieved after first-line therapy. Maintenance was started with 5 mg daily, and was escalated to the target dose of 15 mg. If tolerated, medication was administered until disease progression. The primary endpoint was progression-free survival according to an independent review. The pre-planned interim analysis done by intention to treat was done after 20% of the calculated progression-free survival events. This study is registered with ClinicalTrials.gov, number NCT01556776; treatment in the lenalidomide group is still ongoing. Between July 5, 2012, and March 15, 2016, 468 previously untreated patients with chronic lymphocytic leukaemia were screened for the study; 379 (81%) were not eligible. Recruitment was closed prematurely due to poor accrual after 89 of 200 planned patients were randomly assigned: 60 (67%) enrolled patients were assigned to the lenalidomide group and 29 (33%) to the placebo group, of whom 56 (63%) received lenalidomide and 29 (33%) placebo, with a median of 11·0 (IQR 4·5-20·5) treatment cycles at data cutoff. After a median observation time of 17·9 months (IQR 9·1-28·1), the hazard ratio for progression-free survival assessed by an independent review was 0·168 (95% CI 0·074-0·379). Median progression-free survival was 13·3 months (95% CI 9·9-19·7) in the placebo group and not reached (95% CI 32·3-not evaluable) in the lenalidomide group. The most frequent adverse events were skin disorders (35 patients [63%] in the lenalidomide group vs eight patients [28%] in the placebo group), gastrointestinal disorders (34 [61%] vs eight [28%]), infections (30 [54%] vs 19 [66%]), haematological toxicity (28 [50%] vs five [17%]), and general disorders (28 [50%] vs nine [31%]). One fatal adverse event was reported in each of the treatment groups (one [2%] patient with fatal acute lymphocytic leukaemia in the lenalidomide group and one patient (3%) with fatal multifocal leukoencephalopathy in the placebo group). Lenalidomide is an efficacious maintenance therapy reducing the relative risk of progression in first-line patients with chronic lymphocytic leukaemia who do not achieve minimal residual disease negative disease state following chemoimmunotherapy approaches. The toxicity seems to be acceptable considering the poor prognosis of the eligible patients. The trial independently confirms the clinical significance of a novel, minimal residual disease-based algorithm to predict short progression-free survival, which might be incorporated in future clinical trials to identify candidates for additional maintenance treatment. Celgene Corporation. Copyright © 2017 Elsevier Ltd. All rights reserved.
JAK2 V617F, MPL, and CALR mutations in essential thrombocythaemia and major thrombotic complications: a single-institute retrospective analysis.

PubMed

Pósfai, Éva; Marton, Imelda; Király, Péter Attila; Kotosz, Balázs; Kiss-László, Zsuzsanna; Széll, Márta; Borbényi, Zita

2015-07-01

Thrombo-haemorrhagic events are the main cause of morbidity and mortality in essential thrombocythemia. The aim of this study was to estimate the incidence of thrombotic events and the impact of the JAK2V617F, MPL (W515L, W515K, W515R, W515A and S505N) and CALR (type-1, type-2) mutations on 101 essential thrombocythaemia patients (72 females and 29 males with a mean age of 61 years) diagnosed in a Southern Hungarian regional academic centre. The incidence of major thrombosis was 13.86 %. Sixty percent of the patients carried the JAK2V617F mutation. The MPL mutations were analysed by sequencing and the W515L was the only one we could identify with an incidence of 3.96 %. Type-2 CALR mutation could be identified in 3 cases among the patients who had JAK2/MPL-unmutated ET. Statistical analyses revealed that the JAK2V617F mutation was associated with significantly increased levels of platelet (p = 0.042), haemoglobin (p = 0.000), red blood cell (p = 0.000) and haematocrit (p = 0.000) and hepatomegaly (p = 0.045) at diagnosis compared to JAK2V617F negative counterparts, however there was no significant association between the JAK2V617F mutation status (relative risk: 1.297, 95 % CI 0.395-4.258; p = 0.668) and subsequent thrombotic complications. The impact of JAK2V617F, MPL W515L and CALR mutations on the clinical findings at the diagnosis of ET was obvious, but their statistically significant role in the prediction of thrombotic events could not be proven in this study. Our results indirectly support the concept that, besides the quantitative and qualitative changes in the platelets, the mechanisms leading to thrombosis are more complex and multifactorial.
Cell-Intrinsic Determinants of Ibrutinib-Induced Apoptosis in Chronic Lymphocytic Leukemia.

PubMed

Amin, Nisar A; Balasubramanian, Sriram; Saiya-Cork, Kamlai; Shedden, Kerby; Hu, Nan; Malek, Sami N

2017-02-15

Purpose: Ibrutinib, a Bruton tyrosine kinase (BTK) inhibitor, is approved for the treatment of relapsed chronic lymphocytic leukemia (CLL) and CLL with del17p. Mechanistically, ibrutinib interferes with B-cell receptor (BCR) signaling as well as multiple CLL cell-to-microenvironment interactions. Given the importance of ibrutinib in the management of CLL, a deeper understanding of factors governing sensitivity and resistance is warranted. Experimental Design: We studied 48 longitudinally sampled paired CLL samples, 42 of which were procured before and after standard CLL chemotherapies, and characterized them for well-studied CLL molecular traits as well as by whole-exome sequencing and SNP 6.0 array profiling. We exposed these samples to 0.25 to 5 μmol/L of ibrutinib ex vivo and measured apoptosis fractions as well as BCR signaling by immunoblotting. We disrupted TP53 in HG3, PGA1, and PG-EBV cell lines and measured BCR signaling and ibrutinib responses. Results: CLL samples demonstrated a surprisingly wide range of ex vivo sensitivities to ibrutinib, with IC 50 values ranging from 0.4 to 9.7 μmol/L. Unmutated IGVH status, elevated ZAP70 expression, and trisomy 12 were associated with heightened sensitivity to ibrutinib treatment. Five CLL samples were substantially more resistant to ibrutinib following relapse from chemotherapy; of these, three had acquired a del17p/ TP53 -mutated status. A validation sample of 15 CLL carrying TP53 mutations, of which 13 carried both del17p and a TP53 mutation, confirmed substantially less sensitivity to ibrutinib-induced apoptosis. Conclusions: This study identifies that CLL harboring del17p/ TP53 -mutated cells are substantially less sensitive to ibrutinib-induced apoptosis than del17p/ TP53 wild-type cells. Clin Cancer Res; 23(4); 1049-59. ©2016 AACR . ©2016 American Association for Cancer Research.
The Florida manatee (Trichechus manatus latirostris) immunoglobulin heavy chain suggests the importance of clan III variable segments in repertoire diversity

USGS Publications Warehouse

Breaux, Breanna; Deiss, Thaddeus C.; Chen, Patricia L.; Cruz-Schneider, Maria Paula; Sena, Leonardo; Hunter, Margaret E.; Bonde, Robert K.; Criscitiello, Michael F.

2017-01-01

Manatees are a vulnerable, charismatic sentinel species from the evolutionarily divergent Afrotheria. Manatee health and resistance to infectious disease is of great concern to conservation groups, but little is known about their immune system. To develop manatee-specific tools for monitoring health, we first must have a general knowledge of how the immunoglobulin heavy (IgH) chain locus is organized and transcriptionally expressed. Using the genomic scaffolds of the Florida manatee (Trichechus manatus latirostris), we characterized the potential IgH segmental diversity and constant region isotypic diversity and performed the first Afrotherian repertoire analysis. The Florida manatee has low V(D)J combinatorial diversity (3744 potential combinations) and few constant region isotypes. They also lack clan III V segments, which may have caused reduced VH segment numbers. However, we found productive somatic hypermutation concentrated in the complementarity determining regions. In conclusion, manatees have limited IGHV clan and combinatorial diversity. This suggests that clan III V segments are essential for maintaining IgH locus diversity.
Junctional and allele-specific residues are critical for MERS-CoV neutralization by an exceptionally potent germline-like antibody

DOE PAGES

Ying, Tianlei; Prabakaran, Ponraj; Du, Lanying; ...

2015-09-15

The MERS-CoV is an emerging virus, which already infected more than 1,300 humans with high (~36%) mortality. Here, we show that m336, an exceptionally potent human anti-MERS-CoV antibody, is almost germline with only one somatic mutation in the heavy chain. The structure of Fab m336 in complex with the MERS-CoV receptor-binding domain reveals that its IGHV1-69-derived heavy chain provides more than 85% binding surface and that its epitope almost completely overlaps with the receptor-binding site. Analysis of antibodies from 69 healthy humans suggests an important role of the V(D)J recombination-generated junctional and allele-specific residues for achieving high affinity of bindingmore » at such low levels of somatic hypermutation. Our results also have important implications for development of vaccine immunogens based on the newly identified m336 epitope as well as for elucidation of mechanisms of neutralization by m336-like antibodies and their elicitation in vivo.« less
Activation-induced cytidine deaminase (AID) is strongly expressed in the fetal bovine ileal Peyer's patch and spleen and is associated with expansion of the primary antibody repertoire in the absence of exogenous antigens.

PubMed

Liljavirta, J; Ekman, A; Knight, J S; Pernthaner, A; Iivanainen, A; Niku, M

2013-09-01

Due to a limited range of immunoglobulin (Ig) genes, cattle and several other domestic animals rely on postrecombinatorial amplification of the primary repertoire. We report that activation-induced cytidine deaminase (AID) is strongly expressed in the fetal bovine ileal Peyer's patch and spleen but not in fetal bone marrow. The numbers of IGHV (immunoglobulin heavy chain variable) mutations correlate with AID expression. The mutational profile in the fetuses is similar to postnatal and immunized calves, with targeting of complementarity-determining region (CDR) over framework region (FR), preference of replacement over silent mutations in CDRs but not in FRs, and targeting of the AID hotspot motif RGYW/WRCY. Statistical analysis indicates negative selection on FRs and positive selection on CDRs. Our results suggest that AID-mediated somatic hypermutation and selection take place in bovine fetuses, implying a role for AID in the diversification of the primary antibody repertoire in the absence of exogenous antigens.
The Florida manatee (Trichechus manatus latirostris) immunoglobulin heavy chain suggests the importance of clan III variable segments in repertoire diversity.

PubMed

Breaux, Breanna; Deiss, Thaddeus C; Chen, Patricia L; Cruz-Schneider, Maria Paula; Sena, Leonardo; Hunter, Margaret E; Bonde, Robert K; Criscitiello, Michael F

2017-07-01

Manatees are a vulnerable, charismatic sentinel species from the evolutionarily divergent Afrotheria. Manatee health and resistance to infectious disease is of great concern to conservation groups, but little is known about their immune system. To develop manatee-specific tools for monitoring health, we first must have a general knowledge of how the immunoglobulin heavy (IgH) chain locus is organized and transcriptionally expressed. Using the genomic scaffolds of the Florida manatee (Trichechus manatus latirostris), we characterized the potential IgH segmental diversity and constant region isotypic diversity and performed the first Afrotherian repertoire analysis. The Florida manatee has low V(D)J combinatorial diversity (3744 potential combinations) and few constant region isotypes. They also lack clan III V segments, which may have caused reduced VH segment numbers. However, we found productive somatic hypermutation concentrated in the complementarity determining regions. In conclusion, manatees have limited IGHV clan and combinatorial diversity. This suggests that clan III V segments are essential for maintaining IgH locus diversity. Copyright © 2017 Elsevier Ltd. All rights reserved.
FcγRIIb expression in early stage chronic lymphocytic leukemia.

PubMed

Bosch, Rosa; Mora, Alba; Vicente, Eva Puy; Ferrer, Gerardo; Jansà, Sonia; Damle, Rajendra; Gorlatov, Sergey; Rai, Kanti; Montserrat, Emili; Nomdedeu, Josep; Pratcorona, Marta; Blanco, Laura; Saavedra, Silvana; Garrido, Ana; Esquirol, Albert; Garcia, Irene; Granell, Miquel; Martino, Rodrigo; Delgado, Julio; Sierra, Jorge; Chiorazzi, Nicholas; Moreno, Carol

2017-11-01

In normal B-cells, B-cell antigen receptor (BCR) signaling can be negatively regulated by the low-affinity receptor FcγRIIb (CD32b). To better understand the role of FcγRIIb in chronic lymphocytic leukemia (CLL), we correlated its expression on 155 samples from newly-diagnosed Binet A patients with clinical characteristics and outcome. FcγRIIb expression was similar in normal B-cells and leukemic cells, this being heterogenous among patients and within CLL clones. FcγRIIb expression did not correlate with well known prognostic markers [disease stage, serum beta-2 microglobulin (B2M), IGHV mutational status, expression of ZAP-70 and CD38, and cytogenetics] except for a weak concordance with CD49d. Moreover, patients with low FcγRIIb expression (69/155, 44.5%) required therapy earlier than those with high FcγRIIb expression (86/155, 55.5%) (median 151.4 months vs. not reached; p=.071). These results encourage further investigation on the role of FcγRIIb in CLL biology and prognostic significance in larger series of patients.
Developmental pathway for potent V1V2-directed HIV-neutralizing antibodies

PubMed Central

Doria-Rose, Nicole A.; Schramm, Chaim A.; Gorman, Jason; Moore, Penny L.; Bhiman, Jinal N.; DeKosky, Brandon J.; Ernandes, Michael J.; Georgiev, Ivelin S.; Kim, Helen J.; Pancera, Marie; Staupe, Ryan P.; Altae-Tran, Han R.; Bailer, Robert T.; Crooks, Ema T.; Cupo, Albert; Druz, Aliaksandr; Garrett, Nigel J.; Hoi, Kam H.; Kong, Rui; Louder, Mark K.; Longo, Nancy S.; McKee, Krisha; Nonyane, Molati; O’Dell, Sijy; Roark, Ryan S.; Rudicell, Rebecca S.; Schmidt, Stephen D.; Sheward, Daniel J.; Soto, Cinque; Wibmer, Constantinos Kurt; Yang, Yongping; Zhang, Zhenhai; Mullikin, James C.; Binley, James M.; Sanders, Rogier W.; Wilson, Ian A.; Moore, John P.; Ward, Andrew B.; Georgiou, George; Williamson, Carolyn; Abdool Karim, Salim S.; Morris, Lynn; Kwong, Peter D.; Shapiro, Lawrence; Mascola, John R.

2015-01-01

Summary Antibodies capable of neutralizing HIV-1 often target variable regions 1 and 2 (V1V2) of the HIV-1 envelope, but the mechanism of their elicitation has been unclear. Here we define the developmental pathway by which such antibodies are generated and acquire the requisite molecular characteristics for neutralization. Twelve somatically related neutralizing antibodies (CAP256-VRC26.01-12) were isolated from CAPRISA-donor CAP256; each antibody contained the protruding tyrosine-sulfated, anionic antigen-binding loop (CDR H3) characteristic of this category of antibodies. Their unmutated ancestor emerged between weeks 30–38 post-infection with a 35-residue CDR H3, and neutralized the virus that superinfected this individual 15 weeks after initial infection. Improved neutralization breadth occurred by week 59 with modest affinity maturation, and was preceded by extensive diversification of the virus population. HIV-1 V1V2-directed neutralizing antibodies can thus develop relatively rapidly through initial selection of B cells with a long CDR H3, and limited subsequent somatic hypermutation, an important vaccine insight. PMID:24590074
Analysis of chronic lymphotic leukemia transcriptomic profile: differences between molecular subgroups.

PubMed

Jantus Lewintre, Eloisa; Reinoso Martín, Cristina; Montaner, David; Marín, Miguel; José Terol, María; Farrás, Rosa; Benet, Isabel; Calvete, Juan J; Dopazo, Joaquín; García-Conde, Javier

2009-01-01

B cell chronic lymphocytic leukemia (CLL) is a lymphoproliferative disorder with a variable clinical course. Patients with unmutated IgV(H) gene show a shorter progression-free and overall survival than patients with immunoglobulin heavy chain variable regions (IgV(H)) gene mutated. In addition, BCL6 mutations identify a subgroup of patients with high risk of progression. Gene expression was analysed in 36 early-stage patients using high-density microarrays. Around 150 genes differentially expressed were found according to IgV(H) mutations, whereas no difference was found according to BCL6 mutations. Functional profiling methods allowed us to distinguish KEGG and gene ontology terms showing coordinated gene expression changes across subgroups of CLL. We validated a set of differentially expressed genes according to IgV(H) status, scoring them as putative prognostic markers in CLL. Among them, CRY1, LPL, CD82 and DUSP22 are the ones with at least equal or superior performance to ZAP70 which is actually the most used surrogate marker of IgV(H) status.
Initial antibodies binding to HIV-1 gp41 in acutely infected subjects are polyreactive and highly mutated

PubMed Central

Chen, Xi; Munshaw, Supriya; Zhang, Ruijun; Marshall, Dawn J.; Vandergrift, Nathan; Whitesides, John F.; Lu, Xiaozhi; Yu, Jae-Sung; Hwang, Kwan-Ki; Gao, Feng; Markowitz, Martin; Heath, Sonya L.; Bar, Katharine J.; Goepfert, Paul A.; Montefiori, David C.; Shaw, George C.; Alam, S. Munir; Margolis, David M.; Denny, Thomas N.; Boyd, Scott D.; Marshal, Eleanor; Egholm, Michael; Simen, Birgitte B.; Hanczaruk, Bozena; Fire, Andrew Z.; Voss, Gerald; Kelsoe, Garnett; Tomaras, Georgia D.; Moody, M. Anthony; Kepler, Thomas B.

2011-01-01

The initial antibody response to HIV-1 is targeted to envelope (Env) gp41, and is nonneutralizing and ineffective in controlling viremia. To understand the origins and characteristics of gp41-binding antibodies produced shortly after HIV-1 transmission, we isolated and studied gp41-reactive plasma cells from subjects acutely infected with HIV-1. The frequencies of somatic mutations were relatively high in these gp41-reactive antibodies. Reverted unmutated ancestors of gp41-reactive antibodies derived from subjects acutely infected with HIV-1 frequently did not react with autologous HIV-1 Env; however, these antibodies were polyreactive and frequently bound to host or bacterial antigens. In one large clonal lineage of gp41-reactive antibodies, reactivity to HIV-1 Env was acquired only after somatic mutations. Polyreactive gp41-binding antibodies were also isolated from uninfected individuals. These data suggest that the majority of gp41-binding antibodies produced after acute HIV-1 infection are cross-reactive responses generated by stimulating memory B cells that have previously been activated by non–HIV-1 antigens. PMID:21987658
Trial and error: how the unclonable human mitochondrial genome was cloned in yeast.

PubMed

Bigger, Brian W; Liao, Ai-Yin; Sergijenko, Ana; Coutelle, Charles

2011-11-01

Development of a human mitochondrial gene delivery vector is a critical step in the ability to treat diseases arising from mutations in mitochondrial DNA. Although we have previously cloned the mouse mitochondrial genome in its entirety and developed it as a mitochondrial gene therapy vector, the human mitochondrial genome has been dubbed unclonable in E. coli, due to regions of instability in the D-loop and tRNA(Thr) gene. We tested multi- and single-copy vector systems for cloning human mitochondrial DNA in E. coli and Saccharomyces cerevisiae, including transformation-associated recombination. Human mitochondrial DNA is unclonable in E. coli and cannot be retained in multi- or single-copy vectors under any conditions. It was, however, possible to clone and stably maintain the entire human mitochondrial genome in yeast as long as a single-copy centromeric plasmid was used. D-loop and tRNA(Thr) were both stable and unmutated. This is the first report of cloning the entire human mitochondrial genome and the first step in developing a gene delivery vehicle for human mitochondrial gene therapy.
Somatic Hypermutation-Induced Changes in the Structure and Dynamics of HIV-1 Broadly Neutralizing Antibodies.

PubMed

Davenport, Thaddeus M; Gorman, Jason; Joyce, M Gordon; Zhou, Tongqing; Soto, Cinque; Guttman, Miklos; Moquin, Stephanie; Yang, Yongping; Zhang, Baoshan; Doria-Rose, Nicole A; Hu, Shiu-Lok; Mascola, John R; Kwong, Peter D; Lee, Kelly K

2016-08-02

Antibody somatic hypermutation (SHM) and affinity maturation enhance antigen recognition by modifying antibody paratope structure to improve its complementarity with the target epitope. SHM-induced changes in paratope dynamics may also contribute to antibody maturation, but direct evidence of this is limited. Here, we examine two classes of HIV-1 broadly neutralizing antibodies (bNAbs) for SHM-induced changes in structure and dynamics, and delineate the effects of these changes on interactions with the HIV-1 envelope glycoprotein (Env). In combination with new and existing structures of unmutated and affinity matured antibody Fab fragments, we used hydrogen/deuterium exchange with mass spectrometry to directly measure Fab structural dynamics. Changes in antibody structure and dynamics were positioned to improve complementarity with Env, with changes in dynamics primarily observed at the paratope peripheries. We conclude that SHM optimizes paratope complementarity to conserved HIV-1 epitopes and restricts the mobility of paratope-peripheral residues to minimize clashes with variable features on HIV-1 Env. Copyright © 2016 Elsevier Ltd. All rights reserved.
Antibody Light-Chain-Restricted Recognition of the Site of Immune Pressure in the RV144 HIV-1 Vaccine Trial Is Phylogenetically Conserved

DOE PAGES

Wiehe, Kevin; Easterhoff, David; Luo, Kan; ...

2014-11-29

In HIV-1, the ability to mount antibody responses to conserved, neutralizing epitopes is critical for protection. Here we have studied the light chain usage of human and rhesus macaque antibodies targeted to a dominant region of the HIV-1 envelope second variable (V2) region involving lysine (K) 169, the site of immune pressure in the RV144 vaccine efficacy trial. We found that humans and rhesus macaques used orthologous lambda variable gene segments encoding a glutamic acid-aspartic acid (ED) motif for K169 recognition. Structure determination of an unmutated ancestor antibody demonstrated that the V2 binding site was preconfigured for ED motif-mediated recognitionmore » prior to maturation. Thus, light chain usage for recognition of the site of immune pressure in the RV144 trial is highly conserved across species. In conclusion, these data indicate that the HIV-1 K169-recognizing ED motif has persisted over the diversification between rhesus macaques and humans, suggesting an evolutionary advantage of this antibody recognition mode.« less
Antibody Light-Chain-Restricted Recognition of the Site of Immune Pressure in the RV144 HIV-1 Vaccine Trial Is Phylogenetically Conserved

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wiehe, Kevin; Easterhoff, David; Luo, Kan

In HIV-1, the ability to mount antibody responses to conserved, neutralizing epitopes is critical for protection. Here we have studied the light chain usage of human and rhesus macaque antibodies targeted to a dominant region of the HIV-1 envelope second variable (V2) region involving lysine (K) 169, the site of immune pressure in the RV144 vaccine efficacy trial. We found that humans and rhesus macaques used orthologous lambda variable gene segments encoding a glutamic acid-aspartic acid (ED) motif for K169 recognition. Structure determination of an unmutated ancestor antibody demonstrated that the V2 binding site was preconfigured for ED motif-mediated recognitionmore » prior to maturation. Thus, light chain usage for recognition of the site of immune pressure in the RV144 trial is highly conserved across species. In conclusion, these data indicate that the HIV-1 K169-recognizing ED motif has persisted over the diversification between rhesus macaques and humans, suggesting an evolutionary advantage of this antibody recognition mode.« less
A Human Anti-Insulin IgG Autoantibody Apparently Arises Through Clonal Selection from an Insulin-Specific “Germ-Line” Natural Antibody Template

PubMed Central

Ichiyoshi, Yuji; Zhou, Min; Casali, Paolo

2015-01-01

We analyzed the structural correlates underlying the insulin-dependent selection of the specific anti-insulin IgG1 κ mAb13-producing cell clone, derived from a patient with insulin-dependent diabetes mellitus treated with recombinant human insulin. First, we cloned the germ-line genes that putatively gave rise to the expressed VH and Vκ segments and used them to generate the full (unmutated) “germ-line revertant” of the “wild-type” (somatically mutated) mAb13, using recombinant PCR methods and an in vitro human Cγ1 and Cκ expression system. The full “germ-line revertant” bound insulin specifically and in a dose-saturable fashion, but with a relative avidity (Avrel) more than three-fold lower than that of its wild-type counterpart (Avrel, 1.69 × 10−8 vs 4.91 × 10−9 g/μl). Second, we established, by reassorting wild-type and germ-line revertant forms of the mAb13 VH and Vκ segments, that the increased Avrel for insulin of mAb13 when compared with its full “germ-line revertant” counterpart was entirely dependent on the mutations in the VH not those in the Vκ chain. Third, we determined, by site-directed mutagenesis experiments, that of the three mutations in the mAb13 VH segment (Ser→Gly, Ser→Thr, and Ser→Arg at positions 31, 56, and 58, respectively), only Arg58 was crucial in increasing the mAb13 Avrel (from 1.44 × 10−8 to 5.14 × 10−9 g/μl) and affinity (Kd, from 189 to 59 nM) for insulin. The affinity enhancement mediated by the VH segment Arg58 residue reflected about a threefold decrease in dissociation rate constant (Koff, from 4.92 × 10−3 to 1.54 × 10−3 s−1)but not an increase in association rate constant (Kon, from 2.60 × 104 to 2.61 × 104 M−1 s−1), and it contrasted with the complete loss of insulin binding resulting from the substitution of the VH segment Asn52 by Lys. The present findings suggest that human insulin, a self Ag, has the potential to recruit a natural autoantibody-producing cell precursor expressing a specific surface receptor for Ag in unmutated configuration, and drive it through affinity maturation. They also show that binding of insulin by such a receptor can be enhanced or completely abrogated by a single amino acid change. PMID:7995943
Combining Protein and Strain Engineering for the Production of Glyco-Engineered Horseradish Peroxidase C1A in Pichia pastoris

PubMed Central

Capone, Simona; Ćorajević, Lejla; Bonifert, Günther; Murth, Patrick; Maresch, Daniel; Altmann, Friedrich; Herwig, Christoph; Spadiut, Oliver

2015-01-01

Horseradish peroxidase (HRP), conjugated to antibodies and lectins, is widely used in medical diagnostics. Since recombinant production of the enzyme is difficult, HRP isolated from plant is used for these applications. Production in the yeast Pichia pastoris (P. pastoris), the most promising recombinant production platform to date, causes hyperglycosylation of HRP, which in turn complicates conjugation to antibodies and lectins. In this study we combined protein and strain engineering to obtain an active and stable HRP variant with reduced surface glycosylation. We combined four mutations, each being beneficial for either catalytic activity or thermal stability, and expressed this enzyme variant as well as the unmutated wildtype enzyme in both a P. pastoris benchmark strain and a strain where the native α-1,6-mannosyltransferase (OCH1) was knocked out. Considering productivity in the bioreactor as well as enzyme activity and thermal stability, the mutated HRP variant produced in the P. pastoris benchmark strain turned out to be interesting for medical diagnostics. This variant shows considerable catalytic activity and thermal stability and is less glycosylated, which might allow more controlled and efficient conjugation to antibodies and lectins. PMID:26404235
Functional anergy in a subpopulation of naive B cells from healthy humans that express autoreactive immunoglobulin receptors.

PubMed

Duty, J Andrew; Szodoray, Peter; Zheng, Nai-Ying; Koelsch, Kristi A; Zhang, Qingzhao; Swiatkowski, Mike; Mathias, Melissa; Garman, Lori; Helms, Christina; Nakken, Britt; Smith, Kenneth; Farris, A Darise; Wilson, Patrick C

2009-01-16

Self-reactive B cells not controlled by receptor editing or clonal deletion may become anergic. We report that fully mature human B cells negative for surface IgM and retaining only IgD are autoreactive and functionally attenuated (referred to as naive IgD(+)IgM(-) B cells [B(ND)]). These B(ND) cells typically make up 2.5% of B cells in the peripheral blood, have antibody variable region genes in germline (unmutated) configuration, and, by all current measures, are fully mature. Analysis of 95 recombinant antibodies expressed from the variable genes of single B(ND) cells demonstrated that they are predominantly autoreactive, binding to HEp-2 cell antigens and DNA. Upon B cell receptor cross-linkage, B(ND) cells have a reduced capacity to mobilize intracellular calcium or phosphorylate tyrosines, demonstrating that they are anergic. However, intense stimulation causes B(ND) cells to fully respond, suggesting that these cells could be the precursors of autoantibody secreting plasma cells in autoimmune diseases such as systemic lupus erythematosus or rheumatoid arthritis. This is the first identification of a distinct mature human B cell subset that is naturally autoreactive and controlled by the tolerizing mechanism of functional anergy.
Structural insights into the interaction of human IgG1 with FcγRI: no direct role of glycans in binding

DOE Office of Scientific and Technical Information (OSTI.GOV)

Oganesyan, Vaheh, E-mail: oganesyanv@medimmune.com; Mazor, Yariv; Yang, Chunning

In an effort to identify the critical structural features responsible for the high-affinity interaction of IgG1 Fc with FcγRI, the structure of the corresponding complex was solved at a resolution of 2.4 Å. The three-dimensional structure of a human IgG1 Fc fragment bound to wild-type human FcγRI is reported. The structure of the corresponding complex was solved at a resolution of 2.4 Å using molecular replacement; this is the highest resolution achieved for an unmutated FcγRI molecule. This study highlights the critical structural and functional role played by the second extracellular subdomain of FcγRI. It also explains the long-known majormore » energetic contribution of the Fc ‘LLGG’ motif at positions 234–237, and particularly of Leu235, via a ‘lock-and-key’ mechanism. Finally, a previously held belief is corrected and a differing view is offered on the recently proposed direct role of Fc carbohydrates in the corresponding interaction. Structural evidence is provided that such glycan-related effects are strictly indirect.« less

Chronic lymphocytic leukaemia

PubMed Central

Kipps, Thomas J.; Stevenson, Freda K.; Wu, Catherine J.; Croce, Carlo M.; Packham, Graham; Wierda, William G.; O’Brien, Susan; Gribben, John; Rai, Kanti

2017-01-01

Chronic lymphocytic leukaemia (CLL) is a malignancy of CD5+ B cells that is characterized by the accumulation of small, mature-appearing lymphocytes in the blood, marrow and lymphoid tissues. Signalling via surface immunoglobulin, which constitutes the major part of the B cell receptor, and several genetic alterations play a part in CLL pathogenesis, in addition to interactions between CLL cells and other cell types, such as stromal cells, T cells and nurse-like cells in the lymph nodes. The clinical progression of CLL is heterogeneous and ranges from patients who require treatment soon after diagnosis to others who do not require therapy for many years, if at all. Several factors, including the immunoglobulin heavy-chain variable region gene (IGHV) mutational status, genomic changes, patient age and the presence of comorbidities, should be considered when defining the optimal management strategies, which include chemotherapy, chemoimmunotherapy and/or drugs targeting B cell receptor signalling or inhibitors of apoptosis, such as BCL-2. Research on the biology of CLL has profoundly enhanced our ability to identify patients who are at higher risk for disease progression and our capacity to treat patients with drugs that selectively target distinctive phenotypic or physiological features of CLL. How these and other advances have shaped our current understanding and treatment of patients with CLL is the subject of this Primer. PMID:28102226
Characterization of Escherichia coli K1 colominic acid-specific murine antibodies that are cross-protective against Neisseria meningitidis groups B, C, and Y.

PubMed

Park, In Ho; Lin, Jisheng; Choi, Ji Eun; Shin, Jeon-Soo

2014-06-01

The capsular polysaccharide (PS) of Neisseria meningitidis serogroup B (NMGB) is α(2-8)-linked N-acetylneuraminic acid (Neu5Ac), which is almost identical to the O-acetylated colominic acid (CA) of Escherichia coli K1 Although E. coli K1 has long been known to elicit cross-protective antibodies against NMGB, limited information on these highly cross-reactive antibodies is available. In the present study, six new monoclonal antibodies (mAbs) specific to both E. coli K1 CA and NMGB PS were produced by immunizing Balb/c mice with E. coli K1, and their serological and molecular properties were characterized, together with 12 previously reported hybridoma mAbs. Among the bactericidal mAbs against NMGB, both HmenB5 and HmenB18, which are genetically identical though of different mouse origins, were able to kill serogroup C and Y meningococci. Based on SPR sensograms, the binding affinity of HmenB18 for PS was suggested to be associated with at least two different binding forces: the polyanionicity of Neu5Ac and an interaction with the O-acetyl groups of Neu5Ac. Molecular analysis showed that similar to most mAbs presenting a few restricted V region germline genes, the V region genes of HmenB18 were 979% and 986% identical to the closest IGHV1-1401 and IGLV15-10301 germline gene alleles, respectively, and V-D-J editing in this mAb generated an unusually long VH-CDR3 sequence (17 amino acid residues), containing one basic arginine, two hydrophobic isoleucine residues and a 'YAMDY' motif. Models of the mAb combining sites demonstrate that most of the mAbs exhibited a wide, shallow groove with a high overall positive charge, as seen in mAb735, which is specific for a polyanionic helical epitope. In contrast, the combining site of HmenB18 was shown to be wide but to present a relatively weak positive charge, consistent with the extensive recognition by HmenB18 of the various structural epitopes formed with the Neu5Ac residue and its O-acetylation. Copyright © 2014 Elsevier Ltd. All rights reserved.
Structure of an N276-Dependent HIV-1 Neutralizing Antibody Targeting a Rare V5 Glycan Hole Adjacent to the CD4 Binding Site.

PubMed

Wibmer, Constantinos Kurt; Gorman, Jason; Anthony, Colin S; Mkhize, Nonhlanhla N; Druz, Aliaksandr; York, Talita; Schmidt, Stephen D; Labuschagne, Phillip; Louder, Mark K; Bailer, Robert T; Abdool Karim, Salim S; Mascola, John R; Williamson, Carolyn; Moore, Penny L; Kwong, Peter D; Morris, Lynn

2016-11-15

All HIV-1-infected individuals develop strain-specific neutralizing antibodies to their infecting virus, which in some cases mature into broadly neutralizing antibodies. Defining the epitopes of strain-specific antibodies that overlap conserved sites of vulnerability might provide mechanistic insights into how broadly neutralizing antibodies arise. We previously described an HIV-1 clade C-infected donor, CAP257, who developed broadly neutralizing plasma antibodies targeting an N276 glycan-dependent epitope in the CD4 binding site. The initial CD4 binding site response potently neutralized the heterologous tier 2 clade B viral strain RHPA, which was used to design resurfaced gp120 antigens for single-B-cell sorting. Here we report the isolation and structural characterization of CAP257-RH1, an N276 glycan-dependent CD4 binding site antibody representative of the early CD4 binding site plasma response in donor CAP257. The cocrystal structure of CAP257-RH1 bound to RHPA gp120 revealed critical interactions with the N276 glycan, loop D, and V5, but not with aspartic acid 368, similarly to HJ16 and 179NC75. The CAP257-RH1 monoclonal antibody was derived from the immunoglobulin-variable IGHV3-33 and IGLV3-10 genes and neutralized RHPA but not the transmitted/founder virus from donor CAP257. Its narrow neutralization breadth was attributed to a binding angle that was incompatible with glycosylated V5 loops present in almost all HIV-1 strains, including the CAP257 transmitted/founder virus. Deep sequencing of autologous CAP257 viruses, however, revealed minority variants early in infection that lacked V5 glycans. These glycan-free V5 loops are unusual holes in the glycan shield that may have been necessary for initiating this N276 glycan-dependent CD4 binding site B-cell lineage. The conserved CD4 binding site on gp120 is a major target for HIV-1 vaccine design, but key events in the elicitation and maturation of different antibody lineages to this site remain elusive. Studies have shown that strain-specific antibodies can evolve into broadly neutralizing antibodies or in some cases act as helper lineages. Therefore, characterizing the epitopes of strain-specific antibodies may help to inform the design of HIV-1 immunogens to elicit broadly neutralizing antibodies. In this study, we isolate a narrowly neutralizing N276 glycan-dependent antibody and use X-ray crystallography and viral deep sequencing to describe how gp120 lacking glycans in V5 might have elicited these early glycan-dependent CD4 binding site antibodies. These data highlight how glycan holes can play a role in the elicitation of B-cell lineages targeting the CD4 binding site. Copyright © 2016 Wibmer et al.
Structure of an N276-Dependent HIV-1 Neutralizing Antibody Targeting a Rare V5 Glycan Hole Adjacent to the CD4 Binding Site

DOE Office of Scientific and Technical Information (OSTI.GOV)

Wibmer, Constantinos Kurt; Gorman, Jason; Anthony, Colin S.

ABSTRACT All HIV-1-infected individuals develop strain-specific neutralizing antibodies to their infecting virus, which in some cases mature into broadly neutralizing antibodies. Defining the epitopes of strain-specific antibodies that overlap conserved sites of vulnerability might provide mechanistic insights into how broadly neutralizing antibodies arise. We previously described an HIV-1 clade C-infected donor, CAP257, who developed broadly neutralizing plasma antibodies targeting an N276 glycan-dependent epitope in the CD4 binding site. The initial CD4 binding site response potently neutralized the heterologous tier 2 clade B viral strain RHPA, which was used to design resurfaced gp120 antigens for single-B-cell sorting. Here we report themore » isolation and structural characterization of CAP257-RH1, an N276 glycan-dependent CD4 binding site antibody representative of the early CD4 binding site plasma response in donor CAP257. The cocrystal structure of CAP257-RH1 bound to RHPA gp120 revealed critical interactions with the N276 glycan, loop D, and V5, but not with aspartic acid 368, similarly to HJ16 and 179NC75. The CAP257-RH1 monoclonal antibody was derived from the immunoglobulin-variable IGHV3-33 and IGLV3-10 genes and neutralized RHPA but not the transmitted/founder virus from donor CAP257. Its narrow neutralization breadth was attributed to a binding angle that was incompatible with glycosylated V5 loops present in almost all HIV-1 strains, including the CAP257 transmitted/founder virus. Deep sequencing of autologous CAP257 viruses, however, revealed minority variants early in infection that lacked V5 glycans. These glycan-free V5 loops are unusual holes in the glycan shield that may have been necessary for initiating this N276 glycan-dependent CD4 binding site B-cell lineage. IMPORTANCEThe conserved CD4 binding site on gp120 is a major target for HIV-1 vaccine design, but key events in the elicitation and maturation of different antibody lineages to this site remain elusive. Studies have shown that strain-specific antibodies can evolve into broadly neutralizing antibodies or in some cases act as helper lineages. Therefore, characterizing the epitopes of strain-specific antibodies may help to inform the design of HIV-1 immunogens to elicit broadly neutralizing antibodies. In this study, we isolate a narrowly neutralizing N276 glycan-dependent antibody and use X-ray crystallography and viral deep sequencing to describe how gp120 lacking glycans in V5 might have elicited these early glycan-dependent CD4 binding site antibodies. These data highlight how glycan holes can play a role in the elicitation of B-cell lineages targeting the CD4 binding site.« less
Structure of an N276-Dependent HIV-1 Neutralizing Antibody Targeting a Rare V5 Glycan Hole Adjacent to the CD4 Binding Site

PubMed Central

Wibmer, Constantinos Kurt; Gorman, Jason; Anthony, Colin S.; Mkhize, Nonhlanhla N.; Druz, Aliaksandr; York, Talita; Schmidt, Stephen D.; Labuschagne, Phillip; Louder, Mark K.; Bailer, Robert T.; Abdool Karim, Salim S.; Mascola, John R.; Williamson, Carolyn; Moore, Penny L.

2016-01-01

ABSTRACT All HIV-1-infected individuals develop strain-specific neutralizing antibodies to their infecting virus, which in some cases mature into broadly neutralizing antibodies. Defining the epitopes of strain-specific antibodies that overlap conserved sites of vulnerability might provide mechanistic insights into how broadly neutralizing antibodies arise. We previously described an HIV-1 clade C-infected donor, CAP257, who developed broadly neutralizing plasma antibodies targeting an N276 glycan-dependent epitope in the CD4 binding site. The initial CD4 binding site response potently neutralized the heterologous tier 2 clade B viral strain RHPA, which was used to design resurfaced gp120 antigens for single-B-cell sorting. Here we report the isolation and structural characterization of CAP257-RH1, an N276 glycan-dependent CD4 binding site antibody representative of the early CD4 binding site plasma response in donor CAP257. The cocrystal structure of CAP257-RH1 bound to RHPA gp120 revealed critical interactions with the N276 glycan, loop D, and V5, but not with aspartic acid 368, similarly to HJ16 and 179NC75. The CAP257-RH1 monoclonal antibody was derived from the immunoglobulin-variable IGHV3-33 and IGLV3-10 genes and neutralized RHPA but not the transmitted/founder virus from donor CAP257. Its narrow neutralization breadth was attributed to a binding angle that was incompatible with glycosylated V5 loops present in almost all HIV-1 strains, including the CAP257 transmitted/founder virus. Deep sequencing of autologous CAP257 viruses, however, revealed minority variants early in infection that lacked V5 glycans. These glycan-free V5 loops are unusual holes in the glycan shield that may have been necessary for initiating this N276 glycan-dependent CD4 binding site B-cell lineage. IMPORTANCE The conserved CD4 binding site on gp120 is a major target for HIV-1 vaccine design, but key events in the elicitation and maturation of different antibody lineages to this site remain elusive. Studies have shown that strain-specific antibodies can evolve into broadly neutralizing antibodies or in some cases act as helper lineages. Therefore, characterizing the epitopes of strain-specific antibodies may help to inform the design of HIV-1 immunogens to elicit broadly neutralizing antibodies. In this study, we isolate a narrowly neutralizing N276 glycan-dependent antibody and use X-ray crystallography and viral deep sequencing to describe how gp120 lacking glycans in V5 might have elicited these early glycan-dependent CD4 binding site antibodies. These data highlight how glycan holes can play a role in the elicitation of B-cell lineages targeting the CD4 binding site. PMID:27581986
Self-reactive VH4-34–expressing IgG B cells recognize commensal bacteria

PubMed Central

Glauzy, Salomé; Ng, Yen-Shing; Chamberlain, Nicolas; Massad, Christopher; Isnardi, Isabelle; Uzel, Gulbu; Holland, Steven M.; Picard, Capucine

2017-01-01

The germline immunoglobulin (Ig) variable heavy chain 4–34 (VH4-34) gene segment encodes in humans intrinsically self-reactive antibodies that recognize I/i carbohydrates expressed by erythrocytes with a specific motif in their framework region 1 (FWR1). VH4-34–expressing clones are common in the naive B cell repertoire but are rarely found in IgG memory B cells from healthy individuals. In contrast, CD27+IgG+ B cells from patients genetically deficient for IRAK4 or MYD88, which mediate the function of Toll-like receptors (TLRs) except TLR3, contained VH4-34–expressing clones and showed decreased somatic hypermutation frequencies. In addition, VH4-34–encoded IgGs from IRAK4- and MYD88-deficient patients often displayed an unmutated FWR1 motif, revealing that these antibodies still recognize I/i antigens, whereas their healthy donor counterparts harbored FWR1 mutations abolishing self-reactivity. However, this paradoxical self-reactivity correlated with these VH4-34–encoded IgG clones binding commensal bacteria antigens. Hence, B cells expressing germline-encoded self-reactive VH4-34 antibodies may represent an innate-like B cell population specialized in the containment of commensal bacteria when gut barriers are breached. PMID:28500047
Effector Gene Suites in Some Soil Isolates of Fusarium oxysporum Are Not Sufficient Predictors of Vascular Wilt in Tomato.

PubMed

Jelinski, Nicolas A; Broz, Karen; Jonkers, Wilfried; Ma, Li-Jun; Kistler, H Corby

2017-07-01

Seventy-four Fusarium oxysporum soil isolates were assayed for known effector genes present in an F. oxysporum f. sp. lycopersici race 3 tomato wilt strain (FOL MN-25) obtained from the same fields in Manatee County, Florida. Based on the presence or absence of these genes, four haplotypes were defined, two of which represented 96% of the surveyed isolates. These two most common effector haplotypes contained either all or none of the assayed race 3 effector genes. We hypothesized that soil isolates with all surveyed effector genes, similar to FOL MN-25, would be pathogenic toward tomato, whereas isolates lacking all effectors would be nonpathogenic. However, inoculation experiments revealed that presence of the effector genes alone was not sufficient to ensure pathogenicity on tomato. Interestingly, a nonpathogenic isolate containing the full suite of unmutated effector genes (FOS 4-4) appears to have undergone a chromosomal rearrangement yet remains vegetatively compatible with FOL MN-25. These observations confirm the highly dynamic nature of the F. oxysporum genome and support the conclusion that pathogenesis among free-living populations of F. oxysporum is a complex process. Therefore, the presence of effector genes alone may not be an accurate predictor of pathogenicity among soil isolates of F. oxysporum.
Pleural/pericardic effusions during dasatinib treatment: incidence, management and risk factors associated to their development.

PubMed

Breccia, Massimo; Alimena, Giuliana

2010-09-01

Despite the beneficial effect of imatinib treatment in chronic myeloid leukemia patients, some patients develop resistance and/or intolerance and need a switch to second-generation tyrosine kinase inhibitors. Dasatinib is indicated for chronic myeloid leukemia patients with resistance or intolerance to imatinib; it has 325-fold increase potency compared to imatinib and is active in mutated and unmutated resistant patients. Pleural/pericardic effusions are frequent complications during treatment with dasatinib, and usually are reported to require dose reduction or drug discontinuation. Changing the dasatinib regimen from 70 mg twice daily to 100 mg once daily reduces the risk of pleural effusions. In this article, we review the incidence of the phenomenon observed in different dasatinib trials (Phase I - III) and the currently suggested management. We also describe the identified pathogenetic mechanisms related to the development and discuss the associated risk factors. The aim of this paper is to provide healthcare professionals with clear guidance on the management of pleural effusions associated with dasatinib treatment. Recommendations are based on the published data and clinical experience from a number of different centers. Literature evidences support the fact that with adequate management and monitoring of patients with predisposing factors, pleural effusions can be easily managed.
The novel anticancer agent JNJ-26854165 is active in chronic myeloid leukemic cells with unmutated BCR/ABL and T315I mutant BCR/ABL through promoting proteosomal degradation of BCR/ABL proteins.

PubMed

You, Liangshun; Liu, Hui; Huang, Jian; Xie, Wanzhuo; Wei, Jueying; Ye, Xiujin; Qian, Wenbin

2017-01-31

Chronic myeloid leukemia (CML) is a clonal malignant disease caused by the expression of BCR/ABL. MDM2 (human homolog of the murine double minute-2) inhibitors such as Nutlin-3 have been shown to induce apoptosis in a p53-dependent manner in CML cells and sensitize cells to Imatinib. Here, we demonstrate that JNJ-26854165, an inhibitor of MDM2, inhibits proliferation and triggers cell death in a p53-independent manner in various BCR/ABL-expressing cells, which include primary leukemic cells from patients with CML blast crisis and cells expressing the Imatinib-resistant T315I BCR/ABL mutant. The response to JNJ-26854165 is associated with the downregulation of BCR/ABL dependently of proteosome activation. Moreover, in all tested CML cells, with the exception of T315I mutation cells, combining JNJ-26854165 and tyrosine kinase inhibitor (TKI) Imatinib or PD180970 leads to a synergistic effect. In conclusion, our results suggest that JNJ-26854165, used either alone or in combination with TKIs, represents a promising novel targeted approach to overcome TKI resistance and improve patient outcome in CML.
Expression of CALR mutants causes mpl-dependent thrombocytosis in zebrafish.

PubMed

Lim, K-H; Chang, Y-C; Chiang, Y-H; Lin, H-C; Chang, C-Y; Lin, C-S; Huang, L; Wang, W-T; Gon-Shen Chen, C; Chou, W-C; Kuo, Y-Y

2016-10-07

CALR mutations are identified in about 30% of JAK2/MPL-unmutated myeloproliferative neoplasms (MPNs) including essential thrombocythemia (ET) and primary myelofibrosis. Although the molecular pathogenesis of CALR mutations leading to MPNs has been studied using in vitro cell lines models, how mutant CALR may affect developmental hematopoiesis remains unknown. Here we took advantage of the zebrafish model to examine the effects of mutant CALR on early hematopoiesis and model human CALR-mutated MPNs. We identified three zebrafish genes orthologous to human CALR, referred to as calr, calr3a and calr3b. The expression of CALR-del52 and CALR-ins5 mutants caused an increase in the hematopoietic stem/progenitor cells followed by thrombocytosis without affecting normal angiogenesis. The expression of CALR mutants also perturbed early developmental hematopoiesis in zebrafish. Importantly, morpholino knockdown of mpl but not epor or csf3r could significantly attenuate the effects of mutant CALR. Furthermore, the expression of mutant CALR caused jak-stat signaling activation in zebrafish that could be blocked by JAK inhibitors (ruxolitinib and fedratinib). These findings showed that mutant CALR activates jak-stat signaling through an mpl-dependent mechanism to mediate pathogenic thrombopoiesis in zebrafish, and illustrated that the signaling machinery related to mutant CALR tumorigenesis are conserved between human and zebrafish.
Morphologic identification of atypical chronic lymphocytic leukemia by digital microscopy.

PubMed

Marionneaux, S; Maslak, P; Keohane, E M

2014-08-01

Atypical chronic lymphocytic leukemia (aCLL) is a morphologic variant found in approximately 25% of patients with chronic lymphocytic leukemia (CLL). Although aCLL has a more aggressive course compared to typical CLL (tCLL), it is not usually reported. This retrospective study used digital microscopy to morphologically classify CLL patients as aCLL or tCLL, and determined the prevalence of prognostic markers in each group. CellaVision AB (Lund, Sweden) was used to evaluate lymphocyte morphology on archived blood films of 97 CLL patients, and results of their prognostic marker analysis at diagnosis were obtained. The unpaired t-test, Chi-square, or Fisher's Exact test were used for statistical analysis. 27% of CLL cases were morphologically classified as aCLL. The aCLL group had a higher prevalence of trisomy 12, unmutated IgVH, and CD38 expression (markers associated with poor prognosis), and a lower prevalence of 13q14 deletions compared to tCLL; this was statistically significant. Using digital imaging to identify aCLL is feasible, economical, and may provide clinically relevant prognostic information at diagnosis and during periodic monitoring. Further study of a larger number of patients is needed to assess the clinical utility of reporting aCLL morphology. © 2013 John Wiley & Sons Ltd.
Overexpressed BAG3 is a potential therapeutic target in chronic lymphocytic leukemia.

PubMed

Zhu, Huayuan; Wu, Wei; Fu, Yuan; Shen, Wenyi; Miao, Kourong; Hong, Min; Xu, Wei; Young, Ken H; Liu, Peng; Li, Jianyong

2014-03-01

Bcl-2-associated athanogene 3 (BAG3), a member of BAG family, is shown to sustain cell survival and underlie resistance to chemotherapy in human neoplastic cells. We aimed to determine the exact role and underlying mechanisms of BAG3 in human chronic lymphocytic leukemia (CLL). One hundred human CLL samples and 20 normal B-cell samples from healthy controls were collected. We measured the BAG3 expression in these cells and explored its relationship with known prognostic factors for CLL. The roles of BAG3 in cell apoptosis and migration were evaluated by small interfering RNA-mediated knockdown of BAG3 in primary CLL cells. We showed that BAG3 expression level was increased in CLL cells compared with normal B cells. Moreover, BAG3 expression was particularly upregulated in CD38 positive, unmutated immunoglobulin heavy-chain patients and those with lymphadenopathy and/or splenomegaly. Importantly, patients with increased BAG3 expression level have poor overall survival in subgroups with positive ZAP-70 or those without any "p53 abnormality". In addition, knocking down of BAG3 expression resulted in increased apoptotic ratio and decreased migration in primary CLL cells. Our data indicate that BAG3 is a marker of poor prognostic in specific subgroups of CLL patients and may be a potential therapeutic target for this disease.
Calreticulin mutation-specific immunostaining in myeloproliferative neoplasms: pathogenetic insight and diagnostic value

PubMed Central

Vannucchi, A M; Rotunno, G; Bartalucci, N; Raugei, G; Carrai, V; Balliu, M; Mannarelli, C; Pacilli, A; Calabresi, L; Fjerza, R; Pieri, L; Bosi, A; Manfredini, R; Guglielmelli, P

2014-01-01

Mutations in the gene calreticulin (CALR) occur in the majority of JAK2- and MPL-unmutated patients with essential thrombocythemia (ET) and primary myelofibrosis (PMF); identifying CALR mutations contributes to the diagnostic pathway of ET and PMF. CALR mutations are heterogeneous spanning over the exon 9, but all result in a novel common protein C terminus. We developed a polyclonal antibody against a 17-amino-acid peptide derived from mutated calreticulin that was used for immunostaining of bone marrow biopsies. We show that this antibody specifically recognized patients harboring different types of CALR mutation with no staining in healthy controls and JAK2- or MPL-mutated ET and PMF. The labeling was mostly localized in megakaryocytes, whereas myeloid and erythroid cells showed faint staining, suggesting a preferential expression of calreticulin in megakaryocytes. Megakaryocytic-restricted expression of calreticulin was also demonstrated using an antibody against wild-type calreticulin and by measuring the levels of calreticulin RNA by gene expression analysis. Immunostaining using an antibody specific for mutated calreticulin may become a rapid, simple and cost-effective method for identifying CALR-mutated patients complementing molecular analysis; furthermore, the labeling pattern supports the preferential expansion of megakaryocytic cell lineage as a result of CALR mutation in an immature hematopoietic stem cell. PMID:24618731
PARP1 expression, activity and ex vivo sensitivity to the PARP inhibitor, talazoparib (BMN 673), in chronic lymphocytic leukaemia

PubMed Central

Herriott, Ashleigh; Tudhope, Susan J.; Junge, Gesa; Rodrigues, Natalie; Patterson, Miranda J.; Woodhouse, Laura; Lunec, John; Hunter, Jill E.; Mulligan, Evan A.; Cole, Michael; Allinson, Lisa M.; Wallis, Jonathan P.; Marshall, Scott; Wang, Evelyn; Curtin, Nicola J.; Willmore, Elaine

2015-01-01

In chronic lymphocytic leukemia (CLL), mutation and loss of p53 and ATM abrogate DNA damage signalling and predict poorer response and shorter survival. We hypothesised that poly (ADP-ribose) polymerase (PARP) activity, which is crucial for repair of DNA breaks induced by oxidative stress or chemotherapy, may be an additional predictive biomarker and a target for therapy with PARP inhibitors. We measured PARP activity in 109 patient-derived CLL samples, which varied widely (192 – 190052 pmol PAR/106 cells) compared to that seen in healthy volunteer lymphocytes (2451 – 7519 pmol PAR/106 cells). PARP activity was associated with PARP1 protein expression and endogenous PAR levels. PARP activity was not associated with p53 or ATM loss, Binet stage, IGHV mutational status or survival, but correlated with Bcl-2 and Rel A (an NF-kB subunit). Levels of 8-hydroxy-2′-deoxyguanosine in DNA (a marker of oxidative damage) were not associated with PAR levels or PARP activity. The potent PARP inhibitor, talazoparib (BMN 673), inhibited CD40L-stimulated proliferation of CLL cells at nM concentrations, independently of Binet stage or p53/ATM function. PARP activity is highly variable in CLL and correlates with stress-induced proteins. Proliferating CLL cells (including those with p53 or ATM loss) are highly sensitive to the PARP inhibitor talazoparib. PMID:26539646
Immunostimulatory oligonucleotide-induced metaphase cytogenetics detect chromosomal aberrations in 80% of CLL patients: A study of 132 CLL cases with correlation to FISH, IgVH status, and CD38 expression.

PubMed

Dicker, Frank; Schnittger, Susanne; Haferlach, Torsten; Kern, Wolfgang; Schoch, Claudia

2006-11-01

Compared with fluorescence in situ hybridization (FISH), conventional metaphase cytogenetics play only a minor prognostic role in chronic lymphocytic leukemia (CLL) so far, due to technical problems resulting from limited proliferation of CLL cells in vitro. Here, we present a simple method for in vitro stimulation of CLL cells that overcomes this limitation. In our unselected patient population, 125 of 132 cases could be successfully stimulated for metaphase generation by culture with the immunostimulatory CpG-oligonucleotide DSP30 plus interleukin 2. Of 125 cases, 101 showed chromosomal aberrations. The aberration rate is comparable to the rate detected by parallel interphase FISH. In 47 patients, conventional cytogenetics detected additional aberrations not detected by FISH analysis. A complex aberrant karyotype, defined as one having at least 3 aberrations, was detected in 30 of 125 patients, compared with only one such case as defined by FISH. Conventional cytogenetics frequently detected balanced and unbalanced translocations. A significant correlation of the poor-prognosis unmutated IgV(H) status with unbalanced translocations and of the likewise poor-prognosis CD38 expression to balanced translocations and complex aberrant karyotype was found. We demonstrate that FISH analysis underestimates the complexity of chromosomal aberrations in CLL. Therefore, conventional cytogenetics may define subgroups of patients with high risk of progression.
Circulating CD21low B cells in common variable immunodeficiency resemble tissue homing, innate-like B cells

PubMed Central

Rakhmanov, Mirzokhid; Keller, Baerbel; Gutenberger, Sylvia; Foerster, Christian; Hoenig, Manfred; Driessen, Gertjan; van der Burg, Mirjam; van Dongen, Jacques J.; Wiech, Elisabeth; Visentini, Marcella; Quinti, Isabella; Prasse, Antje; Voelxen, Nadine; Salzer, Ulrich; Goldacker, Sigune; Fisch, Paul; Eibel, Hermann; Schwarz, Klaus; Peter, Hans-Hartmut; Warnatz, Klaus

2009-01-01

The homeostasis of circulating B cell subsets in the peripheral blood of healthy adults is well regulated, but in disease it can be severely disturbed. Thus, a subgroup of patients with common variable immunodeficiency (CVID) presents with an extraordinary expansion of an unusual B cell population characterized by the low expression of CD21. CD21low B cells are polyclonal, unmutated IgM+IgD+ B cells but carry a highly distinct gene expression profile which differs from conventional naïve B cells. Interestingly, while clearly not representing a memory population, they do share several features with the recently defined memory-like tissue, Fc receptor-like 4 positive B cell population in the tonsils of healthy donors. CD21low B cells show signs of previous activation and proliferation in vivo, while exhibiting defective calcium signaling and poor proliferation in response to B cell receptor stimulation. CD21low B cells express decreased amounts of homeostatic but increased levels of inflammatory chemokine receptors. This might explain their preferential homing to peripheral tissues like the bronchoalveolar space of CVID or the synovium of rheumatoid arthritis patients. Therefore, as a result of the close resemblance to the gene expression profile, phenotype, function and preferential tissue homing of murine B1 B cells, we suggest that CD21low B cells represent a human innate-like B cell population. PMID:19666505
Prognostic relevance of oxidative stress measurement in chronic lymphocytic leukaemia.

PubMed

D'Arena, Giovanni; Vitale, Candida; Perbellini, Omar; Coscia, Marta; La Rocca, Francesco; Ruggieri, Vitalba; Visco, Carlo; Di Minno, Nicola Matteo Dario; Innocenti, Idanna; Pizza, Vincenzo; Deaglio, Silvia; Di Minno, Giovanni; Giudice, Aldo; Calapai, Gioacchino; Musto, Pellegrino; Laurenti, Luca; Iorio, Eugenio Luigi

2017-10-01

To evaluate the prognostic significance of oxidative stress (OS) and antioxidant defence status measurement in patients with chronic lymphocytic leukaemia (CLL). d-ROMs test and BAP test were evaluated at diagnosis of 165 patients with CLL and correlated with clinical-biological features and prognosis. An increased oxidative damage (d-ROMs test) and a reduced antioxidant potential (BAP test) were found in CLL patients than normal controls (P<.0001). CLL patients with higher d-ROMs values had higher number of circulating white blood cells and lymphocytes, and higher values of β 2 -microglobulin. Higher d-ROMs values were found in female (P=.0003), in patients with unmutated IgVH (P=.04), unfavourable cytogenetics (P=.002) and more advanced clinical stage (P=.002). Higher BAP test values were found in patients expressing CD49d (P=.01) and with more advanced clinical stage (P=.004). At a median follow-up of 48 months, CLL patients with d-ROMs ≥418 CARR U were found to have a shorter time to first treatment (TFT) (P=.0002), and a reduced survival (P=.006) than others. CLL patients with BAP test values ≥2155 μmol/L had a shorter TFT (P=.008) and a shorter survival (P=.003). OS can affect CLL patients by concomitantly increasing reactive oxygen metabolites production and decreasing antioxidant defences. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
MFI ratio estimation of ZAP-70 in B-CLL by flow cytometry can be improved by considering the isotype-matched antibody signal.

PubMed

Marquez, M-E; Deglesne, P-A; Suarez, G; Romano, E

2011-04-01

The IgV(H) mutational status of B-cell chronic lymphocytic leukemia (B-CLL) is of prognostic value. Expression of ZAP-70 in B-CLL is a surrogate marker for IgV(H) unmutated (UM). As determination of IgV(H) mutational status involves a methodology currently unavailable for most clinical laboratories, it is important to have available a reliable technique for ZAP-70 estimation in B-CLL. Flow cytometry (FC) is a convenient technique for this purpose. However, there is still no adequate way for data analysis, which would prevent the assignment of false positive or negative expression. We have modified the currently most accepted technique, which uses the ratio of the mean fluorescent index (MFI) of B-CLL to T cells. The MFI for parallel antibody isotype staining is subtracted from the ZAP-70 MFI of both B-CLL and T cells. We validated this technique comparing the results obtained for ZAP-70 expression by FC with those obtained with quantitative PCR for the same patients. We applied the technique in a series of 53 patients. With this modification, a better correlation between ZAP-70 expression and IgV(H) UM was obtained. Thus, the MFI ratio B-CLL/T cell corrected by isotype is a reliable analysis technique to estimate ZAP-70 expression in B-CLL. © 2010 Blackwell Publishing Ltd.
Lower Prevalence of Alzheimer's Disease among Tibetans: Association with Religious and Genetic Factors.

PubMed

Huang, Fukai; Shang, Ying; Luo, Yuandai; Wu, Peng; Huang, Xue; Tan, Xiaohui; Lu, Xingyi; Zhen, Lifang; Hu, Xianda

2016-01-01

The prevalence of dementia differs among racial groups, the highest prevalence being in Latin America (8.5%) compared to sub-Saharan African regions (2-4%). The most common type of dementia is Alzheimer's disease (AD). To estimate the prevalence of AD in the Qinghai-Tibet plateau and to investigate the related factors. This was a cross-sectional, multistage cluster sampling design survey. Data was collected from May 2014 to September 2014 from 4,060 Tibetan aged >60 years. Participants underwent clinical examinations and neuropsychological evaluations. MALDI-TOF was used to test the genotypes of CLU, TFAM, TP53INP1, IGHV1-67, CR1, ApoE, and BIN1. Logistic regression models were used to ascertain the associations with AD. The prevalence of AD among Tibetan individuals aged >60 years was 1.33% (95% CI: 0.98-1.69). The CLU haplotypes AA+GA (odds ratio (OR) = 4.483; 95% CI: 1.069-18.792) of rs2279590 was correlated with AD. The CLU haplotypes GG+GC (OR = 0.184; 95% CI: 0.038-0.888) of rs9331888 and kowtow (OR = 0.203; 95% CI 0.046-0.896) were negatively correlated with AD. A low prevalence of AD was found in Tibetans from the Qinghai-Tibet plateau. Multivariate analysis might suggest that regular "mind-body" religious meditative activities may be negatively associated with AD in this population, as well as the CLU genotype at rs9331888.
Distinct Activities of Glycolytic Enzymes Identify Chronic Lymphocytic Leukemia Patients with a more Aggressive Course and Resistance to Chemo-Immunotherapy.

PubMed

Gdynia, Georg; Robak, Tadeusz; Kopitz, Jürgen; Heller, Anette; Grekova, Svetlana; Duglova, Katarina; Laukemper, Gloria; Heinzel-Gutenbrunner, Monika; Gutenbrunner, Cornelius; Roth, Wilfried; Ho, Anthony D; Schirmacher, Peter; Schmitt, Michael; Dreger, Peter; Sellner, Leopold

2018-06-05

A higher capacity to grow under hypoxic conditions can lead to a more aggressive behavior of tumor cells. Determining tumor activity under hypoxia may identify chronic lymphocytic leukemia (CLL) with aggressive clinical course and predict response to chemo-immunotherapy (CIT). A metabolic score was generated by determining pyruvate kinase and lactate dehydrogenase, key enzymes of glycolysis, ex vivo in primary CLL samples under normoxic and hypoxic conditions. This score was further correlated with clinical endpoints and response to CIT in 96 CLL patients. 45 patients were classified as metabolic high risk (HR), 51 as low risk (LR). Treatment-free survival (TFS) was significantly shorter in HR patients (median 394 vs 723 days, p = .021). 15 HR patients and 14 LR patients received CIT after sample acquisition. HR patients had a significantly shorter progression-free survival after treatment compared to LR patients (median 216 days vs not reached, p = .008). Multivariate analysis evaluating age, IGHV, TP53 deletion or mutation and 11q22-23 deletion besides the capacity of tumor cells to grow under severe hypoxic conditions identified the metabolic profile as the strongest independent risk factor for shorter TFS (hazard ratio 2.37, p = .011). The metabolic risk can provide prognostic and predictive information complementary to genetic biomarkers and identify patients who might benefit from alternative treatment approaches. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.

Structural insights into humanization of anti-tissue factor antibody 10H10.

PubMed

Teplyakov, Alexey; Obmolova, Galina; Malia, Thomas J; Raghunathan, Gopalan; Martinez, Christian; Fransson, Johan; Edwards, Wilson; Connor, Judith; Husovsky, Matthew; Beck, Heena; Chi, Ellen; Fenton, Sandra; Zhou, Hong; Almagro, Juan Carlos; Gilliland, Gary L

Murine antibody 10H10 raised against human tissue factor is unique in that it blocks the signaling pathway, and thus inhibits angiogenesis and tumor growth without interfering with coagulation. As a potential therapeutic, the antibody was humanized in a two-step procedure. Antigen-binding loops were grafted onto selected human frameworks and the resulting chimeric antibody was subjected to affinity maturation by using phage display libraries. The results of humanization were analyzed from the structural perspective through comparison of the structure of a humanized variant with the parental mouse antibody. This analysis revealed several hot spots in the framework region that appear to affect antigen binding, and therefore should be considered in human germline selection. In addition, some positions in the Vernier zone, e.g., residue 71 in the heavy chain, that are traditionally thought to be crucial appear to tolerate amino acid substitutions without any effect on binding. Several humanized variants were produced using both short and long forms of complementarity-determining region (CDR) H2 following the difference in the Kabat and Martin definitions. Comparison of such pairs indicated consistently higher thermostability of the variants with short CDR H2. Analysis of the binding data in relation to the structures singled out the ImMunoGeneTics information system® germline IGHV1-2*01 as dubious owing to two potentially destabilizing mutations as compared to the other alleles of the same germline and to other human germlines.
A clinical-molecular prognostic model to predict survival in patients with post polycythemia vera and post essential thrombocythemia myelofibrosis.

PubMed

Passamonti, F; Giorgino, T; Mora, B; Guglielmelli, P; Rumi, E; Maffioli, M; Rambaldi, A; Caramella, M; Komrokji, R; Gotlib, J; Kiladjian, J J; Cervantes, F; Devos, T; Palandri, F; De Stefano, V; Ruggeri, M; Silver, R T; Benevolo, G; Albano, F; Caramazza, D; Merli, M; Pietra, D; Casalone, R; Rotunno, G; Barbui, T; Cazzola, M; Vannucchi, A M

2017-12-01

Polycythemia vera (PV) and essential thrombocythemia (ET) are myeloproliferative neoplasms with variable risk of evolution into post-PV and post-ET myelofibrosis, from now on referred to as secondary myelofibrosis (SMF). No specific tools have been defined for risk stratification in SMF. To develop a prognostic model for predicting survival, we studied 685 JAK2, CALR, and MPL annotated patients with SMF. Median survival of the whole cohort was 9.3 years (95% CI: 8-not reached-NR-). Through penalized Cox regressions we identified negative predictors of survival and according to beta risk coefficients we assigned 2 points to hemoglobin level <11 g/dl, to circulating blasts ⩾3%, and to CALR-unmutated genotype, 1 point to platelet count <150 × 10 9 /l and to constitutional symptoms, and 0.15 points to any year of age. Myelofibrosis Secondary to PV and ET-Prognostic Model (MYSEC-PM) allocated SMF patients into four risk categories with different survival (P<0.0001): low (median survival NR; 133 patients), intermediate-1 (9.3 years, 95% CI: 8.1-NR; 245 patients), intermediate-2 (4.4 years, 95% CI: 3.2-7.9; 126 patients), and high risk (2 years, 95% CI: 1.7-3.9; 75 patients). Finally, we found that the MYSEC-PM represents the most appropriate tool for SMF decision-making to be used in clinical and trial settings.
Chromosome aberrations detected by conventional karyotyping using novel mitogens in chronic lymphocytic leukemia: Clinical and biologic correlations.

PubMed

Rigolin, Gian Matteo; del Giudice, Ilaria; Formigaro, Luca; Saccenti, Elena; Martinelli, Sara; Cavallari, Maurizio; Lista, Enrico; Tammiso, Elisa; Volta, Eleonora; Lupini, Laura; Bassi, Cristian; Bardi, Antonella; Sofritti, Olga; Daghia, Giulia; Cavazzini, Francesco; Marinelli, Marilisa; Tavolaro, Simona; Guarini, Anna; Negrini, Massimo; Foà, Robin; Cuneo, Antonio

2015-12-01

To clarify whether karyotype aberrations (KA) involving regions not covered by the standard fluorescence in situ hybridization (FISH) panel have independent prognostic relevance, we evaluated KA by conventional cytogenetics in a learning cohort (LC; n = 166) and a validation cohort (VC; n = 250) of untreated chronic lymphocytic leukemia (CLL) patients. In the VC, novel mitogens were used to improve metaphase generation and TP53, NOTCH1, and SF3B1 mutations were assessed. KA undetected by FISH were found in 35 and 35% of the cases in the LC and VC, respectively. In addition to FISH, KA allowed reclassification of 23 and 26% of cases in the LC and VC, respectively, into a higher cytogenetic risk group. By multivariate analysis, both in the LC and VC, KA other than isolated 13q deletion correlated with a shorter time to first treatment (TFT; P < 0.001 and 0.003, respectively), while a complex karyotype predicted a worse overall survival (OS, P = 0.015 and 0.010, respectively). In the VC, where a comprehensive biologic assessment was performed, a shorter TFT was also predicted by stage (P < 0.001), IGHV mutational status (P = 0.05), and del(17p)/TP53 mutations (P = 0.033) while stage (P = 0.023) and del(17p)/TP53 mutations (P = 0.024) independently predicted a shorter OS. FISH results did not independently impact on TFT and OS, in the LC and VC cohorts; this was also the case for NOTCH1 and SF3B1 mutations in the VC. We suggest that in CLL, conventional karyotyping with novel mitogens could be more effective than FISH for the detection of KA allowing for a more precise refinement of prognosis. © 2015 Wiley Periodicals, Inc.
B lymphocyte "original sin" in the bone marrow enhances islet autoreactivity in type 1 diabetes-prone nonobese diabetic mice.

PubMed

Henry-Bonami, Rachel A; Williams, Jonathan M; Rachakonda, Amita B; Karamali, Mariam; Kendall, Peggy L; Thomas, James W

2013-06-15

Effective central tolerance is required to control the large extent of autoreactivity normally present in the developing B cell repertoire. Insulin-reactive B cells are required for type 1 diabetes in the NOD mouse, because engineered mice lacking this population are protected from disease. The Cg-Tg(Igh-6/Igh-V125)2Jwt/JwtJ (VH125Tg) model is used to define this population, which is found with increased frequency in the periphery of NOD mice versus nonautoimmune C57BL/6 VH125Tg mice; however, the ontogeny of this disparity is unknown. To better understand the origins of these pernicious B cells, anti-insulin B cells were tracked during development in the polyclonal repertoire of VH125Tg mice. An increased proportion of insulin-binding B cells is apparent in NOD mice at the earliest point of Ag commitment in the bone marrow. Two predominant L chains were identified in B cells that bind heterologous insulin. Interestingly, Vκ4-57-1 polymorphisms that confer a CDR3 Pro-Pro motif enhance self-reactivity in VH125Tg/NOD mice. Despite binding circulating autoantigen in vivo, anti-insulin B cells transition from the parenchyma to the sinusoids in the bone marrow of NOD mice and enter the periphery unimpeded. Anti-insulin B cells expand at the site of autoimmune attack in the pancreas and correlate with increased numbers of IFN-γ-producing cells in the repertoire. These data identify the failure to cull autoreactive B cells in the bone marrow as the primary source of anti-insulin B cells in NOD mice and suggest that dysregulation of central tolerance permits their escape into the periphery to promote disease.
Development of the neonatal B and T cell repertoire in swine: implications for comparative and veterinary immunology.

PubMed

Butler, John E; Sinkora, Marek; Wertz, Nancy; Holtmeier, Wolfgang; Lemke, Caitlin D

2006-01-01

Birth in all higher vertebrates is at the center of the critical window of development in which newborns transition from dependence on innate immunity to dependence on their own adaptive immunity, with passive maternal immunity bridging this transition. Therefore we have studied immunological development through fetal and early neonatal life. In swine, B cells appear earlier in fetal development than T cells. B cell development begins in the yolk sac at the 20th day of gestation (DG20), progresses to fetal liver at DG30 and after DG45 continues in bone marrow. The first wave of developing T cells is gammadelta cells expressing a monomorphic Vdelta rearrangement. Thereafter, alphabeta T cells predominate and at birth, at least 19 TRBV subgroups are expressed, 17 of which appear highly homologous with those in humans. In contrast to the T cell repertoire and unlike humans and mice, the porcine pre-immune VH (IGHV-D-J) repertoire is highly restricted, depending primarily on CDR3 for diversity. The V-KAPPA (IGKV-J) repertoire and apparently also the V-LAMBDA (IGLV-J) repertoire, are also restricted. Diversification of the pre-immune B cell repertoire of swine and the ability to respond to both T-dependent and T-independent antigen depends on colonization of the gut after birth in which colonizing bacteria stimulate with Toll-like receptor ligands, especially bacterial DNA. This may explain the link between repertoire diversification and the anatomical location of primary lymphoid tissue like the ileal Peyers patches. Improper development of adaptive immunity can be caused by infectious agents like the porcine reproductive and respiratory syndrome virus that causes immune dysregulation resulting in immunological injury and autoimmunity.
New evidence for positive selection helps explain the paternal age effect observed in achondroplasia

PubMed Central

Shinde, Deepali N.; Elmer, Dominik P.; Calabrese, Peter; Boulanger, Jérôme; Arnheim, Norman; Tiemann-Boege, Irene

2013-01-01

There are certain de novo germline mutations associated with genetic disorders whose mutation rates per generation are orders of magnitude higher than the genome average. Moreover, these mutations occur exclusively in the male germ line and older men have a higher probability of having an affected child than younger ones, known as the paternal age effect (PAE). The classic example of a genetic disorder exhibiting a PAE is achondroplasia, caused predominantly by a single-nucleotide substitution (c.1138G>A) in FGFR3. To elucidate what mechanisms might be driving the high frequency of this mutation in the male germline, we examined the spatial distribution of the c.1138G>A substitution in a testis from an 80-year-old unaffected man. Using a technology based on bead-emulsion amplification, we were able to measure mutation frequencies in 192 individual pieces of the dissected testis with a false-positive rate lower than 2.7 × 10−6. We observed that most mutations are clustered in a few pieces with 95% of all mutations occurring in 27% of the total testis. Using computational simulations, we rejected the model proposing an elevated mutation rate per cell division at this nucleotide site. Instead, we determined that the observed mutation distribution fits a germline selection model, where mutant spermatogonial stem cells have a proliferative advantage over unmutated cells. Combined with data on several other PAE mutations, our results support the idea that the PAE, associated with a number of Mendelian disorders, may be explained primarily by a selective mechanism. PMID:23740942
Comprehensive genetic characterization of CLL: a study on 506 cases analysed with chromosome banding analysis, interphase FISH, IgV(H) status and immunophenotyping.

PubMed

Haferlach, C; Dicker, F; Schnittger, S; Kern, W; Haferlach, T

2007-12-01

In CLL data from chromosome banding analysis (CBA) have been scarce due to the low proliferative activity of CLL cells in vitro. We improved the cultivation technique using an immunostimulatory CpG-oligonucleotide DSP30 and IL-2. A total of 506 CLL samples were analysed with CBA and interphase FISH using probes for the detection of trisomy 12, IgH rearrangements and deletions of 6q21, 11q22.3 (ATM), 13q14 (D13S25 and D13S319) and 17p13 (TP53). A total of 500 of 506 (98.8%) cases were successfully stimulated for metaphase generation and are subject to this study. Aberrations were detected in 415 of 500 (83.0%) cases by CBA and in 392 of 500 (78.4%) cases by FISH. CBA detected 832 abnormalities and FISH only 502. Therefore, CBA offers important information in addition to FISH. (1) CLL is characterized mainly by genomic imbalances and reciprocal translocations are rare. (2) A subgroup with complex aberrant karyotype (16.4%) is identified which is associated with an unmutated IgV(H) status and CD38 expression (P=0.034 and 0.02, respectively). (3) Additional abnormalities are detectable providing new biological insights into different CLL subclasses revealing a much more heterogeneous pattern of cytogenetic abnormalities as assumed so far based on FISH data only. Therefore, prospective clinical trials should evaluate the prognostic impact of newly available CBA data.
Influenza virus resistance to human neutralizing antibodies.

PubMed

Crowe, James E

2012-01-01

The human antibody repertoire has an exceptionally large capacity to recognize new or changing antigens through combinatorial and junctional diversity established at the time of V(D)J recombination and through somatic hypermutation. Influenza viruses exhibit a relentless capacity to escape the human antibody response by altering the amino acids of their surface proteins in hypervariable domains that exhibit a high level of structural plasticity. Both parties in this high-stakes game of shape shifting drive structural evolution of their functional proteins (the B cell receptor/antibody on one side and the viral hemagglutinin and neuraminidase proteins on the other) using error-prone polymerase systems. It is likely that most of the genetic mutations that occur in these systems are deleterious, resulting in the failure of the B cell or virus with mutations to propagate in the immune repertoire or viral quasispecies. A subset of mutations is tolerated in functional surface proteins that enter the B cell or virus progeny pool. In both cases, selection occurs in the population of mutated and unmutated species. In cases where the functional avidity of the B cell receptor is increased significantly, that clone may be selected for preferential expansion. In contrast, an influenza virus that "escapes" the inhibitory effect of secreted antibodies may represent a high proportion of the progeny virus in that host. The recent paper by O'Donnell et al. [C. D. O'Donnell et al., mBio 3(3):e00120-12, 2012] identifies a mechanism for antibody resistance that does not require escape from binding but rather achieves a greater efficiency in replication.
Analysis of IgV gene mutations in B cell chronic lymphocytic leukaemia according to antigen-driven selection identifies subgroups with different prognosis and usage of the canonical somatic hypermutation machinery.

PubMed

Degan, Massimo; Bomben, Riccardo; Bo, Michele Dal; Zucchetto, Antonella; Nanni, Paola; Rupolo, Maurizio; Steffan, Agostino; Attadia, Vincenza; Ballerini, Pier Ferruccio; Damiani, Daniela; Pucillo, Carlo; Poeta, Giovanni Del; Colombatti, Alfonso; Gattei, Valter

2004-07-01

Cases of B-cell chronic lymphocytic leukaemia (B-CLL) with mutated (M) IgV(H) genes have a better prognosis than unmutated (UM) cases. We analysed the IgV(H) mutational status of B-CLL according to the features of a canonical somatic hypermutation (SHM) process, correlating this data with survival. In a series of 141 B-CLLs, 124 cases were examined for IgV(H) gene per cent mutations and skewing of replacement/silent mutations in the framework/complementarity-determining regions as evidence of antigen-driven selection; this identified three B-CLL subsets: significantly mutated (sM), with evidence of antigen-driven selection, not significantly mutated (nsM) and UM, without such evidence and IgV(H) gene per cent mutations above or below the 2% cut-off. sM B-CLL patients had longer survival within the good prognosis subgroup that had more than 2% mutations of IgV(H) genes. sM, nsM and UM B-CLL were also characterized for the biased usage of IgV(H) families, intraclonal IgV(H) gene diversification, preference of mutations to target-specific nucleotides or hotspots, and for the expression of enzymes involved in SHM (translesion DNA polymerase zeta and eta and activation-induced cytidine deaminase). These findings indicate the activation of a canonical SHM process in nsM and sM B-CLLs and underscore the role of the antigen in defining the specific clinical and biological features of B-CLL.
Bruton tyrosine kinase inhibition in chronic lymphocytic leukemia.

PubMed

Maddocks, Kami; Jones, Jeffrey A

2016-04-01

Chronic lymphocytic leukemia (CLL) is the most common adult leukemia and remains incurable outside of the setting of allogeneic stem cell transplant. While the standard therapy for both initial and relapsed CLL has traditionally included monoclonal antibody therapy in combination with chemotherapy, there are patients with high-risk disease features including unmutated IgVH, del(11q22) and del(17p13) that are associated with poor overall responses to these therapies with short time to relapse and shortened overall survival. Additionally, many of these therapies have a high rate of infectious toxicity in a population already at increased risk. Targeting the B-cell receptor (BCR) signaling pathway has emerged as a promising therapeutic advance in a variety of B-cell malignancies, including CLL. Bruton agammaglobulinemia tyrosine kinase (Btk) is a tyrosine kinase in the BCR pathway critical to the survival of both normal and malignant B cells and inhibition of this kinase has shown to block the progression of CLL. Ibrutinib, a first in class oral inhibitor of Btk, has shown promise as a very effective agent in the treatment of CLL-in both relapsed and upfront therapy, alone and in combination with other therapies, and in patients of all-risk disease-which has led to its approval in relapsed CLL and as frontline therapy in patients with the high-risk del(17p13) disease. Several studies are ongoing to evaluate the efficacy and safety of ibrutinib in combination with chemotherapy as frontline treatment for CLL and investigation into newer-generation Btk inhibitors is also underway. Copyright © 2016 Elsevier Inc. All rights reserved.
Systemic mastocytosis in adults: 2013 update on diagnosis, risk stratification, and management.

PubMed

Pardanani, Animesh

2013-07-01

Systemic mastocytosis (SM) results from a clonal proliferation of abnormal mast cells (MC) in one or more extracutaneous organs. The major criterion is presence of multifocal clusters of morphologically abnormal MC in the bone marrow. Minor diagnostic criteria include elevated serum tryptase level, abnormal MC expression of CD25 and/or CD2, and presence of KITD816V. The 2008 World Health Organization (WHO) classification of SM has been shown to be prognostically relevant. Classification of SM patients into indolent (SM), aggressive SM (ASM), SM associated with a clonal non-MC lineage disease (SM-AHNMD) and mast cell leukemia (MCL) subgroups is a useful first step in establishing prognosis. SM treatment is generally palliative. ISM patients have a normal life expectancy and receive symptom-directed therapy; infrequently, cytoreductive therapy may be indicated for refractory symptoms. ASM patients have disease-related organ dysfunction; interferon-α (±corticosteroids) can control dermatological, hematological, gastrointestinal, skeletal, and mediator-release symptoms, but is hampered by poor tolerability. Similarly, cladribine has broad therapeutic activity, with particular utility when rapid MC debulking is indicated; the main toxicity is myelosuppression. Imatinib has a therapeutic role in the presence of an imatinib-sensitive KIT mutation or in KITD816-unmutated patients. Treatment of SM-AHNMD is governed primarily by the non-MC neoplasm; hydroxyurea has modest utility in this setting. Dasatinib's in vitro anti- KITD816V activity has not translated into significant therapeutic activity in most SM patients. In contrast, recently updated data confirms Midostaurin's significant anti-MC activity in patients with advanced SM. Copyright © 2013 Wiley Periodicals, Inc.
Systemic mastocytosis in adults: 2012 Update on diagnosis, risk stratification, and management.

PubMed

Pardanani, Animesh

2012-04-01

Systemic mastocytosis (SM) results from a clonal proliferation of abnormal mast cells (MC) in one or more extra-cutaneous organs. The major criterion is presence of multifocal clusters of morphologically abnormal MC in the bone marrow. Minor diagnostic criteria include elevated serum tryptase level, abnormal MC expression of CD25 and/or CD2, and presence of KITD816V. The prognostic relevance of the 2008 World Health Organization (WHO) classification of SM has recently been confirmed. Classification of SM patients into indolent (SM), aggressive SM (ASM), SM associated with a clonal non-MC lineage disease (SM-AHNMD) and mast cell leukemia (MCL) subgroups is a useful first step in establishing prognosis. SM treatment is generally palliative. ISM patients have a normal life expectancy and receive symptom-directed therapy; infrequently, cytoreductive therapy may be indicated for refractory symptoms. ASM patients have disease-related organ dysfunction; interferon-α (±corticosteroids) can control dermatological, hematological, gastrointestinal, skeletal, and mediator-release symptoms, but is hampered by poor tolerability. Similarly, cladribine has broad therapeutic activity, with particular utility when rapid MC debulking is indicated; the main toxicity is myelosuppression. Imatinib has a therapeutic role in the presence of an imatinib-sensitive KIT mutation or in KITD816-unmutated patients. Treatment of SM-AHNMD is governed primarily by the non-MC neoplasm; hydroxyurea has modest utility in this setting. Dasatinib's in vitro anti-KITD816V activity has not translated into significant therapeutic activity in most SM patients. In contrast, preliminary data suggest that Midostaurin may produce significant decreases in MC burden in some patients. Copyright © 2012 Wiley Periodicals, Inc.
Systemic mastocytosis in adults: 2015 update on diagnosis, risk stratification, and management.

PubMed

Pardanani, Animesh

2015-03-01

Systemic mastocytosis (SM) results from a clonal proliferation of abnormal mast cells (MC) in one or more extracutaneous organs. The major criterion is presence of multifocal clusters of morphologically abnormal MC in the bone marrow. Minor diagnostic criteria include elevated serum tryptase level, abnormal MC expression of CD25 and/or CD2, and presence of KITD816V. The 2008 World Health Organization classification of SM has been shown to be prognostically relevant. Classification of SM patients into indolent SM (ISM), aggressive SM (ASM), SM associated with a clonal non-MC lineage disease (SM-AHNMD), and mast cell leukemia (MCL) subgroups is a useful first step in establishing prognosis. SM treatment is generally palliative. ISM patients have a normal life expectancy and receive symptom-directed therapy; infrequently, cytoreductive therapy may be indicated for refractory symptoms. ASM patients have disease-related organ dysfunction; interferon-α (+/-corticosteroids) can control dermatological, hematological, gastrointestinal, skeletal, and mediator-release symptoms, but is hampered by poor tolerability. Similarly, cladribine has broad therapeutic activity, with particular utility when rapid MC debulking is indicated; the main toxicity is myelosuppression. Imatinib has a therapeutic role in the presence of an imatinib-sensitive KIT mutation or in KITD816-unmutated patients. Treatment of SM-AHNMD is governed primarily by the non-MC neoplasm; hydroxyurea has modest utility in this setting; there is a role for allogeneic stem cell transplantation in select cases. Investigational Drugs: Recent data confirms midostaurin's significant anti-MC activity in patients with advanced SM. © 2015 Wiley Periodicals, Inc.
Systemic mastocytosis in adults: 2011 update on diagnosis, risk stratification, and management.

PubMed

Pardanani, Animesh

2011-04-01

Systemic mastocytosis (SM) results from a clonal proliferation of abnormal mast cells (MC) in one or more extracutaneous organs. The major criterion is presence of multifocal clusters of morphologically abnormal MC in the bone marrow. Minor diagnostic criteria include elevated serum tryptase level, abnormal MC expression of CD25 and/or CD2, and presence of KITD816V. The prognostic relevance of the 2008 World Health Organization (WHO) classification of SM has recently been confirmed. Classification of SM patients into indolent (SM), aggressive SM (ASM), SM associated with a clonal non-MC lineage disease (SM-AHNMD), and mast cell leukemia (MCL) subgroups is a useful first step in establishing prognosis. SM treatment is generally palliative. ISM patients have a normal life expectancy and receive symptom-directed therapy; infrequently, cytoreductive therapy may be indicated for refractory symptoms. ASM patients have disease-related organ dysfunction; interferon-α (±corticosteroids) can control dermatological, hematological, gastrointestinal, skeletal, and mediator-release symptoms, but is hampered by poor tolerability. Similarly, cladribine has broad therapeutic activity, with particular utility when rapid MC debulking is indicated; the main toxicity is myelosuppression. Imatinib has a therapeutic role in the presence of an imatinib-sensitive KIT mutation or in KITD816-unmutated patients. Treatment of SM-AHNMD is governed primarily by the non-MC neoplasm; hydroxyurea has modest utility in this setting. Dasatinib's in vitro anti- KITD816V activity has not translated into significant therapeutic activity in most SM patients. In contrast, preliminary data suggest that Midostaurin may produce significant decreases in MC burden in some patients. Copyright © 2011 Wiley-Liss, Inc.
Mechanisms of proton relay and product release by Class A β-lactamase at ultrahigh resolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Lewandowski, Eric M.; Lethbridge, Kathryn G.; Sanishvili, Ruslan

The beta-lactam antibiotics inhibit penicillin-binding proteins (PBPs) by forming a stable, covalent, acyl-enzyme complex. During the evolution from PBPs to Class A beta-lactamases, the beta-lactamases acquired Glu166 to activate a catalytic water and cleave the acyl-enzyme bond. Here we present three product complex crystal structures of CTX-M-14 Class A beta-lactamase with a ruthenocene-conjugated penicillin-a 0.85 angstrom resolution structure of E166A mutant complexed with the penilloate product, a 1.30 angstrom resolution complex structure of the same mutant with the penicilloate product, and a 1.18 angstrom resolution complex structure of S70G mutant with a penicilloate product epimer-shedding light on the catalytic mechanismsmore » and product inhibition of PBPs and Class A beta-lactamases. The E166A-penilloate complex captured the hydrogen bonding network following the protonation of the leaving group and, for the first time, unambiguously show that the ring nitrogen donates a proton to Ser130, which in turn donates a proton to Lys73. These observations indicate that in the absence of Glu166, the equivalent lysine would be neutral in PBPs and therefore capable of serving as the general base to activate the catalytic serine. Together with previous results, this structure suggests a common proton relay network shared by Class A beta-lactamases and PBPs, from the catalytic serine to the lysine, and ultimately to the ring nitrogen. Additionally, the E166A-penicilloate complex reveals previously unseen conformational changes of key catalytic residues during the release of the product, and is the first structure to capture the hydrolyzed product in the presence of an unmutated catalytic serine.« less
Recommendations of the SFH (French Society of Haematology) for the diagnosis, treatment and follow-up of hairy cell leukaemia.

PubMed

Cornet, Edouard; Delmer, Alain; Feugier, Pierre; Garnache-Ottou, Francine; Ghez, David; Leblond, Véronique; Levy, Vincent; Maloisel, Frédéric; Re, Daniel; Zini, Jean-Marc; Troussard, Xavier

2014-12-01

Hairy cell leukaemia (HCL) is a rare haematological malignancy, with approximately 175 new incident cases in France. Diagnosis is based on a careful examination of the blood smear and immunophenotyping of the tumour cells, with a panel of four markers being used specifically to screen for hairy cells (CD11c, CD25, CD103 and CD123). In 2011, the V600E mutation of the BRAF gene in exon 15 was identified in HCL; being present in HCL, it is absent in the variant form of HCL (HCL-v) and in splenic red pulp lymphoma (SRPL), two entities related to HCL. The management of patients with HCL has changed in recent years. A poorer response to purine nucleoside analogues (PNAs) is observed in patients with more marked leukocytosis, bulky splenomegaly, an unmutated immunoglobulin variable heavy chain (IgVH) gene profile, use of VH4-34 or with TP53 mutations. We present the recommendations of a group of 11 experts belonging to a number of French hospitals. This group met in November 2013 to examine the criteria for managing patients with HCL. The ideas and proposals of the group are based on a critical analysis of the recommendations already published in the literature and on an analysis of the practices of clinical haematology departments with experience in managing these patients. The first-line treatment uses purine analogues: cladribine or pentostatin. The role of BRAF inhibitors, whether or not combined with MEK inhibitors, is discussed. The panel of French experts proposed recommendations to manage patients with HCL, which can be used in a daily practice.
Serum TK levels in CLL identify Binet stage A patients within biologically defined prognostic subgroups most likely to undergo disease progression.

PubMed

Matthews, Christine; Catherwood, Mark A; Morris, T C M; Kettle, Paul J; Drake, Mary B; Gilmore, William S; Alexander, H Denis

2006-10-01

Serum thymidine kinase (TK) levels have been shown to be correlated with survival in many malignancies, including chronic lymphocytic leukaemia (CLL). This study was designed to investigate associations between TK levels and other prognostic markers, in newly and previously diagnosed Binet stage A patients. Furthermore, the use of serum TK measurement to identify subcategories of disease within those defined by IgV(H) mutational status, gene usage and chromosomal aberrations was investigated. Ninety-one CLL patients were enrolled. Serum TK levels were measured using a radioenzyme assay. IgV(H) mutational status and V(H) gene usage were determined using BIOMED-2 primers and protocol. Recurring chromosomal abnormalities were detected by interphase fluorescent in situ hybridisation (FISH). Flow cytometry and reverse transcriptase polymerase chain reaction (RT-PCR) determined CD38 and Zap-70 expression, respectively. Significantly higher serum TK levels were found in IgV(H) unmutated, compared with IgV(H) mutated, patients (P < 0.001). Elevated TK levels were also found in patients with CD38 and Zap-70 positivity (P = 0.004, P < 0.001, respectively), short lymphocyte doubling time (LDT) (P = 0.044) and poor or intermediate prognosis chromosomal aberrations (P < 0.001). A TK level of >8.5 U/L best identified patients with progressive disease. Elevated TK levels could identify patients categorised, at diagnosis, into good prognosis subgroups by the various biological markers (mutated IgV(H), good prognosis chromosomal aberrations, Zap-70(-) and CD38(-)) who subsequently showed disease progression. Additionally, patients with V(H)3-21 gene usage showed high TK levels, irrespective of mutational status, and serum TK measurement retained predictive power as disease progressed in all subcategories studied.
CD38 expression and immunoglobulin variable region mutations are independent prognostic variables in chronic lymphocytic leukemia, but CD38 expression may vary during the course of the disease.

PubMed

Hamblin, Terry J; Orchard, Jenny A; Ibbotson, Rachel E; Davis, Zadie; Thomas, Peter W; Stevenson, Freda K; Oscier, David G

2002-02-01

Although the presence or absence of somatic mutations in the immunoglobulin variable region (IgV(H)) genes in chronic lymphocytic leukemia (B-CLL) identifies subtypes with very different prognoses, the assay is technically complex and unavailable to most laboratories. CD38 expression has been suggested as a surrogate marker for the 2 subtypes. IgV(H) mutations and CD38 expression in 145 patients with B-CLL with a long follow-up were compared. The 2 assays gave discordant results in 41 patients (28.3%). Multivariate analysis demonstrated that Binet stage, IgV(H) mutations and CD38 were independent prognostic indicators. Median survival time in patients whose cells had unmutated IgV(H) genes and expressed CD38 was 8 years; in those with mutated IgV(H) genes not expressing CD38, it was 26 years. For those with discordant results, median survival time was 15 years. Thus, although CD38 expression does not identify the same 2 subsets as IgV(H) mutations in CLL, it is an independent risk factor that can be used with IgV(H) mutations and clinical stage to select patients with B-CLL with the worst prognoses. Using cryopreserved cells taken at intervals during the course of the disease, however, changes of CD38 expression over time were demonstrated in 10 of 41 patients. Causes of the variation of CD38 expression require further study. Additional prospective studies are required for comparing CD38 expression with other prognostic factors and for taking sequential measurements during the course of the disease.
mTORC1 Inhibition Induces Resistance to Methotrexate and 6-Mercaptopurine in Ph+ and Ph-like B-ALL.

PubMed

Vo, Thanh-Trang T; Lee, J Scott; Nguyen, Duc; Lui, Brandon; Pandori, William; Khaw, Andrew; Mallya, Sharmila; Lu, Mengrou; Müschen, Markus; Konopleva, Marina; Fruman, David A

2017-09-01

Elevated activity of mTOR is associated with poor prognosis and higher incidence of relapse in B-cell acute lymphoblastic leukemia (B-ALL). Thus, ongoing clinical trials are testing mTOR inhibitors in combination with chemotherapy in B-ALL. However, the combination of mTOR inhibitors with standard of care chemotherapy drugs has not been studied extensively in high-risk B-ALL subtypes. Therefore, we tested whether mTOR inhibition can augment the efficacy of current chemotherapy agents in Ph + and Ph-like B-ALL models. Surprisingly, inhibiting mTOR complex 1 (mTORC1) protected B-ALL cells from killing by methotrexate and 6-mercaptopurine, two antimetabolite drugs used in maintenance chemotherapy. The cytoprotective effects correlated with decreased cell-cycle progression and were recapitulated using cell-cycle inhibitors, palbociclib or aphidicolin. Dasatinib, a tyrosine kinase inhibitor currently used in Ph + patients, inhibits ABL kinase upstream of mTOR. Dasatinib resistance is mainly caused by ABL kinase mutations, but is also observed in a subset of ABL unmutated cases. We identified dasatinib-resistant Ph+ cell lines and patient samples in which dasatinib can effectively reduce ABL kinase activity and mTORC1 signaling without causing cell death. In these cases, dasatinib protected leukemia cells from killing by 6-mercaptopurine. Using xenograft models, we observed that mTOR inhibition or dasatinib increased the numbers of leukemia cells that emerge after cessation of chemotherapy treatment. These results demonstrate that inhibitors targeting mTOR or upstream signaling nodes should be used with caution when combined with chemotherapeutic agents that rely on cell-cycle progression to kill B-ALL cells. Mol Cancer Ther; 16(9); 1942-53. ©2017 AACR . ©2017 American Association for Cancer Research.
Missense mutations located in structural p53 DNA-binding motifs are associated with extremely poor survival in chronic lymphocytic leukemia.

PubMed

Trbusek, Martin; Smardova, Jana; Malcikova, Jitka; Sebejova, Ludmila; Dobes, Petr; Svitakova, Miluse; Vranova, Vladimira; Mraz, Marek; Francova, Hana Skuhrova; Doubek, Michael; Brychtova, Yvona; Kuglik, Petr; Pospisilova, Sarka; Mayer, Jiri

2011-07-01

There is a distinct connection between TP53 defects and poor prognosis in chronic lymphocytic leukemia (CLL). It remains unclear whether patients harboring TP53 mutations represent a homogenous prognostic group. We evaluated the survival of patients with CLL and p53 defects identified at our institution by p53 yeast functional assay and complementary interphase fluorescence in situ hybridization analysis detecting del(17p) from 2003 to 2010. A defect of the TP53 gene was identified in 100 of 550 patients. p53 mutations were strongly associated with the deletion of 17p and the unmutated IgVH locus (both P < .001). Survival assessed from the time of abnormality detection was significantly reduced in patients with both missense (P < .001) and nonmissense p53 mutations (P = .004). In addition, patients harboring missense mutation located in p53 DNA-binding motifs (DBMs), structurally well-defined parts of the DNA-binding domain, manifested a clearly shorter median survival (12 months) compared with patients having missense mutations outside DBMs (41 months; P = .002) or nonmissense alterations (36 months; P = .005). The difference in survival was similar in the analysis limited to patients harboring mutation accompanied by del(17p) and was also confirmed in a subgroup harboring TP53 defect at diagnosis. The patients with p53 DBMs mutation (at diagnosis) also manifested a short median time to first therapy (TTFT; 1 month). The substantially worse survival and the short TTFT suggest a strong mutated p53 gain-of-function phenotype in patients with CLL with DBMs mutations. The impact of p53 DBMs mutations on prognosis and response to therapy should be analyzed in investigative clinical trials.

A novel positive allosteric modulator of the GABAA receptor: the action of (+)-ROD188

PubMed Central

Thomet, Urs; Baur, Roland; Razet, Rodolphe; Dodd, Robert H; Furtmüller, Roman; Sieghart, Werner; Sigel, Erwin

2000-01-01

(+)-ROD188 was synthesized in the search for novel ligands of the GABA binding site. It shares some structural similarity with bicuculline. (+)-ROD188 failed to displace [3H]-muscimol in binding studies and failed to induce channel opening in recombinant rat α1β2γ2 GABAA receptors functionally expressed in Xenopus oocytes. (+)-ROD188 allosterically stimulated GABA induced currents. Displacement of [3H]-Ro15-1788 indicated a low affinity action at the benzodiazepine binding site. In functional studies, stimulation by (+)-ROD188 was little sensitive to the presence of 1 μM of the benzodiazepine antagonist Ro 15-1788, and (+)-ROD188 also stimulated currents mediated by α1β2, indicating a major mechanism of action different from that of benzodiazepines. Allosteric stimulation by (+)-ROD188 was similar in α1β2N265S as in unmutated α1β2, while that by loreclezole was strongly reduced. (+)-ROD188 also strongly stimulated currents elicited by either pentobarbital or 5α-pregnan-3α-ol-20-one (3α-OH-DHP), in line with a mode of action different from that of barbiturates or neurosteroids as channel agonists. Stimulation by (+)-ROD188 was largest in α6β2γ2 (α6β2γ2>>α1β2γ2=α5β2γ2>α2β2γ2= α3β2γ2), indicating a unique subunit isoform specificity. Miniature inhibitory postsynaptic currents (mIPSC) in cultures of rat hippocampal neurons, caused by spontaneous release of GABA showed a prolonged decay time in the presence of 30 μM (+)-ROD188, indicating an enhanced synaptic inhibitory transmission. PMID:11030736
Systemic mastocytosis in adults: 2017 update on diagnosis, risk stratification and management.

PubMed

Pardanani, Animesh

2016-11-01

Disease overview:Systemic mastocytosis (SM) results from a clonal proliferation of abnormal mast cells (MC) in one or more extra-cutaneous organs. The major criterion is presence of multifocal clusters of morphologically abnormal MC in the bone marrow. Minor diagnostic criteria include elevated serum tryptase level, abnormal MC expression of CD25 and/or CD2, and presence of KITD816V. Risk stratification: The 2008 World Health Organization (WHO) classification of SM has been shown to be prognostically relevant. Classification of SM patients into indolent (SM), aggressive SM (ASM), SM associated with a clonal non-MC lineage disease (SM-AHNMD) and mast cell leukemia (MCL) subgroups is a useful first step in establishing prognosis. SM treatment is generally palliative. ISM patients have a normal life expectancy and receive symptom-directed therapy; infrequently, cytoreductive therapy may be indicated for refractory symptoms. ASM patients have disease-related organ dysfunction; interferon-α (±corticosteroids) can control dermatological, hematological, gastrointestinal, skeletal and mediator-release symptoms, but is hampered by poor tolerability. Similarly, cladribine has broad therapeutic activity, with particular utility when rapid MC debulking is indicated; the main toxicity is myelosuppression. Imatinib has a therapeutic role in the presence of an imatinib-sensitive KIT mutation or in KITD816-unmutated patients. Treatment of SM-AHNMD is governed primarily by the non-MC neoplasm; hydroxyurea has modest utility in this setting; there is a role for allogeneic stem cell transplantation in select cases. Investigational drugs: Recent data confirms midostaurin's significant anti-MC activity in patients with advanced SM. Am. J. Hematol. 91:1147-1159, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Heterosubtypic Neutralizing Monoclonal Antibodies Cross-Protective against H5N1 and H1N1 Recovered from Human IgM+ Memory B Cells

PubMed Central

Throsby, Mark; van den Brink, Edward; Jongeneelen, Mandy; Poon, Leo L. M.; Alard, Philippe; Cornelissen, Lisette; Bakker, Arjen; Cox, Freek; van Deventer, Els; Guan, Yi; Cinatl, Jindrich; ter Meulen, Jan; Lasters, Ignace; Carsetti, Rita; Peiris, Malik; de Kruif, John; Goudsmit, Jaap

2008-01-01

Background The hemagglutinin (HA) glycoprotein is the principal target of protective humoral immune responses to influenza virus infections but such antibody responses only provide efficient protection against a narrow spectrum of HA antigenic variants within a given virus subtype. Avian influenza viruses such as H5N1 are currently panzootic and pose a pandemic threat. These viruses are antigenically diverse and protective strategies need to cross protect against diverse viral clades. Furthermore, there are 16 different HA subtypes and no certainty the next pandemic will be caused by an H5 subtype, thus it is important to develop prophylactic and therapeutic interventions that provide heterosubtypic protection. Methods and Findings Here we describe a panel of 13 monoclonal antibodies (mAbs) recovered from combinatorial display libraries that were constructed from human IgM+ memory B cells of recent (seasonal) influenza vaccinees. The mAbs have broad heterosubtypic neutralizing activity against antigenically diverse H1, H2, H5, H6, H8 and H9 influenza subtypes. Restriction to variable heavy chain gene IGHV1-69 in the high affinity mAb panel was associated with binding to a conserved hydrophobic pocket in the stem domain of HA. The most potent antibody (CR6261) was protective in mice when given before and after lethal H5N1 or H1N1 challenge. Conclusions The human monoclonal CR6261 described in this study could be developed for use as a broad spectrum agent for prophylaxis or treatment of human or avian influenza infections without prior strain characterization. Moreover, the CR6261 epitope could be applied in targeted vaccine strategies or in the design of novel antivirals. Finally our approach of screening the IgM+ memory repertoire could be applied to identify conserved and functionally relevant targets on other rapidly evolving pathogens. PMID:19079604
Boosting of HIV envelope CD4 binding site antibodies with long variable heavy third complementarity determining region in the randomized double blind RV305 HIV-1 vaccine trial

PubMed Central

Ackerman, Margaret; Saunders, Kevin O.; Pollara, Justin; Vandergrift, Nathan; Parks, Rob; Michael, Nelson L.; O’Connell, Robert J.; Vasan, Sandhya; Rerks-Ngarm, Supachai; Kaewkungwal, Jaranit; Pitisuttithum, Punnee; Nitayaphan, Sorachai; Sinangil, Faruk; Phogat, Sanjay; Alam, S. Munir; Liao, Hua-Xin; Ferrari, Guido; Seaman, Michael S.; Montefiori, David C.; Harrison, Stephen C.; Haynes, Barton F.

2017-01-01

The canary pox vector and gp120 vaccine (ALVAC-HIV and AIDSVAX B/E gp120) in the RV144 HIV-1 vaccine trial conferred an estimated 31% vaccine efficacy. Although the vaccine Env AE.A244 gp120 is antigenic for the unmutated common ancestor of V1V2 broadly neutralizing antibody (bnAbs), no plasma bnAb activity was induced. The RV305 (NCT01435135) HIV-1 clinical trial was a placebo-controlled randomized double-blinded study that assessed the safety and efficacy of vaccine boosting on B cell repertoires. HIV-1-uninfected RV144 vaccine recipients were reimmunized 6–8 years later with AIDSVAX B/E gp120 alone, ALVAC-HIV alone, or a combination of ALVAC-HIV and AIDSVAX B/E gp120 in the RV305 trial. Env-specific post-RV144 and RV305 boost memory B cell VH mutation frequencies increased from 2.9% post-RV144 to 6.7% post-RV305. The vaccine was well tolerated with no adverse events reports. While post-boost plasma did not have bnAb activity, the vaccine boosts expanded a pool of envelope CD4 binding site (bs)-reactive memory B cells with long third heavy chain complementarity determining regions (HCDR3) whose germline precursors and affinity matured B cell clonal lineage members neutralized the HIV-1 CRF01 AE tier 2 (difficult to neutralize) primary isolate, CNE8. Electron microscopy of two of these antibodies bound with near-native gp140 trimers showed that they recognized an open conformation of the Env trimer. Although late boosting of RV144 vaccinees expanded a novel pool of neutralizing B cell clonal lineages, we hypothesize that boosts with stably closed trimers would be necessary to elicit antibodies with greater breadth of tier 2 HIV-1 strains. Trial Registration: ClinicalTrials.gov NCT01435135 PMID:28235027
Variations in the detection of ZAP-70 in chronic lymphocytic leukemia: Comparison with IgV(H) mutation analysis.

PubMed

Sheikholeslami, M R; Jilani, I; Keating, M; Uyeji, J; Chen, K; Kantarjian, H; O'Brien, S; Giles, F; Albitar, M

2006-07-15

Lack of immunoglobulin heavy chain genes (IgV(H)) mutation in patients with chronic lymphocytic leukemia (CLL) is associated with rapid disease progression and shorter survival. The zeta-chain (T-cell receptor) associated protein kinase 70 kDa (ZAP-70) has been reported to be a surrogate marker for IgV(H) mutation status, and its expression in leukemic cells correlates with unmutated IgV(H). However, ZAP-70 detection by flow cytometry varies significantly dependant on the antibodies used, the method of performing the assay, and the condition of the cells in the specimen. The clinical value of ZAP-70 testing when samples are shipped under poorly controlled conditions is not known. Furthermore, testing in a research environment may differ from testing in a routine clinical laboratory. We validated an assay for ZAP-70 by comparing results with clinical outcome and the mutation status of the IgV(H). Using stored samples, we show significant correlation between ZAP-70 expression and clinical outcome as well as IgV(H) mutation at a cut-off point of 15%. While positive samples (>15% positivity) remain positive when kept in the laboratory environment for 48 h after initial testing, results obtained from samples from CLL patients tested after shipping at room temperature for routine testing showed no correlation with IgV(H) mutation status when 15% cut-off was used. In these samples, cut-point of 10% correlated with the IgV(H) mutation (P = 0.0001). This data suggests that although ZAP-70 positivity correlates with IgV(H) mutation status and survival, variations in sample handling and preparation may influence results. We show that IgV(H) mutation results, unlike ZAP-70 remain correlated with CD38 expression and beta-2 microglobulin in shipped samples, and ZAP-70 testing should not be used as the sole criterion for stratifying patients for therapy. (c) 2006 International Society for Analytical Cytology.
CD73 expression identifies a subset of IgM+ antigen-experienced cells with memory attributes that is T cell and CD40 signalling dependent.

PubMed

D'Souza, Lucas; Gupta, Sneh Lata; Bal, Vineeta; Rath, Satyajit; George, Anna

2017-12-01

B-cell memory was long characterized as isotype-switched, somatically mutated and germinal centre (GC)-derived. However, it is now clear that the memory pool is a complex mixture that includes unswitched and unmutated cells. Further, expression of CD73, CD80 and CD273 has allowed the categorization of B-cell memory into multiple subsets, with combinatorial expression of the markers increasing with GC progression, isotype-switching and acquisition of somatic mutations. We have extended these findings to determine whether these markers can be used to identify IgM memory phenotypically as arising from T-dependent versus T-independent responses. We report that CD73 expression identifies a subset of antigen-experienced IgM + cells that share attributes of functional B-cell memory. This subset is reduced in the spleens of T-cell-deficient and CD40-deficient mice and in mixed marrow chimeras made with mutant and wild-type marrow, the proportion of CD73 + IgM memory is restored in the T-cell-deficient donor compartment but not in the CD40-deficient donor compartment, indicating that CD40 ligation is involved in its generation. We also report that CD40 signalling supports optimal expression of CD73 on splenic T cells and age-associated B cells (ABCs), but not on other immune cells such as neutrophils, marginal zone B cells, peritoneal cavity B-1 B cells and regulatory T and B cells. Our data indicate that in addition to promoting GC-associated memory generation during B-cell differentiation, CD40-signalling can influence the composition of the unswitched memory B-cell pool. They also raise the possibility that a fraction of ABCs may represent T-cell-dependent IgM memory. © 2017 John Wiley & Sons Ltd.
Boosting of HIV envelope CD4 binding site antibodies with long variable heavy third complementarity determining region in the randomized double blind RV305 HIV-1 vaccine trial.

PubMed

Easterhoff, David; Moody, M Anthony; Fera, Daniela; Cheng, Hao; Ackerman, Margaret; Wiehe, Kevin; Saunders, Kevin O; Pollara, Justin; Vandergrift, Nathan; Parks, Rob; Kim, Jerome; Michael, Nelson L; O'Connell, Robert J; Excler, Jean-Louis; Robb, Merlin L; Vasan, Sandhya; Rerks-Ngarm, Supachai; Kaewkungwal, Jaranit; Pitisuttithum, Punnee; Nitayaphan, Sorachai; Sinangil, Faruk; Tartaglia, James; Phogat, Sanjay; Kepler, Thomas B; Alam, S Munir; Liao, Hua-Xin; Ferrari, Guido; Seaman, Michael S; Montefiori, David C; Tomaras, Georgia D; Harrison, Stephen C; Haynes, Barton F

2017-02-01

The canary pox vector and gp120 vaccine (ALVAC-HIV and AIDSVAX B/E gp120) in the RV144 HIV-1 vaccine trial conferred an estimated 31% vaccine efficacy. Although the vaccine Env AE.A244 gp120 is antigenic for the unmutated common ancestor of V1V2 broadly neutralizing antibody (bnAbs), no plasma bnAb activity was induced. The RV305 (NCT01435135) HIV-1 clinical trial was a placebo-controlled randomized double-blinded study that assessed the safety and efficacy of vaccine boosting on B cell repertoires. HIV-1-uninfected RV144 vaccine recipients were reimmunized 6-8 years later with AIDSVAX B/E gp120 alone, ALVAC-HIV alone, or a combination of ALVAC-HIV and AIDSVAX B/E gp120 in the RV305 trial. Env-specific post-RV144 and RV305 boost memory B cell VH mutation frequencies increased from 2.9% post-RV144 to 6.7% post-RV305. The vaccine was well tolerated with no adverse events reports. While post-boost plasma did not have bnAb activity, the vaccine boosts expanded a pool of envelope CD4 binding site (bs)-reactive memory B cells with long third heavy chain complementarity determining regions (HCDR3) whose germline precursors and affinity matured B cell clonal lineage members neutralized the HIV-1 CRF01 AE tier 2 (difficult to neutralize) primary isolate, CNE8. Electron microscopy of two of these antibodies bound with near-native gp140 trimers showed that they recognized an open conformation of the Env trimer. Although late boosting of RV144 vaccinees expanded a novel pool of neutralizing B cell clonal lineages, we hypothesize that boosts with stably closed trimers would be necessary to elicit antibodies with greater breadth of tier 2 HIV-1 strains. ClinicalTrials.gov NCT01435135.
Mechanisms of proton relay and product release by Class A β-lactamase at ultrahigh resolution.

PubMed

Lewandowski, Eric M; Lethbridge, Kathryn G; Sanishvili, Ruslan; Skiba, Joanna; Kowalski, Konrad; Chen, Yu

2018-01-01

The β-lactam antibiotics inhibit penicillin-binding proteins (PBPs) by forming a stable, covalent, acyl-enzyme complex. During the evolution from PBPs to Class A β-lactamases, the β-lactamases acquired Glu166 to activate a catalytic water and cleave the acyl-enzyme bond. Here we present three product complex crystal structures of CTX-M-14 Class A β-lactamase with a ruthenocene-conjugated penicillin-a 0.85 Å resolution structure of E166A mutant complexed with the penilloate product, a 1.30 Å resolution complex structure of the same mutant with the penicilloate product, and a 1.18 Å resolution complex structure of S70G mutant with a penicilloate product epimer-shedding light on the catalytic mechanisms and product inhibition of PBPs and Class A β-lactamases. The E166A-penilloate complex captured the hydrogen bonding network following the protonation of the leaving group and, for the first time, unambiguously show that the ring nitrogen donates a proton to Ser130, which in turn donates a proton to Lys73. These observations indicate that in the absence of Glu166, the equivalent lysine would be neutral in PBPs and therefore capable of serving as the general base to activate the catalytic serine. Together with previous results, this structure suggests a common proton relay network shared by Class A β-lactamases and PBPs, from the catalytic serine to the lysine, and ultimately to the ring nitrogen. Additionally, the E166A-penicilloate complex reveals previously unseen conformational changes of key catalytic residues during the release of the product, and is the first structure to capture the hydrolyzed product in the presence of an unmutated catalytic serine. Structural data are available in the PDB database under the accession numbers 5TOP, 5TOY, and 5VLE. © 2017 Federation of European Biochemical Societies.
The xenoantibody response and immunoglobulin gene expression profile of cynomolgus monkeys transplanted with hDAF-transgenic porcine hearts.

PubMed

Zahorsky-Reeves, Joanne L; Kearns-Jonker, Mary K; Lam, Tuan T; Jackson, Jeremy R; Morris, Randall E; Starnes, Vaughn A; Cramer, Donald V

2007-03-01

Recent work has indicated a role for anti-Gal alpha 1-3Gal (Gal) and anti-non-Gal xenoantibodies in the primate humoral rejection response against human-decay accelerating factor (hDAF) transgenic pig organs. Our laboratory has shown that anti-porcine xenograft antibodies in humans and non-human primates are encoded by a small number of germline IgV(H) progenitors. In this study, we extended our analysis to identify the IgV(H) genes encoding xenoantibodies in immunosuppressed cynomolgus monkeys (Macaca fascicularis) transplanted with hDAF-transgenic pig organs. Three immunosuppressed monkeys underwent heterotopic heart transplantation with hDAF porcine heart xenografts. Two of three animals were given GAS914, a poly-L-lysine derivative shown to bind to anti-Gal xenoantibodies and neutralize them. One animal rejected its heart at post-operative day (POD) 39; a second animal rejected the transplanted heart at POD 78. The third monkey was euthanized on POD 36 but the heart was not rejected. Peripheral blood leukocytes (PBL) and serum were obtained from each animal before and at multiple time points after transplantation. We analyzed the immune response by enzyme-linked immunosorbent assay (ELISA) to confirm whether anti-Gal or anti-non-Gal xenoantibodies were induced after graft placement. Immunoglobulin heavy-chain gene (V(H)) cDNA libraries were then produced and screened. We generated soluble single-chain antibodies (scFv) to establish the binding specificity of the cloned immunoglobulin genes. Despite immunosuppression, which included the use of the polymer GAS914, the two animals that rejected their hearts showed elevated levels of cytotoxic anti-pig red blood cell (RBC) antibodies and anti-pig aortic endothelial cell (PAEC) antibodies. The monkey that did not reject its graft showed a decline in serum anti-RBC, anti-PAEC, and anti-Gal xenoantibodies when compared with pre-transplant levels. A V(H)3 family gene with a high level of sequence similarity to an allele of V(H)3-11, designated V(H)3-11(cyno), was expressed at elevated levels in the monkey that was not given GAS914 and whose graft was not rejected until POD 78. IgM but not IgG xenoantibodies directed at N-acetyl lactosamine (a precursor of the Gal epitope) were also induced in this animal. We produced soluble scFv from this new gene to determine whether this antibody could bind to the Gal carbohydrate, and demonstrated that this protein was capable of blocking the binding of human serum xenoantibody to Gal oligosaccharide, as had previously been shown with human V(H)3-11 scFv. DAF-transgenic organs transplanted into cynomolgus monkeys induce anti-Gal and anti-non-Gal xenoantibody responses mediated by both IgM and IgG xenoantibodies. Anti-non-Gal xenoantibodies are induced at high levels in animals treated with GAS914. Antibodies that bind to the Gal carbohydrate and to N-acetyl lactosamine are induced in the absence of GAS914 treatment. The animal whose heart remained beating for 78 days demonstrated increased usage of an antibody encoded by a germline progenitor that is structurally related, but distinct from IGHV311. This antibody binds to the Gal carbohydrate but does not induce the rapid rejection of the xenograft when expressed at high levels as early as day 8 post-transplantation.
16p11.2–p12.2 duplication syndrome; a genomic condition differentiated from euchromatic variation of 16p11.2

PubMed Central

Barber, John C K; Hall, Victoria; Maloney, Viv K; Huang, Shuwen; Roberts, Angharad M; Brady, Angela F; Foulds, Nicki; Bewes, Beverley; Volleth, Marianne; Liehr, Thomas; Mehnert, Karl; Bateman, Mark; White, Helen

2013-01-01

Chromosome 16 contains multiple copy number variations (CNVs) that predispose to genomic disorders. Here, we differentiate pathogenic duplications of 16p11.2–p12.2 from microscopically similar euchromatic variants of 16p11.2. Patient 1 was a girl of 18 with autism, moderate intellectual disability, behavioural difficulties, dysmorphic features and a 7.71-Mb (megabase pair) duplication (16:21 521 005–29 233 146). Patient 2 had a 7.81-Mb duplication (16:21 382 561–29 191 527), speech delay and obsessional behaviour as a boy and, as an adult, short stature, macrocephaly and mild dysmorphism. The duplications contain 65 coding genes of which Polo-like kinase 1 (PLK1) has the highest likelihood of being haploinsufficient and, by implication, a triplosensitive gene. An additional 1.11-Mb CNV of 10q11.21 in Patient 1 was a possible modifier containing the G-protein-regulated inducer of neurite growth 2 (GPRIN2) gene. In contrast, the euchromatic variants in Patients 3 and 4 were amplifications from a 945-kb region containing non-functional immunoglobulin heavy chain (IGHV), hect domain pseudogene (HERC2P4) and TP53-inducible target gene 3 (TP53TG3) loci in proximal 16p11.2 (16:31 953 353–32 898 635). Paralogous pyrosequencing gave a total copy number of 3–8 in controls and 8 to >10 in Patients 3 and 4. The 16p11.2–p12.2 duplication syndrome is a recurrent genomic disorder with a variable phenotype including developmental delay, dysmorphic features, mild to severe intellectual disability, autism, obsessive or stereotyped behaviour, short stature and anomalies of the hands and fingers. It is important to differentiate pathogenic 16p11.2–p12.2 duplications from harmless, microscopically similar euchromatic variants of proximal 16p11.2, especially at prenatal diagnosis. PMID:22828807
MUC4 is negatively regulated through the Wnt/β-catenin pathway via the Notch effector Hath1 in colorectal cancer

PubMed Central

Pai, Priya; Rachagani, Satyanarayana; Dhawan, Punita; Sheinin, Yuri M.; Mallya, Kavita; Pothuraju, Ramesh; Batra, Surinder K.

2016-01-01

MUC4 is a transmembrane mucin lining the normal colonic epithelium. The aberrant/de novo over-expression of MUC4 is well documented in malignancies of the pancreas, ovary and breast. However, studies have reported the loss of MUC4 expression in the majority of colorectal cancers (CRCs). A MUC4 promoter analysis showed the presence of three putative TCF/LEF sites, implying a possible regulation by the Wnt/β-catenin pathway, which has been shown to drive CRC progression. Thus, the objective of our study was to determine whether MUC4 is regulated by β-catenin in CRC. We first knocked down (KD) β-catenin in three CRC cell lines; LS180, HCT-8 and HCT116, which resulted in increased MUC4 transcript and MUC4 protein. Additionally, the overexpression of stabilized mutant β-catenin in LS180 and HCT-8 resulted in a decrease in MUC4 expression. Immunohistochemistry (IHC) of mouse colon tissue harboring tubular adenomas and high grade dysplasia showed dramatically reduced Muc4 in lesions relative to adjacent normal tissue, with increased cytosolic/nuclear β-catenin. Luciferase assays with the complete MUC4 promoter construct p3778 showed increased MUC4 promoter luciferase activity in the absence of β-catenin (KD). Mutation of all three putative TCF/LEF sites showed that MUC4 promoter luciferase activity was increased relative to the un-mutated promoter. Interestingly, it was observed that MUC4 expressing CRC cell lines also expressed high levels of Hath1, a transcription factor repressed by both active Wnt/β-catenin and Notch signaling. The KD of β-catenin and/or treatment with a Notch γ-secretase inhibitor, Dibenzazepine (DBZ) resulted in increased Hath1 and MUC4 in LS180, HCT-8 and HCT116. Furthermore, overexpression of Hath1 in HCT-8 and LS180 caused increased MUC4 transcript and MUC4 protein. Taken together, our results indicate that the Wnt/β-catenin pathway suppresses the Notch pathway effector Hath1, resulting in reduced MUC4 in CRC. PMID:27551331
MUC4 is negatively regulated through the Wnt/β-catenin pathway via the Notch effector Hath1 in colorectal cancer.

PubMed

Pai, Priya; Rachagani, Satyanarayana; Dhawan, Punita; Sheinin, Yuri M; Macha, Muzafar A; Qazi, Asif Khurshid; Chugh, Seema; Ponnusamy, Moorthy P; Mallya, Kavita; Pothuraju, Ramesh; Batra, Surinder K

2016-05-01

MUC4 is a transmembrane mucin lining the normal colonic epithelium. The aberrant/de novo over-expression of MUC4 is well documented in malignancies of the pancreas, ovary and breast. However, studies have reported the loss of MUC4 expression in the majority of colorectal cancers (CRCs). A MUC4 promoter analysis showed the presence of three putative TCF/LEF sites, implying a possible regulation by the Wnt/β-catenin pathway, which has been shown to drive CRC progression. Thus, the objective of our study was to determine whether MUC4 is regulated by β-catenin in CRC. We first knocked down (KD) β-catenin in three CRC cell lines; LS180, HCT-8 and HCT116, which resulted in increased MUC4 transcript and MUC4 protein. Additionally, the overexpression of stabilized mutant β-catenin in LS180 and HCT-8 resulted in a decrease in MUC4 expression. Immunohistochemistry (IHC) of mouse colon tissue harboring tubular adenomas and high grade dysplasia showed dramatically reduced Muc4 in lesions relative to adjacent normal tissue, with increased cytosolic/nuclear β-catenin. Luciferase assays with the complete MUC4 promoter construct p3778 showed increased MUC4 promoter luciferase activity in the absence of β-catenin (KD). Mutation of all three putative TCF/LEF sites showed that MUC4 promoter luciferase activity was increased relative to the un-mutated promoter. Interestingly, it was observed that MUC4 expressing CRC cell lines also expressed high levels of Hath1, a transcription factor repressed by both active Wnt/β-catenin and Notch signaling. The KD of β-catenin and/or treatment with a Notch γ-secretase inhibitor, Dibenzazepine (DBZ) resulted in increased Hath1 and MUC4 in LS180, HCT-8 and HCT116. Furthermore, overexpression of Hath1 in HCT-8 and LS180 caused increased MUC4 transcript and MUC4 protein. Taken together, our results indicate that the Wnt/β-catenin pathway suppresses the Notch pathway effector Hath1, resulting in reduced MUC4 in CRC.
Serum level of CD26 predicts time to first treatment in early B-chronic lymphocytic leukemia.

PubMed

Molica, Stefano; Digiesi, Giovanna; Mirabelli, Rosanna; Cutrona, Giovanna; Antenucci, Anna; Molica, Matteo; Giannarelli, Diana; Sperduti, Isabella; Morabito, Fortunato; Neri, Antonino; Baldini, Luca; Ferrarini, Manlio

2009-09-01

We analyzed the correlation between well-established biological parameters of prognostic relevance in B-cell chronic lymphocytic leukemia (CLL) [i.e. mutational status of the immunoglobulin heavy chain variable region (IgV(H)), ZAP-70- and CD38-expression] and serum levels of CD26 (dipeptidyl peptidase IV, DPP IV) by evaluating the impact of these variables on the time to first treatment (TFT) in a series of 69 previously untreated Binet stage A B-cell CLL patients. By using a commercial ELISA we found that with exception of a borderline significance for ZAP-70 (P = 0.07) and CD38 (P = 0.08), circulating levels of CD26 did not correlate with either Rai substages (P = 0.520) or other biomarker [beta2-microglobulin (P = 0.933), LDH (P = 0.101), mutational status of IgV(H) (P = 0.320)]. Maximally selected log-rank statistic plots identified a CD26 serum concentration of 371 ng/mL as the best cut-off. This threshold allowed the identification of two subsets of patients with CD26 serum levels higher and lower that 371 ng/mL respectively, whose clinical outcome was different with respect to TFT (i.e. 46% and 71% at 5 yr respectively; P = 0.005). Along with higher serum levels of CD26, the univariate Cox proportional hazard model identified absence of mutation in IgV(H) (P < 0.0001) as predictor of shorter TFT. As in multivariate analysis all these parameters maintained their discriminating power (mutational status of IgV(H,)P < 0.0001; soluble CD26, P = 0.02) their combined effect on clinical outcome was assessed. When three groups were considered: (1) Low-risk group (n = 31), patients with concordant IgVH(mut) and low level of soluble CD26; (2) intermediate risk group (n = 26), patients with discordant pattern; (3) high-risk group (n = 12), patients with concordant IgVH(unmut) and high level of soluble CD26, differences in the TFT were statistically significant, with a TFT at 5 yr of respectively 88%, 51% and 43% (P < 0.0001). Our results indicate that in early B-cell CLL biological profile including among other parameters soluble CD26 may provide a useful insight into the complex interrelationship of prognostic variables. Furthermore, CD26 along with mutational status of IgV(H) can be adequately used to predict clinical behavior of patients with low risk disease.
Clinical effect of stereotyped B-cell receptor immunoglobulins in chronic lymphocytic leukaemia: a retrospective multicentre study.

PubMed

Baliakas, Panagiotis; Hadzidimitriou, Anastasia; Sutton, Lesley-Ann; Minga, Eva; Agathangelidis, Andreas; Nichelatti, Michele; Tsanousa, Athina; Scarfò, Lydia; Davis, Zadie; Yan, Xiao-Jie; Shanafelt, Tait; Plevova, Karla; Sandberg, Yorick; Vojdeman, Fie Juhl; Boudjogra, Myriam; Tzenou, Tatiana; Chatzouli, Maria; Chu, Charles C; Veronese, Silvio; Gardiner, Anne; Mansouri, Larry; Smedby, Karin E; Pedersen, Lone Bredo; van Lom, Kirsten; Giudicelli, Véronique; Francova, Hana Skuhrova; Nguyen-Khac, Florence; Panagiotidis, Panagiotis; Juliusson, Gunnar; Angelis, Lefteris; Anagnostopoulos, Achilles; Lefranc, Marie-Paule; Facco, Monica; Trentin, Livio; Catherwood, Mark; Montillo, Marco; Geisler, Christian H; Langerak, Anton W; Pospisilova, Sarka; Chiorazzi, Nicholas; Oscier, David; Jelinek, Diane F; Darzentas, Nikos; Belessi, Chrysoula; Davi, Frederic; Rosenquist, Richard; Ghia, Paolo; Stamatopoulos, Kostas

2014-11-01

About 30% of cases of chronic lymphocytic leukaemia (CLL) carry quasi-identical B-cell receptor immunoglobulins and can be assigned to distinct stereotyped subsets. Although preliminary evidence suggests that B-cell receptor immunoglobulin stereotypy is relevant from a clinical viewpoint, this aspect has never been explored in a systematic manner or in a cohort of adequate size that would enable clinical conclusions to be drawn. For this retrospective, multicentre study, we analysed 8593 patients with CLL for whom immunogenetic data were available. These patients were followed up in 15 academic institutions throughout Europe (in Czech Republic, Denmark, France, Greece, Italy, Netherlands, Sweden, and the UK) and the USA, and data were collected between June 1, 2012, and June 7, 2013. We retrospectively assessed the clinical implications of CLL B-cell receptor immunoglobulin stereotypy, with a particular focus on 14 major stereotyped subsets comprising cases expressing unmutated (U-CLL) or mutated (M-CLL) immunoglobulin heavy chain variable genes. The primary outcome of our analysis was time to first treatment, defined as the time between diagnosis and date of first treatment. 2878 patients were assigned to a stereotyped subset, of which 1122 patients belonged to one of 14 major subsets. Stereotyped subsets showed significant differences in terms of age, sex, disease burden at diagnosis, CD38 expression, and cytogenetic aberrations of prognostic significance. Patients within a specific subset generally followed the same clinical course, whereas patients in different stereotyped subsets-despite having the same immunoglobulin heavy variable gene and displaying similar immunoglobulin mutational status-showed substantially different times to first treatment. By integrating B-cell receptor immunoglobulin stereotypy (for subsets 1, 2, and 4) into the well established Döhner cytogenetic prognostic model, we showed these, which collectively account for around 7% of all cases of CLL and represent both U-CLL and M-CLL, constituted separate clinical entities, ranging from very indolent (subset 4) to aggressive disease (subsets 1 and 2). The molecular classification of chronic lymphocytic leukaemia based on B-cell receptor immunoglobulin stereotypy improves the Döhner hierarchical model and refines prognostication beyond immunoglobulin mutational status, with potential implications for clinical decision making, especially within prospective clinical trials. European Union; General Secretariat for Research and Technology of Greece; AIRC; Italian Ministry of Health; AIRC Regional Project with Fondazione CARIPARO and CARIVERONA; Regione Veneto on Chronic Lymphocytic Leukemia; Nordic Cancer Union; Swedish Cancer Society; Swedish Research Council; and National Cancer Institute (NIH). Copyright © 2014 Elsevier Ltd. All rights reserved.
Genomic Profile and Pathologic Features of Diffuse Large B-Cell Lymphoma Subtype of Methotrexate-associated Lymphoproliferative Disorder in Rheumatoid Arthritis Patients.

PubMed

Carreras, Joaquim; Yukie Kikuti, Yara; Miyaoka, Masashi; Hiraiwa, Shinichiro; Tomita, Sakura; Ikoma, Haruka; Kondo, Yusuke; Shiraiwa, Sawako; Ando, Kiyoshi; Sato, Shinji; Suzuki, Yasuo; Miura, Ikuo; Roncador, Giovanna; Nakamura, Naoya

2018-05-05

Rheumatoid arthritis patients often develop the diffuse large B-cell lymphoma subtype of methotrexate-associated lymphoproliferative disorder (DLBCL). We characterized the genomic profile and pathologic characteristics of 20 biopsies using an integrative approach. DLBCL was associated with extranodal involvement, a high/high-intermediate international prognostic index in 53% of cases, and responded to MTX withdrawal. The phenotype was nongerminal center B-cell in 85% of samples and Epstein-Barr encoding region positive (EBER) in 65%, with a high proliferation index and intermediate MYC expression levels. The immune microenvironment showed high numbers of CD8 cytotoxic T lymphocytes and CD163 M2 macrophages with an (CD163/CD68) M2 ratio of 3.6. Its genomic profile was characterized by 3p12.1-q25.31, 6p25.3, 8q23.1-q24.3, and 12p13.33-q24.33 gains, 6q22.31-q24.1 and 13q21.33-q34 losses, and 1p36.11-p35.3 copy neutral loss-of-heterozygosity. This profile was closer to nongerminal center B-cell DLBCL not-otherwise-specified, but with characteristic 3q, 12q, and 20p gains and lower 9p losses (P<0.05). We successfully verified array results using fluorescent DNA in situ hybridization on PLOD2, MYC, WNT1, and BCL2. Protein immunohistochemistry revealed that DLBCL expressed high IRF4 (6p25.3) and SELPLG (12q24.11) levels, intermediate TNFRSF14 (1p36.32; the exons 1 to 3 were unmutated), BTLA (3q13.2), PLOD2 (3q24), KLHL6 (3q27.1), and MYC (8q24.21) levels, and low AICDA (12p13.31) and EFNB2 (13q33.3) levels. The correlation between the DNA copy number and protein immunohistochemistry was confirmed for BTLA, PLOD2, and EFNB2. The characteristics of EBER versus EBER cases were similar, with the exception of specific changes: EBER cases had higher numbers of CD163 M2 macrophages and FOXP3 regulatory T lymphocytes, high programmed cell death 1 ligand 1 expression levels, slightly fewer genomic changes, and 3q and 4p focal gains. In conclusion, DLBCL has a characteristic genomic profile with 3q and 12 gains, 13q loss, different expression levels of relevant pathogenic biomarkers, and a microenvironment with high numbers of cytotoxic T lymphocytes and M2 macrophages.
Is sequence awareness mandatory for perceptual sequence learning: An assessment using a pure perceptual sequence learning design.

PubMed

Deroost, Natacha; Coomans, Daphné

2018-02-01

We examined the role of sequence awareness in a pure perceptual sequence learning design. Participants had to react to the target's colour that changed according to a perceptual sequence. By varying the mapping of the target's colour onto the response keys, motor responses changed randomly. The effect of sequence awareness on perceptual sequence learning was determined by manipulating the learning instructions (explicit versus implicit) and assessing the amount of sequence awareness after the experiment. In the explicit instruction condition (n = 15), participants were instructed to intentionally search for the colour sequence, whereas in the implicit instruction condition (n = 15), they were left uninformed about the sequenced nature of the task. Sequence awareness after the sequence learning task was tested by means of a questionnaire and the process-dissociation-procedure. The results showed that the instruction manipulation had no effect on the amount of perceptual sequence learning. Based on their report to have actively applied their sequence knowledge during the experiment, participants were subsequently regrouped in a sequence strategy group (n = 14, of which 4 participants from the implicit instruction condition and 10 participants from the explicit instruction condition) and a no-sequence strategy group (n = 16, of which 11 participants from the implicit instruction condition and 5 participants from the explicit instruction condition). Only participants of the sequence strategy group showed reliable perceptual sequence learning and sequence awareness. These results indicate that perceptual sequence learning depends upon the continuous employment of strategic cognitive control processes on sequence knowledge. Sequence awareness is suggested to be a necessary but not sufficient condition for perceptual learning to take place. Copyright © 2018 Elsevier B.V. All rights reserved.
RIKEN Integrated Sequence Analysis (RISA) System—384-Format Sequencing Pipeline with 384 Multicapillary Sequencer

PubMed Central

Shibata, Kazuhiro; Itoh, Masayoshi; Aizawa, Katsunori; Nagaoka, Sumiharu; Sasaki, Nobuya; Carninci, Piero; Konno, Hideaki; Akiyama, Junichi; Nishi, Katsuo; Kitsunai, Tokuji; Tashiro, Hideo; Itoh, Mari; Sumi, Noriko; Ishii, Yoshiyuki; Nakamura, Shin; Hazama, Makoto; Nishine, Tsutomu; Harada, Akira; Yamamoto, Rintaro; Matsumoto, Hiroyuki; Sakaguchi, Sumito; Ikegami, Takashi; Kashiwagi, Katsuya; Fujiwake, Syuji; Inoue, Kouji; Togawa, Yoshiyuki; Izawa, Masaki; Ohara, Eiji; Watahiki, Masanori; Yoneda, Yuko; Ishikawa, Tomokazu; Ozawa, Kaori; Tanaka, Takumi; Matsuura, Shuji; Kawai, Jun; Okazaki, Yasushi; Muramatsu, Masami; Inoue, Yorinao; Kira, Akira; Hayashizaki, Yoshihide

2000-01-01

The RIKEN high-throughput 384-format sequencing pipeline (RISA system) including a 384-multicapillary sequencer (the so-called RISA sequencer) was developed for the RIKEN mouse encyclopedia project. The RISA system consists of colony picking, template preparation, sequencing reaction, and the sequencing process. A novel high-throughput 384-format capillary sequencer system (RISA sequencer system) was developed for the sequencing process. This system consists of a 384-multicapillary auto sequencer (RISA sequencer), a 384-multicapillary array assembler (CAS), and a 384-multicapillary casting device. The RISA sequencer can simultaneously analyze 384 independent sequencing products. The optical system is a scanning system chosen after careful comparison with an image detection system for the simultaneous detection of the 384-capillary array. This scanning system can be used with any fluorescent-labeled sequencing reaction (chain termination reaction), including transcriptional sequencing based on RNA polymerase, which was originally developed by us, and cycle sequencing based on thermostable DNA polymerase. For long-read sequencing, 380 out of 384 sequences (99.2%) were successfully analyzed and the average read length, with more than 99% accuracy, was 654.4 bp. A single RISA sequencer can analyze 216 kb with >99% accuracy in 2.7 h (90 kb/h). For short-read sequencing to cluster the 3′ end and 5′ end sequencing by reading 350 bp, 384 samples can be analyzed in 1.5 h. We have also developed a RISA inoculator, RISA filtrator and densitometer, RISA plasmid preparator which can handle throughput of 40,000 samples in 17.5 h, and a high-throughput RISA thermal cycler which has four 384-well sites. The combination of these technologies allowed us to construct the RISA system consisting of 16 RISA sequencers, which can process 50,000 DNA samples per day. One haploid genome shotgun sequence of a higher organism, such as human, mouse, rat, domestic animals, and plants, can be revealed by seven RISA systems within one month. PMID:11076861
Synchronized excitability in a network enables generation of internal neuronal sequences

PubMed Central

Wang, Yingxue; Roth, Zachary; Pastalkova, Eva

2016-01-01

Hippocampal place field sequences are supported by sensory cues and network internal mechanisms. In contrast, sharp-wave (SPW) sequences, theta sequences, and episode field sequences are internally generated. The relationship of these sequences to memory is unclear. SPW sequences have been shown to support learning and have been assumed to also support episodic memory. Conversely, we demonstrate these SPW sequences were present in trained rats even after episodic memory was impaired and after other internal sequences – episode field and theta sequences – were eliminated. SPW sequences did not support memory despite continuing to ‘replay’ all task-related sequences – place- field and episode field sequences. Sequence replay occurred selectively during synchronous increases of population excitability -- SPWs. Similarly, theta sequences depended on the presence of repeated synchronized waves of excitability – theta oscillations. Thus, we suggest that either intermittent or rhythmic synchronized changes of excitability trigger sequential firing of neurons, which in turn supports learning and/or memory. DOI: http://dx.doi.org/10.7554/eLife.20697.001 PMID:27677848
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe.
Method for identifying and quantifying nucleic acid sequence aberrations

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-07-21

A method is disclosed for detecting nucleic acid sequence aberrations by detecting nucleic acid sequences having both a first and a second nucleic acid sequence type, the presence of the first and second sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. The method uses a first hybridization probe which includes a nucleic acid sequence that is complementary to a first sequence type and a first complexing agent capable of attaching to a second complexing agent and a second hybridization probe which includes a nucleic acid sequence that selectively hybridizes to the second nucleic acid sequence type over the first sequence type and includes a detectable marker for detecting the second hybridization probe. 11 figs.

Sequencing artifacts in the type A influenza databases and attempts to correct them.

PubMed

Suarez, David L; Chester, Nikki; Hatfield, Jason

2014-07-01

There are over 276 000 influenza gene sequences in public databases, with the quality of the sequences determined by the contributor. As part of a high school class project, influenza sequences with possible errors were identified in the public databases based on the size of the gene being longer than expected, with the hypothesis that these sequences would have an error. Students contacted sequence submitters alerting them of the possible sequence issue(s) and requested they the suspect sequence(s) be correct as appropriate. Type A influenza viruses were screened, and gene segments longer than the accepted size were identified for further analysis. Attention was placed on sequences with additional nucleotides upstream or downstream of the highly conserved non-coding ends of the viral segments. A total of 1081 sequences were identified that met this criterion. Three types of errors were commonly observed: non-influenza primer sequence wasn't removed from the sequence; PCR product was cloned and plasmid sequence was included in the sequence; and Taq polymerase added an adenine at the end of the PCR product. Internal insertions of nucleotide sequence were also commonly observed, but in many cases it was unclear if the sequence was correct or actually contained an error. A total of 215 sequences, or 22.8% of the suspect sequences, were corrected in the public databases in the first year of the student project. Unfortunately 138 additional sequences with possible errors were added to the databases in the second year. Additional awareness of the need for data integrity of sequences submitted to public databases is needed to fully reap the benefits of these large data sets. © 2014 The Authors. Influenza and Other Respiratory Viruses Published by John Wiley & Sons Ltd.
First complete genome sequence of infectious laryngotracheitis virus

PubMed Central

2011-01-01

Background Infectious laryngotracheitis virus (ILTV) is an alphaherpesvirus that causes acute respiratory disease in chickens worldwide. To date, only one complete genomic sequence of ILTV has been reported. This sequence was generated by concatenating partial sequences from six different ILTV strains. Thus, the full genomic sequence of a single (individual) strain of ILTV has not been determined previously. This study aimed to use high throughput sequencing technology to determine the complete genomic sequence of a live attenuated vaccine strain of ILTV. Results The complete genomic sequence of the Serva vaccine strain of ILTV was determined, annotated and compared to the concatenated ILTV reference sequence. The genome size of the Serva strain was 152,628 bp, with a G + C content of 48%. A total of 80 predicted open reading frames were identified. The Serva strain had 96.5% DNA sequence identity with the concatenated ILTV sequence. Notably, the concatenated ILTV sequence was found to lack four large regions of sequence, including 528 bp and 594 bp of sequence in the UL29 and UL36 genes, respectively, and two copies of a 1,563 bp sequence in the repeat regions. Considerable differences in the size of the predicted translation products of 4 other genes (UL54, UL30, UL37 and UL38) were also identified. More than 530 single-nucleotide polymorphisms (SNPs) were identified. Most SNPs were located within three genomic regions, corresponding to sequence from the SA-2 ILTV vaccine strain in the concatenated ILTV sequence. Conclusions This is the first complete genomic sequence of an individual ILTV strain. This sequence will facilitate future comparative genomic studies of ILTV by providing an appropriate reference sequence for the sequence analysis of other ILTV strains. PMID:21501528
Locating Sequence on FPC Maps and Selecting a Minimal Tiling Path

PubMed Central

Engler, Friedrich W.; Hatfield, James; Nelson, William; Soderlund, Carol A.

2003-01-01

This study discusses three software tools, the first two aid in integrating sequence with an FPC physical map and the third automatically selects a minimal tiling path given genomic draft sequence and BAC end sequences. The first tool, FSD (FPC Simulated Digest), takes a sequenced clone and adds it back to the map based on a fingerprint generated by an in silico digest of the clone. This allows verification of sequenced clone positions and the integration of sequenced clones that were not originally part of the FPC map. The second tool, BSS (Blast Some Sequence), takes a query sequence and positions it on the map based on sequence associated with the clones in the map. BSS has multiple uses as follows: (1) When the query is a file of marker sequences, they can be added as electronic markers. (2) When the query is draft sequence, the results of BSS can be used to close gaps in a sequenced clone or the physical map. (3) When the query is a sequenced clone and the target is BAC end sequences, one may select the next clone for sequencing using both sequence comparison results and map location. (4) When the query is whole-genome draft sequence and the target is BAC end sequences, the results can be used to select many clones for a minimal tiling path at once. The third tool, pickMTP, automates the majority of this last usage of BSS. Results are presented using the rice FPC map, BAC end sequences, and whole-genome shotgun from Syngenta. PMID:12915486
Program for Editing Spacecraft Command Sequences

NASA Technical Reports Server (NTRS)

Gladden, Roy; Waggoner, Bruce; Kordon, Mark; Hashemi, Mahnaz; Hanks, David; Salcedo, Jose

2006-01-01

Sequence Translator, Editor, and Expander Resource (STEER) is a computer program that facilitates construction of sequences and blocks of sequences (hereafter denoted generally as sequence products) for commanding a spacecraft. STEER also provides mechanisms for translating among various sequence product types and quickly expanding activities of a given sequence in chronological order for review and analysis of the sequence. To date, construction of sequence products has generally been done by use of such clumsy mechanisms as text-editor programs, translating among sequence product types has been challenging, and expanding sequences to time-ordered lists has involved arduous processes of converting sequence products to "real" sequences and running them through Class-A software (defined, loosely, as flight and ground software critical to a spacecraft mission). Also, heretofore, generating sequence products in standard formats has been troublesome because precise formatting and syntax are required. STEER alleviates these issues by providing a graphical user interface containing intuitive fields in which the user can enter the necessary information. The STEER expansion function provides a "quick and dirty" means of seeing how a sequence and sequence block would expand into a chronological list, without need to use of Class-A software.
Multimodal sequence learning.

PubMed

Kemény, Ferenc; Meier, Beat

2016-02-01

While sequence learning research models complex phenomena, previous studies have mostly focused on unimodal sequences. The goal of the current experiment is to put implicit sequence learning into a multimodal context: to test whether it can operate across different modalities. We used the Task Sequence Learning paradigm to test whether sequence learning varies across modalities, and whether participants are able to learn multimodal sequences. Our results show that implicit sequence learning is very similar regardless of the source modality. However, the presence of correlated task and response sequences was required for learning to take place. The experiment provides new evidence for implicit sequence learning of abstract conceptual representations. In general, the results suggest that correlated sequences are necessary for implicit sequence learning to occur. Moreover, they show that elements from different modalities can be automatically integrated into one unitary multimodal sequence. Copyright © 2015 Elsevier B.V. All rights reserved.
A trace display and editing program for data from fluorescence based sequencing machines.

PubMed

Gleeson, T; Hillier, L

1991-12-11

'Ted' (Trace editor) is a graphical editor for sequence and trace data from automated fluorescence sequencing machines. It provides facilities for viewing sequence and trace data (in top or bottom strand orientation), for editing the base sequence, for automated or manual trimming of the head (vector) and tail (uncertain data) from the sequence, for vertical and horizontal trace scaling, for keeping a history of sequence editing, and for output of the edited sequence. Ted has been used extensively in the C.elegans genome sequencing project, both as a stand-alone program and integrated into the Staden sequence assembly package, and has greatly aided in the efficiency and accuracy of sequence editing. It runs in the X windows environment on Sun workstations and is available from the authors. Ted currently supports sequence and trace data from the ABI 373A and Pharmacia A.L.F. sequencers.
Sequences show rapid motor transfer and spatial translation in the oculomotor system.

PubMed

Stainer, Matthew J; Carpenter, R H S; Brotchie, Peter; Anderson, Andrew J

2016-07-01

Every day we perform learnt sequences of actions that seem to happen almost without awareness. It has been argued that for learning such sequences parallel learning networks exist - one using spatial coordinates and one using motor coordinates - with sequence acquisition involving a progressive shift from the former to the latter as a sequence is rehearsed. When sequences are interrupted by an out-of-sequence target, there is a delay in the response to the target, and so here we transiently interrupt oculomotor sequences to probe the influence of oculomotor rehearsal and spatial coordinates in sequence acquisition. For our main experiments, we used a repeating sequences of eight targets in length that was first learnt either using saccadic eye movements (left/right), manual responses (left/right or up/down) or as a sequence of colour (blue/red) requiring no motor response. The sequence was immediately repeated for saccadic eye movements, during which the influence of on out-of-sequence target (an interruption) was assessed. When a sequence is learnt beforehand in an abstract way (for example, as a sequence of colours or of orthogonally mapped manual responses), interruptions are immediately disruptive to latency, suggesting neither motor rehearsal nor specific spatial coordinates are essential for encoding sequences of actions and that sequences - no matter how they are encoded - can be rapidly translated into oculomotor coordinates. The magnitude of a disruption does, however, correspond to how well a sequence is learnt: introducing an interruption to an extended sequence before it was reliably learnt reduces the magnitude of the latency disruption. Copyright © 2016 Elsevier Ltd. All rights reserved.
Subgrouping Automata: automatic sequence subgrouping using phylogenetic tree-based optimum subgrouping algorithm.

PubMed

Seo, Joo-Hyun; Park, Jihyang; Kim, Eun-Mi; Kim, Juhan; Joo, Keehyoung; Lee, Jooyoung; Kim, Byung-Gee

2014-02-01

Sequence subgrouping for a given sequence set can enable various informative tasks such as the functional discrimination of sequence subsets and the functional inference of unknown sequences. Because an identity threshold for sequence subgrouping may vary according to the given sequence set, it is highly desirable to construct a robust subgrouping algorithm which automatically identifies an optimal identity threshold and generates subgroups for a given sequence set. To meet this end, an automatic sequence subgrouping method, named 'Subgrouping Automata' was constructed. Firstly, tree analysis module analyzes the structure of tree and calculates the all possible subgroups in each node. Sequence similarity analysis module calculates average sequence similarity for all subgroups in each node. Representative sequence generation module finds a representative sequence using profile analysis and self-scoring for each subgroup. For all nodes, average sequence similarities are calculated and 'Subgrouping Automata' searches a node showing statistically maximum sequence similarity increase using Student's t-value. A node showing the maximum t-value, which gives the most significant differences in average sequence similarity between two adjacent nodes, is determined as an optimum subgrouping node in the phylogenetic tree. Further analysis showed that the optimum subgrouping node from SA prevents under-subgrouping and over-subgrouping. Copyright © 2013. Published by Elsevier Ltd.
Large-Scale Concatenation cDNA Sequencing

PubMed Central

Yu, Wei; Andersson, Björn; Worley, Kim C.; Muzny, Donna M.; Ding, Yan; Liu, Wen; Ricafrente, Jennifer Y.; Wentland, Meredith A.; Lennon, Greg; Gibbs, Richard A.

1997-01-01

A total of 100 kb of DNA derived from 69 individual human brain cDNA clones of 0.7–2.0 kb were sequenced by concatenated cDNA sequencing (CCS), whereby multiple individual DNA fragments are sequenced simultaneously in a single shotgun library. The method yielded accurate sequences and a similar efficiency compared with other shotgun libraries constructed from single DNA fragments (>20 kb). Computer analyses were carried out on 65 cDNA clone sequences and their corresponding end sequences to examine both nucleic acid and amino acid sequence similarities in the databases. Thirty-seven clones revealed no DNA database matches, 12 clones generated exact matches (≥98% identity), and 16 clones generated nonexact matches (57%–97% identity) to either known human or other species genes. Of those 28 matched clones, 8 had corresponding end sequences that failed to identify similarities. In a protein similarity search, 27 clone sequences displayed significant matches, whereas only 20 of the end sequences had matches to known protein sequences. Our data indicate that full-length cDNA insert sequences provide significantly more nucleic acid and protein sequence similarity matches than expressed sequence tags (ESTs) for database searching. [All 65 cDNA clone sequences described in this paper have been submitted to the GenBank data library under accession nos. U79240–U79304.] PMID:9110174
Finding functional features in Saccharomyces genomes by phylogenetic footprinting.

PubMed

Cliften, Paul; Sudarsanam, Priya; Desikan, Ashwin; Fulton, Lucinda; Fulton, Bob; Majors, John; Waterston, Robert; Cohen, Barak A; Johnston, Mark

2003-07-04

The sifting and winnowing of DNA sequence that occur during evolution cause nonfunctional sequences to diverge, leaving phylogenetic footprints of functional sequence elements in comparisons of genome sequences. We searched for such footprints among the genome sequences of six Saccharomyces species and identified potentially functional sequences. Comparison of these sequences allowed us to revise the catalog of yeast genes and identify sequence motifs that may be targets of transcriptional regulatory proteins. Some of these conserved sequence motifs reside upstream of genes with similar functional annotations or similar expression patterns or those bound by the same transcription factor and are thus good candidates for functional regulatory sequences.
Homogeneity of the 16S rDNA sequence among geographically disparate isolates of Taylorella equigenitalis

PubMed Central

Matsuda, M; Tazumi, A; Kagawa, S; Sekizuka, T; Murayama, O; Moore, JE; Millar, BC

2006-01-01

Background At present, six accessible sequences of 16S rDNA from Taylorella equigenitalis (T. equigenitalis) are available, whose sequence differences occur at a few nucleotide positions. Thus it is important to determine these sequences from additional strains in other countries, if possible, in order to clarify any anomalies regarding 16S rDNA sequence heterogeneity. Here, we clone and sequence the approximate full-length 16S rDNA from additional strains of T. equigenitalis isolated in Japan, Australia and France and compare these sequences to the existing published sequences. Results Clarification of any anomalies regarding 16S rDNA sequence heterogeneity of T. equigenitalis was carried out. When cloning, sequencing and comparison of the approximate full-length 16S rDNA from 17 strains of T. equigenitalis isolated in Japan, Australia and France, nucleotide sequence differences were demonstrated at the six loci in the 1,469 nucleotide sequence. Moreover, 12 polymorphic sites occurred among 23 sequences of the 16S rDNA, including the six reference sequences. Conclusion High sequence similarity (99.5% or more) was observed throughout, except from nucleotide positions 138 to 501 where substitutions and deletions were noted. PMID:16398935
Use of the Minion nanopore sequencer for rapid sequencing of avian influenza virus isolates

USDA-ARS?s Scientific Manuscript database

A relatively new sequencing technology, the MinION nanopore sequencer, provides a platform that is smaller, faster, and cheaper than existing Next Generation Sequence (NGS) technologies. The MinION sequences of individual strands of DNA and can produce millions of sequencing reads. The cost of the s...
Feedback shift register sequences versus uniformly distributed random sequences for correlation chromatography

NASA Technical Reports Server (NTRS)

Kaljurand, M.; Valentin, J. R.; Shao, M.

1996-01-01

Two alternative input sequences are commonly employed in correlation chromatography (CC). They are sequences derived according to the algorithm of the feedback shift register (i.e., pseudo random binary sequences (PRBS)) and sequences derived by using the uniform random binary sequences (URBS). These two sequences are compared. By applying the "cleaning" data processing technique to the correlograms that result from these sequences, we show that when the PRBS is used the S/N of the correlogram is much higher than the one resulting from using URBS.
Process of labeling specific chromosomes using recombinant repetitive DNA

DOEpatents

Moyzis, R.K.; Meyne, J.

1988-02-12

Chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family members and consensus sequences of the repetitive DNA families for the chromosome preferential sequences. The selected low homology regions are then hybridized with chromosomes to determine those low homology regions hybridized with a specific chromosome under normal stringency conditions.
MRO Sequence Checking Tool

NASA Technical Reports Server (NTRS)

Fisher, Forest; Gladden, Roy; Khanampornpan, Teerapat

2008-01-01

The MRO Sequence Checking Tool program, mro_check, automates significant portions of the MRO (Mars Reconnaissance Orbiter) sequence checking procedure. Though MRO has similar checks to the ODY s (Mars Odyssey) Mega Check tool, the checks needed for MRO are unique to the MRO spacecraft. The MRO sequence checking tool automates the majority of the sequence validation procedure and check lists that are used to validate the sequences generated by MRO MPST (mission planning and sequencing team). The tool performs more than 50 different checks on the sequence. The automation varies from summarizing data about the sequence needed for visual verification of the sequence, to performing automated checks on the sequence and providing a report for each step. To allow for the addition of new checks as needed, this tool is built in a modular fashion.
Comparison of next generation sequencing technologies for transcriptome characterization

PubMed Central

2009-01-01

Background We have developed a simulation approach to help determine the optimal mixture of sequencing methods for most complete and cost effective transcriptome sequencing. We compared simulation results for traditional capillary sequencing with "Next Generation" (NG) ultra high-throughput technologies. The simulation model was parameterized using mappings of 130,000 cDNA sequence reads to the Arabidopsis genome (NCBI Accession SRA008180.19). We also generated 454-GS20 sequences and de novo assemblies for the basal eudicot California poppy (Eschscholzia californica) and the magnoliid avocado (Persea americana) using a variety of methods for cDNA synthesis. Results The Arabidopsis reads tagged more than 15,000 genes, including new splice variants and extended UTR regions. Of the total 134,791 reads (13.8 MB), 119,518 (88.7%) mapped exactly to known exons, while 1,117 (0.8%) mapped to introns, 11,524 (8.6%) spanned annotated intron/exon boundaries, and 3,066 (2.3%) extended beyond the end of annotated UTRs. Sequence-based inference of relative gene expression levels correlated significantly with microarray data. As expected, NG sequencing of normalized libraries tagged more genes than non-normalized libraries, although non-normalized libraries yielded more full-length cDNA sequences. The Arabidopsis data were used to simulate additional rounds of NG and traditional EST sequencing, and various combinations of each. Our simulations suggest a combination of FLX and Solexa sequencing for optimal transcriptome coverage at modest cost. We have also developed ESTcalc http://fgp.huck.psu.edu/NG_Sims/ngsim.pl, an online webtool, which allows users to explore the results of this study by specifying individualized costs and sequencing characteristics. Conclusion NG sequencing technologies are a highly flexible set of platforms that can be scaled to suit different project goals. In terms of sequence coverage alone, the NG sequencing is a dramatic advance over capillary-based sequencing, but NG sequencing also presents significant challenges in assembly and sequence accuracy due to short read lengths, method-specific sequencing errors, and the absence of physical clones. These problems may be overcome by hybrid sequencing strategies using a mixture of sequencing methodologies, by new assemblers, and by sequencing more deeply. Sequencing and microarray outcomes from multiple experiments suggest that our simulator will be useful for guiding NG transcriptome sequencing projects in a wide range of organisms. PMID:19646272
Piscine reovirus: Genomic and molecular phylogenetic analysis from farmed and wild salmonids collected on the Canada/US Pacific Coast

USGS Publications Warehouse

Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul S.; Richmond, Zina; Purcell, Maureen K.; Johns, Robert; Johnson, Stewart C.; Sakasida, Sonja M.

2015-01-01

Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period.
Piscine Reovirus: Genomic and Molecular Phylogenetic Analysis from Farmed and Wild Salmonids Collected on the Canada/US Pacific Coast

PubMed Central

Siah, Ahmed; Morrison, Diane B.; Fringuelli, Elena; Savage, Paul; Richmond, Zina; Johns, Robert; Purcell, Maureen K.; Johnson, Stewart C.; Saksida, Sonja M.

2015-01-01

Piscine reovirus (PRV) is a double stranded non-enveloped RNA virus detected in farmed and wild salmonids. This study examined the phylogenetic relationships among different PRV sequence types present in samples from salmonids in Western Canada and the US, including Alaska (US), British Columbia (Canada) and Washington State (US). Tissues testing positive for PRV were partially sequenced for segment S1, producing 71 sequences that grouped into 10 unique sequence types. Sequence analysis revealed no identifiable geographical or temporal variation among the sequence types. Identical sequence types were found in fish sampled in 2001, 2005 and 2014. In addition, PRV positive samples from fish derived from Alaska, British Columbia and Washington State share identical sequence types. Comparative analysis of the phylogenetic tree indicated that Canada/US Pacific Northwest sequences formed a subgroup with some Norwegian sequence types (group II), distinct from other Norwegian and Chilean sequences (groups I, III and IV). Representative PRV positive samples from farmed and wild fish in British Columbia and Washington State were subjected to genome sequencing using next generation sequencing methods. Individual analysis of each of the 10 partial segments indicated that the Canadian and US PRV sequence types clustered separately from available whole genome sequences of some Norwegian and Chilean sequences for all segments except the segment S4. In summary, PRV was genetically homogenous over a large geographic distance (Alaska to Washington State), and the sequence types were relatively stable over a 13 year period. PMID:26536673
Operating characteristics of the implicit learning system supporting serial interception sequence learning.

PubMed

Sanchez, Daniel J; Reber, Paul J

2012-04-01

The memory system that supports implicit perceptual-motor sequence learning relies on brain regions that operate separately from the explicit, medial temporal lobe memory system. The implicit learning system therefore likely has distinct operating characteristics and information processing constraints. To attempt to identify the limits of the implicit sequence learning mechanism, participants performed the serial interception sequence learning (SISL) task with covertly embedded repeating sequences that were much longer than most previous studies: ranging from 30 to 60 (Experiment 1) and 60 to 90 (Experiment 2) items in length. Robust sequence-specific learning was observed for sequences up to 80 items in length, extending the known capacity of implicit sequence learning. In Experiment 3, 12-item repeating sequences were embedded among increasing amounts of irrelevant nonrepeating sequences (from 20 to 80% of training trials). Despite high levels of irrelevant trials, learning occurred across conditions. A comparison of learning rates across all three experiments found a surprising degree of constancy in the rate of learning regardless of sequence length or embedded noise. Sequence learning appears to be constant with the logarithm of the number of sequence repetitions practiced during training. The consistency in learning rate across experiments and conditions implies that the mechanisms supporting implicit sequence learning are not capacity-constrained by very long sequences nor adversely affected by high rates of irrelevant sequences during training.
[Study on ITS sequences of Aconitum vilmorinianum and its medicinal adulterant].

PubMed

Zhang, Xiao-nan; Du, Chun-hua; Fu, De-huan; Gao, Li; Zhou, Pei-jun; Wang, Li

2012-09-01

To analyze and compare the ITS sequences of Aconitum vilmorinianum and its medicinal adulterant Aconitum austroyunnanense. Total genomic DNA were extracted from sample materials by improved CTAB method, ITS sequences of samples were amplified using PCR systems, directly sequenced and analyzed using software DNAStar, ClustalX1.81 and MEGA 4.0. 299 consistent sites, 19 variable sites and 13 informative sites were found in ITS1 sequences, 162 consistent sites, 2 variable sites and 1 informative sites were found in 5.8S sequences, 217 consistent sites, 3 variable sites and 1 informative site were found in ITS2 sequences. Base transition and transversion was not found only in 5.8S sequences, 2 sites transition and 1 site transversion were found in ITS1 sequences, only 1 site transversion was found in ITS2 sequences comparting the ITS sequences data matrix. By analyzing the ITS sequences data matrix from 2 population of Aconitum vilmorinianum and 3 population of Aconitum austroyunnanense, we found a stable informative site at the 596th base in ITS2 sequences, in all the samples of Aconitum vilmorinianum the base was C, and in all the samples of Aconitum austroyunnanense the base was A. Aconitum vilmorinianum and Aconitum austroyunnanense can be identified by their characters of ITS sequences, and the variable sites in ITS1 sequences are more than in ITS2 sequences.

Long-range barcode labeling-sequencing

DOE Office of Scientific and Technical Information (OSTI.GOV)

Chen, Feng; Zhang, Tao; Singh, Kanwar K.

Methods for sequencing single large DNA molecules by clonal multiple displacement amplification using barcoded primers. Sequences are binned based on barcode sequences and sequenced using a microdroplet-based method for sequencing large polynucleotide templates to enable assembly of haplotype-resolved complex genomes and metagenomes.
Coordinate cytokine regulatory sequences

DOEpatents

Frazer, Kelly A.; Rubin, Edward M.; Loots, Gabriela G.

2005-05-10

The present invention provides CNS sequences that regulate the cytokine gene expression, expression cassettes and vectors comprising or lacking the CNS sequences, host cells and non-human transgenic animals comprising the CNS sequences or lacking the CNS sequences. The present invention also provides methods for identifying compounds that modulate the functions of CNS sequences as well as methods for diagnosing defects in the CNS sequences of patients.
Advantages of genome sequencing by long-read sequencer using SMRT technology in medical area.

PubMed

Nakano, Kazuma; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Ashimine, Noriko; Ohki, Shun; Shinzato, Misuzu; Minami, Maiko; Nakanishi, Tetsuhiro; Teruya, Kuniko; Satou, Kazuhito; Hirano, Takashi

2017-07-01

PacBio RS II is the first commercialized third-generation DNA sequencer able to sequence a single molecule DNA in real-time without amplification. PacBio RS II's sequencing technology is novel and unique, enabling the direct observation of DNA synthesis by DNA polymerase. PacBio RS II confers four major advantages compared to other sequencing technologies: long read lengths, high consensus accuracy, a low degree of bias, and simultaneous capability of epigenetic characterization. These advantages surmount the obstacle of sequencing genomic regions such as high/low G+C, tandem repeat, and interspersed repeat regions. Moreover, PacBio RS II is ideal for whole genome sequencing, targeted sequencing, complex population analysis, RNA sequencing, and epigenetics characterization. With PacBio RS II, we have sequenced and analyzed the genomes of many species, from viruses to humans. Herein, we summarize and review some of our key genome sequencing projects, including full-length viral sequencing, complete bacterial genome and almost-complete plant genome assemblies, and long amplicon sequencing of a disease-associated gene region. We believe that PacBio RS II is not only an effective tool for use in the basic biological sciences but also in the medical/clinical setting.
Sequence verification of synthetic DNA by assembly of sequencing reads

PubMed Central

Wilson, Mandy L.; Cai, Yizhi; Hanlon, Regina; Taylor, Samantha; Chevreux, Bastien; Setubal, João C.; Tyler, Brett M.; Peccoud, Jean

2013-01-01

Gene synthesis attempts to assemble user-defined DNA sequences with base-level precision. Verifying the sequences of construction intermediates and the final product of a gene synthesis project is a critical part of the workflow, yet one that has received the least attention. Sequence validation is equally important for other kinds of curated clone collections. Ensuring that the physical sequence of a clone matches its published sequence is a common quality control step performed at least once over the course of a research project. GenoREAD is a web-based application that breaks the sequence verification process into two steps: the assembly of sequencing reads and the alignment of the resulting contig with a reference sequence. GenoREAD can determine if a clone matches its reference sequence. Its sophisticated reporting features help identify and troubleshoot problems that arise during the sequence verification process. GenoREAD has been experimentally validated on thousands of gene-sized constructs from an ORFeome project, and on longer sequences including whole plasmids and synthetic chromosomes. Comparing GenoREAD results with those from manual analysis of the sequencing data demonstrates that GenoREAD tends to be conservative in its diagnostic. GenoREAD is available at www.genoread.org. PMID:23042248
The Genome Sequencer FLX System--longer reads, more applications, straight forward bioinformatics and more complete data sets.

PubMed

Droege, Marcus; Hill, Brendon

2008-08-31

The Genome Sequencer FLX System (GS FLX), powered by 454 Sequencing, is a next-generation DNA sequencing technology featuring a unique mix of long reads, exceptional accuracy, and ultra-high throughput. It has been proven to be the most versatile of all currently available next-generation sequencing technologies, supporting many high-profile studies in over seven applications categories. GS FLX users have pursued innovative research in de novo sequencing, re-sequencing of whole genomes and target DNA regions, metagenomics, and RNA analysis. 454 Sequencing is a powerful tool for human genetics research, having recently re-sequenced the genome of an individual human, currently re-sequencing the complete human exome and targeted genomic regions using the NimbleGen sequence capture process, and detected low-frequency somatic mutations linked to cancer.
Method to amplify variable sequences without imposing primer sequences

DOEpatents

Bradbury, Andrew M.; Zeytun, Ahmet

2006-11-14

The present invention provides methods of amplifying target sequences without including regions flanking the target sequence in the amplified product or imposing amplification primer sequences on the amplified product. Also provided are methods of preparing a library from such amplified target sequences.
Sequence Complexity of Chromosome 3 in Caenorhabditis elegans

PubMed Central

Pierro, Gaetano

2012-01-01

The nucleotide sequences complexity in chromosome 3 of Caenorhabditis elegans (C. elegans) is studied. The complexity of these sequences is compared with some random sequences. Moreover, by using some parameters related to complexity such as fractal dimension and frequency, indicator matrix is given a first classification of sequences of C. elegans. In particular, the sequences with highest and lowest fractal value are singled out. It is shown that the intrinsic nature of the low fractal dimension sequences has many common features with the random sequences. PMID:22919380
Chromosome specific repetitive DNA sequences

DOEpatents

Moyzis, Robert K.; Meyne, Julianne

1991-01-01

A method is provided for determining specific nucleotide sequences useful in forming a probe which can identify specific chromosomes, preferably through in situ hybridization within the cell itself. In one embodiment, chromosome preferential nucleotide sequences are first determined from a library of recombinant DNA clones having families of repetitive sequences. Library clones are identified with a low homology with a sequence of repetitive DNA families to which the first clones respectively belong and variant sequences are then identified by selecting clones having a pattern of hybridization with genomic DNA dissimilar to the hybridization pattern shown by the respective families. In another embodiment, variant sequences are selected from a sequence of a known repetitive DNA family. The selected variant sequence is classified as chromosome specific, chromosome preferential, or chromosome nonspecific. Sequences which are classified as chromosome preferential are further sequenced and regions are identified having a low homology with other regions of the chromosome preferential sequence or with known sequences of other family me This invention is the result of a contract with the Department of Energy (Contract No. W-7405-ENG-36).
Children's discrimination of vowel sequences

NASA Astrophysics Data System (ADS)

Coady, Jeffry A.; Kluender, Keith R.; Evans, Julia

2003-10-01

Children's ability to discriminate sequences of steady-state vowels was investigated. Vowels (as in ``beet,'' ``bat,'' ``bought,'' and ``boot'') were synthesized at durations of 40, 80, 160, 320, 640, and 1280 ms. Four different vowel sequences were created by concatenating different orders of vowels for each duration, separated by 10-ms intervening silence. Thus, sequences differed in vowel order and duration (rate). Sequences were 12 s in duration, with amplitude ramped linearly over the first and last 2 s. Sequence pairs included both same (identical sequences) and different trials (sequences with vowels in different orders). Sequences with vowel of equal duration were presented on individual trials. Children aged 7;0 to 10;6 listened to pairs of sequences (with 100 ms between sequences) and responded whether sequences sounded the same or different. Results indicate that children are best able to discriminate sequences of intermediate-duration vowels, typical of conversational speaking rate. Children were less accurate with both shorter and longer vowels. Results are discussed in terms of auditory processing (shortest vowels) and memory (longest vowels). [Research supported by NIDCD DC-05263, DC-04072, and DC-005650.
Individual sequences in large sets of gene sequences may be distinguished efficiently by combinations of shared sub-sequences

PubMed Central

Gibbs, Mark J; Armstrong, John S; Gibbs, Adrian J

2005-01-01

Background Most current DNA diagnostic tests for identifying organisms use specific oligonucleotide probes that are complementary in sequence to, and hence only hybridise with the DNA of one target species. By contrast, in traditional taxonomy, specimens are usually identified by 'dichotomous keys' that use combinations of characters shared by different members of the target set. Using one specific character for each target is the least efficient strategy for identification. Using combinations of shared bisectionally-distributed characters is much more efficient, and this strategy is most efficient when they separate the targets in a progressively binary way. Results We have developed a practical method for finding minimal sets of sub-sequences that identify individual sequences, and could be targeted by combinations of probes, so that the efficient strategy of traditional taxonomic identification could be used in DNA diagnosis. The sizes of minimal sub-sequence sets depended mostly on sequence diversity and sub-sequence length and interactions between these parameters. We found that 201 distinct cytochrome oxidase subunit-1 (CO1) genes from moths (Lepidoptera) were distinguished using only 15 sub-sequences 20 nucleotides long, whereas only 8–10 sub-sequences 6–10 nucleotides long were required to distinguish the CO1 genes of 92 species from the 9 largest orders of insects. Conclusion The presence/absence of sub-sequences in a set of gene sequences can be used like the questions in a traditional dichotomous taxonomic key; hybridisation probes complementary to such sub-sequences should provide a very efficient means for identifying individual species, subtypes or genotypes. Sequence diversity and sub-sequence length are the major factors that determine the numbers of distinguishing sub-sequences in any set of sequences. PMID:15817134
Comparison of an In Vitro Diagnostic Next-Generation Sequencing Assay with Sanger Sequencing for HIV-1 Genotypic Resistance Testing.

PubMed

Tzou, Philip L; Ariyaratne, Pramila; Varghese, Vici; Lee, Charlie; Rakhmanaliev, Elian; Villy, Carolin; Yee, Meiqi; Tan, Kevin; Michel, Gerd; Pinsky, Benjamin A; Shafer, Robert W

2018-06-01

The ability of next-generation sequencing (NGS) technologies to detect low frequency HIV-1 drug resistance mutations (DRMs) not detected by dideoxynucleotide Sanger sequencing has potential advantages for improved patient outcomes. We compared the performance of an in vitro diagnostic (IVD) NGS assay, the Sentosa SQ HIV genotyping assay for HIV-1 genotypic resistance testing, with Sanger sequencing on 138 protease/reverse transcriptase (RT) and 39 integrase sequences. The NGS assay used a 5% threshold for reporting low-frequency variants. The level of complete plus partial nucleotide sequence concordance between Sanger sequencing and NGS was 99.9%. Among the 138 protease/RT sequences, a mean of 6.4 DRMs was identified by both Sanger and NGS, a mean of 0.5 DRM was detected by NGS alone, and a mean of 0.1 DRM was detected by Sanger sequencing alone. Among the 39 integrase sequences, a mean of 1.6 DRMs was detected by both Sanger sequencing and NGS and a mean of 0.15 DRM was detected by NGS alone. Compared with Sanger sequencing, NGS estimated higher levels of resistance to one or more antiretroviral drugs for 18.2% of protease/RT sequences and 5.1% of integrase sequences. There was little evidence for technical artifacts in the NGS sequences, but the G-to-A hypermutation was detected in three samples. In conclusion, the IVD NGS assay evaluated in this study was highly concordant with Sanger sequencing. At the 5% threshold for reporting minority variants, NGS appeared to attain a modestly increased sensitivity for detecting low-frequency DRMs without compromising sequence accuracy. Copyright © 2018 American Society for Microbiology.
Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

NASA Astrophysics Data System (ADS)

Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

2017-07-01

DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.
Sequence Bundles: a novel method for visualising, discovering and exploring sequence motifs

PubMed Central

2014-01-01

Background We introduce Sequence Bundles--a novel data visualisation method for representing multiple sequence alignments (MSAs). We identify and address key limitations of the existing bioinformatics data visualisation methods (i.e. the Sequence Logo) by enabling Sequence Bundles to give salient visual expression to sequence motifs and other data features, which would otherwise remain hidden. Methods For the development of Sequence Bundles we employed research-led information design methodologies. Sequences are encoded as uninterrupted, semi-opaque lines plotted on a 2-dimensional reconfigurable grid. Each line represents a single sequence. The thickness and opacity of the stack at each residue in each position indicates the level of conservation and the lines' curved paths expose patterns in correlation and functionality. Several MSAs can be visualised in a composite image. The Sequence Bundles method is designed to favour a tangible, continuous and intuitive display of information. Results We have developed a software demonstration application for generating a Sequence Bundles visualisation of MSAs provided for the BioVis 2013 redesign contest. A subsequent exploration of the visualised line patterns allowed for the discovery of a number of interesting features in the dataset. Reported features include the extreme conservation of sequences displaying a specific residue and bifurcations of the consensus sequence. Conclusions Sequence Bundles is a novel method for visualisation of MSAs and the discovery of sequence motifs. It can aid in generating new insight and hypothesis making. Sequence Bundles is well disposed for future implementation as an interactive visual analytics software, which can complement existing visualisation tools. PMID:25237395
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N [San Leandro, CA; Mariella, Jr., Raymond P.; Christian, Allen T [Tracy, CA; Young, Jennifer A [Berkeley, CA; Clague, David S [Livermore, CA

2011-01-18

A method of fabricating a DNA molecule of user-defined sequence. The method comprises the steps of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an even or odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths. In one embodiment starting sequence fragments are of different lengths, n, n+1, n+2, etc.
Program Synthesizes UML Sequence Diagrams

NASA Technical Reports Server (NTRS)

Barry, Matthew R.; Osborne, Richard N.

2006-01-01

A computer program called "Rational Sequence" generates Universal Modeling Language (UML) sequence diagrams of a target Java program running on a Java virtual machine (JVM). Rational Sequence thereby performs a reverse engineering function that aids in the design documentation of the target Java program. Whereas previously, the construction of sequence diagrams was a tedious manual process, Rational Sequence generates UML sequence diagrams automatically from the running Java code.
Probabilistic Motor Sequence Yields Greater Offline and Less Online Learning than Fixed Sequence

PubMed Central

Du, Yue; Prashad, Shikha; Schoenbrun, Ilana; Clark, Jane E.

2016-01-01

It is well acknowledged that motor sequences can be learned quickly through online learning. Subsequently, the initial acquisition of a motor sequence is boosted or consolidated by offline learning. However, little is known whether offline learning can drive the fast learning of motor sequences (i.e., initial sequence learning in the first training session). To examine offline learning in the fast learning stage, we asked four groups of young adults to perform the serial reaction time (SRT) task with either a fixed or probabilistic sequence and with or without preliminary knowledge (PK) of the presence of a sequence. The sequence and PK were manipulated to emphasize either procedural (probabilistic sequence; no preliminary knowledge (NPK)) or declarative (fixed sequence; with PK) memory that were found to either facilitate or inhibit offline learning. In the SRT task, there were six learning blocks with a 2 min break between each consecutive block. Throughout the session, stimuli followed the same fixed or probabilistic pattern except in Block 5, in which stimuli appeared in a random order. We found that PK facilitated the learning of a fixed sequence, but not a probabilistic sequence. In addition to overall learning measured by the mean reaction time (RT), we examined the progressive changes in RT within and between blocks (i.e., online and offline learning, respectively). It was found that the two groups who performed the fixed sequence, regardless of PK, showed greater online learning than the other two groups who performed the probabilistic sequence. The groups who performed the probabilistic sequence, regardless of PK, did not display online learning, as indicated by a decline in performance within the learning blocks. However, they did demonstrate remarkably greater offline improvement in RT, which suggests that they are learning the probabilistic sequence offline. These results suggest that in the SRT task, the fast acquisition of a motor sequence is driven by concurrent online and offline learning. In addition, as the acquisition of a probabilistic sequence requires greater procedural memory compared to the acquisition of a fixed sequence, our results suggest that offline learning is more likely to take place in a procedural sequence learning task. PMID:26973502
Probabilistic Motor Sequence Yields Greater Offline and Less Online Learning than Fixed Sequence.

PubMed

Du, Yue; Prashad, Shikha; Schoenbrun, Ilana; Clark, Jane E

2016-01-01

It is well acknowledged that motor sequences can be learned quickly through online learning. Subsequently, the initial acquisition of a motor sequence is boosted or consolidated by offline learning. However, little is known whether offline learning can drive the fast learning of motor sequences (i.e., initial sequence learning in the first training session). To examine offline learning in the fast learning stage, we asked four groups of young adults to perform the serial reaction time (SRT) task with either a fixed or probabilistic sequence and with or without preliminary knowledge (PK) of the presence of a sequence. The sequence and PK were manipulated to emphasize either procedural (probabilistic sequence; no preliminary knowledge (NPK)) or declarative (fixed sequence; with PK) memory that were found to either facilitate or inhibit offline learning. In the SRT task, there were six learning blocks with a 2 min break between each consecutive block. Throughout the session, stimuli followed the same fixed or probabilistic pattern except in Block 5, in which stimuli appeared in a random order. We found that PK facilitated the learning of a fixed sequence, but not a probabilistic sequence. In addition to overall learning measured by the mean reaction time (RT), we examined the progressive changes in RT within and between blocks (i.e., online and offline learning, respectively). It was found that the two groups who performed the fixed sequence, regardless of PK, showed greater online learning than the other two groups who performed the probabilistic sequence. The groups who performed the probabilistic sequence, regardless of PK, did not display online learning, as indicated by a decline in performance within the learning blocks. However, they did demonstrate remarkably greater offline improvement in RT, which suggests that they are learning the probabilistic sequence offline. These results suggest that in the SRT task, the fast acquisition of a motor sequence is driven by concurrent online and offline learning. In addition, as the acquisition of a probabilistic sequence requires greater procedural memory compared to the acquisition of a fixed sequence, our results suggest that offline learning is more likely to take place in a procedural sequence learning task.
Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

PubMed Central

de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

2000-01-01

Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084
The recurrence sequences via Sylvester matrices

NASA Astrophysics Data System (ADS)

Karaduman, Erdal; Deveci, Ömür

2017-07-01

In this work, we define the Pell-Jacobsthal-Slyvester sequence and the Jacobsthal-Pell-Slyvester sequence by using the Slyvester matrices which are obtained from the characteristic polynomials of the Pell and Jacobsthal sequences and then, we study the sequences defined modulo m. Also, we obtain the cyclic groups and the semigroups from the generating matrices of these sequences when read modulo m and then, we derive the relationships among the orders of the cyclic groups and the periods of the sequences. Furthermore, we redefine Pell-Jacobsthal-Slyvester sequence and the Jacobsthal-Pell-Slyvester sequence by means of the elements of the groups and then, we examine them in the finite groups.
BAC sequencing using pooled methods.

PubMed

Saski, Christopher A; Feltus, F Alex; Parida, Laxmi; Haiminen, Niina

2015-01-01

Shotgun sequencing and assembly of a large, complex genome can be both expensive and challenging to accurately reconstruct the true genome sequence. Repetitive DNA arrays, paralogous sequences, polyploidy, and heterozygosity are main factors that plague de novo genome sequencing projects that typically result in highly fragmented assemblies and are difficult to extract biological meaning. Targeted, sub-genomic sequencing offers complexity reduction by removing distal segments of the genome and a systematic mechanism for exploring prioritized genomic content through BAC sequencing. If one isolates and sequences the genome fraction that encodes the relevant biological information, then it is possible to reduce overall sequencing costs and efforts that target a genomic segment. This chapter describes the sub-genome assembly protocol for an organism based upon a BAC tiling path derived from a genome-scale physical map or from fine mapping using BACs to target sub-genomic regions. Methods that are described include BAC isolation and mapping, DNA sequencing, and sequence assembly.

Long-range correlations and charge transport properties of DNA sequences

NASA Astrophysics Data System (ADS)

Liu, Xiao-liang; Ren, Yi; Xie, Qiong-tao; Deng, Chao-sheng; Xu, Hui

2010-04-01

By using Hurst's analysis and transfer approach, the rescaled range functions and Hurst exponents of human chromosome 22 and enterobacteria phage lambda DNA sequences are investigated and the transmission coefficients, Landauer resistances and Lyapunov coefficients of finite segments based on above genomic DNA sequences are calculated. In a comparison with quasiperiodic and random artificial DNA sequences, we find that λ-DNA exhibits anticorrelation behavior characterized by a Hurst exponent 0.5
Single molecule sequencing of the M13 virus genome without amplification

PubMed Central

Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X.; Yan, Qin; Deem, Michael W.; He, Jiankui

2017-01-01

Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias. PMID:29253901
Single molecule sequencing of the M13 virus genome without amplification.

PubMed

Zhao, Luyang; Deng, Liwei; Li, Gailing; Jin, Huan; Cai, Jinsen; Shang, Huan; Li, Yan; Wu, Haomin; Xu, Weibin; Zeng, Lidong; Zhang, Renli; Zhao, Huan; Wu, Ping; Zhou, Zhiliang; Zheng, Jiao; Ezanno, Pierre; Yang, Andrew X; Yan, Qin; Deem, Michael W; He, Jiankui

2017-01-01

Next generation sequencing (NGS) has revolutionized life sciences research. However, GC bias and costly, time-intensive library preparation make NGS an ill fit for increasing sequencing demands in the clinic. A new class of third-generation sequencing platforms has arrived to meet this need, capable of directly measuring DNA and RNA sequences at the single-molecule level without amplification. Here, we use the new GenoCare single-molecule sequencing platform from Direct Genomics to sequence the genome of the M13 virus. Our platform detects single-molecule fluorescence by total internal reflection microscopy, with sequencing-by-synthesis chemistry. We sequenced the genome of M13 to a depth of 316x, with 100% coverage. We determined a consensus sequence accuracy of 100%. In contrast to GC bias inherent to NGS results, we demonstrated that our single-molecule sequencing method yields minimal GC bias.
DNA sequencing using polymerase substrate-binding kinetics

PubMed Central

Previte, Michael John Robert; Zhou, Chunhong; Kellinger, Matthew; Pantoja, Rigo; Chen, Cheng-Yao; Shi, Jin; Wang, BeiBei; Kia, Amirali; Etchin, Sergey; Vieceli, John; Nikoomanzar, Ali; Bomati, Erin; Gloeckner, Christian; Ronaghi, Mostafa; He, Molly Min

2015-01-01

Next-generation sequencing (NGS) has transformed genomic research by decreasing the cost of sequencing. However, whole-genome sequencing is still costly and complex for diagnostics purposes. In the clinical space, targeted sequencing has the advantage of allowing researchers to focus on specific genes of interest. Routine clinical use of targeted NGS mandates inexpensive instruments, fast turnaround time and an integrated and robust workflow. Here we demonstrate a version of the Sequencing by Synthesis (SBS) chemistry that potentially can become a preferred targeted sequencing method in the clinical space. This sequencing chemistry uses natural nucleotides and is based on real-time recording of the differential polymerase/DNA-binding kinetics in the presence of correct or mismatch nucleotides. This ensemble SBS chemistry has been implemented on an existing Illumina sequencing platform with integrated cluster amplification. We discuss the advantages of this sequencing chemistry for targeted sequencing as well as its limitations for other applications. PMID:25612848
Memory and learning with rapid audiovisual sequences

PubMed Central

Keller, Arielle S.; Sekuler, Robert

2015-01-01

We examined short-term memory for sequences of visual stimuli embedded in varying multisensory contexts. In two experiments, subjects judged the structure of the visual sequences while disregarding concurrent, but task-irrelevant auditory sequences. Stimuli were eight-item sequences in which varying luminances and frequencies were presented concurrently and rapidly (at 8 Hz). Subjects judged whether the final four items in a visual sequence identically replicated the first four items. Luminances and frequencies in each sequence were either perceptually correlated (Congruent) or were unrelated to one another (Incongruent). Experiment 1 showed that, despite encouragement to ignore the auditory stream, subjects' categorization of visual sequences was strongly influenced by the accompanying auditory sequences. Moreover, this influence tracked the similarity between a stimulus's separate audio and visual sequences, demonstrating that task-irrelevant auditory sequences underwent a considerable degree of processing. Using a variant of Hebb's repetition design, Experiment 2 compared musically trained subjects and subjects who had little or no musical training on the same task as used in Experiment 1. Test sequences included some that intermittently and randomly recurred, which produced better performance than sequences that were generated anew for each trial. The auditory component of a recurring audiovisual sequence influenced musically trained subjects more than it did other subjects. This result demonstrates that stimulus-selective, task-irrelevant learning of sequences can occur even when such learning is an incidental by-product of the task being performed. PMID:26575193
Memory and learning with rapid audiovisual sequences.

PubMed

Keller, Arielle S; Sekuler, Robert

2015-01-01

We examined short-term memory for sequences of visual stimuli embedded in varying multisensory contexts. In two experiments, subjects judged the structure of the visual sequences while disregarding concurrent, but task-irrelevant auditory sequences. Stimuli were eight-item sequences in which varying luminances and frequencies were presented concurrently and rapidly (at 8 Hz). Subjects judged whether the final four items in a visual sequence identically replicated the first four items. Luminances and frequencies in each sequence were either perceptually correlated (Congruent) or were unrelated to one another (Incongruent). Experiment 1 showed that, despite encouragement to ignore the auditory stream, subjects' categorization of visual sequences was strongly influenced by the accompanying auditory sequences. Moreover, this influence tracked the similarity between a stimulus's separate audio and visual sequences, demonstrating that task-irrelevant auditory sequences underwent a considerable degree of processing. Using a variant of Hebb's repetition design, Experiment 2 compared musically trained subjects and subjects who had little or no musical training on the same task as used in Experiment 1. Test sequences included some that intermittently and randomly recurred, which produced better performance than sequences that were generated anew for each trial. The auditory component of a recurring audiovisual sequence influenced musically trained subjects more than it did other subjects. This result demonstrates that stimulus-selective, task-irrelevant learning of sequences can occur even when such learning is an incidental by-product of the task being performed.
SeqTrim: a high-throughput pipeline for pre-processing any type of sequence read

PubMed Central

2010-01-01

Background High-throughput automated sequencing has enabled an exponential growth rate of sequencing data. This requires increasing sequence quality and reliability in order to avoid database contamination with artefactual sequences. The arrival of pyrosequencing enhances this problem and necessitates customisable pre-processing algorithms. Results SeqTrim has been implemented both as a Web and as a standalone command line application. Already-published and newly-designed algorithms have been included to identify sequence inserts, to remove low quality, vector, adaptor, low complexity and contaminant sequences, and to detect chimeric reads. The availability of several input and output formats allows its inclusion in sequence processing workflows. Due to its specific algorithms, SeqTrim outperforms other pre-processors implemented as Web services or standalone applications. It performs equally well with sequences from EST libraries, SSH libraries, genomic DNA libraries and pyrosequencing reads and does not lead to over-trimming. Conclusions SeqTrim is an efficient pipeline designed for pre-processing of any type of sequence read, including next-generation sequencing. It is easily configurable and provides a friendly interface that allows users to know what happened with sequences at every pre-processing stage, and to verify pre-processing of an individual sequence if desired. The recommended pipeline reveals more information about each sequence than previously described pre-processors and can discard more sequencing or experimental artefacts. PMID:20089148
Efficient Identification of Murine M2 Macrophage Peptide Targeting Ligands by Phage Display and Next-Generation Sequencing.

PubMed

Liu, Gary W; Livesay, Brynn R; Kacherovsky, Nataly A; Cieslewicz, Maryelise; Lutz, Emi; Waalkes, Adam; Jensen, Michael C; Salipante, Stephen J; Pun, Suzie H

2015-08-19

Peptide ligands are used to increase the specificity of drug carriers to their target cells and to facilitate intracellular delivery. One method to identify such peptide ligands, phage display, enables high-throughput screening of peptide libraries for ligands binding to therapeutic targets of interest. However, conventional methods for identifying target binders in a library by Sanger sequencing are low-throughput, labor-intensive, and provide a limited perspective (<0.01%) of the complete sequence space. Moreover, the small sample space can be dominated by nonspecific, preferentially amplifying "parasitic sequences" and plastic-binding sequences, which may lead to the identification of false positives or exclude the identification of target-binding sequences. To overcome these challenges, we employed next-generation Illumina sequencing to couple high-throughput screening and high-throughput sequencing, enabling more comprehensive access to the phage display library sequence space. In this work, we define the hallmarks of binding sequences in next-generation sequencing data, and develop a method that identifies several target-binding phage clones for murine, alternatively activated M2 macrophages with a high (100%) success rate: sequences and binding motifs were reproducibly present across biological replicates; binding motifs were identified across multiple unique sequences; and an unselected, amplified library accurately filtered out parasitic sequences. In addition, we validate the Multiple Em for Motif Elicitation tool as an efficient and principled means of discovering binding sequences.
Integration of Temporal and Ordinal Information During Serial Interception Sequence Learning

PubMed Central

Gobel, Eric W.; Sanchez, Daniel J.; Reber, Paul J.

2011-01-01

The expression of expert motor skills typically involves learning to perform a precisely timed sequence of movements (e.g., language production, music performance, athletic skills). Research examining incidental sequence learning has previously relied on a perceptually-cued task that gives participants exposure to repeating motor sequences but does not require timing of responses for accuracy. Using a novel perceptual-motor sequence learning task, learning a precisely timed cued sequence of motor actions is shown to occur without explicit instruction. Participants learned a repeating sequence through practice and showed sequence-specific knowledge via a performance decrement when switched to an unfamiliar sequence. In a second experiment, the integration of representation of action order and timing sequence knowledge was examined. When either action order or timing sequence information was selectively disrupted, performance was reduced to levels similar to completely novel sequences. Unlike prior sequence-learning research that has found timing information to be secondary to learning action sequences, when the task demands require accurate action and timing information, an integrated representation of these types of information is acquired. These results provide the first evidence for incidental learning of fully integrated action and timing sequence information in the absence of an independent representation of action order, and suggest that this integrative mechanism may play a material role in the acquisition of complex motor skills. PMID:21417511
Method and apparatus for biological sequence comparison

DOEpatents

Marr, T.G.; Chang, W.I.

1997-12-23

A method and apparatus are disclosed for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence. 5 figs.
Method and apparatus for biological sequence comparison

DOEpatents

Marr, Thomas G.; Chang, William I-Wei

1997-01-01

A method and apparatus for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence.
A Deep-Coverage Tomato BAC Library and Prospects Toward Development of an STC Framework for Genome Sequencing

PubMed Central

Budiman, Muhammad A.; Mao, Long; Wood, Todd C.; Wing, Rod A.

2000-01-01

Recently a new strategy using BAC end sequences as sequence-tagged connectors (STCs) was proposed for whole-genome sequencing projects. In this study, we present the construction and detailed characterization of a 15.0 haploid genome equivalent BAC library for the cultivated tomato, Lycopersicon esculentum cv. Heinz 1706. The library contains 129,024 clones with an average insert size of 117.5 kb and a chloroplast content of 1.11%. BAC end sequences from 1490 ends were generated and analyzed as a preliminary evaluation for using this library to develop an STC framework to sequence the tomato genome. A total of 1205 BAC end sequences (80.9%) were obtained, with an average length of 360 high-quality bases, and were searched against the GenBank database. Using a cutoff expectation value of <10−6, and combining the results from BLASTN, BLASTX, and TBLASTX searches, 24.3% of the BAC end sequences were similar to known sequences, of which almost half (48.7%) share sequence similarities to retrotransposons and 7% to known genes. Some of the transposable element sequences were the first reported in tomato, such as sequences similar to maize transposon Activator (Ac) ORF and tobacco pararetrovirus-like sequences. Interestingly, there were no BAC end sequences similar to the highly repeated TGRI and TGRII elements. However, the majority (70.3%) of STCs did not share significant sequence similarities to any sequences in GenBank at either the DNA or predicted protein levels, indicating that a large portion of the tomato genome is still unknown. Our data demonstrate that this BAC library is suitable for developing an STC database to sequence the tomato genome. The advantages of developing an STC framework for whole-genome sequencing of tomato are discussed. [The BAC end sequences described in this paper have been deposited in the GenBank data library under accession nos. AQ367111–AQ368361.] PMID:10645957
The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing.

PubMed

Binladen, Jonas; Gilbert, M Thomas P; Bollback, Jonathan P; Panitz, Frank; Bendixen, Christian; Nielsen, Rasmus; Willerslev, Eske

2007-02-14

The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform) has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR) reactions and subsequent sequencing runs have been unable to combine template DNA from multiple individuals, as homologous sequences cannot be subsequently assigned to their original sources. We use conventional PCR with 5'-nucleotide tagged primers to generate homologous DNA amplification products from multiple specimens, followed by sequencing through the high-throughput Genome Sequence 20 DNA Sequencing System (GS20, Roche/454 Life Sciences). Each DNA sequence is subsequently traced back to its individual source through 5'tag-analysis. We demonstrate that this new approach enables the assignment of virtually all the generated DNA sequences to the correct source once sequencing anomalies are accounted for (miss-assignment rate<0.4%). Therefore, the method enables accurate sequencing and assignment of homologous DNA sequences from multiple sources in single high-throughput GS20 run. We observe a bias in the distribution of the differently tagged primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution of the sequences as sorted by the second nucleotide of the dinucleotide tags. As the results are based on a single GS20 run, the general applicability of the approach requires confirmation. However, our experiments demonstrate that 5'primer tagging is a useful method in which the sequencing power of the GS20 can be applied to PCR-based assays of multiple homologous PCR products. The new approach will be of value to a broad range of research areas, such as those of comparative genomics, complete mitochondrial analyses, population genetics, and phylogenetics.
Sequential addition of short DNA oligos in DNA-polymerase-based synthesis reactions

DOEpatents

Gardner, Shea N; Mariella, Jr., Raymond P; Christian, Allen T; Young, Jennifer A; Clague, David S

2013-06-25

A method of preselecting a multiplicity of DNA sequence segments that will comprise the DNA molecule of user-defined sequence, separating the DNA sequence segments temporally, and combining the multiplicity of DNA sequence segments with at least one polymerase enzyme wherein the multiplicity of DNA sequence segments join to produce the DNA molecule of user-defined sequence. Sequence segments may be of length n, where n is an odd integer. In one embodiment the length of desired hybridizing overlap is specified by the user and the sequences and the protocol for combining them are guided by computational (bioinformatics) predictions. In one embodiment sequence segments are combined from multiple reading frames to span the same region of a sequence, so that multiple desired hybridizations may occur with different overlap lengths.
Repetitive sequences in plant nuclear DNA: types, distribution, evolution and function.

PubMed

Mehrotra, Shweta; Goyal, Vinod

2014-08-01

Repetitive DNA sequences are a major component of eukaryotic genomes and may account for up to 90% of the genome size. They can be divided into minisatellite, microsatellite and satellite sequences. Satellite DNA sequences are considered to be a fast-evolving component of eukaryotic genomes, comprising tandemly-arrayed, highly-repetitive and highly-conserved monomer sequences. The monomer unit of satellite DNA is 150-400 base pairs (bp) in length. Repetitive sequences may be species- or genus-specific, and may be centromeric or subtelomeric in nature. They exhibit cohesive and concerted evolution caused by molecular drive, leading to high sequence homogeneity. Repetitive sequences accumulate variations in sequence and copy number during evolution, hence they are important tools for taxonomic and phylogenetic studies, and are known as "tuning knobs" in the evolution. Therefore, knowledge of repetitive sequences assists our understanding of the organization, evolution and behavior of eukaryotic genomes. Repetitive sequences have cytoplasmic, cellular and developmental effects and play a role in chromosomal recombination. In the post-genomics era, with the introduction of next-generation sequencing technology, it is possible to evaluate complex genomes for analyzing repetitive sequences and deciphering the yet unknown functional potential of repetitive sequences. Copyright © 2014 The Authors. Production and hosting by Elsevier Ltd.. All rights reserved.
Why barcode? High-throughput multiplex sequencing of mitochondrial genomes for molecular systematics.

PubMed

Timmermans, M J T N; Dodsworth, S; Culverwell, C L; Bocak, L; Ahrens, D; Littlewood, D T J; Pons, J; Vogler, A P

2010-11-01

Mitochondrial genome sequences are important markers for phylogenetics but taxon sampling remains sporadic because of the great effort and cost required to acquire full-length sequences. Here, we demonstrate a simple, cost-effective way to sequence the full complement of protein coding mitochondrial genes from pooled samples using the 454/Roche platform. Multiplexing was achieved without the need for expensive indexing tags ('barcodes'). The method was trialled with a set of long-range polymerase chain reaction (PCR) fragments from 30 species of Coleoptera (beetles) sequenced in a 1/16th sector of a sequencing plate. Long contigs were produced from the pooled sequences with sequencing depths ranging from ∼10 to 100× per contig. Species identity of individual contigs was established via three 'bait' sequences matching disparate parts of the mitochondrial genome obtained by conventional PCR and Sanger sequencing. This proved that assembly of contigs from the sequencing pool was correct. Our study produced sequences for 21 nearly complete and seven partial sets of protein coding mitochondrial genes. Combined with existing sequences for 25 taxa, an improved estimate of basal relationships in Coleoptera was obtained. The procedure could be employed routinely for mitochondrial genome sequencing at the species level, to provide improved species 'barcodes' that currently use the cox1 gene only.
Novel methodologies for spectral classification of exon and intron sequences

NASA Astrophysics Data System (ADS)

Kwan, Hon Keung; Kwan, Benjamin Y. M.; Kwan, Jennifer Y. Y.

2012-12-01

Digital processing of a nucleotide sequence requires it to be mapped to a numerical sequence in which the choice of nucleotide to numeric mapping affects how well its biological properties can be preserved and reflected from nucleotide domain to numerical domain. Digital spectral analysis of nucleotide sequences unfolds a period-3 power spectral value which is more prominent in an exon sequence as compared to that of an intron sequence. The success of a period-3 based exon and intron classification depends on the choice of a threshold value. The main purposes of this article are to introduce novel codes for 1-sequence numerical representations for spectral analysis and compare them to existing codes to determine appropriate representation, and to introduce novel thresholding methods for more accurate period-3 based exon and intron classification of an unknown sequence. The main findings of this study are summarized as follows: Among sixteen 1-sequence numerical representations, the K-Quaternary Code I offers an attractive performance. A windowed 1-sequence numerical representation (with window length of 9, 15, and 24 bases) offers a possible speed gain over non-windowed 4-sequence Voss representation which increases as sequence length increases. A winner threshold value (chosen from the best among two defined threshold values and one other threshold value) offers a top precision for classifying an unknown sequence of specified fixed lengths. An interpolated winner threshold value applicable to an unknown and arbitrary length sequence can be estimated from the winner threshold values of fixed length sequences with a comparable performance. In general, precision increases as sequence length increases. The study contributes an effective spectral analysis of nucleotide sequences to better reveal embedded properties, and has potential applications in improved genome annotation.
Sequence and facies architecture of the upper Blackhawk Formation and the Lower Castlegate Sandstone (Upper Cretaceous), Book Cliffs, Utah, USA

NASA Astrophysics Data System (ADS)

Yoshida, S.

2000-11-01

High-frequency stratigraphic sequences that comprise the Desert Member of the Blackhawk Formation, the Lower Castlegate Sandstone, and the Buck Tongue in the Green River area of Utah display changes in sequence architecture from marine deposits to marginal marine deposits to an entirely nonmarine section. Facies and sequence architecture differ above and below the regionally extensive Castlegate sequence boundary, which separates two low-frequency (106-year cyclicity) sequences. Below this surface, high-frequency sequences are identified and interpreted as comprising the highstand systems tract of the low-frequency Blackhawk sequence. Each high-frequency sequence has a local incised valley system on top of the wave-dominated delta, and coastal plain to shallow marine deposits are preserved. Above the Castlegate sequence boundary, in contrast, a regionally extensive sheet sandstone of fluvial to estuarine origin with laterally continuous internal erosional surfaces occurs. These deposits above the Castlegate sequence boundary are interpreted as the late lowstand to early transgressive systems tracts of the low-frequency Castlegate sequence. The base-level changes that generated both the low- and high-frequency sequences are attributed to crustal response to fluctuations in compressive intraplate stress on two different time scales. The low-frequency stratigraphic sequences are attributed to changes in the long-term regional subsidence rate and regional tilting of foreland basin fill. High-frequency sequences probably reflect the response of anisotropic basement to tectonism. Sequence architecture changes rapidly across the faulted margin of the underlying Paleozoic Paradox Basin. The high-frequency sequences are deeply eroded and stack above the Paradox Basin, but display less relief and become conformable updip. These features indicate that the area above the Paradox Basin was more prone to vertical structural movements during formation of the Blackhawk-Lower Castlegate succession.
Equally parsimonious pathways through an RNA sequence space are not equally likely

NASA Technical Reports Server (NTRS)

Lee, Y. H.; DSouza, L. M.; Fox, G. E.

1997-01-01

An experimental system for determining the potential ability of sequences resembling 5S ribosomal RNA (rRNA) to perform as functional 5S rRNAs in vivo in the Escherichia coli cellular environment was devised previously. Presumably, the only 5S rRNA sequences that would have been fixed by ancestral populations are ones that were functionally valid, and hence the actual historical paths taken through RNA sequence space during 5S rRNA evolution would have most likely utilized valid sequences. Herein, we examine the potential validity of all sequence intermediates along alternative equally parsimonious trajectories through RNA sequence space which connect two pairs of sequences that had previously been shown to behave as valid 5S rRNAs in E. coli. The first trajectory requires a total of four changes. The 14 sequence intermediates provide 24 apparently equally parsimonious paths by which the transition could occur. The second trajectory involves three changes, six intermediate sequences, and six potentially equally parsimonious paths. In total, only eight of the 20 sequence intermediates were found to be clearly invalid. As a consequence of the position of these invalid intermediates in the sequence space, seven of the 30 possible paths consisted of exclusively valid sequences. In several cases, the apparent validity/invalidity of the intermediate sequences could not be anticipated on the basis of current knowledge of the 5S rRNA structure. This suggests that the interdependencies in RNA sequence space may be more complex than currently appreciated. If ancestral sequences predicted by parsimony are to be regarded as actual historical sequences, then the present results would suggest that they should also satisfy a validity requirement and that, in at least limited cases, this conjecture can be tested experimentally.
Aftershock occurrence rate decay for individual sequences and catalogs

NASA Astrophysics Data System (ADS)

Nyffenegger, Paul A.

One of the earliest observations of the Earth's seismicity is that the rate of aftershock occurrence decays with time according to a power law commonly known as modified Omori-law (MOL) decay. However, the physical reasons for aftershock occurrence and the empirical decay in rate remain unclear despite numerous models that yield similar rate decay behavior. Key problems in relating the observed empirical relationship to the physical conditions of the mainshock and fault are the lack of studies including small magnitude mainshocks and the lack of uniformity between studies. We use simulated aftershock sequences to investigate the factors which influence the maximum likelihood (ML) estimate of the Omori-law p value, the parameter describing aftershock occurrence rate decay, for both individual aftershock sequences and "stacked" or superposed sequences. Generally the ML estimate of p is accurate, but since the ML estimated uncertainty is unaffected by whether the sequence resembles an MOL model, a goodness-of-fit test such as the Anderson-Darling statistic is necessary. While stacking aftershock sequences permits the study of entire catalogs and sequences with small aftershock populations, stacking introduces artifacts. The p value for stacked sequences is approximately equal to the mean of the individual sequence p values. We apply single-link cluster analysis to identify all aftershock sequences from eleven regional seismicity catalogs. We observe two new mathematically predictable empirical relationships for the distribution of aftershock sequence populations. The average properties of aftershock sequences are not correlated with tectonic environment, but aftershock populations and p values do show a depth dependence. The p values show great variability with time, and large values or changes in p sometimes precedes major earthquakes. Studies of teleseismic earthquake catalogs over the last twenty years have led seismologists to question seismicity models and aftershock sequence decay for deep sequences. For seven exceptional deep sequences, we conclude that MOL decay adequately describes these sequences, and little difference exists compared to shallow sequences. However, they do include larger aftershock populations compared to most deep sequences. These results imply that p values for deep sequences are larger than those for intermediate depth sequences.

"First generation" automated DNA sequencing technology.

PubMed

Slatko, Barton E; Kieleczawa, Jan; Ju, Jingyue; Gardner, Andrew F; Hendrickson, Cynthia L; Ausubel, Frederick M

2011-10-01

Beginning in the 1980s, automation of DNA sequencing has greatly increased throughput, reduced costs, and enabled large projects to be completed more easily. The development of automation technology paralleled the development of other aspects of DNA sequencing: better enzymes and chemistry, separation and imaging technology, sequencing protocols, robotics, and computational advancements (including base-calling algorithms with quality scores, database developments, and sequence analysis programs). Despite the emergence of high-throughput sequencing platforms, automated Sanger sequencing technology remains useful for many applications. This unit provides background and a description of the "First-Generation" automated DNA sequencing technology. It also includes protocols for using the current Applied Biosystems (ABI) automated DNA sequencing machines. © 2011 by John Wiley & Sons, Inc.
Site directed recombination

DOEpatents

Jurka, Jerzy W.

1997-01-01

Enhanced homologous recombination is obtained by employing a consensus sequence which has been found to be associated with integration of repeat sequences, such as Alu and ID. The consensus sequence or sequence having a single transition mutation determines one site of a double break which allows for high efficiency of integration at the site. By introducing single or double stranded DNA having the consensus sequence flanking region joined to a sequence of interest, one can reproducibly direct integration of the sequence of interest at one or a limited number of sites. In this way, specific sites can be identified and homologous recombination achieved at the site by employing a second flanking sequence associated with a sequence proximal to the 3'-nick.
The sequence of sequencers: The history of sequencing DNA

PubMed Central

Heather, James M.; Chain, Benjamin

2016-01-01

Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. PMID:26554401
Integrating alignment-based and alignment-free sequence similarity measures for biological sequence classification.

PubMed

Borozan, Ivan; Watt, Stuart; Ferretti, Vincent

2015-05-01

Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed in many bacterial and viral genomes. Here, we propose a classification model that exploits the complementary nature of alignment-based and alignment-free similarity measures with the aim to improve the accuracy with which DNA and protein sequences are characterized. Our model classifies sequences using a combined sequence similarity score calculated by adaptively weighting the contribution of different sequence similarity measures. Weights are determined independently for each sequence in the test set and reflect the discriminatory ability of individual similarity measures in the training set. Because the similarity between some sequences is determined more accurately with one type of measure rather than another, our classifier allows different sets of weights to be associated with different sequences. Using five different similarity measures, we show that our model significantly improves the classification accuracy over the current composition- and alignment-based models, when predicting the taxonomic lineage for both short viral sequence fragments and complete viral sequences. We also show that our model can be used effectively for the classification of reads from a real metagenome dataset as well as protein sequences. All the datasets and the code used in this study are freely available at https://collaborators.oicr.on.ca/vferretti/borozan_csss/csss.html. ivan.borozan@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.
Integrating alignment-based and alignment-free sequence similarity measures for biological sequence classification

PubMed Central

Borozan, Ivan; Watt, Stuart; Ferretti, Vincent

2015-01-01

Motivation: Alignment-based sequence similarity searches, while accurate for some type of sequences, can produce incorrect results when used on more divergent but functionally related sequences that have undergone the sequence rearrangements observed in many bacterial and viral genomes. Here, we propose a classification model that exploits the complementary nature of alignment-based and alignment-free similarity measures with the aim to improve the accuracy with which DNA and protein sequences are characterized. Results: Our model classifies sequences using a combined sequence similarity score calculated by adaptively weighting the contribution of different sequence similarity measures. Weights are determined independently for each sequence in the test set and reflect the discriminatory ability of individual similarity measures in the training set. Because the similarity between some sequences is determined more accurately with one type of measure rather than another, our classifier allows different sets of weights to be associated with different sequences. Using five different similarity measures, we show that our model significantly improves the classification accuracy over the current composition- and alignment-based models, when predicting the taxonomic lineage for both short viral sequence fragments and complete viral sequences. We also show that our model can be used effectively for the classification of reads from a real metagenome dataset as well as protein sequences. Availability and implementation: All the datasets and the code used in this study are freely available at https://collaborators.oicr.on.ca/vferretti/borozan_csss/csss.html. Contact: ivan.borozan@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25573913
Studying long 16S rDNA sequences with ultrafast-metagenomic sequence classification using exact alignments (Kraken).

PubMed

Valenzuela-González, Fabiola; Martínez-Porchas, Marcel; Villalpando-Canchola, Enrique; Vargas-Albores, Francisco

2016-03-01

Ultrafast-metagenomic sequence classification using exact alignments (Kraken) is a novel approach to classify 16S rDNA sequences. The classifier is based on mapping short sequences to the lowest ancestor and performing alignments to form subtrees with specific weights in each taxon node. This study aimed to evaluate the classification performance of Kraken with long 16S rDNA random environmental sequences produced by cloning and then Sanger sequenced. A total of 480 clones were isolated and expanded, and 264 of these clones formed contigs (1352 ± 153 bp). The same sequences were analyzed using the Ribosomal Database Project (RDP) classifier. Deeper classification performance was achieved by Kraken than by the RDP: 73% of the contigs were classified up to the species or variety levels, whereas 67% of these contigs were classified no further than the genus level by the RDP. The results also demonstrated that unassembled sequences analyzed by Kraken provide similar or inclusively deeper information. Moreover, sequences that did not form contigs, which are usually discarded by other programs, provided meaningful information when analyzed by Kraken. Finally, it appears that the assembly step for Sanger sequences can be eliminated when using Kraken. Kraken cumulates the information of both sequence senses, providing additional elements for the classification. In conclusion, the results demonstrate that Kraken is an excellent choice for use in the taxonomic assignment of sequences obtained by Sanger sequencing or based on third generation sequencing, of which the main goal is to generate larger sequences. Copyright © 2016 Elsevier B.V. All rights reserved.
Rapid identification of causative species in patients with Old World leishmaniasis.

PubMed Central

Minodier, P; Piarroux, R; Gambarelli, F; Joblet, C; Dumon, H

1997-01-01

Conventional methods for the identification of species of Leishmania parasite causing infections have limitations. By using a DNA-based alternative, the present study tries to develop a new tool for this purpose. Thirty-three patients living in Marseilles (in the south of France) were suffering from visceral or cutaneous leishmaniasis. DNA of the parasite in clinical samples (bone marrow, peripheral blood, or skin) from these patients were amplified by PCR and were directly sequenced. The sequences observed were compared to these of 30 strains of the genus causing Old World leishmaniasis collected in Europe, Africa, or Asia. In the analysis of the sequences of the strains, two different sequence patterns for Leishmania infantum, one sequence for Leishmania donovani, one sequence for Leishmania major, two sequences for Leishmania tropica, and one sequence for Leishmania aethiopica were obtained. Four sequences were observed among the strains from the patients: one was similar to the sequence for the L. major strains, two were identical to the sequences for the L. infantum strains, and the last sequence was not observed within the strains but had a high degree of homology with the sequences of the L. infantum and L. donovani strains. The L. infantum strains from all immunocompetent patients had the same sequence. The L. infantum strains from immunodeficient patients suffering from visceral leishmaniasis had three different sequences. This fact might signify that some variants of L. infantum acquire pathogenicity exclusively in immunocompromised patients. To dispense with the sequencing step, a restriction assay with HaeIII was used. Some restriction patterns might support genetic exchanges in members of the genus Leishmania. PMID:9316906
Science sequence design

NASA Technical Reports Server (NTRS)

Koskela, P. E.; Bollman, W. E.; Freeman, J. E.; Helton, M. R.; Reichert, R. J.; Travers, E. S.; Zawacki, S. J.

1973-01-01

The activities of the following members of the Navigation Team are recorded: the Science Sequence Design Group, responsible for preparing the final science sequence designs; the Advanced Sequence Planning Group, responsible for sequence planning; and the Science Recommendation Team (SRT) representatives, responsible for conducting the necessary sequence design interfaces with the teams during the mission. The interface task included science support in both advance planning and daily operations. Science sequences designed during the mission are also discussed.
The first genome sequences of human bocaviruses from Vietnam

PubMed Central

Thanh, Tran Tan; Van, Hoang Minh Tu; Hong, Nguyen Thi Thu; Nhu, Le Nguyen Truc; Anh, Nguyen To; Tuan, Ha Manh; Hien, Ho Van; Tuong, Nguyen Manh; Kien, Trinh Trung; Khanh, Truong Huu; Nhan, Le Nguyen Thanh; Hung, Nguyen Thanh; Chau, Nguyen Van Vinh; Thwaites, Guy; van Doorn, H. Rogier; Tan, Le Van

2017-01-01

As part of an ongoing effort to generate complete genome sequences of hand, foot and mouth disease-causing enteroviruses directly from clinical specimens, two complete coding sequences and two partial genomic sequences of human bocavirus 1 (n=3) and 2 (n=1) were co-amplified and sequenced, representing the first genome sequences of human bocaviruses from Vietnam. The sequences may aid future study aiming at understanding the evolution of the virus. PMID:28090592
Meeting the challenges of non-referenced genome assembly from short-read sequence data

Treesearch

M. Parks; A. Liston; R. Cronn

2010-01-01

Massively parallel sequencing technologies (MPST) offer unprecedented opportunities for novel sequencing projects. MPST, while offering tremendous sequencing capacity, are typically most effective in resequencing projects (as opposed to the sequencing of novel genomes) due to the fact that sequence is returned in relatively short reads. Nonetheless, there is great...
Experimental investigation of an RNA sequence space

NASA Technical Reports Server (NTRS)

Lee, Youn-Hyung; Dsouza, Lisa; Fox, George E.

1993-01-01

Modern rRNAs are the historic consequence of an ongoing evolutionary exploration of a sequence space. These extant sequences belong to a special subset of the sequence space that is comprised only of those primary sequences that can validly perform the biological function(s) required of the particular RNA. If it were possible to readily identify all such valid sequences, stochastic predictions could be made about the relative likelihood of various evolutionary pathways available to an RNA. Herein an experimental system which can assess whether a particular sequence is likely to have validity as a eubacterial 5S rRNA is described. A total of ten naturally occurring, and hence known to be valid, sequences and two point mutants of unknown validity were used to test the usefulness of the approach. Nine of the ten valid sequences tested positive whereas both mutants tested as clearly defective. The tenth valid sequence gave results that would be interpreted as reflecting a borderline status were the answer not known. These results demonstrate that it is possible to experimentally determine which sequences in local regions of the sequence space are potentially valid 5S rRNAs.
On the role of the SMA in the discrete sequence production task: a TMS study. Transcranial Magnetic Stimulation.

PubMed

Verwey, Willem B; Lammens, Robin; van Honk, Jack

2002-01-01

Participants practiced two discrete six-key sequences for a total of 420 trials. The 1 x 6 sequence had a unique order of key presses while the 2 x 3 sequence involved repetition of a three-key segment. Both sequences showed a long interkey interval halfway the sequence indicating hierarchical sequence control in that not only the 2 x 3 but also the 1 x 6 sequence was executed as two successive motor chunks. Besides, the second part of both sequences was executed faster than the first part. This supports the earlier notion of a motor processor executing the elements of familiar motor chunks and a cognitive processor triggering either these motor chunks or individual sequence elements. Low-frequency, off-line transcranial magnetic stimulation (TMS) of the supplementary motor area (SMA) counteracted normal improvement with practice of key presses at all sequence positions. Together, these results are in line with the notion that with moderate practice, the SMA executes short sequence fragments that are concatenated by other brain structures.
First-order and higher order sequence learning in specific language impairment.

PubMed

Clark, Gillian M; Lum, Jarrad A G

2017-02-01

A core claim of the procedural deficit hypothesis of specific language impairment (SLI) is that the disorder is associated with poor implicit sequence learning. This study investigated whether implicit sequence learning problems in SLI are present for first-order conditional (FOC) and higher order conditional (HOC) sequences. Twenty-five children with SLI and 27 age-matched, nonlanguage-impaired children completed 2 serial reaction time tasks. On 1 version, the sequence to be implicitly learnt comprised a FOC sequence and on the other a HOC sequence. Results showed that the SLI group learned the HOC sequence (η p ² = .285, p = .005) but not the FOC sequence (η p ² = .099, p = .118). The control group learned both sequences (FOC η p ² = .497, HOC η p 2= .465, ps < .001). The SLI group's difficulty learning the FOC sequence is consistent with the procedural deficit hypothesis. However, the study provides new evidence that multiple mechanisms may underpin the learning of FOC and HOC sequences. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, J.N.; Straume, T.; Bogen, K.T.

1998-03-24

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. 14 figs.
Identification of random nucleic acid sequence aberrations using dual capture probes which hybridize to different chromosome regions

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

1998-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration.
Digital RNA sequencing minimizes sequence-dependent bias and amplification noise with optimized single-molecule barcodes

PubMed Central

Shiroguchi, Katsuyuki; Jia, Tony Z.; Sims, Peter A.; Xie, X. Sunney

2012-01-01

RNA sequencing (RNA-Seq) is a powerful tool for transcriptome profiling, but is hampered by sequence-dependent bias and inaccuracy at low copy numbers intrinsic to exponential PCR amplification. We developed a simple strategy for mitigating these complications, allowing truly digital RNA-Seq. Following reverse transcription, a large set of barcode sequences is added in excess, and nearly every cDNA molecule is uniquely labeled by random attachment of barcode sequences to both ends. After PCR, we applied paired-end deep sequencing to read the two barcodes and cDNA sequences. Rather than counting the number of reads, RNA abundance is measured based on the number of unique barcode sequences observed for a given cDNA sequence. We optimized the barcodes to be unambiguously identifiable, even in the presence of multiple sequencing errors. This method allows counting with single-copy resolution despite sequence-dependent bias and PCR-amplification noise, and is analogous to digital PCR but amendable to quantifying a whole transcriptome. We demonstrated transcriptome profiling of Escherichia coli with more accurate and reproducible quantification than conventional RNA-Seq. PMID:22232676
Deep sequencing of evolving pathogen populations: applications, errors, and bioinformatic solutions

PubMed Central

2014-01-01

Deep sequencing harnesses the high throughput nature of next generation sequencing technologies to generate population samples, treating information contained in individual reads as meaningful. Here, we review applications of deep sequencing to pathogen evolution. Pioneering deep sequencing studies from the virology literature are discussed, such as whole genome Roche-454 sequencing analyses of the dynamics of the rapidly mutating pathogens hepatitis C virus and HIV. Extension of the deep sequencing approach to bacterial populations is then discussed, including the impacts of emerging sequencing technologies. While it is clear that deep sequencing has unprecedented potential for assessing the genetic structure and evolutionary history of pathogen populations, bioinformatic challenges remain. We summarise current approaches to overcoming these challenges, in particular methods for detecting low frequency variants in the context of sequencing error and reconstructing individual haplotypes from short reads. PMID:24428920
ORFer--retrieval of protein sequences and open reading frames from GenBank and storage into relational databases or text files.

PubMed

Büssow, Konrad; Hoffmann, Steve; Sievert, Volker

2002-12-19

Functional genomics involves the parallel experimentation with large sets of proteins. This requires management of large sets of open reading frames as a prerequisite of the cloning and recombinant expression of these proteins. A Java program was developed for retrieval of protein and nucleic acid sequences and annotations from NCBI GenBank, using the XML sequence format. Annotations retrieved by ORFer include sequence name, organism and also the completeness of the sequence. The program has a graphical user interface, although it can be used in a non-interactive mode. For protein sequences, the program also extracts the open reading frame sequence, if available, and checks its correct translation. ORFer accepts user input in the form of single or lists of GenBank GI identifiers or accession numbers. It can be used to extract complete sets of open reading frames and protein sequences from any kind of GenBank sequence entry, including complete genomes or chromosomes. Sequences are either stored with their features in a relational database or can be exported as text files in Fasta or tabulator delimited format. The ORFer program is freely available at http://www.proteinstrukturfabrik.de/orfer. The ORFer program allows for fast retrieval of DNA sequences, protein sequences and their open reading frames and sequence annotations from GenBank. Furthermore, storage of sequences and features in a relational database is supported. Such a database can supplement a laboratory information system (LIMS) with appropriate sequence information.
Targeted Re-Sequencing Emulsion PCR Panel for Myopathies: Results in 94 Cases.

PubMed

Punetha, Jaya; Kesari, Akanchha; Uapinyoying, Prech; Giri, Mamta; Clarke, Nigel F; Waddell, Leigh B; North, Kathryn N; Ghaoui, Roula; O'Grady, Gina L; Oates, Emily C; Sandaradura, Sarah A; Bönnemann, Carsten G; Donkervoort, Sandra; Plotz, Paul H; Smith, Edward C; Tesi-Rocha, Carolina; Bertorini, Tulio E; Tarnopolsky, Mark A; Reitter, Bernd; Hausmanowa-Petrusewicz, Irena; Hoffman, Eric P

2016-05-27

Molecular diagnostics in the genetic myopathies often requires testing of the largest and most complex transcript units in the human genome (DMD, TTN, NEB). Iteratively targeting single genes for sequencing has traditionally entailed high costs and long turnaround times. Exome sequencing has begun to supplant single targeted genes, but there are concerns regarding coverage and needed depth of the very large and complex genes that frequently cause myopathies. To evaluate efficiency of next-generation sequencing technologies to provide molecular diagnostics for patients with previously undiagnosed myopathies. We tested a targeted re-sequencing approach, using a 45 gene emulsion PCR myopathy panel, with subsequent sequencing on the Illumina platform in 94 undiagnosed patients. We compared the targeted re-sequencing approach to exome sequencing for 10 of these patients studied. We detected likely pathogenic mutations in 33 out of 94 patients with a molecular diagnostic rate of approximately 35%. The remaining patients showed variants of unknown significance (35/94 patients) or no mutations detected in the 45 genes tested (26/94 patients). Mutation detection rates for targeted re-sequencing vs. whole exome were similar in both methods; however exome sequencing showed better distribution of reads and fewer exon dropouts. Given that costs of highly parallel re-sequencing and whole exome sequencing are similar, and that exome sequencing now takes considerably less laboratory processing time than targeted re-sequencing, we recommend exome sequencing as the standard approach for molecular diagnostics of myopathies.
Shotgun Protein Sequencing with Meta-contig Assembly*

PubMed Central

Guthals, Adrian; Clauser, Karl R.; Bandeira, Nuno

2012-01-01

Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings. PMID:22798278

Shotgun protein sequencing with meta-contig assembly.

PubMed

Guthals, Adrian; Clauser, Karl R; Bandeira, Nuno

2012-10-01

Full-length de novo sequencing from tandem mass (MS/MS) spectra of unknown proteins such as antibodies or proteins from organisms with unsequenced genomes remains a challenging open problem. Conventional algorithms designed to individually sequence each MS/MS spectrum are limited by incomplete peptide fragmentation or low signal to noise ratios and tend to result in short de novo sequences at low sequencing accuracy. Our shotgun protein sequencing (SPS) approach was developed to ameliorate these limitations by first finding groups of unidentified spectra from the same peptides (contigs) and then deriving a consensus de novo sequence for each assembled set of spectra (contig sequences). But whereas SPS enables much more accurate reconstruction of de novo sequences longer than can be recovered from individual MS/MS spectra, it still requires error-tolerant matching to homologous proteins to group smaller contig sequences into full-length protein sequences, thus limiting its effectiveness on sequences from poorly annotated proteins. Using low and high resolution CID and high resolution HCD MS/MS spectra, we address this limitation with a Meta-SPS algorithm designed to overlap and further assemble SPS contigs into Meta-SPS de novo contig sequences extending as long as 100 amino acids at over 97% accuracy without requiring any knowledge of homologous protein sequences. We demonstrate Meta-SPS using distinct MS/MS data sets obtained with separate enzymatic digestions and discuss how the remaining de novo sequencing limitations relate to MS/MS acquisition settings.
Divergent nuclear 18S rDNA paralogs in a turkey coccidium, Eimeria meleagrimitis, complicate molecular systematics and identification.

PubMed

El-Sherry, Shiem; Ogedengbe, Mosun E; Hafeez, Mian A; Barta, John R

2013-07-01

Multiple 18S rDNA sequences were obtained from two single-oocyst-derived lines of each of Eimeria meleagrimitis and Eimeria adenoeides. After analysing the 15 new 18S rDNA sequences from two lines of E. meleagrimitis and 17 new sequences from two lines of E. adenoeides, there were clear indications that divergent, paralogous 18S rDNA copies existed within the nuclear genome of E. meleagrimitis. In contrast, mitochondrial cytochrome c oxidase subunit I (COI) partial sequences from all lines of a particular Eimeria sp. were identical and, in phylogenetic analyses, COI sequences clustered unambiguously in monophyletic and highly-supported clades specific to individual Eimeria sp. Phylogenetic analysis of the new 18S rDNA sequences from E. meleagrimitis showed that they formed two distinct clades: Type A with four new sequences; and Type B with nine new sequences; both Types A and B sequences were obtained from each of the single-oocyst-derived lines of E. meleagrimitis. Together these rDNA types formed a well-supported E. meleagrimitis clade. Types A and B 18S rDNA sequences from E. meleagrimitis had a mean sequence identity of only 97.4% whereas mean sequence identity within types was 99.1-99.3%. The observed intraspecific sequence divergence among E. meleagrimitis 18S rDNA sequence types was even higher (approximately 2.6%) than the interspecific sequence divergence present between some well-recognized species such as Eimeria tenella and Eimeria necatrix (1.1%). Our observations suggest that, unlike COI sequences, 18S rDNA sequences are not reliable molecular markers to be used alone for species identification with coccidia, although 18S rDNA sequences have clear utility for phylogenetic reconstruction of apicomplexan parasites at the genus and higher taxonomic ranks. Copyright © 2013. Published by Elsevier Ltd.
Existence of host-related DNA sequences in the schistosome genome.

PubMed

Iwamura, Y; Irie, Y; Kominami, R; Nara, T; Yasuraoka, K

1991-06-01

DNA sequences homologous to the mouse intracisternal A particle and endogenous type C retrovirus were detected in the DNAs of Schistosoma japonicum adults and S. mansoni eggs. Furthermore, other kinds of repetitive sequences in the host genome such as mouse type 1 Alu sequence (B1), mouse type 2 Alu sequence (B2) and mo-2 sequence, a mouse mini-satellite, were also detected in the DNAs from adults and eggs of S. japonicum and eggs of S. mansoni. Almost all of the sequences described above were absent in the DNAs of S. mansoni adults. The DNA fingerprints of schistosomes, using the mo-2 sequence, were indistinguishable from each other and resembled those of their murine hosts. Moreover, the mo-2 sequence was hypermethylated in the DNAs of schistosomes and its amount was variable in them. These facts indicate that host-related sequences are actually present in schistosomes and that the mo-2 repetitive sequence exists probably in extra-chromosome.
The complete CDS of the prion protein (PRNP) gene of African lion (Panthera leo).

PubMed

Maj, Andrzej; Spellman, Garth M; Sarver, Shane K

2008-04-01

We provide the complete PRNP CDS sequence for the African lion, which is different from the previously published sequence and more similar to other carnivore sequences. The newly obtained prion protein sequence differs from the domestic cat sequence at three amino acid positions and contains only four octapeptide repeats. We recommend that this sequence be used as the reference sequence for future studies of the PRNP gene for this species.
Tidying Up International Nucleotide Sequence Databases: Ecological, Geographical and Sequence Quality Annotation of ITS Sequences of Mycorrhizal Fungi

PubMed Central

Tedersoo, Leho; Abarenkov, Kessy; Nilsson, R. Henrik; Schüssler, Arthur; Grelet, Gwen-Aëlle; Kohout, Petr; Oja, Jane; Bonito, Gregory M.; Veldre, Vilmar; Jairus, Teele; Ryberg, Martin; Larsson, Karl-Henrik; Kõljalg, Urmas

2011-01-01

Sequence analysis of the ribosomal RNA operon, particularly the internal transcribed spacer (ITS) region, provides a powerful tool for identification of mycorrhizal fungi. The sequence data deposited in the International Nucleotide Sequence Databases (INSD) are, however, unfiltered for quality and are often poorly annotated with metadata. To detect chimeric and low-quality sequences and assign the ectomycorrhizal fungi to phylogenetic lineages, fungal ITS sequences were downloaded from INSD, aligned within family-level groups, and examined through phylogenetic analyses and BLAST searches. By combining the fungal sequence database UNITE and the annotation and search tool PlutoF, we also added metadata from the literature to these accessions. Altogether 35,632 sequences belonged to mycorrhizal fungi or originated from ericoid and orchid mycorrhizal roots. Of these sequences, 677 were considered chimeric and 2,174 of low read quality. Information detailing country of collection, geographical coordinates, interacting taxon and isolation source were supplemented to cover 78.0%, 33.0%, 41.7% and 96.4% of the sequences, respectively. These annotated sequences are publicly available via UNITE (http://unite.ut.ee/) for downstream biogeographic, ecological and taxonomic analyses. In European Nucleotide Archive (ENA; http://www.ebi.ac.uk/ena/), the annotated sequences have a special link-out to UNITE. We intend to expand the data annotation to additional genes and all taxonomic groups and functional guilds of fungi. PMID:21949797
The rapid evolution of molecular genetic diagnostics in neuromuscular diseases.

PubMed

Volk, Alexander E; Kubisch, Christian

2017-10-01

The development of massively parallel sequencing (MPS) has revolutionized molecular genetic diagnostics in monogenic disorders. The present review gives a brief overview of different MPS-based approaches used in clinical diagnostics of neuromuscular disorders (NMDs) and highlights their advantages and limitations. MPS-based approaches like gene panel sequencing, (whole) exome sequencing, (whole) genome sequencing, and RNA sequencing have been used to identify the genetic cause in NMDs. Although gene panel sequencing has evolved as a standard test for heterogeneous diseases, it is still debated, mainly because of financial issues and unsolved problems of variant interpretation, whether genome sequencing (and to a lesser extent also exome sequencing) of single patients can already be regarded as routine diagnostics. However, it has been shown that the inclusion of parents and additional family members often leads to a substantial increase in the diagnostic yield in exome-wide/genome-wide MPS approaches. In addition, MPS-based RNA sequencing just enters the research and diagnostic scene. Next-generation sequencing increasingly enables the detection of the genetic cause in highly heterogeneous diseases like NMDs in an efficient and affordable way. Gene panel sequencing and family-based exome sequencing have been proven as potent and cost-efficient diagnostic tools. Although clinical validation and interpretation of genome sequencing is still challenging, diagnostic RNA sequencing represents a promising tool to bypass some hurdles of diagnostics using genomic DNA.
The use of sequence-based SSR mining for the development of a vast collection of microsatellites in Aquilegia Formosa

Treesearch

Brandon Schlautman; Vera Pfeiffer; Juan Zalapa; Johanne Brunet

2014-01-01

Numerous microsatellite markers were developed for Aquilegia formosafrom sequences deposited within the Expressed Sequence Tag (EST), Genomic Survey Sequence (GSS), and Nucleotide databases in NCBI. Microsatellites (SSRs) were identified and primers were designed for 9 SSR containing sequences in the Nucleotide database, 3803 sequences in the EST...
Sequence repeats and protein structure

NASA Astrophysics Data System (ADS)

Hoang, Trinh X.; Trovato, Antonio; Seno, Flavio; Banavar, Jayanth R.; Maritan, Amos

2012-11-01

Repeats are frequently found in known protein sequences. The level of sequence conservation in tandem repeats correlates with their propensities to be intrinsically disordered. We employ a coarse-grained model of a protein with a two-letter amino acid alphabet, hydrophobic (H) and polar (P), to examine the sequence-structure relationship in the realm of repeated sequences. A fraction of repeated sequences comprises a distinct class of bad folders, whose folding temperatures are much lower than those of random sequences. Imperfection in sequence repetition improves the folding properties of the bad folders while deteriorating those of the good folders. Our results may explain why nature has utilized repeated sequences for their versatility and especially to design functional proteins that are intrinsically unstructured at physiological temperatures.
The sequence of sequencers: The history of sequencing DNA.

PubMed

Heather, James M; Chain, Benjamin

2016-01-01

Determining the order of nucleic acid residues in biological samples is an integral component of a wide variety of research applications. Over the last fifty years large numbers of researchers have applied themselves to the production of techniques and technologies to facilitate this feat, sequencing DNA and RNA molecules. This time-scale has witnessed tremendous changes, moving from sequencing short oligonucleotides to millions of bases, from struggling towards the deduction of the coding sequence of a single gene to rapid and widely available whole genome sequencing. This article traverses those years, iterating through the different generations of sequencing technology, highlighting some of the key discoveries, researchers, and sequences along the way. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
An improved model for whole genome phylogenetic analysis by Fourier transform.

PubMed

Yin, Changchuan; Yau, Stephen S-T

2015-10-07

DNA sequence similarity comparison is one of the major steps in computational phylogenetic studies. The sequence comparison of closely related DNA sequences and genomes is usually performed by multiple sequence alignments (MSA). While the MSA method is accurate for some types of sequences, it may produce incorrect results when DNA sequences undergone rearrangements as in many bacterial and viral genomes. It is also limited by its computational complexity for comparing large volumes of data. Previously, we proposed an alignment-free method that exploits the full information contents of DNA sequences by Discrete Fourier Transform (DFT), but still with some limitations. Here, we present a significantly improved method for the similarity comparison of DNA sequences by DFT. In this method, we map DNA sequences into 2-dimensional (2D) numerical sequences and then apply DFT to transform the 2D numerical sequences into frequency domain. In the 2D mapping, the nucleotide composition of a DNA sequence is a determinant factor and the 2D mapping reduces the nucleotide composition bias in distance measure, and thus improving the similarity measure of DNA sequences. To compare the DFT power spectra of DNA sequences with different lengths, we propose an improved even scaling algorithm to extend shorter DFT power spectra to the longest length of the underlying sequences. After the DFT power spectra are evenly scaled, the spectra are in the same dimensionality of the Fourier frequency space, then the Euclidean distances of full Fourier power spectra of the DNA sequences are used as the dissimilarity metrics. The improved DFT method, with increased computational performance by 2D numerical representation, can be applicable to any DNA sequences of different length ranges. We assess the accuracy of the improved DFT similarity measure in hierarchical clustering of different DNA sequences including simulated and real datasets. The method yields accurate and reliable phylogenetic trees and demonstrates that the improved DFT dissimilarity measure is an efficient and effective similarity measure of DNA sequences. Due to its high efficiency and accuracy, the proposed DFT similarity measure is successfully applied on phylogenetic analysis for individual genes and large whole bacterial genomes. Copyright © 2015 Elsevier Ltd. All rights reserved.
Kit for detecting nucleic acid sequences using competitive hybridization probes

DOEpatents

Lucas, Joe N.; Straume, Tore; Bogen, Kenneth T.

2001-01-01

A kit is provided for detecting a target nucleic acid sequence in a sample, the kit comprising: a first hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the first hybridization probe including a first complexing agent for forming a binding pair with a second complexing agent; and a second hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the first hybridization probe does not selectively hybridize, the second hybridization probe including a detectable marker; a third hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a first portion of the target sequence, the third hybridization probe including the same detectable marker as the second hybridization probe; and a fourth hybridization probe which includes a nucleic acid sequence that is sufficiently complementary to selectively hybridize to a second portion of the target sequence to which the third hybridization probe does not selectively hybridize, the fourth hybridization probe including the first complexing agent for forming a binding pair with the second complexing agent; wherein the first and second hybridization probes are capable of simultaneously hybridizing to the target sequence and the third and fourth hybridization probes are capable of simultaneously hybridizing to the target sequence, the detectable marker is not present on the first or fourth hybridization probes and the first, second, third, and fourth hybridization probes each include a competitive nucleic acid sequence which is sufficiently complementary to a third portion of the target sequence that the competitive sequences of the first, second, third, and fourth hybridization probes compete with each other to hybridize to the third portion of the target sequence.
PFAAT version 2.0: a tool for editing, annotating, and analyzing multiple sequence alignments.

PubMed

Caffrey, Daniel R; Dana, Paul H; Mathur, Vidhya; Ocano, Marco; Hong, Eun-Jong; Wang, Yaoyu E; Somaroo, Shyamal; Caffrey, Brian E; Potluri, Shobha; Huang, Enoch S

2007-10-11

By virtue of their shared ancestry, homologous sequences are similar in their structure and function. Consequently, multiple sequence alignments are routinely used to identify trends that relate to function. This type of analysis is particularly productive when it is combined with structural and phylogenetic analysis. Here we describe the release of PFAAT version 2.0, a tool for editing, analyzing, and annotating multiple sequence alignments. Support for multiple annotations is a key component of this release as it provides a framework for most of the new functionalities. The sequence annotations are accessible from the alignment and tree, where they are typically used to label sequences or hyperlink them to related databases. Sequence annotations can be created manually or extracted automatically from UniProt entries. Once a multiple sequence alignment is populated with sequence annotations, sequences can be easily selected and sorted through a sophisticated search dialog. The selected sequences can be further analyzed using statistical methods that explicitly model relationships between the sequence annotations and residue properties. Residue annotations are accessible from the alignment viewer and are typically used to designate binding sites or properties for a particular residue. Residue annotations are also searchable, and allow one to quickly select alignment columns for further sequence analysis, e.g. computing percent identities. Other features include: novel algorithms to compute sequence conservation, mapping conservation scores to a 3D structure in Jmol, displaying secondary structure elements, and sorting sequences by residue composition. PFAAT provides a framework whereby end-users can specify knowledge for a protein family in the form of annotation. The annotations can be combined with sophisticated analysis to test hypothesis that relate to sequence, structure and function.
Molecular cloning and nucleotide sequence of the alpha and beta subunits of allophycocyanin from the cyanelle genome of Cyanophora paradoxa.

PubMed Central

Bryant, D A; de Lorimier, R; Lambert, D H; Dubbs, J M; Stirewalt, V L; Stevens, S E; Porter, R D; Tam, J; Jay, E

1985-01-01

The genes for the alpha- and beta-subunit apoproteins of allophycocyanin (AP) were isolated from the cyanelle genome of Cyanophora paradoxa and subjected to nucleotide sequence analysis. The AP beta-subunit apoprotein gene was localized to a 7.8-kilobase-pair Pst I restriction fragment from cyanelle DNA by hybridization with a tetradecameric oligonucleotide probe. Sequence analysis using that oligonucleotide and its complement as primers for the dideoxy chain-termination sequencing method confirmed the presence of both AP alpha- and beta-subunit genes on this restriction fragment. Additional oligonucleotide primers were synthesized as sequencing progressed and were used to determine rapidly the nucleotide sequence of a 1336-base-pair region of this cloned fragment. This strategy allowed the sequencing to be completed without a detailed restriction map and without extensive and time-consuming subcloning. The sequenced region contains two open reading frames whose deduced amino acid sequences are 81-85% homologous to cyanobacterial and red algal AP subunits whose amino acid sequences have been determined. The two open reading frames are in the same orientation and are separated by 39 base pairs. AP alpha is 5' to AP beta and both coding sequences are preceded by a polypurine, Shine-Dalgarno-type sequence. Sequences upstream from AP alpha closely resemble the Escherichia coli consensus promoter sequences and also show considerable homology to promoter sequences for several chloroplast-encoded psbA genes. A 56-base-pair palindromic sequence downstream from the AP beta gene could play a role in the termination of transcription or translation. The allophycocyanin apoprotein subunit genes are located on the large single-copy region of the cyanelle genome. PMID:2987916
Sequence dependent aggregation of peptides and fibril formation

NASA Astrophysics Data System (ADS)

Hung, Nguyen Ba; Le, Duy-Manh; Hoang, Trinh X.

2017-09-01

Deciphering the links between amino acid sequence and amyloid fibril formation is key for understanding protein misfolding diseases. Here we use Monte Carlo simulations to study the aggregation of short peptides in a coarse-grained model with hydrophobic-polar (HP) amino acid sequences and correlated side chain orientations for hydrophobic contacts. A significant heterogeneity is observed in the aggregate structures and in the thermodynamics of aggregation for systems of different HP sequences and different numbers of peptides. Fibril-like ordered aggregates are found for several sequences that contain the common HPH pattern, while other sequences may form helix bundles or disordered aggregates. A wide variation of the aggregation transition temperatures among sequences, even among those of the same hydrophobic fraction, indicates that not all sequences undergo aggregation at a presumable physiological temperature. The transition is found to be the most cooperative for sequences forming fibril-like structures. For a fibril-prone sequence, it is shown that fibril formation follows the nucleation and growth mechanism. Interestingly, a binary mixture of peptides of an aggregation-prone and a non-aggregation-prone sequence shows the association and conversion of the latter to the fibrillar structure. Our study highlights the role of a sequence in selecting fibril-like aggregates and also the impact of a structural template on fibril formation by peptides of unrelated sequences.
The primary structure of the Saccharomyces cerevisiae gene for 3-phosphoglycerate kinase.

PubMed Central

Hitzeman, R A; Hagie, F E; Hayflick, J S; Chen, C Y; Seeburg, P H; Derynck, R

1982-01-01

The DNA sequence of the gene for the yeast glycolytic enzyme, 3-phosphoglycerate kinase (PGK), has been obtained by sequencing part of a 3.1 kbp HindIII fragment obtained from the yeast genome. The structural gene sequence corresponds to a reading frame of 1251 bp coding for 416 amino acids with no intervening DNA sequences. The amino acid sequence is approximately 65 percent homologous with human and horse PGK protein sequences and is in general agreement with the published protein sequence for yeast PGK. As for other highly expressed structural genes in yeast, the coding sequence is highly codon biased with 95 percent of the amino acids coded for by a select 25 codons (out of 61 possible). Besides structural DNA sequence, 291 bp of 5'-flanking sequence and 286 bp of 3'-flanking sequence were determined. Transcription starts 36 nucleotides upstream from the translational start and stops 86-93 nucleotides downstream from the translational stop. These results suggest a non-polyadenylated mRNA length of 1373 to 1380 nucleotides, which is consistent with the observed length of 1500 nucleotides for polyadenylated PGK mRNA. A sequence TATATATAAA is found at 145 nucleotides upstream from the translational start. This sequence resembles the TATAAA box that is possibly associated with RNA polymerase II binding. Images PMID:6296791
Statistical properties of filtered pseudorandom digital sequences formed from the sum of maximum-length sequences

NASA Technical Reports Server (NTRS)

Wallace, G. R.; Weathers, G. D.; Graf, E. R.

1973-01-01

The statistics of filtered pseudorandom digital sequences called hybrid-sum sequences, formed from the modulo-two sum of several maximum-length sequences, are analyzed. The results indicate that a relation exists between the statistics of the filtered sequence and the characteristic polynomials of the component maximum length sequences. An analysis procedure is developed for identifying a large group of sequences with good statistical properties for applications requiring the generation of analog pseudorandom noise. By use of the analysis approach, the filtering process is approximated by the convolution of the sequence with a sum of unit step functions. A parameter reflecting the overall statistical properties of filtered pseudorandom sequences is derived. This parameter is called the statistical quality factor. A computer algorithm to calculate the statistical quality factor for the filtered sequences is presented, and the results for two examples of sequence combinations are included. The analysis reveals that the statistics of the signals generated with the hybrid-sum generator are potentially superior to the statistics of signals generated with maximum-length generators. Furthermore, fewer calculations are required to evaluate the statistics of a large group of hybrid-sum generators than are required to evaluate the statistics of the same size group of approximately equivalent maximum-length sequences.
Effects of the Ion PGM™ Hi-Q™ sequencing chemistry on sequence data quality.

PubMed

Churchill, Jennifer D; King, Jonathan L; Chakraborty, Ranajit; Budowle, Bruce

2016-09-01

Massively parallel sequencing (MPS) offers substantial improvements over current forensic DNA typing methodologies such as increased resolution, scalability, and throughput. The Ion PGM™ is a promising MPS platform for analysis of forensic biological evidence. The system employs a sequencing-by-synthesis chemistry on a semiconductor chip that measures a pH change due to the release of hydrogen ions as nucleotides are incorporated into the growing DNA strands. However, implementation of MPS into forensic laboratories requires a robust chemistry. Ion Torrent's Hi-Q™ Sequencing Chemistry was evaluated to determine if it could improve on the quality of the generated sequence data in association with selected genetic marker targets. The whole mitochondrial genome and the HID-Ion STR 10-plex panel were sequenced on the Ion PGM™ system with the Ion PGM™ Sequencing 400 Kit and the Ion PGM™ Hi-Q™ Sequencing Kit. Concordance, coverage, strand balance, noise, and deletion ratios were assessed in evaluating the performance of the Ion PGM™ Hi-Q™ Sequencing Kit. The results indicate that reliable, accurate data are generated and that sequencing through homopolymeric regions can be improved with the use of Ion Torrent's Hi-Q™ Sequencing Chemistry. Overall, the quality of the generated sequencing data supports the potential for use of the Ion PGM™ in forensic genetic laboratories.
Fundamental Bounds for Sequence Reconstruction from Nanopore Sequencers.

PubMed

Magner, Abram; Duda, Jarosław; Szpankowski, Wojciech; Grama, Ananth

2016-06-01

Nanopore sequencers are emerging as promising new platforms for high-throughput sequencing. As with other technologies, sequencer errors pose a major challenge for their effective use. In this paper, we present a novel information theoretic analysis of the impact of insertion-deletion (indel) errors in nanopore sequencers. In particular, we consider the following problems: (i) for given indel error characteristics and rate, what is the probability of accurate reconstruction as a function of sequence length; (ii) using replicated extrusion (the process of passing a DNA strand through the nanopore), what is the number of replicas needed to accurately reconstruct the true sequence with high probability? Our results provide a number of important insights: (i) the probability of accurate reconstruction of a sequence from a single sample in the presence of indel errors tends quickly (i.e., exponentially) to zero as the length of the sequence increases; and (ii) replicated extrusion is an effective technique for accurate reconstruction. We show that for typical distributions of indel errors, the required number of replicas is a slow function (polylogarithmic) of sequence length - implying that through replicated extrusion, we can sequence large reads using nanopore sequencers. Moreover, we show that in certain cases, the required number of replicas can be related to information-theoretic parameters of the indel error distributions.
AlignMe—a membrane protein sequence alignment web server

PubMed Central

Stamm, Marcus; Staritzbichler, René; Khafizov, Kamil; Forrest, Lucy R.

2014-01-01

We present a web server for pair-wise alignment of membrane protein sequences, using the program AlignMe. The server makes available two operational modes of AlignMe: (i) sequence to sequence alignment, taking two sequences in fasta format as input, combining information about each sequence from multiple sources and producing a pair-wise alignment (PW mode); and (ii) alignment of two multiple sequence alignments to create family-averaged hydropathy profile alignments (HP mode). For the PW sequence alignment mode, four different optimized parameter sets are provided, each suited to pairs of sequences with a specific similarity level. These settings utilize different types of inputs: (position-specific) substitution matrices, secondary structure predictions and transmembrane propensities from transmembrane predictions or hydrophobicity scales. In the second (HP) mode, each input multiple sequence alignment is converted into a hydrophobicity profile averaged over the provided set of sequence homologs; the two profiles are then aligned. The HP mode enables qualitative comparison of transmembrane topologies (and therefore potentially of 3D folds) of two membrane proteins, which can be useful if the proteins have low sequence similarity. In summary, the AlignMe web server provides user-friendly access to a set of tools for analysis and comparison of membrane protein sequences. Access is available at http://www.bioinfo.mpg.de/AlignMe PMID:24753425
Quantitative phenotyping via deep barcode sequencing.

PubMed

Smith, Andrew M; Heisler, Lawrence E; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J; Chee, Mark; Roth, Frederick P; Giaever, Guri; Nislow, Corey

2009-10-01

Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or "Bar-seq," outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that approximately 20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene-environment interactions on a genome-wide scale.

Polypeptide having or assisting in carbohydrate material degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2016-02-16

The invention relates to a polypeptide which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 76% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having beta-glucosidase activity and uses thereof

DOE Office of Scientific and Technical Information (OSTI.GOV)

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well asmore » the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.« less
Polypeptide having swollenin activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elizabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica D; Damveld, Robbertus Antonius

2015-11-04

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Whole genome sequence analysis of unidentified genetically modified papaya for development of a specific detection method.

PubMed

Nakamura, Kosuke; Kondo, Kazunari; Akiyama, Hiroshi; Ishigaki, Takumi; Noguchi, Akio; Katsumata, Hiroshi; Takasaki, Kazuto; Futo, Satoshi; Sakata, Kozue; Fukuda, Nozomi; Mano, Junichi; Kitta, Kazumi; Tanaka, Hidenori; Akashi, Ryo; Nishimaki-Mogami, Tomoko

2016-08-15

Identification of transgenic sequences in an unknown genetically modified (GM) papaya (Carica papaya L.) by whole genome sequence analysis was demonstrated. Whole genome sequence data were generated for a GM-positive fresh papaya fruit commodity detected in monitoring using real-time polymerase chain reaction (PCR). The sequences obtained were mapped against an open database for papaya genome sequence. Transgenic construct- and event-specific sequences were identified as a GM papaya developed to resist infection from a Papaya ringspot virus. Based on the transgenic sequences, a specific real-time PCR detection method for GM papaya applicable to various food commodities was developed. Whole genome sequence analysis enabled identifying unknown transgenic construct- and event-specific sequences in GM papaya and development of a reliable method for detecting them in papaya food commodities. Copyright © 2016 Elsevier Ltd. All rights reserved.
Polypeptide having beta-glucosidase activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; De Jong, Rene Marcel; Damveld, Robbertus Antonius

2015-09-01

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 70% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having cellobiohydrolase activity and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-09-15

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 93% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having acetyl xylan esterase activity and uses thereof

DOEpatents

Schoonneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 82% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
Polypeptide having carbohydrate degrading activity and uses thereof

DOEpatents

Schooneveld-Bergmans, Margot Elisabeth Francoise; Heijne, Wilbert Herman Marie; Vlasie, Monica Diana; Damveld, Robbertus Antonius

2015-08-18

The invention relates to a polypeptide comprising the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 73% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional polypeptide and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.
GATA: A graphic alignment tool for comparative sequenceanalysis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Nix, David A.; Eisen, Michael B.

2005-01-01

Several problems exist with current methods used to align DNA sequences for comparative sequence analysis. Most dynamic programming algorithms assume that conserved sequence elements are collinear. This assumption appears valid when comparing orthologous protein coding sequences. Functional constraints on proteins provide strong selective pressure against sequence inversions, and minimize sequence duplications and feature shuffling. For non-coding sequences this collinearity assumption is often invalid. For example, enhancers contain clusters of transcription factor binding sites that change in number, orientation, and spacing during evolution yet the enhancer retains its activity. Dotplot analysis is often used to estimate non-coding sequence relatedness. Yet dotmore » plots do not actually align sequences and thus cannot account well for base insertions or deletions. Moreover, they lack an adequate statistical framework for comparing sequence relatedness and are limited to pairwise comparisons. Lastly, dot plots and dynamic programming text outputs fail to provide an intuitive means for visualizing DNA alignments.« less
Deep Sequencing to Identify the Causes of Viral Encephalitis

PubMed Central

Chan, Benjamin K.; Wilson, Theodore; Fischer, Kael F.; Kriesel, John D.

2014-01-01

Deep sequencing allows for a rapid, accurate characterization of microbial DNA and RNA sequences in many types of samples. Deep sequencing (also called next generation sequencing or NGS) is being developed to assist with the diagnosis of a wide variety of infectious diseases. In this study, seven frozen brain samples from deceased subjects with recent encephalitis were investigated. RNA from each sample was extracted, randomly reverse transcribed and sequenced. The sequence analysis was performed in a blinded fashion and confirmed with pathogen-specific PCR. This analysis successfully identified measles virus sequences in two brain samples and herpes simplex virus type-1 sequences in three brain samples. No pathogen was identified in the other two brain specimens. These results were concordant with pathogen-specific PCR and partially concordant with prior neuropathological examinations, demonstrating that deep sequencing can accurately identify viral infections in frozen brain tissue. PMID:24699691
Method for phosphorothioate antisense DNA sequencing by capillary electrophoresis with UV detection.

PubMed

Froim, D; Hopkins, C E; Belenky, A; Cohen, A S

1997-11-01

The progress of antisense DNA therapy demands development of reliable and convenient methods for sequencing short single-stranded oligonucleotides. A method of phosphorothioate antisense DNA sequencing analysis using UV detection coupled to capillary electrophoresis (CE) has been developed based on a modified chain termination sequencing method. The proposed method reduces the sequencing cost since it uses affordable CE-UV instrumentation and requires no labeling with minimal sample processing before analysis. Cycle sequencing with ThermoSequenase generates quantities of sequencing products that are readily detectable by UV. Discrimination of undesired components from sequencing products in the reaction mixture, previously accomplished by fluorescent or radioactive labeling, is now achieved by bringing concentrations of undesired components below the UV detection range which yields a 'clean', well defined sequence. UV detection coupled with CE offers additional conveniences for sequencing since it can be accomplished with commercially available CE-UV equipment and is readily amenable to automation.
Method for phosphorothioate antisense DNA sequencing by capillary electrophoresis with UV detection.

PubMed Central

Froim, D; Hopkins, C E; Belenky, A; Cohen, A S

1997-01-01

The progress of antisense DNA therapy demands development of reliable and convenient methods for sequencing short single-stranded oligonucleotides. A method of phosphorothioate antisense DNA sequencing analysis using UV detection coupled to capillary electrophoresis (CE) has been developed based on a modified chain termination sequencing method. The proposed method reduces the sequencing cost since it uses affordable CE-UV instrumentation and requires no labeling with minimal sample processing before analysis. Cycle sequencing with ThermoSequenase generates quantities of sequencing products that are readily detectable by UV. Discrimination of undesired components from sequencing products in the reaction mixture, previously accomplished by fluorescent or radioactive labeling, is now achieved by bringing concentrations of undesired components below the UV detection range which yields a 'clean', well defined sequence. UV detection coupled with CE offers additional conveniences for sequencing since it can be accomplished with commercially available CE-UV equipment and is readily amenable to automation. PMID:9336449
Orthogonal Polynomials Associated with Complementary Chain Sequences

NASA Astrophysics Data System (ADS)

Behera, Kiran Kumar; Sri Ranga, A.; Swaminathan, A.

2016-07-01

Using the minimal parameter sequence of a given chain sequence, we introduce the concept of complementary chain sequences, which we view as perturbations of chain sequences. Using the relation between these complementary chain sequences and the corresponding Verblunsky coefficients, the para-orthogonal polynomials and the associated Szegő polynomials are analyzed. Two illustrations, one involving Gaussian hypergeometric functions and the other involving Carathéodory functions are also provided. A connection between these two illustrations by means of complementary chain sequences is also observed.
The sequence measurement system of the IR camera

NASA Astrophysics Data System (ADS)

Geng, Ai-hui; Han, Hong-xia; Zhang, Hai-bo

2011-08-01

Currently, the IR cameras are broadly used in the optic-electronic tracking, optic-electronic measuring, fire control and optic-electronic countermeasure field, but the output sequence of the most presently applied IR cameras in the project is complex and the giving sequence documents from the leave factory are not detailed. Aiming at the requirement that the continuous image transmission and image procession system need the detailed sequence of the IR cameras, the sequence measurement system of the IR camera is designed, and the detailed sequence measurement way of the applied IR camera is carried out. The FPGA programming combined with the SignalTap online observation way has been applied in the sequence measurement system, and the precise sequence of the IR camera's output signal has been achieved, the detailed document of the IR camera has been supplied to the continuous image transmission system, image processing system and etc. The sequence measurement system of the IR camera includes CameraLink input interface part, LVDS input interface part, FPGA part, CameraLink output interface part and etc, thereinto the FPGA part is the key composed part in the sequence measurement system. Both the video signal of the CmaeraLink style and the video signal of LVDS style can be accepted by the sequence measurement system, and because the image processing card and image memory card always use the CameraLink interface as its input interface style, the output signal style of the sequence measurement system has been designed into CameraLink interface. The sequence measurement system does the IR camera's sequence measurement work and meanwhile does the interface transmission work to some cameras. Inside the FPGA of the sequence measurement system, the sequence measurement program, the pixel clock modification, the SignalTap file configuration and the SignalTap online observation has been integrated to realize the precise measurement to the IR camera. Te sequence measurement program written by the verilog language combining the SignalTap tool on line observation can count the line numbers in one frame, pixel numbers in one line and meanwhile account the line offset and row offset of the image. Aiming at the complex sequence of the IR camera's output signal, the sequence measurement system of the IR camera accurately measures the sequence of the project applied camera, supplies the detailed sequence document to the continuous system such as image processing system and image transmission system and gives out the concrete parameters of the fval, lval, pixclk, line offset and row offset. The experiment shows that the sequence measurement system of the IR camera can get the precise sequence measurement result and works stably, laying foundation for the continuous system.
Evaluation of 16S Rrna amplicon sequencing using two next-generation sequencing technologies for phylogenetic analysis of the rumen bacterial community in steers

USDA-ARS?s Scientific Manuscript database

Next generation sequencing technologies have vastly changed the approach of sequencing of the 16S rRNA gene for studies in microbial ecology. Three distinct technologies are available for large-scale 16S sequencing. All three are subject to biases introduced by sequencing error rates, amplificatio...
Evaluation of 16S rRNA amplicon sequencing using two next-generation sequencing technologies for phylogenetic analysis of the rumen bacterial community in steers

USDA-ARS?s Scientific Manuscript database

Next generation sequencing technologies have vastly changed the approach of sequencing of the 16S rRNA gene for studies in microbial ecology. Three distinct technologies are available for large-scale 16S sequencing. All three are subject to biases introduced by sequencing error rates, amplificatio...
Noncoding sequence classification based on wavelet transform analysis: part I

NASA Astrophysics Data System (ADS)

Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

2017-09-01

DNA sequences in human genome can be divided into the coding and noncoding ones. Coding sequences are those that are read during the transcription. The identification of coding sequences has been widely reported in literature due to its much-studied periodicity. Noncoding sequences represent the majority of the human genome. They play an important role in gene regulation and differentiation among the cells. However, noncoding sequences do not exhibit periodicities that correlate to their functions. The ENCODE (Encyclopedia of DNA elements) and Epigenomic Roadmap Project projects have cataloged the human noncoding sequences into specific functions. We study characteristics of noncoding sequences with wavelet analysis of genomic signals.
Full genome sequence of Rocio virus reveal substantial variations from the prototype Rocio virus SPH 34675 sequence.

PubMed

Setoh, Yin Xiang; Amarilla, Alberto A; Peng, Nias Y; Slonchak, Andrii; Periasamy, Parthiban; Figueiredo, Luiz T M; Aquino, Victor H; Khromykh, Alexander A

2018-01-01

Rocio virus (ROCV) is an arbovirus belonging to the genus Flavivirus, family Flaviviridae. We present an updated sequence of ROCV strain SPH 34675 (GenBank: AY632542.4), the only available full genome sequence prior to this study. Using next-generation sequencing of the entire genome, we reveal substantial sequence variation from the prototype sequence, with 30 nucleotide differences amounting to 14 amino acid changes, as well as significant changes to predicted 3'UTR RNA structures. Our results present an updated and corrected sequence of a potential emerging human-virulent flavivirus uniquely indigenous to Brazil (GenBank: MF461639).
What is a melody? On the relationship between pitch and brightness of timbre.

PubMed

Cousineau, Marion; Carcagno, Samuele; Demany, Laurent; Pressnitzer, Daniel

2013-01-01

Previous studies showed that the perceptual processing of sound sequences is more efficient when the sounds vary in pitch than when they vary in loudness. We show here that sequences of sounds varying in brightness of timbre are processed with the same efficiency as pitch sequences. The sounds used consisted of two simultaneous pure tones one octave apart, and the listeners' task was to make same/different judgments on pairs of sequences varying in length (one, two, or four sounds). In one condition, brightness of timbre was varied within the sequences by changing the relative level of the two pure tones. In other conditions, pitch was varied by changing fundamental frequency, or loudness was varied by changing the overall level. In all conditions, only two possible sounds could be used in a given sequence, and these two sounds were equally discriminable. When sequence length increased from one to four, discrimination performance decreased substantially for loudness sequences, but to a smaller extent for brightness sequences and pitch sequences. In the latter two conditions, sequence length had a similar effect on performance. These results suggest that the processes dedicated to pitch and brightness analysis, when probed with a sequence-discrimination task, share unexpected similarities.
Prefrontal neural correlates of memory for sequences.

PubMed

Averbeck, Bruno B; Lee, Daeyeol

2007-02-28

The sequence of actions appropriate to solve a problem often needs to be discovered by trial and error and recalled in the future when faced with the same problem. Here, we show that when monkeys had to discover and then remember a sequence of decisions across trials, ensembles of prefrontal cortex neurons reflected the sequence of decisions the animal would make throughout the interval between trials. This signal could reflect either an explicit memory process or a sequence-planning process that begins far in advance of the actual sequence execution. This finding extended to error trials such that, when the neural activity during the intertrial interval specified the wrong sequence, the animal also attempted to execute an incorrect sequence. More specifically, we used a decoding analysis to predict the sequence the monkey was planning to execute at the end of the fore-period, just before sequence execution. When this analysis was applied to error trials, we were able to predict where in the sequence the error would occur, up to three movements into the future. This suggests that prefrontal neural activity can retain information about sequences between trials, and that regardless of whether information is remembered correctly or incorrectly, the prefrontal activity veridically reflects the animal's action plan.

Local alignment of two-base encoded DNA sequence

PubMed Central

Homer, Nils; Merriman, Barry; Nelson, Stanley F

2009-01-01

Background DNA sequence comparison is based on optimal local alignment of two sequences using a similarity score. However, some new DNA sequencing technologies do not directly measure the base sequence, but rather an encoded form, such as the two-base encoding considered here. In order to compare such data to a reference sequence, the data must be decoded into sequence. The decoding is deterministic, but the possibility of measurement errors requires searching among all possible error modes and resulting alignments to achieve an optimal balance of fewer errors versus greater sequence similarity. Results We present an extension of the standard dynamic programming method for local alignment, which simultaneously decodes the data and performs the alignment, maximizing a similarity score based on a weighted combination of errors and edits, and allowing an affine gap penalty. We also present simulations that demonstrate the performance characteristics of our two base encoded alignment method and contrast those with standard DNA sequence alignment under the same conditions. Conclusion The new local alignment algorithm for two-base encoded data has substantial power to properly detect and correct measurement errors while identifying underlying sequence variants, and facilitating genome re-sequencing efforts based on this form of sequence data. PMID:19508732
Application of representational difference analysis to identify genomic differences between Bradyrhizobium elkanii and B. Japonicum species.

PubMed

Soares, René Arderius; Passaglia, Luciane Maria Pereira

2010-10-01

Bradyrhizobium elkanii is successfully used in the formulation of commercial inoculants and, together with B. japonicum, it fully supplies the plant nitrogen demands. Despite the similarity between B. japonicum and B. elkanii species, several works demonstrated genetic and physiological differences between them. In this work Representational Difference Analysis (RDA) was used for genomic comparison between B. elkanii SEMIA 587, a crop inoculant strain, and B. japonicum USDA 110, a reference strain. Two hundred sequences were obtained. From these, 46 sequences belonged exclusively to the genome of B. elkanii strain, and 154 showed similarity to sequences from B. japonicum genome. From the 46 sequences with no similarity to sequences from B. japonicum, 39 showed no similarity to sequences in public databases and seven showed similarity to sequences of genes coding for known proteins. These seven sequences were divided in three groups: similar to sequences from other Bradyrhizobium strains, similar to sequences from other nitrogen-fixing bacteria, and similar to sequences from non nitrogen-fixing bacteria. These new sequences could be used as DNA markers in order to investigate the rates of genetic material gain and loss in natural Bradyrhizobium strains.
Augmented brain function by coordinated reset stimulation with slowly varying sequences.

PubMed

Zeitler, Magteld; Tass, Peter A

2015-01-01

Several brain disorders are characterized by abnormally strong neuronal synchrony. Coordinated Reset (CR) stimulation was developed to selectively counteract abnormal neuronal synchrony by desynchronization. For this, phase resetting stimuli are delivered to different subpopulations in a timely coordinated way. In neural networks with spike timing-dependent plasticity CR stimulation may eventually lead to an anti-kindling, i.e., an unlearning of abnormal synaptic connectivity and abnormal synchrony. The spatiotemporal sequence by which all stimulation sites are stimulated exactly once is called the stimulation site sequence, or briefly sequence. So far, in simulations, pre-clinical and clinical applications CR was applied either with fixed sequences or rapidly varying sequences (RVS). In this computational study we show that appropriate repetition of the sequence with occasional random switching to the next sequence may significantly improve the anti-kindling effect of CR. To this end, a sequence is applied many times before randomly switching to the next sequence. This new method is called SVS CR stimulation, i.e., CR with slowly varying sequences. In a neuronal network with strong short-range excitatory and weak long-range inhibitory dynamic couplings SVS CR stimulation turns out to be superior to CR stimulation with fixed sequences or RVS.
Augmented brain function by coordinated reset stimulation with slowly varying sequences

PubMed Central

Zeitler, Magteld; Tass, Peter A.

2015-01-01

Several brain disorders are characterized by abnormally strong neuronal synchrony. Coordinated Reset (CR) stimulation was developed to selectively counteract abnormal neuronal synchrony by desynchronization. For this, phase resetting stimuli are delivered to different subpopulations in a timely coordinated way. In neural networks with spike timing-dependent plasticity CR stimulation may eventually lead to an anti-kindling, i.e., an unlearning of abnormal synaptic connectivity and abnormal synchrony. The spatiotemporal sequence by which all stimulation sites are stimulated exactly once is called the stimulation site sequence, or briefly sequence. So far, in simulations, pre-clinical and clinical applications CR was applied either with fixed sequences or rapidly varying sequences (RVS). In this computational study we show that appropriate repetition of the sequence with occasional random switching to the next sequence may significantly improve the anti-kindling effect of CR. To this end, a sequence is applied many times before randomly switching to the next sequence. This new method is called SVS CR stimulation, i.e., CR with slowly varying sequences. In a neuronal network with strong short-range excitatory and weak long-range inhibitory dynamic couplings SVS CR stimulation turns out to be superior to CR stimulation with fixed sequences or RVS. PMID:25873867
A Reference Viral Database (RVDB) To Enhance Bioinformatics Analysis of High-Throughput Sequencing for Novel Virus Detection

PubMed Central

Goodacre, Norman; Aljanahi, Aisha; Nandakumar, Subhiksha; Mikailov, Mike

2018-01-01

ABSTRACT Detection of distantly related viruses by high-throughput sequencing (HTS) is bioinformatically challenging because of the lack of a public database containing all viral sequences, without abundant nonviral sequences, which can extend runtime and obscure viral hits. Our reference viral database (RVDB) includes all viral, virus-related, and virus-like nucleotide sequences (excluding bacterial viruses), regardless of length, and with overall reduced cellular sequences. Semantic selection criteria (SEM-I) were used to select viral sequences from GenBank, resulting in a first-generation viral database (VDB). This database was manually and computationally reviewed, resulting in refined, semantic selection criteria (SEM-R), which were applied to a new download of updated GenBank sequences to create a second-generation VDB. Viral entries in the latter were clustered at 98% by CD-HIT-EST to reduce redundancy while retaining high viral sequence diversity. The viral identity of the clustered representative sequences (creps) was confirmed by BLAST searches in NCBI databases and HMMER searches in PFAM and DFAM databases. The resulting RVDB contained a broad representation of viral families, sequence diversity, and a reduced cellular content; it includes full-length and partial sequences and endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Testing of RVDBv10.2, with an in-house HTS transcriptomic data set indicated a significantly faster run for virus detection than interrogating the entirety of the NCBI nonredundant nucleotide database, which contains all viral sequences but also nonviral sequences. RVDB is publically available for facilitating HTS analysis, particularly for novel virus detection. It is meant to be updated on a regular basis to include new viral sequences added to GenBank. IMPORTANCE To facilitate bioinformatics analysis of high-throughput sequencing (HTS) data for the detection of both known and novel viruses, we have developed a new reference viral database (RVDB) that provides a broad representation of different virus species from eukaryotes by including all viral, virus-like, and virus-related sequences (excluding bacteriophages), regardless of their size. In particular, RVDB contains endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Sequences were clustered to reduce redundancy while retaining high viral sequence diversity. A particularly useful feature of RVDB is the reduction of cellular sequences, which can enhance the run efficiency of large transcriptomic and genomic data analysis and increase the specificity of virus detection. PMID:29564396
A Reference Viral Database (RVDB) To Enhance Bioinformatics Analysis of High-Throughput Sequencing for Novel Virus Detection.

PubMed

Goodacre, Norman; Aljanahi, Aisha; Nandakumar, Subhiksha; Mikailov, Mike; Khan, Arifa S

2018-01-01

Detection of distantly related viruses by high-throughput sequencing (HTS) is bioinformatically challenging because of the lack of a public database containing all viral sequences, without abundant nonviral sequences, which can extend runtime and obscure viral hits. Our reference viral database (RVDB) includes all viral, virus-related, and virus-like nucleotide sequences (excluding bacterial viruses), regardless of length, and with overall reduced cellular sequences. Semantic selection criteria (SEM-I) were used to select viral sequences from GenBank, resulting in a first-generation viral database (VDB). This database was manually and computationally reviewed, resulting in refined, semantic selection criteria (SEM-R), which were applied to a new download of updated GenBank sequences to create a second-generation VDB. Viral entries in the latter were clustered at 98% by CD-HIT-EST to reduce redundancy while retaining high viral sequence diversity. The viral identity of the clustered representative sequences (creps) was confirmed by BLAST searches in NCBI databases and HMMER searches in PFAM and DFAM databases. The resulting RVDB contained a broad representation of viral families, sequence diversity, and a reduced cellular content; it includes full-length and partial sequences and endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Testing of RVDBv10.2, with an in-house HTS transcriptomic data set indicated a significantly faster run for virus detection than interrogating the entirety of the NCBI nonredundant nucleotide database, which contains all viral sequences but also nonviral sequences. RVDB is publically available for facilitating HTS analysis, particularly for novel virus detection. It is meant to be updated on a regular basis to include new viral sequences added to GenBank. IMPORTANCE To facilitate bioinformatics analysis of high-throughput sequencing (HTS) data for the detection of both known and novel viruses, we have developed a new reference viral database (RVDB) that provides a broad representation of different virus species from eukaryotes by including all viral, virus-like, and virus-related sequences (excluding bacteriophages), regardless of their size. In particular, RVDB contains endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Sequences were clustered to reduce redundancy while retaining high viral sequence diversity. A particularly useful feature of RVDB is the reduction of cellular sequences, which can enhance the run efficiency of large transcriptomic and genomic data analysis and increase the specificity of virus detection.
Memory for sequences of events impaired in typical aging.

PubMed

Allen, Timothy A; Morris, Andrea M; Stark, Shauna M; Fortin, Norbert J; Stark, Craig E L

2015-03-01

Typical aging is associated with diminished episodic memory performance. To improve our understanding of the fundamental mechanisms underlying this age-related memory deficit, we previously developed an integrated, cross-species approach to link converging evidence from human and animal research. This novel approach focuses on the ability to remember sequences of events, an important feature of episodic memory. Unlike existing paradigms, this task is nonspatial, nonverbal, and can be used to isolate different cognitive processes that may be differentially affected in aging. Here, we used this task to make a comprehensive comparison of sequence memory performance between younger (18-22 yr) and older adults (62-86 yr). Specifically, participants viewed repeated sequences of six colored, fractal images and indicated whether each item was presented "in sequence" or "out of sequence." Several out of sequence probe trials were used to provide a detailed assessment of sequence memory, including: (i) repeating an item from earlier in the sequence ("Repeats"; e.g., AB A: DEF), (ii) skipping ahead in the sequence ("Skips"; e.g., AB D: DEF), and (iii) inserting an item from a different sequence into the same ordinal position ("Ordinal Transfers"; e.g., AB 3: DEF). We found that older adults performed as well as younger controls when tested on well-known and predictable sequences, but were severely impaired when tested using novel sequences. Importantly, overall sequence memory performance in older adults steadily declined with age, a decline not detected with other measures (RAVLT or BPS-O). We further characterized this deficit by showing that performance of older adults was severely impaired on specific probe trials that required detailed knowledge of the sequence (Skips and Ordinal Transfers), and was associated with a shift in their underlying mnemonic representation of the sequences. Collectively, these findings provide unambiguous evidence that the capacity to remember sequences of events is fundamentally affected by typical aging. © 2015 Allen et al.; Published by Cold Spring Harbor Laboratory Press.
Identification of a precursor genomic segment that provided a sequence unique to glycophorin B and E genes

DOE Office of Scientific and Technical Information (OSTI.GOV)

Onda, M.; Kudo, S.; Fukuda, M.

Human glycophorin A, B, and E (GPA, GPB, and GPE) genes belong to a gene family located at the long arm of chromosome 4. These three genes are homologous from the 5'-flanking sequence to the Alu sequence, which is 1 kb downstream from the exon encoding the transmembrane domain. Analysis of the Alu sequence and flanking direct repeat sequences suggested that the GPA gene most closely resembles the ancestral gene, whereas the GPB and GPE gene arose by homologous recombination within the Alu sequence, acquiring 3' sequences from an unrelated precursor genomic segment. Here the authors describe the identification ofmore » this putative precursor genomic segment. A human genomic library was screened by using the sequence of the 3' region of the GPB gene as a probe. The genomic clones isolated were found to contain an Alu sequence that appeared to be involved in the recombination. Downstream from the Alu sequence, the nucleotide sequence of the precursor genomic segment is almost identical to that of the GPB or GPE gene. In contrast, the upstream sequence of the genomic segment differs entirely from that of the GPA, GPB, and GPE genes. Conservation of the direct repeats flanking the Alu sequence of the genomic segment strongly suggests that the sequence of this genomic segment has been maintained during evolution. This identified genomic segment was found to reside downstream from the GPA gene by both gene mapping and in situ chromosomal localization. The precursor genomic segment was also identified in the orangutan genome, which is known to lack GPB and GPE genes. These results indicate that one of the duplicated ancestral glycophorin genes acquired a unique 3' sequence by unequal crossing-over through its Alu sequence and the further downstream Alu sequence present in the duplicated gene. Further duplication and divergence of this gene yielded the GPB and GPE genes. 37 refs., 5 figs.« less
Integrated sequence stratigraphy of the postimpact sediments from the Eyreville core holes, Chesapeake Bay impact structure inner basin

USGS Publications Warehouse

Browning, J.V.; Miller, K.G.; McLaughlin, P.P.; Edwards, L.E.; Kulpecz, A.A.; Powars, D.S.; Wade, B.S.; Feigenson, M.D.; Wright, J.D.

2009-01-01

The Eyreville core holes provide the first continuously cored record of postimpact sequences from within the deepest part of the central Chesapeake Bay impact crater. We analyzed the upper Eocene to Pliocene postimpact sediments from the Eyreville A and C core holes for lithology (semiquantitative measurements of grain size and composition), sequence stratigraphy, and chronostratigraphy. Age is based primarily on Sr isotope stratigraphy supplemented by biostratigraphy (dinocysts, nannofossils, and planktonic foraminifers); age resolution is approximately ??0.5 Ma for early Miocene sequences and approximately ??1.0 Ma for younger and older sequences. Eocene-lower Miocene sequences are subtle, upper middle to lower upper Miocene sequences are more clearly distinguished, and upper Miocene- Pliocene sequences display a distinct facies pattern within sequences. We recognize two upper Eocene, two Oligocene, nine Miocene, three Pliocene, and one Pleistocene sequence and correlate them with those in New Jersey and Delaware. The upper Eocene through Pleistocene strata at Eyreville record changes from: (1) rapidly deposited, extremely fi ne-grained Eocene strata that probably represent two sequences deposited in a deep (>200 m) basin; to (2) highly dissected Oligocene (two very thin sequences) to lower Miocene (three thin sequences) with a long hiatus; to (3) a thick, rapidly deposited (43-73 m/Ma), very fi ne-grained, biosiliceous middle Miocene (16.5-14 Ma) section divided into three sequences (V5-V3) deposited in middle neritic paleoenvironments; to (4) a 4.5-Ma-long hiatus (12.8-8.3 Ma); to (5) sandy, shelly upper Miocene to Pliocene strata (8.3-2.0 Ma) divided into six sequences deposited in shelf and shoreface environments; and, last, to (6) a sandy middle Pleistocene paralic sequence (~400 ka). The Eyreville cores thus record the fi lling of a deep impact-generated basin where the timing of sequence boundaries is heavily infl uenced by eustasy. ?? 2009 The Geological Society of America.
Are commercial providers a viable option for clinical bacterial sequencing?

PubMed

Raven, Kathy; Blane, Beth; Churcher, Carol; Parkhill, Julian; Peacock, Sharon J

2018-04-05

Bacterial whole-genome sequencing in the clinical setting has the potential to bring major improvements to infection control and clinical practice. Sequencing instruments are not currently available in the majority of routine microbiology laboratories worldwide, but an alternative is to use external sequencing providers. To foster discussion around this we investigated whether send-out services were a viable option. Four providers offering MiSeq sequencing were selected based on cost and evaluated based on the service provided and sequence data quality. DNA was prepared from five methicillin-resistant Staphylococcus aureus (MRSA) isolates, four of which were investigated during a previously published outbreak in the UK together with a reference MRSA isolate (ST22 HO 5096 0412). Cost of sequencing per isolate ranged from £155 to £342 and turnaround times from DNA postage to arrival of sequence data ranged from 12 to 63 days. Comparison of commercially generated genomes against the original sequence data demonstrated very high concordance, with no more than one single nucleotide polymorphism (SNP) difference on core genome mapping between the original sequences and the new sequence for all four providers. Multilocus sequence type could not be assigned based on assembly for the two cheapest sequence providers due to fragmented assemblies probably caused by a lower output of sequence data per isolate. Our results indicate that external providers returned highly accurate genome data, but that improvements are required in turnaround time to make this a viable option for use in clinical practice.
Universal sequence map (USM) of arbitrary discrete sequences

PubMed Central

2002-01-01

Background For over a decade the idea of representing biological sequences in a continuous coordinate space has maintained its appeal but not been fully realized. The basic idea is that any sequence of symbols may define trajectories in the continuous space conserving all its statistical properties. Ideally, such a representation would allow scale independent sequence analysis – without the context of fixed memory length. A simple example would consist on being able to infer the homology between two sequences solely by comparing the coordinates of any two homologous units. Results We have successfully identified such an iterative function for bijective mappingψ of discrete sequences into objects of continuous state space that enable scale-independent sequence analysis. The technique, named Universal Sequence Mapping (USM), is applicable to sequences with an arbitrary length and arbitrary number of unique units and generates a representation where map distance estimates sequence similarity. The novel USM procedure is based on earlier work by these and other authors on the properties of Chaos Game Representation (CGR). The latter enables the representation of 4 unit type sequences (like DNA) as an order free Markov Chain transition table. The properties of USM are illustrated with test data and can be verified for other data by using the accompanying web-based tool:http://bioinformatics.musc.edu/~jonas/usm/. Conclusions USM is shown to enable a statistical mechanics approach to sequence analysis. The scale independent representation frees sequence analysis from the need to assume a memory length in the investigation of syntactic rules. PMID:11895567
Database-independent Protein Sequencing (DiPS) Enables Full-length de Novo Protein and Antibody Sequence Determination.

PubMed

Savidor, Alon; Barzilay, Rotem; Elinger, Dalia; Yarden, Yosef; Lindzen, Moshit; Gabashvili, Alexandra; Adiv Tal, Ophir; Levin, Yishai

2017-06-01

Traditional "bottom-up" proteomic approaches use proteolytic digestion, LC-MS/MS, and database searching to elucidate peptide identities and their parent proteins. Protein sequences absent from the database cannot be identified, and even if present in the database, complete sequence coverage is rarely achieved even for the most abundant proteins in the sample. Thus, sequencing of unknown proteins such as antibodies or constituents of metaproteomes remains a challenging problem. To date, there is no available method for full-length protein sequencing, independent of a reference database, in high throughput. Here, we present Database-independent Protein Sequencing, a method for unambiguous, rapid, database-independent, full-length protein sequencing. The method is a novel combination of non-enzymatic, semi-random cleavage of the protein, LC-MS/MS analysis, peptide de novo sequencing, extraction of peptide tags, and their assembly into a consensus sequence using an algorithm named "Peptide Tag Assembler." As proof-of-concept, the method was applied to samples of three known proteins representing three size classes and to a previously un-sequenced, clinically relevant monoclonal antibody. Excluding leucine/isoleucine and glutamic acid/deamidated glutamine ambiguities, end-to-end full-length de novo sequencing was achieved with 99-100% accuracy for all benchmarking proteins and the antibody light chain. Accuracy of the sequenced antibody heavy chain, including the entire variable region, was also 100%, but there was a 23-residue gap in the constant region sequence. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.
Sequence quality analysis tool for HIV type 1 protease and reverse transcriptase.

PubMed

Delong, Allison K; Wu, Mingham; Bennett, Diane; Parkin, Neil; Wu, Zhijin; Hogan, Joseph W; Kantor, Rami

2012-08-01

Access to antiretroviral therapy is increasing globally and drug resistance evolution is anticipated. Currently, protease (PR) and reverse transcriptase (RT) sequence generation is increasing, including the use of in-house sequencing assays, and quality assessment prior to sequence analysis is essential. We created a computational HIV PR/RT Sequence Quality Analysis Tool (SQUAT) that runs in the R statistical environment. Sequence quality thresholds are calculated from a large dataset (46,802 PR and 44,432 RT sequences) from the published literature ( http://hivdb.Stanford.edu ). Nucleic acid sequences are read into SQUAT, identified, aligned, and translated. Nucleic acid sequences are flagged if with >five 1-2-base insertions; >one 3-base insertion; >one deletion; >six PR or >18 RT ambiguous bases; >three consecutive PR or >four RT nucleic acid mutations; >zero stop codons; >three PR or >six RT ambiguous amino acids; >three consecutive PR or >four RT amino acid mutations; >zero unique amino acids; or <0.5% or >15% genetic distance from another submitted sequence. Thresholds are user modifiable. SQUAT output includes a summary report with detailed comments for troubleshooting of flagged sequences, histograms of pairwise genetic distances, neighbor joining phylogenetic trees, and aligned nucleic and amino acid sequences. SQUAT is a stand-alone, free, web-independent tool to ensure use of high-quality HIV PR/RT sequences in interpretation and reporting of drug resistance, while increasing awareness and expertise and facilitating troubleshooting of potentially problematic sequences.
Effects of pre- and pro-sequence of thaumatin on the secretion by Pichia pastoris.

PubMed

Ide, Nobuyuki; Masuda, Tetsuya; Kitabatake, Naofumi

2007-11-23

Thaumatin is a 22-kDa sweet-tasting protein containing eight disulfide bonds. When thaumatin is expressed in Pichia pastoris using the thaumatin cDNA fused with both the alpha-factor signal sequence and the Kex2 protease cleavage site from Saccharomyces cerevisiae, the N-terminal sequence of the secreted thaumatin molecule is not processed correctly. To examine the role of the thaumatin cDNA-encoded N-terminal pre-sequence and C-terminal pro-sequence on the processing of thaumatin and efficiency of thaumatin production in P. pastoris, four expression plasmids with different pre-sequence and pro-sequence were constructed and transformed into P. pastoris. The transformants containing pre-thaumatin gene that has the native plant signal, secreted thaumatin molecules in the medium. The N-terminal amino acid sequence of the secreted thaumatin molecule was processed correctly. The production yield of thaumatin was not affected by the C-terminal pro-sequence, and the pro-sequence was not processed in P. pastoris, indicating that pro-sequence is not necessary for thaumatin synthesis.
From Conventional to Next Generation Sequencing of Epstein-Barr Virus Genomes.

PubMed

Kwok, Hin; Chiang, Alan Kwok Shing

2016-02-24

Genomic sequences of Epstein-Barr virus (EBV) have been of interest because the virus is associated with cancers, such as nasopharyngeal carcinoma, and conditions such as infectious mononucleosis. The progress of whole-genome EBV sequencing has been limited by the inefficiency and cost of the first-generation sequencing technology. With the advancement of next-generation sequencing (NGS) and target enrichment strategies, increasing number of EBV genomes has been published. These genomes were sequenced using different approaches, either with or without EBV DNA enrichment. This review provides an overview of the EBV genomes published to date, and a description of the sequencing technology and bioinformatic analyses employed in generating these sequences. We further explored ways through which the quality of sequencing data can be improved, such as using DNA oligos for capture hybridization, and longer insert size and read length in the sequencing runs. These advances will enable large-scale genomic sequencing of EBV which will facilitate a better understanding of the genetic variations of EBV in different geographic regions and discovery of potentially pathogenic variants in specific diseases.
Mass spectrometry-based protein identification by integrating de novo sequencing with database searching.

PubMed

Wang, Penghao; Wilson, Susan R

2013-01-01

Mass spectrometry-based protein identification is a very challenging task. The main identification approaches include de novo sequencing and database searching. Both approaches have shortcomings, so an integrative approach has been developed. The integrative approach firstly infers partial peptide sequences, known as tags, directly from tandem spectra through de novo sequencing, and then puts these sequences into a database search to see if a close peptide match can be found. However the current implementation of this integrative approach has several limitations. Firstly, simplistic de novo sequencing is applied and only very short sequence tags are used. Secondly, most integrative methods apply an algorithm similar to BLAST to search for exact sequence matches and do not accommodate sequence errors well. Thirdly, by applying these methods the integrated de novo sequencing makes a limited contribution to the scoring model which is still largely based on database searching. We have developed a new integrative protein identification method which can integrate de novo sequencing more efficiently into database searching. Evaluated on large real datasets, our method outperforms popular identification methods.
Metagenome assembly through clustering of next-generation sequencing data using protein sequences.

PubMed

Sim, Mikang; Kim, Jaebum

2015-02-01

The study of environmental microbial communities, called metagenomics, has gained a lot of attention because of the recent advances in next-generation sequencing (NGS) technologies. Microbes play a critical role in changing their environments, and the mode of their effect can be solved by investigating metagenomes. However, the difficulty of metagenomes, such as the combination of multiple microbes and different species abundance, makes metagenome assembly tasks more challenging. In this paper, we developed a new metagenome assembly method by utilizing protein sequences, in addition to the NGS read sequences. Our method (i) builds read clusters by using mapping information against available protein sequences, and (ii) creates contig sequences by finding consensus sequences through probabilistic choices from the read clusters. By using simulated NGS read sequences from real microbial genome sequences, we evaluated our method in comparison with four existing assembly programs. We found that our method could generate relatively long and accurate metagenome assemblies, indicating that the idea of using protein sequences, as a guide for the assembly, is promising. Copyright © 2015 Elsevier B.V. All rights reserved.
Representation of DNA sequences in genetic codon context with applications in exon and intron prediction.

PubMed

Yin, Changchuan

2015-04-01

To apply digital signal processing (DSP) methods to analyze DNA sequences, the sequences first must be specially mapped into numerical sequences. Thus, effective numerical mappings of DNA sequences play key roles in the effectiveness of DSP-based methods such as exon prediction. Despite numerous mappings of symbolic DNA sequences to numerical series, the existing mapping methods do not include the genetic coding features of DNA sequences. We present a novel numerical representation of DNA sequences using genetic codon context (GCC) in which the numerical values are optimized by simulation annealing to maximize the 3-periodicity signal to noise ratio (SNR). The optimized GCC representation is then applied in exon and intron prediction by Short-Time Fourier Transform (STFT) approach. The results show the GCC method enhances the SNR values of exon sequences and thus increases the accuracy of predicting protein coding regions in genomes compared with the commonly used 4D binary representation. In addition, this study offers a novel way to reveal specific features of DNA sequences by optimizing numerical mappings of symbolic DNA sequences.
An efficient approach to BAC based assembly of complex genomes.

PubMed

Visendi, Paul; Berkman, Paul J; Hayashi, Satomi; Golicz, Agnieszka A; Bayer, Philipp E; Ruperao, Pradeep; Hurgobin, Bhavna; Montenegro, Juan; Chan, Chon-Kit Kenneth; Staňková, Helena; Batley, Jacqueline; Šimková, Hana; Doležel, Jaroslav; Edwards, David

2016-01-01

There has been an exponential growth in the number of genome sequencing projects since the introduction of next generation DNA sequencing technologies. Genome projects have increasingly involved assembly of whole genome data which produces inferior assemblies compared to traditional Sanger sequencing of genomic fragments cloned into bacterial artificial chromosomes (BACs). While whole genome shotgun sequencing using next generation sequencing (NGS) is relatively fast and inexpensive, this method is extremely challenging for highly complex genomes, where polyploidy or high repeat content confounds accurate assembly, or where a highly accurate 'gold' reference is required. Several attempts have been made to improve genome sequencing approaches by incorporating NGS methods, to variable success. We present the application of a novel BAC sequencing approach which combines indexed pools of BACs, Illumina paired read sequencing, a sequence assembler specifically designed for complex BAC assembly, and a custom bioinformatics pipeline. We demonstrate this method by sequencing and assembling BAC cloned fragments from bread wheat and sugarcane genomes. We demonstrate that our assembly approach is accurate, robust, cost effective and scalable, with applications for complete genome sequencing in large and complex genomes.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies.

PubMed

Utturkar, Sagar M; Klingeman, Dawn M; Hurt, Richard A; Brown, Steven D

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.

Sequence memory based on coherent spin-interaction neural networks.

PubMed

Xia, Min; Wong, W K; Wang, Zhijie

2014-12-01

Sequence information processing, for instance, the sequence memory, plays an important role on many functions of brain. In the workings of the human brain, the steady-state period is alterable. However, in the existing sequence memory models using heteroassociations, the steady-state period cannot be changed in the sequence recall. In this work, a novel neural network model for sequence memory with controllable steady-state period based on coherent spininteraction is proposed. In the proposed model, neurons fire collectively in a phase-coherent manner, which lets a neuron group respond differently to different patterns and also lets different neuron groups respond differently to one pattern. The simulation results demonstrating the performance of the sequence memory are presented. By introducing a new coherent spin-interaction sequence memory model, the steady-state period can be controlled by dimension parameters and the overlap between the input pattern and the stored patterns. The sequence storage capacity is enlarged by coherent spin interaction compared with the existing sequence memory models. Furthermore, the sequence storage capacity has an exponential relationship to the dimension of the neural network.
A Next-Generation Sequencing Primer—How Does It Work and What Can It Do?

PubMed Central

Alekseyev, Yuriy O.; Fazeli, Roghayeh; Yang, Shi; Basran, Raveen; Miller, Nancy S.

2018-01-01

Next-generation sequencing refers to a high-throughput technology that determines the nucleic acid sequences and identifies variants in a sample. The technology has been introduced into clinical laboratory testing and produces test results for precision medicine. Since next-generation sequencing is relatively new, graduate students, medical students, pathology residents, and other physicians may benefit from a primer to provide a foundation about basic next-generation sequencing methods and applications, as well as specific examples where it has had diagnostic and prognostic utility. Next-generation sequencing technology grew out of advances in multiple fields to produce a sophisticated laboratory test with tremendous potential. Next-generation sequencing may be used in the clinical setting to look for specific genetic alterations in patients with cancer, diagnose inherited conditions such as cystic fibrosis, and detect and profile microbial organisms. This primer will review DNA sequencing technology, the commercialization of next-generation sequencing, and clinical uses of next-generation sequencing. Specific applications where next-generation sequencing has demonstrated utility in oncology are provided. PMID:29761157
Towards predicting the encoding capability of MR fingerprinting sequences.

PubMed

Sommer, K; Amthor, T; Doneva, M; Koken, P; Meineke, J; Börnert, P

2017-09-01

Sequence optimization and appropriate sequence selection is still an unmet need in magnetic resonance fingerprinting (MRF). The main challenge in MRF sequence design is the lack of an appropriate measure of the sequence's encoding capability. To find such a measure, three different candidates for judging the encoding capability have been investigated: local and global dot-product-based measures judging dictionary entry similarity as well as a Monte Carlo method that evaluates the noise propagation properties of an MRF sequence. Consistency of these measures for different sequence lengths as well as the capability to predict actual sequence performance in both phantom and in vivo measurements was analyzed. While the dot-product-based measures yielded inconsistent results for different sequence lengths, the Monte Carlo method was in a good agreement with phantom experiments. In particular, the Monte Carlo method could accurately predict the performance of different flip angle patterns in actual measurements. The proposed Monte Carlo method provides an appropriate measure of MRF sequence encoding capability and may be used for sequence optimization. Copyright © 2017 Elsevier Inc. All rights reserved.
Use of sequence-independent-single-primer-amplification (SISPA) for whole genome sequencing using illumina MiSeq platform for avian influenza virus, Newcastle disease virus, and infectious bronchitis virus

USDA-ARS?s Scientific Manuscript database

Over the past decade, Next Generation Sequencing (NGS) technologies, also called deep sequencing, have continued to evolve, increasing capacity and lower the cost necessary for large genome sequencing projects. The one of the advantage of NGS platforms is the possibility to sequence the samples with...
New Sequences with Low Correlation and Large Family Size

NASA Astrophysics Data System (ADS)

Zeng, Fanxin

In direct-sequence code-division multiple-access (DS-CDMA) communication systems and direct-sequence ultra wideband (DS-UWB) radios, sequences with low correlation and large family size are important for reducing multiple access interference (MAI) and accepting more active users, respectively. In this paper, a new collection of families of sequences of length pn-1, which includes three constructions, is proposed. The maximum number of cyclically distinct families without GMW sequences in each construction is φ(pn-1)/n·φ(pm-1)/m, where p is a prime number, n is an even number, and n=2m, and these sequences can be binary or polyphase depending upon choice of the parameter p. In Construction I, there are pn distinct sequences within each family and the new sequences have at most d+2 nontrivial periodic correlation {-pm-1, -1, pm-1, 2pm-1,…,dpm-1}. In Construction II, the new sequences have large family size p2n and possibly take the nontrivial correlation values in {-pm-1, -1, pm-1, 2pm-1,…,(3d-4)pm-1}. In Construction III, the new sequences possess the largest family size p(d-1)n and have at most 2d correlation levels {-pm-1, -1,pm-1, 2pm-1,…,(2d-2)pm-1}. Three constructions are near-optimal with respect to the Welch bound because the values of their Welch-Ratios are moderate, WR_??_d, WR_??_3d-4 and WR_??_2d-2, respectively. Each family in Constructions I, II and III contains a GMW sequence. In addition, Helleseth sequences and Niho sequences are special cases in Constructions I and III, and their restriction conditions to the integers m and n, pm≠2 (mod 3) and n≅0 (mod 4), respectively, are removed in our sequences. Our sequences in Construction III include the sequences with Niho type decimation 3·2m-2, too. Finally, some open questions are pointed out and an example that illustrates the performance of these sequences is given.
Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment

PubMed Central

2013-01-01

Background Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. Results In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Conclusion Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA. PMID:24564200
Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment.

PubMed

Nagar, Anurag; Hahsler, Michael

2013-01-01

Next Generation Sequencing techniques are producing enormous amounts of biological sequence data and analysis becomes a major computational problem. Currently, most analysis, especially the identification of conserved regions, relies heavily on Multiple Sequence Alignment and its various heuristics such as progressive alignment, whose run time grows with the square of the number and the length of the aligned sequences and requires significant computational resources. In this work, we present a method to efficiently discover regions of high similarity across multiple sequences without performing expensive sequence alignment. The method is based on approximating edit distance between segments of sequences using p-mer frequency counts. Then, efficient high-throughput data stream clustering is used to group highly similar segments into so called quasi-alignments. Quasi-alignments have numerous applications such as identifying species and their taxonomic class from sequences, comparing sequences for similarities, and, as in this paper, discovering conserved regions across related sequences. In this paper, we show that quasi-alignments can be used to discover highly similar segments across multiple sequences from related or different genomes efficiently and accurately. Experiments on a large number of unaligned 16S rRNA sequences obtained from the Greengenes database show that the method is able to identify conserved regions which agree with known hypervariable regions in 16S rRNA. Furthermore, the experiments show that the proposed method scales well for large data sets with a run time that grows only linearly with the number and length of sequences, whereas for existing multiple sequence alignment heuristics the run time grows super-linearly. Quasi-alignment-based algorithms can detect highly similar regions and conserved areas across multiple sequences. Since the run time is linear and the sequences are converted into a compact clustering model, we are able to identify conserved regions fast or even interactively using a standard PC. Our method has many potential applications such as finding characteristic signature sequences for families of organisms and studying conserved and variable regions in, for example, 16S rRNA.
Introduction of the hybcell-based compact sequencing technology and comparison to state-of-the-art methodologies for KRAS mutation detection.

PubMed

Zopf, Agnes; Raim, Roman; Danzer, Martin; Niklas, Norbert; Spilka, Rita; Pröll, Johannes; Gabriel, Christian; Nechansky, Andreas; Roucka, Markus

2015-03-01

The detection of KRAS mutations in codons 12 and 13 is critical for anti-EGFR therapy strategies; however, only those methodologies with high sensitivity, specificity, and accuracy as well as the best cost and turnaround balance are suitable for routine daily testing. Here we compared the performance of compact sequencing using the novel hybcell technology with 454 next-generation sequencing (454-NGS), Sanger sequencing, and pyrosequencing, using an evaluation panel of 35 specimens. A total of 32 mutations and 10 wild-type cases were reported using 454-NGS as the reference method. Specificity ranged from 100% for Sanger sequencing to 80% for pyrosequencing. Sanger sequencing and hybcell-based compact sequencing achieved a sensitivity of 96%, whereas pyrosequencing had a sensitivity of 88%. Accuracy was 97% for Sanger sequencing, 85% for pyrosequencing, and 94% for hybcell-based compact sequencing. Quantitative results were obtained for 454-NGS and hybcell-based compact sequencing data, resulting in a significant correlation (r = 0.914). Whereas pyrosequencing and Sanger sequencing were not able to detect multiple mutated cell clones within one tumor specimen, 454-NGS and the hybcell-based compact sequencing detected multiple mutations in two specimens. Our comparison shows that the hybcell-based compact sequencing is a valuable alternative to state-of-the-art methodologies used for detection of clinically relevant point mutations.
tRNADB-CE: tRNA gene database well-timed in the era of big sequence data.

PubMed

Abe, Takashi; Inokuchi, Hachiro; Yamada, Yuko; Muto, Akira; Iwasaki, Yuki; Ikemura, Toshimichi

2014-01-01

The tRNA gene data base curated by experts "tRNADB-CE" (http://trna.ie.niigata-u.ac.jp) was constructed by analyzing 1,966 complete and 5,272 draft genomes of prokaryotes, 171 viruses', 121 chloroplasts', and 12 eukaryotes' genomes plus fragment sequences obtained by metagenome studies of environmental samples. 595,115 tRNA genes in total, and thus two times of genes compiled previously, have been registered, for which sequence, clover-leaf structure, and results of sequence-similarity and oligonucleotide-pattern searches can be browsed. To provide collective knowledge with help from experts in tRNA researches, we added a column for enregistering comments to each tRNA. By grouping bacterial tRNAs with an identical sequence, we have found high phylogenetic preservation of tRNA sequences, especially at the phylum level. Since many species-unknown tRNAs from metagenomic sequences have sequences identical to those found in species-known prokaryotes, the identical sequence group (ISG) can provide phylogenetic markers to investigate the microbial community in an environmental ecosystem. This strategy can be applied to a huge amount of short sequences obtained from next-generation sequencers, as showing that tRNADB-CE is a well-timed database in the era of big sequence data. It is also discussed that batch-learning self-organizing-map with oligonucleotide composition is useful for efficient knowledge discovery from big sequence data.
Automated Sanger Analysis Pipeline (ASAP): A Tool for Rapidly Analyzing Sanger Sequencing Data with Minimum User Interference.

PubMed

Singh, Aditya; Bhatia, Prateek

2016-12-01

Sanger sequencing platforms, such as applied biosystems instruments, generate chromatogram files. Generally, for 1 region of a sequence, we use both forward and reverse primers to sequence that area, in that way, we have 2 sequences that need to be aligned and a consensus generated before mutation detection studies. This work is cumbersome and takes time, especially if the gene is large with many exons. Hence, we devised a rapid automated command system to filter, build, and align consensus sequences and also optionally extract exonic regions, translate them in all frames, and perform an amino acid alignment starting from raw sequence data within a very short time. In full capabilities of Automated Mutation Analysis Pipeline (ASAP), it is able to read "*.ab1" chromatogram files through command line interface, convert it to the FASTQ format, trim the low-quality regions, reverse-complement the reverse sequence, create a consensus sequence, extract the exonic regions using a reference exonic sequence, translate the sequence in all frames, and align the nucleic acid and amino acid sequences to reference nucleic acid and amino acid sequences, respectively. All files are created and can be used for further analysis. ASAP is available as Python 3.x executable at https://github.com/aditya-88/ASAP. The version described in this paper is 0.28.
TaxI: a software tool for DNA barcoding using distance methods

PubMed Central

Steinke, Dirk; Vences, Miguel; Salzburger, Walter; Meyer, Axel

2005-01-01

DNA barcoding is a promising approach to the diagnosis of biological diversity in which DNA sequences serve as the primary key for information retrieval. Most existing software for evolutionary analysis of DNA sequences was designed for phylogenetic analyses and, hence, those algorithms do not offer appropriate solutions for the rapid, but precise analyses needed for DNA barcoding, and are also unable to process the often large comparative datasets. We developed a flexible software tool for DNA taxonomy, named TaxI. This program calculates sequence divergences between a query sequence (taxon to be barcoded) and each sequence of a dataset of reference sequences defined by the user. Because the analysis is based on separate pairwise alignments this software is also able to work with sequences characterized by multiple insertions and deletions that are difficult to align in large sequence sets (i.e. thousands of sequences) by multiple alignment algorithms because of computational restrictions. Here, we demonstrate the utility of this approach with two datasets of fish larvae and juveniles from Lake Constance and juvenile land snails under different models of sequence evolution. Sets of ribosomal 16S rRNA sequences, characterized by multiple indels, performed as good as or better than cox1 sequence sets in assigning sequences to species, demonstrating the suitability of rRNA genes for DNA barcoding. PMID:16214755
Sequencing, Analysis, and Annotation of Expressed Sequence Tags for Camelus dromedarius

PubMed Central

Al-Swailem, Abdulaziz M.; Shehata, Maher M.; Abu-Duhier, Faisel M.; Al-Yamani, Essam J.; Al-Busadah, Khalid A.; Al-Arawi, Mohammed S.; Al-Khider, Ali Y.; Al-Muhaimeed, Abdullah N.; Al-Qahtani, Fahad H.; Manee, Manee M.; Al-Shomrani, Badr M.; Al-Qhtani, Saad M.; Al-Harthi, Amer S.; Akdemir, Kadir C.; Otu, Hasan H.

2010-01-01

Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF>300 bp and ∼40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism. PMID:20502665
Multiplexed fragaria chloroplast genome sequencing

Treesearch

W. Njuguna; A. Liston; R. Cronn; N.V. Bassil

2010-01-01

A method to sequence multiple chloroplast genomes using ultra high throughput sequencing technologies was recently described. Complete chloroplast genome sequences can resolve phylogenetic relationships at low taxonomic levels and identify informative point mutations and indels. The objective of this research was to sequence multiple Fragaria...
Detection of a divergent variant of grapevine virus F by next-generation sequencing.

PubMed

Molenaar, Nicholas; Burger, Johan T; Maree, Hans J

2015-08-01

The complete genome sequence of a South African isolate of grapevine virus F (GVF) is presented. It was first detected by metagenomic next-generation sequencing of field samples and validated through direct Sanger sequencing. The genome sequence of GVF isolate V5 consists of 7539 nucleotides and contains a poly(A) tail. It has a typical vitivirus genome arrangement that comprises five open reading frames (ORFs), which share only 88.96 % nucleotide sequence identity with the existing complete GVF genome sequence (JX105428).
Nuclear counterparts of the cytoplasmic mitochondrial 12S rRNA gene: a problem of ancient DNA and molecular phylogenies.

PubMed

van der Kuyl, A C; Kuiken, C L; Dekker, J T; Perizonius, W R; Goudsmit, J

1995-06-01

Monkey mummy bones and teeth originating from the North Saqqara Baboon Galleries (Egypt), soft tissue from a mummified baboon in a museum collection, and nineteenth/twentieth-century skin fragments from mangabeys were used for DNA extraction and PCR amplification of part of the mitochondrial 12S rRNA gene. Sequences aligning with the 12S rRNA gene were recovered but were only distantly related to contemporary monkey mitochondrial 12S rRNA sequences. However, many of these sequences were identical or closely related to human nuclear DNA sequences resembling mitochondrial 12S rRNA (isolated from a cell line depleted in mitochondria) and therefore have to be considered contamination. Subsequently in a separate study we were able to recover genuine mitochondrial 12S rRNA sequences from many extant species of nonhuman Old World primates and sequences closely resembling the human nuclear integrations. Analysis of all sequences by the neighbor-joining (NJ) method indicated that mitochondrial DNA sequences and their nuclear counterparts can be divided into two distinct clusters. One cluster contained all temporary cytoplasmic mitochondrial DNA sequences and approximately half of the monkey nuclear mitochondriallike sequences. A second cluster contained most human nuclear sequences and the other half of monkey nuclear sequences with a separate branch leading to human and gorilla mitochondrial and nuclear sequences. Sequences recovered from ancient materials were equally divided between the two clusters. These results constitute a warning for when working with ancient DNA or performing phylogenetic analysis using mitochondrial DNA as a target sequence: Nuclear counterparts of mitochondrial genes may lead to faulty interpretation of results.
Three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions sequence for routine imaging of the spine: preliminary experience

PubMed Central

Tins, B; Cassar-Pullicino, V; Haddaway, M; Nachtrab, U

2012-01-01

Objectives The bulk of spinal imaging is still performed with conventional two-dimensional sequences. This study assesses the suitability of three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions (SPACE) sequence for routine spinal imaging. Methods 62 MRI examinations of the spine were evaluated by 2 examiners in consensus for the depiction of anatomy and presence of artefact. We noted pathologies that might be missed using the SPACE sequence only or the SPACE and a sagittal T1 weighted sequence. The reference standards were sagittal and axial T1 weighted and T2 weighted sequences. At a later date the evaluation was repeated by one of the original examiners and an additional examiner. Results There was good agreement of the single evaluations and consensus evaluation for the conventional sequences: κ>0.8, confidence interval (CI)>0.6–1.0. For the SPACE sequence, depiction of anatomy was very good for 84% of cases, with high interobserver agreement, but there was poor interobserver agreement for other cases. For artefact assessment of SPACE, κ=0.92, CI=0.92–1.0. The SPACE sequence was superior to conventional sequences for depiction of anatomy and artefact resistance. The SPACE sequence occasionally missed bone marrow oedema. In conjunction with sagittal T1 weighted sequences, no abnormality was missed. The isotropic SPACE sequence was superior to conventional sequences in imaging difficult anatomy such as in scoliosis and spondylolysis. Conclusion The SPACE sequence allows excellent assessment of anatomy owing to high spatial resolution and resistance to artefact. The sensitivity for bone marrow abnormalities is limited. PMID:22374284
Three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions sequence for routine imaging of the spine: preliminary experience.

PubMed

Tins, B; Cassar-Pullicino, V; Haddaway, M; Nachtrab, U

2012-08-01

The bulk of spinal imaging is still performed with conventional two-dimensional sequences. This study assesses the suitability of three-dimensional sampling perfection with application-optimised contrasts using a different flip angle evolutions (SPACE) sequence for routine spinal imaging. 62 MRI examinations of the spine were evaluated by 2 examiners in consensus for the depiction of anatomy and presence of artefact. We noted pathologies that might be missed using the SPACE sequence only or the SPACE and a sagittal T(1) weighted sequence. The reference standards were sagittal and axial T(1) weighted and T(2) weighted sequences. At a later date the evaluation was repeated by one of the original examiners and an additional examiner. There was good agreement of the single evaluations and consensus evaluation for the conventional sequences: κ>0.8, confidence interval (CI)>0.6-1.0. For the SPACE sequence, depiction of anatomy was very good for 84% of cases, with high interobserver agreement, but there was poor interobserver agreement for other cases. For artefact assessment of SPACE, κ=0.92, CI=0.92-1.0. The SPACE sequence was superior to conventional sequences for depiction of anatomy and artefact resistance. The SPACE sequence occasionally missed bone marrow oedema. In conjunction with sagittal T(1) weighted sequences, no abnormality was missed. The isotropic SPACE sequence was superior to conventional sequences in imaging difficult anatomy such as in scoliosis and spondylolysis. The SPACE sequence allows excellent assessment of anatomy owing to high spatial resolution and resistance to artefact. The sensitivity for bone marrow abnormalities is limited.
ChromatoGate: A Tool for Detecting Base Mis-Calls in Multiple Sequence Alignments by Semi-Automatic Chromatogram Inspection

PubMed Central

Alachiotis, Nikolaos; Vogiatzi, Emmanouella; Pavlidis, Pavlos; Stamatakis, Alexandros

2013-01-01

Automated DNA sequencers generate chromatograms that contain raw sequencing data. They also generate data that translates the chromatograms into molecular sequences of A, C, G, T, or N (undetermined) characters. Since chromatogram translation programs frequently introduce errors, a manual inspection of the generated sequence data is required. As sequence numbers and lengths increase, visual inspection and manual correction of chromatograms and corresponding sequences on a per-peak and per-nucleotide basis becomes an error-prone, time-consuming, and tedious process. Here, we introduce ChromatoGate (CG), an open-source software that accelerates and partially automates the inspection of chromatograms and the detection of sequencing errors for bidirectional sequencing runs. To provide users full control over the error correction process, a fully automated error correction algorithm has not been implemented. Initially, the program scans a given multiple sequence alignment (MSA) for potential sequencing errors, assuming that each polymorphic site in the alignment may be attributed to a sequencing error with a certain probability. The guided MSA assembly procedure in ChromatoGate detects chromatogram peaks of all characters in an alignment that lead to polymorphic sites, given a user-defined threshold. The threshold value represents the sensitivity of the sequencing error detection mechanism. After this pre-filtering, the user only needs to inspect a small number of peaks in every chromatogram to correct sequencing errors. Finally, we show that correcting sequencing errors is important, because population genetic and phylogenetic inferences can be misled by MSAs with uncorrected mis-calls. Our experiments indicate that estimates of population mutation rates can be affected two- to three-fold by uncorrected errors. PMID:24688709
ChromatoGate: A Tool for Detecting Base Mis-Calls in Multiple Sequence Alignments by Semi-Automatic Chromatogram Inspection.

PubMed

Alachiotis, Nikolaos; Vogiatzi, Emmanouella; Pavlidis, Pavlos; Stamatakis, Alexandros

2013-01-01

Automated DNA sequencers generate chromatograms that contain raw sequencing data. They also generate data that translates the chromatograms into molecular sequences of A, C, G, T, or N (undetermined) characters. Since chromatogram translation programs frequently introduce errors, a manual inspection of the generated sequence data is required. As sequence numbers and lengths increase, visual inspection and manual correction of chromatograms and corresponding sequences on a per-peak and per-nucleotide basis becomes an error-prone, time-consuming, and tedious process. Here, we introduce ChromatoGate (CG), an open-source software that accelerates and partially automates the inspection of chromatograms and the detection of sequencing errors for bidirectional sequencing runs. To provide users full control over the error correction process, a fully automated error correction algorithm has not been implemented. Initially, the program scans a given multiple sequence alignment (MSA) for potential sequencing errors, assuming that each polymorphic site in the alignment may be attributed to a sequencing error with a certain probability. The guided MSA assembly procedure in ChromatoGate detects chromatogram peaks of all characters in an alignment that lead to polymorphic sites, given a user-defined threshold. The threshold value represents the sensitivity of the sequencing error detection mechanism. After this pre-filtering, the user only needs to inspect a small number of peaks in every chromatogram to correct sequencing errors. Finally, we show that correcting sequencing errors is important, because population genetic and phylogenetic inferences can be misled by MSAs with uncorrected mis-calls. Our experiments indicate that estimates of population mutation rates can be affected two- to three-fold by uncorrected errors.
PET-Tool: a software suite for comprehensive processing and managing of Paired-End diTag (PET) sequence data.

PubMed

Chiu, Kuo Ping; Wong, Chee-Hong; Chen, Qiongyu; Ariyaratne, Pramila; Ooi, Hong Sain; Wei, Chia-Lin; Sung, Wing-Kin Ken; Ruan, Yijun

2006-08-25

We recently developed the Paired End diTag (PET) strategy for efficient characterization of mammalian transcriptomes and genomes. The paired end nature of short PET sequences derived from long DNA fragments raised a new set of bioinformatics challenges, including how to extract PETs from raw sequence reads, and correctly yet efficiently map PETs to reference genome sequences. To accommodate and streamline data analysis of the large volume PET sequences generated from each PET experiment, an automated PET data process pipeline is desirable. We designed an integrated computation program package, PET-Tool, to automatically process PET sequences and map them to the genome sequences. The Tool was implemented as a web-based application composed of four modules: the Extractor module for PET extraction; the Examiner module for analytic evaluation of PET sequence quality; the Mapper module for locating PET sequences in the genome sequences; and the Project Manager module for data organization. The performance of PET-Tool was evaluated through the analyses of 2.7 million PET sequences. It was demonstrated that PET-Tool is accurate and efficient in extracting PET sequences and removing artifacts from large volume dataset. Using optimized mapping criteria, over 70% of quality PET sequences were mapped specifically to the genome sequences. With a 2.4 GHz LINUX machine, it takes approximately six hours to process one million PETs from extraction to mapping. The speed, accuracy, and comprehensiveness have proved that PET-Tool is an important and useful component in PET experiments, and can be extended to accommodate other related analyses of paired-end sequences. The Tool also provides user-friendly functions for data quality check and system for multi-layer data management.

Making sense of deep sequencing

PubMed Central

Goldman, D.; Domschke, K.

2016-01-01

This review, the first of an occasional series, tries to make sense of the concepts and uses of deep sequencing of polynucleic acids (DNA and RNA). Deep sequencing, synonymous with next-generation sequencing, high-throughput sequencing and massively parallel sequencing, includes whole genome sequencing but is more often and diversely applied to specific parts of the genome captured in different ways, for example the highly expressed portion of the genome known as the exome and portions of the genome that are epigenetically marked either by DNA methylation, the binding of proteins including histones, or that are in different configurations and thus more or less accessible to enzymes that cleave DNA. Deep sequencing of RNA (RNASeq) reverse-transcribed to complementary DNA is invaluable for measuring RNA expression and detecting changes in RNA structure. Important concepts in deep sequencing include the length and depth of sequence reads, mapping and assembly of reads, sequencing error, haplotypes, and the propensity of deep sequencing, as with other types of ‘big data’, to generate large numbers of errors, requiring monitoring for methodologic biases and strategies for replication and validation. Deep sequencing yields a unique genetic fingerprint that can be used to identify a person, and a trove of predictors of genetic medical diseases. Deep sequencing to identify epigenetic events including changes in DNA methylation and RNA expression can reveal the history and impact of environmental exposures. Because of the power of sequencing to identify and deliver biomedically significant information about a person and their blood relatives, it creates ethical dilemmas and practical challenges in research and clinical care, for example the decision and procedures to report incidental findings that will increasingly and frequently be discovered. PMID:24925306
A statistical method for the detection of variants from next-generation resequencing of DNA pools.

PubMed

Bansal, Vikas

2010-06-15

Next-generation sequencing technologies have enabled the sequencing of several human genomes in their entirety. However, the routine resequencing of complete genomes remains infeasible. The massive capacity of next-generation sequencers can be harnessed for sequencing specific genomic regions in hundreds to thousands of individuals. Sequencing-based association studies are currently limited by the low level of multiplexing offered by sequencing platforms. Pooled sequencing represents a cost-effective approach for studying rare variants in large populations. To utilize the power of DNA pooling, it is important to accurately identify sequence variants from pooled sequencing data. Detection of rare variants from pooled sequencing represents a different challenge than detection of variants from individual sequencing. We describe a novel statistical approach, CRISP [Comprehensive Read analysis for Identification of Single Nucleotide Polymorphisms (SNPs) from Pooled sequencing] that is able to identify both rare and common variants by using two approaches: (i) comparing the distribution of allele counts across multiple pools using contingency tables and (ii) evaluating the probability of observing multiple non-reference base calls due to sequencing errors alone. Information about the distribution of reads between the forward and reverse strands and the size of the pools is also incorporated within this framework to filter out false variants. Validation of CRISP on two separate pooled sequencing datasets generated using the Illumina Genome Analyzer demonstrates that it can detect 80-85% of SNPs identified using individual sequencing while achieving a low false discovery rate (3-5%). Comparison with previous methods for pooled SNP detection demonstrates the significantly lower false positive and false negative rates for CRISP. Implementation of this method is available at http://polymorphism.scripps.edu/~vbansal/software/CRISP/.
Sequence modelling and an extensible data model for genomic database

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Peter Wei-Der

1992-01-01

The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS's do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data modelmore » that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the Extensible Object Model'', to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.« less
Sequence modelling and an extensible data model for genomic database

DOE Office of Scientific and Technical Information (OSTI.GOV)

Li, Peter Wei-Der

1992-01-01

The Human Genome Project (HGP) plans to sequence the human genome by the beginning of the next century. It will generate DNA sequences of more than 10 billion bases and complex marker sequences (maps) of more than 100 million markers. All of these information will be stored in database management systems (DBMSs). However, existing data models do not have the abstraction mechanism for modelling sequences and existing DBMS`s do not have operations for complex sequences. This work addresses the problem of sequence modelling in the context of the HGP and the more general problem of an extensible object data modelmore » that can incorporate the sequence model as well as existing and future data constructs and operators. First, we proposed a general sequence model that is application and implementation independent. This model is used to capture the sequence information found in the HGP at the conceptual level. In addition, abstract and biological sequence operators are defined for manipulating the modelled sequences. Second, we combined many features of semantic and object oriented data models into an extensible framework, which we called the ``Extensible Object Model``, to address the need of a modelling framework for incorporating the sequence data model with other types of data constructs and operators. This framework is based on the conceptual separation between constructors and constraints. We then used this modelling framework to integrate the constructs for the conceptual sequence model. The Extensible Object Model is also defined with a graphical representation, which is useful as a tool for database designers. Finally, we defined a query language to support this model and implement the query processor to demonstrate the feasibility of the extensible framework and the usefulness of the conceptual sequence model.« less
The fast changing landscape of sequencing technologies and their impact on microbial genome assemblies and annotation.

PubMed

Mavromatis, Konstantinos; Land, Miriam L; Brettin, Thomas S; Quest, Daniel J; Copeland, Alex; Clum, Alicia; Goodwin, Lynne; Woyke, Tanja; Lapidus, Alla; Klenk, Hans Peter; Cottingham, Robert W; Kyrpides, Nikos C

2012-01-01

The emergence of next generation sequencing (NGS) has provided the means for rapid and high throughput sequencing and data generation at low cost, while concomitantly creating a new set of challenges. The number of available assembled microbial genomes continues to grow rapidly and their quality reflects the quality of the sequencing technology used, but also of the analysis software employed for assembly and annotation. In this work, we have explored the quality of the microbial draft genomes across various sequencing technologies. We have compared the draft and finished assemblies of 133 microbial genomes sequenced at the Department of Energy-Joint Genome Institute and finished at the Los Alamos National Laboratory using a variety of combinations of sequencing technologies, reflecting the transition of the institute from Sanger-based sequencing platforms to NGS platforms. The quality of the public assemblies and of the associated gene annotations was evaluated using various metrics. Results obtained with the different sequencing technologies, as well as their effects on downstream processes, were analyzed. Our results demonstrate that the Illumina HiSeq 2000 sequencing system, the primary sequencing technology currently used for de novo genome sequencing and assembly at JGI, has various advantages in terms of total sequence throughput and cost, but it also introduces challenges for the downstream analyses. In all cases assembly results although on average are of high quality, need to be viewed critically and consider sources of errors in them prior to analysis. These data follow the evolution of microbial sequencing and downstream processing at the JGI from draft genome sequences with large gaps corresponding to missing genes of significant biological role to assemblies with multiple small gaps (Illumina) and finally to assemblies that generate almost complete genomes (Illumina+PacBio).
International interlaboratory study comparing single organism 16S rRNA gene sequencing data: Beyond consensus sequence comparisons

PubMed Central

Olson, Nathan D.; Lund, Steven P.; Zook, Justin M.; Rojas-Cornejo, Fabiola; Beck, Brian; Foy, Carole; Huggett, Jim; Whale, Alexandra S.; Sui, Zhiwei; Baoutina, Anna; Dobeson, Michael; Partis, Lina; Morrow, Jayne B.

2015-01-01

This study presents the results from an interlaboratory sequencing study for which we developed a novel high-resolution method for comparing data from different sequencing platforms for a multi-copy, paralogous gene. The combination of PCR amplification and 16S ribosomal RNA gene (16S rRNA) sequencing has revolutionized bacteriology by enabling rapid identification, frequently without the need for culture. To assess variability between laboratories in sequencing 16S rRNA, six laboratories sequenced the gene encoding the 16S rRNA from Escherichia coli O157:H7 strain EDL933 and Listeria monocytogenes serovar 4b strain NCTC11994. Participants performed sequencing methods and protocols available in their laboratories: Sanger sequencing, Roche 454 pyrosequencing®, or Ion Torrent PGM®. The sequencing data were evaluated on three levels: (1) identity of biologically conserved position, (2) ratio of 16S rRNA gene copies featuring identified variants, and (3) the collection of variant combinations in a set of 16S rRNA gene copies. The same set of biologically conserved positions was identified for each sequencing method. Analytical methods using Bayesian and maximum likelihood statistics were developed to estimate variant copy ratios, which describe the ratio of nucleotides at each identified biologically variable position, as well as the likely set of variant combinations present in 16S rRNA gene copies. Our results indicate that estimated variant copy ratios at biologically variable positions were only reproducible for high throughput sequencing methods. Furthermore, the likely variant combination set was only reproducible with increased sequencing depth and longer read lengths. We also demonstrate novel methods for evaluating variable positions when comparing multi-copy gene sequence data from multiple laboratories generated using multiple sequencing technologies. PMID:27077030
Partial gene sequences for the A subunit of methyl-coenzyme M reductase (mcrI) as a phylogenetic tool for the family Methanosarcinaceae

NASA Technical Reports Server (NTRS)

Springer, E.; Sachs, M. S.; Woese, C. R.; Boone, D. R.

1995-01-01

Representatives of the family Methanosarcinaceae were analyzed phylogenetically by comparing partial sequences of their methyl-coenzyme M reductase (mcrI) genes. A 490-bp fragment from the A subunit of the gene was selected, amplified by the PCR, cloned, and sequenced for each of 25 strains belonging to the Methanosarcinaceae. The sequences obtained were aligned with the corresponding portions of five previously published sequences, and all of the sequences were compared to determine phylogenetic distances by Fitch distance matrix methods. We prepared analogous trees based on 16S rRNA sequences; these trees corresponded closely to the mcrI trees, although the mcrI sequences of pairs of organisms had 3.01 +/- 0.541 times more changes than the respective pairs of 16S rRNA sequences, suggesting that the mcrI fragment evolved about three times more rapidly than the 16S rRNA gene. The qualitative similarity of the mcrI and 16S rRNA trees suggests that transfer of genetic information between dissimilar organisms has not significantly affected these sequences, although we found inconsistencies between some mcrI distances that we measured and and previously published DNA reassociation data. It is unlikely that multiple mcrI isogenes were present in the organisms that we examined, because we found no major discrepancies in multiple determinations of mcrI sequences from the same organism. Our primers for the PCR also match analogous sites in the previously published mcrII sequences, but all of the sequences that we obtained from members of the Methanosarcinaceae were more closely related to mcrI sequences than to mcrII sequences, suggesting that members of the Methanosarcinaceae do not have distinct mcrII genes.
DNA sequence analysis of ARS elements from chromosome III of Saccharomyces cerevisiae: identification of a new conserved sequence.

PubMed Central

Palzkill, T G; Oliver, S G; Newlon, C S

1986-01-01

Four fragments of Saccharomyces cerevisiae chromosome III DNA which carry ARS elements have been sequenced. Each fragment contains multiple copies of sequences that have at least 10 out of 11 bases of homology to a previously reported 11 bp core consensus sequence. A survey of these new ARS sequences and previously reported sequences revealed the presence of an additional 11 bp conserved element located on the 3' side of the T-rich strand of the core consensus. Subcloning analysis as well as deletion and transposon insertion mutagenesis of ARS fragments support a role for 3' conserved sequence in promoting ARS activity. PMID:3529036
Single-cell genomic sequencing using Multiple Displacement Amplification.

PubMed

Lasken, Roger S

2007-10-01

Single microbial cells can now be sequenced using DNA amplified by the Multiple Displacement Amplification (MDA) reaction. The few femtograms of DNA in a bacterium are amplified into micrograms of high molecular weight DNA suitable for DNA library construction and Sanger sequencing. The MDA-generated DNA also performs well when used directly as template for pyrosequencing by the 454 Life Sciences method. While MDA from single cells loses some of the genomic sequence, this approach will greatly accelerate the pace of sequencing from uncultured microbes. The genetically linked sequences from single cells are also a powerful tool to be used in guiding genomic assembly of shotgun sequences of multiple organisms from environmental DNA extracts (metagenomic sequences).
MALDI Top-Down sequencing: calling N- and C-terminal protein sequences with high confidence and speed.

PubMed

Suckau, Detlev; Resemann, Anja

2009-12-01

The ability to match Top-Down protein sequencing (TDS) results by MALDI-TOF to protein sequences by classical protein database searching was evaluated in this work. Resulting from these analyses were the protein identity, the simultaneous assignment of the N- and C-termini and protein sequences of up to 70 residues from either terminus. In combination with de novo sequencing using the MALDI-TDS data, even fusion proteins were assigned and the detailed sequence around the fusion site was elucidated. MALDI-TDS allowed to efficiently match protein sequences quickly and to validate recombinant protein structures-in particular, protein termini-on the level of undigested proteins.
Computational analysis of sequence selection mechanisms.

PubMed

Meyerguz, Leonid; Grasso, Catherine; Kleinberg, Jon; Elber, Ron

2004-04-01

Mechanisms leading to gene variations are responsible for the diversity of species and are important components of the theory of evolution. One constraint on gene evolution is that of protein foldability; the three-dimensional shapes of proteins must be thermodynamically stable. We explore the impact of this constraint and calculate properties of foldable sequences using 3660 structures from the Protein Data Bank. We seek a selection function that receives sequences as input, and outputs survival probability based on sequence fitness to structure. We compute the number of sequences that match a particular protein structure with energy lower than the native sequence, the density of the number of sequences, the entropy, and the "selection" temperature. The mechanism of structure selection for sequences longer than 200 amino acids is approximately universal. For shorter sequences, it is not. We speculate on concrete evolutionary mechanisms that show this behavior.
Nonparametric Combinatorial Sequence Models

NASA Astrophysics Data System (ADS)

Wauthier, Fabian L.; Jordan, Michael I.; Jojic, Nebojsa

This work considers biological sequences that exhibit combinatorial structures in their composition: groups of positions of the aligned sequences are "linked" and covary as one unit across sequences. If multiple such groups exist, complex interactions can emerge between them. Sequences of this kind arise frequently in biology but methodologies for analyzing them are still being developed. This paper presents a nonparametric prior on sequences which allows combinatorial structures to emerge and which induces a posterior distribution over factorized sequence representations. We carry out experiments on three sequence datasets which indicate that combinatorial structures are indeed present and that combinatorial sequence models can more succinctly describe them than simpler mixture models. We conclude with an application to MHC binding prediction which highlights the utility of the posterior distribution induced by the prior. By integrating out the posterior our method compares favorably to leading binding predictors.
Real-Time DNA Sequencing in the Antarctic Dry Valleys Using the Oxford Nanopore Sequencer

PubMed Central

Johnson, Sarah S.; Zaikova, Elena; Goerlitz, David S.; Bai, Yu; Tighe, Scott W.

2017-01-01

The ability to sequence DNA outside of the laboratory setting has enabled novel research questions to be addressed in the field in diverse areas, ranging from environmental microbiology to viral epidemics. Here, we demonstrate the application of offline DNA sequencing of environmental samples using a hand-held nanopore sequencer in a remote field location: the McMurdo Dry Valleys, Antarctica. Sequencing was performed using a MK1B MinION sequencer from Oxford Nanopore Technologies (ONT; Oxford, United Kingdom) that was equipped with software to operate without internet connectivity. One-direction (1D) genomic libraries were prepared using portable field techniques on DNA isolated from desiccated microbial mats. By adequately insulating the sequencer and laptop, it was possible to run the sequencing protocol for up to 2½ h under arduous conditions. PMID:28337073
WebLogo: A Sequence Logo Generator

PubMed Central

Crooks, Gavin E.; Hon, Gary; Chandonia, John-Marc; Brenner, Steven E.

2004-01-01

WebLogo generates sequence logos, graphical representations of the patterns within a multiple sequence alignment. Sequence logos provide a richer and more precise description of sequence similarity than consensus sequences and can rapidly reveal significant features of the alignment otherwise difficult to perceive. Each logo consists of stacks of letters, one stack for each position in the sequence. The overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino or nucleic acid at that position. WebLogo has been enhanced recently with additional features and options, to provide a convenient and highly configurable sequence logo generator. A command line interface and the complete, open WebLogo source code are available for local installation and customization. PMID:15173120
Single-cell genome sequencing at ultra-high-throughput with microfluidic droplet barcoding.

PubMed

Lan, Freeman; Demaree, Benjamin; Ahmed, Noorsher; Abate, Adam R

2017-07-01

The application of single-cell genome sequencing to large cell populations has been hindered by technical challenges in isolating single cells during genome preparation. Here we present single-cell genomic sequencing (SiC-seq), which uses droplet microfluidics to isolate, fragment, and barcode the genomes of single cells, followed by Illumina sequencing of pooled DNA. We demonstrate ultra-high-throughput sequencing of >50,000 cells per run in a synthetic community of Gram-negative and Gram-positive bacteria and fungi. The sequenced genomes can be sorted in silico based on characteristic sequences. We use this approach to analyze the distributions of antibiotic-resistance genes, virulence factors, and phage sequences in microbial communities from an environmental sample. The ability to routinely sequence large populations of single cells will enable the de-convolution of genetic heterogeneity in diverse cell populations.
A measurement of disorder in binary sequences

NASA Astrophysics Data System (ADS)

Gong, Longyan; Wang, Haihong; Cheng, Weiwen; Zhao, Shengmei

2015-03-01

We propose a complex quantity, AL, to characterize the degree of disorder of L-length binary symbolic sequences. As examples, we respectively apply it to typical random and deterministic sequences. One kind of random sequences is generated from a periodic binary sequence and the other is generated from the logistic map. The deterministic sequences are the Fibonacci and Thue-Morse sequences. In these analyzed sequences, we find that the modulus of AL, denoted by |AL | , is a (statistically) equivalent quantity to the Boltzmann entropy, the metric entropy, the conditional block entropy and/or other quantities, so it is a useful quantitative measure of disorder. It can be as a fruitful index to discern which sequence is more disordered. Moreover, there is one and only one value of |AL | for the overall disorder characteristics. It needs extremely low computational costs. It can be easily experimentally realized. From all these mentioned, we believe that the proposed measure of disorder is a valuable complement to existing ones in symbolic sequences.
Quantitative phenotyping via deep barcode sequencing

PubMed Central

Smith, Andrew M.; Heisler, Lawrence E.; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J.; Chee, Mark; Roth, Frederick P.; Giaever, Guri; Nislow, Corey

2009-01-01

Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or “Bar-seq,” outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that ∼20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene–environment interactions on a genome-wide scale. PMID:19622793
Resurgence of Integrated Behavioral Units

PubMed Central

Bachá-Méndez, Gustavo; Reid, Alliston K; Mendoza-Soylovna, Adela

2007-01-01

Two experiments with rats examined the dynamics of well-learned response sequences when reinforcement contingencies were changed. Both experiments contained four phases, each of which reinforced a 2-response sequence of lever presses until responding was stable. The contingencies then were shifted to a new reinforced sequence until responding was again stable. Extinction-induced resurgence of previously reinforced, and then extinguished, heterogeneous response sequences was observed in all subjects in both experiments. These sequences were demonstrated to be integrated behavioral units, controlled by processes acting at the level of the entire sequence. Response-level processes were also simultaneously operative. Errors in sequence production were strongly influenced by the terminal, not the initial, response in the currently reinforced sequence, but not by the previously reinforced sequence. These studies demonstrate that sequence-level and response-level processes can operate simultaneously in integrated behavioral units. Resurgence and the development of integrated behavioral units may be dissociated; thus the observation of one does not necessarily imply the other. PMID:17345948
Effect of Next-Generation Exome Sequencing Depth for Discovery of Diagnostic Variants.

PubMed

Kim, Kyung; Seong, Moon-Woo; Chung, Won-Hyong; Park, Sung Sup; Leem, Sangseob; Park, Won; Kim, Jihyun; Lee, KiYoung; Park, Rae Woong; Kim, Namshin

2015-06-01

Sequencing depth, which is directly related to the cost and time required for the generation, processing, and maintenance of next-generation sequencing data, is an important factor in the practical utilization of such data in clinical fields. Unfortunately, identifying an exome sequencing depth adequate for clinical use is a challenge that has not been addressed extensively. Here, we investigate the effect of exome sequencing depth on the discovery of sequence variants for clinical use. Toward this, we sequenced ten germ-line blood samples from breast cancer patients on the Illumina platform GAII(x) at a high depth of ~200×. We observed that most function-related diverse variants in the human exonic regions could be detected at a sequencing depth of 120×. Furthermore, investigation using a diagnostic gene set showed that the number of clinical variants identified using exome sequencing reached a plateau at an average sequencing depth of about 120×. Moreover, the phenomena were consistent across the breast cancer samples.
Carbohydrate degrading polypeptide and uses thereof

DOEpatents

Sagt, Cornelis Maria Jacobus; Schooneveld-Bergmans, Margot Elisabeth Francoise; Roubos, Johannes Andries; Los, Alrik Pieter

2015-10-20

The invention relates to a polypeptide having carbohydrate material degrading activity which comprises the amino acid sequence set out in SEQ ID NO: 2 or an amino acid sequence encoded by the nucleotide sequence of SEQ ID NO: 1 or SEQ ID NO: 4, or a variant polypeptide or variant polynucleotide thereof, wherein the variant polypeptide has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2 or the variant polynucleotide encodes a polypeptide that has at least 96% sequence identity with the sequence set out in SEQ ID NO: 2. The invention features the full length coding sequence of the novel gene as well as the amino acid sequence of the full-length functional protein and functional equivalents of the gene or the amino acid sequence. The invention also relates to methods for using the polypeptide in industrial processes. Also included in the invention are cells transformed with a polynucleotide according to the invention suitable for producing these proteins.

High-Resolution Sequence-Function Mapping of Full-Length Proteins

PubMed Central

Kowalsky, Caitlin A.; Klesmith, Justin R.; Stapleton, James A.; Kelly, Vince; Reichkitzer, Nolan; Whitehead, Timothy A.

2015-01-01

Comprehensive sequence-function mapping involves detailing the fitness contribution of every possible single mutation to a gene by comparing the abundance of each library variant before and after selection for the phenotype of interest. Deep sequencing of library DNA allows frequency reconstruction for tens of thousands of variants in a single experiment, yet short read lengths of current sequencers makes it challenging to probe genes encoding full-length proteins. Here we extend the scope of sequence-function maps to entire protein sequences with a modular, universal sequence tiling method. We demonstrate the approach with both growth-based selections and FACS screening, offer parameters and best practices that simplify design of experiments, and present analytical solutions to normalize data across independent selections. Using this protocol, sequence-function maps covering full sequences can be obtained in four to six weeks. Best practices introduced in this manuscript are fully compatible with, and complementary to, other recently published sequence-function mapping protocols. PMID:25790064
On the joint spectral density of bivariate random sequences. Thesis Technical Report No. 21

NASA Technical Reports Server (NTRS)

Aalfs, David D.

1995-01-01

For univariate random sequences, the power spectral density acts like a probability density function of the frequencies present in the sequence. This dissertation extends that concept to bivariate random sequences. For this purpose, a function called the joint spectral density is defined that represents a joint probability weighing of the frequency content of pairs of random sequences. Given a pair of random sequences, the joint spectral density is not uniquely determined in the absence of any constraints. Two approaches to constraining the sequences are suggested: (1) assume the sequences are the margins of some stationary random field, (2) assume the sequences conform to a particular model that is linked to the joint spectral density. For both approaches, the properties of the resulting sequences are investigated in some detail, and simulation is used to corroborate theoretical results. It is concluded that under either of these two constraints, the joint spectral density can be computed from the non-stationary cross-correlation.
Applications of Single-Cell Sequencing for Multiomics.

PubMed

Xu, Yungang; Zhou, Xiaobo

2018-01-01

Single-cell sequencing interrogates the sequence or chromatin information from individual cells with advanced next-generation sequencing technologies. It provides a higher resolution of cellular differences and a better understanding of the underlying genetic and epigenetic mechanisms of an individual cell in the context of its survival and adaptation to microenvironment. However, it is more challenging to perform single-cell sequencing and downstream data analysis, owing to the minimal amount of starting materials, sample loss, and contamination. In addition, due to the picogram level of the amount of nucleic acids used, heavy amplification is often needed during sample preparation of single-cell sequencing, resulting in the uneven coverage, noise, and inaccurate quantification of sequencing data. All these unique properties raise challenges in and thus high demands for computational methods that specifically fit single-cell sequencing data. We here comprehensively survey the current strategies and challenges for multiple single-cell sequencing, including single-cell transcriptome, genome, and epigenome, beginning with a brief introduction to multiple sequencing techniques for single cells.
Innovative /ye/ and /we/ sequences in recent loans in Japanese

NASA Astrophysics Data System (ADS)

Vance, Timothy; Matsugu, Yuka

2005-04-01

The GV sequences /ye/ and /we/ do not occur in Japanese except perhaps in recent loans. Katakana spellings of the relevant loans in authoritative dictionaries are inconsistent, and it is not clear whether native speakers treat them as containing the GV sequences /ye/ and /we/ or as containing the VV sequences /ie/ and /ue/. Native speakers of Japanese with minimal exposure to spoken English were recorded producing some relevant loans in response to picture prompts. The same speakers were also recorded producing some native words containing uncontroversial /ie/ and /ue/ sequences. All the productions are being analyzed acoustically to determine whether they show the expected contrast between GV and VV sequences. A VV sequence is disyllabic (and bimoraic) and should therefore have greater duration and more gradual formant movements than a monosyllabic (and monomoraic) GV sequence. Utterance-initially, a VV sequence should have a LH pitch pattern and should be preceded by a nondistinctive glottal stop, whereas a GV sequence should have a H pitch pattern and should have smooth onset.
[The principle and application of the single-molecule real-time sequencing technology].

PubMed

Yanhu, Liu; Lu, Wang; Li, Yu

2015-03-01

Last decade witnessed the explosive development of the third-generation sequencing strategy, including single-molecule real-time sequencing (SMRT), true single-molecule sequencing (tSMSTM) and the single-molecule nanopore DNA sequencing. In this review, we summarize the principle, performance and application of the SMRT sequencing technology. Compared with the traditional Sanger method and the next-generation sequencing (NGS) technologies, the SMRT approach has several advantages, including long read length, high speed, PCR-free and the capability of direct detection of epigenetic modiﬁcations. However, the disadvantage of its low accuracy, most of which resulted from insertions and deletions, is also notable. So, the raw sequence data need to be corrected before assembly. Up to now, the SMRT is a good fit for applications in the de novo genomic sequencing and the high-quality assemblies of small genomes. In the future, it is expected to play an important role in epigenetics, transcriptomic sequencing, and assemblies of large genomes.
Life Cycle Evolution and Systematics of Campanulariid Hydrozoans

DTIC Science & Technology

2004-09-01

kit according to manufacturer’s protocol. Purified PCR product was cycle-sequenced using either Big Dye 2 or 3 sequencing chemistry (ABI), following...ethidium bromide and purified with PCR purification kits (Qiagen). Purified products were cycle- sequenced with either Big Dye 2 or 3 sequencing chemistry...PCR purification kit (Qiagen). The purified product was cycle-sequenced using Big Dye 2 sequencing chemistry (ABI) following the manufacturer’s
What is a melody? On the relationship between pitch and brightness of timbre

PubMed Central

Cousineau, Marion; Carcagno, Samuele; Demany, Laurent; Pressnitzer, Daniel

2014-01-01

Previous studies showed that the perceptual processing of sound sequences is more efficient when the sounds vary in pitch than when they vary in loudness. We show here that sequences of sounds varying in brightness of timbre are processed with the same efficiency as pitch sequences. The sounds used consisted of two simultaneous pure tones one octave apart, and the listeners’ task was to make same/different judgments on pairs of sequences varying in length (one, two, or four sounds). In one condition, brightness of timbre was varied within the sequences by changing the relative level of the two pure tones. In other conditions, pitch was varied by changing fundamental frequency, or loudness was varied by changing the overall level. In all conditions, only two possible sounds could be used in a given sequence, and these two sounds were equally discriminable. When sequence length increased from one to four, discrimination performance decreased substantially for loudness sequences, but to a smaller extent for brightness sequences and pitch sequences. In the latter two conditions, sequence length had a similar effect on performance. These results suggest that the processes dedicated to pitch and brightness analysis, when probed with a sequence-discrimination task, share unexpected similarities. PMID:24478638
Microbial community analysis of the hypersaline water of the Dead Sea using high-throughput amplicon sequencing.

PubMed

Jacob, Jacob H; Hussein, Emad I; Shakhatreh, Muhamad Ali K; Cornelison, Christopher T

2017-10-01

Amplicon sequencing using next-generation technology (bTEFAP ® ) has been utilized in describing the diversity of Dead Sea microbiota. The investigated area is a well-known salt lake in the western part of Jordan found in the lowest geographical location in the world (more than 420 m below sea level) and characterized by extreme salinity (approximately, 34%) in addition to other extreme conditions (low pH, unique ionic composition different from sea water). DNA was extracted from Dead Sea water. A total of 314,310 small subunit RNA (SSU rRNA) sequences were parsed, and 288,452 sequences were then clustered. For alpha diversity analysis, sample was rarefied to 3,000 sequences. The Shannon-Wiener index curve plot reached a plateau at approximately 3,000 sequences indicating that sequencing depth was sufficient to capture the full scope of microbial diversity. Archaea was found to be dominating the sequences (52%), whereas Bacteria constitute 45% of the sequences. Altogether, prokaryotic sequences (which constitute 97% of all sequences) were found to predominate. The findings expand on previous studies by using high-throughput amplicon sequencing to describe the microbial community in an environment which in recent years has been shown to hide some interesting diversity. © 2017 The Authors. MicrobiologyOpen published by John Wiley & Sons Ltd.
JANE: efficient mapping of prokaryotic ESTs and variable length sequence reads on related template genomes

PubMed Central

2009-01-01

Background ESTs or variable sequence reads can be available in prokaryotic studies well before a complete genome is known. Use cases include (i) transcriptome studies or (ii) single cell sequencing of bacteria. Without suitable software their further analysis and mapping would have to await finalization of the corresponding genome. Results The tool JANE rapidly maps ESTs or variable sequence reads in prokaryotic sequencing and transcriptome efforts to related template genomes. It provides an easy-to-use graphics interface for information retrieval and a toolkit for EST or nucleotide sequence function prediction. Furthermore, we developed for rapid mapping an enhanced sequence alignment algorithm which reassembles and evaluates high scoring pairs provided from the BLAST algorithm. Rapid assembly on and replacement of the template genome by sequence reads or mapped ESTs is achieved. This is illustrated (i) by data from Staphylococci as well as from a Blattabacteria sequencing effort, (ii) mapping single cell sequencing reads is shown for poribacteria to sister phylum representative Rhodopirellula Baltica SH1. The algorithm has been implemented in a web-server accessible at http://jane.bioapps.biozentrum.uni-wuerzburg.de. Conclusion Rapid prokaryotic EST mapping or mapping of sequence reads is achieved applying JANE even without knowing the cognate genome sequence. PMID:19943962
Sedimentary sequence evolution in a Foredeep basin: Eastern Venezuela

DOE Office of Scientific and Technical Information (OSTI.GOV)

Bejarano, C.; Funes, D.; Sarzalho, S.

1996-08-01

Well log-seismic sequence stratigraphy analysis in the Eastern Venezuela Foreland Basin leads to study of the evolution of sedimentary sequences onto the Cretaceous-Paleocene passive margin. This basin comprises two different foredeep sub-basins: The Guarico subbasin to the west, older, and the Maturin sub-basin to the east, younger. A foredeep switching between these two sub-basins is observed at 12.5 m.y. Seismic interpretation and well log sections across the study area show sedimentary sequences with transgressive sands and coastal onlaps to the east-southeast for the Guarico sub-basin, as well as truncations below the switching sequence (12.5 m.y.), and the Maturin sub-basin showsmore » apparent coastal onlaps to the west-northwest, as well as a marine onlap (deeper water) in the west, where it starts to establish. Sequence stratigraphy analysis of these sequences with well logs allowed the study of the evolution of stratigraphic section from Paleocene to middle Miocene (68.0-12.0 m.y.). On the basis of well log patterns, the sequences were divided in regressive-transgressive-regressive sedimentary cycles caused by changes in relative sea level. Facies distributions were analyzed and the sequences were divided into simple sequences or sub- sequences of a greater frequencies than third order depositional sequences.« less
FOUNTAIN: A JAVA open-source package to assist large sequencing projects

PubMed Central

Buerstedde, Jean-Marie; Prill, Florian

2001-01-01

Background Better automation, lower cost per reaction and a heightened interest in comparative genomics has led to a dramatic increase in DNA sequencing activities. Although the large sequencing projects of specialized centers are supported by in-house bioinformatics groups, many smaller laboratories face difficulties managing the appropriate processing and storage of their sequencing output. The challenges include documentation of clones, templates and sequencing reactions, and the storage, annotation and analysis of the large number of generated sequences. Results We describe here a new program, named FOUNTAIN, for the management of large sequencing projects . FOUNTAIN uses the JAVA computer language and data storage in a relational database. Starting with a collection of sequencing objects (clones), the program generates and stores information related to the different stages of the sequencing project using a web browser interface for user input. The generated sequences are subsequently imported and annotated based on BLAST searches against the public databases. In addition, simple algorithms to cluster sequences and determine putative polymorphic positions are implemented. Conclusions A simple, but flexible and scalable software package is presented to facilitate data generation and storage for large sequencing projects. Open source and largely platform and database independent, we wish FOUNTAIN to be improved and extended in a community effort. PMID:11591214
Sequence-specific bias correction for RNA-seq data using recurrent neural networks.

PubMed

Zhang, Yao-Zhong; Yamaguchi, Rui; Imoto, Seiya; Miyano, Satoru

2017-01-25

The recent success of deep learning techniques in machine learning and artificial intelligence has stimulated a great deal of interest among bioinformaticians, who now wish to bring the power of deep learning to bare on a host of bioinformatical problems. Deep learning is ideally suited for biological problems that require automatic or hierarchical feature representation for biological data when prior knowledge is limited. In this work, we address the sequence-specific bias correction problem for RNA-seq data redusing Recurrent Neural Networks (RNNs) to model nucleotide sequences without pre-determining sequence structures. The sequence-specific bias of a read is then calculated based on the sequence probabilities estimated by RNNs, and used in the estimation of gene abundance. We explore the application of two popular RNN recurrent units for this task and demonstrate that RNN-based approaches provide a flexible way to model nucleotide sequences without knowledge of predetermined sequence structures. Our experiments show that training a RNN-based nucleotide sequence model is efficient and RNN-based bias correction methods compare well with the-state-of-the-art sequence-specific bias correction method on the commonly used MAQC-III data set. RNNs provides an alternative and flexible way to calculate sequence-specific bias without explicitly pre-determining sequence structures.
Intra-Genomic Internal Transcribed Spacer Region Sequence Heterogeneity and Molecular Diagnosis in Clinical Microbiology.

PubMed

Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K P; Woo, Patrick C Y

2015-10-22

Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10-49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n=2), Pichia (Candida) norvegensis (n=2), Candida tropicalis (n=1) and Saccharomyces cerevisiae (n=1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study.
Intra-Genomic Internal Transcribed Spacer Region Sequence Heterogeneity and Molecular Diagnosis in Clinical Microbiology

PubMed Central

Zhao, Ying; Tsang, Chi-Ching; Xiao, Meng; Cheng, Jingwei; Xu, Yingchun; Lau, Susanna K. P.; Woo, Patrick C. Y.

2015-01-01

Internal transcribed spacer region (ITS) sequencing is the most extensively used technology for accurate molecular identification of fungal pathogens in clinical microbiology laboratories. Intra-genomic ITS sequence heterogeneity, which makes fungal identification based on direct sequencing of PCR products difficult, has rarely been reported in pathogenic fungi. During the process of performing ITS sequencing on 71 yeast strains isolated from various clinical specimens, direct sequencing of the PCR products showed ambiguous sequences in six of them. After cloning the PCR products into plasmids for sequencing, interpretable sequencing electropherograms could be obtained. For each of the six isolates, 10–49 clones were selected for sequencing and two to seven intra-genomic ITS copies were detected. The identities of these six isolates were confirmed to be Candida glabrata (n = 2), Pichia (Candida) norvegensis (n = 2), Candida tropicalis (n = 1) and Saccharomyces cerevisiae (n = 1). Multiple sequence alignment revealed that one to four intra-genomic ITS polymorphic sites were present in the six isolates, and all these polymorphic sites were located in the ITS1 and/or ITS2 regions. We report and describe the first evidence of intra-genomic ITS sequence heterogeneity in four different pathogenic yeasts, which occurred exclusively in the ITS1 and ITS2 spacer regions for the six isolates in this study. PMID:26506340
New families of site-specific repetitive DNA sequences that comprise constitutive heterochromatin of the Syrian hamster (Mesocricetus auratus, Cricetinae, Rodentia).

PubMed

Yamada, Kazuhiko; Kamimura, Eikichi; Kondo, Mariko; Tsuchiya, Kimiyuki; Nishida-Umehara, Chizuko; Matsuda, Yoichi

2006-02-01

We molecularly cloned new families of site-specific repetitive DNA sequences from BglII- and EcoRI-digested genomic DNA of the Syrian hamster (Mesocricetus auratus, Cricetrinae, Rodentia) and characterized them by chromosome in situ hybridization and filter hybridization. They were classified into six different types of repetitive DNA sequence families according to chromosomal distribution and genome organization. The hybridization patterns of the sequences were consistent with the distribution of C-positive bands and/or Hoechst-stained heterochromatin. The centromeric major satellite DNA and sex chromosome-specific and telomeric region-specific repetitive sequences were conserved in the same genus (Mesocricetus) but divergent in different genera. The chromosome-2-specific sequence was conserved in two genera, Mesocricetus and Cricetulus, and a low copy number of repetitive sequences on the heterochromatic chromosome arms were conserved in the subfamily Cricetinae but not in the subfamily Calomyscinae. By contrast, the other type of repetitive sequences on the heterochromatic chromosome arms, which had sequence similarities to a LINE sequence of rodents, was conserved through the three subfamilies, Cricetinae, Calomyscinae and Murinae. The nucleotide divergence of the repetitive sequences of heterochromatin was well correlated with the phylogenetic relationships of the Cricetinae species, and each sequence has been independently amplified and diverged in the same genome.
Identification of Genomic Insertion and Flanking Sequence of G2-EPSPS and GAT Transgenes in Soybean Using Whole Genome Sequencing Method.

PubMed

Guo, Bingfu; Guo, Yong; Hong, Huilong; Qiu, Li-Juan

2016-01-01

Molecular characterization of sequence flanking exogenous fragment insertion is essential for safety assessment and labeling of genetically modified organism (GMO). In this study, the T-DNA insertion sites and flanking sequences were identified in two newly developed transgenic glyphosate-tolerant soybeans GE-J16 and ZH10-6 based on whole genome sequencing (WGS) method. More than 22.4 Gb sequence data (∼21 × coverage) for each line was generated on Illumina HiSeq 2500 platform. The junction reads mapped to boundaries of T-DNA and flanking sequences in these two events were identified by comparing all sequencing reads with soybean reference genome and sequence of transgenic vector. The putative insertion loci and flanking sequences were further confirmed by PCR amplification, Sanger sequencing, and co-segregation analysis. All these analyses supported that exogenous T-DNA fragments were integrated in positions of Chr19: 50543767-50543792 and Chr17: 7980527-7980541 in these two transgenic lines. Identification of genomic insertion sites of G2-EPSPS and GAT transgenes will facilitate the utilization of their glyphosate-tolerant traits in soybean breeding program. These results also demonstrated that WGS was a cost-effective and rapid method for identifying sites of T-DNA insertions and flanking sequences in soybean.
Geoseq: a tool for dissecting deep-sequencing datasets.

PubMed

Gurtowski, James; Cancio, Anthony; Shah, Hardik; Levovitz, Chaya; George, Ajish; Homann, Robert; Sachidanandam, Ravi

2010-10-12

Datasets generated on deep-sequencing platforms have been deposited in various public repositories such as the Gene Expression Omnibus (GEO), Sequence Read Archive (SRA) hosted by the NCBI, or the DNA Data Bank of Japan (ddbj). Despite being rich data sources, they have not been used much due to the difficulty in locating and analyzing datasets of interest. Geoseq http://geoseq.mssm.edu provides a new method of analyzing short reads from deep sequencing experiments. Instead of mapping the reads to reference genomes or sequences, Geoseq maps a reference sequence against the sequencing data. It is web-based, and holds pre-computed data from public libraries. The analysis reduces the input sequence to tiles and measures the coverage of each tile in a sequence library through the use of suffix arrays. The user can upload custom target sequences or use gene/miRNA names for the search and get back results as plots and spreadsheet files. Geoseq organizes the public sequencing data using a controlled vocabulary, allowing identification of relevant libraries by organism, tissue and type of experiment. Analysis of small sets of sequences against deep-sequencing datasets, as well as identification of public datasets of interest, is simplified by Geoseq. We applied Geoseq to, a) identify differential isoform expression in mRNA-seq datasets, b) identify miRNAs (microRNAs) in libraries, and identify mature and star sequences in miRNAS and c) to identify potentially mis-annotated miRNAs. The ease of using Geoseq for these analyses suggests its utility and uniqueness as an analysis tool.
Novel primers for complete mitochondrial cytochrome b genesequencing in mammals

USGS Publications Warehouse

Naidu, Ashwin; Fitak, Robert R.; Munguia-Vega, Adrian; Culver, Melanie

2011-01-01

Sequence-based species identification relies on the extent and integrity of sequence data available in online databases such as GenBank. When identifying species from a sample of unknown origin, partial DNA sequences obtained from the sample are aligned against existing sequences in databases. When the sequence from the matching species is not present in the database, high-scoring alignments with closely related sequences might produce unreliable results on species identity. For species identification in mammals, the cytochrome b (cyt b) gene has been identified to be highly informative; thus, large amounts of reference sequence data from the cyt b gene are much needed. To enhance availability of cyt b gene sequence data on a large number of mammalian species in GenBank and other such publicly accessible online databases, we identified a primer pair for complete cyt b gene sequencing in mammals. Using this primer pair, we successfully PCR amplified and sequenced the complete cyt b gene from 40 of 44 mammalian species representing 10 orders of mammals. We submitted 40 complete, correctly annotated, cyt b protein coding sequences to GenBank. To our knowledge, this is the first single primer pair to amplify the complete cyt b gene in a broad range of mammalian species. This primer pair can be used for the addition of new cyt b gene sequences and to enhance data available on species represented in GenBank. The availability of novel and complete gene sequences as high-quality reference data can improve the reliability of sequence-based species identification.
Chronology of Eocene-Miocene sequences on the New Jersey shallow shelf: implications for regional, interregional, and global correlations

USGS Publications Warehouse

Browning, James V.; Miller, Kenneth G.; Sugarman, Peter J.; Barron, John; McCarthy, Francine M.G.; Kulhanek, Denise K.; Katz, Miriam E.; Feigenson, Mark D.

2013-01-01

Integrated Ocean Drilling Program Expedition 313 continuously cored and logged latest Eocene to early-middle Miocene sequences at three sites (M27, M28, and M29) on the inner-middle continental shelf offshore New Jersey, providing an opportunity to evaluate the ages, global correlations, and significance of sequence boundaries. We provide a chronology for these sequences using integrated strontium isotopic stratigraphy and biostratigraphy (primarily calcareous nannoplankton, diatoms, and dinocysts [dinoflagellate cysts]). Despite challenges posed by shallow-water sediments, age resolution is typically ±0.5 m.y. and in many sequences is as good as ±0.25 m.y. Three Oligocene sequences were sampled at Site M27 on sequence bottomsets. Fifteen early to early-middle Miocene sequences were dated at Sites M27, M28, and M29 across clinothems in topsets, foresets (where the sequences are thickest), and bottomsets. A few sequences have coarse (∼1 m.y.) or little age constraint due to barren zones; we constrain the age estimates of these less well dated sequences by applying the principle of superposition, i.e., sediments above sequence boundaries in any site are younger than the sediments below the sequence boundaries at other sites. Our age control provides constraints on the timing of deposition in the clinothem; sequences on the topsets are generally the youngest in the clinothem, whereas the bottomsets generally are the oldest. The greatest amount of time is represented on foresets, although we have no evidence for a correlative conformity. Our chronology provides a baseline for regional and interregional correlations and sea-level reconstructions: (1) we correlate a major increase in sedimentation rate precisely with the timing of the middle Miocene climate changes associated with the development of a permanent East Antarctic Ice Sheet; and (2) the timing of sequence boundaries matches the deep-sea oxygen isotopic record, implicating glacioeustasy as a major driver for forming sequence boundaries.
Memory for sequences of events impaired in typical aging

PubMed Central

Allen, Timothy A.; Morris, Andrea M.; Stark, Shauna M.; Fortin, Norbert J.

2015-01-01

Typical aging is associated with diminished episodic memory performance. To improve our understanding of the fundamental mechanisms underlying this age-related memory deficit, we previously developed an integrated, cross-species approach to link converging evidence from human and animal research. This novel approach focuses on the ability to remember sequences of events, an important feature of episodic memory. Unlike existing paradigms, this task is nonspatial, nonverbal, and can be used to isolate different cognitive processes that may be differentially affected in aging. Here, we used this task to make a comprehensive comparison of sequence memory performance between younger (18–22 yr) and older adults (62–86 yr). Specifically, participants viewed repeated sequences of six colored, fractal images and indicated whether each item was presented “in sequence” or “out of sequence.” Several out of sequence probe trials were used to provide a detailed assessment of sequence memory, including: (i) repeating an item from earlier in the sequence (“Repeats”; e.g., ABADEF), (ii) skipping ahead in the sequence (“Skips”; e.g., ABDDEF), and (iii) inserting an item from a different sequence into the same ordinal position (“Ordinal Transfers”; e.g., AB3DEF). We found that older adults performed as well as younger controls when tested on well-known and predictable sequences, but were severely impaired when tested using novel sequences. Importantly, overall sequence memory performance in older adults steadily declined with age, a decline not detected with other measures (RAVLT or BPS-O). We further characterized this deficit by showing that performance of older adults was severely impaired on specific probe trials that required detailed knowledge of the sequence (Skips and Ordinal Transfers), and was associated with a shift in their underlying mnemonic representation of the sequences. Collectively, these findings provide unambiguous evidence that the capacity to remember sequences of events is fundamentally affected by typical aging. PMID:25691514

Theta oscillations promote temporal sequence learning.

PubMed

Crivelli-Decker, Jordan; Hsieh, Liang-Tien; Clarke, Alex; Ranganath, Charan

2018-05-17

Many theoretical models suggest that neural oscillations play a role in learning or retrieval of temporal sequences, but the extent to which oscillations support sequence representation remains unclear. To address this question, we used scalp electroencephalography (EEG) to examine oscillatory activity over learning of different object sequences. Participants made semantic decisions on each object as they were presented in a continuous stream. For three "Consistent" sequences, the order of the objects was always fixed. Activity during Consistent sequences was compared to "Random" sequences that consisted of the same objects presented in a different order on each repetition. Over the course of learning, participants made faster semantic decisions to objects in Consistent, as compared to objects in Random sequences. Thus, participants were able to use sequence knowledge to predict upcoming items in Consistent sequences. EEG analyses revealed decreased oscillatory power in the theta (4-7 Hz) band at frontal sites following decisions about objects in Consistent sequences, as compared with objects in Random sequences. The theta power difference between Consistent and Random only emerged in the second half of the task, as participants were more effectively able to predict items in Consistent sequences. Moreover, we found increases in parieto-occipital alpha (10-13 Hz) and beta (14-28 Hz) power during the pre-response period for objects in Consistent sequences, relative to objects in Random sequences. Linear mixed effects modeling revealed that single trial theta oscillations were related to reaction time for future objects in a sequence, whereas beta and alpha oscillations were only predictive of reaction time on the current trial. These results indicate that theta and alpha/beta activity preferentially relate to future and current events, respectively. More generally our findings highlight the importance of band-specific neural oscillations in the learning of temporal order information. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Application of Modified Spin-Echo–based Sequences for Hepatic MR Elastography: Evaluation, Comparison with the Conventional Gradient-Echo Sequence, and Preliminary Clinical Experience

PubMed Central

Mariappan, Yogesh K.; Dzyubak, Bogdan; Glaser, Kevin J.; Venkatesh, Sudhakar K.; Sirlin, Claude B.; Hooker, Jonathan; McGee, Kiaran P.

2017-01-01

Purpose To (a) evaluate modified spin-echo (SE) magnetic resonance (MR) elastographic sequences for acquiring MR images with improved signal-to-noise ratio (SNR) in patients in whom the standard gradient-echo (GRE) MR elastographic sequence yields low hepatic signal intensity and (b) compare the stiffness values obtained with these sequences with those obtained with the conventional GRE sequence. Materials and Methods This HIPAA-compliant retrospective study was approved by the institutional review board; the requirement to obtain informed consent was waived. Data obtained with modified SE and SE echo-planar imaging (EPI) MR elastographic pulse sequences with short echo times were compared with those obtained with the conventional GRE MR elastographic sequence in two patient cohorts, one that exhibited adequate liver signal intensity and one that exhibited low liver signal intensity. Shear stiffness values obtained with the three sequences in 130 patients with successful GRE-based examinations were retrospectively tested for statistical equivalence by using a 5% margin. In 47 patients in whom GRE examinations were considered to have failed because of low SNR, the SNR and confidence level with the SE-based sequences were compared with those with the GRE sequence. Results The results of this study helped confirm the equivalence of SE MR elastography and SE-EPI MR elastography to GRE MR elastography (P = .0212 and P = .0001, respectively). The SE and SE-EPI MR elastographic sequences provided substantially improved SNR and stiffness inversion confidence level in 47 patients in whom GRE MR elastography had failed. Conclusion Modified SE-based MR elastographic sequences provide higher SNR MR elastographic data and reliable stiffness measurements; thus, they enable quantification of stiffness in patients in whom the conventional GRE MR elastographic sequence failed owing to low signal intensity. The equivalence of the three sequences indicates that the current diagnostic thresholds are applicable to SE MR elastographic sequences for assessing liver fibrosis. © RSNA, 2016 PMID:27509543
(Pea)nuts and bolts of visual narrative: Structure and meaning in sequential image comprehension

PubMed Central

Cohn, Neil; Paczynski, Martin; Jackendoff, Ray; Holcomb, Phillip J.; Kuperberg, Gina R.

2012-01-01

Just as syntax differentiates coherent sentences from scrambled word strings, the comprehension of sequential images must also use a cognitive system to distinguish coherent narrative sequences from random strings of images. We conducted experiments analogous to two classic studies of language processing to examine the contributions of narrative structure and semantic relatedness to processing sequential images. We compared four types of comic strips: 1) Normal sequences with both structure and meaning, 2) Semantic Only sequences (in which the panels were related to a common semantic theme, but had no narrative structure), 3) Structural Only sequences (narrative structure but no semantic relatedness), and 4) Scrambled sequences of randomly-ordered panels. In Experiment 1, participants monitored for target panels in sequences presented panel-by-panel. Reaction times were slowest to panels in Scrambled sequences, intermediate in both Structural Only and Semantic Only sequences, and fastest in Normal sequences. This suggests that both semantic relatedness and narrative structure offer advantages to processing. Experiment 2 measured ERPs to all panels across the whole sequence. The N300/N400 was largest to panels in both the Scrambled and Structural Only sequences, intermediate in Semantic Only sequences and smallest in the Normal sequences. This implies that a combination of narrative structure and semantic relatedness can facilitate semantic processing of upcoming panels (as reflected by the N300/N400). Also, panels in the Scrambled sequences evoked a larger left-lateralized anterior negativity than panels in the Structural Only sequences. This localized effect was distinct from the N300/N400, and appeared despite the fact that these two sequence types were matched on local semantic relatedness between individual panels. These findings suggest that sequential image comprehension uses a narrative structure that may be independent of semantic relatedness. Altogether, we argue that the comprehension of visual narrative is guided by an interaction between structure and meaning. PMID:22387723
A robust and cost-effective approach to sequence and analyze complete genomes of small RNA viruses

USDA-ARS?s Scientific Manuscript database

Background: Next-generation sequencing (NGS) allows ultra-deep sequencing of nucleic acids. The use of sequence-independent amplification of viral nucleic acids without utilization of target-specific primers provides advantages over traditional sequencing methods and allows detection of unsuspected ...
Genome Wide Characterization of Simple Sequence Repeats in Cucumber

USDA-ARS?s Scientific Manuscript database

The whole genome sequence of the cucumber cultivar Gy14 was recently sequenced at 15× coverage with the Roche 454 Titanium technology. The microsatellite DNA sequences (simple sequence repeats, SSRs) in the assembled scaffolds were computationally explored and characterized. A total of 112,073 SSRs ...
Ion Torren Semiconductor Sequencing Allows Rapid, Low Cost Sequencing of the Human Exome (7th Annual SFAF Meeting, 2012)

ScienceCinema

Jenkins, David

2018-01-10

David Jenkins on "Ion Torrent semiconductor sequencing allows rapid, low-cost sequencing of the human exome" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.
Ion Torren Semiconductor Sequencing Allows Rapid, Low Cost Sequencing of the Human Exome (7th Annual SFAF Meeting, 2012)

DOE Office of Scientific and Technical Information (OSTI.GOV)

Jenkins, David

David Jenkins on "Ion Torrent semiconductor sequencing allows rapid, low-cost sequencing of the human exome" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.
Genome sequence of Phytophthora ramorum: implications for management

Treesearch

Brett Tyler; Sucheta Tripathy; Nik Grunwald; Kurt Lamour; Kelly Ivors; Matteo Garbelotto; Daniel Rokhsar; Nik Putnam; Igor Grigoriev; Jeffrey Boore

2006-01-01

A draft genome sequence has been determined for Phytophthora ramorum, together with a draft sequence of the soybean pathogen Phytophthora sojae. The P. ramorum genome was sequenced to a depth of 7-fold coverage, while the P. sojae genome was sequenced to a depth of 9-fold coverage. The genome...
Teaching Task Sequencing via Verbal Mediation.

ERIC Educational Resources Information Center

Rusch, Frank R.; And Others

1987-01-01

Verbal sequence training was used to teach a moderately mentally retarded woman to sequence job-related tasks. Learning to say the tasks in the proper sequence resulted in the employee performing her tasks in that sequence, and the employee was capable of mediating her own work behavior when scheduled changes occurred. (Author/JDD)
Sequencing Adventure Activities: A New Perspective.

ERIC Educational Resources Information Center

Bisson, Christian

Sequencing in adventure education involves putting activities in an order appropriate to the needs of the group. Contrary to the common assumption that each adventure sequence is unique, a review of literature concerning five sequencing models reveals a certain universality. These models present sequences that move through four phases: group…
Application of population sequencing (POPSEQ) for ordering and inputting genotyping-by-sequencing markers in hexaploid wheat

USDA-ARS?s Scientific Manuscript database

The advancement of next-generation sequencing technologies in conjunction with new bioinformatics tools enabled fine-tuning of sequence-based high resolution mapping strategies for complex genomes. Although genotyping-by-sequencing (GBS) provides a large number of markers, its application for assoc...
77 FR 65537 - Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence...

Federal Register 2010, 2011, 2012, 2013, 2014

2012-10-29

... DEPARTMENT OF COMMERCE Patent and Trademark Office Requirements for Patent Applications Containing Nucleotide Sequence and/or Amino Acid Sequence Disclosures ACTION: Proposed collection; comment request... Patent applications that contain nucleotide and/or amino acid sequence disclosures must include a copy of...
A Glance at Microsatellite Motifs from 454 Sequencing Reads of Watermelon Genomic DNA

USDA-ARS?s Scientific Manuscript database

A single 454 (Life Sciences Sequencing Technology) run of Charleston Gray watermelon (Citrullus lanatus var. lanatus) genomic DNA was performed and sequence data were assembled. A large scale identification of simple sequence repeat (SSR) was performed and SSR sequence data were used for the develo...
Multi-platform next-generation sequencing of the domestic turkey (Meleagris gallopavo) genome assembly and analysis

USDA-ARS?s Scientific Manuscript database

Next-generation sequencing technologies were used to rapidly and efficiently sequence the genome of the domestic turkey (Meleagris gallopavo). The current genome assembly (~1.1 Gb) includes 917 Mb of sequence assigned to chromosomes. Innate heterozygosity of the sequenced bird allowed discovery of...
Bellerophon: A program to detect chimeric sequences in multiple sequence alignments

DOE Office of Scientific and Technical Information (OSTI.GOV)

Huber, Thomas; Faulkner, Geoffrey; Hugenholtz, Philip

2003-12-23

Bellerophon is a program for detecting chimeric sequences in multiple sequence datasets by an adaption of partial treeing analysis. Bellerophon was specifically developed to detect 16S rRNA gene chimeras in PCR-clone libraries of environmental samples but can be applied to other nucleotide sequence alignments.
A Code Division Multiple Access Communication System for the Low Frequency Band.

DTIC Science & Technology

1983-04-01

frequency channels spread-spectrum communication / complex sequences, orthogonal codes impulsive noise 20. ABSTRACT (Continue an reverse side It...their transmissions with signature sequences. Our LF/CDMA scheme is different in that each user’s signature sequence set consists of M orthogonal ...signature sequences. Our LF/CDMA scheme is different in that each user’s signature sequence set consists of M orthogonal sequences and thus log 2 M
A vision for ubiquitous sequencing

PubMed Central

Erlich, Yaniv

2015-01-01

Genomics has recently celebrated reaching the $1000 genome milestone, making affordable DNA sequencing a reality. With this goal successfully completed, the next goal of the sequencing revolution can be sequencing sensors—miniaturized sequencing devices that are manufactured for real-time applications and deployed in large quantities at low costs. The first part of this manuscript envisions applications that will benefit from moving the sequencers to the samples in a range of domains. In the second part, the manuscript outlines the critical barriers that need to be addressed in order to reach the goal of ubiquitous sequencing sensors. PMID:26430149
Data compression of discrete sequence: A tree based approach using dynamic programming

NASA Technical Reports Server (NTRS)

Shivaram, Gurusrasad; Seetharaman, Guna; Rao, T. R. N.

1994-01-01

A dynamic programming based approach for data compression of a ID sequence is presented. The compression of an input sequence of size N to that of a smaller size k is achieved by dividing the input sequence into k subsequences and replacing the subsequences by their respective average values. The partitioning of the input sequence is carried with the intention of reducing the mean squared error in the reconstructed sequence. The complexity involved in finding the partitions which would result in such an optimal compressed sequence is reduced by using the dynamic programming approach, which is presented.
Whole-genome sequencing for comparative genomics and de novo genome assembly.

PubMed

Benjak, Andrej; Sala, Claudia; Hartkoorn, Ruben C

2015-01-01

Next-generation sequencing technologies for whole-genome sequencing of mycobacteria are rapidly becoming an attractive alternative to more traditional sequencing methods. In particular this technology is proving useful for genome-wide identification of mutations in mycobacteria (comparative genomics) as well as for de novo assembly of whole genomes. Next-generation sequencing however generates a vast quantity of data that can only be transformed into a usable and comprehensible form using bioinformatics. Here we describe the methodology one would use to prepare libraries for whole-genome sequencing, and the basic bioinformatics to identify mutations in a genome following Illumina HiSeq or MiSeq sequencing, as well as de novo genome assembly following sequencing using Pacific Biosciences (PacBio).
Library construction for next-generation sequencing: Overviews and challenges

PubMed Central

Head, Steven R.; Komori, H. Kiyomi; LaMere, Sarah A.; Whisenant, Thomas; Van Nieuwerburgh, Filip; Salomon, Daniel R.; Ordoukhanian, Phillip

2014-01-01

High-throughput sequencing, also known as next-generation sequencing (NGS), has revolutionized genomic research. In recent years, NGS technology has steadily improved, with costs dropping and the number and range of sequencing applications increasing exponentially. Here, we examine the critical role of sequencing library quality and consider important challenges when preparing NGS libraries from DNA and RNA sources. Factors such as the quantity and physical characteristics of the RNA or DNA source material as well as the desired application (i.e., genome sequencing, targeted sequencing, RNA-seq, ChIP-seq, RIP-seq, and methylation) are addressed in the context of preparing high quality sequencing libraries. In addition, the current methods for preparing NGS libraries from single cells are also discussed. PMID:24502796

PCR Amplification Strategies towards full-length HIV-1 Genome sequencing.

PubMed

Liu, Chao Chun; Ji, Hezhao

2018-06-26

The advent of next generation sequencing has enabled greater resolution of viral diversity and improved feasibility of full viral genome sequencing allowing routine HIV-1 full genome sequencing in both research and diagnostic settings. Regardless of the sequencing platform selected, successful PCR amplification of the HIV-1 genome is essential for sequencing template preparation. As such, full HIV-1 genome amplification is a crucial step in dictating the successful and reliable sequencing downstream. Here we reviewed existing PCR protocols leading to HIV-1 full genome sequencing. In addition to the discussion on basic considerations on relevant PCR design, the advantages as well as the pitfalls of published protocols were reviewed. Copyright© Bentham Science Publishers; For any queries, please email at epub@benthamscience.org.
RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis.

PubMed

Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab

2012-01-01

RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. http://www.cemb.edu.pk/sw.html RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language.
Long sequence correlation coprocessor

NASA Astrophysics Data System (ADS)

Gage, Douglas W.

1994-09-01

A long sequence correlation coprocessor (LSCC) accelerates the bitwise correlation of arbitrarily long digital sequences by calculating in parallel the correlation score for 16, for example, adjacent bit alignments between two binary sequences. The LSCC integrated circuit is incorporated into a computer system with memory storage buffers and a separate general purpose computer processor which serves as its controller. Each of the LSCC's set of sequential counters simultaneously tallies a separate correlation coefficient. During each LSCC clock cycle, computer enable logic associated with each counter compares one bit of a first sequence with one bit of a second sequence to increment the counter if the bits are the same. A shift register assures that the same bit of the first sequence is simultaneously compared to different bits of the second sequence to simultaneously calculate the correlation coefficient by the different counters to represent different alignments of the two sequences.
It’s More Than Stamp Collecting: How Genome Sequencing Can Unify Biological Research

PubMed Central

Richards, Stephen

2015-01-01

The availability of reference genome sequences, especially the human reference, has revolutionized the study of biology. However, whilst the genomes of some species have been fully sequenced, a wide range of biological problems still cannot be effectively studied for lack of genome sequence information. Here, I identify neglected areas of biology and describe how both targeted species sequencing and more broad taxonomic surveys of the tree of life can address important biological questions. I enumerate the significant benefits that would accrue from sequencing a broader range of taxa, as well as discuss the technical advances in sequencing and assembly methods that would allow for wide-ranging application of whole-genome analysis. Finally, I suggest that in addition to “Big Science” survey initiatives to sequence the tree of life, a modified infrastructure-funding paradigm would better support reference genome sequence generation for research communities most in need. PMID:26003218
It's more than stamp collecting: how genome sequencing can unify biological research.

PubMed

Richards, Stephen

2015-07-01

The availability of reference genome sequences, especially the human reference, has revolutionized the study of biology. However, while the genomes of some species have been fully sequenced, a wide range of biological problems still cannot be effectively studied for lack of genome sequence information. Here, I identify neglected areas of biology and describe how both targeted species sequencing and more broad taxonomic surveys of the tree of life can address important biological questions. I enumerate the significant benefits that would accrue from sequencing a broader range of taxa, as well as discuss the technical advances in sequencing and assembly methods that would allow for wide-ranging application of whole-genome analysis. Finally, I suggest that in addition to 'big science' survey initiatives to sequence the tree of life, a modified infrastructure-funding paradigm would better support reference genome sequence generation for research communities most in need. Copyright © 2015 Elsevier Ltd. All rights reserved.
Sequence information signal processor

DOEpatents

Peterson, John C.; Chow, Edward T.; Waterman, Michael S.; Hunkapillar, Timothy J.

1999-01-01

An electronic circuit is used to compare two sequences, such as genetic sequences, to determine which alignment of the sequences produces the greatest similarity. The circuit includes a linear array of series-connected processors, each of which stores a single element from one of the sequences and compares that element with each successive element in the other sequence. For each comparison, the processor generates a scoring parameter that indicates which segment ending at those two elements produces the greatest degree of similarity between the sequences. The processor uses the scoring parameter to generate a similar scoring parameter for a comparison between the stored element and the next successive element from the other sequence. The processor also delivers the scoring parameter to the next processor in the array for use in generating a similar scoring parameter for another pair of elements. The electronic circuit determines which processor and alignment of the sequences produce the scoring parameter with the highest value.
Identification of Sequence Specificity of 5-Methylcytosine Oxidation by Tet1 Protein with High-Throughput Sequencing.

PubMed

Kizaki, Seiichiro; Chandran, Anandhakumar; Sugiyama, Hiroshi

2016-03-02

Tet (ten-eleven translocation) family proteins have the ability to oxidize 5-methylcytosine (mC) to 5-hydroxymethylcytosine (hmC), 5-formylcytosine (fC), and 5-carboxycytosine (caC). However, the oxidation reaction of Tet is not understood completely. Evaluation of genomic-level epigenetic changes by Tet protein requires unbiased identification of the highly selective oxidation sites. In this study, we used high-throughput sequencing to investigate the sequence specificity of mC oxidation by Tet1. A 6.6×10(4) -member mC-containing random DNA-sequence library was constructed. The library was subjected to Tet-reactive pulldown followed by high-throughput sequencing. Analysis of the obtained sequence data identified the Tet1-reactive sequences. We identified mCpG as a highly reactive sequence of Tet1 protein. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Rapid Threat Organism Recognition Pipeline

DOE Office of Scientific and Technical Information (OSTI.GOV)

Williams, Kelly P.; Solberg, Owen D.; Schoeniger, Joseph S.

2013-05-07

The RAPTOR computational pipeline identifies microbial nucleic acid sequences present in sequence data from clinical samples. It takes as input raw short-read genomic sequence data (in particular, the type generated by the Illumina sequencing platforms) and outputs taxonomic evaluation of detected microbes in various human-readable formats. This software was designed to assist in the diagnosis or characterization of infectious disease, by detecting pathogen sequences in nucleic acid sequence data from clinical samples. It has also been applied in the detection of algal pathogens, when algal biofuel ponds became unproductive. RAPTOR first trims and filters genomic sequence reads based on qualitymore » and related considerations, then performs a quick alignment to the human (or other host) genome to filter out host sequences, then performs a deeper search against microbial genomes. Alignment to a protein sequence database is optional. Alignment results are summarized and placed in a taxonomic framework using the Lowest Common Ancestor algorithm.« less
Next-Generation Sequencing Platforms

NASA Astrophysics Data System (ADS)

Mardis, Elaine R.

2013-06-01

Automated DNA sequencing instruments embody an elegant interplay among chemistry, engineering, software, and molecular biology and have built upon Sanger's founding discovery of dideoxynucleotide sequencing to perform once-unfathomable tasks. Combined with innovative physical mapping approaches that helped to establish long-range relationships between cloned stretches of genomic DNA, fluorescent DNA sequencers produced reference genome sequences for model organisms and for the reference human genome. New types of sequencing instruments that permit amazing acceleration of data-collection rates for DNA sequencing have been developed. The ability to generate genome-scale data sets is now transforming the nature of biological inquiry. Here, I provide an historical perspective of the field, focusing on the fundamental developments that predated the advent of next-generation sequencing instruments and providing information about how these instruments work, their application to biological research, and the newest types of sequencers that can extract data from single DNA molecules.
Contributions from associative and explicit sequence knowledge to the execution of discrete keying sequences.

PubMed

Verwey, Willem B

2015-05-01

Research has provided many indications that highly practiced 6-key sequences are carried out in a chunking mode in which key-specific stimuli past the first are largely ignored. When in such sequences a deviating stimulus occasionally occurs at an unpredictable location, participants fall back to responding to individual stimuli (Verwey & Abrahamse, 2012). The observation that in such a situation execution still benefits from prior practice has been attributed to the possibility to operate in an associative mode. To better understand the contribution to the execution of keying sequences of motor chunks, associative sequence knowledge and also of explicit sequence knowledge, the present study tested three alternative accounts for the earlier finding of an execution rate increase at the end of 6-key sequences performed in the associative mode. The results provide evidence that the earlier observed execution rate increase can be attributed to the use of explicit sequence knowledge. In the present experiment this benefit was limited to sequences that are executed at the moderately fast rates of the associative mode, and occurred at both the earlier and final elements of the sequences. Copyright © 2015 Elsevier B.V. All rights reserved.
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

DOE Office of Scientific and Technical Information (OSTI.GOV)

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
Value of a single-shot turbo spin-echo pulse sequence for assessing the architecture of the subarachnoid space and the constitutive nature of cerebrospinal fluid.

PubMed

Pease, Anthony; Sullivan, Stacey; Olby, Natasha; Galano, Heather; Cerda-Gonzalez, Sophia; Robertson, Ian D; Gavin, Patrick; Thrall, Donald

2006-01-01

Three case history reports are presented to illustrate the value of the single-shot turbo spin-echo pulse sequence for assessment of the subarachnoid space. The use of the single-shot turbo spin-echo pulse sequence, which is a heavily T2-weighted sequence, allows for a rapid, noninvasive evaluation of the subarachnoid space by using the high signal from cerebrospinal fluid. This sequence can be completed in seconds rather than the several minutes required for a T2-fast spin-echo sequence. Unlike the standard T2-fast spin-echo sequence, a single-shot turbo spin-echo pulse sequence also provides qualitative information about the protein and the cellular content of the cerebrospinal fluid, such as in patients with inflammatory debris or hemorrhage in the cerebrospinal fluid. Although the resolution of the single-shot turbo spin-echo pulse sequence images is relatively poor compared with more conventional sequences, the qualitative information about the subarachnoid space and cerebrospinal fluid and the rapid acquisition time, make it a useful sequence to include in standard protocols of spinal magnetic resonance imaging.
Protein sequence annotation in the genome era: the annotation concept of SWISS-PROT+TREMBL.

PubMed

Apweiler, R; Gateau, A; Contrino, S; Martin, M J; Junker, V; O'Donovan, C; Lang, F; Mitaritonna, N; Kappus, S; Bairoch, A

1997-01-01

SWISS-PROT is a curated protein sequence database which strives to provide a high level of annotation, a minimal level of redundancy and high level of integration with other databases. Ongoing genome sequencing projects have dramatically increased the number of protein sequences to be incorporated into SWISS-PROT. Since we do not want to dilute the quality standards of SWISS-PROT by incorporating sequences without proper sequence analysis and annotation, we cannot speed up the incorporation of new incoming data indefinitely. However, as we also want to make the sequences available as fast as possible, we introduced TREMBL (TRanslation of EMBL nucleotide sequence database), a supplement to SWISS-PROT. TREMBL consists of computer-annotated entries in SWISS-PROT format derived from the translation of all coding sequences (CDS) in the EMBL nucleotide sequence database, except for CDS already included in SWISS-PROT. While TREMBL is already of immense value, its computer-generated annotation does not match the quality of SWISS-PROTs. The main difference is in the protein functional information attached to sequences. With this in mind, we are dedicating substantial effort to develop and apply computer methods to enhance the functional information attached to TREMBL entries.
rpoB Gene Sequencing for Identification of Corynebacterium Species

PubMed Central

Khamis, Atieh; Raoult, Didier; La Scola, Bernard

2004-01-01

The genus Corynebacterium is a heterogeneous group of species comprising human and animal pathogens and environmental bacteria. It is defined on the basis of several phenotypic characters and the results of DNA-DNA relatedness and, more recently, 16S rRNA gene sequencing. However, the 16S rRNA gene is not polymorphic enough to ensure reliable phylogenetic studies and needs to be completely sequenced for accurate identification. The almost complete rpoB sequences of 56 Corynebacterium species were determined by both PCR and genome walking methods. In all cases the percent similarities between different species were lower than those observed by 16S rRNA gene sequencing, even for those species with degrees of high similarity. Several clusters supported by high bootstrap values were identified. In order to propose a method for strain identification which does not require sequencing of the complete rpoB sequence (approximately 3,500 bp), we identified an area with a high degree of polymorphism, bordered by conserved sequences that can be used as universal primers for PCR amplification and sequencing. The sequence of this fragment (434 to 452 bp) allows accurate species identification and may be used in the future for routine sequence-based identification of Corynebacterium species. PMID:15364970
A safe an easy method for building consensus HIV sequences from 454 massively parallel sequencing data.

PubMed

Fernández-Caballero Rico, Jose Ángel; Chueca Porcuna, Natalia; Álvarez Estévez, Marta; Mosquera Gutiérrez, María Del Mar; Marcos Maeso, María Ángeles; García, Federico

2018-02-01

To show how to generate a consensus sequence from the information of massive parallel sequences data obtained from routine HIV anti-retroviral resistance studies, and that may be suitable for molecular epidemiology studies. Paired Sanger (Trugene-Siemens) and next-generation sequencing (NGS) (454 GSJunior-Roche) HIV RT and protease sequences from 62 patients were studied. NGS consensus sequences were generated using Mesquite, using 10%, 15%, and 20% thresholds. Molecular evolutionary genetics analysis (MEGA) was used for phylogenetic studies. At a 10% threshold, NGS-Sanger sequences from 17/62 patients were phylogenetically related, with a median bootstrap-value of 88% (IQR83.5-95.5). Association increased to 36/62 sequences, median bootstrap 94% (IQR85.5-98)], using a 15% threshold. Maximum association was at the 20% threshold, with 61/62 sequences associated, and a median bootstrap value of 99% (IQR98-100). A safe method is presented to generate consensus sequences from HIV-NGS data at 20% threshold, which will prove useful for molecular epidemiological studies. Copyright © 2016 Elsevier España, S.L.U. and Sociedad Española de Enfermedades Infecciosas y Microbiología Clínica. All rights reserved.
Method for isolating chromosomal DNA in preparation for hybridization in suspension

DOEpatents

Lucas, Joe N.

2000-01-01

A method is provided for detecting nucleic acid sequence aberrations using two immobilization steps. According to the method, a nucleic acid sequence aberration is detected by detecting nucleic acid sequences having both a first nucleic acid sequence type (e.g., from a first chromosome) and a second nucleic acid sequence type (e.g., from a second chromosome), the presence of the first and the second nucleic acid sequence type on the same nucleic acid sequence indicating the presence of a nucleic acid sequence aberration. In the method, immobilization of a first hybridization probe is used to isolate a first set of nucleic acids in the sample which contain the first nucleic acid sequence type. Immobilization of a second hybridization probe is then used to isolate a second set of nucleic acids from within the first set of nucleic acids which contain the second nucleic acid sequence type. The second set of nucleic acids are then detected, their presence indicating the presence of a nucleic acid sequence aberration. Chromosomal DNA in a sample containing cell debris is prepared for hybridization in suspension by treating the mixture with RNase. The treated DNA can also be fixed prior to hybridization.
Megabase sequencing of human genome by ordered-shotgun-sequencing (OSS) strategy

NASA Astrophysics Data System (ADS)

Chen, Ellson Y.

1997-05-01

So far we have used OSS strategy to sequence over 2 megabases DNA in large-insert clones from regions of human X chromosomes with different characteristic levels of GC content. The method starts by randomly fragmenting a BAC, YAC or PAC to 8-12 kb pieces and subcloning those into lambda phage. Insert-ends of these clones are sequenced and overlapped to create a partial map. Complete sequencing is then done on a minimal tiling path of selected subclones, recursively focusing on those at the edges of contigs to facilitate mergers of clones across the entire target. To reduce manual labor, PCR processes have been adapted to prepare sequencing templates throughout the entire operation. The streamlined process can thus lend itself to further automation. The OSS approach is suitable for large- scale genomic sequencing, providing considerable flexibility in the choice of subclones or regions for more or less intensive sequencing. For example, subclones containing contaminating host cell DNA or cloning vector can be recognized and ignored with minimal sequencing effort; regions overlapping a neighboring clone already sequenced need not be redone; and segments containing tandem repeats or long repetitive sequences can be spotted early on and targeted for additional attention.
The future scalability of pH-based genome sequencers: A theoretical perspective

NASA Astrophysics Data System (ADS)

Go, Jonghyun; Alam, Muhammad A.

2013-10-01

Sequencing of human genome is an essential prerequisite for personalized medicine and early prognosis of various genetic diseases. The state-of-art, high-throughput genome sequencing technologies provide improved sequencing; however, their reliance on relatively expensive optical detection schemes has prevented wide-spread adoption of the technology in routine care. In contrast, the recently announced pH-based electronic genome sequencers achieve fast sequencing at low cost because of the compatibility with the current microelectronics technology. While the progress in technology development has been rapid, the physics of the sequencing chips and the potential for future scaling (and therefore, cost reduction) remain unexplored. In this article, we develop a theoretical framework and a scaling theory to explain the principle of operation of the pH-based sequencing chips and use the framework to explore various perceived scaling limits of the technology related to signal to noise ratio, well-to-well crosstalk, and sequencing accuracy. We also address several limitations inherent to the key steps of pH-based genome sequencers, which are widely shared by many other sequencing platforms in the market but remained unexplained properly so far.
Pulse sequence programming in a dynamic visual environment: SequenceTree.

PubMed

Magland, Jeremy F; Li, Cheng; Langham, Michael C; Wehrli, Felix W

2016-01-01

To describe SequenceTree, an open source, integrated software environment for implementing MRI pulse sequences and, ideally, exporting them to actual MRI scanners. The software is a user-friendly alternative to vendor-supplied pulse sequence design and editing tools and is suited for programmers and nonprogrammers alike. The integrated user interface was programmed using the Qt4/C++ toolkit. As parameters and code are modified, the pulse sequence diagram is automatically updated within the user interface. Several aspects of pulse programming are handled automatically, allowing users to focus on higher-level aspects of sequence design. Sequences can be simulated using a built-in Bloch equation solver and then exported for use on a Siemens MRI scanner. Ideally, other types of scanners will be supported in the future. SequenceTree has been used for 8 years in our laboratory and elsewhere and has contributed to more than 50 peer-reviewed publications in areas such as cardiovascular imaging, solid state and nonproton NMR, MR elastography, and high-resolution structural imaging. SequenceTree is an innovative, open source, visual pulse sequence environment for MRI combining simplicity with flexibility and is ideal both for advanced users and users with limited programming experience. © 2015 Wiley Periodicals, Inc.
Investigation of Human Cancers for Retrovirus by Low-Stringency Target Enrichment and High-Throughput Sequencing.

PubMed

Vinner, Lasse; Mourier, Tobias; Friis-Nielsen, Jens; Gniadecki, Robert; Dybkaer, Karen; Rosenberg, Jacob; Langhoff, Jill Levin; Cruz, David Flores Santa; Fonager, Jannik; Izarzugaza, Jose M G; Gupta, Ramneek; Sicheritz-Ponten, Thomas; Brunak, Søren; Willerslev, Eske; Nielsen, Lars Peter; Hansen, Anders Johannes

2015-08-19

Although nearly one fifth of all human cancers have an infectious aetiology, the causes for the majority of cancers remain unexplained. Despite the enormous data output from high-throughput shotgun sequencing, viral DNA in a clinical sample typically constitutes a proportion of host DNA that is too small to be detected. Sequence variation among virus genomes complicates application of sequence-specific, and highly sensitive, PCR methods. Therefore, we aimed to develop and characterize a method that permits sensitive detection of sequences despite considerable variation. We demonstrate that our low-stringency in-solution hybridization method enables detection of <100 viral copies. Furthermore, distantly related proviral sequences may be enriched by orders of magnitude, enabling discovery of hitherto unknown viral sequences by high-throughput sequencing. The sensitivity was sufficient to detect retroviral sequences in clinical samples. We used this method to conduct an investigation for novel retrovirus in samples from three cancer types. In accordance with recent studies our investigation revealed no retroviral infections in human B-cell lymphoma cells, cutaneous T-cell lymphoma or colorectal cancer biopsies. Nonetheless, our generally applicable method makes sensitive detection possible and permits sequencing of distantly related sequences from complex material.

Modeling genome coverage in single-cell sequencing

PubMed Central

Daley, Timothy; Smith, Andrew D.

2014-01-01

Motivation: Single-cell DNA sequencing is necessary for examining genetic variation at the cellular level, which remains hidden in bulk sequencing experiments. But because they begin with such small amounts of starting material, the amount of information that is obtained from single-cell sequencing experiment is highly sensitive to the choice of protocol employed and variability in library preparation. In particular, the fraction of the genome represented in single-cell sequencing libraries exhibits extreme variability due to quantitative biases in amplification and loss of genetic material. Results: We propose a method to predict the genome coverage of a deep sequencing experiment using information from an initial shallow sequencing experiment mapped to a reference genome. The observed coverage statistics are used in a non-parametric empirical Bayes Poisson model to estimate the gain in coverage from deeper sequencing. This approach allows researchers to know statistical features of deep sequencing experiments without actually sequencing deeply, providing a basis for optimizing and comparing single-cell sequencing protocols or screening libraries. Availability and implementation: The method is available as part of the preseq software package. Source code is available at http://smithlabresearch.org/preseq. Contact: andrewds@usc.edu Supplementary information: Supplementary material is available at Bioinformatics online. PMID:25107873
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

DOE PAGES

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Jr., Richard A.; ...

2017-07-18

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted.more » PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Furthermore, our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences.« less
A Case Study into Microbial Genome Assembly Gap Sequences and Finishing Strategies

PubMed Central

Utturkar, Sagar M.; Klingeman, Dawn M.; Hurt, Richard A.; Brown, Steven D.

2017-01-01

This study characterized regions of DNA which remained unassembled by either PacBio and Illumina sequencing technologies for seven bacterial genomes. Two genomes were manually finished using bioinformatics and PCR/Sanger sequencing approaches and regions not assembled by automated software were analyzed. Gaps present within Illumina assemblies mostly correspond to repetitive DNA regions such as multiple rRNA operon sequences. PacBio gap sequences were evaluated for several properties such as GC content, read coverage, gap length, ability to form strong secondary structures, and corresponding annotations. Our hypothesis that strong secondary DNA structures blocked DNA polymerases and contributed to gap sequences was not accepted. PacBio assemblies had few limitations overall and gaps were explained as cumulative effect of lower than average sequence coverage and repetitive sequences at contig termini. An important aspect of the present study is the compilation of biological features that interfered with assembly and included active transposons, multiple plasmid sequences, phage DNA integration, and large sequence duplication. Our targeted genome finishing approach and systematic evaluation of the unassembled DNA will be useful for others looking to close, finish, and polish microbial genome sequences. PMID:28769883
Reproducibility of Illumina platform deep sequencing errors allows accurate determination of DNA barcodes in cells.

PubMed

Beltman, Joost B; Urbanus, Jos; Velds, Arno; van Rooij, Nienke; Rohr, Jan C; Naik, Shalin H; Schumacher, Ton N

2016-04-02

Next generation sequencing (NGS) of amplified DNA is a powerful tool to describe genetic heterogeneity within cell populations that can both be used to investigate the clonal structure of cell populations and to perform genetic lineage tracing. For applications in which both abundant and rare sequences are biologically relevant, the relatively high error rate of NGS techniques complicates data analysis, as it is difficult to distinguish rare true sequences from spurious sequences that are generated by PCR or sequencing errors. This issue, for instance, applies to cellular barcoding strategies that aim to follow the amount and type of offspring of single cells, by supplying these with unique heritable DNA tags. Here, we use genetic barcoding data from the Illumina HiSeq platform to show that straightforward read threshold-based filtering of data is typically insufficient to filter out spurious barcodes. Importantly, we demonstrate that specific sequencing errors occur at an approximately constant rate across different samples that are sequenced in parallel. We exploit this observation by developing a novel approach to filter out spurious sequences. Application of our new method demonstrates its value in the identification of true sequences amongst spurious sequences in biological data sets.
Analysis and Functional Annotation of an Expressed Sequence Tag Collection for Tropical Crop Sugarcane

PubMed Central

Vettore, André L.; da Silva, Felipe R.; Kemper, Edson L.; Souza, Glaucia M.; da Silva, Aline M.; Ferro, Maria Inês T.; Henrique-Silva, Flavio; Giglioti, Éder A.; Lemos, Manoel V.F.; Coutinho, Luiz L.; Nobrega, Marina P.; Carrer, Helaine; França, Suzelei C.; Bacci, Maurício; Goldman, Maria Helena S.; Gomes, Suely L.; Nunes, Luiz R.; Camargo, Luis E.A.; Siqueira, Walter J.; Van Sluys, Marie-Anne; Thiemann, Otavio H.; Kuramae, Eiko E.; Santelli, Roberto V.; Marino, Celso L.; Targon, Maria L.P.N.; Ferro, Jesus A.; Silveira, Henrique C.S.; Marini, Danyelle C.; Lemos, Eliana G.M.; Monteiro-Vitorello, Claudia B.; Tambor, José H.M.; Carraro, Dirce M.; Roberto, Patrícia G.; Martins, Vanderlei G.; Goldman, Gustavo H.; de Oliveira, Regina C.; Truffi, Daniela; Colombo, Carlos A.; Rossi, Magdalena; de Araujo, Paula G.; Sculaccio, Susana A.; Angella, Aline; Lima, Marleide M.A.; de Rosa, Vicente E.; Siviero, Fábio; Coscrato, Virginia E.; Machado, Marcos A.; Grivet, Laurent; Di Mauro, Sonia M.Z.; Nobrega, Francisco G.; Menck, Carlos F.M.; Braga, Marilia D.V.; Telles, Guilherme P.; Cara, Frank A.A.; Pedrosa, Guilherme; Meidanis, João; Arruda, Paulo

2003-01-01

To contribute to our understanding of the genome complexity of sugarcane, we undertook a large-scale expressed sequence tag (EST) program. More than 260,000 cDNA clones were partially sequenced from 26 standard cDNA libraries generated from different sugarcane tissues. After the processing of the sequences, 237,954 high-quality ESTs were identified. These ESTs were assembled into 43,141 putative transcripts. Of the assembled sequences, 35.6% presented no matches with existing sequences in public databases. A global analysis of the whole SUCEST data set indicated that 14,409 assembled sequences (33% of the total) contained at least one cDNA clone with a full-length insert. Annotation of the 43,141 assembled sequences associated almost 50% of the putative identified sugarcane genes with protein metabolism, cellular communication/signal transduction, bioenergetics, and stress responses. Inspection of the translated assembled sequences for conserved protein domains revealed 40,821 amino acid sequences with 1415 Pfam domains. Reassembling the consensus sequences of the 43,141 transcripts revealed a 22% redundancy in the first assembling. This indicated that possibly 33,620 unique genes had been identified and indicated that >90% of the sugarcane expressed genes were tagged. PMID:14613979
Levels of integration in cognitive control and sequence processing in the prefrontal cortex.

PubMed

Bahlmann, Jörg; Korb, Franziska M; Gratton, Caterina; Friederici, Angela D

2012-01-01

Cognitive control is necessary to flexibly act in changing environments. Sequence processing is needed in language comprehension to build the syntactic structure in sentences. Functional imaging studies suggest that sequence processing engages the left ventrolateral prefrontal cortex (PFC). In contrast, cognitive control processes additionally recruit bilateral rostral lateral PFC regions. The present study aimed to investigate these two types of processes in one experimental paradigm. Sequence processing was manipulated using two different sequencing rules varying in complexity. Cognitive control was varied with different cue-sets that determined the choice of a sequencing rule. Univariate analyses revealed distinct PFC regions for the two types of processing (i.e. sequence processing: left ventrolateral PFC and cognitive control processing: bilateral dorsolateral and rostral PFC). Moreover, in a common brain network (including left lateral PFC and intraparietal sulcus) no interaction between sequence and cognitive control processing was observed. In contrast, a multivariate pattern analysis revealed an interaction of sequence and cognitive control processing, such that voxels in left lateral PFC and parietal cortex showed different tuning functions for tasks involving different sequencing and cognitive control demands. These results suggest that the difference between the process of rule selection (i.e. cognitive control) and the process of rule-based sequencing (i.e. sequence processing) find their neuronal underpinnings in distinct activation patterns in lateral PFC. Moreover, the combination of rule selection and rule sequencing can shape the response of neurons in lateral PFC and parietal cortex.
Non-Cartesian Balanced SSFP Pulse Sequences for Real-Time Cardiac MRI

PubMed Central

Feng, Xue; Salerno, Michael; Kramer, Christopher M.; Meyer, Craig H.

2015-01-01

Purpose To develop a new spiral-in/out balanced steady-state free precession (bSSFP) pulse sequence for real-time cardiac MRI and compare it with radial and spiral-out techniques. Methods Non-Cartesian sampling strategies are efficient and robust to motion and thus have important advantages for real-time bSSFP cine imaging. This study describes a new symmetric spiral-in/out sequence with intrinsic gradient moment compensation and SSFP refocusing at TE=TR/2. In-vivo real-time cardiac imaging studies were performed to compare radial, spiral-out, and spiral-in/out bSSFP pulse sequences. Furthermore, phase-based fat-water separation taking advantage of the refocusing mechanism of the spiral-in/out bSSFP sequence was also studied. Results The image quality of the spiral-out and spiral-in/out bSSFP sequences was improved with off-resonance and k-space trajectory correction. The spiral-in/out bSSFP sequence had the highest SNR, CNR, and image quality ratings, with spiral-out bSSFP sequence second in each category and the radial bSSFP sequence third. The spiral-in/out bSSFP sequence provides separated fat and water images with no additional scan time. Conclusions In this work a new spiral-in/out bSSFP sequence was developed and tested. The superiority of spiral bSSFP sequences over the radial bSSFP sequence in terms of SNR and reduced artifacts was demonstrated in real-time MRI of cardiac function without image acceleration. PMID:25960254
Levels of Integration in Cognitive Control and Sequence Processing in the Prefrontal Cortex

PubMed Central

Bahlmann, Jörg; Korb, Franziska M.; Gratton, Caterina; Friederici, Angela D.

2012-01-01

Cognitive control is necessary to flexibly act in changing environments. Sequence processing is needed in language comprehension to build the syntactic structure in sentences. Functional imaging studies suggest that sequence processing engages the left ventrolateral prefrontal cortex (PFC). In contrast, cognitive control processes additionally recruit bilateral rostral lateral PFC regions. The present study aimed to investigate these two types of processes in one experimental paradigm. Sequence processing was manipulated using two different sequencing rules varying in complexity. Cognitive control was varied with different cue-sets that determined the choice of a sequencing rule. Univariate analyses revealed distinct PFC regions for the two types of processing (i.e. sequence processing: left ventrolateral PFC and cognitive control processing: bilateral dorsolateral and rostral PFC). Moreover, in a common brain network (including left lateral PFC and intraparietal sulcus) no interaction between sequence and cognitive control processing was observed. In contrast, a multivariate pattern analysis revealed an interaction of sequence and cognitive control processing, such that voxels in left lateral PFC and parietal cortex showed different tuning functions for tasks involving different sequencing and cognitive control demands. These results suggest that the difference between the process of rule selection (i.e. cognitive control) and the process of rule-based sequencing (i.e. sequence processing) find their neuronal underpinnings in distinct activation patterns in lateral PFC. Moreover, the combination of rule selection and rule sequencing can shape the response of neurons in lateral PFC and parietal cortex. PMID:22952762
Rapid Identification of Sequences for Orphan Enzymes to Power Accurate Protein Annotation

PubMed Central

Ojha, Sunil; Watson, Douglas S.; Bomar, Martha G.; Galande, Amit K.; Shearer, Alexander G.

2013-01-01

The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the “back catalog” of enzymology – “orphan enzymes,” those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC) database alone. In this study, we demonstrate how this orphan enzyme “back catalog” is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis) to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology’s “back catalog” another powerful tool to drive accurate genome annotation. PMID:24386392
Improving performance of DS-CDMA systems using chaotic complex Bernoulli spreading codes

NASA Astrophysics Data System (ADS)

Farzan Sabahi, Mohammad; Dehghanfard, Ali

2014-12-01

The most important goal of spreading spectrum communication system is to protect communication signals against interference and exploitation of information by unintended listeners. In fact, low probability of detection and low probability of intercept are two important parameters to increase the performance of the system. In Direct Sequence Code Division Multiple Access (DS-CDMA) systems, these properties are achieved by multiplying the data information in spreading sequences. Chaotic sequences, with their particular properties, have numerous applications in constructing spreading codes. Using one-dimensional Bernoulli chaotic sequence as spreading code is proposed in literature previously. The main feature of this sequence is its negative auto-correlation at lag of 1, which with proper design, leads to increase in efficiency of the communication system based on these codes. On the other hand, employing the complex chaotic sequences as spreading sequence also has been discussed in several papers. In this paper, use of two-dimensional Bernoulli chaotic sequences is proposed as spreading codes. The performance of a multi-user synchronous and asynchronous DS-CDMA system will be evaluated by applying these sequences under Additive White Gaussian Noise (AWGN) and fading channel. Simulation results indicate improvement of the performance in comparison with conventional spreading codes like Gold codes as well as similar complex chaotic spreading sequences. Similar to one-dimensional Bernoulli chaotic sequences, the proposed sequences also have negative auto-correlation. Besides, construction of complex sequences with lower average cross-correlation is possible with the proposed method.
Rapid and Accurate Sequencing of Enterovirus Genomes Using MinION Nanopore Sequencer.

PubMed

Wang, Ji; Ke, Yue Hua; Zhang, Yong; Huang, Ke Qiang; Wang, Lei; Shen, Xin Xin; Dong, Xiao Ping; Xu, Wen Bo; Ma, Xue Jun

2017-10-01

Knowledge of an enterovirus genome sequence is very important in epidemiological investigation to identify transmission patterns and ascertain the extent of an outbreak. The MinION sequencer is increasingly used to sequence various viral pathogens in many clinical situations because of its long reads, portability, real-time accessibility of sequenced data, and very low initial costs. However, information is lacking on MinION sequencing of enterovirus genomes. In this proof-of-concept study using Enterovirus 71 (EV71) and Coxsackievirus A16 (CA16) strains as examples, we established an amplicon-based whole genome sequencing method using MinION. We explored the accuracy, minimum sequencing time, discrimination and high-throughput sequencing ability of MinION, and compared its performance with Sanger sequencing. Within the first minute (min) of sequencing, the accuracy of MinION was 98.5% for the single EV71 strain and 94.12%-97.33% for 10 genetically-related CA16 strains. In as little as 14 min, 99% identity was reached for the single EV71 strain, and in 17 min (on average), 99% identity was achieved for 10 CA16 strains in a single run. MinION is suitable for whole genome sequencing of enteroviruses with sufficient accuracy and fine discrimination and has the potential as a fast, reliable and convenient method for routine use. Copyright © 2017 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.
Rapid identification of sequences for orphan enzymes to power accurate protein annotation.

PubMed

Ramkissoon, Kevin R; Miller, Jennifer K; Ojha, Sunil; Watson, Douglas S; Bomar, Martha G; Galande, Amit K; Shearer, Alexander G

2013-01-01

The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the "back catalog" of enzymology--"orphan enzymes," those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC) database alone. In this study, we demonstrate how this orphan enzyme "back catalog" is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis) to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology's "back catalog" another powerful tool to drive accurate genome annotation.
Comparison of the quality of different magnetic resonance image sequences of multiple myeloma.

PubMed

Sun, Zhao-yong; Zhang, Hai-bo; Li, Shuo; Wang, Yun; Xue, Hua-dan; Jin, Zheng-yu

2015-02-01

To compare the image quality of T1WI fat phase,T1WI water phase, short time inversion recovery (STIR) sequence, and diffusion weighted imaging (DWI) sequence in the evaluation of multiple myeloma (MM). Totally 20MM patients were enrolled in this study. All patients underwent scanning at coronal T1WI fat phase, coronal T1WI water phase, coronal STIR sequence, and axial DWI sequence. The image quality of the four different sequences was evaluated. The image was divided into seven sections(head and neck, chest, abdomen, pelvis, thigh, leg, and foot), and the signal-to-noise ratio (SNR) of each section was measured at 7 segments (skull, spine, pelvis, humerus, femur, tibia and fibula and ribs) were measured. In addition, 20 active MM lesions were selected, and the contrast-to-noise ratio (CNR) of each scan sequence was calculated. The average image quality scores of T1WI fat phase,T1WI water phase, STIR sequence, and DWI sequence were 4.19 ± 0.70,4.16 ± 0.73,3.89 ± 0.70, and 3.76 ± 0.68, respectively. The image quality at T1-fat phase and T1-water phase were significantly higher than those at STIR (P=0.000 and P=0.001) and DWI sequence (both P=0.000); however, there was no significant difference between T1-fat and T1-water phase (P=0.723)and between STIR and DWI sequence (P=0.167). The SNR of T1WI fat phase was significantly higher than those of the other three sequences (all P=0.000), and there was no significant difference among the other three sequences (all P>0.05). Although the CNR of DWI sequences was slightly higher than those of the other three sequences,there was no significant difference among all of them (all P>0.05). Imaging at T1WI fat phase,T1WI water phase, STIR sequence, and DWI sequence has certain advantages,and they should be combined in the diagnosis of MM.
Evaluating the protein coding potential of exonized transposable element sequences

PubMed Central

Piriyapongsa, Jittima; Rutledge, Mark T; Patel, Sanil; Borodovsky, Mark; Jordan, I King

2007-01-01

Background Transposable element (TE) sequences, once thought to be merely selfish or parasitic members of the genomic community, have been shown to contribute a wide variety of functional sequences to their host genomes. Analysis of complete genome sequences have turned up numerous cases where TE sequences have been incorporated as exons into mRNAs, and it is widely assumed that such 'exonized' TEs encode protein sequences. However, the extent to which TE-derived sequences actually encode proteins is unknown and a matter of some controversy. We have tried to address this outstanding issue from two perspectives: i-by evaluating ascertainment biases related to the search methods used to uncover TE-derived protein coding sequences (CDS) and ii-through a probabilistic codon-frequency based analysis of the protein coding potential of TE-derived exons. Results We compared the ability of three classes of sequence similarity search methods to detect TE-derived sequences among data sets of experimentally characterized proteins: 1-a profile-based hidden Markov model (HMM) approach, 2-BLAST methods and 3-RepeatMasker. Profile based methods are more sensitive and more selective than the other methods evaluated. However, the application of profile-based search methods to the detection of TE-derived sequences among well-curated experimentally characterized protein data sets did not turn up many more cases than had been previously detected and nowhere near as many cases as recent genome-wide searches have. We observed that the different search methods used were complementary in the sense that they yielded largely non-overlapping sets of hits and differed in their ability to recover known cases of TE-derived CDS. The probabilistic analysis of TE-derived exon sequences indicates that these sequences have low protein coding potential on average. In particular, non-autonomous TEs that do not encode protein sequences, such as Alu elements, are frequently exonized but unlikely to encode protein sequences. Conclusion The exaptation of the numerous TE sequences found in exons as bona fide protein coding sequences may prove to be far less common than has been suggested by the analysis of complete genomes. We hypothesize that many exonized TE sequences actually function as post-transcriptional regulators of gene expression, rather than coding sequences, which may act through a variety of double stranded RNA related regulatory pathways. Indeed, their relatively high copy numbers and similarity to sequences dispersed throughout the genome suggests that exonized TE sequences could serve as master regulators with a wide scope of regulatory influence. Reviewers: This article was reviewed by Itai Yanai, Kateryna D. Makova, Melissa Wilson (nominated by Kateryna D. Makova) and Cedric Feschotte (nominated by John M. Logsdon Jr.). PMID:18036258
In silico Analysis of 2085 Clones from a Normalized Rat Vestibular Periphery 3′ cDNA Library

PubMed Central

Roche, Joseph P.; Cioffi, Joseph A.; Kwitek, Anne E.; Erbe, Christy B.; Popper, Paul

2005-01-01

The inserts from 2400 cDNA clones isolated from a normalized Rattus norvegicus vestibular periphery cDNA library were sequenced and characterized. The Wackym-Soares vestibular 3′ cDNA library was constructed from the saccular and utricular maculae, the ampullae of all three semicircular canals and Scarpa's ganglia containing the somata of the primary afferent neurons, microdissected from 104 male and female rats. The inserts from 2400 randomly selected clones were sequenced from the 5′ end. Each sequence was analyzed using the BLAST algorithm compared to the Genbank nonredundant, rat genome, mouse genome and human genome databases to search for high homology alignments. Of the initial 2400 clones, 315 (13%) were found to be of poor quality and did not yield useful information, and therefore were eliminated from the analysis. Of the remaining 2085 sequences, 918 (44%) were found to represent 758 unique genes having useful annotations that were identified in databases within the public domain or in the published literature; these sequences were designated as known characterized sequences. 1141 sequences (55%) aligned with 1011 unique sequences had no useful annotations and were designated as known but uncharacterized sequences. Of the remaining 26 sequences (1%), 24 aligned with rat genomic sequences, but none matched previously described rat expressed sequence tags or mRNAs. No significant alignment to the rat or human genomic sequences could be found for the remaining 2 sequences. Of the 2085 sequences analyzed, 86% were singletons. The known, characterized sequences were analyzed with the FatiGO online data-mining tool (http://fatigo.bioinfo.cnio.es/) to identify level 5 biological process gene ontology (GO) terms for each alignment and to group alignments with similar or identical GO terms. Numerous genes were identified that have not been previously shown to be expressed in the vestibular system. Further characterization of the novel cDNA sequences may lead to the identification of genes with vestibular-specific functions. Continued analysis of the rat vestibular periphery transcriptome should provide new insights into vestibular function and generate new hypotheses. Physiological studies are necessary to further elucidate the roles of the identified genes and novel sequences in vestibular function. PMID:16103642
Functional region prediction with a set of appropriate homologous sequences-an index for sequence selection by integrating structure and sequence information with spatial statistics

PubMed Central

2012-01-01

Background The detection of conserved residue clusters on a protein structure is one of the effective strategies for the prediction of functional protein regions. Various methods, such as Evolutionary Trace, have been developed based on this strategy. In such approaches, the conserved residues are identified through comparisons of homologous amino acid sequences. Therefore, the selection of homologous sequences is a critical step. It is empirically known that a certain degree of sequence divergence in the set of homologous sequences is required for the identification of conserved residues. However, the development of a method to select homologous sequences appropriate for the identification of conserved residues has not been sufficiently addressed. An objective and general method to select appropriate homologous sequences is desired for the efficient prediction of functional regions. Results We have developed a novel index to select the sequences appropriate for the identification of conserved residues, and implemented the index within our method to predict the functional regions of a protein. The implementation of the index improved the performance of the functional region prediction. The index represents the degree of conserved residue clustering on the tertiary structure of the protein. For this purpose, the structure and sequence information were integrated within the index by the application of spatial statistics. Spatial statistics is a field of statistics in which not only the attributes but also the geometrical coordinates of the data are considered simultaneously. Higher degrees of clustering generate larger index scores. We adopted the set of homologous sequences with the highest index score, under the assumption that the best prediction accuracy is obtained when the degree of clustering is the maximum. The set of sequences selected by the index led to higher functional region prediction performance than the sets of sequences selected by other sequence-based methods. Conclusions Appropriate homologous sequences are selected automatically and objectively by the index. Such sequence selection improved the performance of functional region prediction. As far as we know, this is the first approach in which spatial statistics have been applied to protein analyses. Such integration of structure and sequence information would be useful for other bioinformatics problems. PMID:22643026
BioNano genome mapping of individual chromosomes supports physical mapping and sequence assembly in complex plant genomes.

PubMed

Staňková, Helena; Hastie, Alex R; Chan, Saki; Vrána, Jan; Tulpová, Zuzana; Kubaláková, Marie; Visendi, Paul; Hayashi, Satomi; Luo, Mingcheng; Batley, Jacqueline; Edwards, David; Doležel, Jaroslav; Šimková, Hana

2016-07-01

The assembly of a reference genome sequence of bread wheat is challenging due to its specific features such as the genome size of 17 Gbp, polyploid nature and prevalence of repetitive sequences. BAC-by-BAC sequencing based on chromosomal physical maps, adopted by the International Wheat Genome Sequencing Consortium as the key strategy, reduces problems caused by the genome complexity and polyploidy, but the repeat content still hampers the sequence assembly. Availability of a high-resolution genomic map to guide sequence scaffolding and validate physical map and sequence assemblies would be highly beneficial to obtaining an accurate and complete genome sequence. Here, we chose the short arm of chromosome 7D (7DS) as a model to demonstrate for the first time that it is possible to couple chromosome flow sorting with genome mapping in nanochannel arrays and create a de novo genome map of a wheat chromosome. We constructed a high-resolution chromosome map composed of 371 contigs with an N50 of 1.3 Mb. Long DNA molecules achieved by our approach facilitated chromosome-scale analysis of repetitive sequences and revealed a ~800-kb array of tandem repeats intractable to current DNA sequencing technologies. Anchoring 7DS sequence assemblies obtained by clone-by-clone sequencing to the 7DS genome map provided a valuable tool to improve the BAC-contig physical map and validate sequence assembly on a chromosome-arm scale. Our results indicate that creating genome maps for the whole wheat genome in a chromosome-by-chromosome manner is feasible and that they will be an affordable tool to support the production of improved pseudomolecules. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Automated two-point dixon screening for the evaluation of hepatic steatosis and siderosis: comparison with R2-relaxometry and chemical shift-based sequences.

PubMed

Henninger, B; Zoller, H; Rauch, S; Schocke, M; Kannengiesser, S; Zhong, X; Reiter, G; Jaschke, W; Kremser, C

2015-05-01

To evaluate the automated two-point Dixon screening sequence for the detection and estimated quantification of hepatic iron and fat compared with standard sequences as a reference. One hundred and two patients with suspected diffuse liver disease were included in this prospective study. The following MRI protocol was used: 3D-T1-weighted opposed- and in-phase gradient echo with two-point Dixon reconstruction and dual-ratio signal discrimination algorithm ("screening" sequence); fat-saturated, multi-gradient-echo sequence with 12 echoes; gradient-echo T1 FLASH opposed- and in-phase. Bland-Altman plots were generated and correlation coefficients were calculated to compare the sequences. The screening sequence diagnosed fat in 33, iron in 35 and a combination of both in 4 patients. Correlation between R2* values of the screening sequence and the standard relaxometry was excellent (r = 0.988). A slightly lower correlation (r = 0.978) was found between the fat fraction of the screening sequence and the standard sequence. Bland-Altman revealed systematically lower R2* values obtained from the screening sequence and higher fat fraction values obtained with the standard sequence with a rather high variability in agreement. The screening sequence is a promising method with fast diagnosis of the predominant liver disease. It is capable of estimating the amount of hepatic fat and iron comparable to standard methods. • MRI plays a major role in the clarification of diffuse liver disease. • The screening sequence was introduced for the assessment of diffuse liver disease. • It is a fast and automated algorithm for the evaluation of hepatic iron and fat. • It is capable of estimating the amount of hepatic fat and iron.
Probabilistic topic modeling for the analysis and classification of genomic sequences

PubMed Central

2015-01-01

Background Studies on genomic sequences for classification and taxonomic identification have a leading role in the biomedical field and in the analysis of biodiversity. These studies are focusing on the so-called barcode genes, representing a well defined region of the whole genome. Recently, alignment-free techniques are gaining more importance because they are able to overcome the drawbacks of sequence alignment techniques. In this paper a new alignment-free method for DNA sequences clustering and classification is proposed. The method is based on k-mers representation and text mining techniques. Methods The presented method is based on Probabilistic Topic Modeling, a statistical technique originally proposed for text documents. Probabilistic topic models are able to find in a document corpus the topics (recurrent themes) characterizing classes of documents. This technique, applied on DNA sequences representing the documents, exploits the frequency of fixed-length k-mers and builds a generative model for a training group of sequences. This generative model, obtained through the Latent Dirichlet Allocation (LDA) algorithm, is then used to classify a large set of genomic sequences. Results and conclusions We performed classification of over 7000 16S DNA barcode sequences taken from Ribosomal Database Project (RDP) repository, training probabilistic topic models. The proposed method is compared to the RDP tool and Support Vector Machine (SVM) classification algorithm in a extensive set of trials using both complete sequences and short sequence snippets (from 400 bp to 25 bp). Our method reaches very similar results to RDP classifier and SVM for complete sequences. The most interesting results are obtained when short sequence snippets are considered. In these conditions the proposed method outperforms RDP and SVM with ultra short sequences and it exhibits a smooth decrease of performance, at every taxonomic level, when the sequence length is decreased. PMID:25916734
Phylogenetic characterization of a biogas plant microbial community integrating clone library 16S-rDNA sequences and metagenome sequence data obtained by 454-pyrosequencing.

PubMed

Kröber, Magdalena; Bekel, Thomas; Diaz, Naryttza N; Goesmann, Alexander; Jaenicke, Sebastian; Krause, Lutz; Miller, Dimitri; Runte, Kai J; Viehöver, Prisca; Pühler, Alfred; Schlüter, Andreas

2009-06-01

The phylogenetic structure of the microbial community residing in a fermentation sample from a production-scale biogas plant fed with maize silage, green rye and liquid manure was analysed by an integrated approach using clone library sequences and metagenome sequence data obtained by 454-pyrosequencing. Sequencing of 109 clones from a bacterial and an archaeal 16S-rDNA amplicon library revealed that the obtained nucleotide sequences are similar but not identical to 16S-rDNA database sequences derived from different anaerobic environments including digestors and bioreactors. Most of the bacterial 16S-rDNA sequences could be assigned to the phylum Firmicutes with the most abundant class Clostridia and to the class Bacteroidetes, whereas most archaeal 16S-rDNA sequences cluster close to the methanogen Methanoculleus bourgensis. Further sequences of the archaeal library most probably represent so far non-characterised species within the genus Methanoculleus. A similar result derived from phylogenetic analysis of mcrA clone sequences. The mcrA gene product encodes the alpha-subunit of methyl-coenzyme-M reductase involved in the final step of methanogenesis. BLASTn analysis applying stringent settings resulted in assignment of 16S-rDNA metagenome sequence reads to 62 16S-rDNA amplicon sequences thus enabling frequency of abundance estimations for 16S-rDNA clone library sequences. Ribosomal Database Project (RDP) Classifier processing of metagenome 16S-rDNA reads revealed abundance of the phyla Firmicutes, Bacteroidetes and Euryarchaeota and the orders Clostridiales, Bacteroidales and Methanomicrobiales. Moreover, a large fraction of 16S-rDNA metagenome reads could not be assigned to lower taxonomic ranks, demonstrating that numerous microorganisms in the analysed fermentation sample of the biogas plant are still unclassified or unknown.

How Many Protein Sequences Fold to a Given Structure? A Coevolutionary Analysis.

PubMed

Tian, Pengfei; Best, Robert B

2017-10-17

Quantifying the relationship between protein sequence and structure is key to understanding the protein universe. A fundamental measure of this relationship is the total number of amino acid sequences that can fold to a target protein structure, known as the "sequence capacity," which has been suggested as a proxy for how designable a given protein fold is. Although sequence capacity has been extensively studied using lattice models and theory, numerical estimates for real protein structures are currently lacking. In this work, we have quantitatively estimated the sequence capacity of 10 proteins with a variety of different structures using a statistical model based on residue-residue co-evolution to capture the variation of sequences from the same protein family. Remarkably, we find that even for the smallest protein folds, such as the WW domain, the number of foldable sequences is extremely large, exceeding the Avogadro constant. In agreement with earlier theoretical work, the calculated sequence capacity is positively correlated with the size of the protein, or better, the density of contacts. This allows the absolute sequence capacity of a given protein to be approximately predicted from its structure. On the other hand, the relative sequence capacity, i.e., normalized by the total number of possible sequences, is an extremely tiny number and is strongly anti-correlated with the protein length. Thus, although there may be more foldable sequences for larger proteins, it will be much harder to find them. Lastly, we have correlated the evolutionary age of proteins in the CATH database with their sequence capacity as predicted by our model. The results suggest a trade-off between the opposing requirements of high designability and the likelihood of a novel fold emerging by chance. Published by Elsevier Inc.
Phylogenetic characterization of Canine Parvovirus VP2 partial sequences from symptomatic dogs samples.

PubMed

Zienius, D; Lelešius, R; Kavaliauskis, H; Stankevičius, A; Šalomskas, A

2016-01-01

The aim of the present study was to detect canine parvovirus (CPV) from faecal samples of clinically ill domestic dogs by polymerase chain reaction (PCR) followed by VP2 gene partial sequencing and molecular characterization of circulating strains in Lithuania. Eleven clinically and antigen-tested positive dog faecal samples, collected during the period of 2014-2015, were investigated by using PCR. The phylogenetic investigations indicated that the Lithuanian CPV VP2 partial sequences (3025-3706 cds) were closely related and showed 99.0-99.9% identity. All Lithuanian sequences were associated with one phylogroup, but grouped in different clusters. Ten of investigated Lithuanian CPV VP2 sequences were closely associated with CPV 2a antigenic variant (99.4% nt identity). Five CPV VP2 sequences from Lithuania were related to CPV-2a, but were rather divergent (6.8 nt differences). Only one CPV VP2 sequence from Lithuania was associated (99.3% nt identity) with CPV-2b VP2 sequences from France, Italy, USA and Korea. The four of eleven investigated Lithuanian dogs with CPV infection symptoms were vaccinated with CPV-2 vaccine, but their VP2 sequences were phylogenetically distantly associated with CPV vaccine strains VP2 sequences (11.5-15.8 nt differences). Ten Lithuanian CPV VP2 sequences had monophyletic relations among the close geographically associated samples, but five of them were rather divergent (1.0% less sequence similarity). The one Lithuanian CPV VP2 sequence was closely related with CPV-2b antigenic variant. All the Lithuanian CPV VP2 partial sequences were conservative and phylogenetically low associated with most commonly used CPV vaccine strains.
Classification of G-protein coupled receptors based on a rich generation of convolutional neural network, N-gram transformation and multiple sequence alignments.

PubMed

Li, Man; Ling, Cheng; Xu, Qi; Gao, Jingyang

2018-02-01

Sequence classification is crucial in predicting the function of newly discovered sequences. In recent years, the prediction of the incremental large-scale and diversity of sequences has heavily relied on the involvement of machine-learning algorithms. To improve prediction accuracy, these algorithms must confront the key challenge of extracting valuable features. In this work, we propose a feature-enhanced protein classification approach, considering the rich generation of multiple sequence alignment algorithms, N-gram probabilistic language model and the deep learning technique. The essence behind the proposed method is that if each group of sequences can be represented by one feature sequence, composed of homologous sites, there should be less loss when the sequence is rebuilt, when a more relevant sequence is added to the group. On the basis of this consideration, the prediction becomes whether a query sequence belonging to a group of sequences can be transferred to calculate the probability that the new feature sequence evolves from the original one. The proposed work focuses on the hierarchical classification of G-protein Coupled Receptors (GPCRs), which begins by extracting the feature sequences from the multiple sequence alignment results of the GPCRs sub-subfamilies. The N-gram model is then applied to construct the input vectors. Finally, these vectors are imported into a convolutional neural network to make a prediction. The experimental results elucidate that the proposed method provides significant performance improvements. The classification error rate of the proposed method is reduced by at least 4.67% (family level I) and 5.75% (family Level II), in comparison with the current state-of-the-art methods. The implementation program of the proposed work is freely available at: https://github.com/alanFchina/CNN .
Genome sequence determination and metagenomic characterization of a Dehalococcoides mixed culture grown on cis-1,2-dichloroethene.

PubMed

Yohda, Masafumi; Yagi, Osami; Takechi, Ayane; Kitajima, Mizuki; Matsuda, Hisashi; Miyamura, Naoaki; Aizawa, Tomoko; Nakajima, Mutsuyasu; Sunairi, Michio; Daiba, Akito; Miyajima, Takashi; Teruya, Morimi; Teruya, Kuniko; Shiroma, Akino; Shimoji, Makiko; Tamotsu, Hinako; Juan, Ayaka; Nakano, Kazuma; Aoyama, Misako; Terabayashi, Yasunobu; Satou, Kazuhito; Hirano, Takashi

2015-07-01

A Dehalococcoides-containing bacterial consortium that performed dechlorination of 0.20 mM cis-1,2-dichloroethene to ethene in 14 days was obtained from the sediment mud of the lotus field. To obtain detailed information of the consortium, the metagenome was analyzed using the short-read next-generation sequencer SOLiD 3. Matching the obtained sequence tags with the reference genome sequences indicated that the Dehalococcoides sp. in the consortium was highly homologous to Dehalococcoides mccartyi CBDB1 and BAV1. Sequence comparison with the reference sequence constructed from 16S rRNA gene sequences in a public database showed the presence of Sedimentibacter, Sulfurospirillum, Clostridium, Desulfovibrio, Parabacteroides, Alistipes, Eubacterium, Peptostreptococcus and Proteocatella in addition to Dehalococcoides sp. After further enrichment, the members of the consortium were narrowed down to almost three species. Finally, the full-length circular genome sequence of the Dehalococcoides sp. in the consortium, D. mccartyi IBARAKI, was determined by analyzing the metagenome with the single-molecule DNA sequencer PacBio RS. The accuracy of the sequence was confirmed by matching it to the tag sequences obtained by SOLiD 3. The genome is 1,451,062 nt and the number of CDS is 1566, which includes 3 rRNA genes and 47 tRNA genes. There exist twenty-eight RDase genes that are accompanied by the genes for anchor proteins. The genome exhibits significant sequence identity with other Dehalococcoides spp. throughout the genome, but there exists significant difference in the distribution RDase genes. The combination of a short-read next-generation DNA sequencer and a long-read single-molecule DNA sequencer gives detailed information of a bacterial consortium. Copyright © 2014 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.
CDSbank: taxonomy-aware extraction, selection, renaming and formatting of protein-coding DNA or amino acid sequences.

PubMed

Hazes, Bart

2014-02-28

Protein-coding DNA sequences and their corresponding amino acid sequences are routinely used to study relationships between sequence, structure, function, and evolution. The rapidly growing size of sequence databases increases the power of such comparative analyses but it makes it more challenging to prepare high quality sequence data sets with control over redundancy, quality, completeness, formatting, and labeling. Software tools for some individual steps in this process exist but manual intervention remains a common and time consuming necessity. CDSbank is a database that stores both the protein-coding DNA sequence (CDS) and amino acid sequence for each protein annotated in Genbank. CDSbank also stores Genbank feature annotation, a flag to indicate incomplete 5' and 3' ends, full taxonomic data, and a heuristic to rank the scientific interest of each species. This rich information allows fully automated data set preparation with a level of sophistication that aims to meet or exceed manual processing. Defaults ensure ease of use for typical scenarios while allowing great flexibility when needed. Access is via a free web server at http://hazeslab.med.ualberta.ca/CDSbank/. CDSbank presents a user-friendly web server to download, filter, format, and name large sequence data sets. Common usage scenarios can be accessed via pre-programmed default choices, while optional sections give full control over the processing pipeline. Particular strengths are: extract protein-coding DNA sequences just as easily as amino acid sequences, full access to taxonomy for labeling and filtering, awareness of incomplete sequences, and the ability to take one protein sequence and extract all synonymous CDS or identical protein sequences in other species. Finally, CDSbank can also create labeled property files to, for instance, annotate or re-label phylogenetic trees.
PIMS sequencing extension: a laboratory information management system for DNA sequencing facilities.

PubMed

Troshin, Peter V; Postis, Vincent Lg; Ashworth, Denise; Baldwin, Stephen A; McPherson, Michael J; Barton, Geoffrey J

2011-03-07

Facilities that provide a service for DNA sequencing typically support large numbers of users and experiment types. The cost of services is often reduced by the use of liquid handling robots but the efficiency of such facilities is hampered because the software for such robots does not usually integrate well with the systems that run the sequencing machines. Accordingly, there is a need for software systems capable of integrating different robotic systems and managing sample information for DNA sequencing services. In this paper, we describe an extension to the Protein Information Management System (PIMS) that is designed for DNA sequencing facilities. The new version of PIMS has a user-friendly web interface and integrates all aspects of the sequencing process, including sample submission, handling and tracking, together with capture and management of the data. The PIMS sequencing extension has been in production since July 2009 at the University of Leeds DNA Sequencing Facility. It has completely replaced manual data handling and simplified the tasks of data management and user communication. Samples from 45 groups have been processed with an average throughput of 10000 samples per month. The current version of the PIMS sequencing extension works with Applied Biosystems 3130XL 96-well plate sequencer and MWG 4204 or Aviso Theonyx liquid handling robots, but is readily adaptable for use with other combinations of robots. PIMS has been extended to provide a user-friendly and integrated data management solution for DNA sequencing facilities that is accessed through a normal web browser and allows simultaneous access by multiple users as well as facility managers. The system integrates sequencing and liquid handling robots, manages the data flow, and provides remote access to the sequencing results. The software is freely available, for academic users, from http://www.pims-lims.org/.
Nanopore Technology: A Simple, Inexpensive, Futuristic Technology for DNA Sequencing.

PubMed

Gupta, P D

2016-10-01

In health care, importance of DNA sequencing has been fully established. Sanger's Capillary Electrophoresis DNA sequencing methodology is time consuming, cumbersome, hence become more expensive. Lately, because of its versatility DNA sequencing became house hold name, and therefore, there is an urgent need of simple, fast, inexpensive, DNA sequencing technology. In the beginning of this century efforts were made, and Nanopore DNA sequencing technology was developed; still it is infancy, nevertheless, it is the futuristic technology.
Computation of repetitions and regularities of biologically weighted sequences.

PubMed

Christodoulakis, M; Iliopoulos, C; Mouchard, L; Perdikuri, K; Tsakalidis, A; Tsichlas, K

2006-01-01

Biological weighted sequences are used extensively in molecular biology as profiles for protein families, in the representation of binding sites and often for the representation of sequences produced by a shotgun sequencing strategy. In this paper, we address three fundamental problems in the area of biologically weighted sequences: (i) computation of repetitions, (ii) pattern matching, and (iii) computation of regularities. Our algorithms can be used as basic building blocks for more sophisticated algorithms applied on weighted sequences.
A rapid and cost-effective method for sequencing pooled cDNA clones by using a combination of transposon insertion and Gateway technology.

PubMed

Morozumi, Takeya; Toki, Daisuke; Eguchi-Ogawa, Tomoko; Uenishi, Hirohide

2011-09-01

Large-scale cDNA-sequencing projects require an efficient strategy for mass sequencing. Here we describe a method for sequencing pooled cDNA clones using a combination of transposon insertion and Gateway technology. Our method reduces the number of shotgun clones that are unsuitable for reconstruction of cDNA sequences, and has the advantage of reducing the total costs of the sequencing project.
Effects of informed consent for individual genome sequencing on relevant knowledge.

PubMed

Kaphingst, K A; Facio, F M; Cheng, M-R; Brooks, S; Eidem, H; Linn, A; Biesecker, B B; Biesecker, L G

2012-11-01

Increasing availability of individual genomic information suggests that patients will need knowledge about genome sequencing to make informed decisions, but prior research is limited. In this study, we examined genome sequencing knowledge before and after informed consent among 311 participants enrolled in the ClinSeq™ sequencing study. An exploratory factor analysis of knowledge items yielded two factors (sequencing limitations knowledge; sequencing benefits knowledge). In multivariable analysis, high pre-consent sequencing limitations knowledge scores were significantly related to education [odds ratio (OR): 8.7, 95% confidence interval (CI): 2.45-31.10 for post-graduate education, and OR: 3.9; 95% CI: 1.05, 14.61 for college degree compared with less than college degree] and race/ethnicity (OR: 2.4, 95% CI: 1.09, 5.38 for non-Hispanic Whites compared with other racial/ethnic groups). Mean values increased significantly between pre- and post-consent for the sequencing limitations knowledge subscale (6.9-7.7, p < 0.0001) and sequencing benefits knowledge subscale (7.0-7.5, p < 0.0001); increase in knowledge did not differ by sociodemographic characteristics. This study highlights gaps in genome sequencing knowledge and underscores the need to target educational efforts toward participants with less education or from minority racial/ethnic groups. The informed consent process improved genome sequencing knowledge. Future studies could examine how genome sequencing knowledge influences informed decision making. © 2012 John Wiley & Sons A/S.
Next-Generation Sequencing of the Chrysanthemum nankingense (Asteraceae) Transcriptome Permits Large-Scale Unigene Assembly and SSR Marker Discovery

PubMed Central

Wang, Haibin; Jiang, Jiafu; Chen, Sumei; Qi, Xiangyu; Peng, Hui; Li, Pirui; Song, Aiping; Guan, Zhiyong; Fang, Weimin; Liao, Yuan; Chen, Fadi

2013-01-01

Background Simple sequence repeats (SSRs) are ubiquitous in eukaryotic genomes. Chrysanthemum is one of the largest genera in the Asteraceae family. Only few Chrysanthemum expressed sequence tag (EST) sequences have been acquired to date, so the number of available EST-SSR markers is very low. Methodology/Principal Findings Illumina paired-end sequencing technology produced over 53 million sequencing reads from C. nankingense mRNA. The subsequent de novo assembly yielded 70,895 unigenes, of which 45,789 (64.59%) unigenes showed similarity to the sequences in NCBI database. Out of 45,789 sequences, 107 have hits to the Chrysanthemum Nr protein database; 679 and 277 sequences have hits to the database of Helianthus and Lactuca species, respectively. MISA software identified a large number of putative EST-SSRs, allowing 1,788 primer pairs to be designed from the de novo transcriptome sequence and a further 363 from archival EST sequence. Among 100 primer pairs randomly chosen, 81 markers have amplicons and 20 are polymorphic for genotypes analysis in Chrysanthemum. The results showed that most (but not all) of the assays were transferable across species and that they exposed a significant amount of allelic diversity. Conclusions/Significance SSR markers acquired by transcriptome sequencing are potentially useful for marker-assisted breeding and genetic analysis in the genus Chrysanthemum and its related genera. PMID:23626799
Learning of goal-relevant and -irrelevant complex visual sequences in human V1.

PubMed

Rosenthal, Clive R; Mallik, Indira; Caballero-Gaudes, Cesar; Sereno, Martin I; Soto, David

2018-06-12

Learning and memory are supported by a network involving the medial temporal lobe and linked neocortical regions. Emerging evidence indicates that primary visual cortex (i.e., V1) may contribute to recognition memory, but this has been tested only with a single visuospatial sequence as the target memorandum. The present study used functional magnetic resonance imaging to investigate whether human V1 can support the learning of multiple, concurrent complex visual sequences involving discontinous (second-order) associations. Two peripheral, goal-irrelevant but structured sequences of orientated gratings appeared simultaneously in fixed locations of the right and left visual fields alongside a central, goal-relevant sequence that was in the focus of spatial attention. Pseudorandom sequences were introduced at multiple intervals during the presentation of the three structured visual sequences to provide an online measure of sequence-specific knowledge at each retinotopic location. We found that a network involving the precuneus and V1 was involved in learning the structured sequence presented at central fixation, whereas right V1 was modulated by repeated exposure to the concurrent structured sequence presented in the left visual field. The same result was not found in left V1. These results indicate for the first time that human V1 can support the learning of multiple concurrent sequences involving complex discontinuous inter-item associations, even peripheral sequences that are goal-irrelevant. Copyright © 2018. Published by Elsevier Inc.
Biological sequence compression algorithms.

PubMed

Matsumoto, T; Sadakane, K; Imai, H

2000-01-01

Today, more and more DNA sequences are becoming available. The information about DNA sequences are stored in molecular biology databases. The size and importance of these databases will be bigger and bigger in the future, therefore this information must be stored or communicated efficiently. Furthermore, sequence compression can be used to define similarities between biological sequences. The standard compression algorithms such as gzip or compress cannot compress DNA sequences, but only expand them in size. On the other hand, CTW (Context Tree Weighting Method) can compress DNA sequences less than two bits per symbol. These algorithms do not use special structures of biological sequences. Two characteristic structures of DNA sequences are known. One is called palindromes or reverse complements and the other structure is approximate repeats. Several specific algorithms for DNA sequences that use these structures can compress them less than two bits per symbol. In this paper, we improve the CTW so that characteristic structures of DNA sequences are available. Before encoding the next symbol, the algorithm searches an approximate repeat and palindrome using hash and dynamic programming. If there is a palindrome or an approximate repeat with enough length then our algorithm represents it with length and distance. By using this preprocessing, a new program achieves a little higher compression ratio than that of existing DNA-oriented compression algorithms. We also describe new compression algorithm for protein sequences.
A novel, privacy-preserving cryptographic approach for sharing sequencing data

PubMed Central

Cassa, Christopher A; Miller, Rachel A; Mandl, Kenneth D

2013-01-01

Objective DNA samples are often processed and sequenced in facilities external to the point of collection. These samples are routinely labeled with patient identifiers or pseudonyms, allowing for potential linkage to identity and private clinical information if intercepted during transmission. We present a cryptographic scheme to securely transmit externally generated sequence data which does not require any patient identifiers, public key infrastructure, or the transmission of passwords. Materials and methods This novel encryption scheme cryptographically protects participant sequence data using a shared secret key that is derived from a unique subset of an individual’s genetic sequence. This scheme requires access to a subset of an individual’s genetic sequence to acquire full access to the transmitted sequence data, which helps to prevent sample mismatch. Results We validate that the proposed encryption scheme is robust to sequencing errors, population uniqueness, and sibling disambiguation, and provides sufficient cryptographic key space. Discussion Access to a set of an individual’s genotypes and a mutually agreed cryptographic seed is needed to unlock the full sequence, which provides additional sample authentication and authorization security. We present modest fixed and marginal costs to implement this transmission architecture. Conclusions It is possible for genomics researchers who sequence participant samples externally to protect the transmission of sequence data using unique features of an individual’s genetic sequence. PMID:23125421
Zseq: An Approach for Preprocessing Next-Generation Sequencing Data.

PubMed

Alkhateeb, Abedalrhman; Rueda, Luis

2017-08-01

Next-generation sequencing technology generates a huge number of reads (short sequences), which contain a vast amount of genomic data. The sequencing process, however, comes with artifacts. Preprocessing of sequences is mandatory for further downstream analysis. We present Zseq, a linear method that identifies the most informative genomic sequences and reduces the number of biased sequences, sequence duplications, and ambiguous nucleotides. Zseq finds the complexity of the sequences by counting the number of unique k-mers in each sequence as its corresponding score and also takes into the account other factors such as ambiguous nucleotides or high GC-content percentage in k-mers. Based on a z-score threshold, Zseq sweeps through the sequences again and filters those with a z-score less than the user-defined threshold. Zseq algorithm is able to provide a better mapping rate; it reduces the number of ambiguous bases significantly in comparison with other methods. Evaluation of the filtered reads has been conducted by aligning the reads and assembling the transcripts using the reference genome as well as de novo assembly. The assembled transcripts show a better discriminative ability to separate cancer and normal samples in comparison with another state-of-the-art method. Moreover, de novo assembled transcripts from the reads filtered by Zseq have longer genomic sequences than other tested methods. Estimating the threshold of the cutoff point is introduced using labeling rules with optimistic results.
Optimal choice of word length when comparing two Markov sequences using a χ 2-statistic.

PubMed

Bai, Xin; Tang, Kujin; Ren, Jie; Waterman, Michael; Sun, Fengzhu

2017-10-03

Alignment-free sequence comparison using counts of word patterns (grams, k-tuples) has become an active research topic due to the large amount of sequence data from the new sequencing technologies. Genome sequences are frequently modelled by Markov chains and the likelihood ratio test or the corresponding approximate χ 2 -statistic has been suggested to compare two sequences. However, it is not known how to best choose the word length k in such studies. We develop an optimal strategy to choose k by maximizing the statistical power of detecting differences between two sequences. Let the orders of the Markov chains for the two sequences be r 1 and r 2 , respectively. We show through both simulations and theoretical studies that the optimal k= max(r 1 ,r 2 )+1 for both long sequences and next generation sequencing (NGS) read data. The orders of the Markov chains may be unknown and several methods have been developed to estimate the orders of Markov chains based on both long sequences and NGS reads. We study the power loss of the statistics when the estimated orders are used. It is shown that the power loss is minimal for some of the estimators of the orders of Markov chains. Our studies provide guidelines on choosing the optimal word length for the comparison of Markov sequences.
Phylo-mLogo: an interactive and hierarchical multiple-logo visualization tool for alignment of many sequences

PubMed Central

Shih, Arthur Chun-Chieh; Lee, DT; Peng, Chin-Lin; Wu, Yu-Wei

2007-01-01

Background When aligning several hundreds or thousands of sequences, such as epidemic virus sequences or homologous/orthologous sequences of some big gene families, to reconstruct the epidemiological history or their phylogenies, how to analyze and visualize the alignment results of many sequences has become a new challenge for computational biologists. Although there are several tools available for visualization of very long sequence alignments, few of them are applicable to the alignments of many sequences. Results A multiple-logo alignment visualization tool, called Phylo-mLogo, is presented in this paper. Phylo-mLogo calculates the variabilities and homogeneities of alignment sequences by base frequencies or entropies. Different from the traditional representations of sequence logos, Phylo-mLogo not only displays the global logo patterns of the whole alignment of multiple sequences, but also demonstrates their local homologous logos for each clade hierarchically. In addition, Phylo-mLogo also allows the user to focus only on the analysis of some important, structurally or functionally constrained sites in the alignment selected by the user or by built-in automatic calculation. Conclusion With Phylo-mLogo, the user can symbolically and hierarchically visualize hundreds of aligned sequences simultaneously and easily check the changes of their amino acid sites when analyzing many homologous/orthologous or influenza virus sequences. More information of Phylo-mLogo can be found at URL . PMID:17319966
Method and Apparatus for Evaluating the Visual Quality of Processed Digital Video Sequences

NASA Technical Reports Server (NTRS)

Watson, Andrew B. (Inventor)

2002-01-01

A Digital Video Quality (DVQ) apparatus and method that incorporate a model of human visual sensitivity to predict the visibility of artifacts. The DVQ method and apparatus are used for the evaluation of the visual quality of processed digital video sequences and for adaptively controlling the bit rate of the processed digital video sequences without compromising the visual quality. The DVQ apparatus minimizes the required amount of memory and computation. The input to the DVQ apparatus is a pair of color image sequences: an original (R) non-compressed sequence, and a processed (T) sequence. Both sequences (R) and (T) are sampled, cropped, and subjected to color transformations. The sequences are then subjected to blocking and discrete cosine transformation, and the results are transformed to local contrast. The next step is a time filtering operation which implements the human sensitivity to different time frequencies. The results are converted to threshold units by dividing each discrete cosine transform coefficient by its respective visual threshold. At the next stage the two sequences are subtracted to produce an error sequence. The error sequence is subjected to a contrast masking operation, which also depends upon the reference sequence (R). The masked errors can be pooled in various ways to illustrate the perceptual error over various dimensions, and the pooled error can be converted to a visual quality measure.
Facies analysis and sequence stratigraphic framework of upper Campanian strata (Neslen and Mount Garfield formations, Bluecastle Tongue of the Castlegate sandstone, and Mancos shale), Eastern Book cliffs, Colorado and Utah

USGS Publications Warehouse

Kirschbaum, Mark A.; Hettinger, Robert D.

2004-01-01

Facies and sequence-stratigraphic analysis identifies six high-resolution sequences within upper Campanian strata across about 120 miles of the Book Cliffs in western Colorado and eastern Utah. The six sequences are named after prominent sandstone units and include, in ascending order, upper Sego sequence, Neslen sequence, Corcoran sequence, Buck Canyon/lower Cozzette sequence, upper Cozzette sequence, and Cozzette/Rollins sequence. A seventh sequence, the Bluecastle sequence, is present in the extreme western part of the study area. Facies analysis documents deepening- and shallowing- upward successions, parasequence stacking patterns, downlap in subsurface cross sections, facies dislocations, basinward shifts in facies, and truncation of strata.All six sequences display major incision into shoreface deposits of the Sego Sandstone and sandstones of the Corcoran and Cozzette Members of the Mount Garfield Formation. The incised surfaces represent sequence-boundary unconformities that allowed bypass of sediment to lowstand shorelines that are either attached to the older highstand shorelines or are detached from the older highstand shorelines and located southeast of the main study area. The sequence boundary unconformities represent valley incisions that were cut during successive lowstands of relative sea level. The overlying valley-fill deposits generally consist of tidally influenced strata deposited during an overall base level rise. Transgressive surfaces can be traced or projected over, or locally into, estuarine deposits above and landward of their associated shoreface deposits. Maximum flooding surfaces can be traced or projected landward from offshore strata into, or above, coastal-plain deposits. With the exception of the Cozzette/Rollins sequence, the majority of coal-bearing coastal-plain strata was deposited before maximum flooding and is therefore within the transgressive systems tracts. Maximum flooding was followed by strong progradation of parasequences and low preservation potential of coastal-plain strata within the highstand systems tract. The large incised valleys, lack of transgressive retrogradational parasequences, strong progradational nature of highstand parasequences, and low preservation of coastal-plain strata in the highstand systems tracts argue for relatively low accommodation space during deposition of the Sego, Corcoran, and Cozzette sequences. The Buck Canyon/Cozzette and Cozzette/Rollins sequences contrast with other sequences in that the preservation of retrogradational parasequences and the development of large estuaries coincident with maximum flooding indicate a relative increase in accommodation space during deposition of these strata. Following maximum flooding, the Buck Canyon/Cozzette sequence follows the pattern of the other sequences, but the Cozzette/Rollins sequence exhibits a contrasting offlapping pattern with development of offshore clinoforms that downlap and eventually parallel its maximum flooding surface. This highstand systems tract preserves a thick coal-bearing section where the Rollins Sandstone Member of the Mount Garfield Formation parasequences prograde out of the study area, stepping up as much as 800 ft stratigraphically over a distance of about 90 miles. This progradational stacking pattern indicates a higher accommodation space and increased sedimentation rate compared to the previous sequences.
Exploiting long read sequencing technologies to establish high quality highly contiguous pig reference genome assemblies

USDA-ARS?s Scientific Manuscript database

The current pig reference genome sequence (Sscrofa10.2) was established using Sanger sequencing and following the clone-by-clone hierarchical shotgun sequencing approach used in the public human genome project. However, as sequence coverage was low (4-6x) the resulting assembly was only of draft qua...

Some links on this page may take you to non-federal websites. Their policies may differ from this site.