Sample records for database oriented variant

  1. Benchmarking distributed data warehouse solutions for storing genomic variant information

    PubMed Central

    Wiewiórka, Marek S.; Wysakowicz, Dawid P.; Okoniewski, Michał J.

    2017-01-01

    Abstract Genomic-based personalized medicine encompasses storing, analysing and interpreting genomic variants as its central issues. At a time when thousands of patientss sequenced exomes and genomes are becoming available, there is a growing need for efficient database storage and querying. The answer could be the application of modern distributed storage systems and query engines. However, the application of large genomic variant databases to this problem has not been sufficiently far explored so far in the literature. To investigate the effectiveness of modern columnar storage [column-oriented Database Management System (DBMS)] and query engines, we have developed a prototypic genomic variant data warehouse, populated with large generated content of genomic variants and phenotypic data. Next, we have benchmarked performance of a number of combinations of distributed storages and query engines on a set of SQL queries that address biological questions essential for both research and medical applications. In addition, a non-distributed, analytical database (MonetDB) has been used as a baseline. Comparison of query execution times confirms that distributed data warehousing solutions outperform classic relational DBMSs. Moreover, pre-aggregation and further denormalization of data, which reduce the number of distributed join operations, significantly improve query performance by several orders of magnitude. Most of distributed back-ends offer a good performance for complex analytical queries, while the Optimized Row Columnar (ORC) format paired with Presto and Parquet with Spark 2 query engines provide, on average, the lowest execution times. Apache Kudu on the other hand, is the only solution that guarantees a sub-second performance for simple genome range queries returning a small subset of data, where low-latency response is expected, while still offering decent performance for running analytical queries. In summary, research and clinical applications that require the storage and analysis of variants from thousands of samples can benefit from the scalability and performance of distributed data warehouse solutions. Database URL: https://github.com/ZSI-Bio/variantsdwh PMID:29220442

  2. Public variant databases: liability?

    PubMed

    Thorogood, Adrian; Cook-Deegan, Robert; Knoppers, Bartha Maria

    2017-07-01

    Public variant databases support the curation, clinical interpretation, and sharing of genomic data, thus reducing harmful errors or delays in diagnosis. As variant databases are increasingly relied on in the clinical context, there is concern that negligent variant interpretation will harm patients and attract liability. This article explores the evolving legal duties of laboratories, public variant databases, and physicians in clinical genomics and recommends a governance framework for databases to promote responsible data sharing.Genet Med advance online publication 15 December 2016.

  3. Public variant databases: liability?

    PubMed Central

    Thorogood, Adrian; Cook-Deegan, Robert; Knoppers, Bartha Maria

    2017-01-01

    Public variant databases support the curation, clinical interpretation, and sharing of genomic data, thus reducing harmful errors or delays in diagnosis. As variant databases are increasingly relied on in the clinical context, there is concern that negligent variant interpretation will harm patients and attract liability. This article explores the evolving legal duties of laboratories, public variant databases, and physicians in clinical genomics and recommends a governance framework for databases to promote responsible data sharing. Genet Med advance online publication 15 December 2016 PMID:27977006

  4. Evaluating the quality of Marfan genotype-phenotype correlations in existing FBN1 databases.

    PubMed

    Groth, Kristian A; Von Kodolitsch, Yskert; Kutsche, Kerstin; Gaustadnes, Mette; Thorsen, Kasper; Andersen, Niels H; Gravholt, Claus H

    2017-07-01

    Genetic FBN1 testing is pivotal for confirming the clinical diagnosis of Marfan syndrome. In an effort to evaluate variant causality, FBN1 databases are often used. We evaluated the current databases regarding FBN1 variants and validated associated phenotype records with a new Marfan syndrome geno-phenotyping tool called the Marfan score. We evaluated four databases (UMD-FBN1, ClinVar, the Human Gene Mutation Database (HGMD), and Uniprot) containing 2,250 FBN1 variants supported by 4,904 records presented in 307 references. The Marfan score calculated for phenotype data from the records quantified variant associations with Marfan syndrome phenotype. We calculated a Marfan score for 1,283 variants, of which we confirmed the database diagnosis of Marfan syndrome in 77.1%. This represented only 35.8% of the total registered variants; 18.5-33.3% (UMD-FBN1 versus HGMD) of variants associated with Marfan syndrome in the databases could not be confirmed by the recorded phenotype. FBN1 databases can be imprecise and incomplete. Data should be used with caution when evaluating FBN1 variants. At present, the UMD-FBN1 database seems to be the biggest and best curated; therefore, it is the most comprehensive database. However, the need for better genotype-phenotype curated databases is evident, and we hereby present such a database.Genet Med advance online publication 01 December 2016.

  5. Comparison of locus-specific databases for BRCA1 and BRCA2 variants reveals disparity in variant classification within and among databases.

    PubMed

    Vail, Paris J; Morris, Brian; van Kan, Aric; Burdett, Brianna C; Moyes, Kelsey; Theisen, Aaron; Kerr, Iain D; Wenstrup, Richard J; Eggington, Julie M

    2015-10-01

    Genetic variants of uncertain clinical significance (VUSs) are a common outcome of clinical genetic testing. Locus-specific variant databases (LSDBs) have been established for numerous disease-associated genes as a research tool for the interpretation of genetic sequence variants to facilitate variant interpretation via aggregated data. If LSDBs are to be used for clinical practice, consistent and transparent criteria regarding the deposition and interpretation of variants are vital, as variant classifications are often used to make important and irreversible clinical decisions. In this study, we performed a retrospective analysis of 2017 consecutive BRCA1 and BRCA2 genetic variants identified from 24,650 consecutive patient samples referred to our laboratory to establish an unbiased dataset representative of the types of variants seen in the US patient population, submitted by clinicians and researchers for BRCA1 and BRCA2 testing. We compared the clinical classifications of these variants among five publicly accessible BRCA1 and BRCA2 variant databases: BIC, ClinVar, HGMD (paid version), LOVD, and the UMD databases. Our results show substantial disparity of variant classifications among publicly accessible databases. Furthermore, it appears that discrepant classifications are not the result of a single outlier but widespread disagreement among databases. This study also shows that databases sometimes favor a clinical classification when current best practice guidelines (ACMG/AMP/CAP) would suggest an uncertain classification. Although LSDBs have been well established for research applications, our results suggest several challenges preclude their wider use in clinical practice.

  6. Mutation databases for inherited renal disease: are they complete, accurate, clinically relevant, and freely available?

    PubMed

    Savige, Judy; Dagher, Hayat; Povey, Sue

    2014-07-01

    This study examined whether gene-specific DNA variant databases for inherited diseases of the kidney fulfilled the Human Variome Project recommendations of being complete, accurate, clinically relevant and freely available. A recent review identified 60 inherited renal diseases caused by mutations in 132 genes. The disease name, MIM number, gene name, together with "mutation" or "database," were used to identify web-based databases. Fifty-nine diseases (98%) due to mutations in 128 genes had a variant database. Altogether there were 349 databases (a median of 3 per gene, range 0-6), but no gene had two databases with the same number of variants, and 165 (50%) databases included fewer than 10 variants. About half the databases (180, 54%) had been updated in the previous year. Few (77, 23%) were curated by "experts" but these included nine of the 11 with the most variants. Even fewer databases (41, 12%) included clinical features apart from the name of the associated disease. Most (223, 67%) could be accessed without charge, including those for 50 genes (40%) with the maximum number of variants. Future efforts should focus on encouraging experts to collaborate on a single database for each gene affected in inherited renal disease, including both unpublished variants, and clinical phenotypes. © 2014 WILEY PERIODICALS, INC.

  7. Difficulties in diagnosing Marfan syndrome using current FBN1 databases.

    PubMed

    Groth, Kristian A; Gaustadnes, Mette; Thorsen, Kasper; Østergaard, John R; Jensen, Uffe Birk; Gravholt, Claus H; Andersen, Niels H

    2016-01-01

    The diagnostic criteria of Marfan syndrome (MFS) highlight the importance of a FBN1 mutation test in diagnosing MFS. As genetic sequencing becomes better, cheaper, and more accessible, the expected increase in the number of genetic tests will become evident, resulting in numerous genetic variants that need to be evaluated for disease-causing effects based on database information. The aim of this study was to evaluate genetic variants in four databases and review the relevant literature. We assessed background data on 23 common variants registered in ESP6500 and classified as causing MFS in the Human Gene Mutation Database (HGMD). We evaluated data in four variant databases (HGMD, UMD-FBN1, ClinVar, and UniProt) according to the diagnostic criteria for MFS and compared the results with the classification of each variant in the four databases. None of the 23 variants was clearly associated with MFS, even though all classifications in the databases stated otherwise. A genetic diagnosis of MFS cannot reliably be based on current variant databases because they contain incorrectly interpreted conclusions on variants. Variants must be evaluated by time-consuming review of the background material in the databases and by combining these data with expert knowledge on MFS. This is a major problem because we expect even more genetic test results in the near future as a result of the reduced cost and process time for next-generation sequencing.Genet Med 18 1, 98-102.

  8. A Bioinformatics Workflow for Variant Peptide Detection in Shotgun Proteomics*

    PubMed Central

    Li, Jing; Su, Zengliu; Ma, Ze-Qiang; Slebos, Robbert J. C.; Halvey, Patrick; Tabb, David L.; Liebler, Daniel C.; Pao, William; Zhang, Bing

    2011-01-01

    Shotgun proteomics data analysis usually relies on database search. However, commonly used protein sequence databases do not contain information on protein variants and thus prevent variant peptides and proteins from been identified. Including known coding variations into protein sequence databases could help alleviate this problem. Based on our recently published human Cancer Proteome Variation Database, we have created a protein sequence database that comprehensively annotates thousands of cancer-related coding variants collected in the Cancer Proteome Variation Database as well as noncancer-specific ones from the Single Nucleotide Polymorphism Database (dbSNP). Using this database, we then developed a data analysis workflow for variant peptide identification in shotgun proteomics. The high risk of false positive variant identifications was addressed by a modified false discovery rate estimation method. Analysis of colorectal cancer cell lines SW480, RKO, and HCT-116 revealed a total of 81 peptides that contain either noncancer-specific or cancer-related variations. Twenty-three out of 26 variants randomly selected from the 81 were confirmed by genomic sequencing. We further applied the workflow on data sets from three individual colorectal tumor specimens. A total of 204 distinct variant peptides were detected, and five carried known cancer-related mutations. Each individual showed a specific pattern of cancer-related mutations, suggesting potential use of this type of information for personalized medicine. Compatibility of the workflow has been tested with four popular database search engines including Sequest, Mascot, X!Tandem, and MyriMatch. In summary, we have developed a workflow that effectively uses existing genomic data to enable variant peptide detection in proteomics. PMID:21389108

  9. DNA variant databases improve test accuracy and phenotype prediction in Alport syndrome.

    PubMed

    Savige, Judy; Ars, Elisabet; Cotton, Richard G H; Crockett, David; Dagher, Hayat; Deltas, Constantinos; Ding, Jie; Flinter, Frances; Pont-Kingdon, Genevieve; Smaoui, Nizar; Torra, Roser; Storey, Helen

    2014-06-01

    X-linked Alport syndrome is a form of progressive renal failure caused by pathogenic variants in the COL4A5 gene. More than 700 variants have been described and a further 400 are estimated to be known to individual laboratories but are unpublished. The major genetic testing laboratories for X-linked Alport syndrome worldwide have established a Web-based database for published and unpublished COL4A5 variants ( https://grenada.lumc.nl/LOVD2/COL4A/home.php?select_db=COL4A5 ). This conforms with the recommendations of the Human Variome Project: it uses the Leiden Open Variation Database (LOVD) format, describes variants according to the human reference sequence with standardized nomenclature, indicates likely pathogenicity and associated clinical features, and credits the submitting laboratory. The database includes non-pathogenic and recurrent variants, and is linked to another COL4A5 mutation database and relevant bioinformatics sites. Access is free. Increasing the number of COL4A5 variants in the public domain helps patients, diagnostic laboratories, clinicians, and researchers. The database improves the accuracy and efficiency of genetic testing because its variants are already categorized for pathogenicity. The description of further COL4A5 variants and clinical associations will improve our ability to predict phenotype and our understanding of collagen IV biochemistry. The database for X-linked Alport syndrome represents a model for databases in other inherited renal diseases.

  10. The Clinical Next-Generation Sequencing Database: A Tool for the Unified Management of Clinical Information and Genetic Variants to Accelerate Variant Pathogenicity Classification.

    PubMed

    Nishio, Shin-Ya; Usami, Shin-Ichi

    2017-03-01

    Recent advances in next-generation sequencing (NGS) have given rise to new challenges due to the difficulties in variant pathogenicity interpretation and large dataset management, including many kinds of public population databases as well as public or commercial disease-specific databases. Here, we report a new database development tool, named the "Clinical NGS Database," for improving clinical NGS workflow through the unified management of variant information and clinical information. This database software offers a two-feature approach to variant pathogenicity classification. The first of these approaches is a phenotype similarity-based approach. This database allows the easy comparison of the detailed phenotype of each patient with the average phenotype of the same gene mutation at the variant or gene level. It is also possible to browse patients with the same gene mutation quickly. The other approach is a statistical approach to variant pathogenicity classification based on the use of the odds ratio for comparisons between the case and the control for each inheritance mode (families with apparently autosomal dominant inheritance vs. control, and families with apparently autosomal recessive inheritance vs. control). A number of case studies are also presented to illustrate the utility of this database. © 2016 The Authors. **Human Mutation published by Wiley Periodicals, Inc.

  11. Identification of Alternative Splice Variants Using Unique Tryptic Peptide Sequences for Database Searches.

    PubMed

    Tran, Trung T; Bollineni, Ravi C; Strozynski, Margarita; Koehler, Christian J; Thiede, Bernd

    2017-07-07

    Alternative splicing is a mechanism in eukaryotes by which different forms of mRNAs are generated from the same gene. Identification of alternative splice variants requires the identification of peptides specific for alternative splice forms. For this purpose, we generated a human database that contains only unique tryptic peptides specific for alternative splice forms from Swiss-Prot entries. Using this database allows an easy access to splice variant-specific peptide sequences that match to MS data. Furthermore, we combined this database without alternative splice variant-1-specific peptides with human Swiss-Prot. This combined database can be used as a general database for searching of LC-MS data. LC-MS data derived from in-solution digests of two different cell lines (LNCaP, HeLa) and phosphoproteomics studies were analyzed using these two databases. Several nonalternative splice variant-1-specific peptides were found in both cell lines, and some of them seemed to be cell-line-specific. Control and apoptotic phosphoproteomes from Jurkat T cells revealed several nonalternative splice variant-1-specific peptides, and some of them showed clear quantitative differences between the two states.

  12. The UCL low-density lipoprotein receptor gene variant database: pathogenicity update

    PubMed Central

    Futema, Marta; Whittall, Ros; Taylor-Beadling, Alison; Williams, Maggie; den Dunnen, Johan T; Humphries, Steve E

    2017-01-01

    Background Familial hypercholesterolaemia (OMIM 143890) is most frequently caused by variations in the low-density lipoprotein receptor (LDLR) gene. Predicting whether novel variants are pathogenic may not be straightforward, especially for missense and synonymous variants. In 2013, the Association of Clinical Genetic Scientists published guidelines for the classification of variants, with categories 1 and 2 representing clearly not or unlikely pathogenic, respectively, 3 representing variants of unknown significance (VUS), and 4 and 5 representing likely to be or clearly pathogenic, respectively. Here, we update the University College London (UCL) LDLR variant database according to these guidelines. Methods PubMed searches and alerts were used to identify novel LDLR variants for inclusion in the database. Standard in silico tools were used to predict potential pathogenicity. Variants were designated as class 4/5 only when the predictions from the different programs were concordant and as class 3 when predictions were discordant. Results The updated database (http://www.lovd.nl/LDLR) now includes 2925 curated variants, representing 1707 independent events. All 129 nonsense variants, 337 small frame-shifting and 117/118 large rearrangements were classified as 4 or 5. Of the 795 missense variants, 115 were in classes 1 and 2, 605 in class 4 and 75 in class 3. 111/181 intronic variants, 4/34 synonymous variants and 14/37 promoter variants were assigned to classes 4 or 5. Overall, 112 (7%) of reported variants were class 3. Conclusions This study updates the LDLR variant database and identifies a number of reported VUS where additional family and in vitro studies will be required to confirm or refute their pathogenicity. PMID:27821657

  13. A comprehensive global genotype-phenotype database for rare diseases.

    PubMed

    Trujillano, Daniel; Oprea, Gabriela-Elena; Schmitz, Yvonne; Bertoli-Avella, Aida M; Abou Jamra, Rami; Rolfs, Arndt

    2017-01-01

    The ability to discover genetic variants in a patient runs far ahead of the ability to interpret them. Databases with accurate descriptions of the causal relationship between the variants and the phenotype are valuable since these are critical tools in clinical genetic diagnostics. Here, we introduce a comprehensive and global genotype-phenotype database focusing on rare diseases. This database (CentoMD ® ) is a browser-based tool that enables access to a comprehensive, independently curated system utilizing stringent high-quality criteria and a quickly growing repository of genetic and human phenotype ontology (HPO)-based clinical information. Its main goals are to aid the evaluation of genetic variants, to enhance the validity of the genetic analytical workflow, to increase the quality of genetic diagnoses, and to improve evaluation of treatment options for patients with hereditary diseases. The database software correlates clinical information from consented patients and probands of different geographical backgrounds with a large dataset of genetic variants and, when available, biomarker information. An automated follow-up tool is incorporated that informs all users whenever a variant classification has changed. These unique features fully embedded in a CLIA/CAP-accredited quality management system allow appropriate data quality and enhanced patient safety. More than 100,000 genetically screened individuals are documented in the database, resulting in more than 470 million variant detections. Approximately, 57% of the clinically relevant and uncertain variants in the database are novel. Notably, 3% of the genetic variants identified and previously reported in the literature as being associated with a particular rare disease were reclassified, based on internal evidence, as clinically irrelevant. The database offers a comprehensive summary of the clinical validity and causality of detected gene variants with their associated phenotypes, and is a valuable tool for identifying new disease genes through the correlation of novel genetic variants with specific, well-defined phenotypes.

  14. Clinical Variant Classification: A Comparison of Public Databases and a Commercial Testing Laboratory.

    PubMed

    Gradishar, William; Johnson, KariAnne; Brown, Krystal; Mundt, Erin; Manley, Susan

    2017-07-01

    There is a growing move to consult public databases following receipt of a genetic test result from a clinical laboratory; however, the well-documented limitations of these databases call into question how often clinicians will encounter discordant variant classifications that may introduce uncertainty into patient management. Here, we evaluate discordance in BRCA1 and BRCA2 variant classifications between a single commercial testing laboratory and a public database commonly consulted in clinical practice. BRCA1 and BRCA2 variant classifications were obtained from ClinVar and compared with the classifications from a reference laboratory. Full concordance and discordance were determined for variants whose ClinVar entries were of the same pathogenicity (pathogenic, benign, or uncertain). Variants with conflicting ClinVar classifications were considered partially concordant if ≥1 of the listed classifications agreed with the reference laboratory classification. Four thousand two hundred and fifty unique BRCA1 and BRCA2 variants were available for analysis. Overall, 73.2% of classifications were fully concordant and 12.3% were partially concordant. The remaining 14.5% of variants had discordant classifications, most of which had a definitive classification (pathogenic or benign) from the reference laboratory compared with an uncertain classification in ClinVar (14.0%). Here, we show that discrepant classifications between a public database and single reference laboratory potentially account for 26.7% of variants in BRCA1 and BRCA2 . The time and expertise required of clinicians to research these discordant classifications call into question the practicality of checking all test results against a database and suggest that discordant classifications should be interpreted with these limitations in mind. With the increasing use of clinical genetic testing for hereditary cancer risk, accurate variant classification is vital to ensuring appropriate medical management. There is a growing move to consult public databases following receipt of a genetic test result from a clinical laboratory; however, we show that up to 26.7% of variants in BRCA1 and BRCA2 have discordant classifications between ClinVar and a reference laboratory. The findings presented in this paper serve as a note of caution regarding the utility of database consultation. © AlphaMed Press 2017.

  15. HbVar: A relational database of human hemoglobin variants and thalassemia mutations at the globin gene server.

    PubMed

    Hardison, Ross C; Chui, David H K; Giardine, Belinda; Riemer, Cathy; Patrinos, George P; Anagnou, Nicholas; Miller, Webb; Wajcman, Henri

    2002-03-01

    We have constructed a relational database of hemoglobin variants and thalassemia mutations, called HbVar, which can be accessed on the web at http://globin.cse.psu.edu. Extensive information is recorded for each variant and mutation, including a description of the variant and associated pathology, hematology, electrophoretic mobility, methods of isolation, stability information, ethnic occurrence, structure studies, functional studies, and references. The initial information was derived from books by Dr. Titus Huisman and colleagues [Huisman et al., 1996, 1997, 1998]. The current database is updated regularly with the addition of new data and corrections to previous data. Queries can be formulated based on fields in the database. Tables of common categories of variants, such as all those involving the alpha1-globin gene (HBA1) or all those that result in high oxygen affinity, are maintained by automated queries on the database. Users can formulate more precise queries, such as identifying "all beta-globin variants associated with instability and found in Scottish populations." This new database should be useful for clinical diagnosis as well as in fundamental studies of hemoglobin biochemistry, globin gene regulation, and human sequence variation at these loci. Copyright 2002 Wiley-Liss, Inc.

  16. Description and analysis of genetic variants in French hereditary breast and ovarian cancer families recorded in the UMD-BRCA1/BRCA2 databases.

    PubMed

    Caputo, Sandrine; Benboudjema, Louisa; Sinilnikova, Olga; Rouleau, Etienne; Béroud, Christophe; Lidereau, Rosette

    2012-01-01

    BRCA1 and BRCA2 are the two main genes responsible for predisposition to breast and ovarian cancers, as a result of protein-inactivating monoallelic mutations. It remains to be established whether many of the variants identified in these two genes, so-called unclassified/unknown variants (UVs), contribute to the disease phenotype or are simply neutral variants (or polymorphisms). Given the clinical importance of establishing their status, a nationwide effort to annotate these UVs was launched by laboratories belonging to the French GGC consortium (Groupe Génétique et Cancer), leading to the creation of the UMD-BRCA1/BRCA2 databases (http://www.umd.be/BRCA1/ and http://www.umd.be/BRCA2/). These databases have been endorsed by the French National Cancer Institute (INCa) and are designed to collect all variants detected in France, whether causal, neutral or UV. They differ from other BRCA databases in that they contain co-occurrence data for all variants. Using these data, the GGC French consortium has been able to classify certain UVs also contained in other databases. In this article, we report some novel UVs not contained in the BIC database and explore their impact in cancer predisposition based on a structural approach.

  17. CFTR-France, a national relational patient database for sharing genetic and phenotypic data associated with rare CFTR variants.

    PubMed

    Claustres, Mireille; Thèze, Corinne; des Georges, Marie; Baux, David; Girodon, Emmanuelle; Bienvenu, Thierry; Audrezet, Marie-Pierre; Dugueperoux, Ingrid; Férec, Claude; Lalau, Guy; Pagin, Adrien; Kitzis, Alain; Thoreau, Vincent; Gaston, Véronique; Bieth, Eric; Malinge, Marie-Claire; Reboul, Marie-Pierre; Fergelot, Patricia; Lemonnier, Lydie; Mekki, Chadia; Fanen, Pascale; Bergougnoux, Anne; Sasorith, Souphatta; Raynal, Caroline; Bareil, Corinne

    2017-10-01

    Most of the 2,000 variants identified in the CFTR (cystic fibrosis transmembrane regulator) gene are rare or private. Their interpretation is hampered by the lack of available data and resources, making patient care and genetic counseling challenging. We developed a patient-based database dedicated to the annotations of rare CFTR variants in the context of their cis- and trans-allelic combinations. Based on almost 30 years of experience of CFTR testing, CFTR-France (https://cftr.iurc.montp.inserm.fr/cftr) currently compiles 16,819 variant records from 4,615 individuals with cystic fibrosis (CF) or CFTR-RD (related disorders), fetuses with ultrasound bowel anomalies, newborns awaiting clinical diagnosis, and asymptomatic compound heterozygotes. For each of the 736 different variants reported in the database, patient characteristics and genetic information (other variations in cis or in trans) have been thoroughly checked by a dedicated curator. Combining updated clinical, epidemiological, in silico, or in vitro functional data helps to the interpretation of unclassified and the reassessment of misclassified variants. This comprehensive CFTR database is now an invaluable tool for diagnostic laboratories gathering information on rare variants, especially in the context of genetic counseling, prenatal and preimplantation genetic diagnosis. CFTR-France is thus highly complementary to the international database CFTR2 focused so far on the most common CF-causing alleles. © 2017 Wiley Periodicals, Inc.

  18. Comparison and optimization of in silico algorithms for predicting the pathogenicity of sodium channel variants in epilepsy.

    PubMed

    Holland, Katherine D; Bouley, Thomas M; Horn, Paul S

    2017-07-01

    Variants in neuronal voltage-gated sodium channel α-subunits genes SCN1A, SCN2A, and SCN8A are common in early onset epileptic encephalopathies and other autosomal dominant childhood epilepsy syndromes. However, in clinical practice, missense variants are often classified as variants of uncertain significance when missense variants are identified but heritability cannot be determined. Genetic testing reports often include results of computational tests to estimate pathogenicity and the frequency of that variant in population-based databases. The objective of this work was to enhance clinicians' understanding of results by (1) determining how effectively computational algorithms predict epileptogenicity of sodium channel (SCN) missense variants; (2) optimizing their predictive capabilities; and (3) determining if epilepsy-associated SCN variants are present in population-based databases. This will help clinicians better understand the results of indeterminate SCN test results in people with epilepsy. Pathogenic, likely pathogenic, and benign variants in SCNs were identified using databases of sodium channel variants. Benign variants were also identified from population-based databases. Eight algorithms commonly used to predict pathogenicity were compared. In addition, logistic regression was used to determine if a combination of algorithms could better predict pathogenicity. Based on American College of Medical Genetic Criteria, 440 variants were classified as pathogenic or likely pathogenic and 84 were classified as benign or likely benign. Twenty-eight variants previously associated with epilepsy were present in population-based gene databases. The output provided by most computational algorithms had a high sensitivity but low specificity with an accuracy of 0.52-0.77. Accuracy could be improved by adjusting the threshold for pathogenicity. Using this adjustment, the Mendelian Clinically Applicable Pathogenicity (M-CAP) algorithm had an accuracy of 0.90 and a combination of algorithms increased the accuracy to 0.92. Potentially pathogenic variants are present in population-based sources. Most computational algorithms overestimate pathogenicity; however, a weighted combination of several algorithms increased classification accuracy to >0.90. Wiley Periodicals, Inc. © 2017 International League Against Epilepsy.

  19. Korean Variant Archive (KOVA): a reference database of genetic variations in the Korean population.

    PubMed

    Lee, Sangmoon; Seo, Jihae; Park, Jinman; Nam, Jae-Yong; Choi, Ahyoung; Ignatius, Jason S; Bjornson, Robert D; Chae, Jong-Hee; Jang, In-Jin; Lee, Sanghyuk; Park, Woong-Yang; Baek, Daehyun; Choi, Murim

    2017-06-27

    Despite efforts to interrogate human genome variation through large-scale databases, systematic preference toward populations of Caucasian descendants has resulted in unintended reduction of power in studying non-Caucasians. Here we report a compilation of coding variants from 1,055 healthy Korean individuals (KOVA; Korean Variant Archive). The samples were sequenced to a mean depth of 75x, yielding 101 singleton variants per individual. Population genetics analysis demonstrates that the Korean population is a distinct ethnic group comparable to other discrete ethnic groups in Africa and Europe, providing a rationale for such independent genomic datasets. Indeed, KOVA conferred 22.8% increased variant filtering power in addition to Exome Aggregation Consortium (ExAC) when used on Korean exomes. Functional assessment of nonsynonymous variant supported the presence of purifying selection in Koreans. Analysis of copy number variants detected 5.2 deletions and 10.3 amplifications per individual with an increased fraction of novel variants among smaller and rarer copy number variable segments. We also report a list of germline variants that are associated with increased tumor susceptibility. This catalog can function as a critical addition to the pre-existing variant databases in pursuing genetic studies of Korean individuals.

  20. The Finnish disease heritage database (FinDis) update-a database for the genes mutated in the Finnish disease heritage brought to the next-generation sequencing era.

    PubMed

    Polvi, Anne; Linturi, Henna; Varilo, Teppo; Anttonen, Anna-Kaisa; Byrne, Myles; Fokkema, Ivo F A C; Almusa, Henrikki; Metzidis, Anthony; Avela, Kristiina; Aula, Pertti; Kestilä, Marjo; Muilu, Juha

    2013-11-01

    The Finnish Disease Heritage Database (FinDis) (http://findis.org) was originally published in 2004 as a centralized information resource for rare monogenic diseases enriched in the Finnish population. The FinDis database originally contained 405 causative variants for 30 diseases. At the time, the FinDis database was a comprehensive collection of data, but since 1994, a large amount of new information has emerged, making the necessity to update the database evident. We collected information and updated the database to contain genes and causative variants for 35 diseases, including six more genes and more than 1,400 additional disease-causing variants. Information for causative variants for each gene is collected under the LOVD 3.0 platform, enabling easy updating. The FinDis portal provides a centralized resource and user interface to link information on each disease and gene with variant data in the LOVD 3.0 platform. The software written to achieve this has been open-sourced and made available on GitHub (http://github.com/findis-db), allowing biomedical institutions in other countries to present their national data in a similar way, and to both contribute to, and benefit from, standardized variation data. The updated FinDis portal provides a unique resource to assist patient diagnosis, research, and the development of new cures. © 2013 WILEY PERIODICALS, INC.

  1. Monogenic diabetes syndromes: Locus‐specific databases for Alström, Wolfram, and Thiamine‐responsive megaloblastic anemia

    PubMed Central

    Astuti, Dewi; Sabir, Ataf; Fulton, Piers; Zatyka, Malgorzata; Williams, Denise; Hardy, Carol; Milan, Gabriella; Favaretto, Francesca; Yu‐Wai‐Man, Patrick; Rohayem, Julia; López de Heredia, Miguel; Hershey, Tamara; Tranebjaerg, Lisbeth; Chen, Jian‐Hua; Chaussenot, Annabel; Nunes, Virginia; Marshall, Bess; McAfferty, Susan; Tillmann, Vallo; Maffei, Pietro; Paquis‐Flucklinger, Veronique; Geberhiwot, Tarekign; Mlynarski, Wojciech; Parkinson, Kay; Picard, Virginie; Bueno, Gema Esteban; Dias, Renuka; Arnold, Amy; Richens, Caitlin; Paisey, Richard; Urano, Fumihiko; Semple, Robert; Sinnott, Richard

    2017-01-01

    Abstract We developed a variant database for diabetes syndrome genes, using the Leiden Open Variation Database platform, containing observed phenotypes matched to the genetic variations. We populated it with 628 published disease‐associated variants (December 2016) for: WFS1 (n = 309), CISD2 (n = 3), ALMS1 (n = 268), and SLC19A2 (n = 48) for Wolfram type 1, Wolfram type 2, Alström, and Thiamine‐responsive megaloblastic anemia syndromes, respectively; and included 23 previously unpublished novel germline variants in WFS1 and 17 variants in ALMS1. We then investigated genotype–phenotype relations for the WFS1 gene. The presence of biallelic loss‐of‐function variants predicted Wolfram syndrome defined by insulin‐dependent diabetes and optic atrophy, with a sensitivity of 79% (95% CI 75%–83%) and specificity of 92% (83%–97%). The presence of minor loss‐of‐function variants in WFS1 predicted isolated diabetes, isolated deafness, or isolated congenital cataracts without development of the full syndrome (sensitivity 100% [93%–100%]; specificity 78% [73%–82%]). The ability to provide a prognostic prediction based on genotype will lead to improvements in patient care and counseling. The development of the database as a repository for monogenic diabetes gene variants will allow prognostic predictions for other diabetes syndromes as next‐generation sequencing expands the repertoire of genotypes and phenotypes. The database is publicly available online at https://lovd.euro-wabb.org. PMID:28432734

  2. The role of attachment styles in regulating the effects of dopamine on the behavior of salespersons

    PubMed Central

    Verbeke, Willem; Bagozzi, Richard P.; van den Berg, Wouter E.

    2014-01-01

    Two classic strategic orientations have been found to pervade the behavior of modern salespersons: a sales orientation (SO) where salespersons use deception or guile to get customers to buy even if they do not need a product, and a customer orientation (CO) where salespersons first attempt to discover the customer's needs and adjust their product and selling approach to meet those needs. Study 1 replicates recent research and finds that the Taq A1 variant of the DRD2 gene is not related to either sales or CO, whereas the 7-repeat variant of the DRD4 gene is related to CO but not SO. Study 2 investigates gene × phenotype explanations of orientation of salespersons, drawing upon recent research in molecular genetics and biological/psychological attachment theory. The findings show that attachment style regulates the effects of DRD2 on CO, such that greater avoidant attachment styles lead to higher CO for persons with the A2/A2 variant but neither the A1/A2 nor A1/A1 variants. Likewise, attachment style regulates the effects of DRD4 on CO, such that greater avoidant attachment styles lead to higher CO for persons with the 7-repeat variant but not other variants. No effects were found on a SO, and secure and anxious attachment styles did not function as moderators. PMID:24550811

  3. BRCA Share: A Collection of Clinical BRCA Gene Variants.

    PubMed

    Béroud, Christophe; Letovsky, Stanley I; Braastad, Corey D; Caputo, Sandrine M; Beaudoux, Olivia; Bignon, Yves Jean; Bressac-De Paillerets, Brigitte; Bronner, Myriam; Buell, Crystal M; Collod-Béroud, Gwenaëlle; Coulet, Florence; Derive, Nicolas; Divincenzo, Christina; Elzinga, Christopher D; Garrec, Céline; Houdayer, Claude; Karbassi, Izabela; Lizard, Sarab; Love, Angela; Muller, Danièle; Nagan, Narasimhan; Nery, Camille R; Rai, Ghadi; Revillion, Françoise; Salgado, David; Sévenet, Nicolas; Sinilnikova, Olga; Sobol, Hagay; Stoppa-Lyonnet, Dominique; Toulas, Christine; Trautman, Edwin; Vaur, Dominique; Vilquin, Paul; Weymouth, Katelyn S; Willis, Alecia; Eisenberg, Marcia; Strom, Charles M

    2016-12-01

    As next-generation sequencing increases access to human genetic variation, the challenge of determining clinical significance of variants becomes ever more acute. Germline variants in the BRCA1 and BRCA2 genes can confer substantial lifetime risk of breast and ovarian cancer. Assessment of variant pathogenicity is a vital part of clinical genetic testing for these genes. A database of clinical observations of BRCA variants is a critical resource in that process. This article describes BRCA Share™, a database created by a unique international alliance of academic centers and commercial testing laboratories. By integrating the content of the Universal Mutation Database generated by the French Unicancer Genetic Group with the testing results of two large commercial laboratories, Quest Diagnostics and Laboratory Corporation of America (LabCorp), BRCA Share™ has assembled one of the largest publicly accessible collections of BRCA variants currently available. Although access is available to academic researchers without charge, commercial participants in the project are required to pay a support fee and contribute their data. The fees fund the ongoing curation effort, as well as planned experiments to functionally characterize variants of uncertain significance. BRCA Share™ databases can therefore be considered as models of successful data sharing between private companies and the academic world. © 2016 WILEY PERIODICALS, INC.

  4. dbWGFP: a database and web server of human whole-genome single nucleotide variants and their functional predictions.

    PubMed

    Wu, Jiaxin; Wu, Mengmeng; Li, Lianshuo; Liu, Zhuo; Zeng, Wanwen; Jiang, Rui

    2016-01-01

    The recent advancement of the next generation sequencing technology has enabled the fast and low-cost detection of all genetic variants spreading across the entire human genome, making the application of whole-genome sequencing a tendency in the study of disease-causing genetic variants. Nevertheless, there still lacks a repository that collects predictions of functionally damaging effects of human genetic variants, though it has been well recognized that such predictions play a central role in the analysis of whole-genome sequencing data. To fill this gap, we developed a database named dbWGFP (a database and web server of human whole-genome single nucleotide variants and their functional predictions) that contains functional predictions and annotations of nearly 8.58 billion possible human whole-genome single nucleotide variants. Specifically, this database integrates 48 functional predictions calculated by 17 popular computational methods and 44 valuable annotations obtained from various data sources. Standalone software, user-friendly query services and free downloads of this database are available at http://bioinfo.au.tsinghua.edu.cn/dbwgfp. dbWGFP provides a valuable resource for the analysis of whole-genome sequencing, exome sequencing and SNP array data, thereby complementing existing data sources and computational resources in deciphering genetic bases of human inherited diseases. © The Author(s) 2016. Published by Oxford University Press.

  5. Genetic variants of the DNA repair genes from Exome Aggregation Consortium (EXAC) database: significance in cancer.

    PubMed

    Das, Raima; Ghosh, Sankar Kumar

    2017-04-01

    DNA repair pathway is a primary defense system that eliminates wide varieties of DNA damage. Any deficiencies in them are likely to cause the chromosomal instability that leads to cell malfunctioning and tumorigenesis. Genetic polymorphisms in DNA repair genes have demonstrated a significant association with cancer risk. Our study attempts to give a glimpse of the overall scenario of the germline polymorphisms in the DNA repair genes by taking into account of the Exome Aggregation Consortium (ExAC) database as well as the Human Gene Mutation Database (HGMD) for evaluating the disease link, particularly in cancer. It has been found that ExAC DNA repair dataset (which consists of 228 DNA repair genes) comprises 30.4% missense, 12.5% dbSNP reported and 3.2% ClinVar significant variants. 27% of all the missense variants has the deleterious SIFT score of 0.00 and 6% variants carrying the most damaging Polyphen-2 score of 1.00, thus affecting the protein structure and function. However, as per HGMD, only a fraction (1.2%) of ExAC DNA repair variants was found to be cancer-related, indicating remaining variants reported in both the databases to be further analyzed. This, in turn, may provide an increased spectrum of the reported cancer linked variants in the DNA repair genes present in ExAC database. Moreover, further in silico functional assay of the identified vital cancer-associated variants, which is essential to get their actual biological significance, may shed some lights in the field of targeted drug development in near future. Copyright © 2017. Published by Elsevier B.V.

  6. Application of a 5-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants in the InSiGHT locus-specific database.

    PubMed

    Thompson, Bryony A; Spurdle, Amanda B; Plazzer, John-Paul; Greenblatt, Marc S; Akagi, Kiwamu; Al-Mulla, Fahd; Bapat, Bharati; Bernstein, Inge; Capellá, Gabriel; den Dunnen, Johan T; du Sart, Desiree; Fabre, Aurelie; Farrell, Michael P; Farrington, Susan M; Frayling, Ian M; Frebourg, Thierry; Goldgar, David E; Heinen, Christopher D; Holinski-Feder, Elke; Kohonen-Corish, Maija; Robinson, Kristina Lagerstedt; Leung, Suet Yi; Martins, Alexandra; Moller, Pal; Morak, Monika; Nystrom, Minna; Peltomaki, Paivi; Pineda, Marta; Qi, Ming; Ramesar, Rajkumar; Rasmussen, Lene Juel; Royer-Pokora, Brigitte; Scott, Rodney J; Sijmons, Rolf; Tavtigian, Sean V; Tops, Carli M; Weber, Thomas; Wijnen, Juul; Woods, Michael O; Macrae, Finlay; Genuardi, Maurizio

    2014-02-01

    The clinical classification of hereditary sequence variants identified in disease-related genes directly affects clinical management of patients and their relatives. The International Society for Gastrointestinal Hereditary Tumours (InSiGHT) undertook a collaborative effort to develop, test and apply a standardized classification scheme to constitutional variants in the Lynch syndrome-associated genes MLH1, MSH2, MSH6 and PMS2. Unpublished data submission was encouraged to assist in variant classification and was recognized through microattribution. The scheme was refined by multidisciplinary expert committee review of the clinical and functional data available for variants, applied to 2,360 sequence alterations, and disseminated online. Assessment using validated criteria altered classifications for 66% of 12,006 database entries. Clinical recommendations based on transparent evaluation are now possible for 1,370 variants that were not obviously protein truncating from nomenclature. This large-scale endeavor will facilitate the consistent management of families suspected to have Lynch syndrome and demonstrates the value of multidisciplinary collaboration in the curation and classification of variants in public locus-specific databases.

  7. Application of a five-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants lodged on the InSiGHT locus-specific database

    PubMed Central

    Plazzer, John-Paul; Greenblatt, Marc S.; Akagi, Kiwamu; Al-Mulla, Fahd; Bapat, Bharati; Bernstein, Inge; Capellá, Gabriel; den Dunnen, Johan T.; du Sart, Desiree; Fabre, Aurelie; Farrell, Michael P.; Farrington, Susan M.; Frayling, Ian M.; Frebourg, Thierry; Goldgar, David E.; Heinen, Christopher D.; Holinski-Feder, Elke; Kohonen-Corish, Maija; Robinson, Kristina Lagerstedt; Leung, Suet Yi; Martins, Alexandra; Moller, Pal; Morak, Monika; Nystrom, Minna; Peltomaki, Paivi; Pineda, Marta; Qi, Ming; Ramesar, Rajkumar; Rasmussen, Lene Juel; Royer-Pokora, Brigitte; Scott, Rodney J.; Sijmons, Rolf; Tavtigian, Sean V.; Tops, Carli M.; Weber, Thomas; Wijnen, Juul; Woods, Michael O.; Macrae, Finlay; Genuardi, Maurizio

    2015-01-01

    Clinical classification of sequence variants identified in hereditary disease genes directly affects clinical management of patients and their relatives. The International Society for Gastrointestinal Hereditary Tumours (InSiGHT) undertook a collaborative effort to develop, test and apply a standardized classification scheme to constitutional variants in the Lynch Syndrome genes MLH1, MSH2, MSH6 and PMS2. Unpublished data submission was encouraged to assist variant classification, and recognized by microattribution. The scheme was refined by multidisciplinary expert committee review of clinical and functional data available for variants, applied to 2,360 sequence alterations, and disseminated online. Assessment using validated criteria altered classifications for 66% of 12,006 database entries. Clinical recommendations based on transparent evaluation are now possible for 1,370 variants not obviously protein-truncating from nomenclature. This large-scale endeavor will facilitate consistent management of suspected Lynch Syndrome families, and demonstrates the value of multidisciplinary collaboration for curation and classification of variants in public locus-specific databases. PMID:24362816

  8. Standards for Clinical Grade Genomic Databases.

    PubMed

    Yohe, Sophia L; Carter, Alexis B; Pfeifer, John D; Crawford, James M; Cushman-Vokoun, Allison; Caughron, Samuel; Leonard, Debra G B

    2015-11-01

    Next-generation sequencing performed in a clinical environment must meet clinical standards, which requires reproducibility of all aspects of the testing. Clinical-grade genomic databases (CGGDs) are required to classify a variant and to assist in the professional interpretation of clinical next-generation sequencing. Applying quality laboratory standards to the reference databases used for sequence-variant interpretation presents a new challenge for validation and curation. To define CGGD and the categories of information contained in CGGDs and to frame recommendations for the structure and use of these databases in clinical patient care. Members of the College of American Pathologists Personalized Health Care Committee reviewed the literature and existing state of genomic databases and developed a framework for guiding CGGD development in the future. Clinical-grade genomic databases may provide different types of information. This work group defined 3 layers of information in CGGDs: clinical genomic variant repositories, genomic medical data repositories, and genomic medicine evidence databases. The layers are differentiated by the types of genomic and medical information contained and the utility in assisting with clinical interpretation of genomic variants. Clinical-grade genomic databases must meet specific standards regarding submission, curation, and retrieval of data, as well as the maintenance of privacy and security. These organizing principles for CGGDs should serve as a foundation for future development of specific standards that support the use of such databases for patient care.

  9. LitVar: a semantic search engine for linking genomic variant data in PubMed and PMC.

    PubMed

    Allot, Alexis; Peng, Yifan; Wei, Chih-Hsuan; Lee, Kyubum; Phan, Lon; Lu, Zhiyong

    2018-05-14

    The identification and interpretation of genomic variants play a key role in the diagnosis of genetic diseases and related research. These tasks increasingly rely on accessing relevant manually curated information from domain databases (e.g. SwissProt or ClinVar). However, due to the sheer volume of medical literature and high cost of expert curation, curated variant information in existing databases are often incomplete and out-of-date. In addition, the same genetic variant can be mentioned in publications with various names (e.g. 'A146T' versus 'c.436G>A' versus 'rs121913527'). A search in PubMed using only one name usually cannot retrieve all relevant articles for the variant of interest. Hence, to help scientists, healthcare professionals, and database curators find the most up-to-date published variant research, we have developed LitVar for the search and retrieval of standardized variant information. In addition, LitVar uses advanced text mining techniques to compute and extract relationships between variants and other associated entities such as diseases and chemicals/drugs. LitVar is publicly available at https://www.ncbi.nlm.nih.gov/CBBresearch/Lu/Demo/LitVar.

  10. Database for Parkinson Disease Mutations and Rare Variants

    DTIC Science & Technology

    2016-09-01

    AWARD NUMBER: W81XWH-14-1-0097 TITLE: “ Database for Parkinson Disease Mutations and Rare Variants” PRINCIPAL INVESTIGATOR: JEFFERY M. VANCE...TO THE ABOVE ADDRESS. 1. REPORT DATE September 2016 2. REPORT TYPE FINAL 3. DATES COVERED 1 Jul 2014 – 30 Jun 2016 4. TITLE AND SUBTITLE Database ...For Parkinson Disease (PD) specifically, the variant databases currently available are incomplete, don’t assess impact and/or are not equipped to

  11. GETPrime: a gene- or transcript-specific primer database for quantitative real-time PCR.

    PubMed

    Gubelmann, Carine; Gattiker, Alexandre; Massouras, Andreas; Hens, Korneel; David, Fabrice; Decouttere, Frederik; Rougemont, Jacques; Deplancke, Bart

    2011-01-01

    The vast majority of genes in humans and other organisms undergo alternative splicing, yet the biological function of splice variants is still very poorly understood in large part because of the lack of simple tools that can map the expression profiles and patterns of these variants with high sensitivity. High-throughput quantitative real-time polymerase chain reaction (qPCR) is an ideal technique to accurately quantify nucleic acid sequences including splice variants. However, currently available primer design programs do not distinguish between splice variants and also differ substantially in overall quality, functionality or throughput mode. Here, we present GETPrime, a primer database supported by a novel platform that uniquely combines and automates several features critical for optimal qPCR primer design. These include the consideration of all gene splice variants to enable either gene-specific (covering the majority of splice variants) or transcript-specific (covering one splice variant) expression profiling, primer specificity validation, automated best primer pair selection according to strict criteria and graphical visualization of the latter primer pairs within their genomic context. GETPrime primers have been extensively validated experimentally, demonstrating high transcript specificity in complex samples. Thus, the free-access, user-friendly GETPrime database allows fast primer retrieval and visualization for genes or groups of genes of most common model organisms, and is available at http://updepla1srv1.epfl.ch/getprime/. Database URL: http://deplanckelab.epfl.ch.

  12. GETPrime: a gene- or transcript-specific primer database for quantitative real-time PCR

    PubMed Central

    Gubelmann, Carine; Gattiker, Alexandre; Massouras, Andreas; Hens, Korneel; David, Fabrice; Decouttere, Frederik; Rougemont, Jacques; Deplancke, Bart

    2011-01-01

    The vast majority of genes in humans and other organisms undergo alternative splicing, yet the biological function of splice variants is still very poorly understood in large part because of the lack of simple tools that can map the expression profiles and patterns of these variants with high sensitivity. High-throughput quantitative real-time polymerase chain reaction (qPCR) is an ideal technique to accurately quantify nucleic acid sequences including splice variants. However, currently available primer design programs do not distinguish between splice variants and also differ substantially in overall quality, functionality or throughput mode. Here, we present GETPrime, a primer database supported by a novel platform that uniquely combines and automates several features critical for optimal qPCR primer design. These include the consideration of all gene splice variants to enable either gene-specific (covering the majority of splice variants) or transcript-specific (covering one splice variant) expression profiling, primer specificity validation, automated best primer pair selection according to strict criteria and graphical visualization of the latter primer pairs within their genomic context. GETPrime primers have been extensively validated experimentally, demonstrating high transcript specificity in complex samples. Thus, the free-access, user-friendly GETPrime database allows fast primer retrieval and visualization for genes or groups of genes of most common model organisms, and is available at http://updepla1srv1.epfl.ch/getprime/. Database URL: http://deplanckelab.epfl.ch. PMID:21917859

  13. Cube texture formation during the early stages of recrystallization of Al-1%wt.Mn and AA1050 aluminium alloys

    NASA Astrophysics Data System (ADS)

    Miszczyk, M. M.; Paul, H.

    2015-08-01

    The cube texture formation during primary recrystallization was analysed in plane strain deformed samples of a commercial AA1050 alloy and an Al-1%wt.Mn model alloy single crystal of the Goss{110}<001> orientation. The textures were measured with the use of X-ray diffraction and scanning electron microscopy equipped with an electron backscattered diffraction facility. After recrystallization of the Al-1%wt.Mn single crystal, the texture of the recrystallized grains was dominated by four variants of the S{123}<634> orientation. The cube grains were only sporadically detected by the SEM/EBSD system. Nevertheless, an increased density of <111> poles corresponding to the cube orientation was observed. The latter was connected with the superposition of four variants of the S{123}<634> orientation. This indicates that the cube texture after the recrystallization was a ‘compromise texture’. In the case of the recrystallized AA1050 alloy, the strong cube texture results from both the increased density of the particular <111> poles of the four variants of the S orientation and the ∼40°(∼< 111>)-type rotation. The first mechanism transforms the Sdef-oriented areas into Srex ones, whereas the second the near S-oriented, as-deformed areas into near cube-oriented grains.

  14. DaMold: A data-mining platform for variant annotation and visualization in molecular diagnostics research.

    PubMed

    Pandey, Ram Vinay; Pabinger, Stephan; Kriegner, Albert; Weinhäusel, Andreas

    2017-07-01

    Next-generation sequencing (NGS) has become a powerful and efficient tool for routine mutation screening in clinical research. As each NGS test yields hundreds of variants, the current challenge is to meaningfully interpret the data and select potential candidates. Analyzing each variant while manually investigating several relevant databases to collect specific information is a cumbersome and time-consuming process, and it requires expertise and familiarity with these databases. Thus, a tool that can seamlessly annotate variants with clinically relevant databases under one common interface would be of great help for variant annotation, cross-referencing, and visualization. This tool would allow variants to be processed in an automated and high-throughput manner and facilitate the investigation of variants in several genome browsers. Several analysis tools are available for raw sequencing-read processing and variant identification, but an automated variant filtering, annotation, cross-referencing, and visualization tool is still lacking. To fulfill these requirements, we developed DaMold, a Web-based, user-friendly tool that can filter and annotate variants and can access and compile information from 37 resources. It is easy to use, provides flexible input options, and accepts variants from NGS and Sanger sequencing as well as hotspots in VCF and BED formats. DaMold is available as an online application at http://damold.platomics.com/index.html, and as a Docker container and virtual machine at https://sourceforge.net/projects/damold/. © 2017 Wiley Periodicals, Inc.

  15. Spanish personal name variations in national and international biomedical databases: implications for information retrieval and bibliometric studies

    PubMed Central

    Ruiz-Pérez, R.; López-Cózar, E. Delgado; Jiménez-Contreras, E.

    2002-01-01

    Objectives: The study sought to investigate how Spanish names are handled by national and international databases and to identify mistakes that can undermine the usefulness of these databases for locating and retrieving works by Spanish authors. Methods: The authors sampled 172 articles published by authors from the University of Granada Medical School between 1987 and 1996 and analyzed the variations in how each of their names was indexed in Science Citation Index (SCI), MEDLINE, and Índice Médico Español (IME). The number and types of variants that appeared for each author's name were recorded and compared across databases to identify inconsistencies in indexing practices. We analyzed the relationship between variability (number of variants of an author's name) and productivity (number of items the name was associated with as an author), the consequences for retrieval of information, and the most frequent indexing structures used for Spanish names. Results: The proportion of authors who appeared under more then one name was 48.1% in SCI, 50.7% in MEDLINE, and 69.0% in IME. Productivity correlated directly with variability: more than 50% of the authors listed on five to ten items appeared under more than one name in any given database, and close to 100% of the authors listed on more than ten items appeared under two or more variants. Productivity correlated inversely with retrievability: as the number of variants for a name increased, the number of items retrieved under each variant decreased. For the most highly productive authors, the number of items retrieved under each variant tended toward one. The most frequent indexing methods varied between databases. In MEDLINE and IME, names were indexed correctly as “first surname second surname, first name initial middle name initial” (if present) in 41.7% and 49.5% of the records, respectively. However, in SCI, the most frequent method was “first surname, first name initial second name initial” (48.0% of the records) and first surname and second surname run together, first name initial (18.3%). Conclusions: Retrievability on the basis of author's name was poor in all three databases. Each database uses accurate indexing methods, but these methods fail to result in consistency or coherence for specific entries. The likely causes of inconsistency are: (1) use by authors of variants of their names during their publication careers, (2) lack of authority control in all three databases, (3) the use of an inappropriate indexing method for Spanish names in SCI, (4) authors' inconsistent behaviors, and (5) possible editorial interventions by some journals. We offer some suggestions as to how to avert the proliferation of author name variants in the databases. PMID:12398248

  16. Human Variome Project Quality Assessment Criteria for Variation Databases.

    PubMed

    Vihinen, Mauno; Hancock, John M; Maglott, Donna R; Landrum, Melissa J; Schaafsma, Gerard C P; Taschner, Peter

    2016-06-01

    Numerous databases containing information about DNA, RNA, and protein variations are available. Gene-specific variant databases (locus-specific variation databases, LSDBs) are typically curated and maintained for single genes or groups of genes for a certain disease(s). These databases are widely considered as the most reliable information source for a particular gene/protein/disease, but it should also be made clear they may have widely varying contents, infrastructure, and quality. Quality is very important to evaluate because these databases may affect health decision-making, research, and clinical practice. The Human Variome Project (HVP) established a Working Group for Variant Database Quality Assessment. The basic principle was to develop a simple system that nevertheless provides a good overview of the quality of a database. The HVP quality evaluation criteria that resulted are divided into four main components: data quality, technical quality, accessibility, and timeliness. This report elaborates on the developed quality criteria and how implementation of the quality scheme can be achieved. Examples are provided for the current status of the quality items in two different databases, BTKbase, an LSDB, and ClinVar, a central archive of submissions about variants and their clinical significance. © 2016 WILEY PERIODICALS, INC.

  17. DBATE: database of alternative transcripts expression.

    PubMed

    Bianchi, Valerio; Colantoni, Alessio; Calderone, Alberto; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela

    2013-01-01

    The use of high-throughput RNA sequencing technology (RNA-seq) allows whole transcriptome analysis, providing an unbiased and unabridged view of alternative transcript expression. Coupling splicing variant-specific expression with its functional inference is still an open and difficult issue for which we created the DataBase of Alternative Transcripts Expression (DBATE), a web-based repository storing expression values and functional annotation of alternative splicing variants. We processed 13 large RNA-seq panels from human healthy tissues and in disease conditions, reporting expression levels and functional annotations gathered and integrated from different sources for each splicing variant, using a variant-specific annotation transfer pipeline. The possibility to perform complex queries by cross-referencing different functional annotations permits the retrieval of desired subsets of splicing variant expression values that can be visualized in several ways, from simple to more informative. DBATE is intended as a novel tool to help appreciate how, and possibly why, the transcriptome expression is shaped. DATABASE URL: http://bioinformatica.uniroma2.it/DBATE/.

  18. The Israeli National Genetic database: a 10-year experience.

    PubMed

    Zlotogora, Joël; Patrinos, George P

    2017-03-16

    The Israeli National and Ethnic Mutation database ( http://server.goldenhelix.org/israeli ) was launched in September 2006 on the ETHNOS software to include clinically relevant genomic variants reported among Jewish and Arab Israeli patients. In 2016, the database was reviewed and corrected according to ClinVar ( https://www.ncbi.nlm.nih.gov/clinvar ) and ExAC ( http://exac.broadinstitute.org ) database entries. The present article summarizes some key aspects from the development and continuous update of the database over a 10-year period, which could serve as a paradigm of successful database curation for other similar resources. In September 2016, there were 2444 entries in the database, 890 among Jews, 1376 among Israeli Arabs, and 178 entries among Palestinian Arabs, corresponding to an ~4× data content increase compared to when originally launched. While the Israeli Arab population is much smaller than the Jewish population, the number of pathogenic variants causing recessive disorders reported in the database is higher among Arabs (934) than among Jews (648). Nevertheless, the number of pathogenic variants classified as founder mutations in the database is smaller among Arabs (175) than among Jews (192). In 2016, the entire database content was compared to that of other databases such as ClinVar and ExAC. We show that a significant difference in the percentage of pathogenic variants from the Israeli genetic database that were present in ExAC was observed between the Jewish population (31.8%) and the Israeli Arab population (20.6%). The Israeli genetic database was launched in 2006 on the ETHNOS software and is available online ever since. It allows querying the database according to the disorder and the ethnicity; however, many other features are not available, in particular the possibility to search according to the name of the gene. In addition, due to the technical limitations of the previous ETHNOS software, new features and data are not included in the present online version of the database and upgrade is currently ongoing.

  19. Probing the Orientation of Surface-Immobilized Protein G B1 Using ToF-SIMS Sum Frequency Generation and NEXAFS Spectroscopy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    L Baugh; T Weidner; J Baio

    2011-12-31

    The ability to orient active proteins on surfaces is a critical aspect of many medical technologies. An important related challenge is characterizing protein orientation in these surface films. This study uses a combination of time-of-flight secondary ion mass spectrometry (ToF-SIMS), sum frequency generation (SFG) vibrational spectroscopy, and near-edge X-ray absorption fine structure (NEXAFS) spectroscopy to characterize the orientation of surface-immobilized Protein G B1, a rigid 6 kDa domain that binds the Fc fragment of IgG. Two Protein G B1 variants with a single cysteine introduced at either end were immobilized via the cysteine thiol onto maleimide-oligo(ethylene glycol)-functionalized gold and baremore » gold substrates. X-ray photoelectron spectroscopy was used to measure the amount of immobilized protein, and ToF-SIMS was used to measure the amino acid composition of the exposed surface of the protein films and to confirm covalent attachment of protein thiol to the substrate maleimide groups. SFG and NEXAFS were used to characterize the ordering and orientation of peptide or side chain bonds. On both substrates and for both cysteine positions, ToF-SIMS data showed enrichment of mass peaks from amino acids located at the end of the protein opposite to the cysteine surface position as compared with nonspecifically immobilized protein, indicating end-on protein orientations. Orientation on the maleimide substrate was enhanced by increasing pH (7.0-9.5) and salt concentration (0-1.5 M NaCl). SFG spectral peaks characteristic of ordered {alpha}-helix and {beta}-sheet elements were observed for both variants but not for cysteine-free wild type protein on the maleimide surface. The phase of the {alpha}-helix and {beta}-sheet peaks indicated a predominantly upright orientation for both variants, consistent with an end-on protein binding configuration. Polarization dependence of the NEXAFS signal from the N 1s to {pi}* transition of {beta}-sheet peptide bonds also indicated protein ordering, with an estimated tilt angle of inner {beta}-strands of 40-50{sup o} for both variants (one variant more tilted than the other), consistent with SFG results. The combined results demonstrate the power of using complementary techniques to probe protein orientation on surfaces.« less

  20. Ferroelasticity and domain physics in two-dimensional transition metal dichalcogenide monolayers.

    PubMed

    Li, Wenbin; Li, Ju

    2016-02-24

    Monolayers of transition metal dichalcogenides can exist in several structural polymorphs, including 2H, 1T and 1T'. The low-symmetry 1T' phase has three orientation variants, resulting from the three equivalent directions of Peierls distortion in the parental 1T phase. Using first-principles calculations, we predict that mechanical strain can switch the relative thermodynamic stability between the orientation variants of the 1T' phase. We find that such strain-induced variant switching only requires a few percent elastic strain, which is eminently achievable experimentally with transition metal dichalcogenide monolayers. Calculations indicate that the transformation barrier associated with such variant switching is small (<0.2 eV per chemical formula unit), suggesting that strain-induced variant switching can happen under laboratory conditions. Monolayers of transition metal dichalcogenides with 1T' structure therefore have the potential to be ferroelastic and shape memory materials with interesting domain physics.

  1. Ferroelasticity and domain physics in two-dimensional transition metal dichalcogenide monolayers

    DOE PAGES

    Li, Wenbin; Li, Ju

    2016-02-24

    Monolayers of transition metal dichalcogenides can exist in several structural polymorphs, including 2H, 1T and 1T'. The low-symmetry 1T' phase has three orientation variants, resulting from the three equivalent directions of Peierls distortion in the parental 1T phase. Using first-principles calculations, we predict that mechanical strain can switch the relative thermodynamic stability between the orientation variants of the 1T' phase. We find that such strain-induced variant switching only requires a few percent elastic strain, which is eminently achievable experimentally with transition metal dichalcogenide monolayers. Calculations indicate that the transformation barrier associated with such variant switching is small (<0.2 eV permore » chemical formula unit), suggesting that strain-induced variant switching can happen under laboratory conditions. Furthermore, monolayers of transition metal dichalcogenides with 1T' structure therefore have the potential to be ferroelastic and shape memory materials with interesting domain physics.« less

  2. LenVarDB: database of length-variant protein domains.

    PubMed

    Mutt, Eshita; Mathew, Oommen K; Sowdhamini, Ramanathan

    2014-01-01

    Protein domains are functionally and structurally independent modules, which add to the functional variety of proteins. This array of functional diversity has been enabled by evolutionary changes, such as amino acid substitutions or insertions or deletions, occurring in these protein domains. Length variations (indels) can introduce changes at structural, functional and interaction levels. LenVarDB (freely available at http://caps.ncbs.res.in/lenvardb/) traces these length variations, starting from structure-based sequence alignments in our Protein Alignments organized as Structural Superfamilies (PASS2) database, across 731 structural classification of proteins (SCOP)-based protein domain superfamilies connected to 2 730 625 sequence homologues. Alignment of sequence homologues corresponding to a structural domain is available, starting from a structure-based sequence alignment of the superfamily. Orientation of the length-variant (indel) regions in protein domains can be visualized by mapping them on the structure and on the alignment. Knowledge about location of length variations within protein domains and their visual representation will be useful in predicting changes within structurally or functionally relevant sites, which may ultimately regulate protein function. Non-technical summary: Evolutionary changes bring about natural changes to proteins that may be found in many organisms. Such changes could be reflected as amino acid substitutions or insertions-deletions (indels) in protein sequences. LenVarDB is a database that provides an early overview of observed length variations that were set among 731 protein families and after examining >2 million sequences. Indels are followed up to observe if they are close to the active site such that they can affect the activity of proteins. Inclusion of such information can aid the design of bioengineering experiments.

  3. GTRAC: fast retrieval from compressed collections of genomic variants

    PubMed Central

    Tatwawadi, Kedar; Hernaez, Mikel; Ochoa, Idoia; Weissman, Tsachy

    2016-01-01

    Motivation: The dramatic decrease in the cost of sequencing has resulted in the generation of huge amounts of genomic data, as evidenced by projects such as the UK10K and the Million Veteran Project, with the number of sequenced genomes ranging in the order of 10 K to 1 M. Due to the large redundancies among genomic sequences of individuals from the same species, most of the medical research deals with the variants in the sequences as compared with a reference sequence, rather than with the complete genomic sequences. Consequently, millions of genomes represented as variants are stored in databases. These databases are constantly updated and queried to extract information such as the common variants among individuals or groups of individuals. Previous algorithms for compression of this type of databases lack efficient random access capabilities, rendering querying the database for particular variants and/or individuals extremely inefficient, to the point where compression is often relinquished altogether. Results: We present a new algorithm for this task, called GTRAC, that achieves significant compression ratios while allowing fast random access over the compressed database. For example, GTRAC is able to compress a Homo sapiens dataset containing 1092 samples in 1.1 GB (compression ratio of 160), while allowing for decompression of specific samples in less than a second and decompression of specific variants in 17 ms. GTRAC uses and adapts techniques from information theory, such as a specialized Lempel-Ziv compressor, and tailored succinct data structures. Availability and Implementation: The GTRAC algorithm is available for download at: https://github.com/kedartatwawadi/GTRAC Contact: kedart@stanford.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27587665

  4. GTRAC: fast retrieval from compressed collections of genomic variants.

    PubMed

    Tatwawadi, Kedar; Hernaez, Mikel; Ochoa, Idoia; Weissman, Tsachy

    2016-09-01

    The dramatic decrease in the cost of sequencing has resulted in the generation of huge amounts of genomic data, as evidenced by projects such as the UK10K and the Million Veteran Project, with the number of sequenced genomes ranging in the order of 10 K to 1 M. Due to the large redundancies among genomic sequences of individuals from the same species, most of the medical research deals with the variants in the sequences as compared with a reference sequence, rather than with the complete genomic sequences. Consequently, millions of genomes represented as variants are stored in databases. These databases are constantly updated and queried to extract information such as the common variants among individuals or groups of individuals. Previous algorithms for compression of this type of databases lack efficient random access capabilities, rendering querying the database for particular variants and/or individuals extremely inefficient, to the point where compression is often relinquished altogether. We present a new algorithm for this task, called GTRAC, that achieves significant compression ratios while allowing fast random access over the compressed database. For example, GTRAC is able to compress a Homo sapiens dataset containing 1092 samples in 1.1 GB (compression ratio of 160), while allowing for decompression of specific samples in less than a second and decompression of specific variants in 17 ms. GTRAC uses and adapts techniques from information theory, such as a specialized Lempel-Ziv compressor, and tailored succinct data structures. The GTRAC algorithm is available for download at: https://github.com/kedartatwawadi/GTRAC CONTACT: : kedart@stanford.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  5. Detection of alternative splice variants at the proteome level in Aspergillus flavus.

    PubMed

    Chang, Kung-Yen; Georgianna, D Ryan; Heber, Steffen; Payne, Gary A; Muddiman, David C

    2010-03-05

    Identification of proteins from proteolytic peptides or intact proteins plays an essential role in proteomics. Researchers use search engines to match the acquired peptide sequences to the target proteins. However, search engines depend on protein databases to provide candidates for consideration. Alternative splicing (AS), the mechanism where the exon of pre-mRNAs can be spliced and rearranged to generate distinct mRNA and therefore protein variants, enable higher eukaryotic organisms, with only a limited number of genes, to have the requisite complexity and diversity at the proteome level. Multiple alternative isoforms from one gene often share common segments of sequences. However, many protein databases only include a limited number of isoforms to keep minimal redundancy. As a result, the database search might not identify a target protein even with high quality tandem MS data and accurate intact precursor ion mass. We computationally predicted an exhaustive list of putative isoforms of Aspergillus flavus proteins from 20 371 expressed sequence tags to investigate whether an alternative splicing protein database can assign a greater proportion of mass spectrometry data. The newly constructed AS database provided 9807 new alternatively spliced variants in addition to 12 832 previously annotated proteins. The searches of the existing tandem MS spectra data set using the AS database identified 29 new proteins encoded by 26 genes. Nine fungal genes appeared to have multiple protein isoforms. In addition to the discovery of splice variants, AS database also showed potential to improve genome annotation. In summary, the introduction of an alternative splicing database helps identify more proteins and unveils more information about a proteome.

  6. Novel LOVD databases for hereditary breast cancer and colorectal cancer genes in the Chinese population.

    PubMed

    Pan, Min; Cong, Peikuan; Wang, Yue; Lin, Changsong; Yuan, Ying; Dong, Jian; Banerjee, Santasree; Zhang, Tao; Chen, Yanling; Zhang, Ting; Chen, Mingqing; Hu, Peter; Zheng, Shu; Zhang, Jin; Qi, Ming

    2011-12-01

    The Human Variome Project (HVP) is an international consortium of clinicians, geneticists, and researchers from over 30 countries, aiming to facilitate the establishment and maintenance of standards, systems, and infrastructure for the worldwide collection and sharing of all genetic variations effecting human disease. The HVP-China Node will build new and supplement existing databases of genetic diseases. As the first effort, we have created a novel variant database of BRCA1 and BRCA2, mismatch repair genes (MMR), and APC genes for breast cancer, Lynch syndrome, and familial adenomatous polyposis (FAP), respectively, in the Chinese population using the Leiden Open Variation Database (LOVD) format. We searched PubMed and some Chinese search engines to collect all the variants of these genes in the Chinese population that have already been detected and reported. There are some differences in the gene variants between the Chinese population and that of other ethnicities. The database is available online at http://www.genomed.org/LOVD/. Our database will appear to users who survey other LOVD databases (e.g., by Google search, or by NCBI GeneTests search). Remote submissions are accepted, and the information is updated monthly. © 2011 Wiley Periodicals, Inc.

  7. MARRVEL: Integration of Human and Model Organism Genetic Resources to Facilitate Functional Annotation of the Human Genome.

    PubMed

    Wang, Julia; Al-Ouran, Rami; Hu, Yanhui; Kim, Seon-Young; Wan, Ying-Wooi; Wangler, Michael F; Yamamoto, Shinya; Chao, Hsiao-Tuan; Comjean, Aram; Mohr, Stephanie E; Perrimon, Norbert; Liu, Zhandong; Bellen, Hugo J

    2017-06-01

    One major challenge encountered with interpreting human genetic variants is the limited understanding of the functional impact of genetic alterations on biological processes. Furthermore, there remains an unmet demand for an efficient survey of the wealth of information on human homologs in model organisms across numerous databases. To efficiently assess the large volume of publically available information, it is important to provide a concise summary of the most relevant information in a rapid user-friendly format. To this end, we created MARRVEL (model organism aggregated resources for rare variant exploration). MARRVEL is a publicly available website that integrates information from six human genetic databases and seven model organism databases. For any given variant or gene, MARRVEL displays information from OMIM, ExAC, ClinVar, Geno2MP, DGV, and DECIPHER. Importantly, it curates model organism-specific databases to concurrently display a concise summary regarding the human gene homologs in budding and fission yeast, worm, fly, fish, mouse, and rat on a single webpage. Experiment-based information on tissue expression, protein subcellular localization, biological process, and molecular function for the human gene and homologs in the seven model organisms are arranged into a concise output. Hence, rather than visiting multiple separate databases for variant and gene analysis, users can obtain important information by searching once through MARRVEL. Altogether, MARRVEL dramatically improves efficiency and accessibility to data collection and facilitates analysis of human genes and variants by cross-disciplinary integration of 18 million records available in public databases to facilitate clinical diagnosis and basic research. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  8. CYP21A2 mutation update: Comprehensive analysis of databases and published genetic variants.

    PubMed

    Simonetti, Leandro; Bruque, Carlos D; Fernández, Cecilia S; Benavides-Mori, Belén; Delea, Marisol; Kolomenski, Jorge E; Espeche, Lucía D; Buzzalino, Noemí D; Nadra, Alejandro D; Dain, Liliana

    2018-01-01

    Congenital adrenal hyperplasia (CAH) is a group of autosomal recessive disorders of adrenal steroidogenesis. Disorders in steroid 21-hydroxylation account for over 95% of patients with CAH. Clinically, the 21-hydroxylase deficiency has been classified in a broad spectrum of clinical forms, ranging from severe or classical, to mild late onset or non-classical. Known allelic variants in the disease causing CYP21A2 gene are spread among different sources. Until recently, most variants reported have been identified in the clinical setting, which presumably bias described variants to pathogenic ones, as those found in the CYPAlleles database. Nevertheless, a large number of variants are being described in massive genome projects, many of which are found in dbSNP, but lack functional implications and/or their phenotypic effect. In this work, we gathered a total of 1,340 GVs in the CYP21A2 gene, from which 899 variants were unique and 230 have an effect on human health, and compiled all this information in an integrated database. We also connected CYP21A2 sequence information to phenotypic effects for all available mutations, including double mutants in cis. Data compiled in the present work could help physicians in the genetic counseling of families affected with 21-hydroxylase deficiency. © 2017 Wiley Periodicals, Inc.

  9. CanvasDB: a local database infrastructure for analysis of targeted- and whole genome re-sequencing projects

    PubMed Central

    Ameur, Adam; Bunikis, Ignas; Enroth, Stefan; Gyllensten, Ulf

    2014-01-01

    CanvasDB is an infrastructure for management and analysis of genetic variants from massively parallel sequencing (MPS) projects. The system stores SNP and indel calls in a local database, designed to handle very large datasets, to allow for rapid analysis using simple commands in R. Functional annotations are included in the system, making it suitable for direct identification of disease-causing mutations in human exome- (WES) or whole-genome sequencing (WGS) projects. The system has a built-in filtering function implemented to simultaneously take into account variant calls from all individual samples. This enables advanced comparative analysis of variant distribution between groups of samples, including detection of candidate causative mutations within family structures and genome-wide association by sequencing. In most cases, these analyses are executed within just a matter of seconds, even when there are several hundreds of samples and millions of variants in the database. We demonstrate the scalability of canvasDB by importing the individual variant calls from all 1092 individuals present in the 1000 Genomes Project into the system, over 4.4 billion SNPs and indels in total. Our results show that canvasDB makes it possible to perform advanced analyses of large-scale WGS projects on a local server. Database URL: https://github.com/UppsalaGenomeCenter/CanvasDB PMID:25281234

  10. CanvasDB: a local database infrastructure for analysis of targeted- and whole genome re-sequencing projects.

    PubMed

    Ameur, Adam; Bunikis, Ignas; Enroth, Stefan; Gyllensten, Ulf

    2014-01-01

    CanvasDB is an infrastructure for management and analysis of genetic variants from massively parallel sequencing (MPS) projects. The system stores SNP and indel calls in a local database, designed to handle very large datasets, to allow for rapid analysis using simple commands in R. Functional annotations are included in the system, making it suitable for direct identification of disease-causing mutations in human exome- (WES) or whole-genome sequencing (WGS) projects. The system has a built-in filtering function implemented to simultaneously take into account variant calls from all individual samples. This enables advanced comparative analysis of variant distribution between groups of samples, including detection of candidate causative mutations within family structures and genome-wide association by sequencing. In most cases, these analyses are executed within just a matter of seconds, even when there are several hundreds of samples and millions of variants in the database. We demonstrate the scalability of canvasDB by importing the individual variant calls from all 1092 individuals present in the 1000 Genomes Project into the system, over 4.4 billion SNPs and indels in total. Our results show that canvasDB makes it possible to perform advanced analyses of large-scale WGS projects on a local server. Database URL: https://github.com/UppsalaGenomeCenter/CanvasDB. © The Author(s) 2014. Published by Oxford University Press.

  11. The curation of genetic variants: difficulties and possible solutions.

    PubMed

    Pandey, Kapil Raj; Maden, Narendra; Poudel, Barsha; Pradhananga, Sailendra; Sharma, Amit Kumar

    2012-12-01

    The curation of genetic variants from biomedical articles is required for various clinical and research purposes. Nowadays, establishment of variant databases that include overall information about variants is becoming quite popular. These databases have immense utility, serving as a user-friendly information storehouse of variants for information seekers. While manual curation is the gold standard method for curation of variants, it can turn out to be time-consuming on a large scale thus necessitating the need for automation. Curation of variants described in biomedical literature may not be straightforward mainly due to various nomenclature and expression issues. Though current trends in paper writing on variants is inclined to the standard nomenclature such that variants can easily be retrieved, we have a massive store of variants in the literature that are present as non-standard names and the online search engines that are predominantly used may not be capable of finding them. For effective curation of variants, knowledge about the overall process of curation, nature and types of difficulties in curation, and ways to tackle the difficulties during the task are crucial. Only by effective curation, can variants be correctly interpreted. This paper presents the process and difficulties of curation of genetic variants with possible solutions and suggestions from our work experience in the field including literature support. The paper also highlights aspects of interpretation of genetic variants and the importance of writing papers on variants following standard and retrievable methods. Copyright © 2012. Published by Elsevier Ltd.

  12. The Curation of Genetic Variants: Difficulties and Possible Solutions

    PubMed Central

    Pandey, Kapil Raj; Maden, Narendra; Poudel, Barsha; Pradhananga, Sailendra; Sharma, Amit Kumar

    2012-01-01

    The curation of genetic variants from biomedical articles is required for various clinical and research purposes. Nowadays, establishment of variant databases that include overall information about variants is becoming quite popular. These databases have immense utility, serving as a user-friendly information storehouse of variants for information seekers. While manual curation is the gold standard method for curation of variants, it can turn out to be time-consuming on a large scale thus necessitating the need for automation. Curation of variants described in biomedical literature may not be straightforward mainly due to various nomenclature and expression issues. Though current trends in paper writing on variants is inclined to the standard nomenclature such that variants can easily be retrieved, we have a massive store of variants in the literature that are present as non-standard names and the online search engines that are predominantly used may not be capable of finding them. For effective curation of variants, knowledge about the overall process of curation, nature and types of difficulties in curation, and ways to tackle the difficulties during the task are crucial. Only by effective curation, can variants be correctly interpreted. This paper presents the process and difficulties of curation of genetic variants with possible solutions and suggestions from our work experience in the field including literature support. The paper also highlights aspects of interpretation of genetic variants and the importance of writing papers on variants following standard and retrievable methods. PMID:23317699

  13. GALT protein database: querying structural and functional features of GALT enzyme.

    PubMed

    d'Acierno, Antonio; Facchiano, Angelo; Marabotti, Anna

    2014-09-01

    Knowledge of the impact of variations on protein structure can enhance the comprehension of the mechanisms of genetic diseases related to that protein. Here, we present a new version of GALT Protein Database, a Web-accessible data repository for the storage and interrogation of structural effects of variations of the enzyme galactose-1-phosphate uridylyltransferase (GALT), the impairment of which leads to classic Galactosemia, a rare genetic disease. This new version of this database now contains the models of 201 missense variants of GALT enzyme, including heterozygous variants, and it allows users not only to retrieve information about the missense variations affecting this protein, but also to investigate their impact on substrate binding, intersubunit interactions, stability, and other structural features. In addition, it allows the interactive visualization of the models of variants collected into the database. We have developed additional tools to improve the use of the database by nonspecialized users. This Web-accessible database (http://bioinformatica.isa.cnr.it/GALT/GALT2.0) represents a model of tools potentially suitable for application to other proteins that are involved in human pathologies and that are subjected to genetic variations. © 2014 WILEY PERIODICALS, INC.

  14. Multiple endocrine neoplasia type 1 (MEN1): An update of 208 new germline variants reported in the last nine years.

    PubMed

    Concolino, Paola; Costella, Alessandra; Capoluongo, Ettore

    2016-01-01

    This review will focus on the germline MEN1 mutations that have been reported in patients with MEN1 and other hereditary endocrine disorders from 2007 to September 2015. A comprehensive review regarding the analysis of 1336 MEN1 mutations reported in the first decade following the gene's identification was performed by Lemos and Thakker in 2008. No other similar papers are available in literature apart from these data. We also checked for the list of Locus-Specific DataBases (LSDBs) and we found five MEN1 free-online mutational databases. 151 articles from the NCBI PubMed literature database were read and evaluated and a total of 75 MEN1 variants were found. On the contrary, 67, 22 and 44 novel MEN1 variants were obtained from ClinVar, MEN1 at Café Variome and HGMD (The Human Gene Mutation Database) databases respectively. A final careful analysis of MEN1 mutations affecting the coding region was performed. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. A rare variant of the mtDNA HVS1 sequence in the hairs of Napoléon's family.

    PubMed

    Lucotte, Gérard

    2010-10-04

    This paper describes the finding of a rare variant in the sequence of the hypervariable segment (HVS1) of mitochondrial (mtDNA) extracted from two preserved hairs, authenticated as belonging to the French Emperor Napoléon I (Napoléon Bonaparte). This rare variant is a mutation that changes the base C to T at position 16,184 (16184C→T), and it constitutes the only mutation found in this HVS1 sequence. This mutation is rare, because it was not found in a reference database (P < 0.05). In a personal database (M. Pala) comprising 37,000 different sequences, the 16184C→T mutation was found in only three samples, thus in this database the mutation frequency was 0.00008%. This mutation 16184C→T was also the only variant found subsequently in the HVS1 sequences of mtDNAs extracted from Napoléon's mother (Letizia) and from his youngest sister (Caroline), confirming that this mutation is maternally inherited. This 16184C→T variant could be used for genetic verification to authenticate any doubtful material and determine whether it should indeed be attributed to Napoléon.

  16. A rare variant of the mtDNA HVS1 sequence in the hairs of Napoléon's family

    PubMed Central

    2010-01-01

    This paper describes the finding of a rare variant in the sequence of the hypervariable segment (HVS1) of mitochondrial (mtDNA) extracted from two preserved hairs, authenticated as belonging to the French Emperor Napoléon I (Napoléon Bonaparte). This rare variant is a mutation that changes the base C to T at position 16,184 (16184C→T), and it constitutes the only mutation found in this HVS1 sequence. This mutation is rare, because it was not found in a reference database (P < 0.05). In a personal database (M. Pala) comprising 37,000 different sequences, the 16184C→T mutation was found in only three samples, thus in this database the mutation frequency was 0.00008%. This mutation 16184C→T was also the only variant found subsequently in the HVS1 sequences of mtDNAs extracted from Napoléon's mother (Letizia) and from his youngest sister (Caroline), confirming that this mutation is maternally inherited. This 16184C→T variant could be used for genetic verification to authenticate any doubtful material and determine whether it should indeed be attributed to Napoléon. PMID:21092341

  17. Harmonizing the interpretation of genetic variants across the world: the Malaysian experience.

    PubMed

    Hassan, Nik Norliza Nik; Plazzer, John-Paul; Smith, Timothy D; Halim-Fikri, Hashim; Macrae, Finlay; Zubaidi, A A L; Zilfalil, Bin Alwi

    2016-02-26

    Databases for gene variants are very useful for sharing genetic data and to facilitate the understanding of the genetic basis of diseases. This report summarises the issues surrounding the development of the Malaysian Human Variome Project Country Node. The focus is on human germline variants. Somatic variants, mitochondrial variants and other types of genetic variation have corresponding databases which are not covered here, as they have specific issues that do not necessarily apply to germline variations. The ethical, legal, social issues, intellectual property, ownership of the data, information technology implementation, and efforts to improve the standards and systems used in data sharing are discussed. An overarching framework such as provided by the Human Variome Project to co-ordinate activities is invaluable. Country Nodes, such as MyHVP, enable human gene variation associated with human diseases to be collected, stored and shared by all disciplines (clinicians, molecular biologists, pathologists, bioinformaticians) for a consistent interpretation of genetic variants locally and across the world.

  18. The Spectrum of Pedagogical Orientations of Malawian and South African Physical Science Teachers towards Inquiry

    ERIC Educational Resources Information Center

    Ramnarain, Umesh; Nampota, Dorothy; Schuster, David

    2016-01-01

    This study investigated and compared the pedagogical orientations of physical sciences teachers in Malawi and South Africa towards inquiry or direct methods of science teaching. Pedagogical orientation has been theorized as a component of pedagogical content knowledge. Orientations were characterized along a spectrum of two variants of inquiry and…

  19. Nominal ISOMERs (Incorrect Spellings Of Medicines Eluding Researchers)-variants in the spellings of drug names in PubMed: a database review.

    PubMed

    Ferner, Robin E; Aronson, Jeffrey K

    2016-12-14

     To examine how misspellings of drug names could impede searches for published literature.  Database review.  PubMed.  The study included 30 drug names that are commonly misspelt on prescription charts in hospitals in Birmingham, UK (test set), and 30 control names randomly chosen from a hospital formulary (control set). The following definitions were used: standard names-the international non-proprietary names, variant names-deviations in spelling from standard names that are not themselves standard names in English language nomenclature, and hidden reference variants-variant spellings that identified publications in textword (tw) searches of PubMed or other databases, and which were not identified by textword searches for the standard names. Variant names were generated from standard names by applying letter substitutions, omissions, additions, transpositions, duplications, deduplications, and combinations of these. Searches were carried out in PubMed (30 June 2016) for "standard name[tw]" and "variant name[tw] NOT standard name[tw]."  The 30 standard names of drugs in the test set gave 325 979 hits in total, and 160 hidden reference variants gave 3872 hits (1.17%). The standard names of the control set gave 470 064 hits, and 79 hidden reference variants gave 766 hits (0.16%). Letter substitutions (particularly i to y and vice versa) and omissions together accounted for 2924 (74%) of the variants. Amitriptyline (8530 hits) yielded 18 hidden reference variants (179 (2.1%) hits). Names ending in "in," "ine," or "micin" were commonly misspelt. Failing to search for hidden reference variants of "gentamicin," "amitriptyline," "mirtazapine," and "trazodone" would miss at least 19 systematic reviews. A hidden reference variant related to Christmas, "No-el", was rare; variants of "X-miss" were rarer.  When performing searches, researchers should include misspellings of drug names among their search terms. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  20. New workflow for classification of genetic variants' pathogenicity applied to hereditary recurrent fevers by the International Study Group for Systemic Autoinflammatory Diseases (INSAID).

    PubMed

    Van Gijn, Marielle E; Ceccherini, Isabella; Shinar, Yael; Carbo, Ellen C; Slofstra, Mariska; Arostegui, Juan I; Sarrabay, Guillaume; Rowczenio, Dorota; Omoyımnı, Ebun; Balci-Peynircioglu, Banu; Hoffman, Hal M; Milhavet, Florian; Swertz, Morris A; Touitou, Isabelle

    2018-03-29

    Hereditary recurrent fevers (HRFs) are rare inflammatory diseases sharing similar clinical symptoms and effectively treated with anti-inflammatory biological drugs. Accurate diagnosis of HRF relies heavily on genetic testing. This study aimed to obtain an experts' consensus on the clinical significance of gene variants in four well-known HRF genes: MEFV , TNFRSF1A , NLRP3 and MVK . We configured a MOLGENIS web platform to share and analyse pathogenicity classifications of the variants and to manage a consensus-based classification process. Four experts in HRF genetics submitted independent classifications of 858 variants. Classifications were driven to consensus by recruiting four more expert opinions and by targeting discordant classifications in five iterative rounds. Consensus classification was reached for 804/858 variants (94%). None of the unsolved variants (6%) remained with opposite classifications (eg, pathogenic vs benign). New mutational hotspots were found in all genes. We noted a lower pathogenic variant load and a higher fraction of variants with unknown or unsolved clinical significance in the MEFV gene. Applying a consensus-driven process on the pathogenicity assessment of experts yielded rapid classification of almost all variants of four HRF genes. The high-throughput database will profoundly assist clinicians and geneticists in the diagnosis of HRFs. The configured MOLGENIS platform and consensus evolution protocol are usable for assembly of other variant pathogenicity databases. The MOLGENIS software is available for reuse at http://github.com/molgenis/molgenis; the specific HRF configuration is available at http://molgenis.org/said/. The HRF pathogenicity classifications will be published on the INFEVERS database at https://fmf.igh.cnrs.fr/ISSAID/infevers/. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  1. Nominal ISOMERs (Incorrect Spellings Of Medicines Eluding Researchers)—variants in the spellings of drug names in PubMed: a database review

    PubMed Central

    Aronson, Jeffrey K

    2016-01-01

    Objective To examine how misspellings of drug names could impede searches for published literature. Design Database review. Data source PubMed. Review methods The study included 30 drug names that are commonly misspelt on prescription charts in hospitals in Birmingham, UK (test set), and 30 control names randomly chosen from a hospital formulary (control set). The following definitions were used: standard names—the international non-proprietary names, variant names—deviations in spelling from standard names that are not themselves standard names in English language nomenclature, and hidden reference variants—variant spellings that identified publications in textword (tw) searches of PubMed or other databases, and which were not identified by textword searches for the standard names. Variant names were generated from standard names by applying letter substitutions, omissions, additions, transpositions, duplications, deduplications, and combinations of these. Searches were carried out in PubMed (30 June 2016) for “standard name[tw]” and “variant name[tw] NOT standard name[tw].” Results The 30 standard names of drugs in the test set gave 325 979 hits in total, and 160 hidden reference variants gave 3872 hits (1.17%). The standard names of the control set gave 470 064 hits, and 79 hidden reference variants gave 766 hits (0.16%). Letter substitutions (particularly i to y and vice versa) and omissions together accounted for 2924 (74%) of the variants. Amitriptyline (8530 hits) yielded 18 hidden reference variants (179 (2.1%) hits). Names ending in “in,” “ine,” or “micin” were commonly misspelt. Failing to search for hidden reference variants of “gentamicin,” “amitriptyline,” “mirtazapine,” and “trazodone” would miss at least 19 systematic reviews. A hidden reference variant related to Christmas, “No-el”, was rare; variants of “X-miss” were rarer. Conclusion When performing searches, researchers should include misspellings of drug names among their search terms. PMID:27974346

  2. Meta-analysis of CHEK2 1100delC variant and colorectal cancer susceptibility.

    PubMed

    Xiang, He-ping; Geng, Xiao-ping; Ge, Wei-wei; Li, He

    2011-11-01

    Cell cycle checkpoint kinase 2 (CHEK2) gene has been inconsistently associated with colorectal cancer (CRC), particularly the 1100delC variant. To generate large-scale evidence on whether the CHEK2 1100delC variant is associated with CRC susceptibility we have conducted a meta-analysis. Data were collected from the following electronic databases: PubMed, Excerpta Medica Database and Chinese Biomedical Literature Database, with the last report up to November 2010. The odds ratio (OR) and its 95% confidence interval (95% CI) were used to assess the strength of association. We evaluated the contrast of carriers versus non-carriers. Meta-analysis was performed in a fixed/random effect model by using the software Review Manager 4.2. A total of six studies including 4194 cases and 10,010 controls based on the search criteria were involved in this meta-analysis. A significant association of the CHEK2 1100delC variant with unselected CRC was found (OR=2.11, 95% CI=1.41-3.16, P=0.0003). We also found an association of the CHEK2 1100delC variant with familial CRC (OR=2.80, 95% CI=1.74-4.51, P<0.0001). However, the association was not established for sporadic CRC (OR=1.45, 95% CI=0.49-4.30, P=0.50). This meta-analysis demonstrates that the CHEK2 1100delC variant may be an important CRC-predisposing gene, which increases CRC risk. Copyright © 2011. Published by Elsevier Ltd.

  3. EMEN2: An Object Oriented Database and Electronic Lab Notebook

    PubMed Central

    Rees, Ian; Langley, Ed; Chiu, Wah; Ludtke, Steven J.

    2013-01-01

    Transmission electron microscopy and associated methods such as single particle analysis, 2-D crystallography, helical reconstruction and tomography, are highly data-intensive experimental sciences, which also have substantial variability in experimental technique. Object-oriented databases present an attractive alternative to traditional relational databases for situations where the experiments themselves are continually evolving. We present EMEN2, an easy to use object-oriented database with a highly flexible infrastructure originally targeted for transmission electron microscopy and tomography, which has been extended to be adaptable for use in virtually any experimental science. It is a pure object-oriented database designed for easy adoption in diverse laboratory environments, and does not require professional database administration. It includes a full featured, dynamic web interface in addition to APIs for programmatic access. EMEN2 installations currently support roughly 800 scientists worldwide with over 1/2 million experimental records and over 20 TB of experimental data. The software is freely available with complete source. PMID:23360752

  4. Microstructural development inside the stress induced martensite variant in a Ti-Ni-Nb shape memory alloy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zheng, Y.F.; Cai, W.; Zhang, J.X.

    2000-04-03

    The microstructural development inside the stress induced martensite (SIM) variants in Ti-Ni-Nb alloy with various degrees of deformation have been revealed by electron microscopic observations. The orientation relationship between the SIM and the parent phase has been found: [1{bar 1}0]{sub M}{parallel}[11{bar 1}]{sub B2}, (001){sub M} 5{degree} away from (101){sub B2}. The lattice invariant shear of the SIM variants at the slightly deformed stage is dominantly (11{bar 1}) Type I twin. Besides the ordinary slip, the adjustment and development of the internal secondary twinning from (11{bar 1}) Type I twin to {l_angle}011{r_angle} Type II/ or (011) Type I twin, (001)compound twinmore » and (111) Type I twin happen concurrently or in combination inside the SIM variants with the further deformation. The corresponding deformation mechanisms include stress induced reorientation of SIM substructural bands by the most favorably oriented twin system, stress induced migration of the SIM substructural boundary through internal twinning and stress induced injection of foreign SIM variant to the preexisting substructural bands.« less

  5. Identification of Candidate Gene Variants in Korean MODY Families by Whole-Exome Sequencing.

    PubMed

    Shim, Ye Jee; Kim, Jung Eun; Hwang, Su-Kyeong; Choi, Bong Seok; Choi, Byung Ho; Cho, Eun-Mi; Jang, Kyoung Mi; Ko, Cheol Woo

    2015-01-01

    To date, 13 genes causing maturity-onset diabetes of the young (MODY) have been identified. However, there is a big discrepancy in the genetic locus between Asian and Caucasian patients with MODY. Thus, we conducted whole-exome sequencing in Korean MODY families to identify causative gene variants. Six MODY probands and their family members were included. Variants in the dbSNP135 and TIARA databases for Koreans and the variants with minor allele frequencies >0.5% of the 1000 Genomes database were excluded. We selected only the functional variants (gain of stop codon, frameshifts and nonsynonymous single-nucleotide variants) and conducted a case-control comparison in the family members. The selected variants were scanned for the previously introduced gene set implicated in glucose metabolism. Three variants c.620C>T:p.Thr207Ile in PTPRD, c.559C>G:p.Gln187Glu in SYT9, and c.1526T>G:p.Val509Gly in WFS1 were respectively identified in 3 families. We could not find any disease-causative alleles of known MODY 1-13 genes. Based on the predictive program, Thr207Ile in PTPRD was considered pathogenic. Whole-exome sequencing is a valuable method for the genetic diagnosis of MODY. Further evaluation is necessary about the role of PTPRD, SYT9 and WFS1 in normal insulin release from pancreatic beta cells. © 2015 S. Karger AG, Basel.

  6. Deciphering Variability of PKD1 and PKD2 in an Italian Cohort of 643 Patients with Autosomal Dominant Polycystic Kidney Disease (ADPKD)

    PubMed Central

    Carrera, Paola; Calzavara, Silvia; Magistroni, Riccardo; den Dunnen, Johan T.; Rigo, Francesca; Stenirri, Stefania; Testa, Francesca; Messa, Piergiorgio; Cerutti, Roberta; Scolari, Francesco; Izzi, Claudia; Edefonti, Alberto; Negrisolo, Susanna; Benetti, Elisa; Alibrandi, Maria Teresa Sciarrone; Manunta, Paolo; Boletta, Alessandra; Ferrari, Maurizio

    2016-01-01

    Autosomal Dominant Polycystic Kidney Disease (ADPKD) is the most common hereditary kidney disease. We analysed PKD1 and PKD2, in a large cohort of 440 unrelated Italian patients with ADPKD and 203 relatives by direct sequencing and MLPA. Molecular and detailed phenotypic data have been collected and submitted to the PKD1/PKD2 LOVD database. This is the first large retrospective study in Italian patients, describing 701 variants, 249 (35.5%) already associated with ADPKD and 452 (64.5%) novel. According to the criteria adopted, the overall detection rate was 80% (352/440). Novel variants with uncertain significance were found in 14% of patients. Among patients with pathogenic variants, in 301 (85.5%) the disease is associated with PKD1, 196 (55.7%) truncating, 81 (23%) non truncating, 24 (6.8%) IF indels, and in 51 (14.5%) with PKD2. Our results outline the high allelic heterogeneity of variants, complicated by the presence of variants of uncertain significance as well as of multiple variants in the same subject. Classification of novel variants may be particularly cumbersome having an important impact on the genetic counselling. Our study confirms the importance to improve the assessment of variant pathogenicity for ADPKD; to this point databasing of both clinical and molecular data is crucial. PMID:27499327

  7. Object-oriented structures supporting remote sensing databases

    NASA Technical Reports Server (NTRS)

    Wichmann, Keith; Cromp, Robert F.

    1995-01-01

    Object-oriented databases show promise for modeling the complex interrelationships pervasive in scientific domains. To examine the utility of this approach, we have developed an Intelligent Information Fusion System based on this technology, and applied it to the problem of managing an active repository of remotely-sensed satellite scenes. The design and implementation of the system is compared and contrasted with conventional relational database techniques, followed by a presentation of the underlying object-oriented data structures used to enable fast indexing into the data holdings.

  8. Multilevel biological characterization of exomic variants at the protein level significantly improves the identification of their deleterious effects.

    PubMed

    Raimondi, Daniele; Gazzo, Andrea M; Rooman, Marianne; Lenaerts, Tom; Vranken, Wim F

    2016-06-15

    There are now many predictors capable of identifying the likely phenotypic effects of single nucleotide variants (SNVs) or short in-frame Insertions or Deletions (INDELs) on the increasing amount of genome sequence data. Most of these predictors focus on SNVs and use a combination of features related to sequence conservation, biophysical, and/or structural properties to link the observed variant to either neutral or disease phenotype. Despite notable successes, the mapping between genetic variants and their phenotypic effects is riddled with levels of complexity that are not yet fully understood and that are often not taken into account in the predictions, despite their promise of significantly improving the prediction of deleterious mutants. We present DEOGEN, a novel variant effect predictor that can handle both missense SNVs and in-frame INDELs. By integrating information from different biological scales and mimicking the complex mixture of effects that lead from the variant to the phenotype, we obtain significant improvements in the variant-effect prediction results. Next to the typical variant-oriented features based on the evolutionary conservation of the mutated positions, we added a collection of protein-oriented features that are based on functional aspects of the gene affected. We cross-validated DEOGEN on 36 825 polymorphisms, 20 821 deleterious SNVs, and 1038 INDELs from SwissProt. The multilevel contextualization of each (variant, protein) pair in DEOGEN provides a 10% improvement of MCC with respect to current state-of-the-art tools. The software and the data presented here is publicly available at http://ibsquare.be/deogen : wvranken@vub.ac.be Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  9. BISQUE: locus- and variant-specific conversion of genomic, transcriptomic and proteomic database identifiers.

    PubMed

    Meyer, Michael J; Geske, Philip; Yu, Haiyuan

    2016-05-15

    Biological sequence databases are integral to efforts to characterize and understand biological molecules and share biological data. However, when analyzing these data, scientists are often left holding disparate biological currency-molecular identifiers from different databases. For downstream applications that require converting the identifiers themselves, there are many resources available, but analyzing associated loci and variants can be cumbersome if data is not given in a form amenable to particular analyses. Here we present BISQUE, a web server and customizable command-line tool for converting molecular identifiers and their contained loci and variants between different database conventions. BISQUE uses a graph traversal algorithm to generalize the conversion process for residues in the human genome, genes, transcripts and proteins, allowing for conversion across classes of molecules and in all directions through an intuitive web interface and a URL-based web service. BISQUE is freely available via the web using any major web browser (http://bisque.yulab.org/). Source code is available in a public GitHub repository (https://github.com/hyulab/BISQUE). haiyuan.yu@cornell.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  10. Identification of Inherited Retinal Disease-Associated Genetic Variants in 11 Candidate Genes.

    PubMed

    Astuti, Galuh D N; van den Born, L Ingeborgh; Khan, M Imran; Hamel, Christian P; Bocquet, Béatrice; Manes, Gaël; Quinodoz, Mathieu; Ali, Manir; Toomes, Carmel; McKibbin, Martin; El-Asrag, Mohammed E; Haer-Wigman, Lonneke; Inglehearn, Chris F; Black, Graeme C M; Hoyng, Carel B; Cremers, Frans P M; Roosing, Susanne

    2018-01-10

    Inherited retinal diseases (IRDs) display an enormous genetic heterogeneity. Whole exome sequencing (WES) recently identified genes that were mutated in a small proportion of IRD cases. Consequently, finding a second case or family carrying pathogenic variants in the same candidate gene often is challenging. In this study, we searched for novel candidate IRD gene-associated variants in isolated IRD families, assessed their causality, and searched for novel genotype-phenotype correlations. Whole exome sequencing was performed in 11 probands affected with IRDs. Homozygosity mapping data was available for five cases. Variants with minor allele frequencies ≤ 0.5% in public databases were selected as candidate disease-causing variants. These variants were ranked based on their: (a) presence in a gene that was previously implicated in IRD; (b) minor allele frequency in the Exome Aggregation Consortium database (ExAC); (c) in silico pathogenicity assessment using the combined annotation dependent depletion (CADD) score; and (d) interaction of the corresponding protein with known IRD-associated proteins. Twelve unique variants were found in 11 different genes in 11 IRD probands. Novel autosomal recessive and dominant inheritance patterns were found for variants in Small Nuclear Ribonucleoprotein U5 Subunit 200 ( SNRNP200 ) and Zinc Finger Protein 513 ( ZNF513 ), respectively. Using our pathogenicity assessment, a variant in DEAH-Box Helicase 32 ( DHX32 ) was the top ranked novel candidate gene to be associated with IRDs, followed by eight medium and lower ranked candidate genes. The identification of candidate disease-associated sequence variants in 11 single families underscores the notion that the previously identified IRD-associated genes collectively carry > 90% of the defects implicated in IRDs. To identify multiple patients or families with variants in the same gene and thereby provide extra proof for pathogenicity, worldwide data sharing is needed.

  11. UMD-USHbases: a comprehensive set of databases to record and analyse pathogenic mutations and unclassified variants in seven Usher syndrome causing genes.

    PubMed

    Baux, David; Faugère, Valérie; Larrieu, Lise; Le Guédard-Méreuze, Sandie; Hamroun, Dalil; Béroud, Christophe; Malcolm, Sue; Claustres, Mireille; Roux, Anne-Françoise

    2008-08-01

    Using the Universal Mutation Database (UMD) software, we have constructed "UMD-USHbases", a set of relational databases of nucleotide variations for seven genes involved in Usher syndrome (MYO7A, CDH23, PCDH15, USH1C, USH1G, USH3A and USH2A). Mutations in the Usher syndrome type I causing genes are also recorded in non-syndromic hearing loss cases and mutations in USH2A in non-syndromic retinitis pigmentosa. Usher syndrome provides a particular challenge for molecular diagnostics because of the clinical and molecular heterogeneity. As many mutations are missense changes, and all the genes also contain apparently non-pathogenic polymorphisms, well-curated databases are crucial for accurate interpretation of pathogenicity. Tools are provided to assess the pathogenicity of mutations, including conservation of amino acids and analysis of splice-sites. Reference amino acid alignments are provided. Apparently non-pathogenic variants in patients with Usher syndrome, at both the nucleotide and amino acid level, are included. The UMD-USHbases currently contain more than 2,830 entries including disease causing mutations, unclassified variants or non-pathogenic polymorphisms identified in over 938 patients. In addition to data collected from 89 publications, 15 novel mutations identified in our laboratory are recorded in MYO7A (6), CDH23 (8), or PCDH15 (1) genes. Information is given on the relative involvement of the seven genes, the number and distribution of variants in each gene. UMD-USHbases give access to a software package that provides specific routines and optimized multicriteria research and sorting tools. These databases should assist clinicians and geneticists seeking information about mutations responsible for Usher syndrome.

  12. Large-scale mass spectrometric detection of variant peptides resulting from non-synonymous nucleotide differences

    PubMed Central

    Sheynkman, Gloria M.; Shortreed, Michael R.; Frey, Brian L.; Scalf, Mark; Smith, Lloyd M.

    2013-01-01

    Each individual carries thousands of non-synonymous single nucleotide variants (nsSNVs) in their genome, each corresponding to a single amino acid polymorphism (SAP) in the encoded proteins. It is important to be able to directly detect and quantify these variations at the protein level in order to study post-transcriptional regulation, differential allelic expression, and other important biological processes. However, such variant peptides are not generally detected in standard proteomic analyses, due to their absence from the generic databases that are employed for mass spectrometry searching. Here, we extend previous work that demonstrated the use of customized SAP databases constructed from sample-matched RNA-Seq data. We collected deep coverage RNA-Seq data from the Jurkat cell line, compiled the set of nsSNVs that are expressed, used this information to construct a customized SAP database, and searched it against deep coverage shotgun MS data obtained from the same sample. This approach enabled detection of 421 SAP peptides mapping to 395 nsSNVs. We compared these peptides to peptides identified from a large generic search database containing all known nsSNVs (dbSNP) and found that more than 70% of the SAP peptides from this dbSNP-derived search were not supported by the RNA-Seq data, and thus are likely false positives. Next, we increased the SAP coverage from the RNA-Seq derived database by utilizing multiple protease digestions, thereby increasing variant detection to 695 SAP peptides mapping to 504 nsSNV sites. These detected SAP peptides corresponded to moderate to high abundance transcripts (30+ transcripts per million, TPM). The SAP peptides included 192 allelic pairs; the relative expression levels of the two alleles were evaluated for 51 of those pairs, and found to be comparable in all cases. PMID:24175627

  13. eMelanoBase: an online locus-specific variant database for familial melanoma.

    PubMed

    Fung, David C Y; Holland, Elizabeth A; Becker, Therese M; Hayward, Nicholas K; Bressac-de Paillerets, Brigitte; Mann, Graham J

    2003-01-01

    A proportion of melanoma-prone individuals in both familial and non-familial contexts has been shown to carry inactivating mutations in either CDKN2A or, rarely, CDK4. CDKN2A is a complex locus that encodes two unrelated proteins from alternately spliced transcripts that are read in different frames. The alpha transcript (exons 1alpha, 2, and 3) produces the p16INK4A cyclin-dependent kinase inhibitor, while the beta transcript (exons 1beta and 2) is translated as p14ARF, a stabilizing factor of p53 levels through binding to MDM2. Mutations in exon 2 can impair both polypeptides and insertions and deletions in exons 1alpha, 1beta, and 2, which can theoretically generate p16INK4A-p14ARF fusion proteins. No online database currently takes into account all the consequences of these genotypes, a situation compounded by some problematic previous annotations of CDKN2A-related sequences and descriptions of their mutations. As an initiative of the international Melanoma Genetics Consortium, we have therefore established a database of germline variants observed in all loci implicated in familial melanoma susceptibility. Such a comprehensive, publicly accessible database is an essential foundation for research on melanoma susceptibility and its clinical application. Our database serves two types of data as defined by HUGO. The core dataset includes the nucleotide variants on the genomic and transcript levels, amino acid variants, and citation. The ancillary dataset includes keyword description of events at the transcription and translation levels and epidemiological data. The application that handles users' queries was designed in the model-view-controller architecture and was implemented in Java. The object-relational database schema was deduced using functional dependency analysis. We hereby present our first functional prototype of eMelanoBase. The service is accessible via the URL www.wmi.usyd.edu.au:8080/melanoma.html. Copyright 2002 Wiley-Liss, Inc.

  14. DRUMS: a human disease related unique gene mutation search engine.

    PubMed

    Li, Zuofeng; Liu, Xingnan; Wen, Jingran; Xu, Ye; Zhao, Xin; Li, Xuan; Liu, Lei; Zhang, Xiaoyan

    2011-10-01

    With the completion of the human genome project and the development of new methods for gene variant detection, the integration of mutation data and its phenotypic consequences has become more important than ever. Among all available resources, locus-specific databases (LSDBs) curate one or more specific genes' mutation data along with high-quality phenotypes. Although some genotype-phenotype data from LSDB have been integrated into central databases little effort has been made to integrate all these data by a search engine approach. In this work, we have developed disease related unique gene mutation search engine (DRUMS), a search engine for human disease related unique gene mutation as a convenient tool for biologists or physicians to retrieve gene variant and related phenotype information. Gene variant and phenotype information were stored in a gene-centred relational database. Moreover, the relationships between mutations and diseases were indexed by the uniform resource identifier from LSDB, or another central database. By querying DRUMS, users can access the most popular mutation databases under one interface. DRUMS could be treated as a domain specific search engine. By using web crawling, indexing, and searching technologies, it provides a competitively efficient interface for searching and retrieving mutation data and their relationships to diseases. The present system is freely accessible at http://www.scbit.org/glif/new/drums/index.html. © 2011 Wiley-Liss, Inc.

  15. Mutation Update for GNE Gene Variants Associated with GNE Myopathy

    PubMed Central

    Celeste, Frank V.; Vilboux, Thierry; Ciccone, Carla; de Dios, John Karl; Malicdan, May Christine V.; Leoyklang, Petcharat; McKew, John C.; Gahl, William A.; Carrillo-Carrasco, Nuria; Huizing, Marjan

    2014-01-01

    The GNE gene encodes the rate-limiting, bifunctional enzyme of sialic acid biosynthesis, UDP-N-acetylglucosamine 2-epimerase/N-acetylmannosamine kinase (GNE). Biallelic GNE mutations underlie GNE myopathy, an adult-onset progressive myopathy. GNE myopathy-associated GNE mutations are predominantly missense, resulting in reduced, but not absent, GNE enzyme activities. The exact pathomechanism of GNE myopathy remains unknown, but likely involves aberrant (muscle) sialylation. Here we summarize 154 reported and novel GNE variants associated with GNE myopathy, including 122 missense, 11 nonsense, 14 insertion/deletions and 7 intronic variants. All variants were deposited in the online GNE variation database (http://www.dmd.nl/nmdb2/home.php?select_db=GNE). We report the predicted effects on protein function of all variants as well as the predicted effects on epimerase and/or kinase enzymatic activities of selected variants. By analyzing exome sequence databases, we identified three frequently occurring, unreported GNE missense variants/polymorphisms, important for future sequence interpretations. Based on allele frequencies, we estimate the world-wide prevalence of GNE myopathy to be ~ 4–21/1,000,000. This previously unrecognized high prevalence confirms suspicions that many patients may escape diagnosis. Awareness among physicians for GNE myopathy is essential for the identification of new patients, which is required for better understanding of the disorder’s pathomechanism and for the success of ongoing treatment trials. PMID:24796702

  16. VAS: A Vision Advisor System combining agents and object-oriented databases

    NASA Technical Reports Server (NTRS)

    Eilbert, James L.; Lim, William; Mendelsohn, Jay; Braun, Ron; Yearwood, Michael

    1994-01-01

    A model-based approach to identifying and finding the orientation of non-overlapping parts on a tray has been developed. The part models contain both exact and fuzzy descriptions of part features, and are stored in an object-oriented database. Full identification of the parts involves several interacting tasks each of which is handled by a distinct agent. Using fuzzy information stored in the model allowed part features that were essentially at the noise level to be extracted and used for identification. This was done by focusing attention on the portion of the part where the feature must be found if the current hypothesis of the part ID is correct. In going from one set of parts to another the only thing that needs to be changed is the database of part models. This work is part of an effort in developing a Vision Advisor System (VAS) that combines agents and objected-oriented databases.

  17. The Saccharomyces Genome Database Variant Viewer

    PubMed Central

    Sheppard, Travis K.; Hitz, Benjamin C.; Engel, Stacia R.; Song, Giltae; Balakrishnan, Rama; Binkley, Gail; Costanzo, Maria C.; Dalusag, Kyla S.; Demeter, Janos; Hellerstedt, Sage T.; Karra, Kalpana; Nash, Robert S.; Paskov, Kelley M.; Skrzypek, Marek S.; Weng, Shuai; Wong, Edith D.; Cherry, J. Michael

    2016-01-01

    The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is the authoritative community resource for the Saccharomyces cerevisiae reference genome sequence and its annotation. In recent years, we have moved toward increased representation of sequence variation and allelic differences within S. cerevisiae. The publication of numerous additional genomes has motivated the creation of new tools for their annotation and analysis. Here we present the Variant Viewer: a dynamic open-source web application for the visualization of genomic and proteomic differences. Multiple sequence alignments have been constructed across high quality genome sequences from 11 different S. cerevisiae strains and stored in the SGD. The alignments and summaries are encoded in JSON and used to create a two-tiered dynamic view of the budding yeast pan-genome, available at http://www.yeastgenome.org/variant-viewer. PMID:26578556

  18. Screening of Variations in CD22 Gene in Children with B-Precursor Acute Lymphoblastic Leukemia.

    PubMed

    Aslar Oner, Deniz; Akin, Dilara Fatma; Sipahi, Kadir; Mumcuoglu, Mine; Ezer, Ustun; Kürekci, A Emin; Akar, Nejat

    2016-09-01

    CD22 is expressed on the surface of B-cell lineage cells from the early progenitor stage of pro-B cell until terminal differentiation to mature B cells. It plays a role in signal transduction and as a regulator of B-cell receptor signaling in B-cell development. We aimed to screen exons 9-14 of the CD22 gene, which is a mutational hot spot region in B-precursor acute lymphoblastic leukemia (pre-B ALL) patients, to find possible genetic variants that could play role in the pathogenesis of pre-B ALL in Turkish children. This study included 109 Turkish children with pre-B ALL who were diagnosed at Losante Hospital for Children with Leukemia. Genomic DNA was extracted from both peripheral blood and bone marrow leukocytes. Gene amplification was performed with PCR, and all samples were screened for the variants by single strand conformation polymorphism. Samples showing band shifts were sequenced on an automated sequencer. In our patient group a total of 9 variants were identified in the CD22 gene by sequencing: a novel variant in intron 10 (T2199G); a missense variant in exon 12; 5 intronic variants between exon 12 and intron 13; a novel intronic variant (C2424T); and a synonymous in exon 13. Thirteen of 109 children (11.9%) carried the T2199G novel intronic variant located in intron 10, and 17 of 109 children (15.6%) carried the C2424T novel intronic variant. Novel variants in the CD22 gene in children with pre-B ALL in Turkey that are not present, in the Human Gene Mutation Database or NCBI SNP database, were found.

  19. Reliability database development for use with an object-oriented fault tree evaluation program

    NASA Technical Reports Server (NTRS)

    Heger, A. Sharif; Harringtton, Robert J.; Koen, Billy V.; Patterson-Hine, F. Ann

    1989-01-01

    A description is given of the development of a fault-tree analysis method using object-oriented programming. In addition, the authors discuss the programs that have been developed or are under development to connect a fault-tree analysis routine to a reliability database. To assess the performance of the routines, a relational database simulating one of the nuclear power industry databases has been constructed. For a realistic assessment of the results of this project, the use of one of existing nuclear power reliability databases is planned.

  20. Expanded national database collection and data coverage in the FINDbase worldwide database for clinically relevant genomic variation allele frequencies

    PubMed Central

    Viennas, Emmanouil; Komianou, Angeliki; Mizzi, Clint; Stojiljkovic, Maja; Mitropoulou, Christina; Muilu, Juha; Vihinen, Mauno; Grypioti, Panagiota; Papadaki, Styliani; Pavlidis, Cristiana; Zukic, Branka; Katsila, Theodora; van der Spek, Peter J.; Pavlovic, Sonja; Tzimas, Giannis; Patrinos, George P.

    2017-01-01

    FINDbase (http://www.findbase.org) is a comprehensive data repository that records the prevalence of clinically relevant genomic variants in various populations worldwide, such as pathogenic variants leading mostly to monogenic disorders and pharmacogenomics biomarkers. The database also records the incidence of rare genetic diseases in various populations, all in well-distinct data modules. Here, we report extensive data content updates in all data modules, with direct implications to clinical pharmacogenomics. Also, we report significant new developments in FINDbase, namely (i) the release of a new version of the ETHNOS software that catalyzes development curation of national/ethnic genetic databases, (ii) the migration of all FINDbase data content into 90 distinct national/ethnic mutation databases, all built around Microsoft's PivotViewer (http://www.getpivot.com) software (iii) new data visualization tools and (iv) the interrelation of FINDbase with DruGeVar database with direct implications in clinical pharmacogenomics. The abovementioned updates further enhance the impact of FINDbase, as a key resource for Genomic Medicine applications. PMID:27924022

  1. Growth mechanism of extension twin variants during annealing of pure magnesium: An ‘ex situ’ electron backscattered diffraction investigation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sabat, R.K.

    Pure magnesium was subjected to plastic deformation through CSM (continuous stiffness measurement) indentation followed by annealing at 200 °C for 30 min. Nucleation of no new grains was observed neither at the twin–twin intersections nor at the multiple twin variants of a grain after annealing. Significant growth of off-basal twin orientation compared to basal twin orientation was observed in the sample after annealing and is attributed to the partial coherent nature of twin boundary in the later case. Further, growth of twins was independent of the strain distribution between parent and twinned grains. - Highlights: • An ‘ex situ’ EBSDmore » of pure Mg during annealing was investigated. • Nucleation of no new grains was observed. • Significant growth of off-basal twin orientation was observed. • Growth of twins may be attributed to the partial coherent nature of twin boundary.« less

  2. MitBASE : a comprehensive and integrated mitochondrial DNA database. The present status

    PubMed Central

    Attimonelli, M.; Altamura, N.; Benne, R.; Brennicke, A.; Cooper, J. M.; D’Elia, D.; Montalvo, A. de; Pinto, B. de; De Robertis, M.; Golik, P.; Knoop, V.; Lanave, C.; Lazowska, J.; Licciulli, F.; Malladi, B. S.; Memeo, F.; Monnerot, M.; Pasimeni, R.; Pilbout, S.; Schapira, A. H. V.; Sloof, P.; Saccone, C.

    2000-01-01

    MitBASE is an integrated and comprehensive database of mitochondrial DNA data which collects, under a single interface, databases for Plant, Vertebrate, Invertebrate, Human, Protist and Fungal mtDNA and a Pilot database on nuclear genes involved in mitochondrial biogenesis in Saccharomyces cerevisiae. MitBASE reports all available information from different organisms and from intraspecies variants and mutants. Data have been drawn from the primary databases and from the literature; value adding information has been structured, e.g., editing information on protist mtDNA genomes, pathological information for human mtDNA variants, etc. The different databases, some of which are structured using commercial packages (Microsoft Access, File Maker Pro) while others use a flat-file format, have been integrated under ORACLE. Ad hoc retrieval systems have been devised for some of the above listed databases keeping into account their peculiarities. The database is resident at the EBI and is available at the following site: http://www3.ebi.ac.uk/Research/Mitbase/mitbase.pl . The impact of this project is intended for both basic and applied research. The study of mitochondrial genetic diseases and mitochondrial DNA intraspecies diversity are key topics in several biotechnological fields. The database has been funded within the EU Biotechnology programme. PMID:10592207

  3. SPAX - PAX with Super-Pages

    NASA Astrophysics Data System (ADS)

    Bößwetter, Daniel

    Much has been written about the pros and cons of column-orientation as a means to speed up read-mostly analytic workloads in relational databases. In this paper we try to dissect the primitive mechanisms of a database that help express the coherence of tuples and present a novel way of organizing relational data in order to exploit the advantages of both, the row-oriented and the column-oriented world. As we go, we break with yet another bad habit of databases, namely the equal granularity of reads and writes which leads us to the introduction of consecutive clusters of disk pages called super-pages.

  4. Human Chromosome Y and Haplogroups; introducing YDHS Database.

    PubMed

    Tiirikka, Timo; Moilanen, Jukka S

    2015-12-01

    As the high throughput sequencing efforts generate more biological information, scientists from different disciplines are interpreting the polymorphisms that make us unique. In addition, there is an increasing trend in general public to research their own genealogy, find distant relatives and to know more about their biological background. Commercial vendors are providing analyses of mitochondrial and Y-chromosomal markers for such purposes. Clearly, an easy-to-use free interface to the existing data on the identified variants would be in the interest of general public and professionals less familiar with the field. Here we introduce a novel metadatabase YDHS that aims to provide such an interface for Y-chromosomal DNA (Y-DNA) haplogroups and sequence variants. The database uses ISOGG Y-DNA tree as the source of mutations and haplogroups and by using genomic positions of the mutations the database links them to genes and other biological entities. YDHS contains analysis tools for deeper Y-SNP analysis. YDHS addresses the shortage of Y-DNA related databases. We have tested our database using a set of different cases from literature ranging from infertility to autism. The database is at http://www.semanticgen.net/ydhs Y-chromosomal DNA (Y-DNA) haplogroups and sequence variants have not been in the scientific limelight, excluding certain specialized fields like forensics, mainly because there is not much freely available information or it is scattered in different sources. However, as we have demonstrated Y-SNPs do play a role in various cases on the haplogroup level and it is possible to create a free Y-DNA dedicated bioinformatics resource.

  5. The Clock Is Ticking: Library Orientation as Puzzle Room

    ERIC Educational Resources Information Center

    Reade, Tripp

    2017-01-01

    Tripp Reade is the school librarian at Cardinal Gibbons High School in Raleigh, North Carolina. This article describes how he redesigned his school's library orientation program after learning about escape rooms and a variant known as puzzle rooms. Puzzle rooms present players with a set of challenges to solve; they require "teamwork,…

  6. A Toolkit for Active Object-Oriented Databases with Application to Interoperability

    NASA Technical Reports Server (NTRS)

    King, Roger

    1996-01-01

    In our original proposal we stated that our research would 'develop a novel technology that provides a foundation for collaborative information processing.' The essential ingredient of this technology is the notion of 'deltas,' which are first-class values representing collections of proposed updates to a database. The Heraclitus framework provides a variety of algebraic operators for building up, combining, inspecting, and comparing deltas. Deltas can be directly applied to the database to yield a new state, or used 'hypothetically' in queries against the state that would arise if the delta were applied. The central point here is that the step of elevating deltas to 'first-class' citizens in database programming languages will yield tremendous leverage on the problem of supporting updates in collaborative information processing. In short, our original intention was to develop the theoretical and practical foundation for a technology based on deltas in an object-oriented database context, develop a toolkit for active object-oriented databases, and apply this toward collaborative information processing.

  7. A Toolkit for Active Object-Oriented Databases with Application to Interoperability

    NASA Technical Reports Server (NTRS)

    King, Roger

    1996-01-01

    In our original proposal we stated that our research would 'develop a novel technology that provides a foundation for collaborative information processing.' The essential ingredient of this technology is the notion of 'deltas,' which are first-class values representing collections of proposed updates to a database. The Heraclitus framework provides a variety of algebraic operators for building up, combining, inspecting, and comparing deltas. Deltas can be directly applied to the database to yield a new state, or used 'hypothetically' in queries against the state that would arise if the delta were applied. The central point here is that the step of elevating deltas to 'first-class' citizens in database programming languages will yield tremendous leverage on the problem of supporting updates in collaborative information processing. In short, our original intention was to develop the theoretical and practical foundation for a technology based on deltas in an object- oriented database context, develop a toolkit for active object-oriented databases, and apply this toward collaborative information processing.

  8. Filovirus RefSeq Entries: Evaluation and Selection of Filovirus Type Variants, Type Sequences, and Names

    PubMed Central

    Kuhn, Jens H.; Andersen, Kristian G.; Bào, Yīmíng; Bavari, Sina; Becker, Stephan; Bennett, Richard S.; Bergman, Nicholas H.; Blinkova, Olga; Bradfute, Steven; Brister, J. Rodney; Bukreyev, Alexander; Chandran, Kartik; Chepurnov, Alexander A.; Davey, Robert A.; Dietzgen, Ralf G.; Doggett, Norman A.; Dolnik, Olga; Dye, John M.; Enterlein, Sven; Fenimore, Paul W.; Formenty, Pierre; Freiberg, Alexander N.; Garry, Robert F.; Garza, Nicole L.; Gire, Stephen K.; Gonzalez, Jean-Paul; Griffiths, Anthony; Happi, Christian T.; Hensley, Lisa E.; Herbert, Andrew S.; Hevey, Michael C.; Hoenen, Thomas; Honko, Anna N.; Ignatyev, Georgy M.; Jahrling, Peter B.; Johnson, Joshua C.; Johnson, Karl M.; Kindrachuk, Jason; Klenk, Hans-Dieter; Kobinger, Gary; Kochel, Tadeusz J.; Lackemeyer, Matthew G.; Lackner, Daniel F.; Leroy, Eric M.; Lever, Mark S.; Mühlberger, Elke; Netesov, Sergey V.; Olinger, Gene G.; Omilabu, Sunday A.; Palacios, Gustavo; Panchal, Rekha G.; Park, Daniel J.; Patterson, Jean L.; Paweska, Janusz T.; Peters, Clarence J.; Pettitt, James; Pitt, Louise; Radoshitzky, Sheli R.; Ryabchikova, Elena I.; Saphire, Erica Ollmann; Sabeti, Pardis C.; Sealfon, Rachel; Shestopalov, Aleksandr M.; Smither, Sophie J.; Sullivan, Nancy J.; Swanepoel, Robert; Takada, Ayato; Towner, Jonathan S.; van der Groen, Guido; Volchkov, Viktor E.; Volchkova, Valentina A.; Wahl-Jensen, Victoria; Warren, Travis K.; Warfield, Kelly L.; Weidmann, Manfred; Nichol, Stuart T.

    2014-01-01

    Sequence determination of complete or coding-complete genomes of viruses is becoming common practice for supporting the work of epidemiologists, ecologists, virologists, and taxonomists. Sequencing duration and costs are rapidly decreasing, sequencing hardware is under modification for use by non-experts, and software is constantly being improved to simplify sequence data management and analysis. Thus, analysis of virus disease outbreaks on the molecular level is now feasible, including characterization of the evolution of individual virus populations in single patients over time. The increasing accumulation of sequencing data creates a management problem for the curators of commonly used sequence databases and an entry retrieval problem for end users. Therefore, utilizing the data to their fullest potential will require setting nomenclature and annotation standards for virus isolates and associated genomic sequences. The National Center for Biotechnology Information’s (NCBI’s) RefSeq is a non-redundant, curated database for reference (or type) nucleotide sequence records that supplies source data to numerous other databases. Building on recently proposed templates for filovirus variant naming [ ()////-], we report consensus decisions from a majority of past and currently active filovirus experts on the eight filovirus type variants and isolates to be represented in RefSeq, their final designations, and their associated sequences. PMID:25256396

  9. PredictSNP2: A Unified Platform for Accurately Evaluating SNP Effects by Exploiting the Different Characteristics of Variants in Distinct Genomic Regions

    PubMed Central

    Brezovský, Jan

    2016-01-01

    An important message taken from human genome sequencing projects is that the human population exhibits approximately 99.9% genetic similarity. Variations in the remaining parts of the genome determine our identity, trace our history and reveal our heritage. The precise delineation of phenotypically causal variants plays a key role in providing accurate personalized diagnosis, prognosis, and treatment of inherited diseases. Several computational methods for achieving such delineation have been reported recently. However, their ability to pinpoint potentially deleterious variants is limited by the fact that their mechanisms of prediction do not account for the existence of different categories of variants. Consequently, their output is biased towards the variant categories that are most strongly represented in the variant databases. Moreover, most such methods provide numeric scores but not binary predictions of the deleteriousness of variants or confidence scores that would be more easily understood by users. We have constructed three datasets covering different types of disease-related variants, which were divided across five categories: (i) regulatory, (ii) splicing, (iii) missense, (iv) synonymous, and (v) nonsense variants. These datasets were used to develop category-optimal decision thresholds and to evaluate six tools for variant prioritization: CADD, DANN, FATHMM, FitCons, FunSeq2 and GWAVA. This evaluation revealed some important advantages of the category-based approach. The results obtained with the five best-performing tools were then combined into a consensus score. Additional comparative analyses showed that in the case of missense variations, protein-based predictors perform better than DNA sequence-based predictors. A user-friendly web interface was developed that provides easy access to the five tools’ predictions, and their consensus scores, in a user-understandable format tailored to the specific features of different categories of variations. To enable comprehensive evaluation of variants, the predictions are complemented with annotations from eight databases. The web server is freely available to the community at http://loschmidt.chemi.muni.cz/predictsnp2. PMID:27224906

  10. PredictSNP2: A Unified Platform for Accurately Evaluating SNP Effects by Exploiting the Different Characteristics of Variants in Distinct Genomic Regions.

    PubMed

    Bendl, Jaroslav; Musil, Miloš; Štourač, Jan; Zendulka, Jaroslav; Damborský, Jiří; Brezovský, Jan

    2016-05-01

    An important message taken from human genome sequencing projects is that the human population exhibits approximately 99.9% genetic similarity. Variations in the remaining parts of the genome determine our identity, trace our history and reveal our heritage. The precise delineation of phenotypically causal variants plays a key role in providing accurate personalized diagnosis, prognosis, and treatment of inherited diseases. Several computational methods for achieving such delineation have been reported recently. However, their ability to pinpoint potentially deleterious variants is limited by the fact that their mechanisms of prediction do not account for the existence of different categories of variants. Consequently, their output is biased towards the variant categories that are most strongly represented in the variant databases. Moreover, most such methods provide numeric scores but not binary predictions of the deleteriousness of variants or confidence scores that would be more easily understood by users. We have constructed three datasets covering different types of disease-related variants, which were divided across five categories: (i) regulatory, (ii) splicing, (iii) missense, (iv) synonymous, and (v) nonsense variants. These datasets were used to develop category-optimal decision thresholds and to evaluate six tools for variant prioritization: CADD, DANN, FATHMM, FitCons, FunSeq2 and GWAVA. This evaluation revealed some important advantages of the category-based approach. The results obtained with the five best-performing tools were then combined into a consensus score. Additional comparative analyses showed that in the case of missense variations, protein-based predictors perform better than DNA sequence-based predictors. A user-friendly web interface was developed that provides easy access to the five tools' predictions, and their consensus scores, in a user-understandable format tailored to the specific features of different categories of variations. To enable comprehensive evaluation of variants, the predictions are complemented with annotations from eight databases. The web server is freely available to the community at http://loschmidt.chemi.muni.cz/predictsnp2.

  11. Direct evidence of detwinning in polycrystalline Ni-Mn-Ga ferromagnetic shape memory alloys during deformation.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nie, Z. H.; Lin Peng, R.; Johansson, S.

    2008-01-01

    In situ time-of-flight neutron diffraction and high-energy x-ray diffraction techniques were used to reveal the preferred reselection of martensite variants through a detwinning process in polycrystalline Ni-Mn-Ga ferromagnetic shape memory alloys under uniaxial compressive stress. The variant reorientation via detwinning during loading can be explained by considering the influence of external stress on the grain/variant orientation-dependent distortion energy. These direct observations of detwinning provide a good understanding of the deformation mechanisms in shape memory alloys.

  12. A proposed clinical decision support architecture capable of supporting whole genome sequence information.

    PubMed

    Welch, Brandon M; Loya, Salvador Rodriguez; Eilbeck, Karen; Kawamoto, Kensaku

    2014-04-04

    Whole genome sequence (WGS) information may soon be widely available to help clinicians personalize the care and treatment of patients. However, considerable barriers exist, which may hinder the effective utilization of WGS information in a routine clinical care setting. Clinical decision support (CDS) offers a potential solution to overcome such barriers and to facilitate the effective use of WGS information in the clinic. However, genomic information is complex and will require significant considerations when developing CDS capabilities. As such, this manuscript lays out a conceptual framework for a CDS architecture designed to deliver WGS-guided CDS within the clinical workflow. To handle the complexity and breadth of WGS information, the proposed CDS framework leverages service-oriented capabilities and orchestrates the interaction of several independently-managed components. These independently-managed components include the genome variant knowledge base, the genome database, the CDS knowledge base, a CDS controller and the electronic health record (EHR). A key design feature is that genome data can be stored separately from the EHR. This paper describes in detail: (1) each component of the architecture; (2) the interaction of the components; and (3) how the architecture attempts to overcome the challenges associated with WGS information. We believe that service-oriented CDS capabilities will be essential to using WGS information for personalized medicine.

  13. A Proposed Clinical Decision Support Architecture Capable of Supporting Whole Genome Sequence Information

    PubMed Central

    Welch, Brandon M.; Rodriguez Loya, Salvador; Eilbeck, Karen; Kawamoto, Kensaku

    2014-01-01

    Whole genome sequence (WGS) information may soon be widely available to help clinicians personalize the care and treatment of patients. However, considerable barriers exist, which may hinder the effective utilization of WGS information in a routine clinical care setting. Clinical decision support (CDS) offers a potential solution to overcome such barriers and to facilitate the effective use of WGS information in the clinic. However, genomic information is complex and will require significant considerations when developing CDS capabilities. As such, this manuscript lays out a conceptual framework for a CDS architecture designed to deliver WGS-guided CDS within the clinical workflow. To handle the complexity and breadth of WGS information, the proposed CDS framework leverages service-oriented capabilities and orchestrates the interaction of several independently-managed components. These independently-managed components include the genome variant knowledge base, the genome database, the CDS knowledge base, a CDS controller and the electronic health record (EHR). A key design feature is that genome data can be stored separately from the EHR. This paper describes in detail: (1) each component of the architecture; (2) the interaction of the components; and (3) how the architecture attempts to overcome the challenges associated with WGS information. We believe that service-oriented CDS capabilities will be essential to using WGS information for personalized medicine. PMID:25411644

  14. A benchmark study of scoring methods for non-coding mutations.

    PubMed

    Drubay, Damien; Gautheret, Daniel; Michiels, Stefan

    2018-05-15

    Detailed knowledge of coding sequences has led to different candidate models for pathogenic variant prioritization. Several deleteriousness scores have been proposed for the non-coding part of the genome, but no large-scale comparison has been realized to date to assess their performance. We compared the leading scoring tools (CADD, FATHMM-MKL, Funseq2 and GWAVA) and some recent competitors (DANN, SNP and SOM scores) for their ability to discriminate assumed pathogenic variants from assumed benign variants (using the ClinVar, COSMIC and 1000 genomes project databases). Using the ClinVar benchmark, CADD was the best tool for detecting the pathogenic variants that are mainly located in protein coding gene regions. Using the COSMIC benchmark, FATHMM-MKL, GWAVA and SOMliver outperformed the other tools for pathogenic variants that are typically located in lincRNAs, pseudogenes and other parts of the non-coding genome. However, all tools had low precision, which could potentially be improved by future non-coding genome feature discoveries. These results may have been influenced by the presence of potential benign variants in the COSMIC database. The development of a gold standard as consistent as ClinVar for these regions will be necessary to confirm our tool ranking. The Snakemake, C++ and R codes are freely available from https://github.com/Oncostat/BenchmarkNCVTools and supported on Linux. damien.drubay@gustaveroussy.fr or stefan.michiels@gustaveroussy.fr. Supplementary data are available at Bioinformatics online.

  15. Large scale database scrubbing using object oriented software components.

    PubMed

    Herting, R L; Barnes, M R

    1998-01-01

    Now that case managers, quality improvement teams, and researchers use medical databases extensively, the ability to share and disseminate such databases while maintaining patient confidentiality is paramount. A process called scrubbing addresses this problem by removing personally identifying information while keeping the integrity of the medical information intact. Scrubbing entire databases, containing multiple tables, requires that the implicit relationships between data elements in different tables of the database be maintained. To address this issue we developed DBScrub, a Java program that interfaces with any JDBC compliant database and scrubs the database while maintaining the implicit relationships within it. DBScrub uses a small number of highly configurable object-oriented software components to carry out the scrubbing. We describe the structure of these software components and how they maintain the implicit relationships within the database.

  16. A case study for a digital seabed database: Bohai Sea engineering geology database

    NASA Astrophysics Data System (ADS)

    Tianyun, Su; Shikui, Zhai; Baohua, Liu; Ruicai, Liang; Yanpeng, Zheng; Yong, Wang

    2006-07-01

    This paper discusses the designing plan of ORACLE-based Bohai Sea engineering geology database structure from requisition analysis, conceptual structure analysis, logical structure analysis, physical structure analysis and security designing. In the study, we used the object-oriented Unified Modeling Language (UML) to model the conceptual structure of the database and used the powerful function of data management which the object-oriented and relational database ORACLE provides to organize and manage the storage space and improve its security performance. By this means, the database can provide rapid and highly effective performance in data storage, maintenance and query to satisfy the application requisition of the Bohai Sea Oilfield Paradigm Area Information System.

  17. Determining object orientation with a hierarchical database of binary synthetic discriminant function filters

    NASA Technical Reports Server (NTRS)

    Reid, Max B.; Ma, Paul W.; Downie, John D.

    1990-01-01

    An optical correlation-based system is demonstrated which recognizes an object and determines its angular orientation by traversing a hierarchical data base of binary filters. The data-base architecture is made possible by the development of binary synthetic discriminant function filters.

  18. Prototyping Visual Database Interface by Object-Oriented Language

    DTIC Science & Technology

    1988-06-01

    approach is to use object-oriented programming. Object-oriented languages are characterized by three criteria [Ref. 4:p. 1.2.1]: - encapsulation of...made it a sub-class of our DMWindow.Cls, which is discussed later in this chapter. This extension to the application had to be intergrated with our... abnormal behaviors similar to Korth’s discussion of pitfalls in relational database designing. Even extensions like GEM [Ref. 8] that are powerful and

  19. Standardisation of the FAERS database: a systematic approach to manually recoding drug name variants.

    PubMed

    Wong, Carmen K; Ho, Samuel S; Saini, Bandana; Hibbs, David E; Fois, Romano A

    2015-07-01

    The US Food and Drug Administration Adverse Event Reporting System (FAERS), one of the world's largest spontaneous reporting systems, is difficult to use because of report duplication and a lack of standardisation in the recording of drug names. Unresolved data quality issues may distort statistical analyses, rendering the results difficult to interpret when detecting and monitoring adverse effects of pharmaceutical products. The aim of this study was to develop and implement a data cleaning protocol to identify and resolve drug nomenclature issues. The key 'data treatment' plan involved standardising drug names held in the FAERS database. Four million five hundred and six thousand five hundred and seventy-seven. Individual Safety Reports submitted to the FAERS between 1 January 2003 and 31 August 2012 were included for this study. OpenRefine was used to standardise drug name variants in the database such that they were consistent with international non-proprietary nomenclature defined by the World Health Organisation Anatomical Therapeutic Chemical classification. Drug variants where generic constituents could not be confidently determined, undecipherable drug names and non-medicinal products were retained verbatim. After the standardisation process, more than 16 611 916 drug entries were cleaned to their relevant international non-proprietary name. The cleaned drug table comprised 71 858 drug name variants and includes both standardised and original terms. Ninety-nine per cent of drug names was standardised using this method. The millions of reports enclosed in the FAERS contain valuable information that is of interest to pharmacovigilance, toxicology and post-marketing surveillance researchers. With the standardisation of the drug nomenclature, the database can be better utilised by research groups around the world. Copyright © 2015 John Wiley & Sons, Ltd.

  20. Pose-variant facial expression recognition using an embedded image system

    NASA Astrophysics Data System (ADS)

    Song, Kai-Tai; Han, Meng-Ju; Chang, Shuo-Hung

    2008-12-01

    In recent years, one of the most attractive research areas in human-robot interaction is automated facial expression recognition. Through recognizing the facial expression, a pet robot can interact with human in a more natural manner. In this study, we focus on the facial pose-variant problem. A novel method is proposed in this paper to recognize pose-variant facial expressions. After locating the face position in an image frame, the active appearance model (AAM) is applied to track facial features. Fourteen feature points are extracted to represent the variation of facial expressions. The distance between feature points are defined as the feature values. These feature values are sent to a support vector machine (SVM) for facial expression determination. The pose-variant facial expression is classified into happiness, neutral, sadness, surprise or anger. Furthermore, in order to evaluate the performance for practical applications, this study also built a low resolution database (160x120 pixels) using a CMOS image sensor. Experimental results show that the recognition rate is 84% with the self-built database.

  1. The Saccharomyces Genome Database Variant Viewer.

    PubMed

    Sheppard, Travis K; Hitz, Benjamin C; Engel, Stacia R; Song, Giltae; Balakrishnan, Rama; Binkley, Gail; Costanzo, Maria C; Dalusag, Kyla S; Demeter, Janos; Hellerstedt, Sage T; Karra, Kalpana; Nash, Robert S; Paskov, Kelley M; Skrzypek, Marek S; Weng, Shuai; Wong, Edith D; Cherry, J Michael

    2016-01-04

    The Saccharomyces Genome Database (SGD; http://www.yeastgenome.org) is the authoritative community resource for the Saccharomyces cerevisiae reference genome sequence and its annotation. In recent years, we have moved toward increased representation of sequence variation and allelic differences within S. cerevisiae. The publication of numerous additional genomes has motivated the creation of new tools for their annotation and analysis. Here we present the Variant Viewer: a dynamic open-source web application for the visualization of genomic and proteomic differences. Multiple sequence alignments have been constructed across high quality genome sequences from 11 different S. cerevisiae strains and stored in the SGD. The alignments and summaries are encoded in JSON and used to create a two-tiered dynamic view of the budding yeast pan-genome, available at http://www.yeastgenome.org/variant-viewer. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Nomenclature- and Database-Compatible Names for the Two Ebola Virus Variants that Emerged in Guinea and the Democratic Republic of the Congo in 2014

    PubMed Central

    Kuhn, Jens H.; Andersen, Kristian G.; Baize, Sylvain; Bào, Yīmíng; Bavari, Sina; Berthet, Nicolas; Blinkova, Olga; Brister, J. Rodney; Clawson, Anna N.; Fair, Joseph; Gabriel, Martin; Garry, Robert F.; Gire, Stephen K.; Goba, Augustine; Gonzalez, Jean-Paul; Günther, Stephan; Happi, Christian T.; Jahrling, Peter B.; Kapetshi, Jimmy; Kobinger, Gary; Kugelman, Jeffrey R.; Leroy, Eric M.; Maganga, Gael Darren; Mbala, Placide K.; Moses, Lina M.; Muyembe-Tamfum, Jean-Jacques; N’Faly, Magassouba; Nichol, Stuart T.; Omilabu, Sunday A.; Palacios, Gustavo; Park, Daniel J.; Paweska, Janusz T.; Radoshitzky, Sheli R.; Rossi, Cynthia A.; Sabeti, Pardis C.; Schieffelin, John S.; Schoepp, Randal J.; Sealfon, Rachel; Swanepoel, Robert; Towner, Jonathan S.; Wada, Jiro; Wauquier, Nadia; Yozwiak, Nathan L.; Formenty, Pierre

    2014-01-01

    In 2014, Ebola virus (EBOV) was identified as the etiological agent of a large and still expanding outbreak of Ebola virus disease (EVD) in West Africa and a much more confined EVD outbreak in Middle Africa. Epidemiological and evolutionary analyses confirmed that all cases of both outbreaks are connected to a single introduction each of EBOV into human populations and that both outbreaks are not directly connected. Coding-complete genomic sequence analyses of isolates revealed that the two outbreaks were caused by two novel EBOV variants, and initial clinical observations suggest that neither of them should be considered strains. Here we present consensus decisions on naming for both variants (West Africa: “Makona”, Middle Africa: “Lomela”) and provide database-compatible full, shortened, and abbreviated names that are in line with recently established filovirus sub-species nomenclatures. PMID:25421896

  3. Ordinal feature selection for iris and palmprint recognition.

    PubMed

    Sun, Zhenan; Wang, Libin; Tan, Tieniu

    2014-09-01

    Ordinal measures have been demonstrated as an effective feature representation model for iris and palmprint recognition. However, ordinal measures are a general concept of image analysis and numerous variants with different parameter settings, such as location, scale, orientation, and so on, can be derived to construct a huge feature space. This paper proposes a novel optimization formulation for ordinal feature selection with successful applications to both iris and palmprint recognition. The objective function of the proposed feature selection method has two parts, i.e., misclassification error of intra and interclass matching samples and weighted sparsity of ordinal feature descriptors. Therefore, the feature selection aims to achieve an accurate and sparse representation of ordinal measures. And, the optimization subjects to a number of linear inequality constraints, which require that all intra and interclass matching pairs are well separated with a large margin. Ordinal feature selection is formulated as a linear programming (LP) problem so that a solution can be efficiently obtained even on a large-scale feature pool and training database. Extensive experimental results demonstrate that the proposed LP formulation is advantageous over existing feature selection methods, such as mRMR, ReliefF, Boosting, and Lasso for biometric recognition, reporting state-of-the-art accuracy on CASIA and PolyU databases.

  4. Who's Gonna Pay the Piper for Free Online Databases?

    ERIC Educational Resources Information Center

    Jacso, Peter

    1996-01-01

    Discusses new pricing models for some online services and considers the possibilities for the traditional online database market. Topics include multimedia music databases, including copyright implications; other retail-oriented databases; and paying for free databases with advertising. (LRW)

  5. Genetic polymorphisms of pharmacogenomic VIP variants in the Kyrgyz population from northwest China.

    PubMed

    Yunus, Zulfiya; Liu, Lijun; Wang, Hong; Zhang, Le; Li, Xiaolan; Geng, Tingting; Kang, Longli; Jin, Tianbo; Chen, Chao

    2013-10-15

    Pharmacogenomic variant information is well known for major human populations; however, this information is less commonly studied in minorities. In the present study, we genotyped 85 very important pharmacogenetic (VIP) variants (selected from the PharmGKB database) in the Kyrgyz population and compared our data with other four major human populations including Han Chinese in Beijing, China (CHB), the Japanese in Tokyo, Japan (JPT), a northern and western Europe population (CEU), and the Yoruba in Ibadan, Nigeria (YRI). There were 13, 12 and 16 of the selected VIP variant genotype frequencies in the Kyrgyz which differed from those of the CHB, JPT and CEU, respectively (p<0.005). In the YRI, there were 32 different variants, compared to the Kyrgyz (p<0.005). Genotype frequencies of ADH1B, AHR, CYP3A5, PTGS2, VDR, and VKORC1 in the Kyrgyz differed widely from those in the four populations. Haplotype analyses also showed differences among the Kyrgyz and the other four populations. Our results complement the information provided by the database of pharmacogenomics on Kyrgyz. We provide a theoretical basis for safer drug administration and individualized treatment plans for the Kyrgyz. We also provide a template for the study of pharmacogenomics in various ethnic minority groups in China. © 2013 Elsevier B.V. All rights reserved.

  6. [Phenotypic and genotypic spectra of patients with glucose-6-phosphate dehydrogenase deficiency gene known pathogenic variants: a single-center study].

    PubMed

    Chen, X; Yang, L; Wang, H J; Wu, B B; Lu, Y L; Dong, X R; Zhou, W H

    2018-05-02

    Objective: To analyze the hotspots of known pathogenic disease-causing variants of glucose-6-phosphate dehydrogenase (G6PD) and the phenotype spectrum of neonatal patients with known pathogenic disease-causing variants of G6PD. Methods: The known pathogenic disease-causing variants of G6PD were collected from Human Gene Mutation Database. Screening was performed for these variants among the 7 966 cases (2 357 neonatal, 5 609 non-neonatal) in the database of sequencing at Molecular Diagnosis Center, Children's Hospital of Fudan University. All these samples were from patients suspected with genetic disorder. The database contained Whole Exon Sequencing data and Clinical Exon Sequencing data. We screened out the patients with known pathogenic disease-causing variants of G6PD, analyzed the hotspot of G6PD and the phenotype spectrum of neonatal patients with known pathogenic disease-causing variants of G6PD. Results: (1) Among the next generation sequencing data of the 7 966 samples, 86 samples (1.1%) were detected as positive for the known pathogenic disease-causing variants of G6PD (positive samples set). In the positive sample set, 51 patients (33 males, 18 females) were newborn babies. Forty-three patients (26 males, 17 females) had the enzyme activity data of G6PD. (2) Among the 86 samples, Arg463His, Arg459Leu, Leu342Phe, Val291Met were the leading 4 disease-causing variants found in 72 samples (84%). (3) Male neonatal patients with the same variants had the statistically significant differences in enzyme activity: among 13 patients with Arg463His, enzyme activity of 9 patients was ranked as grade Ⅲ, 1 case ranked as Ⅳ, 3 cases had no activity data;among 10 patients with Arg459Leu, enzyme activity of 4 patients was ranked as Ⅱ, 4 cases ranked as Ⅲ, 2 cases had no activity data;among 2 patients with His32Arg, enzyme activity of one patient was ranked as Ⅱ, another was Ⅲ. Male neonatal patients with the same mutation and enzyme activity also had the statistically significant differences in phenotype spectrum: among 9 patients with Arg463His and level Ⅲ enzyme activity, 6 presented hyperbilirubinemia, 2 met the criteria for exchange transfusion therapy, 2 showed hemolysis;among 4 patients with Arg459Leu and level Ⅱ enzyme activity, 3 presented hyperbilirubinemia;among 4 patients with Arg459Leu and level Ⅲ enzyme activity, 2 presented hyperbilirubinemia, 1 met the standard of exchange transfusion therapy;among 3 patients with Val291Met and level Ⅲ enzyme activity, 1 presented hyperbilirubinemia. Conclusions: Arg463His, Arg459Leu, Leu342Phe, Val291Met were the hotspots variants for the G6PD. Patients with the same G6PD variants and sex present different phenotype, patients with the same G6PD variants, sex and enzyme activity also present different phenotype .

  7. Content based image retrieval using local binary pattern operator and data mining techniques.

    PubMed

    Vatamanu, Oana Astrid; Frandeş, Mirela; Lungeanu, Diana; Mihalaş, Gheorghe-Ioan

    2015-01-01

    Content based image retrieval (CBIR) concerns the retrieval of similar images from image databases, using feature vectors extracted from images. These feature vectors globally define the visual content present in an image, defined by e.g., texture, colour, shape, and spatial relations between vectors. Herein, we propose the definition of feature vectors using the Local Binary Pattern (LBP) operator. A study was performed in order to determine the optimum LBP variant for the general definition of image feature vectors. The chosen LBP variant is then subsequently used to build an ultrasound image database, and a database with images obtained from Wireless Capsule Endoscopy. The image indexing process is optimized using data clustering techniques for images belonging to the same class. Finally, the proposed indexing method is compared to the classical indexing technique, which is nowadays widely used.

  8. Graphical user interfaces for symbol-oriented database visualization and interaction

    NASA Astrophysics Data System (ADS)

    Brinkschulte, Uwe; Siormanolakis, Marios; Vogelsang, Holger

    1997-04-01

    In this approach, two basic services designed for the engineering of computer based systems are combined: a symbol-oriented man-machine-service and a high speed database-service. The man-machine service is used to build graphical user interfaces (GUIs) for the database service; these interfaces are stored using the database service. The idea is to create a GUI-builder and a GUI-manager for the database service based upon the man-machine service using the concept of symbols. With user-definable and predefined symbols, database contents can be visualized and manipulated in a very flexible and intuitive way. Using the GUI-builder and GUI-manager, a user can build and operate its own graphical user interface for a given database according to its needs without writing a single line of code.

  9. STOPGAP: a database for systematic target opportunity assessment by genetic association predictions.

    PubMed

    Shen, Judong; Song, Kijoung; Slater, Andrew J; Ferrero, Enrico; Nelson, Matthew R

    2017-09-01

    We developed the STOPGAP (Systematic Target OPportunity assessment by Genetic Association Predictions) database, an extensive catalog of human genetic associations mapped to effector gene candidates. STOPGAP draws on a variety of publicly available GWAS associations, linkage disequilibrium (LD) measures, functional genomic and variant annotation sources. Algorithms were developed to merge the association data, partition associations into non-overlapping LD clusters, map variants to genes and produce a variant-to-gene score used to rank the relative confidence among potential effector genes. This database can be used for a multitude of investigations into the genes and genetic mechanisms underlying inter-individual variation in human traits, as well as supporting drug discovery applications. Shell, R, Perl and Python scripts and STOPGAP R data files (version 2.5.1 at publication) are available at https://github.com/StatGenPRD/STOPGAP . Some of the most useful STOPGAP fields can be queried through an R Shiny web application at http://stopgapwebapp.com . matthew.r.nelson@gsk.com. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  10. Mutation update of transcription factor genes FOXE3, HSF4, MAF, and PITX3 causing cataracts and other developmental ocular defects.

    PubMed

    Anand, Deepti; Agrawal, Smriti A; Slavotinek, Anne; Lachke, Salil A

    2018-04-01

    Mutations in the transcription factor genes FOXE3, HSF4, MAF, and PITX3 cause congenital lens defects including cataracts that may be accompanied by defects in other components of the eye or in nonocular tissues. We comprehensively describe here all the variants in FOXE3, HSF4, MAF, and PITX3 genes linked to human developmental defects. A total of 52 variants for FOXE3, 18 variants for HSF4, 20 variants for MAF, and 19 variants for PITX3 identified so far in isolated cases or within families are documented. This effort reveals FOXE3, HSF4, MAF, and PITX3 to have 33, 16, 18, and 7 unique causal mutations, respectively. Loss-of-function mutant animals for these genes have served to model the pathobiology of the associated human defects, and we discuss the currently known molecular function of these genes, particularly with emphasis on their role in ocular development. Finally, we make the detailed FOXE3, HSF4, MAF, and PITX3 variant information available in the Leiden Online Variation Database (LOVD) platform at https://www.LOVD.nl/FOXE3, https://www.LOVD.nl/HSF4, https://www.LOVD.nl/MAF, and https://www.LOVD.nl/PITX3. Thus, this article informs on key variants in transcription factor genes linked to cataract, aphakia, corneal opacity, glaucoma, microcornea, microphthalmia, anterior segment mesenchymal dysgenesis, and Ayme-Gripp syndrome, and facilitates their access through Web-based databases. © 2018 Wiley Periodicals, Inc.

  11. TransAtlasDB: an integrated database connecting expression data, metadata and variants

    PubMed Central

    Adetunji, Modupeore O; Lamont, Susan J; Schmidt, Carl J

    2018-01-01

    Abstract High-throughput transcriptome sequencing (RNAseq) is the universally applied method for target-free transcript identification and gene expression quantification, generating huge amounts of data. The constraint of accessing such data and interpreting results can be a major impediment in postulating suitable hypothesis, thus an innovative storage solution that addresses these limitations, such as hard disk storage requirements, efficiency and reproducibility are paramount. By offering a uniform data storage and retrieval mechanism, various data can be compared and easily investigated. We present a sophisticated system, TransAtlasDB, which incorporates a hybrid architecture of both relational and NoSQL databases for fast and efficient data storage, processing and querying of large datasets from transcript expression analysis with corresponding metadata, as well as gene-associated variants (such as SNPs) and their predicted gene effects. TransAtlasDB provides the data model of accurate storage of the large amount of data derived from RNAseq analysis and also methods of interacting with the database, either via the command-line data management workflows, written in Perl, with useful functionalities that simplifies the complexity of data storage and possibly manipulation of the massive amounts of data generated from RNAseq analysis or through the web interface. The database application is currently modeled to handle analyses data from agricultural species, and will be expanded to include more species groups. Overall TransAtlasDB aims to serve as an accessible repository for the large complex results data files derived from RNAseq gene expression profiling and variant analysis. Database URL: https://modupeore.github.io/TransAtlasDB/ PMID:29688361

  12. VarioML framework for comprehensive variation data representation and exchange.

    PubMed

    Byrne, Myles; Fokkema, Ivo Fac; Lancaster, Owen; Adamusiak, Tomasz; Ahonen-Bishopp, Anni; Atlan, David; Béroud, Christophe; Cornell, Michael; Dalgleish, Raymond; Devereau, Andrew; Patrinos, George P; Swertz, Morris A; Taschner, Peter Em; Thorisson, Gudmundur A; Vihinen, Mauno; Brookes, Anthony J; Muilu, Juha

    2012-10-03

    Sharing of data about variation and the associated phenotypes is a critical need, yet variant information can be arbitrarily complex, making a single standard vocabulary elusive and re-formatting difficult. Complex standards have proven too time-consuming to implement. The GEN2PHEN project addressed these difficulties by developing a comprehensive data model for capturing biomedical observations, Observ-OM, and building the VarioML format around it. VarioML pairs a simplified open specification for describing variants, with a toolkit for adapting the specification into one's own research workflow. Straightforward variant data can be captured, federated, and exchanged with no overhead; more complex data can be described, without loss of compatibility. The open specification enables push-button submission to gene variant databases (LSDBs) e.g., the Leiden Open Variation Database, using the Cafe Variome data publishing service, while VarioML bidirectionally transforms data between XML and web-application code formats, opening up new possibilities for open source web applications building on shared data. A Java implementation toolkit makes VarioML easily integrated into biomedical applications. VarioML is designed primarily for LSDB data submission and transfer scenarios, but can also be used as a standard variation data format for JSON and XML document databases and user interface components. VarioML is a set of tools and practices improving the availability, quality, and comprehensibility of human variation information. It enables researchers, diagnostic laboratories, and clinics to share that information with ease, clarity, and without ambiguity.

  13. VarioML framework for comprehensive variation data representation and exchange

    PubMed Central

    2012-01-01

    Background Sharing of data about variation and the associated phenotypes is a critical need, yet variant information can be arbitrarily complex, making a single standard vocabulary elusive and re-formatting difficult. Complex standards have proven too time-consuming to implement. Results The GEN2PHEN project addressed these difficulties by developing a comprehensive data model for capturing biomedical observations, Observ-OM, and building the VarioML format around it. VarioML pairs a simplified open specification for describing variants, with a toolkit for adapting the specification into one's own research workflow. Straightforward variant data can be captured, federated, and exchanged with no overhead; more complex data can be described, without loss of compatibility. The open specification enables push-button submission to gene variant databases (LSDBs) e.g., the Leiden Open Variation Database, using the Cafe Variome data publishing service, while VarioML bidirectionally transforms data between XML and web-application code formats, opening up new possibilities for open source web applications building on shared data. A Java implementation toolkit makes VarioML easily integrated into biomedical applications. VarioML is designed primarily for LSDB data submission and transfer scenarios, but can also be used as a standard variation data format for JSON and XML document databases and user interface components. Conclusions VarioML is a set of tools and practices improving the availability, quality, and comprehensibility of human variation information. It enables researchers, diagnostic laboratories, and clinics to share that information with ease, clarity, and without ambiguity. PMID:23031277

  14. GAVIN: Gene-Aware Variant INterpretation for medical sequencing.

    PubMed

    van der Velde, K Joeri; de Boer, Eddy N; van Diemen, Cleo C; Sikkema-Raddatz, Birgit; Abbott, Kristin M; Knopperts, Alain; Franke, Lude; Sijmons, Rolf H; de Koning, Tom J; Wijmenga, Cisca; Sinke, Richard J; Swertz, Morris A

    2017-01-16

    We present Gene-Aware Variant INterpretation (GAVIN), a new method that accurately classifies variants for clinical diagnostic purposes. Classifications are based on gene-specific calibrations of allele frequencies from the ExAC database, likely variant impact using SnpEff, and estimated deleteriousness based on CADD scores for >3000 genes. In a benchmark on 18 clinical gene sets, we achieve a sensitivity of 91.4% and a specificity of 76.9%. This accuracy is unmatched by 12 other tools. We provide GAVIN as an online MOLGENIS service to annotate VCF files and as an open source executable for use in bioinformatic pipelines. It can be found at http://molgenis.org/gavin .

  15. Compression of Index Term Dictionary in an Inverted-File-Oriented Database: Some Effective Algorithms.

    ERIC Educational Resources Information Center

    Wisniewski, Janusz L.

    1986-01-01

    Discussion of a new method of index term dictionary compression in an inverted-file-oriented database highlights a technique of word coding, which generates short fixed-length codes obtained from the index terms themselves by analysis of monogram and bigram statistical distributions. Substantial savings in communication channel utilization are…

  16. Recent developments in Cope-type hydroamination reactions of hydroxylamine and hydrazine derivatives.

    PubMed

    Beauchemin, André M

    2013-11-07

    Cope-type hydroaminations are versatile for the direct amination of alkenes, alkynes and allenes using hydroxylamines and hydrazine derivatives. These reactions occur via a concerted, 5-membered cyclic transition state that is the microscopic reverse of the Cope elimination. This article focuses on recent developments, including intermolecular variants, directed reactions, and asymmetric variants using aldehydes as tethering catalysts, and their applications in target-oriented synthesis.

  17. Economic Questions Raised in Iraq’s New Constitution

    DTIC Science & Technology

    2005-11-01

    oriented.[7] The economic sections of the Constitution agreed upon toward the end of August 2005 set a somewhat different tone than in earlier...of economic systems across the various regions. For example, slightly different variants of free market capitalism across the various U.S. states...three major regions of Iraq adopting variants of these three somewhat different and potentially competing economic systems. Whether or not these

  18. Phase Transition and Texture Evolution in the Ni-Mn-Ga Ferromagnetic Shape-Memory Alloys Studied by a Neutron Diffraction Technique

    NASA Astrophysics Data System (ADS)

    Nie, Z. H.; Wang, Y. D.; Wang, G. Y.; Richardson, J. W.; Wang, G.; Liu, Y. D.; Liaw, P. K.; Zuo, L.

    2008-12-01

    The phase transition and influence of the applied stress on the texture evolution in the as-cast Ni-Mn-Ga ferromagnetic shape-memory alloys were studied by the time-of-flight (TOF) neutron diffraction technique. The neutron diffraction experiments were performed on the General Purpose Powder Diffractometer (Argonne National Laboratory). Inverse pole figures were determined from the neutron data for characterizing the orientation distributions and variant selections of polycrystalline Ni-Mn-Ga alloys subjected to different uniaxial compression deformations. Texture analyses reveal that the initial texture for the parent phase in the as-cast specimen was composed of {left\\{ {{text{001}}} right\\}}{left< {{text{100}}} rightrangle } , {left\\{ {{text{001}}} right\\}}{left< {{text{110}}} rightrangle } , {left\\{ {{text{011}}} right\\}}{left< {{text{100}}} rightrangle } , and {left\\{ {{text{011}}} right\\}}{left< {{text{110}}} rightrangle } , which was weakened after the compression deformation. Moreover, a strong preferred selection of martensitic-twin variants ( {left\\{ {{text{110}}} right\\}}{left< {{text{001}}} rightrangle } and {left\\{ {{text{100}}} right\\}}{left< {{text{001}}} rightrangle } ) was observed in the transformed martensite after a compression stress applied on the parent phase along the cyclindrical axis of the specimens. The preferred selection of variants can be well explained by considering the grain/variant-orientation-dependent Bain-distortion energy.

  19. Force generation within tissues during development

    NASA Astrophysics Data System (ADS)

    Kasza, Karen

    During embryonic development, multicellular tissues physically change shape, move, and grow. Changes in epithelial tissue organization are often accomplished by local movements of cells that are driven largely by forces generated by the motor protein myosin II. These forces are patterned to orient cell movements, resulting in changes in tissue shape and organization to build functional tissues and organs. To investigate the mechanisms of force generation in vivo, we use the fruit fly embryo as a model system. Spatial patterns of forces orient cell movements to drive rapid tissue elongation along the head-to-tail axis of the embryo. I will describe how studying embryos generated with engineered myosin variants provides insight into where, when, and how forces are generated to efficiently reorganize tissues. We found that a myosin variant that is locked-in to the active or ``on'' state accelerates cell movements, while two mutant myosin variants associated with human disease produce slowed cell movement. These myosin variants all disrupt tissue elongation, but live imaging and biophysical measurements reveal distinct effects on myosin organization and dynamics within cells and uncover mechanisms that control the spatial and temporal patterns of force generation. These studies shed light not only on how defects in force generation contribute to disease but also on physical principles at work in active, living materials.

  20. Intrinsic magnetic properties of L10 FeNi obtained from meteorite NWA 6259

    NASA Astrophysics Data System (ADS)

    Poirier, Eric; Pinkerton, Frederick E.; Kubic, Robert; Mishra, Raja K.; Bordeaux, Nina; Mubarok, Arif; Lewis, Laura H.; Goldstein, Joseph I.; Skomski, Ralph; Barmak, Katayun

    2015-05-01

    FeNi having the tetragonal L10 crystal structure is a promising new rare-earth-free permanent magnet material. Laboratory synthesis is challenging, however, tetragonal L10 FeNi—the mineral "tetrataenite"—has been characterized using specimens found in nickel-iron meteorites. Most notably, the meteorite NWA 6259 recovered from Northwest Africa is 95 vol. % tetrataenite with a composition of 43 at. % Ni. Hysteresis loops were measured as a function of sample orientation on a specimen cut from NWA 6259 in order to rigorously deduce the intrinsic hard magnetic properties of its L10 phase. Electron backscatter diffraction showed that NWA 6259 is strongly textured, containing L10 grains oriented along any one of the three equivalent cubic directions of the parent fcc structure. The magnetic structure was modeled as a superposition of the three orthonormal uniaxial variants. By simultaneously fitting first-quadrant magnetization data for 13 different orientations of the sample with respect to the applied field direction, the intrinsic magnetic properties were estimated to be saturation magnetization 4πMs = 14.7 kG and anisotropy field Ha = 14.4 kOe. The anisotropy constant K = 0.84 MJ/m3 is somewhat smaller than the value K = 1.3 MJ/m3 obtained by earlier researchers from nominally equiatomic FeNi prepared by neutron irradiation accompanied by annealing in a magnetic field, suggesting that higher Ni content (fewer Fe antisite defects) may improve the anisotropy. The fit also indicated that NWA 6259 contains one dominant variant (62% by volume), the remainder of the sample being a second variant, and the third variant being absent altogether.

  1. Intrinsic magnetic properties of L1(0) FeNi obtained from meteorite NWA 6259

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Poirier, E; Pinkerton, FE; Kubic, R

    2015-05-07

    FeNi having the tetragonal L1(0) crystal structure is a promising new rare-earth-free permanent magnet material. Laboratory synthesis is challenging, however, tetragonal L1(0) FeNi-the mineral "tetrataenite"-has been characterized using specimens found in nickel-iron meteorites. Most notably, the meteorite NWA 6259 recovered from Northwest Africa is 95 vol.% tetrataenite with a composition of 43 at.% Ni. Hysteresis loops were measured as a function of sample orientation on a specimen cut from NWA 6259 in order to rigorously deduce the intrinsic hard magnetic properties of its L1(0) phase. Electron backscatter diffraction showed that NWA 6259 is strongly textured, containing L1(0) grains oriented alongmore » any one of the three equivalent cubic directions of the parent fcc structure. The magnetic structure was modeled as a superposition of the three orthonormal uniaxial variants. By simultaneously fitting first-quadrant magnetization data for 13 different orientations of the sample with respect to the applied field direction, the intrinsic magnetic properties were estimated to be saturation magnetization 4 pi M-s = 14.7 kG and anisotropy field H-a = 14.4 kOe. The anisotropy constant K = 0.84 MJ/m(3) is somewhat smaller than the value K = 1.3 MJ/m(3) obtained by earlier researchers from nominally equiatomic FeNi prepared by neutron irradiation accompanied by annealing in a magnetic field, suggesting that higher Ni content (fewer Fe antisite defects) may improve the anisotropy. The fit also indicated that NWA 6259 contains one dominant variant (62% by volume), the remainder of the sample being a second variant, and the third variant being absent altogether. (C) 2015 AIP Publishing LLC.« less

  2. IMPACT web portal: oncology database integrating molecular profiles with actionable therapeutics.

    PubMed

    Hintzsche, Jennifer D; Yoo, Minjae; Kim, Jihye; Amato, Carol M; Robinson, William A; Tan, Aik Choon

    2018-04-20

    With the advancement of next generation sequencing technology, researchers are now able to identify important variants and structural changes in DNA and RNA in cancer patient samples. With this information, we can now correlate specific variants and/or structural changes with actionable therapeutics known to inhibit these variants. We introduce the creation of the IMPACT Web Portal, a new online resource that connects molecular profiles of tumors to approved drugs, investigational therapeutics and pharmacogenetics associated drugs. IMPACT Web Portal contains a total of 776 drugs connected to 1326 target genes and 435 target variants, fusion, and copy number alterations. The online IMPACT Web Portal allows users to search for various genetic alterations and connects them to three levels of actionable therapeutics. The results are categorized into 3 levels: Level 1 contains approved drugs separated into two groups; Level 1A contains approved drugs with variant specific information while Level 1B contains approved drugs with gene level information. Level 2 contains drugs currently in oncology clinical trials. Level 3 provides pharmacogenetic associations between approved drugs and genes. IMPACT Web Portal allows for sequencing data to be linked to actionable therapeutics for translational and drug repurposing research. The IMPACT Web Portal online resource allows users to query genes and variants to approved and investigational drugs. We envision that this resource will be a valuable database for personalized medicine and drug repurposing. IMPACT Web Portal is freely available for non-commercial use at http://tanlab.ucdenver.edu/IMPACT .

  3. Assessment of epithelial sodium channel variants in nonwhite cystic fibrosis patients with non-diagnostic CFTR genotypes.

    PubMed

    Brennan, Marie-Luise; Pique, Lynn M; Schrijver, Iris

    2016-01-01

    Several lines of evidence suggest a role for the epithelial sodium channel (ENaC) in cystic fibrosis (CF). The purpose of our study was to assess the contribution of genetic variants in the ENaC subunits (α, β, γ) in nonwhite CF patients in whom CFTR molecular testing has been non-diagnostic. Samples were obtained from patients who were nonwhite and whose molecular CFTR testing did not identify two mutations. Sequencing of the SCNN1A, B, and G genes was performed and variants assessed for pathogenicity and association with CF using databases, protein and splice site mutation analysis software, and literature review. We identified four nonsynonymous amino acid variants in SCNN1A, three in SCNN1B and one in SCNN1G. There was no convincing evidence of pathogenicity. Whereas all have been reported in the dbSNP database, only p.Ala334Thr, p.Val573Ile, and p.Thr663Ala in SCNN1A, p.Gly442Val in SCNN1B and p.Gly183Ser in SCNN1G were previously reported in ENaC genetic studies of CF or CF-like patients. Synonymous substitutions were also observed but novel synonymous variants were not detected. There is no conclusive association of ENaC genetic variants with CF in nonwhite CF patients. Copyright © 2015 European Cystic Fibrosis Society. Published by Elsevier B.V. All rights reserved.

  4. Rare missense variants in POT1 predispose to familial cutaneous malignant melanoma

    PubMed Central

    Shi, Jianxin; Yang, Xiaohong R.; Ballew, Bari; Rotunno, Melissa; Calista, Donato; Fargnoli, Maria Concetta; Ghiorzo, Paola; Paillerets, Brigitte Bressac-de; Nagore, Eduardo; Avril, Marie Francoise; Caporaso, Neil E.; McMaster, Mary L.; Cullen, Michael; Wang, Zhaoming; Zhang, Xijun; Bruno, William; Pastorino, Lorenza; Queirolo, Paola; Banuls-Roca, Jose; Garcia-Casado, Zaida; Vaysse, Amaury; Mohamdi, Hamida; Riazalhosseini, Yasser; Foglio, Mario; Jouenne, Fanélie; Hua, Xing; Hyland, Paula L.; Yin, Jinhu; Vallabhaneni, Haritha; Chai, Weihang; Minghetti, Paola; Pellegrini, Cristina; Ravichandran, Sarangan; Eggermont, Alexander; Lathrop, Mark; Peris, Ketty; Scarra, Giovanna Bianchi; Landi, Giorgio; Savage, Sharon A.; Sampson, Joshua N.; He, Ji; Yeager, Meredith; Goldin, Lynn R.; Demenais, Florence; Chanock, Stephen J.; Tucker, Margaret A.; Goldstein, Alisa M.; Liu, Yie; Landi, Maria Teresa

    2014-01-01

    Although CDKN2A is the most frequent high-risk melanoma susceptibility gene, the underlying genetic factors for most melanoma-prone families remain unknown. Using whole exome sequencing, we identified a rare variant that arose as a founder mutation in the telomere shelterin POT1 gene (g.7:124493086 C>T, Ser270Asn) in five unrelated melanoma-prone families from Romagna, Italy. Carriers of this variant had increased telomere length and elevated fragile telomeres suggesting that this variant perturbs telomere maintenance. Two additional rare POT1 variants were identified in all cases sequenced in two other Italian families, yielding a frequency of POT1 variants comparable to that of CDKN2A mutations in this population. These variants were not found in public databases or in 2,038 genotyped Italian controls. We also identified two rare recurrent POT1 variants in American and French familial melanoma cases. Our findings suggest that POT1 is a major susceptibility gene for familial melanoma in several populations. PMID:24686846

  5. Generalized Database Management System Support for Numeric Database Environments.

    ERIC Educational Resources Information Center

    Dominick, Wayne D.; Weathers, Peggy G.

    1982-01-01

    This overview of potential for utilizing database management systems (DBMS) within numeric database environments highlights: (1) major features, functions, and characteristics of DBMS; (2) applicability to numeric database environment needs and user needs; (3) current applications of DBMS technology; and (4) research-oriented and…

  6. CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer.

    PubMed

    Wong, Wing Chung; Kim, Dewey; Carter, Hannah; Diekhans, Mark; Ryan, Michael C; Karchin, Rachel

    2011-08-01

    Thousands of cancer exomes are currently being sequenced, yielding millions of non-synonymous single nucleotide variants (SNVs) of possible relevance to disease etiology. Here, we provide a software toolkit to prioritize SNVs based on their predicted contribution to tumorigenesis. It includes a database of precomputed, predictive features covering all positions in the annotated human exome and can be used either stand-alone or as part of a larger variant discovery pipeline. MySQL database, source code and binaries freely available for academic/government use at http://wiki.chasmsoftware.org, Source in Python and C++. Requires 32 or 64-bit Linux system (tested on Fedora Core 8,10,11 and Ubuntu 10), 2.5*≤ Python <3.0*, MySQL server >5.0, 60 GB available hard disk space (50 MB for software and data files, 40 GB for MySQL database dump when uncompressed), 2 GB of RAM.

  7. Rationale and uses of a public HIV drug-resistance database.

    PubMed

    Shafer, Robert W

    2006-09-15

    Knowledge regarding the drug resistance of human immunodeficiency virus (HIV) is critical for surveillance of drug resistance, development of antiretroviral drugs, and management of infections with drug-resistant viruses. Such knowledge is derived from studies that correlate genetic variation in the targets of therapy with the antiretroviral treatments received by persons from whom the variant was obtained (genotype-treatment), with drug-susceptibility data on genetic variants (genotype-phenotype), and with virological and clinical response to a new treatment regimen (genotype-outcome). An HIV drug-resistance database is required to represent, store, and analyze the diverse forms of data underlying our knowledge of drug resistance and to make these data available to the broad community of researchers studying drug resistance in HIV and clinicians using HIV drug-resistance tests. Such genotype-treatment, genotype-phenotype, and genotype-outcome correlations are contained in the Stanford HIV RT and Protease Sequence Database and have specific usefulness.

  8. Bedside Back to Bench: Building Bridges between Basic and Clinical Genomic Research.

    PubMed

    Manolio, Teri A; Fowler, Douglas M; Starita, Lea M; Haendel, Melissa A; MacArthur, Daniel G; Biesecker, Leslie G; Worthey, Elizabeth; Chisholm, Rex L; Green, Eric D; Jacob, Howard J; McLeod, Howard L; Roden, Dan; Rodriguez, Laura Lyman; Williams, Marc S; Cooper, Gregory M; Cox, Nancy J; Herman, Gail E; Kingsmore, Stephen; Lo, Cecilia; Lutz, Cathleen; MacRae, Calum A; Nussbaum, Robert L; Ordovas, Jose M; Ramos, Erin M; Robinson, Peter N; Rubinstein, Wendy S; Seidman, Christine; Stranger, Barbara E; Wang, Haoyi; Westerfield, Monte; Bult, Carol

    2017-03-23

    Genome sequencing has revolutionized the diagnosis of genetic diseases. Close collaborations between basic scientists and clinical genomicists are now needed to link genetic variants with disease causation. To facilitate such collaborations, we recommend prioritizing clinically relevant genes for functional studies, developing reference variant-phenotype databases, adopting phenotype description standards, and promoting data sharing. Published by Elsevier Inc.

  9. Bedside Back to Bench: Building Bridges between Basic and Clinical Genomic Research

    PubMed Central

    Manolio, Teri A.; Fowler, Douglas M.; Starita, Lea M.; Haendel, Melissa A.; MacArthur, Daniel G.; Biesecker, Leslie G.; Worthey, Elizabeth; Chisholm, Rex L.; Green, Eric D.; Jacob, Howard J.; McLeod, Howard L.; Roden, Dan; Rodriguez, Laura Lyman; Williams, Marc S.; Cooper, Gregory M.; Cox, Nancy J.; Herman, Gail E.; Kingsmore, Stephen; Lo, Cecilia; Lutz, Cathleen; MacRae, Calum A.; Nussbaum, Robert L.; Ordovas, Jose M.; Ramos, Erin M.; Robinson, Peter N.; Rubinstein, Wendy S.; Seidman, Christine; Stranger, Barbara E.; Wang, Haoyi; Westerfield, Monte; Bult, Carol

    2017-01-01

    Summary Genome sequencing has revolutionized the diagnosis of genetic diseases. Close collaborations between basic scientists and clinical genomicists are now needed to link genetic variants with disease causation. To facilitate such collaborations we recommend prioritizing clinically relevant genes for functional studies, developing reference variant-phenotype databases, adopting phenotype description standards, and promoting data sharing. PMID:28340351

  10. Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with pulmonary hypertension

    USDA-ARS?s Scientific Manuscript database

    The availability of whole genome sequence (WGS) data has made it possible to discover protein variants in silico. However, existing bovine WGS databases do not show data in a form conducive to protein variant analysis, and tend to under represent the breadth of genetic diversity in U.S. beef cattle...

  11. Mutation screening in the Greek population and evaluation of NLGN3 and NLGN4X genes causal factors for autism.

    PubMed

    Volaki, Konstantina; Pampanos, Andreas; Kitsiou-Tzeli, Sophia; Vrettou, Christina; Oikonomakis, Vasilis; Sofocleous, Christalena; Kanavakis, Emmanuel

    2013-10-01

    Molecular and neurobiological evidence for the involvement of neuroligins (particularly NLGN3 and NLGN4X genes) in autistic disorder is accumulating. However, previous mutation screening studies on these two genes have yielded controversial results. The present study explores, for the first time, the contribution of NLGN3 and NLGN4X genetic variants in Greek patients with autistic disorder. We analyzed the full exonic sequence of NLGN3 and NLGN4X genes in 40 patients strictly fulfilling the Diagnostic and Statistical Manual of Mental Disorders, 4th ed. criteria for autistic disorder. We identified nine nucleotide changes in NLGN4X--one probable causative mutation (p.K378R) previously reported by our research group, one novel variant (c.-206G>C), one nonvalidated single nucleotide polymorphism (SNP, rs111953947), and six known human SNPs reported in the SNP database--and one known human SNP in NLGN3 also reported in the SNP database. The variants identified are expected to be benign. However, they should be investigated in the context of variants in interacting cellular pathways to assess their contribution to the etiology of autism.

  12. Targeted mutation screening panels expose systematic population bias in detection of cystic fibrosis risk.

    PubMed

    Lim, Regine M; Silver, Ari J; Silver, Maxwell J; Borroto, Carlos; Spurrier, Brett; Petrossian, Tanya C; Larson, Jessica L; Silver, Lee M

    2016-02-01

    Carrier screening for mutations contributing to cystic fibrosis (CF) is typically accomplished with panels composed of variants that are clinically validated primarily in patients of European descent. This approach has created a static genetic and phenotypic profile for CF. An opportunity now exists to reevaluate the disease profile of CFTR at a global population level. CFTR allele and genotype frequencies were obtained from a nonpatient cohort with more than 60,000 unrelated personal genomes collected by the Exome Aggregation Consortium. Likely disease-contributing mutations were identified with the use of public database annotations and computational tools. We identified 131 previously described and likely pathogenic variants and another 210 untested variants with a high probability of causing protein damage. None of the current genetic screening panels or existing CFTR mutation databases covered a majority of deleterious variants in any geographical population outside of Europe. Both clinical annotation and mutation coverage by commercially available targeted screening panels for CF are strongly biased toward detection of reproductive risk in persons of European descent. South and East Asian populations are severely underrepresented, in part because of a definition of disease that preferences the phenotype associated with European-typical CFTR alleles.

  13. Identification of Rare Variants in TNNI3 with Atrial Fibrillation in a Chinese GeneID Population

    PubMed Central

    Wang, Chuchu; Wu, Manman; Qian, Jin; Li, Bin; Tu, Xin; Xu, Chengqi; Li, Sisi; Chen, Shanshan; Zhao, Yuanyuan; Huang, Yufeng; Shi, Lisong; Cheng, Xiang; Liao, Yuhua; Chen, Qiuyun; Xia, Yunlong; Yao, Wei; Wu, Gang; Cheng, Mian; Wang, Qing K.

    2015-01-01

    Despite advances by genome-wide association studies (GWAS), much of heritability of common human diseases remains missing, a phenomenon referred to as ‘missing heritability’. One potential cause for ‘missing heritability’ is the rare susceptibility variants overlooked by GWAS. Atrial fibrillation (AF) is the most common arrhythmia seen at hospitals and increases risk of stroke by 5-fold and doubles risk of heart failure and sudden death. Here we studied one large Chinese family with AF and hypertrophic cardiomyopathy (HCM). Whole-exome sequencing analysis identified a mutation in TNNI3, R186Q, that co-segregated with the disease in the family, but did not exist in >1,583 controls, suggesting that R186Q causes AF and HCM. High-resolution melting curve analysis and direct DNA sequence analysis were then used to screen mutations in all exons and exon-intron boundaries of TNNI3 in a panel of 1,127 unrelated AF patients and 1,583 non-AF subjects. Four novel missense variants were identified in TNNI3, including E64G, M154L, E187G and D196G in four independent AF patients, but no variant was found in 1,583 non-AF subjects. All variants were not found in public databases, including the ExAC Browser database with 60,706 exomes. These data suggests that rare TNNI3 variants are associated with AF (P=0.03). TNNI3 encodes troponin I, a key regulator of the contraction-relaxation function of cardiac muscle and was not previously implicated in AF. Thus, this study may identify a new biological pathway for the pathogenesis of AF and provides evidence to support the rare variant hypothesis for missing heritability. PMID:26169204

  14. Diversity and impact of rare variants in genes encoding the platelet G protein-coupled receptors.

    PubMed

    Jones, Matthew L; Norman, Jane E; Morgan, Neil V; Mundell, Stuart J; Lordkipanidzé, Marie; Lowe, Gillian C; Daly, Martina E; Simpson, Michael A; Drake, Sian; Watson, Steve P; Mumford, Andrew D

    2015-04-01

    Platelet responses to activating agonists are influenced by common population variants within or near G protein-coupled receptor (GPCR) genes that affect receptor activity. However, the impact of rare GPCR gene variants is unknown. We describe the rare single nucleotide variants (SNVs) in the coding and splice regions of 18 GPCR genes in 7,595 exomes from the 1,000-genomes and Exome Sequencing Project databases and in 31 cases with inherited platelet function disorders (IPFDs). In the population databases, the GPCR gene target regions contained 740 SNVs (318 synonymous, 410 missense, 7 stop gain and 6 splice region) of which 70 % had global minor allele frequency (MAF) < 0.05 %. Functional annotation using six computational algorithms, experimental evidence and structural data identified 156/740 (21 %) SNVs as potentially damaging to GPCR function, most commonly in regions encoding the transmembrane and C-terminal intracellular receptor domains. In 31 index cases with IPFDs (Gi-pathway defect n=15; secretion defect n=11; thromboxane pathway defect n=3 and complex defect n=2) there were 256 SNVs in the target regions of 15 stimulatory platelet GPCRs (34 unique; 12 with MAF< 1 % and 22 with MAF≥ 1 %). These included rare variants predicting R122H, P258T and V207A substitutions in the P2Y12 receptor that were annotated as potentially damaging, but only partially explained the platelet function defects in each case. Our data highlight that potentially damaging variants in platelet GPCR genes have low individual frequencies, but are collectively abundant in the population. Potentially damaging variants are also present in pedigrees with IPFDs and may contribute to complex laboratory phenotypes.

  15. Study on the crystallographic orientation relationship and formation mechanism of reversed austenite in economical Cr12 super martensitic stainless steel

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ye, Dong; Li, Shaohong; Li, Jun

    Effect of carbides and crystallographic orientation relationship on the formation mechanism of reversed austenite of economical Cr12 super martensitic stainless steel (SMSS) has been investigated mainly by transmission electron microscopy (TEM) and electron backscatter diffraction (EBSD). The results indicate that the M{sub 23}C{sub 6} precipitation and the formation of the reversed austenite have the interaction effect during tempering process in SMSS. The reversed austenite forms intensively at the sub-block boundary and the lath boundary within a misorientation range of 0–60°. M{sub 23}C{sub 6} has the same crystallographic orientation relationship with reversed austenite. There are two different kinds of formation modesmore » for reversed austenite. One is a nondiffusional shear reversion; the other is a diffusion transformation. Both are strictly limited by crystallographic orientation relationship. The austenite variants are limited to two kinds within one packet and five kinds within one prior austenite grain. - Highlights: • Reversed austenite forms at martensite boundaries with misorientation of 0–60° • M{sub 23}C{sub 6} precipitation and reversed austenite formation have the interaction effect. • Two austenite variants with different orientations can be formed inside a packet. • Two reversed austenite formation modes: shear reversion; diffusion transformation.« less

  16. Analysis of the Association between Catechol-O-Methyltransferase Val158Met and Male Sexual Orientation.

    PubMed

    Yu, Wei; Tu, Dan; Hong, Fuchang; Wang, Jing; Liu, Xiaoli; Cai, Yumao; Xu, Ruiwei; Zhao, Guanglu; Wang, Feng; Pan, Hong; Wu, Shinan; Feng, Tiejian; Wang, Binbin

    2015-09-01

    Male sexual orientation is thought to have a genetic component. However, previous studies have failed to generate positive results from among candidate genes. Catechol-O-methyltransferase (COMT), located on chromosome 22, has six exons, spans 27 kb, and encodes a protein of 271 amino acids. COMT has an important role in regulating the embryonic levels of catecholamine neurotransmitters (such as dopamine, norepinephrine, and epinephrine) and estrogens. COMT is also thought to be related to sexual orientation. This study aimed to investigate the relationship between the COMT Val158Met variant and male sexual orientation. We performed association analysis of the COMT gene single nucleotide polymorphism, Val158Met, in 409 homosexual cases and 387 heterosexual control Chinese men. COMT polymorphism status was determined using a polymerase chain reaction-based assay. Polymerase chain reaction was performed to genotype the COMT Val158Met polymorphism. The frequency differences of the genotype and alleles distribution between the male homosexual and control groups. Significant differences, both in genotype and alleles, between male homosexual individuals and controls indicated a genetic component related to male homosexuality. The Val allele recessive model could be an interrelated genetic model of the cause of male homosexuality. The COMT Val158Met variant might be associated with male sexual orientation and a recessive model was suggested. © 2015 International Society for Sexual Medicine.

  17. Seshat: A Web service for accurate annotation, validation, and analysis of TP53 variants generated by conventional and next-generation sequencing.

    PubMed

    Tikkanen, Tuomas; Leroy, Bernard; Fournier, Jean Louis; Risques, Rosa Ana; Malcikova, Jitka; Soussi, Thierry

    2018-07-01

    Accurate annotation of genomic variants in human diseases is essential to allow personalized medicine. Assessment of somatic and germline TP53 alterations has now reached the clinic and is required in several circumstances such as the identification of the most effective cancer therapy for patients with chronic lymphocytic leukemia (CLL). Here, we present Seshat, a Web service for annotating TP53 information derived from sequencing data. A flexible framework allows the use of standard file formats such as Mutation Annotation Format (MAF) or Variant Call Format (VCF), as well as common TXT files. Seshat performs accurate variant annotations using the Human Genome Variation Society (HGVS) nomenclature and the stable TP53 genomic reference provided by the Locus Reference Genomic (LRG). In addition, using the 2017 release of the UMD_TP53 database, Seshat provides multiple statistical information for each TP53 variant including database frequency, functional activity, or pathogenicity. The information is delivered in standardized output tables that minimize errors and facilitate comparison of mutational data across studies. Seshat is a beneficial tool to interpret the ever-growing TP53 sequencing data generated by multiple sequencing platforms and it is freely available via the TP53 Website, http://p53.fr or directly at http://vps338341.ovh.net/. © 2018 Wiley Periodicals, Inc.

  18. Factors influencing success of clinical genome sequencing across a broad spectrum of disorders

    PubMed Central

    Lise, Stefano; Broxholme, John; Cazier, Jean-Baptiste; Rimmer, Andy; Kanapin, Alexander; Lunter, Gerton; Fiddy, Simon; Allan, Chris; Aricescu, A. Radu; Attar, Moustafa; Babbs, Christian; Becq, Jennifer; Beeson, David; Bento, Celeste; Bignell, Patricia; Blair, Edward; Buckle, Veronica J; Bull, Katherine; Cais, Ondrej; Cario, Holger; Chapel, Helen; Copley, Richard R; Cornall, Richard; Craft, Jude; Dahan, Karin; Davenport, Emma E; Dendrou, Calliope; Devuyst, Olivier; Fenwick, Aimée L; Flint, Jonathan; Fugger, Lars; Gilbert, Rodney D; Goriely, Anne; Green, Angie; Greger, Ingo H.; Grocock, Russell; Gruszczyk, Anja V; Hastings, Robert; Hatton, Edouard; Higgs, Doug; Hill, Adrian; Holmes, Chris; Howard, Malcolm; Hughes, Linda; Humburg, Peter; Johnson, David; Karpe, Fredrik; Kingsbury, Zoya; Kini, Usha; Knight, Julian C; Krohn, Jonathan; Lamble, Sarah; Langman, Craig; Lonie, Lorne; Luck, Joshua; McCarthy, Davis; McGowan, Simon J; McMullin, Mary Frances; Miller, Kerry A; Murray, Lisa; Németh, Andrea H; Nesbit, M Andrew; Nutt, David; Ormondroyd, Elizabeth; Oturai, Annette Bang; Pagnamenta, Alistair; Patel, Smita Y; Percy, Melanie; Petousi, Nayia; Piazza, Paolo; Piret, Sian E; Polanco-Echeverry, Guadalupe; Popitsch, Niko; Powrie, Fiona; Pugh, Chris; Quek, Lynn; Robbins, Peter A; Robson, Kathryn; Russo, Alexandra; Sahgal, Natasha; van Schouwenburg, Pauline A; Schuh, Anna; Silverman, Earl; Simmons, Alison; Sørensen, Per Soelberg; Sweeney, Elizabeth; Taylor, John; Thakker, Rajesh V; Tomlinson, Ian; Trebes, Amy; Twigg, Stephen RF; Uhlig, Holm H; Vyas, Paresh; Vyse, Tim; Wall, Steven A; Watkins, Hugh; Whyte, Michael P; Witty, Lorna; Wright, Ben; Yau, Chris; Buck, David; Humphray, Sean; Ratcliffe, Peter J; Bell, John I; Wilkie, Andrew OM; Bentley, David; Donnelly, Peter; McVean, Gilean

    2015-01-01

    To assess factors influencing the success of whole genome sequencing for mainstream clinical diagnosis, we sequenced 217 individuals from 156 independent cases across a broad spectrum of disorders in whom prior screening had identified no pathogenic variants. We quantified the number of candidate variants identified using different strategies for variant calling, filtering, annotation and prioritisation. We found that jointly calling variants across samples, filtering against both local and external databases, deploying multiple annotation tools and using familial transmission above biological plausibility contributed to accuracy. Overall, we identified disease causing variants in 21% of cases, rising to 34% (23/68) for Mendelian disorders and 57% (8/14) in trios. We also discovered 32 potentially clinically actionable variants in 18 genes unrelated to the referral disorder, though only four were ultimately considered reportable. Our results demonstrate the value of genome sequencing for routine clinical diagnosis, but also highlight many outstanding challenges. PMID:25985138

  19. Ioss IO Subsystem

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sjaardema, Gregory; Bauer, David; Erik, & Illescas

    2017-01-06

    The Ioss is a database-independent package for providing an object-oriented, abstract interface to IO capabilities for a finite element application; and concrete database interfaces which provided input and/or output to exodusII, xdmf, generated, and heartbeat database formats. The Ioss provides an object-oriented C++-based IO interface for a finite element application code. The application code can perform all IO operations through the Ioss interface which is typically at a higher abstraction level than the concrete database formats. The Ioss then performs the needed operations to translate the finite element data to the specific format required by the concrete database implementations. Themore » Ioss currently supports interfaces to exodusII, xdmf, generated, and heartbeat formats, but additional formats can be easily added.« less

  20. GENEASE: Real time bioinformatics tool for multi-omics and disease ontology exploration, analysis and visualization.

    PubMed

    Ghandikota, Sudhir; Hershey, Gurjit K Khurana; Mersha, Tesfaye B

    2018-03-24

    Advances in high-throughput sequencing technologies have made it possible to generate multiple omics data at an unprecedented rate and scale. The accumulation of these omics data far outpaces the rate at which biologists can mine and generate new hypothesis to test experimentally. There is an urgent need to develop a myriad of powerful tools to efficiently and effectively search and filter these resources to address specific post-GWAS functional genomics questions. However, to date, these resources are scattered across several databases and often lack a unified portal for data annotation and analytics. In addition, existing tools to analyze and visualize these databases are highly fragmented, resulting researchers to access multiple applications and manual interventions for each gene or variant in an ad hoc fashion until all the questions are answered. In this study, we present GENEASE, a web-based one-stop bioinformatics tool designed to not only query and explore multi-omics and phenotype databases (e.g., GTEx, ClinVar, dbGaP, GWAS Catalog, ENCODE, Roadmap Epigenomics, KEGG, Reactome, Gene and Phenotype Ontology) in a single web interface but also to perform seamless post genome-wide association downstream functional and overlap analysis for non-coding regulatory variants. GENEASE accesses over 50 different databases in public domain including model organism-specific databases to facilitate gene/variant and disease exploration, enrichment and overlap analysis in real time. It is a user-friendly tool with point-and-click interface containing links for support information including user manual and examples. GENEASE can be accessed freely at http://research.cchmc.org/mershalab/genease_new/login.html. Tesfaye.Mersha@cchmc.org, Sudhir.Ghandikota@cchmc.org. Supplementary data are available at Bioinformatics online.

  1. Scripps Genome ADVISER: Annotation and Distributed Variant Interpretation SERver

    PubMed Central

    Pham, Phillip H.; Shipman, William J.; Erikson, Galina A.; Schork, Nicholas J.; Torkamani, Ali

    2015-01-01

    Interpretation of human genomes is a major challenge. We present the Scripps Genome ADVISER (SG-ADVISER) suite, which aims to fill the gap between data generation and genome interpretation by performing holistic, in-depth, annotations and functional predictions on all variant types and effects. The SG-ADVISER suite includes a de-identification tool, a variant annotation web-server, and a user interface for inheritance and annotation-based filtration. SG-ADVISER allows users with no bioinformatics expertise to manipulate large volumes of variant data with ease – without the need to download large reference databases, install software, or use a command line interface. SG-ADVISER is freely available at genomics.scripps.edu/ADVISER. PMID:25706643

  2. Integrating heterogeneous databases in clustered medic care environments using object-oriented technology

    NASA Astrophysics Data System (ADS)

    Thakore, Arun K.; Sauer, Frank

    1994-05-01

    The organization of modern medical care environments into disease-related clusters, such as a cancer center, a diabetes clinic, etc., has the side-effect of introducing multiple heterogeneous databases, often containing similar information, within the same organization. This heterogeneity fosters incompatibility and prevents the effective sharing of data amongst applications at different sites. Although integration of heterogeneous databases is now feasible, in the medical arena this is often an ad hoc process, not founded on proven database technology or formal methods. In this paper we illustrate the use of a high-level object- oriented semantic association method to model information found in different databases into an integrated conceptual global model that integrates the databases. We provide examples from the medical domain to illustrate an integration approach resulting in a consistent global view, without attacking the autonomy of the underlying databases.

  3. Hb Mozhaisk [β92(F8)His→Arg; HBB: c.278A>G] as a De Novo Mutation in a Child of Mixed Ethnic Origins.

    PubMed

    Benzoni, Elena; Giannone, Valentina; Michetti, Laura; Seia, Manuela; Cavalleri, Laura; Curcio, Cristina

    Approximately 150 variants described in the HbVar database have been found to be unstable and about 80.0% of these are on the β-globin gene. We describe the case of a 3-year-old child who presented at the emergency room with fever and asthenia. Hematological data suggested severe hemolytic anemia. Sequencing of the β-globin gene revealed the mutation HBB: c.278A>G at codon 92 in a heterozygous state, reported as Hb Mozhaisk in the HbVar database. Other family members did not have Hb Mozhaisk, thus, this variant is due to a de novo mutation. Because of the rarity of this globin variant, we believe it is important to report similar cases, to have a more complete phenotype description of the pathology and define an adequate reproductive risk for couples, considering the dominant inheritance pattern (hence an inheritance risk of 50.0%).

  4. Integrating 400 million variants from 80,000 human samples with extensive annotations: towards a knowledge base to analyze disease cohorts.

    PubMed

    Hakenberg, Jörg; Cheng, Wei-Yi; Thomas, Philippe; Wang, Ying-Chih; Uzilov, Andrew V; Chen, Rong

    2016-01-08

    Data from a plethora of high-throughput sequencing studies is readily available to researchers, providing genetic variants detected in a variety of healthy and disease populations. While each individual cohort helps gain insights into polymorphic and disease-associated variants, a joint perspective can be more powerful in identifying polymorphisms, rare variants, disease-associations, genetic burden, somatic variants, and disease mechanisms. We have set up a Reference Variant Store (RVS) containing variants observed in a number of large-scale sequencing efforts, such as 1000 Genomes, ExAC, Scripps Wellderly, UK10K; various genotyping studies; and disease association databases. RVS holds extensive annotations pertaining to affected genes, functional impacts, disease associations, and population frequencies. RVS currently stores 400 million distinct variants observed in more than 80,000 human samples. RVS facilitates cross-study analysis to discover novel genetic risk factors, gene-disease associations, potential disease mechanisms, and actionable variants. Due to its large reference populations, RVS can also be employed for variant filtration and gene prioritization. A web interface to public datasets and annotations in RVS is available at https://rvs.u.hpc.mssm.edu/.

  5. Population genetics of chronic kidney disease: the evolving story of APOL1.

    PubMed

    Wasser, Walter G; Tzur, Shay; Wolday, Dawit; Adu, Dwomoa; Baumstein, Donald; Rosset, Saharon; Skorecki, Karl

    2012-01-01

    Advances in human genome sequencing and generation of public databases of genomic diversity enable nephrologists to re-examine the genetics of common, complex kidney diseases. Non-diabetic kidney diseases prevalent in African ancestry populations and the allelic variation described in chromosome 22q12.3 is one such illustrative example. Newly available genomic database information enabled research groups to discover common functional DNA sequence risk variants in the APOL1 gene. These variants (termed G1 and G2) evolved to confer protection from a species of trypanosomal infection and thus achieved high prominence in many geographic regions of Africa and have been carried over to African diaspora communities worldwide. Since these discoveries two years ago, new insights have been gained: localization of APOL1 in normal and disease kidney tissues; influence of the APOL1 variants on the histopathology of HIV kidney disease; possible association with kidney transplant durability; onset of kidney failure at a younger age; association with blood lipid concentrations; more precise geographic localization of individuals with these variants to western and southern African ancestry; and the absence of the variants and kidney disease predisposition in Ethiopians. The definition of APOL1 nephropathy also confirms the long-held assumption by many clinicians that kidney disease attributed to hypertension in African populations represents an underlying glomerulopathy. Still awaited is the delineation of the biologic mechanisms of cellular injury related to these variants, to provide biologic proof of the APOL1 association and to provide potential targets for preventive and therapeutic intervention.

  6. An object-oriented approach to the management of meteorological and hydrological data

    NASA Technical Reports Server (NTRS)

    Graves, S. J.; Williams, S. F.; Criswell, E. A.

    1990-01-01

    An interface to several meteorological and hydrological databases have been developed that enables researchers efficiently to access and interrelate data through a customized menu system. By extending a relational database system with object-oriented concepts, each user or group of users may have different 'views' of the data to allow user access to data in customized ways without altering the organization of the database. An application to COHMEX and WetNet, two earth science projects within NASA Marshall Space Flight Center's Earth Science and Applications Division, are described.

  7. United States Air Force Summer Research Program -- 1993. Volume 4. Rome Laboratory

    DTIC Science & Technology

    1993-12-01

    H., eds., Object-Oriented Concepts, Databases , and Applications, Addison-Wesley, Reading, MA, 1989. [Lano9l] Lano, K., "Z++, An Object-Orientated...1433 46.92 60 TCP janus.rl.af.mil mensa.rl.af.mil 1433 2611 The Target Filter Manager responds to requests for data and accesses the target database . A...2.5 2- 1.5- 28 -3 -2 -10 12 3 AZIMUTH (OE(3) Figure 12. Contour plot of antenna pattern, QC2 algorithm 5-32 UPDATING PROBABILISTIC DATABASES Michael A

  8. Software - Naval Oceanography Portal

    Science.gov Websites

    section Advanced Search... Sections Home Time Earth Orientation Astronomy Meteorology Oceanography Ice You are here: Home › USNO › Earth Orientation › Software USNO Logo USNO Navigation Earth Orientation Search databases Auxiliary Software Supporting Software Form Folder Earth Orientation Matrix Calculator

  9. Ultrasound imaging of the thenar motor branch of the median nerve: a cadaveric study.

    PubMed

    Petrover, David; Bellity, Jonathan; Vigan, Marie; Nizard, Remy; Hakime, Antoine

    2017-11-01

    Anatomic variations of the median nerve (MN) increase the risk of iatrogenic injury during carpal tunnel release surgery. We investigated whether high-frequency ultrasonography could identify anatomic variations of the MN and its thenar motor branch (MBMN) in the carpal tunnel. For each volar wrist of healthy non-embalmed cadavers, the type of MN variant (Lanz classification), course and orientation of the MBMN, and presence of hypertrophic muscles were scored by 18-MHz ultrasound and then by dissection. MBMN was identified by ultrasound in all 30 wrists (15 subjects). By dissection, type 1, 2 and 3 variants were found in 84%, 3%, and 13% of wrists, respectively. Ultrasound had good agreement with dissection in identifying the variant type (kappa =0.9). With both techniques, extra-, sub-, and transligamentous courses were recorded in 65%, 31%, and 4% of cases, respectively. With both techniques, the bifid nerve, hypertrophic muscles, and bilateral symmetry for variant type were identified in 13.3%, 13.3%, and 86.7% of wrists, respectively. Agreement between ultrasound and dissection was excellent for the MBMN course and orientation (kappa =1). Ultrasound can be used reliably to identify anatomic variations of the MN and MBMN. It could be a useful tool before carpal tunnel release surgery. • Ultrasound can identify variations of the motor branch of the median nerve. • Ultrasound mapping should be used prior to carpal tunnel release surgery. • All sub-, extra-, and transligamentous courses were accurately identified. • Type 3 variants (bifid nerve), hypertrophic muscles, and bilateral symmetry were accurately identified.

  10. Asynchronous Data Retrieval from an Object-Oriented Database

    NASA Astrophysics Data System (ADS)

    Gilbert, Jonathan P.; Bic, Lubomir

    We present an object-oriented semantic database model which, similar to other object-oriented systems, combines the virtues of four concepts: the functional data model, a property inheritance hierarchy, abstract data types and message-driven computation. The main emphasis is on the last of these four concepts. We describe generic procedures that permit queries to be processed in a purely message-driven manner. A database is represented as a network of nodes and directed arcs, in which each node is a logical processing element, capable of communicating with other nodes by exchanging messages. This eliminates the need for shared memory and for centralized control during query processing. Hence, the model is suitable for implementation on a multiprocessor computer architecture, consisting of large numbers of loosely coupled processing elements.

  11. Empowered genome community: leveraging a bioinformatics platform as a citizen-scientist collaboration tool.

    PubMed

    Wendelsdorf, Katherine; Shah, Sohela

    2015-09-01

    There is on-going effort in the biomedical research community to leverage Next Generation Sequencing (NGS) technology to identify genetic variants that affect our health. The main challenge facing researchers is getting enough samples from individuals either sick or healthy - to be able to reliably identify the few variants that are causal for a phenotype among all other variants typically seen among individuals. At the same time, more and more individuals are having their genome sequenced either out of curiosity or to identify the cause of an illness. These individuals may benefit from of a way to view and understand their data. QIAGEN's Ingenuity Variant Analysis is an online application that allows users with and without extensive bioinformatics training to incorporate information from published experiments, genetic databases, and a variety of statistical models to identify variants, from a long list of candidates, that are most likely causal for a phenotype as well as annotate variants with what is already known about them in the literature and databases. Ingenuity Variant Analysis is also an information sharing platform where users may exchange samples and analyses. The Empowered Genome Community (EGC) is a new program in which QIAGEN is making this on-line tool freely available to any individual who wishes to analyze their own genetic sequence. EGC members are then able to make their data available to other Ingenuity Variant Analysis users to be used in research. Here we present and describe the Empowered Genome Community in detail. We also present a preliminary, proof-of-concept study that utilizes the 200 genomes currently available through the EGC. The goal of this program is to allow individuals to access and understand their own data as well as facilitate citizen-scientist collaborations that can drive research forward and spur quality scientific dialogue in the general public.

  12. The PBII gene of the human salivary proline-rich protein P-B produces another protein, Q504X8, with an opiorphin homolog, QRGPR.

    PubMed

    Saitoh, Eiichi; Sega, Takuya; Imai, Akane; Isemura, Satoko; Kato, Tetsuo; Ochiai, Akihito; Taniguchi, Masayuki

    2018-04-01

    The NCBI gene database and human-transcriptome database for alternative splicing were used to determine the expression of mRNAs for P-B (SMR3B) and variant form of P-B. The translational product from the former mRNA was identified as the protein named P-B, whereas that from the latter has not yet been elucidated. In the present study, we investigated the expression of P-B and its variant form at the protein level. To identify the variant protein of P-B, (1) cationic proteins with a higher isoelectric point in human pooled whole saliva were purified by a two dimensional liquid chromatography; (2) the peptide fragments generated from the in-solution of all proteins digested with trypsin separated and analyzed by MALDI-TOF-MS; and (3) the presence or absence of P-B in individual saliva was examined by 15% SDS-PAGE. The peptide sequences (I 37 PPPYSCTPNMNNCSR 52 , C 53 HHHHKRHHYPCNYCFCYPK 72 , R 59 HHYPCNYCFCYPK 72 and H 60 HYPCNYCFCYPK 72 ) present in the variant protein of P-B were identified. The peptide sequence (G 6 PYPPGPLAPPQPFGPGFVPPPPPPPYGPGR 36 ) in P-B (or the variant) and sequence (I 37 PPPPPAPYGPGIFPPPPPQP 57 ) in P-B were identified. The sum of the sequences identified indicated a 91.23% sequence identity for P-B and 79.76% for the variant. There were cases in which P-B existed in individual saliva, but there were cases in which it did not exist in individual saliva. The variant protein is produced by excising a non-canonical intron (CC-AC pair) from the 3'-noncoding sequence of the PBII gene. Both P-B and the variant are subject to proteolysis in the oral cavity. Copyright © 2018 Elsevier Ltd. All rights reserved.

  13. Gender Variance and Sexual Orientation Among Male Spirit Mediums in Myanmar.

    PubMed

    Coleman, Eli; Allen, Mariette Pathy; Ford, Jessie V

    2018-05-01

    This article describes the gender identity, gender expression, and sexual orientation of male spirit mediums in Myanmar. Our analysis is based on ethnographic work, field observation, and 10 semi-structured interviews. These observations were conducted from 2010 to 2015, mostly in Mandalay, with some fieldwork in Yangon and Bagan. The focus of this investigation was specifically on achout (gender variant individuals) who were spirit mediums (nat kadaw). Semi-structured interviews explored the ways that participants understood their gender identity, gender expression, and sexuality in relation to their work as spirit mediums and broader social life. Myanmar remains quite a homophobic and transphobic culture but is undergoing rapid economic and social change. Therefore, it provides an interesting context to study how safe spaces are produced for sexual/gender minorities amidst broader social change. We find that, through the animistic belief structure, there is a growing space for gender nonconforming people, gender variant, and same-sex-oriented individuals (achout) to neutralize their stigmatized status and attain a level of respect and economic advantage. Their ability to become nat kadaw (mediums of spirits) mitigates or trumps their stigmatized status.

  14. Fully invariant wavelet enhanced minimum average correlation energy filter for object recognition in cluttered and occluded environments

    NASA Astrophysics Data System (ADS)

    Tehsin, Sara; Rehman, Saad; Riaz, Farhan; Saeed, Omer; Hassan, Ali; Khan, Muazzam; Alam, Muhammad S.

    2017-05-01

    A fully invariant system helps in resolving difficulties in object detection when camera or object orientation and position are unknown. In this paper, the proposed correlation filter based mechanism provides the capability to suppress noise, clutter and occlusion. Minimum Average Correlation Energy (MACE) filter yields sharp correlation peaks while considering the controlled correlation peak value. Difference of Gaussian (DOG) Wavelet has been added at the preprocessing stage in proposed filter design that facilitates target detection in orientation variant cluttered environment. Logarithmic transformation is combined with a DOG composite minimum average correlation energy filter (WMACE), capable of producing sharp correlation peaks despite any kind of geometric distortion of target object. The proposed filter has shown improved performance over some of the other variant correlation filters which are discussed in the result section.

  15. Interaction of birth order, handedness, and sexual orientation in the Kinsey interview data.

    PubMed

    Bogaert, Anthony F; Blanchard, Ray; Crosthwait, Lesley E

    2007-10-01

    Recent evidence indicates that 2 of the most consistently observed correlates of men's sexual orientation--handedness and older brothers--may be linked interactively in their prediction of men's sexual orientation. In this article, the authors studied the relationship among handedness, older brothers, and men's sexual orientation in the large and historically significant database originally compiled by Alfred C. Kinsey and his colleagues (A. C. Kinsey, W. B. Pomeroy, & C. E. Martin, 1948). The results demonstrated that handedness moderates the relationship between older brothers and sexual orientation. Specifically, older brothers increased the odds of homosexuality in right-handers only; in non-righthanders, older brothers did not affect the odds of homosexuality. These results refine the possible biological explanations reported to underlie both the handedness and older brother relationships to men's sexual orientation. These results also suggest that biological explanations of men's sexual orientation are likely relevant across time, as the Kinsey data comprise an older cohort relative to modern samples. (PsycINFO Database Record (c) 2007 APA, all rights reserved).

  16. BlackOPs: increasing confidence in variant detection through mappability filtering.

    PubMed

    Cabanski, Christopher R; Wilkerson, Matthew D; Soloway, Matthew; Parker, Joel S; Liu, Jinze; Prins, Jan F; Marron, J S; Perou, Charles M; Hayes, D Neil

    2013-10-01

    Identifying variants using high-throughput sequencing data is currently a challenge because true biological variants can be indistinguishable from technical artifacts. One source of technical artifact results from incorrectly aligning experimentally observed sequences to their true genomic origin ('mismapping') and inferring differences in mismapped sequences to be true variants. We developed BlackOPs, an open-source tool that simulates experimental RNA-seq and DNA whole exome sequences derived from the reference genome, aligns these sequences by custom parameters, detects variants and outputs a blacklist of positions and alleles caused by mismapping. Blacklists contain thousands of artifact variants that are indistinguishable from true variants and, for a given sample, are expected to be almost completely false positives. We show that these blacklist positions are specific to the alignment algorithm and read length used, and BlackOPs allows users to generate a blacklist specific to their experimental setup. We queried the dbSNP and COSMIC variant databases and found numerous variants indistinguishable from mapping errors. We demonstrate how filtering against blacklist positions reduces the number of potential false variants using an RNA-seq glioblastoma cell line data set. In summary, accounting for mapping-caused variants tuned to experimental setups reduces false positives and, therefore, improves genome characterization by high-throughput sequencing.

  17. A survey of commercial object-oriented database management systems

    NASA Technical Reports Server (NTRS)

    Atkins, John

    1992-01-01

    The object-oriented data model is the culmination of over thirty years of database research. Initially, database research focused on the need to provide information in a consistent and efficient manner to the business community. Early data models such as the hierarchical model and the network model met the goal of consistent and efficient access to data and were substantial improvements over simple file mechanisms for storing and accessing data. However, these models required highly skilled programmers to provide access to the data. Consequently, in the early 70's E.F. Codd, an IBM research computer scientists, proposed a new data model based on the simple mathematical notion of the relation. This model is known as the Relational Model. In the relational model, data is represented in flat tables (or relations) which have no physical or internal links between them. The simplicity of this model fostered the development of powerful but relatively simple query languages that now made data directly accessible to the general database user. Except for large, multi-user database systems, a database professional was in general no longer necessary. Database professionals found that traditional data in the form of character data, dates, and numeric data were easily represented and managed via the relational model. Commercial relational database management systems proliferated and performance of relational databases improved dramatically. However, there was a growing community of potential database users whose needs were not met by the relational model. These users needed to store data with data types not available in the relational model and who required a far richer modelling environment than that provided by the relational model. Indeed, the complexity of the objects to be represented in the model mandated a new approach to database technology. The Object-Oriented Model was the result.

  18. Towards the Architecture of an Instructional Multimedia Database.

    ERIC Educational Resources Information Center

    Verhagen, Plin W.; Bestebreurtje, R.

    1994-01-01

    Discussion of multimedia databases in education focuses on the development of an adaptable database in The Netherlands that uses optical storage media to hold the audiovisual components. Highlights include types of applications; types of users; accessibility; adaptation; an object-oriented approach; levels of the database architecture; and…

  19. The Effects of Purpose Orientations on Recent High School Graduates' College Application Decisions

    ERIC Educational Resources Information Center

    Sharma, Gitima; Kim, Jungnam; Bryan, Julia

    2017-01-01

    Using the 2002 Educational Longitudinal Study database, the authors examined the different types of purpose orientations amongst a nationally representative sample of adolescents and the effect of these purpose orientations on high school graduates' college application decisions. Results indicated four types of purpose orientations: career,…

  20. A searchable, whole genome resource designed for protein variant analysis in diverse lineages of U.S. beef cattle

    USDA-ARS?s Scientific Manuscript database

    A key feature of a gene's function is the variety of protein isoforms it encodes in a population. However, the genetic diversity in bovine whole genome databases tends to be underrepresented because these databases contain an abundance of sequence from the most influential sires. Our first aim was ...

  1. Biomedical Requirements for High Productivity Computing Systems

    DTIC Science & Technology

    2005-04-01

    server at http://www.ncbi.nlm.nih.gov/BLAST/. There are many variants of BLAST, including: 1. BLASTN - Compares a DNA query to a DNA database. Searches ...database (3 reading frames from each strand of the DNA) searching . 13 4. TBLASTN - Compares a protein query to a DNA database, in the 6 possible...the molecular during this phase. After eliminating molecules that could not match the query , an atom-by-atom search for the molecules in conducted

  2. Mutation databases and other online sites as a resource for transfusion medicine: history and attributes.

    PubMed

    Blumenfeld, Olga O

    2002-04-01

    Recent advances in molecular biology and technology have provided evidence, at a molecular level, for long-known observations that the human genome is not unique but is characterized by individual sequence variation. At the present time, documentation of genetic variation occurring in a large number of genes is increasing exponentially. The characterization of alleles that encode a variety of blood group antigens has been particularly fruitful for transfusion medicine. Phenotypic variation, as identified by the serologic study of blood group variants, is required to identify the presence of a variant allele. Many of the other alleles currently recorded have been selected and identified on the basis of inherited disease traits. New approaches document single nucleotide polymorphisms that occur throughout the genome and best show how the DNA sequence varies in the human population. The primary data dealing with variant alleles or more general genomic variation are scattered throughout the scientific literature and only within the last few years has information begun to be organized into databases. This article provides guidance on how to access those databases online as a source of information about genetic variation for purposes of molecular, clinical, and diagnostic medicine, research, and teaching. The attributes of the sites are described. A more detailed view of the database dealing specifically with alleles of genes encoding the blood group antigens includes a brief preliminary analysis of the molecular basis for observed polymorphisms. Other online sites that may be particularly useful to the transfusion medicine readership as well as a brief historical account are also presented. Copyright 2002, Elsevier Science (USA). All rights reserved.

  3. MECP2 variation in Rett syndrome-An overview of current coverage of genetic and phenotype data within existing databases.

    PubMed

    Townend, Gillian S; Ehrhart, Friederike; van Kranen, Henk J; Wilkinson, Mark; Jacobsen, Annika; Roos, Marco; Willighagen, Egon L; van Enckevort, David; Evelo, Chris T; Curfs, Leopold M G

    2018-04-27

    Rett syndrome (RTT) is a monogenic rare disorder that causes severe neurological problems. In most cases, it results from a loss-of-function mutation in the gene encoding methyl-CPG-binding protein 2 (MECP2). Currently, about 900 unique MECP2 variations (benign and pathogenic) have been identified and it is suspected that the different mutations contribute to different levels of disease severity. For researchers and clinicians, it is important that genotype-phenotype information is available to identify disease-causing mutations for diagnosis, to aid in clinical management of the disorder, and to provide counseling for parents. In this study, 13 genotype-phenotype databases were surveyed for their general functionality and availability of RTT-specific MECP2 variation data. For each database, we investigated findability and interoperability alongside practical user functionality, and type and amount of genetic and phenotype data. The main conclusions are that, as well as being challenging to find these databases and specific MECP2 variants held within, interoperability is as yet poorly developed and requires effort to search across databases. Nevertheless, we found several thousand online database entries for MECP2 variations and their associated phenotypes, diagnosis, or predicted variant effects, which is a good starting point for researchers and clinicians who want to provide, annotate, and use the data. © 2018 The Authors. Human Mutation published by Wiley Periodicals, Inc.

  4. Experimental evidence of stress-field-induced selection of variants in Ni-Mn-Ga ferromagnetic shape-memory alloys

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Y. D.; Key Laboratory for Anisotropy and Texture of Materials; Brown, D. W.

    2007-05-01

    The in situ time-of-flight neutron-diffraction measurements captured well the martensitic transformation behavior of the Ni-Mn-Ga ferromagnetic shape-memory alloys under uniaxial stress fields. We found that a small uniaxial stress applied during phase transformation dramatically disturbed the distribution of variants in the product phase. The observed changes in the distributions of variants may be explained by considering the role of the minimum distortion energy of the Bain transformation in the effective partition among the variants belonging to the same orientation of parent phase. It was also found that transformation kinetics under various stress fields follows the scale law. The present investigationsmore » provide the fundamental approach for scaling the evolution of microstructures in martensitic transitions, which is of general interest to the condensed matter community.« less

  5. Common variants in Mendelian kidney disease genes and their association with renal function.

    PubMed

    Parsa, Afshin; Fuchsberger, Christian; Köttgen, Anna; O'Seaghdha, Conall M; Pattaro, Cristian; de Andrade, Mariza; Chasman, Daniel I; Teumer, Alexander; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Kim, Young J; Taliun, Daniel; Li, Man; Feitosa, Mary; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C; Glazer, Nicole; Isaacs, Aaron; Rao, Madhumathi; Smith, Albert V; O'Connell, Jeffrey R; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Hwang, Shih-Jen; Atkinson, Elizabeth J; Lohman, Kurt; Cornelis, Marilyn C; Johansson, Asa; Tönjes, Anke; Dehghan, Abbas; Couraki, Vincent; Holliday, Elizabeth G; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y; Murgia, Federico; Trompet, Stella; Imboden, Medea; Kollerits, Barbara; Pistis, Giorgio; Harris, Tamara B; Launer, Lenore J; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D; Boerwinkle, Eric; Schmidt, Helena; Hofer, Edith; Hu, Frank; Demirkan, Ayse; Oostra, Ben A; Turner, Stephen T; Ding, Jingzhong; Andrews, Jeanette S; Freedman, Barry I; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Döring, Angela; Wichmann, H-Erich; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H; Wright, Alan F; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G; Rivadeneira, Fernando; Aulchenko, Yurii S; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K; Portas, Laura; Ford, Ian; Buckley, Brendan M; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J Wouter; Probst-Hensch, Nicole M; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; van Duijn, Cornelia M; Borecki, Ingrid; Kardia, Sharon L R; Liu, Yongmei; Curhan, Gary C; Rudan, Igor; Gyllensten, Ulf; Wilson, James F; Franke, Andre; Pramstaller, Peter P; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M; Bochud, Murielle; Heid, Iris M; Siscovick, David S; Fox, Caroline S; Kao, W Linda; Böger, Carsten A

    2013-12-01

    Many common genetic variants identified by genome-wide association studies for complex traits map to genes previously linked to rare inherited Mendelian disorders. A systematic analysis of common single-nucleotide polymorphisms (SNPs) in genes responsible for Mendelian diseases with kidney phenotypes has not been performed. We thus developed a comprehensive database of genes for Mendelian kidney conditions and evaluated the association between common genetic variants within these genes and kidney function in the general population. Using the Online Mendelian Inheritance in Man database, we identified 731 unique disease entries related to specific renal search terms and confirmed a kidney phenotype in 218 of these entries, corresponding to mutations in 258 genes. We interrogated common SNPs (minor allele frequency >5%) within these genes for association with the estimated GFR in 74,354 European-ancestry participants from the CKDGen Consortium. However, the top four candidate SNPs (rs6433115 at LRP2, rs1050700 at TSC1, rs249942 at PALB2, and rs9827843 at ROBO2) did not achieve significance in a stage 2 meta-analysis performed in 56,246 additional independent individuals, indicating that these common SNPs are not associated with estimated GFR. The effect of less common or rare variants in these genes on kidney function in the general population and disease-specific cohorts requires further research.

  6. In-situ neutron diffraction study of martensitic variant redistribution in polycrystalline Ni-Mn-Ga alloy under cyclic thermo-mechanical treatment

    NASA Astrophysics Data System (ADS)

    Li, Zongbin; Zhang, Yudong; Esling, Claude; Gan, Weimin; Zou, Naifu; Zhao, Xiang; Zuo, Liang

    2014-07-01

    The influences of uniaxial compressive stress on martensitic transformation were studied on a polycrystalline Ni-Mn-Ga bulk alloy prepared by directional solidification. Based upon the integrated in-situ neutron diffraction measurements, direct experimental evidence was obtained on the variant redistribution of seven-layered modulated (7M) martensite, triggered by external uniaxial compression during martensitic transformation. Large anisotropic lattice strain, induced by the cyclic thermo-mechanical treatment, has led to the microstructure modification by forming martensitic variants with a strong ⟨0 1 0⟩7M preferential orientation along the loading axis. As a result, the saturation of magnetization became easier to be reached.

  7. In-situ neutron diffraction study of martensitic variant redistribution in polycrystalline Ni-Mn-Ga alloy under cyclic thermo-mechanical treatment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Zongbin; Zou, Naifu; Zhao, Xiang

    2014-07-14

    The influences of uniaxial compressive stress on martensitic transformation were studied on a polycrystalline Ni-Mn-Ga bulk alloy prepared by directional solidification. Based upon the integrated in-situ neutron diffraction measurements, direct experimental evidence was obtained on the variant redistribution of seven-layered modulated (7M) martensite, triggered by external uniaxial compression during martensitic transformation. Large anisotropic lattice strain, induced by the cyclic thermo-mechanical treatment, has led to the microstructure modification by forming martensitic variants with a strong 〈0 1 0〉{sub 7M} preferential orientation along the loading axis. As a result, the saturation of magnetization became easier to be reached.

  8. VIRUS NOMENCLATURE BELOW THE SPECIES LEVEL: A STANDARDIZED NOMENCLATURE FOR LABORATORY ANIMAL-ADAPTED STRAINS AND VARIANTS OF VIRUSES ASSIGNED TO THE FAMILY FILOVIRIDAE

    PubMed Central

    Kuhn, Jens H.; Bao, Yiming; Bavari, Sina; Becker, Stephan; Bradfute, Steven; Brister, J. Rodney; Bukreyev, Alexander A.; Caì, Yíngyún; Chandran, Kartik; Davey, Robert A.; Dolnik, Olga; Dye, John M.; Enterlein, Sven; Gonzalez, Jean-Paul; Formenty, Pierre; Freiberg, Alexander N.; Hensley, Lisa E.; Honko, Anna N.; Ignatyev, Georgy M.; Jahrling, Peter B.; Johnson, Karl M.; Klenk, Hans-Dieter; Kobinger, Gary; Lackemeyer, Matthew G.; Leroy, Eric M.; Lever, Mark S.; Lofts, Loreen L.; Mühlberger, Elke; Netesov, Sergey V.; Olinger, Gene G.; Palacios, Gustavo; Patterson, Jean L.; Paweska, Janusz T.; Pitt, Louise; Radoshitzky, Sheli R.; Ryabchikova, Elena I.; Saphire, Erica Ollmann; Shestopalov, Aleksandr M.; Smither, Sophie J.; Sullivan, Nancy J.; Swanepoel, Robert; Takada, Ayato; Towner, Jonathan S.; van der Groen, Guido; Volchkov, Viktor E.; Wahl-Jensen, Victoria; Warren, Travis K.; Warfield, Kelly L.; Weidmann, Manfred; Nichol, Stuart T.

    2013-01-01

    The International Committee on Taxonomy of Viruses (ICTV) organizes the classification of viruses into taxa, but is not responsible for the nomenclature for taxa members. International experts groups, such as the ICTV Study Groups, recommend the classification and naming of viruses and their strains, variants, and isolates. The ICTV Filoviridae Study Group has recently introduced an updated classification and nomenclature for filoviruses. Subsequently, and together with numerous other filovirus experts, a consistent nomenclature for their natural genetic variants and isolates was developed that aims at simplifying the retrieval of sequence data from electronic databases. This is a first important step toward a viral genome annotation standard as sought by the US National Center for Biotechnology Information (NCBI). Here, this work is extended to include filoviruses obtained in the laboratory by artificial selection through passage in laboratory hosts. The previously developed template for natural filovirus genetic variant naming ( ///-) is retained, but it is proposed to adapt the type of information added to each field for laboratory animal-adapted variants. For instance, the full-length designation of an Ebola virus Mayinga variant adapted at the State Research Center for Virology and Biotechnology “Vector” to cause disease in guinea pigs after seven passages would be akin to “Ebola virus VECTOR/C.porcellus-lab/COD/1976/Mayinga-GPA-P7”. As was proposed for the names of natural filovirus variants, we suggest using the full-length designation in databases, as well as in the method section of publications. Shortened designations (such as “EBOV VECTOR/C.por/COD/76/May-GPA-P7”) and abbreviations (such as “EBOV/May-GPA-P7”) could be used in the remainder of the text depending on how critical it is to convey information contained in the full-length name. “EBOV” would suffice if only one EBOV strain/variant/isolate is addressed. PMID:23358612

  9. Omitted data in randomized controlled trials for anxiety and depression: A systematic review of the inclusion of sexual orientation and gender identity.

    PubMed

    Heck, Nicholas C; Mirabito, Lucas A; LeMaire, Kelly; Livingston, Nicholas A; Flentje, Annesa

    2017-01-01

    The current study examined the frequency with which randomized controlled trials (RCTs) of behavioral and psychological interventions for anxiety and depression include data pertaining to participant sexual orientation and nonbinary gender identities. Using systematic review methodology, the databases PubMed and PsycINFO were searched to identify RCTs published in 2004, 2009, and 2014. Random selections of 400 articles per database per year (2,400 articles in total) were considered for inclusion in the review. Articles meeting inclusion criteria were read and coded by the research team to identify whether the trial reported data pertaining to participant sexual orientation and nonbinary gender identities. Additional trial characteristics were also identified and indexed in our database (e.g., sample size, funding source). Of the 232 articles meeting inclusion criteria, only 1 reported participants' sexual orientation, and zero articles included nonbinary gender identities. A total of 52,769 participants were represented in the trials, 93 of which were conducted in the United States, and 43 acknowledged the National Institutes of Health as a source of funding. Despite known mental health disparities on the basis of sexual orientation and nonbinary gender identification, researchers evaluating interventions for anxiety and depression are not reporting on these important demographic characteristics. Reporting practices must change to ensure that our interventions generalize to lesbian, gay, bisexual, and transgender persons. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  10. Knowledge Discovery in Variant Databases Using Inductive Logic Programming

    PubMed Central

    Nguyen, Hoan; Luu, Tien-Dao; Poch, Olivier; Thompson, Julie D.

    2013-01-01

    Understanding the effects of genetic variation on the phenotype of an individual is a major goal of biomedical research, especially for the development of diagnostics and effective therapeutic solutions. In this work, we describe the use of a recent knowledge discovery from database (KDD) approach using inductive logic programming (ILP) to automatically extract knowledge about human monogenic diseases. We extracted background knowledge from MSV3d, a database of all human missense variants mapped to 3D protein structure. In this study, we identified 8,117 mutations in 805 proteins with known three-dimensional structures that were known to be involved in human monogenic disease. Our results help to improve our understanding of the relationships between structural, functional or evolutionary features and deleterious mutations. Our inferred rules can also be applied to predict the impact of any single amino acid replacement on the function of a protein. The interpretable rules are available at http://decrypthon.igbmc.fr/kd4v/. PMID:23589683

  11. Knowledge discovery in variant databases using inductive logic programming.

    PubMed

    Nguyen, Hoan; Luu, Tien-Dao; Poch, Olivier; Thompson, Julie D

    2013-01-01

    Understanding the effects of genetic variation on the phenotype of an individual is a major goal of biomedical research, especially for the development of diagnostics and effective therapeutic solutions. In this work, we describe the use of a recent knowledge discovery from database (KDD) approach using inductive logic programming (ILP) to automatically extract knowledge about human monogenic diseases. We extracted background knowledge from MSV3d, a database of all human missense variants mapped to 3D protein structure. In this study, we identified 8,117 mutations in 805 proteins with known three-dimensional structures that were known to be involved in human monogenic disease. Our results help to improve our understanding of the relationships between structural, functional or evolutionary features and deleterious mutations. Our inferred rules can also be applied to predict the impact of any single amino acid replacement on the function of a protein. The interpretable rules are available at http://decrypthon.igbmc.fr/kd4v/.

  12. CHASM and SNVBox: toolkit for detecting biologically important single nucleotide mutations in cancer

    PubMed Central

    Carter, Hannah; Diekhans, Mark; Ryan, Michael C.; Karchin, Rachel

    2011-01-01

    Summary: Thousands of cancer exomes are currently being sequenced, yielding millions of non-synonymous single nucleotide variants (SNVs) of possible relevance to disease etiology. Here, we provide a software toolkit to prioritize SNVs based on their predicted contribution to tumorigenesis. It includes a database of precomputed, predictive features covering all positions in the annotated human exome and can be used either stand-alone or as part of a larger variant discovery pipeline. Availability and Implementation: MySQL database, source code and binaries freely available for academic/government use at http://wiki.chasmsoftware.org, Source in Python and C++. Requires 32 or 64-bit Linux system (tested on Fedora Core 8,10,11 and Ubuntu 10), 2.5*≤ Python <3.0*, MySQL server >5.0, 60 GB available hard disk space (50 MB for software and data files, 40 GB for MySQL database dump when uncompressed), 2 GB of RAM. Contact: karchin@jhu.edu Supplementary Information: Supplementary data are available at Bioinformatics online. PMID:21685053

  13. CancerDR: cancer drug resistance database.

    PubMed

    Kumar, Rahul; Chaudhary, Kumardeep; Gupta, Sudheer; Singh, Harinder; Kumar, Shailesh; Gautam, Ankur; Kapoor, Pallavi; Raghava, Gajendra P S

    2013-01-01

    Cancer therapies are limited by the development of drug resistance, and mutations in drug targets is one of the main reasons for developing acquired resistance. The adequate knowledge of these mutations in drug targets would help to design effective personalized therapies. Keeping this in mind, we have developed a database "CancerDR", which provides information of 148 anti-cancer drugs, and their pharmacological profiling across 952 cancer cell lines. CancerDR provides comprehensive information about each drug target that includes; (i) sequence of natural variants, (ii) mutations, (iii) tertiary structure, and (iv) alignment profile of mutants/variants. A number of web-based tools have been integrated in CancerDR. This database will be very useful for identification of genetic alterations in genes encoding drug targets, and in turn the residues responsible for drug resistance. CancerDR allows user to identify promiscuous drug molecules that can kill wide range of cancer cells. CancerDR is freely accessible at http://crdd.osdd.net/raghava/cancerdr/

  14. The genetic validation of heterogeneity in schizophrenia.

    PubMed

    Tsutsumi, Atsushi; Glatt, Stephen J; Kanazawa, Tetsufumi; Kawashige, Seiya; Uenishi, Hiroyuki; Hokyo, Akira; Kaneko, Takao; Moritani, Makiko; Kikuyama, Hiroki; Koh, Jun; Matsumura, Hitoshi; Yoneda, Hiroshi

    2011-10-07

    Schizophrenia is a heritable disorder, however clear genetic architecture has not been detected. To overcome this state of uncertainty, the SZGene database has been established by including all published case-control genetic association studies appearing in peer-reviewed journals. In the current study, we aimed to determine if genetic variants strongly suggested by SZGene are associated with risk of schizophrenia in our case-control samples of Japanese ancestry. In addition, by employing the additive model for aggregating the effect of seven variants, we aimed to verify the genetic heterogeneity of schizophrenia diagnosed by an operative diagnostic manual, the DSM-IV. Each positively suggested genetic polymorphism was ranked according to its p-value, then the seven top-ranked variants (p < 0.0005) were selected from DRD2, DRD4, GRIN2B, TPH1, MTHFR, and DTNBP1 (February, 2007). 407 Schizophrenia cases and 384 controls participated in this study. To aggregate the vulnerability of the disorder based on the participants' genetic information, we calculated the "risk-index" by adding the number of genetic risk factors. No statistically significant deviation between cases and controls was observed in the genetic risk-index derived from all seven variants on the top-ranked polymorphisms. In fact, the average risk-index score in the schizophrenia group (6.5+/-1.57) was slightly lower than among controls (6.6+/-1.39). The current work illustrates the difficulty in identifying universal and definitive risk-conferring polymorphisms for schizophrenia. Our employed number of samples was small, so we can not preclude the possibility that some or all of these variants are minor risk factors for schizophrenia in the Japanese population. It is also important to aggregate the updated positive variants in the SZGene database when the replication work is conducted.

  15. Brute-Force Approach for Mass Spectrometry-Based Variant Peptide Identification in Proteogenomics without Personalized Genomic Data

    NASA Astrophysics Data System (ADS)

    Ivanov, Mark V.; Lobas, Anna A.; Levitsky, Lev I.; Moshkovskii, Sergei A.; Gorshkov, Mikhail V.

    2018-02-01

    In a proteogenomic approach based on tandem mass spectrometry analysis of proteolytic peptide mixtures, customized exome or RNA-seq databases are employed for identifying protein sequence variants. However, the problem of variant peptide identification without personalized genomic data is important for a variety of applications. Following the recent proposal by Chick et al. (Nat. Biotechnol. 33, 743-749, 2015) on the feasibility of such variant peptide search, we evaluated two available approaches based on the previously suggested "open" search and the "brute-force" strategy. To improve the efficiency of these approaches, we propose an algorithm for exclusion of false variant identifications from the search results involving analysis of modifications mimicking single amino acid substitutions. Also, we propose a de novo based scoring scheme for assessment of identified point mutations. In the scheme, the search engine analyzes y-type fragment ions in MS/MS spectra to confirm the location of the mutation in the variant peptide sequence.

  16. Object-oriented parsing of biological databases with Python.

    PubMed

    Ramu, C; Gemünd, C; Gibson, T J

    2000-07-01

    While database activities in the biological area are increasing rapidly, rather little is done in the area of parsing them in a simple and object-oriented way. We present here an elegant, simple yet powerful way of parsing biological flat-file databases. We have taken EMBL, SWISSPROT and GENBANK as examples. EMBL and SWISS-PROT do not differ much in the format structure. GENBANK has a very different format structure than EMBL and SWISS-PROT. Extracting the desired fields in an entry (for example a sub-sequence with an associated feature) for later analysis is a constant need in the biological sequence-analysis community: this is illustrated with tools to make new splice-site databases. The interface to the parser is abstract in the sense that the access to all the databases is independent from their different formats, since parsing instructions are hidden.

  17. Intrinsic magnetic properties of L1{sub 0} FeNi obtained from meteorite NWA 6259

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Poirier, Eric; Pinkerton, Frederick E., E-mail: frederick.e.pinkerton@gm.com; Kubic, Robert

    2015-05-07

    FeNi having the tetragonal L1{sub 0} crystal structure is a promising new rare-earth-free permanent magnet material. Laboratory synthesis is challenging, however, tetragonal L1{sub 0} FeNi—the mineral “tetrataenite”—has been characterized using specimens found in nickel-iron meteorites. Most notably, the meteorite NWA 6259 recovered from Northwest Africa is 95 vol. % tetrataenite with a composition of 43 at. % Ni. Hysteresis loops were measured as a function of sample orientation on a specimen cut from NWA 6259 in order to rigorously deduce the intrinsic hard magnetic properties of its L1{sub 0} phase. Electron backscatter diffraction showed that NWA 6259 is strongly textured, containing L1{submore » 0} grains oriented along any one of the three equivalent cubic directions of the parent fcc structure. The magnetic structure was modeled as a superposition of the three orthonormal uniaxial variants. By simultaneously fitting first-quadrant magnetization data for 13 different orientations of the sample with respect to the applied field direction, the intrinsic magnetic properties were estimated to be saturation magnetization 4πM{sub s} = 14.7 kG and anisotropy field H{sub a} = 14.4 kOe. The anisotropy constant K = 0.84 MJ/m{sup 3} is somewhat smaller than the value K = 1.3 MJ/m{sup 3} obtained by earlier researchers from nominally equiatomic FeNi prepared by neutron irradiation accompanied by annealing in a magnetic field, suggesting that higher Ni content (fewer Fe antisite defects) may improve the anisotropy. The fit also indicated that NWA 6259 contains one dominant variant (62% by volume), the remainder of the sample being a second variant, and the third variant being absent altogether.« less

  18. iMETHYL: an integrative database of human DNA methylation, gene expression, and genomic variation.

    PubMed

    Komaki, Shohei; Shiwa, Yuh; Furukawa, Ryohei; Hachiya, Tsuyoshi; Ohmomo, Hideki; Otomo, Ryo; Satoh, Mamoru; Hitomi, Jiro; Sobue, Kenji; Sasaki, Makoto; Shimizu, Atsushi

    2018-01-01

    We launched an integrative multi-omics database, iMETHYL (http://imethyl.iwate-megabank.org). iMETHYL provides whole-DNA methylation (~24 million autosomal CpG sites), whole-genome (~9 million single-nucleotide variants), and whole-transcriptome (>14 000 genes) data for CD4 + T-lymphocytes, monocytes, and neutrophils collected from approximately 100 subjects. These data were obtained from whole-genome bisulfite sequencing, whole-genome sequencing, and whole-transcriptome sequencing, making iMETHYL a comprehensive database.

  19. Reference genotype and exome data from an Australian Aboriginal population for health-based research

    PubMed Central

    Tang, Dave; Anderson, Denise; Francis, Richard W.; Syn, Genevieve; Jamieson, Sarra E.; Lassmann, Timo; Blackwell, Jenefer M.

    2016-01-01

    Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found ~99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians. PMID:27070114

  20. Reference genotype and exome data from an Australian Aboriginal population for health-based research.

    PubMed

    Tang, Dave; Anderson, Denise; Francis, Richard W; Syn, Genevieve; Jamieson, Sarra E; Lassmann, Timo; Blackwell, Jenefer M

    2016-04-12

    Genetic analyses, including genome-wide association studies and whole exome sequencing (WES), provide powerful tools for the analysis of complex and rare genetic diseases. To date there are no reference data for Aboriginal Australians to underpin the translation of health-based genomic research. Here we provide a catalogue of variants called after sequencing the exomes of 72 Aboriginal individuals to a depth of 20X coverage in ∼80% of the sequenced nucleotides. We determined 320,976 single nucleotide variants (SNVs) and 47,313 insertions/deletions using the Genome Analysis Toolkit. We had previously genotyped a subset of the Aboriginal individuals (70/72) using the Illumina Omni2.5 BeadChip platform and found ~99% concordance at overlapping sites, which suggests high quality genotyping. Finally, we compared our SNVs to six publicly available variant databases, such as dbSNP and the Exome Sequencing Project, and 70,115 of our SNVs did not overlap any of the single nucleotide polymorphic sites in all the databases. Our data set provides a useful reference point for genomic studies on Aboriginal Australians.

  1. Assigning Main Orientation to an EOH Descriptor on Multispectral Images.

    PubMed

    Li, Yong; Shi, Xiang; Wei, Lijun; Zou, Junwei; Chen, Fang

    2015-07-01

    This paper proposes an approach to compute an EOH (edge-oriented histogram) descriptor with main orientation. EOH has a better matching ability than SIFT (scale-invariant feature transform) on multispectral images, but does not assign a main orientation to keypoints. Alternatively, it tends to assign the same main orientation to every keypoint, e.g., zero degrees. This limits EOH to matching keypoints between images of translation misalignment only. Observing this limitation, we propose assigning to keypoints the main orientation that is computed with PIIFD (partial intensity invariant feature descriptor). In the proposed method, SIFT keypoints are detected from images as the extrema of difference of Gaussians, and every keypoint is assigned to the main orientation computed with PIIFD. Then, EOH is computed for every keypoint with respect to its main orientation. In addition, an implementation variant is proposed for fast computation of the EOH descriptor. Experimental results show that the proposed approach performs more robustly than the original EOH on image pairs that have a rotation misalignment.

  2. Crystallographic features of the martensitic transformation and their impact on variant organization in the intermetallic compound Ni50Mn38Sb12 studied by SEM/EBSD.

    PubMed

    Zhang, Chunyang; Zhang, Yudong; Esling, Claude; Zhao, Xiang; Zuo, Liang

    2017-09-01

    The mechanical and magnetic properties of Ni-Mn-Sb intermetallic compounds are closely related to the martensitic transformation and martensite variant organization. However, studies of these issues are very limited. Thus, a thorough crystallographic investigation of the martensitic transformation orientation relationship (OR), the transformation deformation and their impact on the variant organization of an Ni 50 Mn 38 Sb 12 alloy using scanning electron microscopy/electron backscatter diffraction (SEM/EBSD) was conducted in this work. It is shown that the martensite variants are hierarchically organized into plates, each possessing four distinct twin-related variants, and the plates into plate colonies, each containing four distinct plates delimited by compatible and incompatible plate interfaces. Such a characteristic organization is produced by the martensitic transformation. It is revealed that the transformation obeys the Pitsch relation ({0[Formula: see text]} A // {2[Formula: see text]} M and 〈0[Formula: see text]1〉 A // 〈[Formula: see text]2〉 M ; the subscripts A and M refer to austenite and martensite, respectively). The type I twinning plane K 1 of the intra-plate variants and the compatible plate interface plane correspond to the respective orientation relationship planes {0[Formula: see text]} A and {0[Formula: see text]} A of austenite. The three {0[Formula: see text]} A planes possessed by each pair of compatible plates, one corresponding to the compatible plate interface and the other two to the variants in the two plates, are interrelated by 60° and belong to a single 〈11[Formula: see text]〉 A axis zone. The {0[Formula: see text]} A planes representing the two pairs of compatible plates in each plate colony belong to two 〈11[Formula: see text]〉 A axis zones having one {0[Formula: see text]} A plane in common. This common plane defines the compatible plate interfaces of the two pairs of plates. The transformation strains to form the variants in the compatible plates are compatible and demonstrate an edge-to-edge character. Thus, such plates should nucleate and grow simultaneously. On the other hand, the strains to form the variants in the incompatible plates are incompatible, so they nucleate and grow separately until they meet during the transformation. The results of the present work provide comprehensive information on the martensitic transformation of Ni-Mn-Sb intermetallic compounds and its impact on martensite variant organization.

  3. Using the genome aggregation database, computational pathogenicity prediction tools, and patch clamp heterologous expression studies to demote previously published long QT syndrome type 1 mutations from pathogenic to benign.

    PubMed

    Clemens, Daniel J; Lentino, Anne R; Kapplinger, Jamie D; Ye, Dan; Zhou, Wei; Tester, David J; Ackerman, Michael J

    2018-04-01

    Mutations in the KCNQ1-encoded Kv7.1 potassium channel cause long QT syndrome (LQTS) type 1 (LQT1). It has been suggested that ∼10%-20% of rare LQTS case-derived variants in the literature may have been published erroneously as LQT1-causative mutations and may be "false positives." The purpose of this study was to determine which previously published KCNQ1 case variants are likely false positives. A list of all published, case-derived KCNQ1 missense variants (MVs) was compiled. The occurrence of each MV within the Genome Aggregation Database (gnomAD) was assessed. Eight in silico tools were used to predict each variant's pathogenicity. Case-derived variants that were either (1) too frequently found in gnomAD or (2) absent in gnomAD but predicted to be pathogenic by ≤2 tools were considered potential false positives. Three of these variants were characterized functionally using whole-cell patch clamp technique. Overall, there were 244 KCNQ1 case-derived MVs. Of these, 29 (12%) were seen in ≥10 individuals in gnomAD and are demotable. However, 157 of 244 MVs (64%) were absent in gnomAD. Of these, 7 (4%) were predicted to be pathogenic by ≤2 tools, 3 of which we characterized functionally. There was no significant difference in current density between heterozygous KCNQ1-F127L, -P477L, or -L619M variant-containing channels compared to KCNQ1-WT. This study offers preliminary evidence for the demotion of 32 (13%) previously published LQT1 MVs. Of these, 29 were demoted because of their frequent sighting in gnomAD. Additionally, in silico analysis and in vitro functional studies have facilitated the demotion of 3 ultra-rare MVs (F127L, P477L, L619M). Copyright © 2017 Heart Rhythm Society. Published by Elsevier Inc. All rights reserved.

  4. Variability of Creatine Metabolism Genes in Children with Autism Spectrum Disorder.

    PubMed

    Cameron, Jessie M; Levandovskiy, Valeriy; Roberts, Wendy; Anagnostou, Evdokia; Scherer, Stephen; Loh, Alvin; Schulze, Andreas

    2017-07-31

    Creatine deficiency syndrome (CDS) comprises three separate enzyme deficiencies with overlapping clinical presentations: arginine:glycine amidinotransferase ( GATM gene, glycine amidinotransferase), guanidinoacetate methyltransferase ( GAMT gene), and creatine transporter deficiency ( SLC6A8 gene, solute carrier family 6 member 8). CDS presents with developmental delays/regression, intellectual disability, speech and language impairment, autistic behaviour, epileptic seizures, treatment-refractory epilepsy, and extrapyramidal movement disorders; symptoms that are also evident in children with autism. The objective of the study was to test the hypothesis that genetic variability in creatine metabolism genes is associated with autism. We sequenced GATM , GAMT and SLC6A8 genes in 166 patients with autism (coding sequence, introns and adjacent untranslated regions). A total of 29, 16 and 25 variants were identified in each gene, respectively. Four variants were novel in GATM , and 5 in SLC6A8 (not present in the 1000 Genomes, Exome Sequencing Project (ESP) or Exome Aggregation Consortium (ExAC) databases). A single variant in each gene was identified as non-synonymous, and computationally predicted to be potentially damaging. Nine variants in GATM were shown to have a lower minor allele frequency (MAF) in the autism population than in the 1000 Genomes database, specifically in the East Asian population (Fisher's exact test). Two variants also had lower MAFs in the European population. In summary, there were no apparent associations of variants in GAMT and SLC6A8 genes with autism. The data implying there could be a lower association of some specific GATM gene variants with autism is an observation that would need to be corroborated in a larger group of autism patients, and with sub-populations of Asian ethnicities. Overall, our findings suggest that the genetic variability of creatine synthesis/transport is unlikely to play a part in the pathogenesis of autism spectrum disorder (ASD) in children.

  5. Magnetoencephalography of frontotemporal dementia: spatiotemporally localized changes during semantic decisions

    PubMed Central

    Nestor, Peter J.; Hodges, John R.; Rowe, James B.

    2011-01-01

    Behavioural variant frontotemporal dementia is a neurodegenerative disorder with dysfunction and atrophy of the frontal lobes leading to changes in personality, behaviour, empathy, social conduct and insight, with relative preservation of language and memory. As novel treatments begin to emerge, biomarkers of frontotemporal dementia will become increasingly important, including functionally relevant neuroimaging indices of the neurophysiological basis of cognition. We used magnetoencephalography to examine behavioural variant frontotemporal dementia using a semantic decision task that elicits both frontal and temporal activity in healthy people. Twelve patients with behavioural variant frontotemporal dementia (age 50–75) and 16 matched controls made categorical semantic judgements about 400 pictures during continuous magnetoencephalography. Distributed source analysis was used to compare patients and controls. The patients had normal early responses to picture confrontation, indicating intact visual processing. However, a predominantly posterior set of regions including temporoparietal cortex showed reduced source activity 250–310 ms after stimulus onset, in proportion to behavioural measures of semantic association. In contrast, a left frontoparietal network showed reduced source activity at 550–650 ms, proportional to patients’ deficits in attention and orientation. This late deficit probably reflects impairment in the neural substrate of goal-oriented decision making. The results demonstrate behaviourally relevant neural correlates of semantic processing and decision making in behavioural variant frontotemporal dementia, and show for the first time that magnetoencephalography can be used to study cognitive systems in the context of frontotemporal dementia. PMID:21840892

  6. Computing and Communications Infrastructure for Network-Centric Warfare: Exploiting COTS, Assuring Performance

    DTIC Science & Technology

    2004-06-01

    remote databases, has seen little vendor acceptance. Each database ( Oracle , DB2, MySQL , etc.) has its own client- server protocol. Therefore each...existing standards – SQL , X.500/LDAP, FTP, etc. • View information dissemination as selective replication – State-oriented vs . message-oriented...allowing the 8 application to start. The resource management system would serve as a broker to the resources, making sure that resources are not

  7. GetData: A filesystem-based, column-oriented database format for time-ordered binary data

    NASA Astrophysics Data System (ADS)

    Wiebe, Donald V.; Netterfield, Calvin B.; Kisner, Theodore S.

    2015-12-01

    The GetData Project is the reference implementation of the Dirfile Standards, a filesystem-based, column-oriented database format for time-ordered binary data. Dirfiles provide a fast, simple format for storing and reading data, suitable for both quicklook and analysis pipelines. GetData provides a C API and bindings exist for various other languages. GetData is distributed under the terms of the GNU Lesser General Public License.

  8. Design and Implementation of an Interface Editor for the Amadeus Multi- Relational Database Front-end System

    DTIC Science & Technology

    1993-03-25

    application of Object-Oriented Programming (OOP) and Human-Computer Interface (HCI) design principles. Knowledge gained from each topic has been incorporated...through the ap- plication of Object-Oriented Programming (OOP) and Human-Computer Interface (HCI) design principles. Knowledge gained from each topic has...programming and Human-Computer Interface (HCI) design. Knowledge gained from each is applied to the design of a Form-based interface for database data

  9. Orientation Dependence of Functional Properties in Heterophase Single Crystals of the Ti36.5Ni51.0Hf12.5 and Ti48.5Ni51.5 Alloys

    NASA Astrophysics Data System (ADS)

    Panchenko, E. Yu.; Chumlyakov, Yu. I.; Surikov, N. Yu.; Tagiltsev, A. I.; Vetoshkina, N. G.; Osipovich, K. S.; Maier, H.; Sehitoglu, H.

    2016-03-01

    The features of orientation dependence of stress-induced thermoelastic B2-( R)- B19'-martensitic transformations in single crystals of the Ti48.5Ni51.5 and Ni51.0Ti36.5Hf12.5 (at.%) alloys, which contain disperse particles of the Ti3Ni4 and H-phase, respectively, are revealed along with those of their shape-memory effects (SME) and superelasticity (SE). It is experimentally demonstrated that irrespective of the crystal structure of disperse particles measuring more than 100 nm, for their volume fraction f > 16% there is a weaker orientation dependence of the reversible strain in the cases of manifestation of SME and SE. In the orientations of Class I, wherein martensitic detwinning introduces a considerable contribution into transformation strain, the values of SME |ɛ SME | and SE |ɛ SE | decrease by over a factor of two compared to the theoretical lattice strain value |ɛ tr0 | for a B2- B19'-transformation and the experimental values of reversible strain for quenched TiNi crystals. In the orientations of Class 2, wherein detwinning of the martensite is suppressed as is the case in quenched single-phase single crystals, the reversible strain is maintained close to its theoretical value |ɛ tr0 |. Micromechanical models of interaction between the martensite and the disperse particles are proposed, which account for the weaker orientation dependence of |ɛ SME | and |ɛ SE | due to suppression of detwinning of the B19'-martensite crystals by the particles and a transition from a single-variant evolution of the stress-induced martensitic transformations to a multiple-variant evolution of transformations in the cases of increased size of the particles and their larger volume fractions.

  10. Controlling Protein Surface Orientation by Strategic Placement of Oligo-Histidine Tags

    PubMed Central

    2017-01-01

    We report oriented immobilization of proteins using the standard hexahistidine (His6)-Ni2+:NTA (nitrilotriacetic acid) methodology, which we systematically tuned to give control of surface coverage. Fluorescence microscopy and surface plasmon resonance measurements of self-assembled monolayers (SAMs) of red fluorescent proteins (TagRFP) showed that binding strength increased by 1 order of magnitude for each additional His6-tag on the TagRFP proteins. All TagRFP variants with His6-tags located on only one side of the barrel-shaped protein yielded a 1.5 times higher surface coverage compared to variants with His6-tags on opposite sides of the so-called β-barrel. Time-resolved fluorescence anisotropy measurements supported by polarized infrared spectroscopy verified that the orientation (and thus coverage and functionality) of proteins on surfaces can be controlled by strategic placement of a His6-tag on the protein. Molecular dynamics simulations show how the differently tagged proteins reside at the surface in “end-on” and “side-on” orientations with each His6-tag contributing to binding. Also, not every dihistidine subunit in a given His6-tag forms a full coordination bond with the Ni2+:NTA SAMs, which varied with the position of the His6-tag on the protein. At equal valency but different tag positions on the protein, differences in binding were caused by probing for Ni2+:NTA moieties and by additional electrostatic interactions between different fractions of the β-barrel structure and charged NTA moieties. Potential of mean force calculations indicate there is no specific single-protein interaction mode that provides a clear preferential surface orientation, suggesting that the experimentally measured preference for the end-on orientation is a supra-protein, not a single-protein, effect. PMID:28850777

  11. Ordinal measures for iris recognition.

    PubMed

    Sun, Zhenan; Tan, Tieniu

    2009-12-01

    Images of a human iris contain rich texture information useful for identity authentication. A key and still open issue in iris recognition is how best to represent such textural information using a compact set of features (iris features). In this paper, we propose using ordinal measures for iris feature representation with the objective of characterizing qualitative relationships between iris regions rather than precise measurements of iris image structures. Such a representation may lose some image-specific information, but it achieves a good trade-off between distinctiveness and robustness. We show that ordinal measures are intrinsic features of iris patterns and largely invariant to illumination changes. Moreover, compactness and low computational complexity of ordinal measures enable highly efficient iris recognition. Ordinal measures are a general concept useful for image analysis and many variants can be derived for ordinal feature extraction. In this paper, we develop multilobe differential filters to compute ordinal measures with flexible intralobe and interlobe parameters such as location, scale, orientation, and distance. Experimental results on three public iris image databases demonstrate the effectiveness of the proposed ordinal feature models.

  12. Change of magnetic domain structure by mechanically induced twin boundary motion in Ni-Mn-Ga single crystal

    NASA Astrophysics Data System (ADS)

    Kopecký, Vít; Heczko, Oleg

    2017-10-01

    The single variant state exhibits usual labyrinth and band magnetic domains depending on orientation of easy magnetization axis. By the passage of single twin boundary induced by mechanical stress the rake and granular domain patterns are formed. These domain patterns are further modified by repeated passage of the twin boundary resulting in similar domain patterns in the sample even though the orientation of the magnetization is different.

  13. A comprehensive SNP and indel imputability database.

    PubMed

    Duan, Qing; Liu, Eric Yi; Croteau-Chonka, Damien C; Mohlke, Karen L; Li, Yun

    2013-02-15

    Genotype imputation has become an indispensible step in genome-wide association studies (GWAS). Imputation accuracy, directly influencing downstream analysis, has shown to be improved using re-sequencing-based reference panels; however, this comes at the cost of high computational burden due to the huge number of potentially imputable markers (tens of millions) discovered through sequencing a large number of individuals. Therefore, there is an increasing need for access to imputation quality information without actually conducting imputation. To facilitate this process, we have established a publicly available SNP and indel imputability database, aiming to provide direct access to imputation accuracy information for markers identified by the 1000 Genomes Project across four major populations and covering multiple GWAS genotyping platforms. SNP and indel imputability information can be retrieved through a user-friendly interface by providing the ID(s) of the desired variant(s) or by specifying the desired genomic region. The query results can be refined by selecting relevant GWAS genotyping platform(s). This is the first database providing variant imputability information specific to each continental group and to each genotyping platform. In Filipino individuals from the Cebu Longitudinal Health and Nutrition Survey, our database can achieve an area under the receiver-operating characteristic curve of 0.97, 0.91, 0.88 and 0.79 for markers with minor allele frequency >5%, 3-5%, 1-3% and 0.5-1%, respectively. Specifically, by filtering out 48.6% of markers (corresponding to a reduction of up to 48.6% in computational costs for actual imputation) based on the imputability information in our database, we can remove 77%, 58%, 51% and 42% of the poorly imputed markers at the cost of only 0.3%, 0.8%, 1.5% and 4.6% of the well-imputed markers with minor allele frequency >5%, 3-5%, 1-3% and 0.5-1%, respectively. http://www.unc.edu/∼yunmli/imputability.html

  14. Using semantic data modeling techniques to organize an object-oriented database for extending the mass storage model

    NASA Technical Reports Server (NTRS)

    Campbell, William J.; Short, Nicholas M., Jr.; Roelofs, Larry H.; Dorfman, Erik

    1991-01-01

    A methodology for optimizing organization of data obtained by NASA earth and space missions is discussed. The methodology uses a concept based on semantic data modeling techniques implemented in a hierarchical storage model. The modeling is used to organize objects in mass storage devices, relational database systems, and object-oriented databases. The semantic data modeling at the metadata record level is examined, including the simulation of a knowledge base and semantic metadata storage issues. The semantic data model hierarchy and its application for efficient data storage is addressed, as is the mapping of the application structure to the mass storage.

  15. Long-range order in InAsSb

    NASA Astrophysics Data System (ADS)

    Jen, H. R.; Ma, K. Y.; Stringfellow, G. B.

    1989-03-01

    Results are presented of transmission electron diffraction (TED) observations, demonstrating, for the first time, a CuPt-type ordering in InAs(1-x)Sb(x) alloys, over a wide range of x values (from x = 0.22 to 0.88). The InAsSb alloys were prepared by OMVPE on (001) oriented undoped InSb or InAs substrates. The ordering-induced spots on the TED patterns show the highest intensity for x of about 0.5 and the lowest intensity toward each binary end compound. Only two of the four variants are formed during growth. In some areas, the degree of order for these two variants, 1/2(-1 1 1) and 1/2(1 -1 1), is equal, and in other areas, one variant dominates.

  16. Optical mass memories

    NASA Technical Reports Server (NTRS)

    Bailey, G. A.

    1976-01-01

    Optical and magnetic variants in the design of trillion-bit read/write memories are compared and tabulated. Components and materials suitable for a random access read/write nonmoving memory system are examined, with preference given to holography and photoplastic materials. Advantages and deficiencies of photoplastics are reviewed. Holographic page composer design, essential features of an optical memory with no moving parts, fiche-oriented random access memory design, and materials suitable for an efficient photoplastic fiche are considered. The optical variants offer advantages in lower volume and weight at data transfer rates near 1 Mbit/sec, but power drain is of the same order as for the magnetic variants (tape memory, disk memory). The mechanical properties of photoplastic film materials still leave much to be desired.

  17. Space Launch System Booster Separation Aerodynamic Database Development and Uncertainty Quantification

    NASA Technical Reports Server (NTRS)

    Chan, David T.; Pinier, Jeremy T.; Wilcox, Floyd J., Jr.; Dalle, Derek J.; Rogers, Stuart E.; Gomez, Reynaldo J.

    2016-01-01

    The development of the aerodynamic database for the Space Launch System (SLS) booster separation environment has presented many challenges because of the complex physics of the ow around three independent bodies due to proximity e ects and jet inter- actions from the booster separation motors and the core stage engines. This aerodynamic environment is dicult to simulate in a wind tunnel experiment and also dicult to simu- late with computational uid dynamics. The database is further complicated by the high dimensionality of the independent variable space, which includes the orientation of the core stage, the relative positions and orientations of the solid rocket boosters, and the thrust lev- els of the various engines. Moreover, the clearance between the core stage and the boosters during the separation event is sensitive to the aerodynamic uncertainties of the database. This paper will present the development process for Version 3 of the SLS booster separa- tion aerodynamic database and the statistics-based uncertainty quanti cation process for the database.

  18. Characterization of pathogenic SORL1 genetic variants for association with Alzheimer’s disease: a clinical interpretation strategy

    PubMed Central

    Holstege, Henne; van der Lee, Sven J; Hulsman, Marc; Wong, Tsz Hang; van Rooij, Jeroen GJ; Weiss, Marjan; Louwersheimer, Eva; Wolters, Frank J; Amin, Najaf; Uitterlinden, André G; Hofman, Albert; Ikram, M Arfan; van Swieten, John C; Meijers-Heijboer, Hanne; van der Flier, Wiesje M; Reinders, Marcel JT; van Duijn, Cornelia M; Scheltens, Philip

    2017-01-01

    Accumulating evidence suggests that genetic variants in the SORL1 gene are associated with Alzheimer disease (AD), but a strategy to identify which variants are pathogenic is lacking. In a discovery sample of 115 SORL1 variants detected in 1908 Dutch AD cases and controls, we identified the variant characteristics associated with SORL1 variant pathogenicity. Findings were replicated in an independent sample of 103 SORL1 variants detected in 3193 AD cases and controls. In a combined sample of the discovery and replication samples, comprising 181 unique SORL1 variants, we developed a strategy to classify SORL1 variants into five subtypes ranging from pathogenic to benign. We tested this pathogenicity screen in SORL1 variants reported in two independent published studies. SORL1 variant pathogenicity is defined by the Combined Annotation Dependent Depletion (CADD) score and the minor allele frequency (MAF) reported by the Exome Aggregation Consortium (ExAC) database. Variants predicted strongly damaging (CADD score >30), which are extremely rare (ExAC-MAF <1 × 10−5) increased AD risk by 12-fold (95% CI 4.2–34.3; P=5 × 10−9). Protein-truncating SORL1 mutations were all unknown to ExAC and occurred exclusively in AD cases. More common SORL1 variants (ExAC-MAF≥1 × 10−5) were not associated with increased AD risk, even when predicted strongly damaging. Findings were independent of gender and the APOE-ε4 allele. High-risk SORL1 variants were observed in a substantial proportion of the AD cases analyzed (2%). Based on their effect size, we propose to consider high-risk SORL1 variants next to variants in APOE, PSEN1, PSEN2 and APP for personalized risk assessments in clinical practice. PMID:28537274

  19. MSeqDR: A Centralized Knowledge Repository and Bioinformatics Web Resource to Facilitate Genomic Investigations in Mitochondrial Disease.

    PubMed

    Shen, Lishuang; Diroma, Maria Angela; Gonzalez, Michael; Navarro-Gomez, Daniel; Leipzig, Jeremy; Lott, Marie T; van Oven, Mannis; Wallace, Douglas C; Muraresku, Colleen Clarke; Zolkipli-Cunningham, Zarazuela; Chinnery, Patrick F; Attimonelli, Marcella; Zuchner, Stephan; Falk, Marni J; Gai, Xiaowu

    2016-06-01

    MSeqDR is the Mitochondrial Disease Sequence Data Resource, a centralized and comprehensive genome and phenome bioinformatics resource built by the mitochondrial disease community to facilitate clinical diagnosis and research investigations of individual patient phenotypes, genomes, genes, and variants. A central Web portal (https://mseqdr.org) integrates community knowledge from expert-curated databases with genomic and phenotype data shared by clinicians and researchers. MSeqDR also functions as a centralized application server for Web-based tools to analyze data across both mitochondrial and nuclear DNA, including investigator-driven whole exome or genome dataset analyses through MSeqDR-Genesis. MSeqDR-GBrowse genome browser supports interactive genomic data exploration and visualization with custom tracks relevant to mtDNA variation and mitochondrial disease. MSeqDR-LSDB is a locus-specific database that currently manages 178 mitochondrial diseases, 1,363 genes associated with mitochondrial biology or disease, and 3,711 pathogenic variants in those genes. MSeqDR Disease Portal allows hierarchical tree-style disease exploration to evaluate their unique descriptions, phenotypes, and causative variants. Automated genomic data submission tools are provided that capture ClinVar compliant variant annotations. PhenoTips will be used for phenotypic data submission on deidentified patients using human phenotype ontology terminology. The development of a dynamic informed patient consent process to guide data access is underway to realize the full potential of these resources. © 2016 WILEY PERIODICALS, INC.

  20. MSeqDR: A Centralized Knowledge Repository and Bioinformatics Web Resource to Facilitate Genomic Investigations in Mitochondrial Disease

    PubMed Central

    Shen, Lishuang; Diroma, Maria Angela; Gonzalez, Michael; Navarro-Gomez, Daniel; Leipzig, Jeremy; Lott, Marie T.; van Oven, Mannis; Wallace, Douglas C.; Muraresku, Colleen Clarke; Zolkipli-Cunningham, Zarazuela; Chinnery, Patrick F.; Attimonelli, Marcella; Zuchner, Stephan

    2016-01-01

    MSeqDR is the Mitochondrial Disease Sequence Data Resource, a centralized and comprehensive genome and phenome bioinformatics resource built by the mitochondrial disease community to facilitate clinical diagnosis and research investigations of individual patient phenotypes, genomes, genes, and variants. A central Web portal (https://mseqdr.org) integrates community knowledge from expert-curated databases with genomic and phenotype data shared by clinicians and researchers. MSeqDR also functions as a centralized application server for Web-based tools to analyze data across both mitochondrial and nuclear DNA, including investigator-driven whole exome or genome dataset analyses through MSeqDR-Genesis. MSeqDR-GBrowse supports interactive genomic data exploration and visualization with custom tracks relevant to mtDNA variation and disease. MSeqDR-LSDB is a locus specific database that currently manages 178 mitochondrial diseases, 1,363 genes associated with mitochondrial biology or disease, and 3,711 pathogenic variants in those genes. MSeqDR Disease Portal allows hierarchical tree-style disease exploration to evaluate their unique descriptions, phenotypes, and causative variants. Automated genomic data submission tools are provided that capture ClinVar-compliant variant annotations. PhenoTips is used for phenotypic data submission on de-identified patients using human phenotype ontology terminology. Development of a dynamic informed patient consent process to guide data access is underway to realize the full potential of these resources. PMID:26919060

  1. Common Variants in Mendelian Kidney Disease Genes and Their Association with Renal Function

    PubMed Central

    Fuchsberger, Christian; Köttgen, Anna; O’Seaghdha, Conall M.; Pattaro, Cristian; de Andrade, Mariza; Chasman, Daniel I.; Teumer, Alexander; Endlich, Karlhans; Olden, Matthias; Chen, Ming-Huei; Tin, Adrienne; Kim, Young J.; Taliun, Daniel; Li, Man; Feitosa, Mary; Gorski, Mathias; Yang, Qiong; Hundertmark, Claudia; Foster, Meredith C.; Glazer, Nicole; Isaacs, Aaron; Rao, Madhumathi; Smith, Albert V.; O’Connell, Jeffrey R.; Struchalin, Maksim; Tanaka, Toshiko; Li, Guo; Hwang, Shih-Jen; Atkinson, Elizabeth J.; Lohman, Kurt; Cornelis, Marilyn C.; Johansson, Åsa; Tönjes, Anke; Dehghan, Abbas; Couraki, Vincent; Holliday, Elizabeth G.; Sorice, Rossella; Kutalik, Zoltan; Lehtimäki, Terho; Esko, Tõnu; Deshmukh, Harshal; Ulivi, Sheila; Chu, Audrey Y.; Murgia, Federico; Trompet, Stella; Imboden, Medea; Kollerits, Barbara; Pistis, Giorgio; Harris, Tamara B.; Launer, Lenore J.; Aspelund, Thor; Eiriksdottir, Gudny; Mitchell, Braxton D.; Boerwinkle, Eric; Schmidt, Helena; Hofer, Edith; Hu, Frank; Demirkan, Ayse; Oostra, Ben A.; Turner, Stephen T.; Ding, Jingzhong; Andrews, Jeanette S.; Freedman, Barry I.; Giulianini, Franco; Koenig, Wolfgang; Illig, Thomas; Döring, Angela; Wichmann, H.-Erich; Zgaga, Lina; Zemunik, Tatijana; Boban, Mladen; Minelli, Cosetta; Wheeler, Heather E.; Igl, Wilmar; Zaboli, Ghazal; Wild, Sarah H.; Wright, Alan F.; Campbell, Harry; Ellinghaus, David; Nöthlings, Ute; Jacobs, Gunnar; Biffar, Reiner; Ernst, Florian; Homuth, Georg; Kroemer, Heyo K.; Nauck, Matthias; Stracke, Sylvia; Völker, Uwe; Völzke, Henry; Kovacs, Peter; Stumvoll, Michael; Mägi, Reedik; Hofman, Albert; Uitterlinden, Andre G.; Rivadeneira, Fernando; Aulchenko, Yurii S.; Polasek, Ozren; Hastie, Nick; Vitart, Veronique; Helmer, Catherine; Wang, Jie Jin; Stengel, Bénédicte; Ruggiero, Daniela; Bergmann, Sven; Kähönen, Mika; Viikari, Jorma; Nikopensius, Tiit; Province, Michael; Colhoun, Helen; Doney, Alex; Robino, Antonietta; Krämer, Bernhard K.; Portas, Laura; Ford, Ian; Buckley, Brendan M.; Adam, Martin; Thun, Gian-Andri; Paulweber, Bernhard; Haun, Margot; Sala, Cinzia; Mitchell, Paul; Ciullo, Marina; Vollenweider, Peter; Raitakari, Olli; Metspalu, Andres; Palmer, Colin; Gasparini, Paolo; Pirastu, Mario; Jukema, J. Wouter; Probst-Hensch, Nicole M.; Kronenberg, Florian; Toniolo, Daniela; Gudnason, Vilmundur; Shuldiner, Alan R.; Coresh, Josef; Schmidt, Reinhold; Ferrucci, Luigi; van Duijn, Cornelia M.; Borecki, Ingrid; Kardia, Sharon L.R.; Liu, Yongmei; Curhan, Gary C.; Rudan, Igor; Gyllensten, Ulf; Wilson, James F.; Franke, Andre; Pramstaller, Peter P.; Rettig, Rainer; Prokopenko, Inga; Witteman, Jacqueline; Hayward, Caroline; Ridker, Paul M.; Bochud, Murielle; Heid, Iris M.; Siscovick, David S.; Fox, Caroline S.; Kao, W. Linda; Böger, Carsten A.

    2013-01-01

    Many common genetic variants identified by genome-wide association studies for complex traits map to genes previously linked to rare inherited Mendelian disorders. A systematic analysis of common single-nucleotide polymorphisms (SNPs) in genes responsible for Mendelian diseases with kidney phenotypes has not been performed. We thus developed a comprehensive database of genes for Mendelian kidney conditions and evaluated the association between common genetic variants within these genes and kidney function in the general population. Using the Online Mendelian Inheritance in Man database, we identified 731 unique disease entries related to specific renal search terms and confirmed a kidney phenotype in 218 of these entries, corresponding to mutations in 258 genes. We interrogated common SNPs (minor allele frequency >5%) within these genes for association with the estimated GFR in 74,354 European-ancestry participants from the CKDGen Consortium. However, the top four candidate SNPs (rs6433115 at LRP2, rs1050700 at TSC1, rs249942 at PALB2, and rs9827843 at ROBO2) did not achieve significance in a stage 2 meta-analysis performed in 56,246 additional independent individuals, indicating that these common SNPs are not associated with estimated GFR. The effect of less common or rare variants in these genes on kidney function in the general population and disease-specific cohorts requires further research. PMID:24029420

  2. Promising Variants of Initiation of Martensitic γ - α Transformation in Iron Alloys by a Couple of Elastic Waves

    NASA Astrophysics Data System (ADS)

    Kashchenko, M. P.; Chashchina, V. G.

    2016-01-01

    Variants of initiation of growth of crystals of α-martensite by couples of elastic waves propagating in directions <001>γ and <110>γ in singles crystals of Fe31Ni are suggested. The dynamic theory is used to show that the expected orientations of habit planes {110}γ, {001}γ and {559}γ differ from the typical {31015}γ. Possible features of tetragonality of martensite crystals are discussed. The power of the sources of ultrasound required for initiation of γ - α martensitic transformation is estimated.

  3. An Object-Oriented Collection of Minimum Degree Algorithms: Design, Implementation, and Experiences

    NASA Technical Reports Server (NTRS)

    Kumfert, Gary; Pothen, Alex

    1999-01-01

    The multiple minimum degree (MMD) algorithm and its variants have enjoyed 20+ years of research and progress in generating fill-reducing orderings for sparse, symmetric positive definite matrices. Although conceptually simple, efficient implementations of these algorithms are deceptively complex and highly specialized. In this case study, we present an object-oriented library that implements several recent minimum degree-like algorithms. We discuss how object-oriented design forces us to decompose these algorithms in a different manner than earlier codes and demonstrate how this impacts the flexibility and efficiency of our C++ implementation. We compare the performance of our code against other implementations in C or Fortran.

  4. NAHR-mediated copy-number variants in a clinical population: mechanistic insights into both genomic disorders and Mendelizing traits.

    PubMed

    Dittwald, Piotr; Gambin, Tomasz; Szafranski, Przemyslaw; Li, Jian; Amato, Stephen; Divon, Michael Y; Rodríguez Rojas, Lisa Ximena; Elton, Lindsay E; Scott, Daryl A; Schaaf, Christian P; Torres-Martinez, Wilfredo; Stevens, Abby K; Rosenfeld, Jill A; Agadi, Satish; Francis, David; Kang, Sung-Hae L; Breman, Amy; Lalani, Seema R; Bacino, Carlos A; Bi, Weimin; Milosavljevic, Aleksandar; Beaudet, Arthur L; Patel, Ankita; Shaw, Chad A; Lupski, James R; Gambin, Anna; Cheung, Sau Wai; Stankiewicz, Pawel

    2013-09-01

    We delineated and analyzed directly oriented paralogous low-copy repeats (DP-LCRs) in the most recent version of the human haploid reference genome. The computationally defined DP-LCRs were cross-referenced with our chromosomal microarray analysis (CMA) database of 25,144 patients subjected to genome-wide assays. This computationally guided approach to the empirically derived large data set allowed us to investigate genomic rearrangement relative frequencies and identify new loci for recurrent nonallelic homologous recombination (NAHR)-mediated copy-number variants (CNVs). The most commonly observed recurrent CNVs were NPHP1 duplications (233), CHRNA7 duplications (175), and 22q11.21 deletions (DiGeorge/velocardiofacial syndrome, 166). In the ∼25% of CMA cases for which parental studies were available, we identified 190 de novo recurrent CNVs. In this group, the most frequently observed events were deletions of 22q11.21 (48), 16p11.2 (autism, 34), and 7q11.23 (Williams-Beuren syndrome, 11). Several features of DP-LCRs, including length, distance between NAHR substrate elements, DNA sequence identity (fraction matching), GC content, and concentration of the homologous recombination (HR) hot spot motif 5'-CCNCCNTNNCCNC-3', correlate with the frequencies of the recurrent CNVs events. Four novel adjacent DP-LCR-flanked and NAHR-prone regions, involving 2q12.2q13, were elucidated in association with novel genomic disorders. Our study quantitates genome architectural features responsible for NAHR-mediated genomic instability and further elucidates the role of NAHR in human disease.

  5. Examining Reuse in LaSRS++-Based Projects

    NASA Technical Reports Server (NTRS)

    Madden, Michael M.

    2001-01-01

    NASA Langley Research Center (LaRC) developed the Langley Standard Real-Time Simulation in C++ (LaSRS++) to consolidate all software development for its simulation facilities under one common framework. A common framework promised a decrease in the total development effort for a new simulation by encouraging software reuse. To judge the success of LaSRS++ in this regard, reuse metrics were extracted from 11 aircraft models. Three methods that employ static analysis of the code were used to identify the reusable components. For the method that provides the best estimate, reuse levels fall between 66% and 95% indicating a high degree of reuse. Additional metrics provide insight into the extent of the foundation that LaSRS++ provides to new simulation projects. When creating variants of an aircraft, LaRC developers use object-oriented design to manage the aircraft as a reusable resource. Variants modify the aircraft for a research project or embody an alternate configuration of the aircraft. The variants inherit from the aircraft model. The variants use polymorphism to extend or redefine aircraft behaviors to meet the research requirements or to match the alternate configuration. Reuse level metrics were extracted from 10 variants. Reuse levels of aircraft by variants were 60% - 99%.

  6. Representing metabolic pathway information: an object-oriented approach.

    PubMed

    Ellis, L B; Speedie, S M; McLeish, R

    1998-01-01

    The University of Minnesota Biocatalysis/Biodegradation Database (UM-BBD) is a website providing information and dynamic links for microbial metabolic pathways, enzyme reactions, and their substrates and products. The Compound, Organism, Reaction and Enzyme (CORE) object-oriented database management system was developed to contain and serve this information. CORE was developed using Java, an object-oriented programming language, and PSE persistent object classes from Object Design, Inc. CORE dynamically generates descriptive web pages for reactions, compounds and enzymes, and reconstructs ad hoc pathway maps starting from any UM-BBD reaction. CORE code is available from the authors upon request. CORE is accessible through the UM-BBD at: http://www. labmed.umn.edu/umbbd/index.html.

  7. India Allele Finder: a web-based annotation tool for identifying common alleles in next-generation sequencing data of Indian origin.

    PubMed

    Zhang, Jimmy F; James, Francis; Shukla, Anju; Girisha, Katta M; Paciorkowski, Alex R

    2017-06-27

    We built India Allele Finder, an online searchable database and command line tool, that gives researchers access to variant frequencies of Indian Telugu individuals, using publicly available fastq data from the 1000 Genomes Project. Access to appropriate population-based genomic variant annotation can accelerate the interpretation of genomic sequencing data. In particular, exome analysis of individuals of Indian descent will identify population variants not reflected in European exomes, complicating genomic analysis for such individuals. India Allele Finder offers improved ease-of-use to investigators seeking to identify and annotate sequencing data from Indian populations. We describe the use of India Allele Finder to identify common population variants in a disease quartet whole exome dataset, reducing the number of candidate single nucleotide variants from 84 to 7. India Allele Finder is freely available to investigators to annotate genomic sequencing data from Indian populations. Use of India Allele Finder allows efficient identification of population variants in genomic sequencing data, and is an example of a population-specific annotation tool that simplifies analysis and encourages international collaboration in genomics research.

  8. Two new γ chain variants: Hb F-Augusta GA [(G)γ59(E3)Lys → Arg; HBG2: c.179A > G] and Hb F-Port Royal-II [(A)γ125(H3)Glu → Ala; HBG1: c.377A > C].

    PubMed

    Kutlar, Ferdane; Ameri, Afshin; Patel, Niren H; Zhuang, Lina; Johnson, Lee E; Cheng, Michael L; Kutlar, Abdullah

    2014-01-01

    The total number of hemoglobin (Hb) variants so far reported to the HbVar database is 1598 (April 9 2014) and 130 of them are fetal Hb variants. Fetal Hb are categorized as two different subunits, (G)γ- and (A)γ-globin chains, and γ chain variants can be observed in both subunits. There are 72 (G)γ- and 58 (A)γ-globin chain variants. Most of them are clinically silent and detected during newborn screening programs in the USA and outside the USA. In this report, we discuss the molecular characteristics and diagnostic difficulties of two new γ-globin chain variants found in an African American baby with no clinical symptoms. One is a new (G)γ-globin chain variant, Hb F-Augusta GA [(G)γ59(E3)Lys → Arg; HBG2: c.179A > G] and the other one is Hb F-Port Royal-II [(A)γ125(H3)Glu → Ala; HBG1: c.377A > C].

  9. Different Relative Orientation of Static and Alternative Magnetic Fields and Cress Roots Direction of Growth Changes Their Gravitropic Reaction

    NASA Astrophysics Data System (ADS)

    Sheykina, Nadiia; Bogatina, Nina

    The following variants of roots location relatively to static and alternative components of magnetic field were studied. At first variant the static magnetic field was directed parallel to the gravitation vector, the alternative magnetic field was directed perpendicular to static one; roots were directed perpendicular to both two fields’ components and gravitation vector. At the variant the negative gravitropysm for cress roots was observed. At second variant the static magnetic field was directed parallel to the gravitation vector, the alternative magnetic field was directed perpendicular to static one; roots were directed parallel to alternative magnetic field. At third variant the alternative magnetic field was directed parallel to the gravitation vector, the static magnetic field was directed perpendicular to the gravitation vector, roots were directed perpendicular to both two fields components and gravitation vector; At forth variant the alternative magnetic field was directed parallel to the gravitation vector, the static magnetic field was directed perpendicular to the gravitation vector, roots were directed parallel to static magnetic field. In all cases studied the alternative magnetic field frequency was equal to Ca ions cyclotron frequency. In 2, 3 and 4 variants gravitropism was positive. But the gravitropic reaction speeds were different. In second and forth variants the gravitropic reaction speed in error limits coincided with the gravitropic reaction speed under Earth’s conditions. At third variant the gravitropic reaction speed was slowed essentially.

  10. Clinical Views: Object-Oriented Views for Clinical Databases

    PubMed Central

    Portoni, Luisa; Combi, Carlo; Pinciroli, Francesco

    1998-01-01

    We present here a prototype of a clinical information system for the archiving and the management of multimedia and temporally-oriented clinical data related to PTCA patients. The system is based on an object-oriented DBMS and supports multiple views and view schemas on patients' data. Remote data access is supported too.

  11. Applying AN Object-Oriented Database Model to a Scientific Database Problem: Managing Experimental Data at Cebaf.

    NASA Astrophysics Data System (ADS)

    Ehlmann, Bryon K.

    Current scientific experiments are often characterized by massive amounts of very complex data and the need for complex data analysis software. Object-oriented database (OODB) systems have the potential of improving the description of the structure and semantics of this data and of integrating the analysis software with the data. This dissertation results from research to enhance OODB functionality and methodology to support scientific databases (SDBs) and, more specifically, to support a nuclear physics experiments database for the Continuous Electron Beam Accelerator Facility (CEBAF). This research to date has identified a number of problems related to the practical application of OODB technology to the conceptual design of the CEBAF experiments database and other SDBs: the lack of a generally accepted OODB design methodology, the lack of a standard OODB model, the lack of a clear conceptual level in existing OODB models, and the limited support in existing OODB systems for many common object relationships inherent in SDBs. To address these problems, the dissertation describes an Object-Relationship Diagram (ORD) and an Object-oriented Database Definition Language (ODDL) that provide tools that allow SDB design and development to proceed systematically and independently of existing OODB systems. These tools define multi-level, conceptual data models for SDB design, which incorporate a simple notation for describing common types of relationships that occur in SDBs. ODDL allows these relationships and other desirable SDB capabilities to be supported by an extended OODB system. A conceptual model of the CEBAF experiments database is presented in terms of ORDs and the ODDL to demonstrate their functionality and use and provide a foundation for future development of experimental nuclear physics software using an OODB approach.

  12. Challenges and disparities in the application of personalized genomic medicine to populations with African ancestry

    PubMed Central

    Kessler, Michael D.; Yerges-Armstrong, Laura; Taub, Margaret A.; Shetty, Amol C.; Maloney, Kristin; Jeng, Linda Jo Bone; Ruczinski, Ingo; Levin, Albert M.; Williams, L. Keoki; Beaty, Terri H.; Mathias, Rasika A.; Barnes, Kathleen C.; Boorgula, Meher Preethi; Campbell, Monica; Chavan, Sameer; Ford, Jean G.; Foster, Cassandra; Gao, Li; Hansel, Nadia N.; Horowitz, Edward; Huang, Lili; Ortiz, Romina; Potee, Joseph; Rafaels, Nicholas; Scott, Alan F.; Vergara, Candelaria; Gao, Jingjing; Hu, Yijuan; Johnston, Henry Richard; Qin, Zhaohui S.; Padhukasahasram, Badri; Dunston, Georgia M.; Faruque, Mezbah U.; Kenny, Eimear E.; Gietzen, Kimberly; Hansen, Mark; Genuario, Rob; Bullis, Dave; Lawley, Cindy; Deshpande, Aniket; Grus, Wendy E.; Locke, Devin P.; Foreman, Marilyn G.; Avila, Pedro C.; Grammer, Leslie; Kim, Kwang-YounA; Kumar, Rajesh; Schleimer, Robert; Bustamante, Carlos; De La Vega, Francisco M.; Gignoux, Chris R.; Shringarpure, Suyash S.; Musharoff, Shaila; Wojcik, Genevieve; Burchard, Esteban G.; Eng, Celeste; Gourraud, Pierre-Antoine; Hernandez, Ryan D.; Lizee, Antoine; Pino-Yanes, Maria; Torgerson, Dara G.; Szpiech, Zachary A.; Torres, Raul; Nicolae, Dan L.; Ober, Carole; Olopade, Christopher O.; Olopade, Olufunmilayo; Oluwole, Oluwafemi; Arinola, Ganiyu; Song, Wei; Abecasis, Goncalo; Correa, Adolfo; Musani, Solomon; Wilson, James G.; Lange, Leslie A.; Akey, Joshua; Bamshad, Michael; Chong, Jessica; Fu, Wenqing; Nickerson, Deborah; Reiner, Alexander; Hartert, Tina; Ware, Lorraine B.; Bleecker, Eugene; Meyers, Deborah; Ortega, Victor E.; Pissamai, Maul R. N.; Trevor, Maul R. N.; Watson, Harold; Araujo, Maria Ilma; Oliveira, Ricardo Riccio; Caraballo, Luis; Marrugo, Javier; Martinez, Beatriz; Meza, Catherine; Ayestas, Gerardo; Herrera-Paz, Edwin Francisco; Landaverde-Torres, Pamela; Erazo, Said Omar Leiva; Martinez, Rosella; Mayorga, Alvaro; Mayorga, Luis F.; Mejia-Mejia, Delmy-Aracely; Ramos, Hector; Saenz, Allan; Varela, Gloria; Vasquez, Olga Marina; Ferguson, Trevor; Knight-Madden, Jennifer; Samms-Vaughan, Maureen; Wilks, Rainford J.; Adegnika, Akim; Ateba-Ngoa, Ulysse; Yazdanbakhsh, Maria; O'Connor, Timothy D.

    2016-01-01

    To characterize the extent and impact of ancestry-related biases in precision genomic medicine, we use 642 whole-genome sequences from the Consortium on Asthma among African-ancestry Populations in the Americas (CAAPA) project to evaluate typical filters and databases. We find significant correlations between estimated African ancestry proportions and the number of variants per individual in all variant classification sets but one. The source of these correlations is highlighted in more detail by looking at the interaction between filtering criteria and the ClinVar and Human Gene Mutation databases. ClinVar's correlation, representing African ancestry-related bias, has changed over time amidst monthly updates, with the most extreme switch happening between March and April of 2014 (r=0.733 to r=−0.683). We identify 68 SNPs as the major drivers of this change in correlation. As long as ancestry-related bias when using these clinical databases is minimally recognized, the genetics community will face challenges with implementation, interpretation and cost-effectiveness when treating minority populations. PMID:27725664

  13. A systematic approach to assessing the clinical significance of genetic variants.

    PubMed

    Duzkale, H; Shen, J; McLaughlin, H; Alfares, A; Kelly, M A; Pugh, T J; Funke, B H; Rehm, H L; Lebo, M S

    2013-11-01

    Molecular genetic testing informs diagnosis, prognosis, and risk assessment for patients and their family members. Recent advances in low-cost, high-throughput DNA sequencing and computing technologies have enabled the rapid expansion of genetic test content, resulting in dramatically increased numbers of DNA variants identified per test. To address this challenge, our laboratory has developed a systematic approach to thorough and efficient assessments of variants for pathogenicity determination. We first search for existing data in publications and databases including internal, collaborative and public resources. We then perform full evidence-based assessments through statistical analyses of observations in the general population and disease cohorts, evaluation of experimental data from in vivo or in vitro studies, and computational predictions of potential impacts of each variant. Finally, we weigh all evidence to reach an overall conclusion on the potential for each variant to be disease causing. In this report, we highlight the principles of variant assessment, address the caveats and pitfalls, and provide examples to illustrate the process. By sharing our experience and providing a framework for variant assessment, including access to a freely available customizable tool, we hope to help move towards standardized and consistent approaches to variant assessment. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  14. Mutation Update of ARSA and PSAP Genes Causing Metachromatic Leukodystrophy.

    PubMed

    Cesani, Martina; Lorioli, Laura; Grossi, Serena; Amico, Giulia; Fumagalli, Francesca; Spiga, Ivana; Filocamo, Mirella; Biffi, Alessandra

    2016-01-01

    Metachromatic leukodystrophy is a neurodegenerative disorder characterized by progressive demyelination. The disease is caused by variants in the ARSA gene, which codes for the lysosomal enzyme arylsulfatase A, or, more rarely, in the PSAP gene, which codes for the activator protein saposin B. In this Mutation Update, an extensive review of all the ARSA- and PSAP-causative variants published in the literature to date, accounting for a total of 200 ARSA and 10 PSAP allele types, is presented. The detailed ARSA and PSAP variant lists are freely available on the Leiden Online Variation Database (LOVD) platform at http://www.LOVD.nl/ARSA and http://www.LOVD.nl/PSAP, respectively. © 2015 WILEY PERIODICALS, INC.

  15. Initial experiences with building a health care infrastructure based on Java and object-oriented database technology.

    PubMed

    Dionisio, J D; Sinha, U; Dai, B; Johnson, D B; Taira, R K

    1999-01-01

    A multi-tiered telemedicine system based on Java and object-oriented database technology has yielded a number of practical insights and experiences on their effectiveness and suitability as implementation bases for a health care infrastructure. The advantages and drawbacks to their use, as seen within the context of the telemedicine system's development, are discussed. Overall, these technologies deliver on their early promise, with a few remaining issues that are due primarily to their relative newness.

  16. Heterogenous database integration in a physician workstation.

    PubMed

    Annevelink, J; Young, C Y; Tang, P C

    1991-01-01

    We discuss the integration of a variety of data and information sources in a Physician Workstation (PWS), focusing on the integration of data from DHCP, the Veteran Administration's Distributed Hospital Computer Program. We designed a logically centralized, object-oriented data-schema, used by end users and applications to explore the data accessible through an object-oriented database using a declarative query language. We emphasize the use of procedural abstraction to transparently integrate a variety of information sources into the data schema.

  17. Heterogenous database integration in a physician workstation.

    PubMed Central

    Annevelink, J.; Young, C. Y.; Tang, P. C.

    1991-01-01

    We discuss the integration of a variety of data and information sources in a Physician Workstation (PWS), focusing on the integration of data from DHCP, the Veteran Administration's Distributed Hospital Computer Program. We designed a logically centralized, object-oriented data-schema, used by end users and applications to explore the data accessible through an object-oriented database using a declarative query language. We emphasize the use of procedural abstraction to transparently integrate a variety of information sources into the data schema. PMID:1807624

  18. Image Engine: an object-oriented multimedia database for storing, retrieving and sharing medical images and text.

    PubMed Central

    Lowe, H. J.

    1993-01-01

    This paper describes Image Engine, an object-oriented, microcomputer-based, multimedia database designed to facilitate the storage and retrieval of digitized biomedical still images, video, and text using inexpensive desktop computers. The current prototype runs on Apple Macintosh computers and allows network database access via peer to peer file sharing protocols. Image Engine supports both free text and controlled vocabulary indexing of multimedia objects. The latter is implemented using the TView thesaurus model developed by the author. The current prototype of Image Engine uses the National Library of Medicine's Medical Subject Headings (MeSH) vocabulary (with UMLS Meta-1 extensions) as its indexing thesaurus. PMID:8130596

  19. Conjunctival malignant melanoma: A rare variant and review of important diagnostic and therapeutic considerations

    PubMed Central

    Albreiki, Danah H.; Gilberg, Steven M.; Farmer, James P.

    2012-01-01

    Malignant melanoma of the conjunctiva is a relatively infrequent neoplasm that can be associated with significant morbidity and cause diagnostic difficulty to both the ophthalmologist and pathologist. We herein describe the first reported case in North American and European databases of a rare variant-signet ring cell melanoma – arising in the background of primary acquired melanosis (PAM) and use this case as a review of important diagnostic and therapeutic considerations when faced with this condition. PMID:23960986

  20. Ankle fracture spur sign is pathognomonic for a variant ankle fracture.

    PubMed

    Hinds, Richard M; Garner, Matthew R; Lazaro, Lionel E; Warner, Stephen J; Loftus, Michael L; Birnbaum, Jacqueline F; Burket, Jayme C; Lorich, Dean G

    2015-02-01

    The hyperplantarflexion variant ankle fracture is composed of a posterior tibial lip fracture with posterolateral and posteromedial fracture fragments separated by a vertical fracture line. This infrequently reported injury pattern often includes an associated "spur sign" or double cortical density at the inferomedial tibial metaphysis. The objective of this study was to quantitatively establish the association of the ankle fracture spur sign with the hyperplantarflexion variant ankle fracture. Our clinical database of operative ankle fractures was retrospectively reviewed for the incidence of hyperplantarflexion variant and nonvariant ankle fractures as determined by assessment of injury radiographs, preoperative advanced imaging, and intraoperative observation. Injury radiographs were then evaluated for the presence of the spur sign, and association between the spur sign and variant fractures was analyzed. The incidence of the hyperplantarflexion variant fracture among all ankle fractures was 6.7% (43/640). The spur sign was present in 79% (34/43) of variant fractures and absent in all nonvariant fractures, conferring a specificity of 100% in identifying variant fractures. Positive predictive value and negative predictive value were 100% and 99%, respectively. The ankle fracture spur sign was pathognomonic for the hyperplantarflexion variant ankle fracture. It is important to identify variant fractures preoperatively as patient positioning, operative approach, and fixation construct of variant fractures often differ from those employed for osteosynthesis of nonvariant fractures. Identification of the spur sign should prompt acquisition of advanced imaging to formulate an appropriate operative plan to address the variant fracture pattern. Level III, retrospective comparative study. © The Author(s) 2014.

  1. Changes in classification of genetic variants in BRCA1 and BRCA2.

    PubMed

    Kast, Karin; Wimberger, Pauline; Arnold, Norbert

    2018-02-01

    Classification of variants of unknown significance (VUS) in the breast cancer genes BRCA1 and BRCA2 changes with accumulating evidence for clinical relevance. In most cases down-staging towards neutral variants without clinical significance is possible. We searched the database of the German Consortium for Hereditary Breast and Ovarian Cancer (GC-HBOC) for changes in classification of genetic variants as an update to our earlier publication on genetic variants in the Centre of Dresden. Changes between 2015 and 2017 were recorded. In the group of variants of unclassified significance (VUS, Class 3, uncertain), only changes of classification towards neutral genetic variants were noted. In BRCA1, 25% of the Class 3 variants (n = 2/8) changed to Class 2 (likely benign) and Class 1 (benign). In BRCA2, in 50% of the Class 3 variants (n = 16/32), a change to Class 2 (n = 10/16) or Class 1 (n = 6/16) was observed. No change in classification was noted in Class 4 (likely pathogenic) and Class 5 (pathogenic) genetic variants in both genes. No up-staging from Class 1, Class 2 or Class 3 to more clinical significance was observed. All variants with a change in classification in our cohort were down-staged towards no clinical significance by a panel of experts of the German Consortium for Hereditary Breast and Ovarian Cancer (GC-HBOC). Prevention in families with Class 3 variants should be based on pedigree based risks and should not be guided by the presence of a VUS.

  2. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chang, Y.L.; Chen, P.Y.; Tsai, Y.T.

    The crystallography of lenticular martensite, which formed in coarse austenite grains (size about 80 μm) after subzero treatment at − 196 °C (liquid nitrogen) for different holding times, was investigated using electron backscatter diffraction (EBSD). For the sample treated with 15 min of isothermal holding, more than 50 martensite plates (with a thickness of larger than 1 μm) that formed within a coarse austenite grain were employed to obtain the pole figures. The pole figures clearly indicated that the individual plate of lenticular martensite approximately adopted the Kurdjumov–Sachs (K–S) orientation relationship with respect to the austenite matrix. For the samplemore » treated with 30 s of isothermal holding, a few martensite plates that formed in variant pairings in a coarse austenite grain were analyzed. The results showed that zigzag couplings (including spear couplings), the major product of plate martensite, had an absolute dominance of a specific variant pair (V1/V17). The orientation gradient within a lenticular martensite plate was also measured using convergent beam electron diffraction (CBED). The evidence strongly suggests that the spread in diffracted intensity within pole figures is related to the misorientation gradient within the lenticular martensite plate. - Highlights: • The orientation relationship between lenticular martensite and austenite was investigated by pole figures via Electron Backscatter Diffraction (EBSD). • The initial stage of lenticular martensite formation was investigated, excluding interference from hard impingement. • In addition to EBSD, convergent beam electron diffraction (CBED) was used to measure the misorientation angle from the midrib to the untwinned region in lenticular martensite plate. • Zigzag couplings (including spear couplings), the major product of plate martensite, had an absolute dominance of a specific variant pair (V1/V17).« less

  3. amamutdb.no: A relational database for MAN2B1 allelic variants that compiles genotypes, clinical phenotypes, and biochemical and structural data of mutant MAN2B1 in α-mannosidosis.

    PubMed

    Riise Stensland, Hilde Monica Frostad; Frantzen, Gabrio; Kuokkanen, Elina; Buvang, Elisabeth Kjeldsen; Klenow, Helle Bagterp; Heikinheimo, Pirkko; Malm, Dag; Nilssen, Øivind

    2015-06-01

    α-Mannosidosis is an autosomal recessive lysosomal storage disorder caused by mutations in the MAN2B1 gene, encoding lysosomal α-mannosidase. The disorder is characterized by a range of clinical phenotypes of which the major manifestations are mental impairment, hearing impairment, skeletal changes, and immunodeficiency. Here, we report an α-mannosidosis mutation database, amamutdb.no, which has been constructed as a publicly accessible online resource for recording and analyzing MAN2B1 variants (http://amamutdb.no). Our aim has been to offer structured and relational information on MAN2B1 mutations and genotypes along with associated clinical phenotypes. Classifying missense mutations, as pathogenic or benign, is a challenge. Therefore, they have been given special attention as we have compiled all available data that relate to their biochemical, functional, and structural properties. The α-mannosidosis mutation database is comprehensive and relational in the sense that information can be retrieved and compiled across datasets; hence, it will facilitate diagnostics and increase our understanding of the clinical and molecular aspects of α-mannosidosis. We believe that the amamutdb.no structure and architecture will be applicable for the development of databases for any monogenic disorder. © 2015 WILEY PERIODICALS, INC.

  4. Impact of EML4-ALK Variant on Resistance Mechanisms and Clinical Outcomes in ALK-Positive Lung Cancer.

    PubMed

    Lin, Jessica J; Zhu, Viola W; Yoda, Satoshi; Yeap, Beow Y; Schrock, Alexa B; Dagogo-Jack, Ibiayi; Jessop, Nicholas A; Jiang, Ginger Y; Le, Long P; Gowen, Kyle; Stephens, Philip J; Ross, Jeffrey S; Ali, Siraj M; Miller, Vincent A; Johnson, Melissa L; Lovly, Christine M; Hata, Aaron N; Gainor, Justin F; Iafrate, Anthony J; Shaw, Alice T; Ou, Sai-Hong Ignatius

    2018-04-20

    Purpose Advanced anaplastic lymphoma kinase ( ALK) fusion-positive non-small-cell lung cancers (NSCLCs) are effectively treated with ALK tyrosine kinase inhibitors (TKIs). However, clinical outcomes in these patients vary, and the benefit of TKIs is limited as a result of acquired resistance. Emerging data suggest that the ALK fusion variant may affect clinical outcome, but the molecular basis for this association is unknown. Patients and Methods We identified 129 patients with ALK-positive NSCLC with known ALK variants. ALK resistance mutations and clinical outcomes on ALK TKIs were retrospectively evaluated according to ALK variant. A Foundation Medicine data set of 577 patients with ALK-positive NSCLC was also examined. Results The most frequent ALK variants were EML4-ALK variant 1 in 55 patients (43%) and variant 3 in 51 patients (40%). We analyzed 77 tumor biopsy specimens from patients with variants 1 and 3 who had progressed on an ALK TKI. ALK resistance mutations were significantly more common in variant 3 than in variant 1 (57% v 30%; P = .023). In particular, ALK G1202R was more common in variant 3 than in variant 1 (32% v 0%; P < .001). Analysis of the Foundation Medicine database revealed similar associations of variant 3 with ALK resistance mutation and with G1202R ( P = .010 and .015, respectively). Among patients treated with the third-generation ALK TKI lorlatinib, variant 3 was associated with a significantly longer progression-free survival than variant 1 (hazard ratio, 0.31; 95% CI, 0.12 to 0.79; P = .011). Conclusion Specific ALK variants may be associated with the development of ALK resistance mutations, particularly G1202R, and provide a molecular link between variant and clinical outcome. ALK variant thus represents a potentially important factor in the selection of next-generation ALK inhibitors.

  5. Constraints on Biological Mechanism from Disease Comorbidity Using Electronic Medical Records and Database of Genetic Variants

    PubMed Central

    Bagley, Steven C.; Sirota, Marina; Chen, Richard; Butte, Atul J.; Altman, Russ B.

    2016-01-01

    Patterns of disease co-occurrence that deviate from statistical independence may represent important constraints on biological mechanism, which sometimes can be explained by shared genetics. In this work we study the relationship between disease co-occurrence and commonly shared genetic architecture of disease. Records of pairs of diseases were combined from two different electronic medical systems (Columbia, Stanford), and compared to a large database of published disease-associated genetic variants (VARIMED); data on 35 disorders were available across all three sources, which include medical records for over 1.2 million patients and variants from over 17,000 publications. Based on the sources in which they appeared, disease pairs were categorized as having predominant clinical, genetic, or both kinds of manifestations. Confounding effects of age on disease incidence were controlled for by only comparing diseases when they fall in the same cluster of similarly shaped incidence patterns. We find that disease pairs that are overrepresented in both electronic medical record systems and in VARIMED come from two main disease classes, autoimmune and neuropsychiatric. We furthermore identify specific genes that are shared within these disease groups. PMID:27115429

  6. Constraints on Biological Mechanism from Disease Comorbidity Using Electronic Medical Records and Database of Genetic Variants.

    PubMed

    Bagley, Steven C; Sirota, Marina; Chen, Richard; Butte, Atul J; Altman, Russ B

    2016-04-01

    Patterns of disease co-occurrence that deviate from statistical independence may represent important constraints on biological mechanism, which sometimes can be explained by shared genetics. In this work we study the relationship between disease co-occurrence and commonly shared genetic architecture of disease. Records of pairs of diseases were combined from two different electronic medical systems (Columbia, Stanford), and compared to a large database of published disease-associated genetic variants (VARIMED); data on 35 disorders were available across all three sources, which include medical records for over 1.2 million patients and variants from over 17,000 publications. Based on the sources in which they appeared, disease pairs were categorized as having predominant clinical, genetic, or both kinds of manifestations. Confounding effects of age on disease incidence were controlled for by only comparing diseases when they fall in the same cluster of similarly shaped incidence patterns. We find that disease pairs that are overrepresented in both electronic medical record systems and in VARIMED come from two main disease classes, autoimmune and neuropsychiatric. We furthermore identify specific genes that are shared within these disease groups.

  7. Concept-oriented indexing of video databases: toward semantic sensitive retrieval and browsing.

    PubMed

    Fan, Jianping; Luo, Hangzai; Elmagarmid, Ahmed K

    2004-07-01

    Digital video now plays an important role in medical education, health care, telemedicine and other medical applications. Several content-based video retrieval (CBVR) systems have been proposed in the past, but they still suffer from the following challenging problems: semantic gap, semantic video concept modeling, semantic video classification, and concept-oriented video database indexing and access. In this paper, we propose a novel framework to make some advances toward the final goal to solve these problems. Specifically, the framework includes: 1) a semantic-sensitive video content representation framework by using principal video shots to enhance the quality of features; 2) semantic video concept interpretation by using flexible mixture model to bridge the semantic gap; 3) a novel semantic video-classifier training framework by integrating feature selection, parameter estimation, and model selection seamlessly in a single algorithm; and 4) a concept-oriented video database organization technique through a certain domain-dependent concept hierarchy to enable semantic-sensitive video retrieval and browsing.

  8. An ECG storage and retrieval system embedded in client server HIS utilizing object-oriented DB.

    PubMed

    Wang, C; Ohe, K; Sakurai, T; Nagase, T; Kaihara, S

    1996-02-01

    In the University of Tokyo Hospital, the improved client server HIS has been applied to clinical practice and physicians can order prescription, laboratory examination, ECG examination and radiographic examination, etc. directly by themselves and read results of these examinations, except medical signal waves, schema and image, on UNIX workstations. Recently, we designed and developed an ECG storage and retrieval system embedded in the client server HIS utilizing object-oriented database to take the first step in dealing with digitized signal, schema and image data and show waves, graphics, and images directly to physicians by the client server HIS. The system was developed based on object-oriented analysis and design, and implemented with object-oriented database management system (OODMS) and C++ programming language. In this paper, we describe the ECG data model, functions of the storage and retrieval system, features of user interface and the result of its implementation in the HIS.

  9. EFHC1 variants in juvenile myoclonic epilepsy: reanalysis according to NHGRI and ACMG guidelines for assigning disease causality.

    PubMed

    Bailey, Julia N; Patterson, Christopher; de Nijs, Laurence; Durón, Reyna M; Nguyen, Viet-Huong; Tanaka, Miyabi; Medina, Marco T; Jara-Prado, Aurelio; Martínez-Juárez, Iris E; Ochoa, Adriana; Molina, Yolli; Suzuki, Toshimitsu; Alonso, María E; Wight, Jenny E; Lin, Yu-Chen; Guilhoto, Laura; Targas Yacubian, Elza Marcia; Machado-Salas, Jesús; Daga, Andrea; Yamakawa, Kazuhiro; Grisar, Thierry M; Lakaye, Bernard; Delgado-Escueta, Antonio V

    2017-02-01

    EFHC1 variants are the most common mutations in inherited myoclonic and grand mal clonic-tonic-clonic (CTC) convulsions of juvenile myoclonic epilepsy (JME). We reanalyzed 54 EFHC1 variants associated with epilepsy from 17 cohorts based on National Human Genome Research Institute (NHGRI) and American College of Medical Genetics and Genomics (ACMG) guidelines for interpretation of sequence variants. We calculated Bayesian LOD scores for variants in coinheritance, unconditional exact tests and odds ratios (OR) in case-control associations, allele frequencies in genome databases, and predictions for conservation/pathogenicity. We reviewed whether variants damage EFHC1 functions, whether efhc1 -/- KO mice recapitulate CTC convulsions and "microdysgenesis" neuropathology, and whether supernumerary synaptic and dendritic phenotypes can be rescued in the fly model when EFHC1 is overexpressed. We rated strengths of evidence and applied ACMG combinatorial criteria for classifying variants. Nine variants were classified as "pathogenic," 14 as "likely pathogenic," 9 as "benign," and 2 as "likely benign." Twenty variants of unknown significance had an insufficient number of ancestry-matched controls, but ORs exceeded 5 when compared with racial/ethnic-matched Exome Aggregation Consortium (ExAC) controls. NHGRI gene-level evidence and variant-level evidence establish EFHC1 as the first non-ion channel microtubule-associated protein whose mutations disturb R-type VDCC and TRPM2 calcium currents in overgrown synapses and dendrites within abnormally migrated dislocated neurons, thus explaining CTC convulsions and "microdysgenesis" neuropathology of JME.Genet Med 19 2, 144-156.

  10. Role of GLI2 in hypopituitarism phenotype.

    PubMed

    Arnhold, Ivo J P; França, Marcela M; Carvalho, Luciani R; Mendonca, Berenice B; Jorge, Alexander A L

    2015-06-01

    GLI2 is a zinc-finger transcription factor involved in the Sonic Hedgehog pathway. Gli2 mutant mice have hypoplastic anterior and absent posterior pituitary glands. We reviewed the literature for patients with hypopituitarism and alterations in GLI2. Twenty-five patients (16 families) had heterozygous truncating mutations, and the phenotype frequently included GH deficiency, a small anterior pituitary lobe and an ectopic/undescended posterior pituitary lobe on magnetic resonance imaging and postaxial polydactyly. The inheritance pattern was autosomal dominant with incomplete penetrance and variable expressivity. The mutation was frequently inherited from an asymptomatic parent. Eleven patients had heterozygous non-synonymous GLI2 variants that were classified as variants of unknown significance, because they were either absent from or had a frequency lower than 0.001 in the databases. In these patients, the posterior pituitary was also ectopic, but none had polydactyly. A third group of variants found in patients with hypopituitarism were considered benign because their frequency was ≥ 0.001 in the databases. GLI2 is a large and polymorphic gene, and sequencing may identify variants whose interpretation may be difficult. Incomplete penetrance implies in the participation of other genetic and/or environmental factors. An interaction between Gli2 mutations and prenatal ethanol exposure has been demonstrated in mice dysmorphology. In conclusion, a relatively high frequency of GLI2 mutations and variants were identified in patients with congenital GH deficiency without other brain defects, and most of these patients presented with combined pituitary hormone deficiency and an ectopic posterior pituitary lobe. Future studies may clarify the relative role and frequency of GLI2 alterations in the aetiology of hypopituitarism. © 2015 Society for Endocrinology.

  11. Comprehensive mutational analysis of the M13 major coat protein: improved scaffolds for C-terminal phage display.

    PubMed

    Held, Heike A; Sidhu, Sachdev S

    2004-07-09

    A peptide was fused to the C terminus of the M13 bacteriophage major coat protein (P8), and libraries of P8 mutants were screened to select for variants that displayed the peptide with high efficiency. Over 600 variants were sequenced to compile a comprehensive database of P8 sequence diversity compatible with assembly into the wild-type phage coat. The database reveals that, while the alpha-helical P8 molecule was highly tolerant to mutations, certain functional epitopes were required for efficient incorporation. Three hydrophobic epitopes were located approximately equidistantly along the length of the alpha-helix. In addition, a positively charged epitope was required directly opposite the most C-terminal hydrophobic epitope and on the same side as the other two epitopes. Both ends of the protein were highly tolerant to mutations, consistent with the use of P8 as a scaffold for both N and C-terminal phage display. Further rounds of selection were used to enrich for P8 variants that supported higher levels of C-terminal peptide display. The largest improvements in display resulted from mutations around the junction between P8 and the C-terminal linker, and additional mutations in the N-terminal region were selected for further improvements in display. The best P8 variants improved C-terminal display more than 100-fold relative to the wild-type, and these variants could support the simultaneous display of N and C-terminal fusions. These finding provide information on the requirements for filamentous phage coat assembly, and provide improved scaffolds for phage display technology. Copyright 2004 Elsevier Ltd.

  12. APADB: a database for alternative polyadenylation and microRNA regulation events

    PubMed Central

    Müller, Sören; Rycak, Lukas; Afonso-Grunz, Fabian; Winter, Peter; Zawada, Adam M.; Damrath, Ewa; Scheider, Jessica; Schmäh, Juliane; Koch, Ina; Kahl, Günter; Rotter, Björn

    2014-01-01

    Alternative polyadenylation (APA) is a widespread mechanism that contributes to the sophisticated dynamics of gene regulation. Approximately 50% of all protein-coding human genes harbor multiple polyadenylation (PA) sites; their selective and combinatorial use gives rise to transcript variants with differing length of their 3′ untranslated region (3′UTR). Shortened variants escape UTR-mediated regulation by microRNAs (miRNAs), especially in cancer, where global 3′UTR shortening accelerates disease progression, dedifferentiation and proliferation. Here we present APADB, a database of vertebrate PA sites determined by 3′ end sequencing, using massive analysis of complementary DNA ends. APADB provides (A)PA sites for coding and non-coding transcripts of human, mouse and chicken genes. For human and mouse, several tissue types, including different cancer specimens, are available. APADB records the loss of predicted miRNA binding sites and visualizes next-generation sequencing reads that support each PA site in a genome browser. The database tables can either be browsed according to organism and tissue or alternatively searched for a gene of interest. APADB is the largest database of APA in human, chicken and mouse. The stored information provides experimental evidence for thousands of PA sites and APA events. APADB combines 3′ end sequencing data with prediction algorithms of miRNA binding sites, allowing to further improve prediction algorithms. Current databases lack correct information about 3′UTR lengths, especially for chicken, and APADB provides necessary information to close this gap. Database URL: http://tools.genxpro.net/apadb/ PMID:25052703

  13. Investigating Architectural Issues in Neuromorphic Computing

    DTIC Science & Technology

    2009-06-01

    An example of this is Diffusion Tensor Imaging ( DTI ), a variant of fMRI, which detects water diffusion. DTI is routinely applied at medical...model computed for a subfield positioned over a section of the silhouette dog’s hind leg . The illustrated angles roughly correspond to orientation

  14. Synthesis of spatially variant lattices.

    PubMed

    Rumpf, Raymond C; Pazos, Javier

    2012-07-02

    It is often desired to functionally grade and/or spatially vary a periodic structure like a photonic crystal or metamaterial, yet no general method for doing this has been offered in the literature. A straightforward procedure is described here that allows many properties of the lattice to be spatially varied at the same time while producing a final lattice that is still smooth and continuous. Properties include unit cell orientation, lattice spacing, fill fraction, and more. This adds many degrees of freedom to a design such as spatially varying the orientation to exploit directional phenomena. The method is not a coordinate transformation technique so it can more easily produce complicated and arbitrary spatial variance. To demonstrate, the algorithm is used to synthesize a spatially variant self-collimating photonic crystal to flow a Gaussian beam around a 90° bend. The performance of the structure was confirmed through simulation and it showed virtually no scattering around the bend that would have arisen if the lattice had defects or discontinuities.

  15. Systematic meta-analyses and field synopsis of genetic association studies in colorectal adenomas

    PubMed Central

    Montazeri, Zahra; Theodoratou, Evropi; Nyiraneza, Christine; Timofeeva, Maria; Chen, Wanjing; Svinti, Victoria; Sivakumaran, Shanya; Gresham, Gillian; Cubitt, Laura; Carvajal-Carmona, Luis; Bertagnolli, Monica M; Zauber, Ann G; Tomlinson, Ian; Farrington, Susan M; Dunlop, Malcolm G; Campbell, Harry; Little, Julian

    2018-01-01

    Background Low penetrance genetic variants, primarily single nucleotide polymorphisms, have substantial influence on colorectal cancer (CRC) susceptibility. Most CRCs develop from colorectal adenomas (CRA). Here, we report the first comprehensive field synopsis that catalogues all genetic association studies on CRA, with a parallel online database (http://www.chs.med.ed.ac.uk/CRAgene/). Methods We performed a systematic review, reviewing 9750 titles and then extracted data from 130 publications reporting on 181 polymorphisms in 74 genes. We conducted meta-analyses to derive summary effect estimates for 37 polymorphisms in 26 genes. We applied the Venice criteria and Bayesian False Discovery Probability (BFDP) to assess the levels of the credibility of associations. Results We considered the association with the rs6983267 variant at 8q24 as “highly credible”, reaching genome wide statistical significance in at least one meta-analysis model. We identified “less credible” associations (higher heterogeneity, lower statistical power, BFDP>0.02) with a further four variants of four independent genes: MTHFR c.677C>T p.A222V (rs1801133), TP53 c.215C>G p.R72P (rs1042522), NQO1 c.559C>T p.P187S (rs1800566), and NAT1 alleles imputed as fast acetylator genotypes. For the remaining 32 variants of 22 genes for which positive associations with CRA risk have been previously reported, the meta-analyses revealed no credible evidence to support these as true associations. Conclusions The limited number of credible associations between low penetrance genetic variants and CRA reflects the lower volume of evidence and associated lack of statistical power to detect associations of the magnitude typically observed for genetic variants and chronic diseases. The CRAgene database provides context for CRA genetic association data and will help inform future research directions. PMID:26451011

  16. The BioLexicon: a large-scale terminological resource for biomedical text mining

    PubMed Central

    2011-01-01

    Background Due to the rapidly expanding body of biomedical literature, biologists require increasingly sophisticated and efficient systems to help them to search for relevant information. Such systems should account for the multiple written variants used to represent biomedical concepts, and allow the user to search for specific pieces of knowledge (or events) involving these concepts, e.g., protein-protein interactions. Such functionality requires access to detailed information about words used in the biomedical literature. Existing databases and ontologies often have a specific focus and are oriented towards human use. Consequently, biological knowledge is dispersed amongst many resources, which often do not attempt to account for the large and frequently changing set of variants that appear in the literature. Additionally, such resources typically do not provide information about how terms relate to each other in texts to describe events. Results This article provides an overview of the design, construction and evaluation of a large-scale lexical and conceptual resource for the biomedical domain, the BioLexicon. The resource can be exploited by text mining tools at several levels, e.g., part-of-speech tagging, recognition of biomedical entities, and the extraction of events in which they are involved. As such, the BioLexicon must account for real usage of words in biomedical texts. In particular, the BioLexicon gathers together different types of terms from several existing data resources into a single, unified repository, and augments them with new term variants automatically extracted from biomedical literature. Extraction of events is facilitated through the inclusion of biologically pertinent verbs (around which events are typically organized) together with information about typical patterns of grammatical and semantic behaviour, which are acquired from domain-specific texts. In order to foster interoperability, the BioLexicon is modelled using the Lexical Markup Framework, an ISO standard. Conclusions The BioLexicon contains over 2.2 M lexical entries and over 1.8 M terminological variants, as well as over 3.3 M semantic relations, including over 2 M synonymy relations. Its exploitation can benefit both application developers and users. We demonstrate some such benefits by describing integration of the resource into a number of different tools, and evaluating improvements in performance that this can bring. PMID:21992002

  17. The BioLexicon: a large-scale terminological resource for biomedical text mining.

    PubMed

    Thompson, Paul; McNaught, John; Montemagni, Simonetta; Calzolari, Nicoletta; del Gratta, Riccardo; Lee, Vivian; Marchi, Simone; Monachini, Monica; Pezik, Piotr; Quochi, Valeria; Rupp, C J; Sasaki, Yutaka; Venturi, Giulia; Rebholz-Schuhmann, Dietrich; Ananiadou, Sophia

    2011-10-12

    Due to the rapidly expanding body of biomedical literature, biologists require increasingly sophisticated and efficient systems to help them to search for relevant information. Such systems should account for the multiple written variants used to represent biomedical concepts, and allow the user to search for specific pieces of knowledge (or events) involving these concepts, e.g., protein-protein interactions. Such functionality requires access to detailed information about words used in the biomedical literature. Existing databases and ontologies often have a specific focus and are oriented towards human use. Consequently, biological knowledge is dispersed amongst many resources, which often do not attempt to account for the large and frequently changing set of variants that appear in the literature. Additionally, such resources typically do not provide information about how terms relate to each other in texts to describe events. This article provides an overview of the design, construction and evaluation of a large-scale lexical and conceptual resource for the biomedical domain, the BioLexicon. The resource can be exploited by text mining tools at several levels, e.g., part-of-speech tagging, recognition of biomedical entities, and the extraction of events in which they are involved. As such, the BioLexicon must account for real usage of words in biomedical texts. In particular, the BioLexicon gathers together different types of terms from several existing data resources into a single, unified repository, and augments them with new term variants automatically extracted from biomedical literature. Extraction of events is facilitated through the inclusion of biologically pertinent verbs (around which events are typically organized) together with information about typical patterns of grammatical and semantic behaviour, which are acquired from domain-specific texts. In order to foster interoperability, the BioLexicon is modelled using the Lexical Markup Framework, an ISO standard. The BioLexicon contains over 2.2 M lexical entries and over 1.8 M terminological variants, as well as over 3.3 M semantic relations, including over 2 M synonymy relations. Its exploitation can benefit both application developers and users. We demonstrate some such benefits by describing integration of the resource into a number of different tools, and evaluating improvements in performance that this can bring.

  18. A Database Practicum for Teaching Database Administration and Software Development at Regis University

    ERIC Educational Resources Information Center

    Mason, Robert T.

    2013-01-01

    This research paper compares a database practicum at the Regis University College for Professional Studies (CPS) with technology oriented practicums at other universities. Successful andragogy for technology courses can motivate students to develop a genuine interest in the subject, share their knowledge with peers and can inspire students to…

  19. A Database Design and Development Case: NanoTEK Networks

    ERIC Educational Resources Information Center

    Ballenger, Robert M.

    2010-01-01

    This case provides a real-world project-oriented case study for students enrolled in a management information systems, database management, or systems analysis and design course in which database design and development are taught. The case consists of a business scenario to provide background information and details of the unique operating…

  20. Fine structure characterization of martensite/austenite constituent in low-carbon low-alloy steel by transmission electron forward scatter diffraction.

    PubMed

    Li, C W; Han, L Z; Luo, X M; Liu, Q D; Gu, J F

    2016-11-01

    Transmission electron forward scatter diffraction and other characterization techniques were used to investigate the fine structure and the variant relationship of the martensite/austenite (M/A) constituent of the granular bainite in low-carbon low-alloy steel. The results demonstrated that the M/A constituents were distributed in clusters throughout the bainitic ferrite. Lath martensite was the main component of the M/A constituent, where the relationship between the martensite variants was consistent with the Nishiyama-Wassermann orientation relationship and only three variants were found in the M/A constituent, suggesting that the variants had formed in the M/A constituent according to a specific mechanism. Furthermore, the Σ3 boundaries in the M/A constituent were much longer than their counterparts in the bainitic ferrite region. The results indicate that transmission electron forward scatter diffraction is an effective method of crystallographic analysis for nanolaths in M/A constituents. © 2016 The Authors Journal of Microscopy © 2016 Royal Microscopical Society.

  1. Rapid functional analysis of computationally complex rare human IRF6 gene variants using a novel zebrafish model.

    PubMed

    Li, Edward B; Truong, Dawn; Hallett, Shawn A; Mukherjee, Kusumika; Schutte, Brian C; Liao, Eric C

    2017-09-01

    Large-scale sequencing efforts have captured a rapidly growing catalogue of genetic variations. However, the accurate establishment of gene variant pathogenicity remains a central challenge in translating personal genomics information to clinical decisions. Interferon Regulatory Factor 6 (IRF6) gene variants are significant genetic contributors to orofacial clefts. Although approximately three hundred IRF6 gene variants have been documented, their effects on protein functions remain difficult to interpret. Here, we demonstrate the protein functions of human IRF6 missense gene variants could be rapidly assessed in detail by their abilities to rescue the irf6 -/- phenotype in zebrafish through variant mRNA microinjections at the one-cell stage. The results revealed many missense variants previously predicted by traditional statistical and computational tools to be loss-of-function and pathogenic retained partial or full protein function and rescued the zebrafish irf6 -/- periderm rupture phenotype. Through mRNA dosage titration and analysis of the Exome Aggregation Consortium (ExAC) database, IRF6 missense variants were grouped by their abilities to rescue at various dosages into three functional categories: wild type function, reduced function, and complete loss-of-function. This sensitive and specific biological assay was able to address the nuanced functional significances of IRF6 missense gene variants and overcome many limitations faced by current statistical and computational tools in assigning variant protein function and pathogenicity. Furthermore, it unlocked the possibility for characterizing yet undiscovered human IRF6 missense gene variants from orofacial cleft patients, and illustrated a generalizable functional genomics paradigm in personalized medicine.

  2. ARACHNID: A prototype object-oriented database tool for distributed systems

    NASA Technical Reports Server (NTRS)

    Younger, Herbert; Oreilly, John; Frogner, Bjorn

    1994-01-01

    This paper discusses the results of a Phase 2 SBIR project sponsored by NASA and performed by MIMD Systems, Inc. A major objective of this project was to develop specific concepts for improved performance in accessing large databases. An object-oriented and distributed approach was used for the general design, while a geographical decomposition was used as a specific solution. The resulting software framework is called ARACHNID. The Faint Source Catalog developed by NASA was the initial database testbed. This is a database of many giga-bytes, where an order of magnitude improvement in query speed is being sought. This database contains faint infrared point sources obtained from telescope measurements of the sky. A geographical decomposition of this database is an attractive approach to dividing it into pieces. Each piece can then be searched on individual processors with only a weak data linkage between the processors being required. As a further demonstration of the concepts implemented in ARACHNID, a tourist information system is discussed. This version of ARACHNID is the commercial result of the project. It is a distributed, networked, database application where speed, maintenance, and reliability are important considerations. This paper focuses on the design concepts and technologies that form the basis for ARACHNID.

  3. Systematic documentation and analysis of human genetic variation in hemoglobinopathies using the microattribution approach.

    PubMed

    Giardine, Belinda; Borg, Joseph; Higgs, Douglas R; Peterson, Kenneth R; Philipsen, Sjaak; Maglott, Donna; Singleton, Belinda K; Anstee, David J; Basak, A Nazli; Clark, Barnaby; Costa, Flavia C; Faustino, Paula; Fedosyuk, Halyna; Felice, Alex E; Francina, Alain; Galanello, Renzo; Gallivan, Monica V E; Georgitsi, Marianthi; Gibbons, Richard J; Giordano, Piero C; Harteveld, Cornelis L; Hoyer, James D; Jarvis, Martin; Joly, Philippe; Kanavakis, Emmanuel; Kollia, Panagoula; Menzel, Stephan; Miller, Webb; Moradkhani, Kamran; Old, John; Papachatzopoulou, Adamantia; Papadakis, Manoussos N; Papadopoulos, Petros; Pavlovic, Sonja; Perseu, Lucia; Radmilovic, Milena; Riemer, Cathy; Satta, Stefania; Schrijver, Iris; Stojiljkovic, Maja; Thein, Swee Lay; Traeger-Synodinos, Jan; Tully, Ray; Wada, Takahito; Waye, John S; Wiemann, Claudia; Zukic, Branka; Chui, David H K; Wajcman, Henri; Hardison, Ross C; Patrinos, George P

    2011-03-20

    We developed a series of interrelated locus-specific databases to store all published and unpublished genetic variation related to hemoglobinopathies and thalassemia and implemented microattribution to encourage submission of unpublished observations of genetic variation to these public repositories. A total of 1,941 unique genetic variants in 37 genes, encoding globins and other erythroid proteins, are currently documented in these databases, with reciprocal attribution of microcitations to data contributors. Our project provides the first example of implementing microattribution to incentivise submission of all known genetic variation in a defined system. It has demonstrably increased the reporting of human variants, leading to a comprehensive online resource for systematically describing human genetic variation in the globin genes and other genes contributing to hemoglobinopathies and thalassemias. The principles established here will serve as a model for other systems and for the analysis of other common and/or complex human genetic diseases.

  4. Palm-Vein Classification Based on Principal Orientation Features

    PubMed Central

    Zhou, Yujia; Liu, Yaqin; Feng, Qianjin; Yang, Feng; Huang, Jing; Nie, Yixiao

    2014-01-01

    Personal recognition using palm–vein patterns has emerged as a promising alternative for human recognition because of its uniqueness, stability, live body identification, flexibility, and difficulty to cheat. With the expanding application of palm–vein pattern recognition, the corresponding growth of the database has resulted in a long response time. To shorten the response time of identification, this paper proposes a simple and useful classification for palm–vein identification based on principal direction features. In the registration process, the Gaussian-Radon transform is adopted to extract the orientation matrix and then compute the principal direction of a palm–vein image based on the orientation matrix. The database can be classified into six bins based on the value of the principal direction. In the identification process, the principal direction of the test sample is first extracted to ascertain the corresponding bin. One-by-one matching with the training samples is then performed in the bin. To improve recognition efficiency while maintaining better recognition accuracy, two neighborhood bins of the corresponding bin are continuously searched to identify the input palm–vein image. Evaluation experiments are conducted on three different databases, namely, PolyU, CASIA, and the database of this study. Experimental results show that the searching range of one test sample in PolyU, CASIA and our database by the proposed method for palm–vein identification can be reduced to 14.29%, 14.50%, and 14.28%, with retrieval accuracy of 96.67%, 96.00%, and 97.71%, respectively. With 10,000 training samples in the database, the execution time of the identification process by the traditional method is 18.56 s, while that by the proposed approach is 3.16 s. The experimental results confirm that the proposed approach is more efficient than the traditional method, especially for a large database. PMID:25383715

  5. Efficient hemodynamic event detection utilizing relational databases and wavelet analysis

    NASA Technical Reports Server (NTRS)

    Saeed, M.; Mark, R. G.

    2001-01-01

    Development of a temporal query framework for time-oriented medical databases has hitherto been a challenging problem. We describe a novel method for the detection of hemodynamic events in multiparameter trends utilizing wavelet coefficients in a MySQL relational database. Storage of the wavelet coefficients allowed for a compact representation of the trends, and provided robust descriptors for the dynamics of the parameter time series. A data model was developed to allow for simplified queries along several dimensions and time scales. Of particular importance, the data model and wavelet framework allowed for queries to be processed with minimal table-join operations. A web-based search engine was developed to allow for user-defined queries. Typical queries required between 0.01 and 0.02 seconds, with at least two orders of magnitude improvement in speed over conventional queries. This powerful and innovative structure will facilitate research on large-scale time-oriented medical databases.

  6. Building a genome database using an object-oriented approach.

    PubMed

    Barbasiewicz, Anna; Liu, Lin; Lang, B Franz; Burger, Gertraud

    2002-01-01

    GOBASE is a relational database that integrates data associated with mitochondria and chloroplasts. The most important data in GOBASE, i. e., molecular sequences and taxonomic information, are obtained from the public sequence data repository at the National Center for Biotechnology Information (NCBI), and are validated by our experts. Maintaining a curated genomic database comes with a towering labor cost, due to the shear volume of available genomic sequences and the plethora of annotation errors and omissions in records retrieved from public repositories. Here we describe our approach to increase automation of the database population process, thereby reducing manual intervention. As a first step, we used Unified Modeling Language (UML) to construct a list of potential errors. Each case was evaluated independently, and an expert solution was devised, and represented as a diagram. Subsequently, the UML diagrams were used as templates for writing object-oriented automation programs in the Java programming language.

  7. Embedding CLIPS in a database-oriented diagnostic system

    NASA Technical Reports Server (NTRS)

    Conway, Tim

    1990-01-01

    This paper describes the integration of C Language Production Systems (CLIPS) into a powerful portable maintenance aid (PMA) system used for flightline diagnostics. The current diagnostic target of the system is the Garrett GTCP85-180L, a gas turbine engine used as an Auxiliary Power Unit (APU) on some C-130 military transport aircraft. This project is a database oriented approach to a generic diagnostic system. CLIPS is used for 'many-to-many' pattern matching within the diagnostics process. Patterns are stored in database format, and CLIPS code is generated by a 'compilation' process on the database. Multiple CLIPS rule sets and working memories (in sequence) are supported and communication between the rule sets is achieved via the export and import commands. Work is continuing on using CLIPS in other portions of the diagnostic system and in re-implementing the diagnostic system in the Ada language.

  8. Beyond Same-Sex Attraction: Gender-Variant-Based Victimization Is Associated with Suicidal Behavior and Substance Use for Other-Sex Attracted Adolescents

    PubMed Central

    Chen, Peter Y.; Cigularov, Konstantin P.; Tomazic, Rocco G.

    2015-01-01

    Gender-variant-based victimization is victimization based on the way others perceive an individual to convey masculine, feminine, and androgynous characteristics through their appearance, mannerisms, and behaviors. Previous work identifies gender-variant-based victimization as a risk factor for health-risking outcomes among same-sex attracted youths. The current study seeks to examine this relationship among other-sex attracted youths and same-sex attracted youth, and determine if gender-variant-based victimization is similarly or differentially associated with poor outcomes between these two groups. Anonymous data from a school-based survey of 2,438 racially diverse middle and high school students in the Eastern U.S. was examined. For other-sex attracted adolescents, gender-variant-based victimization was associated with a higher odds of suicidal thoughts and behaviors, regular use of cigarettes, and drug use. When compared to same-sex attracted adolescents, the harmful relationship between gender-variant-based victimization and each of these outcomes was similar in nature. These findings suggest that gender-variant-based victimization has potentially serious implications for the psychological wellbeing and substance use of other-sex attracted adolescents, not just same-sex attracted adolescents, supporting the need to address gender expression as a basis for victimization separate from sexuality- or gender-minority status. The impact that gender-variant-based victimization has on all adolescents should not be overlooked in research and interventions aimed at addressing sexual orientation-based and gender-variant-based victimization, substance use, and suicide prevention. PMID:26068796

  9. A polarimetric scattering database for non-spherical ice particles at microwave wavelengths

    NASA Astrophysics Data System (ADS)

    Lu, Yinghui; Jiang, Zhiyuan; Aydin, Kultegin; Verlinde, Johannes; Clothiaux, Eugene E.; Botta, Giovanni

    2016-10-01

    The atmospheric science community has entered a period in which electromagnetic scattering properties at microwave frequencies of realistically constructed ice particles are necessary for making progress on a number of fronts. One front includes retrieval of ice-particle properties and signatures from ground-based, airborne, and satellite-based radar and radiometer observations. Another front is evaluation of model microphysics by application of forward operators to their outputs and comparison to observations during case study periods. Yet a third front is data assimilation, where again forward operators are applied to databases of ice-particle scattering properties and the results compared to observations, with their differences leading to corrections of the model state. Over the past decade investigators have developed databases of ice-particle scattering properties at microwave frequencies and made them openly available. Motivated by and complementing these earlier efforts, a database containing polarimetric single-scattering properties of various types of ice particles at millimeter to centimeter wavelengths is presented. While the database presented here contains only single-scattering properties of ice particles in a fixed orientation, ice-particle scattering properties are computed for many different directions of the radiation incident on them. These results are useful for understanding the dependence of ice-particle scattering properties on ice-particle orientation with respect to the incident radiation. For ice particles that are small compared to the wavelength, the number of incident directions of the radiation is sufficient to compute reasonable estimates of their (randomly) orientation-averaged scattering properties. This database is complementary to earlier ones in that it contains complete (polarimetric) scattering property information for each ice particle - 44 plates, 30 columns, 405 branched planar crystals, 660 aggregates, and 640 conical graupel - and direction of incident radiation but is limited to four frequencies (X-, Ku-, Ka-, and W-bands), does not include temperature dependencies of the single-scattering properties, and does not include scattering properties averaged over randomly oriented ice particles. Rules for constructing the morphologies of ice particles from one database to the next often differ; consequently, analyses that incorporate all of the different databases will contain the most variability, while illuminating important differences between them. Publication of this database is in support of future analyses of this nature and comes with the hope that doing so helps contribute to the development of a database standard for ice-particle scattering properties, like the NetCDF (Network Common Data Form) CF (Climate and Forecast) or NetCDF CF/Radial metadata conventions.

  10. A service-oriented data access control model

    NASA Astrophysics Data System (ADS)

    Meng, Wei; Li, Fengmin; Pan, Juchen; Song, Song; Bian, Jiali

    2017-01-01

    The development of mobile computing, cloud computing and distributed computing meets the growing individual service needs. Facing with complex application system, it's an urgent problem to ensure real-time, dynamic, and fine-grained data access control. By analyzing common data access control models, on the basis of mandatory access control model, the paper proposes a service-oriented access control model. By regarding system services as subject and data of databases as object, the model defines access levels and access identification of subject and object, and ensures system services securely to access databases.

  11. E-MSD: an integrated data resource for bioinformatics.

    PubMed

    Velankar, S; McNeil, P; Mittard-Runte, V; Suarez, A; Barrell, D; Apweiler, R; Henrick, K

    2005-01-01

    The Macromolecular Structure Database (MSD) group (http://www.ebi.ac.uk/msd/) continues to enhance the quality and consistency of macromolecular structure data in the worldwide Protein Data Bank (wwPDB) and to work towards the integration of various bioinformatics data resources. One of the major obstacles to the improved integration of structural databases such as MSD and sequence databases like UniProt is the absence of up to date and well-maintained mapping between corresponding entries. We have worked closely with the UniProt group at the EBI to clean up the taxonomy and sequence cross-reference information in the MSD and UniProt databases. This information is vital for the reliable integration of the sequence family databases such as Pfam and Interpro with the structure-oriented databases of SCOP and CATH. This information has been made available to the eFamily group (http://www.efamily.org.uk/) and now forms the basis of the regular interchange of information between the member databases (MSD, UniProt, Pfam, Interpro, SCOP and CATH). This exchange of annotation information has enriched the structural information in the MSD database with annotation from wider sequence-oriented resources. This work was carried out under the 'Structure Integration with Function, Taxonomy and Sequences (SIFTS)' initiative (http://www.ebi.ac.uk/msd-srv/docs/sifts) in the MSD group.

  12. A comparative study of six European databases of medically oriented Web resources.

    PubMed

    Abad García, Francisca; González Teruel, Aurora; Bayo Calduch, Patricia; de Ramón Frias, Rosa; Castillo Blasco, Lourdes

    2005-10-01

    The paper describes six European medically oriented databases of Web resources, pertaining to five quality-controlled subject gateways, and compares their performance. The characteristics, coverage, procedure for selecting Web resources, record structure, searching possibilities, and existence of user assistance were described for each database. Performance indicators for each database were obtained by means of searches carried out using the key words, "myocardial infarction." Most of the databases originated in the 1990s in an academic or library context and include all types of Web resources of an international nature. Five databases use Medical Subject Headings. The number of fields per record varies between three and nineteen. The language of the search interfaces is mostly English, and some of them allow searches in other languages. In some databases, the search can be extended to Pubmed. Organizing Medical Networked Information, Catalogue et Index des Sites Médicaux Francophones, and Diseases, Disorders and Related Topics produced the best results. The usefulness of these databases as quick reference resources is clear. In addition, their lack of content overlap means that, for the user, they complement each other. Their continued survival faces three challenges: the instability of the Internet, maintenance costs, and lack of use in spite of their potential usefulness.

  13. BRCA1/2 missense mutations and the value of in-silico analyses.

    PubMed

    Sadowski, Carolin E; Kohlstedt, Daniela; Meisel, Cornelia; Keller, Katja; Becker, Kerstin; Mackenroth, Luisa; Rump, Andreas; Schröck, Evelin; Wimberger, Pauline; Kast, Karin

    2017-11-01

    The clinical implications of genetic variants in BRCA1/2 in healthy and affected individuals are considerable. Variant interpretation, however, is especially challenging for missense variants. The majority of them are classified as variants of unknown clinical significance (VUS). Computational (in-silico) predictive programs are easy to access, but represent only one tool out of a wide range of complemental approaches to classify VUS. With this single-center study, we aimed to evaluate the impact of in-silico analyses in a spectrum of different BRCA1/2 missense variants. We conducted mutation analysis of BRCA1/2 in 523 index patients with suspected hereditary breast and ovarian cancer (HBOC). Classification of the genetic variants was performed according to the German Consortium (GC)-HBOC database. Additionally, all missense variants were classified by the following three in-silico prediction tools: SIFT, Mutation Taster (MT2) and PolyPhen2 (PPH2). Overall 201 different variants, 68 of which constituted missense variants were ranked as pathogenic, neutral, or unknown. The classification of missense variants by in-silico tools resulted in a higher amount of pathogenic mutations (25% vs. 13.2%) compared to the GC-HBOC-classification. Altogether, more than fifty percent (38/68, 55.9%) of missense variants were ranked differently. Sensitivity of in-silico-tools for mutation prediction was 88.9% (PPH2), 100% (SIFT) and 100% (MT2). We found a relevant discrepancy in variant classification by using in-silico prediction tools, resulting in potential overestimation and/or underestimation of cancer risk. More reliable, notably gene-specific, prediction tools and functional tests are needed to improve clinical counseling. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

  14. Principles and Recommendations for Standardizing the Use of the Next-Generation Sequencing Variant File in Clinical Settings.

    PubMed

    Lubin, Ira M; Aziz, Nazneen; Babb, Lawrence J; Ballinger, Dennis; Bisht, Himani; Church, Deanna M; Cordes, Shaun; Eilbeck, Karen; Hyland, Fiona; Kalman, Lisa; Landrum, Melissa; Lockhart, Edward R; Maglott, Donna; Marth, Gabor; Pfeifer, John D; Rehm, Heidi L; Roy, Somak; Tezak, Zivana; Truty, Rebecca; Ullman-Cullere, Mollie; Voelkerding, Karl V; Worthey, Elizabeth A; Zaranek, Alexander W; Zook, Justin M

    2017-05-01

    A national workgroup convened by the Centers for Disease Control and Prevention identified principles and made recommendations for standardizing the description of sequence data contained within the variant file generated during the course of clinical next-generation sequence analysis for diagnosing human heritable conditions. The specifications for variant files were initially developed to be flexible with regard to content representation to support a variety of research applications. This flexibility permits variation with regard to how sequence findings are described and this depends, in part, on the conventions used. For clinical laboratory testing, this poses a problem because these differences can compromise the capability to compare sequence findings among laboratories to confirm results and to query databases to identify clinically relevant variants. To provide for a more consistent representation of sequence findings described within variant files, the workgroup made several recommendations that considered alignment to a common reference sequence, variant caller settings, use of genomic coordinates, and gene and variant naming conventions. These recommendations were considered with regard to the existing variant file specifications presently used in the clinical setting. Adoption of these recommendations is anticipated to reduce the potential for ambiguity in describing sequence findings and facilitate the sharing of genomic data among clinical laboratories and other entities. Copyright © 2017 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  15. Report of a Novel SHOX Missense Variant in a Boy With Short Stature and His Mother With Leri–Weill Dyschondrosteosis

    PubMed Central

    Lucchetti, Laura; Prontera, Paolo; Mencarelli, Amedea; Sallicandro, Ester; Mencarelli, Annalisa; Cofini, Marta; Leonardi, Alberto; Stangoni, Gabriela; Penta, Laura; Esposito, Susanna

    2018-01-01

    Heterozygous mutations in the SHOX gene or in the upstream and downstream enhancer elements are associated with 2–22% of cases of idiopathic short stature (OMIM #300582) and with 60% of cases of Leri–Weill dyschondrosteosis (OMIM #127300) with which female subjects are generally more severely affected. Approximately 80–90% of SHOX pathogenic variants are deletions or duplications, and the remaining 10–20% are point mutations that primarily give rise to missense variants. The clinical interpretation of novel variants, particularly missense variants, can be challenging and can remain of uncertain significance. Here, we describe a novel missense variant (c.1044 G>T, p.Arg118Met) in a Moroccan boy with a disproportionately short stature and without any radiological traits or bone deformities and in his mother, who had a disproportionately short stature and a Madelung deformity. This variant has not been reported to date in the updated SHOX allelic variant or Human Gene Mutation Databases nor is it listed as a polymorphism in the ExAC browser, dbSNP, or 1000G. This mutation was predicted to be deleterious by three different bioinformatics tools since it modifies an amino acid in a highly conserved DNA-binding domain of the SHOX protein. Based on this evidence, the patient was treated with recombinant human growth hormone. PMID:29692759

  16. Integrated sequence analysis pipeline provides one-stop solution for identifying disease-causing mutations.

    PubMed

    Hu, Hao; Wienker, Thomas F; Musante, Luciana; Kalscheuer, Vera M; Kahrizi, Kimia; Najmabadi, Hossein; Ropers, H Hilger

    2014-12-01

    Next-generation sequencing has greatly accelerated the search for disease-causing defects, but even for experts the data analysis can be a major challenge. To facilitate the data processing in a clinical setting, we have developed a novel medical resequencing analysis pipeline (MERAP). MERAP assesses the quality of sequencing, and has optimized capacity for calling variants, including single-nucleotide variants, insertions and deletions, copy-number variation, and other structural variants. MERAP identifies polymorphic and known causal variants by filtering against public domain databases, and flags nonsynonymous and splice-site changes. MERAP uses a logistic model to estimate the causal likelihood of a given missense variant. MERAP considers the relevant information such as phenotype and interaction with known disease-causing genes. MERAP compares favorably with GATK, one of the widely used tools, because of its higher sensitivity for detecting indels, its easy installation, and its economical use of computational resources. Upon testing more than 1,200 individuals with mutations in known and novel disease genes, MERAP proved highly reliable, as illustrated here for five families with disease-causing variants. We believe that the clinical implementation of MERAP will expedite the diagnostic process of many disease-causing defects. © 2014 WILEY PERIODICALS, INC.

  17. A Bioinformatics Approach to the Identification of Variants Associated with Type 1 and Type 2 Diabetes Mellitus that Reside in Functionally Validated miRNAs Binding Sites.

    PubMed

    Ghaedi, Hamid; Bastami, Milad; Jahani, Mohammad Mehdi; Alipoor, Behnam; Tabasinezhad, Maryam; Ghaderi, Omar; Nariman-Saleh-Fam, Ziba; Mirfakhraie, Reza; Movafagh, Abolfazl; Omrani, Mir Davood; Masotti, Andrea

    2016-06-01

    The present work is aimed at finding variants associated with Type 1 and Type 2 diabetes mellitus (DM) that reside in functionally validated miRNAs binding sites and that can have a functional role in determining diabetes and related pathologies. Using bioinformatics analyses we obtained a database of validated polymorphic miRNA binding sites which has been intersected with genes related to DM or to variants associated and/or in linkage disequilibrium (LD) with it and is reported in genome-wide association studies (GWAS). The workflow we followed allowed us to find variants associated with DM that also reside in functional miRNA binding sites. These data have been demonstrated to have a functional role by impairing the functions of genes implicated in biological processes linked to DM. In conclusion, our work emphasized the importance of SNPs located in miRNA binding sites. The results discussed in this work may constitute the basis of further works aimed at finding functional candidates and variants affecting protein structure and function, transcription factor binding sites, and non-coding epigenetic variants, contributing to widen the knowledge about the pathogenesis of this important disease.

  18. MaizeGDB update: New tools, data, and interface for the maize model organism database

    USDA-ARS?s Scientific Manuscript database

    MaizeGDB is a highly curated, community-oriented database and informatics service to researchers focused on the crop plant and model organism Zea mays ssp. mays. Although some form of the maize community database has existed over the last 25 years, there have only been two major releases. In 1991, ...

  19. Proteomic characterization of histone variants in the mouse testis by mass spectrometry-based top-down analysis.

    PubMed

    Kwak, Ho-Geun; Dohmae, Naoshi

    2016-11-15

    Various histones, including testis-specific histones, exist during spermatogenesis and some of them have been reported to play a key role in chromatin remodeling. Mass spectrometry (MS)-based characterization has become the important step to understand histone structures. Although individual histones or partial histone variant groups have been characterized, the comprehensive analysis of histone variants has not yet been conducted in the mouse testis. Here, we present the comprehensive separation and characterization of histone variants from mouse testes by a top-down approach using MS. Histone variants were successfully separated on a reversed phase column using high performance liquid chromatography (HPLC) with an ion-pairing reagent. Increasing concentrations of testis-specific histones were observed in the mouse testis and some somatic histones increased in the epididymis. Specifically, the increase of mass abundance in H3.2 in the epididymis was inversely proportional to the decrease in H3t in the testis, which was approximately 80%. The top-down characterization of intact histone variants in the mouse testis was performed using LC-MS/MS. The masses of separated histone variants and their expected post-translation modifications were calculated by performing deconvolution with information taken from the database. TH2A, TH2B and H3t were characterized by MS/MS fragmentation. Our approach provides comprehensive knowledge for identification of histone variants in the mouse testis that will contribute to the structural and functional research of histone variants during spermatogenesis.

  20. Detailed genetic characteristics of an international large cohort of patients with Stargardt disease: ProgStar study report 8.

    PubMed

    Fujinami, Kaoru; Strauss, Rupert W; Chiang, John Pei-Wen; Audo, Isabelle S; Bernstein, Paul S; Birch, David G; Bomotti, Samantha M; Cideciyan, Artur V; Ervin, Ann-Margret; Marino, Meghan J; Sahel, José-Alain; Mohand-Said, Saddek; Sunness, Janet S; Traboulsi, Elias I; West, Sheila; Wojciechowski, Robert; Zrenner, Eberhart; Michaelides, Michel; Scholl, Hendrik P N

    2018-06-20

    To describe the genetic characteristics of the cohort enrolled in the international multicentre progression of Stargardt disease 1 (STGD1) studies (ProgStar) and to determine geographic differences based on the allele frequency. 345 participants with a clinical diagnosis of STGD1 and harbouring at least one disease-causing ABCA4 variant were enrolled from 9 centres in the USA and Europe. All variants were reviewed and in silico analysis was performed including allele frequency in public databases and pathogenicity predictions. Participants with multiple likely pathogenic variants were classified into four national subgroups (USA, UK, France, Germany), with subsequent comparison analysis of the allele frequency for each prevalent allele. 211 likely pathogenic variants were identified in the total cohort, including missense (63%), splice site alteration (18%), stop (9%) and others. 50 variants were novel. Exclusively missense variants were detected in 139 (50%) of 279 patients with multiple pathogenic variants. The three most prevalent variants of these patients with multiple pathogenic variants were p.G1961E (15%), p.G863A (7%) and c.5461-10 T>C (5%). Subgroup analysis revealed a statistically significant difference between the four recruiting nations in the allele frequency of nine variants. There is a large spectrum of ABCA4 sequence variants, including 50 novel variants, in a well-characterised cohort thereby further adding to the unique allelic heterogeneity in STGD1. Approximately half of the cohort harbours missense variants only, indicating a relatively mild phenotype of the ProgStar cohort. There are significant differences in allele frequencies between nations, although the three most prevalent variants are shared as frequent variants. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  1. The Role of IMAT Solutions for Training Development at the Royal Netherlands Air Force. IMAT Follow-up Research Part 1

    DTIC Science & Technology

    2005-09-01

    e.g. the transformation of a fragment to an instructional fragment. "* IMAT Database: A Jasmine ® database is used as central database in IMAT for the...storage of fragments. This is an object-oriented relational database. Jasmine ® was, amongst other factors, chosen for its ability to handle multimedia...to the Jasmine ® database, which is used in IMAT as central database. 3.1.1.1 Ontologies In IMAT, the proposed solution on problems with information

  2. Ellipsometry with polarisation analysis at cryogenic temperatures inside a vacuum chamber

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bauer, S.; Grees, B.; Spitzer, D.

    2013-12-15

    In this paper we describe a new variant of null ellipsometry to determine thicknesses and optical properties of thin films on a substrate at cryogenic temperatures. In the PCSA arrangement of ellipsometry the polarizer and the compensator are placed before the substrate and the analyzer after it. Usually, in the null ellipsometry the polarizer and the analyzer are rotated to find the searched minimum in intensity. In our variant we rotate the polarizer and the compensator instead, both being placed in the incoming beam before the substrate. Therefore the polarisation analysis of the reflected beam can be realized by anmore » analyzer at fixed orientation. We developed this method for investigations of thin cryogenic films inside a vacuum chamber where the analyzer and detector had to be placed inside the cold shield at a temperature of T≈ 90 K close to the substrate. All other optical components were installed at the incoming beam line outside the vacuum chamber, including all components which need to be rotated during the measurements. Our null ellipsometry variant has been tested with condensed krypton films on a highly oriented pyrolytic graphite substrate (HOPG) at a temperature of T≈ 25 K. We show that it is possible to determine the indices of refraction of condensed krypton and of the HOPG substrate as well as thickness of krypton films with reasonable accuracy.« less

  3. Text Mining Genotype-Phenotype Relationships from Biomedical Literature for Database Curation and Precision Medicine.

    PubMed

    Singhal, Ayush; Simmons, Michael; Lu, Zhiyong

    2016-11-01

    The practice of precision medicine will ultimately require databases of genes and mutations for healthcare providers to reference in order to understand the clinical implications of each patient's genetic makeup. Although the highest quality databases require manual curation, text mining tools can facilitate the curation process, increasing accuracy, coverage, and productivity. However, to date there are no available text mining tools that offer high-accuracy performance for extracting such triplets from biomedical literature. In this paper we propose a high-performance machine learning approach to automate the extraction of disease-gene-variant triplets from biomedical literature. Our approach is unique because we identify the genes and protein products associated with each mutation from not just the local text content, but from a global context as well (from the Internet and from all literature in PubMed). Our approach also incorporates protein sequence validation and disease association using a novel text-mining-based machine learning approach. We extract disease-gene-variant triplets from all abstracts in PubMed related to a set of ten important diseases (breast cancer, prostate cancer, pancreatic cancer, lung cancer, acute myeloid leukemia, Alzheimer's disease, hemochromatosis, age-related macular degeneration (AMD), diabetes mellitus, and cystic fibrosis). We then evaluate our approach in two ways: (1) a direct comparison with the state of the art using benchmark datasets; (2) a validation study comparing the results of our approach with entries in a popular human-curated database (UniProt) for each of the previously mentioned diseases. In the benchmark comparison, our full approach achieves a 28% improvement in F1-measure (from 0.62 to 0.79) over the state-of-the-art results. For the validation study with UniProt Knowledgebase (KB), we present a thorough analysis of the results and errors. Across all diseases, our approach returned 272 triplets (disease-gene-variant) that overlapped with entries in UniProt and 5,384 triplets without overlap in UniProt. Analysis of the overlapping triplets and of a stratified sample of the non-overlapping triplets revealed accuracies of 93% and 80% for the respective categories (cumulative accuracy, 77%). We conclude that our process represents an important and broadly applicable improvement to the state of the art for curation of disease-gene-variant relationships.

  4. Genetic polymorphisms associated with heart failure: A literature review.

    PubMed

    Guo, Mengqi; Guo, Guanlun; Ji, Xiaoping

    2016-02-01

    To review possible associations reported between genetic variants and the risk, therapeutic response and prognosis of heart failure. Electronic databases (PubMed, Web of Science and CNKI) were systematically searched for relevant papers, published between January 1995 and February 2015. Eighty-two articles covering 29 genes and 39 polymorphisms were identified. Genetic association studies of heart failure have been highly controversial. There may be interaction or synergism of several genetic variants that together result in the ultimate pathological phenotype for heart failure. © The Author(s) 2016.

  5. Clinical Interpretation and Implications of Whole-Genome Sequencing

    PubMed Central

    Dewey, Frederick E.; Grove, Megan E.; Pan, Cuiping; Goldstein, Benjamin A.; Bernstein, Jonathan A.; Chaib, Hassan; Merker, Jason D.; Goldfeder, Rachel L.; Enns, Gregory M.; David, Sean P.; Pakdaman, Neda; Ormond, Kelly E.; Caleshu, Colleen; Kingham, Kerry; Klein, Teri E.; Whirl-Carrillo, Michelle; Sakamoto, Kenneth; Wheeler, Matthew T.; Butte, Atul J.; Ford, James M.; Boxer, Linda; Ioannidis, John P. A.; Yeung, Alan C.; Altman, Russ B.; Assimes, Themistocles L.; Snyder, Michael; Ashley, Euan A.; Quertermous, Thomas

    2014-01-01

    IMPORTANCE Whole-genome sequencing (WGS) is increasingly applied in clinical medicine and is expected to uncover clinically significant findings regardless of sequencing indication. OBJECTIVES To examine coverage and concordance of clinically relevant genetic variation provided by WGS technologies; to quantitate inherited disease risk and pharmacogenomic findings in WGS data and resources required for their discovery and interpretation; and to evaluate clinical action prompted by WGS findings. DESIGN, SETTING, AND PARTICIPANTS An exploratory study of 12 adult participants recruited at Stanford University Medical Center who underwent WGS between November 2011 and March 2012. A multidisciplinary team reviewed all potentially reportable genetic findings. Five physicians proposed initial clinical follow-up based on the genetic findings. MAIN OUTCOMES AND MEASURES Genome coverage and sequencing platform concordance in different categories of genetic disease risk, person-hours spent curating candidate disease-risk variants, interpretation agreement between trained curators and disease genetics databases, burden of inherited disease risk and pharmacogenomic findings, and burden and interrater agreement of proposed clinical follow-up. RESULTS Depending on sequencing platform, 10% to 19% of inherited disease genes were not covered to accepted standards for single nucleotide variant discovery. Genotype concordance was high for previously described single nucleotide genetic variants (99%-100%) but low for small insertion/deletion variants (53%-59%). Curation of 90 to 127 genetic variants in each participant required a median of 54 minutes (range, 5-223 minutes) per genetic variant, resulted in moderate classification agreement between professionals (Gross κ, 0.52; 95%CI, 0.40-0.64), and reclassified 69%of genetic variants cataloged as disease causing in mutation databases to variants of uncertain or lesser significance. Two to 6 personal disease-risk findings were discovered in each participant, including 1 frameshift deletion in the BRCA1 gene implicated in hereditary breast and ovarian cancer. Physician review of sequencing findings prompted consideration of a median of 1 to 3 initial diagnostic tests and referrals per participant, with fair interrater agreement about the suitability of WGS findings for clinical follow-up (Fleiss κ, 0.24; P < 001). CONCLUSIONS AND RELEVANCE In this exploratory study of 12 volunteer adults, the use of WGS was associated with incomplete coverage of inherited disease genes, low reproducibility of detection of genetic variation with the highest potential clinical effects, and uncertainty about clinically reportable findings. In certain cases, WGS will identify clinically actionable genetic variants warranting early medical intervention. These issues should be considered when determining the role of WGS in clinical medicine. PMID:24618965

  6. Histone Lysine Methylases and Demethylases in the Landscape of Human Developmental Disorders.

    PubMed

    Faundes, Víctor; Newman, William G; Bernardini, Laura; Canham, Natalie; Clayton-Smith, Jill; Dallapiccola, Bruno; Davies, Sally J; Demos, Michelle K; Goldman, Amy; Gill, Harinder; Horton, Rachel; Kerr, Bronwyn; Kumar, Dhavendra; Lehman, Anna; McKee, Shane; Morton, Jenny; Parker, Michael J; Rankin, Julia; Robertson, Lisa; Temple, I Karen; Banka, Siddharth

    2018-01-04

    Histone lysine methyltransferases (KMTs) and demethylases (KDMs) underpin gene regulation. Here we demonstrate that variants causing haploinsufficiency of KMTs and KDMs are frequently encountered in individuals with developmental disorders. Using a combination of human variation databases and existing animal models, we determine 22 KMTs and KDMs as additional candidates for dominantly inherited developmental disorders. We show that KMTs and KDMs that are associated with, or are candidates for, dominant developmental disorders tend to have a higher level of transcription, longer canonical transcripts, more interactors, and a higher number and more types of post-translational modifications than other KMT and KDMs. We provide evidence to firmly associate KMT2C, ASH1L, and KMT5B haploinsufficiency with dominant developmental disorders. Whereas KMT2C or ASH1L haploinsufficiency results in a predominantly neurodevelopmental phenotype with occasional physical anomalies, KMT5B mutations cause an overgrowth syndrome with intellectual disability. We further expand the phenotypic spectrum of KMT2B-related disorders and show that some individuals can have severe developmental delay without dystonia at least until mid-childhood. Additionally, we describe a recessive histone lysine-methylation defect caused by homozygous or compound heterozygous KDM5B variants and resulting in a recognizable syndrome with developmental delay, facial dysmorphism, and camptodactyly. Collectively, these results emphasize the significance of histone lysine methylation in normal human development and the importance of this process in human developmental disorders. Our results demonstrate that systematic clinically oriented pathway-based analysis of genomic data can accelerate the discovery of rare genetic disorders. Copyright © 2017 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  7. Spatially Resolved Mid-IR Spectra from Meteorites; Linking Composition, Crystallographic Orientation and Spectra on the Micro-Scale

    NASA Astrophysics Data System (ADS)

    Stephen, N. R.

    2016-08-01

    IR spectroscopy is used to infer composition of extraterrestrial bodies, comparing bulk spectra to databases of separate mineral phases. We extract spatially resolved meteorite-specific spectra from achondrites with respect to zonation and orientation.

  8. Cardiological database management system as a mediator to clinical decision support.

    PubMed

    Pappas, C; Mavromatis, A; Maglaveras, N; Tsikotis, A; Pangalos, G; Ambrosiadou, V

    1996-03-01

    An object-oriented medical database management system is presented for a typical cardiologic center, facilitating epidemiological trials. Object-oriented analysis and design were used for the system design, offering advantages for the integrity and extendibility of medical information systems. The system was developed using object-oriented design and programming methodology, the C++ language and the Borland Paradox Relational Data Base Management System on an MS-Windows NT environment. Particular attention was paid to system compatibility, portability, the ease of use, and the suitable design of the patient record so as to support the decisions of medical personnel in cardiovascular centers. The system was designed to accept complex, heterogeneous, distributed data in various formats and from different kinds of examinations such as Holter, Doppler and electrocardiography.

  9. Lack of association between the P413L variant of chromogranin B and ALS risk or age at onset: a meta-analysis.

    PubMed

    Yang, Xinglong; Li, Shimei; Xing, Dongmei; Li, Peiyun; Li, Ci; Qi, Ling; Xu, Yanming; Ren, Hui

    2018-02-01

    Amyotrophic lateral sclerosis (ALS), the most common motor neuron disease, is thought to result from interaction of genetic and environmental risk factors. Whether the potentially functional exonic P413L variant in the chromogranin B gene influences ALS risk and age at onset is controversial. We meta-analysed or other studies assessing the association between the P413L variant and ALS risk or age at ALS onset indexed in Web of Science, PubMed, Embase, Chinese National Knowledge Infrastructure, Wanfang, and SinoMed databases. Five case-control studies were analysed, involving 2639 patients with sporadic ALS, 201 with familial ALS and 3381 controls. No association was detected between risk of either ALS type and the CT + TT genotype or T-allele of the P413L variant. Age at ALS onset was similar between carriers and non-carriers of the T-allele. The available evidence suggests that the P413L variant of chromogranin B is not associated with ALS risk or age at ALS onset. These results should be validated in large, well-designed studies.

  10. ActiveDriverDB: human disease mutations and genome variation in post-translational modification sites of proteins

    PubMed Central

    Krassowski, Michal; Paczkowska, Marta; Cullion, Kim; Huang, Tina; Dzneladze, Irakli; Ouellette, B F Francis; Yamada, Joseph T; Fradet-Turcotte, Amelie

    2018-01-01

    Abstract Interpretation of genetic variation is needed for deciphering genotype-phenotype associations, mechanisms of inherited disease, and cancer driver mutations. Millions of single nucleotide variants (SNVs) in human genomes are known and thousands are associated with disease. An estimated 21% of disease-associated amino acid substitutions corresponding to missense SNVs are located in protein sites of post-translational modifications (PTMs), chemical modifications of amino acids that extend protein function. ActiveDriverDB is a comprehensive human proteo-genomics database that annotates disease mutations and population variants through the lens of PTMs. We integrated >385,000 published PTM sites with ∼3.6 million substitutions from The Cancer Genome Atlas (TCGA), the ClinVar database of disease genes, and human genome sequencing projects. The database includes site-specific interaction networks of proteins, upstream enzymes such as kinases, and drugs targeting these enzymes. We also predicted network-rewiring impact of mutations by analyzing gains and losses of kinase-bound sequence motifs. ActiveDriverDB provides detailed visualization, filtering, browsing and searching options for studying PTM-associated mutations. Users can upload mutation datasets interactively and use our application programming interface in pipelines. Integrative analysis of mutations and PTMs may help decipher molecular mechanisms of phenotypes and disease, as exemplified by case studies of TP53, BRCA2 and VHL. The open-source database is available at https://www.ActiveDriverDB.org. PMID:29126202

  11. TMC-SNPdb: an Indian germline variant database derived from whole exome sequences.

    PubMed

    Upadhyay, Pawan; Gardi, Nilesh; Desai, Sanket; Sahoo, Bikram; Singh, Ankita; Togar, Trupti; Iyer, Prajish; Prasad, Ratnam; Chandrani, Pratik; Gupta, Sudeep; Dutt, Amit

    2016-01-01

    Cancer is predominantly a somatic disease. A mutant allele present in a cancer cell genome is considered somatic when it's absent in the paired normal genome along with public SNP databases. The current build of dbSNP, the most comprehensive public SNP database, however inadequately represents several non-European Caucasian populations, posing a limitation in cancer genomic analyses of data from these populations. We present the T: ata M: emorial C: entre-SNP D: ata B: ase (TMC-SNPdb), as the first open source, flexible, upgradable, and freely available SNP database (accessible through dbSNP build 149 and ANNOVAR)-representing 114 309 unique germline variants-generated from whole exome data of 62 normal samples derived from cancer patients of Indian origin. The TMC-SNPdb is presented with a companion subtraction tool that can be executed with command line option or using an easy-to-use graphical user interface with the ability to deplete additional Indian population specific SNPs over and above dbSNP and 1000 Genomes databases. Using an institutional generated whole exome data set of 132 samples of Indian origin, we demonstrate that TMC-SNPdb could deplete 42, 33 and 28% false positive somatic events post dbSNP depletion in Indian origin tongue, gallbladder, and cervical cancer samples, respectively. Beyond cancer somatic analyses, we anticipate utility of the TMC-SNPdb in several Mendelian germline diseases. In addition to dbSNP build 149 and ANNOVAR, the TMC-SNPdb along with the subtraction tool is available for download in the public domain at the following:Database URL: http://www.actrec.gov.in/pi-webpages/AmitDutt/TMCSNP/TMCSNPdp.html. © The Author(s) 2016. Published by Oxford University Press.

  12. Meta-analysis of gene-level associations for rare variants based on single-variant statistics.

    PubMed

    Hu, Yi-Juan; Berndt, Sonja I; Gustafsson, Stefan; Ganna, Andrea; Hirschhorn, Joel; North, Kari E; Ingelsson, Erik; Lin, Dan-Yu

    2013-08-08

    Meta-analysis of genome-wide association studies (GWASs) has led to the discoveries of many common variants associated with complex human diseases. There is a growing recognition that identifying "causal" rare variants also requires large-scale meta-analysis. The fact that association tests with rare variants are performed at the gene level rather than at the variant level poses unprecedented challenges in the meta-analysis. First, different studies may adopt different gene-level tests, so the results are not compatible. Second, gene-level tests require multivariate statistics (i.e., components of the test statistic and their covariance matrix), which are difficult to obtain. To overcome these challenges, we propose to perform gene-level tests for rare variants by combining the results of single-variant analysis (i.e., p values of association tests and effect estimates) from participating studies. This simple strategy is possible because of an insight that multivariate statistics can be recovered from single-variant statistics, together with the correlation matrix of the single-variant test statistics, which can be estimated from one of the participating studies or from a publicly available database. We show both theoretically and numerically that the proposed meta-analysis approach provides accurate control of the type I error and is as powerful as joint analysis of individual participant data. This approach accommodates any disease phenotype and any study design and produces all commonly used gene-level tests. An application to the GWAS summary results of the Genetic Investigation of ANthropometric Traits (GIANT) consortium reveals rare and low-frequency variants associated with human height. The relevant software is freely available. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.

  13. Spectrum of PAH gene variants among a population of Han Chinese patients with phenylketonuria from northern China.

    PubMed

    Liu, Ning; Huang, Qiuying; Li, Qingge; Zhao, Dehua; Li, Xiaole; Cui, Lixia; Bai, Ying; Feng, Yin; Kong, Xiangdong

    2017-10-05

    Phenylketonuria (PKU), which primarily results from a deficiency of phenylalanine hydroxylase (PAH), is one of the most common inherited inborn errors of metabolism that impairs postnatal cognitive development. The incidence of various PAH variations differs by race and ethnicity. The aim of the present study was to characterize the PAH gene variants of a Han population from Northern China. In total, 655 PKU patients and their families were recruited for this study; each proband was diagnosed both clinically and biochemically with phenylketonuria. Subjects were sequentially screened for single-base variants and exon deletions or duplications within PAH via direct Sanger sequencing and multiplex ligation-dependent probe amplification (MLPA). A spectrum of 174 distinct PAH variants was identified: 152 previously documented variants and 22 novel variants. While single-base variants were distributed throughout the 13 exons, they were particularly concentrated in exons 7 (33.3%), 11 (14.2%), 6 (13.2%), 12 (11.0%), 3 (10.4%), and 5 (4.4%). The predominant variant was p.Arg243Gln (17.7%), followed by Ex6-96A > G (8.3%), p.Val399 = (6.4%), p.Arg53His (4.7%), p.Tyr356* (4.7%), p.Arg241Cys (4.6%), p.Arg413Pro (4.6%), p.Arg111* (4.4%), and c.442-1G > A (3.4%). Notably, two patients were also identified as carrying de novo variants. The composition of PAH gene variants in this Han population from Northern China was distinct from those of other ethnic groups. As such, the construction of a PAH gene variant database for Northern China is necessary to lay a foundation for genetic-based diagnoses, prenatal diagnoses, and population screening.

  14. Gamma-aminobutyric acid A receptor, α-2 (GABRA2) variants as individual markers for alcoholism: a meta-analysis.

    PubMed

    Zintzaras, Elias

    2012-08-01

    The available evidence from the genetic association studies (GAS) published to date on the association between variants in the GABRA2 gene and alcoholism has produced inconclusive results. To interpret these results, a meticulous meta-analysis of all available studies was carried out. The PubMed database and the HuGE Navigator were searched for published GAS-related variants in the GABRA2 gene with susceptibility to alcoholism. Then, the GAS were synthesized to decrease the uncertainty of estimated genetic risk effects. The risk effects were estimated on the basis of the odds ratio (OR) of the allele contrast and the generalized odds ratio (OR(G)), a model-free approach. Cumulative and recursive cumulative meta-analyses (CMA) were also carried out to investigate the trend and stability of effect sizes as evidence accumulates. Fourteen variants investigated in eight studies were analyzed. Significant associations were derived for four variants either for the allele contrast or for the OR(G). In particular, the variants rs279858 and rs279845 showed marginal significance for OR(G): OR(G)=1.27 (1.01-1.60) and OR(G)=1.49 (1.02-2.19), respectively. Also, the variants rs567926 and rs279844 showed significance for the allele contrast: OR=1.24 (1.06-1.46) and OR=1.23 (1.08-1.43), respectively; the ORG produced similar results. The variant rs279858 produced a large heterogeneity between studies. CMA showed a trend of an association only for the variant rs567926. Recursive CMA indicated that more evidence is needed to conclude on the status of significance of all variants. There is evidence that variants in the GABRA2 gene are associated with alcoholism. However, the present findings should be interpreted with caution.

  15. Texture formation in FePt thin films via thermal stress management

    NASA Astrophysics Data System (ADS)

    Rasmussen, P.; Rui, X.; Shield, J. E.

    2005-05-01

    The transformation variant of the fcc to fct transformation in FePt thin films was tailored by controlling the stresses in the thin films, thereby allowing selection of in- or out-of-plane c-axis orientation. FePt thin films were deposited at ambient temperature on several substrates with differing coefficients of thermal expansion relative to the FePt, which generated thermal stresses during the ordering heat treatment. X-ray diffraction analysis revealed preferential out-of-plane c-axis orientation for FePt films deposited on substrates with a similar coefficients of thermal expansion, and random orientation for FePt films deposited on substrates with a very low coefficient of thermal expansion, which is consistent with theoretical analysis when considering residual stresses.

  16. An integrated database-pipeline system for studying single nucleotide polymorphisms and diseases.

    PubMed

    Yang, Jin Ok; Hwang, Sohyun; Oh, Jeongsu; Bhak, Jong; Sohn, Tae-Kwon

    2008-12-12

    Studies on the relationship between disease and genetic variations such as single nucleotide polymorphisms (SNPs) are important. Genetic variations can cause disease by influencing important biological regulation processes. Despite the needs for analyzing SNP and disease correlation, most existing databases provide information only on functional variants at specific locations on the genome, or deal with only a few genes associated with disease. There is no combined resource to widely support gene-, SNP-, and disease-related information, and to capture relationships among such data. Therefore, we developed an integrated database-pipeline system for studying SNPs and diseases. To implement the pipeline system for the integrated database, we first unified complicated and redundant disease terms and gene names using the Unified Medical Language System (UMLS) for classification and noun modification, and the HUGO Gene Nomenclature Committee (HGNC) and NCBI gene databases. Next, we collected and integrated representative databases for three categories of information. For genes and proteins, we examined the NCBI mRNA, UniProt, UCSC Table Track and MitoDat databases. For genetic variants we used the dbSNP, JSNP, ALFRED, and HGVbase databases. For disease, we employed OMIM, GAD, and HGMD databases. The database-pipeline system provides a disease thesaurus, including genes and SNPs associated with disease. The search results for these categories are available on the web page http://diseasome.kobic.re.kr/, and a genome browser is also available to highlight findings, as well as to permit the convenient review of potentially deleterious SNPs among genes strongly associated with specific diseases and clinical phenotypes. Our system is designed to capture the relationships between SNPs associated with disease and disease-causing genes. The integrated database-pipeline provides a list of candidate genes and SNP markers for evaluation in both epidemiological and molecular biological approaches to diseases-gene association studies. Furthermore, researchers then can decide semi-automatically the data set for association studies while considering the relationships between genetic variation and diseases. The database can also be economical for disease-association studies, as well as to facilitate an understanding of the processes which cause disease. Currently, the database contains 14,674 SNP records and 109,715 gene records associated with human diseases and it is updated at regular intervals.

  17. MODELING DISPERSANT INTERACTIONS WITH OIL SPILLS

    EPA Science Inventory

    EPA is developing a model called the EPA Research Object-Oriented Oil Spill Model (ERO3S) and associated databases to simulate the impacts of dispersants on oil slicks. Because there are features of oil slicks that align naturally with major concepts of object-oriented programmi...

  18. Orienteering: An Annotated Bibliography = Orientierungslauf: Eine kommentierte Bibliographie.

    ERIC Educational Resources Information Center

    Seiler, Roland, Ed.; Hartmann, Wolfgang, Ed.

    1994-01-01

    Annotated bibliography of 220 books, monographs, and journal articles on orienteering published 1984-94, from SPOLIT database of the Federal Institute of Sport Science (Cologne, Germany). Annotations in English or German. Ten sections including psychological, physiological, health, sociological, and environmental aspects; training and coaching;…

  19. Query by forms: User-oriented relational database retrieving system and its application in analysis of experiment data

    NASA Astrophysics Data System (ADS)

    Skotniczny, Zbigniew

    1989-12-01

    The Query by Forms (QbF) system is a user-oriented interactive tool for querying large relational database with minimal queries difinition cost. The system was worked out under the assumption that user's time and effort for defining needed queries is the most severe bottleneck. The system may be applied in any Rdb/VMS databases system and is recommended for specific information systems of any project where end-user queries cannot be foreseen. The tool is dedicated to specialist of an application domain who have to analyze data maintained in database from any needed point of view, who do not need to know commercial databases languages. The paper presents the system developed as a compromise between its functionality and usability. User-system communication via a menu-driven "tree-like" structure of screen-forms which produces a query difinition and execution is discussed in detail. Output of query results (printed reports and graphics) is also discussed. Finally the paper shows one application of QbF to a HERA-project.

  20. Databases in the Area of Pharmacogenetics

    PubMed Central

    Sim, Sarah C.; Altman, Russ B.; Ingelman-Sundberg, Magnus

    2012-01-01

    In the area of pharmacogenetics and personalized health care it is obvious that databases, providing important information of the occurrence and consequences of variant genes encoding drug metabolizing enzymes, drug transporters, drug targets, and other proteins of importance for drug response or toxicity, are of critical value for scientists, physicians, and industry. The primary outcome of the pharmacogenomic field is the identification of biomarkers that can predict drug toxicity and drug response, thereby individualizing and improving drug treatment of patients. The drug in question and the polymorphic gene exerting the impact are the main issues to be searched for in the databases. Here, we review the databases that provide useful information in this respect, of benefit for the development of the pharmacogenomic field. PMID:21309040

  1. Group-based variant calling leveraging next-generation supercomputing for large-scale whole-genome sequencing studies.

    PubMed

    Standish, Kristopher A; Carland, Tristan M; Lockwood, Glenn K; Pfeiffer, Wayne; Tatineni, Mahidhar; Huang, C Chris; Lamberth, Sarah; Cherkas, Yauheniya; Brodmerkel, Carrie; Jaeger, Ed; Smith, Lance; Rajagopal, Gunaretnam; Curran, Mark E; Schork, Nicholas J

    2015-09-22

    Next-generation sequencing (NGS) technologies have become much more efficient, allowing whole human genomes to be sequenced faster and cheaper than ever before. However, processing the raw sequence reads associated with NGS technologies requires care and sophistication in order to draw compelling inferences about phenotypic consequences of variation in human genomes. It has been shown that different approaches to variant calling from NGS data can lead to different conclusions. Ensuring appropriate accuracy and quality in variant calling can come at a computational cost. We describe our experience implementing and evaluating a group-based approach to calling variants on large numbers of whole human genomes. We explore the influence of many factors that may impact the accuracy and efficiency of group-based variant calling, including group size, the biogeographical backgrounds of the individuals who have been sequenced, and the computing environment used. We make efficient use of the Gordon supercomputer cluster at the San Diego Supercomputer Center by incorporating job-packing and parallelization considerations into our workflow while calling variants on 437 whole human genomes generated as part of large association study. We ultimately find that our workflow resulted in high-quality variant calls in a computationally efficient manner. We argue that studies like ours should motivate further investigations combining hardware-oriented advances in computing systems with algorithmic developments to tackle emerging 'big data' problems in biomedical research brought on by the expansion of NGS technologies.

  2. Origin of steps in magnetization loops of martensitic Ni-Mn-Ga films on MgO(001)

    NASA Astrophysics Data System (ADS)

    Laptev, Aleksej; Lebecki, Kristof; Welker, Gesa; Luo, Yuansu; Samwer, Konrad; Fonin, Mikhail

    2016-09-01

    We study the temperature dependent magnetization properties of (010)-oriented Ni-Mn-Ga epitaxial films on MgO(001) substrates. In the martensitic phase, we observe pronounced abrupt slope changes in the magnetization loops for all studied samples. Our experimental findings are discussed in conjunction with the micromagnetic simulations, revealing that the characteristic magnetization behavior is governed solely by the magnetization switching within the specific martensitic variant pattern, and no reorientation of twin variants is involved in the process. Our study emphasizes the important role of the magnetostatic interactions in the magnetization behavior of magnetic shape memory alloy thin films.

  3. E-MSD: an integrated data resource for bioinformatics

    PubMed Central

    Velankar, S.; McNeil, P.; Mittard-Runte, V.; Suarez, A.; Barrell, D.; Apweiler, R.; Henrick, K.

    2005-01-01

    The Macromolecular Structure Database (MSD) group (http://www.ebi.ac.uk/msd/) continues to enhance the quality and consistency of macromolecular structure data in the worldwide Protein Data Bank (wwPDB) and to work towards the integration of various bioinformatics data resources. One of the major obstacles to the improved integration of structural databases such as MSD and sequence databases like UniProt is the absence of up to date and well-maintained mapping between corresponding entries. We have worked closely with the UniProt group at the EBI to clean up the taxonomy and sequence cross-reference information in the MSD and UniProt databases. This information is vital for the reliable integration of the sequence family databases such as Pfam and Interpro with the structure-oriented databases of SCOP and CATH. This information has been made available to the eFamily group (http://www.efamily.org.uk/) and now forms the basis of the regular interchange of information between the member databases (MSD, UniProt, Pfam, Interpro, SCOP and CATH). This exchange of annotation information has enriched the structural information in the MSD database with annotation from wider sequence-oriented resources. This work was carried out under the ‘Structure Integration with Function, Taxonomy and Sequences (SIFTS)’ initiative (http://www.ebi.ac.uk/msd-srv/docs/sifts) in the MSD group. PMID:15608192

  4. Comprehensive splicing functional analysis of DNA variants of the BRCA2 gene by hybrid minigenes

    PubMed Central

    2012-01-01

    Introduction The underlying pathogenic mechanism of a large fraction of DNA variants of disease-causing genes is the disruption of the splicing process. We aimed to investigate the effect on splicing of the BRCA2 variants c.8488-1G > A (exon 20) and c.9026_9030del (exon 23), as well as 41 BRCA2 variants reported in the Breast Cancer Information Core (BIC) mutation database. Methods DNA variants were analyzed with the splicing prediction programs NNSPLICE and Human Splicing Finder. Functional analyses of candidate variants were performed by lymphocyte RT-PCR and/or hybrid minigene assays. Forty-one BIC variants of exons 19, 20, 23 and 24 were bioinformatically selected and generated by PCR-mutagenesis of the wild type minigenes. Results Lymphocyte RT-PCR of c.8488-1G > A showed intron 19 retention and a 12-nucleotide deletion in exon 20, whereas c.9026_9030del did not show any splicing anomaly. Minigene analysis of c.8488-1G > A displayed the aforementioned aberrant isoforms but also exon 20 skipping. We further evaluated the splicing outcomes of 41 variants of four BRCA2 exons by minigene analysis. Eighteen variants presented splicing aberrations. Most variants (78.9%) disrupted the natural splice sites, whereas four altered putative enhancers/silencers and had a weak effect. Fluorescent RT-PCR of minigenes accurately detected 14 RNA isoforms generated by cryptic site usage, exon skipping and intron retention events. Fourteen variants showed total splicing disruptions and were predicted to truncate or eliminate essential domains of BRCA2. Conclusions A relevant proportion of BRCA2 variants are correlated with splicing disruptions, indicating that RNA analysis is a valuable tool to assess the pathogenicity of a particular DNA change. The minigene system is a straightforward and robust approach to detect variants with an impact on splicing and contributes to a better knowledge of this gene expression step. PMID:22632462

  5. The Variant p.(Arg183Trp) in SPTLC2 Causes Late-Onset Hereditary Sensory Neuropathy.

    PubMed

    Suriyanarayanan, Saranya; Auranen, Mari; Toppila, Jussi; Paetau, Anders; Shcherbii, Maria; Palin, Eino; Wei, Yu; Lohioja, Tarja; Schlotter-Weigel, Beate; Schön, Ulrike; Abicht, Angela; Rautenstrauss, Bernd; Tyynismaa, Henna; Walter, Maggie C; Hornemann, Thorsten; Ylikallio, Emil

    2016-03-01

    Hereditary sensory and autonomic neuropathy 1 (HSAN1) is an autosomal dominant disorder that can be caused by variants in SPTLC1 or SPTLC2, encoding subunits of serine palmitoyl-CoA transferase. Disease variants alter the enzyme's substrate specificity and lead to accumulation of neurotoxic 1-deoxysphingolipids. We describe two families with autosomal dominant HSAN1C caused by a new variant in SPTLC2, c.547C>T, p.(Arg183Trp). The variant changed a conserved amino acid and was not found in public variant databases. All patients had a relatively mild progressive distal sensory impairment, with onset after age 50. Small fibers were affected early, leading to abnormalities on quantitative sensory testing. Sural biopsy revealed a severe chronic axonal neuropathy with subtotal loss of myelinated axons, relatively preserved number of non-myelinated fibers and no signs for regeneration. Skin biopsy with PGP9.5 labeling showed lack of intraepidermal nerve endings early in the disease. Motor manifestations developed later in the disease course, but there was no evidence of autonomic involvement. Patients had elevated serum 1-deoxysphingolipids, and the variant protein produced elevated amounts of 1-deoxysphingolipids in vitro, which proved the pathogenicity of the variant. Our results expand the genetic spectrum of HSAN1C and provide further detail about the clinical characteristics. Sequencing of SPTLC2 should be considered in all patients presenting with mild late-onset sensory-predominant small or large fiber neuropathy.

  6. UniPROBE, update 2011: expanded content and search tools in the online database of protein-binding microarray data on protein-DNA interactions.

    PubMed

    Robasky, Kimberly; Bulyk, Martha L

    2011-01-01

    The Universal PBM Resource for Oligonucleotide-Binding Evaluation (UniPROBE) database is a centralized repository of information on the DNA-binding preferences of proteins as determined by universal protein-binding microarray (PBM) technology. Each entry for a protein (or protein complex) in UniPROBE provides the quantitative preferences for all possible nucleotide sequence variants ('words') of length k ('k-mers'), as well as position weight matrix (PWM) and graphical sequence logo representations of the k-mer data. In this update, we describe >130% expansion of the database content, incorporation of a protein BLAST (blastp) tool for finding protein sequence matches in UniPROBE, the introduction of UniPROBE accession numbers and additional database enhancements. The UniPROBE database is available at http://uniprobe.org.

  7. Rapid storage and retrieval of genomic intervals from a relational database system using nested containment lists

    PubMed Central

    Wiley, Laura K.; Sivley, R. Michael; Bush, William S.

    2013-01-01

    Efficient storage and retrieval of genomic annotations based on range intervals is necessary, given the amount of data produced by next-generation sequencing studies. The indexing strategies of relational database systems (such as MySQL) greatly inhibit their use in genomic annotation tasks. This has led to the development of stand-alone applications that are dependent on flat-file libraries. In this work, we introduce MyNCList, an implementation of the NCList data structure within a MySQL database. MyNCList enables the storage, update and rapid retrieval of genomic annotations from the convenience of a relational database system. Range-based annotations of 1 million variants are retrieved in under a minute, making this approach feasible for whole-genome annotation tasks. Database URL: https://github.com/bushlab/mynclist PMID:23894185

  8. Rapid storage and retrieval of genomic intervals from a relational database system using nested containment lists.

    PubMed

    Wiley, Laura K; Sivley, R Michael; Bush, William S

    2013-01-01

    Efficient storage and retrieval of genomic annotations based on range intervals is necessary, given the amount of data produced by next-generation sequencing studies. The indexing strategies of relational database systems (such as MySQL) greatly inhibit their use in genomic annotation tasks. This has led to the development of stand-alone applications that are dependent on flat-file libraries. In this work, we introduce MyNCList, an implementation of the NCList data structure within a MySQL database. MyNCList enables the storage, update and rapid retrieval of genomic annotations from the convenience of a relational database system. Range-based annotations of 1 million variants are retrieved in under a minute, making this approach feasible for whole-genome annotation tasks. Database URL: https://github.com/bushlab/mynclist.

  9. Coffin-Siris syndrome and the BAF complex: genotype-phenotype study in 63 patients.

    PubMed

    Santen, Gijs W E; Aten, Emmelien; Vulto-van Silfhout, Anneke T; Pottinger, Caroline; van Bon, Bregje W M; van Minderhout, Ivonne J H M; Snowdowne, Ronelle; van der Lans, Christian A C; Boogaard, Merel; Linssen, Margot M L; Vijfhuizen, Linda; van der Wielen, Michiel J R; Vollebregt, M J Ellen; Breuning, Martijn H; Kriek, Marjolein; van Haeringen, Arie; den Dunnen, Johan T; Hoischen, Alexander; Clayton-Smith, Jill; de Vries, Bert B A; Hennekam, Raoul C M; van Belzen, Martine J

    2013-11-01

    De novo germline variants in several components of the SWI/SNF-like BAF complex can cause Coffin-Siris syndrome (CSS), Nicolaides-Baraitser syndrome (NCBRS), and nonsyndromic intellectual disability. We screened 63 patients with a clinical diagnosis of CSS for these genes (ARID1A, ARID1B, SMARCA2, SMARCA4, SMARCB1, and SMARCE1) and identified pathogenic variants in 45 (71%) patients. We found a high proportion of variants in ARID1B (68%). All four pathogenic variants in ARID1A appeared to be mosaic. By using all variants from the Exome Variant Server as test data, we were able to classify variants in ARID1A, ARID1B, and SMARCB1 reliably as being pathogenic or nonpathogenic. For SMARCA2, SMARCA4, and SMARCE1 several variants in the EVS remained unclassified, underlining the importance of parental testing. We have entered all variant and clinical information in LOVD-powered databases to facilitate further genotype-phenotype correlations, as these will become increasingly important because of the uptake of targeted and untargeted next generation sequencing in diagnostics. The emerging phenotype-genotype correlation is that SMARCB1 patients have the most marked physical phenotype and severe cognitive and growth delay. The variability in phenotype seems most marked in ARID1A and ARID1B patients. Distal limbs anomalies are most marked in ARID1A patients and least in SMARCB1 patients. Numbers are small however, and larger series are needed to confirm this correlation. © 2013 WILEY PERIODICALS, INC.

  10. A comparison of cataloged variation between International HapMap Consortium and 1000 Genomes Project data.

    PubMed

    Buchanan, Carrie C; Torstenson, Eric S; Bush, William S; Ritchie, Marylyn D

    2012-01-01

    Since publication of the human genome in 2003, geneticists have been interested in risk variant associations to resolve the etiology of traits and complex diseases. The International HapMap Consortium undertook an effort to catalog all common variation across the genome (variants with a minor allele frequency (MAF) of at least 5% in one or more ethnic groups). HapMap along with advances in genotyping technology led to genome-wide association studies which have identified common variants associated with many traits and diseases. In 2008 the 1000 Genomes Project aimed to sequence 2500 individuals and identify rare variants and 99% of variants with a MAF of <1%. To determine whether the 1000 Genomes Project includes all the variants in HapMap, we examined the overlap between single nucleotide polymorphisms (SNPs) genotyped in the two resources using merged phase II/III HapMap data and low coverage pilot data from 1000 Genomes. Comparison of the two data sets showed that approximately 72% of HapMap SNPs were also found in 1000 Genomes Project pilot data. After filtering out HapMap variants with a MAF of <5% (separately for each population), 99% of HapMap SNPs were found in 1000 Genomes data. Not all variants cataloged in HapMap are also cataloged in 1000 Genomes. This could affect decisions about which resource to use for SNP queries, rare variant validation, or imputation. Both the HapMap and 1000 Genomes Project databases are useful resources for human genetics, but it is important to understand the assumptions made and filtering strategies employed by these projects.

  11. Analysis of RNA-Seq datasets reveals enrichment of tissue-specific splice variants for nuclear envelope proteins.

    PubMed

    Capitanchik, Charlotte; Dixon, Charles; Swanson, Selene K; Florens, Laurence; Kerr, Alastair R W; Schirmer, Eric C

    2018-06-18

    Nuclear envelopathies/laminopathies yield tissue-specific pathologies, yet arise from mutation of ubiquitously-expressed genes. One possible explanation of this tissue specificity is that tissue-specific partners become disrupted from larger complexes, but a little investigated alternate hypothesis is that the mutated proteins themselves have tissue-specific splice variants. Here, we analyze RNA-Seq datasets to identify muscle-specific splice variants of nuclear envelope genes that could be relevant to the study of laminopathies, particularly muscular dystrophies, that are not currently annotated in sequence databases. Notably, we found novel isoforms or tissue-specificity of isoforms for: Lap2, linked to cardiomyopathy; Nesprin 2, linked to Emery-Dreifuss muscular dystrophy and Lmo7, a regulator of the emerin gene that is linked to Emery-Dreifuss muscular dystrophy. Interestingly, the muscle-specific exon in Lmo7 is rich in serine phosphorylation motifs, suggesting an important regulatory function. Evidence for muscle-specific splice variants in non-nuclear envelope proteins linked to other muscular dystrophies was also found. Tissue-specific variants were also indicated for several nucleoporins including Nup54, Nup133, Nup153 and Nup358/RanBP2. We confirmed expression of novel Lmo7 and RanBP2 variants with RT-PCR and found that specific knockdown of the Lmo7 variant caused a reduction in myogenic index during mouse C2C12 myogenesis. Global analysis revealed an enrichment of tissue-specific splice variants for nuclear envelope proteins in general compared to the rest of the genome, suggesting that splice variants contribute to regulating its tissue-specific functions.

  12. Efficient Privacy-Enhancing Techniques for Medical Databases

    NASA Astrophysics Data System (ADS)

    Schartner, Peter; Schaffer, Martin

    In this paper, we introduce an alternative for using linkable unique health identifiers: locally generated system-wide unique digital pseudonyms. The presented techniques are based on a novel technique called collision-free number generation which is discussed in the introductory part of the article. Afterwards, attention is payed onto two specific variants of collision-free number generation: one based on the RSA-Problem and the other one based on the Elliptic Curve Discrete Logarithm Problem. Finally, two applications are sketched: centralized medical records and anonymous medical databases.

  13. Type of iconicity matters in the vocabulary development of signing children.

    PubMed

    Ortega, Gerardo; Sümer, Beyza; Özyürek, Aslı

    2017-01-01

    Recent research on signed as well as spoken language shows that the iconic features of the target language might play a role in language development. Here, we ask further whether different types of iconic depictions modulate children's preferences for certain types of sign-referent links during vocabulary development in sign language. Results from a picture description task indicate that lexical signs with 2 possible variants are used in different proportions by deaf signers from different age groups. While preschool and school-age children favored variants representing actions associated with their referent (e.g., a writing hand for the sign PEN), adults preferred variants representing the perceptual features of those objects (e.g., upward index finger representing a thin, elongated object for the sign PEN). Deaf parents interacting with their children, however, used action- and perceptual-based variants in equal proportion and favored action variants more than adults signing to other adults. We propose that when children are confronted with 2 variants for the same concept, they initially prefer action-based variants because they give them the opportunity to link a linguistic label to familiar schemas linked to their action/motor experiences. Our results echo findings showing a bias for action-based depictions in the development of iconic co-speech gestures suggesting a modality bias for such representations during development. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  14. Data structures and organisation: Special problems in scientific applications

    NASA Astrophysics Data System (ADS)

    Read, Brian J.

    1989-12-01

    In this paper we discuss and offer answers to the following questions: What, really, are the benifits of databases in physics? Are scientific databases essentially different from conventional ones? What are the drawbacks of a commercial database management system for use with scientific data? Do they outweigh the advantages? Do databases systems have adequate graphics facilities, or is a separate graphics package necessary? SQL as a standard language has deficiencies, but what are they for scientific data in particular? Indeed, is the relational model appropriate anyway? Or, should we turn to object oriented databases?

  15. Orientation and mobility training for partially-sighted older adults using an identification cane: a systematic review

    PubMed Central

    Ballemans, Judith; Kempen, Gertrudis IJM; Zijlstra, GA Rixt

    2011-01-01

    Objective: This study aimed to provide an overview of the development, content, feasibility, and effectiveness of existing orientation and mobility training programmes in the use of the identification cane. Data sources: A systematic bibliographic database search in PubMed, PsychInfo, ERIC, CINAHL and the Cochrane Library was performed, in combination with the expert consultation (n = 42; orientation and mobility experts), and hand-searching of reference lists. Review methods: Selection criteria included a description of the development, the content, the feasibility, or the effectiveness of orientation and mobility training in the use of the identification cane. Two reviewers independently agreed on eligibility and methodological quality. A narrative/qualitative data analysis method was applied to extract data from obtained documents. Results: The sensitive database search and hand-searching of reference lists revealed 248 potentially relevant abstracts. None met the eligibility criteria. Expert consultation resulted in the inclusion of six documents in which the information presented on the orientation and mobility training in the use of the identification cane was incomplete and of low methodological quality. Conclusion: Our review of the literature showed a lack of well-described protocols and studies on orientation and mobility training in identification cane use. PMID:21795405

  16. Mandatory pooling as a supplement to risk-adjusted capitation payments in a competitive health insurance market.

    PubMed

    Van Barneveld, E M; Lamers, L M; van Vliet, R C; van de Ven, W P

    1998-07-01

    Risk-adjusted capitation payments (RACPs) to competing health insurers are an essential element of market-oriented health care reforms in many countries. RACPs based on demographic variables only are insufficient, because they leave ample room for cream skimming. However, the implementation of improved RACPs does not appear to be straightforward. A solution might be to supplement imperfect RACPs with a form of mandatory pooling that reduces the incentives for cream skimming. In a previous paper it was concluded that high-risk pooling (HRP), is a promising supplement to RACPs. The purpose of this paper is to compare HRP with two other main variants of mandatory pooling. These variants are called excess-of-loss (EOL) and proportional pooling (PP). Each variant includes ex post compensations to insurers for some members which depend to various degrees on actually incurred costs. Therefore, these pooling variants reduce the incentives for cream skimming which are inherent in imperfect RACPs, but they also reduce the incentives for efficiency and cost containment. As a rough measure of the latter incentives we use the percentage of total costs for which an insurer is at risk. This paper analyzes which of the three main pooling variants yields the greatest reduction of incentives for cream skimming given such a percentage. The results show that HRP is the most effective of the three pooling variants.

  17. Global Soil Respiration: Interaction with Environmental Variables and Response to Climate Change

    NASA Astrophysics Data System (ADS)

    Jian, J.; Steele, M.

    2016-12-01

    Background, methods, objectivesTerrestrial ecosystems take up around 1.7 Pg C per year; however, the role of terrestrial ecosystems as a carbon sink may change to carbon source by 2050, as a result of positive feedback of soil respiration response to global warming. Nevertheless, limited evidence shows that soil carbon is decreasing and the role of terrestrial ecosystems is changing under warming. One possibility is the positive feedback may slow due to the acclimation of soil respiration as a result of decreasing temperature sensitivity (Q10) with warming. To verify and quantify the uncertainty in soil carbon cycling and feedbacks to climate change, we assembled soil respiration observations from 1961 to 2014 from 724 publications into a monthly global soil respiration database (MSRDB), which included 13482 soil respiration measurements together with 38 other ancillary measurements from 538 sites. Using this database we examined macroscale variation in the relationship between soil respiration and air temperature, precipitation, leaf area index and soil properties. We also quantified global soil respiration, the sources of uncertainty, and its feedback to warming based on climate region-oriented models with variant Q10function. Results and ConclusionsOur results showed substantial heterogeneity in the relationship between soil respiration and environmental factors across different climate regions. For example, soil respiration was strongly related to vegetation (via leaf area index) in colder regions, but not in tropical region. Only in tropical and arid regions did soil properties explain any variation in soil respiration. Global annual mean soil respiration from 1961 to 2014 was estimated to be 72.41 Pg C yr-1 based on monthly global soil respiration database, 25 Pg lower than estimated based on yearly soil respiration database. By using the variable Q10 models, we estimated that global soil respiration increased at a rate of 0.03 Pg C yr-1 from 1961 to 2014, smaller than previous studies ( 0.1 Pg C yr-1). The substantial variations in these relationships suggest that regional scales is important for understanding and prediction of global carbon cycling and how it response to climate change.

  18. Class in the Class: Sharing Bukowski's Class with Community College Students

    ERIC Educational Resources Information Center

    Hiraldo, Carlos

    2008-01-01

    Faculty members take pride in the great diversity of students attending LaGuardia Community College. Their students self-identify with various nationalities, races, religions, ethnicities, and sexual orientations. Not only do students adopt diverse identity markers, but they also come to their classroom with variant skill levels. It is difficult…

  19. TAPAS: tools to assist the targeted protein quantification of human alternative splice variants.

    PubMed

    Yang, Jae-Seong; Sabidó, Eduard; Serrano, Luis; Kiel, Christina

    2014-10-15

    In proteomes of higher eukaryotes, many alternative splice variants can only be detected by their shared peptides. This makes it highly challenging to use peptide-centric mass spectrometry to distinguish and to quantify protein isoforms resulting from alternative splicing events. We have developed two complementary algorithms based on linear mathematical models to efficiently compute a minimal set of shared and unique peptides needed to quantify a set of isoforms and splice variants. Further, we developed a statistical method to estimate the splice variant abundances based on stable isotope labeled peptide quantities. The algorithms and databases are integrated in a web-based tool, and we have experimentally tested the limits of our quantification method using spiked proteins and cell extracts. The TAPAS server is available at URL http://davinci.crg.es/tapas/. luis.serrano@crg.eu or christina.kiel@crg.eu Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  20. Twinning induced by the rhombohedral to orthorhombic phase transition in lanthanum gallate (LaGaO3)

    NASA Astrophysics Data System (ADS)

    Wang, W. L.; Lu, H. Y.

    2006-10-01

    Phase-transformation-induced twins in pressureless-sintered lanthanum gallate (LaGaO3) ceramics have been analysed using the transmission electron microscopy (TEM). Twins are induced by solid state phase transformation upon cooling from the rhombohedral (r, Rbar{3}c) to orthorhombic ( o, Pnma) symmetry at ˜145°C. Three types of transformation twins {101} o , {121} o , and {123} o were found in grains containing multiple domains that represent orientation variants. Three orthorhombic orientation variants were distinguished from the transformation domains converged into a triple junction. These twins are the reflection type as confirmed by tilting experiment in the microscope. Although not related by group-subgroup relation, the transformation twins generated by phase transition from rhombohedral to orthorhombic are consistent with those derived from taking cubic Pm {bar {3}}m aristotype of the lowest common supergroup symmetry as an intermediate metastable structure. The r→ o phase transition of first order in nature may have occurred by a diffusionless, martensitic-type or discontinuous nucleation and growth mechanism.

  1. MaizeGDB: New tools and resource

    USDA-ARS?s Scientific Manuscript database

    MaizeGDB, the USDA-ARS genetics and genomics database, is a highly curated, community-oriented informatics service to researchers focused on the crop plant and model organism Zea mays. MaizeGDB facilitates maize research by curating, integrating, and maintaining a database that serves as the central...

  2. Novel GREM1 Variations in Sub-Saharan African Patients With Cleft Lip and/or Cleft Palate.

    PubMed

    Gowans, Lord Jephthah Joojo; Oseni, Ganiyu; Mossey, Peter A; Adeyemo, Wasiu Lanre; Eshete, Mekonen A; Busch, Tamara D; Donkor, Peter; Obiri-Yeboah, Solomon; Plange-Rhule, Gyikua; Oti, Alexander A; Owais, Arwa; Olaitan, Peter B; Aregbesola, Babatunde S; Oginni, Fadekemi O; Bello, Seidu A; Audu, Rosemary; Onwuamah, Chika; Agbenorku, Pius; Ogunlewe, Mobolanle O; Abdur-Rahman, Lukman O; Marazita, Mary L; Adeyemo, A A; Murray, Jeffrey C; Butali, Azeez

    2018-05-01

    Cleft lip and/or cleft palate (CL/P) are congenital anomalies of the face and have multifactorial etiology, with both environmental and genetic risk factors playing crucial roles. Though at least 40 loci have attained genomewide significant association with nonsyndromic CL/P, these loci largely reside in noncoding regions of the human genome, and subsequent resequencing studies of neighboring candidate genes have revealed only a limited number of etiologic coding variants. The present study was conducted to identify etiologic coding variants in GREM1, a locus that has been shown to be largely associated with cleft of both lip and soft palate. We resequenced DNA from 397 sub-Saharan Africans with CL/P and 192 controls using Sanger sequencing. Following analyses of the sequence data, we observed 2 novel coding variants in GREM1. These variants were not found in the 192 African controls and have never been previously reported in any public genetic variant database that includes more than 5000 combined African and African American controls or from the CL/P literature. The novel variants include p.Pro164Ser in an individual with soft palate cleft only and p.Gly61Asp in an individual with bilateral cleft lip and palate. The proband with the p.Gly61Asp GREM1 variant is a van der Woude (VWS) case who also has an etiologic variant in IRF6 gene. Our study demonstrated that there is low number of etiologic coding variants in GREM1, confirming earlier suggestions that variants in regulatory elements may largely account for the association between this locus and CL/P.

  3. The functional spectrum of low-frequency coding variation.

    PubMed

    Marth, Gabor T; Yu, Fuli; Indap, Amit R; Garimella, Kiran; Gravel, Simon; Leong, Wen Fung; Tyler-Smith, Chris; Bainbridge, Matthew; Blackwell, Tom; Zheng-Bradley, Xiangqun; Chen, Yuan; Challis, Danny; Clarke, Laura; Ball, Edward V; Cibulskis, Kristian; Cooper, David N; Fulton, Bob; Hartl, Chris; Koboldt, Dan; Muzny, Donna; Smith, Richard; Sougnez, Carrie; Stewart, Chip; Ward, Alistair; Yu, Jin; Xue, Yali; Altshuler, David; Bustamante, Carlos D; Clark, Andrew G; Daly, Mark; DePristo, Mark; Flicek, Paul; Gabriel, Stacey; Mardis, Elaine; Palotie, Aarno; Gibbs, Richard

    2011-09-14

    Rare coding variants constitute an important class of human genetic variation, but are underrepresented in current databases that are based on small population samples. Recent studies show that variants altering amino acid sequence and protein function are enriched at low variant allele frequency, 2 to 5%, but because of insufficient sample size it is not clear if the same trend holds for rare variants below 1% allele frequency. The 1000 Genomes Exon Pilot Project has collected deep-coverage exon-capture data in roughly 1,000 human genes, for nearly 700 samples. Although medical whole-exome projects are currently afoot, this is still the deepest reported sampling of a large number of human genes with next-generation technologies. According to the goals of the 1000 Genomes Project, we created effective informatics pipelines to process and analyze the data, and discovered 12,758 exonic SNPs, 70% of them novel, and 74% below 1% allele frequency in the seven population samples we examined. Our analysis confirms that coding variants below 1% allele frequency show increased population-specificity and are enriched for functional variants. This study represents a large step toward detecting and interpreting low frequency coding variation, clearly lays out technical steps for effective analysis of DNA capture data, and articulates functional and population properties of this important class of genetic variation.

  4. Genetic investigation of 100 heart genes in sudden unexplained death victims in a forensic setting

    PubMed Central

    Christiansen, Sofie Lindgren; Hertz, Christin Løth; Ferrero-Miliani, Laura; Dahl, Morten; Weeke, Peter Ejvin; LuCamp; Ottesen, Gyda Lolk; Frank-Hansen, Rune; Bundgaard, Henning; Morling, Niels

    2016-01-01

    In forensic medicine, one-third of the sudden deaths remain unexplained after medico-legal autopsy. A major proportion of these sudden unexplained deaths (SUD) are considered to be caused by inherited cardiac diseases. Sudden cardiac death (SCD) may be the first manifestation of these diseases. The purpose of this study was to explore the yield of next-generation sequencing of genes associated with SCD in a cohort of SUD victims. We investigated 100 genes associated with cardiac diseases in 61 young (1–50 years) SUD cases. DNA was captured with the Haloplex target enrichment system and sequenced using an Illumina MiSeq. The identified genetic variants were evaluated and classified as likely, unknown or unlikely to have a functional effect. The criteria for this classification were based on the literature, databases, conservation and prediction of the effect of the variant. We found that 21 (34%) individuals carried variants with a likely functional effect. Ten (40%) of these variants were located in genes associated with cardiomyopathies and 15 (60%) of the variants in genes associated with cardiac channelopathies. Nineteen individuals carried variants with unknown functional effect. Our findings indicate that broad genetic investigation of SUD victims increases the diagnostic outcome, and the investigation should comprise genes involved in both cardiomyopathies and cardiac channelopathies. PMID:27650965

  5. Pan-cancer analysis reveals technical artifacts in TCGA germline variant calls.

    PubMed

    Buckley, Alexandra R; Standish, Kristopher A; Bhutani, Kunal; Ideker, Trey; Lasken, Roger S; Carter, Hannah; Harismendy, Olivier; Schork, Nicholas J

    2017-06-12

    Cancer research to date has largely focused on somatically acquired genetic aberrations. In contrast, the degree to which germline, or inherited, variation contributes to tumorigenesis remains unclear, possibly due to a lack of accessible germline variant data. Here we called germline variants on 9618 cases from The Cancer Genome Atlas (TCGA) database representing 31 cancer types. We identified batch effects affecting loss of function (LOF) variant calls that can be traced back to differences in the way the sequence data were generated both within and across cancer types. Overall, LOF indel calls were more sensitive to technical artifacts than LOF Single Nucleotide Variant (SNV) calls. In particular, whole genome amplification of DNA prior to sequencing led to an artificially increased burden of LOF indel calls, which confounded association analyses relating germline variants to tumor type despite stringent indel filtering strategies. The samples affected by these technical artifacts include all acute myeloid leukemia and practically all ovarian cancer samples. We demonstrate how technical artifacts induced by whole genome amplification of DNA can lead to false positive germline-tumor type associations and suggest TCGA whole genome amplified samples be used with caution. This study draws attention to the need to be sensitive to problems associated with a lack of uniformity in data generation in TCGA data.

  6. Genetic investigation of 100 heart genes in sudden unexplained death victims in a forensic setting.

    PubMed

    Christiansen, Sofie Lindgren; Hertz, Christin Løth; Ferrero-Miliani, Laura; Dahl, Morten; Weeke, Peter Ejvin; LuCamp; Ottesen, Gyda Lolk; Frank-Hansen, Rune; Bundgaard, Henning; Morling, Niels

    2016-12-01

    In forensic medicine, one-third of the sudden deaths remain unexplained after medico-legal autopsy. A major proportion of these sudden unexplained deaths (SUD) are considered to be caused by inherited cardiac diseases. Sudden cardiac death (SCD) may be the first manifestation of these diseases. The purpose of this study was to explore the yield of next-generation sequencing of genes associated with SCD in a cohort of SUD victims. We investigated 100 genes associated with cardiac diseases in 61 young (1-50 years) SUD cases. DNA was captured with the Haloplex target enrichment system and sequenced using an Illumina MiSeq. The identified genetic variants were evaluated and classified as likely, unknown or unlikely to have a functional effect. The criteria for this classification were based on the literature, databases, conservation and prediction of the effect of the variant. We found that 21 (34%) individuals carried variants with a likely functional effect. Ten (40%) of these variants were located in genes associated with cardiomyopathies and 15 (60%) of the variants in genes associated with cardiac channelopathies. Nineteen individuals carried variants with unknown functional effect. Our findings indicate that broad genetic investigation of SUD victims increases the diagnostic outcome, and the investigation should comprise genes involved in both cardiomyopathies and cardiac channelopathies.

  7. Differences in Transcriptional Activity of Human Papillomavirus Type 6 Molecular Variants in Recurrent Respiratory Papillomatosis

    PubMed Central

    Measso do Bonfim, Caroline; Simão Sobrinho, João; Lacerda Nogueira, Rodrigo; Salgado Kupper, Daniel; Cardoso Pereira Valera, Fabiana; Lacerda Nogueira, Maurício; Villa, Luisa Lina; Rahal, Paula; Sichero, Laura

    2015-01-01

    A significant proportion of recurrent respiratory papillomatosis (RRP) is caused by human papillomavirus type 6 (HPV-6). The long control region (LCR) contains cis-elements for regulation of transcription. Our aim was to characterize LCR HPV-6 variants in RRP cases, compare promoter activity of these isolates and search for cellular transcription factors (TFs) that could explain the differences observed. The complete LCR from 13 RRP was analyzed. Transcriptional activity of 5 variants was compared using luciferase assays. Differences in putative TFs binding sites among variants were revealed using the TRANSFAC database. Chromatin immunoprecipation (CHIP) and luciferase assays were used to evaluate TF binding and impact upon transcription, respectively. Juvenile-onset RRP cases harbored exclusively HPV-6vc related variants, whereas among adult-onset cases HPV-6a variants were more prevalent. The HPV-6vc reference was more transcriptionally active than the HPV-6a reference. Active FOXA1, ELF1 and GATA1 binding sites overlap variable nucleotide positions among isolates and influenced LCR activity. Furthermore, our results support a crucial role for ELF1 on transcriptional downregulation. We identified TFs implicated in the regulation of HPV-6 early gene expression. Many of these factors are mutated in cancer or are putative cancer biomarkers, and must be further studied. PMID:26151558

  8. regSNPs: a strategy for prioritizing regulatory single nucleotide substitutions

    PubMed Central

    Teng, Mingxiang; Ichikawa, Shoji; Padgett, Leah R.; Wang, Yadong; Mort, Matthew; Cooper, David N.; Koller, Daniel L.; Foroud, Tatiana; Edenberg, Howard J.; Econs, Michael J.; Liu, Yunlong

    2012-01-01

    Motivation: One of the fundamental questions in genetics study is to identify functional DNA variants that are responsible to a disease or phenotype of interest. Results from large-scale genetics studies, such as genome-wide association studies (GWAS), and the availability of high-throughput sequencing technologies provide opportunities in identifying causal variants. Despite the technical advances, informatics methodologies need to be developed to prioritize thousands of variants for potential causative effects. Results: We present regSNPs, an informatics strategy that integrates several established bioinformatics tools, for prioritizing regulatory SNPs, i.e. the SNPs in the promoter regions that potentially affect phenotype through changing transcription of downstream genes. Comparing to existing tools, regSNPs has two distinct features. It considers degenerative features of binding motifs by calculating the differences on the binding affinity caused by the candidate variants and integrates potential phenotypic effects of various transcription factors. When tested by using the disease-causing variants documented in the Human Gene Mutation Database, regSNPs showed mixed performance on various diseases. regSNPs predicted three SNPs that can potentially affect bone density in a region detected in an earlier linkage study. Potential effects of one of the variants were validated using luciferase reporter assay. Contact: yunliu@iupui.edu Supplementary information: Supplementary data are available at Bioinformatics online PMID:22611130

  9. Exome analysis of a family with Wolff-Parkinson-White syndrome identifies a novel disease locus.

    PubMed

    Bowles, Neil E; Jou, Chuanchau J; Arrington, Cammon B; Kennedy, Brett J; Earl, Aubree; Matsunami, Norisada; Meyers, Lindsay L; Etheridge, Susan P; Saarel, Elizabeth V; Bleyl, Steven B; Yost, H Joseph; Yandell, Mark; Leppert, Mark F; Tristani-Firouzi, Martin; Gruber, Peter J

    2015-12-01

    Wolff-Parkinson-White (WPW) syndrome is a common cause of supraventricular tachycardia that carries a risk of sudden cardiac death. To date, mutations in only one gene, PRKAG2, which encodes the 5'-AMP-activated protein kinase subunit γ-2, have been identified as causative for WPW. DNA samples from five members of a family with WPW were analyzed by exome sequencing. We applied recently designed prioritization strategies (VAAST/pedigree VAAST) coupled with an ontology-based algorithm (Phevor) that reduced the number of potentially damaging variants to 10: a variant in KCNE2 previously associated with Long QT syndrome was also identified. Of these 11 variants, only MYH6 p.E1885K segregated with the WPW phenotype in all affected individuals and was absent in 10 unaffected family members. This variant was predicted to be damaging by in silico methods and is not present in the 1,000 genome and NHLBI exome sequencing project databases. Screening of a replication cohort of 47 unrelated WPW patients did not identify other likely causative variants in PRKAG2 or MYH6. MYH6 variants have been identified in patients with atrial septal defects, cardiomyopathies, and sick sinus syndrome. Our data highlight the pleiotropic nature of phenotypes associated with defects in this gene. © 2015 Wiley Periodicals, Inc.

  10. Exome Analysis of a Family with Wolff–Parkinson–White Syndrome Identifies a Novel Disease Locus

    PubMed Central

    Bowles, Neil E.; Jou, Chuanchau J.; Arrington, Cammon B.; Kennedy, Brett J.; Earl, Aubree; Matsunami, Norisada; Meyers, Lindsay L.; Etheridge, Susan P.; Saarel, Elizabeth V.; Bleyl, Steven B.; Yost, H. Joseph; Yandell, Mark; Leppert, Mark F.; Tristani-Firouzi, Martin; Gruber, Peter J.

    2016-01-01

    Wolff–Parkinson–White (WPW) syndrome is a common cause of supraventricular tachycardia that carries a risk of sudden cardiac death. To date, mutations in only one gene, PRKAG2, which encodes the 5’ -AMP-activated protein kinase subunit γ-2, have been identified as causative for WPW. DNA samples from five members of a family with WPW were analyzed by exome sequencing. We applied recently designed prioritization strategies (VAAST/pedigree VAAST) coupled with an ontology-based algorithm (Phevor) that reduced the number of potentially damaging variants to 10: a variant in KCNE2 previously associated with Long QT syndrome was also identified. Of these 11 variants, only MYH6 p.E1885K segregated with the WPW phenotype in all affected individuals and was absent in 10 unaffected family members. This variant was predicted to be damaging by in silico methods and is not present in the 1,000 genome and NHLBI exome sequencing project databases. Screening of a replication cohort of 47 unrelated WPW patients did not identify other likely causative variants in PRKAG2 or MYH6. MYH6 variants have been identified in patients with atrial septal defects, cardiomyopathies, and sick sinus syndrome. Our data highlight the pleiotropic nature of phenotypes associated with defects in this gene. PMID:26284702

  11. Association between cytochrome CYP17A1, CYP3A4, and CYP3A43 polymorphisms and prostate cancer risk and aggressiveness in a Korean study population

    PubMed Central

    Han, Jun Hyun; Lee, Yong Seong; Kim, Hae Jong; Lee, Shin Young; Myung, Soon Chul

    2015-01-01

    In this study, we evaluated genetic variants of the androgen metabolism genes CYP17A1, CYP3A4, and CYP3A43 to determine whether they play a role in the development of prostate cancer (PCa) in Korean men. The study population included 240 pathologically diagnosed cases of PCa and 223 age-matched controls. Among the 789 single-nucleotide polymorphism (SNP) database variants detected, 129 were reported in two Asian groups (Han Chinese and Japanese) in the HapMap database. Only 21 polymorphisms of CYP17A1, CYP3A4, and CYP3A43 were selected based on linkage disequilibrium in Asians (r2 = 1), locations (SNPs in exons were preferred), and amino acid changes and were assessed. In addition, we performed haplotype analysis for the 21 SNPs in CYP17A1, CYP3A4, and CYP3A43 genes. To determine the association between genotype and haplotype distributions of patients and controls, logistic analyses were carried out, controlling for age. Twelve sequence variants and five major haplotypes were identified in CYP17A1. Five sequence variants and two major haplotypes were identified in CYP3A4. Four sequence variants and four major haplotypes were observed in CYP3A43. CYP17A1 haplotype-2 (Ht-2) (odds ratio [OR], 1.51; 95% confidence interval [CI], 1.04–2.18) was associated with PCa susceptibility. CYP3A4 Ht-2 (OR: 1.87; 95% CI: 1.02–3.43) was associated with PCa metastatic potential according to tumor stage. rs17115149 (OR: 1.96; 95% CI: 1.04–3.68) and CYP17A1 Ht-4 (OR: 2.01; 95% CI: 1.07–4.11) showed a significant association with histologic aggressiveness according to Gleason score. Genetic variants of CYP17A1 and CYP3A4 may play a role in the development of PCa in Korean men. PMID:25337833

  12. Association between cytochrome CYP17A1, CYP3A4, and CYP3A43 polymorphisms and prostate cancer risk and aggressiveness in a Korean study population.

    PubMed

    Han, Jun Hyun; Lee, Yong Seong; Kim, Hae Jong; Lee, Shin Young; Myung, Soon Chul

    2015-01-01

    In this study, we evaluated genetic variants of the androgen metabolism genes CYP17A1, CYP3A4, and CYP3A43 to determine whether they play a role in the development of prostate cancer (PCa) in Korean men. The study population included 240 pathologically diagnosed cases of PCa and 223 age-matched controls. Among the 789 single-nucleotide polymorphism (SNP) database variants detected, 129 were reported in two Asian groups (Han Chinese and Japanese) in the HapMap database. Only 21 polymorphisms of CYP17A1, CYP3A4, and CYP3A43 were selected based on linkage disequilibrium in Asians (r2 = 1), locations (SNPs in exons were preferred), and amino acid changes and were assessed. In addition, we performed haplotype analysis for the 21 SNPs in CYP17A1, CYP3A4, and CYP3A43 genes. To determine the association between genotype and haplotype distributions of patients and controls, logistic analyses were carried out, controlling for age. Twelve sequence variants and five major haplotypes were identified in CYP17A1. Five sequence variants and two major haplotypes were identified in CYP3A4. Four sequence variants and four major haplotypes were observed in CYP3A43. CYP17A1 haplotype-2 (Ht-2) (odds ratio [OR], 1.51; 95% confidence interval [CI], 1.04-2.18) was associated with PCa susceptibility. CYP3A4 Ht-2 (OR: 1.87; 95% CI: 1.02-3.43) was associated with PCa metastatic potential according to tumor stage. rs17115149 (OR: 1.96; 95% CI: 1.04-3.68) and CYP17A1 Ht-4 (OR: 2.01; 95% CI: 1.07-4.11) showed a significant association with histologic aggressiveness according to Gleason score. Genetic variants of CYP17A1 and CYP3A4 may play a role in the development of PCa in Korean men.

  13. The ACTN3 R577X variant in sprint and strength performance

    PubMed Central

    Kim, Hyeoijin; Song, Keon-Hyoung; Kim, Chul-Hyun

    2014-01-01

    [Purpose] The aim of this study is to examine the association between the distribution of ACTN3 genotypes and alleles in power, speed, and strength-oriented athletics. [Methods] ACTN3 genotyping was carried out for a total of 975 Korean participants: top-level sprinters (n = 58), top-level strength athletes (n = 63), and healthy controls (n = 854). [Results] Genetic associations were evaluated by chi-squire test or Fisher’s exact test. In the power-oriented group composed of sprinters and strength athletes, the frequency of the XX genotype was significantly underrepresented (11.6%) in comparison to its representation in the control group (11.6% versus 19.1%, P < 0.05). When the power-oriented group was divided into strength-oriented and speed-oriented groups, no significant difference in the ACTN3 XX genotype was found between the strength-oriented athletes and the controls (15.9% versus 19.1%, P < 0.262). Only the speed-oriented athletes showed significant differences in the frequency distributions of the ACTN3 XX genotype (6.9% versus 19.1%, P < 0.05) from that of the controls. [Conclusion] The ACTN3 genotype seems to mainly affect sports performance and especially speed. PMID:25671201

  14. The ACTN3 R577X variant in sprint and strength performance.

    PubMed

    Kim, Hyeoijin; Song, Keon-Hyoung; Kim, Chul-Hyun

    2014-12-01

    The aim of this study is to examine the association between the distribution of ACTN3 genotypes and alleles in power, speed, and strength-oriented athletics. ACTN3 genotyping was carried out for a total of 975 Korean participants: top-level sprinters (n = 58), top-level strength athletes (n = 63), and healthy controls (n = 854). Genetic associations were evaluated by chi-squire test or Fisher's exact test. In the power-oriented group composed of sprinters and strength athletes, the frequency of the XX genotype was significantly underrepresented (11.6%) in comparison to its representation in the control group (11.6% versus 19.1%, P < 0.05). When the power-oriented group was divided into strength-oriented and speed-oriented groups, no significant difference in the ACTN3 XX genotype was found between the strength-oriented athletes and the controls (15.9% versus 19.1%, P < 0.262). Only the speed-oriented athletes showed significant differences in the frequency distributions of the ACTN3 XX genotype (6.9% versus 19.1%, P < 0.05) from that of the controls. The ACTN3 genotype seems to mainly affect sports performance and especially speed.

  15. A resource oriented webs service for environmental modeling

    NASA Astrophysics Data System (ADS)

    Ferencik, Ioan

    2013-04-01

    Environmental modeling is a largely adopted practice in the study of natural phenomena. Environmental models can be difficult to build and use and thus sharing them within the community is an important aspect. The most common approach to share a model is to expose it as a web service. In practice the interaction with this web service is cumbersome due to lack of standardized contract and the complexity of the model being exposed. In this work we investigate the use of a resource oriented approach in exposing environmental models as web services. We view a model as a layered resource build atop the object concept from Object Oriented Programming, augmented with persistence capabilities provided by an embedded object database to keep track of its state and implementing the four basic principles of resource oriented architectures: addressability, statelessness, representation and uniform interface. For implementation we use exclusively open source software: Django framework, dyBase object oriented database and Python programming language. We developed a generic framework of resources structured into a hierarchy of types and consequently extended this typology with recurses specific to the domain of environmental modeling. To test our web service we used cURL, a robust command-line based web client.

  16. A Virtual "Hello": A Web-Based Orientation to the Library.

    ERIC Educational Resources Information Center

    Borah, Eloisa Gomez

    1997-01-01

    Describes the development of Web-based library services and resources available at the Rosenfeld Library of the Anderson Graduate School of Management at University of California at Los Angeles. Highlights include library orientation sessions; virtual tours of the library; a database of basic business sources; and research strategies, including…

  17. On Inference Rules of Logic-Based Information Retrieval Systems.

    ERIC Educational Resources Information Center

    Chen, Patrick Shicheng

    1994-01-01

    Discussion of relevance and the needs of the users in information retrieval focuses on a deductive object-oriented approach and suggests eight inference rules for the deduction. Highlights include characteristics of a deductive object-oriented system, database and data modeling language, implementation, and user interface. (Contains 24…

  18. Palmprint Recognition Across Different Devices.

    PubMed

    Jia, Wei; Hu, Rong-Xiang; Gui, Jie; Zhao, Yang; Ren, Xiao-Ming

    2012-01-01

    In this paper, the problem of Palmprint Recognition Across Different Devices (PRADD) is investigated, which has not been well studied so far. Since there is no publicly available PRADD image database, we created a non-contact PRADD image database containing 12,000 grayscale captured from 100 subjects using three devices, i.e., one digital camera and two smart-phones. Due to the non-contact image acquisition used, rotation and scale changes between different images captured from a same palm are inevitable. We propose a robust method to calculate the palm width, which can be effectively used for scale normalization of palmprints. On this PRADD image database, we evaluate the recognition performance of three different methods, i.e., subspace learning method, correlation method, and orientation coding based method, respectively. Experiments results show that orientation coding based methods achieved promising recognition performance for PRADD.

  19. Palmprint Recognition across Different Devices

    PubMed Central

    Jia, Wei; Hu, Rong-Xiang; Gui, Jie; Zhao, Yang; Ren, Xiao-Ming

    2012-01-01

    In this paper, the problem of Palmprint Recognition Across Different Devices (PRADD) is investigated, which has not been well studied so far. Since there is no publicly available PRADD image database, we created a non-contact PRADD image database containing 12,000 grayscale captured from 100 subjects using three devices, i.e., one digital camera and two smart-phones. Due to the non-contact image acquisition used, rotation and scale changes between different images captured from a same palm are inevitable. We propose a robust method to calculate the palm width, which can be effectively used for scale normalization of palmprints. On this PRADD image database, we evaluate the recognition performance of three different methods, i.e., subspace learning method, correlation method, and orientation coding based method, respectively. Experiments results show that orientation coding based methods achieved promising recognition performance for PRADD. PMID:22969380

  20. Genetic polymorphisms of pharmacogenomic VIP variants in the Yi population from China.

    PubMed

    Yan, Mengdan; Li, Dianzhen; Zhao, Guige; Li, Jing; Niu, Fanglin; Li, Bin; Chen, Peng; Jin, Tianbo

    2018-03-30

    Drug response and target therapeutic dosage are different among individuals. The variability is largely genetically determined. With the development of pharmacogenetics and pharmacogenomics, widespread research have provided us a wealth of information on drug-related genetic polymorphisms, and the very important pharmacogenetic (VIP) variants have been identified for the major populations around the world whereas less is known regarding minorities in China, including the Yi ethnic group. Our research aims to screen the potential genetic variants in Yi population on pharmacogenomics and provide a theoretical basis for future medication guidance. In the present study, 80 VIP variants (selected from the PharmGKB database) were genotyped in 100 unrelated and healthy Yi adults recruited for our research. Through statistical analysis, we made a comparison between the Yi and other 11 populations listed in the HapMap database for significant SNPs detection. Two specific SNPs were subsequently enrolled in an observation on global allele distribution with the frequencies downloaded from ALlele FREquency Database. Moreover, F-statistics (Fst), genetic structure and phylogenetic tree analyses were conducted for determination of genetic similarity between the 12 ethnic groups. Using the χ2 tests, rs1128503 (ABCB1), rs7294 (VKORC1), rs9934438 (VKORC1), rs1540339 (VDR) and rs689466 (PTGS2) were identified as the significantly different loci for further analysis. The global allele distribution revealed that the allele "A" of rs1540339 and rs9934438 were more frequent in Yi people, which was consistent with the most populations in East Asia. F-statistics (Fst), genetic structure and phylogenetic tree analyses demonstrated that the Yi and CHD shared a closest relationship on their genetic backgrounds. Additionally, Yi was considered similar to the Han people from Shaanxi province among the domestic ethnic populations in China. Our results demonstrated significant differences on several polymorphic SNPs and supplement the pharmacogenomic information for the Yi population, which could provide new strategies for optimizing clinical medication in accordance with the genetic determinants of drug toxicity and efficacy. Copyright © 2018 Elsevier B.V. All rights reserved.

  1. Competitive region orientation code for palmprint verification and identification

    NASA Astrophysics Data System (ADS)

    Tang, Wenliang

    2015-11-01

    Orientation features of the palmprint have been widely investigated in coding-based palmprint-recognition methods. Conventional orientation-based coding methods usually used discrete filters to extract the orientation feature of palmprint. However, in real operations, the orientations of the filter usually are not consistent with the lines of the palmprint. We thus propose a competitive region orientation-based coding method. Furthermore, an effective weighted balance scheme is proposed to improve the accuracy of the extracted region orientation. Compared with conventional methods, the region orientation of the palmprint extracted using the proposed method can precisely and robustly describe the orientation feature of the palmprint. Extensive experiments on the baseline PolyU and multispectral palmprint databases are performed and the results show that the proposed method achieves a promising performance in comparison to conventional state-of-the-art orientation-based coding methods in both palmprint verification and identification.

  2. Object-Oriented Approach to Integrating Database Semantics. Volume 4.

    DTIC Science & Technology

    1987-12-01

    schemata for; 1. Object Classification Shema -- Entities 2. Object Structure and Relationship Schema -- Relations 3. Operation Classification and... relationships are represented in a database is non- intuitive for naive users. *It is difficult to access and combine information in multiple databases. In this...from the CURRENT-.CLASSES table. Choosing a selected item do-selects it. Choose 0 to exit. 1. STUDENTS 2. CUR~RENT-..CLASSES 3. MANAGMNT -.CLASS

  3. Condensing Massive Satellite Datasets For Rapid Interactive Analysis

    NASA Astrophysics Data System (ADS)

    Grant, G.; Gallaher, D. W.; Lv, Q.; Campbell, G. G.; Fowler, C.; LIU, Q.; Chen, C.; Klucik, R.; McAllister, R. A.

    2015-12-01

    Our goal is to enable users to interactively analyze massive satellite datasets, identifying anomalous data or values that fall outside of thresholds. To achieve this, the project seeks to create a derived database containing only the most relevant information, accelerating the analysis process. The database is designed to be an ancillary tool for the researcher, not an archival database to replace the original data. This approach is aimed at improving performance by reducing the overall size by way of condensing the data. The primary challenges of the project include: - The nature of the research question(s) may not be known ahead of time. - The thresholds for determining anomalies may be uncertain. - Problems associated with processing cloudy, missing, or noisy satellite imagery. - The contents and method of creation of the condensed dataset must be easily explainable to users. The architecture of the database will reorganize spatially-oriented satellite imagery into temporally-oriented columns of data (a.k.a., "data rods") to facilitate time-series analysis. The database itself is an open-source parallel database, designed to make full use of clustered server technologies. A demonstration of the system capabilities will be shown. Applications for this technology include quick-look views of the data, as well as the potential for on-board satellite processing of essential information, with the goal of reducing data latency.

  4. Object-orientated DBMS techniques for time-oriented medical record.

    PubMed

    Pinciroli, F; Combi, C; Pozzi, G

    1992-01-01

    In implementing time-orientated medical record (TOMR) management systems, use of a relational model played a big role. Many applications have been developed to extend query and data manipulation languages to temporal aspects of information. Our experience in developing TOMR revealed some deficiencies inside the relational model, such as: (a) abstract data type definition; (b) unified view of data, at a programming level; (c) management of temporal data; (d) management of signals and images. We identified some first topics to face by an object-orientated approach to database design. This paper describes the first steps in designing and implementing a TOMR by an object-orientated DBMS.

  5. Object-oriented analysis and design of an ECG storage and retrieval system integrated with an HIS.

    PubMed

    Wang, C; Ohe, K; Sakurai, T; Nagase, T; Kaihara, S

    1996-03-01

    For a hospital information system, object-oriented methodology plays an increasingly important role, especially for the management of digitized data, e.g., the electrocardiogram, electroencephalogram, electromyogram, spirogram, X-ray, CT and histopathological images, which are not yet computerized in most hospitals. As a first step in an object-oriented approach to hospital information management and storing medical data in an object-oriented database, we connected electrocardiographs to a hospital network and established the integration of ECG storage and retrieval systems with a hospital information system. In this paper, the object-oriented analysis and design of the ECG storage and retrieval systems is reported.

  6. Imprecision and Uncertainty in the UFO Database Model.

    ERIC Educational Resources Information Center

    Van Gyseghem, Nancy; De Caluwe, Rita

    1998-01-01

    Discusses how imprecision and uncertainty are dealt with in the UFO (Uncertainty and Fuzziness in an Object-oriented) database model. Such information is expressed by means of possibility distributions, and modeled by means of the proposed concept of "role objects." The role objects model uncertain, tentative information about objects,…

  7. Genomic Approach to Understand the Association of DNA Repair with Longevity and Healthy Aging Using Genomic Databases of Oldest-Old Population

    PubMed Central

    Kim, Hyun Soo

    2018-01-01

    Aged population is increasing worldwide due to the aging process that is inevitable. Accordingly, longevity and healthy aging have been spotlighted to promote social contribution of aged population. Many studies in the past few decades have reported the process of aging and longevity, emphasizing the importance of maintaining genomic stability in exceptionally long-lived population. Underlying reason of longevity remains unclear due to its complexity involving multiple factors. With advances in sequencing technology and human genome-associated approaches, studies based on population-based genomic studies are increasing. In this review, we summarize recent longevity and healthy aging studies of human population focusing on DNA repair as a major factor in maintaining genome integrity. To keep pace with recent growth in genomic research, aging- and longevity-associated genomic databases are also briefly introduced. To suggest novel approaches to investigate longevity-associated genetic variants related to DNA repair using genomic databases, gene set analysis was conducted, focusing on DNA repair- and longevity-associated genes. Their biological networks were additionally analyzed to grasp major factors containing genetic variants of human longevity and healthy aging in DNA repair mechanisms. In summary, this review emphasizes DNA repair activity in human longevity and suggests approach to conduct DNA repair-associated genomic study on human healthy aging.

  8. Nonlinear unitary transformations of space-variant polarized light fields from self-induced geometric-phase optical elements

    NASA Astrophysics Data System (ADS)

    Kravets, Nina; Brasselet, Etienne

    2018-01-01

    We propose to couple the optical orientational nonlinearities of liquid crystals with their ability to self-organize to tailor them to control space-variant-polarized optical fields in a nonlinear manner. Experimental demonstration is made using a liquid crystal light valve that behaves like a light-driven geometric phase optical element. We also unveil two original nonlinear optical processes, namely self-induced separability and nonseparability. These results contribute to the advancement of nonlinear singular optics that is still in its infancy despite 25 years of effort, which may foster the development of nonlinear protocols to manipulate high-dimensional optical information both in the classical and quantum regimes.

  9. Germline contamination and leakage in whole genome somatic single nucleotide variant detection.

    PubMed

    Sendorek, Dorota H; Caloian, Cristian; Ellrott, Kyle; Bare, J Christopher; Yamaguchi, Takafumi N; Ewing, Adam D; Houlahan, Kathleen E; Norman, Thea C; Margolin, Adam A; Stuart, Joshua M; Boutros, Paul C

    2018-01-31

    The clinical sequencing of cancer genomes to personalize therapy is becoming routine across the world. However, concerns over patient re-identification from these data lead to questions about how tightly access should be controlled. It is not thought to be possible to re-identify patients from somatic variant data. However, somatic variant detection pipelines can mistakenly identify germline variants as somatic ones, a process called "germline leakage". The rate of germline leakage across different somatic variant detection pipelines is not well-understood, and it is uncertain whether or not somatic variant calls should be considered re-identifiable. To fill this gap, we quantified germline leakage across 259 sets of whole-genome somatic single nucleotide variant (SNVs) predictions made by 21 teams as part of the ICGC-TCGA DREAM Somatic Mutation Calling Challenge. The median somatic SNV prediction set contained 4325 somatic SNVs and leaked one germline polymorphism. The level of germline leakage was inversely correlated with somatic SNV prediction accuracy and positively correlated with the amount of infiltrating normal cells. The specific germline variants leaked differed by tumour and algorithm. To aid in quantitation and correction of leakage, we created a tool, called GermlineFilter, for use in public-facing somatic SNV databases. The potential for patient re-identification from leaked germline variants in somatic SNV predictions has led to divergent open data access policies, based on different assessments of the risks. Indeed, a single, well-publicized re-identification event could reshape public perceptions of the values of genomic data sharing. We find that modern somatic SNV prediction pipelines have low germline-leakage rates, which can be further reduced, especially for cloud-sharing, using pre-filtering software.

  10. The Personal Genome Project Canada: findings from whole genome sequences of the inaugural 56 participants

    PubMed Central

    Reuter, Miriam S.; Walker, Susan; Thiruvahindrapuram, Bhooma; Whitney, Joe; Cohn, Iris; Sondheimer, Neal; Yuen, Ryan K.C.; Trost, Brett; Paton, Tara A.; Pereira, Sergio L.; Herbrick, Jo-Anne; Wintle, Richard F.; Merico, Daniele; Howe, Jennifer; MacDonald, Jeffrey R.; Lu, Chao; Nalpathamkalam, Thomas; Sung, Wilson W.L.; Wang, Zhuozhi; Patel, Rohan V.; Pellecchia, Giovanna; Wei, John; Strug, Lisa J.; Bell, Sherilyn; Kellam, Barbara; Mahtani, Melanie M.; Bassett, Anne S.; Bombard, Yvonne; Weksberg, Rosanna; Shuman, Cheryl; Cohn, Ronald D.; Stavropoulos, Dimitri J.; Bowdin, Sarah; Hildebrandt, Matthew R.; Wei, Wei; Romm, Asli; Pasceri, Peter; Ellis, James; Ray, Peter; Meyn, M. Stephen; Monfared, Nasim; Hosseini, S. Mohsen; Joseph-George, Ann M.; Keeley, Fred W.; Cook, Ryan A.; Fiume, Marc; Lee, Hin C.; Marshall, Christian R.; Davies, Jill; Hazell, Allison; Buchanan, Janet A.; Szego, Michael J.; Scherer, Stephen W.

    2018-01-01

    BACKGROUND: The Personal Genome Project Canada is a comprehensive public data resource that integrates whole genome sequencing data and health information. We describe genomic variation identified in the initial recruitment cohort of 56 volunteers. METHODS: Volunteers were screened for eligibility and provided informed consent for open data sharing. Using blood DNA, we performed whole genome sequencing and identified all possible classes of DNA variants. A genetic counsellor explained the implication of the results to each participant. RESULTS: Whole genome sequencing of the first 56 participants identified 207 662 805 sequence variants and 27 494 copy number variations. We analyzed a prioritized disease-associated data set (n = 1606 variants) according to standardized guidelines, and interpreted 19 variants in 14 participants (25%) as having obvious health implications. Six of these variants (e.g., in BRCA1 or mosaic loss of an X chromosome) were pathogenic or likely pathogenic. Seven were risk factors for cancer, cardiovascular or neurobehavioural conditions. Four other variants — associated with cancer, cardiac or neurodegenerative phenotypes — remained of uncertain significance because of discrepancies among databases. We also identified a large structural chromosome aberration and a likely pathogenic mitochondrial variant. There were 172 recessive disease alleles (e.g., 5 individuals carried mutations for cystic fibrosis). Pharmacogenomics analyses revealed another 3.9 potentially relevant genotypes per individual. INTERPRETATION: Our analyses identified a spectrum of genetic variants with potential health impact in 25% of participants. When also considering recessive alleles and variants with potential pharmacologic relevance, all 56 participants had medically relevant findings. Although access is mostly limited to research, whole genome sequencing can provide specific and novel information with the potential of major impact for health care. PMID:29431110

  11. A comparison of cataloged variation between International HapMap Consortium and 1000 Genomes Project data

    PubMed Central

    Buchanan, Carrie C; Torstenson, Eric S; Bush, William S

    2012-01-01

    Background Since publication of the human genome in 2003, geneticists have been interested in risk variant associations to resolve the etiology of traits and complex diseases. The International HapMap Consortium undertook an effort to catalog all common variation across the genome (variants with a minor allele frequency (MAF) of at least 5% in one or more ethnic groups). HapMap along with advances in genotyping technology led to genome-wide association studies which have identified common variants associated with many traits and diseases. In 2008 the 1000 Genomes Project aimed to sequence 2500 individuals and identify rare variants and 99% of variants with a MAF of <1%. Methods To determine whether the 1000 Genomes Project includes all the variants in HapMap, we examined the overlap between single nucleotide polymorphisms (SNPs) genotyped in the two resources using merged phase II/III HapMap data and low coverage pilot data from 1000 Genomes. Results Comparison of the two data sets showed that approximately 72% of HapMap SNPs were also found in 1000 Genomes Project pilot data. After filtering out HapMap variants with a MAF of <5% (separately for each population), 99% of HapMap SNPs were found in 1000 Genomes data. Conclusions Not all variants cataloged in HapMap are also cataloged in 1000 Genomes. This could affect decisions about which resource to use for SNP queries, rare variant validation, or imputation. Both the HapMap and 1000 Genomes Project databases are useful resources for human genetics, but it is important to understand the assumptions made and filtering strategies employed by these projects. PMID:22319179

  12. Network Analysis of Sequence-Function Relationships and Exploration of Sequence Space of TEM β-Lactamases.

    PubMed

    Zeil, Catharina; Widmann, Michael; Fademrecht, Silvia; Vogel, Constantin; Pleiss, Jürgen

    2016-05-01

    The Lactamase Engineering Database (www.LacED.uni-stuttgart.de) was developed to facilitate the classification and analysis of TEM β-lactamases. The current version contains 474 TEM variants. Two hundred fifty-nine variants form a large scale-free network of highly connected point mutants. The network was divided into three subnetworks which were enriched by single phenotypes: one network with predominantly 2be and two networks with 2br phenotypes. Fifteen positions were found to be highly variable, contributing to the majority of the observed variants. Since it is expected that a considerable fraction of the theoretical sequence space is functional, the currently sequenced 474 variants represent only the tip of the iceberg of functional TEM β-lactamase variants which form a huge natural reservoir of highly interconnected variants. Almost 50% of the variants are part of a quartet. Thus, two single mutations that result in functional enzymes can be combined into a functional protein. Most of these quartets consist of the same phenotype, or the mutations are additive with respect to the phenotype. By predicting quartets from triplets, 3,916 unknown variants were constructed. Eighty-seven variants complement multiple quartets and therefore have a high probability of being functional. The construction of a TEM β-lactamase network and subsequent analyses by clustering and quartet prediction are valuable tools to gain new insights into the viable sequence space of TEM β-lactamases and to predict their phenotype. The highly connected sequence space of TEM β-lactamases is ideally suited to network analysis and demonstrates the strengths of network analysis over tree reconstruction methods. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  13. The Personal Genome Project Canada: findings from whole genome sequences of the inaugural 56 participants.

    PubMed

    Reuter, Miriam S; Walker, Susan; Thiruvahindrapuram, Bhooma; Whitney, Joe; Cohn, Iris; Sondheimer, Neal; Yuen, Ryan K C; Trost, Brett; Paton, Tara A; Pereira, Sergio L; Herbrick, Jo-Anne; Wintle, Richard F; Merico, Daniele; Howe, Jennifer; MacDonald, Jeffrey R; Lu, Chao; Nalpathamkalam, Thomas; Sung, Wilson W L; Wang, Zhuozhi; Patel, Rohan V; Pellecchia, Giovanna; Wei, John; Strug, Lisa J; Bell, Sherilyn; Kellam, Barbara; Mahtani, Melanie M; Bassett, Anne S; Bombard, Yvonne; Weksberg, Rosanna; Shuman, Cheryl; Cohn, Ronald D; Stavropoulos, Dimitri J; Bowdin, Sarah; Hildebrandt, Matthew R; Wei, Wei; Romm, Asli; Pasceri, Peter; Ellis, James; Ray, Peter; Meyn, M Stephen; Monfared, Nasim; Hosseini, S Mohsen; Joseph-George, Ann M; Keeley, Fred W; Cook, Ryan A; Fiume, Marc; Lee, Hin C; Marshall, Christian R; Davies, Jill; Hazell, Allison; Buchanan, Janet A; Szego, Michael J; Scherer, Stephen W

    2018-02-05

    The Personal Genome Project Canada is a comprehensive public data resource that integrates whole genome sequencing data and health information. We describe genomic variation identified in the initial recruitment cohort of 56 volunteers. Volunteers were screened for eligibility and provided informed consent for open data sharing. Using blood DNA, we performed whole genome sequencing and identified all possible classes of DNA variants. A genetic counsellor explained the implication of the results to each participant. Whole genome sequencing of the first 56 participants identified 207 662 805 sequence variants and 27 494 copy number variations. We analyzed a prioritized disease-associated data set ( n = 1606 variants) according to standardized guidelines, and interpreted 19 variants in 14 participants (25%) as having obvious health implications. Six of these variants (e.g., in BRCA1 or mosaic loss of an X chromosome) were pathogenic or likely pathogenic. Seven were risk factors for cancer, cardiovascular or neurobehavioural conditions. Four other variants - associated with cancer, cardiac or neurodegenerative phenotypes - remained of uncertain significance because of discrepancies among databases. We also identified a large structural chromosome aberration and a likely pathogenic mitochondrial variant. There were 172 recessive disease alleles (e.g., 5 individuals carried mutations for cystic fibrosis). Pharmacogenomics analyses revealed another 3.9 potentially relevant genotypes per individual. Our analyses identified a spectrum of genetic variants with potential health impact in 25% of participants. When also considering recessive alleles and variants with potential pharmacologic relevance, all 56 participants had medically relevant findings. Although access is mostly limited to research, whole genome sequencing can provide specific and novel information with the potential of major impact for health care. © 2018 Joule Inc. or its licensors.

  14. Examining rare and low-frequency genetic variants previously associated with lone or familial forms of atrial fibrillation in an electronic medical record system: a cautionary note.

    PubMed

    Weeke, Peter; Denny, Joshua C; Basterache, Lisa; Shaffer, Christian; Bowton, Erica; Ingram, Christie; Darbar, Dawood; Roden, Dan M

    2015-02-01

    Studies in individuals or small kindreds have implicated rare variants in 25 different genes in lone and familial atrial fibrillation (AF) using linkage and segregation analysis, functional characterization, and rarity in public databases. Here, we used a cohort of 20 204 patients of European or African ancestry with electronic medical records and exome chip data to compare the frequency of AF among carriers and noncarriers of these rare variants. The exome chip included 19 of 115 rare variants, in 9 genes, previously associated with lone or familial AF. Using validated algorithms querying a combination of clinical notes, structured billing codes, ECG reports, and procedure codes, we identified 1056 AF cases (>18 years) and 19 148 non-AF controls (>50 years) with available genotype data on the Illumina HumanExome BeadChip v.1.0 in the Vanderbilt electronic medical record-linked DNA repository, BioVU. Known correlations between AF and common variants at 4q25 were replicated. None of the 19 variants previously associated with AF were over-represented among AF cases (P>0.1 for all), and the frequency of variant carriers among non-AF controls was >0.1% for 14 of 19. Repeat analyses using non-AF controls aged >60 (n=14 904), >70 (n=9670), and >80 (n=4729) years did not influence these findings. Rare variants previously implicated in lone or familial forms of AF present on the exome chip are detected at low frequencies in a general population but are not associated with AF. These findings emphasize the need for caution when ascribing variants as pathogenic or causative. © 2014 American Heart Association, Inc.

  15. Fine-Mapping of Common Genetic Variants Associated with Colorectal Tumor Risk Identified Potential Functional Variants

    PubMed Central

    Gala, Manish; Abecasis, Goncalo; Bezieau, Stephane; Brenner, Hermann; Butterbach, Katja; Caan, Bette J.; Carlson, Christopher S.; Casey, Graham; Chang-Claude, Jenny; Conti, David V.; Curtis, Keith R.; Duggan, David; Gallinger, Steven; Haile, Robert W.; Harrison, Tabitha A.; Hayes, Richard B.; Hoffmeister, Michael; Hopper, John L.; Hudson, Thomas J.; Jenkins, Mark A.; Küry, Sébastien; Le Marchand, Loic; Leal, Suzanne M.; Newcomb, Polly A.; Nickerson, Deborah A.; Potter, John D.; Schoen, Robert E.; Schumacher, Fredrick R.; Seminara, Daniela; Slattery, Martha L.; Hsu, Li; Chan, Andrew T.; White, Emily; Berndt, Sonja I.; Peters, Ulrike

    2016-01-01

    Genome-wide association studies (GWAS) have identified many common single nucleotide polymorphisms (SNPs) associated with colorectal cancer risk. These SNPs may tag correlated variants with biological importance. Fine-mapping around GWAS loci can facilitate detection of functional candidates and additional independent risk variants. We analyzed 11,900 cases and 14,311 controls in the Genetics and Epidemiology of Colorectal Cancer Consortium and the Colon Cancer Family Registry. To fine-map genomic regions containing all known common risk variants, we imputed high-density genetic data from the 1000 Genomes Project. We tested single-variant associations with colorectal tumor risk for all variants spanning genomic regions 250-kb upstream or downstream of 31 GWAS-identified SNPs (index SNPs). We queried the University of California, Santa Cruz Genome Browser to examine evidence for biological function. Index SNPs did not show the strongest association signals with colorectal tumor risk in their respective genomic regions. Bioinformatics analysis of SNPs showing smaller P-values in each region revealed 21 functional candidates in 12 loci (5q31.1, 8q24, 11q13.4, 11q23, 12p13.32, 12q24.21, 14q22.2, 15q13, 18q21, 19q13.1, 20p12.3, and 20q13.33). We did not observe evidence of additional independent association signals in GWAS-identified regions. Our results support the utility of integrating data from comprehensive fine-mapping with expanding publicly available genomic databases to help clarify GWAS associations and identify functional candidates that warrant more onerous laboratory follow-up. Such efforts may aid the eventual discovery of disease-causing variant(s). PMID:27379672

  16. BigQ: a NoSQL based framework to handle genomic variants in i2b2.

    PubMed

    Gabetta, Matteo; Limongelli, Ivan; Rizzo, Ettore; Riva, Alberto; Segagni, Daniele; Bellazzi, Riccardo

    2015-12-29

    Precision medicine requires the tight integration of clinical and molecular data. To this end, it is mandatory to define proper technological solutions able to manage the overwhelming amount of high throughput genomic data needed to test associations between genomic signatures and human phenotypes. The i2b2 Center (Informatics for Integrating Biology and the Bedside) has developed a widely internationally adopted framework to use existing clinical data for discovery research that can help the definition of precision medicine interventions when coupled with genetic data. i2b2 can be significantly advanced by designing efficient management solutions of Next Generation Sequencing data. We developed BigQ, an extension of the i2b2 framework, which integrates patient clinical phenotypes with genomic variant profiles generated by Next Generation Sequencing. A visual programming i2b2 plugin allows retrieving variants belonging to the patients in a cohort by applying filters on genomic variant annotations. We report an evaluation of the query performance of our system on more than 11 million variants, showing that the implemented solution scales linearly in terms of query time and disk space with the number of variants. In this paper we describe a new i2b2 web service composed of an efficient and scalable document-based database that manages annotations of genomic variants and of a visual programming plug-in designed to dynamically perform queries on clinical and genetic data. The system therefore allows managing the fast growing volume of genomic variants and can be used to integrate heterogeneous genomic annotations.

  17. Characterization of Novel Missense Variants of SERPINA1 Gene Causing Alpha-1 Antitrypsin Deficiency.

    PubMed

    Matamala, Nerea; Lara, Beatriz; Gomez-Mariano, Gema; Martínez, Selene; Retana, Diana; Fernandez, Taiomara; Silvestre, Ramona Angeles; Belmonte, Irene; Rodriguez-Frias, Francisco; Vilar, Marçal; Sáez, Raquel; Iturbe, Igor; Castillo, Silvia; Molina-Molina, María; Texido, Anna; Tirado-Conde, Gema; Lopez-Campos, Jose Luis; Posada, Manuel; Blanco, Ignacio; Janciauskiene, Sabina; Martinez-Delgado, Beatriz

    2018-06-01

    The SERPINA1 gene is highly polymorphic, with more than 100 variants described in databases. SERPINA1 encodes the alpha-1 antitrypsin (AAT) protein, and severe deficiency of AAT is a major contributor to pulmonary emphysema and liver diseases. In Spanish patients with AAT deficiency, we identified seven new variants of the SERPINA1 gene involving amino acid substitutions in different exons: PiSDonosti (S+Ser14Phe), PiTijarafe (Ile50Asn), PiSevilla (Ala58Asp), PiCadiz (Glu151Lys), PiTarragona (Phe227Cys), PiPuerto Real (Thr249Ala), and PiValencia (Lys328Glu). We examined the characteristics of these variants and the putative association with the disease. Mutant proteins were overexpressed in HEK293T cells, and AAT expression, polymerization, degradation, and secretion, as well as antielastase activity, were analyzed by periodic acid-Schiff staining, Western blotting, pulse-chase, and elastase inhibition assays. When overexpressed, S+S14F, I50N, A58D, F227C, and T249A variants formed intracellular polymers and did not secrete AAT protein. Both the E151K and K328E variants secreted AAT protein and did not form polymers, although K328E showed intracellular retention and reduced antielastase activity. We conclude that deficient variants may be more frequent than previously thought and that their discovery is possible only by the complete sequencing of the gene and subsequent functional characterization. Better knowledge of SERPINA1 variants would improve diagnosis and management of individuals with AAT deficiency.

  18. Reconciling newborn screening and a novel splice variant in BTD associated with partial biotinidase deficiency: A BabySeq Project case report.

    PubMed

    Murry, Jaclyn B; Machini, Kalotina; Ceyhan-Birsoy, Ozge; Kritzer, Amy; Krier, Joel B; Lebo, Matthew S; Fayer, Shawn; Genetti, Casie A; Vannoy, Grace E; Yu, Timothy W; Agrawal, Pankaj B; Parad, Richard B; Holm, Ingrid A; McGuire, Amy L; Green, Robert C; Beggs, Alan H; Rehm, Heidi L; Project, The BabySeq

    2018-05-04

    Here, we report a newborn female infant from the well-baby cohort of the BabySeq Project who was identified with compound heterozygous BTD gene variants. The two identified variants included a well-established pathogenic variant (c.1612C>T, p.Arg538Cys) that causes profound biotinidase deficiency (BTD) in homozygosity. In addition, a novel splice variant (c.44+1G>A, p.?) was identified in the invariant splice donor region of intron 1, potentially predictive of loss of function. The novel variant was predicted to impact splicing of exon 1; however, given the absence of any reported pathogenic variants in exon 1 and the presence of alternative splicing with exon 1 absent in most tissues in the GTEx database, we assigned an initial classification of uncertain significance. Follow-up medical record review of state mandated newborn screen (NBS) results revealed an initial out-of-range biotinidase activity level. Levels from a repeat NBS sample barely passed cut-off into the normal range. To determine whether the infant was biotinidase deficient, subsequent diagnostic enzyme activity testing was performed, confirming partial BTD, and resulted in a change of management for this patient. This led to reclassification of the novel splice variant based on these results. In conclusion, combining the genetic and NBS results together prompted clinical follow-up that confirmed partial biotinidase deficiency, and informed this novel splice site's reclassification emphasizing the importance of combining iterative genetic and phenotypic evaluations. Cold Spring Harbor Laboratory Press.

  19. Missing data in substance abuse research? Researchers’ reporting practices of sexual orientation and gender identity

    PubMed Central

    Bacca, Cristina L.; Cochran, Bryan N.

    2014-01-01

    Background Lesbian, gay, bisexual, and transgender individuals are at higher risk for substance use and substance use disorders than heterosexual individuals and are more likely to seek substance use treatment, yet sexual orientation and gender identity are frequently not reported in the research literature. The purpose of this study was to identify if sexual orientation and gender identity are being reported in the recent substance use literature, and if this has changed over time. Method The PsycINFO and PubMed databases were searched for articles released in 2007 and 2012 using the term “substance abuse” and 200 articles were randomly selected from each time period and database. Articles were coded for the presence or absence of sexual orientation and gender identity information. Results Participants’ sexual orientation was reported in 3.0% and 4.9% of the 2007 and 2.3% and 6.5% of the 2012 sample, in PsycINFO and PubMed sample articles, respectively, while non-binary gender identity was reported in 0% and 1.0% of the 2007 sample and 2.3% and 1.9% of the 2012 PsycINFO and PubMed sample articles. There were no differences in rates of reporting over time. Conclusions Sexual orientation and gender identity are rarely reported in the substance abuse literature, and there has not been a change in reporting practices between 2007 and 2012. Recommendations for future investigators in reporting sexual orientation and gender identity are included. PMID:25496705

  20. Missing data in substance abuse research? Researchers' reporting practices of sexual orientation and gender identity.

    PubMed

    Flentje, Annesa; Bacca, Cristina L; Cochran, Bryan N

    2015-02-01

    Lesbian, gay, bisexual, and transgender individuals are at higher risk for substance use and substance use disorders than heterosexual individuals and are more likely to seek substance use treatment, yet sexual orientation and gender identity are frequently not reported in the research literature. The purpose of this study was to identify if sexual orientation and gender identity are being reported in the recent substance use literature, and if this has changed over time. The PsycINFO and PubMed databases were searched for articles released in 2007 and 2012 using the term "substance abuse" and 200 articles were randomly selected from each time period and database. Articles were coded for the presence or absence of sexual orientation and gender identity information. Participants' sexual orientation was reported in 3.0% and 4.9% of the 2007 and 2.3% and 6.5% of the 2012 sample, in PsycINFO and PubMed sample articles, respectively, while non-binary gender identity was reported in 0% and 1.0% of the 2007 sample and 2.3% and 1.9% of the 2012 PsycINFO and PubMed sample articles. There were no differences in rates of reporting over time. Sexual orientation and gender identity are rarely reported in the substance abuse literature, and there has not been a change in reporting practices between 2007 and 2012. Recommendations for future investigators in reporting sexual orientation and gender identity are included. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  1. Real-world clinical applicability of pathogenicity predictors assessed on SERPINA1 mutations in alpha-1-antitrypsin deficiency.

    PubMed

    Giacopuzzi, Edoardo; Laffranchi, Mattia; Berardelli, Romina; Ravasio, Viola; Ferrarotti, Ilaria; Gooptu, Bibek; Borsani, Giuseppe; Fra, Annamaria

    2018-06-07

    The growth of publicly available data informing upon genetic variations, mechanisms of disease and disease sub-phenotypes offers great potential for personalised medicine. Computational approaches are likely required to assess large numbers of novel genetic variants. However, the integration of genetic, structural and pathophysiological data still represents a challenge for computational predictions and their clinical use. We addressed these issues for alpha-1-antitrypsin deficiency, a disease mediated by mutations in the SERPINA1 gene encoding alpha-1-antitrypsin. We compiled a comprehensive database of SERPINA1 coding mutations and assigned them apparent pathological relevance based upon available data. 'Benign' and 'Pathogenic' mutations were used to assess performance of 31 pathogenicity predictors. Well-performing algorithms clustered the subset of variants known to be severely pathogenic with high scores. Eight new mutations identified in the ExAC database and achieving high scores were selected for characterisation in cell models and showed secretory deficiency and polymer formation, supporting the predictive power of our computational approach. The behaviour of the pathogenic new variants and consistent outliers were rationalised by considering the protein structural context and residue conservation. These findings highlight the potential of computational methods to provide meaningful predictions of the pathogenic significance of novel mutations and identify areas for further investigation. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  2. Three-dimensional spatial analysis of missense variants in RTEL1 identifies pathogenic variants in patients with Familial Interstitial Pneumonia.

    PubMed

    Sivley, R Michael; Sheehan, Jonathan H; Kropski, Jonathan A; Cogan, Joy; Blackwell, Timothy S; Phillips, John A; Bush, William S; Meiler, Jens; Capra, John A

    2018-01-23

    Next-generation sequencing of individuals with genetic diseases often detects candidate rare variants in numerous genes, but determining which are causal remains challenging. We hypothesized that the spatial distribution of missense variants in protein structures contains information about function and pathogenicity that can help prioritize variants of unknown significance (VUS) and elucidate the structural mechanisms leading to disease. To illustrate this approach in a clinical application, we analyzed 13 candidate missense variants in regulator of telomere elongation helicase 1 (RTEL1) identified in patients with Familial Interstitial Pneumonia (FIP). We curated pathogenic and neutral RTEL1 variants from the literature and public databases. We then used homology modeling to construct a 3D structural model of RTEL1 and mapped known variants into this structure. We next developed a pathogenicity prediction algorithm based on proximity to known disease causing and neutral variants and evaluated its performance with leave-one-out cross-validation. We further validated our predictions with segregation analyses, telomere lengths, and mutagenesis data from the homologous XPD protein. Our algorithm for classifying RTEL1 VUS based on spatial proximity to pathogenic and neutral variation accurately distinguished 7 known pathogenic from 29 neutral variants (ROC AUC = 0.85) in the N-terminal domains of RTEL1. Pathogenic proximity scores were also significantly correlated with effects on ATPase activity (Pearson r = -0.65, p = 0.0004) in XPD, a related helicase. Applying the algorithm to 13 VUS identified from sequencing of RTEL1 from patients predicted five out of six disease-segregating VUS to be pathogenic. We provide structural hypotheses regarding how these mutations may disrupt RTEL1 ATPase and helicase function. Spatial analysis of missense variation accurately classified candidate VUS in RTEL1 and suggests how such variants cause disease. Incorporating spatial proximity analyses into other pathogenicity prediction tools may improve accuracy for other genes and genetic diseases.

  3. A systematic review and meta-analysis of genetic association studies for the role of inflammation and the immune system in diabetic nephropathy

    PubMed Central

    Tziastoudi, Maria; Hadjigeorgiou, Georgios M.; Stravodimos, Konstantinos; Zintzaras, Elias

    2017-01-01

    Abstract Background: Despite the certain contribution of metabolic and haemodynamic factors in diabetic nephropathy (DN), many lines of evidence highlight the role of immunologic and inflammatory mechanisms. To elucidate the contribution of the immune system in the development of DN, we explored the contribution of gene variants (polymorphisms) in relevant pathophysiologic pathways. Methods: We selected six major pathways related to immune response from the Kyoto Encyclopaedia of Genes and Genomes database and thereafter we traced all available genetic association studies (GASs) involving gene variants in these pathways from PubMed and HuGE Navigator. Finally, we used meta-analytic methods for synthesizing the results of the GASs. Results: One hundred three GASs were retrieved that included 443 variants from 75 genes. Of those variants, 138 were meta-analysed and 61 produced significant results; seven variants were investigated in single GASs and showed significant association. Variants in CCL2, CCR5, IL6, IL8, EPO, IL1A, IL1B, IL100, IL1RN, GHRL, MMP9, TGFB1, VEGFA, MMP3, MMP12, IL12RB1, PRKCE, TNF and TNFRSF19 genes were associated with an increased risk of DN. Conclusions: There is evidence that variants related with immunologic response affect the course of DN. However, the present results should be interpreted with caution since the current number of available GASs is limited. PMID:28616206

  4. Cloud-Based NoSQL Open Database of Pulmonary Nodules for Computer-Aided Lung Cancer Diagnosis and Reproducible Research.

    PubMed

    Ferreira Junior, José Raniery; Oliveira, Marcelo Costa; de Azevedo-Marques, Paulo Mazzoncini

    2016-12-01

    Lung cancer is the leading cause of cancer-related deaths in the world, and its main manifestation is pulmonary nodules. Detection and classification of pulmonary nodules are challenging tasks that must be done by qualified specialists, but image interpretation errors make those tasks difficult. In order to aid radiologists on those hard tasks, it is important to integrate the computer-based tools with the lesion detection, pathology diagnosis, and image interpretation processes. However, computer-aided diagnosis research faces the problem of not having enough shared medical reference data for the development, testing, and evaluation of computational methods for diagnosis. In order to minimize this problem, this paper presents a public nonrelational document-oriented cloud-based database of pulmonary nodules characterized by 3D texture attributes, identified by experienced radiologists and classified in nine different subjective characteristics by the same specialists. Our goal with the development of this database is to improve computer-aided lung cancer diagnosis and pulmonary nodule detection and classification research through the deployment of this database in a cloud Database as a Service framework. Pulmonary nodule data was provided by the Lung Image Database Consortium and Image Database Resource Initiative (LIDC-IDRI), image descriptors were acquired by a volumetric texture analysis, and database schema was developed using a document-oriented Not only Structured Query Language (NoSQL) approach. The proposed database is now with 379 exams, 838 nodules, and 8237 images, 4029 of them are CT scans and 4208 manually segmented nodules, and it is allocated in a MongoDB instance on a cloud infrastructure.

  5. Dictionary learning-based CT detection of pulmonary nodules

    NASA Astrophysics Data System (ADS)

    Wu, Panpan; Xia, Kewen; Zhang, Yanbo; Qian, Xiaohua; Wang, Ge; Yu, Hengyong

    2016-10-01

    Segmentation of lung features is one of the most important steps for computer-aided detection (CAD) of pulmonary nodules with computed tomography (CT). However, irregular shapes, complicated anatomical background and poor pulmonary nodule contrast make CAD a very challenging problem. Here, we propose a novel scheme for feature extraction and classification of pulmonary nodules through dictionary learning from training CT images, which does not require accurately segmented pulmonary nodules. Specifically, two classification-oriented dictionaries and one background dictionary are learnt to solve a two-category problem. In terms of the classification-oriented dictionaries, we calculate sparse coefficient matrices to extract intrinsic features for pulmonary nodule classification. The support vector machine (SVM) classifier is then designed to optimize the performance. Our proposed methodology is evaluated with the lung image database consortium and image database resource initiative (LIDC-IDRI) database, and the results demonstrate that the proposed strategy is promising.

  6. Experimental quantum private queries with linear optics

    NASA Astrophysics Data System (ADS)

    de Martini, Francesco; Giovannetti, Vittorio; Lloyd, Seth; Maccone, Lorenzo; Nagali, Eleonora; Sansoni, Linda; Sciarrino, Fabio

    2009-07-01

    The quantum private query is a quantum cryptographic protocol to recover information from a database, preserving both user and data privacy: the user can test whether someone has retained information on which query was asked and the database provider can test the amount of information released. Here we discuss a variant of the quantum private query algorithm that admits a simple linear optical implementation: it employs the photon’s momentum (or time slot) as address qubits and its polarization as bus qubit. A proof-of-principle experimental realization is implemented.

  7. Database for Parkinson Disease Mutations and Rare Variants

    DTIC Science & Technology

    2015-07-01

    the author( s ) and should not be construed as an official Department of the Army position, policy or decision unless so designated by other...other provision of law , no person shall be subject to any penalty for failing to comply with a collection of information if it does not display a

  8. Mapping Creativity: Creativity Measurements Network Analysis

    ERIC Educational Resources Information Center

    Pinheiro, Igor Reszka; Cruz, Roberto Moraes

    2014-01-01

    This article borrowed network analysis tools to discover how the construct formed by the set of all measures of creativity configures itself. To this end, using a variant of the meta-analytical method, a database was compiled simulating 42,381 responses to 974 variables centered on 64 creativity measures. Results, although preliminary, indicate…

  9. Benefits of an Object-oriented Database Representation for Controlled Medical Terminologies

    PubMed Central

    Gu, Huanying; Halper, Michael; Geller, James; Perl, Yehoshua

    1999-01-01

    Objective: Controlled medical terminologies (CMTs) have been recognized as important tools in a variety of medical informatics applications, ranging from patient-record systems to decision-support systems. Controlled medical terminologies are typically organized in semantic network structures consisting of tens to hundreds of thousands of concepts. This overwhelming size and complexity can be a serious barrier to their maintenance and widespread utilization. The authors propose the use of object-oriented databases to address the problems posed by the extensive scope and high complexity of most CMTs for maintenance personnel and general users alike. Design: The authors present a methodology that allows an existing CMT, modeled as a semantic network, to be represented as an equivalent object-oriented database. Such a representation is called an object-oriented health care terminology repository (OOHTR). Results: The major benefit of an OOHTR is its schema, which provides an important layer of structural abstraction. Using the high-level view of a CMT afforded by the schema, one can gain insight into the CMT's overarching organization and begin to better comprehend it. The authors' methodology is applied to the Medical Entities Dictionary (MED), a large CMT developed at Columbia-Presbyterian Medical Center. Examples of how the OOHTR schema facilitated updating, correcting, and improving the design of the MED are presented. Conclusion: The OOHTR schema can serve as an important abstraction mechanism for enhancing comprehension of a large CMT, and thus promotes its usability. PMID:10428002

  10. Database computing in HEP

    NASA Technical Reports Server (NTRS)

    Day, C. T.; Loken, S.; Macfarlane, J. F.; May, E.; Lifka, D.; Lusk, E.; Price, L. E.; Baden, A.; Grossman, R.; Qin, X.

    1992-01-01

    The major SSC experiments are expected to produce up to 1 Petabyte of data per year each. Once the primary reconstruction is completed by farms of inexpensive processors, I/O becomes a major factor in further analysis of the data. We believe that the application of database techniques can significantly reduce the I/O performed in these analyses. We present examples of such I/O reductions in prototypes based on relational and object-oriented databases of CDF data samples.

  11. ClinGen--the Clinical Genome Resource.

    PubMed

    Rehm, Heidi L; Berg, Jonathan S; Brooks, Lisa D; Bustamante, Carlos D; Evans, James P; Landrum, Melissa J; Ledbetter, David H; Maglott, Donna R; Martin, Christa Lese; Nussbaum, Robert L; Plon, Sharon E; Ramos, Erin M; Sherry, Stephen T; Watson, Michael S

    2015-06-04

    On autopsy, a patient is found to have hypertrophic cardiomyopathy. The patient’s family pursues genetic testing that shows a “likely pathogenic” variant for the condition on the basis of a study in an original research publication. Given the dominant inheritance of the condition and the risk of sudden cardiac death, other family members are tested for the genetic variant to determine their risk. Several family members test negative and are told that they are not at risk for hypertrophic cardiomyopathy and sudden cardiac death, and those who test positive are told that they need to be regularly monitored for cardiomyopathy on echocardiography. Five years later, during a routine clinic visit of one of the genotype-positive family members, the cardiologist queries a database for current knowledge on the genetic variant and discovers that the variant is now interpreted as “likely benign” by another laboratory that uses more recently derived population-frequency data. A newly available testing panel for additional genes that are implicated in hypertrophic cardiomyopathy is initiated on an affected family member, and a different variant is found that is determined to be pathogenic. Family members are retested, and one member who previously tested negative is now found to be positive for this new variant. An immediate clinical workup detects evidence of cardiomyopathy, and an intracardiac defibrillator is implanted to reduce the risk of sudden cardiac death.

  12. Proposal for Implementing Multi-User Database (MUD) Technology in an Academic Library.

    ERIC Educational Resources Information Center

    Filby, A. M. Iliana

    1996-01-01

    Explores the use of MOO (multi-user object oriented) virtual environments in academic libraries to enhance reference services. Highlights include the development of multi-user database (MUD) technology from gaming to non-recreational settings; programming issues; collaborative MOOs; MOOs as distinguished from other types of virtual reality; audio…

  13. Epitaxial growth of (111)-oriented BaTiO3/SrTiO3 perovskite superlattices on Pt(111)/Ti/Al2O3(0001) substrates

    NASA Astrophysics Data System (ADS)

    Panomsuwan, Gasidit; Takai, Osamu; Saito, Nagahiro

    2013-09-01

    Symmetric BaTiO3/SrTiO3 (BTO/STO) superlattices (SLs) were epitaxially grown on Pt(111)/Ti/Al2O3(0001) substrates with various modulation periods (Λ = 4.8 - 48 nm) using double ion beam sputter deposition. The BTO/STO SLs exhibit high (111) orientation with two in-plane orientation variants related by a 180° rotation along the [111]Pt axis. The BTO layer is under an in-plane compressive state, whereas the STO layer is under an in-plane tensile state due to the effect of lattice mismatch. A remarkable enhancement of dielectric constant is observed for the SL with relatively small modulation period, which is attributed to both the interlayer biaxial strain effect and the Maxwell-Wagner effect.

  14. The effect of grain orientation on nanoindentation behavior of model austenitic alloy Fe-20Cr-25Ni

    DOE PAGES

    Chen, Tianyi; Tan, Lizhen; Lu, Zizhe; ...

    2017-07-26

    Instrumented nanoindentation was used in this paper to investigate the hardness, elastic modulus, and creep behavior of an austenitic Fe-20Cr-25Ni model alloy at room temperature, with the indented grain orientation being the variant. The samples indented close to the {111} surfaces exhibited the highest hardness and modulus. However, nanoindentation creep tests showed the greatest tendency for creep in the {111} indented samples, compared with the samples indented close to the {001} and {101} surfaces. Scanning electron microscopy and cross-sectional transmission electron microscopy revealed slip bands and dislocations in all samples. The slip band patterns on the indented surfaces were influencedmore » by the grain orientations. Deformation twinning was observed only under the {001} indented surfaces. Finally, microstructural analysis and molecular dynamics modeling correlated the anisotropic nanoindentation-creep behavior with the different dislocation substructures formed during indentation, which resulted from the dislocation reactions of certain active slip systems that are determined by the indented grain orientations.« less

  15. Establishment of an international database for genetic variants in esophageal cancer.

    PubMed

    Vihinen, Mauno

    2016-10-01

    The establishment of a database has been suggested in order to collect, organize, and distribute genetic information about esophageal cancer. The World Organization for Specialized Studies on Diseases of the Esophagus and the Human Variome Project will be in charge of a central database of information about esophageal cancer-related variations from publications, databases, and laboratories; in addition to genetic details, clinical parameters will also be included. The aim will be to get all the central players in research, clinical, and commercial laboratories to contribute. The database will follow established recommendations and guidelines. The database will require a team of dedicated curators with different backgrounds. Numerous layers of systematics will be applied to facilitate computational analyses. The data items will be extensively integrated with other information sources. The database will be distributed as open access to ensure exchange of the data with other databases. Variations will be reported in relation to reference sequences on three levels--DNA, RNA, and protein-whenever applicable. In the first phase, the database will concentrate on genetic variations including both somatic and germline variations for susceptibility genes. Additional types of information can be integrated at a later stage. © 2016 New York Academy of Sciences.

  16. Italian Present-day Stress Indicators: IPSI Database

    NASA Astrophysics Data System (ADS)

    Mariucci, M. T.; Montone, P.

    2017-12-01

    In Italy, since the 90s of the last century, researches concerning the contemporary stress field have been developing at Istituto Nazionale di Geofisica e Vulcanologia (INGV) with local and regional scale studies. Throughout the years many data have been analysed and collected: now they are organized and available for an easy end-use online. IPSI (Italian Present-day Stress Indicators) database, is the first geo-referenced repository of information on the crustal present-day stress field maintained at INGV through a web application database and website development by Gabriele Tarabusi. Data consist of horizontal stress orientations analysed and compiled in a standardized format and quality-ranked for reliability and comparability on a global scale with other database. Our first database release includes 855 data records updated to December 2015. Here we present an updated version that will be released in 2018, after new earthquake data entry up to December 2017. The IPSI web site (http://ipsi.rm.ingv.it/) allows accessing data on a standard map viewer and choose which data (category and/or quality) to plot easily. The main information of each single element (type, quality, orientation) can be viewed simply going over the related symbol, all the information appear by clicking the element. At the same time, simple basic information on the different data type, tectonic regime assignment, quality ranking method are available with pop-up windows. Data records can be downloaded in some common formats, moreover it is possible to download a file directly usable with SHINE, a web based application to interpolate stress orientations (http://shine.rm.ingv.it). IPSI is mainly conceived for those interested in studying the characters of Italian peninsula and surroundings although Italian data are part of the World Stress Map (http://www.world-stress-map.org/) as evidenced by many links that redirect to this database for more details on standard practices in this field.

  17. Correspondence Effects for Objects with Opposing Left and Right Protrusions

    ERIC Educational Resources Information Center

    Cho, Dongbin; Proctor, Robert W.

    2011-01-01

    Choice reactions to a property of an object stimulus are often faster when the location of a graspable part of the object corresponds with the location of a keypress response than when it does not, a phenomenon called the object-based Simon effect. Experiments 1-3 examined this effect for variants of teapot stimuli that were oriented to the left…

  18. Learning to Be (In)Variant: Combining Prior Knowledge and Experience to Infer Orientation Invariance in Object Recognition

    ERIC Educational Resources Information Center

    Austerweil, Joseph L.; Griffiths, Thomas L.; Palmer, Stephen E.

    2017-01-01

    How does the visual system recognize images of a novel object after a single observation despite possible variations in the viewpoint of that object relative to the observer? One possibility is comparing the image with a prototype for invariance over a relevant transformation set (e.g., translations and dilations). However, invariance over…

  19. The Cultivation of a Prosocial Value Orientation through Community Service: An Examination of Organizational Context, Social Facilitation, and Duration

    ERIC Educational Resources Information Center

    Horn, Aaron S.

    2012-01-01

    Community service is widely regarded as a fundamental experience in preparation for good citizenship, but it remains unclear whether common variants of service are consequential for civic outcomes. This study examines changes in the relative importance assigned to prosocial and egoistic values associated with service through different types of…

  20. Information model construction of MES oriented to mechanical blanking workshop

    NASA Astrophysics Data System (ADS)

    Wang, Jin-bo; Wang, Jin-ye; Yue, Yan-fang; Yao, Xue-min

    2016-11-01

    Manufacturing Execution System (MES) is one of the crucial technologies to implement informatization management in manufacturing enterprises, and the construction of its information model is the base of MES database development. Basis on the analysis of the manufacturing process information in mechanical blanking workshop and the information requirement of MES every function module, the IDEF1X method was adopted to construct the information model of MES oriented to mechanical blanking workshop, and a detailed description of the data structure feature included in MES every function module and their logical relationship was given from the point of view of information relationship, which laid the foundation for the design of MES database.

  1. Mechanistic Insights into Archaeal and Human Argonaute Substrate Binding and Cleavage Properties

    PubMed Central

    Willkomm, Sarah; Zander, Adrian; Grohmann, Dina; Restle, Tobias

    2016-01-01

    Argonaute (Ago) proteins from all three domains of life are key players in processes that specifically regulate cellular nucleic acid levels. Some of these Ago proteins, among them human Argonaute2 (hAgo2) and Ago from the archaeal organism Methanocaldococcus jannaschii (MjAgo), are able to cleave nucleic acid target strands that are recognised via an Ago-associated complementary guide strand. Here we present an in-depth kinetic side-by-side analysis of hAgo2 and MjAgo guide and target substrate binding as well as target strand cleavage, which enabled us to disclose similarities and differences in the mechanistic pathways as a function of the chemical nature of the substrate. Testing all possible guide-target combinations (i.e. RNA/RNA, RNA/DNA, DNA/RNA and DNA/DNA) with both Ago variants we demonstrate that the molecular mechanism of substrate association is highly conserved among archaeal-eukaryotic Argonautes. Furthermore, we show that hAgo2 binds RNA and DNA guide strands in the same fashion. On the other hand, despite striking homology between the two Ago variants, MjAgo cannot orientate guide RNA substrates in a way that allows interaction with the target DNA in a cleavage-compatible orientation. PMID:27741323

  2. A particle swarm optimization variant with an inner variable learning strategy.

    PubMed

    Wu, Guohua; Pedrycz, Witold; Ma, Manhao; Qiu, Dishan; Li, Haifeng; Liu, Jin

    2014-01-01

    Although Particle Swarm Optimization (PSO) has demonstrated competitive performance in solving global optimization problems, it exhibits some limitations when dealing with optimization problems with high dimensionality and complex landscape. In this paper, we integrate some problem-oriented knowledge into the design of a certain PSO variant. The resulting novel PSO algorithm with an inner variable learning strategy (PSO-IVL) is particularly efficient for optimizing functions with symmetric variables. Symmetric variables of the optimized function have to satisfy a certain quantitative relation. Based on this knowledge, the inner variable learning (IVL) strategy helps the particle to inspect the relation among its inner variables, determine the exemplar variable for all other variables, and then make each variable learn from the exemplar variable in terms of their quantitative relations. In addition, we design a new trap detection and jumping out strategy to help particles escape from local optima. The trap detection operation is employed at the level of individual particles whereas the trap jumping out strategy is adaptive in its nature. Experimental simulations completed for some representative optimization functions demonstrate the excellent performance of PSO-IVL. The effectiveness of the PSO-IVL stresses a usefulness of augmenting evolutionary algorithms by problem-oriented domain knowledge.

  3. Mental-orientation: A new approach to assessing patients across the Alzheimer's disease spectrum.

    PubMed

    Peters-Founshtein, Gregory; Peer, Michael; Rein, Yanai; Kahana Merhavi, Shlomzion; Meiner, Zeev; Arzy, Shahar

    2018-05-21

    This study aims to assess the role of mental-orientation in the diagnosis of mild cognitive impairment and Alzheimer's disease using a novel task. A behavioral study (Experiment 1) compared the mental-orientation task to standard neuropsychological tests in patients across the Alzheimer's disease spectrum. A functional MRI study (Experiment 2) in young adults compared activations evoked by the mental-orientation and standard-orientation tasks as well as their overlap with brain regions susceptible to Alzheimer's disease pathology. The mental-orientation task differentiated mild cognitively impaired and healthy controls at 95% accuracy, while the Addenbrooke's Cognitive Examination, Mini-Mental State Examination and standard-orientation achieved 74%, 70% and 50% accuracy, respectively. Functional MRI revealed the mental-orientation task to preferentially recruit brain regions exhibiting early Alzheimer's-related atrophy, unlike the standard-orientation test. Mental-orientation is suggested to play a key role in Alzheimer's disease, and consequently in early detection and follow-up of patients along the Alzheimer's disease spectrum. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  4. Influence of Hydrogen and Number of Particle Variants on Ordinary and Two-Way Shape Memory Effects in Ti-Ni Single Crystals

    NASA Astrophysics Data System (ADS)

    Kireeva, I. V.; Platonova, Yu. N.; Chumlyakov, Yu. I.

    2017-02-01

    The ordinary and two-way shape memory effects (SMEs) are investigated for [ overline{1} 12] single crystals of Ti-51.3Ni (at.%) alloy aged at 823 K for 1.5 h in free state and under tensile stress of 150 MPa without hydrogen and after saturation by hydrogen. It is established that without hydrogen in [ overline{1} 12] single crystals with one and four variants of Ti3Ni4 particles the maximum magnitude of the ordinary SME is 1.9-2.6% under the external stress σext = 250 MPa. Under σext > 250 MPa, crystals are destroyed. The magnitude of the two-way SME caused by the B2- R- B19' MT equal to 1.1% at σext = 0 is observed in [ overline{1} 12] single crystals with one variant of Ti3Ni4 particles. The physical reason for the observed two-way SME is the internal compressive stresses oriented along the [ overline{1} 12] directions arising from one variant of Ti3Ni4 particles as a result of aging under tensile stress of 150 MPa. It is established that hydrogen does not influence the TR temperature, reduces the plasticity, and suppresses the two-way SME. The suppression of two-way SME in the [ overline{1} 12] single crystals of the Ti-51.3Ni (at.%) alloy with one variant of Ti3Ni4 particles is caused by shielding of stress fields from one variant of Ti3Ni4 particles and multiple nucleation of R- and B19' martensite variants under loading with saturation by hydrogen.

  5. Houston Methodist Variant Viewer: An Application to Support Clinical Laboratory Interpretation of Next-generation Sequencing Data for Cancer

    PubMed Central

    Christensen, Paul A.; Ni, Yunyun; Bao, Feifei; Hendrickson, Heather L.; Greenwood, Michael; Thomas, Jessica S.; Long, S. Wesley; Olsen, Randall J.

    2017-01-01

    Introduction: Next-generation-sequencing (NGS) is increasingly used in clinical and research protocols for patients with cancer. NGS assays are routinely used in clinical laboratories to detect mutations bearing on cancer diagnosis, prognosis and personalized therapy. A typical assay may interrogate 50 or more gene targets that encompass many thousands of possible gene variants. Analysis of NGS data in cancer is a labor-intensive process that can become overwhelming to the molecular pathologist or research scientist. Although commercial tools for NGS data analysis and interpretation are available, they are often costly, lack key functionality or cannot be customized by the end user. Methods: To facilitate NGS data analysis in our clinical molecular diagnostics laboratory, we created a custom bioinformatics tool termed Houston Methodist Variant Viewer (HMVV). HMVV is a Java-based solution that integrates sequencing instrument output, bioinformatics analysis, storage resources and end user interface. Results: Compared to the predicate method used in our clinical laboratory, HMVV markedly simplifies the bioinformatics workflow for the molecular technologist and facilitates the variant review by the molecular pathologist. Importantly, HMVV reduces time spent researching the biological significance of the variants detected, standardizes the online resources used to perform the variant investigation and assists generation of the annotated report for the electronic medical record. HMVV also maintains a searchable variant database, including the variant annotations generated by the pathologist, which is useful for downstream quality improvement and research projects. Conclusions: HMVV is a clinical grade, low-cost, feature-rich, highly customizable platform that we have made available for continued development by the pathology informatics community. PMID:29226007

  6. MERRF Classification: Implications for Diagnosis and Clinical Trials.

    PubMed

    Finsterer, Josef; Zarrouk-Mahjoub, Sinda; Shoffner, John M

    2018-03-01

    Given the etiologic heterogeneity of disease classification using clinical phenomenology, we employed contemporary criteria to classify variants associated with myoclonic epilepsy with ragged-red fibers (MERRF) syndrome and to assess the strength of evidence of gene-disease associations. Standardized approaches are used to clarify the definition of MERRF, which is essential for patient diagnosis, patient classification, and clinical trial design. Systematic literature and database search with application of standardized assessment of gene-disease relationships using modified Smith criteria and of variants reported to be associated with MERRF using modified Yarham criteria. Review of available evidence supports a gene-disease association for two MT-tRNAs and for POLG. Using modified Smith criteria, definitive evidence of a MERRF gene-disease association is identified for MT-TK. Strong gene-disease evidence is present for MT-TL1 and POLG. Functional assays that directly associate variants with oxidative phosphorylation impairment were critical to mtDNA variant classification. In silico analysis was of limited utility to the assessment of individual MT-tRNA variants. With the use of contemporary classification criteria, several mtDNA variants previously reported as pathogenic or possibly pathogenic are reclassified as neutral variants. MERRF is primarily an MT-TK disease, with pathogenic variants in this gene accounting for ~90% of MERRF patients. Although MERRF is phenotypically and genotypically heterogeneous, myoclonic epilepsy is the clinical feature that distinguishes MERRF from other categories of mitochondrial disorders. Given its low frequency in mitochondrial disorders, myoclonic epilepsy is not explained simply by an impairment of cellular energetics. Although MERRF phenocopies can occur in other genes, additional data are needed to establish a MERRF disease-gene association. This approach to MERRF emphasizes standardized classification rather than clinical phenomenology, thus improving patient diagnosis and clinical trial design. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. Low Frequency Variants, Collapsed Based on Biological Knowledge, Uncover Complexity of Population Stratification in 1000 Genomes Project Data

    PubMed Central

    Moore, Carrie B.; Wallace, John R.; Wolfe, Daniel J.; Frase, Alex T.; Pendergrass, Sarah A.; Weiss, Kenneth M.; Ritchie, Marylyn D.

    2013-01-01

    Analyses investigating low frequency variants have the potential for explaining additional genetic heritability of many complex human traits. However, the natural frequencies of rare variation between human populations strongly confound genetic analyses. We have applied a novel collapsing method to identify biological features with low frequency variant burden differences in thirteen populations sequenced by the 1000 Genomes Project. Our flexible collapsing tool utilizes expert biological knowledge from multiple publicly available database sources to direct feature selection. Variants were collapsed according to genetically driven features, such as evolutionary conserved regions, regulatory regions genes, and pathways. We have conducted an extensive comparison of low frequency variant burden differences (MAF<0.03) between populations from 1000 Genomes Project Phase I data. We found that on average 26.87% of gene bins, 35.47% of intergenic bins, 42.85% of pathway bins, 14.86% of ORegAnno regulatory bins, and 5.97% of evolutionary conserved regions show statistically significant differences in low frequency variant burden across populations from the 1000 Genomes Project. The proportion of bins with significant differences in low frequency burden depends on the ancestral similarity of the two populations compared and types of features tested. Even closely related populations had notable differences in low frequency burden, but fewer differences than populations from different continents. Furthermore, conserved or functionally relevant regions had fewer significant differences in low frequency burden than regions under less evolutionary constraint. This degree of low frequency variant differentiation across diverse populations and feature elements highlights the critical importance of considering population stratification in the new era of DNA sequencing and low frequency variant genomic analyses. PMID:24385916

  8. Orientation Modeling for Amateur Cameras by Matching Image Line Features and Building Vector Data

    NASA Astrophysics Data System (ADS)

    Hung, C. H.; Chang, W. C.; Chen, L. C.

    2016-06-01

    With the popularity of geospatial applications, database updating is getting important due to the environmental changes over time. Imagery provides a lower cost and efficient way to update the database. Three dimensional objects can be measured by space intersection using conjugate image points and orientation parameters of cameras. However, precise orientation parameters of light amateur cameras are not always available due to their costliness and heaviness of precision GPS and IMU. To automatize data updating, the correspondence of object vector data and image may be built to improve the accuracy of direct georeferencing. This study contains four major parts, (1) back-projection of object vector data, (2) extraction of image feature lines, (3) object-image feature line matching, and (4) line-based orientation modeling. In order to construct the correspondence of features between an image and a building model, the building vector features were back-projected onto the image using the initial camera orientation from GPS and IMU. Image line features were extracted from the imagery. Afterwards, the matching procedure was done by assessing the similarity between the extracted image features and the back-projected ones. Then, the fourth part utilized line features in orientation modeling. The line-based orientation modeling was performed by the integration of line parametric equations into collinearity condition equations. The experiment data included images with 0.06 m resolution acquired by Canon EOS Mark 5D II camera on a Microdrones MD4-1000 UAV. Experimental results indicate that 2.1 pixel accuracy may be reached, which is equivalent to 0.12 m in the object space.

  9. Identification of a novel valosin-containing protein polymorphism in late-onset Alzheimer's disease.

    PubMed

    Kaleem, M; Zhao, A; Hamshere, M; Myers, A J

    2007-01-01

    Recently, mutations in the valosin-containing protein gene (VCP) were found to be causative for a rare form of dementia [Watts GDJ, et al.: Nat Genet 2004;36:377-381]. This gene lies within a region on the genome that has been linked to late onset Alzheimer's disease (LOAD) [Myers A, et al.: Am J Med Genet 2002;114:233-242]. In this study, we investigated whether variation within VCP could account for the LOAD linkage peak on chromosome 9. We sequenced 188 individuals from the set of sibling pairs we had used to obtain the linkage results for chromosome 9 to look for novel polymorphisms that could explain the linkage signal. Any variant that was found was then typed in 2 additional sets of neuropathologically confirmed samples to look for associations with Alzheimer's disease. We found 2 variants when we sequenced VCP. One was a novel rare variant (R92H) and the other is already reported within the publicly available databases (rs10972300). Neither explained the chromosome 9 linkage signal for LOAD. We have found a novel rare variant within the VCP gene, but we did not find a variant that could explain the linkage signal for LOAD on chromosome 9. Copyright (c) 2007 S. Karger AG, Basel.

  10. Lost and forgotten? Orientation versus memory in Alzheimer's disease and frontotemporal dementia.

    PubMed

    Yew, Belinda; Alladi, Suvarna; Shailaja, Mekala; Hodges, John R; Hornberger, Michael

    2013-01-01

    Recent studies suggest that significant memory problems are not specific to Alzheimer's disease (AD) but can be also observed in other neurodegenerative conditions, such as behavioral variant frontotemporal dementia (bvFTD). We investigated whether orientation (spatial & temporal) information is a better diagnostic marker for AD compared to memory and whether their atrophy correlates of orientation and memory differ. A large sample (n = 190) of AD patients (n = 73), bvFTD patients (n = 54), and healthy controls (n = 63) underwent testing. A subset of the patients (n = 72) underwent structural imaging using voxel-based morphometry analysis of magnetic resonance brain imaging. Orientation and memory scores from the Addenbrooke's Cognitive Examination showed that AD patients had impaired orientation and memory, while bvFTD patients performing at control level for orientation but had impaired memory. A logistic regression showed that 78% of patients could be classified on the basis of orientation and memory scores alone at clinic presentation. Voxel-based morphometry analysis was conducted using orientation and memory scores as covariates, which showed that the neural correlates for orientation and memory also dissociated with posterior hippocampus cortex being related to orientation in AD, while the anterior hippocampus was associated with memory performance in the AD and bvFTD patients. Orientation and memory measures discriminate AD and bvFTD to a high degree and tap into different hippocampal regions. Disorientation and posterior hippocampus appears therefore specific to AD and will allow clinicians to discriminate AD patients from other neurodegenerative conditions with similar memory deficits at clinic presentation.

  11. Single nucleotide variants and InDels identified from whole-genome re-sequencing of Guzerat, Gyr, Girolando and Holstein cattle breeds.

    PubMed

    Stafuzza, Nedenia Bonvino; Zerlotini, Adhemar; Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto

    2017-01-01

    Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.

  12. Single nucleotide variants and InDels identified from whole-genome re-sequencing of Guzerat, Gyr, Girolando and Holstein cattle breeds

    PubMed Central

    Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J.; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto

    2017-01-01

    Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs. PMID:28323836

  13. Efficient analysis of mouse genome sequences reveal many nonsense variants

    PubMed Central

    Steeland, Sophie; Timmermans, Steven; Van Ryckeghem, Sara; Hulpiau, Paco; Saeys, Yvan; Van Montagu, Marc; Vandenbroucke, Roosmarijn E.; Libert, Claude

    2016-01-01

    Genetic polymorphisms in coding genes play an important role when using mouse inbred strains as research models. They have been shown to influence research results, explain phenotypical differences between inbred strains, and increase the amount of interesting gene variants present in the many available inbred lines. SPRET/Ei is an inbred strain derived from Mus spretus that has ∼1% sequence difference with the C57BL/6J reference genome. We obtained a listing of all SNPs and insertions/deletions (indels) present in SPRET/Ei from the Mouse Genomes Project (Wellcome Trust Sanger Institute) and processed these data to obtain an overview of all transcripts having nonsynonymous coding sequence variants. We identified 8,883 unique variants affecting 10,096 different transcripts from 6,328 protein-coding genes, which is about 28% of all coding genes. Because only a subset of these variants results in drastic changes in proteins, we focused on variations that are nonsense mutations that ultimately resulted in a gain of a stop codon. These genes were identified by in silico changing the C57BL/6J coding sequences to the SPRET/Ei sequences, converting them to amino acid (AA) sequences, and comparing the AA sequences. All variants and transcripts affected were also stored in a database, which can be browsed using a SPRET/Ei M. spretus variants web tool (www.spretus.org), including a manual. We validated the tool by demonstrating the loss of function of three proteins predicted to be severely truncated, namely Fas, IRAK2, and IFNγR1. PMID:27147605

  14. Improved genetic counseling in Alport syndrome by new variants of COL4A5 gene.

    PubMed

    Fernandez-Rosado, Francisco; Campos, Ana; Alvarez-Cubero, Maria Jesus; Ruiz, Ana; Entrala-Bernal, Carmen

    2015-07-01

    There are current requirements of using genetic databases for offering a better genetic assistance to patients of some syndromes, especially those with X-linked heredity patterns (like Alport Syndrome) for the high probability of having descendants affected by the disease. We describe the first reported case of COL4A5 gene missense c.1499 G>T mutation in a 16-year-old girl confirmed to be affected by Alport Syndrome after genetic counseling. Next Generation Sequencing procedures let discover this mutation and offer an accurate clinical treatment to this patient. Current scientific understanding of genetic syndromes suggests the high importance of updated databases and the inclusion of Variant of Unknown Significance related to clinical cases. All of this updating could enable patients to have a better opportunity of diagnosis and having genetic and clinical counseling. This event is even more important in women planning to start a family to have correct genetic counseling regarding the risk posed to offspring, and allowing the decision to undergo prenatal testing. © 2015 Asian Pacific Society of Nephrology.

  15. sapFinder: an R/Bioconductor package for detection of variant peptides in shotgun proteomics experiments.

    PubMed

    Wen, Bo; Xu, Shaohang; Sheynkman, Gloria M; Feng, Qiang; Lin, Liang; Wang, Quanhui; Xu, Xun; Wang, Jun; Liu, Siqi

    2014-11-01

    Single nucleotide variations (SNVs) located within a reading frame can result in single amino acid polymorphisms (SAPs), leading to alteration of the corresponding amino acid sequence as well as function of a protein. Accurate detection of SAPs is an important issue in proteomic analysis at the experimental and bioinformatic level. Herein, we present sapFinder, an R software package, for detection of the variant peptides based on tandem mass spectrometry (MS/MS)-based proteomics data. This package automates the construction of variation-associated databases from public SNV repositories or sample-specific next-generation sequencing (NGS) data and the identification of SAPs through database searching, post-processing and generation of HTML-based report with visualized interface. sapFinder is implemented as a Bioconductor package in R. The package and the vignette can be downloaded at http://bioconductor.org/packages/devel/bioc/html/sapFinder.html and are provided under a GPL-2 license. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  16. Robust Optical Recognition of Cursive Pashto Script Using Scale, Rotation and Location Invariant Approach

    PubMed Central

    Ahmad, Riaz; Naz, Saeeda; Afzal, Muhammad Zeshan; Amin, Sayed Hassan; Breuel, Thomas

    2015-01-01

    The presence of a large number of unique shapes called ligatures in cursive languages, along with variations due to scaling, orientation and location provides one of the most challenging pattern recognition problems. Recognition of the large number of ligatures is often a complicated task in oriental languages such as Pashto, Urdu, Persian and Arabic. Research on cursive script recognition often ignores the fact that scaling, orientation, location and font variations are common in printed cursive text. Therefore, these variations are not included in image databases and in experimental evaluations. This research uncovers challenges faced by Arabic cursive script recognition in a holistic framework by considering Pashto as a test case, because Pashto language has larger alphabet set than Arabic, Persian and Urdu. A database containing 8000 images of 1000 unique ligatures having scaling, orientation and location variations is introduced. In this article, a feature space based on scale invariant feature transform (SIFT) along with a segmentation framework has been proposed for overcoming the above mentioned challenges. The experimental results show a significantly improved performance of proposed scheme over traditional feature extraction techniques such as principal component analysis (PCA). PMID:26368566

  17. Genovar: a detection and visualization tool for genomic variants.

    PubMed

    Jung, Kwang Su; Moon, Sanghoon; Kim, Young Jin; Kim, Bong-Jo; Park, Kiejung

    2012-05-08

    Along with single nucleotide polymorphisms (SNPs), copy number variation (CNV) is considered an important source of genetic variation associated with disease susceptibility. Despite the importance of CNV, the tools currently available for its analysis often produce false positive results due to limitations such as low resolution of array platforms, platform specificity, and the type of CNV. To resolve this problem, spurious signals must be separated from true signals by visual inspection. None of the previously reported CNV analysis tools support this function and the simultaneous visualization of comparative genomic hybridization arrays (aCGH) and sequence alignment. The purpose of the present study was to develop a useful program for the efficient detection and visualization of CNV regions that enables the manual exclusion of erroneous signals. A JAVA-based stand-alone program called Genovar was developed. To ascertain whether a detected CNV region is a novel variant, Genovar compares the detected CNV regions with previously reported CNV regions using the Database of Genomic Variants (DGV, http://projects.tcag.ca/variation) and the Single Nucleotide Polymorphism Database (dbSNP). The current version of Genovar is capable of visualizing genomic data from sources such as the aCGH data file and sequence alignment format files. Genovar is freely accessible and provides a user-friendly graphic user interface (GUI) to facilitate the detection of CNV regions. The program also provides comprehensive information to help in the elimination of spurious signals by visual inspection, making Genovar a valuable tool for reducing false positive CNV results. http://genovar.sourceforge.net/.

  18. SSTAR, a Stand-Alone Easy-To-Use Antimicrobial Resistance Gene Predictor.

    PubMed

    de Man, Tom J B; Limbago, Brandi M

    2016-01-01

    We present the easy-to-use Sequence Search Tool for Antimicrobial Resistance, SSTAR. It combines a locally executed BLASTN search against a customizable database with an intuitive graphical user interface for identifying antimicrobial resistance (AR) genes from genomic data. Although the database is initially populated from a public repository of acquired resistance determinants (i.e., ARG-ANNOT), it can be customized for particular pathogen groups and resistance mechanisms. For instance, outer membrane porin sequences associated with carbapenem resistance phenotypes can be added, and known intrinsic mechanisms can be included. Unique about this tool is the ability to easily detect putative new alleles and truncated versions of existing AR genes. Variants and potential new alleles are brought to the attention of the user for further investigation. For instance, SSTAR is able to identify modified or truncated versions of porins, which may be of great importance in carbapenemase-negative carbapenem-resistant Enterobacteriaceae. SSTAR is written in Java and is therefore platform independent and compatible with both Windows and Unix operating systems. SSTAR and its manual, which includes a simple installation guide, are freely available from https://github.com/tomdeman-bio/Sequence-Search-Tool-for-Antimicrobial-Resistance-SSTAR-. IMPORTANCE Whole-genome sequencing (WGS) is quickly becoming a routine method for identifying genes associated with antimicrobial resistance (AR). However, for many microbiologists, the use and analysis of WGS data present a substantial challenge. We developed SSTAR, software with a graphical user interface that enables the identification of known AR genes from WGS and has the unique capacity to easily detect new variants of known AR genes, including truncated protein variants. Current software solutions do not notify the user when genes are truncated and, therefore, likely nonfunctional, which makes phenotype predictions less accurate. SSTAR users can apply any AR database of interest as a reference comparator and can manually add genes that impact resistance, even if such genes are not resistance determinants per se (e.g., porins and efflux pumps).

  19. BTKbase, mutation database for X-linked agammaglobulinemia (XLA).

    PubMed Central

    Vihinen, M; Brandau, O; Brandén, L J; Kwan, S P; Lappalainen, I; Lester, T; Noordzij, J G; Ochs, H D; Ollila, J; Pienaar, S M; Riikonen, P; Saha, B K; Smith, C I

    1998-01-01

    X-linked agammaglobulinemia (XLA) is an immunodeficiency caused by mutations in the gene coding for Bruton's agammaglobulinemia tyrosine kinase (BTK). A database (BTKbase) of BTK mutations has been compiled and the recent update lists 463 mutation entries from 406 unrelated families showing 303 unique molecular events. In addition to mutations, the database also lists variants or polymorphisms. Each patient is given a unique patient identity number (PIN). Information is included regarding the phenotype including symptoms. Mutations in all the five domains of BTK have been noticed to cause the disease, the most common event being missense mutations. The mutations appear almost uniformly throughout the molecule and frequently affect CpG sites that code for arginine residues. The putative structural implications of all the missense mutations are given in the database. The improved version of the registry having a number of new features is available at http://www. helsinki.fi/science/signal/btkbase.html PMID:9399844

  20. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Chen, Tianyi; Tan, Lizhen; Lu, Zizhe

    Instrumented nanoindentation was used in this paper to investigate the hardness, elastic modulus, and creep behavior of an austenitic Fe-20Cr-25Ni model alloy at room temperature, with the indented grain orientation being the variant. The samples indented close to the {111} surfaces exhibited the highest hardness and modulus. However, nanoindentation creep tests showed the greatest tendency for creep in the {111} indented samples, compared with the samples indented close to the {001} and {101} surfaces. Scanning electron microscopy and cross-sectional transmission electron microscopy revealed slip bands and dislocations in all samples. The slip band patterns on the indented surfaces were influencedmore » by the grain orientations. Deformation twinning was observed only under the {001} indented surfaces. Finally, microstructural analysis and molecular dynamics modeling correlated the anisotropic nanoindentation-creep behavior with the different dislocation substructures formed during indentation, which resulted from the dislocation reactions of certain active slip systems that are determined by the indented grain orientations.« less

  1. X-Linked Glomerulopathy Due to COL4A5 Founder Variant.

    PubMed

    Barua, Moumita; John, Rohan; Stella, Lorenzo; Li, Weili; Roslin, Nicole M; Sharif, Bedra; Hack, Saidah; Lajoie-Starkell, Ginette; Schwaderer, Andrew L; Becknell, Brian; Wuttke, Matthias; Köttgen, Anna; Cattran, Daniel; Paterson, Andrew D; Pei, York

    2018-03-01

    Alport syndrome is a rare hereditary disorder caused by rare variants in 1 of 3 genes encoding for type IV collagen. Rare variants in COL4A5 on chromosome Xq22 cause X-linked Alport syndrome, which accounts for ∼80% of the cases. Alport syndrome has a variable clinical presentation, including progressive kidney failure, hearing loss, and ocular defects. Exome sequencing performed in 2 affected related males with an undefined X-linked glomerulopathy characterized by global and segmental glomerulosclerosis, mesangial hypercellularity, and vague basement membrane immune complex deposition revealed a COL4A5 sequence variant, a substitution of a thymine by a guanine at nucleotide 665 (c.T665G; rs281874761) of the coding DNA predicted to lead to a cysteine to phenylalanine substitution at amino acid 222, which was not seen in databases cataloguing natural human genetic variation, including dbSNP138, 1000 Genomes Project release version 01-11-2004, Exome Sequencing Project 21-06-2014, or ExAC 01-11-2014. Review of the literature identified 2 additional families with the same COL4A5 variant leading to similar atypical histopathologic features, suggesting a unique pathologic mechanism initiated by this specific rare variant. Homology modeling suggests that the substitution alters the structural and dynamic properties of the type IV collagen trimer. Genetic analysis comparing members of the 3 families indicated a distant relationship with a shared haplotype, implying a founder effect. Crown Copyright © 2017. Published by Elsevier Inc. All rights reserved.

  2. Broad phenotypes in heterozygous NR5A1 46,XY patients with a disorder of sex development: an oligogenic origin?

    PubMed

    Camats, Núria; Fernández-Cancio, Mónica; Audí, Laura; Schaller, André; Flück, Christa E

    2018-06-11

    SF-1/NR5A1 is a transcriptional regulator of adrenal and gonadal development. NR5A1 disease-causing variants cause disorders of sex development (DSD) and adrenal failure, but most affected individuals show a broad DSD/reproductive phenotype only. Most NR5A1 variants show in vitro pathogenic effects, but not when tested in heterozygote state together with wild-type NR5A1 as usually seen in patients. Thus, the genotype-phenotype correlation for NR5A1 variants remains an unsolved question. We analyzed heterozygous 46,XY SF-1/NR5A1 patients by whole exome sequencing and used an algorithm for data analysis based on selected project-specific DSD- and SF-1-related genes. The variants detected were evaluated for their significance in literature, databases and checked in silico using webtools. We identified 19 potentially deleterious variants (one to seven per patient) in 18 genes in four 46,XY DSD subjects carrying heterozygous NR5A1 disease-causing variants. We constructed a scheme of all these hits within the landscape of currently known genes involved in male sex determination and differentiation. Our results suggest that the broad phenotype in these heterozygous NR5A1 46,XY DSD subjects may well be explained by an oligogenic mode of inheritance, in which multiple hits, individually non-deleterious, may contribute to a DSD phenotype unique to each heterozygous SF-1/NR5A1 individual.

  3. Outcomes of Technical Variant Liver Transplantation versus Whole Liver Transplantation for Pediatric Patients: A Meta-Analysis.

    PubMed

    Ye, Hui; Zhao, Qiang; Wang, Yufang; Wang, Dongping; Zheng, Zhouying; Schroder, Paul Michael; Lu, Yao; Kong, Yuan; Liang, Wenhua; Shang, Yushu; Guo, Zhiyong; He, Xiaoshun

    2015-01-01

    To overcome the shortage of appropriate-sized whole liver grafts for children, technical variant liver transplantation has been practiced for decades. We perform a meta-analysis to compare the survival rates and incidence of surgical complications between pediatric whole liver transplantation and technical variant liver transplantation. To identify relevant studies up to January 2014, we searched PubMed/Medline, Embase, and Cochrane library databases. The primary outcomes measured were patient and graft survival rates, and the secondary outcomes were the incidence of surgical complications. The outcomes were pooled using a fixed-effects model or random-effects model. The one-year, three-year, five-year patient survival rates and one-year, three-year graft survival rates were significantly higher in whole liver transplantation than technical variant liver transplantation (OR = 1.62, 1.90, 1.65, 1.78, and 1.62, respectively, p<0.05). There was no significant difference in five-year graft survival rate between the two groups (OR = 1.47, p = 0.10). The incidence of portal vein thrombosis and biliary complications were significantly lower in the whole liver transplantation group (OR = 0.45 and 0.42, both p<0.05). The incidence of hepatic artery thrombosis was comparable between the two groups (OR = 1.21, p = 0.61). Pediatric whole liver transplantation is associated with better outcomes than technical variant liver transplantation. Continuing efforts should be made to minimize surgical complications to improve the outcomes of technical variant liver transplantation.

  4. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Golbus, Jessica R.; Puckelwartz, Megan J.; Dellefave-Castillo, Lisa

    Background—Cardiomyopathy is highly heritable but genetically diverse. At present, genetic testing for cardiomyopathy uses targeted sequencing to simultaneously assess the coding regions of more than 50 genes. New genes are routinely added to panels to improve the diagnostic yield. With the anticipated $1000 genome, it is expected that genetic testing will shift towards comprehensive genome sequencing accompanied by targeted gene analysis. Therefore, we assessed the reliability of whole genome sequencing and targeted analysis to identify cardiomyopathy variants in 11 subjects with cardiomyopathy. Methods and Results—Whole genome sequencing with an average of 37× coverage was combined with targeted analysis focused onmore » 204 genes linked to cardiomyopathy. Genetic variants were scored using multiple prediction algorithms combined with frequency data from public databases. This pipeline yielded 1-14 potentially pathogenic variants per individual. Variants were further analyzed using clinical criteria and/or segregation analysis. Three of three previously identified primary mutations were detected by this analysis. In six subjects for whom the primary mutation was previously unknown, we identified mutations that segregated with disease, had clinical correlates, and/or had additional pathological correlation to provide evidence for causality. For two subjects with previously known primary mutations, we identified additional variants that may act as modifiers of disease severity. In total, we identified the likely pathological mutation in 9 of 11 (82%) subjects. We conclude that these pilot data demonstrate that ~30-40× coverage whole genome sequencing combined with targeted analysis is feasible and sensitive to identify rare variants in cardiomyopathy-associated genes.« less

  5. Heterozygous RFX6 protein truncating variants are associated with MODY with reduced penetrance.

    PubMed

    Patel, Kashyap A; Kettunen, Jarno; Laakso, Markku; Stančáková, Alena; Laver, Thomas W; Colclough, Kevin; Johnson, Matthew B; Abramowicz, Marc; Groop, Leif; Miettinen, Päivi J; Shepherd, Maggie H; Flanagan, Sarah E; Ellard, Sian; Inagaki, Nobuya; Hattersley, Andrew T; Tuomi, Tiinamaija; Cnop, Miriam; Weedon, Michael N

    2017-10-12

    Finding new causes of monogenic diabetes helps understand glycaemic regulation in humans. To find novel genetic causes of maturity-onset diabetes of the young (MODY), we sequenced MODY cases with unknown aetiology and compared variant frequencies to large public databases. From 36 European patients, we identify two probands with novel RFX6 heterozygous nonsense variants. RFX6 protein truncating variants are enriched in the MODY discovery cohort compared to the European control population within ExAC (odds ratio = 131, P = 1 × 10 -4 ). We find similar results in non-Finnish European (n = 348, odds ratio = 43, P = 5 × 10 -5 ) and Finnish (n = 80, odds ratio = 22, P = 1 × 10 -6 ) replication cohorts. RFX6 heterozygotes have reduced penetrance of diabetes compared to common HNF1A and HNF4A-MODY mutations (27, 70 and 55% at 25 years of age, respectively). The hyperglycaemia results from beta-cell dysfunction and is associated with lower fasting and stimulated gastric inhibitory polypeptide (GIP) levels. Our study demonstrates that heterozygous RFX6 protein truncating variants are associated with MODY with reduced penetrance.Maturity-onset diabetes of the young (MODY) is the most common subtype of familial diabetes. Here, Patel et al. use targeted DNA sequencing of MODY patients and large-scale publically available data to show that RFX6 heterozygous protein truncating variants cause reduced penetrance MODY.

  6. Body Image and Eating Disorders Among Lesbian, Gay, Bisexual, and Transgender Youth.

    PubMed

    McClain, Zachary; Peebles, Rebecka

    2016-12-01

    Adolescence is a crucial period for emerging sexual orientation and gender identity and also body image disturbance and disordered eating. Body image distortion and disordered eating are important pediatric problems affecting individuals along the sexual orientation and gender identity spectrum. Lesbian, gay, bisexual, transgender (LGBT) youth are at risk for eating disorders and body dissatisfaction. Disordered eating in LGBT and gender variant youth may be associated with poorer quality of life and mental health outcomes. Pediatricians should know that these problems occur more frequently in LGBT youth. There is evidence that newer treatment paradigms involving family support are more effective than individual models of care. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. Geometrical optical illusionists.

    PubMed

    Wade, Nicholas J

    2014-01-01

    Geometrical optical illusions were given this title by Oppel in 1855. Variants on such small distortions of visual space were illustrated thereafter, many of which bear the names of those who first described them. Some original forms of the geometrical optical illusions are shown together with 'perceptual portraits' of those who described them. These include: Roget, Chevreul, Fick, Zöllner, Poggendorff, Hering, Kundt, Delboeuf Mach, Helmholtz, Hermann, von Bezold, Müller-Lyer, Lipps, Thiéry, Wundt, Münsterberg, Ebbinghaus, Titchener, Ponzo, Luckiesh, Sander, Ehrenstein, Gregory, Heard, White, Shepard, and. Lingelbach. The illusions are grouped under the headings of orientation, size, the combination of size and orientation, and contrast. Early theories of illusions, before geometrical optical illusions were so named, are mentioned briefly.

  8. A Study of the Responses of Individuals with Different Interpersonal Needs with Respect to Variant Forms of Training in Group and Interpersonal Relations.

    ERIC Educational Resources Information Center

    Smallegan, Marian Joyce

    To determine if opinion change might be dependent in part on the interpersonal needs of the participants of seven seminar sections, need level was measured by FIRO-B (Fundamental Interpersonal Relations Orientation) in three areas: inclusion, control, and affection. Three nonresidential groups met weekly for 15 weeks; four residential groups met…

  9. INSaFLU: an automated open web-based bioinformatics suite "from-reads" for influenza whole-genome-sequencing-based surveillance.

    PubMed

    Borges, Vítor; Pinheiro, Miguel; Pechirra, Pedro; Guiomar, Raquel; Gomes, João Paulo

    2018-06-29

    A new era of flu surveillance has already started based on the genetic characterization and exploration of influenza virus evolution at whole-genome scale. Although this has been prioritized by national and international health authorities, the demanded technological transition to whole-genome sequencing (WGS)-based flu surveillance has been particularly delayed by the lack of bioinformatics infrastructures and/or expertise to deal with primary next-generation sequencing (NGS) data. We developed and implemented INSaFLU ("INSide the FLU"), which is the first influenza-oriented bioinformatics free web-based suite that deals with primary NGS data (reads) towards the automatic generation of the output data that are actually the core first-line "genetic requests" for effective and timely influenza laboratory surveillance (e.g., type and sub-type, gene and whole-genome consensus sequences, variants' annotation, alignments and phylogenetic trees). By handling NGS data collected from any amplicon-based schema, the implemented pipeline enables any laboratory to perform multi-step software intensive analyses in a user-friendly manner without previous advanced training in bioinformatics. INSaFLU gives access to user-restricted sample databases and projects management, being a transparent and flexible tool specifically designed to automatically update project outputs as more samples are uploaded. Data integration is thus cumulative and scalable, fitting the need for a continuous epidemiological surveillance during the flu epidemics. Multiple outputs are provided in nomenclature-stable and standardized formats that can be explored in situ or through multiple compatible downstream applications for fine-tuned data analysis. This platform additionally flags samples as "putative mixed infections" if the population admixture enrolls influenza viruses with clearly distinct genetic backgrounds, and enriches the traditional "consensus-based" influenza genetic characterization with relevant data on influenza sub-population diversification through a depth analysis of intra-patient minor variants. This dual approach is expected to strengthen our ability not only to detect the emergence of antigenic and drug resistance variants but also to decode alternative pathways of influenza evolution and to unveil intricate routes of transmission. In summary, INSaFLU supplies public health laboratories and influenza researchers with an open "one size fits all" framework, potentiating the operationalization of a harmonized multi-country WGS-based surveillance for influenza virus. INSaFLU can be accessed through https://insaflu.insa.pt .

  10. fMRI orientation decoding in V1 does not require global maps or globally coherent orientation stimuli.

    PubMed

    Alink, Arjen; Krugliak, Alexandra; Walther, Alexander; Kriegeskorte, Nikolaus

    2013-01-01

    The orientation of a large grating can be decoded from V1 functional magnetic resonance imaging (fMRI) data, even at low resolution (3-mm isotropic voxels). This finding has suggested that columnar-level neuronal information might be accessible to fMRI at 3T. However, orientation decodability might alternatively arise from global orientation-preference maps. Such global maps across V1 could result from bottom-up processing, if the preferences of V1 neurons were biased toward particular orientations (e.g., radial from fixation, or cardinal, i.e., vertical or horizontal). Global maps could also arise from local recurrent or top-down processing, reflecting pre-attentive perceptual grouping, attention spreading, or predictive coding of global form. Here we investigate whether fMRI orientation decoding with 2-mm voxels requires (a) globally coherent orientation stimuli and/or (b) global-scale patterns of V1 activity. We used opposite-orientation gratings (balanced about the cardinal orientations) and spirals (balanced about the radial orientation), along with novel patch-swapped variants of these stimuli. The two stimuli of a patch-swapped pair have opposite orientations everywhere (like their globally coherent parent stimuli). However, the two stimuli appear globally similar, a patchwork of opposite orientations. We find that all stimulus pairs are robustly decodable, demonstrating that fMRI orientation decoding does not require globally coherent orientation stimuli. Furthermore, decoding remained robust after spatial high-pass filtering for all stimuli, showing that fine-grained components of the fMRI patterns reflect visual orientations. Consistent with previous studies, we found evidence for global radial and vertical preference maps in V1. However, these were weak or absent for patch-swapped stimuli, suggesting that global preference maps depend on globally coherent orientations and might arise through recurrent or top-down processes related to the perception of global form.

  11. Identifying Mendelian disease genes with the Variant Effect Scoring Tool

    PubMed Central

    2013-01-01

    Background Whole exome sequencing studies identify hundreds to thousands of rare protein coding variants of ambiguous significance for human health. Computational tools are needed to accelerate the identification of specific variants and genes that contribute to human disease. Results We have developed the Variant Effect Scoring Tool (VEST), a supervised machine learning-based classifier, to prioritize rare missense variants with likely involvement in human disease. The VEST classifier training set comprised ~ 45,000 disease mutations from the latest Human Gene Mutation Database release and another ~45,000 high frequency (allele frequency >1%) putatively neutral missense variants from the Exome Sequencing Project. VEST outperforms some of the most popular methods for prioritizing missense variants in carefully designed holdout benchmarking experiments (VEST ROC AUC = 0.91, PolyPhen2 ROC AUC = 0.86, SIFT4.0 ROC AUC = 0.84). VEST estimates variant score p-values against a null distribution of VEST scores for neutral variants not included in the VEST training set. These p-values can be aggregated at the gene level across multiple disease exomes to rank genes for probable disease involvement. We tested the ability of an aggregate VEST gene score to identify candidate Mendelian disease genes, based on whole-exome sequencing of a small number of disease cases. We used whole-exome data for two Mendelian disorders for which the causal gene is known. Considering only genes that contained variants in all cases, the VEST gene score ranked dihydroorotate dehydrogenase (DHODH) number 2 of 2253 genes in four cases of Miller syndrome, and myosin-3 (MYH3) number 2 of 2313 genes in three cases of Freeman Sheldon syndrome. Conclusions Our results demonstrate the potential power gain of aggregating bioinformatics variant scores into gene-level scores and the general utility of bioinformatics in assisting the search for disease genes in large-scale exome sequencing studies. VEST is available as a stand-alone software package at http://wiki.chasmsoftware.org and is hosted by the CRAVAT web server at http://www.cravat.us PMID:23819870

  12. LymPHOS 2.0: an update of a phosphosite database of primary human T cells

    PubMed Central

    Nguyen, Tien Dung; Vidal-Cortes, Oriol; Gallardo, Oscar; Abian, Joaquin; Carrascal, Montserrat

    2015-01-01

    LymPHOS is a web-oriented database containing peptide and protein sequences and spectrometric information on the phosphoproteome of primary human T-Lymphocytes. Current release 2.0 contains 15 566 phosphorylation sites from 8273 unique phosphopeptides and 4937 proteins, which correspond to a 45-fold increase over the original database description. It now includes quantitative data on phosphorylation changes after time-dependent treatment with activators of the TCR-mediated signal transduction pathway. Sequence data quality has also been improved with the use of multiple search engines for database searching. LymPHOS can be publicly accessed at http://www.lymphos.org. Database URL: http://www.lymphos.org. PMID:26708986

  13. OOMM--Object-Oriented Matrix Modelling: an instrument for the integration of the Brasilia Regional Health Information System.

    PubMed

    Cammarota, M; Huppes, V; Gaia, S; Degoulet, P

    1998-01-01

    The development of Health Information Systems is widely determined by the establishment of the underlying information models. An Object-Oriented Matrix Model (OOMM) is described which target is to facilitate the integration of the overall health system. The model is based on information modules named micro-databases that are structured in a three-dimensional network: planning, health structures and information systems. The modelling tool has been developed as a layer on top of a relational database system. A visual browser facilitates the development and maintenance of the information model. The modelling approach has been applied to the Brasilia University Hospital since 1991. The extension of the modelling approach to the Brasilia regional health system is considered.

  14. Integration of a neuroimaging processing pipeline into a pan-canadian computing grid

    NASA Astrophysics Data System (ADS)

    Lavoie-Courchesne, S.; Rioux, P.; Chouinard-Decorte, F.; Sherif, T.; Rousseau, M.-E.; Das, S.; Adalat, R.; Doyon, J.; Craddock, C.; Margulies, D.; Chu, C.; Lyttelton, O.; Evans, A. C.; Bellec, P.

    2012-02-01

    The ethos of the neuroimaging field is quickly moving towards the open sharing of resources, including both imaging databases and processing tools. As a neuroimaging database represents a large volume of datasets and as neuroimaging processing pipelines are composed of heterogeneous, computationally intensive tools, such open sharing raises specific computational challenges. This motivates the design of novel dedicated computing infrastructures. This paper describes an interface between PSOM, a code-oriented pipeline development framework, and CBRAIN, a web-oriented platform for grid computing. This interface was used to integrate a PSOM-compliant pipeline for preprocessing of structural and functional magnetic resonance imaging into CBRAIN. We further tested the capacity of our infrastructure to handle a real large-scale project. A neuroimaging database including close to 1000 subjects was preprocessed using our interface and publicly released to help the participants of the ADHD-200 international competition. This successful experiment demonstrated that our integrated grid-computing platform is a powerful solution for high-throughput pipeline analysis in the field of neuroimaging.

  15. Group-oriented coordination models for distributed client-server computing

    NASA Technical Reports Server (NTRS)

    Adler, Richard M.; Hughes, Craig S.

    1994-01-01

    This paper describes group-oriented control models for distributed client-server interactions. These models transparently coordinate requests for services that involve multiple servers, such as queries across distributed databases. Specific capabilities include: decomposing and replicating client requests; dispatching request subtasks or copies to independent, networked servers; and combining server results into a single response for the client. The control models were implemented by combining request broker and process group technologies with an object-oriented communication middleware tool. The models are illustrated in the context of a distributed operations support application for space-based systems.

  16. Development of a Dynamically Configurable, Object-Oriented Framework for Distributed, Multi-modal Computational Aerospace Systems Simulation

    NASA Technical Reports Server (NTRS)

    Afjeh, Abdollah A.; Reed, John A.

    2003-01-01

    The following reports are presented on this project:A first year progress report on: Development of a Dynamically Configurable,Object-Oriented Framework for Distributed, Multi-modal Computational Aerospace Systems Simulation; A second year progress report on: Development of a Dynamically Configurable, Object-Oriented Framework for Distributed, Multi-modal Computational Aerospace Systems Simulation; An Extensible, Interchangeable and Sharable Database Model for Improving Multidisciplinary Aircraft Design; Interactive, Secure Web-enabled Aircraft Engine Simulation Using XML Databinding Integration; and Improving the Aircraft Design Process Using Web-based Modeling and Simulation.

  17. A systematic review and meta-analysis of variations in branching patterns of the adult aortic arch.

    PubMed

    Popieluszko, Patrick; Henry, Brandon Michael; Sanna, Beatrice; Hsieh, Wan Chin; Saganiak, Karolina; Pękala, Przemysław A; Walocha, Jerzy A; Tomaszewski, Krzysztof A

    2018-07-01

    The aortic arch (AA) is the main conduit of the left side of the heart, providing a blood supply to the head, neck, and upper limbs. As it travels through the thorax, the pattern in which it gives off the branches to supply these structures can vary. Variations of these branching patterns have been studied; however, a study providing a comprehensive incidence of these variations has not yet been conducted. The objective of this study was to perform a meta-analysis of all the studies that report prevalence data on AA variants and to provide incidence data on the most common variants. A systematic search of online databases including PubMed, Embase, Scopus, ScienceDirect, Web of Science, SciELO, BIOSIS, and CNKI was performed for literature describing incidence of AA variations in adults. Studies including prevalence data on adult patients or cadavers were collected and their data analyzed. A total of 51 articles were included (N = 23,882 arches). Seven of the most common variants were analyzed. The most common variants found included the classic branching pattern, defined as a brachiocephalic trunk, a left common carotid, and a left subclavian artery (80.9%); the bovine arch variant (13.6%); and the left vertebral artery variant (2.8%). Compared by geographic data, bovine arch variants were noted to have a prevalence as high as 26.8% in African populations. Although patients who have an AA variant are often asymptomatic, they compose a significant portion of the population of patients and pose a greater risk of hemorrhage and ischemia during surgery in the thorax. Because of the possibility of encountering such variants, it is prudent for surgeons to consider potential variations in planning procedures, especially of an endovascular nature, in the thorax. Copyright © 2017 Society for Vascular Surgery. Published by Elsevier Inc. All rights reserved.

  18. Clinical spectrum of KIAA2022 pathogenic variants in males: Case report of two boys with KIAA2022 pathogenic variants and review of the literature.

    PubMed

    Lorenzo, Melissa; Stolte-Dijkstra, Irene; van Rheenen, Patrick; Smith, Ronald Garth; Scheers, Tom; Walia, Jagdeep S

    2018-06-01

    KIAA2022 is an X-linked intellectual disability (XLID) syndrome affecting males more severely than females. Few males with KIAA2022 variants and XLID have been reported. We present a clinical report of two unrelated males, with two nonsense KIAA2022 pathogenic variants, with profound intellectual disabilities, limited language development, strikingly similar autistic behavior, delay in motor milestones, and postnatal growth restriction. Patient 1, 19-years-old, has long ears, deeply set eyes with keratoconus, strabismus, a narrow forehead, anteverted nares, café-au-lait spots, macroglossia, thick vermilion of the upper and lower lips, and prognathism. He has gastroesophageal reflux, constipation with delayed rectosigmoid colonic transit time, difficulty regulating temperature, several musculoskeletal issues, and a history of one grand mal seizure. Patient 2, 10-years-old, has mild dysmorphic features, therapy resistant vomiting with diminished motility of the stomach, mild constipation, cortical visual impairment with intermittent strabismus, axial hypotonia, difficulty regulating temperature, and cutaneous mastocytosis. Genetic testing identified KIAA2022 variant c.652C > T(p.Arg218*) in Patient 1, and a novel nonsense de novo variant c.2707G > T(p.Glu903*) in Patient 2. We also summarized features of all reported males with KIAA2022 variants to date. This report not only adds knowledge of a novel pathogenic variant to the KIAA2022 variant database, but also likely extends the spectrum by describing novel dysmorphic features and medical conditions including macroglossia, café-au-lait spots, keratoconus, severe cutaneous mastocytosis, and motility problems of the GI tract, which may help physicians involved in the care of patients with this syndrome. Lastly, we describe the power of social media in bringing families with rare medical conditions together. © 2018 Wiley Periodicals, Inc.

  19. NCI-60 Whole Exome Sequencing and Pharmacological CellMiner Analyses

    PubMed Central

    Reinhold, William C.; Varma, Sudhir; Sousa, Fabricio; Sunshine, Margot; Abaan, Ogan D.; Davis, Sean R.; Reinhold, Spencer W.; Kohn, Kurt W.; Morris, Joel; Meltzer, Paul S.; Doroshow, James H.; Pommier, Yves

    2014-01-01

    Exome sequencing provides unprecedented insights into cancer biology and pharmacological response. Here we assess these two parameters for the NCI-60, which is among the richest genomic and pharmacological publicly available cancer cell line databases. Homozygous genetic variants that putatively affect protein function were identified in 1,199 genes (approximately 6% of all genes). Variants that are either enriched or depleted compared to non-cancerous genomes, and thus may be influential in cancer progression and differential drug response were identified for 2,546 genes. Potential gene knockouts are made available. Assessment of cell line response to 19,940 compounds, including 110 FDA-approved drugs, reveals ≈80-fold range in resistance versus sensitivity response across cell lines. 103,422 gene variants were significantly correlated with at least one compound (at p<0.0002). These include genes of known pharmacological importance such as IGF1R, BRAF, RAD52, MTOR, STAT2 and TSC2 as well as a large number of candidate genes such as NOM1, TLL2, and XDH. We introduce two new web-based CellMiner applications that enable exploration of variant-to-compound relationships for a broad range of researchers, especially those without bioinformatics support. The first tool, “Genetic variant versus drug visualization”, provides a visualization of significant correlations between drug activity-gene variant combinations. Examples are given for the known vemurafenib-BRAF, and novel ifosfamide-RAD52 pairings. The second, “Genetic variant summation” allows an assessment of cumulative genetic variations for up to 150 combined genes together; and is designed to identify the variant burden for molecular pathways or functional grouping of genes. An example of its use is provided for the EGFR-ERBB2 pathway gene variant data and the identification of correlated EGFR, ERBB2, MTOR, BRAF, MEK and ERK inhibitors. The new tools are implemented as an updated web-based CellMiner version, for which the present publication serves as a compendium. PMID:25032700

  20. Mutation analysis of the COL1A1 and COL1A2 genes in Vietnamese patients with osteogenesis imperfecta.

    PubMed

    Ho Duy, Binh; Zhytnik, Lidiia; Maasalu, Katre; Kändla, Ivo; Prans, Ele; Reimann, Ene; Märtson, Aare; Kõks, Sulev

    2016-08-12

    The genetics of osteogenesis imperfecta (OI) have not been studied in a Vietnamese population before. We performed mutational analysis of the COL1A1 and COL1A2 genes in 91 unrelated OI patients of Vietnamese origin. We then systematically characterized the mutation profiles of these two genes which are most commonly related to OI. Genomic DNA was extracted from EDTA-preserved blood according to standard high-salt extraction methods. Sequence analysis and pathogenic variant identification was performed with Mutation Surveyor DNA variant analysis software. Prediction of the pathogenicity of mutations was conducted using Alamut Visual software. The presence of variants was checked against Dalgleish's osteogenesis imperfecta mutation database. The sample consisted of 91 unrelated osteogenesis imperfecta patients. We identified 54 patients with COL1A1/2 pathogenic variants; 33 with COL1A1 and 21 with COL1A2. Two patients had multiple pathogenic variants. Seventeen novel COL1A1 and 10 novel COL1A2 variants were identified. The majority of identified COL1A1/2 pathogenic variants occurred in a glycine substitution (36/56, 64.3 %), usually serine (23/36, 63.9 %). We found two pathogenic variants of the COL1A1 gene c.2461G > A (p.Gly821Ser) in four unrelated patients and one, c.2005G > A (p.Ala669Thr), in two unrelated patients. Our data showed a lower number of collagen OI pathogenic variants in Vietnamese patients compared to reported rates for Asian populations. The OI mutational profile of the Vietnamese population is unique and related to the presence of a high number of recessive mutations in non-collagenous OI genes. Further analysis of OI patients negative for collagen mutations, is required.

  1. An iterated local search algorithm for the team orienteering problem with variable profits

    NASA Astrophysics Data System (ADS)

    Gunawan, Aldy; Ng, Kien Ming; Kendall, Graham; Lai, Junhan

    2018-07-01

    The orienteering problem (OP) is a routing problem that has numerous applications in various domains such as logistics and tourism. The objective is to determine a subset of vertices to visit for a vehicle so that the total collected score is maximized and a given time budget is not exceeded. The extensive application of the OP has led to many different variants, including the team orienteering problem (TOP) and the team orienteering problem with time windows. The TOP extends the OP by considering multiple vehicles. In this article, the team orienteering problem with variable profits (TOPVP) is studied. The main characteristic of the TOPVP is that the amount of score collected from a visited vertex depends on the duration of stay on that vertex. A mathematical programming model for the TOPVP is first presented and an algorithm based on iterated local search (ILS) that is able to solve modified benchmark instances is then proposed. It is concluded that ILS produces solutions which are comparable to those obtained by the commercial solver CPLEX for smaller instances. For the larger instances, ILS obtains good-quality solutions that have significantly better objective value than those found by CPLEX under reasonable computational times.

  2. Specialized microbial databases for inductive exploration of microbial genome sequences

    PubMed Central

    Fang, Gang; Ho, Christine; Qiu, Yaowu; Cubas, Virginie; Yu, Zhou; Cabau, Cédric; Cheung, Frankie; Moszer, Ivan; Danchin, Antoine

    2005-01-01

    Background The enormous amount of genome sequence data asks for user-oriented databases to manage sequences and annotations. Queries must include search tools permitting function identification through exploration of related objects. Methods The GenoList package for collecting and mining microbial genome databases has been rewritten using MySQL as the database management system. Functions that were not available in MySQL, such as nested subquery, have been implemented. Results Inductive reasoning in the study of genomes starts from "islands of knowledge", centered around genes with some known background. With this concept of "neighborhood" in mind, a modified version of the GenoList structure has been used for organizing sequence data from prokaryotic genomes of particular interest in China. GenoChore , a set of 17 specialized end-user-oriented microbial databases (including one instance of Microsporidia, Encephalitozoon cuniculi, a member of Eukarya) has been made publicly available. These databases allow the user to browse genome sequence and annotation data using standard queries. In addition they provide a weekly update of searches against the world-wide protein sequences data libraries, allowing one to monitor annotation updates on genes of interest. Finally, they allow users to search for patterns in DNA or protein sequences, taking into account a clustering of genes into formal operons, as well as providing extra facilities to query sequences using predefined sequence patterns. Conclusion This growing set of specialized microbial databases organize data created by the first Chinese bacterial genome programs (ThermaList, Thermoanaerobacter tencongensis, LeptoList, with two different genomes of Leptospira interrogans and SepiList, Staphylococcus epidermidis) associated to related organisms for comparison. PMID:15698474

  3. A Model of Object-Identities and Values

    DTIC Science & Technology

    1990-02-23

    integrity constraints in its construct, which provides the natural integration of the logical database model and the object-oriented database model. 20...portions are integrated by a simple commutative diagram of modeling functions. The formalism includes the expression of integrity constraints in its ...38 .5.2.2 The (Concept Model and Its Semantics .. .. .. .. ... .... ... .. 40 5.2.3 Two K%.inds of Predicates

  4. Implementing Relational Operations in an Object-Oriented Database

    DTIC Science & Technology

    1992-03-01

    computer aided software engineering (CASE) and computer aided design (CAD) tools. There has been some research done in the area of combining...35 2. Prograph Database Engine .................................................................. 38 III. W HY A N R/O...in most business applications where the bulk of data being stored and manipulated is simply textual or numeric data that can be stored and manipulated

  5. SPINS: standardized protein NMR storage. A data dictionary and object-oriented relational database for archiving protein NMR spectra.

    PubMed

    Baran, Michael C; Moseley, Hunter N B; Sahota, Gurmukh; Montelione, Gaetano T

    2002-10-01

    Modern protein NMR spectroscopy laboratories have a rapidly growing need for an easily queried local archival system of raw experimental NMR datasets. SPINS (Standardized ProteIn Nmr Storage) is an object-oriented relational database that provides facilities for high-volume NMR data archival, organization of analyses, and dissemination of results to the public domain by automatic preparation of the header files required for submission of data to the BioMagResBank (BMRB). The current version of SPINS coordinates the process from data collection to BMRB deposition of raw NMR data by standardizing and integrating the storage and retrieval of these data in a local laboratory file system. Additional facilities include a data mining query tool, graphical database administration tools, and a NMRStar v2. 1.1 file generator. SPINS also includes a user-friendly internet-based graphical user interface, which is optionally integrated with Varian VNMR NMR data collection software. This paper provides an overview of the data model underlying the SPINS database system, a description of its implementation in Oracle, and an outline of future plans for the SPINS project.

  6. Selecting a Persistent Data Support Environment for Object-Oriented Applications

    DTIC Science & Technology

    1998-03-01

    key features of most object DBMS products is contained in the <DWAS 9{eeds Assessment for Objects from Barry and Associates. The developer should...data structure and behavior in a self- contained module enhances maintainability of the system and promotes reuse of modules for similar domains...considered together, represent a survey of commercial object-oriented database management systems. These references contain detailed information needed

  7. A database of gene-environment interactions pertaining to blood lipid traits, cardiovascular disease and type 2 diabetes

    USDA-ARS?s Scientific Manuscript database

    As the role of the environment – diet, exercise, alcohol and tobacco use and sleep among others – is accorded a more prominent role in modifying the relationship between genetic variants and clinical measures of disease, consideration of gene-environment (GxE) interactions is a must. To facilitate i...

  8. The role of AMH and its receptor SNP in the pathogenesis of PCOS.

    PubMed

    Wang, Fang; Niu, Wen-Bin; Kong, Hui-Juan; Guo, Yi-Hong; Sun, Ying-Pu

    2017-01-05

    The etiology of polycystic ovaries syndrome (PCOS) is unknown. Studies probing the role of genetic variants of anti-Mullerian hormone (AMH) and its type II receptor (AMHR2) in the pathogenesis of PCOS have yielded inconsistent results. Thus, we performed a systematic review and meta-analysis to determine the role of genetic variants of AMH/AMHR2 in the pathogenesis of PCOS. A systematic search of electronic databases was performed. Statistical analysis was performed using the Comprehensive Meta-Analysis software (Version 3). Pooled Odds Ratios (OR) (95% confidence intervals) were determined to assess the association between genetic variants of AMH/AMHR2 and PCOS. Five studies, involving a total of 2042 PCOS cases and 1071 controls, were included in the meta-analysis. Single nucleotide polymorphisms of AMH and AMHR2 did not appear to confer a heightened risk for PCOS (OR: 0.954, 95% CI: 0.848-1.073; P = 0.435; and OR: 1.074, 95% CI: 0.875-1.318; P = 0.494, respectively). In this study, genetic variants of AMH or AMHR2 were not found to be associated with a higher risk for PCOS. Copyright © 2016. Published by Elsevier Ireland Ltd.

  9. Crystal Structure of an Activated Variant of Small Heat Shock Protein Hsp16.5

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mchaourab, Hassane S.; Lin, Yi-Lun; Spiller, Benjamin W.

    How does the sequence of a single small heat shock protein (sHSP) assemble into oligomers of different sizes? To gain insight into the underlying structural mechanism, we determined the crystal structure of an engineered variant of Methanocaldococcus jannaschii Hsp16.5 wherein a 14 amino acid peptide from human heat shock protein 27 (Hsp27) was inserted at the junction of the N-terminal region and the {alpha}-crystallin domain. In response to this insertion, the oligomer shell expands from 24 to 48 subunits while maintaining octahedral symmetry. Oligomer rearrangement does not alter the fold of the conserved {alpha}-crystallin domain nor does it disturb themore » interface holding the dimeric building block together. Rather, the flexible C-terminal tail of Hsp16.5 changes its orientation relative to the {alpha}-crystallin domain which enables alternative packing of dimers. This change in orientation preserves a peptide-in-groove interaction of the C-terminal tail with an adjacent {beta}-sandwich, thereby holding the assembly together. The interior of the expanded oligomer, where substrates presumably bind, retains its predominantly nonpolar character relative to the outside surface. New large windows in the outer shell provide increased access to these substrate-binding regions, thus accounting for the higher affinity of this variant to substrates. Oligomer polydispersity regulates sHSPs chaperone activity in vitro and has been implicated in their physiological roles. The structural mechanism of Hsp16.5 oligomer flexibility revealed here, which is likely to be highly conserved across the sHSP superfamily, explains the relationship between oligomer expansion observed in disease-linked mutants and changes in chaperone activity.« less

  10. Crystal structure of an activated variant of small heat shock protein Hsp16.5.

    PubMed

    McHaourab, Hassane S; Lin, Yi-Lun; Spiller, Benjamin W

    2012-06-26

    How does the sequence of a single small heat shock protein (sHSP) assemble into oligomers of different sizes? To gain insight into the underlying structural mechanism, we determined the crystal structure of an engineered variant of Methanocaldococcus jannaschii Hsp16.5 wherein a 14 amino acid peptide from human heat shock protein 27 (Hsp27) was inserted at the junction of the N-terminal region and the α-crystallin domain. In response to this insertion, the oligomer shell expands from 24 to 48 subunits while maintaining octahedral symmetry. Oligomer rearrangement does not alter the fold of the conserved α-crystallin domain nor does it disturb the interface holding the dimeric building block together. Rather, the flexible C-terminal tail of Hsp16.5 changes its orientation relative to the α-crystallin domain which enables alternative packing of dimers. This change in orientation preserves a peptide-in-groove interaction of the C-terminal tail with an adjacent β-sandwich, thereby holding the assembly together. The interior of the expanded oligomer, where substrates presumably bind, retains its predominantly nonpolar character relative to the outside surface. New large windows in the outer shell provide increased access to these substrate-binding regions, thus accounting for the higher affinity of this variant to substrates. Oligomer polydispersity regulates sHSPs chaperone activity in vitro and has been implicated in their physiological roles. The structural mechanism of Hsp16.5 oligomer flexibility revealed here, which is likely to be highly conserved across the sHSP superfamily, explains the relationship between oligomer expansion observed in disease-linked mutants and changes in chaperone activity.

  11. Molecular Dynamics of CYP2D6 Polymorphisms in the Absence and Presence of a Mechanism-Based Inactivator Reveals Changes in Local Flexibility and Dominant Substrate Access Channels

    PubMed Central

    de Waal, Parker W.; Sunden, Kyle F.; Furge, Laura Lowe

    2014-01-01

    Cytochrome P450 enzymes (CYPs) represent an important enzyme superfamily involved in metabolism of many endogenous and exogenous small molecules. CYP2D6 is responsible for ∼15% of CYP-mediated drug metabolism and exhibits large phenotypic diversity within CYPs with over 100 different allelic variants. Many of these variants lead to functional changes in enzyme activity and substrate selectivity. Herein, a molecular dynamics comparative analysis of four different variants of CYP2D6 was performed. The comparative analysis included simulations with and without SCH 66712, a ligand that is also a mechanism-based inactivator, in order to investigate the possible structural basis of CYP2D6 inactivation. Analysis of protein stability highlighted significantly altered flexibility in both proximal and distal residues from the variant residues. In the absence of SCH 66712, *34, *17-2, and *17-3 displayed more flexibility than *1, and *53 displayed more rigidity. SCH 66712 binding reversed flexibility in *17-2 and *17-3, through *53 remained largely rigid. Throughout simulations with docked SCH 66712, ligand orientation within the heme-binding pocket was consistent with previously identified sites of metabolism and measured binding energies. Subsequent tunnel analysis of substrate access, egress, and solvent channels displayed varied bottle-neck radii. Taken together, our results indicate that SCH 66712 should inactivate these allelic variants, although varied flexibility and substrate binding-pocket accessibility may alter its interaction abilities. PMID:25286176

  12. A trans-acting Variant within the Transcription Factor RIM101 Interacts with Genetic Background to Determine its Regulatory Capacity.

    PubMed

    Read, Timothy; Richmond, Phillip A; Dowell, Robin D

    2016-01-01

    Most genetic variants associated with disease occur within regulatory regions of the genome, underscoring the importance of defining the mechanisms underlying differences in regulation of gene expression between individuals. We discovered a pair of co-regulated, divergently oriented transcripts, AQY2 and ncFRE6, that are expressed in one strain of Saccharomyces cerevisiae, ∑1278b, but not in another, S288c. By combining classical genetics techniques with high-throughput sequencing, we identified a trans-acting single nucleotide polymorphism within the transcription factor RIM101 that causes the background-dependent expression of both transcripts. Subsequent RNA-seq experiments revealed that RIM101 regulates many more targets in S288c than in ∑1278b and that deletion of RIM101 in both backgrounds abrogates the majority of differential expression between the strains. Strikingly, only three transcripts undergo a significant change in expression after swapping RIM101 alleles between backgrounds, implying that the differences in the RIM101 allele lead to a remarkably focused transcriptional response. However, hundreds of RIM101-dependent targets undergo a subtle but consistent shift in expression in the S288c RIM101-swapped strain, but not its ∑1278b counterpart. We conclude that ∑1278b may harbor a variant(s) that buffers against widespread transcriptional dysregulation upon introduction of a non-native RIM101 allele, emphasizing the importance of accounting for genetic background when assessing the impact of a regulatory variant.

  13. Quantitative transmission electron microscopy analysis of multi-variant grains in present L1{sub 0}-FePt based heat assisted magnetic recording media

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ho, Hoan, E-mail: hoan.ho@wdc.com; Department of Materials Science and Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213; Zhu, Jingxi, E-mail: jingxiz@andrew.cmu.edu

    2014-11-21

    We present a study on atomic ordering within individual grains in granular L1{sub 0}-FePt thin films using transmission electron microscopy techniques. The film, used as a medium for heat assisted magnetic recording, consists of a single layer of FePt grains separated by non-magnetic grain boundaries and is grown on an MgO underlayer. Using convergent-beam techniques, diffraction patterns of individual grains are obtained for a large number of crystallites. The study found that although the majority of grains are ordered in the perpendicular direction, more than 15% of them are multi-variant, or of in-plane c-axis orientation, or disordered fcc. It wasmore » also found that these multi-variant and in-plane grains have always grown across MgO grain boundaries separating two or more MgO grains of the underlayer. The in-plane ordered portion within a multi-variant L1{sub 0}-FePt grain always lacks atomic coherence with the MgO directly underneath it, whereas, the perpendicularly ordered portion is always coherent with the underlying MgO grain. Since the existence of multi-variant and in-plane ordered grains are severely detrimental to high density data storage capability, the understanding of their formation mechanism obtained here should make a significant impact on the future development of hard disk drive technology.« less

  14. Kinematic model for the space-variant image motion of star sensors under dynamical conditions

    NASA Astrophysics Data System (ADS)

    Liu, Chao-Shan; Hu, Lai-Hong; Liu, Guang-Bin; Yang, Bo; Li, Ai-Jun

    2015-06-01

    A kinematic description of a star spot in the focal plane is presented for star sensors under dynamical conditions, which involves all necessary parameters such as the image motion, velocity, and attitude parameters of the vehicle. Stars at different locations of the focal plane correspond to the slightly different orientation and extent of motion blur, which characterize the space-variant point spread function. Finally, the image motion, the energy distribution, and centroid extraction are numerically investigated using the kinematic model under dynamic conditions. A centroid error of eight successive iterations <0.002 pixel is used as the termination criterion for the Richardson-Lucy deconvolution algorithm. The kinematic model of a star sensor is useful for evaluating the compensation algorithms of motion-blurred images.

  15. Role of Different Kinds of Boundaries Against Cleavage Crack Propagation in Low-Temperature Embrittlement of Low-Carbon Martensitic Steel

    NASA Astrophysics Data System (ADS)

    Tsuboi, Mizuki; Shibata, Akinobu; Terada, Daisuke; Tsuji, Nobuhiro

    2017-07-01

    The present paper investigated the relationship between low-temperature embrittlement and microstructure of lath martensite in a low-carbon steel from both microstructural and crystallographic points of view. The fracture surface of the specimen after the miniaturized Charpy impact test at 98 K (-175 °C) mainly consisted of cleavage fracture facets parallel to crystallographic {001} planes of martensite. Through the crystallographic orientation analysis of micro-crack propagation, we found that the boundaries which separated different martensite variants having large misorientation angles of {001} cleavage planes could inhibit crack propagation. It was then concluded that the size of the aggregations of martensite variants belonging to the same Bain deformation group could control the low-temperature embrittlement of martensitic steels.

  16. Association of the S267F variant on NTCP gene and treatment response to pegylated interferon in patients with chronic hepatitis B: a multicentre study.

    PubMed

    Thanapirom, Kessarin; Suksawatamnuay, Sirinporn; Sukeepaisarnjaroen, Wattana; Treeprasertsuk, Sombat; Tanwandee, Tawesak; Charatcharoenwitthaya, Phunchai; Thongsawat, Satawat; Leerapun, Apinya; Piratvisuth, Teerha; Boonsirichan, Rattana; Bunchorntavakul, Chalermrat; Pattanasirigool, Chaowalit; Pornthisarn, Bubpha; Tuntipanichteerakul, Supoj; Sripariwuth, Ekawee; Jeamsripong, Woramon; Sanpajit, Theeranun; Poovorawan, Yong; Komolmit, Piyawat

    2018-01-01

    Sodium taurocholate co-transporting polypeptide (NTCP) is a cell receptor for HBV. The S267F variant on the NTCP gene is inversely associated with the chronicity of HBV infection, progression to cirrhosis and hepatocellular carcinoma in East Asian populations. The aim of this study was to determine whether the S267F variant was associated with response to pegylated interferon (PEG-IFN) in patients with chronic HBV infection. A total of 257 patients with chronic HBV, treated with PEG-IFN for 48 weeks, were identified from 13 tertiary hospitals included in the hepatitis B database of the Thai Association for the Study of the Liver (THASL). Of these, 202 patients were infected with HBV genotype C (84.9%); 146 patients were hepatitis B e antigen (HBeAg)-positive (56.8%). Genotypic frequencies of the S267F polymorphism were 85.2%, 14.8% and 0% for the GG, GA and AA genotypes, respectively. S267F GA was associated with sustained alanine aminotransferase (ALT) normalization (OR = 3.25, 95% CI 1.23, 8.61; P=0.02) in HBeAg-positive patients. Patients with S267F variant tended to have more virological response, sustained response with hepatitis B surface antigen (HBsAg) loss at 24 weeks following PEG-IFN treatment. There was no association between the S267F variant and improved patient outcomes in HBeAg-negative patients. The S267F variant on the NTCP gene is independently associated with sustained normalization of ALT following treatment with PEG-IFN in patients with HBV infection who are HBeAg-positive. The findings of this study provide additional support for the clinical significance of the S267F variant of NTCP beyond HBV entry.

  17. Novel Lethal Form of Congenital Hypopituitarism Associated With the First Recessive LHX4 Mutation

    PubMed Central

    Gregory, L. C.; Humayun, K. N.; Turton, J. P. G.; McCabe, M. J.; Rhodes, S. J.

    2015-01-01

    Background: LHX4 encodes a member of the LIM-homeodomain family of transcription factors that is required for normal development of the pituitary gland. To date, only incompletely penetrant heterozygous mutations in LHX4 have been described in patients with variable combined pituitary hormone deficiencies. Objective/Hypothesis: To report a unique family with a novel recessive variant in LHX4 associated with a lethal form of congenital hypopituitarism that was identified through screening a total of 97 patients. Method: We screened 97 unrelated patients with combined pituitary hormone deficiency, including 65% with an ectopic posterior pituitary, for variants in the LHX4 gene using Sanger sequencing. Control databases (1000 Genomes, dbSNP, Exome Variant Server, ExAC Browser) were consulted upon identification of variants. Results: We identified the first novel homozygous missense variant (c.377C>T, p.T126M) in two deceased male patients of Pakistani origin with severe panhypopituitarism associated with anterior pituitary aplasia and posterior pituitary ectopia. Both were born small for gestational age with a small phallus, undescended testes, and mid-facial hypoplasia. The parents' first-born child was a female with mid-facial hypoplasia (DNA was unavailable). Despite rapid commencement of hydrocortisone and T4 in the brothers, all three children died within the first week of life. The LHX4(p.T126M) variant is located within the LIM2 domain, in a highly conserved location. The absence of homozygosity for the variant in over 65 000 controls suggests that it is likely to be responsible for the phenotype. Conclusion: We report, for the first time to our knowledge, a novel homozygous mutation in LHX4 associated with a lethal phenotype, implying that recessive mutations in LHX4 may be incompatible with life. PMID:25871839

  18. Comparative transcriptome analysis of three color variants of the sea cucumber Apostichopus japonicus.

    PubMed

    Jo, Jihoon; Park, Jongsun; Lee, Hyun-Gwan; Kern, Elizabeth M A; Cheon, Seongmin; Jin, Soyeong; Park, Joong-Ki; Cho, Sung-Jin; Park, Chungoo

    2016-08-01

    The sea cucumber Apostichopus japonicus Selenka 1867 represents an important resource in biomedical research, traditional medicine, and the seafood industry. Much of the commercial value of A. japonicus is determined by dorsal/ventral color variation (red, green, and black), yet the taxonomic relationships between these color variants are not clearly understood. We performed the first comparative analysis of de novo assembled transcriptome data from three color variants of A. japonicus. Using the Illumina platform, we sequenced nearly 177,596,774 clean reads representing a total of 18.2Gbp of sea cucumber transcriptome. A comparison of over 0.3 million transcript scaffolds against the Uniprot/Swiss-Prot database yielded 8513, 8602, and 8588 positive matches for green, red, and black body color transcriptomes, respectively. Using the Panther gene classification system, we assessed an extensive and diverse set of expressed genes in three color variants and found that (1) among the three color variants of A. japonicus, genes associated with RNA binding protein, oxidoreductase, nucleic acid binding, transferase, and KRAB box transcription factor were most commonly expressed; and (2) the main protein functional classes are differently regulated in all three color variants (extracellular matrix protein and phosphatase for green color, transporter and potassium channel for red color, and G-protein modulator and enzyme modulator for black color). This work will assist in the discovery and annotation of novel genes that play significant morphological and physiological roles in color variants of A. japonicus, and these sequence data will provide a useful set of resources for the rapidly growing sea cucumber aquaculture industry. Copyright © 2016 Elsevier B.V. All rights reserved.

  19. Canadian Open Genetics Repository (COGR): a unified clinical genomics database as a community resource for standardising and sharing genetic interpretations.

    PubMed

    Lerner-Ellis, Jordan; Wang, Marina; White, Shana; Lebo, Matthew S

    2015-07-01

    The Canadian Open Genetics Repository is a collaborative effort for the collection, storage, sharing and robust analysis of variants reported by medical diagnostics laboratories across Canada. As clinical laboratories adopt modern genomics technologies, the need for this type of collaborative framework is increasingly important. A survey to assess existing protocols for variant classification and reporting was delivered to clinical genetics laboratories across Canada. Based on feedback from this survey, a variant assessment tool was made available to all laboratories. Each participating laboratory was provided with an instance of GeneInsight, a software featuring versioning and approval processes for variant assessments and interpretations and allowing for variant data to be shared between instances. Guidelines were established for sharing data among clinical laboratories and in the final outreach phase, data will be made readily available to patient advocacy groups for general use. The survey demonstrated the need for improved standardisation and data sharing across the country. A variant assessment template was made available to the community to aid with standardisation. Instances of the GeneInsight tool were provided to clinical diagnostic laboratories across Canada for the purpose of uploading, transferring, accessing and sharing variant data. As an ongoing endeavour and a permanent resource, the Canadian Open Genetics Repository aims to serve as a focal point for the collaboration of Canadian laboratories with other countries in the development of tools that take full advantage of laboratory data in diagnosing, managing and treating genetic diseases. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  20. Mapping genetic variations to three-dimensional protein structures to enhance variant interpretation: a proposed framework.

    PubMed

    Glusman, Gustavo; Rose, Peter W; Prlić, Andreas; Dougherty, Jennifer; Duarte, José M; Hoffman, Andrew S; Barton, Geoffrey J; Bendixen, Emøke; Bergquist, Timothy; Bock, Christian; Brunk, Elizabeth; Buljan, Marija; Burley, Stephen K; Cai, Binghuang; Carter, Hannah; Gao, JianJiong; Godzik, Adam; Heuer, Michael; Hicks, Michael; Hrabe, Thomas; Karchin, Rachel; Leman, Julia Koehler; Lane, Lydie; Masica, David L; Mooney, Sean D; Moult, John; Omenn, Gilbert S; Pearl, Frances; Pejaver, Vikas; Reynolds, Sheila M; Rokem, Ariel; Schwede, Torsten; Song, Sicheng; Tilgner, Hagen; Valasatava, Yana; Zhang, Yang; Deutsch, Eric W

    2017-12-18

    The translation of personal genomics to precision medicine depends on the accurate interpretation of the multitude of genetic variants observed for each individual. However, even when genetic variants are predicted to modify a protein, their functional implications may be unclear. Many diseases are caused by genetic variants affecting important protein features, such as enzyme active sites or interaction interfaces. The scientific community has catalogued millions of genetic variants in genomic databases and thousands of protein structures in the Protein Data Bank. Mapping mutations onto three-dimensional (3D) structures enables atomic-level analyses of protein positions that may be important for the stability or formation of interactions; these may explain the effect of mutations and in some cases even open a path for targeted drug development. To accelerate progress in the integration of these data types, we held a two-day Gene Variation to 3D (GVto3D) workshop to report on the latest advances and to discuss unmet needs. The overarching goal of the workshop was to address the question: what can be done together as a community to advance the integration of genetic variants and 3D protein structures that could not be done by a single investigator or laboratory? Here we describe the workshop outcomes, review the state of the field, and propose the development of a framework with which to promote progress in this arena. The framework will include a set of standard formats, common ontologies, a common application programming interface to enable interoperation of the resources, and a Tool Registry to make it easy to find and apply the tools to specific analysis problems. Interoperability will enable integration of diverse data sources and tools and collaborative development of variant effect prediction methods.

  1. Defect structures and growth mechanisms of boron arsenide epilayers grown on 6H-silicon carbide and 15R-silicon carbide substrates

    NASA Astrophysics Data System (ADS)

    Chen, Hui

    B12As2 possesses the extraordinary properties, such as wide bandgap of 3.47eV and unique 'self heal' ability from electron irradiation damage, which make it attractive for the applications in space electronics, high temperature semiconductors and in particular, beta cells, devices capable of producing electrical energy by coupling a radioactive beta emitter to a semiconductor junction. Due to the absence of native substrates, B12As2 has been grown on substrates with compatible structural parameters via chemical vapor deposition. To date, growth on Si with (100), (110) and (111) orientation and (0001) 6H-SiC has been attempted. However, structural variants, including rotational and translational variants, have been observed in the epilayers and are expected to have a detrimental effect on device performance which has severely hindered progress of this material to date. In addition, none of the earlier reports provide a detailed atomic level study of defect structures in the films and growth mechanisms remain obscure. The focus of this thesis is to study defect structures in B12As2 films grown on different SiC substrates using synchrotron x-ray topography, high resolution transmission microscopy as well as other characterization techniques. The goals of the studies are to understand the generations of the defects present in B12As 2 films and their growth mechanisms so as to develop strategies to reduce defect densities and obtain better film quality for future device fabrication. The following detailed studies have been carried out: (1) The microstructures in B12As2 epitaxial layers grown on on-axis c-plane (0001) 6H-SiC substrates were analyzed in detail. Synchrotron white beam X-ray topography (SWBXT) and scanning electron microscopy (SEM) revealed a mosaic structure consisting of a solid solution of twin and matrix epilayer domains. The epitaxial relationship was determined to be (0001)B12As2<112¯0> B12As2||(0001)6H-SiC<112¯0>6H-SiC. B 12As2 twinned domains were found in the epilayer and the twin relationship consisted of a 180° rotation about [0001]B12As2 . High resolution transmission electron microscopy (HRTEM) observation revealed an evolution of the film microstructure from an ˜200nm thick disordered mosaic transition layer to a more ordered structure. Observing the structural projections of the film lower surface and the substrate upper surface, generated by CaRine 4.0 crystal visualization software, eight possible nucleation sites were found to be available on the substrate surface by considering the stable bonding configurations between the boron triangles at the bottom of the boron icosahedra, and the Si dangling bonds on the Si oriented (0001) 6H-SiC substrate surface. The transition layer was suggested to arise from the coalescence of translationally and rotationally variant domains nucleated at the various nucleation sites on the (0001) 6H-SiC surface. Boundaries between translationally variant domains were shown to have unfavorable high-energy bonding configurations while the formation of a 1/3[0001]B12As2 Frank partial dislocation enabled elimination of these high energy boundaries during mutual overgrowth. In consequence, the film quality beyond thicknesses of ˜200nm can be improved as the translational variants grow out leaving only the twin variants. (0003) twin boundaries in the regions beyond 200nm are shown to possess fault vectors such as 1/6[11¯00]B12As2 which originates from the mutual shift between the nucleation sites of the respective domains. (2) The effect off-cut angle on substrate surface on the growth of B12As2 epitaxial layer was studied using a 3.5° off-cut (0001) 6H-SiC substrate. A combined characterized technique composed of SWBXT, SEM, conventional and HRTEM was employed. Similar to the growth on on-axis c-plane 6H-SiC, the epitaxial relationship is identified to be (0001)B12As2<112¯0>B12As2||(0001) 6H-SiC<1120>6H-SiC. It is also revealed that the epilayer consists of a solid solution of B12As2 twinned domains. The 3.5° off-cut angle breaks the surface symmetry of c-plane 6H-SiC, however, the width of each single terrace is large enough to provide eight possible nonequivalent nucleation sites for the growth of B12As 2. In consequence, there could be eight possible structural variants in the film which indicates that the 3.5° offcut angle has little effect in the reduction of possible structural variants in the epilayer and thus may not be an excellent substrate to grow high quality B12As 2 film. (3) Investigation of the microstructures of B12As 2 epitaxial layers grown on m-plane 6H-SiC substrates has been studied. A mosaic structure formed by six types of domains, including (1-21) B 12As2, (2-12) B12As2, (353) B 12As2 and their respective {111} twins, was found in the epilayer. The choice of the various growth orientations in the B12As 2 film were proposed to arise from the following factors: (1) the tendency for B12As2 to grow with {1-21} low energy surface facets; (2) the tendency to minimize the in-plane lattice mismatch between B 12As2 planes oriented approximately parallel to the SiC (0001) planes so as to alleviate local strain in the film/substrate interface; (3) the tendency to nucleate on 3-3 symmetric closed-packed atomic steps exposed on the substrate surface after hydrogen etching. (4) Epitaxial growth of single crystalline B12As2 was discovered and investigated on m-plane 15R-SiC inclusions in a 6H-SiC substrate wafer. SEM showed only one type of triangular feature on the smooth surface of the film which indicated single growth orientation of B12As2. This is confirmed by SWBXT and cross-sectional HRTEM which revealed untwinned (353) orientated B12As2, with significantly improved macroscopic properties as confirmed by Raman spectroscopy. The corresponding growth model involving the bonding configuration between the film and the substrate was developed. It was found that the choice of the unique film orientation substantially resulted from the tendency to nucleate in (111)B12As2 orientation on (474)15R-SiC close-packed facets that are exposed on the m-plane 15R-SiC surface. This indicates that m-plane 15R-SiC could be a potentially excellent substrate to grow high quality B12As2 for future device fabrication.

  2. The 24th annual Nucleic Acids Research database issue: a look back and upcoming changes

    PubMed Central

    Rigden, Daniel J

    2017-01-01

    Abstract This year's Database Issue of Nucleic Acids Research contains 152 papers that include descriptions of 54 new databases and update papers on 98 databases, of which 16 have not been previously featured in NAR. As always, these databases cover a broad range of molecular biology subjects, including genome structure, gene expression and its regulation, proteins, protein domains, and protein–protein interactions. Following the recent trend, an increasing number of new and established databases deal with the issues of human health, from cancer-causing mutations to drugs and drug targets. In accordance with this trend, three recently compiled databases that have been selected by NAR reviewers and editors as ‘breakthrough’ contributions, denovo-db, the Monarch Initiative, and Open Targets, cover human de novo gene variants, disease-related phenotypes in model organisms, and a bioinformatics platform for therapeutic target identification and validation, respectively. We expect these databases to attract the attention of numerous researchers working in various areas of genetics and genomics. Looking back at the past 12 years, we present here the ‘golden set’ of databases that have consistently served as authoritative, comprehensive, and convenient data resources widely used by the entire community and offer some lessons on what makes a successful database. The Database Issue is freely available online at the https://academic.oup.com/nar web site. An updated version of the NAR Molecular Biology Database Collection is available at http://www.oxfordjournals.org/nar/database/a/. PMID:28053160

  3. OBSIFRAC: database-supported software for 3D modeling of rock mass fragmentation

    NASA Astrophysics Data System (ADS)

    Empereur-Mot, Luc; Villemin, Thierry

    2003-03-01

    Under stress, fractures in rock masses tend to form fully connected networks. The mass can thus be thought of as a 3D series of blocks produced by fragmentation processes. A numerical model has been developed that uses a relational database to describe such a mass. The model, which assumes the fractures to be plane, allows data from natural networks to test theories concerning fragmentation processes. In the model, blocks are bordered by faces that are composed of edges and vertices. A fracture can originate from a seed point, its orientation being controlled by the stress field specified by an orientation matrix. Alternatively, it can be generated from a discrete set of given orientations and positions. Both kinds of fracture can occur together in a model. From an original simple block, a given fracture produces two simple polyhedral blocks, and the original block becomes compound. Compound and simple blocks created throughout fragmentation are stored in the database. Several fragmentation processes have been studied. In one scenario, a constant proportion of blocks is fragmented at each step of the process. The resulting distribution appears to be fractal, although seed points are random in each fragmented block. In a second scenario, division affects only one random block at each stage of the process, and gives a Weibull volume distribution law. This software can be used for a large number of other applications.

  4. Automated database-guided expert-supervised orientation for immunophenotypic diagnosis and classification of acute leukemia

    PubMed Central

    Lhermitte, L; Mejstrikova, E; van der Sluijs-Gelling, A J; Grigore, G E; Sedek, L; Bras, A E; Gaipa, G; Sobral da Costa, E; Novakova, M; Sonneveld, E; Buracchi, C; de Sá Bacelar, T; te Marvelde, J G; Trinquand, A; Asnafi, V; Szczepanski, T; Matarraz, S; Lopez, A; Vidriales, B; Bulsa, J; Hrusak, O; Kalina, T; Lecrevisse, Q; Martin Ayuso, M; Brüggemann, M; Verde, J; Fernandez, P; Burgos, L; Paiva, B; Pedreira, C E; van Dongen, J J M; Orfao, A; van der Velden, V H J

    2018-01-01

    Precise classification of acute leukemia (AL) is crucial for adequate treatment. EuroFlow has previously designed an AL orientation tube (ALOT) to guide towards the relevant classification panel (T-cell acute lymphoblastic leukemia (T-ALL), B-cell precursor (BCP)-ALL and/or acute myeloid leukemia (AML)) and final diagnosis. Now we built a reference database with 656 typical AL samples (145 T-ALL, 377 BCP-ALL, 134 AML), processed and analyzed via standardized protocols. Using principal component analysis (PCA)-based plots and automated classification algorithms for direct comparison of single-cells from individual patients against the database, another 783 cases were subsequently evaluated. Depending on the database-guided results, patients were categorized as: (i) typical T, B or Myeloid without or; (ii) with a transitional component to another lineage; (iii) atypical; or (iv) mixed-lineage. Using this automated algorithm, in 781/783 cases (99.7%) the right panel was selected, and data comparable to the final WHO-diagnosis was already provided in >93% of cases (85% T-ALL, 97% BCP-ALL, 95% AML and 87% mixed-phenotype AL patients), even without data on the full-characterization panels. Our results show that database-guided analysis facilitates standardized interpretation of ALOT results and allows accurate selection of the relevant classification panels, hence providing a solid basis for designing future WHO AL classifications. PMID:29089646

  5. Nosql for Storage and Retrieval of Large LIDAR Data Collections

    NASA Astrophysics Data System (ADS)

    Boehm, J.; Liu, K.

    2015-08-01

    Developments in LiDAR technology over the past decades have made LiDAR to become a mature and widely accepted source of geospatial information. This in turn has led to an enormous growth in data volume. The central idea for a file-centric storage of LiDAR point clouds is the observation that large collections of LiDAR data are typically delivered as large collections of files, rather than single files of terabyte size. This split of the dataset, commonly referred to as tiling, was usually done to accommodate a specific processing pipeline. It makes therefore sense to preserve this split. A document oriented NoSQL database can easily emulate this data partitioning, by representing each tile (file) in a separate document. The document stores the metadata of the tile. The actual files are stored in a distributed file system emulated by the NoSQL database. We demonstrate the use of MongoDB a highly scalable document oriented NoSQL database for storing large LiDAR files. MongoDB like any NoSQL database allows for queries on the attributes of the document. As a specialty MongoDB also allows spatial queries. Hence we can perform spatial queries on the bounding boxes of the LiDAR tiles. Inserting and retrieving files on a cloud-based database is compared to native file system and cloud storage transfer speed.

  6. Intrahaplotypic Variants Differentiate Complex Linkage Disequilibrium within Human MHC Haplotypes

    PubMed Central

    Lam, Tze Hau; Tay, Matthew Zirui; Wang, Bei; Xiao, Ziwei; Ren, Ee Chee

    2015-01-01

    Distinct regions of long-range genetic fixation in the human MHC region, known as conserved extended haplotypes (CEHs), possess unique genomic characteristics and are strongly associated with numerous diseases. While CEHs appear to be homogeneous by SNP analysis, the nature of fine variations within their genomic structure is unknown. Using multiple, MHC-homozygous cell lines, we demonstrate extensive sequence conservation in two common Asian MHC haplotypes: A33-B58-DR3 and A2-B46-DR9. However, characterization of phase-resolved MHC haplotypes revealed unique intra-CEH patterns of variation and uncovered 127 single nucleotide variants (SNVs) which are missing from public databases. We further show that the strong linkage disequilibrium structure within the human MHC that typically confounds precise identification of genetic features can be resolved using intra-CEH variants, as evidenced by rs3129063 and rs448489, which affect expression of ZFP57, a gene important in methylation and epigenetic regulation. This study demonstrates an improved strategy that can be used towards genetic dissection of diseases. PMID:26593880

  7. A Python package for parsing, validating, mapping and formatting sequence variants using HGVS nomenclature.

    PubMed

    Hart, Reece K; Rico, Rudolph; Hare, Emily; Garcia, John; Westbrook, Jody; Fusaro, Vincent A

    2015-01-15

    Biological sequence variants are commonly represented in scientific literature, clinical reports and databases of variation using the mutation nomenclature guidelines endorsed by the Human Genome Variation Society (HGVS). Despite the widespread use of the standard, no freely available and comprehensive programming libraries are available. Here we report an open-source and easy-to-use Python library that facilitates the parsing, manipulation, formatting and validation of variants according to the HGVS specification. The current implementation focuses on the subset of the HGVS recommendations that precisely describe sequence-level variation relevant to the application of high-throughput sequencing to clinical diagnostics. The package is released under the Apache 2.0 open-source license. Source code, documentation and issue tracking are available at http://bitbucket.org/hgvs/hgvs/. Python packages are available at PyPI (https://pypi.python.org/pypi/hgvs). Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.

  8. EvoSNP-DB: A database of genetic diversity in East Asian populations.

    PubMed

    Kim, Young Uk; Kim, Young Jin; Lee, Jong-Young; Park, Kiejung

    2013-08-01

    Genome-wide association studies (GWAS) have become popular as an approach for the identification of large numbers of phenotype-associated variants. However, differences in genetic architecture and environmental factors mean that the effect of variants can vary across populations. Understanding population genetic diversity is valuable for the investigation of possible population specific and independent effects of variants. EvoSNP-DB aims to provide information regarding genetic diversity among East Asian populations, including Chinese, Japanese, and Korean. Non-redundant SNPs (1.6 million) were genotyped in 54 Korean trios (162 samples) and were compared with 4 million SNPs from HapMap phase II populations. EvoSNP-DB provides two user interfaces for data query and visualization, and integrates scores of genetic diversity (Fst and VarLD) at the level of SNPs, genes, and chromosome regions. EvoSNP-DB is a web-based application that allows users to navigate and visualize measurements of population genetic differences in an interactive manner, and is available online at [http://biomi.cdc.go.kr/EvoSNP/].

  9. A Python package for parsing, validating, mapping and formatting sequence variants using HGVS nomenclature

    PubMed Central

    Hart, Reece K.; Rico, Rudolph; Hare, Emily; Garcia, John; Westbrook, Jody; Fusaro, Vincent A.

    2015-01-01

    Summary: Biological sequence variants are commonly represented in scientific literature, clinical reports and databases of variation using the mutation nomenclature guidelines endorsed by the Human Genome Variation Society (HGVS). Despite the widespread use of the standard, no freely available and comprehensive programming libraries are available. Here we report an open-source and easy-to-use Python library that facilitates the parsing, manipulation, formatting and validation of variants according to the HGVS specification. The current implementation focuses on the subset of the HGVS recommendations that precisely describe sequence-level variation relevant to the application of high-throughput sequencing to clinical diagnostics. Availability and implementation: The package is released under the Apache 2.0 open-source license. Source code, documentation and issue tracking are available at http://bitbucket.org/hgvs/hgvs/. Python packages are available at PyPI (https://pypi.python.org/pypi/hgvs). Contact: reecehart@gmail.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25273102

  10. Information mining in remote sensing imagery

    NASA Astrophysics Data System (ADS)

    Li, Jiang

    The volume of remotely sensed imagery continues to grow at an enormous rate due to the advances in sensor technology, and our capability for collecting and storing images has greatly outpaced our ability to analyze and retrieve information from the images. This motivates us to develop image information mining techniques, which is very much an interdisciplinary endeavor drawing upon expertise in image processing, databases, information retrieval, machine learning, and software design. This dissertation proposes and implements an extensive remote sensing image information mining (ReSIM) system prototype for mining useful information implicitly stored in remote sensing imagery. The system consists of three modules: image processing subsystem, database subsystem, and visualization and graphical user interface (GUI) subsystem. Land cover and land use (LCLU) information corresponding to spectral characteristics is identified by supervised classification based on support vector machines (SVM) with automatic model selection, while textural features that characterize spatial information are extracted using Gabor wavelet coefficients. Within LCLU categories, textural features are clustered using an optimized k-means clustering approach to acquire search efficient space. The clusters are stored in an object-oriented database (OODB) with associated images indexed in an image database (IDB). A k-nearest neighbor search is performed using a query-by-example (QBE) approach. Furthermore, an automatic parametric contour tracing algorithm and an O(n) time piecewise linear polygonal approximation (PLPA) algorithm are developed for shape information mining of interesting objects within the image. A fuzzy object-oriented database based on the fuzzy object-oriented data (FOOD) model is developed to handle the fuzziness and uncertainty. Three specific applications are presented: integrated land cover and texture pattern mining, shape information mining for change detection of lakes, and fuzzy normalized difference vegetation index (NDVI) pattern mining. The study results show the effectiveness of the proposed system prototype and the potentials for other applications in remote sensing.

  11. AgdbNet – antigen sequence database software for bacterial typing

    PubMed Central

    Jolley, Keith A; Maiden, Martin CJ

    2006-01-01

    Background Bacterial typing schemes based on the sequences of genes encoding surface antigens require databases that provide a uniform, curated, and widely accepted nomenclature of the variants identified. Due to the differences in typing schemes, imposed by the diversity of genes targeted, creating these databases has typically required the writing of one-off code to link the database to a web interface. Here we describe agdbNet, widely applicable web database software that facilitates simultaneous BLAST querying of multiple loci using either nucleotide or peptide sequences. Results Databases are described by XML files that are parsed by a Perl CGI script. Each database can have any number of loci, which may be defined by nucleotide and/or peptide sequences. The software is currently in use on at least five public databases for the typing of Neisseria meningitidis, Campylobacter jejuni and Streptococcus equi and can be set up to query internal isolate tables or suitably-configured external isolate databases, such as those used for multilocus sequence typing. The style of the resulting website can be fully configured by modifying stylesheets and through the use of customised header and footer files that surround the output of the script. Conclusion The software provides a rapid means of setting up customised Internet antigen sequence databases. The flexible configuration options enable typing schemes with differing requirements to be accommodated. PMID:16790057

  12. The Histone Database: an integrated resource for histones and histone fold-containing proteins

    PubMed Central

    Mariño-Ramírez, Leonardo; Levine, Kevin M.; Morales, Mario; Zhang, Suiyuan; Moreland, R. Travis; Baxevanis, Andreas D.; Landsman, David

    2011-01-01

    Eukaryotic chromatin is composed of DNA and protein components—core histones—that act to compactly pack the DNA into nucleosomes, the fundamental building blocks of chromatin. These nucleosomes are connected to adjacent nucleosomes by linker histones. Nucleosomes are highly dynamic and, through various core histone post-translational modifications and incorporation of diverse histone variants, can serve as epigenetic marks to control processes such as gene expression and recombination. The Histone Sequence Database is a curated collection of sequences and structures of histones and non-histone proteins containing histone folds, assembled from major public databases. Here, we report a substantial increase in the number of sequences and taxonomic coverage for histone and histone fold-containing proteins available in the database. Additionally, the database now contains an expanded dataset that includes archaeal histone sequences. The database also provides comprehensive multiple sequence alignments for each of the four core histones (H2A, H2B, H3 and H4), the linker histones (H1/H5) and the archaeal histones. The database also includes current information on solved histone fold-containing structures. The Histone Sequence Database is an inclusive resource for the analysis of chromatin structure and function focused on histones and histone fold-containing proteins. Database URL: The Histone Sequence Database is freely available and can be accessed at http://research.nhgri.nih.gov/histones/. PMID:22025671

  13. Estimated carrier frequency of creatine transporter deficiency in females in the general population using functional characterization of novel missense variants in the SLC6A8 gene.

    PubMed

    DesRoches, Caro-Lyne; Patel, Jaina; Wang, Peixiang; Minassian, Berge; Salomons, Gajja S; Marshall, Christian R; Mercimek-Mahmutoglu, Saadet

    2015-07-10

    Creatine transporter deficiency (CRTR-D) is an X-linked inherited disorder of creatine transport. All males and about 50% of females have intellectual disability or cognitive dysfunction. Creatine deficiency on brain proton magnetic resonance spectroscopy and elevated urinary creatine to creatinine ratio are important biomarkers. Mutations in the SLC6A8 gene occur de novo in 30% of males. Despite reports of high prevalence of CRTR-D in males with intellectual disability, there are no true prevalence studies in the general population. To determine carrier frequency of CRTR-D in the general population we studied the variants in the SLC6A8 gene reported in the Exome Variant Server database and performed functional characterization of missense variants. We also analyzed synonymous and intronic variants for their predicted pathogenicity using in silico analysis tools. Nine missense variants were functionally analyzed using transient transfection by site-directed mutagenesis with In-Fusion HD Cloning in HeLa cells. Creatine uptake was measured by liquid chromatography tandem mass spectrometry for creatine measurement. The c.1654G>T (p.Val552Leu) variant showed low residual creatine uptake activity of 35% of wild type transfected HeLa cells and was classified as pathogenic. Three variants (c.808G>A; p.Val270Met, c.942C>G; p.Phe314Leu and c.952G>A; p.Ala318Thr) were predicted to be pathogenic based on in silico analysis, but proved to be non-pathogenic by our functional analysis. The estimated carrier frequency of CRTR-D was 0.024% in females in the general population. We recommend functional studies for all novel missense variants by transient transfection followed by creatine uptake measurement by liquid chromatography tandem mass spectrometry as fast and cost effective method for the functional analysis of missense variants in the SLC6A8 gene. Crown Copyright © 2015. Published by Elsevier B.V. All rights reserved.

  14. Large-scale exploratory genetic analysis of cognitive impairment in Parkinson's disease.

    PubMed

    Mata, Ignacio F; Johnson, Catherine O; Leverenz, James B; Weintraub, Daniel; Trojanowski, John Q; Van Deerlin, Vivianna M; Ritz, Beate; Rausch, Rebecca; Factor, Stewart A; Wood-Siverio, Cathy; Quinn, Joseph F; Chung, Kathryn A; Peterson-Hiller, Amie L; Espay, Alberto J; Revilla, Fredy J; Devoto, Johnna; Yearout, Dora; Hu, Shu-Ching; Cholerton, Brenna A; Montine, Thomas J; Edwards, Karen L; Zabetian, Cyrus P

    2017-08-01

    Cognitive impairment is a common and disabling problem in Parkinson's disease (PD). Identification of genetic variants that influence the presence or severity of cognitive deficits in PD might provide a clearer understanding of the pathophysiology underlying this important nonmotor feature. We genotyped 1105 PD patients from the PD Cognitive Genetics Consortium for 249,336 variants using the NeuroX array. Participants underwent assessments of learning and memory (Hopkins Verbal Learning Test-Revised [HVLT-R]), working memory/executive function (Letter-Number Sequencing and Trail Making Test [TMT] A and B), language processing (semantic and phonemic verbal fluency), visuospatial abilities (Benton Judgment of Line Orientation [JoLO]), and global cognitive function (Montreal Cognitive Assessment). For common variants, we used linear regression to test for association between genotype and cognitive performance with adjustment for important covariates. Rare variants were analyzed using the optimal unified sequence kernel association test. The significance threshold was defined as a false discovery rate-corrected p-value (P FDR ) of 0.05. Eighteen common variants in 13 genomic regions exceeded the significance threshold for one of the cognitive tests. These included GBA rs2230288 (E326K; P FDR  = 2.7 × 10 -4 ) for JoLO, PARP4 rs9318600 (P FDR  = 0.006), and rs9581094 (P FDR  = 0.006) for HVLT-R total recall, and MTCL1 rs34877994 (P FDR  = 0.01) for TMT B-A. Analysis of rare variants did not yield any significant gene regions. We have conducted the first large-scale PD cognitive genetics analysis and nominated several new putative susceptibility genes for cognitive impairment in PD. These results will require replication in independent PD cohorts. Published by Elsevier Inc.

  15. XML Technology Assessment

    DTIC Science & Technology

    2001-01-01

    System (GCCS) Track Database Management System (TDBM) (3) GCCS Integrated Imagery and Intelligence (3) Intelligence Shared Data Server (ISDS) General ...The CTH is a powerful model that will allow more than just message systems to exchange information. It could be used for object-oriented databases, as...of the Naval Integrated Tactical Environmental System I (NITES I) is used as a case study to demonstrate the utility of this distributed component

  16. Computer-Assisted Promotion of Recreational Opportunities in Natural Resource Areas: A Demonstration and Case Example

    Treesearch

    Emilyn Sheffield; Leslie Furr; Charles Nelson

    1992-01-01

    Filevision IV is a multilayer imaging and data-base management system that combines drawing, filing and extensive report-writing capabilities (Filevision IV, 1988). Filevision IV users access data by attaching graphics to text-oriented data-base records. Tourist attractions, support services, and geo-graphic features can be located on a base map of an area or region....

  17. Dynamic Terrin

    DTIC Science & Technology

    1991-12-30

    York, 1985. [ Serway 86]: Raymond Serway , Physics for Scientists and Engineers. 2nd Edition, Saunders College Publishing, Philadelphia, 1986. pp. 200... Physical Modeling System 3.4 Realtime Hydrology 3.5 Soil Dynamics and Kinematics 4. Database Issues 4.1 Goals 4.2 Object Oriented Databases 4.3 Distributed...Animation System F. Constraints and Physical Modeling G. The PM Physical Modeling System H. Realtime Hydrology I. A Simplified Model of Soil Slumping

  18. Building an Integrated Environment for Multimedia

    NASA Technical Reports Server (NTRS)

    1997-01-01

    Multimedia courseware on the solar system and earth science suitable for use in elementary, middle, and high schools was developed under this grant. The courseware runs on Silicon Graphics, Incorporated (SGI) workstations and personal computers (PCs). There is also a version of the courseware accessible via the World Wide Web. Accompanying multimedia database systems were also developed to enhance the multimedia courseware. The database systems accompanying the PC software are based on the relational model, while the database systems accompanying the SGI software are based on the object-oriented model.

  19. SAbDab: the structural antibody database

    PubMed Central

    Dunbar, James; Krawczyk, Konrad; Leem, Jinwoo; Baker, Terry; Fuchs, Angelika; Georges, Guy; Shi, Jiye; Deane, Charlotte M.

    2014-01-01

    Structural antibody database (SAbDab; http://opig.stats.ox.ac.uk/webapps/sabdab) is an online resource containing all the publicly available antibody structures annotated and presented in a consistent fashion. The data are annotated with several properties including experimental information, gene details, correct heavy and light chain pairings, antigen details and, where available, antibody–antigen binding affinity. The user can select structures, according to these attributes as well as structural properties such as complementarity determining region loop conformation and variable domain orientation. Individual structures, datasets and the complete database can be downloaded. PMID:24214988

  20. Saada: A Generator of Astronomical Database

    NASA Astrophysics Data System (ADS)

    Michel, L.

    2011-11-01

    Saada transforms a set of heterogeneous FITS files or VOtables of various categories (images, tables, spectra, etc.) in a powerful database deployed on the Web. Databases are located on your host and stay independent of any external server. This job doesn’t require writing code. Saada can mix data of various categories in multiple collections. Data collections can be linked each to others making relevant browsing paths and allowing data-mining oriented queries. Saada supports 4 VO services (Spectra, images, sources and TAP) . Data collections can be published immediately after the deployment of the Web interface.

  1. Self-accommodation of B19' martensite in Ti-Ni shape memory alloys. Part III. Analysis of habit plane variant clusters by the geometrically nonlinear theory

    NASA Astrophysics Data System (ADS)

    Inamura, T.; Nishiura, T.; Kawano, H.; Hosoda, H.; Nishida, M.

    2012-06-01

    Competition between the invariant plane (IP) condition at the habit plane, the twin orientation relation (OR) and the kinematic compatibility (KC) at the junction plane (JP) of self-accommodated B19‧ martensite in Ti-Ni was investigated via the geometrically nonlinear theory to understand the habit plane variant (HPV) clusters presented in Parts I and II of this work. As the IP condition cannot be satisfied simultaneously with KC, an additional rotation Q is necessary to form compatible JPs for all HPV pairs. The rotation J necessary to form the exact twin OR between the major correspondence variants (CVs) in each HPV was also examined. The observed HPV cluster was not the cluster with the smallest Q but the one satisfying Q = J with a { ? 1}B19‧ type I twin at JP. Both Q and J are crucial to understanding the various HPV clusters in realistic transformations. Finally, a scheme for the ideal HPV cluster composed of six HPVs is also proposed.

  2. In Situ Observation of Kinetic Processes of Lath Bainite Nucleation and Growth by Laser Scanning Confocal Microscope in Reheated Weld Metals

    NASA Astrophysics Data System (ADS)

    Mao, Gaojun; Cao, Rui; Guo, Xili; Jiang, Yong; Chen, Jianhong

    2017-12-01

    The kinetic processes of nucleation and growth of bainite laths in reheated weld metals are observed and analyzed by a combination of a laser confocal scanning microscope and an electron backscattering diffraction with a field emission scanning electron microscope. The results indicate that the surface relief induced by phase transformation is able to reveal the real microstructural morphologies of bainite laths when viewed from various angles. Five nucleation modes and six types of growth behaviors of bainite laths are revealed. The bainite lath growth rates are measured to vary over a wide range, from 2 μm/s to higher than 2000 μm/s. The orientations of the bainite laths within a prior austenite grain are examined and denoted as different variants. On the basis of variant identification, the reason is analyzed for various growth rates which are demonstrated to be affected by (1) the density of the high-angle misorientation in it, (2) the included angle between habit planes of different variants, and (3) the direction of lath growth with respect to the free (polished) surface.

  3. Association of Arrhythmia-Related Genetic Variants With Phenotypes Documented in Electronic Medical Records.

    PubMed

    Van Driest, Sara L; Wells, Quinn S; Stallings, Sarah; Bush, William S; Gordon, Adam; Nickerson, Deborah A; Kim, Jerry H; Crosslin, David R; Jarvik, Gail P; Carrell, David S; Ralston, James D; Larson, Eric B; Bielinski, Suzette J; Olson, Janet E; Ye, Zi; Kullo, Iftikhar J; Abul-Husn, Noura S; Scott, Stuart A; Bottinger, Erwin; Almoguera, Berta; Connolly, John; Chiavacci, Rosetta; Hakonarson, Hakon; Rasmussen-Torvik, Laura J; Pan, Vivian; Persell, Stephen D; Smith, Maureen; Chisholm, Rex L; Kitchner, Terrie E; He, Max M; Brilliant, Murray H; Wallace, John R; Doheny, Kimberly F; Shoemaker, M Benjamin; Li, Rongling; Manolio, Teri A; Callis, Thomas E; Macaya, Daniela; Williams, Marc S; Carey, David; Kapplinger, Jamie D; Ackerman, Michael J; Ritchie, Marylyn D; Denny, Joshua C; Roden, Dan M

    2016-01-05

    Large-scale DNA sequencing identifies incidental rare variants in established Mendelian disease genes, but the frequency of related clinical phenotypes in unselected patient populations is not well established. Phenotype data from electronic medical records (EMRs) may provide a resource to assess the clinical relevance of rare variants. To determine the clinical phenotypes from EMRs for individuals with variants designated as pathogenic by expert review in arrhythmia susceptibility genes. This prospective cohort study included 2022 individuals recruited for nonantiarrhythmic drug exposure phenotypes from October 5, 2012, to September 30, 2013, for the Electronic Medical Records and Genomics Network Pharmacogenomics project from 7 US academic medical centers. Variants in SCN5A and KCNH2, disease genes for long QT and Brugada syndromes, were assessed for potential pathogenicity by 3 laboratories with ion channel expertise and by comparison with the ClinVar database. Relevant phenotypes were determined from EMRs, with data available from 2002 (or earlier for some sites) through September 10, 2014. One or more variants designated as pathogenic in SCN5A or KCNH2. Arrhythmia or electrocardiographic (ECG) phenotypes defined by International Classification of Diseases, Ninth Revision (ICD-9) codes, ECG data, and manual EMR review. Among 2022 study participants (median age, 61 years [interquartile range, 56-65 years]; 1118 [55%] female; 1491 [74%] white), a total of 122 rare (minor allele frequency <0.5%) nonsynonymous and splice-site variants in 2 arrhythmia susceptibility genes were identified in 223 individuals (11% of the study cohort). Forty-two variants in 63 participants were designated potentially pathogenic by at least 1 laboratory or ClinVar, with low concordance across laboratories (Cohen κ = 0.26). An ICD-9 code for arrhythmia was found in 11 of 63 (17%) variant carriers vs 264 of 1959 (13%) of those without variants (difference, +4%; 95% CI, -5% to +13%; P = .35). In the 1270 (63%) with ECGs, corrected QT intervals were not different in variant carriers vs those without (median, 429 vs 439 milliseconds; difference, -10 milliseconds; 95% CI, -16 to +3 milliseconds; P = .17). After manual review, 22 of 63 participants (35%) with designated variants had any ECG or arrhythmia phenotype, and only 2 had corrected QT interval longer than 500 milliseconds. Among laboratories experienced in genetic testing for cardiac arrhythmia disorders, there was low concordance in designating SCN5A and KCNH2 variants as pathogenic. In an unselected population, the putatively pathogenic genetic variants were not associated with an abnormal phenotype. These findings raise questions about the implications of notifying patients of incidental genetic findings.

  4. Sodium taurocholate cotransporting polypeptide (NTCP) deficiency: Identification of a novel SLC10A1 mutation in two unrelated infants presenting with neonatal indirect hyperbilirubinemia and remarkable hypercholanemia

    PubMed Central

    Qiu, Jian-Wu; Deng, Mei; Cheng, Ying; Atif, Raza-Muhammad; Lin, Wei-Xia; Guo, Li; Li, Hua; Song, Yuan-Zong

    2017-01-01

    Sodium taurocholate cotransporting polypeptide (NTCP) is encoded by the gene SLC10A1 and expressed in the basolateral membrane of the hepatocyte, functioning to uptake bile acids from plasma. Although SLC10A1 has been cloned and NTCP function studied intensively for years, clinical description of NTCP deficiency remains rather limited. This study reported the genotypic and phenotypic features of two neonatal patients with NTCP deficiency. They both presented with neonatal indirect hyperbilirubinemia and remarkable hypercholanemia, and harbored the SLC10A1 variants c.800C>T (p.S267F) and c.263T>C (p.I88T). On genetic analysis of the two family trios, the latter missense variant was detected in trans with the former, a reported loss-of-function variant. Having not been reported in any databases, the c.263T>C (p.I88T) variant demonstrated an allele frequency of 0.67% (1/150) in healthy controls. Moreover, this variant involved a relatively conservative amino acid, and was predicted to be pathogenic or deleterious by changing the conformation of the NTCP molecule. In conclusion, the novel variant c.263T>C (p.I88T) in this study enriched the SLC10A1 mutation spectrum; the clinical findings lent support to the primary role of NTCP in hepatic bile acid clearance, and suggested that NTCP deficiency might be a contributing factor for the development of neonatal indirect hyperbilirubinemia. PMID:29290974

  5. Sodium taurocholate cotransporting polypeptide (NTCP) deficiency: Identification of a novel SLC10A1 mutation in two unrelated infants presenting with neonatal indirect hyperbilirubinemia and remarkable hypercholanemia.

    PubMed

    Qiu, Jian-Wu; Deng, Mei; Cheng, Ying; Atif, Raza-Muhammad; Lin, Wei-Xia; Guo, Li; Li, Hua; Song, Yuan-Zong

    2017-12-05

    Sodium taurocholate cotransporting polypeptide (NTCP) is encoded by the gene SLC10A1 and expressed in the basolateral membrane of the hepatocyte, functioning to uptake bile acids from plasma. Although SLC10A1 has been cloned and NTCP function studied intensively for years, clinical description of NTCP deficiency remains rather limited. This study reported the genotypic and phenotypic features of two neonatal patients with NTCP deficiency. They both presented with neonatal indirect hyperbilirubinemia and remarkable hypercholanemia, and harbored the SLC10A1 variants c.800C>T (p.S267F) and c.263T>C (p.I88T). On genetic analysis of the two family trios, the latter missense variant was detected in trans with the former, a reported loss-of-function variant. Having not been reported in any databases, the c.263T>C (p.I88T) variant demonstrated an allele frequency of 0.67% (1/150) in healthy controls. Moreover, this variant involved a relatively conservative amino acid, and was predicted to be pathogenic or deleterious by changing the conformation of the NTCP molecule. In conclusion, the novel variant c.263T>C (p.I88T) in this study enriched the SLC10A1 mutation spectrum; the clinical findings lent support to the primary role of NTCP in hepatic bile acid clearance, and suggested that NTCP deficiency might be a contributing factor for the development of neonatal indirect hyperbilirubinemia.

  6. Targeted Analysis of Whole Genome Sequence Data to Diagnose Genetic Cardiomyopathy

    DOE PAGES

    Golbus, Jessica R.; Puckelwartz, Megan J.; Dellefave-Castillo, Lisa; ...

    2014-09-01

    Background—Cardiomyopathy is highly heritable but genetically diverse. At present, genetic testing for cardiomyopathy uses targeted sequencing to simultaneously assess the coding regions of more than 50 genes. New genes are routinely added to panels to improve the diagnostic yield. With the anticipated $1000 genome, it is expected that genetic testing will shift towards comprehensive genome sequencing accompanied by targeted gene analysis. Therefore, we assessed the reliability of whole genome sequencing and targeted analysis to identify cardiomyopathy variants in 11 subjects with cardiomyopathy. Methods and Results—Whole genome sequencing with an average of 37× coverage was combined with targeted analysis focused onmore » 204 genes linked to cardiomyopathy. Genetic variants were scored using multiple prediction algorithms combined with frequency data from public databases. This pipeline yielded 1-14 potentially pathogenic variants per individual. Variants were further analyzed using clinical criteria and/or segregation analysis. Three of three previously identified primary mutations were detected by this analysis. In six subjects for whom the primary mutation was previously unknown, we identified mutations that segregated with disease, had clinical correlates, and/or had additional pathological correlation to provide evidence for causality. For two subjects with previously known primary mutations, we identified additional variants that may act as modifiers of disease severity. In total, we identified the likely pathological mutation in 9 of 11 (82%) subjects. We conclude that these pilot data demonstrate that ~30-40× coverage whole genome sequencing combined with targeted analysis is feasible and sensitive to identify rare variants in cardiomyopathy-associated genes.« less

  7. Small intragenic deletion in FOXP2 associated with childhood apraxia of speech and dysarthria.

    PubMed

    Turner, Samantha J; Hildebrand, Michael S; Block, Susan; Damiano, John; Fahey, Michael; Reilly, Sheena; Bahlo, Melanie; Scheffer, Ingrid E; Morgan, Angela T

    2013-09-01

    Relatively little is known about the neurobiological basis of speech disorders although genetic determinants are increasingly recognized. The first gene for primary speech disorder was FOXP2, identified in a large, informative family with verbal and oral dyspraxia. Subsequently, many de novo and familial cases with a severe speech disorder associated with FOXP2 mutations have been reported. These mutations include sequencing alterations, translocations, uniparental disomy, and genomic copy number variants. We studied eight probands with speech disorder and their families. Family members were phenotyped using a comprehensive assessment of speech, oral motor function, language, literacy skills, and cognition. Coding regions of FOXP2 were screened to identify novel variants. Segregation of the variant was determined in the probands' families. Variants were identified in two probands. One child with severe motor speech disorder had a small de novo intragenic FOXP2 deletion. His phenotype included features of childhood apraxia of speech and dysarthria, oral motor dyspraxia, receptive and expressive language disorder, and literacy difficulties. The other variant was found in a family in two of three family members with stuttering, and also in the mother with oral motor impairment. This variant was considered a benign polymorphism as it was predicted to be non-pathogenic with in silico tools and found in database controls. This is the first report of a small intragenic deletion of FOXP2 that is likely to be the cause of severe motor speech disorder associated with language and literacy problems. Copyright © 2013 Wiley Periodicals, Inc.

  8. Genetic variability in ABCB1, occupational pesticide exposure, and Parkinson's disease.

    PubMed

    Narayan, Shilpa; Sinsheimer, Janet S; Paul, Kimberly C; Liew, Zeyan; Cockburn, Myles; Bronstein, Jeff M; Ritz, Beate

    2015-11-01

    Studies suggested that variants in the ABCB1 gene encoding P-glycoprotein, a xenobiotic transporter, may increase susceptibility to pesticide exposures linked to Parkinson's Disease (PD) risk. To investigate the joint impact of two ABCB1 polymorphisms and pesticide exposures on PD risk. In a population-based case control study, we genotyped ABCB1 gene variants at rs1045642 (c.3435C/T) and rs2032582 (c.2677G/T/A) and assessed occupational exposures to organochlorine (OC) and organophosphorus (OP) pesticides based on self-reported occupational use and record-based ambient workplace exposures for 282 PD cases and 514 controls of European ancestry. We identified active ingredients in self-reported occupational use pesticides from a California database and estimated ambient workplace exposures between 1974 and 1999 employing a geographic information system together with records for state pesticide and land use. With unconditional logistic regression, we estimated marginal and joint contributions for occupational pesticide exposures and ABCB1 variants in PD. For occupationally exposed carriers of homozygous ABCB1 variant genotypes, we estimated odds ratios of 1.89 [95% confidence interval (CI): (0.87, 4.07)] to 3.71 [95% CI: (1.96, 7.02)], with the highest odds ratios estimated for occupationally exposed carriers of homozygous ABCB1 variant genotypes at both SNPs; but we found no multiplicative scale interactions. This study lends support to a previous report that commonly used pesticides, specifically OCs and OPs, and variant ABCB1 genotypes at two polymorphic sites jointly increase risk of PD. Copyright © 2015 Elsevier Inc. All rights reserved.

  9. Whole-exome sequencing identifies novel candidate predisposition genes for familial polycythemia vera.

    PubMed

    Hirvonen, Elina A M; Pitkänen, Esa; Hemminki, Kari; Aaltonen, Lauri A; Kilpivaara, Outi

    2017-04-20

    Polycythemia vera (PV), characterized by massive production of erythrocytes, is one of the myeloproliferative neoplasms. Most patients carry a somatic gain-of-function mutation in JAK2, c.1849G > T (p.Val617Phe), leading to constitutive activation of JAK-STAT signaling pathway. Familial clustering is also observed occasionally, but high-penetrance predisposition genes to PV have remained unidentified. We studied the predisposition to PV by exome sequencing (three cases) in a Finnish PV family with four patients. The 12 shared variants (maximum allowed minor allele frequency <0.001 in Finnish population in ExAC database) predicted damaging in silico and absent in an additional control set of over 500 Finns were further validated by Sanger sequencing in a fourth affected family member. Three novel predisposition candidate variants were identified: c.1254C > G (p.Phe418Leu) in ZXDC, c.1931C > G (p.Pro644Arg) in ATN1, and c.701G > A (p.Arg234Gln) in LRRC3. We also observed a rare, predicted benign germline variant c.2912C > G (p.Ala971Gly) in BCORL1 in all four patients. Somatic mutations in BCORL1 have been reported in myeloid malignancies. We further screened the variants in eight PV patients in six other Finnish families, but no other carriers were found. Exome sequencing provides a powerful tool for the identification of novel variants, and understanding the familial predisposition of diseases. This is the first report on Finnish familial PV cases, and we identified three novel candidate variants that may predispose to the disease.

  10. CNVinspector: a web-based tool for the interactive evaluation of copy number variations in single patients and in cohorts.

    PubMed

    Knierim, Ellen; Schwarz, Jana Marie; Schuelke, Markus; Seelow, Dominik

    2013-08-01

    Many genetic disorders are caused by copy number variations (CNVs) in the human genome. However, the large number of benign CNV polymorphisms makes it difficult to delineate causative variants for a certain disease phenotype. Hence, we set out to create software that accumulates and visualises locus-specific knowledge and enables clinicians to study their own CNVs in the context of known polymorphisms and disease variants. CNV data from healthy cohorts (Database of Genomic Variants) and from disease-related databases (DECIPHER) were integrated into a joint resource. Data are presented in an interactive web-based application that allows inspection, evaluation and filtering of CNVs in single individuals or in entire cohorts. CNVinspector provides simple interfaces to upload CNV data, compare them with own or published control data and visualise the results in graphical interfaces. Beyond choosing control data from different public studies, platforms and methods, dedicated filter options allow the detection of CNVs that are either enriched in patients or depleted in controls. Alternatively, a search can be restricted to those CNVs that appear in individuals of similar clinical phenotype. For each gene of interest within a CNV, we provide a link to NCBI, ENSEMBL and the GeneDistiller search engine to browse for potential disease-associated genes. With its user-friendly handling, the integration of control data and the filtering options, CNVinspector will facilitate the daily work of clinical geneticists and accelerate the delineation of new syndromes and gene functions. CNVinspector is freely accessible under http://www.cnvinspector.org.

  11. Tensile strength of various nylon PA6 specimen modes

    NASA Astrophysics Data System (ADS)

    Raz, Karel; Zahalka, Martin

    2017-05-01

    This article explores the influence of production technique on the strength of nylon parts. Identical specimens were manufactured by various techniques. The material of specimens was nylon PA6. 3D printing and injection molding were used, with various orientations of printed layers, and various orientations of specimens in the working space of the 3D printer. The variants are described in detail. A special mold was used for the injection molding process in order to make specimens with and without a weld line. The effect of this weld line was evaluated. All specimens were tested using the standard tensile test configuration. The strength was compared. It was found that the same plastic material has very different mechanical properties depending on the production process.

  12. Effect of Isothermal Temperature on Growth Behavior of Nanostructured Bainite in Laser Cladded Coatings

    PubMed Central

    Guo, Yanbing; Yao, Chengwu; Feng, Kai; Li, Zhuguo; Chu, Paul K.; Wu, Yixiong

    2017-01-01

    The growth and propagation behavior of austenite-to-bainite isothermal transformation in laser-cladded, Si-rich, and Fe-based coatings is investigated. The crystallographic features, orientation relationship at different isothermal temperatures, and the morphology of the nanostructured bainite are determined. The Nishiyama-Wassermann type orientation relationship is observed at a high temperature and at a low temperature, and mixed Nishiyama-Wassermann and Kurdjumov-Sach mechanisms are seen. The growth direction is investigated by the partial dislocation theory and an extrapolated model based on the repeated formation of lenticular-shaped subunits and pile-up along the close-packed directions of the close-packed planes. The variants of the bainite growth directions would be more selective at the high transformation temperature. PMID:28773161

  13. Cyclic Degradation Behavior of < 001 > -Oriented Fe-Mn-Al-Ni Single Crystals in Tension

    NASA Astrophysics Data System (ADS)

    Vollmer, M.; Kriegel, M. J.; Krooß, P.; Martin, S.; Klemm, V.; Weidner, A.; Chumlyakov, Y.; Biermann, H.; Rafaja, D.; Niendorf, T.

    2017-12-01

    In the present study, functional fatigue behavior of a near 〈001〉-oriented Fe-Mn-Al-Ni single crystal was investigated under tensile load. An incremental strain test up to 3.5% strain and cyclic tests up to 25 cycles revealed rapid pseudoelastic degradation. Progressive microstructural degradation was studied by in situ scanning electron microscopy. The results show a partially inhibited reactivation of previously formed martensite and proceeding activation of untransformed areas in subsequent cycles. The preferentially formed martensite variants were identified by means of Schmid factor calculation and the Kurdjumov-Sachs relationship. Post mortem transmission electron microscopy investigations shed light on the prevailing degradation mechanisms. Different types of dislocations were found promoting the progressive degradation during cyclic loading.

  14. The 2014 Nucleic Acids Research Database Issue and an updated NAR online Molecular Biology Database Collection.

    PubMed

    Fernández-Suárez, Xosé M; Rigden, Daniel J; Galperin, Michael Y

    2014-01-01

    The 2014 Nucleic Acids Research Database Issue includes descriptions of 58 new molecular biology databases and recent updates to 123 databases previously featured in NAR or other journals. For convenience, the issue is now divided into eight sections that reflect major subject categories. Among the highlights of this issue are six databases of the transcription factor binding sites in various organisms and updates on such popular databases as CAZy, Database of Genomic Variants (DGV), dbGaP, DrugBank, KEGG, miRBase, Pfam, Reactome, SEED, TCDB and UniProt. There is a strong block of structural databases, which includes, among others, the new RNA Bricks database, updates on PDBe, PDBsum, ArchDB, Gene3D, ModBase, Nucleic Acid Database and the recently revived iPfam database. An update on the NCBI's MMDB describes VAST+, an improved tool for protein structure comparison. Two articles highlight the development of the Structural Classification of Proteins (SCOP) database: one describes SCOPe, which automates assignment of new structures to the existing SCOP hierarchy; the other one describes the first version of SCOP2, with its more flexible approach to classifying protein structures. This issue also includes a collection of articles on bacterial taxonomy and metagenomics, which includes updates on the List of Prokaryotic Names with Standing in Nomenclature (LPSN), Ribosomal Database Project (RDP), the Silva/LTP project and several new metagenomics resources. The NAR online Molecular Biology Database Collection, http://www.oxfordjournals.org/nar/database/c/, has been expanded to 1552 databases. The entire Database Issue is freely available online on the Nucleic Acids Research website (http://nar.oxfordjournals.org/).

  15. Peptidomimetic Escape Mechanisms Arise via Genetic Diversity in the Ligand-Binding Site of the Hepatitis C Virus NS3/4A Serine Protease

    PubMed Central

    Welsch, Christoph; Shimakami, Tetsuro; Hartmann, Christoph; Yang, Yan; Domingues, Francisco S.; Lengauer, Thomas; Zeuzem, Stefan; Lemon, Stanley M.

    2011-01-01

    Background & Aims It is a challenge to develop direct-acting antiviral agents (DAAs) that target the NS3/4A protease of hepatitis C virus (HCV) because resistant variants develop. Ketoamide compounds, designed to mimic the natural protease substrate, have been developed as inhibitors. However, clinical trials have revealed rapid selection of resistant mutants, most of which are considered to be pre-existing variants. Methods We identified residues near the ketoamide-binding site in X-ray structures of the genotype 1a protease, co-crystallized with boceprevir or a telaprevir-like ligand, and then identified variants at these positions in 219 genotype 1 sequences from a public database. We used side-chain modeling to assess the potential effects of these variants on the interaction between ketoamide and the protease, and compared these results with the phenotypic effects on ketoamide resistance, RNA replication capacity, and infectious virus yields in a cell culture model of infection. Results Thirteen natural binding-site variants with potential for ketoamide resistance were identified at 10 residues in the protease, near the ketoamide binding site. Rotamer analysis of amino acid side-chain conformations indicated that 2 variants (R155K and D168G) could affect binding of telaprevir more than boceprevir. Measurements of antiviral susceptibility in cell culture studies were consistent with this observation. Four variants (Q41H, I132V, R155K, and D168G) caused low-to-moderate levels of ketoamide resistance; 3 of these were highly fit (Q41H, I132V, and R155K). Conclusions Using a comprehensive sequence and structure-based analysis, we showed how natural variation in the HCV protease NS3/4A sequences might affect susceptibility to first-generation DAAs. These findings increase our understanding of the molecular basis of ketoamide resistance among naturally existing viral variants. PMID:22155364

  16. Retrieving high-resolution images over the Internet from an anatomical image database

    NASA Astrophysics Data System (ADS)

    Strupp-Adams, Annette; Henderson, Earl

    1999-12-01

    The Visible Human Data set is an important contribution to the national collection of anatomical images. To enhance the availability of these images, the National Library of Medicine has supported the design and development of a prototype object-oriented image database which imports, stores, and distributes high resolution anatomical images in both pixel and voxel formats. One of the key database modules is its client-server Internet interface. This Web interface provides a query engine with retrieval access to high-resolution anatomical images that range in size from 100KB for browser viewable rendered images, to 1GB for anatomical structures in voxel file formats. The Web query and retrieval client-server system is composed of applet GUIs, servlets, and RMI application modules which communicate with each other to allow users to query for specific anatomical structures, and retrieve image data as well as associated anatomical images from the database. Selected images can be downloaded individually as single files via HTTP or downloaded in batch-mode over the Internet to the user's machine through an applet that uses Netscape's Object Signing mechanism. The image database uses ObjectDesign's object-oriented DBMS, ObjectStore that has a Java interface. The query and retrieval systems has been tested with a Java-CDE window system, and on the x86 architecture using Windows NT 4.0. This paper describes the Java applet client search engine that queries the database; the Java client module that enables users to view anatomical images online; the Java application server interface to the database which organizes data returned to the user, and its distribution engine that allow users to download image files individually and/or in batch-mode.

  17. Using diverse U.S. beef cattle genomes to identify missense mutations in EPAS1, a gene associated with high-altitude pulmonary hypertension

    USDA-ARS?s Scientific Manuscript database

    The availability of whole genome sequence (WGS) data has made it possible to discover protein variants in silico. However, bovine WGS databases comprised of related influential sires from relatively few breeds tend to under represent the breadth of genetic diversity in U.S. beef cattle. Thus, our ...

  18. Variant terminology. [for aerospace information systems

    NASA Technical Reports Server (NTRS)

    Buchan, Ronald L.

    1991-01-01

    A system called Variant Terminology Switching (VTS) is set forth that is intended to provide computer-assisted spellings for terms that have American and British versions. VTS is based on the use of brackets, parentheses, and other symbols in conjunction with letters that distinguish American and British spellings. The symbols are used in the systems as indicators of actions such as deleting, adding, and replacing letters as well as replacing entire words and concepts. The system is shown to be useful for the intended purpose and also for the recognition of misspellings and for the standardization of computerized input/output. The VTS system is of interest to the development of international retrieval systems for aerospace and other technical databases that enhance the use by the global scientific community.

  19. GEMINI: Integrative Exploration of Genetic Variation and Genome Annotations

    PubMed Central

    Paila, Umadevi; Chapman, Brad A.; Kirchner, Rory; Quinlan, Aaron R.

    2013-01-01

    Modern DNA sequencing technologies enable geneticists to rapidly identify genetic variation among many human genomes. However, isolating the minority of variants underlying disease remains an important, yet formidable challenge for medical genetics. We have developed GEMINI (GEnome MINIng), a flexible software package for exploring all forms of human genetic variation. Unlike existing tools, GEMINI integrates genetic variation with a diverse and adaptable set of genome annotations (e.g., dbSNP, ENCODE, UCSC, ClinVar, KEGG) into a unified database to facilitate interpretation and data exploration. Whereas other methods provide an inflexible set of variant filters or prioritization methods, GEMINI allows researchers to compose complex queries based on sample genotypes, inheritance patterns, and both pre-installed and custom genome annotations. GEMINI also provides methods for ad hoc queries and data exploration, a simple programming interface for custom analyses that leverage the underlying database, and both command line and graphical tools for common analyses. We demonstrate GEMINI's utility for exploring variation in personal genomes and family based genetic studies, and illustrate its ability to scale to studies involving thousands of human samples. GEMINI is designed for reproducibility and flexibility and our goal is to provide researchers with a standard framework for medical genomics. PMID:23874191

  20. The utilization of neural nets in populating an object-oriented database

    NASA Technical Reports Server (NTRS)

    Campbell, William J.; Hill, Scott E.; Cromp, Robert F.

    1989-01-01

    Existing NASA supported scientific data bases are usually developed, managed and populated in a tedious, error prone and self-limiting way in terms of what can be described in a relational Data Base Management System (DBMS). The next generation Earth remote sensing platforms (i.e., Earth Observation System, (EOS), will be capable of generating data at a rate of over 300 Mbs per second from a suite of instruments designed for different applications. What is needed is an innovative approach that creates object-oriented databases that segment, characterize, catalog and are manageable in a domain-specific context and whose contents are available interactively and in near-real-time to the user community. Described here is work in progress that utilizes an artificial neural net approach to characterize satellite imagery of undefined objects into high-level data objects. The characterized data is then dynamically allocated to an object-oriented data base where it can be reviewed and assessed by a user. The definition, development, and evolution of the overall data system model are steps in the creation of an application-driven knowledge-based scientific information system.

  1. Childhood Abuse Experiences and the COMT and MTHFR Genetic Variants Associated With Male Sexual Orientation in the Han Chinese Populations: A Case-Control Study.

    PubMed

    Qin, Jia-Bi; Zhao, Guang-Lu; Wang, Feng; Cai, Yu-Mao; Lan, Li-Na; Yang, Lin; Feng, Tie-Jian

    2018-01-01

    Although it is widely acknowledged that genetic and environmental factors are involved in the development of male homosexuality, the causes are not fully understood. To explore the association and interaction of childhood abuse experiences and genetic variants of the catechol-O-methyltransferase (COMT) and methylenetetrahydrofolate reductase (MTHFR) genes with the development of male homosexuality. A case-control study of 537 exclusively homosexual men and 583 exclusively heterosexual men was conducted, with data collected from March 2013 to August 2015. Data were analyzed using χ 2 tests and logistic regression models. Sociodemographic characteristics, childhood abuse experiences, and polymorphisms of COMT at rs4680, rs4818, and rs6267 and MTHFR at rs1801133. More frequent occurrence of physical (adjusted odds ratio [aOR] = 1.78), emotional (aOR = 2.07), and sexual (aOR = 2.53) abuse during childhood was significantly associated with the development of male homosexuality. The polymorphisms of MTHFR at rs1801133 and COMT at rs4818 also were significantly associated with the development of male homosexuality in the homozygote comparisons (T/T vs C/C at rs1801133, aOR = 1.68; G/G vs C/C at rs4818, aOR = 1.75). In addition, significant interaction effects between childhood abuse experiences and the COMT and MTHFR genetic variants on the development of male homosexuality were found. This is the first time that an association of childhood abuse, COMT and MTHFR genetic variants, and their interactions with development of male homosexuality was exhaustively explored, which could help provide new insight into the etiology of male homosexuality. Because homosexual men are a relatively obscure population, it was impossible to select the study participants by random sampling, which could lead to selection bias. In addition, because this was a case-control study, recall bias was inevitable, and we could not verify causality. Childhood abuse and the COMT and MTHFR genetic variants could be positively associated with the development of homosexuality. However, it remains unknown how these factors jointly play a role in the development of homosexuality, and more studies in different ethnic populations and with a larger sample and a prospective design are required to confirm our findings. Qin J-B, Zhao G-L, Wang F, et al. Childhood Abuse Experiences and the COMT and MTHFR Genetic Variants Associated With Male Sexual Orientation in the Han Chinese Populations: A Case-Control Study. J Sex Med 2018;15:29-42. Copyright © 2017 International Society for Sexual Medicine. Published by Elsevier Inc. All rights reserved.

  2. C++, objected-oriented programming, and astronomical data models

    NASA Technical Reports Server (NTRS)

    Farris, A.

    1992-01-01

    Contemporary astronomy is characterized by increasingly complex instruments and observational techniques, higher data collection rates, and large data archives, placing severe stress on software analysis systems. The object-oriented paradigm represents a significant new approach to software design and implementation that holds great promise for dealing with this increased complexity. The basic concepts of this approach will be characterized in contrast to more traditional procedure-oriented approaches. The fundamental features of objected-oriented programming will be discussed from a C++ programming language perspective, using examples familiar to astronomers. This discussion will focus on objects, classes and their relevance to the data type system; the principle of information hiding; and the use of inheritance to implement generalization/specialization relationships. Drawing on the object-oriented approach, features of a new database model to support astronomical data analysis will be presented.

  3. SNPdbe: constructing an nsSNP functional impacts database.

    PubMed

    Schaefer, Christian; Meier, Alice; Rost, Burkhard; Bromberg, Yana

    2012-02-15

    Many existing databases annotate experimentally characterized single nucleotide polymorphisms (SNPs). Each non-synonymous SNP (nsSNP) changes one amino acid in the gene product (single amino acid substitution;SAAS). This change can either affect protein function or be neutral in that respect. Most polymorphisms lack experimental annotation of their functional impact. Here, we introduce SNPdbe-SNP database of effects, with predictions of computationally annotated functional impacts of SNPs. Database entries represent nsSNPs in dbSNP and 1000 Genomes collection, as well as variants from UniProt and PMD. SAASs come from >2600 organisms; 'human' being the most prevalent. The impact of each SAAS on protein function is predicted using the SNAP and SIFT algorithms and augmented with experimentally derived function/structure information and disease associations from PMD, OMIM and UniProt. SNPdbe is consistently updated and easily augmented with new sources of information. The database is available as an MySQL dump and via a web front end that allows searches with any combination of organism names, sequences and mutation IDs. http://www.rostlab.org/services/snpdbe.

  4. MIPS: a database for genomes and protein sequences.

    PubMed Central

    Mewes, H W; Heumann, K; Kaps, A; Mayer, K; Pfeiffer, F; Stocker, S; Frishman, D

    1999-01-01

    The Munich Information Center for Protein Sequences (MIPS-GSF), Martinsried near Munich, Germany, develops and maintains genome oriented databases. It is commonplace that the amount of sequence data available increases rapidly, but not the capacity of qualified manual annotation at the sequence databases. Therefore, our strategy aims to cope with the data stream by the comprehensive application of analysis tools to sequences of complete genomes, the systematic classification of protein sequences and the active support of sequence analysis and functional genomics projects. This report describes the systematic and up-to-date analysis of genomes (PEDANT), a comprehensive database of the yeast genome (MYGD), a database reflecting the progress in sequencing the Arabidopsis thaliana genome (MATD), the database of assembled, annotated human EST clusters (MEST), and the collection of protein sequence data within the framework of the PIR-International Protein Sequence Database (described elsewhere in this volume). MIPS provides access through its WWW server (http://www.mips.biochem.mpg.de) to a spectrum of generic databases, including the above mentioned as well as a database of protein families (PROTFAM), the MITOP database, and the all-against-all FASTA database. PMID:9847138

  5. Pedestrian detection based on redundant wavelet transform

    NASA Astrophysics Data System (ADS)

    Huang, Lin; Ji, Liping; Hu, Ping; Yang, Tiejun

    2016-10-01

    Intelligent video surveillance is to analysis video or image sequences captured by a fixed or mobile surveillance camera, including moving object detection, segmentation and recognition. By using it, we can be notified immediately in an abnormal situation. Pedestrian detection plays an important role in an intelligent video surveillance system, and it is also a key technology in the field of intelligent vehicle. So pedestrian detection has very vital significance in traffic management optimization, security early warn and abnormal behavior detection. Generally, pedestrian detection can be summarized as: first to estimate moving areas; then to extract features of region of interest; finally to classify using a classifier. Redundant wavelet transform (RWT) overcomes the deficiency of shift variant of discrete wavelet transform, and it has better performance in motion estimation when compared to discrete wavelet transform. Addressing the problem of the detection of multi-pedestrian with different speed, we present an algorithm of pedestrian detection based on motion estimation using RWT, combining histogram of oriented gradients (HOG) and support vector machine (SVM). Firstly, three intensities of movement (IoM) are estimated using RWT and the corresponding areas are segmented. According to the different IoM, a region proposal (RP) is generated. Then, the features of a RP is extracted using HOG. Finally, the features are fed into a SVM trained by pedestrian databases and the final detection results are gained. Experiments show that the proposed algorithm can detect pedestrians accurately and efficiently.

  6. NPS (Naval Postgraduate School) Supply Requisition Database - Interactive Software as an Alternative to Written Instructions.

    DTIC Science & Technology

    1986-03-01

    SRdb ... .......... .35 APPENDIX A: ABBREVIATIONS AND ACRONYMS ......... 37 " APPENDIX B: USER’S MANUAL ..... ............... 38 APPENDIX C: DATABASE...percentage of situations. The purpose of this paper is to examine and propose a software-oriented alternative to the current manual , instruction-driven...Department Customer Service Manual (Ref. 1] and the applicable NPS Comptroller instruction [Ref. 2]. Several modifications to these written quidelines

  7. CampusGIS of the University of Cologne: a tool for orientation, navigation, and management

    NASA Astrophysics Data System (ADS)

    Baaser, U.; Gnyp, M. L.; Hennig, S.; Hoffmeister, D.; Köhn, N.; Laudien, R.; Bareth, G.

    2006-10-01

    The working group for GIS and Remote Sensing at the Department of Geography at the University of Cologne has established a WebGIS called CampusGIS of the University of Cologne. The overall task of the CampusGIS is the connection of several existing databases at the University of Cologne with spatial data. These existing databases comprise data about staff, buildings, rooms, lectures, and general infrastructure like bus stops etc. These information were yet not linked to their spatial relation. Therefore, a GIS-based method is developed to link all the different databases to spatial entities. Due to the philosophy of the CampusGIS, an online-GUI is programmed which enables users to search for staff, buildings, or institutions. The query results are linked to the GIS database which allows the visualization of the spatial location of the searched entity. This system was established in 2005 and is operational since early 2006. In this contribution, the focus is on further developments. First results of (i) including routing services in, (ii) programming GUIs for mobile devices for, and (iii) including infrastructure management tools in the CampusGIS are presented. Consequently, the CampusGIS is not only available for spatial information retrieval and orientation. It also serves for on-campus navigation and administrative management.

  8. Development of a web-based video management and application processing system

    NASA Astrophysics Data System (ADS)

    Chan, Shermann S.; Wu, Yi; Li, Qing; Zhuang, Yueting

    2001-07-01

    How to facilitate efficient video manipulation and access in a web-based environment is becoming a popular trend for video applications. In this paper, we present a web-oriented video management and application processing system, based on our previous work on multimedia database and content-based retrieval. In particular, we extend the VideoMAP architecture with specific web-oriented mechanisms, which include: (1) Concurrency control facilities for the editing of video data among different types of users, such as Video Administrator, Video Producer, Video Editor, and Video Query Client; different users are assigned various priority levels for different operations on the database. (2) Versatile video retrieval mechanism which employs a hybrid approach by integrating a query-based (database) mechanism with content- based retrieval (CBR) functions; its specific language (CAROL/ST with CBR) supports spatio-temporal semantics of video objects, and also offers an improved mechanism to describe visual content of videos by content-based analysis method. (3) Query profiling database which records the `histories' of various clients' query activities; such profiles can be used to provide the default query template when a similar query is encountered by the same kind of users. An experimental prototype system is being developed based on the existing VideoMAP prototype system, using Java and VC++ on the PC platform.

  9. Automated correction of improperly rotated diffusion gradient orientations in diffusion weighted MRI.

    PubMed

    Jeurissen, Ben; Leemans, Alexander; Sijbers, Jan

    2014-10-01

    Ensuring one is using the correct gradient orientations in a diffusion MRI study can be a challenging task. As different scanners, file formats and processing tools use different coordinate frame conventions, in practice, users can end up with improperly oriented gradient orientations. Using such wrongly oriented gradient orientations for subsequent diffusion parameter estimation will invalidate all rotationally variant parameters and fiber tractography results. While large misalignments can be detected by visual inspection, small rotations of the gradient table (e.g. due to angulation of the acquisition plane), are much more difficult to detect. In this work, we propose an automated method to align the coordinate frame of the gradient orientations with that of the corresponding diffusion weighted images, using a metric based on whole brain fiber tractography. By transforming the gradient table and measuring the average fiber trajectory length, we search for the transformation that results in the best global 'connectivity'. To ensure a fast calculation of the metric we included a range of algorithmic optimizations in our tractography routine. To make the optimization routine robust to spurious local maxima, we use a stochastic optimization routine that selects a random set of seed points on each evaluation. Using simulations, we show that our method can recover the correct gradient orientations with high accuracy and precision. In addition, we demonstrate that our technique can successfully recover rotated gradient tables on a wide range of clinically realistic data sets. As such, our method provides a practical and robust solution to an often overlooked pitfall in the processing of diffusion MRI. Copyright © 2014 Elsevier B.V. All rights reserved.

  10. How to ensure sustainable interoperability in heterogeneous distributed systems through architectural approach.

    PubMed

    Pape-Haugaard, Louise; Frank, Lars

    2011-01-01

    A major obstacle in ensuring ubiquitous information is the utilization of heterogeneous systems in eHealth. The objective in this paper is to illustrate how an architecture for distributed eHealth databases can be designed without lacking the characteristic features of traditional sustainable databases. The approach is firstly to explain traditional architecture in central and homogeneous distributed database computing, followed by a possible approach to use an architectural framework to obtain sustainability across disparate systems i.e. heterogeneous databases, concluded with a discussion. It is seen that through a method of using relaxed ACID properties on a service-oriented architecture it is possible to achieve data consistency which is essential when ensuring sustainable interoperability.

  11. A Survey of Object-Oriented Database Technology

    DTIC Science & Technology

    1990-05-01

    now mention briefly the various security and autho- rization schemes provided by GEMSTONE. 1. Login Authorization. There are two ways to login to...GemStone- through the OPAL programming environment or through the GemStone C interface. A user ID and password is required in both cases to login . 2. Name...lIlj A. Black. Object structure in the Emerald system. Proc. Ist Intl. Conf. on Objcct- Oriented Programming Systems, Languages and Applications, pp

  12. Pattern Adaptation and Normalization Reweighting.

    PubMed

    Westrick, Zachary M; Heeger, David J; Landy, Michael S

    2016-09-21

    Adaptation to an oriented stimulus changes both the gain and preferred orientation of neural responses in V1. Neurons tuned near the adapted orientation are suppressed, and their preferred orientations shift away from the adapter. We propose a model in which weights of divisive normalization are dynamically adjusted to homeostatically maintain response products between pairs of neurons. We demonstrate that this adjustment can be performed by a very simple learning rule. Simulations of this model closely match existing data from visual adaptation experiments. We consider several alternative models, including variants based on homeostatic maintenance of response correlations or covariance, as well as feedforward gain-control models with multiple layers, and we demonstrate that homeostatic maintenance of response products provides the best account of the physiological data. Adaptation is a phenomenon throughout the nervous system in which neural tuning properties change in response to changes in environmental statistics. We developed a model of adaptation that combines normalization (in which a neuron's gain is reduced by the summed responses of its neighbors) and Hebbian learning (in which synaptic strength, in this case divisive normalization, is increased by correlated firing). The model is shown to account for several properties of adaptation in primary visual cortex in response to changes in the statistics of contour orientation. Copyright © 2016 the authors 0270-6474/16/369805-12$15.00/0.

  13. Azimuthal Anisotropy beneath the Contiguous United States Revealed by Shear Wave Splitting

    NASA Astrophysics Data System (ADS)

    Liu, K. H.; Yang, B.; Liu, Y.; Dahm, H. H.; Refayee, H. A.; Gao, S. S.

    2017-12-01

    We have produced a uniformly-measured XKS (including SKS, SKKS, and PKS) splitting database for the contiguous United States and adjacent areas. The database consists of about 30,000 pairs of splitting parameters from 3185 stations. Both the fast orientations and splitting times show systematic spatial variations. The vast majority of the fast orientations are in agreement with the absolute plate motion (APM) direction computed under a fixed hot-spot reference frame. Spatial coherency analysis of the splitting parameters indicates that for the majority of the study area, where a single layer of anisotropy with a horizontal axis of symmetry is inferred, the source of anisotropy is located in the rheologically transitional zone between the lithosphere and asthenosphere. Beneath the western U.S., the previously recognized semi-circular feature of the fast orientations has a much greater spatial coverage, extending to northern Mexico and the Rio Grande Rift. The fast orientations are parallel to the western, southern, and southeastern edges of the North American Craton and can be interpreted by simple shear strain associated with mantle flow around the cratonic keel. The combination of anisotropy induced by this around keel flow and the APM can effectively explain the E-W fast orientations beneath the southern margin of the North American Craton and NE U.S., as well as the nearly N-S fast orientations and small splitting times observed in the SE U.S. The splitting times show a systematic decrease from both the western and eastern U.S. toward the central U.S., where the thickness of the lithosphere is the largest in the study area. This trend can be explained by the reduced efficiency of anisotropy development at greater depth, as well as by the lack of around keel flow in the continental interior.

  14. Conceptual and logical level of database modeling

    NASA Astrophysics Data System (ADS)

    Hunka, Frantisek; Matula, Jiri

    2016-06-01

    Conceptual and logical levels form the top most levels of database modeling. Usually, ORM (Object Role Modeling) and ER diagrams are utilized to capture the corresponding schema. The final aim of business process modeling is to store its results in the form of database solution. For this reason, value oriented business process modeling which utilizes ER diagram to express the modeling entities and relationships between them are used. However, ER diagrams form the logical level of database schema. To extend possibilities of different business process modeling methodologies, the conceptual level of database modeling is needed. The paper deals with the REA value modeling approach to business process modeling using ER-diagrams, and derives conceptual model utilizing ORM modeling approach. Conceptual model extends possibilities for value modeling to other business modeling approaches.

  15. ScaffoldScaffolder: solving contig orientation via bidirected to directed graph reduction.

    PubMed

    Bodily, Paul M; Fujimoto, M Stanley; Snell, Quinn; Ventura, Dan; Clement, Mark J

    2016-01-01

    The contig orientation problem, which we formally define as the MAX-DIR problem, has at times been addressed cursorily and at times using various heuristics. In setting forth a linear-time reduction from the MAX-CUT problem to the MAX-DIR problem, we prove the latter is NP-complete. We compare the relative performance of a novel greedy approach with several other heuristic solutions. Our results suggest that our greedy heuristic algorithm not only works well but also outperforms the other algorithms due to the nature of scaffold graphs. Our results also demonstrate a novel method for identifying inverted repeats and inversion variants, both of which contradict the basic single-orientation assumption. Such inversions have previously been noted as being difficult to detect and are directly involved in the genetic mechanisms of several diseases. http://bioresearch.byu.edu/scaffoldscaffolder. paulmbodily@gmail.com Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  16. [Pharmacogenomics study of 620 whole-exome sequencing: focusing on aspirin application].

    PubMed

    Yang, L; Lu, Y L; Wang, H J; Zhou, W H

    2016-05-01

    To investigate the allele frequencies of aspirin-response-related variants in different population. The allele frequencies of reported clinically significant aspirin-response-related variants were evaluated based on 620 whole exome sequencing (WES) data collected from 2013 to 2016 in Children's Hospital of Fudan University.Then the local allele frequencies were compared with 1 000 Genomes project database, and χ(2) test was used. Thirty-eight aspirin-response-related variants that had clinical significance had been detected in the 620 WES data.Ten (26%) of them were related with drug efficacy while 28 (74%) were related with toxicity or adverse drug reaction (ADR). These variants were distributed in 33 genes.There were 23 aspirin-related variants further analysised, and the frequency of 7 (rs1050891, rs6065, rs7862221, rs1065776, rs3818822, rs3775291 and rs1126643) had no significant difference compared with frequency of European and East Asian population of 1 000 Genome project (P>0.01 for both), 10 (rs2228079, rs1613662, rs4523, rs28360521, rs1131882, rs1047626, rs3856806, rs2768759, rs7572857 and rs1126510) of them had no significant difference compared with East Asian but were significantly different from European population, 1 (rs2075797) had no significant difference compared with frequency of European and different with frequency of East Asian, and 5 variants(rs10279545, rs730012, rs16851030, rs1353411, rs1800469)were different from frequency of both East Asian(0.019, 0.058, 0.167, 0.452, 0.340 vs. 0.100, 0.151, 0.396, 0.568, 0.453, χ(2)=21.798, 20.400, 67.543, 16.531, 15.807, P all<0.01) and European population(0.531, 0.312, 0.037, 0.179, 0.688, χ(2)=325.799, 92.877, 144.811, 156.471, 174.533, P all<0.01). Most variants that have clinical significance in aspirin response are related with drug efficacy or drug toxicity or ADR, indicating the urgency of variants screen in clinical practice.Significant population-specificity is detected in local 620 WES data in aspirin-response-related variants.

  17. Second generation engineering of transketolase for polar aromatic aldehyde substrates.

    PubMed

    Payongsri, Panwajee; Steadman, David; Hailes, Helen C; Dalby, Paul A

    2015-04-01

    Transketolase has significant industrial potential for the asymmetric synthesis of carboncarbon bonds with new chiral centres. Variants evolved on propanal were found previously with nascent activity on polar aromatic aldehydes 3-formylbenzoic acid (3-FBA), 4-formylbenzoic acid (4-FBA), and 3-hydroxybenzaldehyde (3-HBA), suggesting a potential novel route to analogues of chloramphenicol. Here we evolved improved transketolase activities towards aromatic aldehydes, by saturation mutagenesis of two active-site residues (R358 and S385), predicted to interact with the aromatic substituents. S385 variants selectively controlled the aromatic substrate preference, with up to 13-fold enhanced activities, and KM values comparable to those of natural substrates with wild-type transketolase. S385E even completely removed the substrate inhibition for 3-FBA, observed in all previous variants. The mechanisms of catalytic improvement were both mutation type and substrate dependent. S385E improved 3-FBA activity via kcat, but reduced 4-FBA activity via KM. Conversely, S385Y/T improved 3-FBA activity via KM and 4-FBA activity via kcat. This suggested that both substrate proximity and active-site orientation are very sensitive to mutation. Comparison of all variant activities on each substrate indicated different binding modes for the three aromatic substrates, supported by computational docking. This highlights a potential divergence in the evolution of different substrate specificities, with implications for enzyme engineering. Copyright © 2015 Elsevier Inc. All rights reserved.

  18. Structural determinants of phosphoinositide selectivity in splice variants of Grp1 family PH domains

    PubMed Central

    Cronin, Thomas C; DiNitto, Jonathan P; Czech, Michael P; Lambright, David G

    2004-01-01

    The pleckstrin homology (PH) domains of the homologous proteins Grp1 (general receptor for phosphoinositides), ARNO (Arf nucleotide binding site opener), and Cytohesin-1 bind phosphatidylinositol (PtdIns) 3,4,5-trisphosphate with unusually high selectivity. Remarkably, splice variants that differ only by the insertion of a single glycine residue in the β1/β2 loop exhibit dual specificity for PtdIns(3,4,5)P3 and PtdIns(4,5)P2. The structural basis for this dramatic specificity switch is not apparent from the known modes of phosphoinositide recognition. Here, we report crystal structures for dual specificity variants of the Grp1 and ARNO PH domains in either the unliganded form or in complex with the head groups of PtdIns(4,5)P2 and PtdIns(3,4,5)P3. Loss of contacts with the β1/β2 loop with no significant change in head group orientation accounts for the significant decrease in PtdIns(3,4,5)P3 affinity observed for the dual specificity variants. Conversely, a small increase rather than decrease in affinity for PtdIns(4,5)P2 is explained by a novel binding mode, in which the glycine insertion alleviates unfavorable interactions with the β1/β2 loop. These observations are supported by a systematic mutational analysis of the determinants of phosphoinositide recognition. PMID:15359279

  19. Early Grades Ideas.

    ERIC Educational Resources Information Center

    Classroom Computer Learning, 1984

    1984-01-01

    Five computer-oriented classroom activities are suggested. They include: Logo programming to help students develop estimation, logic and spatial skills; creating flow charts; inputting data; making snowflakes using Logo; and developing and using a database management program. (JN)

  20. Data processing and optimization system to study prospective interstate power interconnections

    NASA Astrophysics Data System (ADS)

    Podkovalnikov, Sergei; Trofimov, Ivan; Trofimov, Leonid

    2018-01-01

    The paper presents Data processing and optimization system for studying and making rational decisions on the formation of interstate electric power interconnections, with aim to increasing effectiveness of their functioning and expansion. The technologies for building and integrating a Data processing and optimization system including an object-oriented database and a predictive mathematical model for optimizing the expansion of electric power systems ORIRES, are described. The technology of collection and pre-processing of non-structured data collected from various sources and its loading to the object-oriented database, as well as processing and presentation of information in the GIS system are described. One of the approaches of graphical visualization of the results of optimization model is considered on the example of calculating the option for expansion of the South Korean electric power grid.

  1. Analysis of prostate-specific antigen transcripts in chimpanzees, cynomolgus monkeys, baboons, and African green monkeys.

    PubMed

    Mubiru, James N; Yang, Alice S; Olsen, Christian; Nayak, Sudhir; Livi, Carolina B; Dick, Edward J; Owston, Michael; Garcia-Forey, Magdalena; Shade, Robert E; Rogers, Jeffrey

    2014-01-01

    The function of prostate-specific antigen (PSA) is to liquefy the semen coagulum so that the released sperm can fuse with the ovum. Fifteen spliced variants of the PSA gene have been reported in humans, but little is known about alternative splicing in nonhuman primates. Positive selection has been reported in sex- and reproductive-related genes from sea urchins to Drosophila to humans; however, there are few studies of adaptive evolution of the PSA gene. Here, using polymerase chain reaction (PCR) product cloning and sequencing, we study PSA transcript variant heterogeneity in the prostates of chimpanzees (Pan troglodytes), cynomolgus monkeys (Macaca fascicularis), baboons (Papio hamadryas anubis), and African green monkeys (Chlorocebus aethiops). Six PSA variants were identified in the chimpanzee prostate, but only two variants were found in cynomolgus monkeys, baboons, and African green monkeys. In the chimpanzee the full-length transcript is expressed at the same magnitude as the transcripts that retain intron 3. We have found previously unidentified splice variants of the PSA gene, some of which might be linked to disease conditions. Selection on the PSA gene was studied in 11 primate species by computational methods using the sequences reported here for African green monkey, cynomolgus monkey, baboon, and chimpanzee and other sequences available in public databases. A codon-based analysis (dN/dS) of the PSA gene identified potential adaptive evolution at five residue sites (Arg45, Lys70, Gln144, Pro189, and Thr203).

  2. In silico prediction of splice-altering single nucleotide variants in the human genome.

    PubMed

    Jian, Xueqiu; Boerwinkle, Eric; Liu, Xiaoming

    2014-12-16

    In silico tools have been developed to predict variants that may have an impact on pre-mRNA splicing. The major limitation of the application of these tools to basic research and clinical practice is the difficulty in interpreting the output. Most tools only predict potential splice sites given a DNA sequence without measuring splicing signal changes caused by a variant. Another limitation is the lack of large-scale evaluation studies of these tools. We compared eight in silico tools on 2959 single nucleotide variants within splicing consensus regions (scSNVs) using receiver operating characteristic analysis. The Position Weight Matrix model and MaxEntScan outperformed other methods. Two ensemble learning methods, adaptive boosting and random forests, were used to construct models that take advantage of individual methods. Both models further improved prediction, with outputs of directly interpretable prediction scores. We applied our ensemble scores to scSNVs from the Catalogue of Somatic Mutations in Cancer database. Analysis showed that predicted splice-altering scSNVs are enriched in recurrent scSNVs and known cancer genes. We pre-computed our ensemble scores for all potential scSNVs across the human genome, providing a whole genome level resource for identifying splice-altering scSNVs discovered from large-scale sequencing studies.

  3. Patatin-like phospholipase domain-containing protein 3 (PNPLA3): A potential role in the association between liver disease and bipolar disorder.

    PubMed

    Kenneson, Aileen; Funderburk, Jennifer S

    2017-02-01

    Due to the increased prevalence of liver disease in patients with bipolar disorder, we examined the potential role of the patatin-like phospholipase domain-containing protein 3 (PNPLA3) variant among individuals with bipolar disorder and those with no mood disorder. We used the National Health and Nutrition Examination Survey (NHANES) database (aged 15-39 years) to identify a group of individuals with a bipolar diagnosis and a control group of individuals with no mood disorder. A total of 1931 individuals were randomly selected, one from each family containing information on the PNPLA3 genotype to be used in the analysis. Analyses revealed individuals with the recessive variant genotype (MM) had an adjusted odds ratio for bipolar disorder of about 4.6 compared to individuals with either IM or II genotypes of the PNPLA3 variant. Limitations of this study include the use of a lay-administered survey in for diagnosis of bipolar disorder in NHANES. The association between the PNPLA3 variant and bipolar disorder may help guide further work on medication effectiveness, treatment options, prevention approaches, and understanding potential medication side effects among specific subgroups of individuals with the MM genotype. Published by Elsevier B.V.

  4. ToTem: a tool for variant calling pipeline optimization.

    PubMed

    Tom, Nikola; Tom, Ondrej; Malcikova, Jitka; Pavlova, Sarka; Kubesova, Blanka; Rausch, Tobias; Kolarik, Miroslav; Benes, Vladimir; Bystry, Vojtech; Pospisilova, Sarka

    2018-06-26

    High-throughput bioinformatics analyses of next generation sequencing (NGS) data often require challenging pipeline optimization. The key problem is choosing appropriate tools and selecting the best parameters for optimal precision and recall. Here we introduce ToTem, a tool for automated pipeline optimization. ToTem is a stand-alone web application with a comprehensive graphical user interface (GUI). ToTem is written in Java and PHP with an underlying connection to a MySQL database. Its primary role is to automatically generate, execute and benchmark different variant calling pipeline settings. Our tool allows an analysis to be started from any level of the process and with the possibility of plugging almost any tool or code. To prevent an over-fitting of pipeline parameters, ToTem ensures the reproducibility of these by using cross validation techniques that penalize the final precision, recall and F-measure. The results are interpreted as interactive graphs and tables allowing an optimal pipeline to be selected, based on the user's priorities. Using ToTem, we were able to optimize somatic variant calling from ultra-deep targeted gene sequencing (TGS) data and germline variant detection in whole genome sequencing (WGS) data. ToTem is a tool for automated pipeline optimization which is freely available as a web application at  https://totem.software .

  5. [Therapeutic effect and safety of montelukast sodium combined with budesonide in children with cough variant asthma: a Meta analysis].

    PubMed

    Wei, Yan; Li, Dong-Sheng; Liu, Jian-Jun; Zhang, Jing; Zhao, Hai-En

    2016-11-01

    To evaluate the therapeutic effect and safety of montelukast sodium combined with budesonide in children with cough variant asthma. The databases CNKI, Wanfang Data, VIP, PubMed, EMbase, and BioMed Central were searched for randomized controlled trials (RCTs) of montelukast sodium combined with budesonide in the treatment of children with cough variant asthma. Data extraction and quality assessment were performed for RCTs which met the inclusion criteria, and RevMan 5.3 software was used to perform quality assessment of the articles included and Meta analysis. A total of 11 RCTs involving 1 097 patients were included. The results of the Meta analysis showed that compared with the control group (inhalation of budesonide alone), the observation group (inhalation of montelukast sodium combined with budesonide) had significantly higher overall response rate and more improved pulmonary function parameters including forced expiratory volume in the first second, percentage of forced expiratory volume in the first second, and peak expiratory flow, as well as significantly lower recurrence rate (P<0.01). The incidence of adverse events showed no significant difference between the two groups. Inhalation of montelukast sodium combined with budesonide has a significant effect in children with cough variant asthma and does not increase the incidence of adverse events.

  6. Early-Onset Progressive Retinal Atrophy Associated with an IQCB1 Variant in African Black-Footed Cats (Felis nigripes)

    PubMed Central

    Oh, Annie; Pearce, Jacqueline W.; Gandolfi, Barbara; Creighton, Erica K.; Suedmeyer, William K.; Selig, Michael; Bosiack, Ann P.; Castaner, Leilani J.; Whiting, Rebecca E. H.; Belknap, Ellen B.; Lyons, Leslie A.; Aderdein, Danielle; Alves, Paulo C.; Barsh, Gregory S.; Beale, Holly C.; Boyko, Adam R.; Castelhano, Marta G.; Chan, Patricia; Ellinwood, N. Matthew; Garrick, Dorian J.; Helps, Christopher R.; Kaelin, Christopher B.; Leeb, Tosso; Lohi, Hannes; Longeri, Maria; Malik, Richard; Montague, Michael J.; Munday, John S.; Murphy, William J.; Pedersen, Niels C.; Rothschild, Max F.; Swanson, William F.; Terio, Karen A.; Todhunter, Rory J.; Warren, Wesley C.

    2017-01-01

    African black-footed cats (Felis nigripes) are endangered wild felids. One male and full-sibling female African black-footed cat developed vision deficits and mydriasis as early as 3 months of age. The diagnosis of early-onset progressive retinal atrophy (PRA) was supported by reduced direct and consensual pupillary light reflexes, phenotypic presence of retinal degeneration, and a non-recordable electroretinogram with negligible amplitudes in both eyes. Whole genome sequencing, conducted on two unaffected parents and one affected offspring was compared to a variant database from 51 domestic cats and a Pallas cat, revealed 50 candidate variants that segregated concordantly with the PRA phenotype. Testing in additional affected cats confirmed that cats homozygous for a 2 base pair (bp) deletion within IQ calmodulin-binding motif-containing protein-1 (IQCB1), the gene that encodes for nephrocystin-5 (NPHP5), had vision loss. The variant segregated concordantly in other related individuals within the pedigree supporting the identification of a recessively inherited early-onset feline PRA. Analysis of the black-footed cat studbook suggests additional captive cats are at risk. Genetic testing for IQCB1 and avoidance of matings between carriers should be added to the species survival plan for captive management. PMID:28322220

  7. Calibration and Validation of the COCOMO II.1997.0 Cost/Schedule Estimating Model to the Space and Missile Systems Center Database

    DTIC Science & Technology

    1997-09-01

    Daly chose five models (REVIC, PRICE-S, SEER, System-4, and SPQR /20) to estimate schedule for 21 separate projects from the Electronic System Division...PRICE-S, two variants of COCOMO, System-3, SPQR /20, SASET, SoftCost-Ada) to 11 eight Ada specific programs. Ada was specifically designed for and is

  8. Energies and 2'-Hydroxyl Group Orientations of RNA Backbone Conformations. Benchmark CCSD(T)/CBS Database, Electronic Analysis, and Assessment of DFT Methods and MD Simulations.

    PubMed

    Mládek, Arnošt; Banáš, Pavel; Jurečka, Petr; Otyepka, Michal; Zgarbová, Marie; Šponer, Jiří

    2014-01-14

    Sugar-phosphate backbone is an electronically complex molecular segment imparting RNA molecules high flexibility and architectonic heterogeneity necessary for their biological functions. The structural variability of RNA molecules is amplified by the presence of the 2'-hydroxyl group, capable of forming multitude of intra- and intermolecular interactions. Bioinformatics studies based on X-ray structure database revealed that RNA backbone samples at least 46 substates known as rotameric families. The present study provides a comprehensive analysis of RNA backbone conformational preferences and 2'-hydroxyl group orientations. First, we create a benchmark database of estimated CCSD(T)/CBS relative energies of all rotameric families and test performance of dispersion-corrected DFT-D3 methods and molecular mechanics in vacuum and in continuum solvent. The performance of the DFT-D3 methods is in general quite satisfactory. The B-LYP-D3 method provides the best trade-off between accuracy and computational demands. B3-LYP-D3 slightly outperforms the new PW6B95-D3 and MPW1B95-D3 and is the second most accurate density functional of the study. The best agreement with CCSD(T)/CBS is provided by DSD-B-LYP-D3 double-hybrid functional, although its large-scale applications may be limited by high computational costs. Molecular mechanics does not reproduce the fine energy differences between the RNA backbone substates. We also demonstrate that the differences in the magnitude of the hyperconjugation effect do not correlate with the energy ranking of the backbone conformations. Further, we investigated the 2'-hydroxyl group orientation preferences. For all families, we conducted a QM and MM hydroxyl group rigid scan in gas phase and solvent. We then carried out set of explicit solvent MD simulations of folded RNAs and analyze 2'-hydroxyl group orientations of different backbone families in MD. The solvent energy profiles determined primarily by the sugar pucker match well with the distribution data derived from the simulations. The QM and MM energy profiles predict the same 2'-hydroxyl group orientation preferences. Finally, we demonstrate that the high energy of unfavorable and rarely sampled 2'-hydroxyl group orientations can be attributed to clashes between occupied orbitals.

  9. X-Linked and Autosomal Recessive Alport Syndrome: Pathogenic Variant Features and Further Genotype-Phenotype Correlations

    PubMed Central

    Savige, Judith; Storey, Helen; Il Cheong, Hae; Gyung Kang, Hee; Park, Eujin; Hilbert, Pascale; Persikov, Anton; Torres-Fernandez, Carmen; Ars, Elisabet; Torra, Roser; Hertz, Jens Michael; Thomassen, Mads; Shagam, Lev; Wang, Dongmao; Wang, Yanyan; Flinter, Frances; Nagel, Mato

    2016-01-01

    Alport syndrome results from mutations in the COL4A5 (X-linked) or COL4A3/COL4A4 (recessive) genes. This study examined 754 previously- unpublished variants in these genes from individuals referred for genetic testing in 12 accredited diagnostic laboratories worldwide, in addition to all published COL4A5, COL4A3 and COL4A4 variants in the LOVD databases. It also determined genotype-phenotype correlations for variants where clinical data were available. Individuals were referred for genetic testing where Alport syndrome was suspected clinically or on biopsy (renal failure, hearing loss, retinopathy, lamellated glomerular basement membrane), variant pathogenicity was assessed using currently-accepted criteria, and variants were examined for gene location, and age at renal failure onset. Results were compared using Fisher’s exact test (DNA Stata). Altogether 754 new DNA variants were identified, an increase of 25%, predominantly in people of European background. Of the 1168 COL4A5 variants, 504 (43%) were missense mutations, 273 (23%) splicing variants, 73 (6%) nonsense mutations, 169 (14%) short deletions and 76 (7%) complex or large deletions. Only 135 of the 432 Gly residues in the collagenous sequence were substituted (31%), which means that fewer than 10% of all possible variants have been identified. Both missense and nonsense mutations in COL4A5 were not randomly distributed but more common at the 70 CpG sequences (p<10−41 and p<0.001 respectively). Gly>Ala substitutions were underrepresented in all three genes (p< 0.0001) probably because of an association with a milder phenotype. The average age at end-stage renal failure was the same for all mutations in COL4A5 (24.4 ±7.8 years), COL4A3 (23.3 ± 9.3) and COL4A4 (25.4 ± 10.3) (COL4A5 and COL4A3, p = 0.45; COL4A5 and COL4A4, p = 0.55; COL4A3 and COL4A4, p = 0.41). For COL4A5, renal failure occurred sooner with non-missense than missense variants (p<0.01). For the COL4A3 and COL4A4 genes, age at renal failure occurred sooner with two non-missense variants (p = 0.08, and p = 0.01 respectively). Thus DNA variant characteristics that predict age at renal failure appeared to be the same for all three Alport genes. Founder mutations (with the pathogenic variant in at least 5 apparently- unrelated individuals) were not necessarily associated with a milder phenotype. This study illustrates the benefits when routine diagnostic laboratories share and analyse their data. PMID:27627812

  10. X-Linked and Autosomal Recessive Alport Syndrome: Pathogenic Variant Features and Further Genotype-Phenotype Correlations.

    PubMed

    Savige, Judith; Storey, Helen; Il Cheong, Hae; Gyung Kang, Hee; Park, Eujin; Hilbert, Pascale; Persikov, Anton; Torres-Fernandez, Carmen; Ars, Elisabet; Torra, Roser; Hertz, Jens Michael; Thomassen, Mads; Shagam, Lev; Wang, Dongmao; Wang, Yanyan; Flinter, Frances; Nagel, Mato

    2016-01-01

    Alport syndrome results from mutations in the COL4A5 (X-linked) or COL4A3/COL4A4 (recessive) genes. This study examined 754 previously- unpublished variants in these genes from individuals referred for genetic testing in 12 accredited diagnostic laboratories worldwide, in addition to all published COL4A5, COL4A3 and COL4A4 variants in the LOVD databases. It also determined genotype-phenotype correlations for variants where clinical data were available. Individuals were referred for genetic testing where Alport syndrome was suspected clinically or on biopsy (renal failure, hearing loss, retinopathy, lamellated glomerular basement membrane), variant pathogenicity was assessed using currently-accepted criteria, and variants were examined for gene location, and age at renal failure onset. Results were compared using Fisher's exact test (DNA Stata). Altogether 754 new DNA variants were identified, an increase of 25%, predominantly in people of European background. Of the 1168 COL4A5 variants, 504 (43%) were missense mutations, 273 (23%) splicing variants, 73 (6%) nonsense mutations, 169 (14%) short deletions and 76 (7%) complex or large deletions. Only 135 of the 432 Gly residues in the collagenous sequence were substituted (31%), which means that fewer than 10% of all possible variants have been identified. Both missense and nonsense mutations in COL4A5 were not randomly distributed but more common at the 70 CpG sequences (p<10-41 and p<0.001 respectively). Gly>Ala substitutions were underrepresented in all three genes (p< 0.0001) probably because of an association with a milder phenotype. The average age at end-stage renal failure was the same for all mutations in COL4A5 (24.4 ±7.8 years), COL4A3 (23.3 ± 9.3) and COL4A4 (25.4 ± 10.3) (COL4A5 and COL4A3, p = 0.45; COL4A5 and COL4A4, p = 0.55; COL4A3 and COL4A4, p = 0.41). For COL4A5, renal failure occurred sooner with non-missense than missense variants (p<0.01). For the COL4A3 and COL4A4 genes, age at renal failure occurred sooner with two non-missense variants (p = 0.08, and p = 0.01 respectively). Thus DNA variant characteristics that predict age at renal failure appeared to be the same for all three Alport genes. Founder mutations (with the pathogenic variant in at least 5 apparently- unrelated individuals) were not necessarily associated with a milder phenotype. This study illustrates the benefits when routine diagnostic laboratories share and analyse their data.

  11. Extending the data dictionary for data/knowledge management

    NASA Technical Reports Server (NTRS)

    Hydrick, Cecile L.; Graves, Sara J.

    1988-01-01

    Current relational database technology provides the means for efficiently storing and retrieving large amounts of data. By combining techniques learned from the field of artificial intelligence with this technology, it is possible to expand the capabilities of such systems. This paper suggests using the expanded domain concept, an object-oriented organization, and the storing of knowledge rules within the relational database as a solution to the unique problems associated with CAD/CAM and engineering data.

  12. SeqReporter: automating next-generation sequencing result interpretation and reporting workflow in a clinical laboratory.

    PubMed

    Roy, Somak; Durso, Mary Beth; Wald, Abigail; Nikiforov, Yuri E; Nikiforova, Marina N

    2014-01-01

    A wide repertoire of bioinformatics applications exist for next-generation sequencing data analysis; however, certain requirements of the clinical molecular laboratory limit their use: i) comprehensive report generation, ii) compatibility with existing laboratory information systems and computer operating system, iii) knowledgebase development, iv) quality management, and v) data security. SeqReporter is a web-based application developed using ASP.NET framework version 4.0. The client-side was designed using HTML5, CSS3, and Javascript. The server-side processing (VB.NET) relied on interaction with a customized SQL server 2008 R2 database. Overall, 104 cases (1062 variant calls) were analyzed by SeqReporter. Each variant call was classified into one of five report levels: i) known clinical significance, ii) uncertain clinical significance, iii) pending pathologists' review, iv) synonymous and deep intronic, and v) platform and panel-specific sequence errors. SeqReporter correctly annotated and classified 99.9% (859 of 860) of sequence variants, including 68.7% synonymous single-nucleotide variants, 28.3% nonsynonymous single-nucleotide variants, 1.7% insertions, and 1.3% deletions. One variant of potential clinical significance was re-classified after pathologist review. Laboratory information system-compatible clinical reports were generated automatically. SeqReporter also facilitated quality management activities. SeqReporter is an example of a customized and well-designed informatics solution to optimize and automate the downstream analysis of clinical next-generation sequencing data. We propose it as a model that may envisage the development of a comprehensive clinical informatics solution. Copyright © 2014 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.

  13. Germline variant FGFR4  p.G388R exposes a membrane-proximal STAT3 binding site.

    PubMed

    Ulaganathan, Vijay K; Sperl, Bianca; Rapp, Ulf R; Ullrich, Axel

    2015-12-24

    Variant rs351855-G/A is a commonly occurring single-nucleotide polymorphism of coding regions in exon 9 of the fibroblast growth factor receptor FGFR4 (CD334) gene (c.1162G>A). It results in an amino-acid change at codon 388 from glycine to arginine (p.Gly388Arg) in the transmembrane domain of the receptor. Despite compelling genetic evidence for the association of this common variant with cancers of the bone, breast, colon, prostate, skin, lung, head and neck, as well as soft-tissue sarcomas and non-Hodgkin lymphoma, the underlying biological mechanism has remained elusive. Here we show that substitution of the conserved glycine 388 residue to a charged arginine residue alters the transmembrane spanning segment and exposes a membrane-proximal cytoplasmic signal transducer and activator of transcription 3 (STAT3) binding site Y(390)-(P)XXQ(393). We demonstrate that such membrane-proximal STAT3 binding motifs in the germline of type I membrane receptors enhance STAT3 tyrosine phosphorylation by recruiting STAT3 proteins to the inner cell membrane. Remarkably, such germline variants frequently co-localize with somatic mutations in the Catalogue of Somatic Mutations in Cancer (COSMIC) database. Using Fgfr4 single nucleotide polymorphism knock-in mice and transgenic mouse models for breast and lung cancers, we validate the enhanced STAT3 signalling induced by the FGFR4 Arg388-variant in vivo. Thus, our findings elucidate the molecular mechanism behind the genetic association of rs351855 with accelerated cancer progression and suggest that germline variants of cell-surface molecules that recruit STAT3 to the inner cell membrane are a significant risk for cancer prognosis and disease progression.

  14. Twelve novel HGD gene variants identified in 99 alkaptonuria patients: focus on 'black bone disease' in Italy.

    PubMed

    Nemethova, Martina; Radvanszky, Jan; Kadasi, Ludevit; Ascher, David B; Pires, Douglas E V; Blundell, Tom L; Porfirio, Berardino; Mannoni, Alessandro; Santucci, Annalisa; Milucci, Lia; Sestini, Silvia; Biolcati, Gianfranco; Sorge, Fiammetta; Aurizi, Caterina; Aquaron, Robert; Alsbou, Mohammed; Lourenço, Charles Marques; Ramadevi, Kanakasabapathi; Ranganath, Lakshminarayan R; Gallagher, James A; van Kan, Christa; Hall, Anthony K; Olsson, Birgitta; Sireau, Nicolas; Ayoob, Hana; Timmis, Oliver G; Sang, Kim-Hanh Le Quan; Genovese, Federica; Imrich, Richard; Rovensky, Jozef; Srinivasaraghavan, Rangan; Bharadwaj, Shruthi K; Spiegel, Ronen; Zatkova, Andrea

    2016-01-01

    Alkaptonuria (AKU) is an autosomal recessive disorder caused by mutations in homogentisate-1,2-dioxygenase (HGD) gene leading to the deficiency of HGD enzyme activity. The DevelopAKUre project is underway to test nitisinone as a specific treatment to counteract this derangement of the phenylalanine-tyrosine catabolic pathway. We analysed DNA of 40 AKU patients enrolled for SONIA1, the first study in DevelopAKUre, and of 59 other AKU patients sent to our laboratory for molecular diagnostics. We identified 12 novel DNA variants: one was identified in patients from Brazil (c.557T>A), Slovakia (c.500C>T) and France (c.440T>C), three in patients from India (c.469+6T>C, c.650-85A>G, c.158G>A), and six in patients from Italy (c.742A>G, c.614G>A, c.1057A>C, c.752G>A, c.119A>C, c.926G>T). Thus, the total number of potential AKU-causing variants found in 380 patients reported in the HGD mutation database is now 129. Using mCSM and DUET, computational approaches based on the protein 3D structure, the novel missense variants are predicted to affect the activity of the enzyme by three mechanisms: decrease of stability of individual protomers, disruption of protomer-protomer interactions or modification of residues in the region of the active site. We also present an overview of AKU in Italy, where so far about 60 AKU cases are known and DNA analysis has been reported for 34 of them. In this rather small group, 26 different HGD variants affecting function were described, indicating rather high heterogeneity. Twelve of these variants seem to be specific for Italy.

  15. Twelve novel HGD gene variants identified in 99 alkaptonuria patients: focus on ‘black bone disease' in Italy

    PubMed Central

    Nemethova, Martina; Radvanszky, Jan; Kadasi, Ludevit; Ascher, David B; Pires, Douglas E V; Blundell, Tom L; Porfirio, Berardino; Mannoni, Alessandro; Santucci, Annalisa; Milucci, Lia; Sestini, Silvia; Biolcati, Gianfranco; Sorge, Fiammetta; Aurizi, Caterina; Aquaron, Robert; Alsbou, Mohammed; Marques Lourenço, Charles; Ramadevi, Kanakasabapathi; Ranganath, Lakshminarayan R; Gallagher, James A; van Kan, Christa; Hall, Anthony K; Olsson, Birgitta; Sireau, Nicolas; Ayoob, Hana; Timmis, Oliver G; Le Quan Sang, Kim-Hanh; Genovese, Federica; Imrich, Richard; Rovensky, Jozef; Srinivasaraghavan, Rangan; Bharadwaj, Shruthi K; Spiegel, Ronen; Zatkova, Andrea

    2016-01-01

    Alkaptonuria (AKU) is an autosomal recessive disorder caused by mutations in homogentisate-1,2-dioxygenase (HGD) gene leading to the deficiency of HGD enzyme activity. The DevelopAKUre project is underway to test nitisinone as a specific treatment to counteract this derangement of the phenylalanine-tyrosine catabolic pathway. We analysed DNA of 40 AKU patients enrolled for SONIA1, the first study in DevelopAKUre, and of 59 other AKU patients sent to our laboratory for molecular diagnostics. We identified 12 novel DNA variants: one was identified in patients from Brazil (c.557T>A), Slovakia (c.500C>T) and France (c.440T>C), three in patients from India (c.469+6T>C, c.650–85A>G, c.158G>A), and six in patients from Italy (c.742A>G, c.614G>A, c.1057A>C, c.752G>A, c.119A>C, c.926G>T). Thus, the total number of potential AKU-causing variants found in 380 patients reported in the HGD mutation database is now 129. Using mCSM and DUET, computational approaches based on the protein 3D structure, the novel missense variants are predicted to affect the activity of the enzyme by three mechanisms: decrease of stability of individual protomers, disruption of protomer-protomer interactions or modification of residues in the region of the active site. We also present an overview of AKU in Italy, where so far about 60 AKU cases are known and DNA analysis has been reported for 34 of them. In this rather small group, 26 different HGD variants affecting function were described, indicating rather high heterogeneity. Twelve of these variants seem to be specific for Italy. PMID:25804398

  16. Identification of Susceptibility Loci and Genes for Colorectal Cancer Risk

    PubMed Central

    Zeng, Chenjie; Matsuda, Koichi; Jia, Wei-Hua; Chang, Jiang; Kweon, Sun-Seog; Xiang, Yong-Bing; Shin, Aesun; Jee, Sun Ha; Kim, Dong-Hyun; Zhang, Ben; Cai, Qiuyin; Guo, Xingyi; Long, Jirong; Wang, Nan; Courtney, Regina; Pan, Zhi-Zhong; Wu, Chen; Takahashi, Atsushi; Shin, Min-Ho; Matsuo, Keitaro; Matsuda, Fumihiko; Gao, Yu-Tang; Oh, Jae Hwan; Kim, Soriul; Jung, Keum Ji; Ahn, Yoon-Ok; Ren, Zefang; Li, Hong-Lan; Wu, Jie; Shi, Jiajun; Wen, Wanqing; Yang, Gong; Li, Bingshan; Ji, Bu-Tian; Brenner, Hermann; Schoen, Robert E.; Küry, Sébastien; Gruber, Stephen B.; Schumacher, Fredrick R.; Stenzel, Stephanie L.; Casey, Graham; Hopper, John L.; Jenkins, Mark A.; Kim, Hyeong-Rok; Jeong, Jin-Young; Park, Ji Won; Tajima, Kazuo; Cho, Sang-Hee; Kubo, Michiaki; Shu, Xiao-Ou; Lin, Dongxin; Zeng, Yi-Xin; Zheng, Wei

    2016-01-01

    Background & Aims Known Genetic factors explain only a small fraction of genetic variation in colorectal cancer (CRC). We conducted a genome-wide association study (GWAS) to identify risk loci for CRC. Methods This discovery stage included 8027 cases and 22577 controls of East-Asian ancestry. Promising variants were evaluated in studies including as many as 11044 cases and 12047 controls. Tumor-adjacent normal tissues from 188 patients were analyzed to evaluate correlations of risk variants with expression levels of nearby genes. Potential functionality of risk variants were evaluated using public genomic and epigenomic databases. Results We identified 4 loci associated with CRC risk; P values for the most significant variant in each locus ranged from 3.92×10−8 to 1.24×10−12: 6p21.1 (rs4711689), 8q23.3 (rs2450115, rs6469656), 10q24.3 (rs4919687), and 12p13.3 (rs11064437). We also identified 2 risk variants at loci previously associated with CRC: 10q25.2 (rs10506868) and 20q13.3 (rs6061231). These risk variants, conferring an approximate 10%–18% increase in risk per allele, are located either inside or near protein-coding genes that include TFEB (lysosome biogenesis and autophagy), EIF3H (initiation of translation), CYP17A1 (steroidogenesis), SPSB2 (proteasome degradation), and RPS21 (ribosome biogenesis). Gene expression analyses showed a significant association (P <.05) for rs4711689 with TFEB, rs6469656 with EIF3H, rs11064437 with SPSB2, and rs6061231 with RPS21. Conclusions We identified susceptibility loci and genes associated with CRC risk, linking CRC predisposition to steroid hormone, protein synthesis and degradation, and autophagy pathways and providing added insight into the mechanism of CRC pathogenesis. PMID:26965516

  17. The insulin-sensitivity sulphonylurea receptor variant is associated with thyrotoxic paralysis.

    PubMed

    Rolim, Ana Luiza R; Lindsey, Susan C; Kunii, Ilda S; Crispim, Felipe; Moisés, Regina Célia M S; Maciel, Rui M B; Dias-da-Silva, Magnus R

    2014-10-01

    Thyrotoxicosis is the most common cause of the acquired flaccid muscle paralysis in adults called thyrotoxic periodic paralysis (TPP) and is characterised by transient hypokalaemia and hypophosphataemia under high thyroid hormone levels that is frequently precipitated by carbohydrate load. The sulphonylurea receptor 1 (SUR1 (ABCC8)) is an essential regulatory subunit of the β-cell ATP-sensitive K(+) channel that controls insulin secretion after feeding. Additionally, the SUR1 Ala1369Ser variant appears to be associated with insulin sensitivity. We examined the ABCC8 gene at the single nucleotide level using PCR-restriction fragment length polymorphism (RFLP) analysis to determine its allelic variant frequency and calculated the frequency of the Ala1369Ser C-allele variant in a cohort of 36 Brazilian TPP patients in comparison with 32 controls presenting with thyrotoxicosis without paralysis (TWP). We verified that the frequency of the alanine 1369 C-allele was significantly higher in TPP patients than in TWP patients (61.1 vs 34.4%, odds ratio (OR)=3.42, P=0.039) and was significantly more common than the minor allele frequency observed in the general population from the 1000 Genomes database (61.1 vs 29.0%, OR=4.87, P<0.005). Additionally, the C-allele frequency was similar between TWP patients and the general population (34.4 vs 29%, OR=1.42, P=0.325). We have demonstrated that SUR1 alanine 1369 variant is associated with allelic susceptibility to TPP. We suggest that the hyperinsulinaemia that is observed in TPP may be linked to the ATP-sensitive K(+)/SUR1 alanine variant and, therefore, contribute to the major feedforward precipitating factors in the pathophysiology of TPP. © 2014 Society for Endocrinology.

  18. Complex phenotype of dyskeratosis congenita and mood dysregulation with novel homozygous RTEL1 and TPH1 variants.

    PubMed

    Ungar, Rachel A; Giri, Neelam; Pao, Maryland; Khincha, Payal P; Zhou, Weiyin; Alter, Blanche P; Savage, Sharon A

    2018-06-01

    Dyskeratosis congenita (DC) is an inherited bone marrow failure syndrome caused by germline mutations in telomere biology genes. Patients have extremely short telomeres for their age and a complex phenotype including oral leukoplakia, abnormal skin pigmentation, and dysplastic nails in addition to bone marrow failure, pulmonary fibrosis, stenosis of the esophagus, lacrimal ducts and urethra, developmental anomalies, and high risk of cancer. We evaluated a patient with features of DC, mood dysregulation, diabetes, and lack of pubertal development. Family history was not available but genome-wide genotyping was consistent with consanguinity. Whole exome sequencing identified 82 variants of interest in 80 genes based on the following criteria: homozygous, <0.1% minor allele frequency in public and in-house databases, nonsynonymous, and predicted deleterious by multiple in silico prediction programs. Six genes were identified likely contributory to the clinical presentation. The cause of DC is likely due to homozygous splice site variants in regulator of telomere elongation helicase 1, a known DC and telomere biology gene. A homozygous, missense variant in tryptophan hydroxylase 1 may be clinically important as this gene encodes the rate limiting step in serotonin biosynthesis, a biologic pathway connected with mood disorders. Four additional genes (SCN4A, LRP4, GDAP1L1, and SPTBN5) had rare, missense homozygous variants that we speculate may contribute to portions of the clinical phenotype. This case illustrates the value of conducting detailed clinical and genomic evaluations on rare patients in order to identify new areas of research into the functional consequences of rare variants and their contribution to human disease. © 2018 Wiley Periodicals, Inc.

  19. Divergent Ah Receptor Ligand Selectivity during Hominin Evolution

    PubMed Central

    Hubbard, Troy D.; Murray, Iain A.; Bisson, William H.; Sullivan, Alexis P.; Sebastian, Aswathy; Perry, George H.; Jablonski, Nina G.; Perdew, Gary H.

    2016-01-01

    We have identified a fixed nonsynonymous sequence difference between humans (Val381; derived variant) and Neandertals (Ala381; ancestral variant) in the ligand-binding domain of the aryl hydrocarbon receptor (AHR) gene. In an exome sequence analysis of four Neandertal and Denisovan individuals compared with nine modern humans, there are only 90 total nucleotide sites genome-wide for which archaic hominins are fixed for the ancestral nonsynonymous variant and the modern humans are fixed for the derived variant. Of those sites, only 27, including Val381 in the AHR, also have no reported variability in the human dbSNP database, further suggesting that this highly conserved functional variant is a rare event. Functional analysis of the amino acid variant Ala381 within the AHR carried by Neandertals and nonhuman primates indicate enhanced polycyclic aromatic hydrocarbon (PAH) binding, DNA binding capacity, and AHR mediated transcriptional activity compared with the human AHR. Also relative to human AHR, the Neandertal AHR exhibited 150–1000 times greater sensitivity to induction of Cyp1a1 and Cyp1b1 expression by PAHs (e.g., benzo(a)pyrene). The resulting CYP1A1/CYP1B1 enzymes are responsible for PAH first pass metabolism, which can result in the generation of toxic intermediates and perhaps AHR-associated toxicities. In contrast, the human AHR retains the ancestral sensitivity observed in primates to nontoxic endogenous AHR ligands (e.g., indole, indoxyl sulfate). Our findings reveal that a functionally significant change in the AHR occurred uniquely in humans, relative to other primates, that would attenuate the response to many environmental pollutants, including chemicals present in smoke from fire use during cooking. PMID:27486223

  20. Is integrative use of techniques in psychotherapy the exception or the rule? Results of a national survey of doctoral-level practitioners.

    PubMed

    Thoma, Nathan C; Cecero, John J

    2009-12-01

    This study sought to investigate the extent to which therapists endorse techniques outside of their self-identified orientation and which techniques are endorsed across orientations. A survey consisting of 127 techniques from 8 major theories of psychotherapy was administered via U.S. mail to a national random sample of doctoral-level psychotherapy practitioners. The 201 participants endorsed substantial numbers of techniques from outside their respective orientations. Many of these techniques were quite different from those of the core theories of the respective orientations. Further examining when and why experienced practitioners switch to techniques outside their primary orientation may help reveal where certain techniques fall short and where others excel, indicating a need for further research that taps the collective experience of practitioners. (PsycINFO Database Record (c) 2010 APA, all rights reserved).

Top