Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.
2015-01-01
ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133
Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, S; Jaing, C
2012-03-27
The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interimmore » report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.« less
Guo, Xi; Geng, Peng; Wang, Quan; Cao, Boyang; Liu, Bin
2014-10-01
Severe acute respiratory syndrome (SARS), a disease that spread widely in the world during late 2002 to 2004, severely threatened public health. Although there have been no reported infections since 2004, the extremely pathogenic SARS coronavirus (SARS-CoV), as the causative agent of SARS, has recently been identified in animals, showing the potential for the re-emergence of this disease. Previous studies showed that 27 single nucleotide polymorphism (SNP) mutations among the spike (S) gene of this virus are correlated closely with the SARS pathogenicity and epidemicity. We have developed a SNP DNA microarray in order to detect and genotype these SNPs, and to obtain related information on the pathogenicity and epidemicity of a given strain. The microarray was hybridized with PCR products amplified from cDNAs obtained from different SARS-CoV strains. We were able to detect 24 SNPs and determine the type of a given strain. The hybridization profile showed that 19 samples were detected and genotyped correctly by using our microarray, with 100% accuracy. Our microarray provides a novel method for the detection and epidemiological surveillance of SARS-CoV.
Maslow, Bat-Sheva L; Budinetz, Tara; Sueldo, Carolina; Anspach, Erica; Engmann, Lawrence; Benadiva, Claudio; Nulsen, John C
2015-07-01
To compare the analysis of chromosome number from paraffin-embedded products of conception using single-nucleotide polymorphism (SNP) microarray with the recommended screening for the evaluation of couples presenting with recurrent pregnancy loss who do not have previous fetal cytogenetic data. We performed a retrospective cohort study including all women who presented for a new evaluation of recurrent pregnancy loss over a 2-year period (January 1, 2012, to December 31, 2013). All participants had at least two documented first-trimester losses and both the recommended screening tests and SNP microarray performed on at least one paraffin-embedded products of conception sample. Single-nucleotide polymorphism microarray identifies all 24 chromosomes (22 autosomes, X, and Y). Forty-two women with a total of 178 losses were included in the study. Paraffin-embedded products of conception from 62 losses were sent for SNP microarray. Single-nucleotide polymorphism microarray successfully diagnosed fetal chromosome number in 71% (44/62) of samples, of which 43% (19/44) were euploid and 57% (25/44) were noneuploid. Seven of 42 (17%) participants had abnormalities on recurrent pregnancy loss screening. The per-person detection rate for a cause of pregnancy loss was significantly higher in the SNP microarray (0.50; 95% confidence interval [CI] 0.36-0.64) compared with recurrent pregnancy loss evaluation (0.17; 95% CI 0.08-0.31) (P=.002). Participants with one or more euploid loss identified on paraffin-embedded products of conception were significantly more likely to have an abnormality on recurrent pregnancy loss screening than those with only noneuploid results (P=.028). The significance remained when controlling for age, number of losses, number of samples, and total pregnancies. These results suggest that SNP microarray testing of paraffin-embedded products of conception is a valuable tool for the evaluation of recurrent pregnancy loss in patients without prior fetal cytogenetic results. Recommended recurrent pregnancy loss screening was unnecessary in almost half the patients in our study. II.
DOE Office of Scientific and Technical Information (OSTI.GOV)
SacconePhD, Scott F; Chesler, Elissa J; Bierut, Laura J
Commercial SNP microarrays now provide comprehensive and affordable coverage of the human genome. However, some diseases have biologically relevant genomic regions that may require additional coverage. Addiction, for example, is thought to be influenced by complex interactions among many relevant genes and pathways. We have assembled a list of 486 biologically relevant genes nominated by a panel of experts on addiction. We then added 424 genes that showed evidence of association with addiction phenotypes through mouse QTL mappings and gene co-expression analysis. We demonstrate that there are a substantial number of SNPs in these genes that are not well representedmore » by commercial SNP platforms. We address this problem by introducing a publicly available SNP database for addiction. The database is annotated using numeric prioritization scores indicating the extent of biological relevance. The scores incorporate a number of factors such as SNP/gene functional properties (including synonymy and promoter regions), data from mouse systems genetics and measures of human/mouse evolutionary conservation. We then used HapMap genotyping data to determine if a SNP is tagged by a commercial microarray through linkage disequilibrium. This combination of biological prioritization scores and LD tagging annotation will enable addiction researchers to supplement commercial SNP microarrays to ensure comprehensive coverage of biologically relevant regions.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
Analysis of population structure and genetic history of cattle breeds based on high-density SNP data
USDA-ARS?s Scientific Manuscript database
Advances in single nucleotide polymorphism (SNP) genotyping microarrays have facilitated a new understanding of population structure and evolutionary history for several species. Most existing studies in livestock were based on low density SNP arrays. The first wave of low density SNP studies on cat...
Hu, Guohong; Wang, Hui-Yun; Greenawalt, Danielle M.; Azaro, Marco A.; Luo, Minjie; Tereshchenko, Irina V.; Cui, Xiangfeng; Yang, Qifeng; Gao, Richeng; Shen, Li; Li, Honghua
2006-01-01
Microarray-based analysis of single nucleotide polymorphisms (SNPs) has many applications in large-scale genetic studies. To minimize the influence of experimental variation, microarray data usually need to be processed in different aspects including background subtraction, normalization and low-signal filtering before genotype determination. Although many algorithms are sophisticated for these purposes, biases are still present. In the present paper, new algorithms for SNP microarray data analysis and the software, AccuTyping, developed based on these algorithms are described. The algorithms take advantage of a large number of SNPs included in each assay, and the fact that the top and bottom 20% of SNPs can be safely treated as homozygous after sorting based on their ratios between the signal intensities. These SNPs are then used as controls for color channel normalization and background subtraction. Genotype calls are made based on the logarithms of signal intensity ratios using two cutoff values, which were determined after training the program with a dataset of ∼160 000 genotypes and validated by non-microarray methods. AccuTyping was used to determine >300 000 genotypes of DNA and sperm samples. The accuracy was shown to be >99%. AccuTyping can be downloaded from . PMID:16982644
Huang, Chao-Wei; Lin, Yu-Tsung; Ding, Shih-Torng; Lo, Ling-Ling; Wang, Pei-Hwa; Lin, En-Chung; Liu, Fang-Wei; Lu, Yen-Wen
2015-01-01
The genetic markers associated with economic traits have been widely explored for animal breeding. Among these markers, single-nucleotide polymorphism (SNPs) are gradually becoming a prevalent and effective evaluation tool. Since SNPs only focus on the genetic sequences of interest, it thereby reduces the evaluation time and cost. Compared to traditional approaches, SNP genotyping techniques incorporate informative genetic background, improve the breeding prediction accuracy and acquiesce breeding quality on the farm. This article therefore reviews the typical procedures of animal breeding using SNPs and the current status of related techniques. The associated SNP information and genotyping techniques, including microarray and Lab-on-a-Chip based platforms, along with their potential are highlighted. Examples in pig and poultry with different SNP loci linked to high economic trait values are given. The recommendations for utilizing SNP genotyping in nimal breeding are summarized. PMID:27600241
Characterization of genetic variability of Venezuelan equine encephalitis viruses
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.; ...
2016-04-07
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
Construction of a versatile SNP array for pyramiding useful genes of rice.
Kurokawa, Yusuke; Noda, Tomonori; Yamagata, Yoshiyuki; Angeles-Shim, Rosalyn; Sunohara, Hidehiko; Uehara, Kanako; Furuta, Tomoyuki; Nagai, Keisuke; Jena, Kshirod Kumar; Yasui, Hideshi; Yoshimura, Atsushi; Ashikari, Motoyuki; Doi, Kazuyuki
2016-01-01
DNA marker-assisted selection (MAS) has become an indispensable component of breeding. Single nucleotide polymorphisms (SNP) are the most frequent polymorphism in the rice genome. However, SNP markers are not readily employed in MAS because of limitations in genotyping platforms. Here the authors report a Golden Gate SNP array that targets specific genes controlling yield-related traits and biotic stress resistance in rice. As a first step, the SNP genotypes were surveyed in 31 parental varieties using the Affymetrix Rice 44K SNP microarray. The haplotype information for 16 target genes was then converted to the Golden Gate platform with 143-plex markers. Haplotypes for the 14 useful allele are unique and can discriminate among all other varieties. The genotyping consistency between the Affymetrix microarray and the Golden Gate array was 92.8%, and the accuracy of the Golden Gate array was confirmed in 3 F2 segregating populations. The concept of the haplotype-based selection by using the constructed SNP array was proofed. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Alexiev, Borislav A; Zou, Ying S
2014-12-01
Chromosomal microarray analysis using novel Molecular Inversion Probe (MIP) technology demonstrated 2,570 kb copy neutral LOH of 10q11.22 in two clear cell papillary renal cell carcinomas. In addition, one of the tumors had a big 29,784 kb deletion of 13q11-q14.2. There were two variants of unknown significance, a 2,509 kb gain of Xp22.33 and a 257 kb homozygous deletion of 8p11.22. The somatic mutation panel containing 74 mutations in nine genes did not reveal any mutations. Besides identification of submicroscopic duplications or deletions, SNP microarrays can reveal abnormal allelic imbalances including LOH and copy neutral LOH, which cannot be recognized by chromosome, FISH, and non-SNP microarray arrays. To the best of our knowledge, this is the first study demonstrating copy neutral LOH of 10q11.22 in clear cell papillary renal cell carcinomas using the new MIP SNP OncoScan FFPE Assay Kit on formalin-fixed paraffin-embedded tumor samples. Copyright © 2014 Elsevier GmbH. All rights reserved.
Abbey, Darren; Hickman, Meleah; Gresham, David; Berman, Judith
2011-01-01
Phenotypic diversity can arise rapidly through loss of heterozygosity (LOH) or by the acquisition of copy number variations (CNV) spanning whole chromosomes or shorter contiguous chromosome segments. In Candida albicans, a heterozygous diploid yeast pathogen with no known meiotic cycle, homozygosis and aneuploidy alter clinical characteristics, including drug resistance. Here, we developed a high-resolution microarray that simultaneously detects ∼39,000 single nucleotide polymorphism (SNP) alleles and ∼20,000 copy number variation loci across the C. albicans genome. An important feature of the array analysis is a computational pipeline that determines SNP allele ratios based upon chromosome copy number. Using the array and analysis tools, we constructed a haplotype map (hapmap) of strain SC5314 to assign SNP alleles to specific homologs, and we used it to follow the acquisition of loss of heterozygosity (LOH) and copy number changes in a series of derived laboratory strains. This high-resolution SNP/CGH microarray and the associated hapmap facilitated the phasing of alleles in lab strains and revealed detrimental genome changes that arose frequently during molecular manipulations of laboratory strains. Furthermore, it provided a useful tool for rapid, high-resolution, and cost-effective characterization of changes in allele diversity as well as changes in chromosome copy number in new C. albicans isolates. PMID:22384363
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jaing, C; Gardner, S
The goal of this project is to develop forensic genotyping assays for select agent viruses, enhancing the current capabilities for the viral bioforensics and law enforcement community. We used a multipronged approach combining bioinformatics analysis, PCR-enriched samples, microarrays and TaqMan assays to develop high resolution and cost effective genotyping methods for strain level forensic discrimination of viruses. We have leveraged substantial experience and efficiency gained through year 1 on software development, SNP discovery, TaqMan signature design and phylogenetic signature mapping to scale up the development of forensics signatures in year 2. In this report, we have summarized the whole genomemore » wide SNP analysis and microarray probe design for forensics characterization of South American hemorrhagic fever viruses, tick-borne encephalitis viruses and henipaviruses, Old World Arenaviruses, filoviruses, Crimean-Congo hemorrhagic fever virus, Rift Valley fever virus and Japanese encephalitis virus.« less
Lopez, G H; Morrison, J; Condon, J A; Wilson, B; Martin, J R; Liew, Y-W; Flower, R L; Hyland, C A
2015-10-01
Duffy blood group phenotypes can be predicted by genotyping for single nucleotide polymorphisms (SNPs) responsible for the Fy(a) /Fy(b) polymorphism, for weak Fy(b) antigen, and for the red cell null Fy(a-b-) phenotype. This study correlates Duffy phenotype predictions with serotyping to assess the most reliable procedure for typing. Samples, n = 155 (135 donors and 20 patients), were genotyped by high-resolution melt PCR and by microarray. Samples were in three serology groups: 1) Duffy patterns expected n = 79, 2) weak and equivocal Fy(b) patterns n = 29 and 3) Fy(a-b-) n = 47 (one with anti-Fy3 antibody). Discrepancies were observed for five samples. For two, SNP genotyping predicted weak Fy(b) expression discrepant with Fy(b-) (Group 1 and 3). For three, SNP genotyping predicted Fy(a) , discrepant with Fy(a-b-) (Group 3). DNA sequencing identified silencing mutations in these FY*A alleles. One was a novel FY*A 719delG. One, the sample with the anti-Fy3, was homozygous for a 14-bp deletion (FY*01N.02); a true null. Both the high-resolution melting analysis and SNP microarray assays were concordant and showed genotyping, as well as phenotyping, is essential to ensure 100% accuracy for Duffy blood group assignments. Sequencing is important to resolve phenotype/genotype conflicts which here identified alleles, one novel, that carry silencing mutations. The risk of alloimmunisation may be dependent on this zygosity status. © 2015 International Society of Blood Transfusion.
Dynamic variable selection in SNP genotype autocalling from APEX microarray data.
Podder, Mohua; Welch, William J; Zamar, Ruben H; Tebbutt, Scott J
2006-11-30
Single nucleotide polymorphisms (SNPs) are DNA sequence variations, occurring when a single nucleotide--adenine (A), thymine (T), cytosine (C) or guanine (G)--is altered. Arguably, SNPs account for more than 90% of human genetic variation. Our laboratory has developed a highly redundant SNP genotyping assay consisting of multiple probes with signals from multiple channels for a single SNP, based on arrayed primer extension (APEX). This mini-sequencing method is a powerful combination of a highly parallel microarray with distinctive Sanger-based dideoxy terminator sequencing chemistry. Using this microarray platform, our current genotype calling system (known as SNP Chart) is capable of calling single SNP genotypes by manual inspection of the APEX data, which is time-consuming and exposed to user subjectivity bias. Using a set of 32 Coriell DNA samples plus three negative PCR controls as a training data set, we have developed a fully-automated genotyping algorithm based on simple linear discriminant analysis (LDA) using dynamic variable selection. The algorithm combines separate analyses based on the multiple probe sets to give a final posterior probability for each candidate genotype. We have tested our algorithm on a completely independent data set of 270 DNA samples, with validated genotypes, from patients admitted to the intensive care unit (ICU) of St. Paul's Hospital (plus one negative PCR control sample). Our method achieves a concordance rate of 98.9% with a 99.6% call rate for a set of 96 SNPs. By adjusting the threshold value for the final posterior probability of the called genotype, the call rate reduces to 94.9% with a higher concordance rate of 99.6%. We also reversed the two independent data sets in their training and testing roles, achieving a concordance rate up to 99.8%. The strength of this APEX chemistry-based platform is its unique redundancy having multiple probes for a single SNP. Our model-based genotype calling algorithm captures the redundancy in the system considering all the underlying probe features of a particular SNP, automatically down-weighting any 'bad data' corresponding to image artifacts on the microarray slide or failure of a specific chemistry. In this regard, our method is able to automatically select the probes which work well and reduce the effect of other so-called bad performing probes in a sample-specific manner, for any number of SNPs.
Analysis and visualization of chromosomal abnormalities in SNP data with SNPscan
Ting, Jason C; Ye, Ying; Thomas, George H; Ruczinski, Ingo; Pevsner, Jonathan
2006-01-01
Background A variety of diseases are caused by chromosomal abnormalities such as aneuploidies (having an abnormal number of chromosomes), microdeletions, microduplications, and uniparental disomy. High density single nucleotide polymorphism (SNP) microarrays provide information on chromosomal copy number changes, as well as genotype (heterozygosity and homozygosity). SNP array studies generate multiple types of data for each SNP site, some with more than 100,000 SNPs represented on each array. The identification of different classes of anomalies within SNP data has been challenging. Results We have developed SNPscan, a web-accessible tool to analyze and visualize high density SNP data. It enables researchers (1) to visually and quantitatively assess the quality of user-generated SNP data relative to a benchmark data set derived from a control population, (2) to display SNP intensity and allelic call data in order to detect chromosomal copy number anomalies (duplications and deletions), (3) to display uniparental isodisomy based on loss of heterozygosity (LOH) across genomic regions, (4) to compare paired samples (e.g. tumor and normal), and (5) to generate a file type for viewing SNP data in the University of California, Santa Cruz (UCSC) Human Genome Browser. SNPscan accepts data exported from Affymetrix Copy Number Analysis Tool as its input. We validated SNPscan using data generated from patients with known deletions, duplications, and uniparental disomy. We also inspected previously generated SNP data from 90 apparently normal individuals from the Centre d'Étude du Polymorphisme Humain (CEPH) collection, and identified three cases of uniparental isodisomy, four females having an apparently mosaic X chromosome, two mislabelled SNP data sets, and one microdeletion on chromosome 2 with mosaicism from an apparently normal female. These previously unrecognized abnormalities were all detected using SNPscan. The microdeletion was independently confirmed by fluorescence in situ hybridization, and a region of homozygosity in a UPD case was confirmed by sequencing of genomic DNA. Conclusion SNPscan is useful to identify chromosomal abnormalities based on SNP intensity (such as chromosomal copy number changes) and heterozygosity data (including regions of LOH and some cases of UPD). The program and source code are available at the SNPscan website . PMID:16420694
A Discovery Resource of Rare Copy Number Variations in Individuals with Autism Spectrum Disorder
Prasad, Aparna; Merico, Daniele; Thiruvahindrapuram, Bhooma; Wei, John; Lionel, Anath C.; Sato, Daisuke; Rickaby, Jessica; Lu, Chao; Szatmari, Peter; Roberts, Wendy; Fernandez, Bridget A.; Marshall, Christian R.; Hatchwell, Eli; Eis, Peggy S.; Scherer, Stephen W.
2012-01-01
The identification of rare inherited and de novo copy number variations (CNVs) in human subjects has proven a productive approach to highlight risk genes for autism spectrum disorder (ASD). A variety of microarrays are available to detect CNVs, including single-nucleotide polymorphism (SNP) arrays and comparative genomic hybridization (CGH) arrays. Here, we examine a cohort of 696 unrelated ASD cases using a high-resolution one-million feature CGH microarray, the majority of which were previously genotyped with SNP arrays. Our objective was to discover new CNVs in ASD cases that were not detected by SNP microarray analysis and to delineate novel ASD risk loci via combined analysis of CGH and SNP array data sets on the ASD cohort and CGH data on an additional 1000 control samples. Of the 615 ASD cases analyzed on both SNP and CGH arrays, we found that 13,572 of 21,346 (64%) of the CNVs were exclusively detected by the CGH array. Several of the CGH-specific CNVs are rare in population frequency and impact previously reported ASD genes (e.g., NRXN1, GRM8, DPYD), as well as novel ASD candidate genes (e.g., CIB2, DAPP1, SAE1), and all were inherited except for a de novo CNV in the GPHN gene. A functional enrichment test of gene-sets in ASD cases over controls revealed nucleotide metabolism as a potential novel pathway involved in ASD, which includes several candidate genes for follow-up (e.g., DPYD, UPB1, UPP1, TYMP). Finally, this extensively phenotyped and genotyped ASD clinical cohort serves as an invaluable resource for the next step of genome sequencing for complete genetic variation detection. PMID:23275889
Karampetsou, Evangelia; Morrogh, Deborah; Chitty, Lyn
2014-01-01
The advantage of microarray (array) over conventional karyotype for the diagnosis of fetal pathogenic chromosomal anomalies has prompted the use of microarrays in prenatal diagnostics. In this review we compare the performance of different array platforms (BAC, oligonucleotide CGH, SNP) and designs (targeted, whole genome, whole genome, and targeted, custom) and discuss their advantages and disadvantages in relation to prenatal testing. We also discuss the factors to consider when implementing a microarray testing service for the diagnosis of fetal chromosomal aberrations. PMID:26237396
Hartmann, Luise; Stephenson, Christine F; Verkamp, Stephanie R; Johnson, Krystal R; Burnworth, Bettina; Hammock, Kelle; Brodersen, Lisa Eidenschink; de Baca, Monica E; Wells, Denise A; Loken, Michael R; Zehentner, Barbara K
2014-12-01
Array comparative genomic hybridization (aCGH) has become a powerful tool for analyzing hematopoietic neoplasms and identifying genome-wide copy number changes in a single assay. aCGH also has superior resolution compared with fluorescence in situ hybridization (FISH) or conventional cytogenetics. Integration of single nucleotide polymorphism (SNP) probes with microarray analysis allows additional identification of acquired uniparental disomy, a copy neutral aberration with known potential to contribute to tumor pathogenesis. However, a limitation of microarray analysis has been the inability to detect clonal heterogeneity in a sample. This study comprised 16 samples (acute myeloid leukemia, myelodysplastic syndrome, chronic lymphocytic leukemia, plasma cell neoplasm) with complex cytogenetic features and evidence of clonal evolution. We used an integrated manual peak reassignment approach combining analysis of aCGH and SNP microarray data for characterization of subclonal abnormalities. We compared array findings with results obtained from conventional cytogenetic and FISH studies. Clonal heterogeneity was detected in 13 of 16 samples by microarray on the basis of log2 values. Use of the manual peak reassignment analysis approach improved resolution of the sample's clonal composition and genetic heterogeneity in 10 of 13 (77%) patients. Moreover, in 3 patients, clonal disease progression was revealed by array analysis that was not evident by cytogenetic or FISH studies. Genetic abnormalities originating from separate clonal subpopulations can be identified and further characterized by combining aCGH and SNP hybridization results from 1 integrated microarray chip by use of the manual peak reassignment technique. Its clinical utility in comparison to conventional cytogenetic or FISH studies is demonstrated. © 2014 American Association for Clinical Chemistry.
Zhang, Linsheng; Znoyko, Iya; Costa, Luciano J; Conlin, Laura K; Daber, Robert D; Self, Sally E; Wolff, Daynna J
2011-12-01
Chronic lymphocytic leukemia (CLL) is a clinically heterogeneous disease. The methods currently used for monitoring CLL and determining conditions for treatment are limited in their ability to predict disease progression, patient survival, and response to therapy. Although clonal diversity and the acquisition of new chromosomal abnormalities during the disease course (clonal evolution) have been associated with disease progression, their prognostic potential has been underappreciated because cytogenetic and fluorescence in situ hybridization (FISH) studies have a restricted ability to detect genomic abnormalities and clonal evolution. We hypothesized that whole genome analysis using high resolution single nucleotide polymorphism (SNP) microarrays would be useful to detect diversity and infer clonal evolution to offer prognostic information. In this study, we used the Infinium Omni1 BeadChip (Illumina, San Diego, CA) array for the analysis of genetic variation and percent mosaicism in 25 non-selected CLL patients to explore the prognostic value of the assessment of clonal diversity in patients with CLL. We calculated the percentage of mosaicism for each abnormality by applying a mathematical algorithm to the genotype frequency data and by manual determination using the Simulated DNA Copy Number (SiDCoN) tool, which was developed from a computer model of mosaicism. At least one genetic abnormality was identified in each case, and the SNP data was 98% concordant with FISH results. Clonal diversity, defined as the presence of two or more genetic abnormalities with differing percentages of mosaicism, was observed in 12 patients (48%), and the diversity correlated with the disease stage. Clonal diversity was present in most cases of advanced disease (Rai stages III and IV) or those with previous treatment, whereas 9 of 13 patients without detected clonal diversity were asymptomatic or clinically stable. In conclusion, SNP microarray studies with simultaneous evaluation of genomic alterations and mosaic distribution of clones can be used to assess apparent clonal evolution via analysis of clonal diversity. Since clonal evolution in CLL is strongly correlated with disease progression, whole genome SNP microarray analysis provides a new comprehensive and reliable prognostic tool for CLL patients. Copyright © 2011 Elsevier Inc. All rights reserved.
GStream: Improving SNP and CNV Coverage on Genome-Wide Association Studies
Alonso, Arnald; Marsal, Sara; Tortosa, Raül; Canela-Xandri, Oriol; Julià, Antonio
2013-01-01
We present GStream, a method that combines genome-wide SNP and CNV genotyping in the Illumina microarray platform with unprecedented accuracy. This new method outperforms previous well-established SNP genotyping software. More importantly, the CNV calling algorithm of GStream dramatically improves the results obtained by previous state-of-the-art methods and yields an accuracy that is close to that obtained by purely CNV-oriented technologies like Comparative Genomic Hybridization (CGH). We demonstrate the superior performance of GStream using microarray data generated from HapMap samples. Using the reference CNV calls generated by the 1000 Genomes Project (1KGP) and well-known studies on whole genome CNV characterization based either on CGH or genotyping microarray technologies, we show that GStream can increase the number of reliably detected variants up to 25% compared to previously developed methods. Furthermore, the increased genome coverage provided by GStream allows the discovery of CNVs in close linkage disequilibrium with SNPs, previously associated with disease risk in published Genome-Wide Association Studies (GWAS). These results could provide important insights into the biological mechanism underlying the detected disease risk association. With GStream, large-scale GWAS will not only benefit from the combined genotyping of SNPs and CNVs at an unprecedented accuracy, but will also take advantage of the computational efficiency of the method. PMID:23844243
Nie, Bei; Yang, Min; Fu, Weiling; Liang, Zhiqing
2015-07-07
The surface invasive cleavage assay, because of its innate accuracy and ability for self-signal amplification, provides a potential route for the mapping of hundreds of thousands of human SNP sites. However, its performance on a high density DNA array has not yet been established, due to the unusual "hairpin" probe design on the microarray and the lack of chemical stability of commercially available substrates. Here we present an applicable method to implement a nanocrystalline diamond thin film as an alternative substrate for fabricating an addressable DNA array using maskless light-directed photochemistry, producing the most chemically stable and biocompatible system for genetic analysis and enzymatic reactions. The surface invasive cleavage reaction, followed by degenerated primer ligation and post-rolling circle amplification is consecutively performed on the addressable diamond DNA array, accurately mapping SNP sites from PCR-amplified human genomic target DNA. Furthermore, a specially-designed DNA array containing dual probes in the same pixel is fabricated by following a reverse light-directed DNA synthesis protocol. This essentially enables us to decipher thousands of SNP alleles in a single-pot reaction by the simple addition of enzyme, target and reaction buffers.
Xu, Lingyang; Hou, Yali; Bickhart, Derek M; Song, Jiuzhou; Liu, George E
2013-06-25
Copy number variations (CNVs) are gains and losses of genomic sequence between two individuals of a species when compared to a reference genome. The data from single nucleotide polymorphism (SNP) microarrays are now routinely used for genotyping, but they also can be utilized for copy number detection. Substantial progress has been made in array design and CNV calling algorithms and at least 10 comparison studies in humans have been published to assess them. In this review, we first survey the literature on existing microarray platforms and CNV calling algorithms. We then examine a number of CNV calling tools to evaluate their impacts using bovine high-density SNP data. Large incongruities in the results from different CNV calling tools highlight the need for standardizing array data collection, quality assessment and experimental validation. Only after careful experimental design and rigorous data filtering can the impacts of CNVs on both normal phenotypic variability and disease susceptibility be fully revealed.
Grote, Lauren; Myers, Melanie; Lovell, Anne; Saal, Howard; Sund, Kristen Lipscomb
2014-01-01
SNP microarrays are capable of detecting regions of homozygosity (ROH) which can suggest parental relatedness. This study was designed to describe pre- and post-test counseling practices of genetics professionals regarding ROH, explore perceived comfort and ethical concerns in the follow-up of such results, demonstrate awareness of laws surrounding duty to report consanguinity and incest, and allow respondents to share their personal experiences with results suggesting a parental relationship. A 35 question survey was administered to 240 genetic counselors and geneticists who had ordered or counseled for SNP microarray. The results are presented using descriptive statistics. There was variation in both pre- and post-test counseling practices of genetics professionals. Twenty-five percent of respondents reported pre-test counseling that ROH can indicate parental relatedness. The most commonly reported ethical concern was disclosure of findings suggesting parental relatedness to parents of the patient; only 48.4% reported disclosing parental relatedness when indicated. Fifty-seven percent felt comfortable receiving results suggesting parental consanguinity while 17% felt comfortable receiving results suggesting parental incest. Twenty percent of respondents were extremely/moderately familiar with the laws about duty to report incest. Personal experiences in post-test counseling included both parental acknowledgement and denial of relatedness. This study highlights the differences in genetics professionals' pre- and post-test counseling practices, comfort, and experiences surrounding parental relatedness suggested by SNP microarray results. It identifies a need for professional organizations to offer guidance to genetics professionals about how to respond to and counsel for molecular results suggesting parental consanguinity or incest. © 2013 Wiley Periodicals, Inc.
2011-01-01
Background During gene conversion, genetic information is transferred unidirectionally between highly homologous but non-allelic regions of DNA. While germ-line gene conversion has been implicated in the pathogenesis of some diseases, somatic gene conversion has remained technically difficult to investigate on a large scale. Methods A novel analysis technique is proposed for detecting the signature of somatic gene conversion from SNP microarray data. The Wellcome Trust Case Control Consortium has gathered SNP microarray data for two control populations and cohorts for bipolar disorder (BD), cardiovascular disease (CAD), Crohn's disease (CD), hypertension (HT), rheumatoid arthritis (RA), type-1 diabetes (T1D) and type-2 diabetes (T2D). Using the new analysis technique, the seven disease cohorts are analyzed to identify cohort-specific SNPs at which conversion is predicted. The quality of the predictions is assessed by identifying known disease associations for genes in the homologous duplicons, and comparing the frequency of such associations with background rates. Results Of 28 disease/locus pairs meeting stringent conditions, 22 show various degrees of disease association, compared with only 8 of 70 in a mock study designed to measure the background association rate (P < 10-9). Additional candidate genes are identified using less stringent filtering conditions. In some cases, somatic deletions appear likely. RA has a distinctive pattern of events relative to other diseases. Similarities in patterns are apparent between BD and HT. Conclusions The associations derived represent the first evidence that somatic gene conversion could be a significant causative factor in each of the seven diseases. The specific genes provide potential insights about disease mechanisms, and are strong candidates for further study. Please see Commentary: http://www.biomedcentral.com/1741-7015/9/13/abstract. PMID:21291537
Ross, Kenneth Andrew
2011-02-03
During gene conversion, genetic information is transferred unidirectionally between highly homologous but non-allelic regions of DNA. While germ-line gene conversion has been implicated in the pathogenesis of some diseases, somatic gene conversion has remained technically difficult to investigate on a large scale. A novel analysis technique is proposed for detecting the signature of somatic gene conversion from SNP microarray data. The Wellcome Trust Case Control Consortium has gathered SNP microarray data for two control populations and cohorts for bipolar disorder (BD), cardiovascular disease (CAD), Crohn's disease (CD), hypertension (HT), rheumatoid arthritis (RA), type-1 diabetes (T1D) and type-2 diabetes (T2D). Using the new analysis technique, the seven disease cohorts are analyzed to identify cohort-specific SNPs at which conversion is predicted. The quality of the predictions is assessed by identifying known disease associations for genes in the homologous duplicons, and comparing the frequency of such associations with background rates. Of 28 disease/locus pairs meeting stringent conditions, 22 show various degrees of disease association, compared with only 8 of 70 in a mock study designed to measure the background association rate (P < 10-9). Additional candidate genes are identified using less stringent filtering conditions. In some cases, somatic deletions appear likely. RA has a distinctive pattern of events relative to other diseases. Similarities in patterns are apparent between BD and HT. The associations derived represent the first evidence that somatic gene conversion could be a significant causative factor in each of the seven diseases. The specific genes provide potential insights about disease mechanisms, and are strong candidates for further study.
Peter, Harald; Berggrav, Kathrine; Thomas, Peter; Pfeifer, Yvonne; Witte, Wolfgang; Templeton, Kate
2012-01-01
Klebsiella pneumoniae carbapenemases (KPCs) are considered a serious threat to antibiotic therapy, as they confer resistance to carbapenems, which are used to treat extended-spectrum beta-lactamase (ESBL)-producing bacteria. Here, we describe the development and evaluation of a DNA microarray for the detection and genotyping of KPC genes (blaKPC) within a 5-h period. To test the whole assay procedure (DNA extraction plus a DNA microarray assay) directly from clinical specimens, we compared two commercial DNA extraction kits (the QIAprep Spin miniprep kit [Qiagen] and the urine bacterial DNA isolation kit [Norgen]) for the direct DNA extraction from urine samples (dilution series spiked in human urine). Reliable single nucleotide polymorphism (SNP) typing was demonstrated using 1 × 105 CFU/ml urine for Escherichia coli (Qiagen and Norgen) and 80 CFU/ml urine, on average, for K. pneumoniae (Norgen). This study presents, for the first time, the combination of a new KPC microarray with commercial sample preparation for detecting and genotyping microbial pathogens directly from clinical specimens; this paves the way toward tests providing epidemiological and diagnostic data, enabling better antimicrobial stewardship. PMID:23035190
2014-01-01
Background Kidney stone disease (KSD) is a complex disorder with unknown etiology in majority of the patients. Genetic and environmental factors may cause the disease. In the present study, we used DNA microarray to genotype single nucleotide polymorphisms (SNP) and performed candidate gene association analysis to determine genetic variations associated with the disease. Methods A whole genome SNP genotyping by DNA microarray was initially conducted in 101 patients and 105 control subjects. A set of 104 candidate genes reported to be involved in KSD, gathered from public databases and candidate gene association study databases, were evaluated for their variations associated with KSD. Results Altogether 82 SNPs distributed within 22 candidate gene regions showed significant differences in SNP allele frequencies between the patient and control groups (P < 0.05). Of these, 4 genes including BGLAP, AHSG, CD44, and HAO1, encoding osteocalcin, fetuin-A, CD44-molecule and glycolate oxidase 1, respectively, were further assessed for their associations with the disease because they carried high proportion of SNPs with statistical differences of allele frequencies between the patient and control groups within the gene. The total of 26 SNPs showed significant differences of allele frequencies between the patient and control groups and haplotypes associated with disease risk were identified. The SNP rs759330 located 144 bp downstream of BGLAP where it is a predicted microRNA binding site at 3′UTR of PAQR6 – a gene encoding progestin and adipoQ receptor family member VI, was genotyped in 216 patients and 216 control subjects and found to have significant differences in its genotype and allele frequencies (P = 0.0007, OR 2.02 and P = 0.0001, OR 2.02, respectively). Conclusions Our results suggest that these candidate genes are associated with KSD and PAQR6 comes into our view as the most potent candidate since associated SNP rs759330 is located in the miRNA binding site and may affect mRNA expression level. PMID:24886237
Consolandi, Clarissa
2009-01-01
One major goal of genetic research is to understand the role of genetic variation in living systems. In humans, by far the most common type of such variation involves differences in single DNA nucleotides, and is thus termed single nucleotide polymorphism (SNP). The need for improvement in throughput and reliability of traditional techniques makes it necessary to develop new technologies. Thus the past few years have witnessed an extraordinary surge of interest in DNA microarray technology. This new technology offers the first great hope for providing a systematic way to explore the genome. It permits a very rapid analysis of thousands genes for the purpose of gene discovery, sequencing, mapping, expression, and polymorphism detection. We generated a series of analytical tools to address the manufacturing, detection and data analysis components of a microarray experiment. In particular, we set up a universal array approach in combination with a PCR-LDR (polymerase chain reaction-ligation detection reaction) strategy for allele identification in the HLA gene.
High Frequency of Copy-Neutral Loss of Heterozygosity in Patients with Myelofibrosis.
Rego de Paula Junior, Milton; Nonino, Alexandre; Minuncio Nascimento, Juliana; Bonadio, Raphael S; Pic-Taylor, Aline; de Oliveira, Silviene F; Wellerson Pereira, Rinaldo; do Couto Mascarenhas, Cintia; Forte Mazzeu, Juliana
2018-01-01
Myelofibrosis is the rarest and most severe type of Philadelphia-negative classical myeloproliferative neoplasms. Although mutually exclusive driver mutations in JAK2, MPL, or CALR that activate JAK-STAT pathway have been related to the pathogenesis of the disease, chromosome abnormalities have also been associated with the phenotype and prognosis of the disease. Here, we report the use of a chromosomal microarray platform consisting of both oligo and SNP probes to improve the detection of chromosome abnormalities in patients with myelofibrosis. Sixteen patients with myelofibrosis were tested, and the results were compared to karyotype analysis. Driver mutations in JAK2, MPL, or CALR were investigated by PCR and MLPA. Conventional cytogenetics revealed chromosome abnormalities in 3 out of 16 cases (18.7%), while chromosomal microarray analysis detected copy-number variations (CNV) or copy-neutral loss of heterozygosity (CN-LOH) alterations in 11 out of 16 (68.7%) patients. These included 43 CN-LOH, 14 deletions, 1 trisomy, and 1 duplication. Ten patients showed multiple chromosomal abnormalities, varying from 2 to 13 CNVs or CN-LOHs. Mutational status for JAK2, CALR, and MPL by MLPA revealed a total of 3/16 (18.7%) patients positive for the JAK2 V617F mutation, 9 with CALR deletion or insertion and 1 positive for MPL mutation. Considering that most of the CNVs identified were smaller than the karyotype resolution and the high frequency of CN-LOHs in our study, we propose that chromosomal microarray platforms that combine oligos and SNP should be used as a first-tier genetic test in patients with myelofibrosis. © 2018 S. Karger AG, Basel.
SNP discovery and genotyping using Genotyping-by-Sequencing in Pekin ducks.
Zhu, Feng; Cui, Qian-Qian; Hou, Zhuo-Cheng
2016-11-15
Genomic selection and genome-wide association studies need thousands to millions of SNPs. However, many non-model species do not have reference chips for detecting variation. Our goal was to develop and validate an inexpensive but effective method for detecting SNP variation. Genotyping by sequencing (GBS) can be a highly efficient strategy for genome-wide SNP detection, as an alternative to microarray chips. Here, we developed a GBS protocol for ducks and tested it to genotype 49 Pekin ducks. A total of 169,209 SNPs were identified from all animals, with a mean of 55,920 SNPs per individual. The average SNP density reached 1156 SNPs/MB. In this study, the first application of GBS to ducks, we demonstrate the power and simplicity of this method. GBS can be used for genetic studies in to provide an effective method for genome-wide SNP discovery.
Single-feature polymorphism discovery in the barley transcriptome
Rostoks, Nils; Borevitz, Justin O; Hedley, Peter E; Russell, Joanne; Mudie, Sharon; Morris, Jenny; Cardle, Linda; Marshall, David F; Waugh, Robbie
2005-01-01
A probe-level model for analysis of GeneChip gene-expression data is presented which identified more than 10,000 single-feature polymorphisms (SFP) between two barley genotypes. The method has good sensitivity, as 67% of known single-nucleotide polymorphisms (SNP) were called as SFPs. This method is applicable to all oligonucleotide microarray data, accounts for SNP effects in gene-expression data and represents an efficient and versatile approach for highly parallel marker identification in large genomes. PMID:15960806
Patel, Isha R.; Gangiredla, Jayanthi; Lacher, David W.; Mammel, Mark K.; Jackson, Scott A.; Lampel, Keith A.
2016-01-01
ABSTRACT Most Escherichia coli strains are nonpathogenic. However, for clinical diagnosis and food safety analysis, current identification methods for pathogenic E. coli either are time-consuming and/or provide limited information. Here, we utilized a custom DNA microarray with informative genetic features extracted from 368 sequence sets for rapid and high-throughput pathogen identification. The FDA Escherichia coli Identification (FDA-ECID) platform contains three sets of molecularly informative features that together stratify strain identification and relatedness. First, 53 known flagellin alleles, 103 alleles of wzx and wzy, and 5 alleles of wzm provide molecular serotyping utility. Second, 41,932 probe sets representing the pan-genome of E. coli provide strain-level gene content information. Third, approximately 125,000 single nucleotide polymorphisms (SNPs) of available whole-genome sequences (WGS) were distilled to 9,984 SNPs capable of recapitulating the E. coli phylogeny. We analyzed 103 diverse E. coli strains with available WGS data, including those associated with past foodborne illnesses, to determine robustness and accuracy. The array was able to accurately identify the molecular O and H serotypes, potentially correcting serological failures and providing better resolution for H-nontypeable/nonmotile phenotypes. In addition, molecular risk assessment was possible with key virulence marker identifications. Epidemiologically, each strain had a unique comparative genomic fingerprint that was extended to an additional 507 food and clinical isolates. Finally, a 99.7% phylogenetic concordance was established between microarray analysis and WGS using SNP-level data for advanced genome typing. Our study demonstrates FDA-ECID as a powerful tool for epidemiology and molecular risk assessment with the capacity to profile the global landscape and diversity of E. coli. IMPORTANCE This study describes a robust, state-of-the-art platform developed from available whole-genome sequences of E. coli and Shigella spp. by distilling useful signatures for epidemiology and molecular risk assessment into one assay. The FDA-ECID microarray contains features that enable comprehensive molecular serotyping and virulence profiling along with genome-scale genotyping and SNP analysis. Hence, it is a molecular toolbox that stratifies strain identification and pathogenic potential in the contexts of epidemiology and phylogeny. We applied this tool to strains from food, environmental, and clinical sources, resulting in significantly greater phylogenetic and strain-specific resolution than previously reported for available typing methods. PMID:27037122
Helm, Benjamin M; Langley, Katherine; Spangler, Brooke; Vergano, Samantha
2014-08-01
Single nucleotide polymorphism microarrays have the ability to reveal parental consanguinity which may or may not be known to healthcare providers. Consanguinity can have significant implications for the health of patients and for individual and family psychosocial well-being. These results often present ethical and legal dilemmas that can have important ramifications. Unexpected consanguinity can be confounding to healthcare professionals who may be unprepared to handle these results or to communicate them to families or other appropriate representatives. There are few published accounts of experiences with consanguinity and SNP arrays. In this paper we discuss three cases where molecular evidence of parental incest was identified by SNP microarray. We hope to further highlight consanguinity as a potential incidental finding, how the cases were handled by the clinical team, and what resources were found to be most helpful. This paper aims to contribute further to professional discourse on incidental findings with genomic technology and how they were addressed clinically. These experiences may provide some guidance on how others can prepare for these findings and help improve practice. As genetic and genomic testing is utilized more by non-genetics providers, we also hope to inform about the importance of engaging with geneticists and genetic counselors when addressing these findings.
Laios, Eleftheria; Drogari, Euridiki
2006-12-01
Three mutations in the low density lipoprotein receptor (LDLR) gene account for 49% of familial hypercholesterolemia (FH) cases in Greece. We used the microelectronic array technology of the NanoChip Molecular Biology Workstation to develop a multiplex method to analyze these single-nucleotide polymorphisms (SNPs). Primer pairs amplified the region encompassing each SNP. The biotinylated PCR amplicon was electronically addressed to streptavidin-coated microarray sites. Allele-specific fluorescently labeled oligonucleotide reporters were designed and used for detection of wild-type and SNP sequences. Genotypes were compared to PCR-restriction fragment length polymorphism (PCR-RFLP). We developed three monoplex assays (1 SNP/site) and an optimized multiplex assay (3SNPs/site). We performed 92 Greece II, 100 Genoa, and 98 Afrikaner-2 NanoChip monoplex assays (addressed to duplicate sites and analyzed separately). Of the 580 monoplex genotypings (290 samples), 579 agreed with RFLP. Duplicate sites of one sample were not in agreement with each other. Of the 580 multiplex genotypings, 576 agreed with the monoplex results. Duplicate sites of three samples were not in agreement with each other, indicating requirement for repetition upon which discrepancies were resolved. The multiplex assay detects common LDLR mutations in Greek FH patients and can be extended to accommodate additional mutations.
Bisignano, A; Wells, D; Harton, G; Munné, S
2011-12-01
Diagnosis of embryos for chromosome abnormalities, i.e. aneuploidy screening, has been invigorated by the introduction of microarray-based testing methods allowing analysis of 24 chromosomes in one test. Recent data have been suggestive of increased implantation and pregnancy rates following microarray testing. Preimplantation genetic diagnosis for infertility aims to test for gross chromosome changes with the hope that identification and transfer of normal embryos will improve IVF outcomes. Testing by some methods, specifically single-nucleotide polymorphism (SNP) microarrays, allow for more information and potential insight into parental origin of aneuploidy and uniparental disomy. The usefulness and validity of reporting this information is flawed. Numerous papers have shown that the majority of meiotic errors occur in the egg, while mitotic errors in the embryo affect parental chromosomes at random. Potential mistakes made in assigning an error as meiotic or mitotic may lead to erroneous reporting of results with medical consequences. This study's data suggest that the bioinformatic cleaning used to 'fix' the miscalls that plague single-cell whole-genome amplification provides little improvement in the quality of useful data. Based on the information available, SNP-based aneuploidy screening suffers from a number of serious issues that must be resolved. Copyright © 2011 Reproductive Healthcare Ltd. Published by Elsevier Ltd. All rights reserved.
KinSNP software for homozygosity mapping of disease genes using SNP microarrays
2010-01-01
Consanguineous families affected with a recessive genetic disease caused by homozygotisation of a mutation offer a unique advantage for positional cloning of rare diseases. Homozygosity mapping of patient genotypes is a powerful technique for the identification of the genomic locus harbouring the causing mutation. This strategy relies on the observation that in these patients a large region spanning the disease locus is also homozygous with high probability. The high marker density in single nucleotide polymorphism (SNP) arrays is extremely advantageous for homozygosity mapping. We present KinSNP, a user-friendly software tool for homozygosity mapping using SNP arrays. The software searches for stretches of SNPs which are homozygous to the same allele in all ascertained sick individuals. User-specified parameters control the number of allowed genotyping 'errors' within homozygous blocks. Candidate disease regions are then reported in a detailed, coloured Excel file, along with genotypes of family members and healthy controls. An interactive genome browser has been included which shows homozygous blocks, individual genotypes, genes and further annotations along the chromosomes, with zooming and scrolling capabilities. The software has been used to identify the location of a mutated gene causing insensitivity to pain in a large Bedouin family. KinSNP is freely available from http://bioinfo.bgu.ac.il/bsu/software/kinSNP. PMID:20846928
He, Xianmin; Wei, Qing; Sun, Meiqian; Fu, Xuping; Fan, Sichang; Li, Yao
2006-05-01
Biological techniques such as Array-Comparative genomic hybridization (CGH), fluorescent in situ hybridization (FISH) and affymetrix single nucleotide pleomorphism (SNP) array have been used to detect cytogenetic aberrations. However, on genomic scale, these techniques are labor intensive and time consuming. Comparative genomic microarray analysis (CGMA) has been used to identify cytogenetic changes in hepatocellular carcinoma (HCC) using gene expression microarray data. However, CGMA algorithm can not give precise localization of aberrations, fails to identify small cytogenetic changes, and exhibits false negatives and positives. Locally un-weighted smoothing cytogenetic aberrations prediction (LS-CAP) based on local smoothing and binomial distribution can be expected to address these problems. LS-CAP algorithm was built and used on HCC microarray profiles. Eighteen cytogenetic abnormalities were identified, among them 5 were reported previously, and 12 were proven by CGH studies. LS-CAP effectively reduced the false negatives and positives, and precisely located small fragments with cytogenetic aberrations.
Hoffmann, Thomas J; Zhan, Yiping; Kvale, Mark N; Hesselson, Stephanie E; Gollub, Jeremy; Iribarren, Carlos; Lu, Yontao; Mei, Gangwu; Purdy, Matthew M; Quesenberry, Charles; Rowell, Sarah; Shapero, Michael H; Smethurst, David; Somkin, Carol P; Van den Eeden, Stephen K; Walter, Larry; Webster, Teresa; Whitmer, Rachel A; Finn, Andrea; Schaefer, Catherine; Kwok, Pui-Yan; Risch, Neil
2011-12-01
Four custom Axiom genotyping arrays were designed for a genome-wide association (GWA) study of 100,000 participants from the Kaiser Permanente Research Program on Genes, Environment and Health. The array optimized for individuals of European race/ethnicity was previously described. Here we detail the development of three additional microarrays optimized for individuals of East Asian, African American, and Latino race/ethnicity. For these arrays, we decreased redundancy of high-performing SNPs to increase SNP capacity. The East Asian array was designed using greedy pairwise SNP selection. However, removing SNPs from the target set based on imputation coverage is more efficient than pairwise tagging. Therefore, we developed a novel hybrid SNP selection method for the African American and Latino arrays utilizing rounds of greedy pairwise SNP selection, followed by removal from the target set of SNPs covered by imputation. The arrays provide excellent genome-wide coverage and are valuable additions for large-scale GWA studies. Copyright © 2011 Elsevier Inc. All rights reserved.
2011-01-01
Background Copy number aberrations (CNAs) are an important molecular signature in cancer initiation, development, and progression. However, these aberrations span a wide range of chromosomes, making it hard to distinguish cancer related genes from other genes that are not closely related to cancer but are located in broadly aberrant regions. With the current availability of high-resolution data sets such as single nucleotide polymorphism (SNP) microarrays, it has become an important issue to develop a computational method to detect driving genes related to cancer development located in the focal regions of CNAs. Results In this study, we introduce a novel method referred to as the wavelet-based identification of focal genomic aberrations (WIFA). The use of the wavelet analysis, because it is a multi-resolution approach, makes it possible to effectively identify focal genomic aberrations in broadly aberrant regions. The proposed method integrates multiple cancer samples so that it enables the detection of the consistent aberrations across multiple samples. We then apply this method to glioblastoma multiforme and lung cancer data sets from the SNP microarray platform. Through this process, we confirm the ability to detect previously known cancer related genes from both cancer types with high accuracy. Also, the application of this approach to a lung cancer data set identifies focal amplification regions that contain known oncogenes, though these regions are not reported using a recent CNAs detecting algorithm GISTIC: SMAD7 (chr18q21.1) and FGF10 (chr5p12). Conclusions Our results suggest that WIFA can be used to reveal cancer related genes in various cancer data sets. PMID:21569311
Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.
Guzzi, Pietro Hiram; Cannataro, Mario
2013-08-01
A current trend in genomics is the investigation of the cell mechanism using different technologies, in order to explain the relationship among genes, molecular processes and diseases. For instance, the combined use of gene-expression arrays and genomic arrays has been demonstrated as an effective instrument in clinical practice. Consequently, in a single experiment different kind of microarrays may be used, resulting in the production of different types of binary data (images and textual raw data). The analysis of microarray data requires an initial preprocessing phase, that makes raw data suitable for use on existing analysis platforms, such as the TIGR M4 (TM4) Suite. An additional challenge to be faced by emerging data analysis platforms is the ability to treat in a combined way those different microarray formats coupled with clinical data. In fact, resulting integrated data may include both numerical and symbolic data (e.g. gene expression and SNPs regarding molecular data), as well as temporal data (e.g. the response to a drug, time to progression and survival rate), regarding clinical data. Raw data preprocessing is a crucial step in analysis but is often performed in a manual and error prone way using different software tools. Thus novel, platform independent, and possibly open source tools enabling the semi-automatic preprocessing and annotation of different microarray data are needed. The paper presents Micro-Analyzer (Microarray Analyzer), a cross-platform tool for the automatic normalization, summarization and annotation of Affymetrix gene expression and SNP binary data. It represents the evolution of the μ-CS tool, extending the preprocessing to SNP arrays that were not allowed in μ-CS. The Micro-Analyzer is provided as a Java standalone tool and enables users to read, preprocess and analyse binary microarray data (gene expression and SNPs) by invoking TM4 platform. It avoids: (i) the manual invocation of external tools (e.g. the Affymetrix Power Tools), (ii) the manual loading of preprocessing libraries, and (iii) the management of intermediate files, such as results and metadata. Micro-Analyzer users can directly manage Affymetrix binary data without worrying about locating and invoking the proper preprocessing tools and chip-specific libraries. Moreover, users of the Micro-Analyzer tool can load the preprocessed data directly into the well-known TM4 platform, extending in such a way also the TM4 capabilities. Consequently, Micro Analyzer offers the following advantages: (i) it reduces possible errors in the preprocessing and further analysis phases, e.g. due to the incorrect choice of parameters or due to the use of old libraries, (ii) it enables the combined and centralized pre-processing of different arrays, (iii) it may enhance the quality of further analysis by storing the workflow, i.e. information about the preprocessing steps, and (iv) finally Micro-Analzyer is freely available as a standalone application at the project web site http://sourceforge.net/projects/microanalyzer/. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Population-genetic nature of copy number variations in the human genome.
Kato, Mamoru; Kawaguchi, Takahisa; Ishikawa, Shumpei; Umeda, Takayoshi; Nakamichi, Reiichiro; Shapero, Michael H; Jones, Keith W; Nakamura, Yusuke; Aburatani, Hiroyuki; Tsunoda, Tatsuhiko
2010-03-01
Copy number variations (CNVs) are universal genetic variations, and their association with disease has been increasingly recognized. We designed high-density microarrays for CNVs, and detected 3000-4000 CNVs (4-6% of the genomic sequence) per population that included CNVs previously missed because of smaller sizes and residing in segmental duplications. The patterns of CNVs across individuals were surprisingly simple at the kilo-base scale, suggesting the applicability of a simple genetic analysis for these genetic loci. We utilized the probabilistic theory to determine integer copy numbers of CNVs and employed a recently developed phasing tool to estimate the population frequencies of integer copy number alleles and CNV-SNP haplotypes. The results showed a tendency toward a lower frequency of CNV alleles and that most of our CNVs were explained only by zero-, one- and two-copy alleles. Using the estimated population frequencies, we found several CNV regions with exceptionally high population differentiation. Investigation of CNV-SNP linkage disequilibrium (LD) for 500-900 bi- and multi-allelic CNVs per population revealed that previous conflicting reports on bi-allelic LD were unexpectedly consistent and explained by an LD increase correlated with deletion-allele frequencies. Typically, the bi-allelic LD was lower than SNP-SNP LD, whereas the multi-allelic LD was somewhat stronger than the bi-allelic LD. After further investigation of tag SNPs for CNVs, we conclude that the customary tagging strategy for disease association studies can be applicable for common deletion CNVs, but direct interrogation is needed for other types of CNVs.
Shalia, Kavita; Saranath, Dhananjaya; Rayar, Jaipreet; Shah, Vinod K.; Mashru, Manoj R.; Soneji, Surendra L.
2017-01-01
Background & objectives: Acute myocardial infarction (AMI) is a major health concern in India. The aim of the study was to identify single nucleotide polymorphisms (SNPs) associated with AMI in patients using dedicated chip and validating the identified SNPs on custom-designed chips using high-throughput microarray analysis. Methods: In pilot phase, 48 AMI patients and 48 healthy controls were screened for SNPs using human CVD55K BeadChip with 48,472 SNP probes on Illumina high-throughput microarray platform. The identified SNPs were validated by genotyping additional 160 patients and 179 controls using custom-made Illumina VeraCode GoldenGate Genotyping Assay. Analysis was carried out using PLINK software. Results: From the pilot phase, 98 SNPs present on 94 genes were identified with increased risk of AMI (odds ratio of 1.84-8.85, P=0.04861-0.003337). Five of these SNPs demonstrated association with AMI in the validation phase (P<0.05). Among these, one SNP rs9978223 on interferon gamma receptor 2 [IFNGR2, interferon (IFN)-gamma transducer 1] gene showed a significant association (P=0.00021) with AMI below Bonferroni corrected P value (P=0.00061). IFNGR2 is the second subunit of the receptor for IFN-gamma, an important cytokine in inflammatory reactions. Interpretation & conclusions: The study identified an SNP rs9978223 on IFNGR2 gene, associated with increased risk in AMI patient from India. PMID:29434065
Fine-scaled human genetic structure revealed by SNP microarrays.
Xing, Jinchuan; Watkins, W Scott; Witherspoon, David J; Zhang, Yuhua; Guthery, Stephen L; Thara, Rangaswamy; Mowry, Bryan J; Bulayeva, Kazima; Weiss, Robert B; Jorde, Lynn B
2009-05-01
We report an analysis of more than 240,000 loci genotyped using the Affymetrix SNP microarray in 554 individuals from 27 worldwide populations in Africa, Asia, and Europe. To provide a more extensive and complete sampling of human genetic variation, we have included caste and tribal samples from two states in South India, Daghestanis from eastern Europe, and the Iban from Malaysia. Consistent with observations made by Charles Darwin, our results highlight shared variation among human populations and demonstrate that much genetic variation is geographically continuous. At the same time, principal components analyses reveal discernible genetic differentiation among almost all identified populations in our sample, and in most cases, individuals can be clearly assigned to defined populations on the basis of SNP genotypes. All individuals are accurately classified into continental groups using a model-based clustering algorithm, but between closely related populations, genetic and self-classifications conflict for some individuals. The 250K data permitted high-level resolution of genetic variation among Indian caste and tribal populations and between highland and lowland Daghestani populations. In particular, upper-caste individuals from Tamil Nadu and Andhra Pradesh form one defined group, lower-caste individuals from these two states form another, and the tribal Irula samples form a third. Our results emphasize the correlation of genetic and geographic distances and highlight other elements, including social factors that have contributed to population structure.
[Genetic analysis of two cases with Dandy-Walker deformed fetus].
Yao, Juan; Fang, Rong; Shen, Xueping; Shen, Guosong; Zhang, Su
2017-10-10
To explore the genetic etiology of two fetuses with Dandy-Walker malformation using single nucleotide polymorphism microarray (SNP-array). The fetuses and their parents were subjected to G banding karyotype analysis. The fetuses were also subjected to SNP-array analysis. The parents of both fetuses showed a normal karyotype. One fetus has a 46,X,?i(X)(q10), while for another conventional cell culture has failed. SNP-array showed that one fetus carried a 6p25.3p25.2 microdeletion, and another carried a Xp22.33p22.2 deletion and a Yq11.221q11 duplication. The abnormal fragments have involved FOXC1, SHOX and STS genes, which are associated with Dandy-Walker malformation. Alteration of 6p25.3p25.2, Xp22.33p22.2 copy numbers probably underlies the Dandy-Walker syndrome in the fetuses. The disorder may be attributed to abnormal expression of FOXC1, SHOX, and STS genes. SNP-array can provide an important supplement for prenatal diagnosis.
Genetic Mapping and Exome Sequencing Identify Variants Associated with Five Novel Diseases
Puffenberger, Erik G.; Jinks, Robert N.; Sougnez, Carrie; Cibulskis, Kristian; Willert, Rebecca A.; Achilly, Nathan P.; Cassidy, Ryan P.; Fiorentini, Christopher J.; Heiken, Kory F.; Lawrence, Johnny J.; Mahoney, Molly H.; Miller, Christopher J.; Nair, Devika T.; Politi, Kristin A.; Worcester, Kimberly N.; Setton, Roni A.; DiPiazza, Rosa; Sherman, Eric A.; Eastman, James T.; Francklyn, Christopher; Robey-Bond, Susan; Rider, Nicholas L.; Gabriel, Stacey; Morton, D. Holmes; Strauss, Kevin A.
2012-01-01
The Clinic for Special Children (CSC) has integrated biochemical and molecular methods into a rural pediatric practice serving Old Order Amish and Mennonite (Plain) children. Among the Plain people, we have used single nucleotide polymorphism (SNP) microarrays to genetically map recessive disorders to large autozygous haplotype blocks (mean = 4.4 Mb) that contain many genes (mean = 79). For some, uninformative mapping or large gene lists preclude disease-gene identification by Sanger sequencing. Seven such conditions were selected for exome sequencing at the Broad Institute; all had been previously mapped at the CSC using low density SNP microarrays coupled with autozygosity and linkage analyses. Using between 1 and 5 patient samples per disorder, we identified sequence variants in the known disease-causing genes SLC6A3 and FLVCR1, and present evidence to strongly support the pathogenicity of variants identified in TUBGCP6, BRAT1, SNIP1, CRADD, and HARS. Our results reveal the power of coupling new genotyping technologies to population-specific genetic knowledge and robust clinical data. PMID:22279524
KinSNP software for homozygosity mapping of disease genes using SNP microarrays.
Amir, El-Ad David; Bartal, Ofer; Morad, Efrat; Nagar, Tal; Sheynin, Jony; Parvari, Ruti; Chalifa-Caspi, Vered
2010-08-01
Consanguineous families affected with a recessive genetic disease caused by homozygotisation of a mutation offer a unique advantage for positional cloning of rare diseases. Homozygosity mapping of patient genotypes is a powerful technique for the identification of the genomic locus harbouring the causing mutation. This strategy relies on the observation that in these patients a large region spanning the disease locus is also homozygous with high probability. The high marker density in single nucleotide polymorphism (SNP) arrays is extremely advantageous for homozygosity mapping. We present KinSNP, a user-friendly software tool for homozygosity mapping using SNP arrays. The software searches for stretches of SNPs which are homozygous to the same allele in all ascertained sick individuals. User-specified parameters control the number of allowed genotyping 'errors' within homozygous blocks. Candidate disease regions are then reported in a detailed, coloured Excel file, along with genotypes of family members and healthy controls. An interactive genome browser has been included which shows homozygous blocks, individual genotypes, genes and further annotations along the chromosomes, with zooming and scrolling capabilities. The software has been used to identify the location of a mutated gene causing insensitivity to pain in a large Bedouin family. KinSNP is freely available from.
Combined array CGH plus SNP genome analyses in a single assay for optimized clinical testing
Wiszniewska, Joanna; Bi, Weimin; Shaw, Chad; Stankiewicz, Pawel; Kang, Sung-Hae L; Pursley, Amber N; Lalani, Seema; Hixson, Patricia; Gambin, Tomasz; Tsai, Chun-hui; Bock, Hans-Georg; Descartes, Maria; Probst, Frank J; Scaglia, Fernando; Beaudet, Arthur L; Lupski, James R; Eng, Christine; Wai Cheung, Sau; Bacino, Carlos; Patel, Ankita
2014-01-01
In clinical diagnostics, both array comparative genomic hybridization (array CGH) and single nucleotide polymorphism (SNP) genotyping have proven to be powerful genomic technologies utilized for the evaluation of developmental delay, multiple congenital anomalies, and neuropsychiatric disorders. Differences in the ability to resolve genomic changes between these arrays may constitute an implementation challenge for clinicians: which platform (SNP vs array CGH) might best detect the underlying genetic cause for the disease in the patient? While only SNP arrays enable the detection of copy number neutral regions of absence of heterozygosity (AOH), they have limited ability to detect single-exon copy number variants (CNVs) due to the distribution of SNPs across the genome. To provide comprehensive clinical testing for both CNVs and copy-neutral AOH, we enhanced our custom-designed high-resolution oligonucleotide array that has exon-targeted coverage of 1860 genes with 60 000 SNP probes, referred to as Chromosomal Microarray Analysis – Comprehensive (CMA-COMP). Of the 3240 cases evaluated by this array, clinically significant CNVs were detected in 445 cases including 21 cases with exonic events. In addition, 162 cases (5.0%) showed at least one AOH region >10 Mb. We demonstrate that even though this array has a lower density of SNP probes than other commercially available SNP arrays, it reliably detected AOH events >10 Mb as well as exonic CNVs beyond the detection limitations of SNP genotyping. Thus, combining SNP probes and exon-targeted array CGH into one platform provides clinically useful genetic screening in an efficient manner. PMID:23695279
Miyakawa, Hiroe; Miyamoto, Toshinobu; Koh, Eitetsu; Tsujimura, Akira; Miyagawa, Yasushi; Saijo, Yasuaki; Namiki, Mikio; Sengoku, Kazuo
2012-01-01
Genetic mechanisms have been implicated as a cause of some cases of male infertility. Recently, 10 novel genes involved in human spermatogenesis, including human SEPTIN12, were identified by expression microarray analysis of human testicular tissue. Septin12 is a member of the septin family of conserved cytoskeletal GTPases that form heteropolymeric filamentous structures in interphase cells. It is expressed specifically in the testis. Therefore, we hypothesized that mutation or polymorphisms of SEPTIN12 participate in male infertility, especially Sertoli cell-only syndrome (SCOS). To investigate whether SEPTIN12 gene defects are associated with azoospermia caused by SCOS, mutational analysis was performed in 100 Japanese patients by direct sequencing of coding regions. Statistical analysis was performed in patients with SCOS and in 140 healthy control men. No mutations were found in SEPTIN12 ; however, 8 coding single-nucleotide polymorphisms (SNP1-SNP8) could be detected in the patients with SCOS. The genotype and allele frequencies in SNP3, SNP4, and SNP6 were notably higher in the SCOS group than in the control group (P < .001). These results suggest that SEPTIN12 might play a critical role in human spermatogenesis.
Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays
Binder, Hans; Fasold, Mario; Glomb, Torsten
2009-01-01
Background Single nucleotide polymorphism (SNP) arrays are important tools widely used for genotyping and copy number estimation. This technology utilizes the specific affinity of fragmented DNA for binding to surface-attached oligonucleotide DNA probes. We analyze the variability of the probe signals of Affymetrix GeneChip SNP arrays as a function of the probe sequence to identify relevant sequence motifs which potentially cause systematic biases of genotyping and copy number estimates. Methodology/Principal Findings The probe design of GeneChip SNP arrays enables us to disentangle different sources of intensity modulations such as the number of mismatches per duplex, matched and mismatched base pairings including nearest and next-nearest neighbors and their position along the probe sequence. The effect of probe sequence was estimated in terms of triple-motifs with central matches and mismatches which include all 256 combinations of possible base pairings. The probe/target interactions on the chip can be decomposed into nearest neighbor contributions which correlate well with free energy terms of DNA/DNA-interactions in solution. The effect of mismatches is about twice as large as that of canonical pairings. Runs of guanines (G) and the particular type of mismatched pairings formed in cross-allelic probe/target duplexes constitute sources of systematic biases of the probe signals with consequences for genotyping and copy number estimates. The poly-G effect seems to be related to the crowded arrangement of probes which facilitates complex formation of neighboring probes with at minimum three adjacent G's in their sequence. Conclusions The applied method of “triple-averaging” represents a model-free approach to estimate the mean intensity contributions of different sequence motifs which can be applied in calibration algorithms to correct signal values for sequence effects. Rules for appropriate sequence corrections are suggested. PMID:19924253
Delaneau, Olivier; Marchini, Jonathan
2014-06-13
A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants.
Performance evaluation of DNA copy number segmentation methods.
Pierre-Jean, Morgane; Rigaill, Guillem; Neuvial, Pierre
2015-07-01
A number of bioinformatic or biostatistical methods are available for analyzing DNA copy number profiles measured from microarray or sequencing technologies. In the absence of rich enough gold standard data sets, the performance of these methods is generally assessed using unrealistic simulation studies, or based on small real data analyses. To make an objective and reproducible performance assessment, we have designed and implemented a framework to generate realistic DNA copy number profiles of cancer samples with known truth. These profiles are generated by resampling publicly available SNP microarray data from genomic regions with known copy-number state. The original data have been extracted from dilutions series of tumor cell lines with matched blood samples at several concentrations. Therefore, the signal-to-noise ratio of the generated profiles can be controlled through the (known) percentage of tumor cells in the sample. This article describes this framework and its application to a comparison study between methods for segmenting DNA copy number profiles from SNP microarrays. This study indicates that no single method is uniformly better than all others. It also helps identifying pros and cons of the compared methods as a function of biologically informative parameters, such as the fraction of tumor cells in the sample and the proportion of heterozygous markers. This comparison study may be reproduced using the open source and cross-platform R package jointseg, which implements the proposed data generation and evaluation framework: http://r-forge.r-project.org/R/?group_id=1562. © The Author 2014. Published by Oxford University Press.
Gao, Z J; Jiang, Q; Cheng, D Z; Yan, X X; Chen, Q; Xu, K M
2016-10-02
Objective: To evaluate the application of single nucleotide polymorphism (SNP)-microarray and target gene sequencing technology in the clinical molecular genetic diagnosis of unexplained intellectual disability(ID) or developmental delay (DD). Method: Patients with ID or DD were recruited in the Department of Neurology, Affiliated Children's Hospital of Capital Institute of Pediatrics between September 2015 and February 2016. The intellectual assessment of the patients was performed using 0-6-year-old pediatric examination table of neuropsychological development or Wechsler intelligence scale (>6 years). Patients with a DQ less than 49 or IQ less than 51 were included in this study. The patients were scanned by SNP-array for detection of genomic copy number variations (CNV), and the revealed genomic imbalance was confirmed by quantitative real time-PCR. Candidate gene mutation screening was carried out by target gene sequencing technology.Causal mutations or likely pathogenic variants were verified by polymerase chain reaction and direct sequencing. Result: There were 15 children with ID or DD enrolled, 9 males and 6 females. The age of these patients was 7 months-16 years and 9 months. SNP-array revealed that two of the 15 patients had genomic CNV. Both CNV were de novo micro deletions, one involved 11q24.1q25 and the other micro deletion located on 21q22.2q22.3. Both micro deletions were proved to have a clinical significance due to their association with ID, brain DD, unusual faces etc. by querying Decipher database. Thirteen patients with negative findings in SNP-array were consequently examined with target gene sequencing technology, genotype-phenotype correlation analysis and genetic analysis. Five patients were diagnosed with monogenic disorder, two were diagnosed with suspected genetic disorder and six were still negative. Conclusion: Sequential use of SNP-array and target gene sequencing technology can significantly increase the molecular genetic etiologic diagnosis rate of the patients with unexplained ID or DD. Combined use of these technologies can serve as a useful examinational method in assisting differential diagnosis of children with unexplained ID or DD.
Venegas-Vega, Carlos A.; Zepeda, Luis M.; Garduño-Zarazúa, Luz M.; Berumen, Jaime; Kofman, Susana; Cervantes, Alicia
2013-01-01
The use of conventional cytogenetic techniques in combination with fluorescent in situ hybridization (FISH) and single-nucleotide polymorphism (SNP) microarrays is necessary for the identification of cryptic rearrangements in the diagnosis of chromosomal syndromes. We report two siblings, a boy of 9 years and 9 months of age and his 7-years- and 5-month-old sister, with the classic Wolf-Hirschhorn syndrome (WHS) phenotype. Using high-resolution GTG- and NOR-banding karyotypes, as well as FISH analysis, we characterized a pure 4p deletion in both sibs and a balanced rearrangement in their father, consisting in an insertion of 4p material within a nucleolar organizing region of chromosome 15. Copy number variant (CNV) analysis using SNP arrays showed that both siblings have a similar size of 4p deletion (~6.5 Mb). Our results strongly support the need for conventional cytogenetic and FISH analysis, as well as high-density microarray mapping for the optimal characterization of the genetic imbalance in patients with WHS; parents must always be studied for recognizing cryptic balanced chromosomal rearrangements for an adequate genetic counseling. PMID:23484094
Xie, Yanxin; Xu, Yanwen; Wang, Jing; Miao, Benyu; Zeng, Yanhong; Ding, Chenhui; Gao, Jun; Zhou, Canquan
2018-01-01
The aim of this study was to determine whether an interchromosomal effect (ICE) occurred in embryos obtained from reciprocal translocation (rcp) and Robertsonian translocation (RT) carriers who were following a preimplantation genetic diagnosis (PGD) with whole chromosome screening with an aCGH and SNP microarray. We also analyzed the chromosomal numerical abnormalities in embryos with aneuploidy in parental chromosomes that were not involved with a translocation and balanced in involved parental translocation chromosomes. This retrospective study included 832 embryos obtained from rcp carriers and 382 embryos from RT carriers that were biopsied in 139 PGD cycles. The control group involved embryos obtained from age-matched patient karyotypes who were undergoing preimplantation genetic screening (PGS) with non-translocation, and 579 embryos were analyzed in the control group. A single blastomere at the cleavage stage or trophectoderm from a blastocyst was biopsied, and 24-chromosomal analysis with an aCGH/SNP microarray was conducted using the PGD/PGS protocols. Statistical analyses were implemented on the incidences of cumulative aneuploidy rates between the translocation carriers and the control group. Reliable results were obtained from 138 couples, among whom only one patient was a balanced rcp or RT translocation carrier, undergoing PGD testing in our center from January 2012 to June 2014. For day 3 embryos, the aneuploidy rates were 50.7% for rcp carriers and 49.1% for RT carriers, compared with the control group, with 44.8% at a maternal age < 36 years. When the maternal age was ≥ 36 years, the aneuploidy rates were increased to 61.1% for rcp carriers, 56.7% for RT carriers, and 60.3% for the control group. There were no significant differences. In day 5 embryos, the aneuploidy rates were 24.5% for rcp carriers and 34.9% for RT carriers, compared with the control group with 53.6% at a maternal age < 36 years. When the maternal age was ≥ 36 years, the aneuploidy rates were 10.7% for rcp carriers, 26.3% for RT carriers, and 57.1% for the control group. The cumulative aneuploidy rates of chromosome translocation carriers were significantly lower than the control group. No ICE was observed in cleavage and blastocyst stage embryos obtained from these carriers. Additionally, the risk of chromosomal numerical abnormalities was observed in each of the 23 pairs of autosomes or sex chromosomes from day 3 and day 5 embryos. There was not enough evidence to prove that ICE was present in embryos derived from both rcp and RT translocation carriers, regardless of the maternal age. However, chromosomal numerical abnormalities were noticed in 23 pairs of autosomes and sex chromosomes in parental structurally normal chromosomes. Thus, 24-chromosomal analysis with an aCGH/SNP microarray PGD protocol is required to decrease the risks of failure to diagnose aneuploidy in structurally normal chromosomes.
Fully automated analysis of multi-resolution four-channel micro-array genotyping data
NASA Astrophysics Data System (ADS)
Abbaspour, Mohsen; Abugharbieh, Rafeef; Podder, Mohua; Tebbutt, Scott J.
2006-03-01
We present a fully-automated and robust microarray image analysis system for handling multi-resolution images (down to 3-micron with sizes up to 80 MBs per channel). The system is developed to provide rapid and accurate data extraction for our recently developed microarray analysis and quality control tool (SNP Chart). Currently available commercial microarray image analysis applications are inefficient, due to the considerable user interaction typically required. Four-channel DNA microarray technology is a robust and accurate tool for determining genotypes of multiple genetic markers in individuals. It plays an important role in the state of the art trend where traditional medical treatments are to be replaced by personalized genetic medicine, i.e. individualized therapy based on the patient's genetic heritage. However, fast, robust, and precise image processing tools are required for the prospective practical use of microarray-based genetic testing for predicting disease susceptibilities and drug effects in clinical practice, which require a turn-around timeline compatible with clinical decision-making. In this paper we have developed a fully-automated image analysis platform for the rapid investigation of hundreds of genetic variations across multiple genes. Validation tests indicate very high accuracy levels for genotyping results. Our method achieves a significant reduction in analysis time, from several hours to just a few minutes, and is completely automated requiring no manual interaction or guidance.
SNP-Based Typing: A Useful Tool to Study Bordetella pertussis Populations
van der Heide, Han G. J.; Heuvelman, Kees J.; Kallonen, Teemu; He, Qiushui; Mertsola, Jussi; Advani, Abdolreza; Hallander, Hans O.; Janssens, Koen; Hermans, Peter W.; Mooi, Frits R.
2011-01-01
To monitor changes in Bordetella pertussis populations, mainly two typing methods are used; Pulsed-Field Gel Electrophoresis (PFGE) and Multiple-Locus Variable-Number Tandem Repeat Analysis (MLVA). In this study, a single nucleotide polymorphism (SNP) typing method, based on 87 SNPs, was developed and compared with PFGE and MLVA. The discriminatory indices of SNP typing, PFGE and MLVA were found to be 0.85, 0.95 and 0.83, respectively. Phylogenetic analysis, using SNP typing as Gold Standard, revealed false homoplasies in the PFGE and MLVA trees. Further, in contrast to the SNP-based tree, the PFGE- and MLVA-based trees did not reveal a positive correlation between root-to-tip distance and the isolation year of strains. Thus PFGE and MLVA do not allow an estimation of the relative age of the selected strains. In conclusion, SNP typing was found to be phylogenetically more informative than PFGE and more discriminative than MLVA. Further, in contrast to PFGE, it is readily standardized allowing interlaboratory comparisons. We applied SNP typing to study strains with a novel allele for the pertussis toxin promoter, ptxP3, which have a worldwide distribution and which have replaced the resident ptxP1 strains in the last 20 years. Previously, we showed that ptxP3 strains showed increased pertussis toxin expression and that their emergence was associated with increased notification in the Netherlands. SNP typing showed that the ptxP3 strains isolated in the Americas, Asia, Australia and Europe formed a monophyletic branch which recently diverged from ptxP1 strains. Two predominant ptxP3 SNP types were identified which spread worldwide. The widespread use of SNP typing will enhance our understanding of the evolution and global epidemiology of B. pertussis. PMID:21647370
A de novo Mutation in KMT2A (MLL) in monozygotic twins with Wiedemann-Steiner syndrome.
Dunkerton, Sophie; Field, Matthew; Cho, Vicki; Bertram, Edward; Whittle, Belinda; Groves, Alexandra; Goel, Himanshu
2015-09-01
Growth deficiency, psychomotor delay, and facial dysmorphism was originally described in a male patient in 1989 by Wiedemann et al. and later in 2000 by Steiner et al. Wiedemann-Steiner syndrome (WSS) has since been described only a few times in the literature, with the phenotypic spectrum both expanding and becoming more delineated with each patient reported. We report on the clinical and molecular features of monozygotic twins with a de novo mutation in KMT2A. Single nucleotide polymorphism (SNP) microarray was done on both twins and whole-exome sequencing was done using both parents and one of the affected twins. SNP microarray confirmed that they were monozygotic twins. A de novo heterozygous variant (p. Arg1083*) in the KMT2A gene was identified through whole-exome sequencing, confirming the diagnosis of WSS. In this study, we have identified a de novo mutation in KMT2A associated with psychomotor developmental delay, facial dysmorphism, short stature, hypertrichosis cubiti, and small kidneys. This finding in monozygotic twins gives specificity to the WSS. The description of more cases of WSS is needed for further delineation of this condition. Small kidneys with normal function have not been described in this condition in the medical literature before. © 2015 Wiley Periodicals, Inc.
Martin, Maureen V; Rollins, Brandi; Sequeira, P Adolfo; Mesén, Andrea; Byerley, William; Stein, Richard; Moon, Emily A; Akil, Huda; Jones, Edward G; Watson, Stanley J; Barchas, Jack; DeLisi, Lynn E; Myers, Richard M; Schatzberg, Alan; Bunney, William E; Vawter, Marquis P
2009-01-01
Background The purpose of this study was to examine the effects of glucose reduction stress on lymphoblastic cell line (LCL) gene expression in subjects with schizophrenia compared to non-psychotic relatives. Methods LCLs were grown under two glucose conditions to measure the effects of glucose reduction stress on exon expression in subjects with schizophrenia compared to unaffected family member controls. A second aim of this project was to identify cis-regulated transcripts associated with diagnosis. Results There were a total of 122 transcripts with significant diagnosis by probeset interaction effects and 328 transcripts with glucose deprivation by probeset interaction probeset effects after corrections for multiple comparisons. There were 8 transcripts with expression significantly affected by the interaction between diagnosis and glucose deprivation and probeset after correction for multiple comparisons. The overall validation rate by qPCR of 13 diagnosis effect genes identified through microarray was 62%, and all genes tested by qPCR showed concordant up- or down-regulation by qPCR and microarray. We assessed brain gene expression of five genes found to be altered by diagnosis and glucose deprivation in LCLs and found a significant decrease in expression of one gene, glutaminase, in the dorsolateral prefrontal cortex (DLPFC). One SNP with previously identified regulation by a 3' UTR SNP was found to influence IRF5 expression in both brain and lymphocytes. The relationship between the 3' UTR rs10954213 genotype and IRF5 expression was significant in LCLs (p = 0.0001), DLPFC (p = 0.007), and anterior cingulate cortex (p = 0.002). Conclusion Experimental manipulation of cells lines from subjects with schizophrenia may be a useful approach to explore stress related gene expression alterations in schizophrenia and to identify SNP variants associated with gene expression. PMID:19772658
UPD detection using homozygosity profiling with a SNP genotyping microarray.
Papenhausen, Peter; Schwartz, Stuart; Risheg, Hiba; Keitges, Elisabeth; Gadi, Inder; Burnside, Rachel D; Jaswaney, Vikram; Pappas, John; Pasion, Romela; Friedman, Kenneth; Tepperberg, James
2011-04-01
Single nucleotide polymorphism (SNP) based chromosome microarrays provide both a high-density whole genome analysis of copy number and genotype. In the past 21 months we have analyzed over 13,000 samples primarily referred for developmental delay using the Affymetrix SNP/CN 6.0 version array platform. In addition to copy number, we have focused on the relative distribution of allele homozygosity (HZ) throughout the genome to confirm a strong association of uniparental disomy (UPD) with regions of isoallelism found in most confirmed cases of UPD. We sought to determine whether a long contiguous stretch of HZ (LCSH) greater than a threshold value found only in a single chromosome would correlate with UPD of that chromosome. Nine confirmed UPD cases were retrospectively analyzed with the array in the study, each showing the anticipated LCSH with the smallest 13.5 Mb in length. This length is well above the average longest run of HZ in a set of control patients and was then set as the prospective threshold for reporting possible UPD correlation. Ninety-two cases qualified at that threshold, 46 of those had molecular UPD testing and 29 were positive. Including retrospective cases, 16 showed complete HZ across the chromosome, consistent with total isoUPD. The average size LCSH in the 19 cases that were not completely HZ was 46.3 Mb with a range of 13.5-127.8 Mb. Three patients showed only segmental UPD. Both the size and location of the LCSH are relevant to correlation with UPD. Further studies will continue to delineate an optimal threshold for LCSH/UPD correlation. Copyright © 2011 Wiley-Liss, Inc.
Hidden Markov Model-Based CNV Detection Algorithms for Illumina Genotyping Microarrays.
Seiser, Eric L; Innocenti, Federico
2014-01-01
Somatic alterations in DNA copy number have been well studied in numerous malignancies, yet the role of germline DNA copy number variation in cancer is still emerging. Genotyping microarrays generate allele-specific signal intensities to determine genotype, but may also be used to infer DNA copy number using additional computational approaches. Numerous tools have been developed to analyze Illumina genotype microarray data for copy number variant (CNV) discovery, although commonly utilized algorithms freely available to the public employ approaches based upon the use of hidden Markov models (HMMs). QuantiSNP, PennCNV, and GenoCN utilize HMMs with six copy number states but vary in how transition and emission probabilities are calculated. Performance of these CNV detection algorithms has been shown to be variable between both genotyping platforms and data sets, although HMM approaches generally outperform other current methods. Low sensitivity is prevalent with HMM-based algorithms, suggesting the need for continued improvement in CNV detection methodologies.
Chono, Makiko; Matsunaka, Hitoshi; Seki, Masako; Fujita, Masaya; Kiribuchi-Otobe, Chikako; Oda, Shunsuke; Kojima, Hisayo; Nakamura, Shingo
2015-03-01
In the wheat (Triticum aestivum L.) cultivar 'Zenkoujikomugi', a single nucleotide polymorphism (SNP) in the promoter of MOTHER OF FT AND TFL1 on chromosome 3A (MFT-3A) causes an increase in the level of gene expression, resulting in strong grain dormancy. We used a DNA marker to detect the 'Zenkoujikomugi'-type (Zen-type) SNP and examined the genotype of MFT-3A in Japanese wheat varieties, and we found that 169 of 324 varieties carry the Zen-type SNP. In Japanese commercial varieties, the frequency of the Zen-type SNP was remarkably high in the southern part of Japan, but low in the northern part. To examine the relationship between MFT-3A genotype and grain dormancy, we performed a germination assay in three wheat-growing seasons. On average, the varieties carrying the Zen-type SNP showed stronger grain dormancy than the varieties carrying the non-Zen-type SNP. Among commercial cultivars, 'Iwainodaichi' (Kyushu), 'Junreikomugi' (Kinki-Chugoku-Shikoku), 'Kinuhime' (Kanto-Tokai), 'Nebarigoshi' (Tohoku-Hokuriku), and 'Kitamoe' (Hokkaido) showed the strongest grain dormancy in each geographical group, and all these varieties, except for 'Kitamoe', were found to carry the Zen-type SNP. In recent years, the number of varieties carrying the Zen-type SNP has increased in the Tohoku-Hokuriku region, but not in the Hokkaido region.
Metastatic breast carcinomas display genomic and transcriptomic heterogeneity
Weigelt, Britta; Ng, Charlotte KY; Shen, Ronglai; Popova, Tatiana; Schizas, Michail; Natrajan, Rachael; Mariani, Odette; Stern, Marc-Henri; Norton, Larry; Vincent-Salomon, Anne; Reis-Filho, Jorge S
2015-01-01
Metaplastic breast carcinoma is a rare and aggressive histologic type of breast cancer, preferentially displaying a triple-negative phenotype. We sought to define the transcriptomic heterogeneity of metaplastic breast cancers on the basis of current gene expression microarray-based classifiers, and to determine whether these tumors display gene copy number profiles consistent with those of BRCA1-associated breast cancers. Twenty-eight consecutive triple-negative metaplastic breast carcinomas were reviewed, and the metaplastic component present in each frozen specimen was defined (ie, spindle cell, squamous, chondroid metaplasia). RNA and DNA extracted from frozen sections with tumor cell content >60% were subjected to gene expression (Illumina HumanHT-12 v4) and copy number profiling (Affymetrix SNP 6.0), respectively. Using the best practice PAM50/claudin-low microarray-based classifier, all metaplastic breast carcinomas with spindle cell metaplasia were of claudin-low subtype, whereas those with squamous or chondroid metaplasia were preferentially of basal-like subtype. Triple-negative breast cancer subtyping using a dedicated website (http://cbc.mc.vanderbilt.edu/tnbc/) revealed that all metaplastic breast carcinomas with chondroid metaplasia were of mesenchymal-like subtype, spindle cell carcinomas preferentially of unstable or mesenchymal stem-like subtype, and those with squamous metaplasia were of multiple subtypes. None of the cases was classified as immunomodulatory or luminal androgen receptor subtype. Integrative clustering, combining gene expression and gene copy number data, revealed that metaplastic breast carcinomas with spindle cell and chondroid metaplasia were preferentially classified as of integrative clusters 4 and 9, respectively, whereas those with squamous metaplasia were classified into six different clusters. Eight of the 26 metaplastic breast cancers subjected to SNP6 analysis were classified as BRCA1-like. The diversity of histologic features of metaplastic breast carcinomas is reflected at the transcriptomic level, and an association between molecular subtypes and histology was observed. BRCA1-like genomic profiles were found only in a subset (31%) of metaplastic breast cancers, and were not associated with a specific molecular or histologic subtype. PMID:25412848
Jain, K K
2001-02-01
Cambridge Healthtech Institute's Third Annual Conference on Lab-on-a-Chip and Microarray technology covered the latest advances in this technology and applications in life sciences. Highlights of the meetings are reported briefly with emphasis on applications in genomics, drug discovery and molecular diagnostics. There was an emphasis on microfluidics because of the wide applications in laboratory and drug discovery. The lab-on-a-chip provides the facilities of a complete laboratory in a hand-held miniature device. Several microarray systems have been used for hybridisation and detection techniques. Oligonucleotide scanning arrays provide a versatile tool for the analysis of nucleic acid interactions and provide a platform for improving the array-based methods for investigation of antisense therapeutics. A method for analysing combinatorial DNA arrays using oligonucleotide-modified gold nanoparticle probes and a conventional scanner has considerable potential in molecular diagnostics. Various applications of microarray technology for high-throughput screening in drug discovery and single nucleotide polymorphisms (SNP) analysis were discussed. Protein chips have important applications in proteomics. With the considerable amount of data generated by the different technologies using microarrays, it is obvious that the reading of the information and its interpretation and management through the use of bioinformatics is essential. Various techniques for data analysis were presented. Biochip and microarray technology has an essential role to play in the evolving trends in healthcare, which integrate diagnosis with prevention/treatment and emphasise personalised medicines.
Laing, Chad R; Buchanan, Cody; Taboada, Eduardo N; Zhang, Yongxiang; Karmali, Mohamed A; Thomas, James E; Gannon, Victor Pj
2009-06-29
Many approaches have been used to study the evolution, population structure and genetic diversity of Escherichia coli O157:H7; however, observations made with different genotyping systems are not easily relatable to each other. Three genetic lineages of E. coli O157:H7 designated I, II and I/II have been identified using octamer-based genome scanning and microarray comparative genomic hybridization (mCGH). Each lineage contains significant phenotypic differences, with lineage I strains being the most commonly associated with human infections. Similarly, a clade of hyper-virulent O157:H7 strains implicated in the 2006 spinach and lettuce outbreaks has been defined using single-nucleotide polymorphism (SNP) typing. In this study an in silico comparison of six different genotyping approaches was performed on 19 E. coli genome sequences from 17 O157:H7 strains and single O145:NM and K12 MG1655 strains to provide an overall picture of diversity of the E. coli O157:H7 population, and to compare genotyping methods for O157:H7 strains. In silico determination of lineage, Shiga-toxin bacteriophage integration site, comparative genomic fingerprint, mCGH profile, novel region distribution profile, SNP type and multi-locus variable number tandem repeat analysis type was performed and a supernetwork based on the combination of these methods was produced. This supernetwork showed three distinct clusters of strains that were O157:H7 lineage-specific, with the SNP-based hyper-virulent clade 8 synonymous with O157:H7 lineage I/II. Lineage I/II/clade 8 strains clustered closest on the supernetwork to E. coli K12 and E. coli O55:H7, O145:NM and sorbitol-fermenting O157 strains. The results of this study highlight the similarities in relationships derived from multi-locus genome sampling methods and suggest a "common genotyping language" may be devised for population genetics and epidemiological studies. Future genotyping methods should provide data that can be stored centrally and accessed locally in an easily transferable, informative and extensible format based on comparative genomic analyses.
Chono, Makiko; Matsunaka, Hitoshi; Seki, Masako; Fujita, Masaya; Kiribuchi-Otobe, Chikako; Oda, Shunsuke; Kojima, Hisayo; Nakamura, Shingo
2015-01-01
In the wheat (Triticum aestivum L.) cultivar ‘Zenkoujikomugi’, a single nucleotide polymorphism (SNP) in the promoter of MOTHER OF FT AND TFL1 on chromosome 3A (MFT-3A) causes an increase in the level of gene expression, resulting in strong grain dormancy. We used a DNA marker to detect the ‘Zenkoujikomugi’-type (Zen-type) SNP and examined the genotype of MFT-3A in Japanese wheat varieties, and we found that 169 of 324 varieties carry the Zen-type SNP. In Japanese commercial varieties, the frequency of the Zen-type SNP was remarkably high in the southern part of Japan, but low in the northern part. To examine the relationship between MFT-3A genotype and grain dormancy, we performed a germination assay in three wheat-growing seasons. On average, the varieties carrying the Zen-type SNP showed stronger grain dormancy than the varieties carrying the non-Zen-type SNP. Among commercial cultivars, ‘Iwainodaichi’ (Kyushu), ‘Junreikomugi’ (Kinki-Chugoku-Shikoku), ‘Kinuhime’ (Kanto-Tokai), ‘Nebarigoshi’ (Tohoku-Hokuriku), and ‘Kitamoe’ (Hokkaido) showed the strongest grain dormancy in each geographical group, and all these varieties, except for ‘Kitamoe’, were found to carry the Zen-type SNP. In recent years, the number of varieties carrying the Zen-type SNP has increased in the Tohoku-Hokuriku region, but not in the Hokkaido region. PMID:25931984
Report for the NGFA-5 project.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jaing, C; Jackson, P; Thissen, J
The objective of this project is to provide DHS a comprehensive evaluation of the current genomic technologies including genotyping, TaqMan PCR, multiple locus variable tandem repeat analysis (MLVA), microarray and high-throughput DNA sequencing in the analysis of biothreat agents from complex environmental samples. To effectively compare the sensitivity and specificity of the different genomic technologies, we used SNP TaqMan PCR, MLVA, microarray and high-throughput illumine and 454 sequencing to test various strains from B. anthracis, B. thuringiensis, BioWatch aerosol filter extracts or soil samples that were spiked with B. anthracis, and samples that were previously collected during DHS and EPAmore » environmental release exercises that were known to contain B. thuringiensis spores. The results of all the samples against the various assays are discussed in this report.« less
ITALICS: an algorithm for normalization and DNA copy number calling for Affymetrix SNP arrays.
Rigaill, Guillem; Hupé, Philippe; Almeida, Anna; La Rosa, Philippe; Meyniel, Jean-Philippe; Decraene, Charles; Barillot, Emmanuel
2008-03-15
Affymetrix SNP arrays can be used to determine the DNA copy number measurement of 11 000-500 000 SNPs along the genome. Their high density facilitates the precise localization of genomic alterations and makes them a powerful tool for studies of cancers and copy number polymorphism. Like other microarray technologies it is influenced by non-relevant sources of variation, requiring correction. Moreover, the amplitude of variation induced by non-relevant effects is similar or greater than the biologically relevant effect (i.e. true copy number), making it difficult to estimate non-relevant effects accurately without including the biologically relevant effect. We addressed this problem by developing ITALICS, a normalization method that estimates both biological and non-relevant effects in an alternate, iterative manner, accurately eliminating irrelevant effects. We compared our normalization method with other existing and available methods, and found that ITALICS outperformed these methods for several in-house datasets and one public dataset. These results were validated biologically by quantitative PCR. The R package ITALICS (ITerative and Alternative normaLIzation and Copy number calling for affymetrix Snp arrays) has been submitted to Bioconductor.
Łastowska, M; Viprey, V; Santibanez-Koref, M; Wappler, I; Peters, H; Cullinane, C; Roberts, P; Hall, A G; Tweddle, D A; Pearson, A D J; Lewis, I; Burchill, S A; Jackson, M S
2007-11-22
Identifying genes, whose expression is consistently altered by chromosomal gains or losses, is an important step in defining genes of biological relevance in a wide variety of tumour types. However, additional criteria are needed to discriminate further among the large number of candidate genes identified. This is particularly true for neuroblastoma, where multiple genomic copy number changes of proven prognostic value exist. We have used Affymetrix microarrays and a combination of fluorescent in situ hybridization and single nucleotide polymorphism (SNP) microarrays to establish expression profiles and delineate copy number alterations in 30 primary neuroblastomas. Correlation of microarray data with patient survival and analysis of expression within rodent neuroblastoma cell lines were then used to define further genes likely to be involved in the disease process. Using this approach, we identify >1000 genes within eight recurrent genomic alterations (loss of 1p, 3p, 4p, 10q and 11q, 2p gain, 17q gain, and the MYCN amplicon) whose expression is consistently altered by copy number change. Of these, 84 correlate with patient survival, with the minimal regions of 17q gain and 4p loss being enriched significantly for such genes. These include genes involved in RNA and DNA metabolism, and apoptosis. Orthologues of all but one of these genes on 17q are overexpressed in rodent neuroblastoma cell lines. A significant excess of SNPs whose copy number correlates with survival is also observed on proximal 4p in stage 4 tumours, and we find that deletion of 4p is associated with improved outcome in an extended cohort of tumours. These results define the major impact of genomic copy number alterations upon transcription within neuroblastoma, and highlight genes on distal 17q and proximal 4p for downstream analyses. They also suggest that integration of discriminators, such as survival and comparative gene expression, with microarray data may be useful in the identification of critical genes within regions of loss or gain in many human cancers.
Miyamoto, T; Koh, E; Tsujimura, A; Miyagawa, Y; Saijo, Y; Namiki, M; Sengoku, K
2014-04-01
Genetic mechanisms have been implicated as a cause of some cases of male infertility. Recently, ten novel genes involved in human spermatogenesis, including human LRWD1, have been identified by expression microarray analysis of human testictissue. The human LRWD1 protein mediates the origin recognition complex in chromatin, which is critical for the initiation of pre-replication complex assembly in G1 and chromatin organization in post-G1 cells. The Lrwd1 gene expression is specific to the testis in mice. Therefore, we hypothesized that mutation or polymorphisms of LRWD1 participate in male infertility, especially azoospermia. To investigate whether LRWD1 gene defects are associated with azoospermia caused by SCOS and meiotic arrest (MA), mutational analysis was performed in 100 and 30 Japanese patients by direct sequencing of the coding regions, respectively. Statistical analysis was performed for patients with SCOS and MA and in 100 healthy control men. No mutations were found in LRWD1; however, three coding single-nucleotide polymorphisms (SNP1-SNP3) could be detected in the patients. The genotype and allele frequencies in SNP1 and SNP2 were notably higher in the SCOS group than in the control group (P < 0.05). These results suggest the critical role of LRWD1 in human spermatogenesis. © 2013 Blackwell Verlag GmbH.
Novel approach for deriving genome wide SNP analysis data from archived blood spots
2012-01-01
Background The ability to transport and store DNA at room temperature in low volumes has the advantage of optimising cost, time and storage space. Blood spots on adapted filter papers are popular for this, with FTA (Flinders Technology Associates) Whatman™TM technology being one of the most recent. Plant material, plasmids, viral particles, bacteria and animal blood have been stored and transported successfully using this technology, however the method of porcine DNA extraction from FTA Whatman™TM cards is a relatively new approach, allowing nucleic acids to be ready for downstream applications such as PCR, whole genome amplification, sequencing and subsequent application to single nucleotide polymorphism microarrays has hitherto been under-explored. Findings DNA was extracted from FTA Whatman™TM cards (following adaptations of the manufacturer’s instructions), whole genome amplified and subsequently analysed to validate the integrity of the DNA for downstream SNP analysis. DNA was successfully extracted from 288/288 samples and amplified by WGA. Allele dropout post WGA, was observed in less than 2% of samples and there was no clear evidence of amplification bias nor contamination. Acceptable call rates on porcine SNP chips were also achieved using DNA extracted and amplified in this way. Conclusions DNA extracted from FTA Whatman cards is of a high enough quality and quantity following whole genomic amplification to perform meaningful SNP chip studies. PMID:22974252
Shen, Wei; Paxton, Christian N; Szankasi, Philippe; Longhurst, Maria; Schumacher, Jonathan A; Frizzell, Kimberly A; Sorrells, Shelly M; Clayton, Adam L; Jattani, Rakhi P; Patel, Jay L; Toydemir, Reha; Kelley, Todd W; Xu, Xinjie
2018-04-01
Genetic abnormalities, including copy number variants (CNV), copy number neutral loss of heterozygosity (CN-LOH) and gene mutations, underlie the pathogenesis of myeloid malignancies and serve as important diagnostic, prognostic and/or therapeutic markers. Currently, multiple testing strategies are required for comprehensive genetic testing in myeloid malignancies. The aim of this proof-of-principle study was to investigate the feasibility of combining detection of genome-wide large CNVs, CN-LOH and targeted gene mutations into a single assay using next-generation sequencing (NGS). For genome-wide CNV detection, we designed a single nucleotide polymorphism (SNP) sequencing backbone with 22 762 SNP regions evenly distributed across the entire genome. For targeted mutation detection, 62 frequently mutated genes in myeloid malignancies were targeted. We combined this SNP sequencing backbone with a targeted mutation panel, and sequenced 9 healthy individuals and 16 patients with myeloid malignancies using NGS. We detected 52 somatic CNVs, 11 instances of CN-LOH and 39 oncogenic mutations in the 16 patients with myeloid malignancies, and none in the 9 healthy individuals. All CNVs and CN-LOH were confirmed by SNP microarray analysis. We describe a genome-wide SNP sequencing backbone which allows for sensitive detection of genome-wide CNVs and CN-LOH using NGS. This proof-of-principle study has demonstrated that this strategy can provide more comprehensive genetic profiling for patients with myeloid malignancies using a single assay. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
A molecular scheme for improved characterization of human embryonic stem cell lines
Josephson, Richard; Sykes, Gregory; Liu, Ying; Ording, Carol; Xu, Weining; Zeng, Xianmin; Shin, Soojung; Loring, Jeanne; Maitra, Anirban; Rao, Mahendra S; Auerbach, Jonathan M
2006-01-01
Background Human embryonic stem cells (hESC) offer a renewable source of a wide range of cell types for use in research and cell-based therapies to treat disease. Inspection of protein markers provides important information about the current state of the cells and data for subsequent manipulations. However, hESC must be routinely analyzed at the genomic level to guard against deleterious changes during extensive propagation, expansion, and manipulation in vitro. Results We found that short tandem repeat (STR) analysis, human leukocyte antigen (HLA) typing, single nucleotide polymorphism (SNP) genomic analysis, mitochondrial DNA sequencing, and gene expression analysis by microarray can be used to fully describe any hESC culture in terms of its identity, stability, and undifferentiated state. Conclusion Here we describe, using molecular biology alone, a comprehensive characterization of 17 different hESC lines. The use of amplified nucleic acids means that for the first time full characterization of hESC lines can be performed with little time investment and a minimum of material. The information thus gained will facilitate comparison of lines and replication of results between laboratories. PMID:16919167
Dumitriu, Alexandra; Latourelle, Jeanne C; Hadzi, Tiffany C; Pankratz, Nathan; Garza, Dan; Miller, John P; Vance, Jeffery M; Foroud, Tatiana; Beach, Thomas G; Myers, Richard H
2012-06-01
Parkinson disease (PD) is a complex neurodegenerative disorder with largely unknown genetic mechanisms. While the degeneration of dopaminergic neurons in PD mainly takes place in the substantia nigra pars compacta (SN) region, other brain areas, including the prefrontal cortex, develop Lewy bodies, the neuropathological hallmark of PD. We generated and analyzed expression data from the prefrontal cortex Brodmann Area 9 (BA9) of 27 PD and 26 control samples using the 44K One-Color Agilent 60-mer Whole Human Genome Microarray. All samples were male, without significant Alzheimer disease pathology and with extensive pathological annotation available. 507 of the 39,122 analyzed expression probes were different between PD and control samples at false discovery rate (FDR) of 5%. One of the genes with significantly increased expression in PD was the forkhead box O1 (FOXO1) transcription factor. Notably, genes carrying the FoxO1 binding site were significantly enriched in the FDR-significant group of genes (177 genes covered by 189 probes), suggesting a role for FoxO1 upstream of the observed expression changes. Single-nucleotide polymorphisms (SNPs) selected from a recent meta-analysis of PD genome-wide association studies (GWAS) were successfully genotyped in 50 out of the 53 microarray brains, allowing a targeted expression-SNP (eSNP) analysis for 52 SNPs associated with PD affection at genome-wide significance and the 189 probes from FoxO1 regulated genes. A significant association was observed between a SNP in the cyclin G associated kinase (GAK) gene and a probe in the spermine oxidase (SMOX) gene. Further examination of the FOXO1 region in a meta-analysis of six available GWAS showed two SNPs significantly associated with age at onset of PD. These results implicate FOXO1 as a PD-relevant gene and warrant further functional analyses of its transcriptional regulatory mechanisms.
Dumitriu, Alexandra; Latourelle, Jeanne C.; Hadzi, Tiffany C.; Pankratz, Nathan; Garza, Dan; Miller, John P.; Vance, Jeffery M.; Foroud, Tatiana; Beach, Thomas G.; Myers, Richard H.
2012-01-01
Parkinson disease (PD) is a complex neurodegenerative disorder with largely unknown genetic mechanisms. While the degeneration of dopaminergic neurons in PD mainly takes place in the substantia nigra pars compacta (SN) region, other brain areas, including the prefrontal cortex, develop Lewy bodies, the neuropathological hallmark of PD. We generated and analyzed expression data from the prefrontal cortex Brodmann Area 9 (BA9) of 27 PD and 26 control samples using the 44K One-Color Agilent 60-mer Whole Human Genome Microarray. All samples were male, without significant Alzheimer disease pathology and with extensive pathological annotation available. 507 of the 39,122 analyzed expression probes were different between PD and control samples at false discovery rate (FDR) of 5%. One of the genes with significantly increased expression in PD was the forkhead box O1 (FOXO1) transcription factor. Notably, genes carrying the FoxO1 binding site were significantly enriched in the FDR–significant group of genes (177 genes covered by 189 probes), suggesting a role for FoxO1 upstream of the observed expression changes. Single-nucleotide polymorphisms (SNPs) selected from a recent meta-analysis of PD genome-wide association studies (GWAS) were successfully genotyped in 50 out of the 53 microarray brains, allowing a targeted expression–SNP (eSNP) analysis for 52 SNPs associated with PD affection at genome-wide significance and the 189 probes from FoxO1 regulated genes. A significant association was observed between a SNP in the cyclin G associated kinase (GAK) gene and a probe in the spermine oxidase (SMOX) gene. Further examination of the FOXO1 region in a meta-analysis of six available GWAS showed two SNPs significantly associated with age at onset of PD. These results implicate FOXO1 as a PD–relevant gene and warrant further functional analyses of its transcriptional regulatory mechanisms. PMID:22761592
Aravind Kumar, M; Singh, Vineeta; Naushad, Shaik Mohammad; Shanker, Uday; Lakshmi Narasu, M
2018-05-01
In the view of aggressive nature of Triple-Negative Breast cancer (TNBC) due to the lack of receptors (ER, PR, HER2) and high incidence of drug resistance associated with it, a case-control association study was conducted to identify the contributing genetic risk factors for Triple-negative breast cancer (TNBC). A total of 30 TNBC patients and 50 age and gender-matched controls of Indian origin were screened for 9,00,000 SNP markers using microarray-based SNP genotyping approach. The initial PLINK association analysis (p < 0.01, MAF 0.14-0.44, OR 10-24) identified 28 non-synonymous SNPs and one stop gain mutation in the exonic region as possible determinants of TNBC risk. All the 29 SNPs were annotated using ANNOVAR. The interactions between these markers were evaluated using Multifactor dimensionality reduction (MDR) analysis. The interactions were in the following order: exm408776 > exm1278309 > rs316389 > rs1651654 > rs635538 > exm1292477. Recursive partitioning analysis (RPA) was performed to construct decision tree useful in predicting TNBC risk. As shown in this analysis, rs1651654 and exm585172 SNPs are found to be determinants of TNBC risk. Artificial neural network model was used to generate the Receiver operating characteristic curves (ROC), which showed high sensitivity and specificity (AUC-0.94) of these markers. To conclude, among the 9,00,000 SNPs tested, CCDC42 exm1292477, ANXA3 exm408776, SASH1 exm585172 are found to be the most significant genetic predicting factors for TNBC. The interactions among exm408776, exm1278309, rs316389, rs1651654, rs635538, exm1292477 SNPs inflate the risk for TNBC further. Targeted analysis of these SNPs and genes alone also will have similar clinical utility in predicting TNBC.
[Phenotype-genotype correlation analysis of 12 cases with Angelman/Prader-Willi syndrome].
Chen, Chen; Peng, Ying; Xia, Yan; Li, Haoxian; Zhu, Huimin; Pan, Qian; Yin, Fei; Wu, Lingqian
2014-12-01
To investigate the genotype-phenotype correlation in patients with Angelman syndrome/Prader-Willi syndrome (AS/PWS) and assess the application value of high-resolution single nucleotide polymorphism microarrays (SNP array) for such diseases. Twelve AS/PWS patients were diagnosed through SNP array, fluorescence in situ hybridization (FISH) and karyotype analysis. Clinical characteristics were analyzed. Deletions ranging from 4.8 Mb to 7.0 Mb on chromosome 15q11.2-13 were detected in 11 patients. Uniparental disomy (UPD) was detected in only 1 patient. Patients with deletions could be divided into 2 groups, including 7 cases with class I and 4 with class II. The two groups however had no significant phenotypic difference. The UPD patient had relatively better development and language ability. Deletions of 6 patients were confirmed by FISH to be of de novo in origin. The risk to their sibs was determined to be less than 1%. The phenotypic differences between AS/PWS patients with class I and class II deletion need to be further studied. SNP array is useful in detecting and distinguishing of patients with deletion or UPD. This method may be applied for studying the genotype-phenotype association and the mechanism underlying AS/PWS.
Bruno, D L; Ganesamoorthy, D; Schoumans, J; Bankier, A; Coman, D; Delatycki, M; Gardner, R J M; Hunter, M; James, P A; Kannu, P; McGillivray, G; Pachter, N; Peters, H; Rieubland, C; Savarirayan, R; Scheffer, I E; Sheffield, L; Tan, T; White, S M; Yeung, A; Bowman, Z; Ngo, C; Choy, K W; Cacheux, V; Wong, L; Amor, D J; Slater, H R
2009-02-01
Microarray genome analysis is realising its promise for improving detection of genetic abnormalities in individuals with mental retardation and congenital abnormality. Copy number variations (CNVs) are now readily detectable using a variety of platforms and a major challenge is the distinction of pathogenic from ubiquitous, benign polymorphic CNVs. The aim of this study was to investigate replacement of time consuming, locus specific testing for specific microdeletion and microduplication syndromes with microarray analysis, which theoretically should detect all known syndromes with CNV aetiologies as well as new ones. Genome wide copy number analysis was performed on 117 patients using Affymetrix 250K microarrays. 434 CNVs (195 losses and 239 gains) were found, including 18 pathogenic CNVs and 9 identified as "potentially pathogenic". Almost all pathogenic CNVs were larger than 500 kb, significantly larger than the median size of all CNVs detected. Segmental regions of loss of heterozygosity larger than 5 Mb were found in 5 patients. Genome microarray analysis has improved diagnostic success in this group of patients. Several examples of recently discovered "new syndromes" were found suggesting they are more common than previously suspected and collectively are likely to be a major cause of mental retardation. The findings have several implications for clinical practice. The study revealed the potential to make genetic diagnoses that were not evident in the clinical presentation, with implications for pretest counselling and the consent process. The importance of contributing novel CNVs to high quality databases for genotype-phenotype analysis and review of guidelines for selection of individuals for microarray analysis is emphasised.
Trembizki, Ella; Smith, Helen; Lahra, Monica M; Chen, Marcus; Donovan, Basil; Fairley, Christopher K; Guy, Rebecca; Kaldor, John; Regan, David; Ward, James; Nissen, Michael D; Sloots, Theo P; Whiley, David M
2014-06-01
Neisseria gonorrhoeae antimicrobial resistance (AMR) is a global problem heightened by emerging resistance to ceftriaxone. Appropriate molecular typing methods are important for understanding the emergence and spread of N. gonorrhoeae AMR. We report on the development, validation and testing of a Sequenom MassARRAY iPLEX method for multilocus sequence typing (MLST)-style genotyping of N. gonorrhoeae isolates. An iPLEX MassARRAY method (iPLEX14SNP) was developed targeting 14 informative gonococcal single nucleotide polymorphisms (SNPs) previously shown to predict MLST types. The method was initially validated using 24 N. gonorrhoeae control isolates and was then applied to 397 test isolates collected throughout Queensland, Australia in the first half of 2012. The iPLEX14SNP method provided 100% accuracy for the control isolates, correctly identifying all 14 SNPs for all 24 isolates (336/336). For the 397 test isolates, the iPLEX14SNP assigned results for 5461 of the possible 5558 SNPs (SNP call rate 98.25%), with complete 14 SNP profiles obtained for 364 isolates. Based on the complete SNP profile data, there were 49 different sequence types identified in Queensland, with 11 of the 49 SNP profiles accounting for the majority (n = 280; 77%) of isolates. AMR was dominated by several geographically clustered sequence types. Using the iPLEX14SNP method, up to 384 isolates could be tested within 1 working day for less than Aus$10 per isolate. The iPLEX14SNP offers an accurate and high-throughput method for the MLST-style genotyping of N. gonorrhoeae and may prove particularly useful for large-scale studies investigating the emergence and spread of gonococcal AMR. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
SNP-RFLPing 2: an updated and integrated PCR-RFLP tool for SNP genotyping.
Chang, Hsueh-Wei; Cheng, Yu-Huei; Chuang, Li-Yeh; Yang, Cheng-Hong
2010-04-08
PCR-restriction fragment length polymorphism (RFLP) assay is a cost-effective method for SNP genotyping and mutation detection, but the manual mining for restriction enzyme sites is challenging and cumbersome. Three years after we constructed SNP-RFLPing, a freely accessible database and analysis tool for restriction enzyme mining of SNPs, significant improvements over the 2006 version have been made and incorporated into the latest version, SNP-RFLPing 2. The primary aim of SNP-RFLPing 2 is to provide comprehensive PCR-RFLP information with multiple functionality about SNPs, such as SNP retrieval to multiple species, different polymorphism types (bi-allelic, tri-allelic, tetra-allelic or indels), gene-centric searching, HapMap tagSNPs, gene ontology-based searching, miRNAs, and SNP500Cancer. The RFLP restriction enzymes and the corresponding PCR primers for the natural and mutagenic types of each SNP are simultaneously analyzed. All the RFLP restriction enzyme prices are also provided to aid selection. Furthermore, the previously encountered updating problems for most SNP related databases are resolved by an on-line retrieval system. The user interfaces for functional SNP analyses have been substantially improved and integrated. SNP-RFLPing 2 offers a new and user-friendly interface for RFLP genotyping that can be used in association studies and is freely available at http://bio.kuas.edu.tw/snp-rflping2.
Quantitative phenotyping via deep barcode sequencing.
Smith, Andrew M; Heisler, Lawrence E; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J; Chee, Mark; Roth, Frederick P; Giaever, Guri; Nislow, Corey
2009-10-01
Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or "Bar-seq," outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that approximately 20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene-environment interactions on a genome-wide scale.
Dutra, Roberta L; Piazzon, Flavia B; Zanardo, Évelin A; Costa, Thais Virginia Moura Machado; Montenegro, Marília M; Novo-Filho, Gil M; Dias, Alexandre T; Nascimento, Amom M; Kim, Chong Ae; Kulikowski, Leslie D
2015-12-01
Williams-Beuren syndrome (WBS) is caused by a hemizygous contiguous gene microdeletion of 1.55-1.84 Mb at 7q11.23 region. Approximately, 28 genes have been shown to contribute to classical phenotype of SWB with presence of dysmorphic facial features, supravalvular aortic stenosis (SVAS), intellectual disability, and overfriendliness. With the use of Microarray-based comparative genomic hybridization and other molecular cytogenetic techniques, is possible define with more accuracy partial or atypical deletion and refine the genotype-phenotype correlation. Here, we report on a rare genomic structural rearrangement in a boy with atypical deletion in 7q11.23 and XYY syndrome with characteristic clinical signs, but not sufficient for the diagnosis of WBS. Cytogenetic analysis of G-banding showed a karyotype 47,XYY. Analysis of DNA with the technique of MLPA (Multiplex Ligation-dependent Probe Amplification) using kits a combination of kits (P064, P036, P070, and P029) identified an atypical deletion on 7q11.23. In addition, high resolution SNP Oligonucleotide Microarray Analysis (SNP-array) confirmed the alterations found by MLPA and revealed others pathogenic CNVs, in the chromosomes 7 and X. The present report demonstrates an association not yet described in literature, between Williams-Beuren syndrome and 47,XYY. The identification of atypical deletion in 7q11.23 concomitant to additional pathogenic CNVs in others genomic regions allows a better comprehension of clinical consequences of atypical genomic rearrangements. © 2015 Wiley Periodicals, Inc.
Chaaba, Raja; Attia, Nebil; Hammami, Sonia; Smaoui, Maha; Mahjoub, Sylvia; Hammami, Mohamed; Masmoudi, Ahmed Slaheddine
2005-01-01
Background Apolipoprotein A-V (Apo A-V) gene has recently been identified as a new apolipoprotein involved in triglyceride metabolism. A single nucleotide polymorphism (SNP3) located in the gene promoter (-1131) was associated with triglyceride variation in healthy subjects. In type 2 diabetes the triglyceride level increased compared to healthy subjects. Hypertriglyceridemia is a risk factor for coronary artery disease. We aimed to examine the interaction between SNP3 and lipid profile and coronary artery disease (CAD) in Tunisian type 2 diabetic patients. Results The genotype frequencies of T/T, T/C and C/C were 0.74, 0.23 and 0.03 respectively in non diabetic subjects, 0.71, 0.25 and 0.04 respectively in type 2 diabetic patients. Triglyceride level was higher in heterozygous genotype (-1131 T/C) of apo A-V (p = 0.024). Heterozygous genotype is more frequent in high triglyceride group (40.9%) than in low triglyceride group (18.8%) ; p = 0.011. Despite the relation between CAD and hypertriglyceridemia the SNP 3 was not associated with CAD. Conclusion In type 2 diabetic patients SNP3 is associated with triglyceride level, however there was no association between SNP3 and coronary artery disease. PMID:15636639
Phetsuksiri, Benjawan; Srisungngam, Sopa; Rudeeaneksin, Janisara; Bunchoo, Supranee; Lukebua, Atchariya; Wongtrungkapun, Ruch; Paitoon, Soontara; Sakamuri, Rama Murthy; Brennan, Patrick J; Vissa, Varalakshmi
2012-01-01
Based on the discovery of three single nucleotide polymorphisms (SNPs) in Mycobacterium leprae, it has been previously reported that there are four major SNP types associated with different geographic regions around the world. Another typing system for global differentiation of M. leprae is the analysis of the variable number of short tandem repeats within the rpoT gene. To expand the analysis of geographic distribution of M. leprae, classified by SNP and rpoT gene polymorphisms, we studied 85 clinical isolates from Thai patients and compared the findings with those reported from Asian isolates. SNP genotyping by PCR amplification and sequencing revealed that all strains like those in Myanmar were SNP type 1 and 3, with the former being predominant, while in Japan, Korea, and Indonesia, the SNP type 3 was found to be more frequent. The pattern of M. leprae distribution in Thailand and Myanmar is quite similar, except that SNP type 2 was not found in Thailand. In addition, the 3-copy hexamer genotype in the rpoT gene is shared among the isolates from these two neighboring countries. On the basis of these two markers, we postulate that M. leprae in leprosy patients from Myanmar and Thailand has a common historical origin. Further differentiation among Thai isolates was possible by assessing copy numbers of the TTC sequence, a more polymorphic microsatellite locus.
A remark on copy number variation detection methods.
Li, Shuo; Dou, Xialiang; Gao, Ruiqi; Ge, Xinzhou; Qian, Minping; Wan, Lin
2018-01-01
Copy number variations (CNVs) are gain and loss of DNA sequence of a genome. High throughput platforms such as microarrays and next generation sequencing technologies (NGS) have been applied for genome wide copy number losses. Although progress has been made in both approaches, the accuracy and consistency of CNV calling from the two platforms remain in dispute. In this study, we perform a deep analysis on copy number losses on 254 human DNA samples, which have both SNP microarray data and NGS data publicly available from Hapmap Project and 1000 Genomes Project respectively. We show that the copy number losses reported from Hapmap Project and 1000 Genome Project only have < 30% overlap, while these reports are required to have cross-platform (e.g. PCR, microarray and high-throughput sequencing) experimental supporting by their corresponding projects, even though state-of-art calling methods were employed. On the other hand, copy number losses are found directly from HapMap microarray data by an accurate algorithm, i.e. CNVhac, almost all of which have lower read mapping depth in NGS data; furthermore, 88% of which can be supported by the sequences with breakpoint in NGS data. Our results suggest the ability of microarray calling CNVs and the possible introduction of false negatives from the unessential requirement of the additional cross-platform supporting. The inconsistency of CNV reports from Hapmap Project and 1000 Genomes Project might result from the inadequate information containing in microarray data, the inconsistent detection criteria, or the filtration effect of cross-platform supporting. The statistical test on CNVs called from CNVhac show that the microarray data can offer reliable CNV reports, and majority of CNV candidates can be confirmed by raw sequences. Therefore, the CNV candidates given by a good caller could be highly reliable without cross-platform supporting, so additional experimental information should be applied in need instead of necessarily.
Equalizer reduces SNP bias in Affymetrix microarrays.
Quigley, David
2015-07-30
Gene expression microarrays measure the levels of messenger ribonucleic acid (mRNA) in a sample using probe sequences that hybridize with transcribed regions. These probe sequences are designed using a reference genome for the relevant species. However, most model organisms and all humans have genomes that deviate from their reference. These variations, which include single nucleotide polymorphisms, insertions of additional nucleotides, and nucleotide deletions, can affect the microarray's performance. Genetic experiments comparing individuals bearing different population-associated single nucleotide polymorphisms that intersect microarray probes are therefore subject to systemic bias, as the reduction in binding efficiency due to a technical artifact is confounded with genetic differences between parental strains. This problem has been recognized for some time, and earlier methods of compensation have attempted to identify probes affected by genome variants using statistical models. These methods may require replicate microarray measurement of gene expression in the relevant tissue in inbred parental samples, which are not always available in model organisms and are never available in humans. By using sequence information for the genomes of organisms under investigation, potentially problematic probes can now be identified a priori. However, there is no published software tool that makes it easy to eliminate these probes from an annotation. I present equalizer, a software package that uses genome variant data to modify annotation files for the commonly used Affymetrix IVT and Gene/Exon platforms. These files can be used by any microarray normalization method for subsequent analysis. I demonstrate how use of equalizer on experiments mapping germline influence on gene expression in a genetic cross between two divergent mouse species and in human samples significantly reduces probe hybridization-induced bias, reducing false positive and false negative findings. The equalizer package reduces probe hybridization bias from experiments performed on the Affymetrix microarray platform, allowing accurate assessment of germline influence on gene expression.
Lager, Malin; Mernelius, Sara; Löfgren, Sture; Söderman, Jan
2016-01-01
Healthcare-associated infections caused by Escherichia coli and antibiotic resistance due to extended-spectrum beta-lactamase (ESBL) production constitute a threat against patient safety. To identify, track, and control outbreaks and to detect emerging virulent clones, typing tools of sufficient discriminatory power that generate reproducible and unambiguous data are needed. A probe based real-time PCR method targeting multiple single nucleotide polymorphisms (SNP) was developed. The method was based on the multi locus sequence typing scheme of Institute Pasteur and by adaptation of previously described typing assays. An 8 SNP-panel that reached a Simpson's diversity index of 0.95 was established, based on analysis of sporadic E. coli cases (ESBL n = 27 and non-ESBL n = 53). This multi-SNP assay was used to identify the sequence type 131 (ST131) complex according to the Achtman's multi locus sequence typing scheme. However, it did not fully discriminate within the complex but provided a diagnostic signature that outperformed a previously described detection assay. Pulsed-field gel electrophoresis typing of isolates from a presumed outbreak (n = 22) identified two outbreaks (ST127 and ST131) and three different non-outbreak-related isolates. Multi-SNP typing generated congruent data except for one non-outbreak-related ST131 isolate. We consider multi-SNP real-time PCR typing an accessible primary generic E. coli typing tool for rapid and uniform type identification.
Lavania, M; Jadhav, R S; Turankar, R P; Chaitanya, V S; Singh, M; Sengupta, U
2013-11-01
Earlier studies indicate that genotyping of Mycobaterium leprae based on single-nucleotide polymorphisms (SNPs) is useful for analysis of the global spread of leprosy. In the present study, we investigated the diversity of M. leprae at eight SNP loci using 180 clinical isolates obtained from patients with leprosy residing mainly in Delhi and Purulia (West Bengal) regions. It was observed that the frequency of SNP type 1 and subtype D was most predominant in the Indian population. Further, the SNP type 2 subtype E was noted only from East Delhi region and SNP type 2 subtype G was noted only from the nearby areas of Hoogly district of West Bengal. These results indicate the occurrence of focal transmission of M. leprae infection and demonstrate that analysis by SNP typing has great potential to help researchers in understanding the transmission of M. leprae infection in the community. © 2013 The Authors Clinical Microbiology and Infection © 2013 European Society of Clinical Microbiology and Infectious Diseases.
Accurate HLA type inference using a weighted similarity graph.
Xie, Minzhu; Li, Jing; Jiang, Tao
2010-12-14
The human leukocyte antigen system (HLA) contains many highly variable genes. HLA genes play an important role in the human immune system, and HLA gene matching is crucial for the success of human organ transplantations. Numerous studies have demonstrated that variation in HLA genes is associated with many autoimmune, inflammatory and infectious diseases. However, typing HLA genes by serology or PCR is time consuming and expensive, which limits large-scale studies involving HLA genes. Since it is much easier and cheaper to obtain single nucleotide polymorphism (SNP) genotype data, accurate computational algorithms to infer HLA gene types from SNP genotype data are in need. To infer HLA types from SNP genotypes, the first step is to infer SNP haplotypes from genotypes. However, for the same SNP genotype data set, the haplotype configurations inferred by different methods are usually inconsistent, and it is often difficult to decide which one is true. In this paper, we design an accurate HLA gene type inference algorithm by utilizing SNP genotype data from pedigrees, known HLA gene types of some individuals and the relationship between inferred SNP haplotypes and HLA gene types. Given a set of haplotypes inferred from the genotypes of a population consisting of many pedigrees, the algorithm first constructs a weighted similarity graph based on a new haplotype similarity measure and derives constraint edges from known HLA gene types. Based on the principle that different HLA gene alleles should have different background haplotypes, the algorithm searches for an optimal labeling of all the haplotypes with unknown HLA gene types such that the total weight among the same HLA gene types is maximized. To deal with ambiguous haplotype solutions, we use a genetic algorithm to select haplotype configurations that tend to maximize the same optimization criterion. Our experiments on a previously typed subset of the HapMap data show that the algorithm is highly accurate, achieving an accuracy of 96% for gene HLA-A, 95% for HLA-B, 97% for HLA-C, 84% for HLA-DRB1, 98% for HLA-DQA1 and 97% for HLA-DQB1 in a leave-one-out test. Our algorithm can infer HLA gene types from neighboring SNP genotype data accurately. Compared with a recent approach on the same input data, our algorithm achieved a higher accuracy. The code of our algorithm is available to the public for free upon request to the corresponding authors.
Audo, Isabelle; Bujakowska, Kinga; Mohand-Saïd, Saddek; Tronche, Sophie; Lancelot, Marie-Elise; Antonio, Aline; Germain, Aurore; Lonjou, Christine; Carpentier, Wassila; Sahel, José-Alain; Bhattacharya, Shomi; Zeitz, Christina
2011-01-01
To identify the genetic defect of a consanguineous Portuguese family with rod-cone dystrophy and varying degrees of decreased audition. A detailed ophthalmic and auditory examination was performed on a Portuguese patient with severe autosomal recessive rod-cone dystrophy. Known genetic defects were excluded by performing autosomal recessive retinitis pigmentosa (arRP) genotyping microarray analysis and by Sanger sequencing of the coding exons and flanking intronic regions of eyes shut homolog-drosophila (EYS) and chromosome 2 open reading frame 71 (C2orf71). Subsequently, genome-wide homozygosity mapping was performed in DNA samples from available family members using a 700K single nucleotide polymorphism (SNP) microarray. Candidate genes present in the significantly large homozygous regions were screened for mutations using Sanger sequencing. The largest homozygous region (~11 Mb) in the affected family members was mapped to chromosome 9, which harbors deafness, autosomal recessive 31 (DFNB31; a gene previously associated with Usher syndrome). Mutation analysis of DFNB31 in the index patient identified a novel one-base-pair deletion (c.737delC), which is predicted to lead to a truncated protein (p.Pro246HisfsX13) and co-segregated with the disease in the family. Ophthalmic examination of the index patient and the affected siblings showed severe rod-cone dystrophy. Pure tone audiometry revealed a moderate hearing loss in the index patient, whereas the affected siblings were reported with more profound and early onset hearing impairment. We report a novel truncating mutation in DFNB31 associated with severe rod-cone dystrophy and varying degrees of hearing impairment in a consanguineous family of Portuguese origin. This is the second report of DFNB31 implication in Usher type 2.
SEURAT: visual analytics for the integrated analysis of microarray data.
Gribov, Alexander; Sill, Martin; Lück, Sonja; Rücker, Frank; Döhner, Konstanze; Bullinger, Lars; Benner, Axel; Unwin, Antony
2010-06-03
In translational cancer research, gene expression data is collected together with clinical data and genomic data arising from other chip based high throughput technologies. Software tools for the joint analysis of such high dimensional data sets together with clinical data are required. We have developed an open source software tool which provides interactive visualization capability for the integrated analysis of high-dimensional gene expression data together with associated clinical data, array CGH data and SNP array data. The different data types are organized by a comprehensive data manager. Interactive tools are provided for all graphics: heatmaps, dendrograms, barcharts, histograms, eventcharts and a chromosome browser, which displays genetic variations along the genome. All graphics are dynamic and fully linked so that any object selected in a graphic will be highlighted in all other graphics. For exploratory data analysis the software provides unsupervised data analytics like clustering, seriation algorithms and biclustering algorithms. The SEURAT software meets the growing needs of researchers to perform joint analysis of gene expression, genomical and clinical data.
First example of an FY*01 allele associated with weakened expression of Fya on red blood cells.
Arndt, Patricia A; Horn, Trina; Keller, Jessica A; Heri, Suzanne M; Keller, Margaret A
2015-01-01
Duffy antigens are important in immunohematology. the reference allele for the Duffy gene (FY) is FY*02, which encodes Fy(b). An A>G single nucleotide polymorphism (SNP) at coding nucleotide (c.) 125 in exon 2 defines the FY*01 allele, which encodes the antithetical Fy(a). A C>T SNP at c.265 in the FY*02 allele is associated with weakening of Fy(b) expression on red blood cells (R BCs) (called Fy(x)). until recently, this latter change had not been described on a FY*01 background allele. Phenotype-matched units were desired for a multi-transfused Vietnamese fetus with α-thalassemia. Genotyping of the fetus using a microarray assay that interrogates three SNPs (c.1-67, c.125, and c.265) in FY yielded indeterminate results for the predicted Duffy phenotype. Genomic sequencing of FY exon 2 showed that the fetal sample had one wild-type FY*01 allele and one new FY*01 allele with the c.265C>T SNP, which until recently had only been found on the FY*02 allele. Genotyping performed on samples from the proband's parents indicated that the father had the same FY genotype as the fetus. Flow cytometry, which has been previously demonstrated as a useful method to study antigen strength on cells, was used to determine if this new FY*01 allele was associated with reduced Fy(a) expression on the father's RBCs. Median fluorescence intensity of the father's RBCs (after incubation with anti-FY(a) and fluorescein-labeled anti-IgG) was similar to known FY*01 heterozygotes. and significantly weaker than known FY*01 homozygotes. In conclusion, the fetus and father both had one normal FY*01 allele and one new FY*01W.01, is associated with weakened expression of Fy(a) on RBCs.
A highly efficient multi-core algorithm for clustering extremely large datasets
2010-01-01
Background In recent years, the demand for computational power in computational biology has increased due to rapidly growing data sets from microarray and other high-throughput technologies. This demand is likely to increase. Standard algorithms for analyzing data, such as cluster algorithms, need to be parallelized for fast processing. Unfortunately, most approaches for parallelizing algorithms largely rely on network communication protocols connecting and requiring multiple computers. One answer to this problem is to utilize the intrinsic capabilities in current multi-core hardware to distribute the tasks among the different cores of one computer. Results We introduce a multi-core parallelization of the k-means and k-modes cluster algorithms based on the design principles of transactional memory for clustering gene expression microarray type data and categorial SNP data. Our new shared memory parallel algorithms show to be highly efficient. We demonstrate their computational power and show their utility in cluster stability and sensitivity analysis employing repeated runs with slightly changed parameters. Computation speed of our Java based algorithm was increased by a factor of 10 for large data sets while preserving computational accuracy compared to single-core implementations and a recently published network based parallelization. Conclusions Most desktop computers and even notebooks provide at least dual-core processors. Our multi-core algorithms show that using modern algorithmic concepts, parallelization makes it possible to perform even such laborious tasks as cluster sensitivity and cluster number estimation on the laboratory computer. PMID:20370922
NASA Astrophysics Data System (ADS)
Liu, Hongna; Li, Song; Wang, Zhifei; Li, Zhiyang; Deng, Yan; Wang, Hua; Shi, Zhiyang; He, Nongyue
2008-11-01
Single nucleotide polymorphisms (SNPs) comprise the most abundant source of genetic variation in the human genome wide codominant SNPs identification. Therefore, large-scale codominant SNPs identification, especially for those associated with complex diseases, has induced the need for completely high-throughput and automated SNP genotyping method. Herein, we present an automated detection system of SNPs based on two kinds of functional magnetic nanoparticles (MNPs) and dual-color hybridization. The amido-modified MNPs (NH 2-MNPs) modified with APTES were used for DNA extraction from whole blood directly by electrostatic reaction, and followed by PCR, was successfully performed. Furthermore, biotinylated PCR products were captured on the streptavidin-coated MNPs (SA-MNPs) and interrogated by hybridization with a pair of dual-color probes to determine SNP, then the genotype of each sample can be simultaneously identified by scanning the microarray printed with the denatured fluorescent probes. This system provided a rapid, sensitive and highly versatile automated procedure that will greatly facilitate the analysis of different known SNPs in human genome.
Rare De Novo Copy Number Variants in Patients with Congenital Pulmonary Atresia
Xie, Li; Chen, Jin-Lan; Zhang, Wei-Zhi; Wang, Shou-Zheng; Zhao, Tian-Li; Huang, Can; Wang, Jian; Yang, Jin-Fu; Yang, Yi-Feng; Tan, Zhi-Ping
2014-01-01
Background Ongoing studies using genomic microarrays and next-generation sequencing have demonstrated that the genetic contributions to cardiovascular diseases have been significantly ignored in the past. The aim of this study was to identify rare copy number variants in individuals with congenital pulmonary atresia (PA). Methods and Results Based on the hypothesis that rare structural variants encompassing key genes play an important role in heart development in PA patients, we performed high-resolution genome-wide microarrays for copy number variations (CNVs) in 82 PA patient-parent trios and 189 controls with an Illumina SNP array platform. CNVs were identified in 17/82 patients (20.7%), and eight of these CNVs (9.8%) are considered potentially pathogenic. Five de novo CNVs occurred at two known congenital heart disease (CHD) loci (16p13.1 and 22q11.2). Two de novo CNVs that may affect folate and vitamin B12 metabolism were identified for the first time. A de novo 1-Mb deletion at 17p13.2 may represent a rare genomic disorder that involves mild intellectual disability and associated facial features. Conclusions Rare CNVs contribute to the pathogenesis of PA (9.8%), suggesting that the causes of PA are heterogeneous and pleiotropic. Together with previous data from animal models, our results might help identify a link between CHD and folate-mediated one-carbon metabolism (FOCM). With the accumulation of high-resolution SNP array data, these previously undescribed rare CNVs may help reveal critical gene(s) in CHD and may provide novel insights about CHD pathogenesis. PMID:24826987
Fagerholm, Rainer; Schmidt, Marjanka K; Khan, Sofia; Rafiq, Sajjad; Tapper, William; Aittomäki, Kristiina; Greco, Dario; Heikkinen, Tuomas; Muranen, Taru A; Fasching, Peter A; Janni, Wolfgang; Weinshilboum, Richard; Loehberg, Christian R; Hopper, John L; Southey, Melissa C; Keeman, Renske; Lindblom, Annika; Margolin, Sara; Mannermaa, Arto; Kataja, Vesa; Chenevix-Trench, Georgia; Lambrechts, Diether; Wildiers, Hans; Chang-Claude, Jenny; Seibold, Petra; Couch, Fergus J; Olson, Janet E; Andrulis, Irene L; Knight, Julia A; García-Closas, Montserrat; Figueroa, Jonine; Hooning, Maartje J; Jager, Agnes; Shah, Mitul; Perkins, Barbara J; Luben, Robert; Hamann, Ute; Kabisch, Maria; Czene, Kamila; Hall, Per; Easton, Douglas F; Pharoah, Paul D P; Liu, Jianjun; Eccles, Diana; Blomqvist, Carl; Nevanlinna, Heli
2015-04-10
We have utilized a two-stage study design to search for SNPs associated with the survival of breast cancer patients treated with adjuvant chemotherapy. Our initial GWS data set consisted of 805 Finnish breast cancer cases (360 treated with adjuvant chemotherapy). The top 39 SNPs from this stage were analyzed in three independent data sets: iCOGS (n=6720 chemotherapy-treated cases), SUCCESS-A (n=3596), and POSH (n=518). Two SNPs were successfully validated: rs6500843 (any chemotherapy; per-allele HR 1.16, 95% C.I. 1.08-1.26, p=0.0001, p(adjusted)=0.0091), and rs11155012 (anthracycline therapy; per-allele HR 1.21, 95% C.I. 1.08-1.35, p=0.0010, p(adjusted)=0.0270). The SNP rs6500843 was found to specifically interact with adjuvant chemotherapy, independently of standard prognostic markers (p(interaction)=0.0009), with the rs6500843-GG genotype corresponding to the highest hazard among chemotherapy-treated cases (HR 1.47, 95% C.I. 1.20-1.80). Upon trans-eQTL analysis of public microarray data, the rs6500843 locus was found to associate with the expression of a group of genes involved in cell cycle control, notably AURKA, the expression of which also exhibited differential prognostic value between chemotherapy-treated and untreated cases in our analysis of microarray data. Based on previously published information, we propose that the eQTL genes may be connected to the rs6500843 locus via a RBFOX1-FOXM1 -mediated regulatory pathway.
Rare de novo copy number variants in patients with congenital pulmonary atresia.
Xie, Li; Chen, Jin-Lan; Zhang, Wei-Zhi; Wang, Shou-Zheng; Zhao, Tian-Li; Huang, Can; Wang, Jian; Yang, Jin-Fu; Yang, Yi-Feng; Tan, Zhi-Ping
2014-01-01
Ongoing studies using genomic microarrays and next-generation sequencing have demonstrated that the genetic contributions to cardiovascular diseases have been significantly ignored in the past. The aim of this study was to identify rare copy number variants in individuals with congenital pulmonary atresia (PA). Based on the hypothesis that rare structural variants encompassing key genes play an important role in heart development in PA patients, we performed high-resolution genome-wide microarrays for copy number variations (CNVs) in 82 PA patient-parent trios and 189 controls with an Illumina SNP array platform. CNVs were identified in 17/82 patients (20.7%), and eight of these CNVs (9.8%) are considered potentially pathogenic. Five de novo CNVs occurred at two known congenital heart disease (CHD) loci (16p13.1 and 22q11.2). Two de novo CNVs that may affect folate and vitamin B12 metabolism were identified for the first time. A de novo 1-Mb deletion at 17p13.2 may represent a rare genomic disorder that involves mild intellectual disability and associated facial features. Rare CNVs contribute to the pathogenesis of PA (9.8%), suggesting that the causes of PA are heterogeneous and pleiotropic. Together with previous data from animal models, our results might help identify a link between CHD and folate-mediated one-carbon metabolism (FOCM). With the accumulation of high-resolution SNP array data, these previously undescribed rare CNVs may help reveal critical gene(s) in CHD and may provide novel insights about CHD pathogenesis.
Reibel, Florence; Chauffour, Aurélie; Brossier, Florence; Jarlier, Vincent; Cambau, Emmanuelle; Aubry, Alexandra
2015-01-01
Between 20 and 30 bacteriologically confirmed cases of leprosy are diagnosed each year at the French National Reference Center for mycobacteria. Patients are mainly immigrants from various endemic countries or living in French overseas territories. We aimed at expanding data regarding the geographical distribution of the SNP genotypes of the M. leprae isolates from these patients. Skin biopsies were obtained from 71 leprosy patients diagnosed between January 2009 and December 2013. Data regarding age, sex and place of birth and residence were also collected. Diagnosis of leprosy was confirmed by microscopic detection of acid-fast bacilli and/or amplification by PCR of the M. leprae-specific RLEP region. Single nucleotide polymorphisms (SNP), present in the M. leprae genome at positions 14 676, 1 642 875 and 2 935 685, were determined with an efficiency of 94% (67/71). Almost all patients were from countries other than France where leprosy is still prevalent (n = 31) or from French overseas territories (n = 36) where leprosy is not totally eradicated, while only a minority (n = 4) was born in metropolitan France but have lived in other countries. SNP type 1 was predominant (n = 33), followed by type 3 (n = 17), type 4 (n = 11) and type 2 (n = 6). SNP types were concordant with those previously reported as prevalent in the patients' countries of birth. SNP types found in patients born in countries other than France (Comoros, Haiti, Benin, Congo, Sri Lanka) and French overseas territories (French Polynesia, Mayotte and La Réunion) not covered by previous work correlated well with geographical location and history of human settlements. The phylogenic analysis of M. leprae strains isolated in France strongly suggests that French leprosy cases are caused by SNP types that are (a) concordant with the geographic origin or residence of the patients (non-French countries, French overseas territories, metropolitan France) or (b) more likely random in regions where diverse migration flows occurred.
Quantitative phenotyping via deep barcode sequencing
Smith, Andrew M.; Heisler, Lawrence E.; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J.; Chee, Mark; Roth, Frederick P.; Giaever, Guri; Nislow, Corey
2009-01-01
Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or “Bar-seq,” outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that ∼20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene–environment interactions on a genome-wide scale. PMID:19622793
Karas, Vlad O; Sinnott-Armstrong, Nicholas A; Varghese, Vici; Shafer, Robert W; Greenleaf, William J; Sherlock, Gavin
2018-01-01
Abstract Much of the within species genetic variation is in the form of single nucleotide polymorphisms (SNPs), typically detected by whole genome sequencing (WGS) or microarray-based technologies. However, WGS produces mostly uninformative reads that perfectly match the reference, while microarrays require genome-specific reagents. We have developed Diff-seq, a sequencing-based mismatch detection assay for SNP discovery without the requirement for specialized nucleic-acid reagents. Diff-seq leverages the Surveyor endonuclease to cleave mismatched DNA molecules that are generated after cross-annealing of a complex pool of DNA fragments. Sequencing libraries enriched for Surveyor-cleaved molecules result in increased coverage at the variant sites. Diff-seq detected all mismatches present in an initial test substrate, with specific enrichment dependent on the identity and context of the variation. Application to viral sequences resulted in increased observation of variant alleles in a biologically relevant context. Diff-Seq has the potential to increase the sensitivity and efficiency of high-throughput sequencing in the detection of variation. PMID:29361139
Pounds, Stan; Cao, Xueyuan; Cheng, Cheng; Yang, Jun; Campana, Dario; Evans, William E.; Pui, Ching-Hon; Relling, Mary V.
2010-01-01
Powerful methods for integrated analysis of multiple biological data sets are needed to maximize interpretation capacity and acquire meaningful knowledge. We recently developed Projection Onto the Most Interesting Statistical Evidence (PROMISE). PROMISE is a statistical procedure that incorporates prior knowledge about the biological relationships among endpoint variables into an integrated analysis of microarray gene expression data with multiple biological and clinical endpoints. Here, PROMISE is adapted to the integrated analysis of pharmacologic, clinical, and genome-wide genotype data that incorporating knowledge about the biological relationships among pharmacologic and clinical response data. An efficient permutation-testing algorithm is introduced so that statistical calculations are computationally feasible in this higher-dimension setting. The new method is applied to a pediatric leukemia data set. The results clearly indicate that PROMISE is a powerful statistical tool for identifying genomic features that exhibit a biologically meaningful pattern of association with multiple endpoint variables. PMID:21516175
Tong, Steven Y C; Xie, Shirley; Richardson, Leisha J; Ballard, Susan A; Dakh, Farshid; Grabsch, Elizabeth A; Grayson, M Lindsay; Howden, Benjamin P; Johnson, Paul D R; Giffard, Philip M
2011-01-01
We have developed a single nucleotide polymorphism (SNP) nucleated high-resolution melting (HRM) technique to genotype Enterococcus faecium. Eight SNPs were derived from the E. faecium multilocus sequence typing (MLST) database and amplified fragments containing these SNPs were interrogated by HRM. We tested the HRM genotyping scheme on 85 E. faecium bloodstream isolates and compared the results with MLST, pulsed-field gel electrophoresis (PFGE) and an allele specific real-time PCR (AS kinetic PCR) SNP typing method. In silico analysis based on predicted HRM curves according to the G+C content of each fragment for all 567 sequence types (STs) in the MLST database together with empiric data from the 85 isolates demonstrated that HRM analysis resolves E. faecium into 231 "melting types" (MelTs) and provides a Simpson's Index of Diversity (D) of 0.991 with respect to MLST. This is a significant improvement on the AS kinetic PCR SNP typing scheme that resolves 61 SNP types with D of 0.95. The MelTs were concordant with the known ST of the isolates. For the 85 isolates, there were 13 PFGE patterns, 17 STs, 14 MelTs and eight SNP types. There was excellent concordance between PFGE, MLST and MelTs with Adjusted Rand Indices of PFGE to MelT 0.936 and ST to MelT 0.973. In conclusion, this HRM based method appears rapid and reproducible. The results are concordant with MLST and the MLST based population structure.
Yuan, Youhua; Wen, Yan; You, Yuangang; Xing, Yan; Li, Huanying; Weng, Xiaoman; Wu, Nan; Liu, Shuang; Zhang, Shanshan; Zhang, Wenhong; Zhang, Ying
2015-01-01
Leprosy continues to be prevalent in some mountainous regions of China, and genotypes of leprosy strains endemic to the country are not known. Mycobacterium lepromatosis is a new species that was discovered in Mexico in 2008, and it remains unclear whether this species exists in China. Here, we conducted PCR- restriction fragment length polymorphism (RFLP) analysis to classify genotypes of 85 DNA samples collected from patients from 18 different provinces. All 171 DNA samples from skin biopsies of leprosy patients were tested for the presence of Mycobacterium leprae and Mycobacterium lepromatosis by amplifying the 16S rRNA gene using nested PCR, followed by DNA sequencing. The new species M. lepromatosis was not found among the 171 specimens from leprosy patients in 22 provinces in China. However, we found three SNP genotypes among 85 leprosy patients. A mutation at C251T in the 16S rRNA gene was found in 76% of the strains. We also found that the strains that showed the 16S rRNA C251T mutation belonged to SNP type 3, whereas strains without the point mutation belonged to SNP type 1. The SNP type 3 leprosy strains were observed in patients from both the inner and coastal regions of China, but the SNP type 1 strains were focused only in the coastal region. This indicated that the SNP type 3 leprosy strains were more prevalent than the SNP type 1 strains in China. In addition, the 16S rRNA gene sequence mutation at C251T also indicated a difference in the geographical distribution of the strains. To our knowledge, this is the first report of a new polymorphism in 16S rRNA gene in M. leprae in China. Our findings shed light on the prevalent genotypes and provide insight about leprosy transmission that are important for leprosy control in China.
Mason, Jane A; Aung, Hnin T; Nandini, Adayapalam; Woods, Rickie G; Fairbairn, David J; Rowell, John A; Young, David; Susman, Rachel D; Brown, Simon A; Hyland, Valentine J; Robertson, Jeremy D
2018-05-01
We report a kindred referred for molecular investigation of severe hemophilia A in a young female in which extremely skewed X-inactivation was observed in both the proband and her clinically normal mother. Bidirectional Sanger sequencing of all F8 gene coding regions and exon/intron boundaries was undertaken. Methylation-sensitive restriction enzymes were utilized to investigate skewed X-inactivation using both a classical human androgen receptor (HUMARA) assay, and a novel method targeting differential methylation patterns in multiple informative X-chromosome SNPs. Illumina Whole-Genome Infinium microarray analysis was performed in the case-parent trio (proband and both parents), and the proband's maternal grandmother. The proband was a cytogenetically normal female with severe hemophilia A resulting from a heterozygous F8 pathogenic variant inherited from her similarly affected father. No F8 mutation was identified in the proband's mother, however, both the proband and her mother both demonstrated completely skewed X-chromosome inactivation (100%) in association with a previously unreported 2.3 Mb deletion at Xp22.2. At least three disease-associated genes (FANCB, AP1S2, and PIGA) were contained within the deleted region. We hypothesize that true "extreme" skewing of X-inactivation (≥95%) is a rare occurrence, but when defined correctly there is a high probability of finding an X-chromosome disease-causing variant or larger deletion resulting in X-inactivation through a survival disadvantage or cell lethal mechanism. We postulate that the 2.3 Mb Xp22.2 deletion identified in our kindred arose de novo in the proband's mother (on the grandfather's homolog), and produced extreme skewing of X-inactivation via a "cell lethal" mechanism. We introduce a novel multitarget approach for X-inactivation analysis using multiple informative differentially methylated SNPs, as an alternative to the classical single locus (HUMARA) method. We propose that for females with unexplained severe phenotypic expression of an X-linked recessive disorder trio-SNP microarray should be undertaken in combination with X-inactivation analysis. © 2018 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals, Inc.
Loya Méndez, Yolanda; Reyes Leal, Gilberto; Sánchez González, Adriana; Portillo Reyes, Verónica; Reyes Ruvalcaba, David; Bojórquez Rangel, Guillermo
2014-09-28
Diabetes Mellitus (DM) type 2 is a common pathology with multifactorial etiology, which exact genetic bases remain unknown. Some studies suggest that single nucleotides polymorphisms (SNPs) in the CAPN10 gene (Locus 2q37.3) could be associated with the development of this disease, including the insertion/deletion polymorphism SNP-19 (2R→3R). The present study determined the association between the SNP-19 and the risk of developing DM type 2 in Ciudad Juarez population. For this study 107 participants were selected: 43 diabetics type 2 (cases) and 64 non diabetics with no family history of DM type 2 in first grade (control). Anthropometric studies were realized as well as lipids, lipoproteins and serum glucose biochemical profiles. The genotypification of SNP-19 was performed using peripheral blood lymphocytes DNA, polymerase chain reactions (PCR), and electrophoretic analysis in agarose gels. Once obtained the genotypic and allelic frequencies, the Hardy-Weinberg equilibrium test (GenAlEx 6.4) was also performed. Using the X² analysis it was identified the genotypic differences between cases and control with higher frequency of the homozygous genotype 3R of SNP- 19 in the cases group (0.418) compared to control group (0.265). Also, it was observed an association between genotype 2R/3R with elevated weight, body mass index, and waist and hip circumferences, but only in the diabetic group (P=< 0.05). The findings in this study suggest that SNP-19 in CAPN10 may participate in the development of DM type 2 in the studied population. Copyright AULA MEDICA EDICIONES 2014. Published by AULA MEDICA. All rights reserved.
Daca-Roszak, P; Pfeifer, A; Żebracka-Gala, J; Jarząb, B; Witt, M; Ziętkiewicz, E
2016-01-01
Assays that allow analysis of the biogeographic origin of biological samples in a standard forensic laboratory have to target a small number of highly differentiating markers. Such markers should be easy to multiplex and the assay must perform well in the degraded and scarce biological material. SNPs localized in the genome regions, which in the past were subjected to differential selective pressure in various populations, are the most widely used markers in the studies of biogeographic affiliation. SNPs reflecting biogeographic differences not related to any phenotypic traits are not sufficiently explored. The goal of our study was to identify a small set of SNPs not related to any known pigmentation/phenotype-specific genes, which would allow efficient discrimination between populations of Europe and East Asia. The selection of SNPs was based on the comparative analysis of representative European and Chinese/Japanese samples (B-lymphocyte cell lines), genotyped using the Infinium HumanOmniExpressExome microarray (Illumina). The classifier, consisting of 24 unlinked SNPs (24-SNP classifier), was selected. The performance of a 14-SNP subset of this classifier (14-SNP subclassifier) was tested using genotype data from several populations. The 14-SNP subclassifier differentiated East Asians, Europeans and Africans with ∼100% accuracy; Palestinians, representative of the Middle East, clustered with Europeans, while Amerindians and Pakistani were placed between East Asian and European populations. Based on these results, we have developed a SNaPshot assay (EurEAs_Gplex) for genotyping SNPs from the 14-SNP subclassifier, combined with an additional marker for gender identification. Forensic utility of the EurEAs_Gplex was verified using degraded and low quantity DNA samples. The performance of the EurEAs_Gplex was satisfactory when using degraded DNA; tests using low quantity DNA samples revealed a previously not described source of genotyping errors, potentially important for any SNaPshot-based assays. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Gong, Bin-Sheng; Zhang, Qing-Pu; Zhang, Guang-Mei; Zhang, Shao-Jun; Zhang, Wei; Lv, Hong-Chao; Zhang, Fan; Lv, Sa-Li; Li, Chuan-Xing; Rao, Shao-Qi; Li, Xia
2007-01-01
Gene expression profiles and single-nucleotide polymorphism (SNP) profiles are modern data for genetic analysis. It is possible to use the two types of information to analyze the relationships among genes by some genetical genomics approaches. In this study, gene expression profiles were used as expression traits. And relationships among the genes, which were co-linked to a common SNP(s), were identified by integrating the two types of information. Further research on the co-expressions among the co-linked genes was carried out after the gene-SNP relationships were established using the Haseman-Elston sib-pair regression. The results showed that the co-expressions among the co-linked genes were significantly higher if the number of connections between the genes and a SNP(s) was more than six. Then, the genes were interconnected via one or more SNP co-linkers to construct a gene-SNP intermixed network. The genes sharing more SNPs tended to have a stronger correlation. Finally, a gene-gene network was constructed with their intensities of relationships (the number of SNP co-linkers shared) as the weights for the edges. PMID:18466544
Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography
Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi
2013-01-01
New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined ‘elimination’ status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of M. leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. PMID:23291420
Reibel, Florence; Chauffour, Aurélie; Brossier, Florence; Jarlier, Vincent; Cambau, Emmanuelle; Aubry, Alexandra
2015-01-01
Background Between 20 and 30 bacteriologically confirmed cases of leprosy are diagnosed each year at the French National Reference Center for mycobacteria. Patients are mainly immigrants from various endemic countries or living in French overseas territories. We aimed at expanding data regarding the geographical distribution of the SNP genotypes of the M. leprae isolates from these patients. Methodology/Principal findings Skin biopsies were obtained from 71 leprosy patients diagnosed between January 2009 and December 2013. Data regarding age, sex and place of birth and residence were also collected. Diagnosis of leprosy was confirmed by microscopic detection of acid-fast bacilli and/or amplification by PCR of the M. leprae-specific RLEP region. Single nucleotide polymorphisms (SNP), present in the M. leprae genome at positions 14 676, 1 642 875 and 2 935 685, were determined with an efficiency of 94% (67/71). Almost all patients were from countries other than France where leprosy is still prevalent (n = 31) or from French overseas territories (n = 36) where leprosy is not totally eradicated, while only a minority (n = 4) was born in metropolitan France but have lived in other countries. SNP type 1 was predominant (n = 33), followed by type 3 (n = 17), type 4 (n = 11) and type 2 (n = 6). SNP types were concordant with those previously reported as prevalent in the patients’ countries of birth. SNP types found in patients born in countries other than France (Comoros, Haiti, Benin, Congo, Sri Lanka) and French overseas territories (French Polynesia, Mayotte and La Réunion) not covered by previous work correlated well with geographical location and history of human settlements. Conclusions/Significance The phylogenic analysis of M. leprae strains isolated in France strongly suggests that French leprosy cases are caused by SNP types that are (a) concordant with the geographic origin or residence of the patients (non-French countries, French overseas territories, metropolitan France) or (b) more likely random in regions where diverse migration flows occurred. PMID:26441080
Xu, Huihui; Xiao, Bing; Ji, Xing; Hu, Qin; Chen, Yingwei; Qiu, Wenjuan
2014-07-01
Tetrasomy for the distal chromosome 15q is rare, and only 22 patients (including 6 cases without detailed information) have been described to date in the literature. Here we report on another patient with nonmosaic tetrasomy 15q25.2-qter resulted from an inverted duplication of distal chromosome 15. This patient presents with features of development delay, arachnodactyly, joint contractures and typical facial dysmorphism including frontal bossing, short palpebral fissures, long philtrum, low-set ears, high-arched palate and retrognathia. Unlike most of the related patients, abdominal ultrasound test and brain MRI showed normal. Karyotyping analysis revealed a supernumerary marker chromosome presented in all metaphase cells examined. Parental karyotyping analysis was normal, indicating a de novo chromosome aberration of the patient. SNP microarray analysis found a two copy gain of 17.7 Mb from the distal long arm of chromosome 15 (15q25.2-qter). Further FISH analysis using SureFISH 15q26.3 IGF1R probe proved an inverted duplication of distal long arm of chromosome 15. The segmental duplications which lie in the hotspots of 15q24-26 might increase the susceptibility of chromosome rearrangement. Compared with the George-Abraham' study [2012], ADAMTSL3 might be more related to the cardiac disorders in tetrasomy 15q patients. Considering all patients reported in the literature, different mosaic degrees and segmental sizes don't correlate to the severity of phenotypes. A clear delineation on tetrasomy for distal chromosome 15q could still be investigated. Copyright © 2014 Elsevier Masson SAS. All rights reserved.
Identification of Allelic Imbalance with a Statistical Model for Subtle Genomic Mosaicism
Xia, Rui; Vattathil, Selina; Scheet, Paul
2014-01-01
Genetic heterogeneity in a mixed sample of tumor and normal DNA can confound characterization of the tumor genome. Numerous computational methods have been proposed to detect aberrations in DNA samples from tumor and normal tissue mixtures. Most of these require tumor purities to be at least 10–15%. Here, we present a statistical model to capture information, contained in the individual's germline haplotypes, about expected patterns in the B allele frequencies from SNP microarrays while fully modeling their magnitude, the first such model for SNP microarray data. Our model consists of a pair of hidden Markov models—one for the germline and one for the tumor genome—which, conditional on the observed array data and patterns of population haplotype variation, have a dependence structure induced by the relative imbalance of an individual's inherited haplotypes. Together, these hidden Markov models offer a powerful approach for dealing with mixtures of DNA where the main component represents the germline, thus suggesting natural applications for the characterization of primary clones when stromal contamination is extremely high, and for identifying lesions in rare subclones of a tumor when tumor purity is sufficient to characterize the primary lesions. Our joint model for germline haplotypes and acquired DNA aberration is flexible, allowing a large number of chromosomal alterations, including balanced and imbalanced losses and gains, copy-neutral loss-of-heterozygosity (LOH) and tetraploidy. We found our model (which we term J-LOH) to be superior for localizing rare aberrations in a simulated 3% mixture sample. More generally, our model provides a framework for full integration of the germline and tumor genomes to deal more effectively with missing or uncertain features, and thus extract maximal information from difficult scenarios where existing methods fail. PMID:25166618
Integrative Assessment of Chlorine-Induced Acute Lung Injury in Mice
Pope-Varsalona, Hannah; Concel, Vincent J.; Liu, Pengyuan; Bein, Kiflai; Berndt, Annerose; Martin, Timothy M.; Ganguly, Koustav; Jang, An Soo; Brant, Kelly A.; Dopico, Richard A.; Upadhyay, Swapna; Di, Y. P. Peter; Hu, Zhen; Vuga, Louis J.; Medvedovic, Mario; Kaminski, Naftali; You, Ming; Alexander, Danny C.; McDunn, Jonathan E.; Prows, Daniel R.; Knoell, Daren L.
2012-01-01
The genetic basis for the underlying individual susceptibility to chlorine-induced acute lung injury is unknown. To uncover the genetic basis and pathophysiological processes that could provide additional homeostatic capacities during lung injury, 40 inbred murine strains were exposed to chlorine, and haplotype association mapping was performed. The identified single-nucleotide polymorphism (SNP) associations were evaluated through transcriptomic and metabolomic profiling. Using ≥ 10% allelic frequency and ≥ 10% phenotype explained as threshold criteria, promoter SNPs that could eliminate putative transcriptional factor recognition sites in candidate genes were assessed by determining transcript levels through microarray and reverse real-time PCR during chlorine exposure. The mean survival time varied by approximately 5-fold among strains, and SNP associations were identified for 13 candidate genes on chromosomes 1, 4, 5, 9, and 15. Microarrays revealed several differentially enriched pathways, including protein transport (decreased more in the sensitive C57BLKS/J lung) and protein catabolic process (increased more in the resistant C57BL/10J lung). Lung metabolomic profiling revealed 95 of the 280 metabolites measured were altered by chlorine exposure, and included alanine, which decreased more in the C57BLKS/J than in the C57BL/10J strain, and glutamine, which increased more in the C57BL/10J than in the C57BLKS/J strain. Genetic associations from haplotype mapping were strengthened by an integrated assessment using transcriptomic and metabolomic profiling. The leading candidate genes associated with increased susceptibility to acute lung injury in mice included Klf4, Sema7a, Tns1, Aacs, and a gene that encodes an amino acid carrier, Slc38a4. PMID:22447970
Wong, Gerard; Leckie, Christopher; Kowalczyk, Adam
2012-01-15
Feature selection is a key concept in machine learning for microarray datasets, where features represented by probesets are typically several orders of magnitude larger than the available sample size. Computational tractability is a key challenge for feature selection algorithms in handling very high-dimensional datasets beyond a hundred thousand features, such as in datasets produced on single nucleotide polymorphism microarrays. In this article, we present a novel feature set reduction approach that enables scalable feature selection on datasets with hundreds of thousands of features and beyond. Our approach enables more efficient handling of higher resolution datasets to achieve better disease subtype classification of samples for potentially more accurate diagnosis and prognosis, which allows clinicians to make more informed decisions in regards to patient treatment options. We applied our feature set reduction approach to several publicly available cancer single nucleotide polymorphism (SNP) array datasets and evaluated its performance in terms of its multiclass predictive classification accuracy over different cancer subtypes, its speedup in execution as well as its scalability with respect to sample size and array resolution. Feature Set Reduction (FSR) was able to reduce the dimensions of an SNP array dataset by more than two orders of magnitude while achieving at least equal, and in most cases superior predictive classification performance over that achieved on features selected by existing feature selection methods alone. An examination of the biological relevance of frequently selected features from FSR-reduced feature sets revealed strong enrichment in association with cancer. FSR was implemented in MATLAB R2010b and is available at http://ww2.cs.mu.oz.au/~gwong/FSR.
Kelly, Hilary; Dupras, Andrée Ann; Belanger, Sebastien; Devenish, John
2014-01-01
The lack of a sufficiently discriminatory molecular subtyping tool for Salmonella enterica serovar Enteritidis has hindered source attribution efforts and impeded regulatory actions required to disrupt its food-borne transmission. The underlying biological reason for the ineffectiveness of current molecular subtyping tools such as pulsed-field gel electrophoresis (PFGE) and phage typing appears to be related to the high degree of clonality of S. Enteritidis. By interrogating the organism's genome, we previously identified single nucleotide polymorphisms (SNP) distributed throughout the chromosome and have designed a highly discriminatory PCR-based SNP typing test based on 60 polymorphic loci. The application of the SNP-PCR method to DNA samples from S. Enteritidis strains (n = 55) obtained from a variety of sources has led to the differentiation and clustering of the S. Enteritidis isolates into 12 clades made up of 2 to 9 isolates per clade. Significantly, the SNP-PCR assay was able to further differentiate predominant PFGE types (e.g., XAI.0003) and phage types (e.g., phage type 8) into smaller subsets. The SNP-PCR subtyping test proved to be an accurate, precise, and quantitative tool for evaluating the relationships among the S. Enteritidis isolates tested in this study and should prove useful for clustering related S. Enteritidis isolates involved in outbreaks. PMID:25297333
Leblanc, N; Cortey, M; Fernandez Pinero, J; Gallardo, C; Masembe, C; Okurut, A R; Heath, L; van Heerden, J; Sánchez-Vizcaino, J M; Ståhl, K; Belák, S
2013-08-01
African swine fever virus (ASFV) causes one of the most dreaded transboundary animal diseases (TADs) in Suidae. African swine fever (ASF) often causes high rates of morbidity and mortality, which can reach 100% in domestic swine. To date, serological diagnosis has the drawback of not being able to differentiate variants of this virus. Previous studies have identified the 22 genotypes based on sequence variation in the C-terminal region of the p72 gene, which has become the standard for categorizing ASFVs. This article describes a genotyping assay developed using a segment of PCR-amplified genomic DNA of approximately 450 bp, which encompasses the C-terminal end of the p72 gene. Complementary paired DNA probes of 15 or 17 bp in length, which are identical except for a single nucleotide polymorphism (SNP) in the central position, were designed to either individually or in combination differentiate between the 22 genotypes. The assay was developed using xMAP technology; probes were covalently linked to microspheres, hybridized to PCR product, labelled with a reporter and read in the Luminex 200 analyzer. Characterization of the sample was performed by comparing fluorescence of the paired SNP probes, that is, the probe with higher fluorescence in a complementary pair identified the SNP that a particular sample possessed. In the final assay, a total of 52 probes were employed, 24 SNP pairs and 4 for general detection. One or more samples from each of the 22 genotypes were tested. The assay was able to detect and distinguish all 22 genotypes. This novel assay provides a powerful novel tool for the simultaneous rapid diagnosis and genotypic differentiation of ASF. © 2012 Blackwell Verlag GmbH.
SEURAT: Visual analytics for the integrated analysis of microarray data
2010-01-01
Background In translational cancer research, gene expression data is collected together with clinical data and genomic data arising from other chip based high throughput technologies. Software tools for the joint analysis of such high dimensional data sets together with clinical data are required. Results We have developed an open source software tool which provides interactive visualization capability for the integrated analysis of high-dimensional gene expression data together with associated clinical data, array CGH data and SNP array data. The different data types are organized by a comprehensive data manager. Interactive tools are provided for all graphics: heatmaps, dendrograms, barcharts, histograms, eventcharts and a chromosome browser, which displays genetic variations along the genome. All graphics are dynamic and fully linked so that any object selected in a graphic will be highlighted in all other graphics. For exploratory data analysis the software provides unsupervised data analytics like clustering, seriation algorithms and biclustering algorithms. Conclusions The SEURAT software meets the growing needs of researchers to perform joint analysis of gene expression, genomical and clinical data. PMID:20525257
EnsMart: A Generic System for Fast and Flexible Access to Biological Data
Kasprzyk, Arek; Keefe, Damian; Smedley, Damian; London, Darin; Spooner, William; Melsopp, Craig; Hammond, Martin; Rocca-Serra, Philippe; Cox, Tony; Birney, Ewan
2004-01-01
The EnsMart system (www.ensembl.org/EnsMart) provides a generic data warehousing solution for fast and flexible querying of large biological data sets and integration with third-party data and tools. The system consists of a query-optimized database and interactive, user-friendly interfaces. EnsMart has been applied to Ensembl, where it extends its genomic browser capabilities, facilitating rapid retrieval of customized data sets. A wide variety of complex queries, on various types of annotations, for numerous species are supported. These can be applied to many research problems, ranging from SNP selection for candidate gene screening, through cross-species evolutionary comparisons, to microarray annotation. Users can group and refine biological data according to many criteria, including cross-species analyses, disease links, sequence variations, and expression patterns. Both tabulated list data and biological sequence output can be generated dynamically, in HTML, text, Microsoft Excel, and compressed formats. A wide range of sequence types, such as cDNA, peptides, coding regions, UTRs, and exons, with additional upstream and downstream regions, can be retrieved. The EnsMart database can be accessed via a public Web site, or through a Java application suite. Both implementations and the database are freely available for local installation, and can be extended or adapted to `non-Ensembl' data sets. PMID:14707178
Jiang, Haojun; Xie, Yifan; Li, Xuchao; Ge, Huijuan; Deng, Yongqiang; Mu, Haofang; Feng, Xiaoli; Yin, Lu; Du, Zhou; Chen, Fang; He, Nongyue
2016-01-01
Short tandem repeats (STRs) and single nucleotide polymorphisms (SNPs) have been already used to perform noninvasive prenatal paternity testing from maternal plasma DNA. The frequently used technologies were PCR followed by capillary electrophoresis and SNP typing array, respectively. Here, we developed a noninvasive prenatal paternity testing (NIPAT) based on SNP typing with maternal plasma DNA sequencing. We evaluated the influence factors (minor allele frequency (MAF), the number of total SNP, fetal fraction and effective sequencing depth) and designed three different selective SNP panels in order to verify the performance in clinical cases. Combining targeted deep sequencing of selective SNP and informative bioinformatics pipeline, we calculated the combined paternity index (CPI) of 17 cases to determine paternity. Sequencing-based NIPAT results fully agreed with invasive prenatal paternity test using STR multiplex system. Our study here proved that the maternal plasma DNA sequencing-based technology is feasible and accurate in determining paternity, which may provide an alternative in forensic application in the future.
Mehrian-Shai, Ruty; Yalon, Michal; Moshe, Itai; Barshack, Iris; Nass, Dvorah; Jacob, Jasmine; Dor, Chen; Reichardt, Juergen K V; Constantini, Shlomi; Toren, Amos
2016-01-14
The genetic mechanisms underlying hemangioblastoma development are still largely unknown. We used high-resolution single nucleotide polymorphism microarrays and droplet digital PCR analysis to detect copy number variations (CNVs) in total of 45 hemangioblastoma tumors. We identified 94 CNVs with a median of 18 CNVs per sample. The most frequently gained regions were on chromosomes 1 (p36.32) and 7 (p11.2). These regions contain the EGFR and PRDM16 genes. Recurrent losses were located at chromosome 12 (q24.13), which includes the gene PTPN11. Our findings provide the first high-resolution genome-wide view of chromosomal changes in hemangioblastoma and identify 23 candidate genes: EGFR, PRDM16, PTPN11, HOXD11, HOXD13, FLT3, PTCH, FGFR1, FOXP1, GPC3, HOXC13, HOXC11, MKL1, CHEK2, IRF4, GPHN, IKZF1, RB1, HOXA9, and micro RNA, such as hsa-mir-196a-2 for hemangioblastoma pathogenesis. Furthermore, our data implicate that cell proliferation and angiogenesis promoting pathways may be involved in the molecular pathogenesis of hemangioblastoma.
Constitutional Chromoanagenesis of Distal 13q in a Young Adult with Recurrent Strokes.
Burnside, Rachel D; Harris, April; Speyer, Darrow; Burgin, W Scott; Rose, David Z; Sanchez-Valle, Amarilis
2016-01-01
Constitutional chromoanagenesis events, which include chromoanasynthesis and chromothripsis and result in highly complex rearrangements, have been reported for only a few individuals. While rare, these phenomena have likely been underestimated in a constitutional setting as technologies that can accurately detect such complexity are relatively new to the mature field of clinical cytogenetics. G-banding is not likely to accurately identify chromoanasynthesis or chromothripsis, since the banding patterns of chromosomes are likely to be misidentified or oversimplified due to a much lower resolution. We describe a patient who was initially referred for cytogenetic testing as a child for speech delay. As a young adult, he was referred again for recurrent strokes. Chromosome analysis was performed, and the rearrangement resembled a simple duplication of 13q32q34. However, SNP microarray analysis showed a complex pattern of copy number gains and a loss consistent with chromoanasynthesis involving distal 13q (13q32.1q34). This report emphasizes the value of performing microarray analysis for individuals with abnormal or complex chromosome rearrangements. © 2016 S. Karger AG, Basel.
SNPConvert: SNP Array Standardization and Integration in Livestock Species.
Nicolazzi, Ezequiel Luis; Marras, Gabriele; Stella, Alessandra
2016-06-09
One of the main advantages of single nucleotide polymorphism (SNP) array technology is providing genotype calls for a specific number of SNP markers at a relatively low cost. Since its first application in animal genetics, the number of available SNP arrays for each species has been constantly increasing. However, conversely to that observed in whole genome sequence data analysis, SNP array data does not have a common set of file formats or coding conventions for allele calling. Therefore, the standardization and integration of SNP array data from multiple sources have become an obstacle, especially for users with basic or no programming skills. Here, we describe the difficulties related to handling SNP array data, focusing on file formats, SNP allele coding, and mapping. We also present SNPConvert suite, a multi-platform, open-source, and user-friendly set of tools to overcome these issues. This tool, which can be integrated with open-source and open-access tools already available, is a first step towards an integrated system to standardize and integrate any type of raw SNP array data. The tool is available at: https://github. com/nicolazzie/SNPConvert.git.
Mycobacterium leprae in Colombia described by SNP7614 in gyrA, two minisatellites and geography.
Cardona-Castro, Nora; Beltrán-Alzate, Juan Camilo; Romero-Montoya, Irma Marcela; Li, Wei; Brennan, Patrick J; Vissa, Varalakshmi
2013-03-01
New cases of leprosy are still being detected in Colombia after the country declared achievement of the WHO defined 'elimination' status. To study the ecology of leprosy in endemic regions, a combination of geographic and molecular tools were applied for a group of 201 multibacillary patients including six multi-case families from eleven departments. The location (latitude and longitude) of patient residences were mapped. Slit skin smears and/or skin biopsies were collected and DNA was extracted. Standard agarose gel electrophoresis following a multiplex PCR-was developed for rapid and inexpensive strain typing of Mycobacterium leprae based on copy numbers of two VNTR minisatellite loci 27-5 and 12-5. A SNP (C/T) in gyrA (SNP7614) was mapped by introducing a novel PCR-RFLP into an ongoing drug resistance surveillance effort. Multiple genotypes were detected combining the three molecular markers. The two frequent genotypes in Colombia were SNP7614(C)/27-5(5)/12-5(4) [C54] predominantly distributed in the Atlantic departments and SNP7614 (T)/27-5(4)/12-5(5) [T45] associated with the Andean departments. A novel genotype SNP7614 (C)/27-5(6)/12-5(4) [C64] was detected in cities along the Magdalena river which separates the Andean from Atlantic departments; a subset was further characterized showing association with a rare allele of minisatellite 23-3 and the SNP type 1 of M. leprae. The genotypes within intra-family cases were conserved. Overall, this is the first large scale study that utilized simple and rapid assay formats for identification of major strain types and their distribution in Colombia. It provides the framework for further strain type discrimination and geographic information systems as tools for tracing transmission of leprosy. Copyright © 2012 Elsevier B.V. All rights reserved.
Best practices for hybridization design in two-colour microarray analysis.
Knapen, Dries; Vergauwen, Lucia; Laukens, Kris; Blust, Ronny
2009-07-01
Two-colour microarrays are a popular platform of choice in gene expression studies. Because two different samples are hybridized on a single microarray, and several microarrays are usually needed in a given experiment, there are many possible ways to combine samples on different microarrays. The actual combination employed is commonly referred to as the 'hybridization design'. Different types of hybridization designs have been developed, all aimed at optimizing the experimental setup for the detection of differentially expressed genes while coping with technical noise. Here, we first provide an overview of the different classes of hybridization designs, discussing their advantages and limitations, and then we illustrate the current trends in the use of different hybridization design types in contemporary research.
2011-01-01
Background Identifying the functional importance of the millions of single nucleotide polymorphisms (SNPs) in the human genome is a difficult challenge. Therefore, a reverse strategy, which identifies functionally important SNPs by virtue of the bimodal abundance across the human population of the SNP-related mRNAs will be useful. Those mRNA transcripts that are expressed at two distinct abundances in proportion to SNP allele frequency may warrant further study. Matrix metalloproteinase 1 (MMP1) is important in both normal development and in numerous pathologies. Although much research has been conducted to investigate the expression of MMP1 in many different cell types and conditions, the regulation of its expression is still not fully understood. Results In this study, we used a novel but straightforward method based on agglomerative hierarchical clustering to identify bimodally expressed transcripts in human umbilical vein endothelial cell (HUVEC) microarray data from 15 individuals. We found that MMP1 mRNA abundance was bimodally distributed in un-treated HUVECs and showed a bimodal response to inflammatory mediator treatment. RT-PCR and MMP1 activity assays confirmed the bimodal regulation and DNA sequencing of 69 individuals identified an MMP1 gene promoter polymorphism that segregated precisely with the MMP1 bimodal expression. Chromatin immunoprecipation (ChIP) experiments indicated that the transcription factors (TFs) ETS1, ETS2 and GATA3, bind to the MMP1 promoter in the region of this polymorphism and may contribute to the bimodal expression. Conclusions We describe a simple method to identify putative bimodally expressed RNAs from transcriptome data that is effective yet easy for non-statisticans to understand and use. This method identified bimodal endothelial cell expression of MMP1, which appears to be biologically significant with implications for inflammatory disease. (271 Words) PMID:21244711
Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen
2015-07-27
Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1-8) were identified and genotyped via direct sequencing covering most of the coding region and 3'UTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3'UTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p < 0.05) except SNP1/8 (p > 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs.
Zhang, Ya-Ran; Gui, Lin-Sheng; Li, Yao-Kun; Jiang, Bi-Jie; Wang, Hong-Cheng; Zhang, Ying-Ying; Zan, Lin-Sen
2015-01-01
Smoothened (Smo)-mediated Hedgehog (Hh) signaling pathway governs the patterning, morphogenesis and growth of many different regions within animal body plans. This study evaluated the effects of genetic variations of the bovine SMO gene on economically important body size traits in Chinese Qinchuan cattle. Altogether, eight single nucleotide polymorphisms (SNPs: 1–8) were identified and genotyped via direct sequencing covering most of the coding region and 3ʹUTR of the bovine SMO gene. Both the p.698Ser.>Ser. synonymous mutation resulted from SNP1 and the p.700Ser.>Pro. non-synonymous mutation caused by SNP2 mapped to the intracellular C-terminal tail of bovine Smo protein; the other six SNPs were non-coding variants located in the 3ʹUTR. The linkage disequilibrium was analyzed, and five haplotypes were discovered in 520 Qinchuan cattle. Association analyses showed that SNP2, SNP3/5, SNP4 and SNP6/7 were significantly associated with some body size traits (p < 0.05) except SNP1/8 (p > 0.05). Meanwhile, cattle with wild-type combined haplotype Hap1/Hap1 had significantly (p < 0.05) greater body length than those with Hap2/Hap2. Our results indicate that variations in the SMO gene could affect body size traits of Qinchuan cattle, and the wild-type haplotype Hap1 together with the wild-type alleles of these detected SNPs in the SMO gene could be used to breed cattle with superior body size traits. Therefore, our results could be helpful for marker-assisted selection in beef cattle breeding programs. PMID:26225956
Zhao, Linlu; Bracken, Michael B.; DeWan, Andrew T.
2013-01-01
Summary A genome-wide association study was undertaken to identify maternal single nucleotide polymorphisms (SNPs) and copy-number variants (CNVs) associated with preeclampsia. Case-control analysis was performed on 1070 Afro-Caribbean (n=21 cases and 1049 controls) and 723 Hispanic (n=62 cases and 661 controls) mothers and 1257 mothers of European ancestry (n=50 cases and 1207 controls) from the Hyperglycemia and Adverse Pregnancy Outcome (HAPO) study. European ancestry subjects were genotyped on Illumina Human610-Quad and Afro-Caribbean and Hispanic subjects were genotyped on Illumina Human1M-Duo BeadChip microarrays. Genome-wide SNP data were analyzed using PLINK. CNVs were called using three detection algorithms (GNOSIS, PennCNV, and QuantiSNP), merged using CNVision, and then screened using stringent criteria. SNP and CNV findings were compared to those of the Study of Pregnancy Hypertension in Iowa (SOPHIA), an independent preeclampsia case-control dataset of Caucasian mothers (n=177 cases and 116 controls). A list of top SNPs were identified for each of the HAPO ethnic groups, but none reached Bonferroni-corrected significance. Novel candidate CNVs showing enrichment among preeclampsia cases were also identified in each of the three ethnic groups. Several variants were suggestively replicated in SOPHIA. The discovered SNPs and copy-number variable regions present interesting candidate genetic variants for preeclampsia that warrant further replication and investigation. PMID:23551011
[C677T-SNP of methylenetetrahydrofolate reductase gene and breast cancer in Mexican women].
Calderón-Garcidueñas, Ana Laura; Cerda-Flores, Ricardo Martín; Castruita-Ávila, Ana Lilia; González-Guerrero, Juan Francisco; Barrera-Saldaña, Hugo Alberto
2017-01-01
Low-penetrance susceptibility genes such as 5,10-methylenetetrahydrofolate reductase gene (MTHFR) have been considered in the progression of breast cancer (BC). Cancer is a result of genetic, environmental and epigenetic interactions; therefore, these genes should be studied in environmental context, because the results can vary between populations and even within the same country. The objective was to analyze the allelic and genotypic frequencies of the MTHFR C667T SNP in Mexican Mestizo patients with BC and controls from Northeastern Mexico. 243 patients and 118 healthy women were studied. The analysis of the polymorphism was performed with a DNA microarray. Once the frequency of the polymorphism was obtained, Hardy-Weinberg equilibrium test was carried out for the genotypes. Chi square test was used to compare the distribution of frequencies. The allele frequency in patients was: C = 0.5406; T = 0.4594 and in controls C = 0.5678, T = 0.4322. Genotype in BC patients was: C / C = 29.9%, C / T = 48.3% and T / T = 21.8. The distribution in controls was: C / C = 31.4%, C / T = 50.8%, T / T = 17.8% (chi squared 0.77, p = 0.6801). Northeastern Mexican women in this study showed no association between MTFHR C667T SNP and the risk of BC. It seems that the contribution of this polymorphism to BC in Mexico varies depending on various factors, both genetic and environmental.
Galehdari, Hamid; Saki, Najmaldin; Mohammadi-Asl, Javad; Rahim, Fakher
2013-01-01
Crigler-Najjar syndrome (CNS) type I and type II are usually inherited as autosomal recessive conditions that result from mutations in the UGT1A1 gene. The main objective of the present review is to summarize results of all available evidence on the accuracy of SNP-based pathogenicity detection tools compared to published clinical result for the prediction of in nsSNPs that leads to disease using prediction performance method. A comprehensive search was performed to find all mutations related to CNS. Database searches included dbSNP, SNPdbe, HGMD, Swissvar, ensemble, and OMIM. All the mutation related to CNS was extracted. The pathogenicity prediction was done using SNP-based pathogenicity detection tools include SIFT, PHD-SNP, PolyPhen2, fathmm, Provean, and Mutpred. Overall, 59 different SNPs related to missense mutations in the UGT1A1 gene, were reviewed. Comparing the diagnostic OR, PolyPhen2 and Mutpred have the highest detection 4.983 (95% CI: 1.24 - 20.02) in both, following by SIFT (diagnostic OR: 3.25, 95% CI: 1.07 - 9.83). The highest MCC of SNP-based pathogenicity detection tools, was belong to SIFT (34.19%) followed by Provean, PolyPhen2, and Mutpred (29.99%, 29.89%, and 29.89%, respectively). Hence the highest SNP-based pathogenicity detection tools ACC, was fit to SIFT (62.71%) followed by PolyPhen2, and Mutpred (61.02%, in both). Our results suggest that some of the well-established SNP-based pathogenicity detection tools can appropriately reflect the role of a disease-associated SNP in both local and global structures.
Role of Telokin in Regulating Murine Gastric Fundus Smooth Muscle Tension
An, Changlong; Bhetwal, Bhupal P.; Sanders, Kenton M.; Somlyo, Avril V.; Perrino, Brian A.
2015-01-01
Telokin phosphorylation by cyclic GMP-dependent protein kinase facilitates smooth muscle relaxation. In this study we examined the relaxation of gastric fundus smooth muscles from basal tone, or pre-contracted with KCl or carbachol (CCh), and the phosphorylation of telokin S13, myosin light chain (MLC) S19, MYPT1 T853, T696, and CPI-17 T38 in response to 8-Bromo-cGMP, the NO donor sodium nitroprusside (SNP), or nitrergic neurotransmission. We compared MLC phosphorylation and the contraction and relaxation responses of gastric fundus smooth muscles from telokin-/- mice and their wild-type littermates to KCl or CCh, and 8-Bromo-cGMP, SNP, or nitrergic neurotransmission, respectively. We compared the relaxation responses and telokin phosphorylation of gastric fundus smooth muscles from wild-type mice and W/W V mice which lack ICC-IM, to 8-Bromo-cGMP, SNP, or nitrergic neurotransmission. We found that telokin S13 is basally phosphorylated and that 8-Bromo-cGMP and SNP increased basal telokin phosphorylation. In muscles pre-contracted with KCl or CCh, 8-Bromo-cGMP and SNP had no effect on CPI-17 or MYPT1 phosphorylation, but increased telokin phosphorylation and reduced MLC phosphorylation. In telokin-/- gastric fundus smooth muscles, basal tone and constitutive MLC S19 phosphorylation were increased. Pre-contracted telokin-/- gastric fundus smooth muscles have increased contractile responses to KCl, CCh, or cholinergic neurotransmission and reduced relaxation to 8-Bromo-cGMP, SNP, and nitrergic neurotransmission. However, basal telokin phosphorylation was not increased when muscles were stimulated with lower concentrations of SNP or when the muscles were stimulated by nitrergic neurotransmission. SNP, but not nitrergic neurotransmission, increased telokin Ser13 phosphorylation in both wild-type and W/W V gastric fundus smooth muscles. Our findings indicate that telokin may play a role in attenuating constitutive MLC phosphorylation and provide an additional mechanism to augment gastric fundus mechanical responses to inhibitory neurotransmission. PMID:26258553
Development of a Multiplex Single Base Extension Assay for Mitochondrial DNA Haplogroup Typing
Nelson, Tahnee M.; Just, Rebecca S.; Loreille, Odile; Schanfield, Moses S.; Podini, Daniele
2007-01-01
Aim To provide a screening tool to reduce time and sample consumption when attempting mtDNA haplogroup typing. Methods A single base primer extension assay was developed to enable typing, in a single reaction, of twelve mtDNA haplogroup specific polymorphisms. For validation purposes a total of 147 samples were tested including 73 samples successfully haplogroup typed using mtDNA control region (CR) sequence data, 21 samples inconclusively haplogroup typed by CR data, 20 samples previously haplogroup typed using restriction fragment length polymorphism (RFLP) analysis, and 31 samples of known ancestral origin without previous haplogroup typing. Additionally, two highly degraded human bones embalmed and buried in the early 1950s were analyzed using the single nucleotide polymorphisms (SNP) multiplex. Results When the SNP multiplex was used to type the 96 previously CR sequenced specimens, an increase in haplogroup or macrohaplogroup assignment relative to conventional CR sequence analysis was observed. The single base extension assay was also successfully used to assign a haplogroup to decades-old, embalmed skeletal remains dating to World War II. Conclusion The SNP multiplex was successfully used to obtain haplogroup status of highly degraded human bones, and demonstrated the ability to eliminate possible contributors. The SNP multiplex provides a low-cost, high throughput method for typing of mtDNA haplogroups A, B, C, D, E, F, G, H, L1/L2, L3, M, and N that could be useful for screening purposes for human identification efforts and anthropological studies. PMID:17696300
Lomonaco, Sara; Furumoto, Emily J; Loquasto, Joseph R; Morra, Patrizia; Grassi, Ausilia; Roberts, Robert F
2015-02-01
Identification at the genus, species, and strain levels is desirable when a probiotic microorganism is added to foods. Strains of Bifidobacterium animalis ssp. lactis (BAL) are commonly used worldwide in dairy products supplemented with probiotic strains. However, strain discrimination is difficult because of the high degree of genome identity (99.975%) between different genomes of this subspecies. Typing of monomorphic species can be carried out efficiently by targeting informative single nucleotide polymorphisms (SNP). Findings from a previous study analyzing both reference and commercial strains of BAL identified SNP that could be used to discriminate common strains into 8 groups. This paper describes development of a minisequencing assay based on the primer extension reaction (PER) targeting multiple SNP that can allow strain differentiation of BAL. Based on previous data, 6 informative SNP were selected for further testing, and a multiplex preliminary PCR was optimized to amplify the DNA regions containing the selected SNP. Extension primers (EP) annealing immediately adjacent to the selected SNP were developed and tested in simplex and multiplex PER to evaluate their performance. Twenty-five strains belonging to 9 distinct genomic clusters of B. animalis ssp. lactis were selected and analyzed using the developed minisequencing assay, simultaneously targeting the 6 selected SNP. Fragment analysis was subsequently carried out in duplicate and demonstrated that the assay yielded 8 specific profiles separating the most commonly used commercial strains. This novel multiplex PER approach provides a simple, rapid, flexible SNP-based subtyping method for proper characterization and identification of commercial probiotic strains of BAL from fermented dairy products. To assess the usefulness of this method, DNA was extracted from yogurt manufactured with and without the addition of B. animalis ssp. lactis BB-12. Extracted DNA was then subjected to the minisequencing protocol, resulting in a SNP profile matching the profile for the strain BB-12. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
High-density fiber optic biosensor arrays
NASA Astrophysics Data System (ADS)
Epstein, Jason R.; Walt, David R.
2002-02-01
Novel approaches are required to coordinate the immense amounts of information derived from diverse genomes. This concept has influenced the expanded role of high-throughput DNA detection and analysis in the biological sciences. A high-density fiber optic DNA biosensor was developed consisting of oligonucleotide-functionalized, 3.1 mm diameter microspheres deposited into the etched wells on the distal face of a 500 micrometers imaging fiber bundle. Imaging fiber bundles containing thousands of optical fibers, each associated with a unique oligonucleotide probe sequence, were the foundation for an optically connected, individually addressable DNA detection platform. Different oligonucleotide-functionalized microspheres were combined in a stock solution, and randomly dispersed into the etched wells. Microsphere positions were registered from optical dyes incorporated onto the microspheres. The distribution process provided an inherent redundancy that increases the signal-to-noise ratio as the square root of the number of sensors examined. The representative amount of each probe-type in the array was dependent on their initial stock solution concentration, and as other sequences of interest arise, new microsphere elements can be added to arrays without altering the existing detection capabilities. The oligonucleotide probe sequences hybridize to fluorescently-labeled, complementary DNA target solutions. Fiber optic DNA microarray research has included DNA-protein interaction profiles, microbial strain differentiation, non-labeled target interrogation with molecular beacons, and single cell-based assays. This biosensor array is proficient in DNA detection linked to specific disease states, single nucleotide polymorphism (SNP's) discrimination, and gene expression analysis. This array platform permits multiple detection formats, provides smaller feature sizes, and enables sensor design flexibility. High-density fiber optic microarray biosensors provide a fast, reversible format with the detection limit of a few hundred molecules.
Liu, Yanfang; Liao, Huidan; Liu, Ying; Guo, Juanjuan; Sun, Yi; Fu, Xiaoliang; Xiao, Ding; Cai, Jifeng; Lan, Lingmei; Xie, Pingli; Zha, Lagabaiyila
2017-04-01
Nonbinary single-nucleotide polymorphisms (SNPs) are potential forensic genetic markers because their discrimination power is greater than that of normal binary SNPs, and that they can detect highly degraded samples. We previously developed a nonbinary SNP multiplex typing assay. In this study, we selected additional 20 nonbinary SNPs from the NCBI SNP database and verified them through pyrosequencing. These 20 nonbinary SNPs were analyzed using the fluorescent-labeled SNaPshot multiplex SNP typing method. The allele frequencies and genetic parameters of these 20 nonbinary SNPs were determined among 314 unrelated individuals from Han populations from China. The total power of discrimination was 0.9999999999994, and the cumulative probability of exclusion was 0.9986. Moreover, the result of the combination of this 20 nonbinary SNP assay with the 20 nonbinary SNP assay we previously developed demonstrated that the cumulative probability of exclusion of the 40 nonbinary SNPs was 0.999991 and that no significant linkage disequilibrium was observed in all 40 nonbinary SNPs. Thus, we concluded that this new system consisting of new 20 nonbinary SNPs could provide highly informative polymorphic data which would be further used in forensic application and would serve as a potentially valuable supplement to forensic DNA analysis. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Adiponectin and resistin gene polymorphisms in association with their respective adipokine levels.
Lau, Cia-Hin; Muniandy, Sekaran
2011-05-01
Single nucleotide polymorphisms (SNPs) at the adiponectin and resistin loci are strongly associated with hypoadiponectinemia and hyperresistinemia, which may eventually increase risk of insulin resistance, type 2 diabetes (T2DM), metabolic syndrome (MS), and cardiovascular disease. Real-time PCR was used to genotype SNPs of the adiponectin (SNP+45T>G, SNP+276G>T, SNP+639T>C, and SNP+1212A>G) and resistin (SNP-420C>G and SNP+299G>A) genes in 809 Malaysian men (208 controls, 174 MS without T2DM, 171 T2DM without MS, 256 T2DM with MS) whose ages ranged between 40 and 70 years old. The genotyping results for each SNP marker was verified by sequencing. The anthropometric clinical and metabolic parameters of subjects were recorded. None of these SNPs at the adiponectin and resistin loci were associated with T2DM and MS susceptibility in Malaysian men. SNP+45T>G, SNP+276G>T, and SNP+639T>C of the adiponectin gene did not influence circulating levels of adiponectin. However, the G-allele of SNP+1212A>G at the adiponectin locus was marginally associated (P= 0.0227) with reduced circulating adiponectin levels. SNP-420C>G (df = 2; F= 16.026; P= 1.50×10(-7) ) and SNP+299G>A (df = 2; F= 22.944; P= 2.04×10(-10) ) of the resistin gene were strongly associated with serum resistin levels. Thus, SNP-420C>G and SNP+299G>A of the resistin gene are strongly associated with the risk of hyperresistinemia in Malaysian men. © 2011 The Authors Annals of Human Genetics © 2011 Blackwell Publishing Ltd/University College London.
Hardy, M Y; Ontiveros, N; Varney, M D; Tye-Din, J A
2018-04-01
A hallmark of coeliac disease (CD) is the exceptionally strong genetic association with HLA-DQ2.5, DQ8, and DQ2.2. HLA typing provides information on CD risk important to both clinicians and researchers. A method that enables simple and fast detection of all CD risk genotypes is particularly desirable for the study of large populations. Single nucleotide polymorphism (SNP)-based HLA typing can detect the CD risk genotypes by detecting a combination of six SNPs but this approach can struggle to resolve HLA-DQ2.2, seen in 4% of European CD patients, because of the low resolution of one negatively predicting SNP. We sought to optimise SNP-based HLA typing by harnessing the additional resolution of digital droplet PCR to resolve HLA-DQ2.2. Here we test this two-step approach in an unselected sample of Mexican DNA and compare its accuracy to DNA typed using traditional exon detection. The addition of digital droplet PCR for samples requiring negative prediction of HLA-DQ2.2 enabled HLA-DQ2.2 to be accurately typed. This technique is a simple addition to a SNP-based typing strategy and enables comprehensive definition of all at-risk HLA genotypes in CD in a timely and cost-effective manner. © 2018 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Martínez-Robles, Ángel Manuel; Loyola-Rodríguez, Juan Pablo; Zavala-Alonso, Norma Verónica; Martinez-Martinez, Rita Elizabeth; Ruiz, Facundo; Lara-Castro, René Homero; Donohué-Cornejo, Alejandro; Reyes-López, Simón Yobanny; Espinosa-Cristóbal, León Francisco
2016-01-01
(1) Background: Streptococcus mutans (S. mutans) is the principal pathogen involved in the formation of dental caries. Other systemic diseases have also been associated with specific S. mutans serotypes (c, e, f, and k). Silver nanoparticles (SNP) have been demonstrated to have good antibacterial effects against S. mutans; therefore, limited studies have evaluated the antimicrobial activity of biofunctionalized SNP on S. mutans serotypes. The purpose of this work was to prepare and characterize coated SNP using two different organic components and to evaluate the antimicrobial activity of SNP in clinical isolates of S. mutans strains and serotypes; (2) Methods: SNP with bovine serum albumin (BSA) or chitosan (CS) coatings were prepared and the physical, chemical and microbiological properties of SNP were evaluated; (3) Results: Both types of coated SNP showed antimicrobial activity against S. mutans bacteria and serotypes. Better inhibition was associated with smaller particles and BSA coatings; however, no significant differences were found between the different serotypes, indicating a similar sensitivity to the coated SNP; (4) Conclusion: This study concludes that BSA and CS coated SNP had good antimicrobial activity against S. mutans strains and the four serotypes, and this study suggest the widespread use of SNP as an antimicrobial agent for the inhibition of S. mutans bacteria. PMID:28335264
Martínez-Robles, Ángel Manuel; Loyola-Rodríguez, Juan Pablo; Zavala-Alonso, Norma Verónica; Martinez-Martinez, Rita Elizabeth; Ruiz, Facundo; Lara-Castro, René Homero; Donohué-Cornejo, Alejandro; Reyes-López, Simón Yobanny; Espinosa-Cristóbal, León Francisco
2016-07-22
(1) Background: Streptococcus mutans ( S. mutans ) is the principal pathogen involved in the formation of dental caries. Other systemic diseases have also been associated with specific S. mutans serotypes ( c , e , f , and k ). Silver nanoparticles (SNP) have been demonstrated to have good antibacterial effects against S. mutans ; therefore, limited studies have evaluated the antimicrobial activity of biofunctionalized SNP on S. mutans serotypes. The purpose of this work was to prepare and characterize coated SNP using two different organic components and to evaluate the antimicrobial activity of SNP in clinical isolates of S. mutans strains and serotypes; (2) Methods: SNP with bovine serum albumin (BSA) or chitosan (CS) coatings were prepared and the physical, chemical and microbiological properties of SNP were evaluated; (3) Results: Both types of coated SNP showed antimicrobial activity against S. mutans bacteria and serotypes. Better inhibition was associated with smaller particles and BSA coatings; however, no significant differences were found between the different serotypes, indicating a similar sensitivity to the coated SNP; (4) Conclusion: This study concludes that BSA and CS coated SNP had good antimicrobial activity against S. mutans strains and the four serotypes, and this study suggest the widespread use of SNP as an antimicrobial agent for the inhibition of S. mutans bacteria.
LD2SNPing: linkage disequilibrium plotter and RFLP enzyme mining for tag SNPs
Chang, Hsueh-Wei; Chuang, Li-Yeh; Chang, Yan-Jhu; Cheng, Yu-Huei; Hung, Yu-Chen; Chen, Hsiang-Chi; Yang, Cheng-Hong
2009-01-01
Background Linkage disequilibrium (LD) mapping is commonly used to evaluate markers for genome-wide association studies. Most types of LD software focus strictly on LD analysis and visualization, but lack supporting services for genotyping. Results We developed a freeware called LD2SNPing, which provides a complete package of mining tools for genotyping and LD analysis environments. The software provides SNP ID- and gene-centric online retrievals for SNP information and tag SNP selection from dbSNP/NCBI and HapMap, respectively. Restriction fragment length polymorphism (RFLP) enzyme information for SNP genotype is available to all SNP IDs and tag SNPs. Single and multiple SNP inputs are possible in order to perform LD analysis by online retrieval from HapMap and NCBI. An LD statistics section provides D, D', r2, δQ, ρ, and the P values of the Hardy-Weinberg Equilibrium for each SNP marker, and Chi-square and likelihood-ratio tests for the pair-wise association of two SNPs in LD calculation. Finally, 2D and 3D plots, as well as plain-text output of the results, can be selected. Conclusion LD2SNPing thus provides a novel visualization environment for multiple SNP input, which facilitates SNP association studies. The software, user manual, and tutorial are freely available at . PMID:19500380
Guo, Liyuan; Wang, Jing
2018-01-04
Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element-target gene pairs (E-G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
2018-01-01
Abstract Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element–target gene pairs (E–G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. PMID:29140525
Challenges of microarray applications for microbial detection and gene expression profiling in food
USDA-ARS?s Scientific Manuscript database
Microarray technology represents one of the latest advances in molecular biology. The diverse types of microarrays have been applied to clinical and environmental microbiology, microbial ecology, and in human, veterinary, and plant diagnostics. Since multiple genes can be analyzed simultaneously, ...
Keitges, Elisabeth A; Pasion, Romela; Burnside, Rachel D; Mason, Carla; Gonzalez-Ruiz, Antonio; Dunn, Teresa; Masiello, Meredith; Gebbia, Joseph A; Fernandez, Carlos O; Risheg, Hiba
2013-07-01
Microdeletions of 8p23.1 are mediated by low copy repeats and can cause congenital diaphragmatic hernia (CDH) and cardiac defects. Within this region, point mutations of the GATA4 gene have been shown to cause cardiac defects. However, the cause of CDH in these deletions has been difficult to determine due to the paucity of mutations that result in CDH, the lack of smaller deletions to refine the region and the reduced penetrance of CDH in these large deletions. Mice deficient for one copy of the Gata4 gene have been described with CDH and heart defects suggesting mutations in Gata4 can cause the phenotype in mice. We report on the SNP microarray analysis on two fetuses with deletions of 8p23.1. The first had CDH and a ventricular septal defect (VSD) on ultrasonography and a family history of a maternal VSD. Microarray analysis detected a 127-kb deletion which included the GATA4 and NEIL2 genes which was inherited from the mother. The second fetus had an incomplete atrioventricular canal defect on ultrasonography. Microarray analysis showed a 315-kb deletion that included seven genes, GATA4, NEIL2, FDFT1, CTSB, DEFB136, DEFB135, and DEFB134. These results suggest that haploinsufficiency of the two genes in common within 8p23.1; GATA4 and NEIL2 can cause CDH and cardiac defects in humans. Copyright © 2013 Wiley Periodicals, Inc.
Novel applications of array comparative genomic hybridization in molecular diagnostics.
Cheung, Sau W; Bi, Weimin
2018-05-31
In 2004, the implementation of array comparative genomic hybridization (array comparative genome hybridization [CGH]) into clinical practice marked a new milestone for genetic diagnosis. Array CGH and single-nucleotide polymorphism (SNP) arrays enable genome-wide detection of copy number changes in a high resolution, and therefore microarray has been recognized as the first-tier test for patients with intellectual disability or multiple congenital anomalies, and has also been applied prenatally for detection of clinically relevant copy number variations in the fetus. Area covered: In this review, the authors summarize the evolution of array CGH technology from their diagnostic laboratory, highlighting exonic SNP arrays developed in the past decade which detect small intragenic copy number changes as well as large DNA segments for the region of heterozygosity. The applications of array CGH to human diseases with different modes of inheritance with the emphasis on autosomal recessive disorders are discussed. Expert commentary: An exonic array is a powerful and most efficient clinical tool in detecting genome wide small copy number variants in both dominant and recessive disorders. However, whole-genome sequencing may become the single integrated platform for detection of copy number changes, single-nucleotide changes as well as balanced chromosomal rearrangements in the near future.
Li, Dongmei; Le Pape, Marc A; Parikh, Nisha I; Chen, Will X; Dye, Timothy D
2013-01-01
Microarrays are widely used for examining differential gene expression, identifying single nucleotide polymorphisms, and detecting methylation loci. Multiple testing methods in microarray data analysis aim at controlling both Type I and Type II error rates; however, real microarray data do not always fit their distribution assumptions. Smyth's ubiquitous parametric method, for example, inadequately accommodates violations of normality assumptions, resulting in inflated Type I error rates. The Significance Analysis of Microarrays, another widely used microarray data analysis method, is based on a permutation test and is robust to non-normally distributed data; however, the Significance Analysis of Microarrays method fold change criteria are problematic, and can critically alter the conclusion of a study, as a result of compositional changes of the control data set in the analysis. We propose a novel approach, combining resampling with empirical Bayes methods: the Resampling-based empirical Bayes Methods. This approach not only reduces false discovery rates for non-normally distributed microarray data, but it is also impervious to fold change threshold since no control data set selection is needed. Through simulation studies, sensitivities, specificities, total rejections, and false discovery rates are compared across the Smyth's parametric method, the Significance Analysis of Microarrays, and the Resampling-based empirical Bayes Methods. Differences in false discovery rates controls between each approach are illustrated through a preterm delivery methylation study. The results show that the Resampling-based empirical Bayes Methods offer significantly higher specificity and lower false discovery rates compared to Smyth's parametric method when data are not normally distributed. The Resampling-based empirical Bayes Methods also offers higher statistical power than the Significance Analysis of Microarrays method when the proportion of significantly differentially expressed genes is large for both normally and non-normally distributed data. Finally, the Resampling-based empirical Bayes Methods are generalizable to next generation sequencing RNA-seq data analysis.
Mendoza Lopez, Pablo; Golby, Paul; Wooff, Esen; Garcia, Javier Nunez; Garcia Pelayo, M. Carmen; Conlon, Kevin; Gema Camacho, Ana; Hewinson, R. Glyn; Polaina, Julio; Suárez García, Antonio; Gordon, Stephen V.
2010-01-01
A number of single-nucleotide polymorphisms (SNPs) have been identified in the genome of Mycobacterium bovis BCG Pasteur compared with the sequenced strain M. bovis 2122/97. The functional consequences of many of these mutations remain to be described; however, mutations in genes encoding regulators may be particularly relevant to global phenotypic changes such as loss of virulence, since alteration of a regulator's function will affect the expression of a wide range of genes. One such SNP falls in bcg3145, encoding a member of the AfsR/DnrI/SARP class of global transcriptional regulators, that replaces a highly conserved glutamic acid residue at position 159 (E159G) with glycine in a tetratricopeptide repeat (TPR) located in the bacterial transcriptional activation (BTA) domain of BCG3145. TPR domains are associated with protein–protein interactions, and a conserved core (helices T1–T7) of the BTA domain seems to be required for proper function of SARP-family proteins. Structural modelling predicted that the E159G mutation perturbs the third α-helix of the BTA domain and could therefore have functional consequences. The E159G SNP was found to be present in all BCG strains, but absent from virulent M. bovis and Mycobacterium tuberculosis strains. By overexpressing BCG3145 and Rv3124 in BCG and H37Rv and monitoring transcriptome changes using microarrays, we determined that BCG3145/Rv3124 acts as a positive transcriptional regulator of the molybdopterin biosynthesis moa1 locus, and we suggest that rv3124 be renamed moaR1. The SNP in bcg3145 was found to have a subtle effect on the activity of MoaR1, suggesting that this mutation is not a key event in the attenuation of BCG. PMID:20378651
[Typing and subtyping avian influenza virus using DNA microarrays].
Yang, Zhongping; Wang, Xiurong; Tian, Lina; Wang, Yu; Chen, Hualan
2008-07-01
Outbreaks of highly pathogenic avian influenza (HPAI) virus has caused great economic loss to the poultry industry and resulted in human deaths in Thailand and Vietnam since 2004. Rapid typing and subtyping of viruses, especially HPAI from clinical specimens, are desirable for taking prompt control measures to prevent spreading of the disease. We described a simultaneous approach using microarray to detect and subtype avian influenza virus (AIV). We designed primers of probe genes and used reverse transcriptase PCR to prepare cDNAs of AIV M gene, H5, H7, H9 subtypes haemagglutinin genes and N1, N2 subtypes neuraminidase genes. They were cloned, sequenced, reamplified and spotted to form a glass-bound microarrays. We labeled samples using Cy3-dUTP by RT-PCR, hybridized and scanned the microarrays to typing and subtyping AIV. The hybridization pattern agreed perfectly with the known grid location of each probe, no cross hybridization could be detected. Examinating of HA subtypes 1 through 15, 30 infected samples and 21 field samples revealed the DNA microarray assay was more sensitive and specific than RT-PCR test and chicken embryo inoculation. It can simultaneously detect and differentiate the main epidemic AIV. The results show that DNA microarray technology is a useful diagnostic method.
Microarrays in brain research: the good, the bad and the ugly.
Mirnics, K
2001-06-01
Making sense of microarray data is a complex process, in which the interpretation of findings will depend on the overall experimental design and judgement of the investigator performing the analysis. As a result, differences in tissue harvesting, microarray types, sample labelling and data analysis procedures make post hoc sharing of microarray data a great challenge. To ensure rapid and meaningful data exchange, we need to create some order out of the existing chaos. In these ground-breaking microarray standardization and data sharing efforts, NIH agencies should take a leading role
Wang, Lin; Liu, Simin; Niu, Tianhua; Xu, Xin
2005-03-18
Single nucleotide polymorphisms (SNPs) provide an important tool in pinpointing susceptibility genes for complex diseases and in unveiling human molecular evolution. Selection and retrieval of an optimal SNP set from publicly available databases have emerged as the foremost bottlenecks in designing large-scale linkage disequilibrium studies, particularly in case-control settings. We describe the architectural structure and implementations of a novel software program, SNPHunter, which allows for both ad hoc-mode and batch-mode SNP search, automatic SNP filtering, and retrieval of SNP data, including physical position, function class, flanking sequences at user-defined lengths, and heterozygosity from NCBI dbSNP. The SNP data extracted from dbSNP via SNPHunter can be exported and saved in plain text format for further down-stream analyses. As an illustration, we applied SNPHunter for selecting SNPs for 10 major candidate genes for type 2 diabetes, including CAPN10, FABP4, IL6, NOS3, PPARG, TNF, UCP2, CRP, ESR1, and AR. SNPHunter constitutes an efficient and user-friendly tool for SNP screening, selection, and acquisition. The executable and user's manual are available at http://www.hsph.harvard.edu/ppg/software.htm
Fine-Scale Variation and Genetic Determinants of Alternative Splicing across Individuals
Coulombe-Huntington, Jasmin; Lam, Kevin C. L.; Dias, Christel; Majewski, Jacek
2009-01-01
Recently, thanks to the increasing throughput of new technologies, we have begun to explore the full extent of alternative pre–mRNA splicing (AS) in the human transcriptome. This is unveiling a vast layer of complexity in isoform-level expression differences between individuals. We used previously published splicing sensitive microarray data from lymphoblastoid cell lines to conduct an in-depth analysis on splicing efficiency of known and predicted exons. By combining publicly available AS annotation with a novel algorithm designed to search for AS, we show that many real AS events can be detected within the usually unexploited, speculative majority of the array and at significance levels much below standard multiple-testing thresholds, demonstrating that the extent of cis-regulated differential splicing between individuals is potentially far greater than previously reported. Specifically, many genes show subtle but significant genetically controlled differences in splice-site usage. PCR validation shows that 42 out of 58 (72%) candidate gene regions undergo detectable AS, amounting to the largest scale validation of isoform eQTLs to date. Targeted sequencing revealed a likely causative SNP in most validated cases. In all 17 incidences where a SNP affected a splice-site region, in silico splice-site strength modeling correctly predicted the direction of the micro-array and PCR results. In 13 other cases, we identified likely causative SNPs disrupting predicted splicing enhancers. Using Fst and REHH analysis, we uncovered significant evidence that 2 putative causative SNPs have undergone recent positive selection. We verified the effect of five SNPs using in vivo minigene assays. This study shows that splicing differences between individuals, including quantitative differences in isoform ratios, are frequent in human populations and that causative SNPs can be identified using in silico predictions. Several cases affected disease-relevant genes and it is likely some of these differences are involved in phenotypic diversity and susceptibility to complex diseases. PMID:20011102
Chang, M-T; Cheng, Y-S; Huang, M-C
2013-02-01
In our previous cDNA microarray study, we found that the carbonic anhydrase II (CA2) gene is one of the differentially expressed transcripts in the duck isthmus epithelium during egg formation period. The aim of this study was to identify the single-nucleotide polymorphisms (SNPs) in the CA2 gene of Tsaiya ducks. The relationship of SNP genotype with egg production and reproduction traits was also investigated. A total of 317 ducks from two lines, a control line with no selection and a selected line, were employed for testing. Three SNPs (C37T, A62G and A65G) in the 3'-untranslated region of the CA2 gene were found. SNP-trait association analysis showed that SNP C37T and A62G were associated with duck egg weight besides fertility. The ducks with the CT and AG genotypes had a 1.46 and 1.62 g/egg lower egg weight as compared with ducks with the CC and AA genotypes, respectively (p < 0.05). But the ducks with CT and AG genotypes had 5.20% and 4.22% higher fertility than those with CC and AA genotypes, respectively (p < 0.05). Diplotype constructed on these three SNPs was associated with duck fertility, and the diplotype H1H4 was dominant for duck fertility. These findings might provide the basis for balanced selection and may be used in marker-assisted selection to improve egg weight and fertility simultaneously in the Tsaiya ducks. © 2012 Blackwell Verlag GmbH.
Knüppel, Sven; Meidtner, Karina; Arregui, Maria; Holzhütter, Hermann-Georg; Boeing, Heiner
2015-07-01
Analyzing multiple single nucleotide polymorphisms (SNPs) is a promising approach to finding genetic effects beyond single-locus associations. We proposed the use of multilocus stepwise regression (MSR) to screen for allele combinations as a method to model joint effects, and compared the results with the often used genetic risk score (GRS), conventional stepwise selection, and the shrinkage method LASSO. In contrast to MSR, the GRS, conventional stepwise selection, and LASSO model each genotype by the risk allele doses. We reanalyzed 20 unlinked SNPs related to type 2 diabetes (T2D) in the EPIC-Potsdam case-cohort study (760 cases, 2193 noncases). No SNP-SNP interactions and no nonlinear effects were found. Two SNP combinations selected by MSR (Nagelkerke's R² = 0.050 and 0.048) included eight SNPs with mean allele combination frequency of 2%. GRS and stepwise selection selected nearly the same SNP combinations consisting of 12 and 13 SNPs (Nagelkerke's R² ranged from 0.020 to 0.029). LASSO showed similar results. The MSR method showed the best model fit measured by Nagelkerke's R² suggesting that further improvement may render this method a useful tool in genetic research. However, our comparison suggests that the GRS is a simple way to model genetic effects since it does not consider linkage, SNP-SNP interactions, and no non-linear effects. © 2015 John Wiley & Sons Ltd/University College London.
snpGeneSets: An R Package for Genome-Wide Study Annotation
Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian
2016-01-01
Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/. PMID:27807048
Calpain-10 gene polymorphisms and risk of type 2 diabetes mellitus in Mexican mestizos.
Picos-Cárdenas, V J; Sáinz-González, E; Miliar-García, A; Romero-Zazueta, A; Quintero-Osuna, R; Leal-Ugarte, E; Peralta-Leal, V; Meza-Espinoza, J P
2015-03-27
The calpain-10 gene is expressed primarily in tissues important in glucose metabolism; thus, some of its polymorphisms have been associated with type 2 diabetes. In this study, we examined the association between the calpain-10 single-nucleotide polymorphism (SNP)-43, SNP-19, and SNP-63 and type 2 diabetes in Mexican mestizos. We included 211 patients and 152 non-diabetic subjects. Polymerase chain reaction was used to identify alleles. We compared allele, genotype, haplotype, and diplotype frequencies between both groups and used the chi-square test to calculate the risk. The allele frequency of SNP-43 allele 1 was 70% in controls and 72% in patients; the GG, GA, and AA genotype frequencies were 48.7, 42.8, and 8.5% in controls and 51.2, 41.7, and 7.1% in patients, respectively. For SNP- 19, the prevalence of allele 1 (2R) was 32% in controls and 39% in patients. In controls, homozygosity (2R/2R) was 10.5%, heterozygosity was 42.8%, and 3R/3R was 46.7%; in cases, these values were 13.3, 50.7, and 36.0%, respectively. For SNP-63, the frequency of allele 1 was 87% in controls and 83% in patients; genotype frequencies in controls were 75.7% (CC), 23% (CT), and 1.3% (TT), and were 69.7, 27.5, and 2.8%, respectively for the cases. Genotype distributions were consistent with Hardy-Weinberg equilibrium. No significant intergroup differences for allele, genotype, haplotype, or diplotype frequencies were observed. We found no association between these polymorphisms and diabetes. However, our sample size was small, so the role of calpain-10 risk alleles should be further examined.
Honsa, Erin; Fricke, Thomas; Stephens, Alex J; Ko, Danny; Kong, Fanrong; Gilbert, Gwendolyn L; Huygens, Flavia; Giffard, Philip M
2008-08-19
Streptococcus agalactiae (Group B Streptococcus (GBS)) is an important human pathogen, particularly of newborns. Emerging evidence for a relationship between genotype and virulence has accentuated the need for efficient and well-defined typing methods. The objective of this study was to develop a single nucleotide polymorphism (SNP) based method for assigning GBS isolates to multilocus sequence typing (MLST)-defined clonal complexes. It was found that a SNP set derived from the MLST database on the basis of maximization of Simpsons Index of Diversity provided poor resolution and did not define groups concordant with the population structure as defined by eBURST analysis of the MLST database. This was interpreted as being a consequence of low diversity and high frequency horizontal gene transfer. Accordingly, a different approach to SNP identification was developed. This entailed use of the "Not-N" bioinformatic algorithm that identifies SNPs diagnostic for groups of known sequence variants, together with an empirical process of SNP testing. This yielded a four member SNP set that divides GBS into 10 groups that are concordant with the population structure. A fifth SNP was identified that increased the sensitivity for the clinically significant clonal complex 17 to 100%. Kinetic PCR methods for the interrogation of these SNPs were developed, and used to genotype 116 well characterized isolates. A five SNP method for dividing GBS into biologically valid groups has been developed. These SNPs are ideal for high throughput surveillance activities, and combining with more rapidly evolving loci when additional resolution is required.
Honsa, Erin; Fricke, Thomas; Stephens, Alex J; Ko, Danny; Kong, Fanrong; Gilbert, Gwendolyn L; Huygens, Flavia; Giffard, Philip M
2008-01-01
Background Streptococcus agalactiae (Group B Streptococcus (GBS)) is an important human pathogen, particularly of newborns. Emerging evidence for a relationship between genotype and virulence has accentuated the need for efficient and well-defined typing methods. The objective of this study was to develop a single nucleotide polymorphism (SNP) based method for assigning GBS isolates to multilocus sequence typing (MLST)-defined clonal complexes. Results It was found that a SNP set derived from the MLST database on the basis of maximisation of Simpsons Index of Diversity provided poor resolution and did not define groups concordant with the population structure as defined by eBURST analysis of the MLST database. This was interpreted as being a consequence of low diversity and high frequency horizontal gene transfer. Accordingly, a different approach to SNP identification was developed. This entailed use of the "Not-N" bioinformatic algorithm that identifies SNPs diagnostic for groups of known sequence variants, together with an empirical process of SNP testing. This yielded a four member SNP set that divides GBS into 10 groups that are concordant with the population structure. A fifth SNP was identified that increased the sensitivity for the clinically significant clonal complex 17 to 100%. Kinetic PCR methods for the interrogation of these SNPs were developed, and used to genotype 116 well characterized isolates. Conclusion A five SNP method for dividing GBS into biologically valid groups has been developed. These SNPs are ideal for high throughput surveillance activities, and combining with more rapidly evolving loci when additional resolution is required. PMID:18710585
Calpain-10 gene polymorphism in type 2 diabetes mellitus patients in the Gaza Strip.
Zaharna, Mazen M; Abed, Abdalla A; Sharif, Fadel A
2010-01-01
To examine the role of calpain-10 SNP-44, -43, -63 and del/ins-19 in genetic susceptibility to type 2 diabetes mellitus (T2DM) and associations with triglycerides and total cholesterol in a group of subjects residing in the Gaza Strip. Ninety-six individuals were examined: 48 T2DM patients and 48 controls. The groups were genotyped for calpain-10 SNP-44, -43, -63, and del/ins-19. Mutagenically separated polymerase chain reaction was used to examine SNP-44; del/ins-19 was examined by electrophoresis of the PCR product on agarose gel, while the restriction fragment length polymorphism method was used for SNP-43 and -63. There was evidence that the C allele at SNP-44 played a possible role in susceptibility to T2DM (p = 0.01). T2DM patients with G/A genotype were found to have higher levels of total cholesterol in comparison to those homozygous for allele 1 (G/G) in SNP-43. Total cholesterol levels increased in T2DM patients who are homozygous for del/ins-19 allele 2, in T2DM patients with the 121/221 haplotype combination, and in control subjects with the haplotype combination 111/121. SNP-44 polymorphism of the calpain-10 gene has a significant association with T2DM patients in the Gaza strip. Certain polymorphisms of calpain-10 also have associations with the levels of total cholesterol in both T2DM patients and controls. Copyright © 2010 S. Karger AG, Basel.
Clevert, Djork-Arné; Mitterecker, Andreas; Mayr, Andreas; Klambauer, Günter; Tuefferd, Marianne; De Bondt, An; Talloen, Willem; Göhlmann, Hinrich; Hochreiter, Sepp
2011-07-01
Cost-effective oligonucleotide genotyping arrays like the Affymetrix SNP 6.0 are still the predominant technique to measure DNA copy number variations (CNVs). However, CNV detection methods for microarrays overestimate both the number and the size of CNV regions and, consequently, suffer from a high false discovery rate (FDR). A high FDR means that many CNVs are wrongly detected and therefore not associated with a disease in a clinical study, though correction for multiple testing takes them into account and thereby decreases the study's discovery power. For controlling the FDR, we propose a probabilistic latent variable model, 'cn.FARMS', which is optimized by a Bayesian maximum a posteriori approach. cn.FARMS controls the FDR through the information gain of the posterior over the prior. The prior represents the null hypothesis of copy number 2 for all samples from which the posterior can only deviate by strong and consistent signals in the data. On HapMap data, cn.FARMS clearly outperformed the two most prevalent methods with respect to sensitivity and FDR. The software cn.FARMS is publicly available as a R package at http://www.bioinf.jku.at/software/cnfarms/cnfarms.html.
Bergin, Sarah M; Periaswamy, Balamurugan; Barkham, Timothy; Chua, Hong Choon; Mok, Yee Ming; Fung, Daniel Shuen Sheng; Su, Alex Hsin Chuan; Lee, Yen Ling; Chua, Ming Lai Ivan; Ng, Poh Yong; Soon, Wei Jia Wendy; Chu, Collins Wenhan; Tan, Siyun Lucinda; Meehan, Mary; Ang, Brenda Sze Peng; Leo, Yee Sin; Holden, Matthew T G; De, Partha; Hsu, Li Yang; Chen, Swaine L; de Sessions, Paola Florez; Marimuthu, Kalisvar
2018-05-09
OBJECTIVEWe report the utility of whole-genome sequencing (WGS) conducted in a clinically relevant time frame (ie, sufficient for guiding management decision), in managing a Streptococcus pyogenes outbreak, and present a comparison of its performance with emm typing.SETTINGA 2,000-bed tertiary-care psychiatric hospital.METHODSActive surveillance was conducted to identify new cases of S. pyogenes. WGS guided targeted epidemiological investigations, and infection control measures were implemented. Single-nucleotide polymorphism (SNP)-based genome phylogeny, emm typing, and multilocus sequence typing (MLST) were performed. We compared the ability of WGS and emm typing to correctly identify person-to-person transmission and to guide the management of the outbreak.RESULTSThe study included 204 patients and 152 staff. We identified 35 patients and 2 staff members with S. pyogenes. WGS revealed polyclonal S. pyogenes infections with 3 genetically distinct phylogenetic clusters (C1-C3). Cluster C1 isolates were all emm type 4, sequence type 915 and had pairwise SNP differences of 0-5, which suggested recent person-to-person transmissions. Epidemiological investigation revealed that cluster C1 was mediated by dermal colonization and transmission of S. pyogenes in a male residential ward. Clusters C2 and C3 were genomically diverse, with pairwise SNP differences of 21-45 and 26-58, and emm 11 and mostly emm120, respectively. Clusters C2 and C3, which may have been considered person-to-person transmissions by emm typing, were shown by WGS to be unlikely by integrating pairwise SNP differences with epidemiology.CONCLUSIONSWGS had higher resolution than emm typing in identifying clusters with recent and ongoing person-to-person transmissions, which allowed implementation of targeted intervention to control the outbreak.Infect Control Hosp Epidemiol 2018;1-9.
Nair, S; Lee, Y H; Lindsay, R S; Walker, B R; Tataranni, P A; Bogardus, C; Baier, L J; Permana, P A
2004-06-01
The enzyme 11beta-hydroxysteroid dehydrogenase type 1 (11beta-HSD1) modulates tissue-specific glucocorticoid concentrations by generating active cortisol. We have shown that adipose tissue 11beta-HSD1 mRNA levels were associated with adiposity and insulinaemia. Here we conducted further expression and genetic association studies in Pima Indians. The 11beta-HSD1 mRNA concentrations were measured in abdominal subcutaneous adipocytes (n=61) and skeletal muscle tissues (n=64). Single nucleotide polymorphisms in the HSD11B1 gene were genotyped in a larger group of full-blooded Pima Indians. Two representative SNPs (SNP1, n=706; SNP5, n=839) were associated with Type 2 diabetes mellitus (p=0.01), although neither SNP was associated with obesity. Among subjects with normal glucose tolerance, SNP1 (n=127) and SNP5 (n=159) were associated with insulin-mediated glucose uptake rates (p=0.03 and p=0.04), and SNP1 was further associated with fasting, 30-min, and 2-h plasma insulin concentrations (p=0.002, p=0.002 and p=0.03). Adipocyte 11beta-HSD1 mRNA concentrations were correlated positively with adiposity and insulinaemia, and were additionally negatively correlated with insulin-mediated glucose uptake rates; nevertheless, the adipocyte 11beta-HSD1 expression did not correlate with genotypes of the donors. The muscle 11beta-HSD1 mRNA concentrations did not correlate with any anthropometric or metabolic variables. We confirmed that adipocyte 11beta-HSD1 mRNA concentrations were associated with adiposity, and showed that genetic variations in the HSD11B1 gene were associated with Type 2 diabetes mellitus, plasma insulin concentrations and insulin action, independent of obesity. The variable adipose expression might not be a primary consequence of these HSD11B1 SNPs. Therefore, it is possible that the HSD11B1 gene is under tissue-specific regulation, and has tissue-specific consequences.
[Prenatal genetic diagnosis for a fetus with atypical neurofibromatosis type 1 microdeletion].
Lin, Shaobin; Wu, Jianzhu; Zhang, Zhiqiang; Ji, Yuanjun; Fang, Qun; Chen, Baojiang; Luo, Yanmin
2016-04-01
To analyze the correlation between atypical neurofibromatosis type 1(NF1) microdeletion and fetal phenotype. Fetal blood sampling was carried out for a woman bearing a fetus with talipes equinovarus. G-banded karyotyping and single nucleotide polymorphism array (SNP-array) were performed on the fetal blood sample. Fluorescence in situ hybridization (FISH) was used to confirm the result of SNP array analysis. FISH assay was also carried out on peripheral blood specimens from the parents to ascertain the origin of mutation. The karyotype of fetus was found to be 46, XY by G-banding analysis. However, a 3.132 Mb microdeletion was detected in chromosome region 17q11.2 by SNP array, which overlaped with the region of NF1 microdeletion syndrome. Analyzing of the specimens from the fetus and its parents with FISH has confirmed it to be a de novo deletion. Talipes equinovarus may be an abnormal sonographic feature of fetus with atypical NF1 microdeletion which can be accurately diagnosed with SNP array.
Hatono, Saki; Nishimura, Kaori; Murakami, Yoko; Tsujimura, Mai; Yamagishi, Hiroshi
2017-09-01
The complete sequence of the mitochondrial genome was determined for two cultivars of Brassica rapa . After determining the sequence of a Chinese cabbage variety, 'Oushou hakusai', the sequence of a mizuna variety, 'Chusei shiroguki sensuji kyomizuna', was mapped against the sequence of Chinese cabbage. The precise sequences where the two varieties demonstrated variation were ascertained by direct sequencing. It was found that the mitochondrial genomes of the two varieties are identical over 219,775 bp, with a single nucleotide polymorphism (SNP) between the genomes. Because B. rapa is the maternal species of an amphidiploid crop species, Brassica juncea , the distribution of the SNP was observed both in B. rapa and B. juncea . While the mizuna type SNP was restricted mainly to cultivars of mizuna (japonica group) in B. rapa , the mizuna type was widely distributed in B. juncea . The finding that the two Brassica species have these SNP types in common suggests that the nucleotide substitution occurred in wild B. rapa before both mitotypes were domesticated. It was further inferred that the interspecific hybridization between B. rapa and B. nigra took place twice and resulted in the two mitotypes of cultivated B. juncea .
Bouakaze, Caroline; Keyser, Christine; Crubézy, Eric; Montagnon, Daniel; Ludes, Bertrand
2009-07-01
In the present study, a multiplexed genotyping assay for ten single nucleotide polymorphisms (SNPs) located within six pigmentation candidate genes was developed on modern biological samples and applied to DNA retrieved from 25 archeological human remains from southern central Siberia dating from the Bronze and Iron Ages. SNP genotyping was successful for the majority of ancient samples and revealed that most probably had typical European pigment features, i.e., blue or green eye color, light hair color and skin type, and were likely of European individual ancestry. To our knowledge, this study reports for the first time the multiplexed typing of autosomal SNPs on aged and degraded DNA. By providing valuable information on pigment traits of an individual and allowing individual biogeographical ancestry estimation, autosomal SNP typing can improve ancient DNA studies and aid human identification in some forensic casework situations when used to complement conventional molecular markers.
Automation of complex assays: pharmacogenetics of warfarin dosing.
Wu, Whei-Kuo; Hujsak, Paul G; Kureshy, Fareed
2007-10-01
AutoGenomics, Inc. (Carlsbad, CA, USA) have developed a multiplex microarray assay for genotyping both VKORC1 and CYP2C9 using the INFINITI(™) Analyzer. Multiple alleles in each DNA sample are analyzed by polymerase chain reaction amplification, followed by detection primer extension using the INFINITI Analyzer. The INFINITI Analyzer performs single-nucleotide polymorphism (SNP) analysis using universal oligonucleotides immobilized on the biochip. To genotype broader ethnic groups, genomic DNA from whole blood was tested for nine SNPs for VKORC1 and six for CYP2C9 genotypes. Information related to all 15 SNPs is needed to determine dosing of population of diverse ethnic origin. The INFINITI system provides genotyping information for same day dosing of warfarin.
Cohort analysis of a single nucleotide polymorphism on DNA chips.
Schwonbeck, Susanne; Krause-Griep, Andrea; Gajovic-Eichelmann, Nenad; Ehrentreich-Förster, Eva; Meinl, Walter; Glatt, Hansrüdi; Bier, Frank F
2004-11-15
A method has been developed to determine SNPs on DNA chips by applying a flow-through bioscanner. As a practical application we demonstrated the fast and simple SNP analysis of 24 genotypes in an array of 96 spots with a single hybridisation and dissociation experiment. The main advantage of this methodical concept is the parallel and fast analysis without any need of enzymatic digestion. Additionally, the DNA chip format used is appropriate for parallel analysis up to 400 spots. The polymorphism in the gene of the human phenol sulfotransferase SULT1A1 was studied as a model SNP. Biotinylated PCR products containing the SNP (The SNP summary web site: ) (mutant) and those containing no mutation (wild-type) were brought onto the chips coated with NeutrAvidin using non-contact spotting. This was followed by an analysis which was carried out in a flow-through biochip scanner while constantly rinsing with buffer. After removing the non-biotinylated strand a fluorescent probe was hybridised, which is complementary to the wild-type sequence. If this probe binds to a mutant sequence, then one single base is not fully matching. Thereby, the mismatched hybrid (mutant) is less stable than the full-matched hybrid (wild-type). The final step after hybridisation on the chip involves rinsing with a buffer to start dissociation of the fluorescent probe from the immobilised DNA strand. The online measurement of the fluorescence intensity by the biochip scanner provides the possibility to follow the kinetics of the hybridisation and dissociation processes. According to the different stability of the full-match and the mismatch, either visual discrimination or kinetic analysis is possible to distinguish SNP-containing sequence from the wild-type sequence.
A set of 14 DIP-SNP markers to detect unbalanced DNA mixtures.
Liu, Zhizhen; Liu, Jinding; Wang, Jiaqi; Chen, Deqing; Liu, Zidong; Shi, Jie; Li, Zeqin; Li, Wenyan; Zhang, Gengqian; Du, Bing
2018-03-04
Unbalanced DNA mixture is still a difficult problem for forensic practice. DIP-STRs are useful markers for detection of minor DNA but they are not widespread in the human genome and having long amplicons. In this study, we proposed a novel type of genetic marker, termed DIP-SNP. DIP-SNP refers to the combination of INDEL and SNP in less than 300bp length of human genome. The multiplex PCR and SNaPshot assay were established for 14 DIP-SNP markers in a Chinese Han population from Shanxi, China. This novel compound marker allows detection of the minor DNA contributor with sensitivity from 1:50 to 1:1000 in a DNA mixture of any gender with 1 ng-10 ng DNA template. Most of the DIP-SNP markers had a relatively high probability of informative alleles with an average I value of 0.33. In all, we proposed DIP-SNP as a novel kind of genetic marker for detection of minor contributor from unbalanced DNA mixture and established the detection method by associating the multiplex PCR and SNaPshot assay. DIP-SNP polymorphisms are promising markers for forensic or clinical mixture examination because they are shorter, widespread and higher sensitive. Copyright © 2018 Elsevier Inc. All rights reserved.
Analysis of SNP rs16754 of WT1 gene in a series of de novo acute myeloid leukemia patients.
Luna, Irene; Such, Esperanza; Cervera, Jose; Barragán, Eva; Jiménez-Velasco, Antonio; Dolz, Sandra; Ibáñez, Mariam; Gómez-Seguí, Inés; López-Pavía, María; Llop, Marta; Fuster, Óscar; Oltra, Silvestre; Moscardó, Federico; Martínez-Cuadrón, David; Senent, M Leonor; Gascón, Adriana; Montesinos, Pau; Martín, Guillermo; Bolufer, Pascual; Sanz, Miguel A
2012-12-01
The single nucleotide polymorphism (SNP) rs16754 of the WT1 gene has been previously described as a possible prognostic marker in normal karyotype acute myeloid leukemia (AML) patients. Nevertheless, the findings in this field are not always reproducible in different series. One hundred and seventy-five adult de novo AML patients were screened with two different methods for the detection of SNP rs16754: high-resolution melting (HRM) and FRET hybridization probes. Direct sequencing was used to validate both techniques. The SNP was detected in 52 out of 175 patients (30 %), both by HRM and hybridization probes. Direct sequencing confirmed that every positive sample in the screening methods had a variation in the DNA sequence. Patients with the wild-type genotype (WT1(AA)) for the SNP rs16754 were significantly younger than those with the heterozygous WT1(AG) genotype. No other difference was observed for baseline characteristic or outcome between patients with or without the SNP. Both techniques are equally reliable and reproducible as screening methods for the detection of the SNP rs16754, allowing for the selection of those samples that will need to be sequenced. We were unable to confirm the suggested favorable outcome of SNP rs16754 in de novo AML.
The effect of column purification on cDNA indirect labelling for microarrays
Molas, M Lia; Kiss, John Z
2007-01-01
Background The success of the microarray reproducibility is dependent upon the performance of standardized procedures. Since the introduction of microarray technology for the analysis of global gene expression, reproducibility of results among different laboratories has been a major problem. Two of the main contributors to this variability are the use of different microarray platforms and different laboratory practices. In this paper, we address the latter question in terms of how variation in one of the steps of a labelling procedure affects the cDNA product prior to microarray hybridization. Results We used a standard procedure to label cDNA for microarray hybridization and employed different types of column chromatography for cDNA purification. After purifying labelled cDNA, we used the Agilent 2100 Bioanalyzer and agarose gel electrophoresis to assess the quality of the labelled cDNA before its hybridization onto a microarray platform. There were major differences in the cDNA profile (i.e. cDNA fragment lengths and abundance) as a result of using four different columns for purification. In addition, different columns have different efficiencies to remove rRNA contamination. This study indicates that the appropriate column to use in this type of protocol has to be experimentally determined. Finally, we present new evidence establishing the importance of testing the method of purification used during an indirect labelling procedure. Our results confirm the importance of assessing the quality of the sample in the labelling procedure prior to hybridization onto a microarray platform. Conclusion Standardization of column purification systems to be used in labelling procedures will improve the reproducibility of microarray results among different laboratories. In addition, implementation of a quality control check point of the labelled samples prior to microarray hybridization will prevent hybridizing a poor quality sample to expensive micorarrays. PMID:17597522
The effect of column purification on cDNA indirect labelling for microarrays.
Molas, M Lia; Kiss, John Z
2007-06-27
The success of the microarray reproducibility is dependent upon the performance of standardized procedures. Since the introduction of microarray technology for the analysis of global gene expression, reproducibility of results among different laboratories has been a major problem. Two of the main contributors to this variability are the use of different microarray platforms and different laboratory practices. In this paper, we address the latter question in terms of how variation in one of the steps of a labelling procedure affects the cDNA product prior to microarray hybridization. We used a standard procedure to label cDNA for microarray hybridization and employed different types of column chromatography for cDNA purification. After purifying labelled cDNA, we used the Agilent 2100 Bioanalyzer and agarose gel electrophoresis to assess the quality of the labelled cDNA before its hybridization onto a microarray platform. There were major differences in the cDNA profile (i.e. cDNA fragment lengths and abundance) as a result of using four different columns for purification. In addition, different columns have different efficiencies to remove rRNA contamination. This study indicates that the appropriate column to use in this type of protocol has to be experimentally determined. Finally, we present new evidence establishing the importance of testing the method of purification used during an indirect labelling procedure. Our results confirm the importance of assessing the quality of the sample in the labelling procedure prior to hybridization onto a microarray platform. Standardization of column purification systems to be used in labelling procedures will improve the reproducibility of microarray results among different laboratories. In addition, implementation of a quality control check point of the labelled samples prior to microarray hybridization will prevent hybridizing a poor quality sample to expensive micorarrays.
Identification of bovine NPC1 gene cSNPs and their effects on body size traits of Qinchuan cattle.
Dang, Yonglong; Li, Mingxun; Yang, Mingjuan; Cao, Xiukai; Lan, Xianyong; Lei, Chuzhao; Zhang, Chunlei; Lin, Qing; Chen, Hong
2014-05-01
NPC1 gene is an important gene closely related to the Niemann-Pick type C (NPC). Mutations in the NPC1 gene tend to cause Niemann-Pick type C, a lysosomal storage disorder. Previous studies have shown that NPC1 protein plays an important role in subcellular lipid transport, homeostasis, platelet function and formation, which are basic metabolic activities in the process of development. In this study, to explore the association between the NPC1 gene variation and body size traits in Qinchuan cattle, we detected four novel coding single nucleotide polymorphisms (cSNPs) in the bovine NPC1 gene, including one missense mutation (SNP1) and three synonymous mutations (SNP2, SNP3 and SNP4). Population genetic analyses of 518 individuals and association correlations between cSNPs and bovine body size traits were conducted in this research. A missense mutation at SNP1 locus was found to be significantly related to the heart girth, hip width and body weight (P<0.01 or P<0.05, 3.5-year-old). Two synonymous mutations at SNP2 and SNP3 loci also showed significant effects on hip width (P<0.05, 3.5-year-old). One synonymous mutation at SNP4 locus showed significant effect on body weight (P<0.05, 2.0-year-old). Combined haplotypes H2H6 and H6H6 showed significant effects on body size traits such as heart girth, hip width, and body weight (3.5-year-old, P<0.01 or P<0.05). This study provides evidence that the NPC1 gene might be involved in the regulation of bovine growth and body development, and may be considered as a candidate gene for marker assisted selection (MAS) in beef cattle breeding industry. Copyright © 2014. Published by Elsevier B.V.
Gene-environment interaction in the etiology of mathematical ability using SNP sets.
Docherty, Sophia J; Kovas, Yulia; Plomin, Robert
2011-01-01
Mathematics ability and disability is as heritable as other cognitive abilities and disabilities, however its genetic etiology has received relatively little attention. In our recent genome-wide association study of mathematical ability in 10-year-old children, 10 SNP associations were nominated from scans of pooled DNA and validated in an individually genotyped sample. In this paper, we use a 'SNP set' composite of these 10 SNPs to investigate gene-environment (GE) interaction, examining whether the association between the 10-SNP set and mathematical ability differs as a function of ten environmental measures in the home and school in a sample of 1888 children with complete data. We found two significant GE interactions for environmental measures in the home and the school both in the direction of the diathesis-stress type of GE interaction: The 10-SNP set was more strongly associated with mathematical ability in chaotic homes and when parents are negative.
Röper, Andrea; Reichert, Walter; Mattern, Rainer
2007-01-01
In the field of forensic DNA typing, the analysis of Short Tandem Repeats (STRs) can fail in cases of degraded DNA. The typing of coding region Single Nucleotide Polymorphisms (SNPs) of the mitochondrial genome provides an approach to acquire additional information. In the examined case of aggravated theft, both suspects could be excluded of having left the analyzed hair on the crime scene by SNP typing. This conclusion was not possible subsequent to STR typing. SNP typing of the trace on the torch light left on the crime scene increased the likelihood for suspect no. 2 to be the origin of this trace. This finding was already indicated by STR analysis. Suspect no. 1 was excluded for being the origin of this trace by SNP typing which was also indicated by STR analysis. A limiting factor for the analysis of SNPs is the maternal inheritance of mitochondrial DNA. Individualisation is not possible. In conclusion, it can be said that in the case of traces which cause problems with conventional STR typing the supplementary analysis of coding region SNPs from the mitochondrial genome is very reasonable and greatly contributes to the refinement of analysis methods in the field of forensic genetics.
IL-10 -1082 SNP and IL-10 in primary CNS and vitreoretinal lymphomas.
Ramkumar, Hema L; Shen, De Fen; Tuo, Jingsheng; Braziel, Rita M; Coupland, Sarah E; Smith, Justine R; Chan, Chi-Chao
2012-10-01
Most primary central nervous system lymphomas (PCNSLs) and primary vitreoretinal lymphomas (PVRLs) are B-cell lymphomas that produce high levels of interleukin (IL)-10, which is linked to rapid disease progression. The IL-10 (-1082) G → A polymorphism (IL-10 SNP) is associated with improved survival in certain non-CNS lymphoma patients. PDCD4 is a tumor suppressor gene and upstream regulator of IL-10. This study examined the correlation between the IL-10 SNP, PDCD4 mRNA expression, and IL-10 expression (at transcript and protein levels) in these lymphoma cells. Single-nucleotide polymorphism (SNP)-typing at IL-10 (-1082) was performed after microdissecting cytospun PVRL cells from 26 specimens. Vitreal IL-10 and IL-6 levels were measured by ELISA. PCNSL cells from 52 paraffin-embedded sections were microdissected and SNP typed on genomic DNA. RT-PCR was performed to analyze expression of IL-10 and PDCD4 mRNA. IL-10 (-1082) SNP typing was performed on blood samples of 96 healthy controls. We measured IL-10 (-1082) SNP expression in 26 PVRLs and 52 PCNSLs and examined its relationship with IL-10 protein and gene expression, respectively. More PVRL patients expressed one copy of the IL-10 ( -1082 ) G → A SNP with the GA genotype compared to controls. The frequencies of the three genotypes (AA, AG, GG) significantly differed in PVRL versus controls and in PCNSL versus controls. In PVRLs, the vitreal IL-10/IL-6 ratio was higher in IL-10 (-1082) AG and IL-10 (-1082) AA patients, compared to IL-10 (-1082) GG patients. IL-10 mRNA expression was higher in IL-10 (-1082) AG and IL-10 (-1082) AA PCNSLs, compared to IL-10 (-1082) GG PCNSLs. No correlation was found between IL-10 and PDCD4 expression levels in 37 PCNSL samples. PVRL and PCNSL patients had similar IL-10 (-1082) A allele frequencies, but genotype distributions differed from healthy controls. The findings suggest that the IL-10 (-1082) A allele is a risk factor for higher IL-10 levels in PVRLs and PCNSLs. Higher IL-10 levels have been correlated with more aggressive disease in both PVRLs and PCNSLs, making this finding an important and potentially clinically significant observation.
IL-10 -1082 SNP and IL-10 in primary CNS and vitreoretinal lymphomas
Ramkumar, Hema L.; Shen, De Fen; Tuo, Jingsheng; Braziel, Rita M.; Coupland, Sarah E.; Smith, Justine R.
2012-01-01
Objectives Most primary central nervous system lymphomas (PCNSLs) and primary vitreoretinal lymphomas (PVRLs) are B-cell lymphomas that produce high levels of interleukin (IL)-10, which is linked to rapid disease progression. The IL-10-1082G→A polymorphism (IL-10 SNP) is associated with improved survival in certain non-CNS lymphoma patients. PDCD4 is a tumor suppressor gene and upstream regulator of IL-10. This study examined the correlation between the IL-10 SNP, PDCD4 mRNA expression, and IL-10 expression (at transcript and protein levels) in these lymphoma cells. Materials and methods Single-nucleotide polymorphism (SNP)-typing at IL-10-1082 was performed after micro-dissecting cytospun PVRL cells from 26 specimens. Vitreal IL-10 and IL-6 levels were measured by ELISA. PCNSL cells from 52 paraffin-embedded sections were microdissected and SNP typed on genomic DNA. RT-PCR was performed to analyze expression of IL-10 and PDCD4 mRNA. IL-10-1082 SNP typing was performed on blood samples of 96 healthy controls. We measured IL-10-1082 SNP expression in 26 PVRLs and 52 PCNSLs and examined its relationship with IL-10 protein and gene expression, respectively. Results More PVRL patients expressed one copy of the IL-10-1082G→A SNP with the GA genotype compared to controls. The frequencies of the three genotypes (AA, AG, GG) significantly differed in PVRL versus controls and in PCNSL versus controls. In PVRLs, the vitreal IL-10/IL-6 ratio was higher in IL-10-1082 AG and IL-10-1082 AA patients, compared to IL-10-1082 GG patients. IL-10 mRNA expression was higher in IL-10-1082 AG and IL-10-1082 AA PCNSLs, compared to IL-10-1082 GG PCNSLs. No correlation was found between IL-10 and PDCD4 expression levels in 37 PCNSL samples. Conclusions PVRL and PCNSL patients had similar IL-10-1082 A allele frequencies, but genotype distributions differed from healthy controls. The findings suggest that the IL-10-1082 A allele is a risk factor for higher IL-10 levels in PVRLs and PCNSLs. Higher IL-10 levels have been correlated with more aggressive disease in both PVRLs and PCNSLs, making this finding an important and potentially clinically significant observation. PMID:22628023
Huijsmans, Cornelis J. J.; Poodt, Jeroen; Damen, Jan; van der Linden, Johannes C.; Savelkoul, Paul H. M.; Pruijt, Johannes F. M.; Hilbink, Mirrian; Hermans, Mirjam H. A.
2012-01-01
During tumor development, loss of heterozygosity (LOH) often occurs. When LOH is preceded by an oncogene activating mutation, the mutant allele may be further potentiated if the wild-type allele is lost or inactivated. In myeloproliferative neoplasms (MPN) somatic acquisition of JAK2V617F may be followed by LOH resulting in loss of the wild type allele. The occurrence of LOH in MPN and other proliferative diseases may lead to a further potentiating the mutant allele and thereby increasing morbidity. A real time PCR based SNP profiling assay was developed and validated for LOH detection of the JAK2 region (JAK2LOH). Blood of a cohort of 12 JAK2V617F-positive patients (n = 6 25–50% and n = 6>50% JAK2V617F) and a cohort of 81 patients suspected of MPN was stored with EDTA and subsequently used for validation. To generate germ-line profiles, non-neoplastic formalin-fixed paraffin-embedded tissue from each patient was analyzed. Results of the SNP assay were compared to those of an established Short Tandem Repeat (STR) assay. Both assays revealed JAK2LOH in 1/6 patients with 25–50% JAK2V617F. In patients with >50% JAK2V617F, JAK2LOH was detected in 6/6 by the SNP assay and 5/6 patients by the STR assay. Of the 81 patients suspected of MPN, 18 patients carried JAK2V617F. Both the SNP and STR assay demonstrated the occurrence of JAK2LOH in 5 of them. In the 63 JAK2V617F-negative patients, no JAK2LOH was observed by SNP and STR analyses. The presented SNP assay reliably detects JAK2LOH and is a fast and easy to perform alternative for STR analyses. We therefore anticipate the SNP approach as a proof of principle for the development of LOH SNP-assays for other clinically relevant LOH loci. PMID:22768290
Scurrah, Katrina J; Lamantia, Angela; Ellis, Justine A; Harrap, Stephen B
2017-06-01
Renin-angiotensin-aldosterone system genes have been inconsistently associated with blood pressure, possibly because of unrecognized influences of sex-dependent genetic effects or gene-gene interactions (epistasis). We tested association of systolic blood pressure with single-nucleotide polymorphisms (SNPs) at renin ( REN ), angiotensinogen ( AGT ), angiotensin-converting enzyme ( ACE ), angiotensin II type 1 receptor ( AGTR1 ), and aldosterone synthase ( CYP11B2 ), including sex-SNP or SNP-SNP interactions. Eighty-eight tagSNPs were tested in 2872 white individuals in 809 pedigrees from the Victorian Family Heart Study using variance components models. Three SNPs (rs8075924 and rs4277404 at ACE and rs12721297 at AGTR1 ) were individually associated with lower systolic blood pressure with significant ( P <0.00076) effect sizes ≈1.7 to 2.5 mm Hg. Sex-specific associations were seen for 3 SNPs in men (rs2468523 and rs2478544 at AGT and rs11658531 at ACE ) and 1 SNP in women (rs12451328 at ACE ). SNP-SNP interaction was suggested ( P <0.005) for 14 SNP pairs, none of which had shown individual association with systolic blood pressure. Four SNP pairs were at the same gene (2 for REN , 1 for AGT , and 1 for AGTR1 ). The SNP rs3097 at CYP11B2 was represented in 5 separate pairs. SNPs at key renin-angiotensin-aldosterone system genes associate with systolic blood pressure individually in both sexes, individually in one sex only and only when combined with another SNP. Analyses that incorporate sex-dependent and epistatic effects could reconcile past inconsistencies and account for some of the missing heritability of blood pressure and are generally relevant to SNP association studies for any phenotype. © 2017 American Heart Association, Inc.
Ruettger, Anke; Nieter, Johanna; Skrypnyk, Artem; Engelmann, Ines; Ziegler, Albrecht; Moser, Irmgard; Monecke, Stefan; Ehricht, Ralf
2012-01-01
Membrane-based spoligotyping has been converted to DNA microarray format to qualify it for high-throughput testing. We have shown the assay's validity and suitability for direct typing from tissue and detecting new spoligotypes. Advantages of the microarray methodology include rapidity, ease of operation, automatic data processing, and affordability. PMID:22553239
Ruettger, Anke; Nieter, Johanna; Skrypnyk, Artem; Engelmann, Ines; Ziegler, Albrecht; Moser, Irmgard; Monecke, Stefan; Ehricht, Ralf; Sachse, Konrad
2012-07-01
Membrane-based spoligotyping has been converted to DNA microarray format to qualify it for high-throughput testing. We have shown the assay's validity and suitability for direct typing from tissue and detecting new spoligotypes. Advantages of the microarray methodology include rapidity, ease of operation, automatic data processing, and affordability.
ERIC Educational Resources Information Center
Chang, Ming-Mei; Briggs, George M.
2007-01-01
DNA microarrays are microscopic arrays on a solid surface, typically a glass slide, on which DNA oligonucleotides are deposited or synthesized in a high-density matrix with a predetermined spatial order. Several types of DNA microarrays have been developed and used for various biological studies. Here, we developed an undergraduate laboratory…
Poore, Joshua C.; Barbey, Aron K.; Krueger, Frank; Solomon, Jeffrey; Lipsky, Robert H.; Hodgkinson, Colin A.; Goldman, David; Grafman, Jordan
2012-01-01
This study examined the role of orbitofrontal cortex (OFC) and dorsolateral prefrontal cortex (DLPFC) plasticity in controlling implicit and explicit social biases. Normal controls and patients with varied OFC and DLPFC lesion size and single nucleotide polymorphisms (SNPs) in the brain-derived neurotrophic factor (BDNF) gene, which promotes (methionine–valine [Met/Val] SNP) or stifles (valine–valine [Val/Val] SNP) plasticity in damaged PFC regions, completed measures of implicit and explicit social bias. Patients and controls demonstrated comparable levels of implicit bias, but patients with Met/Val SNPs exhibited less implicit bias when they had smaller OFC lesions compared with Val/Val patients with similar size lesions and those with large OFC lesions. Both patients and controls demonstrated patterns of explicit bias consistent with hypotheses. Patients with Met/Val SNPs exhibited less explicit bias when they had smaller DLPFC lesions sizes compared with Val/Val patients with similar size lesions and those with large DLPFC lesions. OFC lesion size and BDNF SNP type did not moderate explicit bias; DLPFC lesion size and BDNF SNP type did not moderate implicit bias (nor did other medial or lateral regions). Findings suggest that plasticity within specific PFC regions modulates the type and degree of social bias that individuals’ exhibit. PMID:22123938
Multi-locus variable number tandem repeat analysis of 7th pandemic Vibrio cholerae
2012-01-01
Background Seven pandemics of cholera have been recorded since 1817, with the current and ongoing pandemic affecting almost every continent. Cholera remains endemic in developing countries and is still a significant public health issue. In this study we use multilocus variable number of tandem repeats (VNTRs) analysis (MLVA) to discriminate between isolates of the 7th pandemic clone of Vibrio cholerae. Results MLVA of six VNTRs selected from previously published data distinguished 66 V. cholerae isolates collected between 1961–1999 into 60 unique MLVA profiles. Only 4 MLVA profiles consisted of more than 2 isolates. The discriminatory power was 0.995. Phylogenetic analysis showed that, except for the closely related profiles, the relationships derived from MLVA profiles were in conflict with that inferred from Single Nucleotide Polymorphism (SNP) typing. The six SNP groups share consensus VNTR patterns and two SNP groups contained isolates which differed by only one VNTR locus. Conclusions MLVA is highly discriminatory in differentiating 7th pandemic V. cholerae isolates and MLVA data was most useful in resolving the genetic relationships among isolates within groups previously defined by SNPs. Thus MLVA is best used in conjunction with SNP typing in order to best determine the evolutionary relationships among the 7th pandemic V. cholerae isolates and for longer term epidemiological typing. PMID:22624829
Evaluation of Parkinson Disease Risk Variants as Expression-QTLs
Latourelle, Jeanne C.; Dumitriu, Alexandra; Hadzi, Tiffany C.; Beach, Thomas G.; Myers, Richard H.
2012-01-01
The recent Parkinson Disease GWAS Consortium meta-analysis and replication study reports association at several previously confirmed risk loci SNCA, MAPT, GAK/DGKQ, and HLA and identified a novel risk locus at RIT2. To further explore functional consequences of these associations, we investigated modification of gene expression in prefrontal cortex brain samples of pathologically confirmed PD cases (N = 26) and controls (N = 24) by 67 associated SNPs in these 5 loci. Association between the eSNPs and expression was evaluated using a 2-degrees of freedom test of both association and difference in association between cases and controls, adjusted for relevant covariates. SNPs at each of the 5 loci were tested for cis-acting effects on all probes within 250 kb of each locus. Trans-effects of the SNPs on the 39,122 probes passing all QC on the microarray were also examined. From the analysis of cis-acting SNP effects, several SNPs in the MAPT region show significant association to multiple nearby probes, including two strongly correlated probes targeting the gene LOC644246 and the duplicated genes LRRC37A and LRRC37A2, and a third uncorrelated probe targeting the gene DCAKD. Significant cis-associations were also observed between SNPs and two probes targeting genes in the HLA region on chromosome 6. Expanding the association study to examine trans effects revealed an additional 23 SNP-probe associations reaching statistical significance (p<2.8×10−8) including SNPs from the SNCA, MAPT and RIT2 regions. These findings provide additional context for the interpretation of PD associated SNPs identified in recent GWAS as well as potential insight into the mechanisms underlying the observed SNP associations. PMID:23071545
Jiang, Ni; Zhu, Xishan; Zhang, Hongmei; Wang, Xiaoli; Zhou, Xinna; Gu, Jiezhun; Chen, Baoan; Ren, Jun
2014-01-01
Methylenetetrahydrofolate reductase (MTHFR) is the key enzyme for folate metabolism. Previous studies suggest a relationship between its single nucleotide polymorphisms (SNP) of C677T and A1298C with a variety of tumor susceptibility including hematological malignancy. SNP frequency distribution in different ethnic populations might lead to differences in disease susceptibility. There has been little research in Chinese people on the MTHFR SNP with the susceptibility of the hematological malignancy. Therefore, this study investigated the relationship between MTHFR SNPs and hematological malignancy in Jiangsu province in China. Gene microarray was used to detect MTHFR C677T and A1298C single nucleotide polymorphism loci on 157 healthy controls and 127 patients from Jiangsu province with hematological malignancies (30 with multiple myeloma, 28 with non-Hodgkin's lymphoma, 22 with acute lymphoblastic leukemia, 40 with acute myeloid leukemia, and seven with chronic myeloid leukemia). The allele frequency of 677T was 41.3% in patients and 33.1% in controls, showed significant difference (chi2 = 4.08, p = 0.043); 677TT genotype with a high susceptibility to hematological malignancy (OR 1.96, 95% CI 1.01 - 4.45, p = 0.041). In subgroup analyses, the genotypes 677TT and 1298CC were associated with significantly increased multiple myeloma risk (TT vs. CC: OR 8.92, 95% CI 1.06 - 75.24, p = 0.006; CC vs. AA: OR = 4.80, 95% CI 1.56 - 14.73, p = 0.044). No associations were found between polymorphisms and susceptibilities to acute lymphoblastic leukemia, acute myeloid leukemia, or non-Hodgkin's lymphoma. MTHFRC677T polymorphisms influence the risk of hematological malignancy among the population in Jiangsu province. Both MTHFR 677TT and MTHFR 1298CC genotypes increase susceptibility to myeloid leukemia.
USDA-ARS?s Scientific Manuscript database
In this investigation 45 parental cacao plants and five progeny derived from the parental stock studied were genotyped using six SNP markers to determine off-types or mislabeled clones and to authenticate crosses made in the Cocoa Research Institute of Ghana (CRIG) breeding program. Investigation wa...
A functional polymorphism of the TNF-{alpha} gene that is associated with type 2 DM
DOE Office of Scientific and Technical Information (OSTI.GOV)
Susa, Shinji; Daimon, Makoto; Sakabe, Jun-Ichi
2008-05-09
To examine the association of the tumor necrosis factor-{alpha} (TNF-{alpha}) gene region with type 2 diabetes (DM), 11 single-nucleotide polymorphisms (SNPs) of the region were analyzed. The initial study using a sample set (148 cases vs. 227 controls) showed a significant association of the SNP IVS1G + 123A of the TNF-{alpha} gene with DM (p = 0.0056). Multiple logistic regression analysis using an enlarged sample set (225 vs. 716) revealed the significant association of the SNP with DM independently of any clinical traits examined (OR: 1.49, p = 0.014). The functional relevance of the SNP were examined by the electrophoreticmore » mobility shift assays using nuclear extracts from the U937 and NIH3T3 cells and luciferase assays in these cells with Simian virus 40 promoter- and TNF-{alpha} promoter-reporter gene constructs. The functional analyses showed that YY1 transcription factor bound allele-specifically to the SNP region and, the IVS1 + 123A allele had an increase in luciferase expression compared with the G allele.« less
Development of a Digital Microarray with Interferometric Reflectance Imaging
NASA Astrophysics Data System (ADS)
Sevenler, Derin
This dissertation describes a new type of molecular assay for nucleic acids and proteins. We call this technique a digital microarray since it is conceptually similar to conventional fluorescence microarrays, yet it performs enumerative ('digital') counting of the number captured molecules. Digital microarrays are approximately 10,000-fold more sensitive than fluorescence microarrays, yet maintain all of the strengths of the platform including low cost and high multiplexing (i.e., many different tests on the same sample simultaneously). Digital microarrays use gold nanorods to label the captured target molecules. Each gold nanorod on the array is individually detected based on its light scattering, with an interferometric microscopy technique called SP-IRIS. Our optimized high-throughput version of SP-IRIS is able to scan a typical array of 500 spots in less than 10 minutes. Digital DNA microarrays may have utility in applications where sequencing is prohibitively expensive or slow. As an example, we describe a digital microarray assay for gene expression markers of bacterial drug resistance.
Shrinkage regression-based methods for microarray missing value imputation.
Wang, Hsiuying; Chiu, Chia-Chun; Wu, Yi-Ching; Wu, Wei-Sheng
2013-01-01
Missing values commonly occur in the microarray data, which usually contain more than 5% missing values with up to 90% of genes affected. Inaccurate missing value estimation results in reducing the power of downstream microarray data analyses. Many types of methods have been developed to estimate missing values. Among them, the regression-based methods are very popular and have been shown to perform better than the other types of methods in many testing microarray datasets. To further improve the performances of the regression-based methods, we propose shrinkage regression-based methods. Our methods take the advantage of the correlation structure in the microarray data and select similar genes for the target gene by Pearson correlation coefficients. Besides, our methods incorporate the least squares principle, utilize a shrinkage estimation approach to adjust the coefficients of the regression model, and then use the new coefficients to estimate missing values. Simulation results show that the proposed methods provide more accurate missing value estimation in six testing microarray datasets than the existing regression-based methods do. Imputation of missing values is a very important aspect of microarray data analyses because most of the downstream analyses require a complete dataset. Therefore, exploring accurate and efficient methods for estimating missing values has become an essential issue. Since our proposed shrinkage regression-based methods can provide accurate missing value estimation, they are competitive alternatives to the existing regression-based methods.
Kai, M; Nakata, N; Matsuoka, M; Sekizuka, T; Kuroda, M; Makino, M
2013-10-01
Genome analysis of Mycobacterium leprae strain Kyoto-2 in this study revealed characteristic nucleotide substitutions in gene ML0411, compared to the reference genome M. leprae strain TN. The ML0411 gene of Kyoto-2 had six SNPs compared to that of TN. All SNPs in ML0411 were non-synonymous mutations that result in amino acid replacements. In addition, a seventh SNP was found 41 bp upstream of the start codon in the regulatory region. The seven SNP sites in the ML0411 region were investigated by sequencing in 36 M. leprae isolates from the Leprosy Research Center in Japan. The SNP pattern in 14 of the 36 isolates showed similarity to that of Kyoto-2. Determination of the standard SNP types within the 36 stocked isolates revealed that almost all of the Japanese strains belonged to SNP type III, with nucleotide substitutions at position 14676, 164275, and 2935685 of the M. leprae TN genome. The geographical distribution pattern of east Asian M. leprae isolates by discrimination of ML0411 SNPs was investigated and interestingly turned out to be similar to that of tandem repeat numbers of GACATC in the rpoT gene (3 copies or 4 copies), which has been established as a tool for M. leprae genotyping. All seven Korean M. leprae isolates examined in this study, as well as those derived from Honshu Island of Japan, showed 4 copies of the 6-base tandem repeat plus the ML0411 SNPs observed in M. leprae Kyoto-2. They are termed Northeast Asian (NA) strain of M. leprae. On the other hand, many of isolates derived from the Okinawa Islands of Japan and from the Philippines showed 3 copies of the 6-base tandem repeat in addition to the M. leprae TN ML0411 type of SNPs. These results demonstrate the existence of M. leprae strains in Northeast Asian region having characteristic SNP patterns. Copyright © 2013 Elsevier B.V. All rights reserved.
Savio, Andrea J.; Lemire, Mathieu; Mrkonjic, Miralem; Gallinger, Steven; Zanke, Brent W.; Hudson, Thomas J.; Bapat, Bharati
2012-01-01
Single nucleotide polymorphisms (SNPs) are the most common form of genetic variation. We previously demonstrated that SNPs (rs1800734, rs749072, and rs13098279) in the MLH1 gene region are associated with MLH1 promoter island methylation, loss of MLH1 protein expression, and microsatellite instability (MSI) in colorectal cancer (CRC) patients. Recent studies have identified less CpG-dense “shore” regions flanking many CpG islands. These shores often exhibit distinct methylation profiles between different tissues and matched normal versus tumor cells of patients. To date, most epigenetic studies have focused on somatic methylation events occurring within solid tumors; less is known of the contributions of peripheral blood cell (PBC) methylation to processes such as aging and tumorigenesis. To address whether MLH1 methylation in PBCs is correlated with tumorigenesis we utilized the Illumina 450 K microarrays to measure methylation in PBC DNA of 846 healthy controls and 252 CRC patients from Ontario, Canada. Analysis of a region of chromosome 3p21 spanning the MLH1 locus in healthy controls revealed that a CpG island shore 1 kb upstream of the MLH1 gene exhibits different methylation profiles when stratified by SNP genotypes (rs1800734, rs749072, and rs13098279). Individuals with wild-type genotypes incur significantly higher PBC shore methylation than heterozygous or homozygous variant carriers (p<1.1×10−6; ANOVA). This trend is also seen in CRC cases (p<0.096; ANOVA). Shore methylation also decreases significantly with increasing age in cases and controls. This is the first study of its kind to integrate PBC methylation at a CpG island shore with SNP genotype status in CRC cases and controls. These results indicate that CpG island shore methylation in PBCs may be influenced by genotype as well as the normal aging process. PMID:23240038
Sherman, E L; Nkrumah, J D; Murdoch, B M; Li, C; Wang, Z; Fu, A; Moore, S S
2008-01-01
Genes that regulate metabolism and energy partitioning have the potential to influence economically important traits in farm animals, as do polymorphisms within these genes. In the current study, SNP in the bovine neuropeptide Y (NPY), growth hormone receptor (GHR), ghrelin (GHRL), uncoupling proteins 2 and 3 (UCP2 and UCP3), IGF2, corticotrophin-releasing hormone (CRH), cocaine and amphetamine regulated transcript (CART), melanocortin-4 receptor (MC4R), proopiomelanocortin (POMC), and GH genes were evaluated for associations with growth, feed efficiency, and carcass merit in beef steers. In total, 24 SNP were evaluated for associations with these traits and haplotypes were constructed within each gene when 2 or more SNP showed significant associations. An A/G SNP located in intron 4 of the GHR gene had the largest effects on BW of the animals (dominance effect P < 0.01) and feed efficiency (allele substitution effect P < 0.05). Another A/G SNP located in the promoter region of GHR had similar effects but the haplotypes of these 2 SNP reduced the effects of the SNP located in intron 4. Three SNP in the NPY gene showed associations to marbling (P < 0.001) as well as with ADG, BW, and feed conversion ratio (FCR; P < 0.05). The combination of these 3 SNP into haplotypes generally improved the association or had a similar scale of association as each single SNP. Only 1 SNP in UCP3, an A/G SNP in intron 3, was associated with ADG (P = 0.025), partial efficiency of growth, and FCR (P < 0.01). Three SNP in UCP2 gene were in almost complete linkage disequilibrium and showed associations with lean meat yield, yield grade, DMI, and BW (P < 0.05). Haplo-types between the SNP in UCP3 and UCP2 generally reduced the associations seen individually in each SNP. An A/G SNP in the GHRL gene tended to show effects on residual feed intake, FCR, and partial efficiency of growth (P < 0.10). The IGF2 SNP most strongly affected LM area (P < 0.01), back fat, ADG, and FCR (P < 0.05). The SNP in the CART, MC4R, POMC, GH, and CRH genes did not show associations at P < 0.05 with any of the traits. Although most of the SNP that showed associations do not cause amino acid changes, these SNP could be linked to other yet to be detected causative mutations or nearby QTL. It will be very important to verify these results in other cattle populations.
Germline Mutation of the CCK Receptor: A Novel Biomarker for Pancreas Cancer.
Alsubai, Jelal; Matters, Gail L; McGovern, Christopher O; Liao, Jiangang; Gilius, Evan L; Smith, Jill P
2016-01-07
Today, genetic biomarkers have been demonstrated to play an important role in identifying at-risk subjects for familial or inherited cancers. We have identified a single-nucleotide polymorphism (SNP) that results in missplicing of the cholecystokinin (CCK) receptor gene and expressing a larger mutated receptor in pancreatic cancer. The purpose of this study was to evaluate the significance and specificity of this SNP as a potential biomarker in patients with pancreatic cancer compared with other gastrointestinal (GI) cancers that also have CCK receptors. DNA was isolated and genotyped for the CCK receptor SNP from frozen tumor tissue from banked specimens of patients with pancreas, gastric, or colon cancer and from human cancer cell lines. Genotype and allelic frequencies were compared between the cancer cohort and two normal control databases using Fisher's exact test and odds ratio (OR). The Kaplan-Meier method was used to estimate the survival for patients with the CCK-B receptor SNP compared with those with the wild-type genotype. Immunohistochemical staining of cancer cells was done to detect the mutated receptor. Colon and gastric cancer patients had similar genotype frequencies for the CCK receptor SNP as that reported in the normal population. In contrast, the prevalence of the SNP in subjects with pancreatic cancer was twice that of controls and other GI cancers. Survival was adversely affected by the presence of the SNP only in those with pancreatic cancer. Immunoreactivity for the mutated receptor was positive in pancreatic cancer tissues with the SNP but absent in other GI cancers. A SNP of the CCK receptor is significantly increased in patients with pancreatic cancer but not in those with other GI malignancies. Therefore, this SNP may be a potential biomarker for pancreatic cancer.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, S; Jaing, C
The goal of this project is to develop forensic genotyping assays for select agent viruses, addressing a significant capability gap for the viral bioforensics and law enforcement community. We used a multipronged approach combining bioinformatics analysis, PCR-enriched samples, microarrays and TaqMan assays to develop high resolution and cost effective genotyping methods for strain level forensic discrimination of viruses. We have leveraged substantial experience and efficiency gained through year 1 on software development, SNP discovery, TaqMan signature design and phylogenetic signature mapping to scale up the development of forensics signatures in year 2. In this report, we have summarized the Taqmanmore » signature development for South American hemorrhagic fever viruses, tick-borne encephalitis viruses and henipaviruses, Old World Arenaviruses, filoviruses, Crimean-Congo hemorrhagic fever virus, Rift Valley fever virus and Japanese encephalitis virus.« less
Lopez, G H; Condon, J A; Wilson, B; Martin, J R; Liew, Y-W; Flower, R L; Hyland, C A
2015-01-01
An Australian Caucasian blood donor consistently presented a serology profile for the Duffy blood group as Fy(a+b+) with Fy(a) antigen expression weaker than other examples of Fy(a+b+) red cells. Molecular typing studies were performed to investigate the reason for the observed serology profile. Blood group genotyping was performed using a commercial SNP microarray platform. Sanger sequencing was performed using primer sets to amplify across exons 1 and 2 of the FY gene and using allele-specific primers. The propositus was genotyped as FY*A/B, FY*X heterozygote that predicted the Fy(a+b+(w) ) phenotype. Sequencing identified the 265T and 298A variants on the FY*A allele. This link between FY*A allele and 265T was confirmed by allele-specific PCR. The reduced Fy(a) antigen reactivity is attributed to a FY*A allele-carrying 265T and 298A variants previously defined in combination only with the FY*B allele and associated with weak Fy(b) antigen expression. This novel allele should be considered in genotyping interpretative algorithms for generating a predicted phenotype. © 2014 International Society of Blood Transfusion.
McClure, Matthew C.; McCarthy, John; Flynn, Paul; McClure, Jennifer C.; Dair, Emma; O'Connell, D. K.; Kearney, John F.
2018-01-01
A major use of genetic data is parentage verification and identification as inaccurate pedigrees negatively affect genetic gain. Since 2012 the international standard for single nucleotide polymorphism (SNP) verification in Bos taurus cattle has been the ISAG SNP panels. While these ISAG panels provide an increased level of parentage accuracy over microsatellite markers (MS), they can validate the wrong parent at ≤1% misconcordance rate levels, indicating that more SNP are needed if a more accurate pedigree is required. With rapidly increasing numbers of cattle being genotyped in Ireland that represent 61 B. taurus breeds from a wide range of farm types: beef/dairy, AI/pedigree/commercial, purebred/crossbred, and large to small herd size the Irish Cattle Breeding Federation (ICBF) analyzed different SNP densities to determine that at a minimum ≥500 SNP are needed to consistently predict only one set of parents at a ≤1% misconcordance rate. For parentage validation and prediction ICBF uses 800 SNP (ICBF800) selected based on SNP clustering quality, ISAG200 inclusion, call rate (CR), and minor allele frequency (MAF) in the Irish cattle population. Large datasets require sample and SNP quality control (QC). Most publications only deal with SNP QC via CR, MAF, parent-progeny conflicts, and Hardy-Weinberg deviation, but not sample QC. We report here parentage, SNP QC, and a genomic sample QC pipelines to deal with the unique challenges of >1 million genotypes from a national herd such as SNP genotype errors from mis-tagging of animals, lab errors, farm errors, and multiple other issues that can arise. We divide the pipeline into two parts: a Genotype QC and an Animal QC pipeline. The Genotype QC identifies samples with low call rate, missing or mixed genotype classes (no BB genotype or ABTG alleles present), and low genotype frequencies. The Animal QC handles situations where the genotype might not belong to the listed individual by identifying: >1 non-matching genotypes per animal, SNP duplicates, sex and breed prediction mismatches, parentage and progeny validation results, and other situations. The Animal QC pipeline make use of ICBF800 SNP set where appropriate to identify errors in a computationally efficient yet still highly accurate method. PMID:29599798
Gene-Environment Interaction in the Etiology of Mathematical Ability Using SNP Sets
Kovas, Yulia; Plomin, Robert
2010-01-01
Mathematics ability and disability is as heritable as other cognitive abilities and disabilities, however its genetic etiology has received relatively little attention. In our recent genome-wide association study of mathematical ability in 10-year-old children, 10 SNP associations were nominated from scans of pooled DNA and validated in an individually genotyped sample. In this paper, we use a ‘SNP set’ composite of these 10 SNPs to investigate gene-environment (GE) interaction, examining whether the association between the 10-SNP set and mathematical ability differs as a function of ten environmental measures in the home and school in a sample of 1888 children with complete data. We found two significant GE interactions for environmental measures in the home and the school both in the direction of the diathesis-stress type of GE interaction: The 10-SNP set was more strongly associated with mathematical ability in chaotic homes and when parents are negative. PMID:20978832
Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Bellato, Cláudia M; Motilal, Lambert; Zhang, Dapeng
2014-01-15
Cacao (Theobroma cacao L.), the source of cocoa, is an economically important tropical crop. One problem with the premium cacao market is contamination with off-types adulterating raw premium material. Accurate determination of the genetic identity of single cacao beans is essential for ensuring cocoa authentication. Using nanofluidic single nucleotide polymorphism (SNP) genotyping with 48 SNP markers, we generated SNP fingerprints for small quantities of DNA extracted from the seed coat of single cacao beans. On the basis of the SNP profiles, we identified an assumed adulterant variety, which was unambiguously distinguished from the authentic beans by multilocus matching. Assignment tests based on both Bayesian clustering analysis and allele frequency clearly separated all 30 authentic samples from the non-authentic samples. Distance-based principle coordinate analysis further supported these results. The nanofluidic SNP protocol, together with forensic statistical tools, is sufficiently robust to establish authentication and to verify gourmet cacao varieties. This method shows significant potential for practical application.
Controlling false-negative errors in microarray differential expression analysis: a PRIM approach.
Cole, Steve W; Galic, Zoran; Zack, Jerome A
2003-09-22
Theoretical considerations suggest that current microarray screening algorithms may fail to detect many true differences in gene expression (Type II analytic errors). We assessed 'false negative' error rates in differential expression analyses by conventional linear statistical models (e.g. t-test), microarray-adapted variants (e.g. SAM, Cyber-T), and a novel strategy based on hold-out cross-validation. The latter approach employs the machine-learning algorithm Patient Rule Induction Method (PRIM) to infer minimum thresholds for reliable change in gene expression from Boolean conjunctions of fold-induction and raw fluorescence measurements. Monte Carlo analyses based on four empirical data sets show that conventional statistical models and their microarray-adapted variants overlook more than 50% of genes showing significant up-regulation. Conjoint PRIM prediction rules recover approximately twice as many differentially expressed transcripts while maintaining strong control over false-positive (Type I) errors. As a result, experimental replication rates increase and total analytic error rates decline. RT-PCR studies confirm that gene inductions detected by PRIM but overlooked by other methods represent true changes in mRNA levels. PRIM-based conjoint inference rules thus represent an improved strategy for high-sensitivity screening of DNA microarrays. Freestanding JAVA application at http://microarray.crump.ucla.edu/focus
Yu, M Y; Zhao, P Q; Yan, X H; Liu, B; Zhang, Q Q; Wang, R; Ma, C H; Liang, X H; Zhu, F L; Gao, L F
2013-09-10
Tumor necrosis factor (TNF)-related apoptosis-inducing ligand (TRAIL) is expressed in different tissues and cells, including the pancreas and lymphocytes, and it can selectively induce apoptosis in tumor cells but not in most normal cells. TRAIL plays critical roles in type 1 diabetes mellitus, and is involved in type 2 diabetes mellitus (T2DM). We recently discovered the association of nonalcoholic fatty liver disease, a risk factor for T2DM, with a single nucleotide polymorphism (SNP) in the TRAIL (TNFSF10) gene at site 1595C/T (rs1131580), indicating the possible association of T2DM with this TRAIL polymorphism. The aim of this study was to investigate the relationship of the TRAIL SNP at site 1595C/T (rs1131580) with T2DM susceptibility and the biometabolic parameters of T2DM in a Han Chinese population. The polymerase chain reaction-restriction fragment length polymorphism method was used to genotype SNP rs1131580 in 292 patients with T2DM and 266 healthy controls. We found that the frequency of the CC genotype and that of the C allele of rs1131580 were significantly higher in T2DM patients than in the control group. Additionally, the triglyceride and serum creatinine levels of T2DM patients with the CC genotype were significantly higher than those of patients with the TT genotype. Thus, the CC genotype of the TRAIL SNP at 1595C/T (rs1131580) confers increased susceptible to T2DM in a Han Chinese population from Shandong Province. These data suggest that the CC genotype at this SNP is related to diabetic severity and it might be a candidate for the prognostic assessment of T2DM.
Samusik, Nikolay; Krukovskaya, Larisa; Meln, Irina; Shilov, Evgeny; Kozlov, Andrey P.
2013-01-01
PBOV1 is a known human protein-coding gene with an uncharacterized function. We have previously found that PBOV1 lacks orthologs in non-primate genomes and is expressed in a wide range of tumor types. Here we report that PBOV1 protein-coding sequence is human-specific and has originated de novo in the primate evolution through a series of frame-shift and stop codon mutations. We profiled PBOV1 expression in multiple cancer and normal tissue samples and found that it was expressed in 19 out of 34 tumors of various origins but completely lacked expression in any of the normal adult or fetal human tissues. We found that, unlike the cancer/testis antigens that are typically controlled by CpG island-containing promoters, PBOV1 was expressed from a GC-poor TATA-containing promoter which was not influenced by CpG demethylation and was inactive in testis. Our analysis of public microarray data suggests that PBOV1 activation in tumors could be dependent on the Hedgehog signaling pathway. Despite the recent de novo origin and the lack of identifiable functional signatures, a missense SNP in the PBOV1 coding sequence has been previously associated with an increased risk of breast cancer. Using publicly available microarray datasets, we found that high levels of PBOV1 expression in breast cancer and glioma samples were significantly associated with a positive outcome of the cancer disease. We also found that PBOV1 was highly expressed in primary but not in recurrent high-grade gliomas, suggesting the presence of a negative selection against PBOV1-expressing cancer cells. Our findings could contribute to the understanding of the mechanisms behind de novo gene origin and the possible role of tumors in this process. PMID:23418531
Issues in the analysis of oligonucleotide tiling microarrays for transcript mapping
NASA Technical Reports Server (NTRS)
Royce, Thomas E.; Rozowsky, Joel S.; Bertone, Paul; Samanta, Manoj; Stolc, Viktor; Weissman, Sherman; Snyder, Michael; Gerstein, Mark
2005-01-01
Traditional microarrays use probes complementary to known genes to quantitate the differential gene expression between two or more conditions. Genomic tiling microarray experiments differ in that probes that span a genomic region at regular intervals are used to detect the presence or absence of transcription. This difference means the same sets of biases and the methods for addressing them are unlikely to be relevant to both types of experiment. We introduce the informatics challenges arising in the analysis of tiling microarray experiments as open problems to the scientific community and present initial approaches for the analysis of this nascent technology.
A 48 SNP set for grapevine cultivar identification
2011-01-01
Background Rapid and consistent genotyping is an important requirement for cultivar identification in many crop species. Among them grapevine cultivars have been the subject of multiple studies given the large number of synonyms and homonyms generated during many centuries of vegetative multiplication and exchange. Simple sequence repeat (SSR) markers have been preferred until now because of their high level of polymorphism, their codominant nature and their high profile repeatability. However, the rapid application of partial or complete genome sequencing approaches is identifying thousands of single nucleotide polymorphisms (SNP) that can be very useful for such purposes. Although SNP markers are bi-allelic, and therefore not as polymorphic as microsatellites, the high number of loci that can be multiplexed and the possibilities of automation as well as their highly repeatable results under any analytical procedure make them the future markers of choice for any type of genetic identification. Results We analyzed over 300 SNP in the genome of grapevine using a re-sequencing strategy in a selection of 11 genotypes. Among the identified polymorphisms, we selected 48 SNP spread across all grapevine chromosomes with allele frequencies balanced enough as to provide sufficient information content for genetic identification in grapevine allowing for good genotyping success rate. Marker stability was tested in repeated analyses of a selected group of cultivars obtained worldwide to demonstrate their usefulness in genetic identification. Conclusions We have selected a set of 48 stable SNP markers with a high discrimination power and a uniform genome distribution (2-3 markers/chromosome), which is proposed as a standard set for grapevine (Vitis vinifera L.) genotyping. Any previous problems derived from microsatellite allele confusion between labs or the need to run reference cultivars to identify allele sizes disappear using this type of marker. Furthermore, because SNP markers are bi-allelic, allele identification and genotype naming are extremely simple and genotypes obtained with different equipments and by different laboratories are always fully comparable. PMID:22060012
Missing value imputation for microarray data: a comprehensive comparison study and a web tool.
Chiu, Chia-Chun; Chan, Shih-Yao; Wang, Chung-Ching; Wu, Wei-Sheng
2013-01-01
Microarray data are usually peppered with missing values due to various reasons. However, most of the downstream analyses for microarray data require complete datasets. Therefore, accurate algorithms for missing value estimation are needed for improving the performance of microarray data analyses. Although many algorithms have been developed, there are many debates on the selection of the optimal algorithm. The studies about the performance comparison of different algorithms are still incomprehensive, especially in the number of benchmark datasets used, the number of algorithms compared, the rounds of simulation conducted, and the performance measures used. In this paper, we performed a comprehensive comparison by using (I) thirteen datasets, (II) nine algorithms, (III) 110 independent runs of simulation, and (IV) three types of measures to evaluate the performance of each imputation algorithm fairly. First, the effects of different types of microarray datasets on the performance of each imputation algorithm were evaluated. Second, we discussed whether the datasets from different species have different impact on the performance of different algorithms. To assess the performance of each algorithm fairly, all evaluations were performed using three types of measures. Our results indicate that the performance of an imputation algorithm mainly depends on the type of a dataset but not on the species where the samples come from. In addition to the statistical measure, two other measures with biological meanings are useful to reflect the impact of missing value imputation on the downstream data analyses. Our study suggests that local-least-squares-based methods are good choices to handle missing values for most of the microarray datasets. In this work, we carried out a comprehensive comparison of the algorithms for microarray missing value imputation. Based on such a comprehensive comparison, researchers could choose the optimal algorithm for their datasets easily. Moreover, new imputation algorithms could be compared with the existing algorithms using this comparison strategy as a standard protocol. In addition, to assist researchers in dealing with missing values easily, we built a web-based and easy-to-use imputation tool, MissVIA (http://cosbi.ee.ncku.edu.tw/MissVIA), which supports many imputation algorithms. Once users upload a real microarray dataset and choose the imputation algorithms, MissVIA will determine the optimal algorithm for the users' data through a series of simulations, and then the imputed results can be downloaded for the downstream data analyses.
Gallo Vaulet, Lucía; Entrocassi, Carolina; Portu, Ana I; Castro, Erica; Di Bartolomeo, Susana; Ruettger, Anke; Sachse, Konrad; Rodriguez Fermepin, Marcelo
2016-01-01
Chlamydia trachomatis is one of the most common sexually transmitted infections worldwide. Based on sequence variation in the ompA gene encoding the major outer membrane protein, the genotyping scheme distinguishes 17 recognized genotypes, i.e. A, B, Ba, C, D, Da, E, F, G, H, I, Ia, J, K, L1, L2, and L3. Genotyping is an important tool for epidemiological tracking of C. trachomatis infections, including the revelation of transmission pathways and association with tissue tropism and pathogenicity. Moreover, genotyping can be useful for clinicians to establish the correct treatment when LGV strains are detected. Recently a microarray assay was described that offers several advantages, such as rapidity, ease of standardization and detection of mixed infections. The aim of this study was to evaluate the performance of the DNA microarray-based assay for C. trachomatis genotyping of clinical samples already typed by PCR-RFLP from South America. The agreement between both typing techniques was 90.05% and the overall genotype distribution obtained with both techniques was similar. Detection of mixed-genotype infections was significantly higher using the microarray assay (8.4% of cases) compared to PCR-RFLP (0.5%). Among 178 samples, the microarray assay identified 10 ompA genotypes, i.e. D, Da, E, F, G, H, I, J, K and L2. The most predominant type was genotype E, followed by D and F.
Haraksingh, Rajini R; Abyzov, Alexej; Urban, Alexander Eckehart
2017-04-24
High-resolution microarray technology is routinely used in basic research and clinical practice to efficiently detect copy number variants (CNVs) across the entire human genome. A new generation of arrays combining high probe densities with optimized designs will comprise essential tools for genome analysis in the coming years. We systematically compared the genome-wide CNV detection power of all 17 available array designs from the Affymetrix, Agilent, and Illumina platforms by hybridizing the well-characterized genome of 1000 Genomes Project subject NA12878 to all arrays, and performing data analysis using both manufacturer-recommended and platform-independent software. We benchmarked the resulting CNV call sets from each array using a gold standard set of CNVs for this genome derived from 1000 Genomes Project whole genome sequencing data. The arrays tested comprise both SNP and aCGH platforms with varying designs and contain between ~0.5 to ~4.6 million probes. Across the arrays CNV detection varied widely in number of CNV calls (4-489), CNV size range (~40 bp to ~8 Mbp), and percentage of non-validated CNVs (0-86%). We discovered strikingly strong effects of specific array design principles on performance. For example, some SNP array designs with the largest numbers of probes and extensive exonic coverage produced a considerable number of CNV calls that could not be validated, compared to designs with probe numbers that are sometimes an order of magnitude smaller. This effect was only partially ameliorated using different analysis software and optimizing data analysis parameters. High-resolution microarrays will continue to be used as reliable, cost- and time-efficient tools for CNV analysis. However, different applications tolerate different limitations in CNV detection. Our study quantified how these arrays differ in total number and size range of detected CNVs as well as sensitivity, and determined how each array balances these attributes. This analysis will inform appropriate array selection for future CNV studies, and allow better assessment of the CNV-analytical power of both published and ongoing array-based genomics studies. Furthermore, our findings emphasize the importance of concurrent use of multiple analysis algorithms and independent experimental validation in array-based CNV detection studies.
Measuring firm size distribution with semi-nonparametric densities
NASA Astrophysics Data System (ADS)
Cortés, Lina M.; Mora-Valencia, Andrés; Perote, Javier
2017-11-01
In this article, we propose a new methodology based on a (log) semi-nonparametric (log-SNP) distribution that nests the lognormal and enables better fits in the upper tail of the distribution through the introduction of new parameters. We test the performance of the lognormal and log-SNP distributions capturing firm size, measured through a sample of US firms in 2004-2015. Taking different levels of aggregation by type of economic activity, our study shows that the log-SNP provides a better fit of the firm size distribution. We also formally introduce the multivariate log-SNP distribution, which encompasses the multivariate lognormal, to analyze the estimation of the joint distribution of the value of the firm's assets and sales. The results suggest that sales are a better firm size measure, as indicated by other studies in the literature.
Goldstone, Robert J.; McLuckie, Joyce; Smith, David G. E.
2015-01-01
Typing of Mycobacterium avium subspecies paratuberculosis strains presents a challenge, since they are genetically monomorphic and traditional molecular techniques have limited discriminatory power. The recent advances and availability of whole-genome sequencing have extended possibilities for the characterization of Mycobacterium avium subspecies paratuberculosis, and whole-genome sequencing can provide a phylogenetic context to facilitate global epidemiology studies. In this study, we developed a single nucleotide polymorphism (SNP) assay based on PCR and restriction enzyme digestion or sequencing of the amplified product. The SNP analysis was performed using genome sequence data from 133 Mycobacterium avium subspecies paratuberculosis isolates with different genotypes from 8 different host species and 17 distinct geographic regions around the world. A total of 28,402 SNPs were identified among all of the isolates. The minimum number of SNPs required to distinguish between all of the 133 genomes was 93 and between only the type C isolates was 41. To reduce the number of SNPs and PCRs required, we adopted an approach based on sequential detection of SNPs and a decision tree. By the analysis of 14 SNPs Mycobacterium avium subspecies paratuberculosis isolates can be characterized within 14 phylogenetic groups with a higher discriminatory power than mycobacterial interspersed repetitive unit–variable number tandem repeat assay and other typing methods. Continuous updating of genome sequences is needed in order to better characterize new phylogenetic groups and SNP profiles. The novel SNP assay is a discriminative, simple, reproducible method and requires only basic laboratory equipment for the large-scale global typing of Mycobacterium avium subspecies paratuberculosis isolates. PMID:26677250
HLA Type Inference via Haplotypes Identical by Descent
NASA Astrophysics Data System (ADS)
Setty, Manu N.; Gusev, Alexander; Pe'Er, Itsik
The Human Leukocyte Antigen (HLA) genes play a major role in adaptive immune response and are used to differentiate self antigens from non self ones. HLA genes are hyper variable with nearly every locus harboring over a dozen alleles. This variation plays an important role in susceptibility to multiple autoimmune diseases and needs to be matched on for organ transplantation. Unfortunately, HLA typing by serological methods is time consuming and expensive compared to high throughput Single Nucleotide Polymorphism (SNP) data. We present a new computational method to infer per-locus HLA types using shared segments Identical By Descent (IBD), inferred from SNP genotype data. IBD information is modeled as graph where shared haplotypes are explored among clusters of individuals with known and unknown HLA types to identify the latter. We analyze performance of the method in a previously typed subset of the HapMap population, achieving accuracy of 96% in HLA-A, 94% in HLA-B, 95% in HLA-C, 77% in HLA-DR1, 93% in HLA-DQA1 and 90% in HLA-DQB1 genes. We compare our method to a tag SNP based approach and demonstrate higher sensitivity and specificity. Our method demonstrates the power of using shared haplotype segments for large-scale imputation at the HLA locus.
Loughlin, J; Sinsheimer, J S; Mustafa, Z; Carr, A J; Clipsham, K; Bloomfield, V A; Chitnavis, J; Bailey, A; Sykes, B; Chapman, K
2000-03-01
Evidence has accumulated supporting a role for genes in the etiology of osteoarthritis (OA). Several candidates have been targeted as potential susceptibility loci including genes that are involved in the regulation of bone density. Genetic association analysis has suggested a role for the vitamin D receptor gene (VDR) and the estrogen receptor gene (ER) in susceptibility. Such findings must be tested in additional independent cohorts. We tested for association of these 2 genes, plus a third gene implicated in bone density, COL1A1, with idiopathic OA. A case-control cohort of 371 affected probands and 369 unaffected spouses was used. Association was tested using 4 intragenic single nucleotide polymorphisms (SNP), one each for the VDR and COL1A1 genes, and 2 for the ER gene. The VDR and ER SNP are the same SNP that have been associated with OA. All 4 SNP affect restriction enzyme sites and were genotyped using polymerase chain reaction and enzyme digestion. Allele and genotype distributions for each SNP were compared between cases and controls and analyzed using Fisher's exact test. There was no evidence of association of the VDR or the ER gene SNP to OA. There was weak evidence of association of the COL1A1 SNP in female cases (p = 0.017), reflected by a difference in the distribution of genotypes at this SNP between female cases and controls (p = 0.027). However, when corrected for multiple testing, these results were not significant. If the VDR, ER, or COL1A1 genes do encode predisposition to OA then the 4 SNP tested are not associated with major susceptibility alleles at these 3 loci.
SNP-VISTA: An interactive SNP visualization tool
Shah, Nameeta; Teplitsky, Michael V; Minovitsky, Simon; Pennacchio, Len A; Hugenholtz, Philip; Hamann, Bernd; Dubchak, Inna L
2005-01-01
Background Recent advances in sequencing technologies promise to provide a better understanding of the genetics of human disease as well as the evolution of microbial populations. Single Nucleotide Polymorphisms (SNPs) are established genetic markers that aid in the identification of loci affecting quantitative traits and/or disease in a wide variety of eukaryotic species. With today's technological capabilities, it has become possible to re-sequence a large set of appropriate candidate genes in individuals with a given disease in an attempt to identify causative mutations. In addition, SNPs have been used extensively in efforts to study the evolution of microbial populations, and the recent application of random shotgun sequencing to environmental samples enables more extensive SNP analysis of co-occurring and co-evolving microbial populations. The program is available at [1]. Results We have developed and present two modifications of an interactive visualization tool, SNP-VISTA, to aid in the analyses of the following types of data: A. Large-scale re-sequence data of disease-related genes for discovery of associated and/or causative alleles (GeneSNP-VISTA). B. Massive amounts of ecogenomics data for studying homologous recombination in microbial populations (EcoSNP-VISTA). The main features and capabilities of SNP-VISTA are: 1) mapping of SNPs to gene structure; 2) classification of SNPs, based on their location in the gene, frequency of occurrence in samples and allele composition; 3) clustering, based on user-defined subsets of SNPs, highlighting haplotypes as well as recombinant sequences; 4) integration of protein evolutionary conservation visualization; and 5) display of automatically calculated recombination points that are user-editable. Conclusion The main strength of SNP-VISTA is its graphical interface and use of visual representations, which support interactive exploration and hence better understanding of large-scale SNP data by the user. PMID:16336665
Soria, L A; Corva, P M; Branda Sica, A; Villarreal, E L; Melucci, L M; Mezzadra, C A; Papaleo Mazzucco, J; Fernández Macedo, G; Silvestro, C; Schor, A; Miquel, M C
2009-12-01
The PPARGC1A gene (peroxysome proliferator-activated receptor-gamma coactivator 1alpha gene) controls muscle fiber type and brown adipocyte differentiation; therefore, it is a candidate gene for beef quality traits (tenderness and fat content). Two SNPs (Single Nucleotide Polymorphisms) were identified within exon 8 by multiple alignment of DNA sequences obtained from 24 bulls: a transition G/A (SNP 1181) and a transversion A/T (SNP 1299). The SNP 1181 is a novel SNP, corresponding to a non-conservative substitution (AGT/AAT) that could be the cause of amino acid substitution ((364)Serine/(364)Asparagine). A Mismatch PCR method was designed to determine genotypes of 73 bulls and 268 steers for SNP 1181. Growth, slaughter and meat quality information were available for the group of steers. Allele A of SNP 1181 was not found in Angus. In 243 steers, no significant differences (P > 0.05) were found for either final live body weight, gain in backfat thickness in Spring, kidney fat weight, kidney fat percentage, Warner-Bratzler shear force at 7 days postmortem, intramuscular fat percentage or meat colour between genotype GG and AG. This SNP could be included in breed composition and population admixture analyses because there are marked differences in allelic frequencies between Bos taurus and Bos indicus breeds.
Haase, A; Mantion, A; Graf, P; Plendl, J; Thuenemann, A F; Meier, W; Taubert, A; Luch, A
2012-07-01
Silver nanoparticles (SNPs) are among the most commercialized nanoparticles worldwide. Often SNP are used because of their antibacterial properties. Besides that they possess unique optic and catalytic features, making them highly interesting for the creation of novel and advanced functional materials. Despite its widespread use only little data exist in terms of possible adverse effects of SNP on human health. Conventional synthesis routes usually yield products of varying quality and property. It thus may become puzzling to compare biological data from different studies due to the great variety in sizes, coatings or shapes of the particles applied. Here, we applied a novel synthesis approach to obtain SNP of well-defined colloidal and structural properties. Being stabilized by a covalently linked small peptide, these particles are nicely homogenous, with narrow size distribution, and form monodisperse suspensions in aqueous solutions. We applied these peptide-coated SNP in two different sizes of 20 or 40 nm (Ag20Pep and Ag40Pep) and analyzed responses of THP-1-derived human macrophages while being exposed against these particles. Gold nanoparticles of similar size and coating (Au20Pep) were used for comparison. The cytotoxicity of particles was assessed by WST-1 and LDH assays, and the uptake into the cells was confirmed via transmission electron microscopy. In summary, our data demonstrate that this novel type of SNP is well suited to serve as model system for nanoparticles to be tested in toxicological studies in vitro.
Performance of the SNPforID 52 SNP-plex assay in paternity testing.
Børsting, Claus; Sanchez, Juan J; Hansen, Hanna E; Hansen, Anders J; Bruun, Hanne Q; Morling, Niels
2008-09-01
The performance of a multiplex assay with 52 autosomal single nucleotide polymorphisms (SNPs) developed for human identification was tested on 124 mother-child-father trios. The typical paternity indices (PIs) were 10(5)-10(6) for the trios and 10(3)-10(4) for the child-father duos. Using the SNP profiles from the randomly selected trios and 700 previously typed individuals, a total of 83,096 comparisons between mother, child and an unrelated man were performed. On average, 9-10 mismatches per comparison were detected. Four mismatches were genetic inconsistencies and 5-6 mismatches were opposite homozygosities. In only two of the 83,096 comparisons did an unrelated man match perfectly to a mother-child duo, and in both cases the PI of the true father was much higher than the PI of the unrelated man. The trios were also typed for 15 short tandem repeats (STRs) and seven variable number of tandem repeats (VNTRs). The typical PIs based on 15 STRs or seven VNTRs were 5-50 times higher than the typical PIs based on 52 SNPs. Six mutations in tandem repeats were detected among the randomly selected trios. In contrast, there was not found any mutations in the SNP loci. The results showed that the 52 SNP-plex assay is a very useful alternative to currently used methods in relationship testing. The usefulness of SNP markers with low mutation rates in paternity and immigration casework is discussed.
Detection and strain typing of ancient Mycobacterium leprae from a medieval leprosy hospital.
Taylor, G Michael; Tucker, Katie; Butler, Rachel; Pike, Alistair W G; Lewis, Jamie; Roffey, Simon; Marter, Philip; Lee, Oona Y-C; Wu, Houdini H T; Minnikin, David E; Besra, Gurdyal S; Singh, Pushpendra; Cole, Stewart T; Stewart, Graham R
2013-01-01
Nine burials excavated from the Magdalen Hill Archaeological Research Project (MHARP) in Winchester, UK, showing skeletal signs of lepromatous leprosy (LL) have been studied using a multidisciplinary approach including osteological, geochemical and biomolecular techniques. DNA from Mycobacterium leprae was amplified from all nine skeletons but not from control skeletons devoid of indicative pathology. In several specimens we corroborated the identification of M. leprae with detection of mycolic acids specific to the cell wall of M. leprae and persistent in the skeletal samples. In five cases, the preservation of the material allowed detailed genotyping using single-nucleotide polymorphism (SNP) and multiple locus variable number tandem repeat analysis (MLVA). Three of the five cases proved to be infected with SNP type 3I-1, ancestral to contemporary M. leprae isolates found in southern states of America and likely carried by European migrants. From the remaining two burials we identified, for the first time in the British Isles, the occurrence of SNP type 2F. Stable isotope analysis conducted on tooth enamel taken from two of the type 3I-1 and one of the type 2F remains revealed that all three individuals had probably spent their formative years in the Winchester area. Previously, type 2F has been implicated as the precursor strain that migrated from the Middle East to India and South-East Asia, subsequently evolving to type 1 strains. Thus we show that type 2F had also spread westwards to Britain by the early medieval period.
Detection and Strain Typing of Ancient Mycobacterium leprae from a Medieval Leprosy Hospital
Taylor, G. Michael; Tucker, Katie; Butler, Rachel; Pike, Alistair W. G.; Lewis, Jamie; Roffey, Simon; Marter, Philip; Lee, Oona Y-C; Wu, Houdini H. T.; Minnikin, David E.; Besra, Gurdyal S.; Singh, Pushpendra; Cole, Stewart T.; Stewart, Graham R.
2013-01-01
Nine burials excavated from the Magdalen Hill Archaeological Research Project (MHARP) in Winchester, UK, showing skeletal signs of lepromatous leprosy (LL) have been studied using a multidisciplinary approach including osteological, geochemical and biomolecular techniques. DNA from Mycobacterium leprae was amplified from all nine skeletons but not from control skeletons devoid of indicative pathology. In several specimens we corroborated the identification of M. leprae with detection of mycolic acids specific to the cell wall of M. leprae and persistent in the skeletal samples. In five cases, the preservation of the material allowed detailed genotyping using single-nucleotide polymorphism (SNP) and multiple locus variable number tandem repeat analysis (MLVA). Three of the five cases proved to be infected with SNP type 3I-1, ancestral to contemporary M. leprae isolates found in southern states of America and likely carried by European migrants. From the remaining two burials we identified, for the first time in the British Isles, the occurrence of SNP type 2F. Stable isotope analysis conducted on tooth enamel taken from two of the type 3I-1 and one of the type 2F remains revealed that all three individuals had probably spent their formative years in the Winchester area. Previously, type 2F has been implicated as the precursor strain that migrated from the Middle East to India and South-East Asia, subsequently evolving to type 1 strains. Thus we show that type 2F had also spread westwards to Britain by the early medieval period. PMID:23638071
Fiber-optic microarray for simultaneous detection of multiple harmful algal bloom species.
Ahn, Soohyoun; Kulis, David M; Erdner, Deana L; Anderson, Donald M; Walt, David R
2006-09-01
Harmful algal blooms (HABs) are a serious threat to coastal resources, causing a variety of impacts on public health, regional economies, and ecosystems. Plankton analysis is a valuable component of many HAB monitoring and research programs, but the diversity of plankton poses a problem in discriminating toxic from nontoxic species using conventional detection methods. Here we describe a sensitive and specific sandwich hybridization assay that combines fiber-optic microarrays with oligonucleotide probes to detect and enumerate the HAB species Alexandrium fundyense, Alexandrium ostenfeldii, and Pseudo-nitzschia australis. Microarrays were prepared by loading oligonucleotide probe-coupled microspheres (diameter, 3 mum) onto the distal ends of chemically etched imaging fiber bundles. Hybridization of target rRNA from HAB cells to immobilized probes on the microspheres was visualized using Cy3-labeled secondary probes in a sandwich-type assay format. We applied these microarrays to the detection and enumeration of HAB cells in both cultured and field samples. Our study demonstrated a detection limit of approximately 5 cells for all three target organisms within 45 min, without a separate amplification step, in both sample types. We also developed a multiplexed microarray to detect the three HAB species simultaneously, which successfully detected the target organisms, alone and in combination, without cross-reactivity. Our study suggests that fiber-optic microarrays can be used for rapid and sensitive detection and potential enumeration of HAB species in the environment.
Cruella: developing a scalable tissue microarray data management system.
Cowan, James D; Rimm, David L; Tuck, David P
2006-06-01
Compared with DNA microarray technology, relatively little information is available concerning the special requirements, design influences, and implementation strategies of data systems for tissue microarray technology. These issues include the requirement to accommodate new and different data elements for each new project as well as the need to interact with pre-existing models for clinical, biological, and specimen-related data. To design and implement a flexible, scalable tissue microarray data storage and management system that could accommodate information regarding different disease types and different clinical investigators, and different clinical investigation questions, all of which could potentially contribute unforeseen data types that require dynamic integration with existing data. The unpredictability of the data elements combined with the novelty of automated analysis algorithms and controlled vocabulary standards in this area require flexible designs and practical decisions. Our design includes a custom Java-based persistence layer to mediate and facilitate interaction with an object-relational database model and a novel database schema. User interaction is provided through a Java Servlet-based Web interface. Cruella has become an indispensable resource and is used by dozens of researchers every day. The system stores millions of experimental values covering more than 300 biological markers and more than 30 disease types. The experimental data are merged with clinical data that has been aggregated from multiple sources and is available to the researchers for management, analysis, and export. Cruella addresses many of the special considerations for managing tissue microarray experimental data and the associated clinical information. A metadata-driven approach provides a practical solution to many of the unique issues inherent in tissue microarray research, and allows relatively straightforward interoperability with and accommodation of new data models.
Friedrich, Torben; Rahmann, Sven; Weigel, Wilfried; Rabsch, Wolfgang; Fruth, Angelika; Ron, Eliora; Gunzer, Florian; Dandekar, Thomas; Hacker, Jörg; Müller, Tobias; Dobrindt, Ulrich
2010-10-21
The Enterobacteriaceae comprise a large number of clinically relevant species with several individual subspecies. Overlapping virulence-associated gene pools and the high overall genome plasticity often interferes with correct enterobacterial strain typing and risk assessment. Array technology offers a fast, reproducible and standardisable means for bacterial typing and thus provides many advantages for bacterial diagnostics, risk assessment and surveillance. The development of highly discriminative broad-range microbial diagnostic microarrays remains a challenge, because of marked genome plasticity of many bacterial pathogens. We developed a DNA microarray for strain typing and detection of major antimicrobial resistance genes of clinically relevant enterobacteria. For this purpose, we applied a global genome-wide probe selection strategy on 32 available complete enterobacterial genomes combined with a regression model for pathogen classification. The discriminative power of the probe set was further tested in silico on 15 additional complete enterobacterial genome sequences. DNA microarrays based on the selected probes were used to type 92 clinical enterobacterial isolates. Phenotypic tests confirmed the array-based typing results and corroborate that the selected probes allowed correct typing and prediction of major antibiotic resistances of clinically relevant Enterobacteriaceae, including the subspecies level, e.g. the reliable distinction of different E. coli pathotypes. Our results demonstrate that the global probe selection approach based on longest common factor statistics as well as the design of a DNA microarray with a restricted set of discriminative probes enables robust discrimination of different enterobacterial variants and represents a proof of concept that can be adopted for diagnostics of a wide range of microbial pathogens. Our approach circumvents misclassifications arising from the application of virulence markers, which are highly affected by horizontal gene transfer. Moreover, a broad range of pathogens have been covered by an efficient probe set size enabling the design of high-throughput diagnostics.
Fabrication of high quality cDNA microarray using a small amount of cDNA.
Park, Chan Hee; Jeong, Ha Jin; Jung, Jae Jun; Lee, Gui Yeon; Kim, Sang-Chul; Kim, Tae Soo; Yang, Sang Hwa; Chung, Hyun Cheol; Rha, Sun Young
2004-05-01
DNA microarray technology has become an essential part of biological research. It enables the genome-scale analysis of gene expression in various types of model systems. Manufacturing high quality cDNA microarrays of microdeposition type depends on some key factors including a printing device, spotting pins, glass slides, spotting solution, and humidity during spotting. UsingEthe Microgrid II TAS model printing device, this study defined the optimal conditions for producing high density, high quality cDNA microarrays with the least amount of cDNA product. It was observed that aminosilane-modified slides were superior to other types of surface modified-slides. A humidity of 30+/-3% in a closed environment and the overnight drying of the spotted slides gave the best conditions for arraying. In addition, the cDNA dissolved in 30% DMSO gave the optimal conditions for spotting compared to the 1X ArrayIt, 3X SSC and 50% DMSO. Lastly, cDNA in the concentration range of 100-300 ng/ micro l was determined to be best for arraying and post-processing. Currently, the printing system in this study yields reproducible 9000 spots with a spot size 150 mm diameter, and a 200 nm spot spacing.
Galectins are human milk glycan receptors
Noll, Alexander J; Gourdine, Jean-Philippe; Yu, Ying; Lasanajak, Yi; Smith, David F; Cummings, Richard D
2016-01-01
The biological recognition of human milk glycans (HMGs) is poorly understood. Because HMGs are rich in galactose we explored whether they might interact with human galectins, which bind galactose-containing glycans and are highly expressed in epithelial cells and other cell types. We screened a number of human galectins for their binding to HMGs on a shotgun glycan microarray consisting of 247 HMGs derived from human milk, as well as to a defined HMG microarray. Recombinant human galectins (hGal)-1, -3, -4, -7, -8 and -9 bound selectively to glycans, with each galectin recognizing a relatively unique binding motif; by contrast hGal-2 did not recognize HMGs, but did bind to the human blood group A Type 2 determinants on other microarrays. Unlike other galectins, hGal-7 preferentially bound to glycans expressing a terminal Type 1 (Galβ1-3GlcNAc) sequence, a motif that had eluded detection on non-HMG glycan microarrays. Interactions with HMGs were confirmed in a solution setting by isothermal titration microcalorimetry and hapten inhibition experiments. These results demonstrate that galectins selectively bind to HMGs and suggest the possibility that galectin–HMG interactions may play a role in infant immunity. PMID:26747425
Bu, Huajie; Narisu, Narisu; Schlick, Bettina; Rainer, Johannes; Manke, Thomas; Schäfer, Georg; Pasqualini, Lorenza; Chines, Peter; Schweiger, Michal R.; Fuchsberger, Christian
2015-01-01
ABSTRACT Genome‐wide association studies have identified genomic loci, whose single‐nucleotide polymorphisms (SNPs) predispose to prostate cancer (PCa). However, the mechanisms of most of these variants are largely unknown. We integrated chromatin‐immunoprecipitation‐coupled sequencing and microarray expression profiling in TMPRSS2‐ERG gene rearrangement positive DUCaP cells with the GWAS PCa risk SNPs catalog to identify disease susceptibility SNPs localized within functional androgen receptor‐binding sites (ARBSs). Among the 48 GWAS index risk SNPs and 3,917 linked SNPs, 80 were found located in ARBSs. Of these, rs11891426:T>G in an intron of the melanophilin gene (MLPH) was within a novel putative auxiliary AR‐binding motif, which is enriched in the neighborhood of canonical androgen‐responsive elements. T→G exchange attenuated the transcriptional activity of the ARBS in an AR reporter gene assay. The expression of MLPH in primary prostate tumors was significantly lower in those with the G compared with the T allele and correlated significantly with AR protein. Higher melanophilin level in prostate tissue of patients with a favorable PCa risk profile points out a tumor‐suppressive effect. These results unravel a hidden link between AR and a functional putative PCa risk SNP, whose allele alteration affects androgen regulation of its host gene MLPH. PMID:26411452
Genetic selection of embryos that later develop the metabolic syndrome.
Edwards, M J
2012-05-01
THE BARKER HYPOTHESIS: Is an excellent explanation of the process where human and animal foetuses exposed to malnutrition, either by maternal malnutrition or placental insufficiency, are metabolically programmed, with selective stunting of cell differentiation and organ growth. With the postnatal excess of nutrition observed in developed countries, this irreversible programming causes metabolic syndrome, including obesity, type 2 diabetes, and hypertension. Metabolic programming involves epigenetic changes including imprinting which might be transmitted through more than one generation rather than being completely re-set or erased during reproduction. The Barker hypothesis was supported by epidemiological data that recognised no excess fetal or postnatal mortality when pregnant women were starved during the Dutch famine in World War II. This argued against the "thrifty genotype" theory introduced in 1962, which proposed that starvation selected against members of the population with less "thrifty" genes, but the survivors who had "thrifty" genes developed metabolic syndrome if they were subsequently over-nourished. EMBRYONIC/FETAL SELECTION: Embryos or early foetuses could be selected very early in pregnancy on the basis of their genotype, by maternal malnutrition, hypertension, obesity or other causes of placental insufficiency. The genotype that allows embryos, or cells within them, to survive a less hospitable environment in the decidua after implantation might contribute to the later development of metabolic syndrome. This article hypothesises that an adverse intrauterine environment, caused by maternal malnutrition or placental insufficiency, kills a proportion of embryos and selects a surviving population of early embryos whose growth in utero is retarded by their genotype, their environment or a combination of both. The metabolic syndrome follows if the offspring is over-nourished later in life. The embryonic selection hypothesis presented here could be tested by using single nucleotide polymorphism (SNP) microarrays to study adults who had a history of intrauterine growth retardation (IUGR) and subsequent metabolic syndrome. Their SNP array could be compared with their parents and unaffected unrelated or related controls. If there were no selection based on a "thrifty genotype", all parental sequences would be expected to appear in their surviving children, whether or not they had IUGR or developed metabolic syndrome. SNP sequences present in parents or controls but missing from adult offspring with metabolic syndrome who had IUGR, could be associated with or linked to genes that influence susceptibility to metabolic syndrome. This hypothesis proposes that missing genotypes would be lost if the embryos that inherited them died very early in pregnancy. Copyright © 2012 Elsevier Ltd. All rights reserved.
Reverse phase protein microarrays: fluorometric and colorimetric detection.
Gallagher, Rosa I; Silvestri, Alessandra; Petricoin, Emanuel F; Liotta, Lance A; Espina, Virginia
2011-01-01
The Reverse Phase Protein Microarray (RPMA) is an array platform used to quantitate proteins and their posttranslationally modified forms. RPMAs are applicable for profiling key cellular signaling pathways and protein networks, allowing direct comparison of the activation state of proteins from multiple samples within the same array. The RPMA format consists of proteins immobilized directly on a nitrocellulose substratum. The analyte is subsequently probed with a primary antibody and a series of reagents for signal amplification and detection. Due to the diversity, low concentration, and large dynamic range of protein analytes, RPMAs require stringent signal amplification methods, high quality image acquisition, and software capable of precisely analyzing spot intensities on an array. Microarray detection strategies can be either fluorescent or colorimetric. The choice of a detection system depends on (a) the expected analyte concentration, (b) type of microarray imaging system, and (c) type of sample. The focus of this chapter is to describe RPMA detection and imaging using fluorescent and colorimetric (diaminobenzidine (DAB)) methods.
Mitchell, Sarah G; Bunting, Silvia T; Saxe, Debra; Olson, Thomas; Keller, Frank G
2017-04-01
An activating point mutation of the c-KIT tyrosine kinase receptor gene, D816H, has been described in germ cell tumors (GCTs). We report an adolescent diagnosed with an ovarian mixed GCT and systemic mastocytosis with chronic myelomonocytic leukemia (SM-CMML). The teratoma and dysgerminoma differed by copy number aberrations via single nucleotide polymorphism (SNP) microarray, but were inclusive of the same c-KIT D816H point mutation (c.2446G>C) also identified in blood and bone marrow mast cells. These findings indicate not only a clonal origin of the GCT and hematologic malignancy, but also suggest a rare KIT mutation may be playing a fundamental role in malignancy development. © 2016 Wiley Periodicals, Inc.
A global reference for human genetic variation
2016-01-01
The 1000 Genomes Project set out to provide a comprehensive description of common human genetic variation by applying whole-genome sequencing to a diverse set of individuals from multiple populations. Here we report completion of the project, having reconstructed the genomes of 2,504 individuals from 26 populations using a combination of low-coverage whole-genome sequencing, deep exome sequencing, and dense microarray genotyping. We characterized a broad spectrum of genetic variation, in total over 88 million variants (84.7 million single nucleotide polymorphisms (SNPs), 3.6 million short insertions/deletions (indels), and 60,000 structural variants), all phased onto high-quality haplotypes. This resource includes >99% of SNP variants with a frequency of >1% for a variety of ancestries. We describe the distribution of genetic variation across the global sample, and discuss the implications for common disease studies. PMID:26432245
Dapia, Irene; Tong, Hoi Y.; Arias, Pedro; Muñoz, Mario; Tenorio, Jair; Hernández, Rafael; García García, Irene; Gordo, Gema; Ramírez, Elena; Frías, Jesús; Lapunzina, Pablo; Carcas, Antonio J.
2017-01-01
Abstract In 2014, we established a pharmacogenetics unit with the intention of facilitating the integration of pharmacogenetic testing into clinical practice. This unit was centered around two main ideas: i) individualization of clinical recommendations, and ii) preemptive genotyping in risk populations. Our unit is based on the design and validation of a single nucleotide polymorphism (SNP) microarray, which has allowed testing of 180 SNPs associated with drug response (PharmArray), and clinical consultation regarding the results. Herein, we report our experience in integrating pharmacogenetic testing into our hospital and we present the results of the 2,539 pharmacogenetic consultation requests received over the past 3 years in our unit. The results demonstrate the feasibility of implementing pharmacogenetic testing in clinical practice within a national health system. PMID:29193749
Characterization and simulation of cDNA microarray spots using a novel mathematical model
Kim, Hye Young; Lee, Seo Eun; Kim, Min Jung; Han, Jin Il; Kim, Bo Kyung; Lee, Yong Sung; Lee, Young Seek; Kim, Jin Hyuk
2007-01-01
Background The quality of cDNA microarray data is crucial for expanding its application to other research areas, such as the study of gene regulatory networks. Despite the fact that a number of algorithms have been suggested to increase the accuracy of microarray gene expression data, it is necessary to obtain reliable microarray images by improving wet-lab experiments. As the first step of a cDNA microarray experiment, spotting cDNA probes is critical to determining the quality of spot images. Results We developed a governing equation of cDNA deposition during evaporation of a drop in the microarray spotting process. The governing equation included four parameters: the surface site density on the support, the extrapolated equilibrium constant for the binding of cDNA molecules with surface sites on glass slides, the macromolecular interaction factor, and the volume constant of a drop of cDNA solution. We simulated cDNA deposition from the single model equation by varying the value of the parameters. The morphology of the resulting cDNA deposit can be classified into three types: a doughnut shape, a peak shape, and a volcano shape. The spot morphology can be changed into a flat shape by varying the experimental conditions while considering the parameters of the governing equation of cDNA deposition. The four parameters were estimated by fitting the governing equation to the real microarray images. With the results of the simulation and the parameter estimation, the phenomenon of the formation of cDNA deposits in each type was investigated. Conclusion This study explains how various spot shapes can exist and suggests which parameters are to be adjusted for obtaining a good spot. This system is able to explore the cDNA microarray spotting process in a predictable, manageable and descriptive manner. We hope it can provide a way to predict the incidents that can occur during a real cDNA microarray experiment, and produce useful data for several research applications involving cDNA microarrays. PMID:18096047
Sengupta Chattopadhyay, Amrita; Hsiao, Ching-Lin; Chang, Chien Ching; Lian, Ie-Bin; Fann, Cathy S J
2014-01-01
Identifying susceptibility genes that influence complex diseases is extremely difficult because loci often influence the disease state through genetic interactions. Numerous approaches to detect disease-associated SNP-SNP interactions have been developed, but none consistently generates high-quality results under different disease scenarios. Using summarizing techniques to combine a number of existing methods may provide a solution to this problem. Here we used three popular non-parametric methods-Gini, absolute probability difference (APD), and entropy-to develop two novel summary scores, namely principle component score (PCS) and Z-sum score (ZSS), with which to predict disease-associated genetic interactions. We used a simulation study to compare performance of the non-parametric scores, the summary scores, the scaled-sum score (SSS; used in polymorphism interaction analysis (PIA)), and the multifactor dimensionality reduction (MDR). The non-parametric methods achieved high power, but no non-parametric method outperformed all others under a variety of epistatic scenarios. PCS and ZSS, however, outperformed MDR. PCS, ZSS and SSS displayed controlled type-I-errors (<0.05) compared to GS, APDS, ES (>0.05). A real data study using the genetic-analysis-workshop 16 (GAW 16) rheumatoid arthritis dataset identified a number of interesting SNP-SNP interactions. © 2013 Elsevier B.V. All rights reserved.
The MGED ontology: a framework for describing functional genomics experiments.
Stoeckert, Christian J; Parkinson, Helen
2003-01-01
The Microarray Gene Expression Data (MGED) society was formed with an initial focus on experiments involving microarray technology. Despite the diversity of applications, there are common concepts used and a common need to capture experimental information in a standardized manner. In building the MGED ontology, it was recognized that it would be impractical to cover all the different types of experiments on all the different types of organisms by listing and defining all the types of organisms and their properties. Our solution was to create a framework for describing microarray experiments with an initial focus on the biological sample and its manipulation. For concepts that are common for many species, we could provide a manageable listing of controlled terms. For concepts that are species-specific or whose values cannot be readily listed, we created an 'OntologyEntry' concept that referenced an external resource. The MGED ontology is a work in progress that needs additional instances and particularly needs constraints to be added. The ontology currently covers the experimental sample and design, and we have begun capturing aspects of the microarrays themselves as well. The primary application of the ontology will be to develop forms for entering information into databases, and consequently allowing queries, taking advantage of the structure provided by the ontology. The application of an ontology of experimental conditions extends beyond microarray experiments and, as the scope of MGED includes other aspects of functional genomics, so too will the MGED ontology.
Missing value imputation for microarray data: a comprehensive comparison study and a web tool
2013-01-01
Background Microarray data are usually peppered with missing values due to various reasons. However, most of the downstream analyses for microarray data require complete datasets. Therefore, accurate algorithms for missing value estimation are needed for improving the performance of microarray data analyses. Although many algorithms have been developed, there are many debates on the selection of the optimal algorithm. The studies about the performance comparison of different algorithms are still incomprehensive, especially in the number of benchmark datasets used, the number of algorithms compared, the rounds of simulation conducted, and the performance measures used. Results In this paper, we performed a comprehensive comparison by using (I) thirteen datasets, (II) nine algorithms, (III) 110 independent runs of simulation, and (IV) three types of measures to evaluate the performance of each imputation algorithm fairly. First, the effects of different types of microarray datasets on the performance of each imputation algorithm were evaluated. Second, we discussed whether the datasets from different species have different impact on the performance of different algorithms. To assess the performance of each algorithm fairly, all evaluations were performed using three types of measures. Our results indicate that the performance of an imputation algorithm mainly depends on the type of a dataset but not on the species where the samples come from. In addition to the statistical measure, two other measures with biological meanings are useful to reflect the impact of missing value imputation on the downstream data analyses. Our study suggests that local-least-squares-based methods are good choices to handle missing values for most of the microarray datasets. Conclusions In this work, we carried out a comprehensive comparison of the algorithms for microarray missing value imputation. Based on such a comprehensive comparison, researchers could choose the optimal algorithm for their datasets easily. Moreover, new imputation algorithms could be compared with the existing algorithms using this comparison strategy as a standard protocol. In addition, to assist researchers in dealing with missing values easily, we built a web-based and easy-to-use imputation tool, MissVIA (http://cosbi.ee.ncku.edu.tw/MissVIA), which supports many imputation algorithms. Once users upload a real microarray dataset and choose the imputation algorithms, MissVIA will determine the optimal algorithm for the users' data through a series of simulations, and then the imputed results can be downloaded for the downstream data analyses. PMID:24565220
Cell-Based Microarrays for In Vitro Toxicology
NASA Astrophysics Data System (ADS)
Wegener, Joachim
2015-07-01
DNA/RNA and protein microarrays have proven their outstanding bioanalytical performance throughout the past decades, given the unprecedented level of parallelization by which molecular recognition assays can be performed and analyzed. Cell microarrays (CMAs) make use of similar construction principles. They are applied to profile a given cell population with respect to the expression of specific molecular markers and also to measure functional cell responses to drugs and chemicals. This review focuses on the use of cell-based microarrays for assessing the cytotoxicity of drugs, toxins, or chemicals in general. It also summarizes CMA construction principles with respect to the cell types that are used for such microarrays, the readout parameters to assess toxicity, and the various formats that have been established and applied. The review ends with a critical comparison of CMAs and well-established microtiter plate (MTP) approaches.
Both germ line and somatic genetics of the p53 pathway affect ovarian cancer incidence and survival.
Bartel, Frank; Jung, Juliane; Böhnke, Anja; Gradhand, Elise; Zeng, Katharina; Thomssen, Christoph; Hauptmann, Steffen
2008-01-01
Although p53 is one of the most studied genes/proteins in ovarian carcinomas, the predictive value of p53 alterations is still ambiguous. We performed analyses of the TP53 mutational status and its protein expression using immunohistochemistry. Moreover, the single nucleotide polymorphism SNP309 in the P2 promoter of the MDM2 gene was investigated. We correlated the results with age of onset and outcome from 107 patients with ovarian carcinoma. In our study, we identified a large group of patients with p53 overexpression despite having a wild-type gene (49% of all patients with wild-type TP53). This was associated with a significantly shortened overall survival time (P = 0.019). Patients with p53 alterations (especially those with overexpression of wild-type TP53) were also more refractory to chemotherapy compared with patients with normal p53 (P = 0.027). The G-allele of SNP309 is associated with an earlier age of onset in patients with estrogen receptor-overexpressing FIGO stage III disease (P = 0.048). In contrast, in patients with FIGO stage III disease, a weakened p53 pathway (either the G-allele of SNP309 or a TP53 mutation) was correlated with increased overall survival compared with patients whose tumors were wild-type for both TP53 and SNP309 (P = 0.0035). Our study provides evidence that both germ line and somatic alterations of the p53 pathway influence the incidence and survival of ovarian carcinoma, and it underscores the importance of assessing the functionality of p53 in order to predict the sensitivity of platinum-based chemotherapies and patient outcome.
A Self-Directed Method for Cell-Type Identification and Separation of Gene Expression Microarrays
Zuckerman, Neta S.; Noam, Yair; Goldsmith, Andrea J.; Lee, Peter P.
2013-01-01
Gene expression analysis is generally performed on heterogeneous tissue samples consisting of multiple cell types. Current methods developed to separate heterogeneous gene expression rely on prior knowledge of the cell-type composition and/or signatures - these are not available in most public datasets. We present a novel method to identify the cell-type composition, signatures and proportions per sample without need for a-priori information. The method was successfully tested on controlled and semi-controlled datasets and performed as accurately as current methods that do require additional information. As such, this method enables the analysis of cell-type specific gene expression using existing large pools of publically available microarray datasets. PMID:23990767
Molecular Diagnostics in Transfusion Medicine: In Capillary, on a Chip, in Silico, or in Flight?
Garritsen, Henk S.P.; Xiu-Cheng Fan, Alex; Lenz, Daniela; Hannig, Horst; Yan Zhong, Xiao; Geffers, Robert; Lindenmaier, Werner; Dittmar, Kurt E.J.; Wörmann, Bernhard
2009-01-01
Summary Serology, defined as antibody-based diagnostics, has been regarded as the diagnostic gold standard in transfusion medicine. Nowadays however the impact of molecular diagnostics in transfusion medicine is rapidly growing. Molecular diagnostics can improve tissue typing (HLA typing), increase safety of blood products (NAT testing of infectious diseases), and enable blood group typing in difficult situations (after transfusion of blood products or prenatal non-invasive RhD typing). Most of the molecular testing involves the determination of the presence of single nucleotide polymorphisms (SNPs). Antigens (e.g. blood group antigens) mostly result from single nucleotide differences in critical positions. However, most blood group systems cannot be determined by looking at a single SNP. To identify members of a blood group system a number of critical SNPs have to be taken into account. The platforms which are currently used to perform molecular diagnostics are mostly gel-based, requiring time-consuming multiple manual steps. To implement molecular methods in transfusion medicine in the future the development of higher-throughput SNP genotyping non-gel-based platforms which allow a rapid, cost-effective screening are essential. Because of its potential for automation, high throughput and cost effectiveness the special focus of this paper is a relative new technique: SNP genotyping by MALDI-TOF MS analysis. PMID:21113259
Pearce, Madison E; Alikhan, Nabil-Fareed; Dallman, Timothy J; Zhou, Zhemin; Grant, Kathie; Maiden, Martin C J
2018-06-02
Multi-country outbreaks of foodborne bacterial disease present challenges in their detection, tracking, and notification. As food is increasingly distributed across borders, such outbreaks are becoming more common. This increases the need for high-resolution, accessible, and replicable isolate typing schemes. Here we evaluate a core genome multilocus typing (cgMLST) scheme for the high-resolution reproducible typing of Salmonella enterica (S. enterica) isolates, by its application to a large European outbreak of S. enterica serovar Enteritidis. This outbreak had been extensively characterised using single nucleotide polymorphism (SNP)-based approaches. The cgMLST analysis was congruent with the original SNP-based analysis, the epidemiological data, and whole genome MLST (wgMLST) analysis. Combination of the cgMLST and epidemiological data confirmed that the genetic diversity among the isolates predated the outbreak, and was likely present at the infection source. There was consequently no link between country of isolation and genetic diversity, but the cgMLST clusters were congruent with date of isolation. Furthermore, comparison with publicly available Enteritidis isolate data demonstrated that the cgMLST scheme presented is highly scalable, enabling outbreaks to be contextualised within the Salmonella genus. The cgMLST scheme is therefore shown to be a standardised and scalable typing method, which allows Salmonella outbreaks to be analysed and compared across laboratories and jurisdictions. Copyright © 2018. Published by Elsevier B.V.
Flores-Martínez, Silvia Esperanza; Castro-Martínez, Anna Gabriela; López-Quintero, Andrés; García-Zapién, Alejandra Guadalupe; Torres-Rodríguez, Ruth Noemí; Sánchez-Corona, José
2015-01-01
Polycystic ovary syndrome is a complex and heterogeneous disease involving both reproductive and metabolic problems. It has been suggested a genetic predisposition in the etiology of this syndrome. The identification of calpain-10 gene (CAPN10) as the first candidate gene for type 2 diabetes mellitus, has focused the interest in investigating their possible relation with the polycystic ovary syndrome, because this syndrome is associated with hyperinsulinemia and insulin resistance, two metabolic abnormalities associated with type 2 diabetes mellitus. To investigate if there is association between the SNP-63 and the variant indel-19 of the CAPN10 gene and polycystic ovary syndrome in women of reproductive age. This study included 101 women (55 with polycystic ovary syndrome and 46 without polycystic ovary syndrome). The genetic variant indel-19 was identified by electrophoresis of the amplified fragments by PCR, and the SNP-63 by PCR-RFLP. The allele and genotype frequencies of the two variants do not differ significatly between women with polycystic ovary syndrome and control women group. The haplotype 21 (defined by the insertion allele of indel-19 variant and C allele of SNP-63) was found with higher frequency in both study groups, being more frequent in the polycystic ovary syndrome patients group, however, this difference was not statistically significant (p = 0.8353). The results suggest that SNP-63 and indel-19 variant of the CAPN10 gene do not represent a risk factor for polycystic ovary syndrome in our patients group. Copyright © 2015. Published by Masson Doyma México S.A.
Kinoshita, Kenji; Fujimoto, Kentaro; Yakabe, Toru; Saito, Shin; Hamaguchi, Yuzo; Kikuchi, Takayuki; Nonaka, Ken; Murata, Shigenori; Masuda, Daisuke; Takada, Wataru; Funaoka, Sohei; Arai, Susumu; Nakanishi, Hisao; Yokoyama, Kanehisa; Fujiwara, Kazuhiko; Matsubara, Kenichi
2007-01-01
DNA microarrays are routinely used to monitor gene expression profiling and single nucleotide polymorphisms (SNPs). However, for practically useful high performance, the detection sensitivity is still not adequate, leaving low expression genes undetected. To resolve this issue, we have developed a new plastic S-BIO® PrimeSurface® with a biocompatible polymer; its surface chemistry offers an extraordinarily stable thermal property for a lack of pre-activated glass slide surface. The oligonucleotides immobilized on this substrate are robust in boiling water and show no significant loss of hybridization activity during dissociation treatment. This allowed us to hybridize the templates, extend the 3′ end of the immobilized DNA primers on the S-Bio® by DNA polymerase using deoxynucleotidyl triphosphates (dNTP) as extender units, release the templates by denaturalization and use the same templates for a second round of reactions similar to that of the PCR method. By repeating this cycle, the picomolar concentration range of the template oligonucleotide can be detected as stable signals via the incorporation of labeled dUTP into primers. This method of Multiple Primer EXtension (MPEX) could be further extended as an alternative route for producing DNA microarrays for SNP analyses via simple template preparation such as reverse transcript cDNA or restriction enzyme treatment of genome DNA. PMID:17135189
Genetic Structures of Copy Number Variants Revealed by Genotyping Single Sperm
Luo, Minjie; Cui, Xiangfeng; Fredman, David; Brookes, Anthony J.; Azaro, Marco A.; Greenawalt, Danielle M.; Hu, Guohong; Wang, Hui-Yun; Tereshchenko, Irina V.; Lin, Yong; Shentu, Yue; Gao, Richeng; Shen, Li; Li, Honghua
2009-01-01
Background Copy number variants (CNVs) occupy a significant portion of the human genome and may have important roles in meiotic recombination, human genome evolution and gene expression. Many genetic diseases may be underlain by CNVs. However, because of the presence of their multiple copies, variability in copy numbers and the diploidy of the human genome, detailed genetic structure of CNVs cannot be readily studied by available techniques. Methodology/Principal Findings Single sperm samples were used as the primary subjects for the study so that CNV haplotypes in the sperm donors could be studied individually. Forty-eight CNVs characterized in a previous study were analyzed using a microarray-based high-throughput genotyping method after multiplex amplification. Seventeen single nucleotide polymorphisms (SNPs) were also included as controls. Two single-base variants, either allelic or paralogous, could be discriminated for all markers. Microarray data were used to resolve SNP alleles and CNV haplotypes, to quantitatively assess the numbers and compositions of the paralogous segments in each CNV haplotype. Conclusions/Significance This is the first study of the genetic structure of CNVs on a large scale. Resulting information may help understand evolution of the human genome, gain insight into many genetic processes, and discriminate between CNVs and SNPs. The highly sensitive high-throughput experimental system with haploid sperm samples as subjects may be used to facilitate detailed large-scale CNV analysis. PMID:19384415
Nielsen, Peter B; Petersen, Maja S; Ystaas, Viviana; Andersen, Rolf V; Hansen, Karin M; Blaabjerg, Vibeke; Refstrup, Mette
2012-10-01
Classical hereditary hemochromatosis involves the HFE-gene and diagnostic analysis of the DNA variants HFE p.C282Y (c.845G>A; rs1800562) and HFE p.H63D (c.187C>G; rs1799945). The affected protein alters the iron homeostasis resulting in iron overload in various tissues. The aim of this study was to validate the TaqMan-based Sample-to-SNP protocol for the analysis of the HFE-p.C282Y and p.H63D variants with regard to accuracy, usefulness and reproducibility compared to an existing SNP protocol. The Sample-to-SNP protocol uses an approach where the DNA template is made accessible from a cell lysate followed by TaqMan analysis. Besides the HFE-SNPs other eight SNPs were used as well. These SNPs were: Coagulation factor II-gene F2 c.20210G>A, Coagulation factor V-gene F5 p.R506Q (c.1517G>A; rs121917732), Mitochondria SNP: mt7028 G>A, Mitochondria SNP: mt12308 A>G, Proprotein convertase subtilisin/kexin type 9-gene PCSK9 p.R46L (c.137G>T), Plutathione S-transferase pi 1-gene GSTP1 p.I105V (c313A>G; rs1695), LXR g.-171 A>G, ZNF202 g.-118 G>T. In conclusion the Sample-to-SNP kit proved to be an accurate, reliable, robust, easy to use and rapid TaqMan-based SNP detection protocol, which could be quickly implemented in a routine diagnostic or research facility. Copyright © 2012. Published by Elsevier B.V.
Partitioned learning of deep Boltzmann machines for SNP data.
Hess, Moritz; Lenz, Stefan; Blätte, Tamara J; Bullinger, Lars; Binder, Harald
2017-10-15
Learning the joint distributions of measurements, and in particular identification of an appropriate low-dimensional manifold, has been found to be a powerful ingredient of deep leaning approaches. Yet, such approaches have hardly been applied to single nucleotide polymorphism (SNP) data, probably due to the high number of features typically exceeding the number of studied individuals. After a brief overview of how deep Boltzmann machines (DBMs), a deep learning approach, can be adapted to SNP data in principle, we specifically present a way to alleviate the dimensionality problem by partitioned learning. We propose a sparse regression approach to coarsely screen the joint distribution of SNPs, followed by training several DBMs on SNP partitions that were identified by the screening. Aggregate features representing SNP patterns and the corresponding SNPs are extracted from the DBMs by a combination of statistical tests and sparse regression. In simulated case-control data, we show how this can uncover complex SNP patterns and augment results from univariate approaches, while maintaining type 1 error control. Time-to-event endpoints are considered in an application with acute myeloid leukemia patients, where SNP patterns are modeled after a pre-screening based on gene expression data. The proposed approach identified three SNPs that seem to jointly influence survival in a validation dataset. This indicates the added value of jointly investigating SNPs compared to standard univariate analyses and makes partitioned learning of DBMs an interesting complementary approach when analyzing SNP data. A Julia package is provided at 'http://github.com/binderh/BoltzmannMachines.jl'. binderh@imbi.uni-freiburg.de. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Psifidi, Androniki; Dovas, Chrysostomos; Banos, Georgios
2011-01-19
Single nucleotide polymorphisms (SNP) have proven to be powerful genetic markers for genetic applications in medicine, life science and agriculture. A variety of methods exist for SNP detection but few can quantify SNP frequencies when the mutated DNA molecules correspond to a small fraction of the wild-type DNA. Furthermore, there is no generally accepted gold standard for SNP quantification, and, in general, currently applied methods give inconsistent results in selected cohorts. In the present study we sought to develop a novel method for accurate detection and quantification of SNP in DNA pooled samples. The development and evaluation of a novel Ligase Chain Reaction (LCR) protocol that uses a DNA-specific fluorescent dye to allow quantitative real-time analysis is described. Different reaction components and thermocycling parameters affecting the efficiency and specificity of LCR were examined. Several protocols, including gap-LCR modifications, were evaluated using plasmid standard and genomic DNA pools. A protocol of choice was identified and applied for the quantification of a polymorphism at codon 136 of the ovine PRNP gene that is associated with susceptibility to a transmissible spongiform encephalopathy in sheep. The real-time LCR protocol developed in the present study showed high sensitivity, accuracy, reproducibility and a wide dynamic range of SNP quantification in different DNA pools. The limits of detection and quantification of SNP frequencies were 0.085% and 0.35%, respectively. The proposed real-time LCR protocol is applicable when sensitive detection and accurate quantification of low copy number mutations in DNA pools is needed. Examples include oncogenes and tumour suppressor genes, infectious diseases, pathogenic bacteria, fungal species, viral mutants, drug resistance resulting from point mutations, and genetically modified organisms in food.
Psifidi, Androniki; Dovas, Chrysostomos; Banos, Georgios
2011-01-01
Background Single nucleotide polymorphisms (SNP) have proven to be powerful genetic markers for genetic applications in medicine, life science and agriculture. A variety of methods exist for SNP detection but few can quantify SNP frequencies when the mutated DNA molecules correspond to a small fraction of the wild-type DNA. Furthermore, there is no generally accepted gold standard for SNP quantification, and, in general, currently applied methods give inconsistent results in selected cohorts. In the present study we sought to develop a novel method for accurate detection and quantification of SNP in DNA pooled samples. Methods The development and evaluation of a novel Ligase Chain Reaction (LCR) protocol that uses a DNA-specific fluorescent dye to allow quantitative real-time analysis is described. Different reaction components and thermocycling parameters affecting the efficiency and specificity of LCR were examined. Several protocols, including gap-LCR modifications, were evaluated using plasmid standard and genomic DNA pools. A protocol of choice was identified and applied for the quantification of a polymorphism at codon 136 of the ovine PRNP gene that is associated with susceptibility to a transmissible spongiform encephalopathy in sheep. Conclusions The real-time LCR protocol developed in the present study showed high sensitivity, accuracy, reproducibility and a wide dynamic range of SNP quantification in different DNA pools. The limits of detection and quantification of SNP frequencies were 0.085% and 0.35%, respectively. Significance The proposed real-time LCR protocol is applicable when sensitive detection and accurate quantification of low copy number mutations in DNA pools is needed. Examples include oncogenes and tumour suppressor genes, infectious diseases, pathogenic bacteria, fungal species, viral mutants, drug resistance resulting from point mutations, and genetically modified organisms in food. PMID:21283808
Cooper, T A; Wiggans, G R; VanRaden, P M
2013-05-01
Call rates on both a single nucleotide polymorphism (SNP) basis and an animal basis are used as measures of data quality and as screening tools for genomic studies and evaluations of dairy cattle. To investigate the relationship of SNP call rate and genotype accuracy for individual SNP, the correlation between percentages of missing genotypes and parent-progeny conflicts for each SNP was calculated for 103,313 Holsteins. Correlations ranged from 0.14 to 0.38 for the BovineSNP50 and BovineLD (Illumina Inc., San Diego, CA) and GeneSeek Genomic Profiler (Neogen Corp., Lincoln, NE) chips, with lower correlations for newer chips. For US genomic evaluations, genotypes are excluded for animals with a call rate of <90% across autosomal SNP or <80% across X-specific SNP. Mean call rate for 220,175 Holstein, Jersey, and Brown Swiss genotypes was 99.6%. Animal genotypes with a call rate of ≤99% were examined from the US Department of Agriculture genotype database to determine how genotype call rate is related to accuracy of calls on an animal basis. Animal call rate was determined from SNP used in genomic evaluation and is the number of called autosomal and X-specific SNP genotypes divided by the number of SNP from that type of chip. To investigate the relationship of animal call rate and parentage validation, conflicts between a genotyped animal and its sire or dam were determined through a duo test (opposite homozygous SNP genotypes between sire and progeny; 1,374 animal genotypes) and a trio test (also including conflicts with dam and heterozygous SNP genotype for the animal when both parents are the same homozygote; 482 animal genotypes). When animal call rate was ≤ 80%, parentage validation was no longer reliable with the duo test. With the trio test, parentage validation was no longer reliable when animal call rate was ≤ 90%. To investigate how animal call rate was related to genotyping accuracy for animals with multiple genotypes, concordance between genotypes for 1,216 animals that had a genotype with a call rate of ≤ 99% (low call rate) as well as a genotype with a call rate of >99% (high call rate) were calculated by dividing the number of identical SNP genotype calls by the number of SNP that were called for both genotypes. Mean concordance between low- and high-call genotypes was >99% for a low call rate of >90% but decreased to 97% for a call rate of 86 to 90% and to 58% for a call rate of <60%. Edits on call rate reduce the use of incorrect SNP genotypes to calculate genomic evaluations. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Genome-wide association mapping of frost tolerance in barley (Hordeum vulgare L.)
2013-01-01
Background Frost tolerance is a key trait with economic and agronomic importance in barley because it is a major component of winter hardiness, and therefore limits the geographical distribution of the crop and the effective transfer of quality traits between spring and winter crop types. Three main frost tolerance QTL (Fr-H1, Fr-H2 and Fr-H3) have been identified from bi-parental genetic mapping but it can be argued that those mapping populations only capture a portion of the genetic diversity of the species. A genetically broad dataset consisting of 184 genotypes, representative of the barley gene pool cultivated in the Mediterranean basin over an extended time period, was genotyped with 1536 SNP markers. Frost tolerance phenotype scores were collected from two trial sites, Foradada (Spain) and Fiorenzuola (Italy) and combined with the genotypic data in genome wide association analyses (GWAS) using Eigenstrat and kinship approaches to account for population structure. Results GWAS analyses identified twelve and seven positive SNP associations at Foradada and Fiorenzuola, respectively, using Eigenstrat and six and four, respectively, using kinship. Linkage disequilibrium analyses of the significant SNP associations showed they are genetically independent. In the kinship analysis, two of the significant SNP associations were tightly linked to the Fr-H2 and HvBmy loci on chromosomes 5H and 4HL, respectively. The other significant kinship associations were located in genomic regions that have not previously been associated with cold stress. Conclusions Haplotype analysis revealed that most of the significant SNP loci are fixed in the winter or facultative types, while they are freely segregating within the un-adapted spring barley genepool. Although there is a major interest in detecting new variation to improve frost tolerance of available winter and facultative types, from a GWAS perspective, working within the un-adapted spring germplasm pool is an attractive alternative strategy which would minimize statistical issues, simplify the interpretation of the data and identify phenology independent genetic determinants of frost tolerance. PMID:23802597
Petersen, Jessica L; Valberg, Stephanie J; Mickelson, James R; McCue, Molly E
2014-01-01
Summary Two variants in the equine myostatin gene (MSTN), including a T/C SNP substitution in the first intron and a 227-bp SINE insertion in the promoter, are associated with muscle fiber type proportions in the Quarter Horse (QH) and with the prediction of race distance propensity in the Thoroughbred (TB). Genotypes from these loci, along with 18 additional variants surrounding MSTN, were examined in 301 horses of 14 breeds to evaluate haplotype relationships and diversity. The C allele of intron 1 was found in 12 of 14 breeds at a frequency of 0.27; the SINE was observed in five breeds, but common in only the TB and QH (0.73 and 0.48 respectively). Haplotype data suggest the SINE insertion is contemporary to and arose upon a haplotype containing the intron 1 C allele. Gluteal muscle biopsies of TBs showed a significant association of the intron 1 C allele and SINE with a higher proportion of Type 2B and lower proportion of Type 1 fibers. However, in the Belgian horse, in which the SINE is not present, the intron 1 SNP was not associated with fiber type proportions, and evaluation of fiber type proportions across the Belgian, TB and QH breeds shows the significant effect of breed on fiber type proportions is negated when evaluating horses without the SINE variant. These data suggest the SINE, rather than the intron 1 SNP, is driving the observed muscle fiber type characteristics and is the variant targeted by selection for short-distance racing. PMID:25160752
Kostić, Tanja; Sessitsch, Angela
2011-01-01
Reliable and sensitive pathogen detection in clinical and environmental (including food and water) samples is of greatest importance for public health. Standard microbiological methods have several limitations and improved alternatives are needed. Most important requirements for reliable analysis include: (i) specificity; (ii) sensitivity; (iii) multiplexing potential; (iv) robustness; (v) speed; (vi) automation potential; and (vii) low cost. Microarray technology can, through its very nature, fulfill many of these requirements directly and the remaining challenges have been tackled. In this review, we attempt to compare performance characteristics of the microbial diagnostic microarrays developed for the detection and typing of food and water pathogens, and discuss limitations, points still to be addressed and issues specific for the analysis of food, water and environmental samples. PMID:27605332
Development of a spreadsheet for SNPs typing using Microsoft EXCEL.
Hashiyada, Masaki; Itakura, Yukio; Takahashi, Shirushi; Sakai, Jun; Funayama, Masato
2009-04-01
Single-nucleotide polymorphisms (SNPs) have some characteristics that make them very appropriate for forensic studies and applications. In our institute, SNPs typings were performed by the TaqMan SNP Genotyping Assays using the ABI PRISM 7500 FAST Real-Time PCR System (AppliedBiosystems) and Sequence Detection Software ver.1.4 (AppliedBiosystem). The TaqMan method was desired two positive control (Allele1 and 2) and one negative control to analyze each SNP locus. Therefore, it can be analyzed up to 24 loci of a person on a 96-well-plate at the same time. If SNPs analysis is expected to apply to biometrics authentication, 48 and over loci are required to identify a person. In this study, we designed a spreadsheet package using Microsoft EXCEL, and population data were used from our 120 SNPs population studies. On the spreadsheet, we defined SNP types using 'template files' instead of positive and negative controls. "Template files" consisted of the results of 94 unknown samples and two negative controls of each of 120 SNPs loci we had previously studied. By the use of the files, the spreadsheet could analyze 96 SNPs on a 96-wells-plate simultaneously.
Sharma, Rajani; Prudente, Sabrina; Andreozzi, Francesco; Powers, Christine; Mannino, Gaia; Bacci, Simonetta; Gervino, Ernest V; Hauser, Thomas H; Succurro, Elena; Mercuri, Luana; Goheen, Elizabeth H; Shah, Hetal; Trischitta, Vincenzo; Sesti, Giorgio; Doria, Alessandro
2011-05-01
SNP rs2943641 near the insulin receptor substrate 1 (IRS1) gene has been found to be associated with type 2 diabetes (T2D) and insulin-resistance in genome-wide association studies. We investigated whether this SNP is associated with cardiovascular risk factors and coronary artery disease (CAD) among diabetic individuals. SNP rs2943641 was typed in 2133 White T2D subjects and tested for association with BMI, serum HDL cholesterol and triglycerides, hypertension history, and CAD risk. HDL cholesterol decreased by 1mg/dl (p = 0.004) and serum triglycerides increased by 6 mg/dl (p = 0.016) for each copy of the insulin-resistance allele. Despite these effects, no association was found with increased CAD risk (OR = 1.00, 95% CI 0.88-1.13). The insulin-resistance and T2D locus near the IRS1 gene is a determinant of lower HDL cholesterol among T2D subjects. However, this effect is small and does not translate into a detectable increase in CAD risk in this population. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Børsting, Claus; Morling, Niels
2012-02-01
In some relationship cases, the initial investigations of autosomal short tandem repeats (STRs) lead to an ambiguous conclusion and supplementary investigations become necessary. Six unusual paternity cases were previously investigated by other researchers and published as case work examples in forensic journals. Here, the cases were reinvestigated by typing the samples for 49 autosomal single-nucleotide polymorphisms (SNPs) using the SNPforID multiplex assay. Three cases were solved by the SNP investigation without the need for any additional testing. In two cases, the SNP results supported the conclusions based on STRs. In the last case, the SNP results spoke in favor of paternity, and the combined paternity index based on autosomal STRs and SNPs was 12.3 billion. Nevertheless, the alleged father was excluded by X-chromosome typing. The case work examples underline the importance of performing supplementary investigations, and they advocate for the implementation of several panels that may be used in the highly unusual cases. Panels with SNPs or other markers with low mutation probabilities are preferable as supplementary markers, because the risk of detecting (additional) mutations is very low. © 2012 American Association of Blood Banks.
[Identification of Y-chromosomal Genetic Types for the Soldier's Remains from Huaihai Campaign].
Wang, C Z; Wen, S Q; Shi, M S; Yu, X E; Wang, X J; Pan, Y L; Zhang, Y F; Li, H; Tan, J Z
2017-08-01
To identify the Y-chromosomal genetic types for the soldier's remains from Huaihai Campaign, and to offer a clue for search of their paternal relatives. DNA of the remains were extracted by the ancient DNA extraction method. Yfiler kit was used for the multiplex amplification of 17 Y-STR loci. The haplogroups of the samples were speculated. Detailed genotyping of the selected Y-SNP was performed based on the latest Y-chromosome phylogenetic tree. Haplotype-sharing analysis was done based on the data of Y-SNP and Y-STR, the closest modern individual information to the genetic relationship of remains was gained. A total of 8 Y-STR haplotypes were observed on 17 Y-STR loci of 8 male individuals. Furthermore, 6 Y-SNP haplogroups were identified, which were O2a1-M95+, O1a1-P203+, O3*-M122+/M234-, D1-M15+, C3*-ST and R1a1-M17+. Identification of Y-chromosomal genetic types for the soldier's remains from Huaihai Campaign shows a reference value on inferring the geographical origins of old materials. Copyright© by the Editorial Department of Journal of Forensic Medicine
Hybrid genetic algorithm-neural network: feature extraction for unpreprocessed microarray data.
Tong, Dong Ling; Schierz, Amanda C
2011-09-01
Suitable techniques for microarray analysis have been widely researched, particularly for the study of marker genes expressed to a specific type of cancer. Most of the machine learning methods that have been applied to significant gene selection focus on the classification ability rather than the selection ability of the method. These methods also require the microarray data to be preprocessed before analysis takes place. The objective of this study is to develop a hybrid genetic algorithm-neural network (GANN) model that emphasises feature selection and can operate on unpreprocessed microarray data. The GANN is a hybrid model where the fitness value of the genetic algorithm (GA) is based upon the number of samples correctly labelled by a standard feedforward artificial neural network (ANN). The model is evaluated by using two benchmark microarray datasets with different array platforms and differing number of classes (a 2-class oligonucleotide microarray data for acute leukaemia and a 4-class complementary DNA (cDNA) microarray dataset for SRBCTs (small round blue cell tumours)). The underlying concept of the GANN algorithm is to select highly informative genes by co-evolving both the GA fitness function and the ANN weights at the same time. The novel GANN selected approximately 50% of the same genes as the original studies. This may indicate that these common genes are more biologically significant than other genes in the datasets. The remaining 50% of the significant genes identified were used to build predictive models and for both datasets, the models based on the set of genes extracted by the GANN method produced more accurate results. The results also suggest that the GANN method not only can detect genes that are exclusively associated with a single cancer type but can also explore the genes that are differentially expressed in multiple cancer types. The results show that the GANN model has successfully extracted statistically significant genes from the unpreprocessed microarray data as well as extracting known biologically significant genes. We also show that assessing the biological significance of genes based on classification accuracy may be misleading and though the GANN's set of extra genes prove to be more statistically significant than those selected by other methods, a biological assessment of these genes is highly recommended to confirm their functionality. Copyright © 2011 Elsevier B.V. All rights reserved.
Yeh, E; Kimura, L; Errera, F I V; Angeli, C B; Mingroni-Netto, R C; Silva, M E R; Canani, L H S; Passos-Bueno, M R
2008-06-01
Association studies between ADIPOR1 genetic variants and predisposition to type 2 diabetes (DM2) have provided contradictory results. We determined if two single nucleotide polymorphisms (SNP c.-8503G>A and SNP c.10225C>G) in regulatory regions of ADIPOR1 in 567 Brazilian individuals of European (EA; N = 443) or African (AfA; N = 124) ancestry from rural (quilombo remnants; N = 439) and urban (N = 567) areas. We detected a significant effect of ethnicity on the distribution of the allelic frequencies of both SNPs in these populations (EA: -8503A = 0.27; AfA: -8503A = 0.16; P = 0.001 and EA: 10225G = 0.35; AfA: 10225G = 0.51; P < 0.001). Neither of the polymorphisms were associated with DM2 in the case-control study in EA (SNP c.-8503G>A: DM2 group -8503A = 0.26; control group -8503A = 0.30; P = 0.14/SNP 10225C>G: DM2 group 10225G = 0.37; control group 10225G = 0.32; P = 0.40) and AfA populations (SNP c.-8503G>A: DM2 group -8503A = 0.16; control group -8503A = 0.15; P = 0.34/SNP 10225C>G: DM2 group 10225G = 0.51; control group 10225G = 0.52; P = 0.50). Similarly, none of the polymorphisms were associated with metabolic/anthropometric risk factors for DM2 in any of the three populations, except for HDL cholesterol, which was significantly higher in AfA heterozygotes (GC = 53.75 +/- 17.26 mg/dL) than in homozygotes. We conclude that ADIPOR1 polymorphisms are unlikely to be major risk factors for DM2 or for metabolic/anthropometric measurements that represent risk factors for DM2 in populations of European and African ancestries.
Takamiya, Aline Satie; Monteiro, Douglas Roberto; Bernabé, Daniel Galera; Gorup, Luiz Fernando; Camargo, Emerson Rodrigues; Gomes-Filho, João Eduardo; Oliveira, Sandra Helena Penha; Barbosa, Debora Barros
2016-06-01
Silver nanoparticles have been used for different purposes in dentistry, including endodontic treatments. The aim of this study was to determine the cytotoxicity of different types of silver nanoparticles on mouse fibroblast cell line L929 and the reaction of subcutaneous connective tissue of Wistar rats to these nanoparticles. Silver nanoparticles of an average size of 5 nm were synthesized with ammonia (SNA) or polyvinylpyrrolidone (SNP). L929 was exposed to SNA and SNP (0.1-100 μg/mL), and 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide and enzyme-linked immunosorbent assays were performed after 6, 24, and 48 hours. Culture medium was used as the control. Sixteen rats received, individually, 3 polyethylene tubes filled with a fibrin sponge embedded in 100 μL SNA or SNP (1 μg/mL). A fibrin sponge with no embedding was the control. Tissue reaction was performed qualitatively and quantitatively after 7, 15, 30, and 90 days of implantation in the dorsal connective tissue of Wistar rats. SNA and SNP were cytotoxic to L929 in higher concentrations, with SNA significantly more toxic than SNP. SNA and SNP did not induce significant interleukin-1β and interleukin-6 production. The release of stem cell factor by L929 increased 48 hours after the treatment with SNP at 5 μg/mL. Histologic examination showed that the inflammatory responses caused by SNA and SNP at 1 μg/mL were similar to the control in all experimental periods. It was concluded that SNA and SNP were not cytotoxic at 25 μg/mL or lower concentrations. However, for safe clinical use, further studies establishing others points of its toxicologic profile are recommended. Copyright © 2016 American Association of Endodontists. Published by Elsevier Inc. All rights reserved.
Yucesoy, Berran; Kaufman, Kenneth M.; Lummus, Zana L.; Weirauch, Matthew T.; Zhang, Ge; Cartier, André; Boulet, Louis-Philippe; Sastre, Joaquin; Quirce, Santiago; Tarlo, Susan M.; Cruz, Maria-Jesus; Munoz, Xavier; Harley, John B.; Bernstein, David I.
2015-01-01
Diisocyanates, reactive chemicals used to produce polyurethane products, are the most common causes of occupational asthma. The aim of this study is to identify susceptibility gene variants that could contribute to the pathogenesis of diisocyanate asthma (DA) using a Genome-Wide Association Study (GWAS) approach. Genome-wide single nucleotide polymorphism (SNP) genotyping was performed in 74 diisocyanate-exposed workers with DA and 824 healthy controls using Omni-2.5 and Omni-5 SNP microarrays. We identified 11 SNPs that exceeded genome-wide significance; the strongest association was for the rs12913832 SNP located on chromosome 15, which has been mapped to the HERC2 gene (p = 6.94 × 10−14). Strong associations were also found for SNPs near the ODZ3 and CDH17 genes on chromosomes 4 and 8 (rs908084, p = 8.59 × 10−9 and rs2514805, p = 1.22 × 10−8, respectively). We also prioritized 38 SNPs with suggestive genome-wide significance (p < 1 × 10−6). Among them, 17 SNPs map to the PITPNC1, ACMSD, ZBTB16, ODZ3, and CDH17 gene loci. Functional genomics data indicate that 2 of the suggestive SNPs (rs2446823 and rs2446824) are located within putative binding sites for the CCAAT/Enhancer Binding Protein (CEBP) and Hepatocyte Nuclear Factor 4, Alpha transcription factors (TFs), respectively. This study identified SNPs mapping to the HERC2, CDH17, and ODZ3 genes as potential susceptibility loci for DA. Pathway analysis indicated that these genes are associated with antigen processing and presentation, and other immune pathways. Overlap of 2 suggestive SNPs with likely TF binding sites suggests possible roles in disruption of gene regulation. These results provide new insights into the genetic architecture of DA and serve as a basis for future functional and mechanistic studies. PMID:25918132
Bentz, Eva-Katrin; Hefler, Lukas A; Kaufmann, Ulrike; Huber, Johannes C; Kolbus, Andrea; Tempfer, Clemens B
2008-07-01
To assess the association between transsexualism and allele and genotype frequencies of the common cytochrome P450 (CYP) 17 -34 T>C single nucleotide polymorphism (SNP). Case-control study. Academic research institution. 102 male-to-female (MtF) and 49 female-to-male (FtM) transsexuals, 756 male controls, and 915 female controls. Buccal swabs and multiplex polymerase chain reaction on a microarray system. Analysis of the CYP17 -34 T>C SNP. CYP17 -34 T>C SNP allele frequencies were statistically significantly different between FtM transsexuals and female controls (CYP17 T: 55/98 [56%] and CYP17 C: 43/98 [44%] versus CYP17 T: 1253/1826 [69%] and CYP17 C: 573/1826 [31%], respectively). In accordance, genotype distributions were also different between FtM transsexuals and female controls using a recessive genotype model (CYP17 T/T+T/C: 39/49 [80%] and C/C 10/49 [20%] vs. CYP17 T/T+T/C: 821/913 [90%] and C/C 92/913 [10%], respectively). The CYP17 -34 T>C allele and genotype distributions were not statistically significantly different between MtF transsexuals and male controls. Of note, the CYP17 -34 T>C allele distribution was gender-specific among controls (CYP17 C: males; 604 of 1512 [40%] vs. females; 573 of 1826 [31%]). The MtF transsexuals had an allele distribution equivalent to male controls, whereas FtM transsexuals did not follow the gender-specific allele distribution of female controls but rather had an allele distribution equivalent to MtF transsexuals and male controls. These data support CYP17 as a candidate gene of FtM transsexualism and indicate that loss of a female-specific CYP17 T -34C allele distribution pattern is associated with FtM transsexualism.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gentry, T.; Schadt, C.; Zhou, J.
Microarray technology has the unparalleled potential tosimultaneously determine the dynamics and/or activities of most, if notall, of the microbial populations in complex environments such as soilsand sediments. Researchers have developed several types of arrays thatcharacterize the microbial populations in these samples based on theirphylogenetic relatedness or functional genomic content. Several recentstudies have used these microarrays to investigate ecological issues;however, most have only analyzed a limited number of samples withrelatively few experiments utilizing the full high-throughput potentialof microarray analysis. This is due in part to the unique analyticalchallenges that these samples present with regard to sensitivity,specificity, quantitation, and data analysis. Thismore » review discussesspecific applications of microarrays to microbial ecology research alongwith some of the latest studies addressing the difficulties encounteredduring analysis of complex microbial communities within environmentalsamples. With continued development, microarray technology may ultimatelyachieve its potential for comprehensive, high-throughput characterizationof microbial populations in near real-time.« less
Kawasaki, Eiji; Awata, Takuya; Ikegami, Hiroshi; Kobayashi, Tetsuro; Maruyama, Taro; Nakanishi, Koji; Shimada, Akira; Uga, Miho; Uga, Mho; Kurihara, Susumu; Kawabata, Yumiko; Tanaka, Shoichiro; Kanazawa, Yasuhiko; Lee, Inkyu; Eguchi, Katsumi
2006-03-15
The protein tyrosine phosphatase, nonreceptor 22 gene (PTPN22) maps to human chromosome 1p13.3-p13.1 and encodes an important negative regulator of T-cell activation, lymphoid-specific phosphatase (Lyp). Recently, the minor allele of a single-nucleotide polymorphism (SNP) at nucleotide position 1858 (rs2476601, +1858C > T) was found to be associated with type 1 diabetes. However, the degree of the association is variable among ethnic populations, suggesting the presence of other disease-associated variants in PTPN22. To examine this possibility, we carried out a systemic search for PTPN22 using direct sequencing of PCR-amplified products in the Japanese population. Association and linkage studies were also conducted in 1,690 Japanese samples, 180 Korean samples, and 472 Caucasian samples from 95 nuclear families. We identified five novel SNPs, but not the +1858C > T SNP. Of these two frequent SNPs, -1123G > C, and +2740C > T were in strong linkage disequilibrium (LD), and the -1123G > C promoter SNP was associated with acute-onset but not slow-onset type 1 diabetes in the Japanese population (odds ratio [OR] = 1.42, 95% CI = 1.07-1.89, P = 0.015). This association was observed also in Korean patients with type 1 diabetes (Mantel-Haenszel chi2= 6.543, P = 0.0105, combined OR = 1.41 95% CI = 1.09-1.82). Furthermore, the affected family-based control (AFBAC) association test and the transmission disequilibrium analysis of multiplex families of European descent from the British Diabetes Association (BDA) Warren Repository indicated that the association was stronger in -1123G > C compared to +1858C > T. In conclusion, the type 1 diabetes association with PTPN22 is confirmed, but it cannot be attributed solely to the +1858C > T variant. The promoter -1123G > C SNP is a more likely causative variant in PTPN22. 2006 Wiley-Liss, Inc.
Prader-Willi Syndrome due to an Unbalanced de novo Translocation t(15;19)(q12;p13.3).
Dang, Vy; Surampalli, Abhilasha; Manzardo, Ann M; Youn, Stephanie; Butler, Merlin G; Gold, June-Anne; Kimonis, Virginia E
2016-01-01
Prader-Willi syndrome (PWS) is a complex, multisystem genetic disorder characterized by endocrine, neurologic, and behavioral abnormalities. We report the first case of an unbalanced de novo reciprocal translocation of chromosomes 15 and 19, 45,XY,-15,der(19)t(15;19)(q12;p13.3), resulting in monosomy for the PWS critical chromosome region. Our patient had several typical features of PWS including infantile hypotonia, a poor suck and feeding difficulties, tantrums, skin picking, compulsions, small hands and feet, and food seeking, but not hypopigmentation, a micropenis, cryptorchidism or obesity as common findings seen in PWS at the time of examination at 6 years of age. He had seizures noted from 1 to 3 years of age and marked cognitive delay. High-resolution SNP microarray analysis identified an atypical PWS type I deletion in chromosome 15 involving the proximal breakpoint BP1. The deletion extended beyond the GABRB3 gene but was proximal to the usual distal breakpoint (BP3) within the 15q11q13 region, and GABRA5, GABRG3, and OCA2 genes were intact. No deletion of band 19p13.3 was detected; therefore, the patient was not at an increased risk of tumors from the Peutz-Jeghers syndrome associated with a deletion of the STK11 gene. © 2016 S. Karger AG, Basel.
Epistasis between polymorphisms in PCSK1 and DBH is associated with premature ovarian failure.
Pyun, Jung-A; Kim, Sunshin; Cha, Dong Hyun; Kwack, KyuBum
2014-11-01
This study examined whether epistasis between single nucleotide polymorphisms (SNPs) within proprotein convertase subtilisin/kexin type 1 (PCSK1) and dopamine β-hydroxylase (DBH) genes is associated with premature ovarian failure (POF). One hundred twenty women with POF and 222 female controls were recruited for this study. To genotype SNPs within PCSK1 and DBH, we used a GoldenGate assay with VeraCode technology, which uses an allele-specific primer extension method. Two SNPs (rs155979 and rs3762986) within PCSK1 and one SNP (rs1611114) within DBH, which were located in the 5' flanking region, were involved in synergistic interactions. The C allele in the rs155979 SNP showed an increased risk of POF in a dominant model when AA genotype in the rs1611114 SNP was present (odds ratio, 3.60; 95% CI, 1.82-7.14; P = 0.00024), whereas the G allele in the rs1611114 SNP showed a reduced risk of POF in a dominant model when at least one C allele at the rs155979 SNP was present (odds ratio, 0.24; 95% CI, 0.11-0.51; P = 0.00018) or one G allele at the rs3762986 SNP was present (odds ratio, 0.33; 95% CI, 0.19-0.60; P = 0.00023). Epistases between SNPs within PCSK1 and DBH genes are significantly associated with susceptibility or resistance to POF.
Wei, Xue-min; Wang, Qing; Gao, Shu-jun; Sui, Long
2011-04-01
To study the relationship between nitric oxide within cervical microenvironment and different HPV types as well as the effect of sodium nitroprusside (SNP), a nitric oxide donor, on the proliferation and apoptosis of cervical cancer cell lines. HPV typing test was assessed from 115 women by using high-risk HPV (HR-HPV) 21 typing test and the release of cervical nitric oxide (NO) was assessed as nitrate, nitrite in cervical fluid. Cervical NO was then compared between women showing different HPV types. Proliferation of Caski and HeLa cervical cells was determined by methyl thiazolyl tetrazolium (MTT) assay, cell apoptosis was detected by flow cytometry after 24 hours treated by different final concentration of SNP (0.125, 0.25, 0.5, 1.0 and 2.0 mmol/L, respectively). The expressions of HPV E6, E7 gene mRNA and p53 protein were detected by SYBR Green I quantitative real-time PCR and western blot. (1) The cervical NO release of women with HR-HPV was higher compared to that in HPV negative women [(47.6±1.4) µmol/L vs (22.8±0.3) µmol/L; P<0.05]; but there was no statistical difference between low-risk HPV (LR-HPV) group [(24.1±1.2) µmol/L] and control group (P>0.05). (2) After 24 hours treated by different final concentration of SNP, the results shown that SNP could inhibited the proliferation and increased apoptosis rate in Caski and HeLa cells, in which the concentration of SNP≥1.0 mmol/L, there were significantly different (P<0.05), while when SNP≥2.0 mmol/L, the proliferation of cells inhibited seriously. Treated by SNP (1.0 mmol/L) 24 hours, the expressions of HPV18 E6, E7 mRNA in HeLa cells were reduced from 27.362±0.191, 22.962±0.053 to 19.181±0.360, 17.571±0.010 and the protein expression of p53 increased from 1.17±0.03 to 0.23±0.05, there were statistically significant differences between adding SNP group and the control group (P<0.05); but there were no statistically significant differences in HPV16 E6, E7 mRNA and that of p53 in Caski cells (P>0.05). The presence of HR-HPV is associated with an increased release of NO in the human uterine cervix; NO could inhibit the growth and proliferation and enhance the apoptosis of cervical cancer cells, inhibit the expression of HPV18 E6, E7 mRNA in HeLa cells and activate the expression of p53 protein, the mechanism may be due to higher sensitivity of HeLa cells (cervical adenocarcinoma cell) to SNP. The increasing release of NO may play a role in regulating the elimination of HPV in cervical microenvironment, which is a part of mucous membrane immunity.
Park, Yu Rang; Chung, Tae Su; Lee, Young Joo; Song, Yeong Wook; Lee, Eun Young; Sohn, Yeo Won; Song, Sukgil; Park, Woong Yang
2012-01-01
Infection by microorganisms may cause fatally erroneous interpretations in the biologic researches based on cell culture. The contamination by microorganism in the cell culture is quite frequent (5% to 35%). However, current approaches to identify the presence of contamination have many limitations such as high cost of time and labor, and difficulty in interpreting the result. In this paper, we propose a model to predict cell infection, using a microarray technique which gives an overview of the whole genome profile. By analysis of 62 microarray expression profiles under various experimental conditions altering cell type, source of infection and collection time, we discovered 5 marker genes, NM_005298, NM_016408, NM_014588, S76389, and NM_001853. In addition, we discovered two of these genes, S76389, and NM_001853, are involved in a Mycolplasma-specific infection process. We also suggest models to predict the source of infection, cell type or time after infection. We implemented a web based prediction tool in microarray data, named Prediction of Microbial Infection (http://www.snubi.org/software/PMI). PMID:23091307
Wahab, Tara; Birdsell, Dawn N.; Hjertqvist, Marika; Mitchell, Cedar L.; Wagner, David M.; Keim, Paul S.; Hedenström, Ingela; Löfdahl, Sven
2014-01-01
Tularaemia, caused by the bacterium Francisella tularensis, is endemic in Sweden and is poorly understood. The aim of this study was to evaluate the effectiveness of three different genetic typing systems to link a genetic type to the source and place of tularemia infection in Sweden. Canonical single nucleotide polymorphisms (canSNPs), MLVA including five variable number of tandem repeat loci and PmeI-PFGE were tested on 127 F. tularensis positive specimens collected from Swedish case-patients. All three typing methods identified two major genetic groups with near-perfect agreement. Higher genetic resolution was obtained with canSNP and MLVA compared to PFGE; F. tularensis samples were first assigned into ten phylogroups based on canSNPs followed by 33 unique MLVA types. Phylogroups were geographically analysed to reveal complex phylogeographic patterns in Sweden. The extensive phylogenetic diversity found within individual counties posed a challenge to linking specific genetic types with specific geographic locations. Despite this, a single phylogroup (B.22), defined by a SNP marker specific to a lone Swedish sequenced strain, did link genetic type with a likely geographic place. This result suggests that SNP markers, highly specific to a particular reference genome, may be found most frequently among samples recovered from the same location where the reference genome originated. This insight compels us to consider whole-genome sequencing (WGS) as the appropriate tool for effectively linking specific genetic type to geography. Comparing the WGS of an unknown sample to WGS databases of archived Swedish strains maximizes the likelihood of revealing those rare geographically informative SNPs. PMID:25401326
Wang, Xuefeng; Lee, Seunggeun; Zhu, Xiaofeng; Redline, Susan; Lin, Xihong
2013-12-01
Family-based genetic association studies of related individuals provide opportunities to detect genetic variants that complement studies of unrelated individuals. Most statistical methods for family association studies for common variants are single marker based, which test one SNP a time. In this paper, we consider testing the effect of an SNP set, e.g., SNPs in a gene, in family studies, for both continuous and discrete traits. Specifically, we propose a generalized estimating equations (GEEs) based kernel association test, a variance component based testing method, to test for the association between a phenotype and multiple variants in an SNP set jointly using family samples. The proposed approach allows for both continuous and discrete traits, where the correlation among family members is taken into account through the use of an empirical covariance estimator. We derive the theoretical distribution of the proposed statistic under the null and develop analytical methods to calculate the P-values. We also propose an efficient resampling method for correcting for small sample size bias in family studies. The proposed method allows for easily incorporating covariates and SNP-SNP interactions. Simulation studies show that the proposed method properly controls for type I error rates under both random and ascertained sampling schemes in family studies. We demonstrate through simulation studies that our approach has superior performance for association mapping compared to the single marker based minimum P-value GEE test for an SNP-set effect over a range of scenarios. We illustrate the application of the proposed method using data from the Cleveland Family GWAS Study. © 2013 WILEY PERIODICALS, INC.
Smit, Maria; Segers, Karin; Carrascosa, Laura Garcia; Shay, Tracy; Baraldi, Francesca; Gyapay, Gabor; Snowder, Gary; Georges, Michel; Cockett, Noelle; Charlier, Carole
2003-01-01
To identify the callipyge mutation, we have resequenced 184 kb spanning the DLK1-, GTL2-, PEG11-, and MEG8-imprinted domain and have identified an A-to-G transition in a highly conserved dodecamer motif between DLK1 and GTL2. This was the only difference found between the callipyge (CLPG) allele and a phylogenetically closely related wild-type allele. We report that this SNP is in perfect association with the callipyge genotype. The demonstration that Solid Gold-the alleged founder ram of the callipyge flock-is mosaic for this SNP virtually proves the causality of this SNP in the determinism of the callipyge phenotype. PMID:12586730
Microelectronic DNA assay for the detection of BRCA1 gene mutations
NASA Technical Reports Server (NTRS)
Chen, Hua; Han, Jie; Li, Jun; Meyyappan, Meyya
2004-01-01
Mutations in BRCA1 are characterized by predisposition to breast cancer, ovarian cancer and prostate cancer as well as colon cancer. Prognosis for this cancer survival depends upon the stage at which cancer is diagnosed. Reliable and rapid mutation detection is crucial for the early diagnosis and treatment. We developed an electronic assay for the detection of a representative single nucleotide polymorphism (SNP), deletion and insertion in BRCA1 gene by the microelectronics microarray instrumentation. The assay is rapid, and it takes 30 minutes for the immobilization of target DNA samples, hybridization, washing and readout. The assay is multiplexing since it is carried out at the same temperature and buffer conditions for each step. The assay is also highly specific, as the signal-to-noise ratio is much larger than recommended value (72.86 to 321.05 vs. 5) for homozygotes genotyping, and signal ratio close to the perfect value 1 for heterozygotes genotyping (1.04).
Richard, Arianne C; Lyons, Paul A; Peters, James E; Biasci, Daniele; Flint, Shaun M; Lee, James C; McKinney, Eoin F; Siegel, Richard M; Smith, Kenneth G C
2014-08-04
Although numerous investigations have compared gene expression microarray platforms, preprocessing methods and batch correction algorithms using constructed spike-in or dilution datasets, there remains a paucity of studies examining the properties of microarray data using diverse biological samples. Most microarray experiments seek to identify subtle differences between samples with variable background noise, a scenario poorly represented by constructed datasets. Thus, microarray users lack important information regarding the complexities introduced in real-world experimental settings. The recent development of a multiplexed, digital technology for nucleic acid measurement enables counting of individual RNA molecules without amplification and, for the first time, permits such a study. Using a set of human leukocyte subset RNA samples, we compared previously acquired microarray expression values with RNA molecule counts determined by the nCounter Analysis System (NanoString Technologies) in selected genes. We found that gene measurements across samples correlated well between the two platforms, particularly for high-variance genes, while genes deemed unexpressed by the nCounter generally had both low expression and low variance on the microarray. Confirming previous findings from spike-in and dilution datasets, this "gold-standard" comparison demonstrated signal compression that varied dramatically by expression level and, to a lesser extent, by dataset. Most importantly, examination of three different cell types revealed that noise levels differed across tissues. Microarray measurements generally correlate with relative RNA molecule counts within optimal ranges but suffer from expression-dependent accuracy bias and precision that varies across datasets. We urge microarray users to consider expression-level effects in signal interpretation and to evaluate noise properties in each dataset independently.
Wang, Boyi; Tan, Hua-Wei; Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Matsumoto, Tracie; Zhang, Dapeng
2015-01-01
Longan (Dimocarpus longan Lour.) is an important tropical fruit tree crop. Accurate varietal identification is essential for germplasm management and breeding. Using longan transcriptome sequences from public databases, we developed single nucleotide polymorphism (SNP) markers; validated 60 SNPs in 50 longan germplasm accessions, including cultivated varieties and wild germplasm; and designated 25 SNP markers that unambiguously identified all tested longan varieties with high statistical rigor (P<0.0001). Multiple trees from the same clone were verified and off-type trees were identified. Diversity analysis revealed genetic relationships among analyzed accessions. Cultivated varieties differed significantly from wild populations (Fst=0.300; P<0.001), demonstrating untapped genetic diversity for germplasm conservation and utilization. Within cultivated varieties, apparent differences between varieties from China and those from Thailand and Hawaii indicated geographic patterns of genetic differentiation. These SNP markers provide a powerful tool to manage longan genetic resources and breeding, with accurate and efficient genotype identification. PMID:26504559
Takasaki, Shigeru
2012-01-01
This paper first explains how the relations between Japanese Alzheimer's disease (AD) patients and their mitochondrial SNP frequencies at individual mtDNA positions examined using the radial basis function (RBF) network and a method based on RBF network predictions and that Japanese AD patients are associated with the haplogroups G2a and N9b1. It then describes a method for the initial diagnosis of Alzheimer's disease that is based on the mtSNP haplogroups of the AD patients. The method examines the relations between someone's mtDNA mutations and the mtSNPs of AD patients. As the mtSNP haplogroups thus obtained indicate which nucleotides of mtDNA loci are changed in the Alzheimer's patients, a person's probability of becoming an AD patient can be predicted by comparing those mtDNA mutations with that person's mtDNA mutations. The proposed method can also be used to diagnose diseases such as Parkinson's disease and type 2 diabetes and to identify people likely to become centenarians. PMID:22848858
HRM and SNaPshot as alternative forensic SNP genotyping methods.
Mehta, Bhavik; Daniel, Runa; McNevin, Dennis
2017-09-01
Single nucleotide polymorphisms (SNPs) have been widely used in forensics for prediction of identity, biogeographical ancestry (BGA) and externally visible characteristics (EVCs). Single base extension (SBE) assays, most notably SNaPshot® (Thermo Fisher Scientific), are commonly used for forensic SNP genotyping as they can be employed on standard instrumentation in forensic laboratories (e.g. capillary electrophoresis). High resolution melt (HRM) analysis is an alternative method and is a simple, fast, single tube assay for low throughput SNP typing. This study compares HRM and SNaPshot®. HRM produced reproducible and concordant genotypes at 500 pg, however, difficulties were encountered when genotyping SNPs with high GC content in flanking regions and differentiating variants of symmetrical SNPs. SNaPshot® was reproducible at 100 pg and is less dependent on SNP choice. HRM has a shorter processing time in comparison to SNaPshot®, avoids post PCR contamination risk and has potential as a screening tool for many forensic applications.
IGF-II gene region polymorphisms related to exertional muscle damage.
Devaney, Joseph M; Hoffman, Eric P; Gordish-Dressman, Heather; Kearns, Amy; Zambraski, Edward; Clarkson, Priscilla M
2007-05-01
We examined the association of a novel single-nucleotide polymorphism (SNP) in IGF-I (IGF-I -C1245T located in the promoter) and eight SNPs in the IGF-II gene region with indicators of muscle damage [strength loss, muscle soreness, and increases in circulating levels of creatine kinase (CK) and myoglobin] after eccentric exercise. We also examined two SNPs in the IGF binding protein-3 (IGFBP-3). The age, height, and body mass of the 151 subjects studied were 24.1 +/- 5.2 yr, 170.8 +/- 9.9 cm, and 73.3 +/- 17.0 kg, respectively. There were no significant associations of phenotypes with IGF-I. IGF-II SNP (G12655A, rs3213216) and IGFBP-3 SNP (A8618T, rs6670) were not significantly associated with any variable. The most significant finding in this study was that for men, IGF-II (C13790G, rs3213221), IGF-II (ApaI, G17200A, rs680), IGF-II antisense (IGF2AS) (G11711T, rs7924316), and IGFBP-3 (-C1592A, rs2132570) were significantly associated with muscle damage indicators. We found that men who were 1) homozygous for the rare IGF-II C13790G allele and rare allele for the ApaI (G17200A) SNP demonstrated the greatest strength loss immediately after exercise, greatest soreness, and highest postexercise serum CK activity; 2) homozygous wild type for IGF2AS (G11711T, rs7924316) had the greatest strength loss and most muscle soreness; and 3) homozygous wild type for the IGF2AS G11711T SNP showed the greatest strength loss, highest muscle soreness, and greater CK and myoglobin response to exercise. In women, fewer significant associations appeared.
Singh, Kanhaiya; Goyal, Prabhjot; Singh, Manju; Deshmukh, Sujit; Upadhyay, Divyesh; Kant, Sri; Agrawal, Neeraj K; Gupta, Sanjeev K; Singh, Kiran
2017-12-01
Retinal angiogenesis is a hallmark of diabetic retinopathy. Matrix Metalloproteinases (MMPs) are involved in degradation of extracellular matrix (ECM). Functional SNP-1562C>T in the promoter of the MMP-9 gene results increase in transcriptional activity. The present work was designed to evaluate the contribution of functional SNP-1562C>T of MMP-9 gene to the risk of proliferative diabetic retinopathy (PDR) in type 2 diabetes mellitus (T2DM) patients in north Indian Population. This Case control study comprised of a total of 645 individuals in which 320 were T2DM patients out of which 73 had PDR, 98 had non- proliferative diabetic retinopathy (NPDR), 149 T2DM cases without any eye related disease (DM) and 325 non diabetic healthy individuals as controls (non DM controls). Genotyping for SNP-1562C>T of MMP-9 was done by polymerase chain reactions followed by restriction analyses with specific endonucleases (PCR-RFLP). DNA sequencing was used to ascertain PCR-RFLP results. T allele frequency in PDR patients was 32.1%, 20.4% in NPDR, 15.4% in DM and 13.7% in controls. Statistically significant difference was observed in both allele and genotype distribution between the PDR versus non-DM control group (p<0.0001 by T allele; p=0.002 by TT and p<0.0001 by CT genotype). The present study suggests that the functional SNP-1562C>T in the promoter of the MMP-9 gene could be regarded as a major risk factor for PDR as increased MMP-9 production from high expressing T allele may promote retinal angiogenesis. Copyright © 2017 Elsevier Inc. All rights reserved.
Motawi, Tarek; Salman, Tarek; Shaker, Olfat
2015-01-01
Introduction Adiponectin is an adipose tissue-specific protein with insulin-sensitizing properties. Many investigators have explored the association between adiponectin single nucleotide polymorphisms (SNPs) and type 2 diabetes mellitus (T2DM) in different ethnic populations from different regions. Leptin is a protein hormone constituting an important signal in the regulation of adipose tissue mass and body weight. The aim of this study was to explore potential associations between SNP +45 T>G of the adiponectin gene and SNP 2548G/A of leptin with T2DM and the effect of SNPs on serum adiponectin and leptin levels. Material and methods From the Egyptian population, we enrolled 110 T2DM patients and 90 non-diabetic controls. Serum lipid profile, blood glucose, serum adiponectin, and leptin were measured. Genotyping for two common SNPs of the adiponectin and leptin genes was performed by polymerase chain reaction–restriction fragment length polymorphism. Results The G allele and TG/GG genotype of SNP 45 occurred more frequently than the T allele and TT genotype in T2DM patients compares to the controls. Subjects with the GG + TG genotype of SNP 45 were at increased risk for T2DM (OR = 6.476; 95% CI: 3.401–12.33) and associated with a low serum adiponectin level compared with the TT genotype. The serum leptin concentration of GA + AA genotype carriers was not significantly different from that of the GG genotype in the diabetic group. Conclusions The G allele carriers who have reduced plasma concentrations of adiponectin may have an association with T2DM, while leptin SNP 2548 G/A is not associated with the risk of development of T2DM in the Egyptian population. PMID:26528333
Marvalim, Charlie; Wong, Jing Xiang Gimson; Sutiman, Natalia; Lim, Wan Teck; Tan, Shao Weng; Kanesvaran, Ravindran; Ng, Quan Sing; Jain, Amit; Ang, Mei Kim; Tan, Wan Ling; Toh, Chee Keong; Tan, Eng Huat; Chowbay, Balram
2017-03-01
The critical role of lysine demethylase 4A (KDM4A), in regulating chromatin structure and consequently in driving cellular proliferation and oncogenesis has been the focus of recent studies. Non-small-cell lung cancer (NSCLC) patients with adenocarcinoma histology who were homozygous for KDM4A single nucleotide polymorphism (SNP)-A482 (rs586339) were recently shown to have significantly worse overall survival (OS) compared with patients with the wild-type or the heterozygous genotype at this locus (hazard ratio=1.68, P=0.042). In the current study, we investigated the association between the same polymorphism with OS in our Asian NSCLC-adenocarcinoma patients comprising Chinese (N=572), Malays (N=50), and Indians (N=22). KDM4A SNP-A482 genotype status was determined by Sanger sequencing. OS was calculated from the date of diagnosis to date of death or censored at the date of last follow-up. Kaplan-Meier analysis, log-rank test, and Cox regression methods were utilized to evaluate OS outcomes. KDM4A SNP-A482 had a minor allele (C) frequency of 18.8% and a major allele (A) frequency of 81.2% in our Asian NSCLC (adenocarcinoma) patients. However, the OS in our Asian NSCLC patients homozygous for KDM4A SNP-A482 was not significantly different from those who were wild type or heterozygous at this locus [CC vs. AA/AC: median OS (95% confidence interval): 40.2 (18.7-61.6) vs. 29.6 (26.9-32.3) months; P=0.858]. The results remained statistically nonsignificant even after adjustment for epidermal growth factor receptor mutational status, suggesting that KDM4A SNP-A482 does not significantly influence OS in Asian NSCLC patients.
DNA microarray-based PCR ribotyping of Clostridium difficile.
Schneeberg, Alexander; Ehricht, Ralf; Slickers, Peter; Baier, Vico; Neubauer, Heinrich; Zimmermann, Stefan; Rabold, Denise; Lübke-Becker, Antina; Seyboldt, Christian
2015-02-01
This study presents a DNA microarray-based assay for fast and simple PCR ribotyping of Clostridium difficile strains. Hybridization probes were designed to query the modularly structured intergenic spacer region (ISR), which is also the template for conventional and PCR ribotyping with subsequent capillary gel electrophoresis (seq-PCR) ribotyping. The probes were derived from sequences available in GenBank as well as from theoretical ISR module combinations. A database of reference hybridization patterns was set up from a collection of 142 well-characterized C. difficile isolates representing 48 seq-PCR ribotypes. The reference hybridization patterns calculated by the arithmetic mean were compared using a similarity matrix analysis. The 48 investigated seq-PCR ribotypes revealed 27 array profiles that were clearly distinguishable. The most frequent human-pathogenic ribotypes 001, 014/020, 027, and 078/126 were discriminated by the microarray. C. difficile strains related to 078/126 (033, 045/FLI01, 078, 126, 126/FLI01, 413, 413/FLI01, 598, 620, 652, and 660) and 014/020 (014, 020, and 449) showed similar hybridization patterns, confirming their genetic relatedness, which was previously reported. A panel of 50 C. difficile field isolates was tested by seq-PCR ribotyping and the DNA microarray-based assay in parallel. Taking into account that the current version of the microarray does not discriminate some closely related seq-PCR ribotypes, all isolates were typed correctly. Moreover, seq-PCR ribotypes without reference profiles available in the database (ribotype 009 and 5 new types) were correctly recognized as new ribotypes, confirming the performance and expansion potential of the microarray. Copyright © 2015, American Society for Microbiology. All Rights Reserved.
A review on SNP and other types of molecular markers and their use in animal genetics
Vignal, Alain; Milan, Denis; SanCristobal, Magali; Eggen, André
2002-01-01
During the last ten years, the use of molecular markers, revealing polymorphism at the DNA level, has been playing an increasing part in animal genetics studies. Amongst others, the microsatellite DNA marker has been the most widely used, due to its easy use by simple PCR, followed by a denaturing gel electrophoresis for allele size determination, and to the high degree of information provided by its large number of alleles per locus. Despite this, a new marker type, named SNP, for Single Nucleotide Polymorphism, is now on the scene and has gained high popularity, even though it is only a bi-allelic type of marker. In this review, we will discuss the reasons for this apparent step backwards, and the pertinence of the use of SNPs in animal genetics, in comparison with other marker types. PMID:12081799
Mentasti, Massimo; Tewolde, Rediat; Aslett, Martin; Harris, Simon R.; Afshar, Baharak; Underwood, Anthony; Harrison, Timothy G.
2016-01-01
Sequence-based typing (SBT), analogous to multilocus sequence typing (MLST), is the current “gold standard” typing method for investigation of legionellosis outbreaks caused by Legionella pneumophila. However, as common sequence types (STs) cause many infections, some investigations remain unresolved. In this study, various whole-genome sequencing (WGS)-based methods were evaluated according to published guidelines, including (i) a single nucleotide polymorphism (SNP)-based method, (ii) extended MLST using different numbers of genes, (iii) determination of gene presence or absence, and (iv) a kmer-based method. L. pneumophila serogroup 1 isolates (n = 106) from the standard “typing panel,” previously used by the European Society for Clinical Microbiology Study Group on Legionella Infections (ESGLI), were tested together with another 229 isolates. Over 98% of isolates were considered typeable using the SNP- and kmer-based methods. Percentages of isolates with complete extended MLST profiles ranged from 99.1% (50 genes) to 86.8% (1,455 genes), while only 41.5% produced a full profile with the gene presence/absence scheme. Replicates demonstrated that all methods offer 100% reproducibility. Indices of discrimination range from 0.972 (ribosomal MLST) to 0.999 (SNP based), and all values were higher than that achieved with SBT (0.940). Epidemiological concordance is generally inversely related to discriminatory power. We propose that an extended MLST scheme with ∼50 genes provides optimal epidemiological concordance while substantially improving the discrimination offered by SBT and can be used as part of a hierarchical typing scheme that should maintain backwards compatibility and increase discrimination where necessary. This analysis will be useful for the ESGLI to design a scheme that has the potential to become the new gold standard typing method for L. pneumophila. PMID:27280420
David, Sophia; Mentasti, Massimo; Tewolde, Rediat; Aslett, Martin; Harris, Simon R; Afshar, Baharak; Underwood, Anthony; Fry, Norman K; Parkhill, Julian; Harrison, Timothy G
2016-08-01
Sequence-based typing (SBT), analogous to multilocus sequence typing (MLST), is the current "gold standard" typing method for investigation of legionellosis outbreaks caused by Legionella pneumophila However, as common sequence types (STs) cause many infections, some investigations remain unresolved. In this study, various whole-genome sequencing (WGS)-based methods were evaluated according to published guidelines, including (i) a single nucleotide polymorphism (SNP)-based method, (ii) extended MLST using different numbers of genes, (iii) determination of gene presence or absence, and (iv) a kmer-based method. L. pneumophila serogroup 1 isolates (n = 106) from the standard "typing panel," previously used by the European Society for Clinical Microbiology Study Group on Legionella Infections (ESGLI), were tested together with another 229 isolates. Over 98% of isolates were considered typeable using the SNP- and kmer-based methods. Percentages of isolates with complete extended MLST profiles ranged from 99.1% (50 genes) to 86.8% (1,455 genes), while only 41.5% produced a full profile with the gene presence/absence scheme. Replicates demonstrated that all methods offer 100% reproducibility. Indices of discrimination range from 0.972 (ribosomal MLST) to 0.999 (SNP based), and all values were higher than that achieved with SBT (0.940). Epidemiological concordance is generally inversely related to discriminatory power. We propose that an extended MLST scheme with ∼50 genes provides optimal epidemiological concordance while substantially improving the discrimination offered by SBT and can be used as part of a hierarchical typing scheme that should maintain backwards compatibility and increase discrimination where necessary. This analysis will be useful for the ESGLI to design a scheme that has the potential to become the new gold standard typing method for L. pneumophila. Copyright © 2016 David et al.
Turankar, R P; Lavania, M; Chaitanya, V S; Sengupta, U; Darlong, J; Darlong, F; Siva Sai, K S R; Jadhav, R S
2014-03-01
The exact mode of transmission of leprosy is not clearly understood; however, many studies have demonstrated active transmission of leprosy around a source case. Families of five active leprosy cases and their household contacts were chosen from a high endemic area in Purulia. Fifty-two soil samples were also collected from different areas of their houses. DNA was extracted from slit-skin smears (SSS) and soil samples and the Mycobacterium leprae-specific RLEP (129 bp) region was amplified using PCR. Molecular typing of M. leprae was performed for all RLEP PCR-positive samples by single nucleotide polymorphism (SNP) typing and confirmation by DNA sequencing. SSS of these five patients and six out of the total 28 contacts were PCR positive for RLEP whereas 17 soil samples out of 52 showed the presence of M. leprae DNA. SNP typing of M. leprae from all RLEP PCR-positive subjects (patients and smear-positive contacts) and 10 soil samples showed the SNP type 1 genotype. M. leprae DNA from the five leprosy patients and the six contacts was further subtyped and the D subtype was noted in all patients and contacts, except for one contact where the C subtype was identified. Typing followed by subtyping of M. leprae clearly revealed that either the contacts were infected by the patients or both patients and contacts had the same source of infection. It also revealed that the type of M. leprae in the soil in the inhabited areas where patients resided was also of the same type as that found in patients. © 2013 The Authors Clinical Microbiology and Infection © 2013 European Society of Clinical Microbiology and Infectious Diseases.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Yang, Jing; Li, Yuan-Yuan; Shanghai Center for Bioinformation Technology, Shanghai 200235
2012-03-02
Highlights: Black-Right-Pointing-Pointer Proper dataset partition can improve the prediction of deleterious nsSNPs. Black-Right-Pointing-Pointer Partition according to original residue type at nsSNP is a good criterion. Black-Right-Pointing-Pointer Similar strategy is supposed promising in other machine learning problems. -- Abstract: Many non-synonymous SNPs (nsSNPs) are associated with diseases, and numerous machine learning methods have been applied to train classifiers for sorting disease-associated nsSNPs from neutral ones. The continuously accumulated nsSNP data allows us to further explore better prediction approaches. In this work, we partitioned the training data into 20 subsets according to either original or substituted amino acid type at the nsSNPmore » site. Using support vector machine (SVM), training classification models on each subset resulted in an overall accuracy of 76.3% or 74.9% depending on the two different partition criteria, while training on the whole dataset obtained an accuracy of only 72.6%. Moreover, the dataset was also randomly divided into 20 subsets, but the corresponding accuracy was only 73.2%. Our results demonstrated that partitioning the whole training dataset into subsets properly, i.e., according to the residue type at the nsSNP site, will improve the performance of the trained classifiers significantly, which should be valuable in developing better tools for predicting the disease-association of nsSNPs.« less
Gelernter, Joel; Sherva, Richard; Koesterer, Ryan; Almasy, Laura; Zhao, Hongyu; Kranzler, Henry R.; Farrer, Lindsay
2013-01-01
We report a GWAS for cocaine dependence (CD) in three sets of African- and European-American subjects (AAs and EAs, respectively), to identify pathways, genes, and alleles important in CD risk. The discovery GWAS dataset (n=5,697 subjects) was genotyped using the Illumina OmniQuad microarray (890,000 analyzed SNPs). Additional genotypes were imputed based on the 1000 Genomes reference panel. Top-ranked findings were evaluated by incorporating information from publicly available GWAS data from 4,063 subjects. Then, the most significant GWAS SNPs were genotyped in 2,549 independent subjects. We observed one genomewide-significant (GWS) result: rs7086629 at the FAM53B (“family with sequence similarity 53, member B”) locus. This was supported in both AAs and EAs; p-value (meta-analysis of all samples) =4.28×10−8. The gene maps to the same chromosomal region as the maximum peak we observed in a previous linkage study. NCOR2 (nuclear receptor corepressor 1) SNP rs150954431 was associated with p=1.19×10−9 in the EA discovery sample. SNP rs2456778, which maps to CDK1 (“cyclin-dependent kinase 1”), was associated with cocaine-induced paranoia in AAs in the discovery sample only (p=4.68×10−8). This is the first study to identify risk variants for CD using GWAS. Our results implicate novel risk loci and provide insights into potential therapeutic and prevention strategies. PMID:23958962
Serin, Elise A. R.; Snoek, L. B.; Nijveen, Harm; Willems, Leo A. J.; Jiménez-Gómez, Jose M.; Hilhorst, Henk W. M.; Ligterink, Wilco
2017-01-01
High-density genetic maps are essential for high resolution mapping of quantitative traits. Here, we present a new genetic map for an Arabidopsis Bayreuth × Shahdara recombinant inbred line (RIL) population, built on RNA-seq data. RNA-seq analysis on 160 RILs of this population identified 30,049 single-nucleotide polymorphisms (SNPs) covering the whole genome. Based on a 100-kbp window SNP binning method, 1059 bin-markers were identified, physically anchored on the genome. The total length of the RNA-seq genetic map spans 471.70 centimorgans (cM) with an average marker distance of 0.45 cM and a maximum marker distance of 4.81 cM. This high resolution genotyping revealed new recombination breakpoints in the population. To highlight the advantages of such high-density map, we compared it to two publicly available genetic maps for the same population, comprising 69 PCR-based markers and 497 gene expression markers derived from microarray data, respectively. In this study, we show that SNP markers can effectively be derived from RNA-seq data. The new RNA-seq map closes many existing gaps in marker coverage, saturating the previously available genetic maps. Quantitative trait locus (QTL) analysis for published phenotypes using the available genetic maps showed increased QTL mapping resolution and reduced QTL confidence interval using the RNA-seq map. The new high-density map is a valuable resource that facilitates the identification of candidate genes and map-based cloning approaches. PMID:29259624
Forensic genetic SNP typing of low-template DNA and highly degraded DNA from crime case samples.
Børsting, Claus; Mogensen, Helle Smidt; Morling, Niels
2013-05-01
Heterozygote imbalances leading to allele drop-outs and disproportionally large stutters leading to allele drop-ins are known stochastic phenomena related to STR typing of low-template DNA (LtDNA). The large stutters and the many drop-ins in typical STR stutter positions are artifacts from the PCR amplification of tandem repeats. These artifacts may be avoided by typing bi-allelic markers instead of STRs. In this work, the SNPforID multiplex assay was used to type LtDNA. A sensitized SNP typing protocol was introduced, that increased signal strengths without increasing noise and without affecting the heterozygote balance. Allele drop-ins were only observed in experiments with 25 pg of DNA and not in experiments with 50 and 100 pg of DNA. The allele drop-in rate in the 25 pg experiments was 0.06% or 100 times lower than what was previously reported for STR typing of LtDNA. A composite model and two different consensus models were used to interpret the SNP data. Correct profiles with 42-49 SNPs were generated from the 50 and 100 pg experiments, whereas a few incorrect genotypes were included in the generated profiles from the 25 pg experiments. With the strict consensus model, between 35 and 48 SNPs were correctly typed in the 25 pg experiments and only one allele drop-out (error rate: 0.07%) was observed in the consensus profiles. A total of 28 crime case samples were selected for typing with the sensitized SNPforID protocol. The samples were previously typed with old STR kits during the crime case investigation and only partial profiles (0-6 STRs) were obtained. Eleven of the samples could not be quantified with the Quantifiler™ Human DNA Quantification kit because of partial or complete inhibition of the PCR. For eight of these samples, SNP typing was only possible when the buffer and DNA polymerase used in the original protocol was replaced with the AmpFℓSTR(®) SEfiler Plus™ Master Mix, which was developed specifically for challenging forensic samples. All the crime case samples were successfully typed with the SNPforID multiplex assay and the match probabilities ranged from 1.1×10(-15) to 7.9×10(-23). In comparison, four of the samples could not be typed with the AmpFℓSTR(®) SEfiler Plus™ kit and the match probabilities were higher than 10(-7) for another six samples. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
The detection and differentiation of canine respiratory pathogens using oligonucleotide microarrays.
Wang, Lih-Chiann; Kuo, Ya-Ting; Chueh, Ling-Ling; Huang, Dean; Lin, Jiunn-Horng
2017-05-01
Canine respiratory diseases are commonly seen in dogs along with co-infections with multiple respiratory pathogens, including viruses and bacteria. Virus infections in even vaccinated dogs were also reported. The clinical signs caused by different respiratory etiological agents are similar, which makes differential diagnosis imperative. An oligonucleotide microarray system was developed in this study. The wild type and vaccine strains of canine distemper virus (CDV), influenza virus, canine herpesvirus (CHV), Bordetella bronchiseptica and Mycoplasma cynos were detected and differentiated simultaneously on a microarray chip. The detection limit is 10, 10, 100, 50 and 50 copy numbers for CDV, influenza virus, CHV, B. bronchiseptica and M. cynos, respectively. The clinical test results of nasal swab samples showed that the microarray had remarkably better efficacy than the multiplex PCR-agarose gel method. The positive detection rate of microarray and agarose gel was 59.0% (n=33) and 41.1% (n=23) among the 56 samples, respectively. CDV vaccine strain and pathogen co-infections were further demonstrated by the microarray but not by the multiplex PCR-agarose gel. The oligonucleotide microarray provides a highly efficient diagnosis alternative that could be applied to clinical usage, greatly assisting in disease therapy and control. Copyright © 2017 Elsevier B.V. All rights reserved.
Su, Lining; Wang, Chunjie; Zheng, Chenqing; Wei, Huiping; Song, Xiaoqing
2018-04-13
Parkinson's disease (PD) is a long-term degenerative disease that is caused by environmental and genetic factors. The networks of genes and their regulators that control the progression and development of PD require further elucidation. We examine common differentially expressed genes (DEGs) from several PD blood and substantia nigra (SN) microarray datasets by meta-analysis. Further we screen the PD-specific genes from common DEGs using GCBI. Next, we used a series of bioinformatics software to analyze the miRNAs, lncRNAs and SNPs associated with the common PD-specific genes, and then identify the mTF-miRNA-gene-gTF network. Our results identified 36 common DEGs in PD blood studies and 17 common DEGs in PD SN studies, and five of the genes were previously known to be associated with PD. Further study of the regulatory miRNAs associated with the common PD-specific genes revealed 14 PD-specific miRNAs in our study. Analysis of the mTF-miRNA-gene-gTF network about PD-specific genes revealed two feed-forward loops: one involving the SPRK2 gene, hsa-miR-19a-3p and SPI1, and the second involving the SPRK2 gene, hsa-miR-17-3p and SPI. The long non-coding RNA (lncRNA)-mediated regulatory network identified lncRNAs associated with PD-specific genes and PD-specific miRNAs. Moreover, single nucleotide polymorphism (SNP) analysis of the PD-specific genes identified two significant SNPs, and SNP analysis of the neurodegenerative disease-specific genes identified seven significant SNPs. Most of these SNPs are present in the 3'-untranslated region of genes and are controlled by several miRNAs. Our study identified a total of 53 common DEGs in PD patients compared with healthy controls in blood and brain datasets and five of these genes were previously linked with PD. Regulatory network analysis identified PD-specific miRNAs, associated long non-coding RNA and feed-forward loops, which contribute to our understanding of the mechanisms underlying PD. The SNPs identified in our study can determine whether a genetic variant is associated with PD. Overall, these findings will help guide our study of the complex molecular mechanism of PD.
Genomic resources for Myzus persicae: EST sequencing, SNP identification, and microarray design
Ramsey, John S; Wilson, Alex CC; de Vos, Martin; Sun, Qi; Tamborindeguy, Cecilia; Winfield, Agnese; Malloch, Gaynor; Smith, Dawn M; Fenton, Brian; Gray, Stewart M; Jander, Georg
2007-01-01
Background The green peach aphid, Myzus persicae (Sulzer), is a world-wide insect pest capable of infesting more than 40 plant families, including many crop species. However, despite the significant damage inflicted by M. persicae in agricultural systems through direct feeding damage and by its ability to transmit plant viruses, limited genomic information is available for this species. Results Sequencing of 16 M. persicae cDNA libraries generated 26,669 expressed sequence tags (ESTs). Aphids for library construction were raised on Arabidopsis thaliana, Nicotiana benthamiana, Brassica oleracea, B. napus, and Physalis floridana (with and without Potato leafroll virus infection). The M. persicae cDNA libraries include ones made from sexual and asexual whole aphids, guts, heads, and salivary glands. In silico comparison of cDNA libraries identified aphid genes with tissue-specific expression patterns, and gene expression that is induced by feeding on Nicotiana benthamiana. Furthermore, 2423 genes that are novel to science and potentially aphid-specific were identified. Comparison of cDNA data from three aphid lineages identified single nucleotide polymorphisms that can be used as genetic markers and, in some cases, may represent functional differences in the protein products. In particular, non-conservative amino acid substitutions in a highly expressed gut protease may be of adaptive significance for M. persicae feeding on different host plants. The Agilent eArray platform was used to design an M. persicae oligonucleotide microarray representing over 10,000 unique genes. Conclusion New genomic resources have been developed for M. persicae, an agriculturally important insect pest. These include previously unknown sequence data, a collection of expressed genes, molecular markers, and a DNA microarray that can be used to study aphid gene expression. These resources will help elucidate the adaptations that allow M. persicae to develop compatible interactions with its host plants, complementing ongoing work illuminating plant molecular responses to phloem-feeding insects. PMID:18021414
2011-01-01
Background Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. Results We report the integration of the whole genome sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were identified to be either in identity by descent (IBD) or in copy number variation (CNV) with results from SNP array genotyping. Coding insertions and deletions (indels) were found to be enriched for size in multiples of 3 and were located near the N- and C-termini of proteins. For larger indels, a combination of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation in an individual bovine genome and demonstrate that structural variation surpasses sequence variation as the main component of genomic variability. Better accuracy of SNP detection was achieved with little loss of sensitivity when algorithms that implemented mapping quality were used. IBD regions were found to be instrumental for calculating resequencing SNP accuracy, while SNP detection within CNVs tended to be less reliable. CNV discovery was affected dramatically by platform resolution and coverage biases. The combined data for this study showed that at a moderate level of sequencing coverage, an ensemble of platforms and tools can be applied together to maximize the accurate detection of sequence and structural variants. PMID:22082336
Law, Amanda J.; Lipska, Barbara K.; Weickert, Cynthia Shannon; Hyde, Thomas M.; Straub, Richard E.; Hashimoto, Ryota; Harrison, Paul J.; Kleinman, Joel E.; Weinberger, Daniel R.
2006-01-01
Genetic variation in neuregulin 1 (NRG1) is associated with schizophrenia. The disease-associated SNPs are noncoding, and their functional implications remain unknown. We hypothesized that differential expression of the NRG1 gene explains its association to the disease. We examined four of the disease-associated SNPs that make up the original risk haplotype in the 5′ upstream region of the gene for their effects on mRNA abundance of NRG1 types I–IV in human postmortem hippocampus. Diagnostic comparisons revealed a 34% increase in type I mRNA in schizophrenia and an interaction of diagnosis and genotype (SNP8NRG221132) on this transcript. Of potentially greater interest, a single SNP within the risk haplotype (SNP8NRG243177) and a 22-kb block of this core haplotype are associated with mRNA expression for the novel type IV isoform in patients and controls. Bioinformatic promoter analyses indicate that both SNPs lead to a gain/loss of putative binding sites for three transcription factors, serum response factor, myelin transcription factor-1, and High Mobility Group Box Protein-1. These data implicate variation in isoform expression as a molecular mechanism for the genetic association of NRG1 with schizophrenia. PMID:16618933
Sub-micro-liter Electrochemical Single-Nucleotide-Polymorphism Detector for Lab-on-a-Chip System
NASA Astrophysics Data System (ADS)
Tanaka, Hiroyuki; Fiorini, Paolo; Peeters, Sara; Majeed, Bivragh; Sterken, Tom; de Beeck, Maaike Op; Hayashi, Miho; Yaku, Hidenobu; Yamashita, Ichiro
2012-04-01
A sub-micro-liter single-nucleotide-polymorphism (SNP) detector for lab-on-a-chip applications is developed. This detector enables a fast, sensitive, and selective SNP detection directly from human blood. The detector is fabricated on a Si substrate by a standard complementary metal oxide semiconductor/micro electro mechanical systems (CMOS/MEMS) process and Polydimethylsiloxane (PDMS) molding. Stable and reproducible measurements are obtained by implementing an on-chip Ag/AgCl electrode and encapsulating the detector. The detector senses the presence of SNPs by measuring the concentration of pyrophosphoric acid generated during selective DNA amplification. A 0.5-µL-volume detector enabled the successful performance of the typing of a SNP within the ABO gene using human blood. The measured sensitivity is 566 pA/µM.
Manaffar, R; Zare, S; Agh, N; Abdolahzadeh, N; Soltanian, S; Sorgeloos, P; Bossier, P; Van Stappen, G
2011-01-01
In order to find a marker for differentiating between a bisexual and a parthenogenetic Artemia strain, Exon-7 of the Na/K ATPase α(1) subunit gene was screened by RFLP technique. The results revealed a constant synonymous SNP (single nucleotide polymorphism) in digestion by the Tru1I enzyme that was consistent with these two types of Artemia. This SNP was identified as an accurate molecular marker for discrimination between bisexual and parthenogenetic Artemia. According to the Nei's genetic distance (1973), the lowest genetic distance was found between individuals from Artemia urmiana Günther 1890 and parthenogenetic populations, making the described marker the first marker to easily distinguish between these two cooccurring species. © 2010 Blackwell Publishing Ltd.
Stochastic models for inferring genetic regulation from microarray gene expression data.
Tian, Tianhai
2010-03-01
Microarray expression profiles are inherently noisy and many different sources of variation exist in microarray experiments. It is still a significant challenge to develop stochastic models to realize noise in microarray expression profiles, which has profound influence on the reverse engineering of genetic regulation. Using the target genes of the tumour suppressor gene p53 as the test problem, we developed stochastic differential equation models and established the relationship between the noise strength of stochastic models and parameters of an error model for describing the distribution of the microarray measurements. Numerical results indicate that the simulated variance from stochastic models with a stochastic degradation process can be represented by a monomial in terms of the hybridization intensity and the order of the monomial depends on the type of stochastic process. The developed stochastic models with multiple stochastic processes generated simulations whose variance is consistent with the prediction of the error model. This work also established a general method to develop stochastic models from experimental information. 2009 Elsevier Ireland Ltd. All rights reserved.
Forensic validation of the SNPforID 52-plex assay.
Musgrave-Brown, Esther; Ballard, David; Balogh, Kinga; Bender, Klaus; Berger, Burkhard; Bogus, Magdalena; Børsting, Claus; Brion, María; Fondevila, Manuel; Harrison, Cheryl; Oguzturun, Ceylan; Parson, Walther; Phillips, Chris; Proff, Carsten; Ramos-Luis, Eva; Sanchez, Juan J; Sánchez Diz, Paula; Sobrino Rey, Bea; Stradmann-Bellinghausen, Beate; Thacker, Catherine; Carracedo, Angel; Morling, Niels; Scheithauer, Richard; Schneider, Peter M; Syndercombe Court, Denise
2007-06-01
The advantages of single nucleotide polymorphism (SNP) typing in forensic genetics are well known and include a wider choice of high-throughput typing platforms, lower mutation rates, and improved analysis of degraded samples. However, if SNPs are to become a realistic supplement to current short tandem repeat (STR) typing methods, they must be shown to successfully and reliably analyse the challenging samples commonly encountered in casework situations. The European SNPforID consortium, supported by the EU GROWTH programme, has developed a multiplex of 52 SNPs for forensic analysis, with the amplification of all 52 loci in a single reaction followed by two single base extension (SBE) reactions which are detected with capillary electrophoresis. In order to validate this assay, a variety of DNA extracts were chosen to represent problems such as low copy number and degradation that are commonly seen in forensic casework. A total of 40 extracts were used in the study, each of which was sent to two of the five participating laboratories for typing in duplicate or triplicate. Laboratories were instructed to carry out their analyses as if they were dealing with normal casework samples. Results were reported back to the coordinating laboratory and compared with those obtained from traditional STR typing of the same extracts using Powerplex 16 (Promega). These results indicate that, although the ability to successfully type good quality, low copy number extracts is lower, the 52-plex SNP assay performed better than STR typing on degraded samples, and also on samples that were both degraded and of limited quantity, suggesting that SNP analysis can provide advantages over STR analysis in forensically relevant circumstances. However, there were also additional problems arising from contamination and primer quality issues and these are discussed.
Montazeri, Zahra; Yanofsky, Corey M; Bickel, David R
2010-01-01
Research on analyzing microarray data has focused on the problem of identifying differentially expressed genes to the neglect of the problem of how to integrate evidence that a gene is differentially expressed with information on the extent of its differential expression. Consequently, researchers currently prioritize genes for further study either on the basis of volcano plots or, more commonly, according to simple estimates of the fold change after filtering the genes with an arbitrary statistical significance threshold. While the subjective and informal nature of the former practice precludes quantification of its reliability, the latter practice is equivalent to using a hard-threshold estimator of the expression ratio that is not known to perform well in terms of mean-squared error, the sum of estimator variance and squared estimator bias. On the basis of two distinct simulation studies and data from different microarray studies, we systematically compared the performance of several estimators representing both current practice and shrinkage. We find that the threshold-based estimators usually perform worse than the maximum-likelihood estimator (MLE) and they often perform far worse as quantified by estimated mean-squared risk. By contrast, the shrinkage estimators tend to perform as well as or better than the MLE and never much worse than the MLE, as expected from what is known about shrinkage. However, a Bayesian measure of performance based on the prior information that few genes are differentially expressed indicates that hard-threshold estimators perform about as well as the local false discovery rate (FDR), the best of the shrinkage estimators studied. Based on the ability of the latter to leverage information across genes, we conclude that the use of the local-FDR estimator of the fold change instead of informal or threshold-based combinations of statistical tests and non-shrinkage estimators can be expected to substantially improve the reliability of gene prioritization at very little risk of doing so less reliably. Since the proposed replacement of post-selection estimates with shrunken estimates applies as well to other types of high-dimensional data, it could also improve the analysis of SNP data from genome-wide association studies.
Steinbacher, Peter; Feichtinger, René G; Kedenko, Lyudmyla; Kedenko, Igor; Reinhardt, Sandra; Schönauer, Anna-Lena; Leitner, Isabella; Sänger, Alexandra M; Stoiber, Walter; Kofler, Barbara; Förster, Holger; Paulweber, Bernhard; Ring-Dimitriou, Susanne
2015-01-01
PGC-1α (peroxisome proliferator-activated receptor γ co-activator 1α) is an important regulator of mitochondrial biogenesis and a master regulator of enzymes involved in oxidative phosphorylation. Recent evidence demonstrated that the Gly482Ser single nucleotide polymorphism (SNP) in the PGC-1α gene affects insulin sensitivity, blood lipid metabolism and binding to myocyte enhancer factor 2 (MEF2). Individuals carrying this SNP were shown to have a reduced cardiorespiratory fitness and a higher risk to develop type 2 diabetes. Here, we investigated the responses of untrained men with the Gly482Ser SNP to a 10 week programme of endurance training (cycling, 3 x 60 min/week, heart rate at 70-90% VO2peak). Quantitative data from analysis of biopsies from vastus lateralis muscle revealed that the SNP group, in contrast to the control group, lacked a training-induced increase in content of slow contracting oxidative fibres. Capillary supply, mitochondrial density, mitochondrial enzyme activities and intramyocellular lipid content increased similarly in both groups. These results indicate that the impaired binding of MEF2 to PGC-1α in humans with this SNP impedes exercise-induced fast-to-slow muscle fibre transformation.
Multiplexed SNP genotyping using the Qbead™ system: a quantum dot-encoded microsphere-based assay
Xu, Hongxia; Sha, Michael Y.; Wong, Edith Y.; Uphoff, Janet; Xu, Yanzhang; Treadway, Joseph A.; Truong, Anh; O’Brien, Eamonn; Asquith, Steven; Stubbins, Michael; Spurr, Nigel K.; Lai, Eric H.; Mahoney, Walt
2003-01-01
We have developed a new method using the Qbead™ system for high-throughput genotyping of single nucleotide polymorphisms (SNPs). The Qbead system employs fluorescent Qdot™ semiconductor nanocrystals, also known as quantum dots, to encode microspheres that subsequently can be used as a platform for multiplexed assays. By combining mixtures of quantum dots with distinct emission wavelengths and intensities, unique spectral ‘barcodes’ are created that enable the high levels of multiplexing required for complex genetic analyses. Here, we applied the Qbead system to SNP genotyping by encoding microspheres conjugated to allele-specific oligonucleotides. After hybridization of oligonucleotides to amplicons produced by multiplexed PCR of genomic DNA, individual microspheres are analyzed by flow cytometry and each SNP is distinguished by its unique spectral barcode. Using 10 model SNPs, we validated the Qbead system as an accurate and reliable technique for multiplexed SNP genotyping. By modifying the types of probes conjugated to microspheres, the Qbead system can easily be adapted to other assay chemistries for SNP genotyping as well as to other applications such as analysis of gene expression and protein–protein interactions. With its capability for high-throughput automation, the Qbead system has the potential to be a robust and cost-effective platform for a number of applications. PMID:12682378
Zain, Maryam; Awan, Fazli Rabbi; Cooper, Jackie A; Li, Ka Wah; Palmen, Jutta; Acharya, Jay; Howard, Philip; Baig, Shahid M; Elkeles, Robert S; Stephens, Jeffrey W; Ireland, Helen; Humphries, Steve E
2014-09-01
To determine the sequence variant of TLL1 gene (rs1503298, T > C) in three British cohorts (PREDICT, UDACS and ED) of patients with type-2 Diabetes mellitus (T2DM) in order to assess its association with coronary heart disease (CHD). Analytical study. UCL, London, UK. Participants were genotyped in 2011-2012 for TLL1 SNP. Samples and related information were previously collected in 2001-2003 for PREDICT, and in 2001-2002 for UDACS and ED groups. Patients included in PREDICT (n=600), UDACS (n=1020) and ED (n=1240) had Diabetes. TLL1 SNP (rs1503298, T > C) was genotyped using TaqMan technology. Allele frequencies were compared using c2 test, and tested for Hardy-Weinberg equilibrium. The risk of disease was assessed from Odds ratios (OR) with 95% Confidence Intervals (95% CI). Moreover, for the PREDICT cohort, the SNP association was tested with Coronary Artery Calcification (CAC) scores. No significant association was found for this SNP with CHD or CAC scores in these cohorts. This SNP could not be confirmed as a risk factor for CHD in T2DM patients. However, the low power of thesmall sample size available is a limitation to the modest effect on risk. Further studies in larger samples would be useful.
Steinbacher, Peter; Feichtinger, René G.; Kedenko, Lyudmyla; Kedenko, Igor; Reinhardt, Sandra; Schönauer, Anna-Lena; Leitner, Isabella; Sänger, Alexandra M.; Stoiber, Walter; Kofler, Barbara; Förster, Holger; Paulweber, Bernhard; Ring-Dimitriou, Susanne
2015-01-01
PGC-1α (peroxisome proliferator-activated receptor γ co-activator 1α) is an important regulator of mitochondrial biogenesis and a master regulator of enzymes involved in oxidative phosphorylation. Recent evidence demonstrated that the Gly482Ser single nucleotide polymorphism (SNP) in the PGC-1α gene affects insulin sensitivity, blood lipid metabolism and binding to myocyte enhancer factor 2 (MEF2). Individuals carrying this SNP were shown to have a reduced cardiorespiratory fitness and a higher risk to develop type 2 diabetes. Here, we investigated the responses of untrained men with the Gly482Ser SNP to a 10 week programme of endurance training (cycling, 3 x 60 min/week, heart rate at 70-90% VO2peak). Quantitative data from analysis of biopsies from vastus lateralis muscle revealed that the SNP group, in contrast to the control group, lacked a training-induced increase in content of slow contracting oxidative fibres. Capillary supply, mitochondrial density, mitochondrial enzyme activities and intramyocellular lipid content increased similarly in both groups. These results indicate that the impaired binding of MEF2 to PGC-1α in humans with this SNP impedes exercise-induced fast-to-slow muscle fibre transformation. PMID:25886402
Evaluation of artificial time series microarray data for dynamic gene regulatory network inference.
Xenitidis, P; Seimenis, I; Kakolyris, S; Adamopoulos, A
2017-08-07
High-throughput technology like microarrays is widely used in the inference of gene regulatory networks (GRNs). We focused on time series data since we are interested in the dynamics of GRNs and the identification of dynamic networks. We evaluated the amount of information that exists in artificial time series microarray data and the ability of an inference process to produce accurate models based on them. We used dynamic artificial gene regulatory networks in order to create artificial microarray data. Key features that characterize microarray data such as the time separation of directly triggered genes, the percentage of directly triggered genes and the triggering function type were altered in order to reveal the limits that are imposed by the nature of microarray data on the inference process. We examined the effect of various factors on the inference performance such as the network size, the presence of noise in microarray data, and the network sparseness. We used a system theory approach and examined the relationship between the pole placement of the inferred system and the inference performance. We examined the relationship between the inference performance in the time domain and the true system parameter identification. Simulation results indicated that time separation and the percentage of directly triggered genes are crucial factors. Also, network sparseness, the triggering function type and noise in input data affect the inference performance. When two factors were simultaneously varied, it was found that variation of one parameter significantly affects the dynamic response of the other. Crucial factors were also examined using a real GRN and acquired results confirmed simulation findings with artificial data. Different initial conditions were also used as an alternative triggering approach. Relevant results confirmed that the number of datasets constitutes the most significant parameter with regard to the inference performance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Focused Screening of ECM-Selective Adhesion Peptides on Cellulose-Bound Peptide Microarrays.
Kanie, Kei; Kondo, Yuto; Owaki, Junki; Ikeda, Yurika; Narita, Yuji; Kato, Ryuji; Honda, Hiroyuki
2016-11-19
The coating of surfaces with bio-functional proteins is a promising strategy for the creation of highly biocompatible medical implants. Bio-functional proteins from the extracellular matrix (ECM) provide effective surface functions for controlling cellular behavior. We have previously screened bio-functional tripeptides for feasibility of mass production with the aim of identifying those that are medically useful, such as cell-selective peptides. In this work, we focused on the screening of tripeptides that selectively accumulate collagen type IV (Col IV), an ECM protein that accelerates the re-endothelialization of medical implants. A SPOT peptide microarray was selected for screening owing to its unique cellulose membrane platform, which can mimic fibrous scaffolds used in regenerative medicine. However, since the library size on the SPOT microarray was limited, physicochemical clustering was used to provide broader variation than that of random peptide selection. Using the custom focused microarray of 500 selected peptides, we assayed the relative binding rates of tripeptides to Col IV, collagen type I (Col I), and albumin. We discovered a cluster of Col IV-selective adhesion peptides that exhibit bio-safety with endothelial cells. The results from this study can be used to improve the screening of regeneration-enhancing peptides.
Focused Screening of ECM-Selective Adhesion Peptides on Cellulose-Bound Peptide Microarrays
Kanie, Kei; Kondo, Yuto; Owaki, Junki; Ikeda, Yurika; Narita, Yuji; Kato, Ryuji; Honda, Hiroyuki
2016-01-01
The coating of surfaces with bio-functional proteins is a promising strategy for the creation of highly biocompatible medical implants. Bio-functional proteins from the extracellular matrix (ECM) provide effective surface functions for controlling cellular behavior. We have previously screened bio-functional tripeptides for feasibility of mass production with the aim of identifying those that are medically useful, such as cell-selective peptides. In this work, we focused on the screening of tripeptides that selectively accumulate collagen type IV (Col IV), an ECM protein that accelerates the re-endothelialization of medical implants. A SPOT peptide microarray was selected for screening owing to its unique cellulose membrane platform, which can mimic fibrous scaffolds used in regenerative medicine. However, since the library size on the SPOT microarray was limited, physicochemical clustering was used to provide broader variation than that of random peptide selection. Using the custom focused microarray of 500 selected peptides, we assayed the relative binding rates of tripeptides to Col IV, collagen type I (Col I), and albumin. We discovered a cluster of Col IV-selective adhesion peptides that exhibit bio-safety with endothelial cells. The results from this study can be used to improve the screening of regeneration-enhancing peptides. PMID:28952593
Novianti, Putri W; Roes, Kit C B; Eijkemans, Marinus J C
2014-01-01
Classification methods used in microarray studies for gene expression are diverse in the way they deal with the underlying complexity of the data, as well as in the technique used to build the classification model. The MAQC II study on cancer classification problems has found that performance was affected by factors such as the classification algorithm, cross validation method, number of genes, and gene selection method. In this paper, we study the hypothesis that the disease under study significantly determines which method is optimal, and that additionally sample size, class imbalance, type of medical question (diagnostic, prognostic or treatment response), and microarray platform are potentially influential. A systematic literature review was used to extract the information from 48 published articles on non-cancer microarray classification studies. The impact of the various factors on the reported classification accuracy was analyzed through random-intercept logistic regression. The type of medical question and method of cross validation dominated the explained variation in accuracy among studies, followed by disease category and microarray platform. In total, 42% of the between study variation was explained by all the study specific and problem specific factors that we studied together.
2012-01-01
Background Cucurbita pepo is a member of the Cucurbitaceae family, the second- most important horticultural family in terms of economic importance after Solanaceae. The "summer squash" types, including Zucchini and Scallop, rank among the highest-valued vegetables worldwide. There are few genomic tools available for this species. The first Cucurbita transcriptome, along with a large collection of Single Nucleotide Polymorphisms (SNP), was recently generated using massive sequencing. A set of 384 SNP was selected to generate an Illumina GoldenGate assay in order to construct the first SNP-based genetic map of Cucurbita and map quantitative trait loci (QTL). Results We herein present the construction of the first SNP-based genetic map of Cucurbita pepo using a population derived from the cross of two varieties with contrasting phenotypes, representing the main cultivar groups of the species' two subspecies: Zucchini (subsp. pepo) × Scallop (subsp. ovifera). The mapping population was genotyped with 384 SNP, a set of selected EST-SNP identified in silico after massive sequencing of the transcriptomes of both parents, using the Illumina GoldenGate platform. The global success rate of the assay was higher than 85%. In total, 304 SNP were mapped, along with 11 SSR from a previous map, giving a map density of 5.56 cM/marker. This map was used to infer syntenic relationships between C. pepo and cucumber and to successfully map QTL that control plant, flowering and fruit traits that are of benefit to squash breeding. The QTL effects were validated in backcross populations. Conclusion Our results show that massive sequencing in different genotypes is an excellent tool for SNP discovery, and that the Illumina GoldenGate platform can be successfully applied to constructing genetic maps and performing QTL analysis in Cucurbita. This is the first SNP-based genetic map in the Cucurbita genus and is an invaluable new tool for biological research, especially considering that most of these markers are located in the coding regions of genes involved in different physiological processes. The platform will also be useful for future mapping and diversity studies, and will be essential in order to accelerate the process of breeding new and better-adapted squash varieties. PMID:22356647
Ma, Jianping; Wang, Jufang; Liu, Yanfen; Wang, Changyi; Duan, Donghui; Lu, Nanjia; Wang, Kaiyue; Zhang, Lu; Gu, Kaibo; Chen, Sihan; Zhang, Tao; You, Dingyun; Han, Liyuan
2017-02-01
The aim of this study was to compare the expression levels of serum miRNAs in diabetic retinopathy and type 2 diabetes mellitus. Serum miRNA expression profiles from diabetic retinopathy cases (type 2 diabetes mellitus patients with diabetic retinopathy) and type 2 diabetes mellitus controls (type 2 diabetes mellitus patients without diabetic retinopathy) were examined by miRNA-specific microarray analysis. Quantitative real-time polymerase chain reaction was used to validate the significantly differentially expressed serum miRNAs from the microarray analysis of 45 diabetic retinopathy cases and 45 age-, sex-, body mass index- and duration-of-diabetes-matched type 2 diabetes mellitus controls. The relative changes in serum miRNA expression levels were analyzed using the 2-ΔΔCt method. A total of 5 diabetic retinopathy cases and 5 type 2 diabetes mellitus controls were included in the miRNA-specific microarray analysis. The serum levels of miR-3939 and miR-1910-3p differed significantly between the two groups in the screening stage; however, quantitative real-time polymerase chain reaction did not reveal significant differences in miRNA expression for 45 diabetic retinopathy cases and their matched type 2 diabetes mellitus controls. Our findings indicate that miR-3939 and miR-1910-3p may not play important roles in the development of diabetic retinopathy; however, studies with a larger sample size are needed to confirm our findings.
Cephalometric analysis of the middle part of the face in patients with mandibular prognathism.
Cutović Tatjana; Jović, Nebojsa; Kozomara, Ruzica; Radojicić, Julija; Janosević, Mirjana; Mladenović, Irena; Matijević, Stevo
2014-11-01
The middle part of the face, that is the maxilla, has always been mentioned as a possible etiologic factor of skeletal Class III. However, the importance of the relationship of maxillary retroposition towards the cranial base is still unclear, although it has been examined many times. The aim of this study was to conduct cephalometric analysis of the morphology of maxilla, including the whole middle part of the face in patients with divergent and convergent facial types of mandibular prognathism, as well as to determine differences betweeen them. Lateral cephalometric teleradiograph images of 90 patients were analyzed at the Dental Clinic of the Military Medical Academy, Belgrade, Serbia. All the patients were male, aged 18-35 years, not previously treated orthodontically. On the basis of dentalskeletal relations of jaws and teeth, the patients were divided into three groups: the group P1 (patients with divergent facial type of mandibular prognathism), P2 (patients with convergent facial type of mandibular pragmathism) and the group E (control group or eugnathic patients). A total of 9 cephalometric parameters related to the middle face were measured and analyzed: the length of the hard palate--SnaSnp, the length of the maxillary corpus--AptmPP, the length of the soft palate, the angle between the hard and soft palate--SnaSnpUt, the angle of inclination of the maxillary alveolar process, the angle of inclination of the upper front teeth, the effective maxillary length--CoA, the posterior maxillary alveolar hyperplasia--U6PP and the angle of maxillary prognathism. The obtained results showed that the CoA, AptmPP and SnaSnp were significally shorter in patients with divergent facial type of mandibular prognathism compared to patients with convergent facial type of the mandibular prognathism and also in both experimental groups of patients compared to the control group. SnaSnp was significantly shorter in patients with divergent facial type of mandibular prognathism compared to the control group, whereas SnaSnp was significantly smaller in patients with convergent facial type of mandibular prognathism compared to the control group. Additionally, there was a pronounced incisor dentoalveolar compensation of skeletal discrepancy in both groups of patients with mandibular prognathism manifested in the form of a significant upper front teeth protrusion, but without significant differences among the groups, while the maxillary retrognathism was present in most patients of both experimental groups. A pronounced UGPP was found only in the patients with divergent type of mandibular prognathism. The maxilla is certainly one of the key factors which contributes to making the diagnosis, but primarily to making a plan for mandibular prognathism treatment Accurate assessment of the manifestation of abnormality, localization of skeletal problems and understanding of the biological potential are key factors of the stability of/the results of surgical-orthodontic treatment of this abnormality.
Zhao, Nan; Han, Jing Ginger; Shyu, Chi-Ren; Korkin, Dmitry
2014-01-01
Single nucleotide polymorphisms (SNPs) are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs) have been found near or inside the protein-protein interaction (PPI) interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor). Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1) a 2-class problem (strengthening/weakening PPI mutations), (2) another 2-class problem (mutations that disrupt/preserve a PPI), and (3) a 3-class classification (detrimental/neutral/beneficial mutation effects). In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the rewiring of large-scale protein-protein interaction networks, and can be useful for functional annotation of disease-associated SNPs. SNIP-IN tool is freely accessible as a web-server at http://korkinlab.org/snpintool/. PMID:24784581
Szental, Joshua A; Baird, Paul N; Richardson, Andrea J; Islam, F M Amirul; Scholl, Hendrik P N; Charbel Issa, Peter; Holz, Frank G; Gillies, Mark; Guymer, Robyn H
2010-12-01
Recent imaging studies have suggested that macular pigment is decreased centrally in macular telangiectasia type 2 (MT2). The uptake of xanthophyll pigment into the macula is thought to be facilitated by a xanthophyll-binding protein (XBP). The Pi isoform of glutathione S-transferase (GSTP1) represents one such XBP with high binding affinity. This case-control study aimed to determine whether two common single-nucleotide polymorphisms (SNPs) of GSTP1 were associated with MT2. DNA samples from 39 cases and 21 controls were collected. Two polymorphic sites of Ile105Val and Ala114Val in exons 5 and 6 respectively, of the GSTP1 gene were analysed. Comparison of alleles and genotypes between cases and controls indicated that there were no statistically significant differences for either the Ile105Val SNP (P=0.43) or the Ala114Val SNP (P=0.85), or for any combinations; however, the homozygous at-risk genotype (GG) of the Ile105Val SNP was present in 8% of cases but absent in controls. This study found no statistically significant association between two common GSTP1 SNPs and MT2; however, a trend towards a greater frequency of the GG genotype of the Ile105Val SNP in cases is of great interest. The biological plausibility of disturbed macular pigment uptake in MT2 makes GSTP1 an excellent candidate gene. Further investigation is warranted in future studies of MT2.
Genetic source tracking of an anthrax outbreak in Shaanxi province, China.
Liu, Dong-Li; Wei, Jian-Chun; Chen, Qiu-Lan; Guo, Xue-Jun; Zhang, En-Min; He, Li; Liang, Xu-Dong; Ma, Guo-Zhu; Zhou, Ti-Cao; Yin, Wen-Wu; Liu, Wei; Liu, Kai; Shi, Yi; Ji, Jian-Jun; Zhang, Hui-Juan; Ma, Lin; Zhang, Fa-Xin; Zhang, Zhi-Kai; Zhou, Hang; Yu, Hong-Jie; Kan, Biao; Xu, Jian-Guo; Liu, Feng; Li, Wei
2017-01-17
Anthrax is an acute zoonotic infectious disease caused by the bacterium known as Bacillus anthracis. From 26 July to 8 August 2015, an outbreak with 20 suspected cutaneous anthrax cases was reported in Ganquan County, Shaanxi province in China. The genetic source tracking analysis of the anthrax outbreak was performed by molecular epidemiological methods in this study. Three molecular typing methods, namely canonical single nucleotide polymorphisms (canSNP), multiple-locus variable-number tandem repeat analysis (MLVA), and single nucleotide repeat (SNR) analysis, were used to investigate the possible source of transmission and identify the genetic relationship among the strains isolated from human cases and diseased animals during the outbreak. Five strains isolated from diseased mules were clustered together with patients' isolates using canSNP typing and MLVA. The causative B. anthracis lineages in this outbreak belonged to the A.Br.001/002 canSNP subgroup and the MLVA15-31 genotype (the 31 genotype in MLVA15 scheme). Because nine isolates from another four provinces in China were clustered together with outbreak-related strains by the canSNP (A.Br.001/002 subgroup) and MLVA15 method (MLVA15-31 genotype), still another SNR analysis (CL10, CL12, CL33, and CL35) was used to source track the outbreak, and the results suggesting that these patients in the anthrax outbreak were probably infected by the same pathogen clone. It was deduced that the anthrax outbreak occurred in Shaanxi province, China in 2015 was a local occurrence.
Hernandez-Ferrer, Carles; Quintela Garcia, Ines; Danielski, Katharina; Carracedo, Ángel; Pérez-Jurado, Luis A; González, Juan R
2015-05-20
The well-known Genome-Wide Association Studies (GWAS) had led to many scientific discoveries using SNP data. Even so, they were not able to explain the full heritability of complex diseases. Now, other structural variants like copy number variants or DNA inversions, either germ-line or in mosaicism events, are being studies. We present the R package affy2sv to pre-process Affymetrix CytoScan HD/750k array (also for Genome-Wide SNP 5.0/6.0 and Axiom) in structural variant studies. We illustrate the capabilities of affy2sv using two different complete pipelines on real data. The first one performing a GWAS and a mosaic alterations detection study, and the other detecting CNVs and performing an inversion calling. Both examples presented in the article show up how affy2sv can be used as part of more complex pipelines aimed to analyze Affymetrix SNP arrays data in genetic association studies, where different types of structural variants are considered.
Ando, Tetsuya; Komaki, Gen; Nishimura, Hiroki; Naruo, Tetsuro; Okabe, Kenjiro; Kawai, Keisuke; Takii, Masato; Oka, Takakazu; Kodama, Naoki; Nakamoto, Chiemi; Ishikawa, Toshio; Suzuki-Hotta, Mari; Minatozaki, Kazunori; Yamaguchi, Chikara; Nishizono-Maher, Aya; Kono, Masaki; Kajiwara, Sohei; Suematsu, Hiroyuki; Tomita, Yuichiro; Ebana, Shoichi; Okamoto, Yuri; Nagata, Katsutaro; Nakai, Yoshikatsu; Koide, Masanori; Kobayashi, Nobuyuki; Kurokawa, Nobuo; Nagata, Toshihiko; Kiriike, Nobuo; Takenaka, Yoshito; Nagamine, Kiyohide; Ookuma, Kazuyoshi; Murata, Shiho
2010-08-01
Patients with anorexia nervosa restricting type (AN-R) often develop bulimic symptoms and crossover to AN-binge eating/purging type (AN-BP), or to bulimia nervosa (BN). We have reported earlier that genetic variants of an orexigenic peptide ghrelin are associated with BN. Here, the relationship between a ghrelin gene variant and the rate of change from AN-R to other phenotypes of eating disorders (EDs) was investigated. Participants were 165 patients with ED, initially diagnosed as AN-R. The dates of their AN-R onset and changes in diagnosis to other subtypes of ED were investigated retrospectively. Ghrelin gene 3056 T-->C SNP (single nucleotide polymorphism) was genotyped. Probability and hazard ratios were analyzed using life table analysis and Cox's proportional hazard regression model, in which the starting point was the time of AN-R onset and the outcome events were the time of (i) onset of binge eating, that is, when patients changed to binge eating AN and BN and (ii) recovery of normal weight, that is, when patients changed to BN or remission. Patients with the TT genotype at 3056 T-->C had a higher probability and hazard ratio for recovery of normal weight. The ghrelin SNP was not related with the onset of binge eating. The 3056 T-->C SNP of the ghrelin gene is related to the probability and the rate of recovery of normal body weight from restricting-type AN.
Nonlinear matching measure for the analysis of on-off type DNA microarray images
NASA Astrophysics Data System (ADS)
Kim, Jong D.; Park, Misun; Kim, Jongwon
2003-07-01
In this paper, we propose a new nonlinear matching measure for automatic analysis of the on-off type DNA microarray images in which the hybridized spots are detected by the template matching method. The targeting spots of HPV DNA chips are designed for genotyping the human papilloma virus(HPV). The proposed measure is obtained by binarythresholding over the whole template region and taking the number of white pixels inside the spotted area. This measure is evaluated in terms of the accuracy of the estimated marker location to show better performance than the normalized covariance.
Locke, Jonathan M; Wei, Fan-Yan; Tomizawa, Kazuhito; Weedon, Michael N; Harries, Lorna W
2015-04-01
Intronic single nucleotide polymorphisms (SNPs) in the CDKAL1 gene are associated with risk of developing type 2 diabetes. A strong correlation between risk alleles and lower levels of the non-coding RNA, CDKAL1-v1, has recently been reported in whole blood extracted from Japanese individuals. We sought to replicate this association in two independent cohorts: one using whole blood from white UK-resident individuals, and one using a collection of human pancreatic islets, a more relevant tissue type to study with respect to the aetiology of diabetes. Levels of CDKAL1-v1 were measured by real-time PCR using RNA extracted from human whole blood (n = 70) and human pancreatic islets (n = 48). Expression with respect to genotype was then determined. In a simple linear regression model, expression of CDKAL1-v1 was associated with the lead type 2 diabetes-associated SNP, rs7756992, in whole blood and islets. However, these associations were abolished or substantially reduced in multiple regression models taking into account rs9366357 genotype: a moderately linked SNP explaining a much larger amount of the variation in CDKAL1-v1 levels, but not strongly associated with risk of type 2 diabetes. Contrary to previous findings, we provide evidence against a role for dysregulated expression of CDKAL1-v1 in mediating the association between intronic SNPs in CDKAL1 and susceptibility to type 2 diabetes. The results of this study illustrate how caution should be exercised when inferring causality from an association between disease-risk genotype and non-coding RNA expression.
Malhis, Nawar; Butterfield, Yaron S N; Ester, Martin; Jones, Steven J M
2009-01-01
A plethora of alignment tools have been created that are designed to best fit different types of alignment conditions. While some of these are made for aligning Illumina Sequence Analyzer reads, none of these are fully utilizing its probability (prb) output. In this article, we will introduce a new alignment approach (Slider) that reduces the alignment problem space by utilizing each read base's probabilities given in the prb files. Compared with other aligners, Slider has higher alignment accuracy and efficiency. In addition, given that Slider matches bases with probabilities other than the most probable, it significantly reduces the percentage of base mismatches. The result is that its SNP predictions are more accurate than other SNP prediction approaches used today that start from the most probable sequence, including those using base quality.
Automated typing of red blood cell and platelet antigens: a whole-genome sequencing study.
Lane, William J; Westhoff, Connie M; Gleadall, Nicholas S; Aguad, Maria; Smeland-Wagman, Robin; Vege, Sunitha; Simmons, Daimon P; Mah, Helen H; Lebo, Matthew S; Walter, Klaudia; Soranzo, Nicole; Di Angelantonio, Emanuele; Danesh, John; Roberts, David J; Watkins, Nick A; Ouwehand, Willem H; Butterworth, Adam S; Kaufman, Richard M; Rehm, Heidi L; Silberstein, Leslie E; Green, Robert C
2018-06-01
There are more than 300 known red blood cell (RBC) antigens and 33 platelet antigens that differ between individuals. Sensitisation to antigens is a serious complication that can occur in prenatal medicine and after blood transfusion, particularly for patients who require multiple transfusions. Although pre-transfusion compatibility testing largely relies on serological methods, reagents are not available for many antigens. Methods based on single-nucleotide polymorphism (SNP) arrays have been used, but typing for ABO and Rh-the most important blood groups-cannot be done with SNP typing alone. We aimed to develop a novel method based on whole-genome sequencing to identify RBC and platelet antigens. This whole-genome sequencing study is a subanalysis of data from patients in the whole-genome sequencing arm of the MedSeq Project randomised controlled trial (NCT01736566) with no measured patient outcomes. We created a database of molecular changes in RBC and platelet antigens and developed an automated antigen-typing algorithm based on whole-genome sequencing (bloodTyper). This algorithm was iteratively improved to address cis-trans haplotype ambiguities and homologous gene alignments. Whole-genome sequencing data from 110 MedSeq participants (30 × depth) were used to initially validate bloodTyper through comparison with conventional serology and SNP methods for typing of 38 RBC antigens in 12 blood-group systems and 22 human platelet antigens. bloodTyper was further validated with whole-genome sequencing data from 200 INTERVAL trial participants (15 × depth) with serological comparisons. We iteratively improved bloodTyper by comparing its typing results with conventional serological and SNP typing in three rounds of testing. The initial whole-genome sequencing typing algorithm was 99·5% concordant across the first 20 MedSeq genomes. Addressing discordances led to development of an improved algorithm that was 99·8% concordant for the remaining 90 MedSeq genomes. Additional modifications led to the final algorithm, which was 99·2% concordant across 200 INTERVAL genomes (or 99·9% after adjustment for the lower depth of coverage). By enabling more precise antigen-matching of patients with blood donors, antigen typing based on whole-genome sequencing provides a novel approach to improve transfusion outcomes with the potential to transform the practice of transfusion medicine. National Human Genome Research Institute, Doris Duke Charitable Foundation, National Health Service Blood and Transplant, National Institute for Health Research, and Wellcome Trust. Copyright © 2018 Elsevier Ltd. All rights reserved.
Sirois, Francine; Kaefer, Nadine; Currie, Krista A; Chrétien, Michel; Nkongolo, Kabwe K; Mbikay, Majambu
2012-10-01
The PCSK1 (proprotein convertase subtilisin/kexin type 1) locus encodes proprotein convertase 1/3, an endoprotease that converts prohormones and proneuropeptides to their active forms. Spontaneous loss-of-function mutations in the coding sequence of its gene have been linked to obesity in humans. Minor alleles of two common non-synonymous single-nucleotide polymorphisms (SNPs), rs6232 (T > C, N221D) and rs6235 (C > G, S690T), have been associated with increased risk of obesity in European populations. In this study, we compared the frequencies of the rs6232 and rs6234 (G > C, Q665E) SNPs in Aboriginal and Caucasian populations of Northern Ontario. The two SNPs were all relatively less frequent in Aboriginals: The minor allele frequency of the rs6232 SNP was 0.01 in Aboriginals and 0.08 in Caucasians (P < 4.10(-6)); for the rs6234 SNP, it was 0.20 and 0.32, respectively (P < 0.001). Resequencing revealed that the rs6234 SNP variation was tightly linked to that of the rs6235 SNP, as previously reported. Most interestingly, all carriers of the rs6232 SNP variation also carried the rs6234/rs6235 SNP clustered variations, but not the reverse, suggesting the former occurred later on an allele already carrying the latter. These data indicate that, in Northern Ontario Aboriginals, the triple-variant PCSK1 allele is relatively rare and might be of lesser significance for obesity risk in this population.
Efficient selection of tagging single-nucleotide polymorphisms in multiple populations.
Howie, Bryan N; Carlson, Christopher S; Rieder, Mark J; Nickerson, Deborah A
2006-08-01
Common genetic polymorphism may explain a portion of the heritable risk for common diseases, so considerable effort has been devoted to finding and typing common single-nucleotide polymorphisms (SNPs) in the human genome. Many SNPs show correlated genotypes, or linkage disequilibrium (LD), suggesting that only a subset of all SNPs (known as tagging SNPs, or tagSNPs) need to be genotyped for disease association studies. Based on the genetic differences that exist among human populations, most tagSNP sets are defined in a single population and applied only in populations that are closely related. To improve the efficiency of multi-population analyses, we have developed an algorithm called MultiPop-TagSelect that finds a near-minimal union of population-specific tagSNP sets across an arbitrary number of populations. We present this approach as an extension of LD-select, a tagSNP selection method that uses a greedy algorithm to group SNPs into bins based on their pairwise association patterns, although the MultiPop-TagSelect algorithm could be used with any SNP tagging approach that allows choices between nearly equivalent SNPs. We evaluate the algorithm by considering tagSNP selection in candidate-gene resequencing data and lower density whole-chromosome data. Our analysis reveals that an exhaustive search is often intractable, while the developed algorithm can quickly and reliably find near-optimal solutions even for difficult tagSNP selection problems. Using populations of African, Asian, and European ancestry, we also show that an optimal multi-population set of tagSNPs can be substantially smaller (up to 44%) than a typical set obtained through independent or sequential selection.
Nitric Oxide Exerts Basal and Insulin-Dependent Anorexigenic Actions in POMC Hypothalamic Neurons
Wellhauser, Leigh; Chalmers, Jennifer A.
2016-01-01
The arcuate nucleus of the hypothalamus represents a key center for the control of appetite and feeding through the regulation of 2 key neuronal populations, notably agouti-related peptide/neuropeptide Y and proopimelanocortin (POMC)/cocaine- and amphetamine-regulated transcript neurons. Altered regulation of these neuronal networks, in particular the dysfunction of POMC neurons upon high-fat consumption, is a major pathogenic mechanism involved in the development of obesity and type 2 diabetes mellitus. Efforts are underway to preserve the integrity or enhance the functionality of POMC neurons in order to prevent or treat these metabolic diseases. Here, we report for the first time that the nitric oxide (NO−) donor, sodium nitroprusside (SNP) mediates anorexigenic actions in both hypothalamic tissue and hypothalamic-derived cell models by mediating the up-regulation of POMC levels. SNP increased POMC mRNA in a dose-dependent manner and enhanced α-melanocortin-secreting hormone production and secretion in mHypoA-POMC/GFP-2 cells. SNP also enhanced insulin-driven POMC expression likely by inhibiting the deacetylase activity of sirtuin 1. Furthermore, SNP enhanced insulin-dependent POMC expression, likely by reducing the transcriptional repression of Foxo1 on the POMC gene. Prolonged SNP exposure prevented the development of insulin resistance. Taken together, the NO− donor SNP enhances the anorexigenic potential of POMC neurons by promoting its transcriptional expression independent and in cooperation with insulin. Thus, increasing cellular NO− levels represents a hormone-independent method of promoting anorexigenic output from the existing POMC neuronal populations and may be advantageous in the fight against these prevalent disorders. PMID:26930171
SNP ID-info: SNP ID searching and visualization platform.
Yang, Cheng-Hong; Chuang, Li-Yeh; Cheng, Yu-Huei; Wen, Cheng-Hao; Chang, Phei-Lang; Chang, Hsueh-Wei
2008-09-01
Many association studies provide the relationship between single nucleotide polymorphisms (SNPs), diseases and cancers, without giving a SNP ID, however. Here, we developed the SNP ID-info freeware to provide the SNP IDs within inputting genetic and physical information of genomes. The program provides an "SNP-ePCR" function to generate the full-sequence using primers and template inputs. In "SNPosition," sequence from SNP-ePCR or direct input is fed to match the SNP IDs from SNP fasta-sequence. In "SNP search" and "SNP fasta" function, information of SNPs within the cytogenetic band, contig position, and keyword input are acceptable. Finally, the SNP ID neighboring environment for inputs is completely visualized in the order of contig position and marked with SNP and flanking hits. The SNP identification problems inherent in NCBI SNP BLAST are also avoided. In conclusion, the SNP ID-info provides a visualized SNP ID environment for multiple inputs and assists systematic SNP association studies. The server and user manual are available at http://bio.kuas.edu.tw/snpid-info.
NASA Astrophysics Data System (ADS)
Roy, Swarup
2018-02-01
Interaction of greenly synthesized silver nanoparticles (SNP) and lysozyme (Lys) has been studied using spectroscopy. From UV-Vis study it is observed that a moderate association constant (Kapp) of 5.36 × 104 L/mol giving an indication of interaction. Fluorescence emission and time resolved study, confirm static mode of quenching phenomena and the binding constant (Kb) was 25.12, 3.98 and 1.99 × 103 L/mol at 298, 305 and 312 K respectively and the number of binding sites (n) was found to be ∼1. Using temperature dependent fluorimetric data, thermodynamic parameters calculated (Enthalpy change, ΔH = -143.95 kJ/mol, Entropy change, ΔS = -400.32 J/mol/K, Gibbs free energy change, ΔG = -24.66 kJ/mol at 298 K) and resulting insight indicative of weak force (van der Walls interaction & H-bonding) as key feature for the Lys-SNP interaction. By following Förster's non-radiative energy transfer (FRET) theory, average binding distance (r = 3.05 nm) was calculated and observed that nonradiative type energy transfer between SNP and Lys. What is more, circular dichroism (CD) spectra indicates presence of SNP does not display substantial alteration in the secondary structure of Lys. Hence, this results may be very useful for the well thought of essential aspects of binding between the Lys and SNP.
Fragomeni, Breno de Oliveira; Misztal, Ignacy; Lourenco, Daniela Lino; Aguilar, Ignacio; Okimoto, Ronald; Muir, William M
2014-01-01
The purpose of this study was to determine if the set of genomic regions inferred as accounting for the majority of genetic variation in quantitative traits remain stable over multiple generations of selection. The data set contained phenotypes for five generations of broiler chicken for body weight, breast meat, and leg score. The population consisted of 294,632 animals over five generations and also included genotypes of 41,036 single nucleotide polymorphism (SNP) for 4,866 animals, after quality control. The SNP effects were calculated by a GWAS type analysis using single step genomic BLUP approach for generations 1-3, 2-4, 3-5, and 1-5. Variances were calculated for windows of 20 SNP. The top ten windows for each trait that explained the largest fraction of the genetic variance across generations were examined. Across generations, the top 10 windows explained more than 0.5% but less than 1% of the total variance. Also, the pattern of the windows was not consistent across generations. The windows that explained the greatest variance changed greatly among the combinations of generations, with a few exceptions. In many cases, a window identified as top for one combination, explained less than 0.1% for the other combinations. We conclude that identification of top SNP windows for a population may have little predictive power for genetic selection in the following generations for the traits here evaluated.
Burgarella, Sarah; Cattaneo, Dario; Pinciroli, Francesco; Masseroli, Marco
2005-12-01
Improvements of bio-nano-technologies and biomolecular techniques have led to increasing production of high-throughput experimental data. Spotted cDNA microarray is one of the most diffuse technologies, used in single research laboratories and in biotechnology service facilities. Although they are routinely performed, spotted microarray experiments are complex procedures entailing several experimental steps and actors with different technical skills and roles. During an experiment, involved actors, who can also be located in a distance, need to access and share specific experiment information according to their roles. Furthermore, complete information describing all experimental steps must be orderly collected to allow subsequent correct interpretation of experimental results. We developed MicroGen, a web system for managing information and workflow in the production pipeline of spotted microarray experiments. It is constituted of a core multi-database system able to store all data completely characterizing different spotted microarray experiments according to the Minimum Information About Microarray Experiments (MIAME) standard, and of an intuitive and user-friendly web interface able to support the collaborative work required among multidisciplinary actors and roles involved in spotted microarray experiment production. MicroGen supports six types of user roles: the researcher who designs and requests the experiment, the spotting operator, the hybridisation operator, the image processing operator, the system administrator, and the generic public user who can access the unrestricted part of the system to get information about MicroGen services. MicroGen represents a MIAME compliant information system that enables managing workflow and supporting collaborative work in spotted microarray experiment production.
Visible red and infrared light alters gene expression in human marrow stromal fibroblast cells.
Guo, J; Wang, Q; Wai, D; Zhang, Q Z; Shi, S H; Le, A D; Shi, S T; Yen, S L-K
2015-04-01
This study tested whether or not gene expression in human marrow stromal fibroblast (MSF) cells depends on light wavelength and energy density. Primary cultures of isolated human bone marrow stem cells (hBMSC) were exposed to visible red (VR, 633 nm) and infrared (IR, 830 nm) radiation wavelengths from a light emitting diode (LED) over a range of energy densities (0.5, 1.0, 1.5, and 2.0 Joules/cm2) Cultured cells were assayed for cell proliferation, osteogenic potential, adipogenesis, mRNA and protein content. mRNA was analyzed by microarray and compared among different wavelengths and energy densities. Mesenchymal and epithelial cell responses were compared to determine whether responses were cell type specific. Protein array analysis was used to further analyze key pathways identified by microarrays. Different wavelengths and energy densities produced unique sets of genes identified by microarray analysis. Pathway analysis pointed to TGF-beta 1 in the visible red and Akt 1 in the infrared wavelengths as key pathways to study. TGF-beta protein arrays suggested switching from canonical to non-canonical TGF-beta pathways with increases to longer IR wavelengths. Microarrays suggest RANKL and MMP 10 followed IR energy density dose-response curves. Epithelial and mesenchymal cells respond differently to stimulation by light suggesting cell type-specific response is possible. These studies demonstrate differential gene expression with different wavelengths, energy densities and cell types. These differences in gene expression have the potential to be exploited for therapeutic purposes and can help explain contradictory results in the literature when wavelengths, energy densities and cell types differ. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hsia, Chu Chieh; Chizhikov, Vladimir E.; Yang, Amy X.
Hepatitis B virus (HBV), hepatitis C virus (HCV), and human immunodeficiency virus type-1 (HIV-1) are transfusion-transmitted human pathogens that have a major impact on blood safety and public health worldwide. We developed a microarray multiplex assay for the simultaneous detection and discrimination of these three viruses. The microarray consists of 16 oligonucleotide probes, immobilized on a silylated glass slide. Amplicons from multiplex PCR were labeled with Cy-5 and hybridized to the microarray. The assay detected 1 International Unit (IU), 10 IU, 20 IU of HBV, HCV, and HIV-1, respectively, in a single multiplex reaction. The assay also detected and discriminatedmore » the presence of two or three of these viruses in a single sample. Our data represent a proof-of-concept for the possible use of highly sensitive multiplex microarray assay to screen and confirm the presence of these viruses in blood donors and patients.« less
Employing image processing techniques for cancer detection using microarray images.
Dehghan Khalilabad, Nastaran; Hassanpour, Hamid
2017-02-01
Microarray technology is a powerful genomic tool for simultaneously studying and analyzing the behavior of thousands of genes. The analysis of images obtained from this technology plays a critical role in the detection and treatment of diseases. The aim of the current study is to develop an automated system for analyzing data from microarray images in order to detect cancerous cases. The proposed system consists of three main phases, namely image processing, data mining, and the detection of the disease. The image processing phase performs operations such as refining image rotation, gridding (locating genes) and extracting raw data from images the data mining includes normalizing the extracted data and selecting the more effective genes. Finally, via the extracted data, cancerous cell is recognized. To evaluate the performance of the proposed system, microarray database is employed which includes Breast cancer, Myeloid Leukemia and Lymphomas from the Stanford Microarray Database. The results indicate that the proposed system is able to identify the type of cancer from the data set with an accuracy of 95.45%, 94.11%, and 100%, respectively. Copyright © 2017 Elsevier Ltd. All rights reserved.
Parental origin of sequence variants associated with complex diseases.
Kong, Augustine; Steinthorsdottir, Valgerdur; Masson, Gisli; Thorleifsson, Gudmar; Sulem, Patrick; Besenbacher, Soren; Jonasdottir, Aslaug; Sigurdsson, Asgeir; Kristinsson, Kari Th; Jonasdottir, Adalbjorg; Frigge, Michael L; Gylfason, Arnaldur; Olason, Pall I; Gudjonsson, Sigurjon A; Sverrisson, Sverrir; Stacey, Simon N; Sigurgeirsson, Bardur; Benediktsdottir, Kristrun R; Sigurdsson, Helgi; Jonsson, Thorvaldur; Benediktsson, Rafn; Olafsson, Jon H; Johannsson, Oskar Th; Hreidarsson, Astradur B; Sigurdsson, Gunnar; Ferguson-Smith, Anne C; Gudbjartsson, Daniel F; Thorsteinsdottir, Unnur; Stefansson, Kari
2009-12-17
Effects of susceptibility variants may depend on from which parent they are inherited. Although many associations between sequence variants and human traits have been discovered through genome-wide associations, the impact of parental origin has largely been ignored. Here we show that for 38,167 Icelanders genotyped using single nucleotide polymorphism (SNP) chips, the parental origin of most alleles can be determined. For this we used a combination of genealogy and long-range phasing. We then focused on SNPs that associate with diseases and are within 500 kilobases of known imprinted genes. Seven independent SNP associations were examined. Five-one with breast cancer, one with basal-cell carcinoma and three with type 2 diabetes-have parental-origin-specific associations. These variants are located in two genomic regions, 11p15 and 7q32, each harbouring a cluster of imprinted genes. Furthermore, we observed a novel association between the SNP rs2334499 at 11p15 and type 2 diabetes. Here the allele that confers risk when paternally inherited is protective when maternally transmitted. We identified a differentially methylated CTCF-binding site at 11p15 and demonstrated correlation of rs2334499 with decreased methylation of that site.
Zhang, Li; Mitani, Yoshitsugu; Caulin, Carlos; Rao, Pulivarthi H; Kies, Merrill S; Saintigny, Pierre; Zhang, Nianxiang; Weber, Randal S; Lippman, Scott M; El-Naggar, Adel K
2013-06-01
The molecular genetic alterations underlying the development and diversity of salivary gland carcinomas are largely unknown. To characterize these events, comparative genomic hybridization analysis was performed, using a single-nucleotide polymorphism microarray platform, of 60 fresh-frozen specimens that represent the main salivary carcinoma types: mucoepidermoid carcinoma (MEC), adenoid cystic carcinoma (ACC), and salivary duct carcinoma (SDC). The results were correlated with the clinicopathologic features and translocation statuses to characterize the genetic alterations. The most commonly shared copy number abnormalities (CNAs) in all types were losses at chromosomes 6q23-26 and the 9p21 region. Subtype-specific CNAs included a loss at 12q11-12 in ACC and a gain at 17q11-12 in SDC. Focal copy number losses included 1p36.33-p36-22 in ACC, 9p13.2 in MEC, and 3p12.3-q11-2, 6q21-22.1, 12q14.1, and 12q15 in SDC. Tumor-specific amplicons were identified at 11q23.3 (PVRL1) in ACC, 11q13.3 (NUMA1) in MEC, and 6p21.1 (CCND3), 9p13.2 (PAX5), 12q15 (CNOT2/RAB3IP), 12q21.1 (GLIPR1L1), and 17q12 (ERBB2/CCL4) in SDC. A comparative CNA analysis of fusion-positive and fusion-negative ACCs and MECs revealed relatively lower CNAs in fusion-positive tumors than in fusion-negative tumors in both tumor types. An association between CNAs and high grade and advanced stage was observed in MECs only. These findings support the pathogenetic segregation of these entities and define novel chromosomal sites for future identification of biomarkers and therapeutic targets. Copyright © 2013 American Society for Investigative Pathology. Published by Elsevier Inc. All rights reserved.
Alcón, Soledad; Morales, Sara; Camello, Pedro J; Hemming, Jason M; Jennings, Lee; Mawe, Gary M; Pozo, María J
2001-01-01
The purpose of this study was to determine the effects of sodium nitroprusside (SNP), 2,2′-(hydroxynitrosohydrazino)bis-ethanamine (DETA/NO) and 3-morpholinosydnonimine (SIN-1), NO donors which yield different NO reactive species (NO+, NO. and peroxynitrite, respectively), as well as exogenous peroxynitrite, on gall bladder contractility. Under resting tone conditions, SNP induced a dose-dependent contraction with a maximal effect (10.3 ± 0.7 mN, s.e.m.) at 1 mm. Consistent with these findings, SNP caused a concentration-dependent depolarization of gall bladder smooth muscle. The excitatory effects of SNP were dependent on extracellular calcium entry through L-type Ca2+ channels. Furthermore, the contraction and depolarization were sensitive to tyrosine kinase blockade, and an associated increase in tyrosine phosphorylation was detected in Western blot studies. DETA/NO induced dose-dependent relaxing effects. These relaxations were sensitive to the guanylyl cyclase inhibitor 1H-[1,2,4]oxidiazolo[4,3-a]quinoxaline-1-one (ODQ, 2 μm) but they were not altered by treatment with the potassium channel blockers tetraethylammoniun (TEA, 5 mm) and 4-aminopyridine (4-AP, 5 mm). When tested in a reducing environment (created by 2.5 mm 1,4-dithiothreitol, DTT), SNP caused a relaxation of gall bladder muscle strips. Similarly, the SNP-induced contraction was converted to a relaxation, and associated hyperpolarization, when DTT was added during the steady state of an SNP-induced response. SIN-1 (0.1 mm), which has been shown to release peroxynitrite, induced relaxing effects that were enhanced by superoxide dismutase (SOD, 50 U ml−1). The relaxations induced by either SIN-1 alone or SIN-1 in the presence of SOD were strengthened by catalase (1000 U ml−1) and abolished by ODQ pretreatment. However, exogenous peroxynitrite induced a concentration-dependent contraction, which was dependent on activation of leukotriene (LT) metabolism and extracellular calcium. The peroxynitrite-induced contraction was abolished in the presence of the peroxynitrite scavenger melatonin. These results suggest that SIN-1 behaves as an NO. rather than a peroxynitrite source. We conclude that, depending on the redox state, NO has opposing effects on the motility of the gall bladder, being a relaxing agent when in NO. form and a contracting agent when in NO+ or peroxynitrite redox species form. Knowledge of the contrasting effects of the different redox forms of NO can clarify our understanding of the effects of NO donors on gall bladder and other smooth muscle cell types. PMID:11313447
The pig genome project has plenty to squeal about.
Fan, B; Gorbach, D M; Rothschild, M F
2011-01-01
Significant progress on pig genetics and genomics research has been witnessed in recent years due to the integration of advanced molecular biology techniques, bioinformatics and computational biology, and the collaborative efforts of researchers in the swine genomics community. Progress on expanding the linkage map has slowed down, but the efforts have created a higher-resolution physical map integrating the clone map and BAC end sequence. The number of QTL mapped is still growing and most of the updated QTL mapping results are available through PigQTLdb. Additionally, expression studies using high-throughput microarrays and other gene expression techniques have made significant advancements. The number of identified non-coding RNAs is rapidly increasing and their exact regulatory functions are being explored. A publishable draft (build 10) of the swine genome sequence was available for the pig genomics community by the end of December 2010. Build 9 of the porcine genome is currently available with Ensembl annotation; manual annotation is ongoing. These drafts provide useful tools for such endeavors as comparative genomics and SNP scans for fine QTL mapping. A recent community-wide effort to create a 60K porcine SNP chip has greatly facilitated whole-genome association analyses, haplotype block construction and linkage disequilibrium mapping, which can contribute to whole-genome selection. The future 'systems biology' that integrates and optimizes the information from all research levels can enhance the pig community's understanding of the full complexity of the porcine genome. These recent technological advances and where they may lead are reviewed. Copyright © 2011 S. Karger AG, Basel.
Szijan, Irene; Rochefort, Daniel; Bruder, Carl; Surace, Ezequiel; Machiavelli, Gloria; Dalamon, Viviana; Cotignola, Javier; Ferreiro, Veronica; Campero, Alvaro; Basso, Armando; Dumanski, Jan P; Rouleau, Guy A
2003-01-01
The NF2 tumor suppressor gene, located in chromosome 22q12, is involved in the development of multiple tumors of the nervous system, either associated with neurofibromatosis 2 or sporadic ones, mainly schwannomas and meningiomas. In order to evaluate the role of the NF2 gene in sporadic central nervous system (CNS) tumors, we analyzed NF2 mutations in 26 specimens: 14 meningiomas, 4 schwannomas, 4 metastases, and 4 other histopathological types of neoplasms. Denaturing high performance liquid chromatography (denaturing HPLC) and comparative genomic hybridization on a DNA microarray (microarray- CGH) were used as scanning methods for small mutations and gross rearrangements respectively. Small mutations were identified in six out of seventeen meningiomas and schwannomas, one mutation was novel. Large deletions were detected in six meningiomas. All mutations were predicted to result in truncated protein or in the absence of a large protein domain. No NF2 mutations were found in other histopathological types of CNS tumors. These results provide additional evidence that mutations in the NF2 gene play an important role in the development of sporadic meningiomas and schwannomas. Denaturing HPLC analysis of small mutations and microarray-CGH of large deletions are complementary, fast, and efficient methods for the detection of mutations in tumor tissues.
Xiao, Shan; Zeng, Xiaoyun; Quan, Li; Zhu, Jun
2015-01-01
To explore the correlation between FTO (fat mass and obesity associated) gene, which is associated with 3 single nucleotide polymorphisms (SNP) of fat mass and obesity, type 2 diabetes and body mass index (BMI) in the Uygur population in northwest China. A total of 849 Uygur patients with type 2 diabetes mellitus were selected from the hospitalized patients in the First Affiliated Hospital of Xinjiang Medical University, the First People's Hospital of Kashi and the hospitals in the Turpan areas. At the same time, 873 cases of healthy persons who conducted a medical checkup in the physical examination centre of the above hospitals were enrolled as controls. The present investigation used the case-control research method, and physical examination and biochemical index determination were carried out. The Sequenom MassARRAY technology was employed in the detection of 3 SNP loci of the FTO gene. The representative population of each SNP in the control group was analyzed by Hardy-Weinberg law. The differences of each clinical parameter in the two groups were analyzed by t-test analysis. The differences of genotype and allele of each SNP in the two groups were analyzed by χ(2) test. BMI, waistline (WL), systolic blood pressure (SBP), diastolic blood pressure (DBP), fasting blood glucose (FBG), total cholesterol (TC), aspartate aminotransferase (AST) and alanine aminotransferase (ALT) in the type 2 diabetes group were higher than those in the control group, while the high density lipoprotein (HDL) and low density lipoprotein (LDL) were lower than those of the control group; 2. The allele frequency of A of rs8050136 and rs9939609 in the type 2 diabetes mellitus group was higher than that of the control group. The BMI of the whole population and type 2 diabetes group with genotype C/A+A/A of rs8050136 was higher than that in C/C group, and the BMI with genotype T/A+A/A of rs9939609 was higher than that in group T/T. Stratification was conducted on BMI according to the normal, overweight and obesity criteria. There were significant differences in the distribution of genotype frequency of rs9939609 in the type 2 diabetes group and the control group of the normal BMI group. Single nucleotide mutation of rs7195539 in FTO gene may be a protective factor against the Uygur type 2 diabetes. Single nucleotide mutations of rs8050136 and rs9939609 may be associated with the Uygur type 2 diabetes and obesity, with A as a potential risk allele. The gene polymorphism of rs8050136 may correlate with type 2 diabetes mellitus through the function of BMI, while the correlation between rs9939609 gene polymorphism and type 2 diabetes is not depending from BMI.
Sun, Mingjun; Jing, Zhigang; Di, Dongdong; Yan, Hao; Zhang, Zhicheng; Xu, Quangang; Zhang, Xiyue; Wang, Xun; Ni, Bo; Sun, Xiangxiang; Yan, Chengxu; Yang, Zhen; Tian, Lili; Li, Jinping; Fan, Weixing
2017-01-01
Brucellosis is a worldwide zoonotic disease caused by Brucella spp. In China, brucellosis is recognized as a reemerging disease mainly caused by Brucella melitensis specie. To better understand the currently endemic B. melitensis strains in China, three Brucella genotyping methods were applied to 110 B. melitensis strains obtained in past several years. By MLVA genotyping, five MLVA-8 genotypes were identified, among which genotypes 42 (1-5-3-13-2-2-3-2) was recognized as the predominant genotype, while genotype 63 (1-5-3-13-2-3-3-2) and a novel genotype of 1-5-3-13-2-4-3-2 were second frequently observed. MLVA-16 discerned a total of 57 MLVA-16 genotypes among these Brucella strains, with 41 genotypes being firstly detected and the other 16 genotypes being previously reported. By BruMLSA21 typing, six sequence types (STs) were identified, among them ST8 is the most frequently seen in China while the other five STs were firstly detected and designated as ST137, ST138, ST139, ST140, and ST141 by international multilocus sequence typing database. Whole-genome sequence (WGS)-single-nucleotide polymorphism (SNP)-based typing and phylogenetic analysis resolved Chinese B. melitensis strains into five clusters, reflecting the existence of multiple lineages among these Chinese B. melitensis strains. In phylogeny, Chinese lineages are more closely related to strains collected from East Mediterranean and Middle East countries, such as Turkey, Kuwait, and Iraq. In the next few years, MLVA typing will certainly remain an important epidemiological tool for Brucella infection analysis, as it displays a high discriminatory ability and achieves result largely in agreement with WGS-SNP-based typing. However, WGS-SNP-based typing is found to be the most powerful and reliable method in discerning Brucella strains and will be popular used in the future.
Comparative study of classification algorithms for immunosignaturing data
2012-01-01
Background High-throughput technologies such as DNA, RNA, protein, antibody and peptide microarrays are often used to examine differences across drug treatments, diseases, transgenic animals, and others. Typically one trains a classification system by gathering large amounts of probe-level data, selecting informative features, and classifies test samples using a small number of features. As new microarrays are invented, classification systems that worked well for other array types may not be ideal. Expression microarrays, arguably one of the most prevalent array types, have been used for years to help develop classification algorithms. Many biological assumptions are built into classifiers that were designed for these types of data. One of the more problematic is the assumption of independence, both at the probe level and again at the biological level. Probes for RNA transcripts are designed to bind single transcripts. At the biological level, many genes have dependencies across transcriptional pathways where co-regulation of transcriptional units may make many genes appear as being completely dependent. Thus, algorithms that perform well for gene expression data may not be suitable when other technologies with different binding characteristics exist. The immunosignaturing microarray is based on complex mixtures of antibodies binding to arrays of random sequence peptides. It relies on many-to-many binding of antibodies to the random sequence peptides. Each peptide can bind multiple antibodies and each antibody can bind multiple peptides. This technology has been shown to be highly reproducible and appears promising for diagnosing a variety of disease states. However, it is not clear what is the optimal classification algorithm for analyzing this new type of data. Results We characterized several classification algorithms to analyze immunosignaturing data. We selected several datasets that range from easy to difficult to classify, from simple monoclonal binding to complex binding patterns in asthma patients. We then classified the biological samples using 17 different classification algorithms. Using a wide variety of assessment criteria, we found ‘Naïve Bayes’ far more useful than other widely used methods due to its simplicity, robustness, speed and accuracy. Conclusions ‘Naïve Bayes’ algorithm appears to accommodate the complex patterns hidden within multilayered immunosignaturing microarray data due to its fundamental mathematical properties. PMID:22720696
Hu, Jian; Zhou, Yi-ren; Ding, Jia-lin; Wang, Zhi-yuan; Liu, Ling; Wang, Ye-kai; Lou, Hui-ling; Qiao, Shou-yi; Wu, Yan-hua
2017-05-20
The ABO blood type is one of the most common and widely used genetic traits in humans. Three glycosyltransferase-encoding gene alleles, I A , I B and i, produce three red blood cell surface antigens, by which the ABO blood type is classified. By using the ABO blood type experiment as an ideal case for genetics teaching, we can easily introduce to the students several genetic concepts, including multiple alleles, gene interaction, single nucleotide polymorphism (SNP) and gene evolution. Herein we have innovated and integrated our ABO blood type genetics experiments. First, in the section of Molecular Genetics, a new method of ABO blood genotyping was established: specific primers based on SNP sites were designed to distinguish three alleles through quantitative real-time PCR. Next, the experimental teaching method of Gene Evolution was innovated in the Population Genetics section: a gene-evolution software was developed to simulate the evolutionary tendency of the ABO genotype encoding alleles under diverse conditions. Our reform aims to extend the contents of genetics experiments, to provide additional teaching approaches, and to improve the learning efficiency of our students eventually.
[Possible role of genetic factors on reduced risk for gastric cancer among duodenal ulcer patients].
Matsuda, Koichi; Tanikawa, Chizu; Nakamura, Yusuke
2013-08-01
Although H. pylori causes both gastric cancer and peptic ulcer, duodenal ulcer patients were known to have low risk for gastric cancer. Recently the association of PSCA and ABO with duodenal ulcer were identified by GWAS in the Japanese population. A T-allele of SNP rs2294008 in the PSCA promoter creates the upstream translational initiation codon and affects the protein localization from cytoplasm to cell surface. A T-allele of SNP rs2294008 increased gastric cancer risk but reduced duodenal ulcer risk. In addition, blood type O was shown to increase risk for duodenal ulcer, while blood type A was associated with gastric cancer risk in the Caucasian population. Our finding would partially explain low risk of gastric cancer among duodenal ulcer patients.
2012-01-01
Background Mycobacterium avium subspecies paratuberculosis (Map) is the aetiological agent of Johne’s disease or paratuberculosis and is included within the Mycobacterium avium complex (MAC). Map strains are of two major types often referred to as ‘Sheep’ or ‘S-type’ and ‘Cattle’ or ‘C-type’. With the advent of more discriminatory typing techniques it has been possible to further classify the S-type strains into two groups referred to as Type I and Type III. This study was undertaken to genotype a large panel of S-type small ruminant isolates from different hosts and geographical origins and to compare them with a large panel of well documented C-type isolates to assess the genetic diversity of these strain types. Methods used included Mycobacterial Interspersed Repetitive Units - Variable-Number Tandem Repeat analysis (MIRU-VNTR), analysis of Large Sequence Polymorphisms by PCR (LSP analysis), Single Nucleotide Polymorphism (SNP) analysis of gyr genes, Pulsed-Field Gel Electrophoresis (PFGE) and Restriction Fragment Length Polymorphism analysis coupled with hybridization to IS900 (IS900-RFLP) analysis. Results The presence of LSPA4 and absence of LSPA20 was confirmed in all 24 Map S-type strains analysed. SNPs within the gyr genes divided the S-type strains into types I and III. Twenty four PFGE multiplex profiles and eleven different IS900-RFLP profiles were identified among the S-type isolates, some of them not previously published. Both PFGE and IS900-RFLP segregated the S-type strains into types I and III and the results concurred with those of the gyr SNP analysis. Nine MIRU-VNTR genotypes were identified in these isolates. MIRU-VNTR analysis differentiated Map strains from other members of Mycobacterium avium Complex, and Map S-type from C-type but not type I from III. Pigmented Map isolates were found of type I or III. Conclusion This is the largest panel of S-type strains investigated to date. The S-type strains could be further divided into two subtypes, I and III by some of the typing techniques (IS900-RFLP, PFGE and SNP analysis of the gyr genes). MIRU-VNTR did not divide the strains into the subtypes I and III but did detect genetic differences between isolates within each of the subtypes. Pigmentation is not exclusively associated with type I strains. PMID:23164429
Ryynänen, Heikki J; Primmer, Craig R
2006-01-01
Background Single nucleotide polymorphisms (SNPs) represent the most abundant type of DNA variation in the vertebrate genome, and their applications as genetic markers in numerous studies of molecular ecology and conservation of natural populations are emerging. Recent large-scale sequencing projects in several fish species have provided a vast amount of data in public databases, which can be utilized in novel SNP discovery in salmonids. However, the suggested duplicated nature of the salmonid genome may hamper SNP characterization if the primers designed in conserved gene regions amplify multiple loci. Results Here we introduce a new intron-primed exon-crossing (IPEC) method in an attempt to overcome this duplication problem, and also evaluate different priming methods for SNP discovery in Atlantic salmon (Salmo salar) and other salmonids. A total of 69 loci with differing priming strategies were screened in S. salar, and 27 of these produced ~13 kb of high-quality sequence data consisting of 19 SNPs or indels (one per 680 bp). The SNP frequency and the overall nucleotide diversity (3.99 × 10-4) in S. salar was lower than reported in a majority of other organisms, which may suggest a relative young population history for Atlantic salmon. A subset of primers used in cross-species analyses revealed considerable variation in the SNP frequencies and nucleotide diversities in other salmonids. Conclusion Sequencing success was significantly higher with the new IPEC primers; thus the total number of loci to screen in order to identify one potential polymorphic site was six times less with this new strategy. Given that duplication may hamper SNP discovery in some species, the IPEC method reported here is an alternative way of identifying novel polymorphisms in such cases. PMID:16872523
Wang, Yi-Ting; Sung, Pei-Yuan; Lin, Peng-Lin; Yu, Ya-Wen; Chung, Ren-Hua
2015-05-15
Genome-wide association studies (GWAS) have become a common approach to identifying single nucleotide polymorphisms (SNPs) associated with complex diseases. As complex diseases are caused by the joint effects of multiple genes, while the effect of individual gene or SNP is modest, a method considering the joint effects of multiple SNPs can be more powerful than testing individual SNPs. The multi-SNP analysis aims to test association based on a SNP set, usually defined based on biological knowledge such as gene or pathway, which may contain only a portion of SNPs with effects on the disease. Therefore, a challenge for the multi-SNP analysis is how to effectively select a subset of SNPs with promising association signals from the SNP set. We developed the Optimal P-value Threshold Pedigree Disequilibrium Test (OPTPDT). The OPTPDT uses general nuclear families. A variable p-value threshold algorithm is used to determine an optimal p-value threshold for selecting a subset of SNPs. A permutation procedure is used to assess the significance of the test. We used simulations to verify that the OPTPDT has correct type I error rates. Our power studies showed that the OPTPDT can be more powerful than the set-based test in PLINK, the multi-SNP FBAT test, and the p-value based test GATES. We applied the OPTPDT to a family-based autism GWAS dataset for gene-based association analysis and identified MACROD2-AS1 with genome-wide significance (p-value=2.5×10(-6)). Our simulation results suggested that the OPTPDT is a valid and powerful test. The OPTPDT will be helpful for gene-based or pathway association analysis. The method is ideal for the secondary analysis of existing GWAS datasets, which may identify a set of SNPs with joint effects on the disease.
Diversity analysis of cotton (Gossypium hirsutum L.) germplasm using the CottonSNP63K Array.
Hinze, Lori L; Hulse-Kemp, Amanda M; Wilson, Iain W; Zhu, Qian-Hao; Llewellyn, Danny J; Taylor, Jen M; Spriggs, Andrew; Fang, David D; Ulloa, Mauricio; Burke, John J; Giband, Marc; Lacape, Jean-Marc; Van Deynze, Allen; Udall, Joshua A; Scheffler, Jodi A; Hague, Steve; Wendel, Jonathan F; Pepper, Alan E; Frelichowski, James; Lawley, Cindy T; Jones, Don C; Percy, Richard G; Stelly, David M
2017-02-03
Cotton germplasm resources contain beneficial alleles that can be exploited to develop germplasm adapted to emerging environmental and climate conditions. Accessions and lines have traditionally been characterized based on phenotypes, but phenotypic profiles are limited by the cost, time, and space required to make visual observations and measurements. With advances in molecular genetic methods, genotypic profiles are increasingly able to identify differences among accessions due to the larger number of genetic markers that can be measured. A combination of both methods would greatly enhance our ability to characterize germplasm resources. Recent efforts have culminated in the identification of sufficient SNP markers to establish high-throughput genotyping systems, such as the CottonSNP63K array, which enables a researcher to efficiently analyze large numbers of SNP markers and obtain highly repeatable results. In the current investigation, we have utilized the SNP array for analyzing genetic diversity primarily among cotton cultivars, making comparisons to SSR-based phylogenetic analyses, and identifying loci associated with seed nutritional traits. The SNP markers distinctly separated G. hirsutum from other Gossypium species and distinguished the wild from cultivated types of G. hirsutum. The markers also efficiently discerned differences among cultivars, which was the primary goal when designing the CottonSNP63K array. Population structure within the genus compared favorably with previous results obtained using SSR markers, and an association study identified loci linked to factors that affect cottonseed protein content. Our results provide a large genome-wide variation data set for primarily cultivated cotton. Thousands of SNPs in representative cotton genotypes provide an opportunity to finely discriminate among cultivated cotton from around the world. The SNPs will be relevant as dense markers of genome variation for association mapping approaches aimed at correlating molecular polymorphisms with variation in phenotypic traits, as well as for molecular breeding approaches in cotton.
Cingolani, Pablo; Patel, Viral M.; Coon, Melissa; Nguyen, Tung; Land, Susan J.; Ruden, Douglas M.; Lu, Xiangyi
2012-01-01
This paper describes a new program SnpSift for filtering differential DNA sequence variants between two or more experimental genomes after genotoxic chemical exposure. Here, we illustrate how SnpSift can be used to identify candidate phenotype-relevant variants including single nucleotide polymorphisms, multiple nucleotide polymorphisms, insertions, and deletions (InDels) in mutant strains isolated from genome-wide chemical mutagenesis of Drosophila melanogaster. First, the genomes of two independently isolated mutant fly strains that are allelic for a novel recessive male-sterile locus generated by genotoxic chemical exposure were sequenced using the Illumina next-generation DNA sequencer to obtain 20- to 29-fold coverage of the euchromatic sequences. The sequencing reads were processed and variants were called using standard bioinformatic tools. Next, SnpEff was used to annotate all sequence variants and their potential mutational effects on associated genes. Then, SnpSift was used to filter and select differential variants that potentially disrupt a common gene in the two allelic mutant strains. The potential causative DNA lesions were partially validated by capillary sequencing of polymerase chain reaction-amplified DNA in the genetic interval as defined by meiotic mapping and deletions that remove defined regions of the chromosome. Of the five candidate genes located in the genetic interval, the Pka-like gene CG12069 was found to carry a separate pre-mature stop codon mutation in each of the two allelic mutants whereas the other four candidate genes within the interval have wild-type sequences. The Pka-like gene is therefore a strong candidate gene for the male-sterile locus. These results demonstrate that combining SnpEff and SnpSift can expedite the identification of candidate phenotype-causative mutations in chemically mutagenized Drosophila strains. This technique can also be used to characterize the variety of mutations generated by genotoxic chemicals. PMID:22435069
Presence of Mycobacterium leprae genotype 4 in environmental waters in Northeast Brazil.
Holanda, Maísa Viana de; Marques, Livia Erika Carlos; Macedo, Maria Luisa Bezerra de; Pontes, Maria Araci de Andrade; Sabadia, José Antonio Beltrão; Kerr, Ligia Regina Franco Sansigolo; Almeida, Rosa Lívia Freitas; Frota, Cristiane Cunha
2017-01-01
This study quantified Mycobacterium leprae bacilli in environmental water samples from five municipalities in the State of Ceará by quantitative polymerase chain reaction (qPCR) and compared the identified genotypes with those obtained from leprosy patient biopsies. We collected five replicas from each of the 30 selected reservoirs and skin lesion biopsies from 25 new leprosy cases treated at a reference center in Fortaleza, Ceará from 2010 to 2013. The 16S rRNA gene region of M. leprae was amplified by qPCR and a standard curve was created with the pIDTBlue 16SrRNAMlep plasmid. The Juazeiro do Norte water samples and the biopsies were genotyped (single nucleotide polymorphism [SNP] 1 to 4) and the SNP 4 genotypes were subtyped. Of the 149 water samples analyzed, 54.4% were positive for the M. leprae DNA. The M. leprae bacilli copy number ranged from 1.42 × 10 -1 to 1.44 × 10 + 2 . Most biopsies showed SNP type 4 (64%), while all samples from Juazeiro do Norte were SNP type 4, with subtype 4-N appearing at the highest frequency. We suggest that environmental waters containing M. leprae bacilli play an important role in disease transmission, justifying PGL-1 seropositivity in individuals living in areas where there is no reported case, and in leprosy cases individuals who report no previous contact with other case. Therefore, further investigation is needed to clarify disease transmission in this region and to explore the role of the environment. We also suggest that in this area surveillance for leprosy cases should be intensified.
Nair, Sethu C; Pattaradilokrat, Sittiporn; Zilversmit, Martine M; Dommer, Jennifer; Nagarajan, Vijayaraj; Stephens, Melissa T; Xiao, Wenming; Tan, John C; Su, Xin-Zhuan
2014-01-01
The rodent malaria parasite Plasmodium yoelii is an important model for studying malaria immunity and pathogenesis. One approach for studying malaria disease phenotypes is genetic mapping, which requires typing a large number of genetic markers from multiple parasite strains and/or progeny from genetic crosses. Hundreds of microsatellite (MS) markers have been developed to genotype the P. yoelii genome; however, typing a large number of MS markers can be labor intensive, time consuming, and expensive. Thus, development of high-throughput genotyping tools such as DNA microarrays that enable rapid and accurate large-scale genotyping of the malaria parasite will be highly desirable. In this study, we sequenced the genomes of two P. yoelii strains (33X and N67) and obtained a large number of single nucleotide polymorphisms (SNPs). Based on the SNPs obtained, we designed sets of oligonucleotide probes to develop a microarray that could interrogate ∼11,000 SNPs across the 14 chromosomes of the parasite in a single hybridization. Results from hybridizations of DNA samples of five P. yoelii strains or cloned lines (17XNL, YM, 33X, N67 and N67C) and two progeny from a genetic cross (N67×17XNL) to the microarray showed that the array had a high call rate (∼97%) and accuracy (99.9%) in calling SNPs, providing a simple and reliable tool for typing the P. yoelii genome. Our data show that the P. yoelii genome is highly polymorphic, although isogenic pairs of parasites were also detected. Additionally, our results indicate that the 33X parasite is a progeny of 17XNL (or YM) and an unknown parasite. The highly accurate and reliable microarray developed in this study will greatly facilitate our ability to study the genetic basis of important traits and the disease it causes. Published by Elsevier B.V.
Expression profiling and pathway analysis of Krüppel-like factor 4 in mouse embryonic fibroblasts
Hagos, Engda G; Ghaleb, Amr M; Kumar, Amrita; Neish, Andrew S; Yang, Vincent W
2011-01-01
Background: Krüppel-like factor 4 (KLF4) is a zinc-finger transcription factor with diverse regulatory functions in proliferation, differentiation, and development. KLF4 also plays a role in inflammation, tumorigenesis, and reprogramming of somatic cells to induced pluripotent stem (iPS) cells. To gain insight into the mechanisms by which KLF4 regulates these processes, we conducted DNA microarray analyses to identify differentially expressed genes in mouse embryonic fibroblasts (MEFs) wild type and null for Klf4. Methods: Expression profiles of fibroblasts isolated from mouse embryos wild type or null for the Klf4 alleles were examined by DNA microarrays. Differentially expressed genes were subjected to the Database for Annotation, Visualization and Integrated Discovery (DAVID). The microarray data were also interrogated with the Ingenuity Pathway Analysis (IPA) and Gene Set Enrichment Analysis (GSEA) for pathway identification. Results obtained from the microarray analysis were confirmed by Western blotting for select genes with biological relevance to determine the correlation between mRNA and protein levels. Results: One hundred and sixty three up-regulated and 88 down-regulated genes were identified that demonstrated a fold-change of at least 1.5 and a P-value < 0.05 in Klf4-null MEFs compared to wild type MEFs. Many of the up-regulated genes in Klf4-null MEFs encode proto-oncogenes, growth factors, extracellular matrix, and cell cycle activators. In contrast, genes encoding tumor suppressors and those involved in JAK-STAT signaling pathways are down-regulated in Klf4-null MEFs. IPA and GSEA also identified various pathways that are regulated by KLF4. Lastly, Western blotting of select target genes confirmed the changes revealed by microarray data. Conclusions: These data are not only consistent with previous functional studies of KLF4's role in tumor suppression and somatic cell reprogramming, but also revealed novel target genes that mediate KLF4's functions. PMID:21892412
Yuen, Peter S.T.; Jo, Sang-Kyung; Holly, Mikaela K.; Hu, Xuzhen; Star, Robert A.
2006-01-01
Acute renal failure (ARF) has a high morbidity and mortality. In animal ARF models, effective treatments must be administered before or shortly after the insult, limiting their clinical potential. We used microarrays to identify early biomarkers that distinguish ischemic from nephrotoxic ARF, or biomarkers that detect both injury types. We compared rat kidney transcriptomes 2 and 8 hours after ischemia/reperfusion and after mercuric chloride. Quality control and statistical analyses were necessary to normalize microarrays from different lots, eliminate outliers, and exclude unaltered genes. Principal component analysis revealed distinct ischemic and nephrotoxic trajectories, and clear array groupings. Therefore, we used supervised analysis, t-tests and fold changes, to compile gene lists for each group, exclusive or non-exclusive, alone or in combination. There was little network connectivity, even in the largest group. Some microarray-identified genes were validated by TaqMan assay, ruling out artifacts. Western blotting confirmed that HO-1 and ATF3 proteins were upregulated; however, unexpectedly, their localization changed within the kidney. HO-1 staining shifted from cortical (early) to outer stripe of the outer medulla (late), primarily in detaching cells, after mercuric chloride, but not ischemia/reperfusion. ATF3 staining was similar, but with additional early transient expression in the outer stripe after ischemia/reperfusion. We conclude that microarray-identified genes must be evaluated not only for protein levels, but also for anatomical distribution among different zones, nephron segments, or cell types. Although protein detection reagents are limited, microarray data lay a rich foundation to explore biomarkers, therapeutics, and pathophysiology of ARF. PMID:16507785
Gene set analysis approaches for RNA-seq data: performance evaluation and application guideline
Rahmatallah, Yasir; Emmert-Streib, Frank
2016-01-01
Transcriptome sequencing (RNA-seq) is gradually replacing microarrays for high-throughput studies of gene expression. The main challenge of analyzing microarray data is not in finding differentially expressed genes, but in gaining insights into the biological processes underlying phenotypic differences. To interpret experimental results from microarrays, gene set analysis (GSA) has become the method of choice, in particular because it incorporates pre-existing biological knowledge (in a form of functionally related gene sets) into the analysis. Here we provide a brief review of several statistically different GSA approaches (competitive and self-contained) that can be adapted from microarrays practice as well as those specifically designed for RNA-seq. We evaluate their performance (in terms of Type I error rate, power, robustness to the sample size and heterogeneity, as well as the sensitivity to different types of selection biases) on simulated and real RNA-seq data. Not surprisingly, the performance of various GSA approaches depends only on the statistical hypothesis they test and does not depend on whether the test was developed for microarrays or RNA-seq data. Interestingly, we found that competitive methods have lower power as well as robustness to the samples heterogeneity than self-contained methods, leading to poor results reproducibility. We also found that the power of unsupervised competitive methods depends on the balance between up- and down-regulated genes in tested gene sets. These properties of competitive methods have been overlooked before. Our evaluation provides a concise guideline for selecting GSA approaches, best performing under particular experimental settings in the context of RNA-seq. PMID:26342128
Genomic diversity in Mycobacterium leprae isolates from leprosy cases in South India.
Das, Madhusmita; Chaitanya, V Sundeep; Kanmani, K; Rajan, Lakshmi; Ebenezer, Mannam
2016-11-01
The Objective of this study was to identify the strain diversity of Mycobacterium leprae in terms of SNP types and subtypes stratified as per genomic single nucleotide polymorphisms, in clinical isolates of leprosy patients from a tertiary care leprosy center in South India. Further, the associations of SNP types with clinical outcomes in leprosy were also investigated. DNA was extracted from excisional skin biopsies of a total of 172 newly diagnosed untreated leprosy patients from a clinic in Tamil Nadu, in south India, that also serves patients from neighboring states. All the leprosy patients were those who voluntarily reported at the clinic during the study period of one year i.e., 2015. Clinical and histopathological details were collected at diagnosis and leprosy was confirmed through bacteriological smear examination and PCR for M. leprae specific RLEP region. SNP types and subtypes were determined by PCR amplification and Sanger sequencing of PCR products. M. leprae specific RLEP gene amplification was achieved in 160 out of 172 patients. Among 160 specimens 118(73.75%) were type 1 and 42 (26.25%) were type 2 and on subtyping it was noted that 88/160 (55.00%) were 1D, 25/160 (15.62%) 1C, 5/160 (3.12%) 1A, 33/160 (20.62%) 2G and 9/160 (5.62%) were 2H. Our results indicated that subtype 1D is predominant in the south Indian population. We also noted 2G, 1C and 1A in the patient sample tested. Additionally we identified subtype 2H for the first time in India. Copyright © 2016. Published by Elsevier B.V.
Barton, G; Abbott, J; Chiba, N; Huang, DW; Huang, Y; Krznaric, M; Mack-Smith, J; Saleem, A; Sherman, BT; Tiwari, B; Tomlinson, C; Aitman, T; Darlington, J; Game, L; Sternberg, MJE; Butcher, SA
2008-01-01
Background Microarray experimentation requires the application of complex analysis methods as well as the use of non-trivial computer technologies to manage the resultant large data sets. This, together with the proliferation of tools and techniques for microarray data analysis, makes it very challenging for a laboratory scientist to keep up-to-date with the latest developments in this field. Our aim was to develop a distributed e-support system for microarray data analysis and management. Results EMAAS (Extensible MicroArray Analysis System) is a multi-user rich internet application (RIA) providing simple, robust access to up-to-date resources for microarray data storage and analysis, combined with integrated tools to optimise real time user support and training. The system leverages the power of distributed computing to perform microarray analyses, and provides seamless access to resources located at various remote facilities. The EMAAS framework allows users to import microarray data from several sources to an underlying database, to pre-process, quality assess and analyse the data, to perform functional analyses, and to track data analysis steps, all through a single easy to use web portal. This interface offers distance support to users both in the form of video tutorials and via live screen feeds using the web conferencing tool EVO. A number of analysis packages, including R-Bioconductor and Affymetrix Power Tools have been integrated on the server side and are available programmatically through the Postgres-PLR library or on grid compute clusters. Integrated distributed resources include the functional annotation tool DAVID, GeneCards and the microarray data repositories GEO, CELSIUS and MiMiR. EMAAS currently supports analysis of Affymetrix 3' and Exon expression arrays, and the system is extensible to cater for other microarray and transcriptomic platforms. Conclusion EMAAS enables users to track and perform microarray data management and analysis tasks through a single easy-to-use web application. The system architecture is flexible and scalable to allow new array types, analysis algorithms and tools to be added with relative ease and to cope with large increases in data volume. PMID:19032776
Cappola, Thomas P; Matkovich, Scot J; Wang, Wei; van Booven, Derek; Li, Mingyao; Wang, Xuexia; Qu, Liming; Sweitzer, Nancy K; Fang, James C; Reilly, Muredach P; Hakonarson, Hakon; Nerbonne, Jeanne M; Dorn, Gerald W
2011-02-08
Common heart failure has a strong undefined heritable component. Two recent independent cardiovascular SNP array studies identified a common SNP at 1p36 in intron 2 of the HSPB7 gene as being associated with heart failure. HSPB7 resequencing identified other risk alleles but no functional gene variants. Here, we further show no effect of the HSPB7 SNP on cardiac HSPB7 mRNA levels or splicing, suggesting that the SNP marks the position of a functional variant in another gene. Accordingly, we used massively parallel platforms to resequence all coding exons of the adjacent CLCNKA gene, which encodes the K(a) renal chloride channel (ClC-K(a)). Of 51 exonic CLCNKA variants identified, one SNP (rs10927887, encoding Arg83Gly) was common, in linkage disequilibrium with the heart failure risk SNP in HSPB7, and associated with heart failure in two independent Caucasian referral populations (n = 2,606 and 1,168; combined P = 2.25 × 10(-6)). Individual genotyping of rs10927887 in the two study populations and a third independent heart failure cohort (combined n = 5,489) revealed an additive allele effect on heart failure risk that is independent of age, sex, and prior hypertension (odds ratio = 1.27 per allele copy; P = 8.3 × 10(-7)). Functional characterization of recombinant wild-type Arg83 and variant Gly83 ClC-K(a) chloride channel currents revealed ≈ 50% loss-of-function of the variant channel. These findings identify a common, functionally significant genetic risk factor for Caucasian heart failure. The variant CLCNKA risk allele, telegraphed by linked variants in the adjacent HSPB7 gene, uncovers a previously overlooked genetic mechanism affecting the cardio-renal axis.
Kongchum, Pawapol; Palti, Yniv; Hallerman, Eric M; Hulata, Gideon; David, Lior
2010-08-01
Single nucleotide polymorphisms (SNPs) in immune response genes have been reported as markers for susceptibility to infectious diseases in human and livestock. A disease caused by cyprinid herpesvirus 3 (CyHV-3) is highly contagious and virulent in common carp (Cyprinus carpio). With the aim to develop molecular tools for breeding CyHV-3-resistant carp, we have amplified and sequenced 11 candidate genes for viral disease resistance including TLR2, TLR3, TLR4ba, TLR7, TLR9, TLR21, TLR22, MyD88, TRAF6, type I IFN and IL-1beta. For each gene, we initially cloned and sequenced PCR amplicons from 8 to 12 fish (2-3 fish per strain) from the SNP discovery panel. We then identified and evaluated putative SNPs for their polymorphisms in the SNP discovery panel and validated their usefulness for linkage analysis in a full-sib family using the SNaPshot method. Our sequencing results and phylogenetic analyses suggested that TLR3, TLR7 and MyD88 genes are duplicated in the common carp genome. We, therefore, developed locus-specific PCR primers and SNP genotyping assays for the duplicated loci. A total of 48 SNP markers were developed from PCR fragments of the 13 loci (7 single-locus and 3 duplicated genes). Thirty-nine markers were polymorphic with estimated minor allele frequencies of more than 0.1. The utility of the SNP markers was evaluated in one full-sib family and revealed that 20 markers from 9 loci segregated in a disomic and Mendelian pattern and would be useful for linkage analysis. Published by Elsevier Ltd.
Bath, Kevin G.; Chuang, Jocelyn; Spencer-Segal, Joanna L.; Amso, Dima; Altemus, Margaret; McEwen, Bruce S.; Lee, Francis S.
2012-01-01
Background Most anxiety and depressive disorders are twice as common in women compared to men and the sex difference in prevalence typically emerges during adolescence. Hormonal changes across the menstrual cycle and during the postpartum and peri-menopausal periods are associated with increased risk for anxiety and depression symptoms. In humans and animals, reduced brain derived neurotrophic factor (BDNF) has been associated with increased expression of affective pathology. Recently, a single nucleotide polymorphism (SNP) in the BDNF gene (BDNF Val66Met), which reduces BDNF bioavailability, has been identified in humans and associated with a variety of neuropsychiatric disorders. Although BDNF expression can be directly influenced by estrogen and progesterone, the potential impact of the BDNF Val66Met SNP on sensitivity to reproductive hormone changes remains an open question. Approach As a predictive model, we used female mice in which the human SNP (BDNF Val66Met) was inserted into the mouse BDNF gene. Using standard behavioral paradigms, we tested the impact of this SNP on age and estrous-cycle specific expression of anxiety-like behaviors. Results Mice homozygous for the BDNF Val66Met SNP begin to exhibit increased anxiety-like behaviors over prepubertal and early adult development, show significant fluctuations in anxiety-like behaviors over the estrous cycle, and as adults differ from wild-type mice by showing significant fluctuations in anxiety-like behaviors over the estrous cycle, specifically more anxiety-like behaviors during the estrus phase. Conclusions These findings have implications regarding the potential role of this SNP in contributing to developmental and reproductive hormone-dependent changes in affective disorders in humans. PMID:22552045
Martínez-García, Pedro J; Fresnedo-Ramírez, Jonathan; Parfitt, Dan E; Gradziel, Thomas M; Crisosto, Carlos H
2013-01-01
Single nucleotide polymorphisms (SNPs) are a fundamental source of genomic variation. Large SNP panels have been developed for Prunus species. Fruit quality traits are essential peach breeding program objectives since they determine consumer acceptance, fruit consumption, industry trends and cultivar adoption. For many cultivars, these traits are negatively impacted by cold storage, used to extend fruit market life. The major symptoms of chilling injury are lack of flavor, off flavor, mealiness, flesh browning, and flesh bleeding. A set of 1,109 SNPs was mapped previously and 67 were linked with these complex traits. The prediction of the effects associated with these SNPs on downstream products from the 'peach v1.0' genome sequence was carried out. A total of 2,163 effects were detected, 282 effects (non-synonymous, synonymous or stop codon gained) were located in exonic regions (13.04 %) and 294 placed in intronic regions (13.59 %). An extended list of genes and proteins that could be related to these traits was developed. Two SNP markers that explain a high percentage of the observed phenotypic variance, UCD_SNP_1084 and UCD_SNP_46, are associated with zinc finger (C3HC4-type RING finger) family protein and AOX1A (alternative oxidase 1a) protein groups, respectively. In addition, phenotypic variation suggests that the observed polymorphism for SNP UCD_SNP_1084 [A/G] mutation could be a candidate quantitative trait nucleotide affecting quantitative trait loci for mealiness. The interaction and expression of affected proteins could explain the variation observed in each individual and facilitate understanding of gene regulatory networks for fruit quality traits in peach.
A genome-wide perspective on the evolutionary history of enigmatic wolf-like canids
vonHoldt, Bridgett M.; Pollinger, John P.; Earl, Dent A.; Knowles, James C.; Boyko, Adam R.; Parker, Heidi; Geffen, Eli; Pilot, Malgorzata; Jedrzejewski, Wlodzimierz; Jedrzejewska, Bogumila; Sidorovich, Vadim; Greco, Claudia; Randi, Ettore; Musiani, Marco; Kays, Roland; Bustamante, Carlos D.; Ostrander, Elaine A.; Novembre, John; Wayne, Robert K.
2011-01-01
High-throughput genotyping technologies developed for model species can potentially increase the resolution of demographic history and ancestry in wild relatives. We use a SNP genotyping microarray developed for the domestic dog to assay variation in over 48K loci in wolf-like species worldwide. Despite the high mobility of these large carnivores, we find distinct hierarchical population units within gray wolves and coyotes that correspond with geographic and ecologic differences among populations. Further, we test controversial theories about the ancestry of the Great Lakes wolf and red wolf using an analysis of haplotype blocks across all 38 canid autosomes. We find that these enigmatic canids are highly admixed varieties derived from gray wolves and coyotes, respectively. This divergent genomic history suggests that they do not have a shared recent ancestry as proposed by previous researchers. Interspecific hybridization, as well as the process of evolutionary divergence, may be responsible for the observed phenotypic distinction of both forms. Such admixture complicates decisions regarding endangered species restoration and protection. PMID:21566151
Machiela, Mitchell J; Zhou, Weiyin; Karlins, Eric; Sampson, Joshua N; Freedman, Neal D; Yang, Qi; Hicks, Belynda; Dagnall, Casey; Hautman, Christopher; Jacobs, Kevin B; Abnet, Christian C; Aldrich, Melinda C; Amos, Christopher; Amundadottir, Laufey T; Arslan, Alan A; Beane-Freeman, Laura E; Berndt, Sonja I; Black, Amanda; Blot, William J; Bock, Cathryn H; Bracci, Paige M; Brinton, Louise A; Bueno-de-Mesquita, H Bas; Burdett, Laurie; Buring, Julie E; Butler, Mary A; Canzian, Federico; Carreón, Tania; Chaffee, Kari G; Chang, I-Shou; Chatterjee, Nilanjan; Chen, Chu; Chen, Constance; Chen, Kexin; Chung, Charles C; Cook, Linda S; Crous Bou, Marta; Cullen, Michael; Davis, Faith G; De Vivo, Immaculata; Ding, Ti; Doherty, Jennifer; Duell, Eric J; Epstein, Caroline G; Fan, Jin-Hu; Figueroa, Jonine D; Fraumeni, Joseph F; Friedenreich, Christine M; Fuchs, Charles S; Gallinger, Steven; Gao, Yu-Tang; Gapstur, Susan M; Garcia-Closas, Montserrat; Gaudet, Mia M; Gaziano, J Michael; Giles, Graham G; Gillanders, Elizabeth M; Giovannucci, Edward L; Goldin, Lynn; Goldstein, Alisa M; Haiman, Christopher A; Hallmans, Goran; Hankinson, Susan E; Harris, Curtis C; Henriksson, Roger; Holly, Elizabeth A; Hong, Yun-Chul; Hoover, Robert N; Hsiung, Chao A; Hu, Nan; Hu, Wei; Hunter, David J; Hutchinson, Amy; Jenab, Mazda; Johansen, Christoffer; Khaw, Kay-Tee; Kim, Hee Nam; Kim, Yeul Hong; Kim, Young Tae; Klein, Alison P; Klein, Robert; Koh, Woon-Puay; Kolonel, Laurence N; Kooperberg, Charles; Kraft, Peter; Krogh, Vittorio; Kurtz, Robert C; LaCroix, Andrea; Lan, Qing; Landi, Maria Teresa; Marchand, Loic Le; Li, Donghui; Liang, Xiaolin; Liao, Linda M; Lin, Dongxin; Liu, Jianjun; Lissowska, Jolanta; Lu, Lingeng; Magliocco, Anthony M; Malats, Nuria; Matsuo, Keitaro; McNeill, Lorna H; McWilliams, Robert R; Melin, Beatrice S; Mirabello, Lisa; Moore, Lee; Olson, Sara H; Orlow, Irene; Park, Jae Yong; Patiño-Garcia, Ana; Peplonska, Beata; Peters, Ulrike; Petersen, Gloria M; Pooler, Loreall; Prescott, Jennifer; Prokunina-Olsson, Ludmila; Purdue, Mark P; Qiao, You-Lin; Rajaraman, Preetha; Real, Francisco X; Riboli, Elio; Risch, Harvey A; Rodriguez-Santiago, Benjamin; Ruder, Avima M; Savage, Sharon A; Schumacher, Fredrick; Schwartz, Ann G; Schwartz, Kendra L; Seow, Adeline; Wendy Setiawan, Veronica; Severi, Gianluca; Shen, Hongbing; Sheng, Xin; Shin, Min-Ho; Shu, Xiao-Ou; Silverman, Debra T; Spitz, Margaret R; Stevens, Victoria L; Stolzenberg-Solomon, Rachael; Stram, Daniel; Tang, Ze-Zhong; Taylor, Philip R; Teras, Lauren R; Tobias, Geoffrey S; Van Den Berg, David; Visvanathan, Kala; Wacholder, Sholom; Wang, Jiu-Cun; Wang, Zhaoming; Wentzensen, Nicolas; Wheeler, William; White, Emily; Wiencke, John K; Wolpin, Brian M; Wong, Maria Pik; Wu, Chen; Wu, Tangchun; Wu, Xifeng; Wu, Yi-Long; Wunder, Jay S; Xia, Lucy; Yang, Hannah P; Yang, Pan-Chyr; Yu, Kai; Zanetti, Krista A; Zeleniuch-Jacquotte, Anne; Zheng, Wei; Zhou, Baosen; Ziegler, Regina G; Perez-Jurado, Luis A; Caporaso, Neil E; Rothman, Nathaniel; Tucker, Margaret; Dean, Michael C; Yeager, Meredith; Chanock, Stephen J
2016-06-13
To investigate large structural clonal mosaicism of chromosome X, we analysed the SNP microarray intensity data of 38,303 women from cancer genome-wide association studies (20,878 cases and 17,425 controls) and detected 124 mosaic X events >2 Mb in 97 (0.25%) women. Here we show rates for X-chromosome mosaicism are four times higher than mean autosomal rates; X mosaic events more often include the entire chromosome and participants with X events more likely harbour autosomal mosaic events. X mosaicism frequency increases with age (0.11% in 50-year olds; 0.45% in 75-year olds), as reported for Y and autosomes. Methylation array analyses of 33 women with X mosaicism indicate events preferentially involve the inactive X chromosome. Our results provide further evidence that the sex chromosomes undergo mosaic events more frequently than autosomes, which could have implications for understanding the underlying mechanisms of mosaic events and their possible contribution to risk for chronic diseases.
Clinical comparison of overlapping deletions of 19p13.3.
Risheg, Hiba; Pasion, Romela; Sacharow, Stephanie; Proud, Virginia; Immken, LaDonna; Schwartz, Stuart; Tepperberg, Jim H; Papenhausen, Peter; Tan, Tiong Y; Andrieux, Joris; Plessis, Ghislaine; Amor, David J; Keitges, Elisabeth A
2013-05-01
We present three patients with overlapping interstitial deletions of 19p13.3 identified by high resolution SNP microarray analysis. All three had a similar phenotype characterized by intellectual disability or developmental delay, structural heart abnormalities, large head relative to height and weight or macrocephaly, and minor facial anomalies. Deletion sizes ranged from 792 Kb to 1.0 Mb and included a common region arr [hg19] 19p13.3 (3,814,392-4,136,989), containing eight genes: ZFR2, ATCAY, NMRK2, DAPK3, EEF2, PIAS4, ZBTB7A, MAP2K2, and two non-coding RNA's MIR637 and SNORDU37. The patient phenotypes were compared with three previous single patient reports with similar interstitial 19p13.3 deletions and six additional patients from the DECIPHER and ISCA databases to determine if a common haploinsufficient phenotype for the region can be established. Copyright © 2013 Wiley Periodicals, Inc.
CGDSNPdb: a database resource for error-checked and imputed mouse SNPs.
Hutchins, Lucie N; Ding, Yueming; Szatkiewicz, Jin P; Von Smith, Randy; Yang, Hyuna; de Villena, Fernando Pardo-Manuel; Churchill, Gary A; Graber, Joel H
2010-07-06
The Center for Genome Dynamics Single Nucleotide Polymorphism Database (CGDSNPdb) is an open-source value-added database with more than nine million mouse single nucleotide polymorphisms (SNPs), drawn from multiple sources, with genotypes assigned to multiple inbred strains of laboratory mice. All SNPs are checked for accuracy and annotated for properties specific to the SNP as well as those implied by changes to overlapping protein-coding genes. CGDSNPdb serves as the primary interface to two unique data sets, the 'imputed genotype resource' in which a Hidden Markov Model was used to assess local haplotypes and the most probable base assignment at several million genomic loci in tens of strains of mice, and the Affymetrix Mouse Diversity Genotyping Array, a high density microarray with over 600,000 SNPs and over 900,000 invariant genomic probes. CGDSNPdb is accessible online through either a web-based query tool or a MySQL public login. Database URL: http://cgd.jax.org/cgdsnpdb/
Harris, R. Alan; Wang, Ting; Coarfa, Cristian; Nagarajan, Raman P.; Hong, Chibo; Downey, Sara L.; Johnson, Brett E.; Fouse, Shaun D.; Delaney, Allen; Zhao, Yongjun; Olshen, Adam; Ballinger, Tracy; Zhou, Xin; Forsberg, Kevin J.; Gu, Junchen; Echipare, Lorigail; O’Geen, Henriette; Lister, Ryan; Pelizzola, Mattia; Xi, Yuanxin; Epstein, Charles B.; Bernstein, Bradley E.; Hawkins, R. David; Ren, Bing; Chung, Wen-Yu; Gu, Hongcang; Bock, Christoph; Gnirke, Andreas; Zhang, Michael Q.; Haussler, David; Ecker, Joseph; Li, Wei; Farnham, Peggy J.; Waterland, Robert A.; Meissner, Alexander; Marra, Marco A.; Hirst, Martin; Milosavljevic, Aleksandar; Costello, Joseph F.
2010-01-01
Sequencing-based DNA methylation profiling methods are comprehensive and, as accuracy and affordability improve, will increasingly supplant microarrays for genome-scale analyses. Here, four sequencing-based methodologies were applied to biological replicates of human embryonic stem cells to compare their CpG coverage genome-wide and in transposons, resolution, cost, concordance and its relationship with CpG density and genomic context. The two bisulfite methods reached concordance of 82% for CpG methylation levels and 99% for non-CpG cytosine methylation levels. Using binary methylation calls, two enrichment methods were 99% concordant, while regions assessed by all four methods were 97% concordant. To achieve comprehensive methylome coverage while reducing cost, an approach integrating two complementary methods was examined. The integrative methylome profile along with histone methylation, RNA, and SNP profiles derived from the sequence reads allowed genome-wide assessment of allele-specific epigenetic states, identifying most known imprinted regions and new loci with monoallelic epigenetic marks and monoallelic expression. PMID:20852635
Diagnosis of Van den Ende-Gupta syndrome: Approach to the Marden-Walker-like spectrum of disorders.
Niederhoffer, Karen Y; Fahiminiya, Somayyeh; Eydoux, Patrice; Mawson, John; Nishimura, Gen; Jerome-Majewska, Loydie A; Patel, Millan S
2016-09-01
Marden-Walker syndrome is challenging to diagnose, as there is significant overlap with other multi-system congenital contracture syndromes including Beals congenital contractural arachnodactyly, D4ST1-Deficient Ehlers-Danlos syndrome (adducted thumb-clubfoot syndrome), Schwartz-Jampel syndrome, Freeman-Sheldon syndrome, Cerebro-oculo-facio-skeletal syndrome, and Van den Ende-Gupta syndrome. We discuss this differential diagnosis in the context of a boy from a consanguineous union with Van den Ende-Gupta syndrome, a diagnosis initially confused by the atypical presence of intellectual disability. SNP microarray and whole exome sequencing identified a homozygous frameshift mutation (p.L870V) in SCARF2 and predicted damaging mutations in several genes, most notably DGCR2 (p.P75L) and NCAM2 (p.S147G), both possible candidates for this child's intellectual disability. We review distinguishing features for each Marden-Walker-like syndrome and propose a clinical algorithm for diagnosis among this spectrum of disorders. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Functional Characterization of Schizophrenia-Associated Variation in CACNA1C
Eckart, Nicole; Song, Qifeng; Yang, Rebecca; Wang, Ruihua; Zhu, Heng; McCallion, Andrew S.; Avramopoulos, Dimitrios
2016-01-01
Calcium channel subunits, including CACNA1C, have been associated with multiple psychiatric disorders. Specifically, genome wide association studies (GWAS) have repeatedly identified the single nucleotide polymorphism (SNP) rs1006737 in intron 3 of CACNA1C to be strongly associated with schizophrenia and bipolar disorder. Here, we show that rs1006737 marks a quantitative trait locus for CACNA1C transcript levels. We test 16 SNPs in high linkage disequilibrium with rs1007637 and find one, rs4765905, consistently showing allele-dependent regulatory function in reporter assays. We find allele-specific protein binding for 13 SNPs including rs4765905. Using protein microarrays, we identify several proteins binding ≥3 SNPs, but not control sequences, suggesting possible functional interactions and combinatorial haplotype effects. Finally, using circular chromatin conformation capture, we show interaction of the disease-associated region including the 16 SNPs with the CACNA1C promoter and other potential regulatory regions. Our results elucidate the pathogenic relevance of one of the best-supported risk loci for schizophrenia and bipolar disorder. PMID:27276213
Female chromosome X mosaicism is age-related and preferentially affects the inactivated X chromosome
Machiela, Mitchell J.; Zhou, Weiyin; Karlins, Eric; Sampson, Joshua N.; Freedman, Neal D.; Yang, Qi; Hicks, Belynda; Dagnall, Casey; Hautman, Christopher; Jacobs, Kevin B.; Abnet, Christian C.; Aldrich, Melinda C.; Amos, Christopher; Amundadottir, Laufey T.; Arslan, Alan A.; Beane-Freeman, Laura E.; Berndt, Sonja I.; Black, Amanda; Blot, William J.; Bock, Cathryn H.; Bracci, Paige M.; Brinton, Louise A.; Bueno-de-Mesquita, H Bas; Burdett, Laurie; Buring, Julie E.; Butler, Mary A.; Canzian, Federico; Carreón, Tania; Chaffee, Kari G.; Chang, I-Shou; Chatterjee, Nilanjan; Chen, Chu; Chen, Constance; Chen, Kexin; Chung, Charles C.; Cook, Linda S.; Crous Bou, Marta; Cullen, Michael; Davis, Faith G.; De Vivo, Immaculata; Ding, Ti; Doherty, Jennifer; Duell, Eric J.; Epstein, Caroline G.; Fan, Jin-Hu; Figueroa, Jonine D.; Fraumeni, Joseph F.; Friedenreich, Christine M.; Fuchs, Charles S.; Gallinger, Steven; Gao, Yu-Tang; Gapstur, Susan M.; Garcia-Closas, Montserrat; Gaudet, Mia M.; Gaziano, J. Michael; Giles, Graham G.; Gillanders, Elizabeth M.; Giovannucci, Edward L.; Goldin, Lynn; Goldstein, Alisa M.; Haiman, Christopher A.; Hallmans, Goran; Hankinson, Susan E.; Harris, Curtis C.; Henriksson, Roger; Holly, Elizabeth A.; Hong, Yun-Chul; Hoover, Robert N.; Hsiung, Chao A.; Hu, Nan; Hu, Wei; Hunter, David J.; Hutchinson, Amy; Jenab, Mazda; Johansen, Christoffer; Khaw, Kay-Tee; Kim, Hee Nam; Kim, Yeul Hong; Kim, Young Tae; Klein, Alison P.; Klein, Robert; Koh, Woon-Puay; Kolonel, Laurence N.; Kooperberg, Charles; Kraft, Peter; Krogh, Vittorio; Kurtz, Robert C.; LaCroix, Andrea; Lan, Qing; Landi, Maria Teresa; Marchand, Loic Le; Li, Donghui; Liang, Xiaolin; Liao, Linda M.; Lin, Dongxin; Liu, Jianjun; Lissowska, Jolanta; Lu, Lingeng; Magliocco, Anthony M.; Malats, Nuria; Matsuo, Keitaro; McNeill, Lorna H.; McWilliams, Robert R.; Melin, Beatrice S.; Mirabello, Lisa; Moore, Lee; Olson, Sara H.; Orlow, Irene; Park, Jae Yong; Patiño-Garcia, Ana; Peplonska, Beata; Peters, Ulrike; Petersen, Gloria M.; Pooler, Loreall; Prescott, Jennifer; Prokunina-Olsson, Ludmila; Purdue, Mark P.; Qiao, You-Lin; Rajaraman, Preetha; Real, Francisco X.; Riboli, Elio; Risch, Harvey A.; Rodriguez-Santiago, Benjamin; Ruder, Avima M.; Savage, Sharon A.; Schumacher, Fredrick; Schwartz, Ann G.; Schwartz, Kendra L.; Seow, Adeline; Wendy Setiawan, Veronica; Severi, Gianluca; Shen, Hongbing; Sheng, Xin; Shin, Min-Ho; Shu, Xiao-Ou; Silverman, Debra T.; Spitz, Margaret R.; Stevens, Victoria L.; Stolzenberg-Solomon, Rachael; Stram, Daniel; Tang, Ze-Zhong; Taylor, Philip R.; Teras, Lauren R.; Tobias, Geoffrey S.; Van Den Berg, David; Visvanathan, Kala; Wacholder, Sholom; Wang, Jiu-Cun; Wang, Zhaoming; Wentzensen, Nicolas; Wheeler, William; White, Emily; Wiencke, John K.; Wolpin, Brian M.; Wong, Maria Pik; Wu, Chen; Wu, Tangchun; Wu, Xifeng; Wu, Yi-Long; Wunder, Jay S.; Xia, Lucy; Yang, Hannah P.; Yang, Pan-Chyr; Yu, Kai; Zanetti, Krista A.; Zeleniuch-Jacquotte, Anne; Zheng, Wei; Zhou, Baosen; Ziegler, Regina G.; Perez-Jurado, Luis A.; Caporaso, Neil E.; Rothman, Nathaniel; Tucker, Margaret; Dean, Michael C.; Yeager, Meredith; Chanock, Stephen J.
2016-01-01
To investigate large structural clonal mosaicism of chromosome X, we analysed the SNP microarray intensity data of 38,303 women from cancer genome-wide association studies (20,878 cases and 17,425 controls) and detected 124 mosaic X events >2 Mb in 97 (0.25%) women. Here we show rates for X-chromosome mosaicism are four times higher than mean autosomal rates; X mosaic events more often include the entire chromosome and participants with X events more likely harbour autosomal mosaic events. X mosaicism frequency increases with age (0.11% in 50-year olds; 0.45% in 75-year olds), as reported for Y and autosomes. Methylation array analyses of 33 women with X mosaicism indicate events preferentially involve the inactive X chromosome. Our results provide further evidence that the sex chromosomes undergo mosaic events more frequently than autosomes, which could have implications for understanding the underlying mechanisms of mosaic events and their possible contribution to risk for chronic diseases. PMID:27291797
Leite, Neiva; Furtado-Alle, Lupe; Teixeira, Mayza Dalcin; de Souza, Ricardo Lehtonen Rodrigues; da Silva, Larissa Rosa; Pizzi, Juliana; Lopes, Maria de Fátima Aguiar; Titski, Ana Cláudia Kapp
2018-01-01
Purpose The rs9939609 SNP (T > A) in FTO gene is associated with obesity and type 2 diabetes. The present study aimed at verifying whether this SNP influenced biochemical outcomes of children and adolescents who are overweight/obese submitted to a program of physical exercise and also if there was influence on basal levels of these biochemical variables. Methods The sample was composed by 432 children and adolescents grouped in three ways (obese, overweight, and normal weight); of these, 135 children and adoloescents who are obese and overweight were submitted to a physical exercise program for 12 weeks. All were genotyped by TaqMan SNP genotyping assay. Results The children and adolescents who are overweight/obese and carriers of AA genotype had higher levels of insulin (p=0.03) and HOMA (p=0.007) and lower levels of glucose (p=0.003), but the SNP did not modulate the response to physical exercise. Conclusions In our study, the rs9939609 AA genotype was associated with parameters related to insulin metabolism but did not interact with physical exercise. PMID:29854435
Molecular characterization of endocarditis-associated Staphylococcus aureus.
Nethercott, Cara; Mabbett, Amanda N; Totsika, Makrina; Peters, Paul; Ortiz, Juan C; Nimmo, Graeme R; Coombs, Geoffrey W; Walker, Mark J; Schembri, Mark A
2013-07-01
Infective endocarditis (IE) is a life-threatening infection of the heart endothelium and valves. Staphylococcus aureus is a predominant cause of severe IE and is frequently associated with infections in health care settings and device-related infections. Multilocus sequence typing (MLST), spa typing, and virulence gene microarrays are frequently used to classify S. aureus clinical isolates. This study examined the utility of these typing tools to investigate S. aureus epidemiology associated with IE. Ninety-seven S. aureus isolates were collected from patients diagnosed with (i) IE, (ii) bloodstream infection related to medical devices, (iii) bloodstream infection not related to medical devices, and (iv) skin or soft-tissue infections. The MLST clonal complex (CC) for each isolate was determined and compared to the CCs of members of the S. aureus population by eBURST analysis. The spa type of all isolates was also determined. A null model was used to determine correlations of IE with CC and spa type. DNA microarray analysis was performed, and a permutational analysis of multivariate variance (PERMANOVA) and principal coordinates analysis were conducted to identify genotypic differences between IE and non-IE strains. CC12, CC20, and spa type t160 were significantly associated with IE S. aureus. A subset of virulence-associated genes and alleles, including genes encoding staphylococcal superantigen-like proteins, fibrinogen-binding protein, and a leukocidin subunit, also significantly correlated with IE isolates. MLST, spa typing, and microarray analysis are promising tools for monitoring S. aureus epidemiology associated with IE. Further research to determine a role for the S. aureus IE-associated virulence genes identified in this study is warranted.
Molecular Characterization of Endocarditis-Associated Staphylococcus aureus
Nethercott, Cara; Mabbett, Amanda N.; Totsika, Makrina; Peters, Paul; Ortiz, Juan C.; Nimmo, Graeme R.; Coombs, Geoffrey W.
2013-01-01
Infective endocarditis (IE) is a life-threatening infection of the heart endothelium and valves. Staphylococcus aureus is a predominant cause of severe IE and is frequently associated with infections in health care settings and device-related infections. Multilocus sequence typing (MLST), spa typing, and virulence gene microarrays are frequently used to classify S. aureus clinical isolates. This study examined the utility of these typing tools to investigate S. aureus epidemiology associated with IE. Ninety-seven S. aureus isolates were collected from patients diagnosed with (i) IE, (ii) bloodstream infection related to medical devices, (iii) bloodstream infection not related to medical devices, and (iv) skin or soft-tissue infections. The MLST clonal complex (CC) for each isolate was determined and compared to the CCs of members of the S. aureus population by eBURST analysis. The spa type of all isolates was also determined. A null model was used to determine correlations of IE with CC and spa type. DNA microarray analysis was performed, and a permutational analysis of multivariate variance (PERMANOVA) and principal coordinates analysis were conducted to identify genotypic differences between IE and non-IE strains. CC12, CC20, and spa type t160 were significantly associated with IE S. aureus. A subset of virulence-associated genes and alleles, including genes encoding staphylococcal superantigen-like proteins, fibrinogen-binding protein, and a leukocidin subunit, also significantly correlated with IE isolates. MLST, spa typing, and microarray analysis are promising tools for monitoring S. aureus epidemiology associated with IE. Further research to determine a role for the S. aureus IE-associated virulence genes identified in this study is warranted. PMID:23616460
Telldahl, Ylva; Svensson, Emma; Götherström, Anders; Storå, Jan
2011-01-01
Human management of livestock and the presence of different breeds have been discussed in archaeozoology and animal breeding. Traditionally osteometrics has been the main tool in addressing these questions. We combine osteometrics with molecular sex identifications of 104 of 340 morphometrically analysed bones in order to investigate the use of cattle at the Eketorp ringfort on the Öland island in Sweden. The fort is dated to 300-1220/50 A.D., revealing three different building phases. In order to investigate specific patterns and shifts through time in the use of cattle the genetic data is evaluated in relation to osteometric patterns and occurrence of pathologies on cattle metapodia. Males were genotyped for a Y-chromosomal SNP in UTY19 that separates the two major haplogroups, Y1 and Y2, in taurine cattle. A subset of the samples were also genotyped for one SNP involved in coat coloration (MC1R), one SNP putatively involved in resistance to cattle plague (TLR4), and one SNP in intron 5 of the IGF-1 gene that has been associated to size and reproduction.The results of the molecular analyses confirm that the skeletal assemblage from Eketorp is dominated by skeletal elements from females, which implies that dairying was important. Pathological lesions on the metapodia were classified into two groups; those associated with the use as draught animals and those lesions without a similar aetiology. The results show that while bulls both exhibit draught related lesions and other types of lesions, cows exhibit other types of lesions. Interestingly, a few elements from females exhibit draught related lesions. We conclude that this reflects the different use of adult female and male cattle.Although we note some variation in the use of cattle at Eketorp between Iron Age and Medieval time we have found little evidence for the use of different types of animals for specific purposes. The use of specific (genetic) breeds seems to be a phenomenon that developed later than the Eketorp settlement.
Telldahl, Ylva; Svensson, Emma; Götherström, Anders; Storå, Jan
2011-01-01
Human management of livestock and the presence of different breeds have been discussed in archaeozoology and animal breeding. Traditionally osteometrics has been the main tool in addressing these questions. We combine osteometrics with molecular sex identifications of 104 of 340 morphometrically analysed bones in order to investigate the use of cattle at the Eketorp ringfort on the Öland island in Sweden. The fort is dated to 300–1220/50 A.D., revealing three different building phases. In order to investigate specific patterns and shifts through time in the use of cattle the genetic data is evaluated in relation to osteometric patterns and occurrence of pathologies on cattle metapodia. Males were genotyped for a Y-chromosomal SNP in UTY19 that separates the two major haplogroups, Y1 and Y2, in taurine cattle. A subset of the samples were also genotyped for one SNP involved in coat coloration (MC1R), one SNP putatively involved in resistance to cattle plague (TLR4), and one SNP in intron 5 of the IGF-1 gene that has been associated to size and reproduction. The results of the molecular analyses confirm that the skeletal assemblage from Eketorp is dominated by skeletal elements from females, which implies that dairying was important. Pathological lesions on the metapodia were classified into two groups; those associated with the use as draught animals and those lesions without a similar aetiology. The results show that while bulls both exhibit draught related lesions and other types of lesions, cows exhibit other types of lesions. Interestingly, a few elements from females exhibit draught related lesions. We conclude that this reflects the different use of adult female and male cattle. Although we note some variation in the use of cattle at Eketorp between Iron Age and Medieval time we have found little evidence for the use of different types of animals for specific purposes. The use of specific (genetic) breeds seems to be a phenomenon that developed later than the Eketorp settlement. PMID:21731623
Unique Proteins Expressed by Blood Vessels in Skeletal Sites Colonized by Breast Cancer Cells
2006-08-01
fluorescent labeled acetylated LDL at an accelerated rate (3). After one week in culture BVECs and MVECs were harvested. Total RNA was extracted from...both cell types using the Qiagen RNeasy kit (Valencia, CA). Microarray labeling, hybridization and analysis was conducted on the RNA by the Penn...State University DNA Microarray Facility under the direction of Dr. Craig Praul. Briefly, RNA obtained from three separate isolations of BVECs and
Development of a genotyping microarray for Usher syndrome.
Cremers, Frans P M; Kimberling, William J; Külm, Maigi; de Brouwer, Arjan P; van Wijk, Erwin; te Brinke, Heleen; Cremers, Cor W R J; Hoefsloot, Lies H; Banfi, Sandro; Simonelli, Francesca; Fleischhauer, Johannes C; Berger, Wolfgang; Kelley, Phil M; Haralambous, Elene; Bitner-Glindzicz, Maria; Webster, Andrew R; Saihan, Zubin; De Baere, Elfride; Leroy, Bart P; Silvestri, Giuliana; McKay, Gareth J; Koenekoop, Robert K; Millan, Jose M; Rosenberg, Thomas; Joensuu, Tarja; Sankila, Eeva-Marja; Weil, Dominique; Weston, Mike D; Wissinger, Bernd; Kremer, Hannie
2007-02-01
Usher syndrome, a combination of retinitis pigmentosa (RP) and sensorineural hearing loss with or without vestibular dysfunction, displays a high degree of clinical and genetic heterogeneity. Three clinical subtypes can be distinguished, based on the age of onset and severity of the hearing impairment, and the presence or absence of vestibular abnormalities. Thus far, eight genes have been implicated in the syndrome, together comprising 347 protein-coding exons. To improve DNA diagnostics for patients with Usher syndrome, we developed a genotyping microarray based on the arrayed primer extension (APEX) method. Allele-specific oligonucleotides corresponding to all 298 Usher syndrome-associated sequence variants known to date, 76 of which are novel, were arrayed. Approximately half of these variants were validated using original patient DNAs, which yielded an accuracy of >98%. The efficiency of the Usher genotyping microarray was tested using DNAs from 370 unrelated European and American patients with Usher syndrome. Sequence variants were identified in 64/140 (46%) patients with Usher syndrome type I, 45/189 (24%) patients with Usher syndrome type II, 6/21 (29%) patients with Usher syndrome type III and 6/20 (30%) patients with atypical Usher syndrome. The chip also identified two novel sequence variants, c.400C>T (p.R134X) in PCDH15 and c.1606T>C (p.C536S) in USH2A. The Usher genotyping microarray is a versatile and affordable screening tool for Usher syndrome. Its efficiency will improve with the addition of novel sequence variants with minimal extra costs, making it a very useful first-pass screening tool.
Development of a genotyping microarray for Usher syndrome
Cremers, Frans P M; Kimberling, William J; Külm, Maigi; de Brouwer, Arjan P; van Wijk, Erwin; te Brinke, Heleen; Cremers, Cor W R J; Hoefsloot, Lies H; Banfi, Sandro; Simonelli, Francesca; Fleischhauer, Johannes C; Berger, Wolfgang; Kelley, Phil M; Haralambous, Elene; Bitner‐Glindzicz, Maria; Webster, Andrew R; Saihan, Zubin; De Baere, Elfride; Leroy, Bart P; Silvestri, Giuliana; McKay, Gareth J; Koenekoop, Robert K; Millan, Jose M; Rosenberg, Thomas; Joensuu, Tarja; Sankila, Eeva‐Marja; Weil, Dominique; Weston, Mike D; Wissinger, Bernd; Kremer, Hannie
2007-01-01
Background Usher syndrome, a combination of retinitis pigmentosa (RP) and sensorineural hearing loss with or without vestibular dysfunction, displays a high degree of clinical and genetic heterogeneity. Three clinical subtypes can be distinguished, based on the age of onset and severity of the hearing impairment, and the presence or absence of vestibular abnormalities. Thus far, eight genes have been implicated in the syndrome, together comprising 347 protein‐coding exons. Methods: To improve DNA diagnostics for patients with Usher syndrome, we developed a genotyping microarray based on the arrayed primer extension (APEX) method. Allele‐specific oligonucleotides corresponding to all 298 Usher syndrome‐associated sequence variants known to date, 76 of which are novel, were arrayed. Results Approximately half of these variants were validated using original patient DNAs, which yielded an accuracy of >98%. The efficiency of the Usher genotyping microarray was tested using DNAs from 370 unrelated European and American patients with Usher syndrome. Sequence variants were identified in 64/140 (46%) patients with Usher syndrome type I, 45/189 (24%) patients with Usher syndrome type II, 6/21 (29%) patients with Usher syndrome type III and 6/20 (30%) patients with atypical Usher syndrome. The chip also identified two novel sequence variants, c.400C>T (p.R134X) in PCDH15 and c.1606T>C (p.C536S) in USH2A. Conclusion The Usher genotyping microarray is a versatile and affordable screening tool for Usher syndrome. Its efficiency will improve with the addition of novel sequence variants with minimal extra costs, making it a very useful first‐pass screening tool. PMID:16963483
Schnitzler, Fabian; Friedrich, Matthias; Wolf, Christiane; Angelberger, Marianne; Diegelmann, Julia; Olszak, Torsten; Beigel, Florian; Tillack, Cornelia; Stallhofer, Johannes; Göke, Burkhard; Glas, Jürgen; Lohse, Peter; Brand, Stephan
2014-01-01
Very recently, a sub-analysis of genome-wide association scans revealed that the non-coding single nucleotide polymorphism (SNP) rs12212067 in the FOXO3A gene is associated with a milder course of Crohn's disease (CD) (Cell 2013;155:57-69). The aim of our study was to evaluate the clinical value of the SNP rs12212067 in predicting the severity of CD by correlating CD patient genotype status with the most relevant complications of CD such as stenoses, fistulas, and CD-related surgery. We genotyped 550 CD patients for rs12212067 (FOXO3A) and the three common CD-associated NOD2 mutations rs2066844, rs2066847, and rs2066847 and performed genotype-phenotype analyses. No significant phenotypic differences were found between the wild-type genotype TT of the FOXO3A SNP rs12212067 and the minor genotypes TG and GG independently from NOD2 variants. The allele frequency of the minor G allele was 12.7%. Age at diagnosis, disease duration, body mass index, surgery rate, stenoses, fistula, need for immunosuppressive therapy, and disease course were not significantly different. In contrast, the NOD2 mutant p.Leu1007fsX1008 (rs2066847) was highly associated with penetrating CD (p = 0.01), the development of fistulas (p = 0.01) and stenoses (p = 0.01), and ileal disease localization (p = 0.03). Importantly, the NOD2 SNP rs2066847 was a strong separator between an aggressive and a mild course of CD (p = 2.99×10(-5)), while the FOXO3A SNP rs12212067 did not separate between mild and aggressive CD behavior in our cohort (p = 0.35). 96.2% of the homozygous NOD2 p.Leu1007fsX1008 carriers had an aggressive disease behavior compared to 69.3% of the patients with the NOD2 wild-type genotype (p = 0.007). In clinical practice, the NOD2 variant p.Leu1007fsX1008 (rs2066847), in particular in homozygous form, is a much stronger marker for a severe clinical phenotype than the FOXO3A rs12212067 SNP for a mild disease course on an individual patient level despite its important impact on the inflammatory response of monocytes.
Identification and characterization of nuclear genes involved in photosynthesis in Populus
2014-01-01
Background The gap between the real and potential photosynthetic rate under field conditions suggests that photosynthesis could potentially be improved. Nuclear genes provide possible targets for improving photosynthetic efficiency. Hence, genome-wide identification and characterization of the nuclear genes affecting photosynthetic traits in woody plants would provide key insights on genetic regulation of photosynthesis and identify candidate processes for improvement of photosynthesis. Results Using microarray and bulked segregant analysis strategies, we identified differentially expressed nuclear genes for photosynthesis traits in a segregating population of poplar. We identified 515 differentially expressed genes in this population (FC ≥ 2 or FC ≤ 0.5, P < 0.05), 163 up-regulated and 352 down-regulated. Real-time PCR expression analysis confirmed the microarray data. Singular Enrichment Analysis identified 48 significantly enriched GO terms for molecular functions (28), biological processes (18) and cell components (2). Furthermore, we selected six candidate genes for functional examination by a single-marker association approach, which demonstrated that 20 SNPs in five candidate genes significantly associated with photosynthetic traits, and the phenotypic variance explained by each SNP ranged from 2.3% to 12.6%. This revealed that regulation of photosynthesis by the nuclear genome mainly involves transport, metabolism and response to stimulus functions. Conclusions This study provides new genome-scale strategies for the discovery of potential candidate genes affecting photosynthesis in Populus, and for identification of the functions of genes involved in regulation of photosynthesis. This work also suggests that improving photosynthetic efficiency under field conditions will require the consideration of multiple factors, such as stress responses. PMID:24673936
Wang, Rongyue; Lei, Tingying; Fu, Fang; Li, Ru; Jing, Xiangyi; Yang, Xin; Liu, Juan; Li, Dongzhi; Liao, Can
2018-03-26
Chromosome microarray analysis (CMA) is currently the first-tier diagnostic assay for the evaluation of developmental delay (DD) and intellectual disability (ID) with unknown etiology. Here, we present our clinical experience in implementing whole-genome high-resolution single nucleotide polymorphism (SNP) arrays to investigate 489 patients with unexplained DD/ID in whom standard karyotyping analyses showed normal karyotypes. This study aimed to assess the usefulness of CMA for clinical diagnostic testing in the Chinese population. A total of 489 children were classified into three groups: isolated DD/ID (n = 358), DD/ID with epilepsy (n = 49), and DD/ID with other structural anomalies (n = 82). We identified 126 cases (25.8%, 126/489) of pathogenic copy number variants (CNVs) by CMA, including 89 (24.9%, 89/358) with isolated DD/ID, 13 (26.5%, 13/49) with DD/ID with epilepsy, and 24 (29.3%, 24/82) with DD/ID with other structural anomalies. Among the 126 cases of pathogenic CNVs, 79 cases were identified as microdeletion/microduplication syndromes, among which 76 cases were classified as common syndromes, and 3 cases were classified as rare syndromes, including 15q24 microdeletion syndrome, Xq28 microduplication syndrome and Lowe syndrome. Additionally, there were forty-seven cases of non-syndromic pathogenic CNVs. The ABAT, FTSJ1, DYNC1H1, and SETBP1 genes were identified as DD/ID candidate genes. Our findings suggest the necessity of CMA as a routine diagnostic test for unexplained DD/ID in South China. Copyright © 2018. Published by Elsevier B.V.
Typing SNP based on the near-infrared spectroscopy and artificial neural network
NASA Astrophysics Data System (ADS)
Ren, Li; Wang, Wei-Peng; Gao, Yu-Zhen; Yu, Xiao-Wei; Xie, Hong-Ping
2009-07-01
Based on the near-infrared spectra (NIRS) of the measured samples as the discriminant variables of their genotypes, the genotype discriminant model of SNP has been established by using back-propagation artificial neural network (BP-ANN). Taking a SNP (857G > A) of N-acetyltransferase 2 (NAT2) as an example, DNA fragments containing the SNP site were amplified by the PCR method based on a pair of primers to obtain the three-genotype (GG, AA, and GA) modeling samples. The NIRS-s of the amplified samples were directly measured in transmission by using quartz cell. Based on the sample spectra measured, the two BP-ANN-s were combined to obtain the stronger ability of the three-genotype classification. One of them was established to compress the measured NIRS variables by using the resilient back-propagation algorithm, and another network established by Levenberg-Marquardt algorithm according to the compressed NIRS-s was used as the discriminant model of the three-genotype classification. For the established model, the root mean square error for the training and the prediction sample sets were 0.0135 and 0.0132, respectively. Certainly, this model could rightly predict the three genotypes (i.e. the accuracy of prediction samples was up to100%) and had a good robust for the prediction of unknown samples. Since the three genotypes of SNP could be directly determined by using the NIRS-s without any preprocessing for the analyzed samples after PCR, this method is simple, rapid and low-cost.
Yu, Hualong; Hong, Shufang; Yang, Xibei; Ni, Jun; Dan, Yuanyuan; Qin, Bin
2013-01-01
DNA microarray technology can measure the activities of tens of thousands of genes simultaneously, which provides an efficient way to diagnose cancer at the molecular level. Although this strategy has attracted significant research attention, most studies neglect an important problem, namely, that most DNA microarray datasets are skewed, which causes traditional learning algorithms to produce inaccurate results. Some studies have considered this problem, yet they merely focus on binary-class problem. In this paper, we dealt with multiclass imbalanced classification problem, as encountered in cancer DNA microarray, by using ensemble learning. We utilized one-against-all coding strategy to transform multiclass to multiple binary classes, each of them carrying out feature subspace, which is an evolving version of random subspace that generates multiple diverse training subsets. Next, we introduced one of two different correction technologies, namely, decision threshold adjustment or random undersampling, into each training subset to alleviate the damage of class imbalance. Specifically, support vector machine was used as base classifier, and a novel voting rule called counter voting was presented for making a final decision. Experimental results on eight skewed multiclass cancer microarray datasets indicate that unlike many traditional classification approaches, our methods are insensitive to class imbalance.
Stephenson, Kathryn E.; Neubauer, George H.; Reimer, Ulf; ...
2014-11-14
An effective vaccine against human immunodeficiency virus type 1 (HIV-1) will have to provide protection against a vast array of different HIV-1 strains. Current methods to measure HIV-1-specific binding antibodies following immunization typically focus on determining the magnitude of antibody responses, but the epitope diversity of antibody responses has remained largely unexplored. Here we describe the development of a global HIV-1 peptide microarray that contains 6564 peptides from across the HIV-1 proteome and covers the majority of HIV-1 sequences in the Los Alamos National Laboratory global HIV-1 sequence database. Using this microarray, we quantified the magnitude, breadth, and depth ofmore » IgG binding to linear HIV-1 sequences in HIV-1-infected humans and HIV-1-vaccinated humans, rhesus monkeys and guinea pigs. The microarray measured potentially important differences in antibody epitope diversity, particularly regarding the depth of epitope variants recognized at each binding site. Our data suggest that the global HIV-1 peptide microarray may be a useful tool for both preclinical and clinical HIV-1 research.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stephenson, Kathryn E.; Neubauer, George H.; Reimer, Ulf
An effective vaccine against human immunodeficiency virus type 1 (HIV-1) will have to provide protection against a vast array of different HIV-1 strains. Current methods to measure HIV-1-specific binding antibodies following immunization typically focus on determining the magnitude of antibody responses, but the epitope diversity of antibody responses has remained largely unexplored. Here we describe the development of a global HIV-1 peptide microarray that contains 6564 peptides from across the HIV-1 proteome and covers the majority of HIV-1 sequences in the Los Alamos National Laboratory global HIV-1 sequence database. Using this microarray, we quantified the magnitude, breadth, and depth ofmore » IgG binding to linear HIV-1 sequences in HIV-1-infected humans and HIV-1-vaccinated humans, rhesus monkeys and guinea pigs. The microarray measured potentially important differences in antibody epitope diversity, particularly regarding the depth of epitope variants recognized at each binding site. Our data suggest that the global HIV-1 peptide microarray may be a useful tool for both preclinical and clinical HIV-1 research.« less
See what you eat--broad GMO screening with microarrays.
von Götz, Franz
2010-03-01
Despite the controversy of whether genetically modified organisms (GMOs) are beneficial or harmful for humans, animals, and/or ecosystems, the number of cultivated GMOs is increasing every year. Many countries and federations have implemented safety and surveillance systems for GMOs. Potent testing technologies need to be developed and implemented to monitor the increasing number of GMOs. First, these GMO tests need to be comprehensive, i.e., should detect all, or at least the most important, GMOs on the market. This type of GMO screening requires a high degree of parallel tests or multiplexing. To date, DNA microarrays have the highest number of multiplexing capabilities when nucleic acids are analyzed. This trend article focuses on the evolution of DNA microarrays for GMO testing. Over the last 7 years, combinations of multiplex PCR detection and microarray detection have been developed to qualitatively assess the presence of GMOs. One example is the commercially available DualChip GMO (Eppendorf, Germany; http://www.eppendorf-biochip.com), which is the only GMO screening system successfully validated in a multicenter study. With use of innovative amplification techniques, promising steps have recently been taken to make GMO detection with microarrays quantitative.
Nutrigenetics: links between genetic background and response to Mediterranean-type diets.
Lairon, Denis; Defoort, Catherine; Martin, Jean-Charles; Amiot-Carlin, Marie-Jo; Gastaldi, Marguerite; Planells, Richard
2009-09-01
It has been substantiated that the onset of most major diseases (CVD, diabetes, obesity, cancers, etc.) is modulated by the interaction between genetic traits (susceptibility) and environmental factors, especially diet. We aim to report more specific observations relating the effects of Mediterranean-type diets on cardiovascular risk factors and the genetic background of subjects. In the first part, general concepts about nutrigenetics are briefly presented. Human genome has, overall, only marginally changed since its origin but it is thought that minor changes (polymorphisms) of common genes that occurred during evolution are now widespread in human populations, and can alter metabolic pathways and response to diets. In the second part, we report the data obtained during the Medi-RIVAGE intervention study performed in the South-East of France. Data obtained in 169 subjects at moderate cardiovascular risk after a 3-month dietary intervention indicate that some of the twenty-three single nucleotide polymorphisms (SNP) studied exhibit interactions with diets regarding changes of particular parameters after 3-month regimens. Detailed examples are presented, such as interactions between SNP in genes coding for microsomial transfer protein (MTTP) or intestinal fatty acid binding protein (FABP2) and triglyceride, LDL-cholesterol or Framigham score lowering in responses to Mediterranean-type diets. The data provided add further evidence of the interaction between particular SNP and metabolic responses to diets. Finally, improvement in dietary recommendations by taking into account known genetic variability has been discussed.
Wong, Michelle; Öhrmalm, Lars; Broliden, Kristina; Aust, Carl; Hibberd, Martin; Tolfvenstam, Thomas
2012-01-01
Mannose-binding Lectin protein (MBL) has been suggested to be relevant in the defence against infections in immunosuppressed individuals. In a Swedish adult cohort immunosuppressed from both the underlying disease and from iatrogenic treatments for their underlying disease we investigated the role of MBL in susceptibility to infection. In this cross sectional, prospective study, blood samples obtained from 96 neutropaenic febrile episodes, representing 82 individuals were analysed for single nucleotide polymorphism (SNP) in the MBL2 gene. Concurrent measurement of plasma MBL protein concentrations was also performed for observation of acute response during febrile episodes. No association was observed between MBL2 genotype or plasma MBL concentrations, and the type or frequency of infection. Adding to the literature, we found no evidence that viral infections or co-infections with virus and bacteria would be predisposed by MBL deficiency. We further saw no correlation between MBL2 genotype and the risk of fever. However, fever duration in febrile neutropaenic episodes was negatively associated with MBL2 SNP mutations (p<0.05). Patients with MBL2 SNP mutations presented a median febrile duration of 1.8 days compared with 3 days amongst patients with wildtype MBL2 genotype. We found no clear association between infection, or infection type to MBL2 genotypes or plasma MBL concentration, and add to the reports casting doubts on the benefit of recombinant MBL replacement therapy use during iatrogenic neutropaenia.
Wong, Michelle; Öhrmalm, Lars; Broliden, Kristina; Aust, Carl; Hibberd, Martin; Tolfvenstam, Thomas
2012-01-01
Background Mannose-binding Lectin protein (MBL) has been suggested to be relevant in the defence against infections in immunosuppressed individuals. In a Swedish adult cohort immunosuppressed from both the underlying disease and from iatrogenic treatments for their underlying disease we investigated the role of MBL in susceptibility to infection. Methods In this cross sectional, prospective study, blood samples obtained from 96 neutropaenic febrile episodes, representing 82 individuals were analysed for single nucleotide polymorphism (SNP) in the MBL2 gene. Concurrent measurement of plasma MBL protein concentrations was also performed for observation of acute response during febrile episodes. Findings No association was observed between MBL2 genotype or plasma MBL concentrations, and the type or frequency of infection. Adding to the literature, we found no evidence that viral infections or co-infections with virus and bacteria would be predisposed by MBL deficiency. We further saw no correlation between MBL2 genotype and the risk of fever. However, fever duration in febrile neutropaenic episodes was negatively associated with MBL2 SNP mutations (p<0.05). Patients with MBL2 SNP mutations presented a median febrile duration of 1.8 days compared with 3 days amongst patients with wildtype MBL2 genotype. Interpretation We found no clear association between infection, or infection type to MBL2 genotypes or plasma MBL concentration, and add to the reports casting doubts on the benefit of recombinant MBL replacement therapy use during iatrogenic neutropaenia. PMID:22363494
Homogeneous versus heterogeneous probes for microbial ecological microarrays.
Bae, Jin-Woo; Park, Yong-Ha
2006-07-01
Microbial ecological microarrays have been developed for investigating the composition and functions of microorganism communities in environmental niches. These arrays include microbial identification microarrays, which use oligonucleotides, gene fragments or microbial genomes as probes. In this article, the advantages and disadvantages of each type of probe are reviewed. Oligonucleotide probes are currently useful for probing uncultivated bacteria that are not amenable to gene fragment probing, whereas the functional gene fragments amplified randomly from microbial genomes require phylogenetic and hierarchical categorization before use as microbial identification probes, despite their high resolution for both specificity and sensitivity. Until more bacteria are sequenced and gene fragment probes are thoroughly validated, heterogeneous bacterial genome probes will provide a simple, sensitive and quantitative tool for exploring the ecosystem structure.
Wang, Wen; Li, Hao; Zhao, Zheng; Wang, Haoyuan; Zhang, Dong; Zhang, Yan; Lan, Qing; Wang, Jiangfei; Cao, Yong; Zhao, Jizong
2018-04-01
Abdominal aortic aneurysms (AAAs) and intracranial saccular aneurysms (IAs) are the most common types of aneurysms. This study was to investigate the common pathogenesis shared between these two kinds of aneurysms. We collected 12 IAs samples and 12 control arteries from the Beijing Tiantan Hospital and performed microarray analysis. In addition, we utilized the microarray datasets of IAs and AAAs from the Gene Expression Omnibus (GEO), in combination with our microarray results, to generate messenger RNA expression profiles for both AAAs and IAs in our study. Functional exploration and protein-protein interaction (PPI) analysis were performed. A total of 727 common genes were differentially expressed (404 was upregulated; 323 was downregulated) for both AAAs and IAs. The GO and pathway analyses showed that the common dysregulated genes were mainly enriched in vascular smooth muscle contraction, muscle contraction, immune response, defense response, cell activation, IL-6 signaling and chemokine signaling pathways, etc. The further protein-protein analysis identified 35 hub nodes, including TNF, IL6, MAPK13, and CCL5. These hub node genes were enriched in inflammatory response, positive regulation of IL-6 production, chemokine signaling pathway, and T/B cell receptor signaling pathway. Our study will gain new insight into the molecular mechanisms for the pathogenesis of both types of aneurysms and provide new therapeutic targets for the patients harboring AAAs and IAs.
Nie, Shuai; Benito-Peña, Elena; Zhang, Huaibin; Wu, Yue; Walt, David R
2013-10-10
Herein, we describe a protocol for simultaneously measuring six proteins in saliva using a fiber-optic microsphere-based antibody array. The immuno-array technology employed combines the advantages of microsphere-based suspension array fabrication with the use of fluorescence microscopy. As described in the video protocol, commercially available 4.5 μm polymer microspheres were encoded into seven different types, differentiated by the concentration of two fluorescent dyes physically trapped inside the microspheres. The encoded microspheres containing surface carboxyl groups were modified with monoclonal capture antibodies through EDC/NHS coupling chemistry. To assemble the protein microarray, the different types of encoded and functionalized microspheres were mixed and randomly deposited in 4.5 μm microwells, which were chemically etched at the proximal end of a fiber-optic bundle. The fiber-optic bundle was used as both a carrier and for imaging the microspheres. Once assembled, the microarray was used to capture proteins in the saliva supernatant collected from the clinic. The detection was based on a sandwich immunoassay using a mixture of biotinylated detection antibodies for different analytes with a streptavidin-conjugated fluorescent probe, R-phycoerythrin. The microarray was imaged by fluorescence microscopy in three different channels, two for microsphere registration and one for the assay signal. The fluorescence micrographs were then decoded and analyzed using a homemade algorithm in MATLAB.
Giotis, Efstathios S; Robey, Rebecca C; Skinner, Natalie G; Tomlinson, Christopher D; Goodbourn, Stephen; Skinner, Michael A
2016-08-05
Viruses that infect birds pose major threats-to the global supply of chicken, the major, universally-acceptable meat, and as zoonotic agents (e.g. avian influenza viruses H5N1 and H7N9). Controlling these viruses in birds as well as understanding their emergence into, and transmission amongst, humans will require considerable ingenuity and understanding of how different species defend themselves. The type I interferon-coordinated response constitutes the major antiviral innate defence. Although interferon was discovered in chicken cells, details of the response, particularly the identity of hundreds of stimulated genes, are far better described in mammals. Viruses induce interferon-stimulated genes but they also regulate the expression of many hundreds of cellular metabolic and structural genes to facilitate their replication. This study focusses on the potentially anti-viral genes by identifying those induced just by interferon in primary chick embryo fibroblasts. Three transcriptomic technologies were exploited: RNA-seq, a classical 3'-biased chicken microarray and a high density, "sense target", whole transcriptome chicken microarray, with each recognising 120-150 regulated genes (curated for duplication and incorrect assignment of some microarray probesets). Overall, the results are considered robust because 128 of the compiled, curated list of 193 regulated genes were detected by two, or more, of the technologies.
Yamamura, Shohei; Yamada, Eriko; Kimura, Fukiko; Miyajima, Kumiko; Shigeto, Hajime
2017-10-21
A new single-cell microarray chip was designed and developed to separate and analyze single adherent and non-adherent cancer cells. The single-cell microarray chip is made of polystyrene with over 60,000 microchambers of 10 different size patterns (31-40 µm upper diameter, 11-20 µm lower diameter). A drop of suspension of adherent carcinoma (NCI-H1650) and non-adherent leukocyte (CCRF-CEM) cells was placed onto the chip, and single-cell occupancy of NCI-H1650 and CCRF-CEM was determined to be 79% and 84%, respectively. This was achieved by controlling the chip design and surface treatment. Analysis of protein expression in single NCI-H1650 and CCRF-CEM cells was performed on the single-cell microarray chip by multi-antibody staining. Additionally, with this system, we retrieved positive single cells from the microchambers by a micromanipulator. Thus, this system demonstrates the potential for easy and accurate separation and analysis of various types of single cells.
Naj, Adam C.; West, Michael; Rich, Stephen S.; Post, Wendy; Kao, W.H. Linda; Wasserman, Bruce A.; Herrington, David M.; Rodriguez, Annabelle
2012-01-01
Background Little is known regarding the association of scavenger receptor class B type I (SCARB1) single nucleotide polymorphisms (SNPs) and subclinical atherosclerosis (SCA), particularly in subjects of different racial/ethnic backgrounds. We examined this relationship in the Multi-Ethnic Study of Atherosclerosis (MESA). Methods and Results Forty-three SCARB1 tagging SNPs were genotyped. Baseline examinations included fasting lipids and SCA phenotypes (coronary artery calcium [CAC], and common and internal carotid artery thickness [CCIMT and ICIMT]). Examining SNP associations with different SCA phenotypes across multiple racial/ethnic groups with adjustment for multiple covariates, we found the C allele of SNP rs10846744 was associated with higher CCIMT in African American (P=0.03), Chinese (P=0.02), European American (P=0.05), and Hispanic participants (P=0.03), and was strongly associated in pooled analyses (P=0.0002). The results also showed that the association of this SNP with CCIMT was independent of lipids and other well-established cardiovascular risk factors. Stratifying by sex, there appeared to be a strong association of rs10846744 with CCIMT in females, but no genotype-sex interactions were observed. Conclusions Variation in SCARB1 at rs10846744 was significantly associated with CCIMT across racial/ethnic groups in MESA. PMID:20160195
PAX3 gene deletion detected by microarray analysis in a girl with hearing loss.
Drozniewska, Malgorzata; Haus, Olga
2014-01-01
Deletions of the PAX3 gene have been rarely reported in the literature. Mutations of this gene are a common cause of Waardenburg syndrome type 1 and 3. We report a 16 year old female presenting hearing loss and normal intellectual development, without major features of Waardenburg syndrome type 1, and without family history of the syndrome. Her phenotype, however, overlaps with features of craniofacial-deafness-hand syndrome. Microarray analysis showed ~862 kb de novo deletion at 2q36.1 including PAX3. The above findings suggest that the rearrangement found in our patient appeared de novo and with high probability is a cause of her phenotype.
Laser capture microdissection of embryonic cells and preparation of RNA for microarray assays.
Redmond, Latasha C; Pang, Christopher J; Dumur, Catherine; Haar, Jack L; Lloyd, Joyce A
2014-01-01
In order to compare the global gene expression profiles of different embryonic cell types, it is first necessary to isolate the specific cells of interest. The purpose of this chapter is to provide a step-by-step protocol to perform laser capture microdissection (LCM) on embryo samples and obtain sufficient amounts of high-quality RNA for microarray hybridizations. Using the LCM/microarray strategy on mouse embryo samples has some challenges, because the cells of interest are available in limited quantities. The first step in the protocol is to obtain embryonic tissue, and immediately cryoprotect and freeze it in a cryomold containing Optimal Cutting Temperature freezing media (Sakura Finetek), using a dry ice-isopentane bath. The tissue is then cryosectioned, and the microscope slides are processed to fix, stain, and dehydrate the cells. LCM is employed to isolate specific cell types from the slides, identified under the microscope by virtue of their morphology. Detailed protocols are provided for using the currently available ArcturusXT LCM instrument and CapSure(®) LCM Caps, to which the selected cells adhere upon laser capture. To maintain RNA integrity, upon removing a slide from the final processing step, or attaching the first cells on the LCM cap, LCM is completed within 20 min. The cells are then immediately recovered from the LCM cap using a denaturing solution that stabilizes RNA integrity. RNA is prepared using standard methods, modified for working with small samples. To ensure the validity of the microarray data, the quality of the RNA is assessed using the Agilent bioanalyzer. Only RNA that is of sufficient integrity and quantity is used to perform microarray assays. This chapter provides guidance regarding troubleshooting and optimization to obtain high-quality RNA from cells of limited availability, obtained from embryo samples by LCM.
Laser Capture Microdissection of Embryonic Cells and Preparation of RNA for Microarray Assays
Redmond, Latasha C.; Pang, Christopher J.; Dumur, Catherine; Haar, Jack L.; Lloyd, Joyce A.
2014-01-01
In order to compare the global gene expression profiles of different embryonic cell types, it is first necessary to isolate the specific cells of interest. The purpose of this chapter is to provide a step-by-step protocol to perform laser capture microdissection (LCM) on embryo samples and obtain sufficient amounts of high-quality RNA for microarray hybridizations. Using the LCM/microarray strategy on mouse embryo samples has some challenges, because the cells of interest are available in limited quantities. The first step in the protocol is to obtain embryonic tissue, and immediately cryoprotect and freeze it in a cryomold containing Optimal Cutting Temperature freezing media (Sakura Finetek), using a dry ice–isopentane bath. The tissue is then cryosectioned, and the microscope slides are processed to fix, stain, and dehydrate the cells. LCM is employed to isolate specific cell types from the slides, identified under the microscope by virtue of their morphology. Detailed protocols are provided for using the currently available ArcturusXT LCM instrument and CapSure® LCM Caps, to which the selected cells adhere upon laser capture. To maintain RNA integrity, upon removing a slide from the final processing step, or attaching the first cells on the LCM cap, LCM is completed within 20 min. The cells are then immediately recovered from the LCM cap using a denaturing solution that stabilizes RNA integrity. RNA is prepared using standard methods, modified for working with small samples. To ensure the validity of the microarray data, the quality of the RNA is assessed using the Agilent bioanalyzer. Only RNA that is of sufficient integrity and quantity is used to perform microarray assays. This chapter provides guidance regarding troubleshooting and optimization to obtain high-quality RNA from cells of limited availability, obtained from embryo samples by LCM. PMID:24318813
Hsu, Arthur L; Tang, Sen-Lin; Halgamuge, Saman K
2003-11-01
Current Self-Organizing Maps (SOMs) approaches to gene expression pattern clustering require the user to predefine the number of clusters likely to be expected. Hierarchical clustering methods used in this area do not provide unique partitioning of data. We describe an unsupervised dynamic hierarchical self-organizing approach, which suggests an appropriate number of clusters, to perform class discovery and marker gene identification in microarray data. In the process of class discovery, the proposed algorithm identifies corresponding sets of predictor genes that best distinguish one class from other classes. The approach integrates merits of hierarchical clustering with robustness against noise known from self-organizing approaches. The proposed algorithm applied to DNA microarray data sets of two types of cancers has demonstrated its ability to produce the most suitable number of clusters. Further, the corresponding marker genes identified through the unsupervised algorithm also have a strong biological relationship to the specific cancer class. The algorithm tested on leukemia microarray data, which contains three leukemia types, was able to determine three major and one minor cluster. Prediction models built for the four clusters indicate that the prediction strength for the smaller cluster is generally low, therefore labelled as uncertain cluster. Further analysis shows that the uncertain cluster can be subdivided further, and the subdivisions are related to two of the original clusters. Another test performed using colon cancer microarray data has automatically derived two clusters, which is consistent with the number of classes in data (cancerous and normal). JAVA software of dynamic SOM tree algorithm is available upon request for academic use. A comparison of rectangular and hexagonal topologies for GSOM is available from http://www.mame.mu.oz.au/mechatronics/journalinfo/Hsu2003supp.pdf
Glycome Diagnosis of Human Induced Pluripotent Stem Cells Using Lectin Microarray*
Tateno, Hiroaki; Toyota, Masashi; Saito, Shigeru; Onuma, Yasuko; Ito, Yuzuru; Hiemori, Keiko; Fukumura, Mihoko; Matsushima, Asako; Nakanishi, Mio; Ohnuma, Kiyoshi; Akutsu, Hidenori; Umezawa, Akihiro; Horimoto, Katsuhisa; Hirabayashi, Jun; Asashima, Makoto
2011-01-01
Induced pluripotent stem cells (iPSCs) can now be produced from various somatic cell (SC) lines by ectopic expression of the four transcription factors. Although the procedure has been demonstrated to induce global change in gene and microRNA expressions and even epigenetic modification, it remains largely unknown how this transcription factor-induced reprogramming affects the total glycan repertoire expressed on the cells. Here we performed a comprehensive glycan analysis using 114 types of human iPSCs generated from five different SCs and compared their glycomes with those of human embryonic stem cells (ESCs; nine cell types) using a high density lectin microarray. In unsupervised cluster analysis of the results obtained by lectin microarray, both undifferentiated iPSCs and ESCs were clustered as one large group. However, they were clearly separated from the group of differentiated SCs, whereas all of the four SCs had apparently distinct glycome profiles from one another, demonstrating that SCs with originally distinct glycan profiles have acquired those similar to ESCs upon induction of pluripotency. Thirty-eight lectins discriminating between SCs and iPSCs/ESCs were statistically selected, and characteristic features of the pluripotent state were then obtained at the level of the cellular glycome. The expression profiles of relevant glycosyltransferase genes agreed well with the results obtained by lectin microarray. Among the 38 lectins, rBC2LCN was found to detect only undifferentiated iPSCs/ESCs and not differentiated SCs. Hence, the high density lectin microarray has proved to be valid for not only comprehensive analysis of glycans but also diagnosis of stem cells under the concept of the cellular glycome. PMID:21471226
Hsiao, Tun-Jen; Hwang, Yuchi; Chang, Hua-Mei; Lin, Eugene
2014-01-01
One particularly interesting single nucleotide polymorphism (SNP), rs6235 (encoding an S690T substitution), in the proprotein convertase subtilisin/kexin type 1 (PCSK1) gene has been widely associated with obesity in several European cohorts. The present study was intended to investigate the association between the PCSK1 rs6235 SNP and the prevalence of overweight or obesity, or obesity-related metabolic traits in a Taiwanese population. A total of 964 Taiwanese subjects with general health examinations were analyzed. Our data revealed no association of PCSK1 rs6235 with the risk of obesity or overweight in the complete subjects. However, the PCSK1 rs6235 SNP exhibited a significant association with overweight among the male subjects (P=0.03), but not among the female subjects. Furthermore, the carriers of GG variant had a significantly higher waist circumference than those with the CC variant (82.5 ± 11.5 vs. 81.2 ± 10.2 cm; P=0.01) and those with the CG variant (82.5 ± 11.5 vs. 81.4 ± 10.4 cm; P=0.021). In addition, the carriers of GG variant had a higher diastolic blood pressure than those with the CC variant (81.9 ± 14.2 vs. 80.3 ± 12.9 mm Hg; P=0.023). Our study indicates that the PCSK1 rs6235 SNP may contribute to the risk of overweight in men and predict obesity-related metabolic traits such as waist circumference and diastolic blood pressure in Taiwanese subjects. © 2013.
Leak, Tennille S.; Mychaleckyj, Josyf C.; Smith, Shelly G.; Keene, Keith L.; Gordon, Candace J.; Hicks, Pamela J.; Freedman, Barry I.; Bowden, Donald W.; Sale, Michèle M.
2009-01-01
Previously we performed a genome scan for type 2 diabetes (T2DM) using 638 African-American (AA) affected sibling pairs from 247 families; non-parametric linkage analysis suggested evidence of linkage at 6q24-27 (LOD 2.26). To comprehensively evaluate this region we performed a 2-stage association study by first constructing a SNP map of 754 SNPs selected from HapMap on the basis of linkage disequilibrium (LD) in 300 AAT2DM-ESRD subjects, 311 AA controls, 43 European American controls and 45 Yoruba Nigerian samples (Set 1). Replication analyses were conducted in an independent population of 283 AA T2DM-ESRD subjects and 282 AA controls (Set 2). In addition, we adjusted for the impact of admixture on association results by using ancestry informative markers (AIMs). In Stage 1, 137 (18.2%) SNPs showed nominal evidence of association (P<0.05) in one or more of tests of association: allelic (n=33), dominant (n=36), additive (n=29), or recessive (n=34) genotypic models, and 2- (n=47) and 3-SNP (n=43) haplotypic analyses. These SNPs were selected for follow-up genotyping. Stage 2 analyses confirmed association with a predicted 2-SNP “risk” haplotype in the PARK2 gene. Also, two intergenic SNPs showed consistent genotypic association with T2DM-ESRD: rs12197043 and rs4897081. Combined analysis of all subjects from both stages revealed nominal associations with 17 SNPs within genes; including suggestive associations in ESR1 and PARK2. This study confirms known diabetic nephropathy loci and identifies potentially novel susceptibility variants located within 6q24-27 in AA. PMID:18560894
2015-01-01
Background Obesity affects quality of life and life expectancy and is associated with cardiovascular disorders, cancer, diabetes, reproductive disorders in women, prostate diseases in men, and congenital anomalies in children. The use of single nucleotide polymorphism (SNP) markers of diseases and drug responses (i.e., significant differences of personal genomes of patients from the reference human genome) can help physicians to improve treatment. Clinical research can validate SNP markers via genotyping of patients and demonstration that SNP alleles are significantly more frequent in patients than in healthy people. The search for biomedical SNP markers of interest can be accelerated by computer-based analysis of hundreds of millions of SNPs in the 1000 Genomes project because of selection of the most meaningful candidate SNP markers and elimination of neutral SNPs. Results We cross-validated the output of two computer-based methods: DNA sequence analysis using Web service SNP_TATA_Comparator and keyword search for articles on comorbidities of obesity. Near the sites binding to TATA-binding protein (TBP) in human gene promoters, we found 22 obesity-related candidate SNP markers, including rs10895068 (male breast cancer in obesity); rs35036378 (reduced risk of obesity after ovariectomy); rs201739205 (reduced risk of obesity-related cancers due to weight loss by diet/exercise in obese postmenopausal women); rs183433761 (obesity resistance during a high-fat diet); rs367732974 and rs549591993 (both: cardiovascular complications in obese patients with type 2 diabetes mellitus); rs200487063 and rs34104384 (both: obesity-caused hypertension); rs35518301, rs72661131, and rs562962093 (all: obesity); and rs397509430, rs33980857, rs34598529, rs33931746, rs33981098, rs34500389, rs63750953, rs281864525, rs35518301, and rs34166473 (all: chronic inflammation in comorbidities of obesity). Using an electrophoretic mobility shift assay under nonequilibrium conditions, we empirically validated the statistical significance (α < 0.00025) of the differences in TBP affinity values between the minor and ancestral alleles of 4 out of the 22 SNPs: rs200487063, rs201381696, rs34104384, and rs183433761. We also measured half-life (t1/2), Gibbs free energy change (ΔG), and the association and dissociation rate constants, ka and kd, of the TBP-DNA complex for these SNPs. Conclusions Validation of the 22 candidate SNP markers by proper clinical protocols appears to have a strong rationale and may advance postgenomic predictive preventive personalized medicine. PMID:26694100
Analysis of Altered Micro RNA Expression Profiles in Focal Cortical Dysplasia IIB.
Li, Lin; Liu, Chang-Qing; Li, Tian-Fu; Guan, Yu-Guang; Zhou, Jian; Qi, Xue-Ling; Yang, Yu-Tao; Deng, Jia-Hui; Xu, Zhi-Qing David; Luan, Guo-Ming
2016-04-01
Focal cortical dysplasia type IIB is a commonly encountered subtype of developmental malformation of the cerebral cortex and is often associated with pharmacoresistant epilepsy. In this study, to investigate the molecular etiology of focal cortical dysplasia type IIB, the authors performed micro ribonucleic acid (RNA) microarray on surgical specimens from 5 children (2 female and 3 male, mean age was 73.4 months, range 50-112 months) diagnosed of focal cortical dysplasia type IIB and matched normal tissue adjacent to the lesion. In all, 24 micro RNAs were differentially expressed in focal cortical dysplasia type IIB, and the microarray results were validated using quantitative real-time polymerase chain reaction (PCR). Then the putative target genes of the differentially expressed micro RNAs were identified by bioinformatics analysis. Moreover, biological significance of the target genes was evaluated by investigating the pathways in which the genes were enriched, and the Hippo signaling pathway was proposed to be highly related with the pathogenesis of focal cortical dysplasia type IIB. © The Author(s) 2015.
Linear reduction methods for tag SNP selection.
He, Jingwu; Zelikovsky, Alex
2004-01-01
It is widely hoped that constructing a complete human haplotype map will help to associate complex diseases with certain SNP's. Unfortunately, the number of SNP's is huge and it is very costly to sequence many individuals. Therefore, it is desirable to reduce the number of SNP's that should be sequenced to considerably small number of informative representatives, so called tag SNP's. In this paper, we propose a new linear algebra based method for selecting and using tag SNP's. Our method is purely combinatorial and can be combined with linkage disequilibrium (LD) and block based methods. We measure the quality of our tag SNP selection algorithm by comparing actual SNP's with SNP's linearly predicted from linearly chosen tag SNP's. We obtain an extremely good compression and prediction rates. For example, for long haplotypes (>25000 SNP's), knowing only 0.4% of all SNP's we predict the entire unknown haplotype with 2% accuracy while the prediction method is based on a 10% sample of the population.
Toward optimal set of single nucleotide polymorphism investigation before IVF.
Ivanov, A V; Dedul, A G; Fedotov, Y N; Komlichenko, E V
2016-10-01
At present, the patient preparation for IVF needs to undergo a series of planned tests, including the genotyping of single nucleotide polymorphism (SNP) alleles of some genes. In former USSR countries, such investigation was not included in overwhelming majority of health insurance programs and paid by patient. In common, there are prerequisites to the study of more than 50 polymorphisms. An important faced task is to determine the optimal panel for SNP genotyping in terms of price/number of SNP. During 2009-2015 in the University Hospital of St. Petersburg State University, blood samples were analyzed from 550 women with different reproductive system disorders preparing for IVF and 46 healthy women in control group. In total, 28 SNP were analyzed in the genes of thrombophilia factors, folic acid cycle, detoxification system, and the renin-angiotensin system. The method used was real-time PCR. A significant increase in the frequency of pathological alleles of some polymorphisms in patients with habitual failure of IVF was shown, compared with the control group. As a result, two options defined panels for optimal typing SNP before IVF were composed. Standard panel includes 8 SNP, 5 in thromborhilic factors, and 3 in folic acid cycle genes. They are 20210 G > A of FII gene, R506Q G > A of FV gene (mutation Leiden), -675 5G > 4G of PAI-I gene, L33P T > C of ITGB3 gene, -455 G > A of FGB gene, 667 C > T of MTHFR gene, 2756 A > G of MTR gene, and 66 A > G of MTRR gene. Extended panel of 15 SNP also includes 807 C > T of ITGA2 gene, T154M C > T of GP1BA gene, second polymorphism 1298 A > C in MTHFR gene, polymorphisms of the renin-angiotensin gene AGT M235T T > C and -1166 A > C of AGTR1 gene, polymorphisms I105V A > G and A114V C > T of detoxification system gene GSTP. The results of SNP genotyping can be adjusted for treatment tactics and IVF, and also medical support getting pregnant. The success rate of IVF is increased as the result, especially in the group with the usual failure of IVF.
Shi, Xiang Yang; Dumenyo, C Korsi; Hernandez-Martinez, Rufina; Azad, Hamid; Cooksey, Donald A
2007-11-01
Many virulence genes in plant bacterial pathogens are coordinately regulated by "global" regulatory genes. Conducting DNA microarray analysis of bacterial mutants of such genes, compared with the wild type, can help to refine the list of genes that may contribute to virulence in bacterial pathogens. The regulatory gene algU, with roles in stress response and regulation of the biosynthesis of the exopolysaccharide alginate in Pseudomonas aeruginosa and many other bacteria, has been extensively studied. The role of algU in Xylella fastidiosa, the cause of Pierce's disease of grapevines, was analyzed by mutation and whole-genome microarray analysis to define its involvement in aggregation, biofilm formation, and virulence. In this study, an algU::nptII mutant had reduced cell-cell aggregation, attachment, and biofilm formation and lower virulence in grapevines. Microarray analysis showed that 42 genes had significantly lower expression in the algU::nptII mutant than in the wild type. Among these are several genes that could contribute to cell aggregation and biofilm formation, as well as other physiological processes such as virulence, competition, and survival.
Finding gene clusters for a replicated time course study
2014-01-01
Background Finding genes that share similar expression patterns across samples is an important question that is frequently asked in high-throughput microarray studies. Traditional clustering algorithms such as K-means clustering and hierarchical clustering base gene clustering directly on the observed measurements and do not take into account the specific experimental design under which the microarray data were collected. A new model-based clustering method, the clustering of regression models method, takes into account the specific design of the microarray study and bases the clustering on how genes are related to sample covariates. It can find useful gene clusters for studies from complicated study designs such as replicated time course studies. Findings In this paper, we applied the clustering of regression models method to data from a time course study of yeast on two genotypes, wild type and YOX1 mutant, each with two technical replicates, and compared the clustering results with K-means clustering. We identified gene clusters that have similar expression patterns in wild type yeast, two of which were missed by K-means clustering. We further identified gene clusters whose expression patterns were changed in YOX1 mutant yeast compared to wild type yeast. Conclusions The clustering of regression models method can be a valuable tool for identifying genes that are coordinately transcribed by a common mechanism. PMID:24460656
Core Genome Multilocus Sequence Typing Scheme for High-Resolution Typing of Enterococcus faecium
de Been, Mark; Pinholt, Mette; Top, Janetta; Bletz, Stefan; van Schaik, Willem; Brouwer, Ellen; Rogers, Malbert; Kraat, Yvette; Bonten, Marc; Corander, Jukka; Westh, Henrik; Harmsen, Dag
2015-01-01
Enterococcus faecium, a common inhabitant of the human gut, has emerged in the last 2 decades as an important multidrug-resistant nosocomial pathogen. Since the start of the 21st century, multilocus sequence typing (MLST) has been used to study the molecular epidemiology of E. faecium. However, due to the use of a small number of genes, the resolution of MLST is limited. Whole-genome sequencing (WGS) now allows for high-resolution tracing of outbreaks, but current WGS-based approaches lack standardization, rendering them less suitable for interlaboratory prospective surveillance. To overcome this limitation, we developed a core genome MLST (cgMLST) scheme for E. faecium. cgMLST transfers genome-wide single nucleotide polymorphism (SNP) diversity into a standardized and portable allele numbering system that is far less computationally intensive than SNP-based analysis of WGS data. The E. faecium cgMLST scheme was built using 40 genome sequences that represented the diversity of the species. The scheme consists of 1,423 cgMLST target genes. To test the performance of the scheme, we performed WGS analysis of 103 outbreak isolates from five different hospitals in the Netherlands, Denmark, and Germany. The cgMLST scheme performed well in distinguishing between epidemiologically related and unrelated isolates, even between those that had the same sequence type (ST), which denotes the higher discriminatory power of this cgMLST scheme over that of conventional MLST. We also show that in terms of resolution, the performance of the E. faecium cgMLST scheme is equivalent to that of an SNP-based approach. In conclusion, the cgMLST scheme developed in this study facilitates rapid, standardized, and high-resolution tracing of E. faecium outbreaks. PMID:26400782
Pollex, Erika K; Anger, Gregory; Hutson, Janine; Koren, Gideon; Piquette-Miller, Micheline
2010-05-01
The antidiabetic agent glyburide (glibenclamide) is frequently used for the treatment of type II diabetes and is increasingly being used for the treatment of gestational diabetes. Evidence suggests that breast cancer resistance protein/ATP-binding cassette, subfamily G, member 2 (ABCG2) expressed in the placenta protects the fetus against the accumulation of glyburide. A number of studies have investigated the significance of several single-nucleotide polymorphisms (SNPs) in the ABCG2 gene. Associations between the Q141K (C421A) SNP and ABCG2 protein expression, membrane surface translocation, efflux activity, or ATPase activity have been shown. Therefore, alterations in glyburide transport across the placenta, resulting in increased fetal glyburide exposure, may be seen in individuals carrying the C421A allele. The purpose of this study is to investigate whether the Q141K SNP causes alterations in ABCG2-mediated glyburide transport. Glyburide accumulation assays were carried out with stably transfected human embryonic kidney (HEK)-293 cells expressing wild-type ABCG2 (Arg482) and polymorphic ABCG2 (Q141K). Glyburide kinetic parameters were determined for comparison of wild-type and SNP ABCG2 activity by simultaneously fitting data for ABCG2-expressing cells (saturable transport) and empty vector-expressing cells (nonsaturable transport) by nonlinear regression analysis. The apparent K(t) and V(max) values for the transfected HEK-293 cells expressing the polymorphic variant (Q141K) of ABCG2 were significantly higher than those values determined for the wild-type ABCG2-expressing cells (p < 0.05). Our results indicate that the Q141K variant of ABCG2 may have the potential to alter the placental pharmacokinetics of glyburide used in pregnancy.
Core Genome Multilocus Sequence Typing Scheme for High- Resolution Typing of Enterococcus faecium.
de Been, Mark; Pinholt, Mette; Top, Janetta; Bletz, Stefan; Mellmann, Alexander; van Schaik, Willem; Brouwer, Ellen; Rogers, Malbert; Kraat, Yvette; Bonten, Marc; Corander, Jukka; Westh, Henrik; Harmsen, Dag; Willems, Rob J L
2015-12-01
Enterococcus faecium, a common inhabitant of the human gut, has emerged in the last 2 decades as an important multidrug-resistant nosocomial pathogen. Since the start of the 21st century, multilocus sequence typing (MLST) has been used to study the molecular epidemiology of E. faecium. However, due to the use of a small number of genes, the resolution of MLST is limited. Whole-genome sequencing (WGS) now allows for high-resolution tracing of outbreaks, but current WGS-based approaches lack standardization, rendering them less suitable for interlaboratory prospective surveillance. To overcome this limitation, we developed a core genome MLST (cgMLST) scheme for E. faecium. cgMLST transfers genome-wide single nucleotide polymorphism(SNP) diversity into a standardized and portable allele numbering system that is far less computationally intensive than SNP-based analysis of WGS data. The E. faecium cgMLST scheme was built using 40 genome sequences that represented the diversity of the species. The scheme consists of 1,423 cgMLST target genes. To test the performance of the scheme, we performed WGS analysis of 103 outbreak isolates from five different hospitals in the Netherlands, Denmark, and Germany. The cgMLST scheme performed well in distinguishing between epidemiologically related and unrelated isolates, even between those that had the same sequence type (ST), which denotes the higher discriminatory power of this cgMLST scheme over that of conventional MLST. We also show that in terms of resolution, the performance of the E. faecium cgMLST scheme is equivalent to that of an SNP-based approach. In conclusion, the cgMLST scheme developed in this study facilitates rapid, standardized, and high-resolution tracing of E. faecium outbreaks.
Biological relevance of CNV calling methods using familial relatedness including monozygotic twins.
Castellani, Christina A; Melka, Melkaye G; Wishart, Andrea E; Locke, M Elizabeth O; Awamleh, Zain; O'Reilly, Richard L; Singh, Shiva M
2014-04-21
Studies involving the analysis of structural variation including Copy Number Variation (CNV) have recently exploded in the literature. Furthermore, CNVs have been associated with a number of complex diseases and neurodevelopmental disorders. Common methods for CNV detection use SNP, CNV, or CGH arrays, where the signal intensities of consecutive probes are used to define the number of copies associated with a given genomic region. These practices pose a number of challenges that interfere with the ability of available methods to accurately call CNVs. It has, therefore, become necessary to develop experimental protocols to test the reliability of CNV calling methods from microarray data so that researchers can properly discriminate biologically relevant data from noise. We have developed a workflow for the integration of data from multiple CNV calling algorithms using the same array results. It uses four CNV calling programs: PennCNV (PC), Affymetrix® Genotyping Console™ (AGC), Partek® Genomics Suite™ (PGS) and Golden Helix SVS™ (GH) to analyze CEL files from the Affymetrix® Human SNP 6.0 Array™. To assess the relative suitability of each program, we used individuals of known genetic relationships. We found significant differences in CNV calls obtained by different CNV calling programs. Although the programs showed variable patterns of CNVs in the same individuals, their distribution in individuals of different degrees of genetic relatedness has allowed us to offer two suggestions. The first involves the use of multiple algorithms for the detection of the largest possible number of CNVs, and the second suggests the use of PennCNV over all other methods when the use of only one software program is desirable.
The Genomic Architecture of Sporadic Heart Failure
Dorn, Gerald W
2011-01-01
Common or sporadic systolic heart failure (heart failure) is the clinical syndrome of insufficient forward cardiac output resulting from myocardial disease. Most heart failure is the consequence of ischemic or idiopathic cardiomyopathy. There is a clear familial predisposition to heart failure, with a genetic component estimated to confer between 20 and 30% of overall risk. The multifactorial etiology of this syndrome has complicated identification of its genetic underpinnings. Until recently, almost all genetic studies of heart failure were designed and deployed according to the common disease-common variant hypothesis, in which individual risk alleles impart a small positive or negative effect and overall genetic risk is the cumulative impact of all functional genetic variations. Early studies employed a candidate gene approach, focused mainly on factors within adrenergic and renin-angiotensin pathways that affect heart failure progression and are targeted by standard pharmacotherapeutics. Many of these reported allelic associations with heart failure have not been replicated. However, the preponderance of data support risk-modifier effects for the Arg389Gly polymorphism of β1-adrenergic receptors and the intron 16 in/del polymorphism of angiotensin converting enzyme. Recent unbiased studies using genome-wide single nucleotide polymorphism (SNP) microarrays have shown fewer positive results than when these platforms were applied to hypertension, myocardial infarction, or diabetes, possibly reflecting the complex etiology of heart failure. A new cardiovascular gene-centric sub-genome SNP array identified a common heat failure risk allele at 1p36 in multiple independent cohorts, but the biological mechanism for this association is still uncertain. It is likely that common gene polymorphisms account for only a fraction of individual genetic heart failure risk, and future studies using deep resequencing are likely to identify rare gene variants with larger biological effects. PMID:21566223
Using Next Generation Sequencing for Multiplexed Trait-Linked Markers in Wheat
Bernardo, Amy; Wang, Shan; St. Amand, Paul; Bai, Guihua
2015-01-01
With the advent of next generation sequencing (NGS) technologies, single nucleotide polymorphisms (SNPs) have become the major type of marker for genotyping in many crops. However, the availability of SNP markers for important traits of bread wheat ( Triticum aestivum L.) that can be effectively used in marker-assisted selection (MAS) is still limited and SNP assays for MAS are usually uniplex. A shift from uniplex to multiplex assays will allow the simultaneous analysis of multiple markers and increase MAS efficiency. We designed 33 locus-specific markers from SNP or indel-based marker sequences that linked to 20 different quantitative trait loci (QTL) or genes of agronomic importance in wheat and analyzed the amplicon sequences using an Ion Torrent Proton Sequencer and a custom allele detection pipeline to determine the genotypes of 24 selected germplasm accessions. Among the 33 markers, 27 were successfully multiplexed and 23 had 100% SNP call rates. Results from analysis of "kompetitive allele-specific PCR" (KASP) and sequence tagged site (STS) markers developed from the same loci fully verified the genotype calls of 23 markers. The NGS-based multiplexed assay developed in this study is suitable for rapid and high-throughput screening of SNPs and some indel-based markers in wheat. PMID:26625271
Zhang, Han; Wheeler, William; Song, Lei; Yu, Kai
2017-07-07
As meta-analysis results published by consortia of genome-wide association studies (GWASs) become increasingly available, many association summary statistics-based multi-locus tests have been developed to jointly evaluate multiple single-nucleotide polymorphisms (SNPs) to reveal novel genetic architectures of various complex traits. The validity of these approaches relies on the accurate estimate of z-score correlations at considered SNPs, which in turn requires knowledge on the set of SNPs assessed by each study participating in the meta-analysis. However, this exact SNP coverage information is usually unavailable from the meta-analysis results published by GWAS consortia. In the absence of the coverage information, researchers typically estimate the z-score correlations by making oversimplified coverage assumptions. We show through real studies that such a practice can generate highly inflated type I errors, and we demonstrate the proper way to incorporate correct coverage information into multi-locus analyses. We advocate that consortia should make SNP coverage information available when posting their meta-analysis results, and that investigators who develop analytic tools for joint analyses based on summary data should pay attention to the variation in SNP coverage and adjust for it appropriately. Published by Oxford University Press 2017. This work is written by US Government employees and is in the public domain in the US.
Coding SNP in tenascin-C Fn-III-D domain associates with adult asthma.
Matsuda, Akira; Hirota, Tomomitsu; Akahoshi, Mitsuteru; Shimizu, Makiko; Tamari, Mayumi; Miyatake, Akihiko; Takahashi, Atsushi; Nakashima, Kazuko; Takahashi, Naomi; Obara, Kazuhiko; Yuyama, Noriko; Doi, Satoru; Kamogawa, Yumiko; Enomoto, Tadao; Ohshima, Koichi; Tsunoda, Tatsuhiko; Miyatake, Shoichiro; Fujita, Kimie; Kusakabe, Moriaki; Izuhara, Kenji; Nakamura, Yusuke; Hopkin, Julian; Shirakawa, Taro
2005-10-01
The extracellular matrix glycoprotein tenascin-C (TNC) has been accepted as a valuable histopathological subepithelial marker for evaluating the severity of asthmatic disease and the therapeutic response to drugs. We found an association between an adult asthma and an SNP encoding TNC fibronectin type III-D (Fn-III-D) domain in a case-control study between a Japanese population including 446 adult asthmatic patients and 658 normal healthy controls. The SNP (44513A/T in exon 17) strongly associates with adult bronchial asthma (chi2 test, P=0.00019, Odds ratio=1.76, 95% confidence interval=1.31-2.36). This coding SNP induces an amino acid substitution (Leu1677Ile) within the Fn-III-D domain of the alternative splicing region. Computer-assisted protein structure modeling suggests that the substituted amino acid locates at the outer edge of the beta-sheet in Fn-III-D domain and causes instability of this beta-sheet. As the TNC fibronectin-III domain has molecular elasticity, the structural change may affect the integrity and stiffness of asthmatic airways. In addition, TNC expression in lung fibroblasts increases with Th2 immune cytokine stimulation. Thus, Leu1677Ile may be valuable marker for evaluating the risk for developing asthma and plays a role in its pathogenesis.
Dereeper, Alexis; Nicolas, Stéphane; Le Cunff, Loïc; Bacilieri, Roberto; Doligez, Agnès; Peros, Jean-Pierre; Ruiz, Manuel; This, Patrice
2011-05-05
High-throughput re-sequencing, new genotyping technologies and the availability of reference genomes allow the extensive characterization of Single Nucleotide Polymorphisms (SNPs) and insertion/deletion events (indels) in many plant species. The rapidly increasing amount of re-sequencing and genotyping data generated by large-scale genetic diversity projects requires the development of integrated bioinformatics tools able to efficiently manage, analyze, and combine these genetic data with genome structure and external data. In this context, we developed SNiPlay, a flexible, user-friendly and integrative web-based tool dedicated to polymorphism discovery and analysis. It integrates:1) a pipeline, freely accessible through the internet, combining existing softwares with new tools to detect SNPs and to compute different types of statistical indices and graphical layouts for SNP data. From standard sequence alignments, genotyping data or Sanger sequencing traces given as input, SNiPlay detects SNPs and indels events and outputs submission files for the design of Illumina's SNP chips. Subsequently, it sends sequences and genotyping data into a series of modules in charge of various processes: physical mapping to a reference genome, annotation (genomic position, intron/exon location, synonymous/non-synonymous substitutions), SNP frequency determination in user-defined groups, haplotype reconstruction and network, linkage disequilibrium evaluation, and diversity analysis (Pi, Watterson's Theta, Tajima's D).Furthermore, the pipeline allows the use of external data (such as phenotype, geographic origin, taxa, stratification) to define groups and compare statistical indices.2) a database storing polymorphisms, genotyping data and grapevine sequences released by public and private projects. It allows the user to retrieve SNPs using various filters (such as genomic position, missing data, polymorphism type, allele frequency), to compare SNP patterns between populations, and to export genotyping data or sequences in various formats. Our experiments on grapevine genetic projects showed that SNiPlay allows geneticists to rapidly obtain advanced results in several key research areas of plant genetic diversity. Both the management and treatment of large amounts of SNP data are rendered considerably easier for end-users through automation and integration. Current developments are taking into account new advances in high-throughput technologies.SNiPlay is available at: http://sniplay.cirad.fr/.
Detection of selective sweeps in cattle using genome-wide SNP data
2013-01-01
Background The domestication and subsequent selection by humans to create breeds and biological types of cattle undoubtedly altered the patterning of variation within their genomes. Strong selection to fix advantageous large-effect mutations underlying domesticability, breed characteristics or productivity created selective sweeps in which variation was lost in the chromosomal region flanking the selected allele. Selective sweeps have now been identified in the genomes of many animal species including humans, dogs, horses, and chickens. Here, we attempt to identify and characterise regions of the bovine genome that have been subjected to selective sweeps. Results Two datasets were used for the discovery and validation of selective sweeps via the fixation of alleles at a series of contiguous SNP loci. BovineSNP50 data were used to identify 28 putative sweep regions among 14 diverse cattle breeds. Affymetrix BOS 1 prescreening assay data for five breeds were used to identify 85 regions and validate 5 regions identified using the BovineSNP50 data. Many genes are located within these regions and the lack of sequence data for the analysed breeds precludes the nomination of selected genes or variants and limits the prediction of the selected phenotypes. However, phenotypes that we predict to have historically been under strong selection include horned-polled, coat colour, stature, ear morphology, and behaviour. Conclusions The bias towards common SNPs in the design of the BovineSNP50 assay led to the identification of recent selective sweeps associated with breed formation and common to only a small number of breeds rather than ancient events associated with domestication which could potentially be common to all European taurines. The limited SNP density, or marker resolution, of the BovineSNP50 assay significantly impacted the rate of false discovery of selective sweeps, however, we found sweeps in common between breeds which were confirmed using an ultra-high-density assay scored in a small number of animals from a subset of the breeds. No sweep regions were shared between indicine and taurine breeds reflecting their divergent selection histories and the very different environmental habitats to which these sub-species have adapted. PMID:23758707
Kling, Daniel; Tillmar, Andreas; Egeland, Thore; Mostad, Petter
2015-09-01
Several applications necessitate an unbiased determination of relatedness, be it in linkage or association studies or in a forensic setting. An appropriate model to compute the joint probability of some genetic data for a set of persons given some hypothesis about the pedigree structure is then required. The increasing number of markers available through high-density SNP microarray typing and NGS technologies intensifies the demand, where using a large number of markers may lead to biased results due to strong dependencies between closely located loci, both within pedigrees (linkage) and in the population (allelic association or linkage disequilibrium (LD)). We present a new general model, based on a Markov chain for inheritance patterns and another Markov chain for founder allele patterns, the latter allowing us to account for LD. We also demonstrate a specific implementation for X chromosomal markers that allows for computation of likelihoods based on hypotheses of alleged relationships and genetic marker data. The algorithm can simultaneously account for linkage, LD, and mutations. We demonstrate its feasibility using simulated examples. The algorithm is implemented in the software FamLinkX, providing a user-friendly GUI for Windows systems (FamLinkX, as well as further usage instructions, is freely available at www.famlink.se ). Our software provides the necessary means to solve cases where no previous implementation exists. In addition, the software has the possibility to perform simulations in order to further study the impact of linkage and LD on computed likelihoods for an arbitrary set of markers.
Barra, Gustavo Barcelos; Dutra, Ludmila Alves Sanches; Watanabe, Sílvia Conde; Costa, Patrícia Godoy Garcia; Cruz, Patrícia Sales Marques da; Azevedo, Monalisa Ferreira; Amato, Angélica Amorim
2012-11-01
To investigate the association of the T allele of the single nucleotide polymorphism (SNP) rs7903146 of TCF7L2 with the occurrence of T2D in a sample of subjects followed up at the Brasilia University Hospital. The SNP rs7903146 of TCF7L2 was genotyped by allele-specific PCR in 113 patients with known T2D and in 139 non-diabetic controls in Brasilia, Brazil. We found that the T allele of the SNP rs7903146 of TCF7L2 was significantly associated with T2D risk (odds ratio of 3.92 for genotype TT in the recessive genetic model, p = 0.004 and 1.5 for T allele, p = 0.032). These results reinforce previous findings on the consistent association of this genetic factor and the risk of T2D in populations of diverse ethnic backgrounds.
Oh, Chang Seok; Lee, Soong Deok; Kim, Yi-Suk; Shin, Dong Hoon
2015-01-01
Previous study showed that East Asian mtDNA haplogroups, especially those of Koreans, could be successfully assigned by the coupled use of analyses on coding region SNP markers and control region mutation motifs. In this study, we tried to see if the same triple multiplex analysis for coding regions SNPs could be also applicable to ancient samples from East Asia as the complementation for sequence analysis of mtDNA control region. By the study on Joseon skeleton samples, we know that mtDNA haplogroup determined by coding region SNP markers successfully falls within the same haplogroup that sequence analysis on control region can assign. Considering that ancient samples in previous studies make no small number of errors in control region mtDNA sequencing, coding region SNP analysis can be used as good complimentary to the conventional haplogroup determination, especially of archaeological human bone samples buried underground over long periods. PMID:26345190
Coombs, Geoffrey W.; Goering, Richard V.; Chua, Kyra Y. L.; Monecke, Stefan; Howden, Benjamin P.; Stinear, Timothy P.; Ehricht, Ralf; O’Brien, Frances G.; Christiansen, Keryn J.
2012-01-01
In Australia the PVL - positive ST93-IV [2B], colloquially known as “Queensland CA-MRSA” has become the dominant CA-MRSA clone. First described in the early 2000s, ST93-IV [2B] is associated with skin and severe invasive infections including necrotizing pneumonia. A singleton by multilocus sequence typing (MLST) eBURST analysis ST93 is distinct from other S aureus clones. To determine if the increased prevalence of ST93-IV [2B] is due to the widespread transmission of a single strain of ST93-IV [2B] the genetic relatedness of 58 S. aureus ST93 isolated throughout Australia over an extended period were studied in detail using a variety of molecular methods including pulsed-field gel electrophoresis, spa typing, MLST, microarray DNA, SCCmec typing and dru typing. Identification of the phage harbouring the lukS-PV/lukF-PV Panton Valentine leucocidin genes, detection of allelic variations in lukS-PV/lukF-PV, and quantification of LukF-PV expression was also performed. Although ST93-IV [2B] is known to have an apparent enhanced clinical virulence, the isolates harboured few known virulence determinants. All PVL-positive isolates carried the PVL-encoding phage ΦSa2USA and the lukS-PV/lukF-PV genes had the same R variant SNP profile. The isolates produced similar expression levels of LukF-PV. Although multiple rearrangements of the spa sequence have occurred, the core genome in ST93 is very stable. The emergence of ST93-MRSA is due to independent acquisitions of different dru-defined type IV and type V SCCmec elements in several spa-defined ST93-MSSA backgrounds. Rearrangement of the spa sequence in ST93-MRSA has subsequently occurred in some of these strains. Although multiple ST93-MRSA strains were characterised, little genetic diversity was identified for most isolates, with PVL-positive ST93-IVa [2B]-t202-dt10 predominant across Australia. Whether ST93-IVa [2B] t202-dt10 arose from one PVL-positive ST93-MSSA-t202, or by independent acquisitions of SCCmec-IVa [2B]-dt10 into multiple PVL-positive ST93-MSSA-t202 strains is not known. PMID:22900085
Xue, Xiao-yan; Zhou, Ying; Chen, Ying-ying; Meng, Jing-ru; Jia, Min; Hou, Zheng; Bai, Hui; Mao, Xing-gang; Luo, Xiao-xing
2012-04-01
To find potential enhancers for facilitating the buccal delivery of insulin, a TR146 cell-culture model of buccal epithelium, cultured on commercially available insert plates, was used to evaluate the permeability-enhancing effects of several traditional and new types of chemical enhancers, including N-acetyl-L-cysteine (NAC), sodium deoxycholate (SDC), sodium nitroprusside (SNP), reduced glutathione (GSH), glutamine (Gln), chitosan (CS), L-arginine (Arg), 1-dodecylazacycloheptan-2-one (Azone), and soybean lecithin (SPC) (50 and 10 μg/mL respectively). Permeability studies were performed to determine the enhancing effects of these compounds on insulin permeation across the cell-culture model. The enhancing effects of the enhancers were assessed by calculating the apparent permeability coefficients and enhancement ratio. Cytotoxicity of the permeation enhancers at different concentrations was investigated by using the methylthiazolydiphenyl-tetrazolium bromide (MTT) assay. Results showed that 50 μg/mL of NAC, SDC, GSH, CS, Arg, Azone, SPC, SNP, and 10 μg/mL of SNP had a significant enhancing effect on promoting the transport of insulin across the TR146 cell model. MTT assays showed that 50 μg/mL of Gln, Azone, SDC, SNP, Arg, 10 μg/mL SDC, and Arg had obvious toxic effects on TR146 cells. Therefore, NAC, GSH, CS, SPC, and SNP appear to be safe, effective permeability enhancers that promote the transport of insulin across the TR146 cell-culture model of buccal epithelium and may be potential enhancers for buccal delivery of insulin with both low toxicity and high efficiency.
Van Inghelandt, Delphine; Melchinger, Albrecht E; Lebreton, Claude; Stich, Benjamin
2010-05-01
Information about the genetic diversity and population structure in elite breeding material is of fundamental importance for the improvement of crops. The objectives of our study were to (a) examine the population structure and the genetic diversity in elite maize germplasm based on simple sequence repeat (SSR) markers, (b) compare these results with those obtained from single nucleotide polymorphism (SNP) markers, and (c) compare the coancestry coefficient calculated from pedigree records with genetic distance estimates calculated from SSR and SNP markers. Our study was based on 1,537 elite maize inbred lines genotyped with 359 SSR and 8,244 SNP markers. The average number of alleles per locus, of group specific alleles, and the gene diversity (D) were higher for SSRs than for SNPs. Modified Roger's distance (MRD) estimates and membership probabilities of the STRUCTURE matrices were higher for SSR than for SNP markers but the germplasm organization in four heterotic pools was consistent with STRUCTURE results based on SSRs and SNPs. MRD estimates calculated for the two marker systems were highly correlated (0.87). Our results suggested that the same conclusions regarding the structure and the diversity of heterotic pools could be drawn from both markers types. Furthermore, although our results suggested that the ratio of the number of SSRs and SNPs required to obtain MRD or D estimates with similar precision is not constant across the various precision levels, we propose that between 7 and 11 times more SNPs than SSRs should be used for analyzing population structure and genetic diversity.
Singh, Kh Dhanachandra; Karthikeyan, Muthusamy
2014-12-01
The renin-angiotensin-aldosterone system (RAAS) plays a key role in the regulation of blood pressure (BP). Mutations on the genes that encode components of the RAAS have played a significant role in genetic susceptibility to hypertension and have been intensively scrutinized. The identification of such probably causal mutations not only provides insight into the RAAS but may also serve as antihypertensive therapeutic targets and diagnostic markers. The methods for analyzing the SNPs from the huge dataset of SNPs, containing both functional and neutral SNPs is challenging by the experimental approach on every SNPs to determine their biological significance. To explore the functional significance of genetic mutation (SNPs), we adopted combined sequence and sequence-structure-based SNP analysis algorithm. Out of 3864 SNPs reported in dbSNP, we found 108 missense SNPs in the coding region and remaining in the non-coding region. In this study, we are reporting only those SNPs in coding region to be deleterious when three or more tools are predicted to be deleterious and which have high RMSD from the native structure. Based on these analyses, we have identified two SNPs of REN gene, eight SNPs of AGT gene, three SNPs of ACE gene, two SNPs of AT1R gene, three SNPs of CYP11B2 gene and three SNPs of CMA1 gene in the coding region were found to be deleterious. Further this type of study will be helpful in reducing the cost and time for identification of potential SNP and also helpful in selecting potential SNP for experimental study out of SNP pool.
Population distribution and ancestry of the cancer protective MDM2 SNP285 (rs117039649).
Knappskog, Stian; Gansmo, Liv B; Dibirova, Khadizha; Metspalu, Andres; Cybulski, Cezary; Peterlongo, Paolo; Aaltonen, Lauri; Vatten, Lars; Romundstad, Pål; Hveem, Kristian; Devilee, Peter; Evans, Gareth D; Lin, Dongxin; Van Camp, Guy; Manolopoulos, Vangelis G; Osorio, Ana; Milani, Lili; Ozcelik, Tayfun; Zalloua, Pierre; Mouzaya, Francis; Bliznetz, Elena; Balanovska, Elena; Pocheshkova, Elvira; Kučinskas, Vaidutis; Atramentova, Lubov; Nymadawa, Pagbajabyn; Titov, Konstantin; Lavryashina, Maria; Yusupov, Yuldash; Bogdanova, Natalia; Koshel, Sergey; Zamora, Jorge; Wedge, David C; Charlesworth, Deborah; Dörk, Thilo; Balanovsky, Oleg; Lønning, Per E
2014-09-30
The MDM2 promoter SNP285C is located on the SNP309G allele. While SNP309G enhances Sp1 transcription factor binding and MDM2 transcription, SNP285C antagonizes Sp1 binding and reduces the risk of breast-, ovary- and endometrial cancer. Assessing SNP285 and 309 genotypes across 25 different ethnic populations (>10.000 individuals), the incidence of SNP285C was 6-8% across European populations except for Finns (1.2%) and Saami (0.3%). The incidence decreased towards the Middle-East and Eastern Russia, and SNP285C was absent among Han Chinese, Mongolians and African Americans. Interhaplotype variation analyses estimated SNP285C to have originated about 14,700 years ago (95% CI: 8,300 - 33,300). Both this estimate and the geographical distribution suggest SNP285C to have arisen after the separation between Caucasians and modern day East Asians (17,000 - 40,000 years ago). We observed a strong inverse correlation (r = -0.805; p < 0.001) between the percentage of SNP309G alleles harboring SNP285C and the MAF for SNP309G itself across different populations suggesting selection and environmental adaptation with respect to MDM2 expression in recent human evolution. In conclusion, we found SNP285C to be a pan-Caucasian variant. Ethnic variation regarding distribution of SNP285C needs to be taken into account when assessing the impact of MDM2 SNPs on cancer risk.
Bessonov, Kyrylo; Walkey, Christopher J.; Shelp, Barry J.; van Vuuren, Hennie J. J.; Chiu, David; van der Merwe, George
2013-01-01
Analyzing time-course expression data captured in microarray datasets is a complex undertaking as the vast and complex data space is represented by a relatively low number of samples as compared to thousands of available genes. Here, we developed the Interdependent Correlation Clustering (ICC) method to analyze relationships that exist among genes conditioned on the expression of a specific target gene in microarray data. Based on Correlation Clustering, the ICC method analyzes a large set of correlation values related to gene expression profiles extracted from given microarray datasets. ICC can be applied to any microarray dataset and any target gene. We applied this method to microarray data generated from wine fermentations and selected NSF1, which encodes a C2H2 zinc finger-type transcription factor, as the target gene. The validity of the method was verified by accurate identifications of the previously known functional roles of NSF1. In addition, we identified and verified potential new functions for this gene; specifically, NSF1 is a negative regulator for the expression of sulfur metabolism genes, the nuclear localization of Nsf1 protein (Nsf1p) is controlled in a sulfur-dependent manner, and the transcription of NSF1 is regulated by Met4p, an important transcriptional activator of sulfur metabolism genes. The inter-disciplinary approach adopted here highlighted the accuracy and relevancy of the ICC method in mining for novel gene functions using complex microarray datasets with a limited number of samples. PMID:24130853
Ling, Zhi-Qiang; Wang, Yi; Mukaisho, Kenichi; Hattori, Takanori; Tatsuta, Takeshi; Ge, Ming-Hua; Jin, Li; Mao, Wei-Min; Sugihara, Hiroyuki
2010-06-01
Tests of differentially expressed genes (DEGs) from microarray experiments are based on the null hypothesis that genes that are irrelevant to the phenotype/stimulus are expressed equally in the target and control samples. However, this strict hypothesis is not always true, as there can be several transcriptomic background differences between target and control samples, including different cell/tissue types, different cell cycle stages and different biological donors. These differences lead to increased false positives, which have little biological/medical significance. In this article, we propose a statistical framework to identify DEGs between target and control samples from expression microarray data allowing transcriptomic background differences between these samples by introducing a modified null hypothesis that the gene expression background difference is normally distributed. We use an iterative procedure to perform robust estimation of the null hypothesis and identify DEGs as outliers. We evaluated our method using our own triplicate microarray experiment, followed by validations with reverse transcription-polymerase chain reaction (RT-PCR) and on the MicroArray Quality Control dataset. The evaluations suggest that our technique (i) results in less false positive and false negative results, as measured by the degree of agreement with RT-PCR of the same samples, (ii) can be applied to different microarray platforms and results in better reproducibility as measured by the degree of DEG identification concordance both intra- and inter-platforms and (iii) can be applied efficiently with only a few microarray replicates. Based on these evaluations, we propose that this method not only identifies more reliable and biologically/medically significant DEG, but also reduces the power-cost tradeoff problem in the microarray field. Source code and binaries freely available for download at http://comonca.org.cn/fdca/resources/softwares/deg.zip.
La Serna-Infantes, Jorge; Pastor, Miguel Chávez; Trubnykova, Milana; Velásquez, Félix Chavesta; Sotomayor, Flor Vásquez; Barriga, Hugo Abarca
2018-07-01
Trichothiodystrophy type 4 is a rare autosomal recessive and ectodermal disorder, characterized by dry, brittle, sparse and sulfur-deficient hair and other features like intellectual disability, ichthyotic skin and short stature, caused by a homozygous mutation in MPLKIP gene. Glutaric aciduria type 3 is caused by a homozygous mutation in SUGCT gene with no distinctive phenotype. Both genes are localized on chromosome 7 (7p14). We report an 8-year-old female with short stature, microcephaly, development delay, intellectual disability and hair characterized for dark, short, coarse, sparse and brittle associated to classical trichorrhexis microscopy pattern. Chromosome microarray analysis showed a 125 kb homozygous pathogenic deletion, which includes genes MPLKIP and SUGCT, not described before. This is the first case described in Peru of a novel contiguous gene deletion of Trichothiodystrophy type 4 and Glutaric aciduria type 3 performed by chromosome microarray analysis, highlighting the contribution and importance of molecular technologies on diagnosis of rare genetic conditions. Copyright © 2018 Elsevier Masson SAS. All rights reserved.
Microarray labeling extension values: laboratory signatures for Affymetrix GeneChips
Lee, Yun-Shien; Chen, Chun-Houh; Tsai, Chi-Neu; Tsai, Chia-Lung; Chao, Angel; Wang, Tzu-Hao
2009-01-01
Interlaboratory comparison of microarray data, even when using the same platform, imposes several challenges to scientists. RNA quality, RNA labeling efficiency, hybridization procedures and data-mining tools can all contribute variations in each laboratory. In Affymetrix GeneChips, about 11–20 different 25-mer oligonucleotides are used to measure the level of each transcript. Here, we report that ‘labeling extension values (LEVs)’, which are correlation coefficients between probe intensities and probe positions, are highly correlated with the gene expression levels (GEVs) on eukayotic Affymetrix microarray data. By analyzing LEVs and GEVs in the publicly available 2414 cel files of 20 Affymetrix microarray types covering 13 species, we found that correlations between LEVs and GEVs only exist in eukaryotic RNAs, but not in prokaryotic ones. Surprisingly, Affymetrix results of the same specimens that were analyzed in different laboratories could be clearly differentiated only by LEVs, leading to the identification of ‘laboratory signatures’. In the examined dataset, GSE10797, filtering out high-LEV genes did not compromise the discovery of biological processes that are constructed by differentially expressed genes. In conclusion, LEVs provide a new filtering parameter for microarray analysis of gene expression and it may improve the inter- and intralaboratory comparability of Affymetrix GeneChips data. PMID:19295132
Protein microarray analysis reveals BAFF-binding autoantibodies in systemic lupus erythematosus
Price, Jordan V.; Haddon, David J.; Kemmer, Dodge; Delepine, Guillaume; Mandelbaum, Gil; Jarrell, Justin A.; Gupta, Rohit; Balboni, Imelda; Chakravarty, Eliza F.; Sokolove, Jeremy; Shum, Anthony K.; Anderson, Mark S.; Cheng, Mickie H.; Robinson, William H.; Browne, Sarah K.; Holland, Steven M.; Baechler, Emily C.; Utz, Paul J.
2013-01-01
Autoantibodies against cytokines, chemokines, and growth factors inhibit normal immunity and are implicated in inflammatory autoimmune disease and diseases of immune deficiency. In an effort to evaluate serum from autoimmune and immunodeficient patients for Abs against cytokines, chemokines, and growth factors in a high-throughput and unbiased manner, we constructed a multiplex protein microarray for detection of serum factor–binding Abs and used the microarray to detect autoantibody targets in SLE. We designed a nitrocellulose-surface microarray containing human cytokines, chemokines, and other circulating proteins and demonstrated that the array permitted specific detection of serum factor–binding probes. We used the arrays to detect previously described autoantibodies against cytokines in samples from individuals with autoimmune polyendocrine syndrome type 1 and chronic mycobacterial infection. Serum profiling from individuals with SLE revealed that among several targets, elevated IgG autoantibody reactivity to B cell–activating factor (BAFF) was associated with SLE compared with control samples. BAFF reactivity correlated with the severity of disease-associated features, including IFN-α–driven SLE pathology. Our results showed that serum factor protein microarrays facilitate detection of autoantibody reactivity to serum factors in human samples and that BAFF-reactive autoantibodies may be associated with an elevated inflammatory disease state within the spectrum of SLE. PMID:24270423
Impact of pre-imputation SNP-filtering on genotype imputation results
2014-01-01
Background Imputation of partially missing or unobserved genotypes is an indispensable tool for SNP data analyses. However, research and understanding of the impact of initial SNP-data quality control on imputation results is still limited. In this paper, we aim to evaluate the effect of different strategies of pre-imputation quality filtering on the performance of the widely used imputation algorithms MaCH and IMPUTE. Results We considered three scenarios: imputation of partially missing genotypes with usage of an external reference panel, without usage of an external reference panel, as well as imputation of completely un-typed SNPs using an external reference panel. We first created various datasets applying different SNP quality filters and masking certain percentages of randomly selected high-quality SNPs. We imputed these SNPs and compared the results between the different filtering scenarios by using established and newly proposed measures of imputation quality. While the established measures assess certainty of imputation results, our newly proposed measures focus on the agreement with true genotypes. These measures showed that pre-imputation SNP-filtering might be detrimental regarding imputation quality. Moreover, the strongest drivers of imputation quality were in general the burden of missingness and the number of SNPs used for imputation. We also found that using a reference panel always improves imputation quality of partially missing genotypes. MaCH performed slightly better than IMPUTE2 in most of our scenarios. Again, these results were more pronounced when using our newly defined measures of imputation quality. Conclusion Even a moderate filtering has a detrimental effect on the imputation quality. Therefore little or no SNP filtering prior to imputation appears to be the best strategy for imputing small to moderately sized datasets. Our results also showed that for these datasets, MaCH performs slightly better than IMPUTE2 in most scenarios at the cost of increased computing time. PMID:25112433
Díaz-Villaseñor, Andrea; Cruz, Laura; Cebrián, Arturo; Hernández-Ramírez, Raúl U.; Hiriart, Marcia; García-Vargas, Gonzálo; Bassol, Susana; Sordo, Monserrat; Gandolfi, A. Jay; Klimecki, Walter T.; López-Carillo, Lizbeth; Cebrián, Mariano E.; Ostrosky-Wegman, Patricia
2013-01-01
The incidence of type 2 diabetes mellitus (T2DM) is increasing worldwide and diverse environmental and genetic risk factors are well recognized. Single nucleotide polymorphisms (SNPs) in the calpain-10 gene (CAPN-10), which encodes a protein involved in the secretion and action of insulin, and chronic exposure to inorganic arsenic (iAs) through drinking water have been independently associated with an increase in the risk for T2DM. In the present work we evaluated if CAPN-10 SNPs and iAs exposure jointly contribute to the outcome of T2DM. Insulin secretion (beta-cell function) and insulin sensitivity were evaluated indirectly through validated indexes (HOMA2) in subjects with and without T2DM who have been exposed to a gradient of iAs in their drinking water in northern Mexico. The results were analyzed taking into account the presence of the risk factor SNPs SNP-43 and -44 in CAPN-10. Subjects with T2DM had significantly lower beta-cell function and insulin sensitivity. An inverse association was found between beta-cell function and iAs exposure, the association being more pronounced in subjects with T2DM. Subjects without T2DM who were carriers of the at-risk genotype SNP-43 or -44, also had significantly lower beta-cell function. The association of SNP-43 with beta-cell function was dependent on iAs exposure, age, gender and BMI, whereas the association with SNP-44 was independent of all of these factors. Chronic exposure to iAs seems to be a risk factor for T2DM in humans through the reduction of beta-cell function, with an enhanced effect seen in the presence of the at-risk genotype of SNP-43 in CAPN-10. Carriers of CAPN-10 SNP-44 have also shown reduced beta-cell function. PMID:23349674
Zhang, Wensheng; Edwards, Andrea; Zhu, Dongxiao; Flemington, Erik K.; Deininger, Prescott; Zhang, Kun
2012-01-01
In metazoans, miRNAs regulate gene expression primarily through binding to target sites in the 3′ UTRs (untranslated regions) of messenger RNAs (mRNAs). Cis-acting variants within, or close to, a gene are crucial in explaining the variability of gene expression measures. Single nucleotide polymorphisms (SNPs) in the 3′ UTRs of genes can affect the base-pairing between miRNAs and mRNAs, and hence disrupt existing target sites (in the reference sequence) or create novel target sites, suggesting a possible mechanism for cis regulation of gene expression. Moreover, because the alleles of different SNPs within a DNA sequence of limited length tend to be in strong linkage disequilibrium (LD), we hypothesize the variants of miRNA target sites caused by SNPs potentially function as bridges linking the documented cis-SNP markers to the expression of the associated genes. A large-scale analysis was herein performed to test this hypothesis. By systematically integrating multiple latest information sources, we found 21 significant gene-level SNP-involved miRNA-mediated post-transcriptional regulation modules (SNP-MPRMs) in the form of SNP-miRNA-mRNA triplets in lymphocyte cell lines for the CEU and YRI populations. Among the cognate genes, six including ALG8, DGKE, GNA12, KLF11, LRPAP1, and MMAB are related to multiple genetic diseases such as depressive disorder and Type-II diabetes. Furthermore, we found that ∼35% of the documented transcript intensity-related cis-SNPs (∼950) in a recent publication are identical to, or in significant linkage disequilibrium (LD) (p<0.01) with, one or multiple SNPs located in miRNA target sites. Based on these associations (or identities), 69 significant exon-level SNP-MPRMs and 12 disease genes were further determined for two populations. These results provide concrete in silico evidence for the proposed hypothesis. The discovered modules warrant additional follow-up in independent laboratory studies. PMID:22348086
Lehman, Donna M; Fu, Dong-Jing; Freeman, Angela B; Hunt, Kelly J; Leach, Robin J; Johnson-Pais, Teresa; Hamlington, Jeanette; Dyer, Thomas D; Arya, Rector; Abboud, Hanna; Göring, Harald H H; Duggirala, Ravindranath; Blangero, John; Konrad, Robert J; Stern, Michael P
2005-04-01
Excess O-glycosylation of proteins by O-linked beta-N-acetylglucosamine (O-GlcNAc) may be involved in the pathogenesis of type 2 diabetes. The enzyme O-GlcNAc-selective N-acetyl-beta-d glucosaminidase (O-GlcNAcase) encoded by MGEA5 on 10q24.1-q24.3 reverses this modification by catalyzing the removal of O-GlcNAc. We have previously reported the linkage of type 2 diabetes and age at diabetes onset to an overlapping region on chromosome 10q in the San Antonio Family Diabetes Study (SAFADS). In this study, we investigated menangioma-expressed antigen-5 (MGEA5) as a positional candidate gene. Twenty-four single nucleotide polymorphisms (SNPs), identified by sequencing 44 SAFADS subjects, were genotyped in 436 individuals from 27 families whose data were used in the original linkage report. Association tests indicated significant association of a novel SNP with the traits diabetes (P = 0.0128, relative risk = 2.77) and age at diabetes onset (P = 0.0017). The associated SNP is located in intron 10, which contains an alternate stop codon and may lead to decreased expression of the 130-kDa isoform, the isoform predicted to contain the O-GlcNAcase activity. We investigated whether this variant was responsible for the original linkage signal. The variance attributed to this SNP accounted for approximately 25% of the logarithm of odds. These results suggest that this variant within the MGEA5 gene may increase diabetes risk in Mexican Americans.
Multicapillary gel electrophoresis based analysis of genetic variants in the WFS1 gene.
Elek, Zsuzsanna; Dénes, Réka; Prokop, Susanne; Somogyi, Anikó; Yowanto, Handy; Luo, Jane; Souquet, Manfred; Guttman, András; Rónai, Zsolt
2016-09-01
The WFS1 gene is one of the thoroughly investigated targets in diabetes research, variants of the gene were suggested to be the genetic components of the common forms (type 1 and type 2) of diabetes. Our project focused on the analysis of polymorphisms (rs4689388, rs148797429, rs4273545) localized in the WFS1 promoter region. Although submarine gel electrophoresis based approaches were also employed in the genetic tests, it was demonstrated that multicapillary electrophoresis offers a state of the art approach for reliable high-throughput SNP and VNTR analysis. Association studies were carried out in a case-control setup. Luciferase reporter assay was employed to test the effect of the investigated loci on the activity of gene expression in vitro. Significant association could be demonstrated between all three polymorphisms and type 2 diabetes in both allele- and genotype-wise settings even using Bonferroni correction. It is notable; however, that the three loci were in strong linkage disequilibrium, thus the observed associations cannot be considered as separate effects. Molecular analyses showed that the rs4273545 GT SNP played a role in the regulation of transcription in vitro. However, this effect took place only in the presence of the region including the rs148797429 site, although this latter locus did not have its own impact on the regulation of gene expression. The paper provides genotyping protocols readily applicable in any multiplex SNP and VNTR analyses, moreover confirms and extends previous results about the role of WFS1 polymorphisms in the genetic risk of diabetes mellitus. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Wolf, Christiane; Angelberger, Marianne; Diegelmann, Julia; Olszak, Torsten; Beigel, Florian; Tillack, Cornelia; Stallhofer, Johannes; Göke, Burkhard; Glas, Jürgen; Lohse, Peter; Brand, Stephan
2014-01-01
Background Very recently, a sub-analysis of genome-wide association scans revealed that the non-coding single nucleotide polymorphism (SNP) rs12212067 in the FOXO3A gene is associated with a milder course of Crohn's disease (CD) (Cell 2013;155:57–69). The aim of our study was to evaluate the clinical value of the SNP rs12212067 in predicting the severity of CD by correlating CD patient genotype status with the most relevant complications of CD such as stenoses, fistulas, and CD-related surgery. Methodology/Principal Findings We genotyped 550 CD patients for rs12212067 (FOXO3A) and the three common CD-associated NOD2 mutations rs2066844, rs2066847, and rs2066847 and performed genotype-phenotype analyses. Results No significant phenotypic differences were found between the wild-type genotype TT of the FOXO3A SNP rs12212067 and the minor genotypes TG and GG independently from NOD2 variants. The allele frequency of the minor G allele was 12.7%. Age at diagnosis, disease duration, body mass index, surgery rate, stenoses, fistula, need for immunosuppressive therapy, and disease course were not significantly different. In contrast, the NOD2 mutant p.Leu1007fsX1008 (rs2066847) was highly associated with penetrating CD (p = 0.01), the development of fistulas (p = 0.01) and stenoses (p = 0.01), and ileal disease localization (p = 0.03). Importantly, the NOD2 SNP rs2066847 was a strong separator between an aggressive and a mild course of CD (p = 2.99×10−5), while the FOXO3A SNP rs12212067 did not separate between mild and aggressive CD behavior in our cohort (p = 0.35). 96.2% of the homozygous NOD2 p.Leu1007fsX1008 carriers had an aggressive disease behavior compared to 69.3% of the patients with the NOD2 wild-type genotype (p = 0.007). Conclusion/Significance In clinical practice, the NOD2 variant p.Leu1007fsX1008 (rs2066847), in particular in homozygous form, is a much stronger marker for a severe clinical phenotype than the FOXO3A rs12212067 SNP for a mild disease course on an individual patient level despite its important impact on the inflammatory response of monocytes. PMID:25365249
Meinert, Christian; Gembardt, Florian; Böhme, Ilka; Tetzner, Anja; Wieland, Thomas; Greenberg, Barry; Walther, Thomas
2016-01-01
The study aimed to identify proteins regulated by the cardiovascular protective peptide angiotensin-(1-7) and to determine potential intracellular signaling cascades. Human endothelial cells were stimulated with Ang-(1-7) for 1 h, 3 h, 6 h, and 9 h. Peptide effects on intracellular signaling were assessed via antibody microarray, containing antibodies against 725 proteins. Bioinformatics software was used to identify affected intracellular signaling pathways. Microarray data was verified exemplarily by Western blot, Real-Time RT-PCR, and immunohistochemical studies. The microarray identified 110 regulated proteins after 1 h, 119 after 3 h, 31 after 6 h, and 86 after 9 h Ang-(1-7) stimulation. Regulated proteins were associated with high significance to several metabolic pathways like “Molecular Mechanism of Cancer” and “p53 signaling” in a time dependent manner. Exemplarily, Western blots for the E3-type small ubiquitin-like modifier ligase PIAS2 confirmed the microarray data and displayed a decrease by more than 50% after Ang-(1-7) stimulation at 1 h and 3 h without affecting its mRNA. Immunohistochemical studies with PIAS2 in human endothelial cells showed a decrease in cytoplasmic PIAS2 after Ang-(1-7) treatment. The Ang-(1-7) mediated decrease of PIAS2 was reproduced in other endothelial cell types. The results suggest that angiotensin-(1-7) plays a role in metabolic pathways related to cell death and cell survival in human endothelial cells.
Erickson, A; Fisher, M; Furukawa-Stoffer, T; Ambagala, A; Hodko, D; Pasick, J; King, D P; Nfon, C; Ortega Polo, R; Lung, O
2018-04-01
Microarray technology can be useful for pathogen detection as it allows simultaneous interrogation of the presence or absence of a large number of genetic signatures. However, most microarray assays are labour-intensive and time-consuming to perform. This study describes the development and initial evaluation of a multiplex reverse transcription (RT)-PCR and novel accompanying automated electronic microarray assay for simultaneous detection and differentiation of seven important viruses that affect swine (foot-and-mouth disease virus [FMDV], swine vesicular disease virus [SVDV], vesicular exanthema of swine virus [VESV], African swine fever virus [ASFV], classical swine fever virus [CSFV], porcine respiratory and reproductive syndrome virus [PRRSV] and porcine circovirus type 2 [PCV2]). The novel electronic microarray assay utilizes a single, user-friendly instrument that integrates and automates capture probe printing, hybridization, washing and reporting on a disposable electronic microarray cartridge with 400 features. This assay accurately detected and identified a total of 68 isolates of the seven targeted virus species including 23 samples of FMDV, representing all seven serotypes, and 10 CSFV strains, representing all three genotypes. The assay successfully detected viruses in clinical samples from the field, experimentally infected animals (as early as 1 day post-infection (dpi) for FMDV and SVDV, 4 dpi for ASFV, 5 dpi for CSFV), as well as in biological material that were spiked with target viruses. The limit of detection was 10 copies/μl for ASFV, PCV2 and PRRSV, 100 copies/μl for SVDV, CSFV, VESV and 1,000 copies/μl for FMDV. The electronic microarray component had reduced analytical sensitivity for several of the target viruses when compared with the multiplex RT-PCR. The integration of capture probe printing allows custom onsite array printing as needed, while electrophoretically driven hybridization generates results faster than conventional microarrays that rely on passive hybridization. With further refinement, this novel, rapid, highly automated microarray technology has potential applications in multipathogen surveillance of livestock diseases. © 2017 Her Majesty the Queen in Right of Canada • Transboundary and Emerging Diseases.
2012-01-01
Diffuse large B-cell lymphoma (DLBCL) is the most common type of non-Hodgkin Lymphoma comprising of greater than 30% of adult non-Hodgkin Lymphomas. DLBCL represents a diverse set of lymphomas, defined as diffuse proliferation of large B lymphoid cells. Numerous cytogenetic studies including karyotypes and fluorescent in situ hybridization (FISH), as well as morphological, biological, clinical, microarray and sequencing technologies have attempted to categorize DLBCL into morphological variants, molecular and immunophenotypic subgroups, as well as distinct disease entities. Despite such efforts, most lymphoma remains undistinguishable and falls into DLBCL, not otherwise specified (DLBCL-NOS). The advent of microarray-based studies (chromosome, RNA, gene expression, etc) has provided a plethora of high-resolution data that could potentially facilitate the finer classification of DLBCL. This review covers the microarray data currently published for DLBCL. We will focus on these types of data; 1) array based CGH; 2) classical CGH; and 3) gene expression profiling studies. The aims of this review were three-fold: (1) to catalog chromosome loci that are present in at least 20% or more of distinct DLBCL subtypes; a detailed list of gains and losses for different subtypes was generated in a table form to illustrate specific chromosome loci affected in selected subtypes; (2) to determine common and distinct copy number alterations among the different subtypes and based on this information, characteristic and similar chromosome loci for the different subtypes were depicted in two separate chromosome ideograms; and, (3) to list re-classified subtypes and those that remained indistinguishable after review of the microarray data. To the best of our knowledge, this is the first effort to compile and review available literatures on microarray analysis data and their practical utility in classifying DLBCL subtypes. Although conventional cytogenetic methods such as Karyotypes and FISH have played a major role in classification schemes of lymphomas, better classification models are clearly needed to further understanding the biology, disease outcome and therapeutic management of DLBCL. In summary, microarray data reviewed here can provide better subtype specific classifications models for DLBCL. PMID:22967872
Georgi, Enrico; Walter, Mathias C; Pfalzgraf, Marie-Theres; Northoff, Bernd H; Holdt, Lesca M; Scholz, Holger C; Zoeller, Lothar; Zange, Sabine; Antwerpen, Markus H
2017-01-01
Brucellosis, a worldwide common bacterial zoonotic disease, has become quite rare in Northern and Western Europe. However, since 2014 a significant increase of imported infections caused by Brucella (B.) melitensis has been noticed in Germany. Patients predominantly originated from Middle East including Turkey and Syria. These circumstances afforded an opportunity to gain insights into the population structure of Brucella strains. Brucella-isolates from 57 patients were recovered between January 2014 and June 2016 with culture confirmed brucellosis by the National Consultant Laboratory for Brucella. Their whole genome sequences were generated using the Illumina MiSeq platform. A whole genome-based SNP typing assay was developed in order to resolve geographically attributed genetic clusters. Results were compared to MLVA typing results, the current gold-standard of Brucella typing. In addition, sequences were examined for possible genetic variation within target regions of molecular diagnostic assays. Phylogenetic analyses revealed spatial clustering and distinguished strains from different patients in either case, whereas multiple isolates from a single patient or technical replicates showed identical SNP and MLVA profiles. By including WGS data from the NCBI database, five major genotypes were identified. Notably, strains originating from Turkey showed a high diversity and grouped into seven subclusters of genotype II. MLVA analysis congruently clustered all isolates and predominantly matched the East Mediterranean genetic clade. This study confirms whole-genome based SNP-analysis as a powerful tool for accurate typing of B. melitensis. Furthermore it allows special allocation and therefore provides useful information on the geographic origin for trace-back analysis. However, the lack of reliable metadata in public databases often prevents a resolution below geographic regions or country levels and corresponding precise trace-back analysis. Once this obstacle is resolved, WGS-derived bacterial typing adds an important method to complement epidemiological surveys during outbreak investigations. This is the first report of a detailed genetic investigation of an extensive collection of B. melitensis strains isolated from human cases in Germany.
Li, Wen; Fan, Chun Chieh; Mäki-Marttunen, Tuomo; Thompson, Wesley K; Schork, Andrew J; Bettella, Francesco; Djurovic, Srdjan; Dale, Anders M; Andreassen, Ole A; Wang, Yunpeng
2018-06-01
Traditional genome-wide association studies (GWAS) have successfully detected genetic variants associated with schizophrenia. However, only a small fraction of heritability can be explained. Gene-set/pathway-based methods can overcome limitations arising from single nucleotide polymorphism (SNP)-based analysis, but most of them place constraints on size which may exclude highly specific and functional sets, like macromolecules. Voltage-gated calcium (Ca v ) channels, belonging to macromolecules, are composed of several subunits whose encoding genes are located far away or even on different chromosomes. We combined information about such molecules with GWAS data to investigate how functional channels associated with schizophrenia. We defined a biologically meaningful SNP-set based on channel structure and performed an association study by using a validated method: SNP-set (sequence) kernel association test. We identified eight subtypes of Ca v channels significantly associated with schizophrenia from a subsample of published data (N = 56,605), including the L-type channels (Ca v 1.1, Ca v 1.2, Ca v 1.3), P-/Q-type Ca v 2.1, N-type Ca v 2.2, R-type Ca v 2.3, T-type Ca v 3.1, and Ca v 3.3. Only genes from Ca v 1.2 and Ca v 3.3 have been implicated by the largest GWAS (N = 82,315). Each subtype of Ca v channels showed relatively high chip heritability, proportional to the size of its constituent gene regions. The results suggest that abnormalities of Ca v channels may play an important role in the pathophysiology of schizophrenia and these channels may represent appropriate drug targets for therapeutics. Analyzing subunit-encoding genes of a macromolecule in aggregate is a complementary way to identify more genetic variants of polygenic diseases. This study offers the potential of power for discovery the biological mechanisms of schizophrenia. © 2018 Wiley Periodicals, Inc.
Type I interferons modulate methotrexate resistance in gestational trophoblastic neoplasia.
Elias, Kevin M; Harvey, Richard A; Hasselblatt, Kathleen T; Seckl, Michael J; Berkowitz, Ross S
2017-06-01
Resistance to methotrexate is a leading clinical problem in gestational trophoblastic neoplasia (GTN), but there are limited laboratory models for this condition. We created isogenic trophoblastic cell lines resistant to methotrexate and compared these to the parent cell lines using gene expression microarrays and qRT-PCR followed by mechanistic studies using recombinant cytokines, pathway inhibitors, and patient sera. Gene expression microarrays and focused analysis by qRT-PCR revealed methotrexate led to type I interferon upregulation, in particular interferon alpha 2 (IFNA2), and methotrexate resistance was associated with chronic low level increases in type I interferon expression. Recombinant IFNA2 imparted chemosensitive choriocarcinoma cells with partial resistance to methotrexate, while chemoresistant choriocarcinoma cells were uniquely sensitive to fludarabine, a STAT1 inhibitor. In pre-treatment patient sera, IFNA2 levels correlated with subsequent resistance to methotrexate chemotherapy. Methotrexate resistance is influenced by type I interferon signaling with prognostic and therapeutic implications for treating women with GTN. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Sharma, Pankaj; Gupta, Neerja; Chowdhury, Madhumita Roy; Sapra, Savita; Ghosh, Manju; Gulati, Sheffali; Kabra, Madhulika
2016-09-15
Intellectual disability (ID)/Global developmental delay (GDD) is a diverse group of disorders in terms of cognitive and non-cognitive functions and can occur with or without associated co-morbidities. It affects 1-3% of individuals globally and in at least 30-50% of cases the etiology remains unexplained. The widespread use of chromosomal microarray analysis (CMA) in a clinical setting has allowed the identification of submicroscopic copy number variations (CNVs), throughout the genome, associated with neurodevelopmental phenotypes including ID/GDD. In this study we investigated the utility of CMA in the detection of CNVs in 106 patients with unexplained ID/DD, dysmorphism with or without multiple congenital anomalies (MCA). CMA study was carried out using Agilent 8×60K chips and Illumina Human CytoSNP-12 chips. Pathogenic CNVs were found in 15 (14.2%) patients. In these patients, CNVs on single chromosome were detected in 10 patients while 5 patients showed co-occurrence CNVs on two chromosomes. The size of these CNVs ranged between 322kb to 13Mb. The yield of pathogenic CNVs was similar for both mild and severe ID/GDD cases. One patient described in this paper is considered to harbour a likely pathogenic CNV with deletion in 17q22 region. Only few cases have been described in literature for 17q22 deletion and patient reported here was found to have an atypical deletion in 17q22 region (Case 90). This study re-affirms the view point that CMA is a powerful diagnostic tool in the evaluation of idiopathic ID/GDD patients irrespective of the degree of severity. Identifying pathogenic CNVs helps in counseling and prenatal diagnosis if desired. Copyright © 2016 Elsevier B.V. All rights reserved.
Effects of Temperature on the Meiotic Recombination Landscape of the Yeast Saccharomyces cerevisiae
Zhang, Ke; Wu, Xue-Chang
2017-01-01
ABSTRACT Although meiosis in warm-blooded organisms takes place in a narrow temperature range, meiosis in many organisms occurs over a wide variety of temperatures. We analyzed the properties of meiosis in the yeast Saccharomyces cerevisiae in cells sporulated at 14°C, 30°C, or 37°C. Using comparative-genomic-hybridization microarrays, we examined the distribution of Spo11-generated meiosis-specific double-stranded DNA breaks throughout the genome. Although there were between 300 and 400 regions of the genome with high levels of recombination (hot spots) observed at each temperature, only about 20% of these hot spots were found to have occurred independently of the temperature. In S. cerevisiae, regions near the telomeres and centromeres tend to have low levels of meiotic recombination. This tendency was observed in cells sporulated at 14°C and 30°C, but not at 37°C. Thus, the temperature of sporulation in yeast affects some global property of chromosome structure relevant to meiotic recombination. Using single-nucleotide polymorphism (SNP)-specific whole-genome microarrays, we also examined crossovers and their associated gene conversion events as well as gene conversion events that were unassociated with crossovers in all four spores of tetrads obtained by sporulation of diploids at 14°C, 30°C, or 37°C. Although tetrads from cells sporulated at 30°C had slightly (20%) more crossovers than those derived from cells sporulated at the other two temperatures, spore viability was good at all three temperatures. Thus, despite temperature-induced variation in the genetic maps, yeast cells produce viable haploid products at a wide variety of sporulation temperatures. PMID:29259092
Pang, Chunhong; LaLonde, Amy; Godfrey, Tony E; Que, Jianwen; Sun, Jun; Wu, Tong Tong; Zhou, Zhongren
2017-01-01
Bile acid reflux in the esophagus plays an important role in the carcinogenesis of esophageal adenocarcinoma (EAC). The G-protein coupled bile acid receptor (TGR5) has been associated with the development of gastrointestinal cancer. However, little is known regarding the role of TGR5 in esophageal carcinoma and precancerous lesions. We analyzed genomic DNA from 116 EACs for copy number aberrations via Affymetrix SNP6.0 microarrays. The TGR5 gene locus was amplified in 12.7% (14/116) of the EACs. The TGR5 protein expression was also assessed using immunohistochemistry from tissue microarrays, including Barrett’s esophagus (BE), low-(LGD) and high-grade dysplasia (HGD), columnar cell metaplasia (CM), squamous epithelium (SE), EAC and squamous cell carcinoma. The TGR5 protein was highly expressed in 71% of EAC (75/106), 100% of HGD (11/11), 72% of LGD (13/18), 66% of BE (23/35), 84% of CM (52/62), and 36% of SE (30/83). The patients with high expression of TGR5 exhibited significantly worse overall survival compared to the patients with nonhigh expression. TGR5 high expression was significantly increased in the males compared to the females in all cases with an odds ratio of 1.9 times. The vitamin D receptor (VDR) was significantly correlated with TGR5 expression. Our findings indicated that TGR5 may play an important role in the development and prognosis of EAC through a bile acid ligand. Gender differences in TGR5 and VDR expression may explain why males have a higher incidence of EAC compared to females. PMID:28223834
Progranulin levels in blood in Alzheimer's disease and mild cognitive impairment.
Cooper, Yonatan A; Nachun, Daniel; Dokuru, Deepika; Yang, Zhongan; Karydas, Anna M; Serrero, Ginette; Yue, Binbin; Boxer, Adam L; Miller, Bruce L; Coppola, Giovanni
2018-05-01
Changes in progranulin ( GRN ) expression have been hypothesized to alter risk for Alzheimer's disease (AD). We investigated the relationship between GRN expression in peripheral blood and clinical diagnosis of AD and mild cognitive impairment (MCI). Peripheral blood progranulin gene expression was measured, using microarrays from Alzheimer's ( n = 186), MCI ( n = 118), and control ( n = 204) subjects from the University of California San Francisco Memory and Aging Center (UCSF-MAC) and two independent published series (AddNeuroMed and ADNI). GRN gene expression was correlated with clinical, demographic, and genetic data, including APOE haplotype and the GRN rs5848 single-nucleotide polymorphism. Finally, we assessed progranulin protein levels, using enzyme-linked immunosorbent assay, and methylation status using methylation microarrays. We observed an increase in blood progranulin gene expression and a decrease in GRN promoter methylation in males ( P = 0.007). Progranulin expression was 13% higher in AD and MCI patients compared with controls in the UCSF-MAC cohort ( F 2,505 = 10.41, P = 3.72*10 -5 ). This finding was replicated in the AddNeuroMed ( F 2,271 = 17.9, P = 4.83*10 -8 ) but not the ADNI series. The rs5848 SNP (T-allele) predicted decreased blood progranulin gene expression ( P = 0.03). The APOE4 haplotype was positively associated with progranulin expression independent of diagnosis ( P = 0.04). Finally, we did not identify differences in plasma progranulin protein levels or gene methylation between diagnostic categories. Progranulin mRNA is elevated in peripheral blood of patients with AD and MCI and its expression is associated with numerous genetic and demographic factors. These data suggest a role in the pathogenesis of neurodegenerative dementias besides frontotemporal dementia.
Fukuda, Ken-ichi; Kasai, Shinya; Hasegawa, Junko; Hayashida, Masakazu; Minami, Masabumi; Ikeda, Kazutaka
2013-01-01
Individual differences in the sensitivity to fentanyl, a widely used opioid analgesic, lead to different proper doses of fentanyl, which can hamper effective pain treatment. Voltage-activated Ca2+ channels (VACCs) play a crucial role in the nervous system by controlling membrane excitability and calcium signaling. Cav2.3 (R-type) VACCs have been especially thought to play critical roles in pain pathways and the analgesic effects of opioids. However, unknown is whether single-nucleotide polymorphisms (SNPs) of the human CACNA1E (calcium channel, voltage-dependent, R type, alpha 1E subunit) gene that encodes Cav2.3 VACCs influence the analgesic effects of opioids. Thus, the present study examined associations between fentanyl sensitivity and SNPs in the human CACNA1E gene in 355 Japanese patients who underwent painful orofacial cosmetic surgery, including bone dissection. We first conducted linkage disequilibrium (LD) analyses of 223 SNPs in a region that contains the CACNA1E gene using genomic samples from 100 patients, and a total of 13 LD blocks with 42 Tag SNPs were observed within and around the CACNA1E gene region. In the preliminary study using the same 100 genomic samples, only the rs3845446 A/G SNP was significantly associated with perioperative fentanyl use among these 42 Tag SNPs. In a confirmatory study using the other 255 genomic samples, this SNP was also significantly associated with perioperative fentanyl use. Thus, we further analyzed associations between genotypes of this SNP and all of the clinical data using a total of 355 samples. The rs3845446 A/G SNP was associated with intraoperative fentanyl use, 24 h postoperative fentanyl requirements, and perioperative fentanyl use. Subjects who carried the minor G allele required significantly less fentanyl for pain control compared with subjects who did not carry this allele. Although further validation is needed, the present findings show the possibility of the involvement of CACNA1E gene polymorphisms in fentanyl sensitivity. PMID:23940630
Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar
2016-01-01
Mango (Mangifera indica L.) is called "king of fruits" due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties 'Neelam', 'Dashehari' and their hybrid 'Amrapali' using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango.
Mahato, Ajay Kumar; Sharma, Nimisha; Singh, Akshay; Srivastav, Manish; Jaiprakash; Singh, Sanjay Kumar; Singh, Anand Kumar; Sharma, Tilak Raj; Singh, Nagendra Kumar
2016-01-01
Mango (Mangifera indica L.) is called “king of fruits” due to its sweetness, richness of taste, diversity, large production volume and a variety of end usage. Despite its huge economic importance genomic resources in mango are scarce and genetics of useful horticultural traits are poorly understood. Here we generated deep coverage leaf RNA sequence data for mango parental varieties ‘Neelam’, ‘Dashehari’ and their hybrid ‘Amrapali’ using next generation sequencing technologies. De-novo sequence assembly generated 27,528, 20,771 and 35,182 transcripts for the three genotypes, respectively. The transcripts were further assembled into a non-redundant set of 70,057 unigenes that were used for SSR and SNP identification and annotation. Total 5,465 SSR loci were identified in 4,912 unigenes with 288 type I SSR (n ≥ 20 bp). One hundred type I SSR markers were randomly selected of which 43 yielded PCR amplicons of expected size in the first round of validation and were designated as validated genic-SSR markers. Further, 22,306 SNPs were identified by aligning high quality sequence reads of the three mango varieties to the reference unigene set, revealing significantly enhanced SNP heterozygosity in the hybrid Amrapali. The present study on leaf RNA sequencing of mango varieties and their hybrid provides useful genomic resource for genetic improvement of mango. PMID:27736892
Lv, Zhiping; Li, Ying; Wu, Yongzhong; Qu, Yi
2016-08-01
To evaluate the association of intercellular cell-adhesion molecule 1 (ICAM-1) and high-mobility group A1 (HMGA1) gene variants with diabetic retinopathy (DR) in a Chinese type 2 diabetes mellitus (T2DM) cohort. A total of 792 patients with T2DM were enrolled and categorized into two groups: (1) the DR group consisted of 448 patients, which was further subclassified into the proliferative DR (PDR) group with 220 patients and the nonproliferative DR (NPDR) group with 228 patients; (2) the diabetes without retinopathy (DNR) group comprised 344 patients who had no signs of DR. The single-nucleotide polymorphism (SNP) rs5498 in ICAM-1 gene and IVS5-13insC variant in HMGA1 gene were genotyped. No evident association was found in the allele frequencies between SNP rs5498 in ICAM-1 gene and DR patients; the combined p values for the additive, dominant, and recessive models in genotype were greater than 0.05. No significant association was identified between the IVS5-13insC variant in HMGA1 gene and DR individuals. Our results revealed that SNP rs5498 in ICAM-1 gene and IVS5-13insC variant in HMGA1 gene were not associated with the susceptibility of DR in the Chinese T2DM cohort.
[Analysis of the parental origin of MECP2 mutations in patients with Rett syndrome].
Zhang, Jing-jing; Bao, Xin-hua; Cao, Guang-na; Jiang, Sheng-ling; Zhu, Xing-wang; Lu, Hong-mei; Jia, Li-fang; Pan, Hong; Wu, Xi-ru
2010-04-01
To identify the parental origin of methyl-CpG-binding protein 2 (MECP2) gene mutations in Chinese patients with Rett syndrome. Single nucleotide polymorphisms (SNPs) in intron 3 of the MECP2 gene were analyzed by PCR and sequencing in 115 patients with Rett syndrome. Then sequencing of the SNP region was performed for the fathers of the patients who had at least one SNP, to determine which allele was from the father. Then allele-specific PCR was performed and the products were sequenced to see whether the allele from father or mother harbored the mutation. Seventy-six of the 115 patients had at least one SNP. Three hot SNPs were found in these patients. They were: IVS3+22C >G, IVS3+266C >T and IVS3+683C>T. Among the 76 cases, 73 had a paternal origin of MECP2 mutations, and the other 3 had a maternal origin. There were multiple types of MECP2 mutation of the paternal origin, including 4 frame shift, 2 deletion and 67 point (56C >T, 6C >G, 2A >G, 2G >T and 1A >T) mutations. The mutation types of the 3 patients with maternal origin included 2 frame shift and 1 point (C >T) mutation. In Chinese RTT patients, the MECP2 mutations are mostly of paternal origin.
Methylation oligonucleotide microarray: a novel tool to analyze methylation patterns
NASA Astrophysics Data System (ADS)
Hou, Peng; Ji, Meiju; He, Nongyao; Lu, Zuhong
2003-04-01
A new technique to analyze methylation patterns in several adjacent CpG sites was developed and reported here. We selected a 336bp segment of the 5"-untranslated region and the first exon of the p16Ink4a gene, which include the most densely packed CpG fragment of the islands containing 32 CpG dinucleotides, as the investigated target. The probes that include all types of methylation patterns were designed to fabricate a DNA microarray to determine the methylation patterns of seven adjacent CpG dinucleotides sites. High accuracy and reproducibility were observed in several parallel experiments. The results led us to the conclusion that the methylation oligonucleotide microarray can be applied as a novel and powerful tool to map methylation patterns and changes in multiple CpG island loci in a variety of tumors.
NASA Astrophysics Data System (ADS)
Huang, Yong-Zhen; Zhang, Liang-Zhi; Lai, Xin-Sheng; Li, Ming-Xun; Sun, Yu-Jia; Li, Cong-Jun; Lan, Xian-Yong; Lei, Chu-Zhao; Zhang, Chun-Lei; Zhao, Xin; Chen, Hong
2014-04-01
Zinc finger, BED-type containing 6 (ZBED6) is an important transcription factor in placental mammals, affecting development, cell proliferation and growth. In this study, we found that the expression of the ZBED6 and IGF2 were upregulated during C2C12 differentiation. The IGF2 expression levels were negatively associated with the methylation status in beef cattle (P < 0.05). A luciferase assay for the IGF2 intron 3 and P3 promoter showed that the mutant-type 439 A-SNP-pGL3 in driving reporter gene transcription is significantly higher than that of the wild-type 439 G-SNP-pGL3 construct (P < 0.05). An over-expression assay revealed that ZBED6 regulate IGF2 expression and promote myoblast differentiation. Furthermore, knockdown of ZBED6 led to IGF2 expression change in vitro. Taken together, these results suggest that ZBED6 inhibits IGF2 activity and expression via a G to A transition disrupts the interaction. Thus, we propose that ZBED6 plays a critical role in myogenic differentiation.
Gheit, Tarik; Tommasino, Massimo
2011-01-01
Epidemiological and functional studies have clearly demonstrated that certain types of human papillomavirus (HPV) from the genus alpha of the HPV phylogenetic tree, referred to as high-risk (HR) types, are the etiological cause of cervical cancer. Several methods for HPV detection and typing have been developed, and their importance in clinical and epidemiological studies has been well demonstrated. However, comparative studies have shown that several assays have different sensitivities for the detection of specific HPV types, particularly in the case of multiple infections. In this chapter, we describe a novel one-shot method for the detection and typing of 19 mucosal HR HPV types (types 16, 18, 26, 31, 33, 35, 39, 45, 51, 52, 53, 56, 58, 59, 66, 68, 70, 73, and 82). The assay combines the advantages of the multiplex PCR methods, i.e., high sensitivity and the possibility to perform multiple amplifications in a single reaction, with an array primer extension (APEX) assay. The latter method offers the benefits of Sanger dideoxy sequencing with the high-throughput potential of the microarray. Initial studies have revealed that the assay is very sensitive in detecting multiple HPV infections.
Association of myostatin variants with growth traits of Zhikong scallop ( Chlamys farreri)
NASA Astrophysics Data System (ADS)
Fu, Qiang; Guo, Huihui; Feng, Liying; Li, Xue; Zhang, Lingling; Wang, Shi; Hu, Xiaoli; Bao, Zhenmin
2016-02-01
Scallop is a popular sea food and an important aquaculture shellfish. Identification of genes and genetic variants relating to scallop growth could benefit high-yielding scallop breeding. Myostatin ( MSTN) is a conservative regulator of muscle growth, and has become one of the most important target genes for genetic improvement of the production of farmed animals. In this study, four single nucleotide polymorphisms (SNPs) were identified in the 5' flanking region of MSTN gene ( CfMSTN) in Zhikong scallop ( Chlamys farreri). The association of these SNPs with scallop growth traits, including shell length, shell height, body weight and striated muscle weight was analyzed. The SNP g-1162G
Differentiation of the seven major lyssavirus species by oligonucleotide microarray.
Xi, Jin; Guo, Huancheng; Feng, Ye; Xu, Yunbin; Shao, Mingfu; Su, Nan; Wan, Jiayu; Li, Jiping; Tu, Changchun
2012-03-01
An oligonucleotide microarray, LyssaChip, has been developed and verified as a highly specific diagnostic tool for differentiation of the 7 major lyssavirus species. As with conventional typing microarray methods, the LyssaChip relies on sequence differences in the 371-nucleotide region coding for the nucleoprotein. This region was amplified using nested reverse transcription-PCR primers that bind to the 7 major lyssaviruses. The LyssaChip includes 57 pairs of species typing and corresponding control oligonucleotide probes (oligoprobes) immobilized on glass slides, and it can analyze 12 samples on a single slide within 8 h. Analysis of 111 clinical brain specimens (65 from animals with suspected rabies submitted to the laboratory and 46 of butchered dog brain tissues collected from restaurants) showed that the chip method was 100% sensitive and highly consistent with the "gold standard," a fluorescent antibody test (FAT). The chip method could detect rabies virus in highly decayed brain tissues, whereas the FAT did not, and therefore the chip test may be more applicable to highly decayed brain tissues than the FAT. LyssaChip may provide a convenient and inexpensive alternative for diagnosis and differentiation of rabies and rabies-related diseases.
Variation of gene expression in Bacillus subtilis samples of fermentation replicates.
Zhou, Ying; Yu, Wen-Bang; Ye, Bang-Ce
2011-06-01
The application of comprehensive gene expression profiling technologies to compare wild and mutated microorganism samples or to assess molecular differences between various treatments has been widely used. However, little is known about the normal variation of gene expression in microorganisms. In this study, an Agilent customized microarray representing 4,106 genes was used to quantify transcript levels of five-repeated flasks to assess normal variation in Bacillus subtilis gene expression. CV analysis and analysis of variance were employed to investigate the normal variance of genes and the components of variance, respectively. The results showed that above 80% of the total variation was caused by biological variance. For the 12 replicates, 451 of 4,106 genes exhibited variance with CV values over 10%. The functional category enrichment analysis demonstrated that these variable genes were mainly involved in cell type differentiation, cell type localization, cell cycle and DNA processing, and spore or cyst coat. Using power analysis, the minimal biological replicate number for a B. subtilis microarray experiment was determined to be six. The results contribute to the definition of the baseline level of variability in B. subtilis gene expression and emphasize the importance of replicate microarray experiments.
Nowrousian, Minou; Ringelberg, Carol; Dunlap, Jay C; Loros, Jennifer J; Kück, Ulrich
2005-04-01
The filamentous fungus Sordaria macrospora forms complex three-dimensional fruiting bodies that protect the developing ascospores and ensure their proper discharge. Several regulatory genes essential for fruiting body development were previously isolated by complementation of the sterile mutants pro1, pro11 and pro22. To establish the genetic relationships between these genes and to identify downstream targets, we have conducted cross-species microarray hybridizations using cDNA arrays derived from the closely related fungus Neurospora crassa and RNA probes prepared from wild-type S. macrospora and the three developmental mutants. Of the 1,420 genes which gave a signal with the probes from all the strains used, 172 (12%) were regulated differently in at least one of the three mutants compared to the wild type, and 17 (1.2%) were regulated differently in all three mutant strains. Microarray data were verified by Northern analysis or quantitative real time PCR. Among the genes that are up- or down-regulated in the mutant strains are genes encoding the pheromone precursors, enzymes involved in melanin biosynthesis and a lectin-like protein. Analysis of gene expression in double mutants revealed a complex network of interaction between the pro gene products.
Apolipoprotein H promoter polymorphisms in relation to lupus and lupus-related phenotypes.
Suresh, Sangita; Demirci, F Yesim K; Jacobs, Erin; Kao, Amy H; Rhew, Elisa Y; Sanghera, Dharambir K; Selzer, Faith; Sutton-Tyrrell, Kim; McPherson, David; Bontempo, Franklin A; Kammerer, Candace M; Ramsey-Goldman, Rosalind; Manzi, Susan; Kamboh, M Ilyas
2009-02-01
Sequence variation in gene promoters is often associated with disease risk. We tested the hypothesis that common promoter variation in the APOH gene (encoding for ss(2)-glycoprotein I) is associated with systemic lupus erythematosus (SLE) risk and SLE-related clinical phenotypes in a Caucasian cohort. We used a case-control design and genotyped 345 women with SLE and 454 healthy control women for 8 APOH promoter single-nucleotide polymorphisms (SNP; -1284C>G, -1219G>A, -1190G>C, -759A>G, -700C>A, -643T>C, -38G>A, and -32C>A).Association analyses were performed on single SNP and haplotypes. Haplotype analyses were performed using EH (Estimate Haplotype-frequencies) and Haploview programs. In vitro reporter gene assay was performed in COS-1 cells. Electrophoretic mobility shift assay (EMSA) was performed using HepG2 nuclear cells. Overall haplotype distribution of the APOH promoter SNP was significantly different between cases and controls (p = 0.009). The -643C allele was found to be protective against carotid plaque formation (adjusted OR 0.37, p = 0.013) among patients with SLE. The -643C allele was associated with a ~2-fold decrease in promoter activity as compared to wild-type -643T allele (mean +/- standard deviation: 3.94 +/- 0.05 vs 6.99 +/- 0.68, p = 0.016). EMSA showed that the -643T>C SNP harbors a binding site for a nuclear factor. The -1219G>A SNP showed a significant association with the risk of lupus nephritis (age-adjusted OR 0.36, p = 0.016). Our data indicate that APOH promoter variants may be involved in the etiology of SLE, especially the risk for autoimmune-mediated cardiovascular disease.
Li, Xiao-Mu; Ling, Yan; Lu, Da-Ru; Lu, Zhi-Qiang; Liu, Ying; Chen, Hong-Yan; Gao, Xin
2012-10-01
Proprotein convertase subtilisin/kexin-type 1 (PCSK1) is a prohormone convertase that has an important role in prohormone maturation including the process of prorenin to renin. We studied the association of the PCSK1 single-nucleotide polymorphism (SNP) rs6235 (encoding an S690T substitution) with essential hypertension (EH), obesity and related traits in the Han Chinese population. The rs6235 SNP in the PCSK1 gene was investigated using a case-control study design, with 1034 hypertension cases and 1112 normotensive controls. In this study, the rs6235 SNP was significantly associated with hypertension (OR=1.26, 95% CI (1.10-1.46), P=0.001); the odds ratios of GC vs GG and CC vs GG were 1.30 (95% CI (1.06-1.58), P=0.010) and 1.55 (95% CI (1.12-2.13), P=0.007), respectively. In the controls, the C-allele was associated with increased systolic (P=0.010) and diastolic (P=0.010) blood pressure levels. In all of the EH patients and EH patients without a history of renin-angiotensin-aldosterone (RAA) system-related antagonists, the C-allele was associated with increased plasma renin activity (P=0.00004 and 0.002, respectively) and aldosterone levels (P=0.018 and 0.005, respectively). The C-allele was also associated with increased body mass index (BMI) (P=0.010) in the normotensive controls. In conclusion, the PCSK1 SNP rs6235 was associated with EH and blood pressure in the Han Chinese population, and this association may be mediated by the SNP's effect on RAA levels. rs6235 was also associated with BMI in this population.
Tahir, Imtiaz Mahmood; Iqbal, Tahira; Saleem, Sadaf; Perveen, Sofia; Farooqi, Aboubakker
2017-01-01
Interindividual variability in polymorphic uridine diphosphate-glucuronosyltransferase 1A1 (UGT1A1) ascribed to genetic diversity is associated with relative glucuronidation level among individuals. The present research was aimed to study the effect of 2 important single nucleotide polymorphisms (SNPs; rs8330 and rs10929303) of UGT1A1 gene on glucuronidation status of acetaminophen in healthy volunteers (n = 109). Among enrolled volunteers, 54.13% were male (n = 59) and 45.87% were female (n = 50). The in vivo activity of UGT1A1 was investigated by high-performance liquid chromatography-based analysis of glucuronidation status (ie, acetaminophen and acetaminophen glucuronide) in human volunteers after oral intake of a single dose (1000 mg) of acetaminophen. The TaqMan SNP genotyping assay was used for UGT1A1 genotyping. The wild-type genotype (C/C) was observed the most frequent one for both SNPs (rs8330 and rs10929303) and associated with fast glucuronidator phenotypes. The distribution of variant genotype (G/G) for SNP rs8330 was observed in 5% of male and 8% of the female population; however, for SNP rs10929303, the G/G genotype was found in 8% of both genders. A trimodal distribution (fast, intermediate, and slow) based on phenotypes was observed. Among the male participants, the glucuronidation phenotypes were observed as 7% slow, 37% intermediate, and 56% fast glucuronidators; however, these findings for the females were slightly different as 8%, 32%, and 60% respectively. The k-statistics revealed a compelling evidence for good concordance between phenotype and genotype with a k value of 1.00 for SNP rs8330 and 0.966 for SNP rs10929303 in our population. PMID:28932176
Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma Jj; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco Cam; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric
2016-01-01
Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple ( Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species.
Novel Thrombotic Function of a Human SNP in STXBP5 Revealed by CRISPR/Cas9 Gene Editing in Mice.
Zhu, Qiuyu Martin; Ko, Kyung Ae; Ture, Sara; Mastrangelo, Michael A; Chen, Ming-Huei; Johnson, Andrew D; O'Donnell, Christopher J; Morrell, Craig N; Miano, Joseph M; Lowenstein, Charles J
2017-02-01
To identify and characterize the effect of a SNP (single-nucleotide polymorphism) in the STXBP5 locus that is associated with altered thrombosis in humans. GWAS (genome-wide association studies) have identified numerous SNPs associated with human thrombotic phenotypes, but determining the functional significance of an individual candidate SNP can be challenging, particularly when in vivo modeling is required. Recent GWAS led to the discovery of STXBP5 as a regulator of platelet secretion in humans. Further clinical studies have identified genetic variants of STXBP5 that are linked to altered plasma von Willebrand factor levels and thrombosis in humans, but the functional significance of these variants in STXBP5 is not understood. We used CRISPR/Cas9 (clustered regularly interspaced short palindromic repeats/CRISPR-associated 9) techniques to produce a precise mouse model carrying a human coding SNP rs1039084 (encoding human p. N436S) in the STXBP5 locus associated with decreased thrombosis. Mice carrying the orthologous human mutation (encoding p. N437S in mouse STXBP5) have lower plasma von Willebrand factor levels, decreased thrombosis, and decreased platelet secretion compared with wild-type mice. This thrombosis phenotype recapitulates the phenotype of humans carrying the minor allele of rs1039084. Decreased plasma von Willebrand factor and platelet activation may partially explain the decreased thrombotic phenotype in mutant mice. Using precise mammalian genome editing, we have identified a human nonsynonymous SNP rs1039084 in the STXBP5 locus as a causal variant for a decreased thrombotic phenotype. CRISPR/Cas9 genetic editing facilitates the rapid and efficient generation of animals to study the function of human genetic variation in vascular diseases. © 2016 American Heart Association, Inc.
Van Inghelandt, Delphine; Melchinger, Albrecht E.; Lebreton, Claude
2010-01-01
Information about the genetic diversity and population structure in elite breeding material is of fundamental importance for the improvement of crops. The objectives of our study were to (a) examine the population structure and the genetic diversity in elite maize germplasm based on simple sequence repeat (SSR) markers, (b) compare these results with those obtained from single nucleotide polymorphism (SNP) markers, and (c) compare the coancestry coefficient calculated from pedigree records with genetic distance estimates calculated from SSR and SNP markers. Our study was based on 1,537 elite maize inbred lines genotyped with 359 SSR and 8,244 SNP markers. The average number of alleles per locus, of group specific alleles, and the gene diversity (D) were higher for SSRs than for SNPs. Modified Roger’s distance (MRD) estimates and membership probabilities of the STRUCTURE matrices were higher for SSR than for SNP markers but the germplasm organization in four heterotic pools was consistent with STRUCTURE results based on SSRs and SNPs. MRD estimates calculated for the two marker systems were highly correlated (0.87). Our results suggested that the same conclusions regarding the structure and the diversity of heterotic pools could be drawn from both markers types. Furthermore, although our results suggested that the ratio of the number of SSRs and SNPs required to obtain MRD or D estimates with similar precision is not constant across the various precision levels, we propose that between 7 and 11 times more SNPs than SSRs should be used for analyzing population structure and genetic diversity. Electronic supplementary material The online version of this article (doi:10.1007/s00122-009-1256-2) contains supplementary material, which is available to authorized users. PMID:20063144
Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma JJ; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco CAM; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric
2016-01-01
Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple (Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species. PMID:27917289
Sun, Yueying; Lu, Xiaohui; Su, Fengxia; Wang, Limei; Liu, Chenghui; Duan, Xinrui; Li, Zhengping
2015-12-15
Most of practical methods for detection of single nucleotide polymorphism (SNP) need at least two steps: amplification (usually by PCR) and detection of SNP by using the amplification products. Ligase chain reaction (LCR) can integrate the amplification and allele discrimination in one step. However, the detection of LCR products still remains a great challenge for highly sensitive and quantitative SNP detection. Herein, a simple but robust strategy for real-time fluorescence LCR has been developed for highly sensitive and quantitative SNP detection. A pair of LCR probes are firstly labeled with a fluorophore and a quencher, respectively. When the pair of LCR probes are ligated in LCR, the fluorophore will be brought close to the quencher, and thus, the fluorescence will be specifically quenched by fluorescence resonance energy transfer (FRET). The decrease of fluorescence intensity resulted from FRET can be real-time monitored in the LCR process. With the proposed real-time fluorescence LCR assay, 10 aM DNA targets or 100 pg genomic DNA can be accurately determined and as low as 0.1% mutant DNA can be detected in the presence of a large excess of wild-type DNA, indicating the high sensitivity and specificity. The real-time measuring does not require the detection step after LCR and gives a wide dynamic range for detection of DNA targets (from 10 aM to 1 pM). As LCR has been widely used for detection of SNP, DNA methylation, mRNA and microRNA, the real-time fluorescence LCR assay shows great potential for various genetic analysis. Copyright © 2015 Elsevier B.V. All rights reserved.
AGARWAL, SANDEEP K.; GOURH, PRAVITT; SHETE, SANJAY; PAZ, GENE; DIVECHA, DIPAL; REVEILLE, JOHN D.; ASSASSI, SHERVIN; TAN, FILEMON K.; MAYES, MAUREEN D.; ARNETT, FRANK C.
2010-01-01
Objective IL23R has been identified as a susceptibility gene for development of multiple autoimmune diseases. We investigated the possible association of IL23R with systemic sclerosis (SSc), an autoimmune disease that leads to the development of cutaneous and visceral fibrosis. Methods We tested 9 single-nucleotide polymorphisms (SNP) in IL23R for association with SSc in a cohort of 1402 SSc cases and 1038 controls. IL23R SNP tested were previously identified as SNP showing associations with inflammatory bowel disease. Results Case-control comparisons revealed no statistically significant differences between patients and healthy controls with any of the IL23R polymorphisms. Analyses of subsets of SSc patients showed that rs11209026 (Arg381Gln variant) was associated with anti-topoisomerase I antibody (ATA)-positive SSc (p = 0.001)) and rs11465804 SNP was associated with diffuse and ATA-positive SSc (p = 0.0001, p = 0.0026, respectively). These associations remained significant after accounting for multiple comparisons using the false discovery rate method. Wild-type genotype at both rs11209026 and rs11465804 showed significant protection against the presence of pulmonary hypertension (PHT). (p = 3×10−5, p = 1×10−5, respectively). Conclusion Polymorphisms in IL23R are associated with susceptibility to ATA-positive SSc and protective against development of PHT in patients with SSc. PMID:19918037
Impact of SNPs on Protein Phosphorylation Status in Rice (Oryza sativa L.).
Lin, Shoukai; Chen, Lijuan; Tao, Huan; Huang, Jian; Xu, Chaoqun; Li, Lin; Ma, Shiwei; Tian, Tian; Liu, Wei; Xue, Lichun; Ai, Yufang; He, Huaqin
2016-11-11
Single nucleotide polymorphisms (SNPs) are widely used in functional genomics and genetics research work. The high-quality sequence of rice genome has provided a genome-wide SNP and proteome resource. However, the impact of SNPs on protein phosphorylation status in rice is not fully understood. In this paper, we firstly updated rice SNP resource based on the new rice genome Ver. 7.0, then systematically analyzed the potential impact of Non-synonymous SNPs (nsSNPs) on the protein phosphorylation status. There were 3,897,312 SNPs in Ver. 7.0 rice genome, among which 9.9% was nsSNPs. Whilst, a total 2,508,261 phosphorylated sites were predicted in rice proteome. Interestingly, we observed that 150,197 (39.1%) nsSNPs could influence protein phosphorylation status, among which 52.2% might induce changes of protein kinase (PK) types for adjacent phosphorylation sites. We constructed a database, SNP_rice, to deposit the updated rice SNP resource and phosSNPs information. It was freely available to academic researchers at http://bioinformatics.fafu.edu.cn. As a case study, we detected five nsSNPs that potentially influenced heterotrimeric G proteins phosphorylation status in rice, indicating that genetic polymorphisms showed impact on the signal transduction by influencing the phosphorylation status of heterotrimeric G proteins. The results in this work could be a useful resource for future experimental identification and provide interesting information for better rice breeding.
Burall, Laurel S; Grim, Christopher J; Datta, Atin R
2017-01-01
Four listeriosis incidences/outbreaks, spanning 19 months, have been linked to Listeria monocytogenes serotype 4b variant (4bV) strains. Three of these incidents can be linked to a defined geographical region, while the fourth is likely to be linked. In this study, whole genome sequencing (WGS) of strains from these incidents was used for genomic comparisons using two approached. The first was JSpecies tetramer, which analyzed tetranucleotide frequency to assess relatedness. The second, the CFSAN SNP Pipeline, was used to perform WGS SNP analyses against three different reference genomes to evaluate relatedness by SNP distances. In each case, unrelated strains were included as controls. The analyses showed that strains from these incidents form a highly related clade with SNP differences of ≤101 within the clade and >9000 against other strains. Multi-Virulence-Locus Sequence Typing, a third standardized approach for evaluation relatedness, was used to assess the genetic drift in six conserved, known virulence loci and showed a different clustering pattern indicating possible differences in selection pressure experienced by these genes. These data suggest a high degree of relatedness among these 4bV strains linked to a defined geographic region and also highlight the possibility of alterations related to adaptation and virulence.
Braam, W; Keijzer, H; Struijker Boudier, H; Didden, R; Smits, M; Curfs, L
2013-11-01
In some of our patients with intellectual disabilities (ID) and sleep problems, the initial good response to melatonin disappeared within a few weeks after starting treatment. In these patients melatonin levels at noon were extremely high (>50 pg/ml). We hypothesise that the disappearing effectiveness is associated with slow metabolisation of melatonin because of a single nucleotide polymorphism (SNP) of CYP1A2. In this pilot study we analysed DNA extracted from saliva samples of 15 consecutive patients with disappearing effectiveness of melatonin. Saliva was collected at noon and 4 pm for measuring melatonin levels. In all patients' salivary melatonin levels at noon were >50 or melatonin half time was > 5 h. A SNP was found in eight of 15 patients. The allele 1C was found in two patients and in six patients the 1F allele was found. Of 15 patients with disappearing effectiveness of melatonin, seven were diagnosed with autism spectrum disorder, and in four of them a SNP was found. The other eight patients were known with a genetic syndrome. In six of them behaviour was considered to be autistic-type and in three of them a SNP was found. This finding may give a new direction for research into the genetic background of autism. © 2012 The Authors. Journal of Intellectual Disability Research © 2012 John Wiley & Sons Ltd, MENCAP & IASSID.
Patterns of diversity and recombination along chromosome 1 of maize (Zea mays ssp. mays L.).
Tenaillon, Maud I; Sawkins, Mark C; Anderson, Lorinda K; Stack, Stephen M; Doebley, John; Gaut, Brandon S
2002-01-01
We investigate the interplay between genetic diversity and recombination in maize (Zea mays ssp. mays). Genetic diversity was measured in three types of markers: single-nucleotide polymorphisms, indels, and microsatellites. All three were examined in a sample of previously published DNA sequences from 21 loci on maize chromosome 1. Small indels (1-5 bp) were numerous and far more common than large indels. Furthermore, large indels (>100 bp) were infrequent in the population sample, suggesting they are slightly deleterious. The 21 loci also contained 47 microsatellites, of which 33 were polymorphic. Diversity in SNPs, indels, and microsatellites was compared to two measures of recombination: C (=4Nc) estimated from DNA sequence data and R based on a quantitative recombination nodule map of maize synaptonemal complex 1. SNP diversity was correlated with C (r = 0.65; P = 0.007) but not with R (r = -0.10; P = 0.69). Given the lack of correlation between R and SNP diversity, the correlation between SNP diversity and C may be driven by demography. In contrast to SNP diversity, microsatellite diversity was correlated with R (r = 0.45; P = 0.004) but not C (r = -0.025; P = 0.55). The correlation could arise if recombination is mutagenic for microsatellites, or it may be consistent with background selection that is apparent only in this class of rapidly evolving markers. PMID:12454083
Recommendations for the use of microarrays in prenatal diagnosis.
Suela, Javier; López-Expósito, Isabel; Querejeta, María Eugenia; Martorell, Rosa; Cuatrecasas, Esther; Armengol, Lluis; Antolín, Eugenia; Domínguez Garrido, Elena; Trujillo-Tiebas, María José; Rosell, Jordi; García Planells, Javier; Cigudosa, Juan Cruz
2017-04-07
Microarray technology, recently implemented in international prenatal diagnosis systems, has become one of the main techniques in this field in terms of detection rate and objectivity of the results. This guideline attempts to provide background information on this technology, including technical and diagnostic aspects to be considered. Specifically, this guideline defines: the different prenatal sample types to be used, as well as their characteristics (chorionic villi samples, amniotic fluid, fetal cord blood or miscarriage tissue material); variant reporting policies (including variants of uncertain significance) to be considered in informed consents and prenatal microarray reports; microarray limitations inherent to the technique and which must be taken into account when recommending microarray testing for diagnosis; a detailed clinical algorithm recommending the use of microarray testing and its introduction into routine clinical practice within the context of other genetic tests, including pregnancies in families with a genetic history or specific syndrome suspicion, first trimester increased nuchal translucency or second trimester heart malformation and ultrasound findings not related to a known or specific syndrome. This guideline has been coordinated by the Spanish Association for Prenatal Diagnosis (AEDP, «Asociación Española de Diagnóstico Prenatal»), the Spanish Human Genetics Association (AEGH, «Asociación Española de Genética Humana») and the Spanish Society of Clinical Genetics and Dysmorphology (SEGCyD, «Sociedad Española de Genética Clínica y Dismorfología»). Copyright © 2017 Elsevier España, S.L.U. All rights reserved.
HuH-7 reference genome profile: complex karyotype composed of massive loss of heterozygosity.
Kasai, Fumio; Hirayama, Noriko; Ozawa, Midori; Satoh, Motonobu; Kohara, Arihiro
2018-05-17
Human cell lines represent a valuable resource as in vitro experimental models. A hepatoma cell line, HuH-7 (JCRB0403), has been used extensively in various research fields and a number of studies using this line have been published continuously since it was established in 1982. However, an accurate genome profile, which can be served as a reliable reference, has not been available. In this study, we performed M-FISH, SNP microarray and amplicon sequencing to characterize the cell line. Single cell analysis of metaphases revealed a high level of heterogeneity with a mode of 60 chromosomes. Cytogenetic results demonstrated chromosome abnormalities involving every chromosome in addition to a massive loss of heterozygosity, which accounts for 55.3% of the genome, consistent with the homozygous variants seen in the sequence analysis. We provide empirical data that the HuH-7 cell line is composed of highly heterogeneous cell populations, suggesting that besides cell line authentication, the quality of cell lines needs to be taken into consideration in the future use of tumor cell lines.
Whole genome amplification and real-time PCR in forensic casework
Giardina, Emiliano; Pietrangeli, Ilenia; Martone, Claudia; Zampatti, Stefania; Marsala, Patrizio; Gabriele, Luciano; Ricci, Omero; Solla, Gianluca; Asili, Paola; Arcudi, Giovanni; Spinella, Aldo; Novelli, Giuseppe
2009-01-01
Background WGA (Whole Genome Amplification) in forensic genetics can eliminate the technical limitations arising from low amounts of genomic DNA (gDNA). However, it has not been used to date because any amplification bias generated may complicate the interpretation of results. Our aim in this paper was to assess the applicability of MDA to forensic SNP genotyping by performing a comparative analysis of genomic and amplified DNA samples. A 26-SNPs TaqMan panel specifically designed for low copy number (LCN) and/or severely degraded genomic DNA was typed on 100 genomic as well as amplified DNA samples. Results Aliquots containing 1, 0.1 and 0.01 ng each of 100 DNA samples were typed for a 26-SNPs panel. Similar aliquots of the same DNA samples underwent multiple displacement amplification (MDA) before being typed for the same panel. Genomic DNA samples showed 0% PCR failure rate for all three dilutions, whilst the PCR failure rate of the amplified DNA samples was 0% for the 1 ng and 0.1 ng dilutions and 0.077% for the 0.01 ng dilution. The genotyping results of both the amplified and genomic DNA samples were also compared with reference genotypes of the same samples obtained by direct sequencing. The genomic DNA samples showed genotype concordance rates of 100% for all three dilutions while the concordance rates of the amplified DNA samples were 100% for the 1 ng and 0.1 ng dilutions and 99.923% for the 0.01 ng dilution. Moreover, ten artificially-degraded DNA samples, which gave no results when analyzed by current forensic methods, were also amplified by MDA and genotyped with 100% concordance. Conclusion We investigated the suitability of MDA material for forensic SNP typing. Comparative analysis of amplified and genomic DNA samples showed that a large number of SNPs could be accurately typed starting from just 0.01 ng of template. We found that the MDA genotyping call and accuracy rates were only slightly lower than those for genomic DNA. Indeed, when 10 pg of input DNA was used in MDA, we obtained 99.923% concordance, indicating a genotyping error rate of 1/1299 (7.7 × 10-4). This is quite similar to the genotyping error rate of STRs used in current forensic analysis. Such efficiency and accuracy of SNP typing of amplified DNA suggest that MDA can also generate large amounts of genome-equivalent DNA from a minimal amount of input DNA. These results show for the first time that MDA material is suitable for SNP-based forensic protocols and in general when samples fail to give interpretable STR results. PMID:19366436
NASA Technical Reports Server (NTRS)
Khaoustov, V. I.; Risin, D.; Pellis, N. R.; Yoffe, B.; McIntire, L. V. (Principal Investigator)
2001-01-01
Developed at NASA, the rotary cell culture system (RCCS) allows the creation of unique microgravity environment of low shear force, high-mass transfer, and enables three-dimensional (3D) cell culture of dissimilar cell types. Recently we demonstrated that a simulated microgravity is conducive for maintaining long-term cultures of functional hepatocytes and promote 3D cell assembly. Using deoxyribonucleic acid (DNA) microarray technology, it is now possible to measure the levels of thousands of different messenger ribonucleic acids (mRNAs) in a single hybridization step. This technique is particularly powerful for comparing gene expression in the same tissue under different environmental conditions. The aim of this research was to analyze gene expression of hepatoblastoma cell line (HepG2) during early stage of 3D-cell assembly in simulated microgravity. For this, mRNA from HepG2 cultured in the RCCS was analyzed by deoxyribonucleic acid microarray. Analyses of HepG2 mRNA by using 6K glass DNA microarray revealed changes in expression of 95 genes (overexpression of 85 genes and downregulation of 10 genes). Our preliminary results indicated that simulated microgravity modifies the expression of several genes and that microarray technology may provide new understanding of the fundamental biological questions of how gravity affects the development and function of individual cells.
MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering
Kim, Eun-Youn; Kim, Seon-Young; Ashlock, Daniel; Nam, Dougu
2009-01-01
Background Uncovering subtypes of disease from microarray samples has important clinical implications such as survival time and sensitivity of individual patients to specific therapies. Unsupervised clustering methods have been used to classify this type of data. However, most existing methods focus on clusters with compact shapes and do not reflect the geometric complexity of the high dimensional microarray clusters, which limits their performance. Results We present a cluster-number-based ensemble clustering algorithm, called MULTI-K, for microarray sample classification, which demonstrates remarkable accuracy. The method amalgamates multiple k-means runs by varying the number of clusters and identifies clusters that manifest the most robust co-memberships of elements. In addition to the original algorithm, we newly devised the entropy-plot to control the separation of singletons or small clusters. MULTI-K, unlike the simple k-means or other widely used methods, was able to capture clusters with complex and high-dimensional structures accurately. MULTI-K outperformed other methods including a recently developed ensemble clustering algorithm in tests with five simulated and eight real gene-expression data sets. Conclusion The geometric complexity of clusters should be taken into account for accurate classification of microarray data, and ensemble clustering applied to the number of clusters tackles the problem very well. The C++ code and the data sets tested are available from the authors. PMID:19698124
SNPServer: a real-time SNP discovery tool.
Savage, David; Batley, Jacqueline; Erwin, Tim; Logan, Erica; Love, Christopher G; Lim, Geraldine A C; Mongin, Emmanuel; Barker, Gary; Spangenberg, German C; Edwards, David
2005-07-01
SNPServer is a real-time flexible tool for the discovery of SNPs (single nucleotide polymorphisms) within DNA sequence data. The program uses BLAST, to identify related sequences, and CAP3, to cluster and align these sequences. The alignments are parsed to the SNP discovery software autoSNP, a program that detects SNPs and insertion/deletion polymorphisms (indels). Alternatively, lists of related sequences or pre-assembled sequences may be entered for SNP discovery. SNPServer and autoSNP use redundancy to differentiate between candidate SNPs and sequence errors. For each candidate SNP, two measures of confidence are calculated, the redundancy of the polymorphism at a SNP locus and the co-segregation of the candidate SNP with other SNPs in the alignment. SNPServer is available at http://hornbill.cspp.latrobe.edu.au/snpdiscovery.html.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Starska, Katarzyna, E-mail: katarzyna.starska@umed.lodz.pl; Bryś, Magdalena; Forma, Ewa
Metallothioneins (MTs) are intracellular thiol-rich heavy metal-binding proteins which join trace metal ions protecting cells against heavy metal toxicity and regulate metal distribution and donation to various enzymes and transcription factors. The goal of this study was to identify the − 5 A/G (rs28366003) single-nucleotide polymorphism (SNP) in the core promoter region of the MT2A gene, and to investigate its effect on allele-specific gene expression and Cd, Zn, Cu and Ni content in sinonasal inverted papilloma tissue (IP), with non-cancerous sinonasal mucosa (NCM) as a control. The MT2A promoter region − 5 A/G SNP was identified by restriction fragment lengthmore » polymorphism using 117 IP and 132 NCM. MT2A gene analysis was performed by quantitative real-time PCR. Metal levels were analyzed by flame atomic absorption spectrometry. The frequency of A allele carriage was 99.2% and 100% in IP and NCM, respectively. The G allele carriage was detected in 23.9% of IP and in 12.1% of the NCM samples. As a result, a significant association of − 5 A/G SNP in MT2A gene with mRNA expression in both groups was determined. A significant association was identified between the − 5 A/G SNP in the MT2A gene with mRNA expression in both groups. A highly significant association was detected between the rs28366003 genotype and Cd and Zn content in IP. Furthermore, significant differences were identified between A/A and A/G genotype with regard to the type of metal contaminant. The Spearman rank correlation results showed the MT2A gene expression and both Cd and Cu levels were negatively correlated. The results obtained in this study suggest that the − 5 A/G SNP in the MT2A gene may have an effect on allele-specific gene expression and toxic metal accumulation in sinonasal inverted papilloma. - Highlights: • MT2A gene expression and metal content in sinonasal inverted papilloma tissues • Association between SNP (rs28366003) and expression of MT2A • Significant associations between the SNP and Cd and Zn levels • Negative correlation between MT2A gene expression and Cd and Cu levels.« less
Heo, Hyun Young; Chung, Soyi; Kim, Yong Tae; Kim, Do Hyun; Seo, Tae Seok
2016-04-15
Genetic variations such as single nucleotide polymorphism (SNP) and point mutations are important biomarkers to monitor disease prognosis and diagnosis. In this study, we developed a novel rotary microfluidic device which can perform multiplex SNP typing on the mutation sites of TP53 genes. The microdevice consists of three glass layers: a channel wafer, a Ti/Pt electrode-patterned resistance temperature detector (RTD) wafer, and a rotary plate in which twelve reaction chambers were fabricated. A series of sample injection, ligation-rolling circle amplification (L-RCA) reaction, and fluorescence detection of the resultant amplicons could be executed by rotating the top rotary plate, identifying five mutation points related with cancer prognosis. The use of the rotary plate eliminates the necessity of microvalves and micropumps to control the microfluidic flow in the channel, simplifying the chip design and chip operation for multiplex SNP detection. The proposed microdevice provides an advanced genetic analysis platform in terms of multiplexity, simplicity, and portability in the fields of biomedical diagnostics. Copyright © 2015 Elsevier B.V. All rights reserved.
Rasal, Kiran D; Shah, Tejas M; Vaidya, Megha; Jakhesara, Subhash J; Joshi, Chaitanya G
2015-06-01
The recent advances in high throughput sequencing technology accelerate possible ways for the study of genome wide variation in several organisms and associated consequences. In the present study, mutations in TGFBR3 showing significant association with FCR trait in chicken during exome sequencing were further analyzed. Out of four SNPs, one nsSNP p.Val451Leu was found in the coding region of TGFBR3. In silico tools such as SnpSift and PANTHER predicted it as deleterious (0.04) and to be tolerated, respectively, while I-Mutant revealed that protein stability decreased. The TGFBR3 I-TASSER model has a C-score of 0.85, which was validated using PROCHECK. Based on MD simulation, mutant protein structure deviated from native with RMSD 0.08 Å due to change in the H-bonding distances of mutant residue. The docking of TGFBR3 with interacting TGFBR2 inferred that mutant required more global energy. Therefore, the present study will provide useful information about functional SNPs that have an impact on FCR traits.
Abbasi, Zahra; Kazemi Nezhad, Seyed Reza; Pourmahdi-Broojeni, Mahdi; Rajaei, Elham
2017-01-01
Single-nucleotide polymorphism (SNP) rs2476601 within protein tyrosine phosphatase non-receptor type 22 gene (PTPN22) has been shown to be a risk factor for different autoimmune diseases. This study explored the association of 1858 C/T SNP with rheumatoid arthritis (RA) and celiac disease (CD) in a region covering south-west of Iran. Totally, 52 patients with CD, 120 patients with RA, and 120 healthy subjects were selected. The samples were genotyped for the rs2476601 in PTPN22 gene using the tetra-amplification refractory mutation system polymerase chain reaction. The frequency of +1858T risk allele was significantly increased in both RA (P=0.021, OR=2.56, 95%CI=1.19-5.47) and CD (P=0.002, OR=3.87, 95%CI=1.68-8.95) patients, as compared to the control group. However, no association was found between the +1858C/T PTPN22 gene SNP and the anti-cyclic citrullinated peptide and rheumatoid factor positivity in RA patients. PTPN22 gene could play a crucial role in people's susceptibility to certain autoimmune diseases.
Photodynamic synchrotron x-ray therapy in Glioma cell using superparamagnetic iron nanoparticle
NASA Astrophysics Data System (ADS)
Kim, Hong-Tae; Kim, Ki-Hong; Choi, Gi-Hwan; Jheon, Sanghoon; Park, Sung-Hwan; Kim, Bong-Il; Hyodo, Kazuyuki; Ando, Masami; Kim, Jong-Ki
2009-06-01
In order to evaluate cytotoxic effects of secondary Auger electron emission(Photon Activation Therapy:PAT) from alginate-coated iron nanoparticles(Alg-SNP), Alg-SNP-uptaken C6 glioma cell lines were irradiated with 6.89/7.2 Kev synchrotron X-ray. 0-125 Gy were irradiated on three experimental groups including No-SNP group incubating without SNP as control group, 6hr-SNP group incubating with SNP for 6hr and ON-SNP group incubating with SNP overnight. Irradiated cells were stained with Acridine Orange(AO) and Edithium Bromide(EB) to count their viability with fluorescent microscopy in comparison with control groups. AO stained in damaged DNA, giving FL color change in X-ray plus SNP group. EB did not or less enter inside the cell nucleus of control group. In contrast, EB entered inside the cell nucleus of Alg-SNP group which means more damage compared with Control groups. The results of MTT assay demonstrated a X-ray dose-dependent reduction generally in cell viability in the experimental groups. 3 or 9 times increase in cell survival loss rate was observed at 6hr-SNP and ON-SNP groups, respectively compared to No-SNP control group in first experiment that was done to test cell survival rate at relatively lower dose, from 0 to 50 Gy. In second experiment X-ray dose was increased to 125 Gy. Survival loss was sharply decreased in a relatively lower dose from 5 to 25 Gy, and then demonstrated an exponentially decreasing behavior with a convergence until 125 Gy for each group. This observation suggests PAT effects on the cell directly by X-ray in the presence of Alg-SNP occurs within lower X-ray dose, and conventional X-ray radiation effect becomes dominant in higher X-ray dose. The cell viability loss of ON-SNP group was three times higher compared with that of 6hr-SNP group. In conclusion, it is possible to design photodynamic X-ray therapy study using a monochromatic x-ray energy and metal nanoparticle as x-ray sensitizer, which may enable new X-ray PDT to disseminated tumors without side effects to normal surrounding tissue.
Genetic polymorphism directs IL-6 expression in fibroblasts but not selected other cell types
Noss, Erika H.; Nguyen, Hung N.; Chang, Sook Kyung; Watts, Gerald F. M.; Brenner, Michael B.
2015-01-01
Interleukin (IL)-6 blockade is an effective treatment for rheumatoid arthritis (RA), and synovial fibroblasts are a major IL-6 producer in the inflamed joint. We found that human RA and osteoarthritis (OA) synovial fibroblasts derived from independent donors reproducibly segregated into low, medium, and high IL-6 producers, independent of stimulus, cell passage, or disease state. IL-6 expression pattern correlated strongly with total mRNA expression, not mRNA stability, suggesting transcriptional rather than posttranscriptional regulation. High-fibroblast IL-6 expression was significantly associated with the IL-6 proximal promoter single nucleotide polymorphism (SNP) rs1800795 minor allele (CC) genotype. In contrast, no association between this SNP and IL-6 production was detected in CD14+ monocytes, another major producer of synovial IL-6. Luciferase expression assays confirmed that this SNP was associated with differential IL-6 expression in fibroblasts. To date, several association studies examining rs1800795 allele frequency and disease risk have reported seemingly conflicting results ranging from no association to association with either the major or minor allele across a spectrum of conditions, including cancer and autoimmune, cardiovascular, infectious, and metabolic diseases. This study points to a prominent contribution from promoter genetic variation in fibroblast IL-6 regulation, but not in other IL-6–producing cell types. We propose that some of the heterogeneity in these clinical studies likely reflects the cellular source of IL-6 in specific diseases, much of which may be produced by nonhematopoietic cells. These results highlight that functional analysis of disease-associated SNPs on gene expression and pathologic processes must consider variation in diverse cell types. PMID:26578807
A novel feature extraction approach for microarray data based on multi-algorithm fusion
Jiang, Zhu; Xu, Rong
2015-01-01
Feature extraction is one of the most important and effective method to reduce dimension in data mining, with emerging of high dimensional data such as microarray gene expression data. Feature extraction for gene selection, mainly serves two purposes. One is to identify certain disease-related genes. The other is to find a compact set of discriminative genes to build a pattern classifier with reduced complexity and improved generalization capabilities. Depending on the purpose of gene selection, two types of feature extraction algorithms including ranking-based feature extraction and set-based feature extraction are employed in microarray gene expression data analysis. In ranking-based feature extraction, features are evaluated on an individual basis, without considering inter-relationship between features in general, while set-based feature extraction evaluates features based on their role in a feature set by taking into account dependency between features. Just as learning methods, feature extraction has a problem in its generalization ability, which is robustness. However, the issue of robustness is often overlooked in feature extraction. In order to improve the accuracy and robustness of feature extraction for microarray data, a novel approach based on multi-algorithm fusion is proposed. By fusing different types of feature extraction algorithms to select the feature from the samples set, the proposed approach is able to improve feature extraction performance. The new approach is tested against gene expression dataset including Colon cancer data, CNS data, DLBCL data, and Leukemia data. The testing results show that the performance of this algorithm is better than existing solutions. PMID:25780277
A novel feature extraction approach for microarray data based on multi-algorithm fusion.
Jiang, Zhu; Xu, Rong
2015-01-01
Feature extraction is one of the most important and effective method to reduce dimension in data mining, with emerging of high dimensional data such as microarray gene expression data. Feature extraction for gene selection, mainly serves two purposes. One is to identify certain disease-related genes. The other is to find a compact set of discriminative genes to build a pattern classifier with reduced complexity and improved generalization capabilities. Depending on the purpose of gene selection, two types of feature extraction algorithms including ranking-based feature extraction and set-based feature extraction are employed in microarray gene expression data analysis. In ranking-based feature extraction, features are evaluated on an individual basis, without considering inter-relationship between features in general, while set-based feature extraction evaluates features based on their role in a feature set by taking into account dependency between features. Just as learning methods, feature extraction has a problem in its generalization ability, which is robustness. However, the issue of robustness is often overlooked in feature extraction. In order to improve the accuracy and robustness of feature extraction for microarray data, a novel approach based on multi-algorithm fusion is proposed. By fusing different types of feature extraction algorithms to select the feature from the samples set, the proposed approach is able to improve feature extraction performance. The new approach is tested against gene expression dataset including Colon cancer data, CNS data, DLBCL data, and Leukemia data. The testing results show that the performance of this algorithm is better than existing solutions.
Hess, Jonathan L.; Tylee, Daniel S.; Barve, Rahul; de Jong, Simone; Ophoff, Roel A.; Kumarasinghe, Nishantha; Tooney, Paul; Schall, Ulrich; Gardiner, Erin; Beveridge, Natalie Jane; Scott, Rodney J.; Yasawardene, Surangi; Perera, Antionette; Mendis, Jayan; Carr, Vaughan; Kelly, Brian; Cairns, Murray; Tsuang, Ming T.; Glatt, Stephen J.
2016-01-01
The application of microarray technology in schizophrenia research was heralded as paradigm-shifting, as it allowed for high-throughput assessment of cell and tissue function. This technology was widely adopted, initially in studies of postmortem brain tissue, and later in studies of peripheral blood. The collective body of schizophrenia microarray literature contains apparent inconsistencies between studies, with failures to replicate top hits, in part due to small sample sizes, cohort-specific effects, differences in array types, and other confounders. In an attempt to summarize existing studies of schizophrenia cases and non-related comparison subjects, we performed two mega-analyses of a combined set of microarray data from postmortem prefrontal cortices (n = 315) and from ex-vivo blood tissues (n = 578). We adjusted regression models per gene to remove non-significant covariates, providing best-estimates of transcripts dysregulated in schizophrenia. We also examined dysregulation of functionally related gene sets and gene co-expression modules, and assessed enrichment of cell types and genetic risk factors. The identities of the most significantly dysregulated genes were largely distinct for each tissue, but the findings indicated common emergent biological functions (e.g. immunity) and regulatory factors (e.g., predicted targets of transcription factors and miRNA species across tissues). Our network-based analyses converged upon similar patterns of heightened innate immune gene expression in both brain and blood in schizophrenia. We also constructed generalizable machine-learning classifiers using the blood-based microarray data. Our study provides an informative atlas for future pathophysiologic and biomarker studies of schizophrenia. PMID:27450777
Hess, Jonathan L; Tylee, Daniel S; Barve, Rahul; de Jong, Simone; Ophoff, Roel A; Kumarasinghe, Nishantha; Tooney, Paul; Schall, Ulrich; Gardiner, Erin; Beveridge, Natalie Jane; Scott, Rodney J; Yasawardene, Surangi; Perera, Antionette; Mendis, Jayan; Carr, Vaughan; Kelly, Brian; Cairns, Murray; Tsuang, Ming T; Glatt, Stephen J
2016-10-01
The application of microarray technology in schizophrenia research was heralded as paradigm-shifting, as it allowed for high-throughput assessment of cell and tissue function. This technology was widely adopted, initially in studies of postmortem brain tissue, and later in studies of peripheral blood. The collective body of schizophrenia microarray literature contains apparent inconsistencies between studies, with failures to replicate top hits, in part due to small sample sizes, cohort-specific effects, differences in array types, and other confounders. In an attempt to summarize existing studies of schizophrenia cases and non-related comparison subjects, we performed two mega-analyses of a combined set of microarray data from postmortem prefrontal cortices (n=315) and from ex-vivo blood tissues (n=578). We adjusted regression models per gene to remove non-significant covariates, providing best-estimates of transcripts dysregulated in schizophrenia. We also examined dysregulation of functionally related gene sets and gene co-expression modules, and assessed enrichment of cell types and genetic risk factors. The identities of the most significantly dysregulated genes were largely distinct for each tissue, but the findings indicated common emergent biological functions (e.g. immunity) and regulatory factors (e.g., predicted targets of transcription factors and miRNA species across tissues). Our network-based analyses converged upon similar patterns of heightened innate immune gene expression in both brain and blood in schizophrenia. We also constructed generalizable machine-learning classifiers using the blood-based microarray data. Our study provides an informative atlas for future pathophysiologic and biomarker studies of schizophrenia. Published by Elsevier B.V.
Identifying novel glioma associated pathways based on systems biology level meta-analysis.
Hu, Yangfan; Li, Jinquan; Yan, Wenying; Chen, Jiajia; Li, Yin; Hu, Guang; Shen, Bairong
2013-01-01
With recent advances in microarray technology, including genomics, proteomics, and metabolomics, it brings a great challenge for integrating this "-omics" data to analysis complex disease. Glioma is an extremely aggressive and lethal form of brain tumor, and thus the study of the molecule mechanism underlying glioma remains very important. To date, most studies focus on detecting the differentially expressed genes in glioma. However, the meta-analysis for pathway analysis based on multiple microarray datasets has not been systematically pursued. In this study, we therefore developed a systems biology based approach by integrating three types of omics data to identify common pathways in glioma. Firstly, the meta-analysis has been performed to study the overlapping of signatures at different levels based on the microarray gene expression data of glioma. Among these gene expression datasets, 12 pathways were found in GeneGO database that shared by four stages. Then, microRNA expression profiles and ChIP-seq data were integrated for the further pathway enrichment analysis. As a result, we suggest 5 of these pathways could be served as putative pathways in glioma. Among them, the pathway of TGF-beta-dependent induction of EMT via SMAD is of particular importance. Our results demonstrate that the meta-analysis based on systems biology level provide a more useful approach to study the molecule mechanism of complex disease. The integration of different types of omics data, including gene expression microarrays, microRNA and ChIP-seq data, suggest some common pathways correlated with glioma. These findings will offer useful potential candidates for targeted therapeutic intervention of glioma.
Kong, Lingde; Tian, Weifeng; Cao, Peng; Wang, Haonan; Zhang, Bing; Shen, Yong
2017-10-01
The predictive factors associated with neck pain remain unclear. We conducted a cross-sectional study to assess predictive factors, especially Modic changes (MCs), associated with the intensity and duration of neck pain in patients with cervical disc degenerative disease.We retrospectively reviewed patients in our hospital from January 2013 to December 2016. Severe neck pain (SNP) and persistent neck pain (PNP) were the 2 main outcomes, and were assessed based on the numerical rating scale (NRS). Basic data, and also imaging data, were collected and analyzed as potential predictive factors. Univariate analysis and multiple logistic regression analysis were performed to assess the predictive factors for neck pain.In all, 381 patients (193 males and 188 females) with cervical degenerative disease were included in our study. The number of patients with SNP and PNP were 94 (24.67%) and 109 (28.61%), respectively. The NRS of neck pain in patients with type 1 MCs was significantly higher than type 2 MCs (4.8 ± 0.9 vs 3.9 ± 1.1; P = .004). The multivariate logistic analysis showed that kyphosis curvature (odds ratio [OR] 1.082, 95% confidence interval [CI] 1.044-1.112), spondylolisthesis (OR 1.339, 95% CI 1.226-1.462), and annular tear (OR 1.188, 95% CI 1.021-1.382) were factors associated with SNP, whereas kyphosis curvature (OR 1.568, 95% CI 1.022-2.394), spondylolisthesis (OR 1.486, 95% CI 1.082-2.041), and MCs (OR 1.152, 95% CI 1.074-1.234) were associated with PNP.We concluded that kyphosis curvature, spondylolisthesis, and annular tear are associated with SNP, whereas kyphosis curvature, spondylolisthesis, and MCs are associated with PNP. This study supports the view that MCs can lead to a long duration of neck pain.
Predictive factors associated with neck pain in patients with cervical disc degeneration
Kong, Lingde; Tian, Weifeng; Cao, Peng; Wang, Haonan; Zhang, Bing; Shen, Yong
2017-01-01
Abstract The predictive factors associated with neck pain remain unclear. We conducted a cross-sectional study to assess predictive factors, especially Modic changes (MCs), associated with the intensity and duration of neck pain in patients with cervical disc degenerative disease. We retrospectively reviewed patients in our hospital from January 2013 to December 2016. Severe neck pain (SNP) and persistent neck pain (PNP) were the 2 main outcomes, and were assessed based on the numerical rating scale (NRS). Basic data, and also imaging data, were collected and analyzed as potential predictive factors. Univariate analysis and multiple logistic regression analysis were performed to assess the predictive factors for neck pain. In all, 381 patients (193 males and 188 females) with cervical degenerative disease were included in our study. The number of patients with SNP and PNP were 94 (24.67%) and 109 (28.61%), respectively. The NRS of neck pain in patients with type 1 MCs was significantly higher than type 2 MCs (4.8 ± 0.9 vs 3.9 ± 1.1; P = .004). The multivariate logistic analysis showed that kyphosis curvature (odds ratio [OR] 1.082, 95% confidence interval [CI] 1.044–1.112), spondylolisthesis (OR 1.339, 95% CI 1.226–1.462), and annular tear (OR 1.188, 95% CI 1.021–1.382) were factors associated with SNP, whereas kyphosis curvature (OR 1.568, 95% CI 1.022–2.394), spondylolisthesis (OR 1.486, 95% CI 1.082–2.041), and MCs (OR 1.152, 95% CI 1.074–1.234) were associated with PNP. We concluded that kyphosis curvature, spondylolisthesis, and annular tear are associated with SNP, whereas kyphosis curvature, spondylolisthesis, and MCs are associated with PNP. This study supports the view that MCs can lead to a long duration of neck pain. PMID:29069048
Pemov, Alexander; Sung, Heejong; Hyland, Paula L.; Sloan, Jennifer L.; Ruppert, Sarah L.; Baldwin, Andrea M.; Boland, Joseph F.; Bass, Sara E.; Lee, Hyo Jung; Jones, Kristine M.; Zhang, Xijun; Mullikin, James C.; Widemann, Brigitte C.; Wilson, Alexander F.; Stewart, Douglas R.
2014-01-01
Neurofibromatosis type 1 (NF1) is an autosomal dominant, monogenic disorder of dysregulated neurocutaneous tissue growth. Pleiotropy, variable expressivity and few NF1 genotype-phenotype correlates limit clinical prognostication in NF1. Phenotype complexity in NF1 is hypothesized to derive in part from genetic modifiers unlinked to the NF1 locus. In this study, we hypothesized that normal variation in germline gene expression confers risk for certain phenotypes in NF1. In a set of 79 individuals with NF1, we examined the association between gene expression in lymphoblastoid cell lines with NF1-associated phenotypes and sequenced select genes with significant phenotype/expression correlations. In a discovery cohort of 89 self-reported European-Americans with NF1 we examined the association between germline sequence variants of these genes with café-au-lait macule (CALM) count, a tractable, tumor-like phenotype in NF1. Two correlated, common SNPs (rs4660761 and rs7161) between DPH2 and ATP6V0B were significantly associated with the CALM count. Analysis with tiled regression also identified SNP rs4660761 as significantly associated with CALM count. SNP rs1800934 and 12 rare variants in the mismatch repair gene MSH6 were also associated with CALM count. Both SNPs rs7161 and rs4660761 (DPH2 and ATP6V0B) were highly significant in a mega-analysis in a combined cohort of 180 self-reported European-Americans; SNP rs1800934 (MSH6) was near-significant in a meta-analysis assuming dominant effect of the minor allele. SNP rs4660761 is predicted to regulate ATP6V0B, a gene associated with melanosome biology. Individuals with homozygous mutations in MSH6 can develop an NF1-like phenotype, including multiple CALMs. Through a multi-platform approach, we identified variants that influence NF1 CALM count. PMID:25329635
Wu, Yahui; Luo, Lixin; Chen, Likai; Tao, Xingxing; Huang, Ming; Wang, Hui; Chen, Zhiqiang; Xiao, Wuming
2016-11-18
Genetic analysis revealed that narrow leaf, small panicle, thin and slender stems as well as low fertility rate of an Indica rice variety were recessive traits and controlled by a single gene. Applying map-based cloning strategy, a novel narrow leaf gene, which was named nal11 was delimited to an interval of 58.3 kb between the InDel markers N10 and InD5016. There are 9 genes in the mapping interval, and only a heat shock DNAJ protein encode gene (Os07g09450) has a specific G to T SNP, which was occurred at the last base of the second exon of Os07g09450 in ZYX. 5' and 3' RACE result shown that there were two transcripts in NAL11, and the SNP in nal11 leads to a variable shear of mRNA. In addition, this type of mRNA alternative splicing together with a stop codon closely followed the SNP which caused termination of translation destroyed the DNAJ domain of nal11's product. These results suggested that the heat shock DNAJ gene was most likely to be the candidate gene of nal11. The results of RT-PCR and real-time PCR further verified that the SNP in the ZYX-nal11 gene affects mRNA splicing pattern. Phenotype of ZYX may be caused by a statistically significant reduction in the total number of small veins in leaf, size and number of small vascular bundles and cells in stems, similar to several previous reported mutations. The basic molecular information we provide here will be useful for further investigations of the physiological function of the heat shock DNAJ gene, which will be helpful in better understanding the role of the DNAJ family in regulation of plant type traits such as leaf width of rice. Copyright © 2016 Elsevier Inc. All rights reserved.
Zhang, Han; Wheeler, William; Hyland, Paula L; Yang, Yifan; Shi, Jianxin; Chatterjee, Nilanjan; Yu, Kai
2016-06-01
Meta-analysis of multiple genome-wide association studies (GWAS) has become an effective approach for detecting single nucleotide polymorphism (SNP) associations with complex traits. However, it is difficult to integrate the readily accessible SNP-level summary statistics from a meta-analysis into more powerful multi-marker testing procedures, which generally require individual-level genetic data. We developed a general procedure called Summary based Adaptive Rank Truncated Product (sARTP) for conducting gene and pathway meta-analysis that uses only SNP-level summary statistics in combination with genotype correlation estimated from a panel of individual-level genetic data. We demonstrated the validity and power advantage of sARTP through empirical and simulated data. We conducted a comprehensive pathway-based meta-analysis with sARTP on type 2 diabetes (T2D) by integrating SNP-level summary statistics from two large studies consisting of 19,809 T2D cases and 111,181 controls with European ancestry. Among 4,713 candidate pathways from which genes in neighborhoods of 170 GWAS established T2D loci were excluded, we detected 43 T2D globally significant pathways (with Bonferroni corrected p-values < 0.05), which included the insulin signaling pathway and T2D pathway defined by KEGG, as well as the pathways defined according to specific gene expression patterns on pancreatic adenocarcinoma, hepatocellular carcinoma, and bladder carcinoma. Using summary data from 8 eastern Asian T2D GWAS with 6,952 cases and 11,865 controls, we showed 7 out of the 43 pathways identified in European populations remained to be significant in eastern Asians at the false discovery rate of 0.1. We created an R package and a web-based tool for sARTP with the capability to analyze pathways with thousands of genes and tens of thousands of SNPs.
Zhang, Han; Wheeler, William; Hyland, Paula L.; Yang, Yifan; Shi, Jianxin; Chatterjee, Nilanjan; Yu, Kai
2016-01-01
Meta-analysis of multiple genome-wide association studies (GWAS) has become an effective approach for detecting single nucleotide polymorphism (SNP) associations with complex traits. However, it is difficult to integrate the readily accessible SNP-level summary statistics from a meta-analysis into more powerful multi-marker testing procedures, which generally require individual-level genetic data. We developed a general procedure called Summary based Adaptive Rank Truncated Product (sARTP) for conducting gene and pathway meta-analysis that uses only SNP-level summary statistics in combination with genotype correlation estimated from a panel of individual-level genetic data. We demonstrated the validity and power advantage of sARTP through empirical and simulated data. We conducted a comprehensive pathway-based meta-analysis with sARTP on type 2 diabetes (T2D) by integrating SNP-level summary statistics from two large studies consisting of 19,809 T2D cases and 111,181 controls with European ancestry. Among 4,713 candidate pathways from which genes in neighborhoods of 170 GWAS established T2D loci were excluded, we detected 43 T2D globally significant pathways (with Bonferroni corrected p-values < 0.05), which included the insulin signaling pathway and T2D pathway defined by KEGG, as well as the pathways defined according to specific gene expression patterns on pancreatic adenocarcinoma, hepatocellular carcinoma, and bladder carcinoma. Using summary data from 8 eastern Asian T2D GWAS with 6,952 cases and 11,865 controls, we showed 7 out of the 43 pathways identified in European populations remained to be significant in eastern Asians at the false discovery rate of 0.1. We created an R package and a web-based tool for sARTP with the capability to analyze pathways with thousands of genes and tens of thousands of SNPs. PMID:27362418
Accuracy of direct genomic values in Holstein bulls and cows using subsets of SNP markers
2010-01-01
Background At the current price, the use of high-density single nucleotide polymorphisms (SNP) genotyping assays in genomic selection of dairy cattle is limited to applications involving elite sires and dams. The objective of this study was to evaluate the use of low-density assays to predict direct genomic value (DGV) on five milk production traits, an overall conformation trait, a survival index, and two profit index traits (APR, ASI). Methods Dense SNP genotypes were available for 42,576 SNP for 2,114 Holstein bulls and 510 cows. A subset of 1,847 bulls born between 1955 and 2004 was used as a training set to fit models with various sets of pre-selected SNP. A group of 297 bulls born between 2001 and 2004 and all cows born between 1992 and 2004 were used to evaluate the accuracy of DGV prediction. Ridge regression (RR) and partial least squares regression (PLSR) were used to derive prediction equations and to rank SNP based on the absolute value of the regression coefficients. Four alternative strategies were applied to select subset of SNP, namely: subsets of the highest ranked SNP for each individual trait, or a single subset of evenly spaced SNP, where SNP were selected based on their rank for ASI, APR or minor allele frequency within intervals of approximately equal length. Results RR and PLSR performed very similarly to predict DGV, with PLSR performing better for low-density assays and RR for higher-density SNP sets. When using all SNP, DGV predictions for production traits, which have a higher heritability, were more accurate (0.52-0.64) than for survival (0.19-0.20), which has a low heritability. The gain in accuracy using subsets that included the highest ranked SNP for each trait was marginal (5-6%) over a common set of evenly spaced SNP when at least 3,000 SNP were used. Subsets containing 3,000 SNP provided more than 90% of the accuracy that could be achieved with a high-density assay for cows, and 80% of the high-density assay for young bulls. Conclusions Accurate genomic evaluation of the broader bull and cow population can be achieved with a single genotyping assays containing ~ 3,000 to 5,000 evenly spaced SNP. PMID:20950478
Xu, Qing; Mei, Gui; Sun, Dongxiao; Zhang, Qin; Zhang, Yuan; Yin, Cengceng; Chen, Huiyong; Ding, Xiangdong; Liu, Jianfeng
2012-11-02
We previously localized a quantitative trait locus (QTL) on bovine chromosome 6 affecting milk production traits to a 1.5-Mb region between BMS483 and MNB-209 via genome scanning followed by fine mapping. Totally 15 genes were mapped within such linkage region through bioinformatic analysis of the cattle-human comparative map and bovine genome assembly. Of them, the UDP-glucose dehydrogenase (UGDH) was suggested as a potential positional candidate gene for milk production traits based on its corresponding physiological and biochemical functions and genetic effects. By sequencing all the coding exons and the untranslated regions in UGDH with pooled DNA of 8 sires represented the separated families detected in our previous studies, a total of ten SNPs were identified and genotyped in 1417 Holstein cows of 8 separation families. Individual SNP-based association analysis revealed 4 significant associations of SNP Ex1-1, SNP Int3-1, SNP Int5-1, and SNP Ex12-3 with milk yield (P < 0.05), and 2 significant associations of SNP Ex1-1 and SNP Ex12-3 with protein yield (P < 0.05). Furthermore, our haplotype-based association analyses indicated that haplotypes G-C-C, formed by SNP Ex12-2-SNP Int11-1-SNP Ex11-1, T-G, formed by SNP Int9-3-SNP Int9-2, and C-C, formed by SNP Int5-1-SNP Int3-1, are significantly associated with protein percentage (F=4.15; P=0.0418) and fat percentage (F=5.18~7.25; P=0.0072~0.0231). Finally, by using an in vitro expression assay, we demonstrated that the A allele of SNP Ex1-1 and T allele of SNP Ex11-1of UGDH significantly decreases the expression of UGDH by 68.0% at the RNA, and 50.1% at the protein level, suggesting that SNP Ex1-1 and Ex11-1 represent two functional polymorphisms affecting expression of UGDH and may partly contributed to the observed association of the gene with milk production traits in our samples. Taken together, our findings strongly indicate that UGDH gene could be involved in genetic variation underlying the QTL for milk production traits.
Finding Groups in Gene Expression Data
2005-01-01
The vast potential of the genomic insight offered by microarray technologies has led to their widespread use since they were introduced a decade ago. Application areas include gene function discovery, disease diagnosis, and inferring regulatory networks. Microarray experiments enable large-scale, high-throughput investigations of gene activity and have thus provided the data analyst with a distinctive, high-dimensional field of study. Many questions in this field relate to finding subgroups of data profiles which are very similar. A popular type of exploratory tool for finding subgroups is cluster analysis, and many different flavors of algorithms have been used and indeed tailored for microarray data. Cluster analysis, however, implies a partitioning of the entire data set, and this does not always match the objective. Sometimes pattern discovery or bump hunting tools are more appropriate. This paper reviews these various tools for finding interesting subgroups. PMID:16046827
A Next-generation Tissue Microarray (ngTMA) Protocol for Biomarker Studies
Zlobec, Inti; Suter, Guido; Perren, Aurel; Lugli, Alessandro
2014-01-01
Biomarker research relies on tissue microarrays (TMA). TMAs are produced by repeated transfer of small tissue cores from a ‘donor’ block into a ‘recipient’ block and then used for a variety of biomarker applications. The construction of conventional TMAs is labor intensive, imprecise, and time-consuming. Here, a protocol using next-generation Tissue Microarrays (ngTMA) is outlined. ngTMA is based on TMA planning and design, digital pathology, and automated tissue microarraying. The protocol is illustrated using an example of 134 metastatic colorectal cancer patients. Histological, statistical and logistical aspects are considered, such as the tissue type, specific histological regions, and cell types for inclusion in the TMA, the number of tissue spots, sample size, statistical analysis, and number of TMA copies. Histological slides for each patient are scanned and uploaded onto a web-based digital platform. There, they are viewed and annotated (marked) using a 0.6-2.0 mm diameter tool, multiple times using various colors to distinguish tissue areas. Donor blocks and 12 ‘recipient’ blocks are loaded into the instrument. Digital slides are retrieved and matched to donor block images. Repeated arraying of annotated regions is automatically performed resulting in an ngTMA. In this example, six ngTMAs are planned containing six different tissue types/histological zones. Two copies of the ngTMAs are desired. Three to four slides for each patient are scanned; 3 scan runs are necessary and performed overnight. All slides are annotated; different colors are used to represent the different tissues/zones, namely tumor center, invasion front, tumor/stroma, lymph node metastases, liver metastases, and normal tissue. 17 annotations/case are made; time for annotation is 2-3 min/case. 12 ngTMAs are produced containing 4,556 spots. Arraying time is 15-20 hr. Due to its precision, flexibility and speed, ngTMA is a powerful tool to further improve the quality of TMAs used in clinical and translational research. PMID:25285857
Analysis of high-order SNP barcodes in mitochondrial D-loop for chronic dialysis susceptibility.
Yang, Cheng-Hong; Lin, Yu-Da; Chuang, Li-Yeh; Chang, Hsueh-Wei
2016-10-01
Positively identifying disease-associated single nucleotide polymorphism (SNP) markers in genome-wide studies entails the complex association analysis of a huge number of SNPs. Such large numbers of SNP barcode (SNP/genotype combinations) continue to pose serious computational challenges, especially for high-dimensional data. We propose a novel exploiting SNP barcode method based on differential evolution, termed IDE (improved differential evolution). IDE uses a "top combination strategy" to improve the ability of differential evolution to explore high-order SNP barcodes in high-dimensional data. We simulate disease data and use real chronic dialysis data to test four global optimization algorithms. In 48 simulated disease models, we show that IDE outperforms existing global optimization algorithms in terms of exploring ability and power to detect the specific SNP/genotype combinations with a maximum difference between cases and controls. In real data, we show that IDE can be used to evaluate the relative effects of each individual SNP on disease susceptibility. IDE generated significant SNP barcode with less computational complexity than the other algorithms, making IDE ideally suited for analysis of high-order SNP barcodes. Copyright © 2016 Elsevier Inc. All rights reserved.
CONTACT (NAME ONLY): Hassan El-Fawal Abstract Details PRESENTATION TYPE: Platform or Poster CURRENT CATEGORY: Neurodegenerative Disease | Biomarkers | Neurotoxicity, Metals KEYWORDS: Autoantibodies, Glutathione-S-Transferase, DATE/TIME LAST MODIFIED: DATE/TIME SUBMITTED: Abs...
Haplotype-based approach to known MS-associated regions increases the amount of explained risk
Khankhanian, Pouya; Gourraud, Pierre-Antoine; Lizee, Antoine; Goodin, Douglas S
2015-01-01
Genome-wide association studies (GWAS), using single nucleotide polymorphisms (SNPs), have yielded 110 non-human leucocyte antigen genomic regions that are associated with multiple sclerosis (MS). Despite this large number of associations, however, only 28% of MS-heritability can currently be explained. Here we compare the use of multi-SNP-haplotypes to the use of single-SNPs as alternative methods to describe MS genetic risk. SNP-haplotypes (of various lengths from 1 up to 15 contiguous SNPs) were constructed at each of the 110 previously identified, MS-associated, genomic regions. Even after correcting for the larger number of statistical comparisons made when using the haplotype-method, in 32 of the regions, the SNP-haplotype based model was markedly more significant than the single-SNP based model. By contrast, in no region was the single-SNP based model similarly more significant than the SNP-haplotype based model. Moreover, when we included the 932 MS-associated SNP-haplotypes (that we identified from 102 regions) as independent variables into a logistic linear model, the amount of MS-heritability, as assessed by Nagelkerke's R-squared, was 38%, which was considerably better than 29%, which was obtained by using only single-SNPs. This study demonstrates that SNP-haplotypes can be used to fine-map the genetic associations within regions of interest previously identified by single-SNP GWAS. Moreover, the amount of the MS genetic risk explained by the SNP-haplotype associations in the 110 MS-associated genomic regions was considerably greater when using SNP-haplotypes than when using single-SNPs. Also, the use of SNP-haplotypes can lead to the discovery of new regions of interest, which have not been identified by a single-SNP GWAS. PMID:26185143
[Phenotypic and genetic analysis of a patient presented with Tietz/Waardenburg type II a syndrome].
Wang, Huanhuan; Tang, Lifang; Zhang, Jingmin; Hu, Qin; Chen, Yingwei; Xiao, Bing
2015-08-01
To determine the genetic cause for a patient featuring decreased pigmentation of the skin and iris, hearing loss and multiple congenital anomalies. Routine chromosomal banding was performed to analyze the karyotype of the patient and his parents. Single nucleotide polymorphism array (SNP array) was employed to identify cryptic chromosome aberrations, and quantitative real-time PCR was used to confirm the results. Karyotype analysis has revealed no obvious anomaly for the patient and his parents. SNP array analysis of the patient has demonstrated a 3.9 Mb deletion encompassing 3p13p14.1, which caused loss of entire MITF gene. The deletion was confirmed by quantitative real-time PCR. Clinical features of the patient have included severe bilateral hearing loss, decreased pigmentation of the skin and iris and multiple congenital anomalies. The patient, carrying a 3p13p14.1 deletion, has features of Tietz syndrome/Waardenburg syndrome type IIa. This case may provide additional data for the study of genotype-phenotype correlation of this disease.
Strauss, Christian; Endimiani, Andrea; Perreten, Vincent
2015-01-01
A rapid and simple DNA labeling system has been developed for disposable microarrays and has been validated for the detection of 117 antibiotic resistance genes abundant in Gram-positive bacteria. The DNA was fragmented and amplified using phi-29 polymerase and random primers with linkers. Labeling and further amplification were then performed by classic PCR amplification using biotinylated primers specific for the linkers. The microarray developed by Perreten et al. (Perreten, V., Vorlet-Fawer, L., Slickers, P., Ehricht, R., Kuhnert, P., Frey, J., 2005. Microarray-based detection of 90 antibiotic resistance genes of gram-positive bacteria. J.Clin.Microbiol. 43, 2291-2302.) was improved by additional oligonucleotides. A total of 244 oligonucleotides (26 to 37 nucleotide length and with similar melting temperatures) were spotted on the microarray, including genes conferring resistance to clinically important antibiotic classes like β-lactams, macrolides, aminoglycosides, glycopeptides and tetracyclines. Each antibiotic resistance gene is represented by at least 2 oligonucleotides designed from consensus sequences of gene families. The specificity of the oligonucleotides and the quality of the amplification and labeling were verified by analysis of a collection of 65 strains belonging to 24 species. Association between genotype and phenotype was verified for 6 antibiotics using 77 Staphylococcus strains belonging to different species and revealed 95% test specificity and a 93% predictive value of a positive test. The DNA labeling and amplification is independent of the species and of the target genes and could be used for different types of microarrays. This system has also the advantage to detect several genes within one bacterium at once, like in Staphylococcus aureus strain BM3318, in which up to 15 genes were detected. This new microarray-based detection system offers a large potential for applications in clinical diagnostic, basic research, food safety and surveillance programs for antimicrobial resistance. Copyright © 2014 Elsevier B.V. All rights reserved.
Khan, Rishi L; Gonye, Gregory E; Gao, Guang; Schwaber, James S
2006-01-01
Background Using microarrays by co-hybridizing two samples labeled with different dyes enables differential gene expression measurements and comparisons across slides while controlling for within-slide variability. Typically one dye produces weaker signal intensities than the other often causing signals to be undetectable. In addition, undetectable spots represent a large problem for two-color microarray designs and most arrays contain at least 40% undetectable spots even when labeled with reference samples such as Stratagene's Universal Reference RNAs™. Results We introduce a novel universal reference sample that produces strong signal for all spots on the array, increasing the average fraction of detectable spots to 97%. Maximizing detectable spots on the reference image channel also decreases the variability of microarray data allowing for reliable detection of smaller differential gene expression changes. The reference sample is derived from sequence contained in the parental EST clone vector pT7T3D-Pac and is called vector RNA (vRNA). We show that vRNA can also be used for quality control of microarray printing and PCR product quality, detection of hybridization anomalies, and simplification of spot finding and segmentation tasks. This reference sample can be made inexpensively in large quantities as a renewable resource that is consistent across experiments. Conclusion Results of this study show that vRNA provides a useful universal reference that yields high signal for almost all spots on a microarray, reduces variation and allows for comparisons between experiments and laboratories. Further, it can be used for quality control of microarray printing and PCR product quality, detection of hybridization anomalies, and simplification of spot finding and segmentation tasks. This type of reference allows for detection of small changes in differential expression while reference designs in general allow for large-scale multivariate experimental designs. vRNA in combination with reference designs enable systems biology microarray experiments of small physiologically relevant changes. PMID:16677381
Grubaugh, Nathan D.; McMenamy, Scott S.; Turell, Michael J.; Lee, John S.
2013-01-01
Background Arthropod-borne viruses are important emerging pathogens world-wide. Viruses transmitted by mosquitoes, such as dengue, yellow fever, and Japanese encephalitis viruses, infect hundreds of millions of people and animals each year. Global surveillance of these viruses in mosquito vectors using molecular based assays is critical for prevention and control of the associated diseases. Here, we report an oligonucleotide DNA microarray design, termed ArboChip5.1, for multi-gene detection and identification of mosquito-borne RNA viruses from the genera Flavivirus (family Flaviviridae), Alphavirus (Togaviridae), Orthobunyavirus (Bunyaviridae), and Phlebovirus (Bunyaviridae). Methodology/Principal Findings The assay utilizes targeted PCR amplification of three genes from each virus genus for electrochemical detection on a portable, field-tested microarray platform. Fifty-two viruses propagated in cell-culture were used to evaluate the specificity of the PCR primer sets and the ArboChip5.1 microarray capture probes. The microarray detected all of the tested viruses and differentiated between many closely related viruses such as members of the dengue, Japanese encephalitis, and Semliki Forest virus clades. Laboratory infected mosquitoes were used to simulate field samples and to determine the limits of detection. Additionally, we identified dengue virus type 3, Japanese encephalitis virus, Tembusu virus, Culex flavivirus, and a Quang Binh-like virus from mosquitoes collected in Thailand in 2011 and 2012. Conclusions/Significance We demonstrated that the described assay can be utilized in a comprehensive field surveillance program by the broad-range amplification and specific identification of arboviruses from infected mosquitoes. Furthermore, the microarray platform can be deployed in the field and viral RNA extraction to data analysis can occur in as little as 12 h. The information derived from the ArboChip5.1 microarray can help to establish public health priorities, detect disease outbreaks, and evaluate control programs. PMID:23967358
Kaewmanee, M; Phoksawat, W; Romphruk, A; Romphruk, A V; Jumnainsong, A; Leelayuwat, C
2013-06-01
Natural killer group 2 member D (NKG2D) on immune effector cells recognizes multiple stress-inducible ligands. NKG2D single-nucleotide polymorphism (SNP) haplotypes were related to the levels of cytotoxic activity of peripheral blood mononuclear cells. Indeed, these polymorphisms were also located in NKG2F. Isothermal multiple displacement amplification (IMDA) is used for whole genome amplification (WGA) that can amplify very small genomic DNA templates into microgram with whole genome coverage. This is particularly useful in the cases of limited amount of valuable DNA samples requiring multi-locus genotyping. In this study, we evaluated the quality and applicability of IMDA to genetic studies in terms of sensitivity, efficiency of IMDA re-amplification and stability of IMDA products. The smallest amount of DNA to be effectively amplified by IMDA was 200 pg yielding final DNA of approximately 16 µg within 1.5 h. IMDA could be re-amplified only once (second round of amplification), and could be kept for 5 months at 4°C and more than a year at -20°C without loosing genome coverage. The amplified products were used successfully to setup a multiplex polymerase chain reaction-sequence-specific primer for SNP typing of the NKG2D/F genes. The NKG2D/F multiplex polymerase chain reaction (PCR) contained six PCR mixtures for detecting 10 selected SNPs, including 8 NKG2D/F SNP haplotypes and 2 additional NKG2D coding SNPs. This typing procedure will be applicable in both clinical and research laboratories. Thus, our data provide useful information and limitations for utilization of genome-wide amplification using IMDA and its application for multiplex NKG2D/F typing. © 2013 John Wiley & Sons Ltd.
Pecavar, Verena; Blaschitz, Marion; Hufnagl, Peter; Zeinzinger, Josef; Fiedler, Anita; Allerberger, Franz; Maass, Matthias; Indra, Alexander
2012-06-01
Clostridium difficile, a Gram-positive, spore-forming, anaerobic bacterium, is the main causative agent of hospital-acquired diarrhoea worldwide. In addition to metronidazole and vancomycin, rifaximin, a rifamycin derivative, is a promising antibiotic for the treatment of recurring C. difficile infections (CDI). However, exposure of C. difficile to this antibiotic has led to the development of rifaximin-resistance due to point mutations in the β-subunit of the RNA polymerase (rpoB) gene. In the present study, 348 C. difficile strains with known PCR-ribotypes were investigated for respective single nucleotide polymorphisms (SNPs) within the proposed rpoB hot-spot region by using high-resolution melting (HRM) analysis. This method allows the detection of SNPs by comparing the altered melting behaviour of dsDNA with that of wild-type DNA. Discrimination between wild-type and mutant strains was enhanced by creating heteroduplexes by mixing sample DNA with wild-type DNA, leading to characteristic melting curve shapes from samples containing SNPs in the respective rpoB section. In the present study, we were able to identify 16 different rpoB sequence-types (ST) by sequencing analysis of a 325 bp fragment. The 16 PCR STs displayed a total of 24 different SNPs. Fifteen of these 24 SNPs were located within the proposed 151 bp SNP hot-spot region, resulting in 11 different HRM curve profiles (CP). Eleven SNPs (seven of which were within the proposed hot-spot region) led to amino acid substitutions associated with reduced susceptibility to rifaximin and 13 SNPs (eight of which were within the hot-spot region) were synonymous. This investigation clearly demonstrates that HRM analysis of the proposed SNP hot-spot region in the rpoB gene of C. difficile is a fast and cost-effective method for the identification of C. difficile samples with reduced susceptibility to rifaximin and even allows simultaneous SNP subtyping of the respective C. difficile isolates.
Tomkinson, A.; Raeburn, D.
1996-01-01
1. We have examined the effects of the isoenzyme-selective phosphodiesterase (PDE) inhibitors, vinpocetine (type 1), siguazodan (type 3), rolipram (type 4) and zaprinast (type 5) and the non-selective PDE inhibitor enprofylline on methacholine (MCh) contractile concentration-response curves on guinea-pig and rat isolated ileum. 2. In guinea-pig ileum, vinpocetine (10-300 microM), zaprinast (1-300 microM) and enprofylline (100-1000 microM) produced a concentration-dependent depression of the maximum response (Emax) to MCh only without effect on the MCh EC50 values (rank order of potency: zaprinast > vinpocetine > enprofylline). In contrast, siguazodan (10-300 microM) and rolipram (10-300 microM) produced a rightward displacement of the MCh concentration-response curve (increase in EC50: rank order; rolipram > siguazodan), with effects on the MCh maximum seen only at higher concentrations. 3. In the rat ileum, vinpocetine (10-300 microM), zaprinast (0.1-300 microM) and enprofylline (100-1000 microM) caused depression of the MCh maximum contraction (rank order: zaprinast > vinpocetine > enprofylline). Low concentrations of rolipram and siguazodan had no significant effect on the MCh maximum. In the presence of higher concentrations (> 100 microM) of rolipram and siguazodan, a maximum response was not achieved at the highest concentration of MCh tested. As in the guinea-pig ileum, only rolipram (10-300 microM) and siguazodan (10-300 microM) produced a significant, concentration-dependent, rightward displacement of the MCh concentration-response curve (increase in EC50: rank order: rolipram > siguazodan). 4. In the guinea-pig ileum, isoprenaline (0.1 microM) produced a rightward displacement (approximately 3 fold) of the MCh concentration-response curve, accompanied by a significant depression of the maximum response. Increasing the isoprenaline concentration (1 microM) had no further effect on either parameter. Sodium nitroprusside (SNP, > or = 10 microM) produced a concentration-dependent depression of the MCh maximum without an effect on the EC50. 5. In the rat ileum, isoprenaline (1 microM) produced a concentration-dependent rightward displacement (approximately 2.8 fold) of the MCh concentration-response curve with depression of the MCh maximum at higher (> or = 100 microM) concentrations. SNP produced depression of the MCh maximum at a concentration of 10 microM and above. Effects on the MCh EC50 were seen only at 100 and 300 microM. 6. In guinea-pig ileum, isoprenaline (0.1 microM) in combination with rolipram (10 microM) further increased the MCh EC50 and reduced the MCh maximum. The combination of SNP (10 microM) with zaprinast (0.1 microM) produced no further significant effect than SNP alone. 7. In rat ileum, isoprenaline (1 microM) in combination with rolipram (10 microM) further increased the EC50 and reduced the maximum. SNP (10 microM) had no significant effect on either the MCh maximum or EC50. A combination with zaprinast (1 microM) had no further effect. 8. In conclusion, all the PDE inhibitors tested produced a concentration-dependent inhibition of the MCh concentration-response curve, indicating a modulator role for the PDE isoenzymes in gastrointestinal smooth muscle contractility. The PDE inhibitors that elevate cyclic GMP produced a depression of the MCh maximum response only, whilst those that elevate cyclic AMP produced a rightward displacement of the MCh concentration-response curve. This was confirmed by the use of isoprenaline and SNP. This difference in the type of inhibition produced by these PDE isoenzyme inhibitors may reflect a different intracellular site/mechanism by which the cyclic AMP- and cyclic GMP-activated kinases act functionally to antagonize the contractile response. PMID:8864552
How large a training set is needed to develop a classifier for microarray data?
Dobbin, Kevin K; Zhao, Yingdong; Simon, Richard M
2008-01-01
A common goal of gene expression microarray studies is the development of a classifier that can be used to divide patients into groups with different prognoses, or with different expected responses to a therapy. These types of classifiers are developed on a training set, which is the set of samples used to train a classifier. The question of how many samples are needed in the training set to produce a good classifier from high-dimensional microarray data is challenging. We present a model-based approach to determining the sample size required to adequately train a classifier. It is shown that sample size can be determined from three quantities: standardized fold change, class prevalence, and number of genes or features on the arrays. Numerous examples and important experimental design issues are discussed. The method is adapted to address ex post facto determination of whether the size of a training set used to develop a classifier was adequate. An interactive web site for performing the sample size calculations is provided. We showed that sample size calculations for classifier development from high-dimensional microarray data are feasible, discussed numerous important considerations, and presented examples.
NCBI GEO: archive for functional genomics data sets--10 years on.
Barrett, Tanya; Troup, Dennis B; Wilhite, Stephen E; Ledoux, Pierre; Evangelista, Carlos; Kim, Irene F; Tomashevsky, Maxim; Marshall, Kimberly A; Phillippy, Katherine H; Sherman, Patti M; Muertter, Rolf N; Holko, Michelle; Ayanbule, Oluwabukunmi; Yefanov, Andrey; Soboleva, Alexandra
2011-01-01
A decade ago, the Gene Expression Omnibus (GEO) database was established at the National Center for Biotechnology Information (NCBI). The original objective of GEO was to serve as a public repository for high-throughput gene expression data generated mostly by microarray technology. However, the research community quickly applied microarrays to non-gene-expression studies, including examination of genome copy number variation and genome-wide profiling of DNA-binding proteins. Because the GEO database was designed with a flexible structure, it was possible to quickly adapt the repository to store these data types. More recently, as the microarray community switches to next-generation sequencing technologies, GEO has again adapted to host these data sets. Today, GEO stores over 20,000 microarray- and sequence-based functional genomics studies, and continues to handle the majority of direct high-throughput data submissions from the research community. Multiple mechanisms are provided to help users effectively search, browse, download and visualize the data at the level of individual genes or entire studies. This paper describes recent database enhancements, including new search and data representation tools, as well as a brief review of how the community uses GEO data. GEO is freely accessible at http://www.ncbi.nlm.nih.gov/geo/.
A Quick and Parallel Analytical Method Based on Quantum Dots Labeling for ToRCH-Related Antibodies
NASA Astrophysics Data System (ADS)
Yang, Hao; Guo, Qing; He, Rong; Li, Ding; Zhang, Xueqing; Bao, Chenchen; Hu, Hengyao; Cui, Daxiang
2009-12-01
Quantum dot is a special kind of nanomaterial composed of periodic groups of II-VI, III-V or IV-VI materials. Their high quantum yield, broad absorption with narrow photoluminescence spectra and high resistance to photobleaching, make them become a promising labeling substance in biological analysis. Here, we report a quick and parallel analytical method based on quantum dots for ToRCH-related antibodies including Toxoplasma gondii, Rubella virus, Cytomegalovirus and Herpes simplex virus type 1 (HSV1) and 2 (HSV2). Firstly, we fabricated the microarrays with the five kinds of ToRCH-related antigens and used CdTe quantum dots to label secondary antibody and then analyzed 100 specimens of randomly selected clinical sera from obstetric outpatients. The currently prevalent enzyme-linked immunosorbent assay (ELISA) kits were considered as “golden standard” for comparison. The results show that the quantum dots labeling-based ToRCH microarrays have comparable sensitivity and specificity with ELISA. Besides, the microarrays hold distinct advantages over ELISA test format in detection time, cost, operation and signal stability. Validated by the clinical assay, our quantum dots-based ToRCH microarrays have great potential in the detection of ToRCH-related pathogens.
High density DNA microarrays: algorithms and biomedical applications.
Liu, Wei-Min
2004-08-01
DNA microarrays are devices capable of detecting the identity and abundance of numerous DNA or RNA segments in samples. They are used for analyzing gene expressions, identifying genetic markers and detecting mutations on a genomic scale. The fundamental chemical mechanism of DNA microarrays is the hybridization between probes and targets due to the hydrogen bonds of nucleotide base pairing. Since the cross hybridization is inevitable, and probes or targets may form undesirable secondary or tertiary structures, the microarray data contain noise and depend on experimental conditions. It is crucial to apply proper statistical algorithms to obtain useful signals from noisy data. After we obtained the signals of a large amount of probes, we need to derive the biomedical information such as the existence of a transcript in a cell, the difference of expression levels of a gene in multiple samples, and the type of a genetic marker. Furthermore, after the expression levels of thousands of genes or the genotypes of thousands of single nucleotide polymorphisms are determined, it is usually important to find a small number of genes or markers that are related to a disease, individual reactions to drugs, or other phenotypes. All these applications need careful data analyses and reliable algorithms.
Corella, Dolores; Asensio, Eva M; Coltell, Oscar; Sorlí, José V; Estruch, Ramón; Martínez-González, Miguel Ángel; Salas-Salvadó, Jordi; Castañer, Olga; Arós, Fernando; Lapetra, José; Serra-Majem, Lluís; Gómez-Gracia, Enrique; Ortega-Azorín, Carolina; Fiol, Miquel; Espino, Javier Díez; Díaz-López, Andrés; Fitó, Montserrat; Ros, Emilio; Ordovás, José M
2016-01-07
Circadian rhythms regulate key biological processes influencing metabolic pathways. Disregulation is associated with type 2 diabetes (T2D) and cardiovascular diseases (CVD). Circadian rhythms are generated by a transcriptional autoregulatory feedback loop involving core clock genes. CLOCK (circadian locomotor output cycles protein kaput), one of those core genes, is known to regulate glucose metabolism in rodent models. Cross-sectional studies in humans have reported associations between this locus and obesity, plasma glucose, hypertension and T2D prevalence, supporting its role in cardiovascular risk. However, no longitudinal study has investigated the association between CLOCK gene variation and T2D or CVD incidence. Moreover, although in a previous work we detected a gene-diet interaction between the CLOCK-rs4580704 (C > G) single nucleotide polymorphism (SNP) and monounsaturated (MUFA) intake on insulin resistance, no interventional study has analyzed gene-diet interactions on T2D or CVD outcomes. We analyzed the association between the CLOCK-rs4580704 SNP and incidence of T2D and CVD longitudinally in 7098 PREDIMED trial (ISRCTN35739639) participants after a median 4.8-year follow-up. We also examined modulation by Mediterranean diet (MedDiet) intervention (high in MUFA) on these associations. We observed a significant association between the CLOCK-rs4580704 SNP and T2D incidence in n = 3671 non-T2D PREDIMED participants, with variant allele (G) carriers showing decreased incidence (dominant model) compared with CC homozygotes (HR: 0.69; 95 % CI 0.54-0.87; P = 0.002). This protection was more significant in the MedDiet intervention group (HR: 0.58; 95 % CI 0.43-0.78; P < 0.001) than in the control group (HR: 0.95; 95 % CI 0.63-1.44; P = 0.818). Moreover, we detected a statistically significant interaction (P = 0.018) between CLOCK-rs4580704 SNP and T2D status on stroke. Thus, only in T2D subjects was CLOCK-rs4580704 SNP associated with stroke risk, G-carriers having decreased risk (HR: 0.61; 95 % CI 0.40-0.94; P = 0.024 versus CC) in the multivariable-adjusted model. In agreement with our previous results showing a protective effect of the G-allele against hyperglycemia, we extended our findings by reporting a novel association with lower T2D incidence and also suggesting a dietary modulation. Moreover, we report for the first time an association between a CLOCK polymorphism and stroke in T2D subjects, suggesting that core clock genes may significantly contribute to increased CVD risk in T2D.