snp array analysis: Topics by Science.gov

Sample records for snp array analysis

SNPConvert: SNP Array Standardization and Integration in Livestock Species.

PubMed

Nicolazzi, Ezequiel Luis; Marras, Gabriele; Stella, Alessandra

2016-06-09

One of the main advantages of single nucleotide polymorphism (SNP) array technology is providing genotype calls for a specific number of SNP markers at a relatively low cost. Since its first application in animal genetics, the number of available SNP arrays for each species has been constantly increasing. However, conversely to that observed in whole genome sequence data analysis, SNP array data does not have a common set of file formats or coding conventions for allele calling. Therefore, the standardization and integration of SNP array data from multiple sources have become an obstacle, especially for users with basic or no programming skills. Here, we describe the difficulties related to handling SNP array data, focusing on file formats, SNP allele coding, and mapping. We also present SNPConvert suite, a multi-platform, open-source, and user-friendly set of tools to overcome these issues. This tool, which can be integrated with open-source and open-access tools already available, is a first step towards an integrated system to standardize and integrate any type of raw SNP array data. The tool is available at: https://github. com/nicolazzie/SNPConvert.git.
Combined array CGH plus SNP genome analyses in a single assay for optimized clinical testing

PubMed Central

Wiszniewska, Joanna; Bi, Weimin; Shaw, Chad; Stankiewicz, Pawel; Kang, Sung-Hae L; Pursley, Amber N; Lalani, Seema; Hixson, Patricia; Gambin, Tomasz; Tsai, Chun-hui; Bock, Hans-Georg; Descartes, Maria; Probst, Frank J; Scaglia, Fernando; Beaudet, Arthur L; Lupski, James R; Eng, Christine; Wai Cheung, Sau; Bacino, Carlos; Patel, Ankita

2014-01-01

In clinical diagnostics, both array comparative genomic hybridization (array CGH) and single nucleotide polymorphism (SNP) genotyping have proven to be powerful genomic technologies utilized for the evaluation of developmental delay, multiple congenital anomalies, and neuropsychiatric disorders. Differences in the ability to resolve genomic changes between these arrays may constitute an implementation challenge for clinicians: which platform (SNP vs array CGH) might best detect the underlying genetic cause for the disease in the patient? While only SNP arrays enable the detection of copy number neutral regions of absence of heterozygosity (AOH), they have limited ability to detect single-exon copy number variants (CNVs) due to the distribution of SNPs across the genome. To provide comprehensive clinical testing for both CNVs and copy-neutral AOH, we enhanced our custom-designed high-resolution oligonucleotide array that has exon-targeted coverage of 1860 genes with 60 000 SNP probes, referred to as Chromosomal Microarray Analysis – Comprehensive (CMA-COMP). Of the 3240 cases evaluated by this array, clinically significant CNVs were detected in 445 cases including 21 cases with exonic events. In addition, 162 cases (5.0%) showed at least one AOH region >10 Mb. We demonstrate that even though this array has a lower density of SNP probes than other commercially available SNP arrays, it reliably detected AOH events >10 Mb as well as exonic CNVs beyond the detection limitations of SNP genotyping. Thus, combining SNP probes and exon-targeted array CGH into one platform provides clinically useful genetic screening in an efficient manner. PMID:23695279
[Genetic analysis of two cases with Dandy-Walker deformed fetus].

PubMed

Yao, Juan; Fang, Rong; Shen, Xueping; Shen, Guosong; Zhang, Su

2017-10-10

To explore the genetic etiology of two fetuses with Dandy-Walker malformation using single nucleotide polymorphism microarray (SNP-array). The fetuses and their parents were subjected to G banding karyotype analysis. The fetuses were also subjected to SNP-array analysis. The parents of both fetuses showed a normal karyotype. One fetus has a 46,X,?i(X)(q10), while for another conventional cell culture has failed. SNP-array showed that one fetus carried a 6p25.3p25.2 microdeletion, and another carried a Xp22.33p22.2 deletion and a Yq11.221q11 duplication. The abnormal fragments have involved FOXC1, SHOX and STS genes, which are associated with Dandy-Walker malformation. Alteration of 6p25.3p25.2, Xp22.33p22.2 copy numbers probably underlies the Dandy-Walker syndrome in the fetuses. The disorder may be attributed to abnormal expression of FOXC1, SHOX, and STS genes. SNP-array can provide an important supplement for prenatal diagnosis.
[Prenatal genetic diagnosis for a fetus with atypical neurofibromatosis type 1 microdeletion].

PubMed

Lin, Shaobin; Wu, Jianzhu; Zhang, Zhiqiang; Ji, Yuanjun; Fang, Qun; Chen, Baojiang; Luo, Yanmin

2016-04-01

To analyze the correlation between atypical neurofibromatosis type 1(NF1) microdeletion and fetal phenotype. Fetal blood sampling was carried out for a woman bearing a fetus with talipes equinovarus. G-banded karyotyping and single nucleotide polymorphism array (SNP-array) were performed on the fetal blood sample. Fluorescence in situ hybridization (FISH) was used to confirm the result of SNP array analysis. FISH assay was also carried out on peripheral blood specimens from the parents to ascertain the origin of mutation. The karyotype of fetus was found to be 46, XY by G-banding analysis. However, a 3.132 Mb microdeletion was detected in chromosome region 17q11.2 by SNP array, which overlaped with the region of NF1 microdeletion syndrome. Analyzing of the specimens from the fetus and its parents with FISH has confirmed it to be a de novo deletion. Talipes equinovarus may be an abnormal sonographic feature of fetus with atypical NF1 microdeletion which can be accurately diagnosed with SNP array.
Discovery of 100K SNP array and its utilization in sugarcane

USDA-ARS?s Scientific Manuscript database

Next generation sequencing (NGS) enable us to identify thousands of single nucleotide polymorphisms (SNPs) marker for genotyping and fingerprinting. However, the process requires very precise bioinformatics analysis and filtering process. High throughput SNP array with predefined genomic location co...
Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar).

PubMed

Houston, Ross D; Taggart, John B; Cézard, Timothé; Bekaert, Michaël; Lowe, Natalie R; Downing, Alison; Talbot, Richard; Bishop, Stephen C; Archibald, Alan L; Bron, James E; Penman, David J; Davassi, Alessandro; Brew, Fiona; Tinch, Alan E; Gharbi, Karim; Hamilton, Alastair

2014-02-06

Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection.
Development and validation of a high density SNP genotyping array for Atlantic salmon (Salmo salar)

PubMed Central

2014-01-01

Background Dense single nucleotide polymorphism (SNP) genotyping arrays provide extensive information on polymorphic variation across the genome of species of interest. Such information can be used in studies of the genetic architecture of quantitative traits and to improve the accuracy of selection in breeding programs. In Atlantic salmon (Salmo salar), these goals are currently hampered by the lack of a high-density SNP genotyping platform. Therefore, the aim of the study was to develop and test a dense Atlantic salmon SNP array. Results SNP discovery was performed using extensive deep sequencing of Reduced Representation (RR-Seq), Restriction site-Associated DNA (RAD-Seq) and mRNA (RNA-Seq) libraries derived from farmed and wild Atlantic salmon samples (n = 283) resulting in the discovery of > 400 K putative SNPs. An Affymetrix Axiom® myDesign Custom Array was created and tested on samples of animals of wild and farmed origin (n = 96) revealing a total of 132,033 polymorphic SNPs with high call rate, good cluster separation on the array and stable Mendelian inheritance in our sample. At least 38% of these SNPs are from transcribed genomic regions and therefore more likely to include functional variants. Linkage analysis utilising the lack of male recombination in salmonids allowed the mapping of 40,214 SNPs distributed across all 29 pairs of chromosomes, highlighting the extensive genome-wide coverage of the SNPs. An identity-by-state clustering analysis revealed that the array can clearly distinguish between fish of different origins, within and between farmed and wild populations. Finally, Y-chromosome-specific probes included on the array provide an accurate molecular genetic test for sex. Conclusions This manuscript describes the first high-density SNP genotyping array for Atlantic salmon. This array will be publicly available and is likely to be used as a platform for high-resolution genetics research into traits of evolutionary and economic importance in salmonids and in aquaculture breeding programs via genomic selection. PMID:24524230
Analysis of population structure and genetic history of cattle breeds based on high-density SNP data

USDA-ARS?s Scientific Manuscript database

Advances in single nucleotide polymorphism (SNP) genotyping microarrays have facilitated a new understanding of population structure and evolutionary history for several species. Most existing studies in livestock were based on low density SNP arrays. The first wave of low density SNP studies on cat...
A powerful tool for genome analysis in maize: development and evaluation of the high density 600 k SNP genotyping array.

PubMed

Unterseer, Sandra; Bauer, Eva; Haberer, Georg; Seidel, Michael; Knaak, Carsten; Ouzunova, Milena; Meitinger, Thomas; Strom, Tim M; Fries, Ruedi; Pausch, Hubert; Bertani, Christofer; Davassi, Alessandro; Mayer, Klaus Fx; Schön, Chris-Carolin

2014-09-29

High density genotyping data are indispensable for genomic analyses of complex traits in animal and crop species. Maize is one of the most important crop plants worldwide, however a high density SNP genotyping array for analysis of its large and highly dynamic genome was not available so far. We developed a high density maize SNP array composed of 616,201 variants (SNPs and small indels). Initially, 57 M variants were discovered by sequencing 30 representative temperate maize lines and then stringently filtered for sequence quality scores and predicted conversion performance on the array resulting in the selection of 1.2 M polymorphic variants assayed on two screening arrays. To identify high-confidence variants, 285 DNA samples from a broad genetic diversity panel of worldwide maize lines including the samples used for sequencing, important founder lines for European maize breeding, hybrids, and proprietary samples with European, US, semi-tropical, and tropical origin were used for experimental validation. We selected 616 k variants according to their performance during validation, support of genotype calls through sequencing data, and physical distribution for further analysis and for the design of the commercially available Affymetrix® Axiom® Maize Genotyping Array. This array is composed of 609,442 SNPs and 6,759 indels. Among these are 116,224 variants in coding regions and 45,655 SNPs of the Illumina® MaizeSNP50 BeadChip for study comparison. In a subset of 45,974 variants, apart from the target SNP additional off-target variants are detected, which show only a minor bias towards intermediate allele frequencies. We performed principal coordinate and admixture analyses to determine the ability of the array to detect and resolve population structure and investigated the extent of LD within a worldwide validation panel. The high density Affymetrix® Axiom® Maize Genotyping Array is optimized for European and American temperate maize and was developed based on a diverse sample panel by applying stringent quality filter criteria to ensure its suitability for a broad range of applications. With 600 k variants it is the largest currently publically available genotyping array in crop species.
Development and validation of a 20K single nucleotide polymorphism (SNP) whole genome genotyping array for apple (Malus × domestica Borkh).

PubMed

Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

2014-01-01

High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs.
Development and Validation of a 20K Single Nucleotide Polymorphism (SNP) Whole Genome Genotyping Array for Apple (Malus × domestica Borkh)

PubMed Central

Bianco, Luca; Cestaro, Alessandro; Sargent, Daniel James; Banchi, Elisa; Derdak, Sophia; Di Guardo, Mario; Salvi, Silvio; Jansen, Johannes; Viola, Roberto; Gut, Ivo; Laurens, Francois; Chagné, David; Velasco, Riccardo; van de Weg, Eric; Troggio, Michela

2014-01-01

High-density SNP arrays for genome-wide assessment of allelic variation have made high resolution genetic characterization of crop germplasm feasible. A medium density array for apple, the IRSC 8K SNP array, has been successfully developed and used for screens of bi-parental populations. However, the number of robust and well-distributed markers contained on this array was not sufficient to perform genome-wide association analyses in wider germplasm sets, or Pedigree-Based Analysis at high precision, because of rapid decay of linkage disequilibrium. We describe the development of an Illumina Infinium array targeting 20K SNPs. The SNPs were predicted from re-sequencing data derived from the genomes of 13 Malus × domestica apple cultivars and one accession belonging to a crab apple species (M. micromalus). A pipeline for SNP selection was devised that avoided the pitfalls associated with the inclusion of paralogous sequence variants, supported the construction of robust multi-allelic SNP haploblocks and selected up to 11 entries within narrow genomic regions of ±5 kb, termed focal points (FPs). Broad genome coverage was attained by placing FPs at 1 cM intervals on a consensus genetic map, complementing them with FPs to enrich the ends of each of the chromosomes, and by bridging physical intervals greater than 400 Kbps. The selection also included ∼3.7K validated SNPs from the IRSC 8K array. The array has already been used in other studies where ∼15.8K SNP markers were mapped with an average of ∼6.8K SNPs per full-sib family. The newly developed array with its high density of polymorphic validated SNPs is expected to be of great utility for Pedigree-Based Analysis and Genomic Selection. It will also be a valuable tool to help dissect the genetic mechanisms controlling important fruit quality traits, and to aid the identification of marker-trait associations suitable for the application of Marker Assisted Selection in apple breeding programs. PMID:25303088
Tumor Touch Imprints as Source for Whole Genome Analysis of Neuroblastoma Tumors

PubMed Central

Brunner, Clemens; Brunner-Herglotz, Bettina; Ziegler, Andrea; Frech, Christian; Amann, Gabriele; Ladenstein, Ruth; Ambros, Inge M.; Ambros, Peter F.

2016-01-01

Introduction Tumor touch imprints (TTIs) are routinely used for the molecular diagnosis of neuroblastomas by interphase fluorescence in-situ hybridization (I-FISH). However, in order to facilitate a comprehensive, up-to-date molecular diagnosis of neuroblastomas and to identify new markers to refine risk and therapy stratification methods, whole genome approaches are needed. We examined the applicability of an ultra-high density SNP array platform that identifies copy number changes of varying sizes down to a few exons for the detection of genomic changes in tumor DNA extracted from TTIs. Material and Methods DNAs were extracted from TTIs of 46 neuroblastoma and 4 other pediatric tumors. The DNAs were analyzed on the Cytoscan HD SNP array platform to evaluate numerical and structural genomic aberrations. The quality of the data obtained from TTIs was compared to that from randomly chosen fresh or fresh frozen solid tumors (n = 212) and I-FISH validation was performed. Results SNP array profiles were obtained from 48 (out of 50) TTI DNAs of which 47 showed genomic aberrations. The high marker density allowed for single gene analysis, e.g. loss of nine exons in the ATRX gene and the visualization of chromothripsis. Data quality was comparable to fresh or fresh frozen tumor SNP profiles. SNP array results were confirmed by I-FISH. Conclusion TTIs are an excellent source for SNP array processing with the advantage of simple handling, distribution and storage of tumor tissue on glass slides. The minimal amount of tumor tissue needed to analyze whole genomes makes TTIs an economic surrogate source in the molecular diagnostic work up of tumor samples. PMID:27560999
A Discovery Resource of Rare Copy Number Variations in Individuals with Autism Spectrum Disorder

PubMed Central

Prasad, Aparna; Merico, Daniele; Thiruvahindrapuram, Bhooma; Wei, John; Lionel, Anath C.; Sato, Daisuke; Rickaby, Jessica; Lu, Chao; Szatmari, Peter; Roberts, Wendy; Fernandez, Bridget A.; Marshall, Christian R.; Hatchwell, Eli; Eis, Peggy S.; Scherer, Stephen W.

2012-01-01

The identification of rare inherited and de novo copy number variations (CNVs) in human subjects has proven a productive approach to highlight risk genes for autism spectrum disorder (ASD). A variety of microarrays are available to detect CNVs, including single-nucleotide polymorphism (SNP) arrays and comparative genomic hybridization (CGH) arrays. Here, we examine a cohort of 696 unrelated ASD cases using a high-resolution one-million feature CGH microarray, the majority of which were previously genotyped with SNP arrays. Our objective was to discover new CNVs in ASD cases that were not detected by SNP microarray analysis and to delineate novel ASD risk loci via combined analysis of CGH and SNP array data sets on the ASD cohort and CGH data on an additional 1000 control samples. Of the 615 ASD cases analyzed on both SNP and CGH arrays, we found that 13,572 of 21,346 (64%) of the CNVs were exclusively detected by the CGH array. Several of the CGH-specific CNVs are rare in population frequency and impact previously reported ASD genes (e.g., NRXN1, GRM8, DPYD), as well as novel ASD candidate genes (e.g., CIB2, DAPP1, SAE1), and all were inherited except for a de novo CNV in the GPHN gene. A functional enrichment test of gene-sets in ASD cases over controls revealed nucleotide metabolism as a potential novel pathway involved in ASD, which includes several candidate genes for follow-up (e.g., DPYD, UPB1, UPP1, TYMP). Finally, this extensively phenotyped and genotyped ASD clinical cohort serves as an invaluable resource for the next step of genome sequencing for complete genetic variation detection. PMID:23275889
Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array

USDA-ARS?s Scientific Manuscript database

Btau_4.0 and UMD3.1 are two distinct cattle reference genome assemblies. In our previous study using the low density BovineSNP50 array, we reported a copy number variation (CNV) analysis on Btau_4.0 with 521 animals of 21 cattle breeds, yielding 682 CNV regions with a total length of 139.8 megabases...
R classes and methods for SNP array data.

PubMed

Scharpf, Robert B; Ruczinski, Ingo

2010-01-01

The Bioconductor project is an "open source and open development software project for the analysis and comprehension of genomic data" (1), primarily based on the R programming language. Infrastructure packages, such as Biobase, are maintained by Bioconductor core developers and serve several key roles to the broader community of Bioconductor software developers and users. In particular, Biobase introduces an S4 class, the eSet, for high-dimensional assay data. Encapsulating the assay data as well as meta-data on the samples, features, and experiment in the eSet class definition ensures propagation of the relevant sample and feature meta-data throughout an analysis. Extending the eSet class promotes code reuse through inheritance as well as interoperability with other R packages and is less error-prone. Recently proposed class definitions for high-throughput SNP arrays extend the eSet class. This chapter highlights the advantages of adopting and extending Biobase class definitions through a working example of one implementation of classes for the analysis of high-throughput SNP arrays.
Vitis Phylogenomics: Hybridization Intensities from a SNP Array Outperform Genotype Calls

PubMed Central

Miller, Allison J.; Matasci, Naim; Schwaninger, Heidi; Aradhya, Mallikarjuna K.; Prins, Bernard; Zhong, Gan-Yuan; Simon, Charles; Buckler, Edward S.; Myles, Sean

2013-01-01

Understanding relationships among species is a fundamental goal of evolutionary biology. Single nucleotide polymorphisms (SNPs) identified through next generation sequencing and related technologies enable phylogeny reconstruction by providing unprecedented numbers of characters for analysis. One approach to SNP-based phylogeny reconstruction is to identify SNPs in a subset of individuals, and then to compile SNPs on an array that can be used to genotype additional samples at hundreds or thousands of sites simultaneously. Although powerful and efficient, this method is subject to ascertainment bias because applying variation discovered in a representative subset to a larger sample favors identification of SNPs with high minor allele frequencies and introduces bias against rare alleles. Here, we demonstrate that the use of hybridization intensity data, rather than genotype calls, reduces the effects of ascertainment bias. Whereas traditional SNP calls assess known variants based on diversity housed in the discovery panel, hybridization intensity data survey variation in the broader sample pool, regardless of whether those variants are present in the initial SNP discovery process. We apply SNP genotype and hybridization intensity data derived from the Vitis9kSNP array developed for grape to show the effects of ascertainment bias and to reconstruct evolutionary relationships among Vitis species. We demonstrate that phylogenies constructed using hybridization intensities suffer less from the distorting effects of ascertainment bias, and are thus more accurate than phylogenies based on genotype calls. Moreover, we reconstruct the phylogeny of the genus Vitis using hybridization data, show that North American subgenus Vitis species are monophyletic, and resolve several previously poorly known relationships among North American species. This study builds on earlier work that applied the Vitis9kSNP array to evolutionary questions within Vitis vinifera and has general implications for addressing ascertainment bias in array-enabled phylogeny reconstruction. PMID:24236035
Development and Validation of a High-Density SNP Genotyping Array for African Oil Palm.

PubMed

Kwong, Qi Bin; Teh, Chee Keng; Ong, Ai Ling; Heng, Huey Ying; Lee, Heng Leng; Mohamed, Mohaimi; Low, Joel Zi-Bin; Apparow, Sukganah; Chew, Fook Tim; Mayes, Sean; Kulaveerasingam, Harikrishna; Tammi, Martti; Appleton, David Ross

2016-08-01

High-density single nucleotide polymorphism (SNP) genotyping arrays are powerful tools that can measure the level of genetic polymorphism within a population. To develop a whole-genome SNP array for oil palms, SNP discovery was performed using deep resequencing of eight libraries derived from 132 Elaeis guineensis and Elaeis oleifera palms belonging to 59 origins, resulting in the discovery of >3 million putative SNPs. After SNP filtering, the Illumina OP200K custom array was built with 170 860 successful probes. Phenetic clustering analysis revealed that the array could distinguish between palms of different origins in a way consistent with pedigree records. Genome-wide linkage disequilibrium declined more slowly for the commercial populations (ranging from 120 kb at r(2) = 0.43 to 146 kb at r(2) = 0.50) when compared with the semi-wild populations (19.5 kb at r(2) = 0.22). Genetic fixation mapping comparing the semi-wild and commercial population identified 321 selective sweeps. A genome-wide association study (GWAS) detected a significant peak on chromosome 2 associated with the polygenic component of the shell thickness trait (based on the trait shell-to-fruit; S/F %) in tenera palms. Testing of a genomic selection model on the same trait resulted in good prediction accuracy (r = 0.65) with 42% of the S/F % variation explained. The first high-density SNP genotyping array for oil palm has been developed and shown to be robust for use in genetic studies and with potential for developing early trait prediction to shorten the oil palm breeding cycle. Copyright © 2016 The Author. Published by Elsevier Inc. All rights reserved.
Development of a 63K SNP Array for Cotton and High-Density Mapping of Intraspecific and Interspecific Populations of Gossypium spp.

PubMed Central

Hulse-Kemp, Amanda M.; Lemm, Jana; Plieske, Joerg; Ashrafi, Hamid; Buyyarapu, Ramesh; Fang, David D.; Frelichowski, James; Giband, Marc; Hague, Steve; Hinze, Lori L.; Kochan, Kelli J.; Riggs, Penny K.; Scheffler, Jodi A.; Udall, Joshua A.; Ulloa, Mauricio; Wang, Shirley S.; Zhu, Qian-Hao; Bag, Sumit K.; Bhardwaj, Archana; Burke, John J.; Byers, Robert L.; Claverie, Michel; Gore, Michael A.; Harker, David B.; Islam, Md S.; Jenkins, Johnie N.; Jones, Don C.; Lacape, Jean-Marc; Llewellyn, Danny J.; Percy, Richard G.; Pepper, Alan E.; Poland, Jesse A.; Mohan Rai, Krishan; Sawant, Samir V.; Singh, Sunil Kumar; Spriggs, Andrew; Taylor, Jen M.; Wang, Fei; Yourstone, Scott M.; Zheng, Xiuting; Lawley, Cindy T.; Ganal, Martin W.; Van Deynze, Allen; Wilson, Iain W.; Stelly, David M.

2015-01-01

High-throughput genotyping arrays provide a standardized resource for plant breeding communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), complex trait dissection, and studying patterns of genomic diversity among cultivars and wild accessions. We have developed the CottonSNP63K, an Illumina Infinium array containing assays for 45,104 putative intraspecific single nucleotide polymorphism (SNP) markers for use within the cultivated cotton species Gossypium hirsutum L. and 17,954 putative interspecific SNP markers for use with crosses of other cotton species with G. hirsutum. The SNPs on the array were developed from 13 different discovery sets that represent a diverse range of G. hirsutum germplasm and five other species: G. barbadense L., G. tomentosum Nuttal × Seemann, G. mustelinum Miers × Watt, G. armourianum Kearny, and G. longicalyx J.B. Hutchinson and Lee. The array was validated with 1,156 samples to generate cluster positions to facilitate automated analysis of 38,822 polymorphic markers. Two high-density genetic maps containing a total of 22,829 SNPs were generated for two F2 mapping populations, one intraspecific and one interspecific, and 3,533 SNP markers were co-occurring in both maps. The produced intraspecific genetic map is the first saturated map that associates into 26 linkage groups corresponding to the number of cotton chromosomes for a cross between two G. hirsutum lines. The linkage maps were shown to have high levels of collinearity to the JGI G. raimondii Ulbrich reference genome sequence. The CottonSNP63K array, cluster file and associated marker sequences constitute a major new resource for the global cotton research community. PMID:25908569
Development and evaluation of the first high-throughput SNP array for common carp (Cyprinus carpio)

PubMed Central

2014-01-01

Background A large number of single nucleotide polymorphisms (SNPs) have been identified in common carp (Cyprinus carpio) but, as yet, no high-throughput genotyping platform is available for this species. C. carpio is an important aquaculture species that accounts for nearly 14% of freshwater aquaculture production worldwide. We have developed an array for C. carpio with 250,000 SNPs and evaluated its performance using samples from various strains of C. carpio. Results The SNPs used on the array were selected from two resources: the transcribed sequences from RNA-seq data of four strains of C. carpio, and the genome re-sequencing data of five strains of C. carpio. The 250,000 SNPs on the resulting array are distributed evenly across the reference C.carpio genome with an average spacing of 6.6 kb. To evaluate the SNP array, 1,072 C. carpio samples were collected and tested. Of the 250,000 SNPs on the array, 185,150 (74.06%) were found to be polymorphic sites. Genotyping accuracy was checked using genotyping data from a group of full-siblings and their parents, and over 99.8% of the qualified SNPs were found to be reliable. Analysis of the linkage disequilibrium on all samples and on three domestic C.carpio strains revealed that the latter had the longer haplotype blocks. We also evaluated our SNP array on 80 samples from eight species related to C. carpio, with from 53,526 to 71,984 polymorphic SNPs. An identity by state analysis divided all the samples into three clusters; most of the C. carpio strains formed the largest cluster. Conclusions The Carp SNP array described here is the first high-throughput genotyping platform for C. carpio. Our evaluation of this array indicates that it will be valuable for farmed carp and for genetic and population biology studies in C. carpio and related species. PMID:24762296
Development and evaluation of the first high-throughput SNP array for common carp (Cyprinus carpio).

PubMed

Xu, Jian; Zhao, Zixia; Zhang, Xiaofeng; Zheng, Xianhu; Li, Jiongtang; Jiang, Yanliang; Kuang, Youyi; Zhang, Yan; Feng, Jianxin; Li, Chuangju; Yu, Juhua; Li, Qiang; Zhu, Yuanyuan; Liu, Yuanyuan; Xu, Peng; Sun, Xiaowen

2014-04-24

A large number of single nucleotide polymorphisms (SNPs) have been identified in common carp (Cyprinus carpio) but, as yet, no high-throughput genotyping platform is available for this species. C. carpio is an important aquaculture species that accounts for nearly 14% of freshwater aquaculture production worldwide. We have developed an array for C. carpio with 250,000 SNPs and evaluated its performance using samples from various strains of C. carpio. The SNPs used on the array were selected from two resources: the transcribed sequences from RNA-seq data of four strains of C. carpio, and the genome re-sequencing data of five strains of C. carpio. The 250,000 SNPs on the resulting array are distributed evenly across the reference C.carpio genome with an average spacing of 6.6 kb. To evaluate the SNP array, 1,072 C. carpio samples were collected and tested. Of the 250,000 SNPs on the array, 185,150 (74.06%) were found to be polymorphic sites. Genotyping accuracy was checked using genotyping data from a group of full-siblings and their parents, and over 99.8% of the qualified SNPs were found to be reliable. Analysis of the linkage disequilibrium on all samples and on three domestic C.carpio strains revealed that the latter had the longer haplotype blocks. We also evaluated our SNP array on 80 samples from eight species related to C. carpio, with from 53,526 to 71,984 polymorphic SNPs. An identity by state analysis divided all the samples into three clusters; most of the C. carpio strains formed the largest cluster. The Carp SNP array described here is the first high-throughput genotyping platform for C. carpio. Our evaluation of this array indicates that it will be valuable for farmed carp and for genetic and population biology studies in C. carpio and related species.

Surface invasive cleavage assay on a maskless light-directed diamond DNA microarray for genome-wide human SNP mapping.

PubMed

Nie, Bei; Yang, Min; Fu, Weiling; Liang, Zhiqing

2015-07-07

The surface invasive cleavage assay, because of its innate accuracy and ability for self-signal amplification, provides a potential route for the mapping of hundreds of thousands of human SNP sites. However, its performance on a high density DNA array has not yet been established, due to the unusual "hairpin" probe design on the microarray and the lack of chemical stability of commercially available substrates. Here we present an applicable method to implement a nanocrystalline diamond thin film as an alternative substrate for fabricating an addressable DNA array using maskless light-directed photochemistry, producing the most chemically stable and biocompatible system for genetic analysis and enzymatic reactions. The surface invasive cleavage reaction, followed by degenerated primer ligation and post-rolling circle amplification is consecutively performed on the addressable diamond DNA array, accurately mapping SNP sites from PCR-amplified human genomic target DNA. Furthermore, a specially-designed DNA array containing dual probes in the same pixel is fabricated by following a reverse light-directed DNA synthesis protocol. This essentially enables us to decipher thousands of SNP alleles in a single-pot reaction by the simple addition of enzyme, target and reaction buffers.
Construction of a versatile SNP array for pyramiding useful genes of rice.

PubMed

Kurokawa, Yusuke; Noda, Tomonori; Yamagata, Yoshiyuki; Angeles-Shim, Rosalyn; Sunohara, Hidehiko; Uehara, Kanako; Furuta, Tomoyuki; Nagai, Keisuke; Jena, Kshirod Kumar; Yasui, Hideshi; Yoshimura, Atsushi; Ashikari, Motoyuki; Doi, Kazuyuki

2016-01-01

DNA marker-assisted selection (MAS) has become an indispensable component of breeding. Single nucleotide polymorphisms (SNP) are the most frequent polymorphism in the rice genome. However, SNP markers are not readily employed in MAS because of limitations in genotyping platforms. Here the authors report a Golden Gate SNP array that targets specific genes controlling yield-related traits and biotic stress resistance in rice. As a first step, the SNP genotypes were surveyed in 31 parental varieties using the Affymetrix Rice 44K SNP microarray. The haplotype information for 16 target genes was then converted to the Golden Gate platform with 143-plex markers. Haplotypes for the 14 useful allele are unique and can discriminate among all other varieties. The genotyping consistency between the Affymetrix microarray and the Golden Gate array was 92.8%, and the accuracy of the Golden Gate array was confirmed in 3 F2 segregating populations. The concept of the haplotype-based selection by using the constructed SNP array was proofed. Copyright © 2015 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
SNPchiMp v.3: integrating and standardizing single nucleotide polymorphism data for livestock species.

PubMed

Nicolazzi, Ezequiel L; Caprera, Andrea; Nazzicari, Nelson; Cozzi, Paolo; Strozzi, Francesco; Lawley, Cindy; Pirani, Ali; Soans, Chandrasen; Brew, Fiona; Jorjani, Hossein; Evans, Gary; Simpson, Barry; Tosser-Klopp, Gwenola; Brauning, Rudiger; Williams, John L; Stella, Alessandra

2015-04-10

In recent years, the use of genomic information in livestock species for genetic improvement, association studies and many other fields has become routine. In order to accommodate different market requirements in terms of genotyping cost, manufacturers of single nucleotide polymorphism (SNP) arrays, private companies and international consortia have developed a large number of arrays with different content and different SNP density. The number of currently available SNP arrays differs among species: ranging from one for goats to more than ten for cattle, and the number of arrays available is increasing rapidly. However, there is limited or no effort to standardize and integrate array- specific (e.g. SNP IDs, allele coding) and species-specific (i.e. past and current assemblies) SNP information. Here we present SNPchiMp v.3, a solution to these issues for the six major livestock species (cow, pig, horse, sheep, goat and chicken). Original data was collected directly from SNP array producers and specific international genome consortia, and stored in a MySQL database. The database was then linked to an open-access web tool and to public databases. SNPchiMp v.3 ensures fast access to the database (retrieving within/across SNP array data) and the possibility of annotating SNP array data in a user-friendly fashion. This platform allows easy integration and standardization, and it is aimed at both industry and research. It also enables users to easily link the information available from the array producer with data in public databases, without the need of additional bioinformatics tools or pipelines. In recognition of the open-access use of Ensembl resources, SNPchiMp v.3 was officially credited as an Ensembl E!mpowered tool. Availability at http://bioinformatics.tecnoparco.org/SNPchimp.
Single Nucleotide Polymorphism Array Analysis of Bone Marrow Failure Patients Reveals Characteristic Patterns of Genetic Changes

PubMed Central

Babushok, Daria V.; Xie, Hongbo M.; Roth, Jacquelyn J.; Perdigones, Nieves; Olson, Timothy S.; Cockroft, Joshua D.; Gai, Xiaowu; Perin, Juan C.; Li, Yimei; Paessler, Michele E.; Hakonarson, Hakon; Podsakoff, Gregory M.; Mason, Philip J.; Biegel, Jaclyn A.; Bessler, Monica

2013-01-01

Summary The bone marrow failure syndromes (BMFS) are a heterogeneous group of rare blood disorders characterized by inadequate haematopoiesis, clonal evolution, and increased risk of leukaemia. Single nucleotide polymorphism arrays (SNP-A) have been proposed as a tool for surveillance of clonal evolution in BMFS. To better understand the natural history of BMFS and to assess the clinical utility of SNP-A in these disorders, we analysed 124 SNP-A from a comprehensively characterized cohort of 91 patients at our BMFS centre. SNP-A were correlated with medical histories, haematopathology, cytogenetic and molecular data. To assess clonal evolution, longitudinal analysis of SNP-A was performed in 25 patients. We found that acquired copy number-neutral loss of heterozygosity (CN-LOH) was significantly more frequent in acquired aplastic anaemia (aAA) than in other BMFS (odds ratio 12.2, p<0.01). Homozygosity by descent was most common in congenital BMFS, frequently unmasking autosomal recessive mutations. Copy number variants (CNVs) were frequently polymorphic, and we identified CNVs enriched in neutropenia and aAA. Our results suggest that acquired CN-LOH is a general phenomenon in aAA that is probably mechanistically and prognostically distinct from typical CN-LOH of myeloid malignancies. Our analysis of clinical utility of SNP-A shows the highest yield of detecting new clonal haematopoiesis at diagnosis and at relapse. PMID:24116929
Single nucleotide polymorphism array analysis of bone marrow failure patients reveals characteristic patterns of genetic changes.

PubMed

Babushok, Daria V; Xie, Hongbo M; Roth, Jacquelyn J; Perdigones, Nieves; Olson, Timothy S; Cockroft, Joshua D; Gai, Xiaowu; Perin, Juan C; Li, Yimei; Paessler, Michele E; Hakonarson, Hakon; Podsakoff, Gregory M; Mason, Philip J; Biegel, Jaclyn A; Bessler, Monica

2014-01-01

The bone marrow failure syndromes (BMFS) are a heterogeneous group of rare blood disorders characterized by inadequate haematopoiesis, clonal evolution, and increased risk of leukaemia. Single nucleotide polymorphism arrays (SNP-A) have been proposed as a tool for surveillance of clonal evolution in BMFS. To better understand the natural history of BMFS and to assess the clinical utility of SNP-A in these disorders, we analysed 124 SNP-A from a comprehensively characterized cohort of 91 patients at our BMFS centre. SNP-A were correlated with medical histories, haematopathology, cytogenetic and molecular data. To assess clonal evolution, longitudinal analysis of SNP-A was performed in 25 patients. We found that acquired copy number-neutral loss of heterozygosity (CN-LOH) was significantly more frequent in acquired aplastic anaemia (aAA) than in other BMFS (odds ratio 12·2, P < 0·01). Homozygosity by descent was most common in congenital BMFS, frequently unmasking autosomal recessive mutations. Copy number variants (CNVs) were frequently polymorphic, and we identified CNVs enriched in neutropenia and aAA. Our results suggest that acquired CN-LOH is a general phenomenon in aAA that is probably mechanistically and prognostically distinct from typical CN-LOH of myeloid malignancies. Our analysis of clinical utility of SNP-A shows the highest yield of detecting new clonal haematopoiesis at diagnosis and at relapse. © 2013 John Wiley & Sons Ltd.
Analysis of genetic diversity using SNP markers in oat

USDA-ARS?s Scientific Manuscript database

A large-scale single nucleotide polymorphism (SNP) discovery was carried out in cultivated oat using Roche 454 sequencing methods. DNA sequences were generated from cDNAs originating from a panel of 20 diverse oat cultivars, and from Diversity Array Technology (DArT) genomic complexity reductions fr...
Glossary

MedlinePlus

... array, and oligo/SNP combination array. Related terms: comparative genomic hybridization ; copy number variant ; SNP array chromosome ... for example, the AB blood groups in humans comparative genomic hybridization Method in which two DNA samples ( ...
A low-density SNP array for analyzing differential selection in freshwater and marine populations of threespine stickleback (Gasterosteus aculeatus).

PubMed

Ferchaud, Anne-Laure; Pedersen, Susanne H; Bekkevold, Dorte; Jian, Jianbo; Niu, Yongchao; Hansen, Michael M

2014-10-06

The threespine stickleback (Gasterosteus aculeatus) has become an important model species for studying both contemporary and parallel evolution. In particular, differential adaptation to freshwater and marine environments has led to high differentiation between freshwater and marine stickleback populations at the phenotypic trait of lateral plate morphology and the underlying candidate gene Ectodysplacin (EDA). Many studies have focused on this trait and candidate gene, although other genes involved in marine-freshwater adaptation may be equally important. In order to develop a resource for rapid and cost efficient analysis of genetic divergence between freshwater and marine sticklebacks, we generated a low-density SNP (Single Nucleotide Polymorphism) array encompassing markers of chromosome regions under putative directional selection, along with neutral markers for background. RAD (Restriction site Associated DNA) sequencing of sixty individuals representing two freshwater and one marine population led to the identification of 33,993 SNP markers. Ninety-six of these were chosen for the low-density SNP array, among which 70 represented SNPs under putatively directional selection in freshwater vs. marine environments, whereas 26 SNPs were assumed to be neutral. Annotation of these regions revealed several genes that are candidates for affecting stickleback phenotypic variation, some of which have been observed in previous studies whereas others are new. We have developed a cost-efficient low-density SNP array that allows for rapid screening of polymorphisms in threespine stickleback. The array provides a valuable tool for analyzing adaptive divergence between freshwater and marine stickleback populations beyond the well-established candidate gene Ectodysplacin (EDA).
Design and coverage of high throughput genotyping arrays optimized for individuals of East Asian, African American, and Latino race/ethnicity using imputation and a novel hybrid SNP selection algorithm.

PubMed

Hoffmann, Thomas J; Zhan, Yiping; Kvale, Mark N; Hesselson, Stephanie E; Gollub, Jeremy; Iribarren, Carlos; Lu, Yontao; Mei, Gangwu; Purdy, Matthew M; Quesenberry, Charles; Rowell, Sarah; Shapero, Michael H; Smethurst, David; Somkin, Carol P; Van den Eeden, Stephen K; Walter, Larry; Webster, Teresa; Whitmer, Rachel A; Finn, Andrea; Schaefer, Catherine; Kwok, Pui-Yan; Risch, Neil

2011-12-01

Four custom Axiom genotyping arrays were designed for a genome-wide association (GWA) study of 100,000 participants from the Kaiser Permanente Research Program on Genes, Environment and Health. The array optimized for individuals of European race/ethnicity was previously described. Here we detail the development of three additional microarrays optimized for individuals of East Asian, African American, and Latino race/ethnicity. For these arrays, we decreased redundancy of high-performing SNPs to increase SNP capacity. The East Asian array was designed using greedy pairwise SNP selection. However, removing SNPs from the target set based on imputation coverage is more efficient than pairwise tagging. Therefore, we developed a novel hybrid SNP selection method for the African American and Latino arrays utilizing rounds of greedy pairwise SNP selection, followed by removal from the target set of SNPs covered by imputation. The arrays provide excellent genome-wide coverage and are valuable additions for large-scale GWA studies. Copyright © 2011 Elsevier Inc. All rights reserved.
Prenatal Diagnosis of DNA Copy Number Variations by Genomic Single-Nucleotide Polymorphism Array in Fetuses with Congenital Heart Defects.

PubMed

Tang, Shaohua; Lv, Jiaojiao; Chen, Xiangnan; Bai, Lili; Li, Huanzheng; Chen, Chong; Wang, Ping; Xu, Xueqin; Lu, Jianxin

2016-01-01

To evaluate the usefulness of single-nucleotide polymorphism (SNP) array for prenatal genetic diagnosis of congenital heart defect (CHD), we used this approach to detect clinically significant copy number variants (CNVs) in fetuses with CHDs. A HumanCytoSNP-12 array was used to detect genomic samples obtained from 39 fetuses that exhibited cardiovascular abnormalities on ultrasound and had a normal karyotype. The relationship between CNVs and CHDs was identified by using genotype-phenotype comparisons and searching of chromosomal databases. All clinically significant CNVs were confirmed by real-time PCR. CNVs were detected in 38/39 (97.4%) fetuses: variants of unknown significance were detected in 2/39 (5.1%), and clinically significant CNVs were identified in 7/39 (17.9%). In 3 of the 7 fetuses with clinically significant CNVs, 3 rare and previously undescribed CNVs were detected, and these CNVs encompassed the CHD candidate genes FLNA (Xq28 dup), BCOR (Xp11.4 dup), and RBL2 (16q12.2 del). Compared with conventional cytogenetic genomics, SNP array analysis provides significantly improved detection of submicroscopic genomic aberrations in pregnancies with CHDs. Based on these results, we propose that genomic SNP array is an effective method which could be used in the prenatal diagnostic test to assist genetic counseling for pregnancies with CHDs. © 2015 S. Karger AG, Basel.
Genetic loci associated with delayed clearance of Plasmodium falciparum following artemisinin treatment in Southeast Asia

DTIC Science & Technology

2013-01-02

intensity data from the SNP array were normalized using the Affymetrix GeneChip Targeted Genotyping Analysis Software ( GTGS ). To assess robustness of SNP...calls, genotypes were called using three algorithms: (i) GTGS , (ii) illuminus (27), and (iii) a heuristic algorithm based on discrete cutoffs of
SNP-array reveals genome-wide patterns of geographical and potential adaptive divergence across the natural range of Atlantic salmon (Salmo salar).

PubMed

Bourret, Vincent; Kent, Matthew P; Primmer, Craig R; Vasemägi, Anti; Karlsson, Sten; Hindar, Kjetil; McGinnity, Philip; Verspoor, Eric; Bernatchez, Louis; Lien, Sigbjørn

2013-02-01

Atlantic salmon (Salmo salar) is one of the most extensively studied fish species in the world due to its significance in aquaculture, fisheries and ongoing conservation efforts to protect declining populations. Yet, limited genomic resources have hampered our understanding of genetic architecture in the species and the genetic basis of adaptation to the wide range of natural and artificial environments it occupies. In this study, we describe the development of a medium-density Atlantic salmon single nucleotide polymorphism (SNP) array based on expressed sequence tags (ESTs) and genomic sequencing. The array was used in the most extensive assessment of population genetic structure performed to date in this species. A total of 6176 informative SNPs were successfully genotyped in 38 anadromous and freshwater wild populations distributed across the species natural range. Principal component analysis clearly differentiated European and North American populations, and within Europe, three major regional genetic groups were identified for the first time in a single analysis. We assessed the potential for the array to disentangle neutral and putative adaptive divergence of SNP allele frequencies across populations and among regional groups. In Europe, secondary contact zones were identified between major clusters where endogenous and exogenous barriers could be associated, rendering the interpretation of environmental influence on potentially adaptive divergence equivocal. A small number of markers highly divergent in allele frequencies (outliers) were observed between (multiple) freshwater and anadromous populations, between northern and southern latitudes, and when comparing Baltic populations to all others. We also discuss the potential future applications of the SNP array for conservation, management and aquaculture. © 2012 Blackwell Publishing Ltd.
Genome-Wide Mapping of Copy Number Variation in Humans: Comparative Analysis of High Resolution Array Platforms

PubMed Central

Haraksingh, Rajini R.; Abyzov, Alexej; Gerstein, Mark; Urban, Alexander E.; Snyder, Michael

2011-01-01

Accurate and efficient genome-wide detection of copy number variants (CNVs) is essential for understanding human genomic variation, genome-wide CNV association type studies, cytogenetics research and diagnostics, and independent validation of CNVs identified from sequencing based technologies. Numerous, array-based platforms for CNV detection exist utilizing array Comparative Genome Hybridization (aCGH), Single Nucleotide Polymorphism (SNP) genotyping or both. We have quantitatively assessed the abilities of twelve leading genome-wide CNV detection platforms to accurately detect Gold Standard sets of CNVs in the genome of HapMap CEU sample NA12878, and found significant differences in performance. The technologies analyzed were the NimbleGen 4.2 M, 2.1 M and 3×720 K Whole Genome and CNV focused arrays, the Agilent 1×1 M CGH and High Resolution and 2×400 K CNV and SNP+CGH arrays, the Illumina Human Omni1Quad array and the Affymetrix SNP 6.0 array. The Gold Standards used were a 1000 Genomes Project sequencing-based set of 3997 validated CNVs and an ultra high-resolution aCGH-based set of 756 validated CNVs. We found that sensitivity, total number, size range and breakpoint resolution of CNV calls were highest for CNV focused arrays. Our results are important for cost effective CNV detection and validation for both basic and clinical applications. PMID:22140474
Diversity analysis of cotton (Gossypium hirsutum L.) germplasm using the CottonSNP63K Array.

PubMed

Hinze, Lori L; Hulse-Kemp, Amanda M; Wilson, Iain W; Zhu, Qian-Hao; Llewellyn, Danny J; Taylor, Jen M; Spriggs, Andrew; Fang, David D; Ulloa, Mauricio; Burke, John J; Giband, Marc; Lacape, Jean-Marc; Van Deynze, Allen; Udall, Joshua A; Scheffler, Jodi A; Hague, Steve; Wendel, Jonathan F; Pepper, Alan E; Frelichowski, James; Lawley, Cindy T; Jones, Don C; Percy, Richard G; Stelly, David M

2017-02-03

Cotton germplasm resources contain beneficial alleles that can be exploited to develop germplasm adapted to emerging environmental and climate conditions. Accessions and lines have traditionally been characterized based on phenotypes, but phenotypic profiles are limited by the cost, time, and space required to make visual observations and measurements. With advances in molecular genetic methods, genotypic profiles are increasingly able to identify differences among accessions due to the larger number of genetic markers that can be measured. A combination of both methods would greatly enhance our ability to characterize germplasm resources. Recent efforts have culminated in the identification of sufficient SNP markers to establish high-throughput genotyping systems, such as the CottonSNP63K array, which enables a researcher to efficiently analyze large numbers of SNP markers and obtain highly repeatable results. In the current investigation, we have utilized the SNP array for analyzing genetic diversity primarily among cotton cultivars, making comparisons to SSR-based phylogenetic analyses, and identifying loci associated with seed nutritional traits. The SNP markers distinctly separated G. hirsutum from other Gossypium species and distinguished the wild from cultivated types of G. hirsutum. The markers also efficiently discerned differences among cultivars, which was the primary goal when designing the CottonSNP63K array. Population structure within the genus compared favorably with previous results obtained using SSR markers, and an association study identified loci linked to factors that affect cottonseed protein content. Our results provide a large genome-wide variation data set for primarily cultivated cotton. Thousands of SNPs in representative cotton genotypes provide an opportunity to finely discriminate among cultivated cotton from around the world. The SNPs will be relevant as dense markers of genome variation for association mapping approaches aimed at correlating molecular polymorphisms with variation in phenotypic traits, as well as for molecular breeding approaches in cotton.
Single nucleotide polymorphism array karyotyping: a diagnostic and prognostic tool in myelodysplastic syndromes with unsuccessful conventional cytogenetic testing.

PubMed

Arenillas, Leonor; Mallo, Mar; Ramos, Fernando; Guinta, Kathryn; Barragán, Eva; Lumbreras, Eva; Larráyoz, María-José; De Paz, Raquel; Tormo, Mar; Abáigar, María; Pedro, Carme; Cervera, José; Such, Esperanza; José Calasanz, María; Díez-Campelo, María; Sanz, Guillermo F; Hernández, Jesús María; Luño, Elisa; Saumell, Sílvia; Maciejewski, Jaroslaw; Florensa, Lourdes; Solé, Francesc

2013-12-01

Cytogenetic aberrations identified by metaphase cytogenetics (MC) have diagnostic, prognostic, and therapeutic implications in myelodysplastic syndromes (MDS). However, in some MDS patients MC study is unsuccesful. Single nucleotide polymorphism array (SNP-A) based karyotyping could be helpful in these cases. We performed SNP-A in 62 samples from bone marrow or peripheral blood of primary MDS with an unsuccessful MC study. SNP-A analysis enabled the detection of aberrations in 31 (50%) patients. We used the copy number alteration information to apply the International Prognostic Scoring System (IPSS) and we observed differences in survival between the low/intermediate-1 and intermediate-2/high risk patients. We also saw differences in survival between very low/low/intermediate and the high/very high patients when we applied the revised IPSS (IPSS-R). In conclusion, SNP-A can be used successfully in PB samples and the identification of CNA by SNP-A improve the diagnostic and prognostic evaluation of this group of MDS patients. Copyright © 2013 Wiley Periodicals, Inc.
High-density single nucleotide polymorphism (SNP) array mapping in Brassica oleracea: identification of QTL associated with carotenoid variation in broccoli florets.

PubMed

Brown, Allan F; Yousef, Gad G; Chebrolu, Kranthi K; Byrd, Robert W; Everhart, Koyt W; Thomas, Aswathy; Reid, Robert W; Parkin, Isobel A P; Sharpe, Andrew G; Oliver, Rebekah; Guzman, Ivette; Jackson, Eric W

2014-09-01

A high-resolution genetic linkage map of B. oleracea was developed from a B. napus SNP array. The work will facilitate genetic and evolutionary studies in Brassicaceae. A broccoli population, VI-158 × BNC, consisting of 150 F2:3 families was used to create a saturated Brassica oleracea (diploid: CC) linkage map using a recently developed rapeseed (Brassica napus) (tetraploid: AACC) Illumina Infinium single nucleotide polymorphism (SNP) array. The map consisted of 547 non-redundant SNP markers spanning 948.1 cM across nine chromosomes with an average interval size of 1.7 cM. As the SNPs are anchored to the genomic reference sequence of the rapid cycling B. oleracea TO1000, we were able to estimate that the map provides 96 % coverage of the diploid genome. Carotenoid analysis of 2 years data identified 3 QTLs on two chromosomes that are associated with up to half of the phenotypic variation associated with the accumulation of total or individual compounds. By searching the genome sequences of the two related diploid species (B. oleracea and B. rapa), we further identified putative carotenoid candidate genes in the region of these QTLs. This is the first description of the use of a B. napus SNP array to rapidly construct high-density genetic linkage maps of one of the constituent diploid species. The unambiguous nature of these markers with regard to genomic sequences provides evidence to the nature of genes underlying the QTL, and demonstrates the value and impact this resource will have on Brassica research.
[Phenotype-genotype correlation analysis of 12 cases with Angelman/Prader-Willi syndrome].

PubMed

Chen, Chen; Peng, Ying; Xia, Yan; Li, Haoxian; Zhu, Huimin; Pan, Qian; Yin, Fei; Wu, Lingqian

2014-12-01

To investigate the genotype-phenotype correlation in patients with Angelman syndrome/Prader-Willi syndrome (AS/PWS) and assess the application value of high-resolution single nucleotide polymorphism microarrays (SNP array) for such diseases. Twelve AS/PWS patients were diagnosed through SNP array, fluorescence in situ hybridization (FISH) and karyotype analysis. Clinical characteristics were analyzed. Deletions ranging from 4.8 Mb to 7.0 Mb on chromosome 15q11.2-13 were detected in 11 patients. Uniparental disomy (UPD) was detected in only 1 patient. Patients with deletions could be divided into 2 groups, including 7 cases with class I and 4 with class II. The two groups however had no significant phenotypic difference. The UPD patient had relatively better development and language ability. Deletions of 6 patients were confirmed by FISH to be of de novo in origin. The risk to their sibs was determined to be less than 1%. The phenotypic differences between AS/PWS patients with class I and class II deletion need to be further studied. SNP array is useful in detecting and distinguishing of patients with deletion or UPD. This method may be applied for studying the genotype-phenotype association and the mechanism underlying AS/PWS.
Conclusive evidence for hexasomic inheritance in chrysanthemum based on analysis of a 183 k SNP array.

PubMed

van Geest, Geert; Voorrips, Roeland E; Esselink, Danny; Post, Aike; Visser, Richard Gf; Arens, Paul

2017-08-07

Cultivated chrysanthemum is an outcrossing hexaploid (2n = 6× = 54) with a disputed mode of inheritance. In this paper, we present a single nucleotide polymorphism (SNP) selection pipeline that was used to design an Affymetrix Axiom array with 183 k SNPs from RNA sequencing data (1). With this array, we genotyped four bi-parental populations (with sizes of 405, 53, 76 and 37 offspring plants respectively), and a cultivar panel of 63 genotypes. Further, we present a method for dosage scoring in hexaploids from signal intensities of the array based on mixture models (2) and validation of selection steps in the SNP selection pipeline (3). The resulting genotypic data is used to draw conclusions on the mode of inheritance in chrysanthemum (4), and to make an inference on allelic expression bias (5). With use of the mixture model approach, we successfully called the dosage of 73,936 out of 183,130 SNPs (40.4%) that segregated in any of the bi-parental populations. To investigate the mode of inheritance, we analysed markers that segregated in the large bi-parental population (n = 405). Analysis of segregation of duplex x nulliplex SNPs resulted in evidence for genome-wide hexasomic inheritance. This evidence was substantiated by the absence of strong linkage between markers in repulsion, which indicated absence of full disomic inheritance. We present the success rate of SNP discovery out of RNA sequencing data as affected by different selection steps, among which SNP coverage over genotypes and use of different types of sequence read mapping software. Genomic dosage highly correlated with relative allele coverage from the RNA sequencing data, indicating that most alleles are expressed according to their genomic dosage. The large population, genotyped with a very large number of markers, is a unique framework for extensive genetic analyses in hexaploid chrysanthemum. As starting point, we show conclusive evidence for genome-wide hexasomic inheritance.
Analysis and visualization of chromosomal abnormalities in SNP data with SNPscan

PubMed Central

Ting, Jason C; Ye, Ying; Thomas, George H; Ruczinski, Ingo; Pevsner, Jonathan

2006-01-01

Background A variety of diseases are caused by chromosomal abnormalities such as aneuploidies (having an abnormal number of chromosomes), microdeletions, microduplications, and uniparental disomy. High density single nucleotide polymorphism (SNP) microarrays provide information on chromosomal copy number changes, as well as genotype (heterozygosity and homozygosity). SNP array studies generate multiple types of data for each SNP site, some with more than 100,000 SNPs represented on each array. The identification of different classes of anomalies within SNP data has been challenging. Results We have developed SNPscan, a web-accessible tool to analyze and visualize high density SNP data. It enables researchers (1) to visually and quantitatively assess the quality of user-generated SNP data relative to a benchmark data set derived from a control population, (2) to display SNP intensity and allelic call data in order to detect chromosomal copy number anomalies (duplications and deletions), (3) to display uniparental isodisomy based on loss of heterozygosity (LOH) across genomic regions, (4) to compare paired samples (e.g. tumor and normal), and (5) to generate a file type for viewing SNP data in the University of California, Santa Cruz (UCSC) Human Genome Browser. SNPscan accepts data exported from Affymetrix Copy Number Analysis Tool as its input. We validated SNPscan using data generated from patients with known deletions, duplications, and uniparental disomy. We also inspected previously generated SNP data from 90 apparently normal individuals from the Centre d'Étude du Polymorphisme Humain (CEPH) collection, and identified three cases of uniparental isodisomy, four females having an apparently mosaic X chromosome, two mislabelled SNP data sets, and one microdeletion on chromosome 2 with mosaicism from an apparently normal female. These previously unrecognized abnormalities were all detected using SNPscan. The microdeletion was independently confirmed by fluorescence in situ hybridization, and a region of homozygosity in a UPD case was confirmed by sequencing of genomic DNA. Conclusion SNPscan is useful to identify chromosomal abnormalities based on SNP intensity (such as chromosomal copy number changes) and heterozygosity data (including regions of LOH and some cases of UPD). The program and source code are available at the SNPscan website . PMID:16420694
[Phenotypic and genetic analysis of a patient presented with Tietz/Waardenburg type II a syndrome].

PubMed

Wang, Huanhuan; Tang, Lifang; Zhang, Jingmin; Hu, Qin; Chen, Yingwei; Xiao, Bing

2015-08-01

To determine the genetic cause for a patient featuring decreased pigmentation of the skin and iris, hearing loss and multiple congenital anomalies. Routine chromosomal banding was performed to analyze the karyotype of the patient and his parents. Single nucleotide polymorphism array (SNP array) was employed to identify cryptic chromosome aberrations, and quantitative real-time PCR was used to confirm the results. Karyotype analysis has revealed no obvious anomaly for the patient and his parents. SNP array analysis of the patient has demonstrated a 3.9 Mb deletion encompassing 3p13p14.1, which caused loss of entire MITF gene. The deletion was confirmed by quantitative real-time PCR. Clinical features of the patient have included severe bilateral hearing loss, decreased pigmentation of the skin and iris and multiple congenital anomalies. The patient, carrying a 3p13p14.1 deletion, has features of Tietz syndrome/Waardenburg syndrome type IIa. This case may provide additional data for the study of genotype-phenotype correlation of this disease.

High-Resolution SNP/CGH Microarrays Reveal the Accumulation of Loss of Heterozygosity in Commonly Used Candida albicans Strains

PubMed Central

Abbey, Darren; Hickman, Meleah; Gresham, David; Berman, Judith

2011-01-01

Phenotypic diversity can arise rapidly through loss of heterozygosity (LOH) or by the acquisition of copy number variations (CNV) spanning whole chromosomes or shorter contiguous chromosome segments. In Candida albicans, a heterozygous diploid yeast pathogen with no known meiotic cycle, homozygosis and aneuploidy alter clinical characteristics, including drug resistance. Here, we developed a high-resolution microarray that simultaneously detects ∼39,000 single nucleotide polymorphism (SNP) alleles and ∼20,000 copy number variation loci across the C. albicans genome. An important feature of the array analysis is a computational pipeline that determines SNP allele ratios based upon chromosome copy number. Using the array and analysis tools, we constructed a haplotype map (hapmap) of strain SC5314 to assign SNP alleles to specific homologs, and we used it to follow the acquisition of loss of heterozygosity (LOH) and copy number changes in a series of derived laboratory strains. This high-resolution SNP/CGH microarray and the associated hapmap facilitated the phasing of alleles in lab strains and revealed detrimental genome changes that arose frequently during molecular manipulations of laboratory strains. Furthermore, it provided a useful tool for rapid, high-resolution, and cost-effective characterization of changes in allele diversity as well as changes in chromosome copy number in new C. albicans isolates. PMID:22384363
Detection of clonal evolution in hematopoietic malignancies by combining comparative genomic hybridization and single nucleotide polymorphism arrays.

PubMed

Hartmann, Luise; Stephenson, Christine F; Verkamp, Stephanie R; Johnson, Krystal R; Burnworth, Bettina; Hammock, Kelle; Brodersen, Lisa Eidenschink; de Baca, Monica E; Wells, Denise A; Loken, Michael R; Zehentner, Barbara K

2014-12-01

Array comparative genomic hybridization (aCGH) has become a powerful tool for analyzing hematopoietic neoplasms and identifying genome-wide copy number changes in a single assay. aCGH also has superior resolution compared with fluorescence in situ hybridization (FISH) or conventional cytogenetics. Integration of single nucleotide polymorphism (SNP) probes with microarray analysis allows additional identification of acquired uniparental disomy, a copy neutral aberration with known potential to contribute to tumor pathogenesis. However, a limitation of microarray analysis has been the inability to detect clonal heterogeneity in a sample. This study comprised 16 samples (acute myeloid leukemia, myelodysplastic syndrome, chronic lymphocytic leukemia, plasma cell neoplasm) with complex cytogenetic features and evidence of clonal evolution. We used an integrated manual peak reassignment approach combining analysis of aCGH and SNP microarray data for characterization of subclonal abnormalities. We compared array findings with results obtained from conventional cytogenetic and FISH studies. Clonal heterogeneity was detected in 13 of 16 samples by microarray on the basis of log2 values. Use of the manual peak reassignment analysis approach improved resolution of the sample's clonal composition and genetic heterogeneity in 10 of 13 (77%) patients. Moreover, in 3 patients, clonal disease progression was revealed by array analysis that was not evident by cytogenetic or FISH studies. Genetic abnormalities originating from separate clonal subpopulations can be identified and further characterized by combining aCGH and SNP hybridization results from 1 integrated microarray chip by use of the manual peak reassignment technique. Its clinical utility in comparison to conventional cytogenetic or FISH studies is demonstrated. © 2014 American Association for Clinical Chemistry.
Evaluation of Bovine High-Density SNP Genotyping Array in Indigenous Dairy Cattle Breeds.

PubMed

Dash, S; Singh, A; Bhatia, A K; Jayakumar, S; Sharma, A; Singh, S; Ganguly, I; Dixit, S P

2018-04-03

In total 52 samples of Sahiwal ( 19 ), Tharparkar ( 17 ), and Gir ( 16 ) were genotyped by using BovineHD SNP chip to analyze minor allele frequency (MAF), genetic diversity, and linkage disequilibrium among these cattle. The common SNPs of BovineHD and 54K SNP Chips were also extracted and evaluated for their performance. Only 40%-50% SNPs of these arrays was found informative for genetic analysis in these cattle breeds. The overall mean of MAF for SNPs of BovineHD SNPChip was 0.248 ± 0.006, 0.241 ± 0.007, and 0.242 ± 0.009 in Sahiwal, Tharparkar and Gir, respectively, while that for 54K SNPs was on lower side. The average Reynold's genetic distance between breeds ranged from 0.042 to 0.055 based on BovineHD Beadchip, and from 0.052 to 0.084 based on 54K SNP Chip. The estimates of genetic diversity based on HD and 54K chips were almost same and, hence, low density chip seems to be good enough to decipher genetic diversity of these cattle breeds. The linkage disequilibrium started decaying (r 2 < 0.2) at 140 kb inter-marker distance and, hence, a 20K low density customized SNP array from HD chip could be designed for genomic selection in these cattle else the 54K Bead Chip as such will be useful.
Interim report on updated microarray probes for the LLNL Burkholderia pseudomallei SNP array

DOE Office of Scientific and Technical Information (OSTI.GOV)

Gardner, S; Jaing, C

2012-03-27

The overall goal of this project is to forensically characterize 100 unknown Burkholderia isolates in the US-Australia collaboration. We will identify genome-wide single nucleotide polymorphisms (SNPs) from B. pseudomallei and near neighbor species including B. mallei, B. thailandensis and B. oklahomensis. We will design microarray probes to detect these SNP markers and analyze 100 Burkholderia genomic DNAs extracted from environmental, clinical and near neighbor isolates from Australian collaborators on the Burkholderia SNP microarray. We will analyze the microarray genotyping results to characterize the genetic diversity of these new isolates and triage the samples for whole genome sequencing. In this interimmore » report, we described the SNP analysis and the microarray probe design for the Burkholderia SNP microarray.« less
Comprehensive performance comparison of high-resolution array platforms for genome-wide Copy Number Variation (CNV) analysis in humans.

PubMed

Haraksingh, Rajini R; Abyzov, Alexej; Urban, Alexander Eckehart

2017-04-24

High-resolution microarray technology is routinely used in basic research and clinical practice to efficiently detect copy number variants (CNVs) across the entire human genome. A new generation of arrays combining high probe densities with optimized designs will comprise essential tools for genome analysis in the coming years. We systematically compared the genome-wide CNV detection power of all 17 available array designs from the Affymetrix, Agilent, and Illumina platforms by hybridizing the well-characterized genome of 1000 Genomes Project subject NA12878 to all arrays, and performing data analysis using both manufacturer-recommended and platform-independent software. We benchmarked the resulting CNV call sets from each array using a gold standard set of CNVs for this genome derived from 1000 Genomes Project whole genome sequencing data. The arrays tested comprise both SNP and aCGH platforms with varying designs and contain between ~0.5 to ~4.6 million probes. Across the arrays CNV detection varied widely in number of CNV calls (4-489), CNV size range (~40 bp to ~8 Mbp), and percentage of non-validated CNVs (0-86%). We discovered strikingly strong effects of specific array design principles on performance. For example, some SNP array designs with the largest numbers of probes and extensive exonic coverage produced a considerable number of CNV calls that could not be validated, compared to designs with probe numbers that are sometimes an order of magnitude smaller. This effect was only partially ameliorated using different analysis software and optimizing data analysis parameters. High-resolution microarrays will continue to be used as reliable, cost- and time-efficient tools for CNV analysis. However, different applications tolerate different limitations in CNV detection. Our study quantified how these arrays differ in total number and size range of detected CNVs as well as sensitivity, and determined how each array balances these attributes. This analysis will inform appropriate array selection for future CNV studies, and allow better assessment of the CNV-analytical power of both published and ongoing array-based genomics studies. Furthermore, our findings emphasize the importance of concurrent use of multiple analysis algorithms and independent experimental validation in array-based CNV detection studies.
Evaluation of copy number variation detection for a SNP array platform

PubMed Central

2014-01-01

Background Copy Number Variations (CNVs) are usually inferred from Single Nucleotide Polymorphism (SNP) arrays by use of some software packages based on given algorithms. However, there is no clear understanding of the performance of these software packages; it is therefore difficult to select one or several software packages for CNV detection based on the SNP array platform. We selected four publicly available software packages designed for CNV calling from an Affymetrix SNP array, including Birdsuite, dChip, Genotyping Console (GTC) and PennCNV. The publicly available dataset generated by Array-based Comparative Genomic Hybridization (CGH), with a resolution of 24 million probes per sample, was considered to be the “gold standard”. Compared with the CGH-based dataset, the success rate, average stability rate, sensitivity, consistence and reproducibility of these four software packages were assessed compared with the “gold standard”. Specially, we also compared the efficiency of detecting CNVs simultaneously by two, three and all of the software packages with that by a single software package. Results Simply from the quantity of the detected CNVs, Birdsuite detected the most while GTC detected the least. We found that Birdsuite and dChip had obvious detecting bias. And GTC seemed to be inferior because of the least amount of CNVs it detected. Thereafter we investigated the detection consistency produced by one certain software package and the rest three software suits. We found that the consistency of dChip was the lowest while GTC was the highest. Compared with the CNVs detecting result of CGH, in the matching group, GTC called the most matching CNVs, PennCNV-Affy ranked second. In the non-overlapping group, GTC called the least CNVs. With regards to the reproducibility of CNV calling, larger CNVs were usually replicated better. PennCNV-Affy shows the best consistency while Birdsuite shows the poorest. Conclusion We found that PennCNV outperformed the other three packages in the sensitivity and specificity of CNV calling. Obviously, each calling method had its own limitations and advantages for different data analysis. Therefore, the optimized calling methods might be identified using multiple algorithms to evaluate the concordance and discordance of SNP array-based CNV calling. PMID:24555668
MixHMM: Inferring Copy Number Variation and Allelic Imbalance Using SNP Arrays and Tumor Samples Mixed with Stromal Cells

PubMed Central

Schulz, Vincent; Chen, Min; Tuck, David

2010-01-01

Background Genotyping platforms such as single nucleotide polymorphism (SNP) arrays are powerful tools to study genomic aberrations in cancer samples. Allele specific information from SNP arrays provides valuable information for interpreting copy number variation (CNV) and allelic imbalance including loss-of-heterozygosity (LOH) beyond that obtained from the total DNA signal available from array comparative genomic hybridization (aCGH) platforms. Several algorithms based on hidden Markov models (HMMs) have been designed to detect copy number changes and copy-neutral LOH making use of the allele information on SNP arrays. However heterogeneity in clinical samples, due to stromal contamination and somatic alterations, complicates analysis and interpretation of these data. Methods We have developed MixHMM, a novel hidden Markov model using hidden states based on chromosomal structural aberrations. MixHMM allows CNV detection for copy numbers up to 7 and allows more complete and accurate description of other forms of allelic imbalance, such as increased copy number LOH or imbalanced amplifications. MixHMM also incorporates a novel sample mixing model that allows detection of tumor CNV events in heterogeneous tumor samples, where cancer cells are mixed with a proportion of stromal cells. Conclusions We validate MixHMM and demonstrate its advantages with simulated samples, clinical tumor samples and a dilution series of mixed samples. We have shown that the CNVs of cancer cells in a tumor sample contaminated with up to 80% of stromal cells can be detected accurately using Illumina BeadChip and MixHMM. Availability The MixHMM is available as a Python package provided with some other useful tools at http://genecube.med.yale.edu:8080/MixHMM. PMID:20532221
Two Siblings with Alternate Unbalanced Recombinants Derived from a Large Cryptic Maternal Pericentric Inversion of Chromosome 20

PubMed Central

DeScipio, Cheryl; Morrissette, Jennifer J.D.; Conlin, Laura K.; Clark, Dinah; Kaur, Maninder; Coplan, James; Riethman, Harold; Spinner, Nancy B.; Krantz, Ian D.

2009-01-01

Two brothers, with dissimilar clinical features, were each found to have different abnormalities of chromosome 20 by subtelomere fluorescence in situ hybridization (FISH). The proband had deletion of 20p subtelomere and duplication of 20q subtelomere, while his brother was found to have a duplication of 20p subtelomere and deletion of 20q subtelomere. Parental cytogenetic studies were initially thought to be normal, both by G-banding and by subtelomere FISH analysis. Since chromosome 20 is a metacentric chromosome and an inversion was suspected, we used anchored FISH to assist in identifying a possible inversion. This approach employed concomitant hybridization of a FISH probe to the short (p) arm of chromosome 20 with the 20q subtelomere probe. We identified a cytogenetically non-visible, mosaic pericentric inversion of one of the maternal chromosome 20 homologues, providing a mechanistic explanation for the chromosomal abnormalities present in these brothers. Array comparative genomic hybridization (CGH) with both a custom-made BAC and cosmid-based subtelomere specific array (TEL array) and a commercially-available SNP-based array confirmed and further characterized these rearrangements, identifying this as the largest pericentric inversion of chromosome 20 described to date. TEL array data indicate that the 20p breakpoint is defined by BAC RP11-978M13, ~900 kb from the pter; SNP array data reveal this breakpoint to occur within BAC RP11-978M13. The 20q breakpoint is defined by BAC RP11-93B14, ~1.7 Mb from the qter, by TEL array; SNP array data refine this breakpoint to within a gap between BACs on the TEL array (i.e. between RP11-93B14 and proximal BAC RP11-765G16). PMID:20101690
Two siblings with alternate unbalanced recombinants derived from a large cryptic maternal pericentric inversion of chromosome 20.

PubMed

Descipio, Cheryl; Morrissette, Jennifer D; Conlin, Laura K; Clark, Dinah; Kaur, Maninder; Coplan, James; Riethman, Harold; Spinner, Nancy B; Krantz, Ian D

2010-02-01

Two brothers, with dissimilar clinical features, were each found to have different abnormalities of chromosome 20 by subtelomere fluorescence in situ hybridization (FISH). The proband had deletion of 20p subtelomere and duplication of 20q subtelomere, while his brother was found to have a duplication of 20p subtelomere and deletion of 20q subtelomere. Parental cytogenetic studies were initially thought to be normal, both by G-banding and by subtelomere FISH analysis. Since chromosome 20 is a metacentric chromosome and an inversion was suspected, we used anchored FISH to assist in identifying a possible inversion. This approach employed concomitant hybridization of a FISH probe to the short (p) arm of chromosome 20 with the 20q subtelomere probe. We identified a cytogenetically non-visible, mosaic pericentric inversion of one of the maternal chromosome 20 homologs, providing a mechanistic explanation for the chromosomal abnormalities present in these brothers. Array comparative genomic hybridization (CGH) with both a custom-made BAC and cosmid-based subtelomere specific array (TEL array) and a commercially available SNP-based array confirmed and further characterized these rearrangements, identifying this as the largest pericentric inversion of chromosome 20 described to date. TEL array data indicate that the 20p breakpoint is defined by BAC RP11-978M13, approximately 900 kb from the pter; SNP array data reveal this breakpoint to occur within BAC RP11-978M13. The 20q breakpoint is defined by BAC RP11-93B14, approximately 1.7 Mb from the qter, by TEL array; SNP array data refine this breakpoint to within a gap between BACs on the TEL array (i.e., between RP11-93B14 and proximal BAC RP11-765G16). Copyright 2010 Wiley-Liss, Inc.
Elucidation of the ‘Honeycrisp’ pedigree through haplotype analysis with a multi-family integrated SNP linkage map and a large apple (Malus×domestica) pedigree-connected SNP data set

PubMed Central

Howard, Nicholas P; van de Weg, Eric; Bedford, David S; Peace, Cameron P; Vanderzande, Stijn; Clark, Matthew D; Teh, Soon Li; Cai, Lichun; Luby, James J

2017-01-01

The apple (Malus×domestica) cultivar Honeycrisp has become important economically and as a breeding parent. An earlier study with SSR markers indicated the original recorded pedigree of ‘Honeycrisp’ was incorrect and ‘Keepsake’ was identified as one putative parent, the other being unknown. The objective of this study was to verify ‘Keepsake’ as a parent and identify and genetically describe the unknown parent and its grandparents. A multi-family based dense and high-quality integrated SNP map was created using the apple 8 K Illumina Infinium SNP array. This map was used alongside a large pedigree-connected data set from the RosBREED project to build extended SNP haplotypes and to identify pedigree relationships. ‘Keepsake’ was verified as one parent of ‘Honeycrisp’ and ‘Duchess of Oldenburg’ and ‘Golden Delicious’ were identified as grandparents through the unknown parent. Following this finding, siblings of ‘Honeycrisp’ were identified using the SNP data. Breeding records from several of these siblings suggested that the previously unreported parent is a University of Minnesota selection, MN1627. This selection is no longer available, but now is genetically described through imputed SNP haplotypes. We also present the mosaic grandparental composition of ‘Honeycrisp’ for each of its 17 chromosome pairs. This new pedigree and genetic information will be useful in future pedigree-based genetic studies to connect ‘Honeycrisp’ with other cultivars used widely in apple breeding programs. The created SNP linkage map will benefit future research using the data from the Illumina apple 8 and 20 K and Affymetrix 480 K SNP arrays. PMID:28243452
Identification of Pyrus single nucleotide polymorphisms (SNPs) and evaluation for genetic mapping in European pear and interspecific Pyrus hybrids.

PubMed

Montanari, Sara; Saeed, Munazza; Knäbel, Mareike; Kim, YoonKyeong; Troggio, Michela; Malnoy, Mickael; Velasco, Riccardo; Fontana, Paolo; Won, KyungHo; Durel, Charles-Eric; Perchepied, Laure; Schaffer, Robert; Wiedow, Claudia; Bus, Vincent; Brewer, Lester; Gardiner, Susan E; Crowhurst, Ross N; Chagné, David

2013-01-01

We have used new generation sequencing (NGS) technologies to identify single nucleotide polymorphism (SNP) markers from three European pear (Pyrus communis L.) cultivars and subsequently developed a subset of 1096 pear SNPs into high throughput markers by combining them with the set of 7692 apple SNPs on the IRSC apple Infinium® II 8K array. We then evaluated this apple and pear Infinium® II 9K SNP array for large-scale genotyping in pear across several species, using both pear and apple SNPs. The segregating populations employed for array validation included a segregating population of European pear ('Old Home'×'Louise Bon Jersey') and four interspecific breeding families derived from Asian (P. pyrifolia Nakai and P. bretschneideri Rehd.) and European pear pedigrees. In total, we mapped 857 polymorphic pear markers to construct the first SNP-based genetic maps for pear, comprising 78% of the total pear SNPs included in the array. In addition, 1031 SNP markers derived from apple (13% of the total apple SNPs included in the array) were polymorphic and were mapped in one or more of the pear populations. These results are the first to demonstrate SNP transferability across the genera Malus and Pyrus. Our construction of high density SNP-based and gene-based genetic maps in pear represents an important step towards the identification of chromosomal regions associated with a range of horticultural characters, such as pest and disease resistance, orchard yield and fruit quality.
[Application of single nucleotide polymorphism-microarray and target gene sequencing in the study of genetic etiology of children with unexplained intellectual disability or developmental delay].

PubMed

Gao, Z J; Jiang, Q; Cheng, D Z; Yan, X X; Chen, Q; Xu, K M

2016-10-02

Objective: To evaluate the application of single nucleotide polymorphism (SNP)-microarray and target gene sequencing technology in the clinical molecular genetic diagnosis of unexplained intellectual disability(ID) or developmental delay (DD). Method: Patients with ID or DD were recruited in the Department of Neurology, Affiliated Children's Hospital of Capital Institute of Pediatrics between September 2015 and February 2016. The intellectual assessment of the patients was performed using 0-6-year-old pediatric examination table of neuropsychological development or Wechsler intelligence scale (>6 years). Patients with a DQ less than 49 or IQ less than 51 were included in this study. The patients were scanned by SNP-array for detection of genomic copy number variations (CNV), and the revealed genomic imbalance was confirmed by quantitative real time-PCR. Candidate gene mutation screening was carried out by target gene sequencing technology.Causal mutations or likely pathogenic variants were verified by polymerase chain reaction and direct sequencing. Result: There were 15 children with ID or DD enrolled, 9 males and 6 females. The age of these patients was 7 months-16 years and 9 months. SNP-array revealed that two of the 15 patients had genomic CNV. Both CNV were de novo micro deletions, one involved 11q24.1q25 and the other micro deletion located on 21q22.2q22.3. Both micro deletions were proved to have a clinical significance due to their association with ID, brain DD, unusual faces etc. by querying Decipher database. Thirteen patients with negative findings in SNP-array were consequently examined with target gene sequencing technology, genotype-phenotype correlation analysis and genetic analysis. Five patients were diagnosed with monogenic disorder, two were diagnosed with suspected genetic disorder and six were still negative. Conclusion: Sequential use of SNP-array and target gene sequencing technology can significantly increase the molecular genetic etiologic diagnosis rate of the patients with unexplained ID or DD. Combined use of these technologies can serve as a useful examinational method in assisting differential diagnosis of children with unexplained ID or DD.
Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao

PubMed Central

Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos

2015-01-01

Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. PMID:26070980
Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array.

PubMed

Antanaviciute, Laima; Fernández-Fernández, Felicidad; Jansen, Johannes; Banchi, Elisa; Evans, Katherine M; Viola, Roberto; Velasco, Riccardo; Dunwell, Jim M; Troggio, Michela; Sargent, Daniel J

2012-05-25

A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the 'Golden Delicious' genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the 'Golden Delicious' pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the 'Golden Delicious' reference sequence will assist in the continued improvement of the genome sequence assembly for that variety.
Clinical impact of gene mutations and lesions detected by SNP-array karyotyping in acute myeloid leukemia patients in the context of gemtuzumab ozogamicin treatment: Results of the ALFA-0701 trial

PubMed Central

Chevret, Sylvie; Nibourel, Olivier; Cheok, Meyling; Pautas, Cécile; Duléry, Rémy; Boyer, Thomas; Cayuela, Jean-Michel; Hayette, Sandrine; Raffoux, Emmanuel; Farhat, Hassan; Boissel, Nicolas; Terre, Christine

2014-01-01

We recently showed that the addition of fractionated doses of gemtuzumab ozogamicin (GO) to standard chemotherapy improves clinical outcome of acute myeloid leukemia (AML) patients. In the present study, we performed mutational analysis of 11 genes (FLT3, NPM1, CEBPA, MLL, WT1, IDH1/2, RUNX1, ASXL1, TET2, DNMT3A), EVI1 overexpression screening, and 6.0 single-nucleotide polymorphism array (SNP-A) analysis in diagnostic samples of the 278 AML patients enrolled in the ALFA-0701 trial. In cytogenetically normal (CN) AML (n = 146), 38% of the patients had at least 1 SNP-A lesion and 89% of the patients had at least 1 molecular alteration. In multivariate analysis, the independent predictors of higher cumulative incidence of relapse were unfavorable karyotype (P = 0.013) and randomization in the control arm (P = 0.007) in the whole cohort, and MLL partial tandem duplications (P = 0.014) and DNMT3A mutations (P = 0.010) in CN-AML. The independent predictors of shorter overall survival (OS) were unfavorable karyotype (P < 0.001) and SNP-A lesion(s) (P = 0.001) in the whole cohort, and SNP-A lesion(s) (P = 0.006), DNMT3A mutations (P = 0.042) and randomization in the control arm (P = 0.043) in CN-AML. Interestingly, CN-AML patients benefited preferentially more from GO treatment as compared to AML patients with abnormal cytogenetics (hazard ratio for death, 0.52 versus 1.14; test for interaction, P = 0.04). Although the interaction test was not statistically significant, the OS benefit associated with GO treatment appeared also more pronounced in FLT3 internal tandem duplication positive than in negative patients. PMID:24659740
Clinical impact of gene mutations and lesions detected by SNP-array karyotyping in acute myeloid leukemia patients in the context of gemtuzumab ozogamicin treatment: results of the ALFA-0701 trial.

PubMed

Renneville, Aline; Abdelali, Raouf Ben; Chevret, Sylvie; Nibourel, Olivier; Cheok, Meyling; Pautas, Cécile; Duléry, Rémy; Boyer, Thomas; Cayuela, Jean-Michel; Hayette, Sandrine; Raffoux, Emmanuel; Farhat, Hassan; Boissel, Nicolas; Terre, Christine; Dombret, Hervé; Castaigne, Sylvie; Preudhomme, Claude

2014-02-28

We recently showed that the addition of fractionated doses of gemtuzumab ozogamicin (GO) to standard chemotherapy improves clinical outcome of acute myeloid leukemia (AML) patients. In the present study, we performed mutational analysis of 11 genes (FLT3, NPM1, CEBPA, MLL, WT1, IDH1/2, RUNX1, ASXL1, TET2, DNMT3A), EVI1 overexpression screening, and 6.0 single-nucleotide polymorphism array (SNP-A) analysis in diagnostic samples of the 278 AML patients enrolled in the ALFA-0701 trial. In cytogenetically normal (CN) AML (n=146), 38% of the patients had at least 1 SNP-A lesion and 89% of the patients had at least 1 molecular alteration. In multivariate analysis, the independent predictors of higher cumulative incidence of relapse were unfavorable karyotype (P = 0.013) and randomization in the control arm (P = 0.007) in the whole cohort, and MLL partial tandem duplications (P = 0.014) and DNMT3A mutations (P = 0.010) in CN-AML. The independent predictors of shorter overall survival (OS) were unfavorable karyotype (P <0.001) and SNP-A lesion(s) (P = 0.001) in the whole cohort, and SNP-A lesion(s) (P = 0.006), DNMT3A mutations (P = 0.042) and randomization in the control arm (P = 0.043) in CN-AML. Interestingly, CN-AML patients benefited preferentially more from GO treatment as compared to AML patients with abnormal cytogenetics (hazard ratio for death, 0.52 versus 1.14; test for interaction, P = 0.04). Although the interaction test was not statistically significant, the OS benefit associated with GO treatment appeared also more pronounced in FLT3 internal tandem duplication positive than in negative patients.
Optimal design of low-density SNP arrays for genomic prediction: algorithm and applications

USDA-ARS?s Scientific Manuscript database

Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for their optimal design. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optim...
Microdeletions are a general feature of adult and adolescent acute lymphoblastic leukemia: Unexpected similarities with pediatric disease

PubMed Central

Paulsson, Kajsa; Cazier, Jean-Baptiste; MacDougall, Finlay; Stevens, Jane; Stasevich, Irina; Vrcelj, Nikoletta; Chaplin, Tracy; Lillington, Debra M.; Lister, T. Andrew; Young, Bryan D.

2008-01-01

We present here a genome-wide map of abnormalities found in diagnostic samples from 45 adults and adolescents with acute lymphoblastic leukemia (ALL). A 500K SNP array analysis uncovered frequent genetic abnormalities, with cryptic deletions constituting half of the detected changes, implying that microdeletions are a characteristic feature of this malignancy. Importantly, the pattern of deletions resembled that recently reported in pediatric ALL, suggesting that adult, adolescent, and childhood cases may be more similar on the genetic level than previously thought. Thus, 70% of the cases displayed deletion of one or more of the CDKN2A, PAX5, IKZF1, ETV6, RB1, and EBF1 genes. Furthermore, several genes not previously implicated in the pathogenesis of ALL were identified as possible recurrent targets of deletion. In total, the SNP array analysis identified 367 genetic abnormalities not corresponding to known copy number polymorphisms, with all but two cases (96%) displaying at least one cryptic change. The resolution level of this SNP array study is the highest used to date to investigate a malignant hematologic disorder. Our findings provide insights into the leukemogenic process and may be clinically important in adult and adolescent ALL. Most importantly, we report that microdeletions of key genes appear to be a common, characteristic feature of ALL that is shared among different clinical, morphological, and cytogenetic subgroups. PMID:18458336
Identification of Pyrus Single Nucleotide Polymorphisms (SNPs) and Evaluation for Genetic Mapping in European Pear and Interspecific Pyrus Hybrids

PubMed Central

Troggio, Michela; Malnoy, Mickael; Velasco, Riccardo; Fontana, Paolo; Won, KyungHo; Durel, Charles-Eric; Perchepied, Laure; Schaffer, Robert; Wiedow, Claudia; Bus, Vincent; Brewer, Lester; Gardiner, Susan E.; Crowhurst, Ross N.; Chagné, David

2013-01-01

We have used new generation sequencing (NGS) technologies to identify single nucleotide polymorphism (SNP) markers from three European pear (Pyrus communis L.) cultivars and subsequently developed a subset of 1096 pear SNPs into high throughput markers by combining them with the set of 7692 apple SNPs on the IRSC apple Infinium® II 8K array. We then evaluated this apple and pear Infinium® II 9K SNP array for large-scale genotyping in pear across several species, using both pear and apple SNPs. The segregating populations employed for array validation included a segregating population of European pear (‘Old Home’×‘Louise Bon Jersey’) and four interspecific breeding families derived from Asian (P. pyrifolia Nakai and P. bretschneideri Rehd.) and European pear pedigrees. In total, we mapped 857 polymorphic pear markers to construct the first SNP-based genetic maps for pear, comprising 78% of the total pear SNPs included in the array. In addition, 1031 SNP markers derived from apple (13% of the total apple SNPs included in the array) were polymorphic and were mapped in one or more of the pear populations. These results are the first to demonstrate SNP transferability across the genera Malus and Pyrus. Our construction of high density SNP-based and gene-based genetic maps in pear represents an important step towards the identification of chromosomal regions associated with a range of horticultural characters, such as pest and disease resistance, orchard yield and fruit quality. PMID:24155917
Detection of doublecortin domain-containing 2 (DCDC2), a new candidate tumor suppressor gene of hepatocellular carcinoma, by triple combination array analysis

PubMed Central

2013-01-01

Background To detect genes correlated with hepatocellular carcinoma (HCC), we developed a triple combination array consisting of methylation array, gene expression array and single nucleotide polymorphism (SNP) array analysis. Methods A surgical specimen obtained from a 68-year-old female HCC patient was analyzed by triple combination array, which identified doublecortin domain-containing 2 (DCDC2) as a candidate tumor suppressor gene of HCC. Subsequently, samples from 48 HCC patients were evaluated for their DCDC2 methylation and expression status using methylation specific PCR (MSP) and semi-quantitative reverse transcriptase (RT) PCR, respectively. Then, we investigated the relationship between clinicopathological factors and methylation status of DCDC2. Results DCDC2 was revealed to be hypermethylated (methylation value 0.846, range 0–1.0) in cancer tissue, compared with adjacent normal tissue (0.212) by methylation array in the 68-year-old female patient. Expression array showed decreased expression of DCDC2 in cancerous tissue. SNP array showed that the copy number of chromosome 6p22.1, in which DCDC2 resides, was normal. MSP revealed hypermethylation of the promoter region of DCDC2 in 41 of the tumor samples. DCDC2 expression was significantly decreased in the cases with methylation (P = 0.048). Furthermore, the methylated cases revealed worse prognosis for overall survival than unmethylated cases (P = 0.048). Conclusions The present study indicates that triple combination array is an effective method to detect novel genes related to HCC. We propose that DCDC2 is a tumor suppressor gene of HCC. PMID:24034596

GACT: a Genome build and Allele definition Conversion Tool for SNP imputation and meta-analysis in genetic association studies.

PubMed

Sulovari, Arvis; Li, Dawei

2014-07-19

Genome-wide association studies (GWAS) have successfully identified genes associated with complex human diseases. Although much of the heritability remains unexplained, combining single nucleotide polymorphism (SNP) genotypes from multiple studies for meta-analysis will increase the statistical power to identify new disease-associated variants. Meta-analysis requires same allele definition (nomenclature) and genome build among individual studies. Similarly, imputation, commonly-used prior to meta-analysis, requires the same consistency. However, the genotypes from various GWAS are generated using different genotyping platforms, arrays or SNP-calling approaches, resulting in use of different genome builds and allele definitions. Incorrect assumptions of identical allele definition among combined GWAS lead to a large portion of discarded genotypes or incorrect association findings. There is no published tool that predicts and converts among all major allele definitions. In this study, we have developed a tool, GACT, which stands for Genome build and Allele definition Conversion Tool, that predicts and inter-converts between any of the common SNP allele definitions and between the major genome builds. In addition, we assessed several factors that may affect imputation quality, and our results indicated that inclusion of singletons in the reference had detrimental effects while ambiguous SNPs had no measurable effect. Unexpectedly, exclusion of genotypes with missing rate > 0.001 (40% of study SNPs) showed no significant decrease of imputation quality (even significantly higher when compared to the imputation with singletons in the reference), especially for rare SNPs. GACT is a new, powerful, and user-friendly tool with both command-line and interactive online versions that can accurately predict, and convert between any of the common allele definitions and between genome builds for genome-wide meta-analysis and imputation of genotypes from SNP-arrays or deep-sequencing, particularly for data from the dbGaP and other public databases. http://www.uvm.edu/genomics/software/gact.
Comparative Analysis of CNV Calling Algorithms: Literature Survey and a Case Study Using Bovine High-Density SNP Data.

PubMed

Xu, Lingyang; Hou, Yali; Bickhart, Derek M; Song, Jiuzhou; Liu, George E

2013-06-25

Copy number variations (CNVs) are gains and losses of genomic sequence between two individuals of a species when compared to a reference genome. The data from single nucleotide polymorphism (SNP) microarrays are now routinely used for genotyping, but they also can be utilized for copy number detection. Substantial progress has been made in array design and CNV calling algorithms and at least 10 comparison studies in humans have been published to assess them. In this review, we first survey the literature on existing microarray platforms and CNV calling algorithms. We then examine a number of CNV calling tools to evaluate their impacts using bovine high-density SNP data. Large incongruities in the results from different CNV calling tools highlight the need for standardizing array data collection, quality assessment and experimental validation. Only after careful experimental design and rigorous data filtering can the impacts of CNVs on both normal phenotypic variability and disease susceptibility be fully revealed.
Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.

PubMed

Livingstone, Donald; Royaert, Stefan; Stack, Conrad; Mockaitis, Keithanne; May, Greg; Farmer, Andrew; Saski, Christopher; Schnell, Ray; Kuhn, David; Motamayor, Juan Carlos

2015-08-01

Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ∼4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification project was undertaken using RNAseq data from 16 diverse cacao cultivars. RNA sequences were aligned to the assembled transcriptome of the cultivar Matina 1-6, and 330,000 SNPs within coding regions were identified. From these SNPs, a subset of 6,000 high-quality SNPs were selected for inclusion on an Illumina Infinium SNP array: the Cacao6kSNP array. Using Cacao6KSNP array data from over 1,000 cacao samples, we demonstrate that our custom array produces a saturated genetic map and can be used to distinguish among even closely related genotypes. Our study enhances and expands the genetic resources available to the cacao research community, and provides the genome-scale set of tools that are critical for advancing breeding with molecular markers in an agricultural species with high genetic diversity. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
A High Density SNP Array for the Domestic Horse and Extant Perissodactyla: Utility for Association Mapping, Genetic Diversity, and Phylogeny Studies

PubMed Central

McCue, Molly E.; Bannasch, Danika L.; Petersen, Jessica L.; Gurr, Jessica; Bailey, Ernie; Binns, Matthew M.; Distl, Ottmar; Guérin, Gérard; Hasegawa, Telhisa; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Penedo, M. Cecilia T.; Røed, Knut H.; Ryder, Oliver A.; Swinburne, June E.; Tozaki, Teruaki; Valberg, Stephanie J.; Vaudin, Mark; Lindblad-Toh, Kerstin

2012-01-01

An equine SNP genotyping array was developed and evaluated on a panel of samples representing 14 domestic horse breeds and 18 evolutionarily related species. More than 54,000 polymorphic SNPs provided an average inter-SNP spacing of ∼43 kb. The mean minor allele frequency across domestic horse breeds was 0.23, and the number of polymorphic SNPs within breeds ranged from 43,287 to 52,085. Genome-wide linkage disequilibrium (LD) in most breeds declined rapidly over the first 50–100 kb and reached background levels within 1–2 Mb. The extent of LD and the level of inbreeding were highest in the Thoroughbred and lowest in the Mongolian and Quarter Horse. Multidimensional scaling (MDS) analyses demonstrated the tight grouping of individuals within most breeds, close proximity of related breeds, and less tight grouping in admixed breeds. The close relationship between the Przewalski's Horse and the domestic horse was demonstrated by pair-wise genetic distance and MDS. Genotyping of other Perissodactyla (zebras, asses, tapirs, and rhinoceros) was variably successful, with call rates and the number of polymorphic loci varying across taxa. Parsimony analysis placed the modern horse as sister taxa to Equus przewalski. The utility of the SNP array in genome-wide association was confirmed by mapping the known recessive chestnut coat color locus (MC1R) and defining a conserved haplotype of ∼750 kb across all breeds. These results demonstrate the high quality of this SNP genotyping resource, its usefulness in diverse genome analyses of the horse, and potential use in related species. PMID:22253606
Population structure and genome-wide association analysis for frost tolerance in oat using continuous SNP array signal intensity ratios.

PubMed

Tumino, Giorgio; Voorrips, Roeland E; Rizza, Fulvia; Badeck, Franz W; Morcia, Caterina; Ghizzoni, Roberta; Germeier, Christoph U; Paulo, Maria-João; Terzi, Valeria; Smulders, Marinus J M

2016-09-01

Infinium SNP data analysed as continuous intensity ratios enabled associating genotypic and phenotypic data from heterogeneous oat samples, showing that association mapping for frost tolerance is a feasible option. Oat is sensitive to freezing temperatures, which restricts the cultivation of fall-sown or winter oats to regions with milder winters. Fall-sown oats have a longer growth cycle, mature earlier, and have a higher productivity than spring-sown oats, therefore improving frost tolerance is an important goal in oat breeding. Our aim was to test the effectiveness of a Genome-Wide Association Study (GWAS) for mapping QTLs related to frost tolerance, using an approach that tolerates continuously distributed signals from SNPs in bulked samples from heterogeneous accessions. A collection of 138 European oat accessions, including landraces, old and modern varieties from 27 countries was genotyped using the Infinium 6K SNP array. The SNP data were analyzed as continuous intensity ratios, rather than converting them into discrete values by genotype calling. PCA and Ward's clustering of genetic similarities revealed the presence of two main groups of accessions, which roughly corresponded to Continental Europe and Mediterranean/Atlantic Europe, although a total of eight subgroups can be distinguished. The accessions were phenotyped for frost tolerance under controlled conditions by measuring fluorescence quantum yield of photosystem II after a freezing stress. GWAS were performed by a linear mixed model approach, comparing different corrections for population structure. All models detected three robust QTLs, two of which co-mapped with QTLs identified earlier in bi-parental mapping populations. The approach used in the present work shows that SNP array data of heterogeneous hexaploid oat samples can be successfully used to determine genetic similarities and to map associations to quantitative phenotypic traits.
Development of new SNP derived cleaved amplified polymorphic sequence marker set and its successful utilization in the genetic analysis of seed color variation in barley.

PubMed

Bungartz, Annemarie; Klaus, Marius; Mathew, Boby; Léon, Jens; Naz, Ali Ahmad

2016-03-01

The aim of the present study was to develop a new cost effective PCR based CAPS marker set using advantages of high-throughput SNP genotyping. Initially, SNP survey was made using 20 diverse barley genotypes via 9k iSelect array genotyping that resulted in 6334 polymorphic SNP markers. Principle component analysis using this marker data showed fine differentiation of barley diverse gene pool. Till this end, we developed 200 SNP derived CAPS markers distributed across the genome covering around 991cM with an average marker density of 5.09cM. Further, we genotyped 68 CAPS markers in an F2 population (Cheri×ICB181160) segregating for seed color variation in barley. Genetic mapping of seed color revealed putative linkage of single nuclear gene on chromosome 1H. These findings showed the proof of concept for the development and utility of a newer cost effective genomic tool kit to analyze broader genetic resources of barley worldwide. Copyright © 2016 Elsevier Inc. All rights reserved.
Measuring diversity in Gossypium hirsutum using the CottonSNP63K Array

USDA-ARS?s Scientific Manuscript database

A CottonSNP63K array and accompanying cluster file has been developed and includes 45,104 intra-specific SNPs and 17,954 inter-specific SNPs for automated genotyping of cotton (Gossypium spp.) samples. Development of the cluster file included genotyping of 1,156 samples, a subset of which were iden...
Development of a dense SNP-based linkage map of an apple rootstock progeny using the Malus Infinium whole genome genotyping array

PubMed Central

2012-01-01

Background A whole-genome genotyping array has previously been developed for Malus using SNP data from 28 Malus genotypes. This array offers the prospect of high throughput genotyping and linkage map development for any given Malus progeny. To test the applicability of the array for mapping in diverse Malus genotypes, we applied the array to the construction of a SNP-based linkage map of an apple rootstock progeny. Results Of the 7,867 Malus SNP markers on the array, 1,823 (23.2%) were heterozygous in one of the two parents of the progeny, 1,007 (12.8%) were heterozygous in both parental genotypes, whilst just 2.8% of the 921 Pyrus SNPs were heterozygous. A linkage map spanning 1,282.2 cM was produced comprising 2,272 SNP markers, 306 SSR markers and the S-locus. The length of the M432 linkage map was increased by 52.7 cM with the addition of the SNP markers, whilst marker density increased from 3.8 cM/marker to 0.5 cM/marker. Just three regions in excess of 10 cM remain where no markers were mapped. We compared the positions of the mapped SNP markers on the M432 map with their predicted positions on the ‘Golden Delicious’ genome sequence. A total of 311 markers (13.7% of all mapped markers) mapped to positions that conflicted with their predicted positions on the ‘Golden Delicious’ pseudo-chromosomes, indicating the presence of paralogous genomic regions or mis-assignments of genome sequence contigs during the assembly and anchoring of the genome sequence. Conclusions We incorporated data for the 2,272 SNP markers onto the map of the M432 progeny and have presented the most complete and saturated map of the full 17 linkage groups of M. pumila to date. The data were generated rapidly in a high-throughput semi-automated pipeline, permitting significant savings in time and cost over linkage map construction using microsatellites. The application of the array will permit linkage maps to be developed for QTL analyses in a cost-effective manner, and the identification of SNPs that have been assigned erroneous positions on the ‘Golden Delicious’ reference sequence will assist in the continued improvement of the genome sequence assembly for that variety. PMID:22631220
Performance comparison of SNP detection tools with illumina exome sequencing data—an assessment using both family pedigree information and sample-matched SNP array data

PubMed Central

Yi, Ming; Zhao, Yongmei; Jia, Li; He, Mei; Kebebew, Electron; Stephens, Robert M.

2014-01-01

To apply exome-seq-derived variants in the clinical setting, there is an urgent need to identify the best variant caller(s) from a large collection of available options. We have used an Illumina exome-seq dataset as a benchmark, with two validation scenarios—family pedigree information and SNP array data for the same samples, permitting global high-throughput cross-validation, to evaluate the quality of SNP calls derived from several popular variant discovery tools from both the open-source and commercial communities using a set of designated quality metrics. To the best of our knowledge, this is the first large-scale performance comparison of exome-seq variant discovery tools using high-throughput validation with both Mendelian inheritance checking and SNP array data, which allows us to gain insights into the accuracy of SNP calling through such high-throughput validation in an unprecedented way, whereas the previously reported comparison studies have only assessed concordance of these tools without directly assessing the quality of the derived SNPs. More importantly, the main purpose of our study was to establish a reusable procedure that applies high-throughput validation to compare the quality of SNP discovery tools with a focus on exome-seq, which can be used to compare any forthcoming tool(s) of interest. PMID:24831545
Comprehensive high-resolution genomic profiling and cytogenetics of human chondrocyte cultures by GTG-banding, locus-specific FISH, SKY and SNP array.

PubMed

Wallenborn, M; Petters, O; Rudolf, D; Hantmann, H; Richter, M; Ahnert, P; Rohani, L; Smink, J J; Bulwin, G C; Krupp, W; Schulz, R M; Holland, H

2018-04-23

In the development of cell-based medicinal products, it is crucial to guarantee that the application of such an advanced therapy medicinal product (ATMP) is safe for the patients. The consensus of the European regulatory authorities is: "In conclusion, on the basis of the state of art, conventional karyotyping can be considered a valuable and useful technique to analyse chromosomal stability during preclinical studies". 408 chondrocyte samples (84 monolayers and 324 spheroids) from six patients were analysed using trypsin-Giemsa staining, spectral karyotyping and fluorescence in situ hybridisation, to evaluate the genetic stability of chondrocyte samples from non-clinical studies. Single nucleotide polymorphism (SNP) array analysis was performed on chondrocyte spheroids from five of the six donors. Applying this combination of techniques, the genetic analyses performed revealed no significant genetic instability until passage 3 in monolayer cells and interphase cells from spheroid cultures at different time points. Clonal occurrence of polyploid metaphases and endoreduplications were identified associated with prolonged cultivation time. Also, gonosomal losses were observed in chondrocyte spheroids, with increasing passage and duration of the differentiation phase. Interestingly, in one of the donors, chromosomal aberrations that are also described in extraskeletal myxoid chondrosarcoma were identified. The SNP array analysis exhibited chromosomal aberrations in two donors and copy neutral losses of heterozygosity regions in four donors. This study showed the necessity of combined genetic analyses at defined cultivation time points in quality studies within the field of cell therapy.
A 34K SNP genotyping array for Populus trichocarpa: design, application to the study of natural populations and transferability to other Populus species.

PubMed

Geraldes, A; Difazio, S P; Slavov, G T; Ranjan, P; Muchero, W; Hannemann, J; Gunter, L E; Wymore, A M; Grassa, C J; Farzaneh, N; Porth, I; McKown, A D; Skyba, O; Li, E; Fujita, M; Klápště, J; Martin, J; Schackwitz, W; Pennacchio, C; Rokhsar, D; Friedmann, M C; Wasteneys, G O; Guy, R D; El-Kassaby, Y A; Mansfield, S D; Cronk, Q C B; Ehlting, J; Douglas, C J; Tuskan, G A

2013-03-01

Genetic mapping of quantitative traits requires genotypic data for large numbers of markers in many individuals. For such studies, the use of large single nucleotide polymorphism (SNP) genotyping arrays still offers the most cost-effective solution. Herein we report on the design and performance of a SNP genotyping array for Populus trichocarpa (black cottonwood). This genotyping array was designed with SNPs pre-ascertained in 34 wild accessions covering most of the species latitudinal range. We adopted a candidate gene approach to the array design that resulted in the selection of 34 131 SNPs, the majority of which are located in, or within 2 kb of, 3543 candidate genes. A subset of the SNPs on the array (539) was selected based on patterns of variation among the SNP discovery accessions. We show that more than 95% of the loci produce high quality genotypes and that the genotyping error rate for these is likely below 2%. We demonstrate that even among small numbers of samples (n = 10) from local populations over 84% of loci are polymorphic. We also tested the applicability of the array to other species in the genus and found that the number of polymorphic loci decreases rapidly with genetic distance, with the largest numbers detected in other species in section Tacamahaca. Finally, we provide evidence for the utility of the array to address evolutionary questions such as intraspecific studies of genetic differentiation, species assignment and the detection of natural hybrids. © 2013 Blackwell Publishing Ltd.
Comparison of array comparative genomic hybridization and quantitative real-time PCR-based aneuploidy screening of blastocyst biopsies.

PubMed

Capalbo, Antonio; Treff, Nathan R; Cimadomo, Danilo; Tao, Xin; Upham, Kathleen; Ubaldi, Filippo Maria; Rienzi, Laura; Scott, Richard T

2015-07-01

Comprehensive chromosome screening (CCS) methods are being extensively used to select chromosomally normal embryos in human assisted reproduction. Some concerns related to the stage of analysis and which aneuploidy screening method to use still remain. In this study, the reliability of blastocyst-stage aneuploidy screening and the diagnostic performance of the two mostly used CCS methods (quantitative real-time PCR (qPCR) and array comparative genome hybridization (aCGH)) has been assessed. aCGH aneuploid blastocysts were rebiopsied, blinded, and evaluated by qPCR. Discordant cases were subsequently rebiopsied, blinded, and evaluated by single-nucleotide polymorphism (SNP) array-based CCS. Although 81.7% of embryos showed the same diagnosis when comparing aCGH and qPCR-based CCS, 18.3% (22/120) of embryos gave a discordant result for at least one chromosome. SNP array reanalysis showed that a discordance was reported in ten blastocysts for aCGH, mostly due to false positives, and in four cases for qPCR. The discordant aneuploidy call rate per chromosome was significantly higher for aCGH (5.7%) compared with qPCR (0.6%; P<0.01). To corroborate these findings, 39 embryos were simultaneously biopsied for aCGH and qPCR during blastocyst-stage aneuploidy screening cycles. 35 matched including all 21 euploid embryos. Blinded SNP analysis on rebiopsies of the four embryos matched qPCR. These findings demonstrate the high reliability of diagnosis performed at the blastocyst stage with the use of different CCS methods. However, the application of aCGH can be expected to result in a higher aneuploidy rate than other contemporary methods of CCS.
Development and Evaluation of a 9K SNP Array for Peach by Internationally Coordinated SNP Detection and Validation in Breeding Germplasm

PubMed Central

Scalabrin, Simone; Gilmore, Barbara; Lawley, Cynthia T.; Gasic, Ksenija; Micheletti, Diego; Rosyara, Umesh R.; Cattonaro, Federica; Vendramin, Elisa; Main, Dorrie; Aramini, Valeria; Blas, Andrea L.; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Troggio, Michela; Sosinski, Bryon; Aranzana, Maria José; Arús, Pere; Iezzoni, Amy; Morgante, Michele; Peace, Cameron

2012-01-01

Although a large number of single nucleotide polymorphism (SNP) markers covering the entire genome are needed to enable molecular breeding efforts such as genome wide association studies, fine mapping, genomic selection and marker-assisted selection in peach [Prunus persica (L.) Batsch] and related Prunus species, only a limited number of genetic markers, including simple sequence repeats (SSRs), have been available to date. To address this need, an international consortium (The International Peach SNP Consortium; IPSC) has pursued a coordinated effort to perform genome-scale SNP discovery in peach using next generation sequencing platforms to develop and characterize a high-throughput Illumina Infinium® SNP genotyping array platform. We performed whole genome re-sequencing of 56 peach breeding accessions using the Illumina and Roche/454 sequencing technologies. Polymorphism detection algorithms identified a total of 1,022,354 SNPs. Validation with the Illumina GoldenGate® assay was performed on a subset of the predicted SNPs, verifying ∼75% of genic (exonic and intronic) SNPs, whereas only about a third of intergenic SNPs were verified. Conservative filtering was applied to arrive at a set of 8,144 SNPs that were included on the IPSC peach SNP array v1, distributed over all eight peach chromosomes with an average spacing of 26.7 kb between SNPs. Use of this platform to screen a total of 709 accessions of peach in two separate evaluation panels identified a total of 6,869 (84.3%) polymorphic SNPs. The almost 7,000 SNPs verified as polymorphic through extensive empirical evaluation represent an excellent source of markers for future studies in genetic relatedness, genetic mapping, and dissecting the genetic architecture of complex agricultural traits. The IPSC peach SNP array v1 is commercially available and we expect that it will be used worldwide for genetic studies in peach and related stone fruit and nut species. PMID:22536421
High-Performance Multiplex SNP Analysis of Three Hemochromatosis-Related Mutations With Capillary Array Electrophoresis Microplates

PubMed Central

Medintz, Igor; Wong, Wendy W.; Berti, Lorenzo; Shiow, Lawrence; Tom, Jennifer; Scherer, James; Sensabaugh, George; Mathies, Richard A.

2001-01-01

An assay is described for high-throughput single nucleotide polymorphism (SNP) genotyping on a microfabricated capillary array electrophoresis (CAE) microchip. The assay targets the three common variants at the HFE locus associated with the genetic disease hereditary hemochromatosis (HHC). The assay employs allele-specific PCR (ASPCR) for the C282Y (845g->a), H63D (187c->g), and S65C (193a->t) variants using fluorescently-labeled energy-transfer (ET) allele-specific primers. Using a 96-channel radial CAE microplate, the labeled ASPCR products generated from 96 samples in a reference Caucasian population are simultaneously separated with single-base-pair resolution and genotyped in under 10 min. Detection is accomplished with a laser-excited rotary four-color fluorescence scanner. The allele-specific amplicons are differentiated on the basis of both their size and the color of the label emission. This study is the first demonstration of the combined use of ASPCR with ET primers and microfabricated radial CAE microplates to perform multiplex SNP analyses in a clinically relevant population. PMID:11230165
Predicting Breed Composition Using Breed Frequencies of 50,000 Markers from the U.S. Meat Animal Research Center 2,000 Bull Project

USDA-ARS?s Scientific Manuscript database

Our objective was to evaluate whether breed composition of crossbred cattle could be predicted using reference breed frequencies of SNP markers on the BovineSNP50 array. Semen DNA samples of over 2,000 bulls from 16 common commercial beef breeds were genotyped using the array and used to estimate cu...
Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data.

PubMed

Favero, F; Joshi, T; Marquard, A M; Birkbak, N J; Krzystanek, M; Li, Q; Szallasi, Z; Eklund, A C

2015-01-01

Exome or whole-genome deep sequencing of tumor DNA along with paired normal DNA can potentially provide a detailed picture of the somatic mutations that characterize the tumor. However, analysis of such sequence data can be complicated by the presence of normal cells in the tumor specimen, by intratumor heterogeneity, and by the sheer size of the raw data. In particular, determination of copy number variations from exome sequencing data alone has proven difficult; thus, single nucleotide polymorphism (SNP) arrays have often been used for this task. Recently, algorithms to estimate absolute, but not allele-specific, copy number profiles from tumor sequencing data have been described. We developed Sequenza, a software package that uses paired tumor-normal DNA sequencing data to estimate tumor cellularity and ploidy, and to calculate allele-specific copy number profiles and mutation profiles. We applied Sequenza, as well as two previously published algorithms, to exome sequence data from 30 tumors from The Cancer Genome Atlas. We assessed the performance of these algorithms by comparing their results with those generated using matched SNP arrays and processed by the allele-specific copy number analysis of tumors (ASCAT) algorithm. Comparison between Sequenza/exome and SNP/ASCAT revealed strong correlation in cellularity (Pearson's r = 0.90) and ploidy estimates (r = 0.42, or r = 0.94 after manual inspecting alternative solutions). This performance was noticeably superior to previously published algorithms. In addition, in artificial data simulating normal-tumor admixtures, Sequenza detected the correct ploidy in samples with tumor content as low as 30%. The agreement between Sequenza and SNP array-based copy number profiles suggests that exome sequencing alone is sufficient not only for identifying small scale mutations but also for estimating cellularity and inferring DNA copy number aberrations. © The Author 2014. Published by Oxford University Press on behalf of the European Society for Medical Oncology.
Development and evaluation of high-density Axiom® CicerSNP Array for high-resolution genetic mapping and breeding applications in chickpea.

PubMed

Roorkiwal, Manish; Jain, Ankit; Kale, Sandip M; Doddamani, Dadakhalandar; Chitikineni, Annapurna; Thudi, Mahendar; Varshney, Rajeev K

2018-04-01

To accelerate genomics research and molecular breeding applications in chickpea, a high-throughput SNP genotyping platform 'Axiom ® CicerSNP Array' has been designed, developed and validated. Screening of whole-genome resequencing data from 429 chickpea lines identified 4.9 million SNPs, from which a subset of 70 463 high-quality nonredundant SNPs was selected using different stringent filter criteria. This was further narrowed down to 61 174 SNPs based on p-convert score ≥0.3, of which 50 590 SNPs could be tiled on array. Among these tiled SNPs, a total of 11 245 SNPs (22.23%) were from the coding regions of 3673 different genes. The developed Axiom ® CicerSNP Array was used for genotyping two recombinant inbred line populations, namely ICCRIL03 (ICC 4958 × ICC 1882) and ICCRIL04 (ICC 283 × ICC 8261). Genotyping data reflected high success and polymorphic rate, with 15 140 (29.93%; ICCRIL03) and 20 018 (39.57%; ICCRIL04) polymorphic SNPs. High-density genetic maps comprising 13 679 SNPs spanning 1033.67 cM and 7769 SNPs spanning 1076.35 cM were developed for ICCRIL03 and ICCRIL04 populations, respectively. QTL analysis using multilocation, multiseason phenotyping data on these RILs identified 70 (ICCRIL03) and 120 (ICCRIL04) main-effect QTLs on genetic map. Higher precision and potential of this array is expected to advance chickpea genetics and breeding applications. © 2017 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Population sequencing reveals breed and sub-species specific CNVs in cattle

USDA-ARS?s Scientific Manuscript database

Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an increased...
Novel SNP array analysis and exome sequencing detect a homozygous exon 7 deletion of MEGF10 causing early onset myopathy, areflexia, respiratory distress and dysphagia (EMARDD)

PubMed Central

Pierson, Tyler Mark; Markello, Thomas; Accardi, John; Wolfe, Lynne; Adams, David; Sincan, Murat; Tarazi, Noor M.; Fajardo, Karin Fuentes; Cherukuri, Praveen F.; Bajraktari, Ilda; Meilleur, Katy G.; Donkervoort, Sandra; Jain, Mina; Hu, Ying; Lehky, Tanya J.; Cruz, Pedro; Mullikin, James C.; Bonnemann, Carsten; Gahl, William A.; Boerkoel, Cornelius F.; Tifft, Cynthia J.

2013-01-01

Early-onset myopathy, areflexia, respiratory distress and dysphagia (EMARDD) is a myopathic disorder associated with mutations in MEGF10. By novel analysis of SNP array hybridization and exome sequence coverage, we diagnosed a 10-year old girl with EMARDD following identification of a novel homozygous deletion of exon 7 in MEGF10. In contrast to previously reported EMARDD patients, her weakness was more prominent proximally than distally, and involved her legs more than her arms. MRI of her pelvis and thighs showed muscle atrophy and fatty replacement. Ultrasound of several muscle groups revealed dense homogenous increases in echogenicity. Cloning and sequencing of the deletion breakpoint identified features suggesting the mutation arose by fork stalling and template switching. These findings constitute the first genomic deletion causing EMARDD, expand the clinical phenotype, and provide new insight into the pattern and histology of its muscular pathology. PMID:23453856
High density genetic mapping identifies new susceptibility loci for rheumatoid arthritis

PubMed Central

Eyre, Steve; Bowes, John; Diogo, Dorothée; Lee, Annette; Barton, Anne; Martin, Paul; Zhernakova, Alexandra; Stahl, Eli; Viatte, Sebastien; McAllister, Kate; Amos, Christopher I.; Padyukov, Leonid; Toes, Rene E.M.; Huizinga, Tom W.J.; Wijmenga, Cisca; Trynka, Gosia; Franke, Lude; Westra, Harm-Jan; Alfredsson, Lars; Hu, Xinli; Sandor, Cynthia; de Bakker, Paul I.W.; Davila, Sonia; Khor, Chiea Chuen; Heng, Khai Koon; Andrews, Robert; Edkins, Sarah; Hunt, Sarah E; Langford, Cordelia; Symmons, Deborah; Concannon, Pat; Onengut-Gumuscu, Suna; Rich, Stephen S; Deloukas, Panos; Gonzalez-Gay, Miguel A.; Rodriguez-Rodriguez, Luis; Ärlsetig, Lisbeth; Martin, Javier; Rantapää-Dahlqvist, Solbritt; Plenge, Robert; Raychaudhuri, Soumya; Klareskog, Lars; Gregersen, Peter K; Worthington, Jane

2012-01-01

Summary Using the Immunochip custom single nucleotide polymorphism (SNP) array, designed for dense genotyping of 186 genome wide association study (GWAS) confirmed loci we analysed 11,475 rheumatoid arthritis cases of European ancestry and 15,870 controls for 129,464 markers. The data were combined in meta-analysis with GWAS data from additional independent cases (n=2,363) and controls (n=17,872). We identified fourteen novel loci; nine were associated with rheumatoid arthritis overall and 5 specifically in anti-citrillunated peptide antibody positive disease, bringing the number of confirmed European ancestry rheumatoid arthritis loci to 46. We refined the peak of association to a single gene for 19 loci, identified secondary independent effects at six loci and association to low frequency variants (minor allele frequency <0.05) at 4 loci. Bioinformatic analysis of the data generated strong hypotheses for the causal SNP at seven loci. This study illustrates the advantages of dense SNP mapping analysis to inform subsequent functional investigations. PMID:23143596

SNP-array lesions in core binding factor acute myeloid leukemia

PubMed Central

Duployez, Nicolas; Boudry-Labis, Elise; Roumier, Christophe; Boissel, Nicolas; Petit, Arnaud; Geffroy, Sandrine; Helevaut, Nathalie; Celli-Lebras, Karine; Terré, Christine; Fenneteau, Odile; Cuccuini, Wendy; Luquet, Isabelle; Lapillonne, Hélène; Lacombe, Catherine; Cornillet, Pascale; Ifrah, Norbert; Dombret, Hervé; Leverger, Guy; Jourdan, Eric; Preudhomme, Claude

2018-01-01

Acute myeloid leukemia (AML) with t(8;21) and inv(16), together referred as core binding factor (CBF)-AML, are recognized as unique entities. Both rearrangements share a common pathophysiology, the disruption of the CBF, and a relatively good prognosis. Experiments have demonstrated that CBF rearrangements were insufficient to induce leukemia, implying the existence of cooperating events. To explore these aberrations, we performed single nucleotide polymorphism (SNP)-array in a well-annotated cohort of 198 patients with CBF-AML. Excluding breakpoint-associated lesions, the most frequent events included loss of a sex chromosome (53%), deletions at 9q21 (12%) and 7q36 (9%) in patients with t(8;21) compared with trisomy 22 (13%), trisomy 8 (10%) and 7q36 deletions (12%) in patients with inv(16). SNP-array revealed novel recurrent genetic alterations likely to be involved in CBF-AML leukemogenesis. ZBTB7A mutations (20% of t(8;21)-AML) were shown to be a target of copy-neutral losses of heterozygosity (CN-LOH) at chromosome 19p. FOXP1 focal deletions were identified in 5% of inv(16)-AML while sequence analysis revealed that 2% carried FOXP1 truncating mutations. Finally, CCDC26 disruption was found in both subtypes (4.5% of the whole cohort) and possibly highlighted a new lesion associated with aberrant tyrosine kinase signaling in this particular subtype of leukemia. PMID:29464086
SNP-array lesions in core binding factor acute myeloid leukemia.

PubMed

Duployez, Nicolas; Boudry-Labis, Elise; Roumier, Christophe; Boissel, Nicolas; Petit, Arnaud; Geffroy, Sandrine; Helevaut, Nathalie; Celli-Lebras, Karine; Terré, Christine; Fenneteau, Odile; Cuccuini, Wendy; Luquet, Isabelle; Lapillonne, Hélène; Lacombe, Catherine; Cornillet, Pascale; Ifrah, Norbert; Dombret, Hervé; Leverger, Guy; Jourdan, Eric; Preudhomme, Claude

2018-01-19

Acute myeloid leukemia (AML) with t(8;21) and inv(16), together referred as core binding factor (CBF)-AML, are recognized as unique entities. Both rearrangements share a common pathophysiology, the disruption of the CBF, and a relatively good prognosis. Experiments have demonstrated that CBF rearrangements were insufficient to induce leukemia, implying the existence of cooperating events. To explore these aberrations, we performed single nucleotide polymorphism (SNP)-array in a well-annotated cohort of 198 patients with CBF-AML. Excluding breakpoint-associated lesions, the most frequent events included loss of a sex chromosome (53%), deletions at 9q21 (12%) and 7q36 (9%) in patients with t(8;21) compared with trisomy 22 (13%), trisomy 8 (10%) and 7q36 deletions (12%) in patients with inv(16). SNP-array revealed novel recurrent genetic alterations likely to be involved in CBF-AML leukemogenesis. ZBTB7A mutations (20% of t(8;21)-AML) were shown to be a target of copy-neutral losses of heterozygosity (CN-LOH) at chromosome 19p. FOXP1 focal deletions were identified in 5% of inv(16)-AML while sequence analysis revealed that 2% carried FOXP1 truncating mutations. Finally, CCDC26 disruption was found in both subtypes (4.5% of the whole cohort) and possibly highlighted a new lesion associated with aberrant tyrosine kinase signaling in this particular subtype of leukemia.
MMP9 polymorphisms and breast cancer risk: a report from the Shanghai Breast Cancer Genetics Study.

PubMed

Beeghly-Fadiel, Alicia; Lu, Wei; Shu, Xiao-Ou; Long, Jirong; Cai, Qiuyin; Xiang, Yongbin; Gao, Yu-Tang; Zheng, Wei

2011-04-01

In addition to tumor invasion and angiogenesis, matrix metalloproteinase (MMP)9 also contributes to carcinogenesis and tumor growth. Genetic variation that may influence MMP9 expression was evaluated among participants of the Shanghai Breast Cancer Genetics Study (SBCGS) for associations with breast cancer susceptibility. In stage 1, 11 MMP9 single nucleotide polymorphisms (SNPs) were genotyped by the Affymetrix Targeted Genotyping System and/or the Affymetrix Genome-Wide Human SNP Array 6.0 among 4,227 SBCGS participants. One SNP was further genotyped using the Sequenom iPLEX MassARRAY platform among an additional 6,270 SBCGS participants. Associations with breast cancer risk were evaluated by odds ratios (OR) and 95% confidence intervals (CI) from logistic regression models that included adjustment for age, education, and genotyping stage when appropriate. In Stage 1, rare allele homozygotes for a promoter SNP (rs3918241) or a non-synonymous SNP (rs2274756, R668Q) tended to occur more frequently among breast cancer cases (P value = 0.116 and 0.056, respectively). Given their high linkage disequilibrium (D' = 1.0, r (2) = 0.97), one (rs3918241) was selected for additional analysis. An association with breast cancer risk was not supported by additional Stage 2 genotyping. In combined analysis, no elevated risk of breast cancer among homozygotes was found (OR: 1.2, 95% CI: 0.8-1.8). Common genetic variation in MMP9 was not found to be significantly associated with breast cancer susceptibility among participants of the Shanghai Breast Cancer Genetics Study.
A 34K SNP genotyping array for Populus trichocarpa: design, application to the study of natural populations and transferability to other Populus species

DOE Office of Scientific and Technical Information (OSTI.GOV)

Geraldes, Armando; Hannemann, Jan; Grassa, Chris

2013-01-01

Genetic mapping of quantitative traits requires genotypic data for large numbers of markers in many individuals. Despite the declining costs of genotyping by sequencing, for most studies, the use of large SNP genotyping arrays still offers the most cost-effective solution for large-scale targeted genotyping. Here we report on the design and performance of a SNP genotyping array for Populus trichocarpa (black cottonwood). This genotyping array was designed with SNPs pre-ascertained in 34 wild accessions covering most of the species range. Due to the rapid decay of linkage disequilibrium in P. trichocarpa we adopted a candidate gene approach to the arraymore » design that resulted in the selection of 34,131 SNPs, the majority of which are located in, or within 2 kb, of 3,543 candidate genes. A subset of the SNPs (539) was selected based on patterns of variation among the SNP discovery accessions. We show that more than 95% of the loci produce high quality genotypes and that the genotyping error rate for these is likely below 2%, indicating that high-quality data are generated with this array. We demonstrate that even among small numbers of samples (n=10) from local populations over 84% of loci are polymorphic. We also tested the applicability of the array to other species in the genus and found that due to ascertainment bias the number of polymorphic loci decreases rapidly with genetic distance, with the largest numbers detected in other species in section Tacamahaca (P. balsamifera and P. angustifolia). Finally, we provide evidence for the utility of the array for intraspecific studies of genetic differentiation and for species assignment and the detection of natural hybrids.« less
affy2sv: an R package to pre-process Affymetrix CytoScan HD and 750K arrays for SNP, CNV, inversion and mosaicism calling.

PubMed

Hernandez-Ferrer, Carles; Quintela Garcia, Ines; Danielski, Katharina; Carracedo, Ángel; Pérez-Jurado, Luis A; González, Juan R

2015-05-20

The well-known Genome-Wide Association Studies (GWAS) had led to many scientific discoveries using SNP data. Even so, they were not able to explain the full heritability of complex diseases. Now, other structural variants like copy number variants or DNA inversions, either germ-line or in mosaicism events, are being studies. We present the R package affy2sv to pre-process Affymetrix CytoScan HD/750k array (also for Genome-Wide SNP 5.0/6.0 and Axiom) in structural variant studies. We illustrate the capabilities of affy2sv using two different complete pipelines on real data. The first one performing a GWAS and a mosaic alterations detection study, and the other detecting CNVs and performing an inversion calling. Both examples presented in the article show up how affy2sv can be used as part of more complex pipelines aimed to analyze Affymetrix SNP arrays data in genetic association studies, where different types of structural variants are considered.
GPHMM: an integrated hidden Markov model for identification of copy number alteration and loss of heterozygosity in complex tumor samples using whole genome SNP arrays

PubMed Central

Li, Ao; Liu, Zongzhi; Lezon-Geyda, Kimberly; Sarkar, Sudipa; Lannin, Donald; Schulz, Vincent; Krop, Ian; Winer, Eric; Harris, Lyndsay; Tuck, David

2011-01-01

There is an increasing interest in using single nucleotide polymorphism (SNP) genotyping arrays for profiling chromosomal rearrangements in tumors, as they allow simultaneous detection of copy number and loss of heterozygosity with high resolution. Critical issues such as signal baseline shift due to aneuploidy, normal cell contamination, and the presence of GC content bias have been reported to dramatically alter SNP array signals and complicate accurate identification of aberrations in cancer genomes. To address these issues, we propose a novel Global Parameter Hidden Markov Model (GPHMM) to unravel tangled genotyping data generated from tumor samples. In contrast to other HMM methods, a distinct feature of GPHMM is that the issues mentioned above are quantitatively modeled by global parameters and integrated within the statistical framework. We developed an efficient EM algorithm for parameter estimation. We evaluated performance on three data sets and show that GPHMM can correctly identify chromosomal aberrations in tumor samples containing as few as 10% cancer cells. Furthermore, we demonstrated that the estimation of global parameters in GPHMM provides information about the biological characteristics of tumor samples and the quality of genotyping signal from SNP array experiments, which is helpful for data quality control and outlier detection in cohort studies. PMID:21398628
High-throughput informative single nucleotide polymorphism-based typing of Neisseria gonorrhoeae using the Sequenom MassARRAY iPLEX platform.

PubMed

Trembizki, Ella; Smith, Helen; Lahra, Monica M; Chen, Marcus; Donovan, Basil; Fairley, Christopher K; Guy, Rebecca; Kaldor, John; Regan, David; Ward, James; Nissen, Michael D; Sloots, Theo P; Whiley, David M

2014-06-01

Neisseria gonorrhoeae antimicrobial resistance (AMR) is a global problem heightened by emerging resistance to ceftriaxone. Appropriate molecular typing methods are important for understanding the emergence and spread of N. gonorrhoeae AMR. We report on the development, validation and testing of a Sequenom MassARRAY iPLEX method for multilocus sequence typing (MLST)-style genotyping of N. gonorrhoeae isolates. An iPLEX MassARRAY method (iPLEX14SNP) was developed targeting 14 informative gonococcal single nucleotide polymorphisms (SNPs) previously shown to predict MLST types. The method was initially validated using 24 N. gonorrhoeae control isolates and was then applied to 397 test isolates collected throughout Queensland, Australia in the first half of 2012. The iPLEX14SNP method provided 100% accuracy for the control isolates, correctly identifying all 14 SNPs for all 24 isolates (336/336). For the 397 test isolates, the iPLEX14SNP assigned results for 5461 of the possible 5558 SNPs (SNP call rate 98.25%), with complete 14 SNP profiles obtained for 364 isolates. Based on the complete SNP profile data, there were 49 different sequence types identified in Queensland, with 11 of the 49 SNP profiles accounting for the majority (n = 280; 77%) of isolates. AMR was dominated by several geographically clustered sequence types. Using the iPLEX14SNP method, up to 384 isolates could be tested within 1 working day for less than Aus$10 per isolate. The iPLEX14SNP offers an accurate and high-throughput method for the MLST-style genotyping of N. gonorrhoeae and may prove particularly useful for large-scale studies investigating the emergence and spread of gonococcal AMR. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Population sequencing reveals breed and sub-species specific CNVs in cattle

USDA-ARS?s Scientific Manuscript database

Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect the rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an incre...
Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin.

PubMed

Troggio, Michela; Surbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James

2013-01-01

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the 'Golden Delicious' genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies.
Development of a Medium Density Combined-Species SNP Array for Pacific and European Oysters (Crassostrea gigas and Ostrea edulis).

PubMed

Gutierrez, Alejandro P; Turner, Frances; Gharbi, Karim; Talbot, Richard; Lowe, Natalie R; Peñaloza, Carolina; McCullough, Mark; Prodöhl, Paulo A; Bean, Tim P; Houston, Ross D

2017-07-05

SNP arrays are enabling tools for high-resolution studies of the genetic basis of complex traits in farmed and wild animals. Oysters are of critical importance in many regions from both an ecological and economic perspective, and oyster aquaculture forms a key component of global food security. The aim of our study was to design a combined-species, medium density SNP array for Pacific oyster ( Crassostrea gigas ) and European flat oyster ( Ostrea edulis ), and to test the performance of this array on farmed and wild populations from multiple locations, with a focus on European populations. SNP discovery was carried out by whole-genome sequencing (WGS) of pooled genomic DNA samples from eight C. gigas populations, and restriction site-associated DNA sequencing (RAD-Seq) of 11 geographically diverse O. edulis populations. Nearly 12 million candidate SNPs were discovered and filtered based on several criteria, including preference for SNPs segregating in multiple populations and SNPs with monomorphic flanking regions. An Affymetrix Axiom Custom Array was created and tested on a diverse set of samples ( n = 219) showing ∼27 K high quality SNPs for C. gigas and ∼11 K high quality SNPs for O. edulis segregating in these populations. A high proportion of SNPs were segregating in each of the populations, and the array was used to detect population structure and levels of linkage disequilibrium (LD). Further testing of the array on three C. gigas nuclear families ( n = 165) revealed that the array can be used to clearly distinguish between both families based on identity-by-state (IBS) clustering parental assignment software. This medium density, combined-species array will be publicly available through Affymetrix, and will be applied for genome-wide association and evolutionary genetic studies, and for genomic selection in oyster breeding programs. Copyright © 2017 Gutierrez et al.
Accurate genomic predictions for BCWD resistance in rainbow trout are achieved using low-density SNP panels: Evidence that long-range LD is a major contributing factor.

PubMed

Vallejo, Roger L; Silva, Rafael M O; Evenhuis, Jason P; Gao, Guangtu; Liu, Sixin; Parsons, James E; Martin, Kyle E; Wiens, Gregory D; Lourenco, Daniela A L; Leeds, Timothy D; Palti, Yniv

2018-06-05

Previously accurate genomic predictions for Bacterial cold water disease (BCWD) resistance in rainbow trout were obtained using a medium-density single nucleotide polymorphism (SNP) array. Here, the impact of lower-density SNP panels on the accuracy of genomic predictions was investigated in a commercial rainbow trout breeding population. Using progeny performance data, the accuracy of genomic breeding values (GEBV) using 35K, 10K, 3K, 1K, 500, 300 and 200 SNP panels as well as a panel with 70 quantitative trait loci (QTL)-flanking SNP was compared. The GEBVs were estimated using the Bayesian method BayesB, single-step GBLUP (ssGBLUP) and weighted ssGBLUP (wssGBLUP). The accuracy of GEBVs remained high despite the sharp reductions in SNP density, and even with 500 SNP accuracy was higher than the pedigree-based prediction (0.50-0.56 versus 0.36). Furthermore, the prediction accuracy with the 70 QTL-flanking SNP (0.65-0.72) was similar to the panel with 35K SNP (0.65-0.71). Genomewide linkage disequilibrium (LD) analysis revealed strong LD (r 2 ≥ 0.25) spanning on average over 1 Mb across the rainbow trout genome. This long-range LD likely contributed to the accurate genomic predictions with the low-density SNP panels. Population structure analysis supported the hypothesis that long-range LD in this population may be caused by admixture. Results suggest that lower-cost, low-density SNP panels can be used for implementing genomic selection for BCWD resistance in rainbow trout breeding programs. © 2018 The Authors. This article is a U.S. Government work and is in the public domain in the USA. Journal of Animal Breeding and Genetics published by Blackwell Verlag GmbH.
A novel approach to analyzing fMRI and SNP data via parallel independent component analysis

NASA Astrophysics Data System (ADS)

Liu, Jingyu; Pearlson, Godfrey; Calhoun, Vince; Windemuth, Andreas

2007-03-01

There is current interest in understanding genetic influences on brain function in both the healthy and the disordered brain. Parallel independent component analysis, a new method for analyzing multimodal data, is proposed in this paper and applied to functional magnetic resonance imaging (fMRI) and a single nucleotide polymorphism (SNP) array. The method aims to identify the independent components of each modality and the relationship between the two modalities. We analyzed 92 participants, including 29 schizophrenia (SZ) patients, 13 unaffected SZ relatives, and 50 healthy controls. We found a correlation of 0.79 between one fMRI component and one SNP component. The fMRI component consists of activations in cingulate gyrus, multiple frontal gyri, and superior temporal gyrus. The related SNP component is contributed to significantly by 9 SNPs located in sets of genes, including those coding for apolipoprotein A-I, and C-III, malate dehydrogenase 1 and the gamma-aminobutyric acid alpha-2 receptor. A significant difference in the presences of this SNP component is found between the SZ group (SZ patients and their relatives) and the control group. In summary, we constructed a framework to identify the interactions between brain functional and genetic information; our findings provide new insight into understanding genetic influences on brain function in a common mental disorder.
The use of population-scale sequencing to identify CNVs impacting productive traits in different cattle breeds

USDA-ARS?s Scientific Manuscript database

Individualized copy number variation (CNV) maps have highlighted the need for population surveys of cattle to detect rare and common variants. While SNP and comparative genomic hybridization (CGH) arrays have provided preliminary data, next-generation sequence (NGS) data analysis offers an increased...
Diversity analysis of cotton (Gossypium hirsutum L.) germplasm using the CottonSNP63K Array

USDA-ARS?s Scientific Manuscript database

Cotton germplasm resources contain beneficial alleles that can be exploited to develop germplasm adapting to emerging environmental and climate conditions, and this germplasm has commonly been characterized based on phenotypes. However, phenotypic profiles are limited by what can be observed and me...
Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping

PubMed Central

2011-01-01

Background Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. Results We report the integration of the whole genome sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were identified to be either in identity by descent (IBD) or in copy number variation (CNV) with results from SNP array genotyping. Coding insertions and deletions (indels) were found to be enriched for size in multiples of 3 and were located near the N- and C-termini of proteins. For larger indels, a combination of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation in an individual bovine genome and demonstrate that structural variation surpasses sequence variation as the main component of genomic variability. Better accuracy of SNP detection was achieved with little loss of sensitivity when algorithms that implemented mapping quality were used. IBD regions were found to be instrumental for calculating resequencing SNP accuracy, while SNP detection within CNVs tended to be less reliable. CNV discovery was affected dramatically by platform resolution and coverage biases. The combined data for this study showed that at a moderate level of sequencing coverage, an ensemble of platforms and tools can be applied together to maximize the accurate detection of sequence and structural variants. PMID:22082336
KinSNP software for homozygosity mapping of disease genes using SNP microarrays

PubMed Central

2010-01-01

Consanguineous families affected with a recessive genetic disease caused by homozygotisation of a mutation offer a unique advantage for positional cloning of rare diseases. Homozygosity mapping of patient genotypes is a powerful technique for the identification of the genomic locus harbouring the causing mutation. This strategy relies on the observation that in these patients a large region spanning the disease locus is also homozygous with high probability. The high marker density in single nucleotide polymorphism (SNP) arrays is extremely advantageous for homozygosity mapping. We present KinSNP, a user-friendly software tool for homozygosity mapping using SNP arrays. The software searches for stretches of SNPs which are homozygous to the same allele in all ascertained sick individuals. User-specified parameters control the number of allowed genotyping 'errors' within homozygous blocks. Candidate disease regions are then reported in a detailed, coloured Excel file, along with genotypes of family members and healthy controls. An interactive genome browser has been included which shows homozygous blocks, individual genotypes, genes and further annotations along the chromosomes, with zooming and scrolling capabilities. The software has been used to identify the location of a mutated gene causing insensitivity to pain in a large Bedouin family. KinSNP is freely available from http://bioinfo.bgu.ac.il/bsu/software/kinSNP. PMID:20846928
High-Resolution Mapping of Structural Mutations in Prostate Cancer with Single Nucleotide Polymorphism Arrays

DTIC Science & Technology

2006-11-01

study of the NCI60 panel of cancer cell lines [39]. More recently, amplifications of NOTCH3 were noted in ovarian tumors by an SNP array analysis...and the functional role of NOTCH3 was suggested by the ability to suppress cell proliferation by inhibiting NOTCH3 [40]. Allele-specific copy...Identified and functionally validated the oncogene MITF. 40 Park JT, Li M, Nakayama K, et al. Notch3 gene amplification in ovarian cancer. Cancer Res
DOE Office of Scientific and Technical Information (OSTI.GOV)

Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.

Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
Using RNA-Seq to assemble a rose transcriptome with more than 13,000 full-length expressed genes and to develop the WagRhSNP 68k Axiom SNP array for rose (Rosa L.).

PubMed

Koning-Boucoiran, Carole F S; Esselink, G Danny; Vukosavljev, Mirjana; van 't Westende, Wendy P C; Gitonga, Virginia W; Krens, Frans A; Voorrips, Roeland E; van de Weg, W Eric; Schulz, Dietmar; Debener, Thomas; Maliepaard, Chris; Arens, Paul; Smulders, Marinus J M

2015-01-01

In order to develop a versatile and large SNP array for rose, we set out to mine ESTs from diverse sets of rose germplasm. For this RNA-Seq libraries containing about 700 million reads were generated from tetraploid cut and garden roses using Illumina paired-end sequencing, and from diploid Rosa multiflora using 454 sequencing. Separate de novo assemblies were performed in order to identify single nucleotide polymorphisms (SNPs) within and between rose varieties. SNPs among tetraploid roses were selected for constructing a genotyping array that can be employed for genetic mapping and marker-trait association discovery in breeding programs based on tetraploid germplasm, both from cut roses and from garden roses. In total 68,893 SNPs were included on the WagRhSNP Axiom array. Next, an orthology-guided assembly was performed for the construction of a non-redundant rose transcriptome database. A total of 21,740 transcripts had significant hits with orthologous genes in the strawberry (Fragaria vesca L.) genome. Of these 13,390 appeared to contain the full-length coding regions. This newly established transcriptome resource adds considerably to the currently available sequence resources for the Rosaceae family in general and the genus Rosa in particular.
Parentage Reconstruction in Eucalyptus nitens Using SNPs and Microsatellite Markers: A Comparative Analysis of Marker Data Power and Robustness.

PubMed

Telfer, Emily J; Stovold, Grahame T; Li, Yongjun; Silva-Junior, Orzenil B; Grattapaglia, Dario G; Dungey, Heidi S

2015-01-01

Pedigree reconstruction using molecular markers enables efficient management of inbreeding in open-pollinated breeding strategies, replacing expensive and time-consuming controlled pollination. This is particularly useful in preferentially outcrossed, insect pollinated Eucalypts known to suffer considerable inbreeding depression from related matings. A single nucleotide polymorphism (SNP) marker panel consisting of 106 markers was selected for pedigree reconstruction from the recently developed high-density Eucalyptus Infinium SNP chip (EuCHIP60K). The performance of this SNP panel for pedigree reconstruction in open-pollinated progenies of two Eucalyptus nitens seed orchards was compared with that of two microsatellite panels with 13 and 16 markers respectively. The SNP marker panel out-performed one of the microsatellite panels in the resolution power to reconstruct pedigrees and out-performed both panels with respect to data quality. Parentage of all but one offspring in each clonal seed orchard was correctly matched to the expected seed parent using the SNP marker panel, whereas parentage assignment to less than a third of the expected seed parents were supported using the 13-microsatellite panel. The 16-microsatellite panel supported all but one of the recorded seed parents, one better than the SNP panel, although there was still a considerable level of missing and inconsistent data. SNP marker data was considerably superior to microsatellite data in accuracy, reproducibility and robustness. Although microsatellites and SNPs data provide equivalent resolution for pedigree reconstruction, microsatellite analysis requires more time and experience to deal with the uncertainties of allele calling and faces challenges for data transferability across labs and over time. While microsatellite analysis will continue to be useful for some breeding tasks due to the high information content, existing infrastructure and low operating costs, the multi-species SNP resource available with the EuCHIP60k, opens a whole new array of opportunities for high-throughput, genome-wide or targeted genotyping in species of Eucalyptus.

High-density genetic mapping identifies new susceptibility loci for rheumatoid arthritis.

PubMed

Eyre, Steve; Bowes, John; Diogo, Dorothée; Lee, Annette; Barton, Anne; Martin, Paul; Zhernakova, Alexandra; Stahl, Eli; Viatte, Sebastien; McAllister, Kate; Amos, Christopher I; Padyukov, Leonid; Toes, Rene E M; Huizinga, Tom W J; Wijmenga, Cisca; Trynka, Gosia; Franke, Lude; Westra, Harm-Jan; Alfredsson, Lars; Hu, Xinli; Sandor, Cynthia; de Bakker, Paul I W; Davila, Sonia; Khor, Chiea Chuen; Heng, Khai Koon; Andrews, Robert; Edkins, Sarah; Hunt, Sarah E; Langford, Cordelia; Symmons, Deborah; Concannon, Pat; Onengut-Gumuscu, Suna; Rich, Stephen S; Deloukas, Panos; Gonzalez-Gay, Miguel A; Rodriguez-Rodriguez, Luis; Ärlsetig, Lisbeth; Martin, Javier; Rantapää-Dahlqvist, Solbritt; Plenge, Robert M; Raychaudhuri, Soumya; Klareskog, Lars; Gregersen, Peter K; Worthington, Jane

2012-12-01

Using the Immunochip custom SNP array, which was designed for dense genotyping of 186 loci identified through genome-wide association studies (GWAS), we analyzed 11,475 individuals with rheumatoid arthritis (cases) of European ancestry and 15,870 controls for 129,464 markers. We combined these data in a meta-analysis with GWAS data from additional independent cases (n = 2,363) and controls (n = 17,872). We identified 14 new susceptibility loci, 9 of which were associated with rheumatoid arthritis overall and five of which were specifically associated with disease that was positive for anticitrullinated peptide antibodies, bringing the number of confirmed rheumatoid arthritis risk loci in individuals of European ancestry to 46. We refined the peak of association to a single gene for 19 loci, identified secondary independent effects at 6 loci and identified association to low-frequency variants at 4 loci. Bioinformatic analyses generated strong hypotheses for the causal SNP at seven loci. This study illustrates the advantages of dense SNP mapping analysis to inform subsequent functional investigations.
Nested association mapping for dissecting complex traits using Peanut 58K SNP array

USDA-ARS?s Scientific Manuscript database

Genome-wide association studies (GWAS) and linkage mapping have been the two most predominant strategies to dissect complex traits, but are limited by the occurrence of false positives reported for GWAS, and low resolution in the case of linkage analysis. This has led to the development of a joint a...
Cohort analysis of a single nucleotide polymorphism on DNA chips.

PubMed

Schwonbeck, Susanne; Krause-Griep, Andrea; Gajovic-Eichelmann, Nenad; Ehrentreich-Förster, Eva; Meinl, Walter; Glatt, Hansrüdi; Bier, Frank F

2004-11-15

A method has been developed to determine SNPs on DNA chips by applying a flow-through bioscanner. As a practical application we demonstrated the fast and simple SNP analysis of 24 genotypes in an array of 96 spots with a single hybridisation and dissociation experiment. The main advantage of this methodical concept is the parallel and fast analysis without any need of enzymatic digestion. Additionally, the DNA chip format used is appropriate for parallel analysis up to 400 spots. The polymorphism in the gene of the human phenol sulfotransferase SULT1A1 was studied as a model SNP. Biotinylated PCR products containing the SNP (The SNP summary web site: ) (mutant) and those containing no mutation (wild-type) were brought onto the chips coated with NeutrAvidin using non-contact spotting. This was followed by an analysis which was carried out in a flow-through biochip scanner while constantly rinsing with buffer. After removing the non-biotinylated strand a fluorescent probe was hybridised, which is complementary to the wild-type sequence. If this probe binds to a mutant sequence, then one single base is not fully matching. Thereby, the mismatched hybrid (mutant) is less stable than the full-matched hybrid (wild-type). The final step after hybridisation on the chip involves rinsing with a buffer to start dissociation of the fluorescent probe from the immobilised DNA strand. The online measurement of the fluorescence intensity by the biochip scanner provides the possibility to follow the kinetics of the hybridisation and dissociation processes. According to the different stability of the full-match and the mismatch, either visual discrimination or kinetic analysis is possible to distinguish SNP-containing sequence from the wild-type sequence.
Association analysis for feet and legs disorders with whole-genome sequence variants in 3 dairy cattle breeds.

PubMed

Wu, Xiaoping; Guldbrandtsen, Bernt; Lund, Mogens Sandø; Sahana, Goutam

2016-09-01

Identification of genetic variants associated with feet and legs disorders (FLD) will aid in the genetic improvement of these traits by providing knowledge on genes that influence trait variations. In Denmark, FLD in cattle has been recorded since the 1990s. In this report, we used deregressed breeding values as response variables for a genome-wide association study. Bulls (5,334 Danish Holstein, 4,237 Nordic Red Dairy Cattle, and 1,180 Danish Jersey) with deregressed estimated breeding values were genotyped with the Illumina Bovine 54k single nucleotide polymorphism (SNP) genotyping array. Genotypes were imputed to whole-genome sequence variants, and then 22,751,039 SNP on 29 autosomes were used for an association analysis. A modified linear mixed-model approach (efficient mixed-model association eXpedited, EMMAX) and a linear mixed model were used for association analysis. We identified 5 (3,854 SNP), 3 (13,642 SNP), and 0 quantitative trait locus (QTL) regions associated with the FLD index in Danish Holstein, Nordic Red Dairy Cattle, and Danish Jersey populations, respectively. We did not identify any QTL that were common among the 3 breeds. In a meta-analysis of the 3 breeds, 4 QTL regions were significant, but no additional QTL region was identified compared with within-breed analyses. Comparison between top SNP locations within these QTL regions and known genes suggested that RASGRP1, LCORL, MOS, and MITF may be candidate genes for FLD in dairy cattle. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Identification of the mechanism underlying a human chimera by SNP array analysis.

PubMed

Shin, So Youn; Yoo, Han-Wook; Lee, Beom Hee; Kim, Kun Suk; Seo, Eul-Ju

2012-09-01

Human chimerism resulting from the fusion of two different zygotes is a rare phenomenon. Two mechanisms of chimerism have been hypothesized: dispermic fertilization of an oocyte and its second polar body and dispermic fertilization of two identical gametes from parthenogenetic activation, and these can be identified and discriminated using DNA polymorphism. In the present study we describe a patient with chimerism presenting as a true hermaphrodite and applied single nucleotide polymorphism array analysis to demonstrate dispermic fertilization of two identical gametes from parthenogenetic activation as the underlying mechanism at the whole chromosome level. We suggest that application of genotyping array analysis to the diagnostic process in patients with disorders of sex development will help identify more human chimera patients and increase our understanding of the underlying mechanisms. Copyright © 2012 Wiley Periodicals, Inc.
KinSNP software for homozygosity mapping of disease genes using SNP microarrays.

PubMed

Amir, El-Ad David; Bartal, Ofer; Morad, Efrat; Nagar, Tal; Sheynin, Jony; Parvari, Ruti; Chalifa-Caspi, Vered

2010-08-01

Consanguineous families affected with a recessive genetic disease caused by homozygotisation of a mutation offer a unique advantage for positional cloning of rare diseases. Homozygosity mapping of patient genotypes is a powerful technique for the identification of the genomic locus harbouring the causing mutation. This strategy relies on the observation that in these patients a large region spanning the disease locus is also homozygous with high probability. The high marker density in single nucleotide polymorphism (SNP) arrays is extremely advantageous for homozygosity mapping. We present KinSNP, a user-friendly software tool for homozygosity mapping using SNP arrays. The software searches for stretches of SNPs which are homozygous to the same allele in all ascertained sick individuals. User-specified parameters control the number of allowed genotyping 'errors' within homozygous blocks. Candidate disease regions are then reported in a detailed, coloured Excel file, along with genotypes of family members and healthy controls. An interactive genome browser has been included which shows homozygous blocks, individual genotypes, genes and further annotations along the chromosomes, with zooming and scrolling capabilities. The software has been used to identify the location of a mutated gene causing insensitivity to pain in a large Bedouin family. KinSNP is freely available from.
Whole-genome single-nucleotide polymorphism (SNP) marker discovery and association analysis with the eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content in Larimichthys crocea

PubMed Central

Xiao, Shijun; Wang, Panpan; Dong, Linsong; Zhang, Yaguang; Han, Zhaofang; Wang, Qiurong

2016-01-01

Whole-genome single-nucleotide polymorphism (SNP) markers are valuable genetic resources for the association and conservation studies. Genome-wide SNP development in many teleost species are still challenging because of the genome complexity and the cost of re-sequencing. Genotyping-By-Sequencing (GBS) provided an efficient reduced representative method to squeeze cost for SNP detection; however, most of recent GBS applications were reported on plant organisms. In this work, we used an EcoRI-NlaIII based GBS protocol to teleost large yellow croaker, an important commercial fish in China and East-Asia, and reported the first whole-genome SNP development for the species. 69,845 high quality SNP markers that evenly distributed along genome were detected in at least 80% of 500 individuals. Nearly 95% randomly selected genotypes were successfully validated by Sequenom MassARRAY assay. The association studies with the muscle eicosapentaenoic acid (EPA) and docosahexaenoic acid (DHA) content discovered 39 significant SNP markers, contributing as high up to ∼63% genetic variance that explained by all markers. Functional genes that involved in fat digestion and absorption pathway were identified, such as APOB, CRAT and OSBPL10. Notably, PPT2 Gene, previously identified in the association study of the plasma n-3 and n-6 polyunsaturated fatty acid level in human, was re-discovered in large yellow croaker. Our study verified that EcoRI-NlaIII based GBS could produce quality SNP markers in a cost-efficient manner in teleost genome. The developed SNP markers and the EPA and DHA associated SNP loci provided invaluable resources for the population structure, conservation genetics and genomic selection of large yellow croaker and other fish organisms. PMID:28028455
Analysis of LDLR mutations in familial hypercholesterolemia patients in Greece by use of the NanoChip microelectronic array technology.

PubMed

Laios, Eleftheria; Drogari, Euridiki

2006-12-01

Three mutations in the low density lipoprotein receptor (LDLR) gene account for 49% of familial hypercholesterolemia (FH) cases in Greece. We used the microelectronic array technology of the NanoChip Molecular Biology Workstation to develop a multiplex method to analyze these single-nucleotide polymorphisms (SNPs). Primer pairs amplified the region encompassing each SNP. The biotinylated PCR amplicon was electronically addressed to streptavidin-coated microarray sites. Allele-specific fluorescently labeled oligonucleotide reporters were designed and used for detection of wild-type and SNP sequences. Genotypes were compared to PCR-restriction fragment length polymorphism (PCR-RFLP). We developed three monoplex assays (1 SNP/site) and an optimized multiplex assay (3SNPs/site). We performed 92 Greece II, 100 Genoa, and 98 Afrikaner-2 NanoChip monoplex assays (addressed to duplicate sites and analyzed separately). Of the 580 monoplex genotypings (290 samples), 579 agreed with RFLP. Duplicate sites of one sample were not in agreement with each other. Of the 580 multiplex genotypings, 576 agreed with the monoplex results. Duplicate sites of three samples were not in agreement with each other, indicating requirement for repetition upon which discrepancies were resolved. The multiplex assay detects common LDLR mutations in Greek FH patients and can be extended to accommodate additional mutations.
Genotype-Phenotype Analysis, Neuropsychological Assessment, and Growth Hormone Response in a Patient with 18p Deletion Syndrome.

PubMed

Sun, Huihui; Wan, Naijun; Wang, Xinli; Chang, Liang; Cheng, Dazhi

2018-01-01

18p deletion syndrome is a rare chromosomal disease caused by deletion of the short arm of chromosome 18. By using cytogenetic and SNP array analysis, we identified a girl with 18p deletion syndrome exhibiting craniofacial anomalies, intellectual disability, and short stature. G-banding analysis of metaphase cells revealed an abnormal karyotype 46,XX,del(18)(p10). Further, SNP array detected a 15.3-Mb deletion at 18p11.21p11.32 (chr18:12842-15375878) including 61 OMIM genes. Genotype-phenotype correlation analysis showed that clinical manifestations of the patient were correlated with LAMA1, TWSG1, and GNAL deletions. Her neuropsychological assessment test demonstrated delay in most cognitive functions including impaired mathematics, linguistic skills, visual motor perception, respond speed, and executive function. Meanwhile, her integrated visual and auditory continuous performance test (IVA-CPT) indicated a severe comprehensive attention deficit. At age 7 and 1/12 years, her height was 110.8 cm (-2.5 SD height for age). Growth hormone (GH) treatment was initiated. After 27 months treatment, her height was increased to 129.6 cm (-1.0 SD height for age) at 9 and 4/12 years, indicating an effective response to GH treatment. © 2018 S. Karger AG, Basel.
Identification of the varietal origin of loose leaf tea based on analysis of a single leaf by SNP nanofluidic array

USDA-ARS?s Scientific Manuscript database

Tea [Camellia sinensis (L.) O Kuntze] is an economically important crop cultivated in more than 50 countries. Production and marketing of premium specialty tea products provides opportunities for tea growers, the tea industry and consumers. Rapid market segmentation in the tea industry has resulted ...
Development of a 690K SNP array in catfish and its application for genetic mapping and validation of the reference genome sequence

USDA-ARS?s Scientific Manuscript database

Single nucleotide polymorphisms (SNPs) are capable of providing the highest level of genome coverage for genomic and genetic analysis because of their abundance and relatively even distribution in the genome. Such a capacity, however, cannot be achieved without an efficient genotyping platform such ...
DOE Office of Scientific and Technical Information (OSTI.GOV)

Gardner, Shea; Slezak, Tom

With the flood of whole genome finished and draft microbial sequences, we need faster, more scalable bioinformatics tools for sequence comparison. An algorithm is described to find single nucleotide polymorphisms (SNPs) in whole genome data. It scales to hundreds of bacterial or viral genomes, and can be used for finished and/or draft genomes available as unassembled contigs. The method is fast to compute, finding SNPs and building a SNP phylogeny in seconds to hours. We use it to identify thousands of putative SNPs from all publicly available Filoviridae, Poxviridae, foot-and-mouth disease virus, Bacillus, and Escherichia coli genomes and plasmids. Themore » SNP-based trees that result are consistent with known taxonomy and trees determined in other studies. The approach we describe can handle as input hundreds of gigabases of sequence in a single run. The algorithm is based on k-mer analysis using a suffix array, so we call it saSNP.« less
Three gangliogliomas: results of GTG-banding, SKY, genome-wide high resolution SNP-array, gene expression and review of the literature.

PubMed

Xu, Li-Xin; Holland, Heidrun; Kirsten, Holger; Ahnert, Peter; Krupp, Wolfgang; Bauer, Manfred; Schober, Ralf; Mueller, Wolf; Fritzsch, Dominik; Meixensberger, Jürgen; Koschny, Ronald

2015-04-01

According to the World Health Organization gangliogliomas are classified as well-differentiated and slowly growing neuroepithelial tumors, composed of neoplastic mature ganglion and glial cells. It is the most frequent tumor entity observed in patients with long-term epilepsy. Comprehensive cytogenetic and molecular cytogenetic data including high-resolution genomic profiling (single nucleotide polymorphism (SNP)-array) of gangliogliomas are scarce but necessary for a better oncological understanding of this tumor entity. For a detailed characterization at the single cell and cell population levels, we analyzed genomic alterations of three gangliogliomas using trypsin-Giemsa banding (GTG-banding) and by spectral karyotyping (SKY) in combination with SNP-array and gene expression array experiments. By GTG and SKY, we could confirm frequently detected chromosomal aberrations (losses within chromosomes 10, 13 and 22; gains within chromosomes 5, 7, 8 and 12), and identify so far unknown genetic aberrations like the unbalanced non-reciprocal translocation t(1;18)(q21;q21). Interestingly, we report on the second so far detected ganglioglioma with ring chromosome 1. Analyses of SNP-array data from two of the tumors and respective germline DNA (peripheral blood) identified few small gains and losses and a number of copy-neutral regions with loss of heterozygosity (LOH) in germline and in tumor tissue. In comparison to germline DNA, tumor tissues did not show substantial regions with significant loss or gain or with newly developed LOH. Gene expression analyses of tumor-specific genes revealed similarities in the profile of the analyzed samples regarding different relevant pathways. Taken together, we describe overlapping but also distinct and novel genetic aberrations of three gangliogliomas. © 2014 Japanese Society of Neuropathology.
Development and application of a novel genome-wide SNP array reveals domestication history in soybean

PubMed Central

Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

2016-01-01

Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean. PMID:26856884
Development and application of a novel genome-wide SNP array reveals domestication history in soybean.

PubMed

Wang, Jiao; Chu, Shanshan; Zhang, Huairen; Zhu, Ying; Cheng, Hao; Yu, Deyue

2016-02-09

Domestication of soybeans occurred under the intense human-directed selections aimed at developing high-yielding lines. Tracing the domestication history and identifying the genes underlying soybean domestication require further exploration. Here, we developed a high-throughput NJAU 355 K SoySNP array and used this array to study the genetic variation patterns in 367 soybean accessions, including 105 wild soybeans and 262 cultivated soybeans. The population genetic analysis suggests that cultivated soybeans have tended to originate from northern and central China, from where they spread to other regions, accompanied with a gradual increase in seed weight. Genome-wide scanning for evidence of artificial selection revealed signs of selective sweeps involving genes controlling domestication-related agronomic traits including seed weight. To further identify genomic regions related to seed weight, a genome-wide association study (GWAS) was conducted across multiple environments in wild and cultivated soybeans. As a result, a strong linkage disequilibrium region on chromosome 20 was found to be significantly correlated with seed weight in cultivated soybeans. Collectively, these findings should provide an important basis for genomic-enabled breeding and advance the study of functional genomics in soybean.
Evaluation of SNP Data from the Malus Infinium Array Identifies Challenges for Genetic Analysis of Complex Genomes of Polyploid Origin

PubMed Central

Troggio, Michela; Šurbanovski, Nada; Bianco, Luca; Moretto, Marco; Giongo, Lara; Banchi, Elisa; Viola, Roberto; Fernández, Felicdad Fernández; Costa, Fabrizio; Velasco, Riccardo; Cestaro, Alessandro; Sargent, Daniel James

2013-01-01

High throughput arrays for the simultaneous genotyping of thousands of single-nucleotide polymorphisms (SNPs) have made the rapid genetic characterisation of plant genomes and the development of saturated linkage maps a realistic prospect for many plant species of agronomic importance. However, the correct calling of SNP genotypes in divergent polyploid genomes using array technology can be problematic due to paralogy, and to divergence in probe sequences causing changes in probe binding efficiencies. An Illumina Infinium II whole-genome genotyping array was recently developed for the cultivated apple and used to develop a molecular linkage map for an apple rootstock progeny (M432), but a large proportion of segregating SNPs were not mapped in the progeny, due to unexpected genotype clustering patterns. To investigate the causes of this unexpected clustering we performed BLAST analysis of all probe sequences against the ‘Golden Delicious’ genome sequence and discovered evidence for paralogous annealing sites and probe sequence divergence for a high proportion of probes contained on the array. Following visual re-evaluation of the genotyping data generated for 8,788 SNPs for the M432 progeny using the array, we manually re-scored genotypes at 818 loci and mapped a further 797 markers to the M432 linkage map. The newly mapped markers included the majority of those that could not be mapped previously, as well as loci that were previously scored as monomorphic, but which segregated due to divergence leading to heterozygosity in probe annealing sites. An evaluation of the 8,788 probes in a diverse collection of Malus germplasm showed that more than half the probes returned genotype clustering patterns that were difficult or impossible to interpret reliably, highlighting implications for the use of the array in genome-wide association studies. PMID:23826289
Development and characterization of a microheater array device for real-time DNA mutation detection

NASA Astrophysics Data System (ADS)

Williams, Layne; Okandan, Murat; Chagovetz, Alex; Blair, Steve

2008-04-01

DNA analysis, specifically single nucleotide polymorphism (SNP) detection, is becoming increasingly important in rapid diagnostics and disease detection. Temperature is often controlled to help speed reaction rates and perform melting of hybridized oligonucleotides. The difference in melting temperatures, Tm, between wild-type and SNP sequences, respectively, to a given probe oligonucleotide, is indicative of the specificity of the reaction. We have characterized Tm's in solution and on a solid substrate of three sequences from known mutations associated with Cystic Fibrosis. Taking advantage of Tm differences, a microheater array device was designed to enable individual temperature control of up to 18 specific hybridization events. The device was fabricated at Sandia National Laboratories using surface micromachining techniques. The microheaters have been characterized using an IR camera at Sandia and show individual temperature control with minimal thermal cross talk. Development of the device as a real-time DNA detection platform, including surface chemistry and associated microfluidics, is described.
Development and characterization of a microheater array device for real-time DNA mutation detection

NASA Astrophysics Data System (ADS)

Williams, Layne; Okandan, Murat; Chagovetz, Alex; Blair, Steve

2008-02-01

DNA analysis, specifically single nucleotide polymorphism (SNP) detection, is becoming increasingly important in rapid diagnostics and disease detection. Temperature is often controlled to help speed reaction rates and perform melting of hybridized oligonucleotides. The difference in melting temperatures, Tm, between wild-type and SNP sequences, respectively, to a given probe oligonucleotide, is indicative of the specificity of the reaction. We have characterized Tm's in solution and on a solid substrate of three sequences from known mutations associated with Cystic Fibrosis. Taking advantage of Tm differences, a microheater array device was designed to enable individual temperature control of up to 18 specific hybridization events. The device was fabricated at Sandia National Laboratories using surface micromachining techniques. The microheaters have been characterized using an IR camera at Sandia and show individual temperature control with minimal thermal cross talk. Development of the device as a real-time DNA detection platform, including surface chemistry and associated microfluidics, is described.
A Large Maize (Zea mays L.) SNP Genotyping Array: Development and Germplasm Genotyping, and Genetic Mapping to Compare with the B73 Reference Genome

PubMed Central

Ganal, Martin W.; Durstewitz, Gregor; Polley, Andreas; Bérard, Aurélie; Buckler, Edward S.; Charcosset, Alain; Clarke, Joseph D.; Graner, Eva-Maria; Hansen, Mark; Joets, Johann; Le Paslier, Marie-Christine; McMullen, Michael D.; Montalent, Pierre; Rose, Mark; Schön, Chris-Carolin; Sun, Qi; Walter, Hildrun; Martin, Olivier C.; Falque, Matthieu

2011-01-01

SNP genotyping arrays have been useful for many applications that require a large number of molecular markers such as high-density genetic mapping, genome-wide association studies (GWAS), and genomic selection. We report the establishment of a large maize SNP array and its use for diversity analysis and high density linkage mapping. The markers, taken from more than 800,000 SNPs, were selected to be preferentially located in genes and evenly distributed across the genome. The array was tested with a set of maize germplasm including North American and European inbred lines, parent/F1 combinations, and distantly related teosinte material. A total of 49,585 markers, including 33,417 within 17,520 different genes and 16,168 outside genes, were of good quality for genotyping, with an average failure rate of 4% and rates up to 8% in specific germplasm. To demonstrate this array's use in genetic mapping and for the independent validation of the B73 sequence assembly, two intermated maize recombinant inbred line populations – IBM (B73×Mo17) and LHRF (F2×F252) – were genotyped to establish two high density linkage maps with 20,913 and 14,524 markers respectively. 172 mapped markers were absent in the current B73 assembly and their placement can be used for future improvements of the B73 reference sequence. Colinearity of the genetic and physical maps was mostly conserved with some exceptions that suggest errors in the B73 assembly. Five major regions containing non-colinearities were identified on chromosomes 2, 3, 6, 7 and 9, and are supported by both independent genetic maps. Four additional non-colinear regions were found on the LHRF map only; they may be due to a lower density of IBM markers in those regions or to true structural rearrangements between lines. Given the array's high quality, it will be a valuable resource for maize genetics and many aspects of maize breeding. PMID:22174790
Characterization of genetic variability of Venezuelan equine encephalitis viruses

DOE PAGES

Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.; ...

2016-04-07

Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less

Genomic analysis identified a potential novel molecular mechanism for high-altitude adaptation in sheep at the Himalayas.

PubMed

Gorkhali, Neena Amatya; Dong, Kunzhe; Yang, Min; Song, Shen; Kader, Adiljian; Shrestha, Bhola Shankar; He, Xiaohong; Zhao, Qianjun; Pu, Yabin; Li, Xiangchen; Kijas, James; Guan, Weijun; Han, Jianlin; Jiang, Lin; Ma, Yuehui

2016-07-22

Sheep has successfully adapted to the extreme high-altitude Himalayan region. To identify genes underlying such adaptation, we genotyped genome-wide single nucleotide polymorphisms (SNPs) of four major sheep breeds living at different altitudes in Nepal and downloaded SNP array data from additional Asian and Middle East breeds. Using a di value-based genomic comparison between four high-altitude and eight lowland Asian breeds, we discovered the most differentiated variants at the locus of FGF-7 (Keratinocyte growth factor-7), which was previously reported as a good protective candidate for pulmonary injuries. We further found a SNP upstream of FGF-7 that appears to contribute to the divergence signature. First, the SNP occurred at an extremely conserved site. Second, the SNP showed an increasing allele frequency with the elevated altitude in Nepalese sheep. Third, the electrophoretic mobility shift assays (EMSA) analysis using human lung cancer cells revealed the allele-specific DNA-protein interactions. We thus hypothesized that FGF-7 gene potentially enhances lung function by regulating its expression level in high-altitude sheep through altering its binding of specific transcription factors. Especially, FGF-7 gene was not implicated in previous studies of other high-altitude species, suggesting a potential novel adaptive mechanism to high altitude in sheep at the Himalayas.
High-density SNP assay development for genetic analysis in maritime pine (Pinus pinaster).

PubMed

Plomion, C; Bartholomé, J; Lesur, I; Boury, C; Rodríguez-Quilón, I; Lagraulet, H; Ehrenmann, F; Bouffier, L; Gion, J M; Grivet, D; de Miguel, M; de María, N; Cervera, M T; Bagnoli, F; Isik, F; Vendramin, G G; González-Martínez, S C

2016-03-01

Maritime pine provides essential ecosystem services in the south-western Mediterranean basin, where it covers around 4 million ha. Its scattered distribution over a range of environmental conditions makes it an ideal forest tree species for studies of local adaptation and evolutionary responses to climatic change. Highly multiplexed single nucleotide polymorphism (SNP) genotyping arrays are increasingly used to study genetic variation in living organisms and for practical applications in plant and animal breeding and genetic resource conservation. We developed a 9k Illumina Infinium SNP array and genotyped maritime pine trees from (i) a three-generation inbred (F2) pedigree, (ii) the French breeding population and (iii) natural populations from Portugal and the French Atlantic coast. A large proportion of the exploitable SNPs (2052/8410, i.e. 24.4%) segregated in the mapping population and could be mapped, providing the densest ever gene-based linkage map for this species. Based on 5016 SNPs, natural and breeding populations from the French gene pool exhibited similar level of genetic diversity. Population genetics and structure analyses based on 3981 SNP markers common to the Portuguese and French gene pools revealed high levels of differentiation, leading to the identification of a set of highly differentiated SNPs that could be used for seed provenance certification. Finally, we discuss how the validated SNPs could facilitate the identification of ecologically and economically relevant genes in this species, improving our understanding of the demography and selective forces shaping its natural genetic diversity, and providing support for new breeding strategies. © 2015 John Wiley & Sons Ltd.
Estimation and partitioning of (co)heritability of inflammatory bowel disease from GWAS and immunochip data.

PubMed

Chen, Guo-Bo; Lee, Sang Hong; Brion, Marie-Jo A; Montgomery, Grant W; Wray, Naomi R; Radford-Smith, Graham L; Visscher, Peter M

2014-09-01

As custom arrays are cheaper than generic GWAS arrays, larger sample size is achievable for gene discovery. Custom arrays can tag more variants through denser genotyping of SNPs at associated loci, but at the cost of losing genome-wide coverage. Balancing this trade-off is important for maximizing experimental designs. We quantified both the gain in captured SNP-heritability at known candidate regions and the loss due to imperfect genome-wide coverage for inflammatory bowel disease using immunochip (iChip) and imputed GWAS data on 61,251 and 38.550 samples, respectively. For Crohn's disease (CD), the iChip and GWAS data explained 19 and 26% of variation in liability, respectively, and SNPs in the densely genotyped iChip regions explained 13% of the SNP-heritability for both the iChip and GWAS data. For ulcerative colitis (UC), the iChip and GWAS data explained 15 and 19% of variation in liability, respectively, and the dense iChip regions explained 10 and 9% of the SNP-heritability in the iChip and the GWAS data. From bivariate analyses, estimates of the genetic correlation in risk between CD and UC were 0.75 (SE 0.017) and 0.62 (SE 0.042) for the iChip and GWAS data, respectively. We also quantified the SNP-heritability of genomic regions that did or did not contain the previous 163 GWAS hits for CD and UC, and SNP-heritability of the overlapping loci between the densely genotyped iChip regions and the 163 GWAS hits. For both diseases, over different genomic partitioning, the densely genotyped regions on the iChip tagged at least as much variation in liability as in the corresponding regions in the GWAS data, however a certain amount of tagged SNP-heritability in the GWAS data was lost using the iChip due to the low coverage at unselected regions. These results imply that custom arrays with a GWAS backbone will facilitate more gene discovery, both at associated and novel loci. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
CIDR

Science.gov Websites

NIH CIDR Program Studies For whole exome sequencing projects, we pretest all samples using a high -density SNP array (>200,000 markers). For custom targeted sequencing, we pretest all samples using a 96 pretest samples using a 96 SNP GoldenGate assay. This extensive pretesting allows us to unambiguously tie
Novel applications of array comparative genomic hybridization in molecular diagnostics.

PubMed

Cheung, Sau W; Bi, Weimin

2018-05-31

In 2004, the implementation of array comparative genomic hybridization (array comparative genome hybridization [CGH]) into clinical practice marked a new milestone for genetic diagnosis. Array CGH and single-nucleotide polymorphism (SNP) arrays enable genome-wide detection of copy number changes in a high resolution, and therefore microarray has been recognized as the first-tier test for patients with intellectual disability or multiple congenital anomalies, and has also been applied prenatally for detection of clinically relevant copy number variations in the fetus. Area covered: In this review, the authors summarize the evolution of array CGH technology from their diagnostic laboratory, highlighting exonic SNP arrays developed in the past decade which detect small intragenic copy number changes as well as large DNA segments for the region of heterozygosity. The applications of array CGH to human diseases with different modes of inheritance with the emphasis on autosomal recessive disorders are discussed. Expert commentary: An exonic array is a powerful and most efficient clinical tool in detecting genome wide small copy number variants in both dominant and recessive disorders. However, whole-genome sequencing may become the single integrated platform for detection of copy number changes, single-nucleotide changes as well as balanced chromosomal rearrangements in the near future.
Clear cell papillary renal cell carcinoma: a chromosomal microarray analysis of two cases using a novel Molecular Inversion Probe (MIP) technology.

PubMed

Alexiev, Borislav A; Zou, Ying S

2014-12-01

Chromosomal microarray analysis using novel Molecular Inversion Probe (MIP) technology demonstrated 2,570 kb copy neutral LOH of 10q11.22 in two clear cell papillary renal cell carcinomas. In addition, one of the tumors had a big 29,784 kb deletion of 13q11-q14.2. There were two variants of unknown significance, a 2,509 kb gain of Xp22.33 and a 257 kb homozygous deletion of 8p11.22. The somatic mutation panel containing 74 mutations in nine genes did not reveal any mutations. Besides identification of submicroscopic duplications or deletions, SNP microarrays can reveal abnormal allelic imbalances including LOH and copy neutral LOH, which cannot be recognized by chromosome, FISH, and non-SNP microarray arrays. To the best of our knowledge, this is the first study demonstrating copy neutral LOH of 10q11.22 in clear cell papillary renal cell carcinomas using the new MIP SNP OncoScan FFPE Assay Kit on formalin-fixed paraffin-embedded tumor samples. Copyright © 2014 Elsevier GmbH. All rights reserved.
ITALICS: an algorithm for normalization and DNA copy number calling for Affymetrix SNP arrays.

PubMed

Rigaill, Guillem; Hupé, Philippe; Almeida, Anna; La Rosa, Philippe; Meyniel, Jean-Philippe; Decraene, Charles; Barillot, Emmanuel

2008-03-15

Affymetrix SNP arrays can be used to determine the DNA copy number measurement of 11 000-500 000 SNPs along the genome. Their high density facilitates the precise localization of genomic alterations and makes them a powerful tool for studies of cancers and copy number polymorphism. Like other microarray technologies it is influenced by non-relevant sources of variation, requiring correction. Moreover, the amplitude of variation induced by non-relevant effects is similar or greater than the biologically relevant effect (i.e. true copy number), making it difficult to estimate non-relevant effects accurately without including the biologically relevant effect. We addressed this problem by developing ITALICS, a normalization method that estimates both biological and non-relevant effects in an alternate, iterative manner, accurately eliminating irrelevant effects. We compared our normalization method with other existing and available methods, and found that ITALICS outperformed these methods for several in-house datasets and one public dataset. These results were validated biologically by quantitative PCR. The R package ITALICS (ITerative and Alternative normaLIzation and Copy number calling for affymetrix Snp arrays) has been submitted to Bioconductor.
Small cell ovarian carcinoma: genomic stability and responsiveness to therapeutics.

PubMed

Gamwell, Lisa F; Gambaro, Karen; Merziotis, Maria; Crane, Colleen; Arcand, Suzanna L; Bourada, Valerie; Davis, Christopher; Squire, Jeremy A; Huntsman, David G; Tonin, Patricia N; Vanderhyden, Barbara C

2013-02-21

The biology of small cell ovarian carcinoma of the hypercalcemic type (SCCOHT), which is a rare and aggressive form of ovarian cancer, is poorly understood. Tumourigenicity, in vitro growth characteristics, genetic and genomic anomalies, and sensitivity to standard and novel chemotherapeutic treatments were investigated in the unique SCCOHT cell line, BIN-67, to provide further insight in the biology of this rare type of ovarian cancer. The tumourigenic potential of BIN-67 cells was determined and the tumours formed in a xenograft model was compared to human SCCOHT. DNA sequencing, spectral karyotyping and high density SNP array analysis was performed. The sensitivity of the BIN-67 cells to standard chemotherapeutic agents and to vesicular stomatitis virus (VSV) and the JX-594 vaccinia virus was tested. BIN-67 cells were capable of forming spheroids in hanging drop cultures. When xenografted into immunodeficient mice, BIN-67 cells developed into tumours that reflected the hypercalcemia and histology of human SCCOHT, notably intense expression of WT-1 and vimentin, and lack of expression of inhibin. Somatic mutations in TP53 and the most common activating mutations in KRAS and BRAF were not found in BIN-67 cells by DNA sequencing. Spectral karyotyping revealed a largely normal diploid karyotype (in greater than 95% of cells) with a visibly shorter chromosome 20 contig. High density SNP array analysis also revealed few genomic anomalies in BIN-67 cells, which included loss of heterozygosity of an estimated 16.7 Mb interval on chromosome 20. SNP array analyses of four SCCOHT samples also indicated a low frequency of genomic anomalies in the majority of cases. Although resistant to platinum chemotherapeutic drugs, BIN-67 cell viability in vitro was reduced by > 75% after infection with oncolytic viruses. These results show that SCCOHT differs from high-grade serous carcinomas by exhibiting few chromosomal anomalies and lacking TP53 mutations. Although BIN-67 cells are resistant to standard chemotherapeutic agents, their sensitivity to oncolytic viruses suggests that their therapeutic use in SCCOHT should be considered.
Comparison between genotyping by sequencing and SNP-chip genotyping in QTL mapping in wheat

USDA-ARS?s Scientific Manuscript database

Array- or chip-based single nucleotide polymorphism (SNP) markers are widely used in genomic studies because of their abundance in a genome and cost less per data point compared to older marker technologies. Genotyping by sequencing (GBS), a relatively newer approach of genotyping, suggests equal or...
Making a chocolate chip: development and evaluation of a 6K SNP array for Theobroma cacao.

USDA-ARS?s Scientific Manuscript database

Theobroma cacao, the key ingredient in chocolate production, is one of the world's most important tree fruit crops, with ~4,000,000 metric tons produced across 50 countries. To move towards gene discovery and marker-assisted breeding in cacao, a single-nucleotide polymorphism (SNP) identification pr...
Tomato breeding in the genomics era: insights from a SNP array.

PubMed

Víquez-Zamora, Marcela; Vosman, Ben; van de Geest, Henri; Bovy, Arnaud; Visser, Richard G F; Finkers, Richard; van Heusden, Adriaan W

2013-05-27

The major bottle neck in genetic and linkage studies in tomato has been the lack of a sufficient number of molecular markers. This has radically changed with the application of next generation sequencing and high throughput genotyping. A set of 6000 SNPs was identified and 5528 of them were used to evaluate tomato germplasm at the level of species, varieties and segregating populations. From the 5528 SNPs, 1980 originated from 454-sequencing, 3495 from Illumina Solexa sequencing and 53 were additional known markers. Genotyping different tomato samples allowed the evaluation of the level of heterozygosity and introgressions among commercial varieties. Cherry tomatoes were especially different from round/beefs in chromosomes 4, 5 and 12. We were able to identify a set of 750 unique markers distinguishing S. lycopersicum 'Moneymaker' from all its distantly related wild relatives. Clustering and neighbour joining analysis among varieties and species showed expected grouping patterns, with S. pimpinellifolium as the most closely related to commercial tomatoes earlier results. Our results show that a SNP search in only a few breeding lines already provides generally applicable markers in tomato and its wild relatives. It also shows that the Illumina bead array generated data are highly reproducible. Our SNPs can roughly be divided in two categories: SNPs of which both forms are present in the wild relatives and in domesticated tomatoes (originating from common ancestors) and SNPs unique for the domesticated tomato (originating from after the domestication event). The SNPs can be used for genotyping, identification of varieties, comparison of genetic and physical linkage maps and to confirm (phylogenetic) relations. In the SNPs used for the array there is hardly any overlap with the SolCAP array and it is strongly recommended to combine both SNP sets and to select a core collection of robust SNPs completely covering the entire tomato genome.
In Vitro vs In Silico Detected SNPs for the Development of a Genotyping Array: What Can We Learn from a Non-Model Species?

PubMed Central

Lepoittevin, Camille; Frigerio, Jean-Marc; Garnier-Géré, Pauline; Salin, Franck; Cervera, María-Teresa; Vornam, Barbara; Harvengt, Luc; Plomion, Christophe

2010-01-01

Background There is considerable interest in the high-throughput discovery and genotyping of single nucleotide polymorphisms (SNPs) to accelerate genetic mapping and enable association studies. This study provides an assessment of EST-derived and resequencing-derived SNP quality in maritime pine (Pinus pinaster Ait.), a conifer characterized by a huge genome size (∼23.8 Gb/C). Methodology/Principal Findings A 384-SNPs GoldenGate genotyping array was built from i/ 184 SNPs originally detected in a set of 40 re-sequenced candidate genes (in vitro SNPs), chosen on the basis of functionality scores, presence of neighboring polymorphisms, minor allele frequencies and linkage disequilibrium and ii/ 200 SNPs screened from ESTs (in silico SNPs) selected based on the number of ESTs used for SNP detection, the SNP minor allele frequency and the quality of SNP flanking sequences. The global success rate of the assay was 66.9%, and a conversion rate (considering only polymorphic SNPs) of 51% was achieved. In vitro SNPs showed significantly higher genotyping-success and conversion rates than in silico SNPs (+11.5% and +18.5%, respectively). The reproducibility was 100%, and the genotyping error rate very low (0.54%, dropping down to 0.06% when removing four SNPs showing elevated error rates). Conclusions/Significance This study demonstrates that ESTs provide a resource for SNP identification in non-model species, which do not require any additional bench work and little bio-informatics analysis. However, the time and cost benefits of in silico SNPs are counterbalanced by a lower conversion rate than in vitro SNPs. This drawback is acceptable for population-based experiments, but could be dramatic in experiments involving samples from narrow genetic backgrounds. In addition, we showed that both the visual inspection of genotyping clusters and the estimation of a per SNP error rate should help identify markers that are not suitable to the GoldenGate technology in species characterized by a large and complex genome. PMID:20543950
Increased Frequency of De Novo Copy Number Variations in Congenital Heart Disease by Integrative Analysis of SNP Array and Exome Sequence Data

PubMed Central

Rodriguez-Murillo, Laura; Fromer, Menachem; Mazaika, Erica; Vardarajan, Badri; Italia, Michael; Leipzig, Jeremy; DePalma, Steven R.; Golhar, Ryan; Sanders, Stephan J.; Yamrom, Boris; Ronemus, Michael; Iossifov, Ivan; Willsey, A. Jeremy; State, Matthew W.; Kaltman, Jonathan R.; White, Peter S.; Shen, Yufeng; Warburton, Dorothy; Brueckner, Martina; Seidman, Christine; Goldmuntz, Elizabeth; Gelb, Bruce D.; Lifton, Richard; Seidman, Jonathan; Hakonarson, Hakon; Chung, Wendy K.

2014-01-01

Rationale Congenital heart disease (CHD) is among the most common birth defects. Most cases are of unknown etiology. Objective To determine the contribution of de novo copy number variants (CNVs) in the etiology of sporadic CHD. Methods and Results We studied 538 CHD trios using genome-wide dense single nucleotide polymorphism (SNP) arrays and/or whole exome sequencing (WES). Results were experimentally validated using digital droplet PCR. We compared validated CNVs in CHD cases to CNVs in 1,301 healthy control trios. The two complementary high-resolution technologies identified 63 validated de novo CNVs in 51 CHD cases. A significant increase in CNV burden was observed when comparing CHD trios with healthy trios, using either SNP array (p=7x10−5, Odds Ratio (OR)=4.6) or WES data (p=6x10−4, OR=3.5) and remained after removing 16% of de novo CNV loci previously reported as pathogenic (p=0.02, OR=2.7). We observed recurrent de novo CNVs on 15q11.2 encompassing CYFIP1, NIPA1, and NIPA2 and single de novo CNVs encompassing DUSP1, JUN, JUP, MED15, MED9, PTPRE SREBF1, TOP2A, and ZEB2, genes that interact with established CHD proteins NKX2-5 and GATA4. Integrating de novo variants in WES and CNV data suggests that ETS1 is the pathogenic gene altered by 11q24.2-q25 deletions in Jacobsen syndrome and that CTBP2 is the pathogenic gene in 10q sub-telomeric deletions. Conclusions We demonstrate a significantly increased frequency of rare de novo CNVs in CHD patients compared with healthy controls and suggest several novel genetic loci for CHD. PMID:25205790
A girl with incomplete Prader-Willi syndrome and negative MS-PCR, found to have mosaic maternal UPD-15 at SNP array.

PubMed

Morandi, Anita; Bonnefond, Amélie; Lobbens, Stéphane; Carotenuto, Marco; Del Giudice, Emanuele Miraglia; Froguel, Philippe; Maffeis, Claudio

2015-11-01

The Prader-Willi syndrome (PWS) is caused by lack of expression of paternal allele of the 15q11.2-q13 region, due to deletions at paternal 15q11.2-q13 (<70%), maternal uniparental disomy of chromosome 15 (mat-UPD 15) (30%) or imprinting defects (1%). Hyperphagia, intellectual disabilities/behavioral disorders, neonatal hypotonia, and hypogonadism are cardinal features for PWS. Methylation sensitive PCR (MS-PCR) of the SNRPN locus, which assesses the presence of both the unmethylated (paternal) and the methylated (maternal) allele of 15q11.2-q13, is considered a sensitive reference technique for PWS diagnosis regardless of genetic subtype. We describe a 17-year-old girl with severe obesity, short stature, and intellectual disability, without hypogonadism and history of neonatal hypotonia, who was suspected to have an incomplete PWS. The MS-PCR showed a normal pattern with similar maternal and paternal electrophoretic bands. Afterwards, a SNP array showed the presence of iso-UPD 15, that is, UPD15 with two copies of the same chromosome 15, in about 50% of cells, suggesting a diagnosis of partial PWS due to mosaic maternal iso-UPD15 arisen as rescue of a post-fertilization error. A quantitative methylation analysis confirmed the presence of mosaic UPD15 in about 50% of cells. We propose that complete clinical criteria for PWS and MS-PCR should not be considered sensitive in suspecting and diagnosing partial PWS due to mosaic UPD15. In contrast, clinical suspicion based on less restrictive criteria followed by SNP array is a more powerful approach to diagnose atypical PWS due to UPD15 mosaicism. © 2015 Wiley Periodicals, Inc.
Oligonucleotide-arrayed TFT photosensor applicable for DNA chip technology.

PubMed

Tanaka, Tsuyoshi; Hatakeyama, Keiichi; Sawaguchi, Masahiro; Iwadate, Akihito; Mizutani, Yasushi; Sasaki, Kazuhiro; Tateishi, Naofumi; Takeyama, Haruko; Matsunaga, Tadashi

2006-09-05

A thin film transistor (TFT) photosensor fabricated by semiconductor integrated circuit (IC) technology was applied to DNA chip technology. The surface of the TFT photosensor was coated with TiO2 using a vapor deposition technique for the fabrication of optical filters. The immobilization of thiolated oligonucleotide probes onto a TiO2-coated TFT photosensor using gamma-aminopropyltriethoxysilane (APTES) and N-(gamma-maleimidobutyloxy) sulfosuccinimide ester (GMBS) was optimized. The coverage value of immobilized oligonucleotides reached a plateau at 33.7 pmol/cm2, which was similar to a previous analysis using radioisotope-labeled oligonucleotides. The lowest detection limits were 0.05 pmol/cm2 for quantum dot and 2.1 pmol/cm2 for Alexa Fluor 350. Furthermore, single nucleotide polymorphism (SNP) detection was examined using the oligonucleotide-arrayed TFT photosensor. A SNP present in the aldehyde dehydrogenase 2 (ALDH2) gene was used as a target. The SNPs in ALDH2*1 and ALDH2*2 target DNA were detected successfully using the TFT photosensor. DNA hybridization in the presence of both ALDH2*1 and ALDH2*2 target DNA was observed using both ALDH2*1 and ALDH2*2 detection oligonucleotides-arrayed TFT photosensor. Use of the TFT photosensor will allow the development of a disposable photodetecting device for DNA chip systems. (c) 2006 Wiley Periodicals, Inc.
Integrating sequence and array data to create an improved 1000 Genomes Project haplotype reference panel.

PubMed

Delaneau, Olivier; Marchini, Jonathan

2014-06-13

A major use of the 1000 Genomes Project (1000 GP) data is genotype imputation in genome-wide association studies (GWAS). Here we develop a method to estimate haplotypes from low-coverage sequencing data that can take advantage of single-nucleotide polymorphism (SNP) microarray genotypes on the same samples. First the SNP array data are phased to build a backbone (or 'scaffold') of haplotypes across each chromosome. We then phase the sequence data 'onto' this haplotype scaffold. This approach can take advantage of relatedness between sequenced and non-sequenced samples to improve accuracy. We use this method to create a new 1000 GP haplotype reference set for use by the human genetic community. Using a set of validation genotypes at SNP and bi-allelic indels we show that these haplotypes have lower genotype discordance and improved imputation performance into downstream GWAS samples, especially at low-frequency variants.
The Mobilome; A Major Contributor to Escherichia coli stx2-Positive O26:H11 Strains Intra-Serotype Diversity.

PubMed

Delannoy, Sabine; Mariani-Kurkdjian, Patricia; Webb, Hattie E; Bonacorsi, Stephane; Fach, Patrick

2017-01-01

Shiga toxin-producing Escherichia coli of serotype O26:H11/H- constitute a diverse group of strains and several clones with distinct genetic characteristics have been identified and characterized. Whole genome sequencing was performed using Illumina and PacBio technologies on eight stx2 -positive O26:H11 strains circulating in France. Comparative analyses of the whole genome of the stx2 -positive O26:H11 strains indicate that several clones of EHEC O26:H11 are co-circulating in France. Phylogenetic analysis of the French strains together with stx2 -positive and stx -negative E. coli O26:H11 genomes obtained from Genbank indicates the existence of four clonal complexes (SNP-CCs) separated in two distinct lineages, one of which comprises the "new French clone" (SNP-CC1) that appears genetically closely related to stx -negative attaching and effacing E. coli (AEEC) strains. Interestingly, the whole genome SNP (wgSNP) phylogeny is summarized in the cas gene phylogeny, and a simple qPCR assay targeting the CRISPR array specific to SNP-CC1 (SP_O26-E) can distinguish between the two main lineages. The PacBio sequencing allowed a detailed analysis of the mobile genetic elements (MGEs) of the strains. Numerous MGEs were identified in each strain, including a large number of prophages and up to four large plasmids, representing overall 8.7-19.8% of the total genome size. Analysis of the prophage pool of the strains shows a considerable diversity with a complex history of recombination. Each clonal complex (SNP-CC) is characterized by a unique set of plasmids and phages, including stx -prophages, suggesting evolution through separate acquisition events. Overall, the MGEs appear to play a major role in O26:H11 intra-serotype clonal diversification.
The Mobilome; A Major Contributor to Escherichia coli stx2-Positive O26:H11 Strains Intra-Serotype Diversity

PubMed Central

Delannoy, Sabine; Mariani-Kurkdjian, Patricia; Webb, Hattie E.; Bonacorsi, Stephane; Fach, Patrick

2017-01-01

Shiga toxin-producing Escherichia coli of serotype O26:H11/H- constitute a diverse group of strains and several clones with distinct genetic characteristics have been identified and characterized. Whole genome sequencing was performed using Illumina and PacBio technologies on eight stx2-positive O26:H11 strains circulating in France. Comparative analyses of the whole genome of the stx2-positive O26:H11 strains indicate that several clones of EHEC O26:H11 are co-circulating in France. Phylogenetic analysis of the French strains together with stx2-positive and stx-negative E. coli O26:H11 genomes obtained from Genbank indicates the existence of four clonal complexes (SNP-CCs) separated in two distinct lineages, one of which comprises the “new French clone” (SNP-CC1) that appears genetically closely related to stx-negative attaching and effacing E. coli (AEEC) strains. Interestingly, the whole genome SNP (wgSNP) phylogeny is summarized in the cas gene phylogeny, and a simple qPCR assay targeting the CRISPR array specific to SNP-CC1 (SP_O26-E) can distinguish between the two main lineages. The PacBio sequencing allowed a detailed analysis of the mobile genetic elements (MGEs) of the strains. Numerous MGEs were identified in each strain, including a large number of prophages and up to four large plasmids, representing overall 8.7–19.8% of the total genome size. Analysis of the prophage pool of the strains shows a considerable diversity with a complex history of recombination. Each clonal complex (SNP-CC) is characterized by a unique set of plasmids and phages, including stx-prophages, suggesting evolution through separate acquisition events. Overall, the MGEs appear to play a major role in O26:H11 intra-serotype clonal diversification. PMID:28932209
Genome-Wide SNP Detection, Validation, and Development of an 8K SNP Array for Apple

PubMed Central

Chagné, David; Crowhurst, Ross N.; Troggio, Michela; Davey, Mark W.; Gilmore, Barbara; Lawley, Cindy; Vanderzande, Stijn; Hellens, Roger P.; Kumar, Satish; Cestaro, Alessandro; Velasco, Riccardo; Main, Dorrie; Rees, Jasper D.; Iezzoni, Amy; Mockler, Todd; Wilhelm, Larry; Van de Weg, Eric; Gardiner, Susan E.; Bassil, Nahla; Peace, Cameron

2012-01-01

As high-throughput genetic marker screening systems are essential for a range of genetics studies and plant breeding applications, the International RosBREED SNP Consortium (IRSC) has utilized the Illumina Infinium® II system to develop a medium- to high-throughput SNP screening tool for genome-wide evaluation of allelic variation in apple (Malus×domestica) breeding germplasm. For genome-wide SNP discovery, 27 apple cultivars were chosen to represent worldwide breeding germplasm and re-sequenced at low coverage with the Illumina Genome Analyzer II. Following alignment of these sequences to the whole genome sequence of ‘Golden Delicious’, SNPs were identified using SoapSNP. A total of 2,113,120 SNPs were detected, corresponding to one SNP to every 288 bp of the genome. The Illumina GoldenGate® assay was then used to validate a subset of 144 SNPs with a range of characteristics, using a set of 160 apple accessions. This validation assay enabled fine-tuning of the final subset of SNPs for the Illumina Infinium® II system. The set of stringent filtering criteria developed allowed choice of a set of SNPs that not only exhibited an even distribution across the apple genome and a range of minor allele frequencies to ensure utility across germplasm, but also were located in putative exonic regions to maximize genotyping success rate. A total of 7867 apple SNPs was established for the IRSC apple 8K SNP array v1, of which 5554 were polymorphic after evaluation in segregating families and a germplasm collection. This publicly available genomics resource will provide an unprecedented resolution of SNP haplotypes, which will enable marker-locus-trait association discovery, description of the genetic architecture of quantitative traits, investigation of genetic variation (neutral and functional), and genomic selection in apple. PMID:22363718
Targeted capture and resequencing of 1040 genes reveal environmentally driven functional variation in grey wolves.

PubMed

Schweizer, Rena M; Robinson, Jacqueline; Harrigan, Ryan; Silva, Pedro; Galverni, Marco; Musiani, Marco; Green, Richard E; Novembre, John; Wayne, Robert K

2016-01-01

In an era of ever-increasing amounts of whole-genome sequence data for individuals and populations, the utility of traditional single nucleotide polymorphisms (SNPs) array-based genome scans is uncertain. We previously performed a SNP array-based genome scan to identify candidate genes under selection in six distinct grey wolf (Canis lupus) ecotypes. Using this information, we designed a targeted capture array for 1040 genes, including all exons and flanking regions, as well as 5000 1-kb nongenic neutral regions, and resequenced these regions in 107 wolves. Selection tests revealed striking patterns of variation within candidate genes relative to noncandidate regions and identified potentially functional variants related to local adaptation. We found 27% and 47% of candidate genes from the previous SNP array study had functional changes that were outliers in sweed and bayenv analyses, respectively. This result verifies the use of genomewide SNP surveys to tag genes that contain functional variants between populations. We highlight nonsynonymous variants in APOB, LIPG and USH2A that occur in functional domains of these proteins, and that demonstrate high correlation with precipitation seasonality and vegetation. We find Arctic and High Arctic wolf ecotypes have higher numbers of genes under selection, which highlight their conservation value and heightened threat due to climate change. This study demonstrates that combining genomewide genotyping arrays with large-scale resequencing and environmental data provides a powerful approach to discern candidate functional variants in natural populations. © 2015 John Wiley & Sons Ltd.

Exploiting sequence similarity to validate the sensitivity of SNP arrays in detecting fine-scaled copy number variations.

PubMed

Wong, Gerard; Leckie, Christopher; Gorringe, Kylie L; Haviv, Izhak; Campbell, Ian G; Kowalczyk, Adam

2010-04-15

High-density single nucleotide polymorphism (SNP) genotyping arrays are efficient and cost effective platforms for the detection of copy number variation (CNV). To ensure accuracy in probe synthesis and to minimize production costs, short oligonucleotide probe sequences are used. The use of short probe sequences limits the specificity of binding targets in the human genome. The specificity of these short probeset sequences has yet to be fully analysed against a normal reference human genome. Sequence similarity can artificially elevate or suppress copy number measurements, and hence reduce the reliability of affected probe readings. For the purpose of detecting narrow CNVs reliably down to the width of a single probeset, sequence similarity is an important issue that needs to be addressed. We surveyed the Affymetrix Human Mapping SNP arrays for probeset sequence similarity against the reference human genome. Utilizing sequence similarity results, we identified a collection of fine-scaled putative CNVs between gender from autosomal probesets whose sequence matches various loci on the sex chromosomes. To detect these variations, we utilized our statistical approach, Detecting REcurrent Copy number change using rank-order Statistics (DRECS), and showed that its performance was superior and more stable than the t-test in detecting CNVs. Through the application of DRECS on the HapMap population datasets with multi-matching probesets filtered, we identified biologically relevant SNPs in aberrant regions across populations with known association to physical traits, such as height, covered by the span of a single probe. This provided empirical confirmation of the existence of naturally occurring narrow CNVs as well as the sensitivity of the Affymetrix SNP array technology in detecting them. The MATLAB implementation of DRECS is available at http://ww2.cs.mu.oz.au/ approximately gwong/DRECS/index.html.
The Minnesota Center for Twin and Family Research Genome-Wide Association Study

PubMed Central

Miller, Michael B.; Basu, Saonli; Cunningham, Julie; Eskin, Eleazar; Malone, Steven M.; Oetting, William S.; Schork, Nicholas; Sul, Jae Hoon; Iacono, William G.; Mcgue, Matt

2012-01-01

As part of the Genes, Environment and Development Initiative (GEDI), the Minnesota Center for Twin and Family Research (MCTFR) undertook a genome-wide association study (GWAS), which we describe here. A total of 8405 research participants, clustered in 4-member families, have been successfully genotyped on 527,829 single nucleotide polymorphism (SNP) markers using Illumina’s Human660W-Quad array. Quality control screening of samples and markers as well as SNP imputation procedures are described. We also describe methods for ancestry control and how the familial clustering of the MCTFR sample can be accounted for in the analysis using a Rapid Feasible Generalized Least Squares algorithm. The rich longitudinal MCTFR assessments provide numerous opportunities for collaboration. PMID:23363460
Application of Nexus copy number software for CNV detection and analysis.

PubMed

Darvishi, Katayoon

2010-04-01

Among human structural genomic variation, copy number variants (CNVs) are the most frequently known component, comprised of gains/losses of DNA segments that are generally 1 kb in length or longer. Array-based comparative genomic hybridization (aCGH) has emerged as a powerful tool for detecting genomic copy number variants (CNVs). With the rapid increase in the density of array technology and with the adaptation of new high-throughput technology, a reliable and computationally scalable method for accurate mapping of recurring DNA copy number aberrations has become a main focus in research. Here we introduce Nexus Copy Number software, a platform-independent tool, to analyze the output files of all types of commercial and custom-made comparative genomic hybridization (CGH) and single-nucleotide polymorphism (SNP) arrays, such as those manufactured by Affymetrix, Agilent Technologies, Illumina, and Roche NimbleGen. It also supports data generated by various array image-analysis software tools such as GenePix, ImaGene, and BlueFuse. (c) 2010 by John Wiley & Sons, Inc.
A Complex 6p25 Rearrangement in a Child With Multiple Epiphyseal Dysplasia

PubMed Central

Bedoyan, Jirair K.; Lesperance, Marci M.; Ackley, Todd; Iyer, Ramaswamy K.; Innis, Jeffrey W.; Misra, Vinod K.

2015-01-01

Genomic rearrangements are increasingly recognized as important contributors to human disease. Here we report on an 11½-year-old child with myopia, Duane retraction syndrome, bilateral mixed hearing loss, skeletal anomalies including multiple epiphyseal dysplasia, and global developmental delay, and a complex 6p25 genomic rearrangement. We have employed oligonucleotide-based comparative genomic hybridization arrays (aCGH) of different resolutions (44 and 244K) as well as a 1 M single nucleotide polymorphism (SNP) array to analyze this complex rearrangement. Our analyses reveal a complex rearrangement involving a ~2.21 Mb interstitial deletion, a ~240 kb terminal deletion, and a 70–80 kb region in between these two deletions that shows maintenance of genomic copy number. The interstitial deletion contains eight known genes, including three Forkhead box containing (FOX) transcription factors (FOXQ1, FOXF2, and FOXC1). The region maintaining genomic copy number partly overlaps the dual specificity protein phosphatase 22 (DUSP22) gene. Array analyses suggest a homozygous loss of genomic material at the 5′ end of DUSP22, which was corroborated using TaqMan® copy number analysis. It is possible that this homozygous genomic loss may render both copies of DUSP22 or its products non-functional. Our analysis suggests a rearrangement mechanism distinct from a previously reported replication-based error-prone mechanism without template switching for a specific 6p25 rearrangement with a 1.22 Mb interstitial deletion. Our study demonstrates the utility and limitations of using oligonucleotide-based aCGH and SNP array technologies of increasing resolutions in order to identify complex DNA rearrangements and gene disruptions. PMID:21204225
Automated tetraploid genotype calling by hierarchical clustering

USDA-ARS?s Scientific Manuscript database

SNP arrays are transforming breeding and genetics research for autotetraploids. To fully utilize these arrays, however, the relationship between signal intensity and allele dosage must be inferred independently for each marker. We developed an improved computational method to automate this process, ...
Linkage disequilibrium and signatures of positive selection around LINE-1 retrotransposons in the human genome.

PubMed

Kuhn, Alexandre; Ong, Yao Min; Cheng, Ching-Yu; Wong, Tien Yin; Quake, Stephen R; Burkholder, William F

2014-06-03

Insertions of the human-specific subfamily of LINE-1 (L1) retrotransposon are highly polymorphic across individuals and can critically influence the human transcriptome. We hypothesized that L1 insertions could represent genetic variants determining important human phenotypic traits, and performed an integrated analysis of L1 elements and single nucleotide polymorphisms (SNPs) in several human populations. We found that a large fraction of L1s were in high linkage disequilibrium with their surrounding genomic regions and that they were well tagged by SNPs. However, L1 variants were only partially captured by SNPs on standard SNP arrays, so that their potential phenotypic impact would be frequently missed by SNP array-based genome-wide association studies. We next identified potential phenotypic effects of L1s by looking for signatures of natural selection linked to L1 insertions; significant extended haplotype homozygosity was detected around several L1 insertions. This finding suggests that some of these L1 insertions may have been the target of recent positive selection.
Genomic analysis using high density SNP based oligonucleotide arrays and MLPA provides a comprehensive analysis of INI1/SMARCB1 in malignant rhabdoid tumors

PubMed Central

Jackson, Eric M.; Sievert, Angela J.; Gai, Xiaowu; Hakonarson, Hakon; Judkins, Alexander R; Tooke, Laura; Perin, Juan Carlos; Xie, Hongbo; Shaikh, Tamim H.; Biegel, Jaclyn A.

2009-01-01

Translational Relevance Previous reports suggested that abnormalities of INI1 could be detected in 70–75% of malignant rhabdoid tumors. The mechanism of inactivation in the other 25% remained unclear. The goal of this study was to perform a high-resolution genomic analysis of a large series of rhabdoid tumors with the expectation of identifying additional loci related to the initiation or progression of these malignancies. We also developed a comprehensive set of assays, including a new MLPA assay, to interrogate the INI1 locus in 22q11.2. Intragenic deletions could be detected using the Illumina 550K Beadchip, whereas single exon deletions could be detected using MLPA. The current study demonstrates that with a multi-platform approach, alterations at the INI1 locus can be detected in almost all cases. Thus, appropriate molecular genetic testing can be used as an aid in the diagnosis and for treatment planning for most patients. Purpose A high-resolution genomic profiling and comprehensive targeted analysis of INI1/SMARCB1 of a large series of pediatric rhabdoid tumors was performed. The aim was to identify regions of copy number change and loss of heterozygosity that might pinpoint additional loci involved in the development or progression of rhabdoid tumors, and define the spectrum of genomic alterations of INI1 in this malignancy. Experimental Design A multi-platform approach, utilizing Illumina single nucleotide polymorphism (SNP) based oligonucleotide arrays, multiplex ligation dependent probe amplification (MLPA), fluorescence in situ hybridization (FISH), and coding sequence analysis was used to characterize genome wide copy number changes, loss of heterozygosity, and genomic alterations of INI1/SMARCB1 in a series of pediatric rhabdoid tumors. Results The bi-allelic alterations of INI1 that led to inactivation were elucidated in 50 of 51 tumors. INI1 inactivation was demonstrated by a variety of mechanisms, including deletions, mutations, and loss of heterozygosity. The results from the array studies highlighted the complexity of rearrangements of chromosome 22, compared to the low frequency of alterations involving the other chromosomes. Conclusions The results from the genome wide SNP-array analysis suggest that INI1 is the primary tumor suppressor gene involved in the development of rhabdoid tumors with no second locus identified. In addition, we did not identify hot spots for the breakpoints in sporadic tumors with deletions of chromosome 22q11.2. By employing a multimodality approach, the wide spectrum of alterations of INI1 can be identified in the majority of patients, which increases the clinical utility of molecular diagnostic testing. PMID:19276269
Evaluation of Genomic Instability in the Abnormal Prostate

DTIC Science & Technology

2006-12-01

array CGH maps copy number aberrations relative to the genome sequence by using arrays of BAC or cDNA clones as the hybridization target instead of...data produced from these analyses complicate the interpretation of results . For these reasons, and as outlined by Davies et al., 22 it is desirable...There have been numerous studies of these abnormalities and several techniques, including 9 chromosome painting, array CGH and SNP arrays , have
Genetic diversity, linkage disequilibrium, population structure and construction of a core collection of Prunus avium L. landraces and bred cultivars.

PubMed

Campoy, José Antonio; Lerigoleur-Balsemin, Emilie; Christmann, Hélène; Beauvieux, Rémi; Girollet, Nabil; Quero-García, José; Dirlewanger, Elisabeth; Barreneche, Teresa

2016-02-24

Depiction of the genetic diversity, linkage disequilibrium (LD) and population structure is essential for the efficient organization and exploitation of genetic resources. The objectives of this study were to (i) to evaluate the genetic diversity and to detect the patterns of LD, (ii) to estimate the levels of population structure and (iii) to identify a 'core collection' suitable for association genetic studies in sweet cherry. A total of 210 genotypes including modern cultivars and landraces from 16 countries were genotyped using the RosBREED cherry 6 K SNP array v1. Two groups, mainly bred cultivars and landraces, respectively, were first detected using STRUCTURE software and confirmed by Principal Coordinate Analysis (PCoA). Further analyses identified nine subgroups using STRUCTURE and Discriminant Analysis of Principal Components (DAPC). Several sub-groups correspond to different eco-geographic regions of landraces distribution. Linkage disequilibrium was evaluated showing lower values than in peach, the reference Prunus species. A 'core collection' containing 156 accessions was selected using the maximum length sub tree method. The present study constitutes the first population genetics analysis in cultivated sweet cherry using a medium-density SNP (single nucleotide polymorphism) marker array. We provided estimations of linkage disequilibrium, genetic structure and the definition of a first INRA's Sweet Cherry core collection useful for breeding programs, germplasm management and association genetics studies.
Genetic Identity in Genebanks: Application of the SolCAP 12K SNP Array in Fingerprinting and Diversity Analysis in the Global In Trust Potato Collection.

PubMed

Ellis, David; Chavez, Oswaldo; Coombs, Joseph J; Soto, Julian V; Gomez, Rene; Douches, David S; Panta, Ana; Silvestre, Rocio; Anglin, Noelle Lynette

2018-05-24

Breeders rely on genetic integrity of material from genebanks, however, mislabeling and errors in original data can occur. Paired samples of original material and their in vitro counterparts from 250 diverse potato landrace accessions from the International Potato Center (CIP), were fingerprinted using the Infinium 12K V2 Potato Array to confirm genetic identity and evaluate genetic diversity. Diploid, triploid, and tetraploid accessions were included representing seven cultivated potato taxa (Hawkes, 1990). Fingerprints between mother field plants and in vitro clones, were used to evaluate identity, relatedness, and ancestry. Clones of the same accession grouped together, however eleven (4.4%) accessions were mismatches genetically. SNP genotypes were used to construct a phylogeny to evaluate inter- and intraspecific relationships and population structure. Data suggests that the triploids evaluated are genetically similar. STRUCTURE analysis identified several putative hybrids and suggests six populations with significant gene flow between. This study provides a model for genetic identity of plant genetic resources collections as mistakes in conservation of these collections and in genebanks is a reality and confirmed identity is critical for breeders and other users of these collections, as well as for quality management programs and to provide insights into the diversity of the accessions evaluated.
p.Q192R SNP of PON1 seems not to be Associated with Carotid Atherosclerosis Risk Factors in an Asymptomatic and Normolipidemic Brazilian Population Sample

PubMed Central

Scherrer, Daniel Zanetti; Zago, Vanessa Helena de Souza; Vieira, Isabela Calanca; Parra, Eliane Soler; Panzoldo, Natália Baratella; Alexandre, Fernanda; Secolin, Rodrigo; Baracat, Jamal; Quintão, Eder Carlos Rocha; de Faria, Eliana Cotta

2015-01-01

Background Evidences suggest that paraoxonase 1 (PON1) confers important antioxidant and anti-inflammatory properties when associated with high-density lipoprotein (HDL). Objective To investigate the relationships between p.Q192R SNP of PON1, biochemical parameters and carotid atherosclerosis in an asymptomatic, normolipidemic Brazilian population sample. Methods We studied 584 volunteers (females n = 326, males n = 258; 19-75 years of age). Total genomic DNA was extracted and SNP was detected in the TaqMan® SNP OpenArray® genotyping platform (Applied Biosystems, Foster City, CA). Plasma lipoproteins and apolipoproteins were determined and PON1 activity was measured using paraoxon as a substrate. High-resolution β-mode ultrasonography was used to measure cIMT and the presence of carotid atherosclerotic plaques in a subgroup of individuals (n = 317). Results The presence of p.192Q was associated with a significant increase in PON1 activity (RR = 12.30 (11.38); RQ = 46.96 (22.35); QQ = 85.35 (24.83) μmol/min; p < 0.0001), HDL-C (RR= 45 (37); RQ = 62 (39); QQ = 69 (29) mg/dL; p < 0.001) and apo A-I (RR = 140.76 ± 36.39; RQ = 147.62 ± 36.92; QQ = 147.49 ± 36.65 mg/dL; p = 0.019). Stepwise regression analysis revealed that heterozygous and p.192Q carriers influenced by 58% PON1 activity towards paraoxon. The univariate linear regression analysis demonstrated that p.Q192R SNP was not associated with mean cIMT; as a result, in the multiple regression analysis, no variables were selected with 5% significance. In logistic regression analysis, the studied parameters were not associated with the presence of carotid plaques. Conclusion In low-risk individuals, the presence of the p.192Q variant of PON1 is associated with a beneficial plasma lipid profile but not with carotid atherosclerosis. PMID:26039660
p.Q192R SNP of PON1 seems not to be Associated with Carotid Atherosclerosis Risk Factors in an Asymptomatic and Normolipidemic Brazilian Population Sample.

PubMed

Scherrer, Daniel Zanetti; Zago, Vanessa Helena de Souza; Vieira, Isabela Calanca; Parra, Eliane Soler; Panzoldo, Natália Baratella; Alexandre, Fernanda; Secolin, Rodrigo; Baracat, Jamal; Quintão, Eder Carlos Rocha; Faria, Eliana Cotta de

2015-07-01

Evidences suggest that paraoxonase 1 (PON1) confers important antioxidant and anti-inflammatory properties when associated with high-density lipoprotein (HDL). To investigate the relationships between p.Q192R SNP of PON1, biochemical parameters and carotid atherosclerosis in an asymptomatic, normolipidemic Brazilian population sample. We studied 584 volunteers (females n = 326, males n = 258; 19-75 years of age). Total genomic DNA was extracted and SNP was detected in the TaqMan® SNP OpenArray® genotyping platform (Applied Biosystems, Foster City, CA). Plasma lipoproteins and apolipoproteins were determined and PON1 activity was measured using paraoxon as a substrate. High-resolution β-mode ultrasonography was used to measure cIMT and the presence of carotid atherosclerotic plaques in a subgroup of individuals (n = 317). The presence of p.192Q was associated with a significant increase in PON1 activity (RR = 12.30 (11.38); RQ = 46.96 (22.35); QQ = 85.35 (24.83) μmol/min; p < 0.0001), HDL-C (RR= 45 (37); RQ = 62 (39); QQ = 69 (29) mg/dL; p < 0.001) and apo A-I (RR = 140.76 ± 36.39; RQ = 147.62 ± 36.92; QQ = 147.49 ± 36.65 mg/dL; p = 0.019). Stepwise regression analysis revealed that heterozygous and p.192Q carriers influenced by 58% PON1 activity towards paraoxon. The univariate linear regression analysis demonstrated that p.Q192R SNP was not associated with mean cIMT; as a result, in the multiple regression analysis, no variables were selected with 5% significance. In logistic regression analysis, the studied parameters were not associated with the presence of carotid plaques. In low-risk individuals, the presence of the p.192Q variant of PON1 is associated with a beneficial plasma lipid profile but not with carotid atherosclerosis.
Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications.

PubMed

Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R; Taylor, Jeremy F; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

2016-01-01

Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal.
Optimal Design of Low-Density SNP Arrays for Genomic Prediction: Algorithm and Applications

PubMed Central

Wu, Xiao-Lin; Xu, Jiaqi; Feng, Guofei; Wiggans, George R.; Taylor, Jeremy F.; He, Jun; Qian, Changsong; Qiu, Jiansheng; Simpson, Barry; Walker, Jeremy; Bauck, Stewart

2016-01-01

Low-density (LD) single nucleotide polymorphism (SNP) arrays provide a cost-effective solution for genomic prediction and selection, but algorithms and computational tools are needed for the optimal design of LD SNP chips. A multiple-objective, local optimization (MOLO) algorithm was developed for design of optimal LD SNP chips that can be imputed accurately to medium-density (MD) or high-density (HD) SNP genotypes for genomic prediction. The objective function facilitates maximization of non-gap map length and system information for the SNP chip, and the latter is computed either as locus-averaged (LASE) or haplotype-averaged Shannon entropy (HASE) and adjusted for uniformity of the SNP distribution. HASE performed better than LASE with ≤1,000 SNPs, but required considerably more computing time. Nevertheless, the differences diminished when >5,000 SNPs were selected. Optimization was accomplished conditionally on the presence of SNPs that were obligated to each chromosome. The frame location of SNPs on a chip can be either uniform (evenly spaced) or non-uniform. For the latter design, a tunable empirical Beta distribution was used to guide location distribution of frame SNPs such that both ends of each chromosome were enriched with SNPs. The SNP distribution on each chromosome was finalized through the objective function that was locally and empirically maximized. This MOLO algorithm was capable of selecting a set of approximately evenly-spaced and highly-informative SNPs, which in turn led to increased imputation accuracy compared with selection solely of evenly-spaced SNPs. Imputation accuracy increased with LD chip size, and imputation error rate was extremely low for chips with ≥3,000 SNPs. Assuming that genotyping or imputation error occurs at random, imputation error rate can be viewed as the upper limit for genomic prediction error. Our results show that about 25% of imputation error rate was propagated to genomic prediction in an Angus population. The utility of this MOLO algorithm was also demonstrated in a real application, in which a 6K SNP panel was optimized conditional on 5,260 obligatory SNP selected based on SNP-trait association in U.S. Holstein animals. With this MOLO algorithm, both imputation error rate and genomic prediction error rate were minimal. PMID:27583971
Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array

USDA-ARS?s Scientific Manuscript database

High-density single nucleotide polymorphism (SNP) genotyping chips are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships among individuals in populations and studying marker-trait associations in mapping experiments. We developed a genotyping array includ...
High-throughput SNP genotyping in the highly heterozygous genome of Eucalyptus: assay success, polymorphism and transferability across species

PubMed Central

2011-01-01

Background High-throughput SNP genotyping has become an essential requirement for molecular breeding and population genomics studies in plant species. Large scale SNP developments have been reported for several mainstream crops. A growing interest now exists to expand the speed and resolution of genetic analysis to outbred species with highly heterozygous genomes. When nucleotide diversity is high, a refined diagnosis of the target SNP sequence context is needed to convert queried SNPs into high-quality genotypes using the Golden Gate Genotyping Technology (GGGT). This issue becomes exacerbated when attempting to transfer SNPs across species, a scarcely explored topic in plants, and likely to become significant for population genomics and inter specific breeding applications in less domesticated and less funded plant genera. Results We have successfully developed the first set of 768 SNPs assayed by the GGGT for the highly heterozygous genome of Eucalyptus from a mixed Sanger/454 database with 1,164,695 ESTs and the preliminary 4.5X draft genome sequence for E. grandis. A systematic assessment of in silico SNP filtering requirements showed that stringent constraints on the SNP surrounding sequences have a significant impact on SNP genotyping performance and polymorphism. SNP assay success was high for the 288 SNPs selected with more rigorous in silico constraints; 93% of them provided high quality genotype calls and 71% of them were polymorphic in a diverse panel of 96 individuals of five different species. SNP reliability was high across nine Eucalyptus species belonging to three sections within subgenus Symphomyrtus and still satisfactory across species of two additional subgenera, although polymorphism declined as phylogenetic distance increased. Conclusions This study indicates that the GGGT performs well both within and across species of Eucalyptus notwithstanding its nucleotide diversity ≥2%. The development of a much larger array of informative SNPs across multiple Eucalyptus species is feasible, although strongly dependent on having a representative and sufficiently deep collection of sequences from many individuals of each target species. A higher density SNP platform will be instrumental to undertake genome-wide phylogenetic and population genomics studies and to implement molecular breeding by Genomic Selection in Eucalyptus. PMID:21492434
A SNP genotyping array for hexaploid oat

USDA-ARS?s Scientific Manuscript database

Recognizing a need in cultivated hexaploid oat (Avena sativa L.) for a reliable set of reference SNPs, we have developed a 6K BeadChip design containing 257 Infinium I and 5,486 Infinium II designs corresponding to 5,743 SNPs. Of those, 4,975 SNPs yielded successful assays after array manufacturing...
Comparative genome-wide mapping versus extreme pool-genotyping and development of diagnostic SNP markers linked to QTL for adult plant resistance to stripe rust in common wheat.

PubMed

Wu, Jianhui; Huang, Shuo; Zeng, Qingdong; Liu, Shengjie; Wang, Qilin; Mu, Jingmei; Yu, Shizhou; Han, Dejun; Kang, Zhensheng

2018-06-16

A major stripe rust resistance QTL on chromosome 4BL was localized to a 4.5-Mb interval using comparative QTL mapping methods and validated in 276 wheat genotypes by haplotype analysis. CYMMIT-derived wheat line P10103 was previously identified to have adult plant resistance (APR) to stripe rust in the greenhouse and field. The conventional approach for QTL mapping in common wheat is laborious. Here, we performed QTL detection of APR using a combination of genome-wide scanning and extreme pool-genotyping. SNP-based genetic maps were constructed using the Wheat55 K SNP array to genotype a recombinant inbred line (RIL) population derived from the cross Mingxian 169 × P10103. Five stable QTL were detected across multiple environments. A fter comparing SNP profiles from contrasting, extreme DNA pools of RILs six putative QTL were located to approximate chromosome positions. A major QTL on chromosome 4B was identified in F 2:4 contrasting pools from cross Zhengmai 9023 × P10103. A consensus QTL (LOD = 26-40, PVE = 42-55%), named QYr.nwafu-4BL, was defined and localized to a 4.5-Mb interval flanked by SNP markers AX-110963704 and AX-110519862 in chromosome arm 4BL. Based on stripe rust response, marker genotypes, pedigree analysis and mapping data, QYr.nwafu-4BL is likely to be a new APR QTL. The applicability of the SNP-based markers flanking QYr.nwafu-4BL was validated on a diversity panel of 276 wheat lines. The additional minor QTL on chromosomes 4A, 5A, 5B and 6A enhanced the level of resistance conferred by QYr.nwafu-4BL. Marker-assisted pyramiding of QYr.nwafu-4BL and other favorable minor QTL in new wheat cultivars should improve the level of APR to stripe rust.
Re-Ranking Sequencing Variants in the Post-GWAS Era for Accurate Causal Variant Identification

PubMed Central

Faye, Laura L.; Machiela, Mitchell J.; Kraft, Peter; Bull, Shelley B.; Sun, Lei

2013-01-01

Next generation sequencing has dramatically increased our ability to localize disease-causing variants by providing base-pair level information at costs increasingly feasible for the large sample sizes required to detect complex-trait associations. Yet, identification of causal variants within an established region of association remains a challenge. Counter-intuitively, certain factors that increase power to detect an associated region can decrease power to localize the causal variant. First, combining GWAS with imputation or low coverage sequencing to achieve the large sample sizes required for high power can have the unintended effect of producing differential genotyping error among SNPs. This tends to bias the relative evidence for association toward better genotyped SNPs. Second, re-use of GWAS data for fine-mapping exploits previous findings to ensure genome-wide significance in GWAS-associated regions. However, using GWAS findings to inform fine-mapping analysis can bias evidence away from the causal SNP toward the tag SNP and SNPs in high LD with the tag. Together these factors can reduce power to localize the causal SNP by more than half. Other strategies commonly employed to increase power to detect association, namely increasing sample size and using higher density genotyping arrays, can, in certain common scenarios, actually exacerbate these effects and further decrease power to localize causal variants. We develop a re-ranking procedure that accounts for these adverse effects and substantially improves the accuracy of causal SNP identification, often doubling the probability that the causal SNP is top-ranked. Application to the NCI BPC3 aggressive prostate cancer GWAS with imputation meta-analysis identified a new top SNP at 2 of 3 associated loci and several additional possible causal SNPs at these loci that may have otherwise been overlooked. This method is simple to implement using R scripts provided on the author's website. PMID:23950724
Rapid discovery of SNPs differentiating hatchery steelhead trout from ESA-listed natural-origin steelhead trout using a 57K SNP array

USGS Publications Warehouse

Larson, Wesley; Palti, Yniv; Gao, G.; Warheit, Kenneth I.; Seeb, James E.

2017-01-01

Natural-origin steelhead trout (Oncorhynchus mykiss (Walbaum, 1792)) in the Pacific Northwest, USA, are threatened by a number of factors including habitat destruction, disease, decline in marine survival, and a potential erosion of genetic viability due to introgression from hatchery strains. Our major goal was to use a recently developed SNP array containing ∼57 000 SNPs to identify a subset of SNPs that differentiate hatchery and natural-origin populations. We analyzed 35 765 polymorphic SNPs in nine populations of steelhead trout sampled from Puget Sound, Washington, USA. We then conducted two outlier tests and found 360 loci that were candidates for divergent selection between hatchery and natural-origin populations (mean FCT = 0.29, maximum = 0.65) and 595 SNPs that were candidates for selection among natural-origin populations (mean FST = 0.25, maximum = 0.51). Comparisons with a linkage map revealed that two chromosomes (Omy05 and Omy25) contained significantly more outliers than other chromosomes, suggesting that regions on Omy05 and Omy25 may be of adaptive significance. Our results highlight several advantages of the 57 000 SNP array as a tool for population and conservation genomics studies.

Estimation of linkage disequilibrium and interspecific gene flow in Ficedula flycatchers by a newly developed 50k single-nucleotide polymorphism array

PubMed Central

Kawakami, Takeshi; Backström, Niclas; Burri, Reto; Husby, Arild; Olason, Pall; Rice, Amber M; Ålund, Murielle; Qvarnström, Anna; Ellegren, Hans

2014-01-01

With the access to draft genome sequence assemblies and whole-genome resequencing data from population samples, molecular ecology studies will be able to take truly genome-wide approaches. This now applies to an avian model system in ecological and evolutionary research: Old World flycatchers of the genus Ficedula, for which we recently obtained a 1.1 Gb collared flycatcher genome assembly and identified 13 million single-nucleotide polymorphism (SNP)s in population resequencing of this species and its sister species, pied flycatcher. Here, we developed a custom 50K Illumina iSelect flycatcher SNP array with markers covering 30 autosomes and the Z chromosome. Using a number of selection criteria for inclusion in the array, both genotyping success rate and polymorphism information content (mean marker heterozygosity = 0.41) were high. We used the array to assess linkage disequilibrium (LD) and hybridization in flycatchers. Linkage disequilibrium declined quickly to the background level at an average distance of 17 kb, but the extent of LD varied markedly within the genome and was more than 10-fold higher in ‘genomic islands’ of differentiation than in the rest of the genome. Genetic ancestry analysis identified 33 F1 hybrids but no later-generation hybrids from sympatric populations of collared flycatchers and pied flycatchers, contradicting earlier reports of backcrosses identified from much fewer number of markers. With an estimated divergence time as recently as <1 Ma, this suggests strong selection against F1 hybrids and unusually rapid evolution of reproductive incompatibility in an avian system. PMID:24784959
Discovery of novel variants in genotyping arrays improves genotype retention and reduces ascertainment bias

PubMed Central

2012-01-01

Background High-density genotyping arrays that measure hybridization of genomic DNA fragments to allele-specific oligonucleotide probes are widely used to genotype single nucleotide polymorphisms (SNPs) in genetic studies, including human genome-wide association studies. Hybridization intensities are converted to genotype calls by clustering algorithms that assign each sample to a genotype class at each SNP. Data for SNP probes that do not conform to the expected pattern of clustering are often discarded, contributing to ascertainment bias and resulting in lost information - as much as 50% in a recent genome-wide association study in dogs. Results We identified atypical patterns of hybridization intensities that were highly reproducible and demonstrated that these patterns represent genetic variants that were not accounted for in the design of the array platform. We characterized variable intensity oligonucleotide (VINO) probes that display such patterns and are found in all hybridization-based genotyping platforms, including those developed for human, dog, cattle, and mouse. When recognized and properly interpreted, VINOs recovered a substantial fraction of discarded probes and counteracted SNP ascertainment bias. We developed software (MouseDivGeno) that identifies VINOs and improves the accuracy of genotype calling. MouseDivGeno produced highly concordant genotype calls when compared with other methods but it uniquely identified more than 786000 VINOs in 351 mouse samples. We used whole-genome sequence from 14 mouse strains to confirm the presence of novel variants explaining 28000 VINOs in those strains. We also identified VINOs in human HapMap 3 samples, many of which were specific to an African population. Incorporating VINOs in phylogenetic analyses substantially improved the accuracy of a Mus species tree and local haplotype assignment in laboratory mouse strains. Conclusion The problems of ascertainment bias and missing information due to genotyping errors are widely recognized as limiting factors in genetic studies. We have conducted the first formal analysis of the effect of novel variants on genotyping arrays, and we have shown that these variants account for a large portion of miscalled and uncalled genotypes. Genetic studies will benefit from substantial improvements in the accuracy of their results by incorporating VINOs in their analyses. PMID:22260749
Association, effects and validation of polymorphisms within the NCAPG - LCORL locus located on BTA6 with feed intake, gain, meat and carcass traits in beef cattle

USDA-ARS?s Scientific Manuscript database

Background: In a previously reported genome-wide association study based on a high-density bovine SNP genotyping array, 8 SNP were nominally associated (P
Development and implementation of a highly-multiplexed SNP array for genetic mapping in maritime pine and comparative mapping with loblolly pine

PubMed Central

2011-01-01

Background Single nucleotide polymorphisms (SNPs) are the most abundant source of genetic variation among individuals of a species. New genotyping technologies allow examining hundreds to thousands of SNPs in a single reaction for a wide range of applications such as genetic diversity analysis, linkage mapping, fine QTL mapping, association studies, marker-assisted or genome-wide selection. In this paper, we evaluated the potential of highly-multiplexed SNP genotyping for genetic mapping in maritime pine (Pinus pinaster Ait.), the main conifer used for commercial plantation in southwestern Europe. Results We designed a custom GoldenGate assay for 1,536 SNPs detected through the resequencing of gene fragments (707 in vitro SNPs/Indels) and from Sanger-derived Expressed Sequenced Tags assembled into a unigene set (829 in silico SNPs/Indels). Offspring from three-generation outbred (G2) and inbred (F2) pedigrees were genotyped. The success rate of the assay was 63.6% and 74.8% for in silico and in vitro SNPs, respectively. A genotyping error rate of 0.4% was further estimated from segregating data of SNPs belonging to the same gene. Overall, 394 SNPs were available for mapping. A total of 287 SNPs were integrated with previously mapped markers in the G2 parental maps, while 179 SNPs were localized on the map generated from the analysis of the F2 progeny. Based on 98 markers segregating in both pedigrees, we were able to generate a consensus map comprising 357 SNPs from 292 different loci. Finally, the analysis of sequence homology between mapped markers and their orthologs in a Pinus taeda linkage map, made it possible to align the 12 linkage groups of both species. Conclusions Our results show that the GoldenGate assay can be used successfully for high-throughput SNP genotyping in maritime pine, a conifer species that has a genome seven times the size of the human genome. This SNP-array will be extended thanks to recent sequencing effort using new generation sequencing technologies and will include SNPs from comparative orthologous sequences that were identified in the present study, providing a wider collection of anchor points for comparative genomics among the conifers. PMID:21767361
Clonal diversity analysis using SNP microarray: a new prognostic tool for chronic lymphocytic leukemia.

PubMed

Zhang, Linsheng; Znoyko, Iya; Costa, Luciano J; Conlin, Laura K; Daber, Robert D; Self, Sally E; Wolff, Daynna J

2011-12-01

Chronic lymphocytic leukemia (CLL) is a clinically heterogeneous disease. The methods currently used for monitoring CLL and determining conditions for treatment are limited in their ability to predict disease progression, patient survival, and response to therapy. Although clonal diversity and the acquisition of new chromosomal abnormalities during the disease course (clonal evolution) have been associated with disease progression, their prognostic potential has been underappreciated because cytogenetic and fluorescence in situ hybridization (FISH) studies have a restricted ability to detect genomic abnormalities and clonal evolution. We hypothesized that whole genome analysis using high resolution single nucleotide polymorphism (SNP) microarrays would be useful to detect diversity and infer clonal evolution to offer prognostic information. In this study, we used the Infinium Omni1 BeadChip (Illumina, San Diego, CA) array for the analysis of genetic variation and percent mosaicism in 25 non-selected CLL patients to explore the prognostic value of the assessment of clonal diversity in patients with CLL. We calculated the percentage of mosaicism for each abnormality by applying a mathematical algorithm to the genotype frequency data and by manual determination using the Simulated DNA Copy Number (SiDCoN) tool, which was developed from a computer model of mosaicism. At least one genetic abnormality was identified in each case, and the SNP data was 98% concordant with FISH results. Clonal diversity, defined as the presence of two or more genetic abnormalities with differing percentages of mosaicism, was observed in 12 patients (48%), and the diversity correlated with the disease stage. Clonal diversity was present in most cases of advanced disease (Rai stages III and IV) or those with previous treatment, whereas 9 of 13 patients without detected clonal diversity were asymptomatic or clinically stable. In conclusion, SNP microarray studies with simultaneous evaluation of genomic alterations and mosaic distribution of clones can be used to assess apparent clonal evolution via analysis of clonal diversity. Since clonal evolution in CLL is strongly correlated with disease progression, whole genome SNP microarray analysis provides a new comprehensive and reliable prognostic tool for CLL patients. Copyright © 2011 Elsevier Inc. All rights reserved.
A Conductometric Indium Oxide Semiconducting Nanoparticle Enzymatic Biosensor Array

PubMed Central

Lee, Dongjin; Ondrake, Janet; Cui, Tianhong

2011-01-01

We report a conductometric nanoparticle biosensor array to address the significant variation of electrical property in nanomaterial biosensors due to the random network nature of nanoparticle thin-film. Indium oxide and silica nanoparticles (SNP) are assembled selectively on the multi-site channel area of the resistors using layer-by-layer self-assembly. To demonstrate enzymatic biosensing capability, glucose oxidase is immobilized on the SNP layer for glucose detection. The packaged sensor chip onto a ceramic pin grid array is tested using syringe pump driven feed and multi-channel I–V measurement system. It is successfully demonstrated that glucose is detected in many different sensing sites within a chip, leading to concentration dependent currents. The sensitivity has been found to be dependent on the channel length of the resistor, 4–12 nA/mM for channel lengths of 5–20 μm, while the apparent Michaelis-Menten constant is 20 mM. By using sensor array, analytical data could be obtained with a single step of sample solution feeding. This work sheds light on the applicability of the developed nanoparticle microsensor array to multi-analyte sensors, novel bioassay platforms, and sensing components in a lab-on-a-chip. PMID:22163696
Joint genome-wide association study for milk fatty acid traits in Chinese and Danish Holstein populations.

PubMed

Li, X; Buitenhuis, A J; Lund, M S; Li, C; Sun, D; Zhang, Q; Poulsen, N A; Su, G

2015-11-01

The identification of causal genes or genomic regions associated with fatty acids (FA) will enhance our understanding of the pathways underlying FA synthesis and provide opportunities for changing milk fat composition through a genetic approach. The linkage disequilibrium between adjacent markers is highly consistent between the Chinese and Danish Holstein populations, such that a joint genome-wide association study (GWAS) can be performed. In this study, a joint GWAS was performed for 16 milk FA traits based on data of 784 Chinese and 371 Danish Holstein cows genotyped by a high-density bovine single nucleotide polymorphism (SNP) array. A total of 486,464 SNP markers on 29 bovine autosomes were used. Bonferroni corrections were applied to adjust the significance thresholds for multiple testing at the genome- and chromosome-wide levels. According to the analysis of either the Chinese or Danish data individually, the total numbers of overlapping SNP that were significant at the chromosome level were 94 for C14:1, 208 for the C14 index, and 1 for C18:0. Joint analysis using the combined data of the 2 populations detected greater numbers of significant SNP compared with either of the individual populations alone for 7 and 10 traits at the genome- and chromosome-wide significance levels, respectively. Greater numbers of significant SNP were detected for C18:0 and the C18 index in the Chinese population compared with the joint analysis. Sixty-five significant SNP across all traits had significantly different effects in the 2 populations. Ten FA were influenced by a quantitative trait loci (QTL) region including DGAT1. Both C14:1 and the C14 index were influenced by a QTL region including SCD1 in the combined population. Other QTL regions also showed significant associations with the studied FA. A large region (14.9-24.9 Mbp) in BTA26 significantly influenced C14:1 and the C14 index in both populations, mostly likely due to the SNP in SCD1. A QTL region (69.97-73.69 Mbp) on BTA9 showed a significantly different effect on C18:0 between the 2 populations. Detection of these important SNP and the corresponding QTL regions will be helpful for follow-up studies to identify causal mutations and their interaction with environments for milk FA in dairy cattle. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
[Association between single-nucleotide polymorphisms in the IRAK-4 gene and allergic rhinitis].

PubMed

Zhang, Yuan; Xi, Lin; Zhao, Yan-ming; Zhao, Li-ping; Zhang, Luo

2012-06-01

To investigate the genetic association pattern between single-nucleotide polymorphisms (SNP) in the interleukin-1 receptor-associated kinase 4 (IRAK-4) gene and allergic rhinitis (AR). A population of 379 patients with the diagnosis of AR and 333 healthy controls who lived in Beijing region was recruited. A total of 8 reprehensive marker SNP which were in IRAK-4 gene region were selected according to the Beijing people database from Hapmap website. The individual genotyping was performed by MassARRAY platform. SPSS 13.0 software was used for statistic analysis. Subgroup analysis for the presence of different allergen sensitivities displayed associations only in the house dust mite-allergic cohorts (rs3794262: P = 0.0034, OR = 1.7388; rs4251481: P = 0.0023, OR = 2.6593), but not in subjects who were allergic to pollens as well as mix allergens. The potential genetic contribution of the IRAK-4 gene to AR demonstrated an allergen-dependant association pattern in Chinese population.
Mismatch and G-Stack Modulated Probe Signals on SNP Microarrays

PubMed Central

Binder, Hans; Fasold, Mario; Glomb, Torsten

2009-01-01

Background Single nucleotide polymorphism (SNP) arrays are important tools widely used for genotyping and copy number estimation. This technology utilizes the specific affinity of fragmented DNA for binding to surface-attached oligonucleotide DNA probes. We analyze the variability of the probe signals of Affymetrix GeneChip SNP arrays as a function of the probe sequence to identify relevant sequence motifs which potentially cause systematic biases of genotyping and copy number estimates. Methodology/Principal Findings The probe design of GeneChip SNP arrays enables us to disentangle different sources of intensity modulations such as the number of mismatches per duplex, matched and mismatched base pairings including nearest and next-nearest neighbors and their position along the probe sequence. The effect of probe sequence was estimated in terms of triple-motifs with central matches and mismatches which include all 256 combinations of possible base pairings. The probe/target interactions on the chip can be decomposed into nearest neighbor contributions which correlate well with free energy terms of DNA/DNA-interactions in solution. The effect of mismatches is about twice as large as that of canonical pairings. Runs of guanines (G) and the particular type of mismatched pairings formed in cross-allelic probe/target duplexes constitute sources of systematic biases of the probe signals with consequences for genotyping and copy number estimates. The poly-G effect seems to be related to the crowded arrangement of probes which facilitates complex formation of neighboring probes with at minimum three adjacent G's in their sequence. Conclusions The applied method of “triple-averaging” represents a model-free approach to estimate the mean intensity contributions of different sequence motifs which can be applied in calibration algorithms to correct signal values for sequence effects. Rules for appropriate sequence corrections are suggested. PMID:19924253
LGI1 microdeletion in autosomal dominant lateral temporal epilepsy

PubMed Central

Fanciulli, M.; Santulli, L.; Errichiello, L.; Barozzi, C.; Tomasi, L.; Rigon, L.; Cubeddu, T.; de Falco, A.; Rampazzo, A.; Michelucci, R.; Uzzau, S.; Striano, S.; de Falco, F.A.; Striano, P.

2012-01-01

Objectives: To characterize clinically and genetically a family with autosomal dominant lateral temporal epilepsy (ADLTE) negative to LGI1 exon sequencing test. Methods: All participants were personally interviewed and underwent neurologic examination. Most affected subjects underwent EEG and neuroradiologic examinations (CT/MRI). Available family members were genotyped with the HumanOmni1-Quad v1.0 single nucleotide polymorphism (SNP) array beadchip and copy number variations (CNVs) were analyzed in each subject. LGI1 gene dosage was performed by real-time quantitative PCR (qPCR). Results: The family had 8 affected members (2 deceased) over 3 generations. All of them showed GTC seizures, with focal onset in 6 and unknown onset in 2. Four patients had focal seizures with auditory features. EEG showed only minor sharp abnormalities in 3 patients and MRI was unremarkable in all the patients examined. Three family members presented major depression and anxiety symptoms. Routine LGI1 exon sequencing revealed no point mutation. High-density SNP array CNV analysis identified a genomic microdeletion about 81 kb in size encompassing the first 4 exons of LGI1 in all available affected members and in 2 nonaffected carriers, which was confirmed by qPCR analysis. Conclusions: This is the first microdeletion affecting LGI1 identified in ADLTE. Families with ADLTE in which no point mutations are revealed by direct exon sequencing should be screened for possible genomic deletion mutations by CNV analysis or other appropriate methods. Overall, CNV analysis of multiplex families may be useful for identifying microdeletions in novel disease genes. PMID:22496201
SEURAT: visual analytics for the integrated analysis of microarray data.

PubMed

Gribov, Alexander; Sill, Martin; Lück, Sonja; Rücker, Frank; Döhner, Konstanze; Bullinger, Lars; Benner, Axel; Unwin, Antony

2010-06-03

In translational cancer research, gene expression data is collected together with clinical data and genomic data arising from other chip based high throughput technologies. Software tools for the joint analysis of such high dimensional data sets together with clinical data are required. We have developed an open source software tool which provides interactive visualization capability for the integrated analysis of high-dimensional gene expression data together with associated clinical data, array CGH data and SNP array data. The different data types are organized by a comprehensive data manager. Interactive tools are provided for all graphics: heatmaps, dendrograms, barcharts, histograms, eventcharts and a chromosome browser, which displays genetic variations along the genome. All graphics are dynamic and fully linked so that any object selected in a graphic will be highlighted in all other graphics. For exploratory data analysis the software provides unsupervised data analytics like clustering, seriation algorithms and biclustering algorithms. The SEURAT software meets the growing needs of researchers to perform joint analysis of gene expression, genomical and clinical data.
Genotype imputation in the domestic dog

PubMed Central

Meurs, K. M.

2016-01-01

Application of imputation methods to accurately predict a dense array of SNP genotypes in the dog could provide an important supplement to current analyses of array-based genotyping data. Here, we developed a reference panel of 4,885,283 SNPs in 83 dogs across 15 breeds using whole genome sequencing. We used this panel to predict the genotypes of 268 dogs across three breeds with 84,193 SNP array-derived genotypes as inputs. We then (1) performed breed clustering of the actual and imputed data; (2) evaluated several reference panel breed combinations to determine an optimal reference panel composition; and (3) compared the accuracy of two commonly used software algorithms (Beagle and IMPUTE2). Breed clustering was well preserved in the imputation process across eigenvalues representing 75 % of the variation in the imputed data. Using Beagle with a target panel from a single breed, genotype concordance was highest using a multi-breed reference panel (92.4 %) compared to a breed-specific reference panel (87.0 %) or a reference panel containing no breeds overlapping with the target panel (74.9 %). This finding was confirmed using target panels derived from two other breeds. Additionally, using the multi-breed reference panel, genotype concordance was slightly higher with IMPUTE2 (94.1 %) compared to Beagle; Pearson correlation coefficients were slightly higher for both software packages (0.946 for Beagle, 0.961 for IMPUTE2). Our findings demonstrate that genotype imputation from SNP array-derived data to whole genome-level genotypes is both feasible and accurate in the dog with appropriate breed overlap between the target and reference panels. PMID:27129452
Calling Chromosome Alterations, DNA Methylation Statuses, and Mutations in Tumors by Simple Targeted Next-Generation Sequencing: A Solution for Transferring Integrated Pangenomic Studies into Routine Practice?

PubMed

Garinet, Simon; Néou, Mario; de La Villéon, Bruno; Faillot, Simon; Sakat, Julien; Da Fonseca, Juliana P; Jouinot, Anne; Le Tourneau, Christophe; Kamal, Maud; Luscap-Rondof, Windy; Boeva, Valentina; Gaujoux, Sebastien; Vidaud, Michel; Pasmant, Eric; Letourneur, Franck; Bertherat, Jérôme; Assié, Guillaume

2017-09-01

Pangenomic studies identified distinct molecular classes for many cancers, with major clinical applications. However, routine use requires cost-effective assays. We assessed whether targeted next-generation sequencing (NGS) could call chromosomal alterations and DNA methylation status. A training set of 77 tumors and a validation set of 449 (43 tumor types) were analyzed by targeted NGS and single-nucleotide polymorphism (SNP) arrays. Thirty-two tumors were analyzed by NGS after bisulfite conversion, and compared to methylation array or methylation-specific multiplex ligation-dependent probe amplification. Considering allelic ratios, correlation was strong between targeted NGS and SNP arrays (r = 0.88). In contrast, considering DNA copy number, for variations of one DNA copy, correlation was weaker between read counts and SNP array (r = 0.49). Thus, we generated TARGOMICs, optimized for detecting chromosome alterations by combining allelic ratios and read counts generated by targeted NGS. Sensitivity for calling normal, lost, and gained chromosomes was 89%, 72%, and 31%, respectively. Specificity was 81%, 93%, and 98%, respectively. These results were confirmed in the validation set. Finally, TARGOMICs could efficiently align and compute proportions of methylated cytosines from bisulfite-converted DNA from targeted NGS. In conclusion, beyond calling mutations, targeted NGS efficiently calls chromosome alterations and methylation status in tumors. A single run and minor design/protocol adaptations are sufficient. Optimizing targeted NGS should expand translation of genomics to clinical routine. Copyright © 2017 American Society for Investigative Pathology and the Association for Molecular Pathology. Published by Elsevier Inc. All rights reserved.
A ddRAD Based Linkage Map of the Cultivated Strawberry, Fragaria xananassa

PubMed Central

Davik, Jahn; Sargent, Daniel James; Brurberg, May Bente; Lien, Sigbjørn; Kent, Matthew; Alsheikh, Muath

2015-01-01

The cultivated strawberry (Fragaria ×ananassa Duch.) is an allo-octoploid considered difficult to disentangle genetically due to its four relatively similar sub-genomic chromosome sets. This has been alleviated by the recent release of the strawberry IStraw90 whole genome genotyping array. However, array resolution relies on the genotypes used in the array construction and may be of limited general use. SNP detection based on reduced genomic sequencing approaches has the potential of providing better coverage in cases where the studied genotypes are only distantly related from the SNP array’s construction foundation. Here we have used double digest restriction-associated DNA sequencing (ddRAD) to identify SNPs in a 145 seedling F1 hybrid population raised from the cross between the cultivars Sonata (♀) and Babette (♂). A linkage map containing 907 markers which spanned 1,581.5 cM across 31 linkage groups representing the 28 chromosomes of the species. Comparing the physical span of the SNP markers with the F. vesca genome sequence, the linkage groups resolved covered 79% of the estimated 830 Mb of the F. ×ananassa genome. Here, we have developed the first linkage map for F. ×ananassa using ddRAD and show that this technique and other related techniques are useful tools for linkage map development and downstream genetic studies in the octoploid strawberry. PMID:26398886
Genomewide single nucleotide polymorphism discovery in Atlantic salmon (Salmo salar): validation in wild and farmed American and European populations.

PubMed

Yáñez, J M; Naswa, S; López, M E; Bassini, L; Correa, K; Gilbey, J; Bernatchez, L; Norris, A; Neira, R; Lhorente, J P; Schnable, P S; Newman, S; Mileham, A; Deeb, N; Di Genova, A; Maass, A

2016-07-01

A considerable number of single nucleotide polymorphisms (SNPs) are required to elucidate genotype-phenotype associations and determine the molecular basis of important traits. In this work, we carried out de novo SNP discovery accounting for both genome duplication and genetic variation from American and European salmon populations. A total of 9 736 473 nonredundant SNPs were identified across a set of 20 fish by whole-genome sequencing. After applying six bioinformatic filtering steps, 200 K SNPs were selected to develop an Affymetrix Axiom(®) myDesign Custom Array. This array was used to genotype 480 fish representing wild and farmed salmon from Europe, North America and Chile. A total of 159 099 (79.6%) SNPs were validated as high quality based on clustering properties. A total of 151 509 validated SNPs showed a unique position in the genome. When comparing these SNPs against 238 572 markers currently available in two other Atlantic salmon arrays, only 4.6% of the SNP overlapped with the panel developed in this study. This novel high-density SNP panel will be very useful for the dissection of economically and ecologically relevant traits, enhancing breeding programmes through genomic selection as well as supporting genetic studies in both wild and farmed populations of Atlantic salmon using high-resolution genomewide information. © 2016 John Wiley & Sons Ltd.
Development of a 63K SNP array for Gossypium and high-density mapping of intra- and inter-specific populations of cotton (G. hirsutum L.)

USDA-ARS?s Scientific Manuscript database

High-throughput genotyping arrays provide a standardized resource for crop research communities that are useful for a breadth of applications including high-density genetic mapping, genome-wide association studies (GWAS), genomic selection (GS), candidate marker and quantitative trait loci (QTL) ide...
Development and evaluation of the Axiom® IStraw35 384HT array for the allo-octoploid cultivated strawberry Fragaria ×ananassa

USDA-ARS?s Scientific Manuscript database

The Axiom® IStraw90 SNP (single nucleotide polymorphism) array was developed to enable high-throughput genotyping in allo-octoploid cultivated strawberry (Fragaria ×ananassa). However, high cost ($80-105 per sample) limits throughput for certain applications. On average the IStraw90 has yielded 50% ...
Development and evaluation of a high density genotyping 'Axiom_Arachis' array with 58K SNPs for accelerating genetics and breeding in groundnut

USDA-ARS?s Scientific Manuscript database

Single nucleotide polymorphisms (SNPs) are the most abundant DNA sequence variation in the genomes which can be used to associate genotypic variation to the phenotype. Therefore, availability of a high-density SNP array with uniform genome coverage can advance genetic studies and breeding applicatio...
Noninvasive Prenatal Paternity Testing (NIPAT) through Maternal Plasma DNA Sequencing: A Pilot Study.

PubMed

Jiang, Haojun; Xie, Yifan; Li, Xuchao; Ge, Huijuan; Deng, Yongqiang; Mu, Haofang; Feng, Xiaoli; Yin, Lu; Du, Zhou; Chen, Fang; He, Nongyue

2016-01-01

Short tandem repeats (STRs) and single nucleotide polymorphisms (SNPs) have been already used to perform noninvasive prenatal paternity testing from maternal plasma DNA. The frequently used technologies were PCR followed by capillary electrophoresis and SNP typing array, respectively. Here, we developed a noninvasive prenatal paternity testing (NIPAT) based on SNP typing with maternal plasma DNA sequencing. We evaluated the influence factors (minor allele frequency (MAF), the number of total SNP, fetal fraction and effective sequencing depth) and designed three different selective SNP panels in order to verify the performance in clinical cases. Combining targeted deep sequencing of selective SNP and informative bioinformatics pipeline, we calculated the combined paternity index (CPI) of 17 cases to determine paternity. Sequencing-based NIPAT results fully agreed with invasive prenatal paternity test using STR multiplex system. Our study here proved that the maternal plasma DNA sequencing-based technology is feasible and accurate in determining paternity, which may provide an alternative in forensic application in the future.
Fine scale mapping of the 17q22 breast cancer locus using dense SNPs, genotyped within the Collaborative Oncological Gene-Environment Study (COGs)

PubMed Central

Darabi, Hatef; Beesley, Jonathan; Droit, Arnaud; Kar, Siddhartha; Nord, Silje; Moradi Marjaneh, Mahdi; Soucy, Penny; Michailidou, Kyriaki; Ghoussaini, Maya; Fues Wahl, Hanna; Bolla, Manjeet K.; Wang, Qin; Dennis, Joe; Alonso, M. Rosario; Andrulis, Irene L.; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W.; Benitez, Javier; Bogdanova, Natalia V.; Bojesen, Stig E.; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Chang-Claude, Jenny; Choi, Ji-Yeob; Conroy, Don M.; Couch, Fergus J.; Cox, Angela; Cross, Simon S.; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Easton, Douglas F.; Fasching, Peter A.; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Galle, Eva; García-Closas, Montserrat; Giles, Graham G.; Goldberg, Mark S.; González-Neira, Anna; Guénel, Pascal; Haiman, Christopher A.; Hallberg, Emily; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L.; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kang, Daehee; Khan, Sofia; Kosma, Veli-Matti; Kriege, Mieke; Kristensen, Vessela; Lambrechts, Diether; Le Marchand, Loic; Lee, Soo Chin; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Matsuo, Keitaro; Mayes, Rebecca; McKay, James; Meindl, Alfons; Milne, Roger L.; Muir, Kenneth; Neuhausen, Susan L.; Nevanlinna, Heli; Olswold, Curtis; Orr, Nick; Peterlongo, Paolo; Pita, Guillermo; Pylkäs, Katri; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J.; Schmidt, Marjanka K.; Schmutzler, Rita K.; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Southey, Melissa C.; Stram, Daniel O.; Surowy, Harald; Swerdlow, Anthony; Teo, Soo H.; Tessier, Daniel C.; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Vachon, Celine M.; Vincent, Daniel; Winqvist, Robert; Wu, Anna H.; Wu, Pei-Ei; Yip, Cheng Har; Zheng, Wei; Pharoah, Paul D. P.; Hall, Per; Edwards, Stacey L.; Simard, Jacques; French, Juliet D.; Chenevix-Trench, Georgia; Dunning, Alison M.

2016-01-01

Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90–0.94; P = 8.96 × 10−15)) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10−09, r2 = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10−11, r2 = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus. PMID:27600471

Fine scale mapping of the 17q22 breast cancer locus using dense SNPs, genotyped within the Collaborative Oncological Gene-Environment Study (COGs).

PubMed

Darabi, Hatef; Beesley, Jonathan; Droit, Arnaud; Kar, Siddhartha; Nord, Silje; Moradi Marjaneh, Mahdi; Soucy, Penny; Michailidou, Kyriaki; Ghoussaini, Maya; Fues Wahl, Hanna; Bolla, Manjeet K; Wang, Qin; Dennis, Joe; Alonso, M Rosario; Andrulis, Irene L; Anton-Culver, Hoda; Arndt, Volker; Beckmann, Matthias W; Benitez, Javier; Bogdanova, Natalia V; Bojesen, Stig E; Brauch, Hiltrud; Brenner, Hermann; Broeks, Annegien; Brüning, Thomas; Burwinkel, Barbara; Chang-Claude, Jenny; Choi, Ji-Yeob; Conroy, Don M; Couch, Fergus J; Cox, Angela; Cross, Simon S; Czene, Kamila; Devilee, Peter; Dörk, Thilo; Easton, Douglas F; Fasching, Peter A; Figueroa, Jonine; Fletcher, Olivia; Flyger, Henrik; Galle, Eva; García-Closas, Montserrat; Giles, Graham G; Goldberg, Mark S; González-Neira, Anna; Guénel, Pascal; Haiman, Christopher A; Hallberg, Emily; Hamann, Ute; Hartman, Mikael; Hollestelle, Antoinette; Hopper, John L; Ito, Hidemi; Jakubowska, Anna; Johnson, Nichola; Kang, Daehee; Khan, Sofia; Kosma, Veli-Matti; Kriege, Mieke; Kristensen, Vessela; Lambrechts, Diether; Le Marchand, Loic; Lee, Soo Chin; Lindblom, Annika; Lophatananon, Artitaya; Lubinski, Jan; Mannermaa, Arto; Manoukian, Siranoush; Margolin, Sara; Matsuo, Keitaro; Mayes, Rebecca; McKay, James; Meindl, Alfons; Milne, Roger L; Muir, Kenneth; Neuhausen, Susan L; Nevanlinna, Heli; Olswold, Curtis; Orr, Nick; Peterlongo, Paolo; Pita, Guillermo; Pylkäs, Katri; Rudolph, Anja; Sangrajrang, Suleeporn; Sawyer, Elinor J; Schmidt, Marjanka K; Schmutzler, Rita K; Seynaeve, Caroline; Shah, Mitul; Shen, Chen-Yang; Shu, Xiao-Ou; Southey, Melissa C; Stram, Daniel O; Surowy, Harald; Swerdlow, Anthony; Teo, Soo H; Tessier, Daniel C; Tomlinson, Ian; Torres, Diana; Truong, Thérèse; Vachon, Celine M; Vincent, Daniel; Winqvist, Robert; Wu, Anna H; Wu, Pei-Ei; Yip, Cheng Har; Zheng, Wei; Pharoah, Paul D P; Hall, Per; Edwards, Stacey L; Simard, Jacques; French, Juliet D; Chenevix-Trench, Georgia; Dunning, Alison M

2016-09-07

Genome-wide association studies have found SNPs at 17q22 to be associated with breast cancer risk. To identify potential causal variants related to breast cancer risk, we performed a high resolution fine-mapping analysis that involved genotyping 517 SNPs using a custom Illumina iSelect array (iCOGS) followed by imputation of genotypes for 3,134 SNPs in more than 89,000 participants of European ancestry from the Breast Cancer Association Consortium (BCAC). We identified 28 highly correlated common variants, in a 53 Kb region spanning two introns of the STXBP4 gene, that are strong candidates for driving breast cancer risk (lead SNP rs2787486 (OR = 0.92; CI 0.90-0.94; P = 8.96 × 10(-15))) and are correlated with two previously reported risk-associated variants at this locus, SNPs rs6504950 (OR = 0.94, P = 2.04 × 10(-09), r(2) = 0.73 with lead SNP) and rs1156287 (OR = 0.93, P = 3.41 × 10(-11), r(2) = 0.83 with lead SNP). Analyses indicate only one causal SNP in the region and several enhancer elements targeting STXBP4 are located within the 53 kb association signal. Expression studies in breast tumor tissues found SNP rs2787486 to be associated with increased STXBP4 expression, suggesting this may be a target gene of this locus.
Seven newly identified loci for autoimmune thyroid disease.

PubMed

Cooper, Jason D; Simmonds, Matthew J; Walker, Neil M; Burren, Oliver; Brand, Oliver J; Guo, Hui; Wallace, Chris; Stevens, Helen; Coleman, Gillian; Franklyn, Jayne A; Todd, John A; Gough, Stephen C L

2012-12-01

Autoimmune thyroid disease (AITD), including Graves' disease (GD) and Hashimoto's thyroiditis (HT), is one of the most common of the immune-mediated diseases. To further investigate the genetic determinants of AITD, we conducted an association study using a custom-made single-nucleotide polymorphism (SNP) array, the ImmunoChip. The SNP array contains all known and genotype-able SNPs across 186 distinct susceptibility loci associated with one or more immune-mediated diseases. After stringent quality control, we analysed 103 875 common SNPs (minor allele frequency >0.05) in 2285 GD and 462 HT patients and 9364 controls. We found evidence for seven new AITD risk loci (P < 1.12 × 10(-6); a permutation test derived significance threshold), five at locations previously associated and two at locations awaiting confirmation, with other immune-mediated diseases.
Exome sequencing and SNP analysis detect novel compound heterozygosity in fatty acid hydroxylase-associated neurodegeneration

PubMed Central

Pierson, Tyler Mark; Simeonov, Dimitre R; Sincan, Murat; Adams, David A; Markello, Thomas; Golas, Gretchen; Fuentes-Fajardo, Karin; Hansen, Nancy F; Cherukuri, Praveen F; Cruz, Pedro; Blackstone, Craig; Tifft, Cynthia; Boerkoel, Cornelius F; Gahl, William A

2012-01-01

Fatty acid hydroxylase-associated neurodegeneration due to fatty acid 2-hydroxylase deficiency presents with a wide range of phenotypes including spastic paraplegia, leukodystrophy, and/or brain iron deposition. All previously described families with this disorder were consanguineous, with homozygous mutations in the probands. We describe a 10-year-old male, from a non-consanguineous family, with progressive spastic paraplegia, dystonia, ataxia, and cognitive decline associated with a sural axonal neuropathy. The use of high-throughput sequencing techniques combined with SNP array analyses revealed a novel paternally derived missense mutation and an overlapping novel maternally derived ∼28-kb genomic deletion in FA2H. This patient provides further insight into the consistent features of this disorder and expands our understanding of its phenotypic presentation. The presence of a sural nerve axonal neuropathy had not been previously associated with this disorder and so may extend the phenotype. PMID:22146942
Characterization of polyploid wheat genomic diversity using a high-density 90 000 single nucleotide polymorphism array

PubMed Central

Wang, Shichen; Wong, Debbie; Forrest, Kerrie; Allen, Alexandra; Chao, Shiaoman; Huang, Bevan E; Maccaferri, Marco; Salvi, Silvio; Milner, Sara G; Cattivelli, Luigi; Mastrangelo, Anna M; Whan, Alex; Stephen, Stuart; Barker, Gary; Wieseke, Ralf; Plieske, Joerg; International Wheat Genome Sequencing Consortium; Lillemo, Morten; Mather, Diane; Appels, Rudi; Dolferus, Rudy; Brown-Guedira, Gina; Korol, Abraham; Akhunova, Alina R; Feuillet, Catherine; Salse, Jerome; Morgante, Michele; Pozniak, Curtis; Luo, Ming-Cheng; Dvorak, Jan; Morell, Matthew; Dubcovsky, Jorge; Ganal, Martin; Tuberosa, Roberto; Lawley, Cindy; Mikoulitch, Ivan; Cavanagh, Colin; Edwards, Keith J; Hayden, Matthew; Akhunov, Eduard

2014-01-01

High-density single nucleotide polymorphism (SNP) genotyping arrays are a powerful tool for studying genomic patterns of diversity, inferring ancestral relationships between individuals in populations and studying marker–trait associations in mapping experiments. We developed a genotyping array including about 90 000 gene-associated SNPs and used it to characterize genetic variation in allohexaploid and allotetraploid wheat populations. The array includes a significant fraction of common genome-wide distributed SNPs that are represented in populations of diverse geographical origin. We used density-based spatial clustering algorithms to enable high-throughput genotype calling in complex data sets obtained for polyploid wheat. We show that these model-free clustering algorithms provide accurate genotype calling in the presence of multiple clusters including clusters with low signal intensity resulting from significant sequence divergence at the target SNP site or gene deletions. Assays that detect low-intensity clusters can provide insight into the distribution of presence–absence variation (PAV) in wheat populations. A total of 46 977 SNPs from the wheat 90K array were genetically mapped using a combination of eight mapping populations. The developed array and cluster identification algorithms provide an opportunity to infer detailed haplotype structure in polyploid wheat and will serve as an invaluable resource for diversity studies and investigating the genetic basis of trait variation in wheat. PMID:24646323
Phenotype variations affect genetic association studies of degenerative disc disease: conclusions of analysis of genetic association of 58 single nucleotide polymorphisms with highly specific phenotypes for disc degeneration in 332 subjects.

PubMed

Rajasekaran, S; Kanna, Rishi Mugesh; Senthil, Natesan; Raveendran, Muthuraja; Cheung, Kenneth M C; Chan, Danny; Subramaniam, Sakthikanal; Shetty, Ajoy Prasad

2013-10-01

Although the influence of genetics on the process of disc degeneration is well recognized, in recently published studies, there is a wide variation in the race and selection criteria for such study populations. More importantly, the radiographic features of disc degeneration that are selected to represent the disc degeneration phenotype are variable in these studies. The study presented here evaluates the association between single nucleotide polymorphisms (SNPs) of candidate genes and three distinct radiographic features that can be defined as the degenerative disc disease (DDD) phenotype. The study objectives were to examine the allelic diversity of 58 SNPs related to 35 candidate genes related to lumbar DDD, to evaluate the association in a hitherto unevaluated ethnic Indian population that represents more than one-sixth of the world population, and to analyze how genetic associations can vary in the same study subjects with the choice of phenotype. A cross-sectional, case-control study of an ethnic Indian population was carried out. Fifty-eight SNPs in 35 potential candidate genes were evaluated in 342 subjects and the associations were analyzed against three highly specific markers for DDD, namely disc degeneration by Pfirrmann grading, end-plate damage evaluated by total end-plate damage score, and annular tears evaluated by disc herniations and hyperintense zones. Genotyping of cases and controls was performed on a genome-wide SNP array to identify potential associated disease loci. The results from the genome-wide SNP array were then used to facilitate SNP selection and genotype validation was conducted using Sequenom-based genotyping. Eleven of the 58 SNPs provided evidence of association with one of the phenotypes. For annular tears, rs1042631 SNP of AGC1 and rs467691 SNP of ADAMTS5 were highly significantly associated (p<.01) and SNPs in NGFB, IL1B, IL18RAP, and MMP10 were also significantly associated (p<.05). The rs4076018 SNP of NGFB was highly significant (p<.01) and rs2292657 SNP of GLI1 was significantly (p<.05) correlated to disc degeneration. For end-plate damage, the rs2252070 SNP of MMP 13 showed a significant association (p<.05). Previously associated genes such as COL 9, SKT, CHST 3, CILP, IGFR, SOXp, BMP, MMP 2-12, ADH2, IL1RN, and COX2 were not significantly associated and new associations (NGFB and GLI1) were identified. The validity of all the associations was found to be phenotype dependent. For the first time, genetic associations with DDD have been performed in an Indian population. Apart from identifying new associations, the highlight of the study was that in the same study population with DDD, SNP associations completely changed when different radiographic features were used to define the DDD phenotype. Our study results therefore indicate that standardization of the phenotypes chosen to study the genetics of disc degeneration is essential and should be strongly considered before planning genetic association studies. Copyright © 2013 Elsevier Inc. All rights reserved.
Transcriptomic SNP discovery for custom genotyping arrays: impacts of sequence data, SNP calling method and genotyping technology on the probability of validation success.

PubMed

Humble, Emily; Thorne, Michael A S; Forcada, Jaume; Hoffman, Joseph I

2016-08-26

Single nucleotide polymorphism (SNP) discovery is an important goal of many studies. However, the number of 'putative' SNPs discovered from a sequence resource may not provide a reliable indication of the number that will successfully validate with a given genotyping technology. For this it may be necessary to account for factors such as the method used for SNP discovery and the type of sequence data from which it originates, suitability of the SNP flanking sequences for probe design, and genomic context. To explore the relative importance of these and other factors, we used Illumina sequencing to augment an existing Roche 454 transcriptome assembly for the Antarctic fur seal (Arctocephalus gazella). We then mapped the raw Illumina reads to the new hybrid transcriptome using BWA and BOWTIE2 before calling SNPs with GATK. The resulting markers were pooled with two existing sets of SNPs called from the original 454 assembly using NEWBLER and SWAP454. Finally, we explored the extent to which SNPs discovered using these four methods overlapped and predicted the corresponding validation outcomes for both Illumina Infinium iSelect HD and Affymetrix Axiom arrays. Collating markers across all discovery methods resulted in a global list of 34,718 SNPs. However, concordance between the methods was surprisingly poor, with only 51.0 % of SNPs being discovered by more than one method and 13.5 % being called from both the 454 and Illumina datasets. Using a predictive modeling approach, we could also show that SNPs called from the Illumina data were on average more likely to successfully validate, as were SNPs called by more than one method. Above and beyond this pattern, predicted validation outcomes were also consistently better for Affymetrix Axiom arrays. Our results suggest that focusing on SNPs called by more than one method could potentially improve validation outcomes. They also highlight possible differences between alternative genotyping technologies that could be explored in future studies of non-model organisms.
The genome-wide structure of two economically important indigenous Sicilian cattle breeds.

PubMed

Mastrangelo, S; Saura, M; Tolone, M; Salces-Ortiz, J; Di Gerlando, R; Bertolini, F; Fontanesi, L; Sardina, M T; Serrano, M; Portolano, B

2014-11-01

Genomic technologies, such as high-throughput genotyping based on SNP arrays, provided background information concerning genome structure in domestic animals. The aim of this work was to investigate the genetic structure, the genome-wide estimates of inbreeding, coancestry, effective population size (Ne), and the patterns of linkage disequilibrium (LD) in 2 economically important Sicilian local cattle breeds, Cinisara (CIN) and Modicana (MOD), using the Illumina Bovine SNP50K v2 BeadChip. To understand the genetic relationship and to place both Sicilian breeds in a global context, genotypes from 134 other domesticated bovid breeds were used. Principal component analysis showed that the Sicilian cattle breeds were closer to individuals of Bos taurus taurus from Eurasia and formed nonoverlapping clusters with other breeds. Between the Sicilian cattle breeds, MOD was the most differentiated, whereas the animals belonging to the CIN breed showed a lower value of assignment, the presence of substructure, and genetic links with the MOD breed. The average molecular inbreeding and coancestry coefficients were moderately high, and the current estimates of Ne were low in both breeds. These values indicated a low genetic variability. Considering levels of LD between adjacent markers, the average r(2) in the MOD breed was comparable to those reported for others cattle breeds, whereas CIN showed a lower value. Therefore, these results support the need of more dense SNP arrays for a high-power association mapping and genomic selection efficiency, particularly for the CIN cattle breed. Controlling molecular inbreeding and coancestry would restrict inbreeding depression, the probability of losing beneficial rare alleles, and therefore the risk of extinction. The results generated from this study have important implications for the development of conservation and/or selection breeding programs in these 2 local cattle breeds.
UPD detection using homozygosity profiling with a SNP genotyping microarray.

PubMed

Papenhausen, Peter; Schwartz, Stuart; Risheg, Hiba; Keitges, Elisabeth; Gadi, Inder; Burnside, Rachel D; Jaswaney, Vikram; Pappas, John; Pasion, Romela; Friedman, Kenneth; Tepperberg, James

2011-04-01

Single nucleotide polymorphism (SNP) based chromosome microarrays provide both a high-density whole genome analysis of copy number and genotype. In the past 21 months we have analyzed over 13,000 samples primarily referred for developmental delay using the Affymetrix SNP/CN 6.0 version array platform. In addition to copy number, we have focused on the relative distribution of allele homozygosity (HZ) throughout the genome to confirm a strong association of uniparental disomy (UPD) with regions of isoallelism found in most confirmed cases of UPD. We sought to determine whether a long contiguous stretch of HZ (LCSH) greater than a threshold value found only in a single chromosome would correlate with UPD of that chromosome. Nine confirmed UPD cases were retrospectively analyzed with the array in the study, each showing the anticipated LCSH with the smallest 13.5 Mb in length. This length is well above the average longest run of HZ in a set of control patients and was then set as the prospective threshold for reporting possible UPD correlation. Ninety-two cases qualified at that threshold, 46 of those had molecular UPD testing and 29 were positive. Including retrospective cases, 16 showed complete HZ across the chromosome, consistent with total isoUPD. The average size LCSH in the 19 cases that were not completely HZ was 46.3 Mb with a range of 13.5-127.8 Mb. Three patients showed only segmental UPD. Both the size and location of the LCSH are relevant to correlation with UPD. Further studies will continue to delineate an optimal threshold for LCSH/UPD correlation. Copyright © 2011 Wiley-Liss, Inc.
SAAS-CNV: A Joint Segmentation Approach on Aggregated and Allele Specific Signals for the Identification of Somatic Copy Number Alterations with Next-Generation Sequencing Data.

PubMed

Zhang, Zhongyang; Hao, Ke

2015-11-01

Cancer genomes exhibit profound somatic copy number alterations (SCNAs). Studying tumor SCNAs using massively parallel sequencing provides unprecedented resolution and meanwhile gives rise to new challenges in data analysis, complicated by tumor aneuploidy and heterogeneity as well as normal cell contamination. While the majority of read depth based methods utilize total sequencing depth alone for SCNA inference, the allele specific signals are undervalued. We proposed a joint segmentation and inference approach using both signals to meet some of the challenges. Our method consists of four major steps: 1) extracting read depth supporting reference and alternative alleles at each SNP/Indel locus and comparing the total read depth and alternative allele proportion between tumor and matched normal sample; 2) performing joint segmentation on the two signal dimensions; 3) correcting the copy number baseline from which the SCNA state is determined; 4) calling SCNA state for each segment based on both signal dimensions. The method is applicable to whole exome/genome sequencing (WES/WGS) as well as SNP array data in a tumor-control study. We applied the method to a dataset containing no SCNAs to test the specificity, created by pairing sequencing replicates of a single HapMap sample as normal/tumor pairs, as well as a large-scale WGS dataset consisting of 88 liver tumors along with adjacent normal tissues. Compared with representative methods, our method demonstrated improved accuracy, scalability to large cancer studies, capability in handling both sequencing and SNP array data, and the potential to improve the estimation of tumor ploidy and purity.
SAAS-CNV: A Joint Segmentation Approach on Aggregated and Allele Specific Signals for the Identification of Somatic Copy Number Alterations with Next-Generation Sequencing Data

PubMed Central

Zhang, Zhongyang; Hao, Ke

2015-01-01

Cancer genomes exhibit profound somatic copy number alterations (SCNAs). Studying tumor SCNAs using massively parallel sequencing provides unprecedented resolution and meanwhile gives rise to new challenges in data analysis, complicated by tumor aneuploidy and heterogeneity as well as normal cell contamination. While the majority of read depth based methods utilize total sequencing depth alone for SCNA inference, the allele specific signals are undervalued. We proposed a joint segmentation and inference approach using both signals to meet some of the challenges. Our method consists of four major steps: 1) extracting read depth supporting reference and alternative alleles at each SNP/Indel locus and comparing the total read depth and alternative allele proportion between tumor and matched normal sample; 2) performing joint segmentation on the two signal dimensions; 3) correcting the copy number baseline from which the SCNA state is determined; 4) calling SCNA state for each segment based on both signal dimensions. The method is applicable to whole exome/genome sequencing (WES/WGS) as well as SNP array data in a tumor-control study. We applied the method to a dataset containing no SCNAs to test the specificity, created by pairing sequencing replicates of a single HapMap sample as normal/tumor pairs, as well as a large-scale WGS dataset consisting of 88 liver tumors along with adjacent normal tissues. Compared with representative methods, our method demonstrated improved accuracy, scalability to large cancer studies, capability in handling both sequencing and SNP array data, and the potential to improve the estimation of tumor ploidy and purity. PMID:26583378
Development and Evaluation of a Genome-Wide 6K SNP Array for Diploid Sweet Cherry and Tetraploid Sour Cherry

PubMed Central

Peace, Cameron; Bassil, Nahla; Main, Dorrie; Ficklin, Stephen; Rosyara, Umesh R.; Stegmeir, Travis; Sebolt, Audrey; Gilmore, Barbara; Lawley, Cindy; Mockler, Todd C.; Bryant, Douglas W.; Wilhelm, Larry; Iezzoni, Amy

2012-01-01

High-throughput genome scans are important tools for genetic studies and breeding applications. Here, a 6K SNP array for use with the Illumina Infinium® system was developed for diploid sweet cherry (Prunus avium) and allotetraploid sour cherry (P. cerasus). This effort was led by RosBREED, a community initiative to enable marker-assisted breeding for rosaceous crops. Next-generation sequencing in diverse breeding germplasm provided 25 billion basepairs (Gb) of cherry DNA sequence from which were identified genome-wide SNPs for sweet cherry and for the two sour cherry subgenomes derived from sweet cherry (avium subgenome) and P. fruticosa (fruticosa subgenome). Anchoring to the peach genome sequence, recently released by the International Peach Genome Initiative, predicted relative physical locations of the 1.9 million putative SNPs detected, preliminarily filtered to 368,943 SNPs. Further filtering was guided by results of a 144-SNP subset examined with the Illumina GoldenGate® assay on 160 accessions. A 6K Infinium® II array was designed with SNPs evenly spaced genetically across the sweet and sour cherry genomes. SNPs were developed for each sour cherry subgenome by using minor allele frequency in the sour cherry detection panel to enrich for subgenome-specific SNPs followed by targeting to either subgenome according to alleles observed in sweet cherry. The array was evaluated using panels of sweet (n = 269) and sour (n = 330) cherry breeding germplasm. Approximately one third of array SNPs were informative for each crop. A total of 1825 polymorphic SNPs were verified in sweet cherry, 13% of these originally developed for sour cherry. Allele dosage was resolved for 2058 polymorphic SNPs in sour cherry, one third of these being originally developed for sweet cherry. This publicly available genomics resource represents a significant advance in cherry genome-scanning capability that will accelerate marker-locus-trait association discovery, genome structure investigation, and genetic diversity assessment in this diploid-tetraploid crop group. PMID:23284615
Quantitative trait loci markers derived from whole genome sequence data increases the reliability of genomic prediction.

PubMed

Brøndum, R F; Su, G; Janss, L; Sahana, G; Guldbrandtsen, B; Boichard, D; Lund, M S

2015-06-01

This study investigated the effect on the reliability of genomic prediction when a small number of significant variants from single marker analysis based on whole genome sequence data were added to the regular 54k single nucleotide polymorphism (SNP) array data. The extra markers were selected with the aim of augmenting the custom low-density Illumina BovineLD SNP chip (San Diego, CA) used in the Nordic countries. The single-marker analysis was done breed-wise on all 16 index traits included in the breeding goals for Nordic Holstein, Danish Jersey, and Nordic Red cattle plus the total merit index itself. Depending on the trait's economic weight, 15, 10, or 5 quantitative trait loci (QTL) were selected per trait per breed and 3 to 5 markers were selected to tag each QTL. After removing duplicate markers (same marker selected for more than one trait or breed) and filtering for high pairwise linkage disequilibrium and assaying performance on the array, a total of 1,623 QTL markers were selected for inclusion on the custom chip. Genomic prediction analyses were performed for Nordic and French Holstein and Nordic Red animals using either a genomic BLUP or a Bayesian variable selection model. When using the genomic BLUP model including the QTL markers in the analysis, reliability was increased by up to 4 percentage points for production traits in Nordic Holstein animals, up to 3 percentage points for Nordic Reds, and up to 5 percentage points for French Holstein. Smaller gains of up to 1 percentage point was observed for mastitis, but only a 0.5 percentage point increase was seen for fertility. When using a Bayesian model accuracies were generally higher with only 54k data compared with the genomic BLUP approach, but increases in reliability were relatively smaller when QTL markers were included. Results from this study indicate that the reliability of genomic prediction can be increased by including markers significant in genome-wide association studies on whole genome sequence data alongside the 54k SNP set. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Association Analysis of the Ephrin-B2 Gene in African-Americans with End-Stage Renal Disease

PubMed Central

Hicks, Pamela J.; Staten, Jennifer L.; Palmer, Nicholette D.; Langefeld, Carl D.; Ziegler, Julie T.; Keene, Keith L.; Sale, Michele M.; Bowden, Donald W.; Freedman, Barry I.

2008-01-01

Background Genome scans in African-Americans with end-stage renal disease (ESRD) identified linkage on chromosome 13q33 in the region containing the ephrin-B2 ligand (EFNB2) genes. Interactions between the ephrin-B2 receptor and ephrin-B2 ligand play essential roles in renal angiogenesis, blood vessel maturation, and kidney disease. Methods The EFNB2 gene was evaluated as a positional candidate for non-diabetic and diabetic ESRD susceptibility in 1,071 unrelated African-American subjects; 316 with non-diabetic etiologies of ESRD, 394 with type 2 diabetes-associated ESRD and 361 healthy controls. Single nucleotide polymorphism (SNP) genotyping was performed on the Sequenom Mass Array System. Statistical analyses were computed using Dandelion version 1.26, Snpaddmix version 1.4 and Haploview version 3.32. Results Twenty-eight HapMap tag SNPs were genotyped spanning the 39 kilobases (kb) of the EFNB2 coding region, with average spacing of 1.43 kb. Analysis of 710 ESRD patient samples and 361 controls provided no evidence of single SNP associations in either diabetic or non-diabetic ESRD; although nominal evidence of association with all-cause ESRD was observed with a two SNP (p = 0.022) and three SNP (p = 0.023) haplotype, both containing SNPs rs7490924 and rs2391335 in intron 1. Conclusions Although an attractive positional candidate gene, polymorphisms in the EFNB2 gene do not appear to contribute in a substantial way to non-diabetic, diabetic or all-cause ESRD susceptibility in African-Americans. Additional genes within the chromosome 13q33 linkage interval are likely contributors to African-American non-diabetic ESRD. PMID:18580054
A High-Density Consensus Map of Common Wheat Integrating Four Mapping Populations Scanned by the 90K SNP Array

PubMed Central

Wen, Weie; He, Zhonghu; Gao, Fengmei; Liu, Jindong; Jin, Hui; Zhai, Shengnan; Qu, Yanying; Xia, Xianchun

2017-01-01

A high-density consensus map is a powerful tool for gene mapping, cloning and molecular marker-assisted selection in wheat breeding. The objective of this study was to construct a high-density, single nucleotide polymorphism (SNP)-based consensus map of common wheat (Triticum aestivum L.) by integrating genetic maps from four recombinant inbred line populations. The populations were each genotyped using the wheat 90K Infinium iSelect SNP assay. A total of 29,692 SNP markers were mapped on 21 linkage groups corresponding to 21 hexaploid wheat chromosomes, covering 2,906.86 cM, with an overall marker density of 10.21 markers/cM. Compared with the previous maps based on the wheat 90K SNP chip detected 22,736 (76.6%) of the SNPs with consistent chromosomal locations, whereas 1,974 (6.7%) showed different chromosomal locations, and 4,982 (16.8%) were newly mapped. Alignment of the present consensus map and the wheat expressed sequence tags (ESTs) Chromosome Bin Map enabled assignment of 1,221 SNP markers to specific chromosome bins and 819 ESTs were integrated into the consensus map. The marker orders of the consensus map were validated based on physical positions on the wheat genome with Spearman rank correlation coefficients ranging from 0.69 (4D) to 0.97 (1A, 4B, 5B, and 6A), and were also confirmed by comparison with genetic position on the previously 40K SNP consensus map with Spearman rank correlation coefficients ranging from 0.84 (6D) to 0.99 (6A). Chromosomal rearrangements reported previously were confirmed in the present consensus map and new putative rearrangements were identified. In addition, an integrated consensus map was developed through the combination of five published maps with ours, containing 52,607 molecular markers. The consensus map described here provided a high-density SNP marker map and a reliable order of SNPs, representing a step forward in mapping and validation of chromosomal locations of SNPs on the wheat 90K array. Moreover, it can be used as a reference for quantitative trait loci (QTL) mapping to facilitate exploitation of genes and QTL in wheat breeding. PMID:28848588
"Gap hunting" to characterize clustered probe signals in Illumina methylation array data.

PubMed

Andrews, Shan V; Ladd-Acosta, Christine; Feinberg, Andrew P; Hansen, Kasper D; Fallin, M Daniele

2016-01-01

The Illumina 450k array has been widely used in epigenetic association studies. Current quality-control (QC) pipelines typically remove certain sets of probes, such as those containing a SNP or with multiple mapping locations. An additional set of potentially problematic probes are those with DNA methylation distributions characterized by two or more distinct clusters separated by gaps. Data-driven identification of such probes may offer additional insights for downstream analyses. We developed a procedure, termed "gap hunting," to identify probes showing clustered distributions. Among 590 peripheral blood samples from the Study to Explore Early Development, we identified 11,007 "gap probes." The vast majority (9199) are likely attributed to an underlying SNP(s) or other variant in the probe, although SNP-affected probes exist that do not produce a gap signals. Specific factors predict which SNPs lead to gap signals, including type of nucleotide change, probe type, DNA strand, and overall methylation state. These expected effects are demonstrated in paired genotype and 450k data on the same samples. Gap probes can also serve as a surrogate for the local genetic sequence on a haplotype scale and can be used to adjust for population stratification. The characteristics of gap probes reflect potentially informative biology. QC pipelines may benefit from an efficient data-driven approach that "flags" gap probes, rather than filtering such probes, followed by careful interpretation of downstream association analyses. Our results should translate directly to the recently released Illumina EPIC array given the similar chemistry and content design.
Haplotype-Based Genotyping in Polyploids.

PubMed

Clevenger, Josh P; Korani, Walid; Ozias-Akins, Peggy; Jackson, Scott

2018-01-01

Accurate identification of polymorphisms from sequence data is crucial to unlocking the potential of high throughput sequencing for genomics. Single nucleotide polymorphisms (SNPs) are difficult to accurately identify in polyploid crops due to the duplicative nature of polyploid genomes leading to low confidence in the true alignment of short reads. Implementing a haplotype-based method in contrasting subgenome-specific sequences leads to higher accuracy of SNP identification in polyploids. To test this method, a large-scale 48K SNP array (Axiom Arachis2) was developed for Arachis hypogaea (peanut), an allotetraploid, in which 1,674 haplotype-based SNPs were included. Results of the array show that 74% of the haplotype-based SNP markers could be validated, which is considerably higher than previous methods used for peanut. The haplotype method has been implemented in a standalone program, HAPLOSWEEP, which takes as input bam files and a vcf file and identifies haplotype-based markers. Haplotype discovery can be made within single reads or span paired reads, and can leverage long read technology by targeting any length of haplotype. Haplotype-based genotyping is applicable in all allopolyploid genomes and provides confidence in marker identification and in silico-based genotyping for polyploid genomics.
Coverage and efficiency in current SNP chips

PubMed Central

Ha, Ngoc-Thuy; Freytag, Saskia; Bickeboeller, Heike

2014-01-01

To answer the question as to which commercial high-density SNP chip covers most of the human genome given a fixed budget, we compared the performance of 12 chips of different sizes released by Affymetrix and Illumina for the European, Asian, and African populations. These include Affymetrix' relatively new population-optimized arrays, whose SNP sets are each tailored toward a specific ethnicity. Our evaluation of the chips included the use of two measures, efficiency and cost–benefit ratio, which we developed as supplements to genetic coverage. Unlike coverage, these measures factor in the price of a chip or its substitute size (number of SNPs on chip), allowing comparisons to be drawn between differently priced chips. In this fashion, we identified the Affymetrix population-optimized arrays as offering the most cost-effective coverage for the Asian and African population. For the European population, we established the Illumina Human Omni 2.5-8 as the preferred choice. Interestingly, the Affymetrix chip tailored toward an Eastern Asian subpopulation performed well for all three populations investigated. However, our coverage estimates calculated for all chips proved much lower than those advertised by the producers. All our analyses were based on the 1000 Genome Project as reference population. PMID:24448550
Single-nucleotide polymorphism genotyping on optical thin-film biosensor chips.

PubMed

Zhong, Xiao-Bo; Reynolds, Robert; Kidd, Judith R; Kidd, Kenneth K; Jenison, Robert; Marlar, Richard A; Ward, David C

2003-09-30

Single-nucleotide polymorphisms (SNPs) constitute the bulk of human genetic variation and provide excellent markers to identify genetic factors contributing to complex disease susceptibility. A rapid, sensitive, and inexpensive assay is important for large-scale SNP scoring. Here we report the development of a multiplex SNP detection system using silicon chips coated to create a thin-film optical biosensor. Allele-discriminating, aldehyde-labeled oligonucleotides are arrayed and covalently attached to a hydrazinederivatized chip surface. Target sequences (e.g., PCR amplicons) then are hybridized in the presence of a mixture of biotinylated detector probes, one for each SNP, and a thermostable DNA ligase. After a stringent wash (0.01 M NaOH), ligation of biotinylated detector probes to perfectly matched capture oligomers is visualized as a color change on the chip surface (gold to blue/purple) after brief incubations with an anti-biotin IgG-horseradish peroxidase conjugate and a precipitable horseradish peroxidase substrate. Testing of PCR fragments is completed in 30-40 min. Up to several hundred SNPs can be assayed on a 36-mm2 chip, and SNP scoring can be done by eye or with a simple digital-camera system. This assay is extremely robust, exhibits high sensitivity and specificity, and is format-flexible and economical. In studies of mutations associated with risk for venous thrombosis and genotyping/haplotyping of African-American samples, we document high-fidelity analysis with 0 misassignments in 500 assays performed in duplicate.
Maximization of Markers Linked in Coupling for Tetraploid Potatoes via Monoparental Haploids

PubMed Central

Bartkiewicz, Annette M.; Chilla, Friederike; Terefe-Ayana, Diro; Lübeck, Jens; Strahwald, Josef; Tacke, Eckhard; Hofferbert, Hans-Reinhard; Linde, Marcus; Debener, Thomas

2018-01-01

Haploid potato populations derived from a single tetraploid donor constitute an efficient strategy to analyze markers segregating from a single donor genotype. Analysis of marker segregation in populations derived from crosses between polysomic tetraploids is complicated by a maximum of eight segregating alleles, multiple dosages of the markers and problems related to linkage analysis of marker segregation in repulsion. Here, we present data on two monoparental haploid populations generated by prickle pollination of two tetraploid cultivars with Solanum phureja and genotyped with the 12.8 k SolCAP single nucleotide polymorphism (SNP) array. We show that in a population of monoparental haploids, the number of biallelic SNP markers segregating in linkage to loci from the tetraploid donor genotype is much larger than in putative crosses of this genotype to a diverse selection of 125 tetraploid cultivars. Although this strategy is more laborious than conventional breeding, the generation of haploid progeny for efficient marker analysis is straightforward if morphological markers and flow cytometry are utilized to select true haploid progeny. The level of introgressed fragments from S. phureja, the haploid inducer, is very low, supporting its suitability for genetic analysis. Mapping with single-dose markers allowed the analysis of quantitative trait loci (QTL) for four phenotypic traits. PMID:29868076
Diagnosis of Familial Wolf-Hirschhorn Syndrome due to a Paternal Cryptic Chromosomal Rearrangement by Conventional and Molecular Cytogenetic Techniques

PubMed Central

Venegas-Vega, Carlos A.; Zepeda, Luis M.; Garduño-Zarazúa, Luz M.; Berumen, Jaime; Kofman, Susana; Cervantes, Alicia

2013-01-01

The use of conventional cytogenetic techniques in combination with fluorescent in situ hybridization (FISH) and single-nucleotide polymorphism (SNP) microarrays is necessary for the identification of cryptic rearrangements in the diagnosis of chromosomal syndromes. We report two siblings, a boy of 9 years and 9 months of age and his 7-years- and 5-month-old sister, with the classic Wolf-Hirschhorn syndrome (WHS) phenotype. Using high-resolution GTG- and NOR-banding karyotypes, as well as FISH analysis, we characterized a pure 4p deletion in both sibs and a balanced rearrangement in their father, consisting in an insertion of 4p material within a nucleolar organizing region of chromosome 15. Copy number variant (CNV) analysis using SNP arrays showed that both siblings have a similar size of 4p deletion (~6.5 Mb). Our results strongly support the need for conventional cytogenetic and FISH analysis, as well as high-density microarray mapping for the optimal characterization of the genetic imbalance in patients with WHS; parents must always be studied for recognizing cryptic balanced chromosomal rearrangements for an adequate genetic counseling. PMID:23484094

High-density genetic map construction and comparative genome analysis in asparagus bean.

PubMed

Huang, Haitao; Tan, Huaqiang; Xu, Dongmei; Tang, Yi; Niu, Yisong; Lai, Yunsong; Tie, Manman; Li, Huanxiu

2018-03-19

Genetic maps are a prerequisite for quantitative trait locus (QTL) analysis, marker-assisted selection (MAS), fine gene mapping, and assembly of genome sequences. So far, several asparagus bean linkage maps have been established using various kinds of molecular markers. However, these maps were all constructed by gel- or array-based markers. No maps based on sequencing method have been reported. In this study, an NGS-based strategy, SLAF-seq, was applied to create a high-density genetic map for asparagus bean. Through SLAF library construction and Illumina sequencing of two parents and 100 F2 individuals, a total of 55,437 polymorphic SLAF markers were developed and mined for SNP markers. The map consisted of 5,225 SNP markers in 11 LGs, spanning a total distance of 1,850.81 cM, with an average distance between markers of 0.35 cM. Comparative genome analysis with four other legume species, soybean, common bean, mung bean and adzuki bean showed that asparagus bean is genetically more related to adzuki bean. The results will provide a foundation for future genomic research, such as QTL fine mapping, comparative mapping in pulses, and offer support for assembling asparagus bean genome sequence.
SEURAT: Visual analytics for the integrated analysis of microarray data

PubMed Central

2010-01-01

Background In translational cancer research, gene expression data is collected together with clinical data and genomic data arising from other chip based high throughput technologies. Software tools for the joint analysis of such high dimensional data sets together with clinical data are required. Results We have developed an open source software tool which provides interactive visualization capability for the integrated analysis of high-dimensional gene expression data together with associated clinical data, array CGH data and SNP array data. The different data types are organized by a comprehensive data manager. Interactive tools are provided for all graphics: heatmaps, dendrograms, barcharts, histograms, eventcharts and a chromosome browser, which displays genetic variations along the genome. All graphics are dynamic and fully linked so that any object selected in a graphic will be highlighted in all other graphics. For exploratory data analysis the software provides unsupervised data analytics like clustering, seriation algorithms and biclustering algorithms. Conclusions The SEURAT software meets the growing needs of researchers to perform joint analysis of gene expression, genomical and clinical data. PMID:20525257
Bovine exome sequence analysis and targeted SNP genotyping of recessive fertility defects BH1, HH2, and HH3 reveal a putative causative mutation in SMC2 for HH3.

PubMed

McClure, Matthew C; Bickhart, Derek; Null, Dan; Vanraden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B; Van Tassell, Curtis P; Sonstegard, Tad S

2014-01-01

The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array.
Bovine Exome Sequence Analysis and Targeted SNP Genotyping of Recessive Fertility Defects BH1, HH2, and HH3 Reveal a Putative Causative Mutation in SMC2 for HH3

PubMed Central

McClure, Matthew C.; Bickhart, Derek; Null, Dan; VanRaden, Paul; Xu, Lingyang; Wiggans, George; Liu, George; Schroeder, Steve; Glasscock, Jarret; Armstrong, Jon; Cole, John B.; Van Tassell, Curtis P.; Sonstegard, Tad S.

2014-01-01

The recent discovery of bovine haplotypes with negative effects on fertility in the Brown Swiss, Holstein, and Jersey breeds has allowed producers to identify carrier animals using commercial single nucleotide polymorphism (SNP) genotyping assays. This study was devised to identify the causative mutations underlying defective bovine embryo development contained within three of these haplotypes (Brown Swiss haplotype 1 and Holstein haplotypes 2 and 3) by combining exome capture with next generation sequencing. Of the 68,476,640 sequence variations (SV) identified, only 1,311 genome-wide SNP were concordant with the haplotype status of 21 sequenced carriers. Validation genotyping of 36 candidate SNP identified only 1 variant that was concordant to Holstein haplotype 3 (HH3), while no variants located within the refined intervals for HH2 or BH1 were concordant. The variant strictly associated with HH3 is a non-synonymous SNP (T/C) within exon 24 of the Structural Maintenance of Chromosomes 2 (SMC2) on Chromosome 8 at position 95,410,507 (UMD3.1). This polymorphism changes amino acid 1135 from phenylalanine to serine and causes a non-neutral, non-tolerated, and evolutionarily unlikely substitution within the NTPase domain of the encoded protein. Because only exome capture sequencing was used, we could not rule out the possibility that the true causative mutation for HH3 might lie in a non-exonic genomic location. Given the essential role of SMC2 in DNA repair, chromosome condensation and segregation during cell division, our findings strongly support the non-synonymous SNP (T/C) in SMC2 as the likely causative mutation. The absence of concordant variations for HH2 or BH1 suggests either the underlying causative mutations lie within a non-exomic region or in exome regions not covered by the capture array. PMID:24667746
The Role of Constitutional Copy Number Variants in Breast Cancer

PubMed Central

Walker, Logan C.; Wiggins, George A.R.; Pearson, John F.

2015-01-01

Constitutional copy number variants (CNVs) include inherited and de novo deviations from a diploid state at a defined genomic region. These variants contribute significantly to genetic variation and disease in humans, including breast cancer susceptibility. Identification of genetic risk factors for breast cancer in recent years has been dominated by the use of genome-wide technologies, such as single nucleotide polymorphism (SNP)-arrays, with a significant focus on single nucleotide variants. To date, these large datasets have been underutilised for generating genome-wide CNV profiles despite offering a massive resource for assessing the contribution of these structural variants to breast cancer risk. Technical challenges remain in determining the location and distribution of CNVs across the human genome due to the accuracy of computational prediction algorithms and resolution of the array data. Moreover, better methods are required for interpreting the functional effect of newly discovered CNVs. In this review, we explore current and future application of SNP array technology to assess rare and common CNVs in association with breast cancer risk in humans. PMID:27600231
Renal cell carcinoma primary cultures maintain genomic and phenotypic profile of parental tumor tissues.

PubMed

Cifola, Ingrid; Bianchi, Cristina; Mangano, Eleonora; Bombelli, Silvia; Frascati, Fabio; Fasoli, Ester; Ferrero, Stefano; Di Stefano, Vitalba; Zipeto, Maria A; Magni, Fulvio; Signorini, Stefano; Battaglia, Cristina; Perego, Roberto A

2011-06-13

Clear cell renal cell carcinoma (ccRCC) is characterized by recurrent copy number alterations (CNAs) and loss of heterozygosity (LOH), which may have potential diagnostic and prognostic applications. Here, we explored whether ccRCC primary cultures, established from surgical tumor specimens, maintain the DNA profile of parental tumor tissues allowing a more confident CNAs and LOH discrimination with respect to the original tissues. We established a collection of 9 phenotypically well-characterized ccRCC primary cell cultures. Using the Affymetrix SNP array technology, we performed the genome-wide copy number (CN) profiling of both cultures and corresponding tumor tissues. Global concordance for each culture/tissue pair was assayed evaluating the correlations between whole-genome CN profiles and SNP allelic calls. CN analysis was performed using the two CNAG v3.0 and Partek software, and comparing results returned by two different algorithms (Hidden Markov Model and Genomic Segmentation). A very good overlap between the CNAs of each culture and corresponding tissue was observed. The finding, reinforced by high whole-genome CN correlations and SNP call concordances, provided evidence that each culture was derived from its corresponding tissue and maintained the genomic alterations of parental tumor. In addition, primary culture DNA profile remained stable for at least 3 weeks, till to third passage. These cultures showed a greater cell homogeneity and enrichment in tumor component than original tissues, thus enabling a better discrimination of CNAs and LOH. Especially for hemizygous deletions, primary cultures presented more evident CN losses, typically accompanied by LOH; differently, in original tissues the intensity of these deletions was weaken by normal cell contamination and LOH calls were missed. ccRCC primary cultures are a reliable in vitro model, well-reproducing original tumor genetics and phenotype, potentially useful for future functional approaches aimed to study genes or pathways involved in ccRCC etiopathogenesis and to identify novel clinical markers or therapeutic targets. Moreover, SNP array technology proved to be a powerful tool to better define the cell composition and homogeneity of RCC primary cultures. © 2011 Cifola et al; licensee BioMed Central Ltd.
DASH-2: Flexible, Low-Cost, and High-Throughput SNP Genotyping by Dynamic Allele-Specific Hybridization on Membrane Arrays

PubMed Central

Jobs, Magnus; Howell, W. Mathias; Strömqvist, Linda; Mayr, Torsten; Brookes, Anthony J.

2003-01-01

Genotyping technologies need to be continually improved in terms of their flexibility, cost-efficiency, and throughput, to push forward genome variation analysis. To this end, we have leveraged the inherent simplicity of dynamic allele-specific hybridization (DASH) and coupled it to recent innovations of centrifugal arrays and iFRET. We have thereby created a new genotyping platform we term DASH-2, which we demonstrate and evaluate in this report. The system is highly flexible in many ways (any plate format, PCR multiplexing, serial and parallel array processing, spectral-multiplexing of hybridization probes), thus supporting a wide range of application scales and objectives. Precision is demonstrated to be in the range 99.8–100%, and assay costs are 0.05 USD or less per genotype assignment. DASH-2 thus provides a powerful new alternative for genotyping practice, which can be used without the need for expensive robotics support. PMID:12727908
The iSelect 9 K SNP analysis revealed polyploidization induced revolutionary changes and intense human selection causing strong haplotype blocks in wheat.

PubMed

Hao, Chenyang; Wang, Yuquan; Chao, Shiaoman; Li, Tian; Liu, Hongxia; Wang, Lanfen; Zhang, Xueyong

2017-01-30

A Chinese wheat mini core collection was genotyped using the wheat 9 K iSelect SNP array. Total 2420 and 2396 polymorphic SNPs were detected on the A and the B genome chromosomes, which formed 878 haplotype blocks. There were more blocks in the B genome, but the average block size was significantly (P < 0.05) smaller than those in the A genome. Intense selection (domestication and breeding) had a stronger effect on the A than on the B genome chromosomes. Based on the genetic pedigrees, many blocks can be traced back to a well-known Strampelli cross, which was made one century ago. Furthermore, polyploidization of wheat (both tetraploidization and hexaploidization) induced revolutionary changes in both the A and the B genomes, with a greater increase of gene diversity compared to their diploid ancestors. Modern breeding has dramatically increased diversity in the gene coding regions, though obvious blocks were formed on most of the chromosomes in both tetraploid and hexaploid wheats. Tag-SNP markers identified in this study can be used for marker assisted selection using haplotype blocks as a wheat breeding strategy. This strategy can also be employed to facilitate genome selection in other self-pollinating crop species.
Genome-wide association studies for multiple diseases of the German Shepherd Dog

PubMed Central

Tsai, Kate L.; Noorai, Rooksana E.; Starr-Moss, Alison N.; Quignon, Pascale; Rinz, Caitlin J.; Ostrander, Elaine A.; Steiner, Jörg M.; Murphy, Keith E.

2012-01-01

The German Shepherd Dog (GSD) is a popular working and companion breed for which over 50 hereditary diseases have been documented. Herein, SNP profiles for 197 GSDs were generated using the Affymetrix v2 canine SNP array for a genome-wide association study to identify loci associated with four diseases: pituitary dwarfism, degenerative myelopathy (DM), congenital megaesophagus (ME), and pancreatic acinar atrophy (PAA). A locus on Chr 9 is strongly associated with pituitary dwarfism and is proximal to a plausible candidate gene, LHX3. Results for DM confirm a major locus encompassing SOD1, in which an associated point mutation was previously identified, but do not suggest modifier loci. Several SNPs on Chr 12 are associated with ME and a 4.7 Mb haplotype block is present in affected dogs. Analysis of additional ME cases for a SNP within the haplotype provides further support for this association. Results for PAA indicate more complex genetic underpinnings. Several regions on multiple chromosomes reach genome-wide significance. However, no major locus is apparent and only two associated haplotype blocks, on Chrs 7 and 12 are observed. These data suggest that PAA may be governed by multiple loci with small effects, or it may be a heterogeneous disorder. PMID:22105877
Prevalence and prognostic impact of allelic imbalances associated with leukemic transformation of Philadelphia chromosome–negative myeloproliferative neoplasms

PubMed Central

Krug, Utz O.; Lee, Dhong Hyun Tony; Kawamata, Norihiko; Iwanski, Gabriela B.; Lasho, Terra; Weiss, Tamara; Nowak, Daniel; Koren-Michowitz, Maya; Kato, Motohiro; Sanada, Masashi; Shih, Lee-Yung; Nagler, Arnon; Raynaud, Sophie D.; Müller-Tidow, Carsten; Mesa, Ruben; Haferlach, Torsten; Gilliland, D. Gary; Tefferi, Ayalew; Ogawa, Seishi; Koeffler, H. Phillip

2010-01-01

Philadelphia chromosome–negative myeloproliferative neoplasms (MPNs) including polycythemia vera, essential thrombocythemia, and primary myelofibrosis show an inherent tendency for transformation into leukemia (MPN-blast phase), which is hypothesized to be accompanied by acquisition of additional genomic lesions. We, therefore, examined chromosomal abnormalities by high-resolution single nucleotide polymorphism (SNP) array in 88 MPN patients, as well as 71 cases with MPN-blast phase, and correlated these findings with their clinical parameters. Frequent genomic alterations were found in MPN after leukemic transformation with up to 3-fold more genomic changes per sample compared with samples in chronic phase (P < .001). We identified commonly altered regions involved in disease progression including not only established targets (ETV6, TP53, and RUNX1) but also new candidate genes on 7q, 16q, 19p, and 21q. Moreover, trisomy 8 or amplification of 8q24 (MYC) was almost exclusively detected in JAK2V617F− cases with MPN-blast phase. Remarkably, copy number–neutral loss of heterozygosity (CNN-LOH) on either 7q or 9p including homozygous JAK2V617F was related to decreased survival after leukemic transformation (P = .01 and P = .016, respectively). Our high-density SNP-array analysis of MPN genomes in the chronic compared with leukemic stage identified novel target genes and provided prognostic insights associated with the evolution to leukemia. PMID:20068225
Comprehensive comparison of three commercial human whole-exome capture platforms.

PubMed

Asan; Xu, Yu; Jiang, Hui; Tyler-Smith, Chris; Xue, Yali; Jiang, Tao; Wang, Jiawei; Wu, Mingzhi; Liu, Xiao; Tian, Geng; Wang, Jun; Wang, Jian; Yang, Huangming; Zhang, Xiuqing

2011-09-28

Exome sequencing, which allows the global analysis of protein coding sequences in the human genome, has become an effective and affordable approach to detecting causative genetic mutations in diseases. Currently, there are several commercial human exome capture platforms; however, the relative performances of these have not been characterized sufficiently to know which is best for a particular study. We comprehensively compared three platforms: NimbleGen's Sequence Capture Array and SeqCap EZ, and Agilent's SureSelect. We assessed their performance in a variety of ways, including number of genes covered and capture efficacy. Differences that may impact on the choice of platform were that Agilent SureSelect covered approximately 1,100 more genes, while NimbleGen provided better flanking sequence capture. Although all three platforms achieved similar capture specificity of targeted regions, the NimbleGen platforms showed better uniformity of coverage and greater genotype sensitivity at 30- to 100-fold sequencing depth. All three platforms showed similar power in exome SNP calling, including medically relevant SNPs. Compared with genotyping and whole-genome sequencing data, the three platforms achieved a similar accuracy of genotype assignment and SNP detection. Importantly, all three platforms showed similar levels of reproducibility, GC bias and reference allele bias. We demonstrate key differences between the three platforms, particularly advantages of solutions over array capture and the importance of a large gene target set.
A genome-wide SNP-association study confirms a sequence variant (g.66493737C>T) in the equine myostatin (MSTN) gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses

PubMed Central

2010-01-01

Background Thoroughbred horses have been selected for traits contributing to speed and stamina for centuries. It is widely recognized that inherited variation in physical and physiological characteristics is responsible for variation in individual aptitude for race distance, and that muscle phenotypes in particular are important. Results A genome-wide SNP-association study for optimum racing distance was performed using the EquineSNP50 Bead Chip genotyping array in a cohort of n = 118 elite Thoroughbred racehorses divergent for race distance aptitude. In a cohort-based association test we evaluated genotypic variation at 40,977 SNPs between horses suited to short distance (≤ 8 f) and middle-long distance (> 8 f) races. The most significant SNP was located on chromosome 18: BIEC2-417495 ~690 kb from the gene encoding myostatin (MSTN) [Punadj. = 6.96 × 10-6]. Considering best race distance as a quantitative phenotype, a peak of association on chromosome 18 (chr18:65809482-67545806) comprising eight SNPs encompassing a 1.7 Mb region was observed. Again, similar to the cohort-based analysis, the most significant SNP was BIEC2-417495 (Punadj. = 1.61 × 10-9; PBonf. = 6.58 × 10-5). In a candidate gene study we have previously reported a SNP (g.66493737C>T) in MSTN associated with best race distance in Thoroughbreds; however, its functional and genome-wide relevance were uncertain. Additional re-sequencing in the flanking regions of the MSTN gene revealed four novel 3' UTR SNPs and a 227 bp SINE insertion polymorphism in the 5' UTR promoter sequence. Linkage disequilibrium was highest between g.66493737C>T and BIEC2-417495 (r2 = 0.86). Conclusions Comparative association tests consistently demonstrated the g.66493737C>T SNP as the superior variant in the prediction of distance aptitude in racehorses (g.66493737C>T, P = 1.02 × 10-10; BIEC2-417495, Punadj. = 1.61 × 10-9). Functional investigations will be required to determine whether this polymorphism affects putative transcription-factor binding and gives rise to variation in gene and protein expression. Nonetheless, this study demonstrates that the g.66493737C>T SNP provides the most powerful genetic marker for prediction of race distance aptitude in Thoroughbreds. PMID:20932346
A genome-wide SNP-association study confirms a sequence variant (g.66493737C>T) in the equine myostatin (MSTN) gene as the most powerful predictor of optimum racing distance for Thoroughbred racehorses.

PubMed

Hill, Emmeline W; McGivney, Beatrice A; Gu, Jingjing; Whiston, Ronan; Machugh, David E

2010-10-11

Thoroughbred horses have been selected for traits contributing to speed and stamina for centuries. It is widely recognized that inherited variation in physical and physiological characteristics is responsible for variation in individual aptitude for race distance, and that muscle phenotypes in particular are important. A genome-wide SNP-association study for optimum racing distance was performed using the EquineSNP50 Bead Chip genotyping array in a cohort of n = 118 elite Thoroughbred racehorses divergent for race distance aptitude. In a cohort-based association test we evaluated genotypic variation at 40,977 SNPs between horses suited to short distance (≤ 8 f) and middle-long distance (> 8 f) races. The most significant SNP was located on chromosome 18: BIEC2-417495 ~690 kb from the gene encoding myostatin (MSTN) [P(unadj.) = 6.96 x 10⁻⁶]. Considering best race distance as a quantitative phenotype, a peak of association on chromosome 18 (chr18:65809482-67545806) comprising eight SNPs encompassing a 1.7 Mb region was observed. Again, similar to the cohort-based analysis, the most significant SNP was BIEC2-417495 (P(unadj.) = 1.61 x 10⁻⁹; P(Bonf.) = 6.58 x 10⁻⁵). In a candidate gene study we have previously reported a SNP (g.66493737C>T) in MSTN associated with best race distance in Thoroughbreds; however, its functional and genome-wide relevance were uncertain. Additional re-sequencing in the flanking regions of the MSTN gene revealed four novel 3' UTR SNPs and a 227 bp SINE insertion polymorphism in the 5' UTR promoter sequence. Linkage disequilibrium was highest between g.66493737C>T and BIEC2-417495 (r² = 0.86). Comparative association tests consistently demonstrated the g.66493737C>T SNP as the superior variant in the prediction of distance aptitude in racehorses (g.66493737C>T, P = 1.02 x 10⁻¹⁰; BIEC2-417495, P(unadj.) = 1.61 x 10⁻⁹). Functional investigations will be required to determine whether this polymorphism affects putative transcription-factor binding and gives rise to variation in gene and protein expression. Nonetheless, this study demonstrates that the g.66493737C>T SNP provides the most powerful genetic marker for prediction of race distance aptitude in Thoroughbreds.
A New Single Nucleotide Polymorphism Database for Rainbow Trout Generated Through Whole Genome Resequencing.

PubMed

Gao, Guangtu; Nome, Torfinn; Pearse, Devon E; Moen, Thomas; Naish, Kerry A; Thorgaard, Gary H; Lien, Sigbjørn; Palti, Yniv

2018-01-01

Single-nucleotide polymorphisms (SNPs) are highly abundant markers, which are broadly distributed in animal genomes. For rainbow trout ( Oncorhynchus mykiss ), SNP discovery has been previously done through sequencing of restriction-site associated DNA (RAD) libraries, reduced representation libraries (RRL) and RNA sequencing. Recently we have performed high coverage whole genome resequencing with 61 unrelated samples, representing a wide range of rainbow trout and steelhead populations, with 49 new samples added to 12 aquaculture samples from AquaGen (Norway) that we previously used for SNP discovery. Of the 49 new samples, 11 were double-haploid lines from Washington State University (WSU) and 38 represented wild and hatchery populations from a wide range of geographic distribution and with divergent migratory phenotypes. We then mapped the sequences to the new rainbow trout reference genome assembly (GCA_002163495.1) which is based on the Swanson YY doubled haploid line. Variant calling was conducted with FreeBayes and SAMtools mpileup , followed by filtering of SNPs based on quality score, sequence complexity, read depth on the locus, and number of genotyped samples. Results from the two variant calling programs were compared and genotypes of the double haploid samples were used for detecting and filtering putative paralogous sequence variants (PSVs) and multi-sequence variants (MSVs). Overall, 30,302,087 SNPs were identified on the rainbow trout genome 29 chromosomes and 1,139,018 on unplaced scaffolds, with 4,042,723 SNPs having high minor allele frequency (MAF > 0.25). The average SNP density on the chromosomes was one SNP per 64 bp, or 15.6 SNPs per 1 kb. Results from the phylogenetic analysis that we conducted indicate that the SNP markers contain enough population-specific polymorphisms for recovering population relationships despite the small sample size used. Intra-Population polymorphism assessment revealed high level of polymorphism and heterozygosity within each population. We also provide functional annotation based on the genome position of each SNP and evaluate the use of clonal lines for filtering of PSVs and MSVs. These SNPs form a new database, which provides an important resource for a new high density SNP array design and for other SNP genotyping platforms used for genetic and genomics studies of this iconic salmonid fish species.
Paternally inherited microdeletion at 15q11.2 confirms a significant role for the SNORD116 C/D box snoRNA cluster in Prader-Willi syndrome.

PubMed

Duker, Angela L; Ballif, Blake C; Bawle, Erawati V; Person, Richard E; Mahadevan, Sangeetha; Alliman, Sarah; Thompson, Regina; Traylor, Ryan; Bejjani, Bassem A; Shaffer, Lisa G; Rosenfeld, Jill A; Lamb, Allen N; Sahoo, Trilochan

2010-11-01

Prader-Willi syndrome (PWS) is a neurobehavioral disorder manifested by infantile hypotonia and feeding difficulties in infancy, followed by morbid obesity secondary to hyperphagia. It is caused by deficiency of paternally expressed transcript(s) within the human chromosome region 15q11.2. PWS patients harboring balanced chromosomal translocations with breakpoints within small nuclear ribonucleoprotein polypeptide N (SNRPN) have provided indirect evidence for a role for the imprinted C/D box containing small nucleolar RNA (snoRNA) genes encoded downstream of SNRPN. In addition, recently published data provide strong evidence in support of a role for the snoRNA SNORD116 cluster (HBII-85) in PWS etiology. In this study, we performed detailed phenotypic, cytogenetic, and molecular analyses including chromosome analysis, array comparative genomic hybridization (array CGH), expression studies, and single-nucleotide polymorphism (SNP) genotyping for parent-of-origin determination of the 15q11.2 microdeletion on an 11-year-old child expressing the major components of the PWS phenotype. This child had an ∼236.29 kb microdeletion at 15q11.2 within the larger Prader-Willi/Angelman syndrome critical region that included the SNORD116 cluster of snoRNAs. Analysis of SNP genotypes in proband and mother provided evidence in support of the deletion being on the paternal chromosome 15. This child also met most of the major PWS diagnostic criteria including infantile hypotonia, early-onset morbid obesity, and hypogonadism. Identification and characterization of this case provide unequivocal evidence for a critical role for the SNORD116 snoRNA molecules in PWS pathogenesis. Array CGH testing for genomic copy-number changes in cases with complex phenotypes is proving to be invaluable in detecting novel alterations and enabling better genotype-phenotype correlations.
Whole-Exome Sequencing Study of Thyrotropin-Secreting Pituitary Adenomas.

PubMed

Sapkota, Santosh; Horiguchi, Kazuhiko; Tosaka, Masahiko; Yamada, Syozo; Yamada, Masanobu

2017-02-01

Thyrotropin (TSH)-secreting pituitary adenomas (TSHomas) are a rare cause of hyperthyroidism, and the genetic aberrations responsible remain unknown. To identify somatic genetic abnormalities in TSHomas. A single-nucleotide polymorphism (SNP) array analysis was performed on 8 TSHomas. Four tumors with no allelic losses or limited loss of heterozygosity were selected, and whole-exome sequencing was performed, including their corresponding blood samples. Somatic variants were confirmed by Sanger sequencing. A set of 8 tumors was also assessed to validate candidate genes. Twelve patients with sporadic TSHomas were examined. The overall performance of whole-exome sequencing was good, with an average coverage of each base in the targeted region of 97.6%. Six DNA variants were confirmed as candidate driver mutations, with an average of 1.5 somatic mutations per tumor. No mutations were recurrent. Two of these mutations were found in genes with an established role in malignant tumorigenesis (SMOX and SYTL3), and 4 had unknown roles (ZSCAN23, ASTN2, R3HDM2, and CWH43). Similarly, an SNP array analysis revealed frequent chromosomal regions of copy number gains, including recurrent gains at loci harboring 4 of these 6 genes. Several candidate somatic mutations and changes in copy numbers for TSHomas were identified. The results showed no recurrence of mutations in the tumors studied but a low number of mutations, thereby highlighting their benign nature. Further studies on a larger cohort of TSHomas, along with the use of epigenetic and transcriptomic approaches, may reveal the underlying genetic lesions. Copyright © 2017 by the Endocrine Society
Genetic studies in a patient with X-linked retinoschisis coexisting with developmental delay and sensorineural hearing loss.

PubMed

Sudha, Dhandayuthapani; Patric, Irene Rosita Pia; Ganapathy, Aparna; Agarwal, Smitha; Krishna, Shuba; Neriyanuri, Srividya; Sripriya, Sarangapani; Sen, Parveen; Chidambaram, Subbulakshmi; Arunachalam, Jayamuruga Pandian

2017-01-01

In this study, we present a juvenile retinoschisis patient with developmental delay, sensorineural hearing loss, and reduced axial tone. X-linked juvenile retinoschisis (XLRS) is a retinal dystrophy, most often not associated with systemic anomalies and also not showing any locus heterogeneity. Therefore it was of interest to understand the genetic basis of the condition in this patient. RS1 gene screening for XLRS was performed by Sanger sequencing. Whole genome SNP 6.0 array analysis was carried out to investigate gross chromosomal aberrations that could result in systemic phenotype. In addition, targeted next generation sequencing (NGS) was employed to determine any possible involvement of X-linked syndromic and non-syndromic mental retardation genes. This NGS panel consisted of 550 genes implicated in several other rare inherited diseases. RS1 gene screening revealed a pathogenic hemizygous splice site mutation (c.78+1G>T), inherited from the mother. SNP 6.0 array analysis did not indicate any significant chromosomal aberrations that could be disease-associated. Targeted resequencing did not identify any mutations in the X-linked mental retardation genes. However, variations in three other genes (NSD1, LARGE, and POLG) were detected, which were all inherited from the patient's unaffected father. Taken together, RS1 mutation was found to segregate with retinoschisis phenotype while none of the other identified variations were co-segregating with the systemic defects. Hereby, we infer that the multisystemic defects harbored by the patient are a rare coexistence of XLRS, developmental delay, sensorineural hearing loss, and reduced axial tone reported for the first time in the literature.
A common variant near TGFBR3 is associated with primary open angle glaucoma.

PubMed

Li, Zheng; Allingham, R Rand; Nakano, Masakazu; Jia, Liyun; Chen, Yuhong; Ikeda, Yoko; Mani, Baskaran; Chen, Li-Jia; Kee, Changwon; Garway-Heath, David F; Sripriya, Sarangapani; Fuse, Nobuo; Abu-Amero, Khaled K; Huang, Chukai; Namburi, Prasanthi; Burdon, Kathryn; Perera, Shamira A; Gharahkhani, Puya; Lin, Ying; Ueno, Morio; Ozaki, Mineo; Mizoguchi, Takanori; Krishnadas, Subbiah Ramasamy; Osman, Essam A; Lee, Mei Chin; Chan, Anita S Y; Tajudin, Liza-Sharmini A; Do, Tan; Goncalves, Aurelien; Reynier, Pascal; Zhang, Hong; Bourne, Rupert; Goh, David; Broadway, David; Husain, Rahat; Negi, Anil K; Su, Daniel H; Ho, Ching-Lin; Blanco, Augusto Azuara; Leung, Christopher K S; Wong, Tina T; Yakub, Azhany; Liu, Yutao; Nongpiur, Monisha E; Han, Jong Chul; Hon, Do Nhu; Shantha, Balekudaru; Zhao, Bowen; Sang, Jinghong; Zhang, NiHong; Sato, Ryuichi; Yoshii, Kengo; Panda-Jonas, Songhomita; Ashley Koch, Allison E; Herndon, Leon W; Moroi, Sayoko E; Challa, Pratap; Foo, Jia Nee; Bei, Jin-Xin; Zeng, Yi-Xin; Simmons, Cameron P; Bich Chau, Tran Nguyen; Sharmila, Philomenadin Ferdinamarie; Chew, Merwyn; Lim, Blanche; Tam, Pansy O S; Chua, Elaine; Ng, Xiao Yu; Yong, Victor H K; Chong, Yaan Fun; Meah, Wee Yang; Vijayan, Saravanan; Seongsoo, Sohn; Xu, Wang; Teo, Yik Ying; Cooke Bailey, Jessica N; Kang, Jae H; Haines, Jonathan L; Cheng, Ching Yu; Saw, Seang-Mei; Tai, E-Shyong; Richards, Julia E; Ritch, Robert; Gaasterland, Douglas E; Pasquale, Louis R; Liu, Jianjun; Jonas, Jost B; Milea, Dan; George, Ronnie; Al-Obeidan, Saleh A; Mori, Kazuhiko; Macgregor, Stuart; Hewitt, Alex W; Girkin, Christopher A; Zhang, Mingzhi; Sundaresan, Periasamy; Vijaya, Lingam; Mackey, David A; Wong, Tien Yin; Craig, Jamie E; Sun, Xinghuai; Kinoshita, Shigeru; Wiggs, Janey L; Khor, Chiea-Chuen; Yang, Zhenglin; Pang, Chi Pui; Wang, Ningli; Hauser, Michael A; Tashiro, Kei; Aung, Tin; Vithana, Eranga N

2015-07-01

Primary open angle glaucoma (POAG), a major cause of blindness worldwide, is a complex disease with a significant genetic contribution. We performed Exome Array (Illumina) analysis on 3504 POAG cases and 9746 controls with replication of the most significant findings in 9173 POAG cases and 26 780 controls across 18 collections of Asian, African and European descent. Apart from confirming strong evidence of association at CDKN2B-AS1 (rs2157719 [G], odds ratio [OR] = 0.71, P = 2.81 × 10(-33)), we observed one SNP showing significant association to POAG (CDC7-TGFBR3 rs1192415, ORG-allele = 1.13, Pmeta = 1.60 × 10(-8)). This particular SNP has previously been shown to be strongly associated with optic disc area and vertical cup-to-disc ratio, which are regarded as glaucoma-related quantitative traits. Our study now extends this by directly implicating it in POAG disease pathogenesis. © The Author 2015. Published by Oxford University Press.
A common variant near TGFBR3 is associated with primary open angle glaucoma

PubMed Central

Li, Zheng; Allingham, R. Rand; Nakano, Masakazu; Jia, Liyun; Chen, Yuhong; Ikeda, Yoko; Mani, Baskaran; Chen, Li-Jia; Kee, Changwon; Garway-Heath, David F.; Sripriya, Sarangapani; Fuse, Nobuo; Abu-Amero, Khaled K.; Huang, Chukai; Namburi, Prasanthi; Burdon, Kathryn; Perera, Shamira A.; Gharahkhani, Puya; Lin, Ying; Ueno, Morio; Ozaki, Mineo; Mizoguchi, Takanori; Krishnadas, Subbiah Ramasamy; Osman, Essam A.; Lee, Mei Chin; Chan, Anita S.Y.; Tajudin, Liza-Sharmini A.; Do, Tan; Goncalves, Aurelien; Reynier, Pascal; Zhang, Hong; Bourne, Rupert; Goh, David; Broadway, David; Husain, Rahat; Negi, Anil K.; Su, Daniel H; Ho, Ching-Lin; Blanco, Augusto Azuara; Leung, Christopher K.S.; Wong, Tina T.; Yakub, Azhany; Liu, Yutao; Nongpiur, Monisha E.; Han, Jong Chul; Hon, Do Nhu; Shantha, Balekudaru; Zhao, Bowen; Sang, Jinghong; Zhang, NiHong; Sato, Ryuichi; Yoshii, Kengo; Panda-Jonas, Songhomita; Ashley Koch, Allison E.; Herndon, Leon W.; Moroi, Sayoko E.; Challa, Pratap; Foo, Jia Nee; Bei, Jin-Xin; Zeng, Yi-Xin; Simmons, Cameron P.; Bich Chau, Tran Nguyen; Sharmila, Philomenadin Ferdinamarie; Chew, Merwyn; Lim, Blanche; Tam, Pansy O.S.; Chua, Elaine; Ng, Xiao Yu; Yong, Victor H.K.; Chong, Yaan Fun; Meah, Wee Yang; Vijayan, Saravanan; Seongsoo, Sohn; Xu, Wang; Teo, Yik Ying; Cooke Bailey, Jessica N.; Kang, Jae H.; Haines, Jonathan L.; Cheng, Ching Yu; Saw, Seang-Mei; Tai, E-Shyong; Richards, Julia E.; Ritch, Robert; Gaasterland, Douglas E.; Pasquale, Louis R.; Liu, Jianjun; Jonas, Jost B.; Milea, Dan; George, Ronnie; Al-Obeidan, Saleh A.; Mori, Kazuhiko; Macgregor, Stuart; Hewitt, Alex W.; Girkin, Christopher A.; Zhang, Mingzhi; Sundaresan, Periasamy; Vijaya, Lingam; Mackey, David A.; Wong, Tien Yin; Craig, Jamie E.; Sun, Xinghuai; Kinoshita, Shigeru; Wiggs, Janey L.; Khor, Chiea-Chuen; Yang, Zhenglin; Pang, Chi Pui; Wang, Ningli; Hauser, Michael A.; Tashiro, Kei; Aung, Tin; Vithana, Eranga N.

2015-01-01

Primary open angle glaucoma (POAG), a major cause of blindness worldwide, is a complex disease with a significant genetic contribution. We performed Exome Array (Illumina) analysis on 3504 POAG cases and 9746 controls with replication of the most significant findings in 9173 POAG cases and 26 780 controls across 18 collections of Asian, African and European descent. Apart from confirming strong evidence of association at CDKN2B-AS1 (rs2157719 [G], odds ratio [OR] = 0.71, P = 2.81 × 10−33), we observed one SNP showing significant association to POAG (CDC7–TGFBR3 rs1192415, ORG-allele = 1.13, Pmeta = 1.60 × 10−8). This particular SNP has previously been shown to be strongly associated with optic disc area and vertical cup-to-disc ratio, which are regarded as glaucoma-related quantitative traits. Our study now extends this by directly implicating it in POAG disease pathogenesis. PMID:25861811
Genome-wide association study for milking speed in French Holstein cows.

PubMed

Marete, Andrew; Sahana, Goutam; Fritz, Sébastien; Lefebvre, Rachel; Barbat, Anne; Lund, Mogens Sandø; Guldbrandtsen, Bernt; Boichard, Didier

2018-04-25

Using a combination of data from the BovineSNP50 BeadChip SNP array (Illumina, San Diego, CA) and a EuroGenomics (Amsterdam, the Netherlands) custom single nucleotide polymorphism (SNP) chip with SNP pre-selected from whole genome sequence data, we carried out an association study of milking speed in 32,491 French Holstein dairy cows. Milking speed was measured by a score given by the farmer. Phenotypes were yield deviations as obtained from the French evaluation system. They were analyzed with a linear mixed model for association studies. We identified SNP on 22 chromosomes significantly associated with milking speed. As clinical mastitis and somatic cell score have an unfavorable genetic correlation with milking speed, we tested whether the most significant SNP on these 22 chromosomes associated with milking speed were also associated with clinical mastitis or somatic cell score. Nine hundred seventy-one genome-wide significant SNP were associated with milking speed. Of these, 86 were associated with clinical mastitis and 198 with somatic cell score. The most significant association signals for milking speed were observed on chromosomes 7, 8, 10, 14, and 18. The most significant signal was located on chromosome 14 (ZFAT gene). Eleven novel milking speed quantitative trait loci (QTL) were observed on chromosomes 7, 10, 11, 14, 18, 25, and 26. Twelve candidate SNP for milking speed mapped directly within genes. Of these 10 were QTL lead SNP, which mapped within the genes HMHA1, POLR2E, GNB5, KLHL29, ZFAT, KCNB2, CEACAM18, CCL24, and LHPP. Limited pleiotropy was observed between milking speed QTL and clinical mastitis. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

A high-density intraspecific SNP linkage map of pigeonpea (Cajanas cajan L. Millsp.)

PubMed Central

Mandal, Paritra; Bhutani, Shefali; Dutta, Sutapa; Kumawat, Giriraj; Singh, Bikram Pratap; Chaudhary, A. K.; Yadav, Rekha; Gaikwad, K.; Sevanthi, Amitha Mithra; Datta, Subhojit; Raje, Ranjeet S.; Sharma, Tilak R.; Singh, Nagendra Kumar

2017-01-01

Pigeonpea (Cajanus cajan (L.) Millsp.) is a major food legume cultivated in semi-arid tropical regions including the Indian subcontinent, Africa, and Southeast Asia. It is an important source of protein, minerals, and vitamins for nearly 20% of the world population. Due to high carbon sequestration and drought tolerance, pigeonpea is an important crop for the development of climate resilient agriculture and nutritional security. However, pigeonpea productivity has remained low for decades because of limited genetic and genomic resources, and sparse utilization of landraces and wild pigeonpea germplasm. Here, we present a dense intraspecific linkage map of pigeonpea comprising 932 markers that span a total adjusted map length of 1,411.83 cM. The consensus map is based on three different linkage maps that incorporate a large number of single nucleotide polymorphism (SNP) markers derived from next generation sequencing data, using Illumina GoldenGate bead arrays, and genotyping with restriction site associated DNA (RAD) sequencing. The genotyping-by-sequencing enhanced the marker density but was met with limited success due to lack of common markers across the genotypes of mapping population. The integrated map has 547 bead-array SNP, 319 RAD-SNP, and 65 simple sequence repeat (SSR) marker loci. We also show here correspondence between our linkage map and published genome pseudomolecules of pigeonpea. The availability of a high-density linkage map will help improve the anchoring of the pigeonpea genome to its chromosomes and the mapping of genes and quantitative trait loci associated with useful agronomic traits. PMID:28654689
Construction and Annotation of a High Density SNP Linkage Map of the Atlantic Salmon (Salmo salar) Genome.

PubMed

Tsai, Hsin Y; Robledo, Diego; Lowe, Natalie R; Bekaert, Michael; Taggart, John B; Bron, James E; Houston, Ross D

2016-07-07

High density linkage maps are useful tools for fine-scale mapping of quantitative trait loci, and characterization of the recombination landscape of a species' genome. Genomic resources for Atlantic salmon (Salmo salar) include a well-assembled reference genome, and high density single nucleotide polymorphism (SNP) arrays. Our aim was to create a high density linkage map, and to align it with the reference genome assembly. Over 96,000 SNPs were mapped and ordered on the 29 salmon linkage groups using a pedigreed population comprising 622 fish from 60 nuclear families, all genotyped with the 'ssalar01' high density SNP array. The number of SNPs per group showed a high positive correlation with physical chromosome length (r = 0.95). While the order of markers on the genetic and physical maps was generally consistent, areas of discrepancy were identified. Approximately 6.5% of the previously unmapped reference genome sequence was assigned to chromosomes using the linkage map. Male recombination rate was lower than females across the vast majority of the genome, but with a notable peak in subtelomeric regions. Finally, using RNA-Seq data to annotate the reference genome, the mapped SNPs were categorized according to their predicted function, including annotation of ∼2500 putative nonsynonymous variants. The highest density SNP linkage map for any salmonid species has been created, annotated, and integrated with the Atlantic salmon reference genome assembly. This map highlights the marked heterochiasmy of salmon, and provides a useful resource for salmonid genetics and genomics research. Copyright © 2016 Tsai et al.
Three clinical experiences with SNP array results consistent with parental incest: a narrative with lessons learned.

PubMed

Helm, Benjamin M; Langley, Katherine; Spangler, Brooke; Vergano, Samantha

2014-08-01

Single nucleotide polymorphism microarrays have the ability to reveal parental consanguinity which may or may not be known to healthcare providers. Consanguinity can have significant implications for the health of patients and for individual and family psychosocial well-being. These results often present ethical and legal dilemmas that can have important ramifications. Unexpected consanguinity can be confounding to healthcare professionals who may be unprepared to handle these results or to communicate them to families or other appropriate representatives. There are few published accounts of experiences with consanguinity and SNP arrays. In this paper we discuss three cases where molecular evidence of parental incest was identified by SNP microarray. We hope to further highlight consanguinity as a potential incidental finding, how the cases were handled by the clinical team, and what resources were found to be most helpful. This paper aims to contribute further to professional discourse on incidental findings with genomic technology and how they were addressed clinically. These experiences may provide some guidance on how others can prepare for these findings and help improve practice. As genetic and genomic testing is utilized more by non-genetics providers, we also hope to inform about the importance of engaging with geneticists and genetic counselors when addressing these findings.
Imputation across genotyping arrays for genome-wide association studies: assessment of bias and a correction strategy.

PubMed

Johnson, Eric O; Hancock, Dana B; Levy, Joshua L; Gaddis, Nathan C; Saccone, Nancy L; Bierut, Laura J; Page, Grier P

2013-05-01

A great promise of publicly sharing genome-wide association data is the potential to create composite sets of controls. However, studies often use different genotyping arrays, and imputation to a common set of SNPs has shown substantial bias: a problem which has no broadly applicable solution. Based on the idea that using differing genotyped SNP sets as inputs creates differential imputation errors and thus bias in the composite set of controls, we examined the degree to which each of the following occurs: (1) imputation based on the union of genotyped SNPs (i.e., SNPs available on one or more arrays) results in bias, as evidenced by spurious associations (type 1 error) between imputed genotypes and arbitrarily assigned case/control status; (2) imputation based on the intersection of genotyped SNPs (i.e., SNPs available on all arrays) does not evidence such bias; and (3) imputation quality varies by the size of the intersection of genotyped SNP sets. Imputations were conducted in European Americans and African Americans with reference to HapMap phase II and III data. Imputation based on the union of genotyped SNPs across the Illumina 1M and 550v3 arrays showed spurious associations for 0.2 % of SNPs: ~2,000 false positives per million SNPs imputed. Biases remained problematic for very similar arrays (550v1 vs. 550v3) and were substantial for dissimilar arrays (Illumina 1M vs. Affymetrix 6.0). In all instances, imputing based on the intersection of genotyped SNPs (as few as 30 % of the total SNPs genotyped) eliminated such bias while still achieving good imputation quality.
Identification of De Novo and Rare Inherited Copy Number Variants in Children with Syndromic Congenital Heart Defects.

PubMed

Hussein, Ibtessam R; Bader, Rima S; Chaudhary, Adeel G; Bassiouni, Randa; Alquaiti, Maha; Ashgan, Fai; Schulten, Hans-Juergen; Al Qahtani, Mohammad H

2018-06-01

Congenital heart defects (CHDs) are the most common birth defects in neonatal life. CHDs could be presented as isolated defects or associated with developmental delay (DD) and/or other congenital malformations. A small proportion of cardiac defects are caused by chromosomal abnormalities or single gene defects; however, in a large proportion of cases no genetic diagnosis could be achieved by clinical examination and conventional genetic analysis. The development of genome wide array-Comparative Genomic Hybridization technique (array-CGH) allowed for the detection of cryptic chromosomal imbalances and pathogenic copy number variants (CNVs) not detected by conventional techniques. We investigated 94 patients having CHDs associated with other malformations and/or DD. Clinical examination and Echocardiography was done to all patients to evaluate the type of CHD. To investigate for genome defects we applied high-density array-CGH 2 × 400K (41 patients) and CGH/SNP microarray 2 × 400K (Agilent) for 53 patients. Confirmation of results was done using Fluorescent in situ hybridization (FISH) or qPCR techniques in certain cases. Chromosomal abnormalities such as trisomy 18, 13, 21, microdeletions: del22q11.2, del7q11.23, del18 (p11.32; p11.21), tetrasomy 18p, trisomy 9p, del11q24-q25, add 15p, add(18)(q21.3), and der 9, 15 (q34.2; q11.2) were detected in 21/94 patients (22%) using both conventional cytogenetics methods and array-CGH technique. Cryptic chromosomal anomalies and pathogenic variants were detected in 15/73 (20.5%) cases. CNVs were observed in a large proportion of the studied samples (27/56) (48%). Clustering of variants was observed in chromosome 1p36, 1p21.1, 2q37, 3q29, 5p15, 7p22.3, 8p23, 11p15.5, 14q11.2, 15q11.2, 16p13.3, 16p11.2, 18p11, 21q22, and 22q11.2. CGH/SNP array could detect loss of heterozygosity (LOH) in different chromosomal loci in 10/25 patients. Array-CGH technique allowed for detection of cryptic chromosomal imbalances that could not be detected by conventional cytogenetics methods. CHDs associated with DD/congenital malformations presented with a relatively high rate of cryptic chromosomal abnormalities. Clustering of CNVs in certain genome loci needs further analysis to identify candidate genes that may provide clues for understanding the molecular pathway of cardiac development.
Blood pressure loci identified with a gene-centric array.

PubMed

Johnson, Toby; Gaunt, Tom R; Newhouse, Stephen J; Padmanabhan, Sandosh; Tomaszewski, Maciej; Kumari, Meena; Morris, Richard W; Tzoulaki, Ioanna; O'Brien, Eoin T; Poulter, Neil R; Sever, Peter; Shields, Denis C; Thom, Simon; Wannamethee, Sasiwarang G; Whincup, Peter H; Brown, Morris J; Connell, John M; Dobson, Richard J; Howard, Philip J; Mein, Charles A; Onipinla, Abiodun; Shaw-Hawkins, Sue; Zhang, Yun; Davey Smith, George; Day, Ian N M; Lawlor, Debbie A; Goodall, Alison H; Fowkes, F Gerald; Abecasis, Gonçalo R; Elliott, Paul; Gateva, Vesela; Braund, Peter S; Burton, Paul R; Nelson, Christopher P; Tobin, Martin D; van der Harst, Pim; Glorioso, Nicola; Neuvrith, Hani; Salvi, Erika; Staessen, Jan A; Stucchi, Andrea; Devos, Nabila; Jeunemaitre, Xavier; Plouin, Pierre-François; Tichet, Jean; Juhanson, Peeter; Org, Elin; Putku, Margus; Sõber, Siim; Veldre, Gudrun; Viigimaa, Margus; Levinsson, Anna; Rosengren, Annika; Thelle, Dag S; Hastie, Claire E; Hedner, Thomas; Lee, Wai K; Melander, Olle; Wahlstrand, Björn; Hardy, Rebecca; Wong, Andrew; Cooper, Jackie A; Palmen, Jutta; Chen, Li; Stewart, Alexandre F R; Wells, George A; Westra, Harm-Jan; Wolfs, Marcel G M; Clarke, Robert; Franzosi, Maria Grazia; Goel, Anuj; Hamsten, Anders; Lathrop, Mark; Peden, John F; Seedorf, Udo; Watkins, Hugh; Ouwehand, Willem H; Sambrook, Jennifer; Stephens, Jonathan; Casas, Juan-Pablo; Drenos, Fotios; Holmes, Michael V; Kivimaki, Mika; Shah, Sonia; Shah, Tina; Talmud, Philippa J; Whittaker, John; Wallace, Chris; Delles, Christian; Laan, Maris; Kuh, Diana; Humphries, Steve E; Nyberg, Fredrik; Cusi, Daniele; Roberts, Robert; Newton-Cheh, Christopher; Franke, Lude; Stanton, Alice V; Dominiczak, Anna F; Farrall, Martin; Hingorani, Aroon D; Samani, Nilesh J; Caulfield, Mark J; Munroe, Patricia B

2011-12-09

Raised blood pressure (BP) is a major risk factor for cardiovascular disease. Previous studies have identified 47 distinct genetic variants robustly associated with BP, but collectively these explain only a few percent of the heritability for BP phenotypes. To find additional BP loci, we used a bespoke gene-centric array to genotype an independent discovery sample of 25,118 individuals that combined hypertensive case-control and general population samples. We followed up four SNPs associated with BP at our p < 8.56 × 10(-7) study-specific significance threshold and six suggestively associated SNPs in a further 59,349 individuals. We identified and replicated a SNP at LSP1/TNNT3, a SNP at MTHFR-NPPB independent (r(2) = 0.33) of previous reports, and replicated SNPs at AGT and ATP2B1 reported previously. An analysis of combined discovery and follow-up data identified SNPs significantly associated with BP at p < 8.56 × 10(-7) at four further loci (NPR3, HFE, NOS3, and SOX6). The high number of discoveries made with modest genotyping effort can be attributed to using a large-scale yet targeted genotyping array and to the development of a weighting scheme that maximized power when meta-analyzing results from samples ascertained with extreme phenotypes, in combination with results from nonascertained or population samples. Chromatin immunoprecipitation and transcript expression data highlight potential gene regulatory mechanisms at the MTHFR and NOS3 loci. These results provide candidates for further study to help dissect mechanisms affecting BP and highlight the utility of studying SNPs and samples that are independent of those studied previously even when the sample size is smaller than that in previous studies. Copyright © 2011 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
High-throughput multiplex HLA-typing by ligase detection reaction (LDR) and universal array (UA) approach.

PubMed

Consolandi, Clarissa

2009-01-01

One major goal of genetic research is to understand the role of genetic variation in living systems. In humans, by far the most common type of such variation involves differences in single DNA nucleotides, and is thus termed single nucleotide polymorphism (SNP). The need for improvement in throughput and reliability of traditional techniques makes it necessary to develop new technologies. Thus the past few years have witnessed an extraordinary surge of interest in DNA microarray technology. This new technology offers the first great hope for providing a systematic way to explore the genome. It permits a very rapid analysis of thousands genes for the purpose of gene discovery, sequencing, mapping, expression, and polymorphism detection. We generated a series of analytical tools to address the manufacturing, detection and data analysis components of a microarray experiment. In particular, we set up a universal array approach in combination with a PCR-LDR (polymerase chain reaction-ligation detection reaction) strategy for allele identification in the HLA gene.
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation.

PubMed

Howe, Glenn T; Yu, Jianbin; Knaus, Brian; Cronn, Richard; Kolpak, Scott; Dolan, Peter; Lorenz, W Walter; Dean, Jeffrey F D

2013-02-28

Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array-more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change.
Continuing difficulties in interpreting CNV data: lessons from a genome-wide CNV association study of Australian HNPCC/lynch syndrome patients.

PubMed

Talseth-Palmer, Bente A; Holliday, Elizabeth G; Evans, Tiffany-Jane; McEvoy, Mark; Attia, John; Grice, Desma M; Masson, Amy L; Meldrum, Cliff; Spigelman, Allan; Scott, Rodney J

2013-03-26

Hereditary non-polyposis colorectal cancer (HNPCC)/Lynch syndrome (LS) is a cancer syndrome characterised by early-onset epithelial cancers, especially colorectal cancer (CRC) and endometrial cancer. The aim of the current study was to use SNP-array technology to identify genomic aberrations which could contribute to the increased risk of cancer in HNPCC/LS patients. Individuals diagnosed with HNPCC/LS (100) and healthy controls (384) were genotyped using the Illumina Human610-Quad SNP-arrays. Copy number variation (CNV) calling and association analyses were performed using Nexus software, with significant results validated using QuantiSNP. TaqMan Copy-Number assays were used for verification of CNVs showing significant association with HNPCC/LS identified by both software programs. We detected copy number (CN) gains associated with HNPCC/LS status on chromosome 7q11.21 (28% cases and 0% controls, Nexus; p =3.60E-20 and QuantiSNP; p < 1.00E-16) and 16p11.2 (46% in cases, while a CN loss was observed in 23% of controls, Nexus; p = 4.93E-21 and QuantiSNP; p = 5.00E-06) via in silico analyses. TaqMan Copy-Number assay was used for validation of CNVs showing significant association with HNPCC/LS. In addition, CNV burden (total CNV length, average CNV length and number of observed CNV events) was significantly greater in cases compared to controls. A greater CNV burden was identified in HNPCC/LS cases compared to controls supporting the notion of higher genomic instability in these patients. One intergenic locus on chromosome 7q11.21 is possibly associated with HNPCC/LS and deserves further investigation. The results from this study highlight the complexities of fluorescent based CNV analyses. The inefficiency of both CNV detection methods to reproducibly detect observed CNVs demonstrates the need for sequence data to be considered alongside intensity data to avoid false positive results.
The pitfalls of platform comparison: DNA copy number array technologies assessed

PubMed Central

2009-01-01

Background The accurate and high resolution mapping of DNA copy number aberrations has become an important tool by which to gain insight into the mechanisms of tumourigenesis. There are various commercially available platforms for such studies, but there remains no general consensus as to the optimal platform. There have been several previous platform comparison studies, but they have either described older technologies, used less-complex samples, or have not addressed the issue of the inherent biases in such comparisons. Here we describe a systematic comparison of data from four leading microarray technologies (the Affymetrix Genome-wide SNP 5.0 array, Agilent High-Density CGH Human 244A array, Illumina HumanCNV370-Duo DNA Analysis BeadChip, and the Nimblegen 385 K oligonucleotide array). We compare samples derived from primary breast tumours and their corresponding matched normals, well-established cancer cell lines, and HapMap individuals. By careful consideration and avoidance of potential sources of bias, we aim to provide a fair assessment of platform performance. Results By performing a theoretical assessment of the reproducibility, noise, and sensitivity of each platform, notable differences were revealed. Nimblegen exhibited between-replicate array variances an order of magnitude greater than the other three platforms, with Agilent slightly outperforming the others, and a comparison of self-self hybridizations revealed similar patterns. An assessment of the single probe power revealed that Agilent exhibits the highest sensitivity. Additionally, we performed an in-depth visual assessment of the ability of each platform to detect aberrations of varying sizes. As expected, all platforms were able to identify large aberrations in a robust manner. However, some focal amplifications and deletions were only detected in a subset of the platforms. Conclusion Although there are substantial differences in the design, density, and number of replicate probes, the comparison indicates a generally high level of concordance between platforms, despite differences in the reproducibility, noise, and sensitivity. In general, Agilent tended to be the best aCGH platform and Affymetrix, the superior SNP-CGH platform, but for specific decisions the results described herein provide a guide for platform selection and study design, and the dataset a resource for more tailored comparisons. PMID:19995423
Comparing CNV detection methods for SNP arrays.

PubMed

Winchester, Laura; Yau, Christopher; Ragoussis, Jiannis

2009-09-01

Data from whole genome association studies can now be used for dual purposes, genotyping and copy number detection. In this review we discuss some of the methods for using SNP data to detect copy number events. We examine a number of algorithms designed to detect copy number changes through the use of signal-intensity data and consider methods to evaluate the changes found. We describe the use of several statistical models in copy number detection in germline samples. We also present a comparison of data using these methods to assess accuracy of prediction and detection of changes in copy number.
TLR4 Asp299Gly polymorphism may be protective against chronic periodontitis.

PubMed

Sellers, R M; Payne, J B; Yu, F; LeVan, T D; Walker, C; Mikuls, T R

2016-04-01

Periodontitis results from interplay between genetic and environmental factors. Single nucleotide polymorphisms (SNPs) in the coding region of the toll-like receptor 4 gene (TLR4) may be associated with periodontitis, although previous studies have been inconclusive. Moreover, the interaction between environmental factors, such as cigarette smoking (a major risk factor for periodontitis), and Porphyromonas gingivalis (a major periodontal pathogen) with the TLR4 coding region Asp299Gly SNP (rs4986790; a SNP associated with lipopolysaccharide-mediated inflammatory responses in periodontitis), have been largely ignored in previous reports. Therefore, the objective of this study was to examine the association between TLR4 Asp299Gly (rs4986790) with alveolar bone height loss (ABHL) and periodontitis, accounting for interactions between this SNP with smoking and P. gingivalis prevalence. The CD14/-260 SNP (rs2569190) served as a control, as a recent meta-analysis suggested no relationship between this SNP and periodontitis. This multicenter study included 617 participants who had rheumatoid arthritis or osteoarthritis. This report presents a secondary outcome from the primary case-control study examining the relationship of periodontitis with established rheumatoid arthritis. The Centers for Disease Control/American Academy of Periodontology case definitions of periodontitis were used for this analysis. Participants received a full-mouth clinical periodontal examination and panoramic radiograph. Percentage ABHL was measured on posterior teeth. The TLR4 Asp299Gly and CD14/-260 SNPs were selected a priori and genotypes were determined using the ImmunoChip array (Illumina(®) ). Minor allele frequencies and associations with periodontitis and ABHL did not differ according to rheumatoid arthritis vs. osteoarthritis status; therefore, data from these two groups were pooled. The presence of P. gingivalis was detected in subgingival plaque by PCR. Multivariate ordinal logistic regression examined associations between the SNPs and periodontitis or ABHL. SNP interactions with smoking and P. gingivalis were analyzed. A significant, negative interaction was observed between the TLR4 SNP and the presence of P. gingivalis (p = 0.045) with respect to periodontitis. The TLR4 minor variant was also associated with less ABHL: 16.8% of individuals with low ABHL, 9.0% with moderate ABHL and 11.2% with high ABHL had the minor allele [p = 0.029; odds ratio = 0.58 (95% confidence interval: 0.36-0.95)]. The interaction between the TLR4 SNP and smoking was not significant with respect to periodontitis or ABHL. The CD14 SNP was not associated with periodontitis or ABHL. The TLR4 Asp299Gly SNP significantly interacted with P. gingivalis in conferring a decreased risk of periodontitis and may be protective against ABHL, a feature of periodontitis. Agents blocking TLR4 signaling, a strategy currently under investigation for the treatment of other inflammatory conditions, may warrant investigation in the context of periodontitis related to the presence of P. gingivalis. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Intracranial hemangiopericytoma: Case study with cytogenetics and genome wide SNP-A analysis.

PubMed

Holland, Heidrun; Livrea, Michela; Ahnert, Peter; Koschny, Ronald; Kirsten, Holger; Meixensberger, Jürgen; Bauer, Manfred; Schober, Ralf; Fritzsch, Dominik; Krupp, Wolfgang

2011-05-15

The tumor entity of hemangiopericytoma is not universally recognized as a nosological entity by pathologists, and there is a trend toward reassigning it to other categories gradually. However, hemangiopericytomas occurring in the nervous system are included in the new WHO classification of brain tumors, and are distinguished from both meningioma and fibrous tumors. Since there are few genetic studies, we performed a comprehensive cytogenetic analysis of an infratentorial hemangiopericytoma in a 55-year-old female. It was originally classified as a grade II tumor but recurred as a grade III tumor with a proliferation index of 20%. Using trypsin-Giemsa staining (GTG-banding) and multicolor fluorescence in situ hybridization (M-FISH), we could confirm the loss of chromosomal material 10q, which has been previously described in hemangiopericytoma, and we identified de novo chromosomal aberrations on chromosome 8. Applying genome-wide high-density single nucleotide polymorphism array (SNP-A) analysis, we detected segments with loss or gain, as well as clonal deletions or regions suggestive of segmental uniparental disomy. These findings, together with the results of conventional histological and immunohistochemical characterization, provide additional evidence for the nosological separation of hemangiopericytoma in the central nervous system as a biologically different entity. Copyright © 2011 Elsevier GmbH. All rights reserved.
Recovery of Native Genetic Background in Admixed Populations Using Haplotypes, Phenotypes, and Pedigree Information – Using Cika Cattle as a Case Breed

PubMed Central

Simčič, Mojca; Smetko, Anamarija; Sölkner, Johann; Seichter, Doris; Gorjanc, Gregor; Kompan, Dragomir; Medugorac, Ivica

2015-01-01

The aim of this study was to obtain unbiased estimates of the diversity parameters, the population history, and the degree of admixture in Cika cattle which represents the local admixed breeds at risk of extinction undergoing challenging conservation programs. Genetic analyses were performed on the genome-wide Single Nucleotide Polymorphism (SNP) Illumina Bovine SNP50 array data of 76 Cika animals and 531 animals from 14 reference populations. To obtain unbiased estimates we used short haplotypes spanning four markers instead of single SNPs to avoid an ascertainment bias of the BovineSNP50 array. Genome-wide haplotypes combined with partial pedigree and type trait classification show the potential to improve identification of purebred animals with a low degree of admixture. Phylogenetic analyses demonstrated unique genetic identity of Cika animals. Genetic distance matrix presented by rooted Neighbour-Net suggested long and broad phylogenetic connection between Cika and Pinzgauer. Unsupervised clustering performed by the admixture analysis and two-dimensional presentation of the genetic distances between individuals also suggest Cika is a distinct breed despite being similar in appearance to Pinzgauer. Animals identified as the most purebred could be used as a nucleus for a recovery of the native genetic background in the current admixed population. The results show that local well-adapted strains, which have never been intensively managed and differentiated into specific breeds, exhibit large haplotype diversity. They suggest a conservation and recovery approach that does not rely exclusively on the search for the original native genetic background but rather on the identification and removal of common introgressed haplotypes would be more powerful. Successful implementation of such an approach should be based on combining phenotype, pedigree, and genome-wide haplotype data of the breed of interest and a spectrum of reference breeds which potentially have had direct or indirect historical contribution to the genetic makeup of the breed of interest. PMID:25923207
Analysis of high-order SNP barcodes in mitochondrial D-loop for chronic dialysis susceptibility.

PubMed

Yang, Cheng-Hong; Lin, Yu-Da; Chuang, Li-Yeh; Chang, Hsueh-Wei

2016-10-01

Positively identifying disease-associated single nucleotide polymorphism (SNP) markers in genome-wide studies entails the complex association analysis of a huge number of SNPs. Such large numbers of SNP barcode (SNP/genotype combinations) continue to pose serious computational challenges, especially for high-dimensional data. We propose a novel exploiting SNP barcode method based on differential evolution, termed IDE (improved differential evolution). IDE uses a "top combination strategy" to improve the ability of differential evolution to explore high-order SNP barcodes in high-dimensional data. We simulate disease data and use real chronic dialysis data to test four global optimization algorithms. In 48 simulated disease models, we show that IDE outperforms existing global optimization algorithms in terms of exploring ability and power to detect the specific SNP/genotype combinations with a maximum difference between cases and controls. In real data, we show that IDE can be used to evaluate the relative effects of each individual SNP on disease susceptibility. IDE generated significant SNP barcode with less computational complexity than the other algorithms, making IDE ideally suited for analysis of high-order SNP barcodes. Copyright © 2016 Elsevier Inc. All rights reserved.
Similar Genetic Architecture with Shared and Unique Quantitative Trait Loci for Bacterial Cold Water Disease Resistance in Two Rainbow Trout Breeding Populations

PubMed Central

Vallejo, Roger L.; Liu, Sixin; Gao, Guangtu; Fragomeni, Breno O.; Hernandez, Alvaro G.; Leeds, Timothy D.; Parsons, James E.; Martin, Kyle E.; Evenhuis, Jason P.; Welch, Timothy J.; Wiens, Gregory D.; Palti, Yniv

2017-01-01

Bacterial cold water disease (BCWD) causes significant mortality and economic losses in salmonid aquaculture. In previous studies, we identified moderate-large effect quantitative trait loci (QTL) for BCWD resistance in rainbow trout (Oncorhynchus mykiss). However, the recent availability of a 57 K SNP array and a reference genome assembly have enabled us to conduct genome-wide association studies (GWAS) that overcome several experimental limitations from our previous work. In the current study, we conducted GWAS for BCWD resistance in two rainbow trout breeding populations using two genotyping platforms, the 57 K Affymetrix SNP array and restriction-associated DNA (RAD) sequencing. Overall, we identified 14 moderate-large effect QTL that explained up to 60.8% of the genetic variance in one of the two populations and 27.7% in the other. Four of these QTL were found in both populations explaining a substantial proportion of the variance, although major differences were also detected between the two populations. Our results confirm that BCWD resistance is controlled by the oligogenic inheritance of few moderate-large effect loci and a large-unknown number of loci each having a small effect on BCWD resistance. We detected differences in QTL number and genome location between two GWAS models (weighted single-step GBLUP and Bayes B), which highlights the utility of using different models to uncover QTL. The RAD-SNPs detected a greater number of QTL than the 57 K SNP array in one population, suggesting that the RAD-SNPs may uncover polymorphisms that are more unique and informative for the specific population in which they were discovered. PMID:29109734
Similar Genetic Architecture with Shared and Unique Quantitative Trait Loci for Bacterial Cold Water Disease Resistance in Two Rainbow Trout Breeding Populations.

PubMed

Vallejo, Roger L; Liu, Sixin; Gao, Guangtu; Fragomeni, Breno O; Hernandez, Alvaro G; Leeds, Timothy D; Parsons, James E; Martin, Kyle E; Evenhuis, Jason P; Welch, Timothy J; Wiens, Gregory D; Palti, Yniv

2017-01-01

Bacterial cold water disease (BCWD) causes significant mortality and economic losses in salmonid aquaculture. In previous studies, we identified moderate-large effect quantitative trait loci (QTL) for BCWD resistance in rainbow trout ( Oncorhynchus mykiss ). However, the recent availability of a 57 K SNP array and a reference genome assembly have enabled us to conduct genome-wide association studies (GWAS) that overcome several experimental limitations from our previous work. In the current study, we conducted GWAS for BCWD resistance in two rainbow trout breeding populations using two genotyping platforms, the 57 K Affymetrix SNP array and restriction-associated DNA (RAD) sequencing. Overall, we identified 14 moderate-large effect QTL that explained up to 60.8% of the genetic variance in one of the two populations and 27.7% in the other. Four of these QTL were found in both populations explaining a substantial proportion of the variance, although major differences were also detected between the two populations. Our results confirm that BCWD resistance is controlled by the oligogenic inheritance of few moderate-large effect loci and a large-unknown number of loci each having a small effect on BCWD resistance. We detected differences in QTL number and genome location between two GWAS models (weighted single-step GBLUP and Bayes B), which highlights the utility of using different models to uncover QTL. The RAD-SNPs detected a greater number of QTL than the 57 K SNP array in one population, suggesting that the RAD-SNPs may uncover polymorphisms that are more unique and informative for the specific population in which they were discovered.
Genome-wide SNP scan in a porcine Large White×Minzhu intercross population reveals a locus influencing muscle mass on chromosome 2.

PubMed

Liu, Xin; Wang, Li Gang; Luo, Wei Zhen; Li, Yong; Liang, Jing; Yan, Hua; Zhao, Ke Bin; Wang, Li Xian; Zhang, Long Chao

2014-12-01

A high-density single nucleotide polymorphism (SNP) array containing 62 163 markers was employed for a genome-wide association study (GWAS) to identify variants associated with lean meat in ham (LMH, %) and lean meat percentage (LMP, %) within a porcine Large White×Minzhu intercross population. For each individual, LMH and LMP were measured after slaughter at the age of 240±7 days. A total of 557 F2 animals were genotyped. The GWAS revealed that 21 SNPs showed significant genome-wide or chromosome-wide associations with LMH and LMP by the Genome-wide Rapid Association using Mixed Model and Regression-Genomic Control approach. Nineteen significant genome-wide SNPs were mapped to the distal end of Sus Scrofa Chromosome (SSC) 2, where a major known gene responsible for muscle mass, IGF2 is located. A conditioned analysis, in which the genotype of the strongest associated SNP is included as a fixed effect in the model, showed that those significant SNPs on SSC2 were derived from a single quantitative trait locus. The two chromosome-wide association SNPs on SSC1 disappeared after conditioned analysis suggested the association signal is a false association derived from using a F2 population. The present result is expected to lead to novel insights into muscle mass in different pig breeds and lays a preliminary foundation for follow-up studies for identification of causal mutations for subsequent application in marker-assisted selection programs for improving muscle mass in pigs. © 2014 Japanese Society of Animal Science.
Identical cryptic partial monosomy 20pter and trisomy 20qter in three adult siblings due to a large maternal pericentric inversion: detection by MLPA and breakpoint mapping by SNP array analysis.

PubMed

Stevens, Servi J C; Smeets, Eric E J G L; Blom, Eveline; van Uum, Chris M J; Albrechts, Jozefa C M; Herbergs, Jos; Janssen, Jannie W M; Engelen, John J M

2009-10-01

Genotypic and phenotypic data are presented on three adult siblings with mild to moderate mental retardation and mild dysmorphic features. All three siblings showed a chromosome 20 gain at the q-telomere and loss at the p-telomere in routine subtelomeric MLPA screening. Analysis of GTG-banded chromosomes did not detect any abnormalities, but subtelomeric fluorescent in situ hybridization (FISH) confirmed cryptic partial monosomy of chromosome region 20p13 --> 20pter and cryptic partial trisomy of chromosome region 20q13.33 --> 20qter. Furthermore, FISH analysis in the mother showed a cryptic inv(20)(p13q13.33). This explained the cytogenetic mechanism underlying the chromosomal imbalance in the three children, that is, the meiotic formation of a recombinant chromosome 20 due to crossing-over in the inverted segment. All three children thus carried a rec(20)dup(20q)inv(20)(p13q13.33)mat chromosome. SNP array analysis enabled rapid and detailed imbalance sizing and showed a 1.06 Mb loss in 20p13 and a 2.51 Mb gain in 20q13.33, comprising 21 and 78 genes, respectively. The maternal inversion is the largest described thus far for chromosome 20, comprising 94.4% of its length. Such large inversions result in a particularly high risk for live-born unbalanced offspring because the partial monosomy and trisomy segments are small. Moreover, the inversion size is directly related to the percentage of unbalanced gametes due to high crossing-over change within the inverted segment. The fact that all three children carry an identical chromosomal rearrangement has consequences for genetic counseling for carriers of large pericentric inversions, as the recurrence risk is very high.
Loss of Chromosome 18 in Neuroendocrine Tumors of the Small Intestine: The Enigma Remains.

PubMed

Nieser, Maike; Henopp, Tobias; Brix, Joachim; Stoß, Laura; Sitek, Barbara; Naboulsi, Wael; Anlauf, Martin; Schlitter, Anna M; Klöppel, Günter; Gress, Thomas; Moll, Roland; Bartsch, Detlef K; Heverhagen, Anna E; Knoefel, Wolfram T; Kaemmerer, Daniel; Haybaeck, Johannes; Fend, Falko; Sperveslage, Jan; Sipos, Bence

2017-01-01

Neuroendocrine tumors of the small intestine (SI-NETs) exhibit an increasing incidence and high mortality rate. Until now, no fundamental molecular event has been linked to the tumorigenesis and progression of these tumors. Only the loss of chromosome 18 (Chr18) has been shown in up to two thirds of SI-NETs, whereby the significance of this alteration is still not understood. We therefore performed the first comprehensive study to identify Chr18-related events at the genetic, epigenetic and gene/protein expression levels. We did expression analysis of all seven putative Chr18-related tumor suppressors by quantitative real-time PCR (qRT-PCR), Western blot and immunohistochemistry. Next-generation exome sequencing and SNP array analysis were performed with five SI-NETs with (partial) loss of Chr18. Finally, we analyzed all microRNAs (miRNAs) located on Chr18 by qRT-PCR, comparing Chr18+/- and Chr18+/+ SI-NETs. Only DCC (deleted in colorectal cancer) revealed loss of/greatly reduced expression in 6/21 cases (29%). No relevant loss of SMAD2, SMAD4, elongin A3 and CABLES was detected. PMAIP1 and maspin were absent at the protein level. Next-generation sequencing did not reveal relevant recurrent somatic mutations on Chr18 either in an exploratory cohort of five SI-NETs, or in a validation cohort (n = 30). SNP array analysis showed no additional losses. The quantitative analysis of all 27 Chr18-related miRNAs revealed no difference in expression between Chr18+/- and Chr18+/+ SI-NETs. DCC seems to be the only Chr18-related tumor suppressor affected by the monoallelic loss of Chr18 resulting in a loss of DCC protein expression in one third of SI-NETs. No additional genetic or epigenetic alterations were present on Chr18. © 2016 S. Karger AG, Basel.

Genomic Variation by Whole-Genome SNP Mapping Arrays Predicts Time-to-Event Outcome in Patients with Chronic Lymphocytic Leukemia

PubMed Central

Schweighofer, Carmen D.; Coombes, Kevin R.; Majewski, Tadeusz; Barron, Lynn L.; Lerner, Susan; Sargent, Rachel L.; O'Brien, Susan; Ferrajoli, Alessandra; Wierda, William G.; Czerniak, Bogdan A.; Medeiros, L. Jeffrey; Keating, Michael J.; Abruzzo, Lynne V.

2013-01-01

Genomic abnormalities, such as deletions in 11q22 or 17p13, are associated with poorer prognosis in patients with chronic lymphocytic leukemia (CLL). We hypothesized that unknown regions of copy number variation (CNV) affect clinical outcome and can be detected by array-based single-nucleotide polymorphism (SNP) genotyping. We compared SNP genotypes from 168 untreated patients with CLL with genotypes from 73 white HapMap controls. We identified 322 regions of recurrent CNV, 82 of which occurred significantly more often in CLL than in HapMap (CLL-specific CNV), including regions typically aberrant in CLL: deletions in 6q21, 11q22, 13q14, and 17p13 and trisomy 12. In univariate analyses, 35 of total and 11 of CLL-specific CNVs were associated with unfavorable time-to-event outcomes, including gains or losses in chromosomes 2p, 4p, 4q, 6p, 6q, 7q, 11p, 11q, and 17p. In multivariate analyses, six CNVs (ie, CLL-specific variations in 11p15.1-15.4 or 6q27) predicted time-to-treatment or overall survival independently of established markers of prognosis. Moreover, genotypic complexity (ie, the number of independent CNVs per patient) significantly predicted prognosis, with a median time-to-treatment of 64 months versus 23 months in patients with zero to one versus two or more CNVs, respectively (P = 3.3 × 10−8). In summary, a comparison of SNP genotypes from patients with CLL with HapMap controls allowed us to identify known and unknown recurrent CNVs and to determine regions and rates of CNV that predict poorer prognosis in patients with CLL. PMID:23273604
LS-CAP: an algorithm for identifying cytogenetic aberrations in hepatocellular carcinoma using microarray data.

PubMed

He, Xianmin; Wei, Qing; Sun, Meiqian; Fu, Xuping; Fan, Sichang; Li, Yao

2006-05-01

Biological techniques such as Array-Comparative genomic hybridization (CGH), fluorescent in situ hybridization (FISH) and affymetrix single nucleotide pleomorphism (SNP) array have been used to detect cytogenetic aberrations. However, on genomic scale, these techniques are labor intensive and time consuming. Comparative genomic microarray analysis (CGMA) has been used to identify cytogenetic changes in hepatocellular carcinoma (HCC) using gene expression microarray data. However, CGMA algorithm can not give precise localization of aberrations, fails to identify small cytogenetic changes, and exhibits false negatives and positives. Locally un-weighted smoothing cytogenetic aberrations prediction (LS-CAP) based on local smoothing and binomial distribution can be expected to address these problems. LS-CAP algorithm was built and used on HCC microarray profiles. Eighteen cytogenetic abnormalities were identified, among them 5 were reported previously, and 12 were proven by CGH studies. LS-CAP effectively reduced the false negatives and positives, and precisely located small fragments with cytogenetic aberrations.
Comprehensive replication of the relationship between myopia-related genes and refractive errors in a large Japanese cohort.

PubMed

Yoshikawa, Munemitsu; Yamashiro, Kenji; Miyake, Masahiro; Oishi, Maho; Akagi-Kurashige, Yumiko; Kumagai, Kyoko; Nakata, Isao; Nakanishi, Hideo; Oishi, Akio; Gotoh, Norimoto; Yamada, Ryo; Matsuda, Fumihiko; Yoshimura, Nagahisa

2014-10-21

We investigated the association between refractive error in a Japanese population and myopia-related genes identified in two recent large-scale genome-wide association studies. Single-nucleotide polymorphisms (SNPs) in 51 genes that were reported by the Consortium for Refractive Error and Myopia and/or the 23andMe database were genotyped in 3712 healthy Japanese volunteers from the Nagahama Study using HumanHap610K Quad, HumanOmni2.5M, and/or HumanExome Arrays. To evaluate the association between refractive error and recently identified myopia-related genes, we used three approaches to perform quantitative trait locus analyses of mean refractive error in both eyes of the participants: per-SNP, gene-based top-SNP, and gene-based all-SNP analyses. Association plots of successfully replicated genes also were investigated. In our per-SNP analysis, eight myopia gene associations were replicated successfully: GJD2, RASGRF1, BICC1, KCNQ5, CD55, CYP26A1, LRRC4C, and B4GALNT2.Seven additional gene associations were replicated in our gene-based analyses: GRIA4, BMP2, QKI, BMP4, SFRP1, SH3GL2, and EHBP1L1. The signal strength of the reported SNPs and their tagging SNPs increased after considering different linkage disequilibrium patterns across ethnicities. Although two previous studies suggested strong associations between PRSS56, LAMA2, TOX, and RDH5 and myopia, we could not replicate these results. Our results confirmed the significance of the myopia-related genes reported previously and suggested that gene-based replication analyses are more effective than per-SNP analyses. Our comparison with two previous studies suggested that BMP3 SNPs cause myopia primarily in Caucasian populations, while they may exhibit protective effects in Asian populations. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.
Developing 100K Affymetrix Axiom SNP Array for Polyploid Sugarcane

USDA-ARS?s Scientific Manuscript database

Sugarcane genotyping or fingerprinting has long been a daunting task due to its high polyploidy level with large number of chromosomes. Single nucleotide polymorphisms (SNPs) are very abundant DNA sequence variations in the genomes. With the advance of next generation sequencing (NGS) technologies, ...
QTL mapping of potato chip color and tuber traits within an autotetraploid family

USDA-ARS?s Scientific Manuscript database

Cultivated potato (Solanum tuberosum L.) is a highly heterozygous autotetraploid crop species, and this presents challenges for traditional line development and molecular breeding. Recent availability of a single nucleotide polymorphism (SNP) array with 8303 features and software packages for linkag...
Assumption-free estimation of the genetic contribution to refractive error across childhood.

PubMed

Guggenheim, Jeremy A; St Pourcain, Beate; McMahon, George; Timpson, Nicholas J; Evans, David M; Williams, Cathy

2015-01-01

Studies in relatives have generally yielded high heritability estimates for refractive error: twins 75-90%, families 15-70%. However, because related individuals often share a common environment, these estimates are inflated (via misallocation of unique/common environment variance). We calculated a lower-bound heritability estimate for refractive error free from such bias. Between the ages 7 and 15 years, participants in the Avon Longitudinal Study of Parents and Children (ALSPAC) underwent non-cycloplegic autorefraction at regular research clinics. At each age, an estimate of the variance in refractive error explained by single nucleotide polymorphism (SNP) genetic variants was calculated using genome-wide complex trait analysis (GCTA) using high-density genome-wide SNP genotype information (minimum N at each age=3,404). The variance in refractive error explained by the SNPs ("SNP heritability") was stable over childhood: Across age 7-15 years, SNP heritability averaged 0.28 (SE=0.08, p<0.001). The genetic correlation for refractive error between visits varied from 0.77 to 1.00 (all p<0.001) demonstrating that a common set of SNPs was responsible for the genetic contribution to refractive error across this period of childhood. Simulations suggested lack of cycloplegia during autorefraction led to a small underestimation of SNP heritability (adjusted SNP heritability=0.35; SE=0.09). To put these results in context, the variance in refractive error explained (or predicted) by the time participants spent outdoors was <0.005 and by the time spent reading was <0.01, based on a parental questionnaire completed when the child was aged 8-9 years old. Genetic variation captured by common SNPs explained approximately 35% of the variation in refractive error between unrelated subjects. This value sets an upper limit for predicting refractive error using existing SNP genotyping arrays, although higher-density genotyping in larger samples and inclusion of interaction effects is expected to raise this figure toward twin- and family-based heritability estimates. The same SNPs influenced refractive error across much of childhood. Notwithstanding the strong evidence of association between time outdoors and myopia, and time reading and myopia, less than 1% of the variance in myopia at age 15 was explained by crude measures of these two risk factors, indicating that their effects may be limited, at least when averaged over the whole population.
Dynamic variable selection in SNP genotype autocalling from APEX microarray data.

PubMed

Podder, Mohua; Welch, William J; Zamar, Ruben H; Tebbutt, Scott J

2006-11-30

Single nucleotide polymorphisms (SNPs) are DNA sequence variations, occurring when a single nucleotide--adenine (A), thymine (T), cytosine (C) or guanine (G)--is altered. Arguably, SNPs account for more than 90% of human genetic variation. Our laboratory has developed a highly redundant SNP genotyping assay consisting of multiple probes with signals from multiple channels for a single SNP, based on arrayed primer extension (APEX). This mini-sequencing method is a powerful combination of a highly parallel microarray with distinctive Sanger-based dideoxy terminator sequencing chemistry. Using this microarray platform, our current genotype calling system (known as SNP Chart) is capable of calling single SNP genotypes by manual inspection of the APEX data, which is time-consuming and exposed to user subjectivity bias. Using a set of 32 Coriell DNA samples plus three negative PCR controls as a training data set, we have developed a fully-automated genotyping algorithm based on simple linear discriminant analysis (LDA) using dynamic variable selection. The algorithm combines separate analyses based on the multiple probe sets to give a final posterior probability for each candidate genotype. We have tested our algorithm on a completely independent data set of 270 DNA samples, with validated genotypes, from patients admitted to the intensive care unit (ICU) of St. Paul's Hospital (plus one negative PCR control sample). Our method achieves a concordance rate of 98.9% with a 99.6% call rate for a set of 96 SNPs. By adjusting the threshold value for the final posterior probability of the called genotype, the call rate reduces to 94.9% with a higher concordance rate of 99.6%. We also reversed the two independent data sets in their training and testing roles, achieving a concordance rate up to 99.8%. The strength of this APEX chemistry-based platform is its unique redundancy having multiple probes for a single SNP. Our model-based genotype calling algorithm captures the redundancy in the system considering all the underlying probe features of a particular SNP, automatically down-weighting any 'bad data' corresponding to image artifacts on the microarray slide or failure of a specific chemistry. In this regard, our method is able to automatically select the probes which work well and reduce the effect of other so-called bad performing probes in a sample-specific manner, for any number of SNPs.
Powerful Identification of Cis-regulatory SNPs in Human Primary Monocytes Using Allele-Specific Gene Expression

PubMed Central

Almlöf, Jonas Carlsson; Lundmark, Per; Lundmark, Anders; Ge, Bing; Maouche, Seraya; Göring, Harald H. H.; Liljedahl, Ulrika; Enström, Camilla; Brocheton, Jessy; Proust, Carole; Godefroy, Tiphaine; Sambrook, Jennifer G.; Jolley, Jennifer; Crisp-Hihn, Abigail; Foad, Nicola; Lloyd-Jones, Heather; Stephens, Jonathan; Gwilliam, Rhian; Rice, Catherine M.; Hengstenberg, Christian; Samani, Nilesh J.; Erdmann, Jeanette; Schunkert, Heribert; Pastinen, Tomi; Deloukas, Panos; Goodall, Alison H.; Ouwehand, Willem H.; Cambien, François; Syvänen, Ann-Christine

2012-01-01

A large number of genome-wide association studies have been performed during the past five years to identify associations between SNPs and human complex diseases and traits. The assignment of a functional role for the identified disease-associated SNP is not straight-forward. Genome-wide expression quantitative trait locus (eQTL) analysis is frequently used as the initial step to define a function while allele-specific gene expression (ASE) analysis has not yet gained a wide-spread use in disease mapping studies. We compared the power to identify cis-acting regulatory SNPs (cis-rSNPs) by genome-wide allele-specific gene expression (ASE) analysis with that of traditional expression quantitative trait locus (eQTL) mapping. Our study included 395 healthy blood donors for whom global gene expression profiles in circulating monocytes were determined by Illumina BeadArrays. ASE was assessed in a subset of these monocytes from 188 donors by quantitative genotyping of mRNA using a genome-wide panel of SNP markers. The performance of the two methods for detecting cis-rSNPs was evaluated by comparing associations between SNP genotypes and gene expression levels in sample sets of varying size. We found that up to 8-fold more samples are required for eQTL mapping to reach the same statistical power as that obtained by ASE analysis for the same rSNPs. The performance of ASE is insensitive to SNPs with low minor allele frequencies and detects a larger number of significantly associated rSNPs using the same sample size as eQTL mapping. An unequivocal conclusion from our comparison is that ASE analysis is more sensitive for detecting cis-rSNPs than standard eQTL mapping. Our study shows the potential of ASE mapping in tissue samples and primary cells which are difficult to obtain in large numbers. PMID:23300628
Identification of Genes Promoting Skin Youthfulness by Genome-Wide Association Study

PubMed Central

Chang, Anne L.S.; Atzmon, Gil; Bergman, Aviv; Brugmann, Samantha; Atwood, Scott X; Chang, Howard Y; Barzilai, Nir

2014-01-01

To identify genes that promote facial skin youthfulness (SY), a genome-wide association study on an Ashkenazi Jewish discovery group (n=428) was performed using Affymetrix 6.0 Single-Nucleotide Polymorphism (SNP) Array. After SNP quality controls, 901,470 SNPs remained for analysis. The eigenstrat method showed no stratification. Cases and controls were identified by global facial skin aging severity including intrinsic and extrinsic parameters. Linear regression adjusted for age and gender, with no significant differences in smoking history, body mass index, menopausal status, or personal or family history of centenarians. Six SNPs met the Bonferroni threshold with Pallele<10−8; two of these six had Pgenotype<10−8. Quantitative trait loci mapping confirmed linkage disequilibrium. The six SNPs were interrogated by MassARRAY in a replication group (n=436) with confirmation of rs6975107, an intronic region of KCND2 (potassium voltage-gated channel, Shal-related family member 2) (Pgenotype=0.023). A second replication group (n=371) confirmed rs318125, downstream of DIAPH2 (diaphanous homolog 2 (Drosophila)) (Pallele=0.010, Pgenotype=0.002) and rs7616661, downstream of EDEM1 (ER degradation enhancer, mannosidase α-like 1) (Pgenotype=0.042). DIAPH2 has been associated with premature ovarian insufficiency, an aging phenotype in humans. EDEM1 associates with lifespan in animal models, although not humans. KCND2 is expressed in human skin, but has not been associated with aging. These genes represent new candidate genes to study the molecular basis of healthy skin aging. PMID:24037343
LD2SNPing: linkage disequilibrium plotter and RFLP enzyme mining for tag SNPs

PubMed Central

Chang, Hsueh-Wei; Chuang, Li-Yeh; Chang, Yan-Jhu; Cheng, Yu-Huei; Hung, Yu-Chen; Chen, Hsiang-Chi; Yang, Cheng-Hong

2009-01-01

Background Linkage disequilibrium (LD) mapping is commonly used to evaluate markers for genome-wide association studies. Most types of LD software focus strictly on LD analysis and visualization, but lack supporting services for genotyping. Results We developed a freeware called LD2SNPing, which provides a complete package of mining tools for genotyping and LD analysis environments. The software provides SNP ID- and gene-centric online retrievals for SNP information and tag SNP selection from dbSNP/NCBI and HapMap, respectively. Restriction fragment length polymorphism (RFLP) enzyme information for SNP genotype is available to all SNP IDs and tag SNPs. Single and multiple SNP inputs are possible in order to perform LD analysis by online retrieval from HapMap and NCBI. An LD statistics section provides D, D', r2, δQ, ρ, and the P values of the Hardy-Weinberg Equilibrium for each SNP marker, and Chi-square and likelihood-ratio tests for the pair-wise association of two SNPs in LD calculation. Finally, 2D and 3D plots, as well as plain-text output of the results, can be selected. Conclusion LD2SNPing thus provides a novel visualization environment for multiple SNP input, which facilitates SNP association studies. The software, user manual, and tutorial are freely available at . PMID:19500380
Development and utilization of 100K SNP array in Saccharum Spp.

USDA-ARS?s Scientific Manuscript database

Sugarcane genotyping or fingerprinting has long been a daunting task due to its high polyploidy level with large number of chromosomes. Single nucleotide polymorphisms (SNPs) are very abundant DNA sequence variations in the genome. With the advance of next generation sequencing (NGS) technologies, m...
Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality.

PubMed

Ali, Shahin S; Shao, Jonathan; Strem, Mary D; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W; Bailey, Bryan A

2015-01-01

Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri.
Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality

PubMed Central

Ali, Shahin S.; Shao, Jonathan; Strem, Mary D.; Phillips-Mora, Wilberth; Zhang, Dapeng; Meinhardt, Lyndel W.; Bailey, Bryan A.

2015-01-01

Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population may allow the population to adapt to changing environmental conditions and adapt to enhanced resistance in the host plant. The present study developed single nucleotide polymorphism (SNP) markers from RNASeq results for 13 M. roreri isolates and validated the markers for their ability to reveal genetic diversity in an international M. roreri collection. The SNP resources reported herein represent the first study of RNA sequencing (RNASeq)-derived SNP validation in M. roreri and demonstrates the utility of RNASeq as an approach for de novo SNP identification in M. roreri. A total of 88 polymorphic SNPs were used to evaluate the genetic diversity of 172 M. roreri cacao isolates resulting in 37 distinct genotypes (including 14 synonymous groups). Absence of heterozygosity for the 88 SNP markers indicates reproduction in M. roreri is clonal and likely due to a homothallic life style. The upper Magdalena Valley of Colombia showed the highest levels of genetic diversity with 20 distinct genotypes of which 13 were limited to this region, and indicates this region as the possible center of origin for M. roreri. PMID:26379633
Association of methionine synthase gene polymorphisms with wool production and quality traits in Chinese Merino population.

PubMed

Rong, E G; Yang, H; Zhang, Z W; Wang, Z P; Yan, X H; Li, H; Wang, N

2015-10-01

Methionine synthase (MTR) plays a crucial role in maintaining homeostasis of intracellular methionine, folate, and homocysteine, and its activity correlates with DNA methylation in many mammalian tissues. Our previous genomewide association study identified that 1 SNP located in the gene was associated with several wool production and quality traits in Chinese Merino. To confirm the potential involvement of the gene in sheep wool production and quality traits, we performed sheep tissue expression profiling, SNP detection, and association analysis with sheep wool production and quality traits. The semiquantitative reverse transcription PCR analysis showed that the gene was differentially expressed in skin from Merino and Kazak sheep. The sequencing analysis identified a total of 13 SNP in the gene from Chinese Merino sheep. Comparison of the allele frequencies revealed that these 13 identified SNP were significantly different among the 6 tested Chinese Merino strains ( < 0.001). Linkage disequilibrium analysis showed that SNP 3 to 11 were strongly linked in a single haplotype block in the tested population. Association analysis showed that SNP 2 to 11 were significantly associated with the average wool fiber diameter and the fineness SD and that SNP 4 to 11 were significantly associated with the CV of fiber diameter trait ( < 0.05). Single nucleotide polymorphism 2 and SNP 5 to 12 were weakly associated with wool crimp. Similarly, the haplotypes derived from these 13 identified SNP were also significantly associated with the average wool fiber diameter, fineness SD, and the CV of fiber diameter ( < 0.05). Our results suggest that is a candidate gene for sheep wool production and quality traits, and the identified SNP might be used in sheep breeding.
DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation.

PubMed

Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H; Proukakis, Christos

2017-01-01

Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array "waves", and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance.
DNA isolation protocol effects on nuclear DNA analysis by microarrays, droplet digital PCR, and whole genome sequencing, and on mitochondrial DNA copy number estimation

PubMed Central

Nacheva, Elizabeth; Mokretar, Katya; Soenmez, Aynur; Pittman, Alan M.; Grace, Colin; Valli, Roberto; Ejaz, Ayesha; Vattathil, Selina; Maserati, Emanuela; Houlden, Henry; Taanman, Jan-Willem; Schapira, Anthony H.

2017-01-01

Potential bias introduced during DNA isolation is inadequately explored, although it could have significant impact on downstream analysis. To investigate this in human brain, we isolated DNA from cerebellum and frontal cortex using spin columns under different conditions, and salting-out. We first analysed DNA using array CGH, which revealed a striking wave pattern suggesting primarily GC-rich cerebellar losses, even against matched frontal cortex DNA, with a similar pattern on a SNP array. The aCGH changes varied with the isolation protocol. Droplet digital PCR of two genes also showed protocol-dependent losses. Whole genome sequencing showed GC-dependent variation in coverage with spin column isolation from cerebellum. We also extracted and sequenced DNA from substantia nigra using salting-out and phenol / chloroform. The mtDNA copy number, assessed by reads mapping to the mitochondrial genome, was higher in substantia nigra when using phenol / chloroform. We thus provide evidence for significant method-dependent bias in DNA isolation from human brain, as reported in rat tissues. This may contribute to array “waves”, and could affect copy number determination, particularly if mosaicism is being sought, and sequencing coverage. Variations in isolation protocol may also affect apparent mtDNA abundance. PMID:28683077
Type 2 diabetes mellitus disease risk genes identified by genome wide copy number variation scan in normal populations.

PubMed

Prabhanjan, Manasa; Suresh, Raviraj V; Murthy, Megha N; Ramachandra, Nallur B

2016-03-01

To identify the role of copy number variations (CNVs) on disease risk genes and its effect on disease phenotypes in type 2 diabetes mellitus (T2DM) in 12 random populations using high throughput arrays. CNV analysis was carried out on a total of 1715 individuals from 12 populations, from ArrayExpress Archive of the European Bioinformatics Institute along with our subjects using Affymetrix Genome Wide SNP 6.0 array. CNV effect on T2DM genes were analyzed using several bioinformatics tools and a molecular protein interaction network was constructed to identify the disease mechanism altered by the CNVs. Analysis showed 34.4% of the total population to be under CNV burden for T2DM, with 83 disease causal and associated genes being under CNV influence. Hotspots were identified on chromosomes 22, 12, 6, 19 and 11.Overlap studies with case cohorts revealed significant disease risk genes such as EGFR, E2F1, PPP1R3A, HLA and TSPAN8. CNVs play a significant role in predisposing T2DM in normal cohorts and contribute to the phenotypic effects. Thus, CNVs should be considered as one of the major contributors in predisposition of the disease. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Increasing feed efficiency and reducing methane emissions using genomics: An international approach

USDA-ARS?s Scientific Manuscript database

Genomic technology (including SNP arrays and next-generation sequencing) is a powerful driver for the genetic improvement of livestock. Phenotype recording can now, to an extent, be partitioned from selection, and even limited to several thousand animals. Rapid development of new technologies and pr...
Linkage Disequilibrium And Genome-Wide Association Studies In O. sativa

USDA-ARS?s Scientific Manuscript database

There is increasing evidence that genome-wide association studies provide a powerful approach to find the genetic basis of complex phenotypic variation in all kinds of species. For this purpose, we developed the first generation 44K Affymetrix SNP array in rice (see Tung et al. poster). We genotyped...
CIDR

Science.gov Websites

variety of arrays appropriate for a wide breadth of study design needs. Genomic coverage of many of the chromosomal anomalies are services offered at NO ADDITIONAL COST to study investigators with GWAS projects be submitted for both the initial GWAS study as well as replication using our custom SNP service

Mapping a New Spontaneous Preterm Birth Susceptibility Gene, IGF1R, Using Linkage, Haplotype Sharing, and Association Analysis

PubMed Central

Luukkonen, Aino; Teramo, Kari; Puttonen, Hilkka; Ojaniemi, Marja; Varilo, Teppo; Chaudhari, Bimal P.; Plunkett, Jevon; Murray, Jeffrey C.; McCarroll, Steven A.; Muglia, Louis J.; Palotie, Aarno; Hallman, Mikko

2011-01-01

Preterm birth is the major cause of neonatal death and serious morbidity. Most preterm births are due to spontaneous onset of labor without a known cause or effective prevention. Both maternal and fetal genomes influence the predisposition to spontaneous preterm birth (SPTB), but the susceptibility loci remain to be defined. We utilized a combination of unique population structures, family-based linkage analysis, and subsequent case-control association to identify a susceptibility haplotype for SPTB. Clinically well-characterized SPTB families from northern Finland, a subisolate founded by a relatively small founder population that has subsequently experienced a number of bottlenecks, were selected for the initial discovery sample. Genome-wide linkage analysis using a high-density single-nucleotide polymorphism (SNP) array in seven large northern Finnish non-consanginous families identified a locus on 15q26.3 (HLOD 4.68). This region contains the IGF1R gene, which encodes the type 1 insulin-like growth factor receptor IGF-1R. Haplotype segregation analysis revealed that a 55 kb 12-SNP core segment within the IGF1R gene was shared identical-by-state (IBS) in five families. A follow-up case-control study in an independent sample representing the more general Finnish population showed an association of a 6-SNP IGF1R haplotype with SPTB in the fetuses, providing further evidence for IGF1R as a SPTB predisposition gene (frequency in cases versus controls 0.11 versus 0.05, P = 0.001, odds ratio 2.3). This study demonstrates the identification of a predisposing, low-frequency haplotype in a multifactorial trait using a well-characterized population and a combination of family and case-control designs. Our findings support the identification of the novel susceptibility gene IGF1R for predisposition by the fetal genome to being born preterm. PMID:21304894
Genome-wide association analysis reveals loci associated with resistance against Piscirickettsia salmonis in two Atlantic salmon (Salmo salar L.) chromosomes.

PubMed

Correa, Katharina; Lhorente, Jean P; López, María E; Bassini, Liane; Naswa, Sudhir; Deeb, Nader; Di Genova, Alex; Maass, Alejandro; Davidson, William S; Yáñez, José M

2015-10-24

Pisciricketssia salmonis is the causal agent of Salmon Rickettsial Syndrome (SRS), which affects salmon species and causes severe economic losses. Selective breeding for disease resistance represents one approach for controlling SRS in farmed Atlantic salmon. Knowledge concerning the architecture of the resistance trait is needed before deciding on the most appropriate approach to enhance artificial selection for P. salmonis resistance in Atlantic salmon. The purpose of the study was to dissect the genetic variation in the resistance to this pathogen in Atlantic salmon. 2,601 Atlantic salmon smolts were experimentally challenged against P. salmonis by means of intra-peritoneal injection. These smolts were the progeny of 40 sires and 118 dams from a Chilean breeding population. Mortalities were recorded daily and the experiment ended at day 40 post-inoculation. Fish were genotyped using a 50K Affymetrix® Axiom® myDesignTM Single Nucleotide Polymorphism (SNP) Genotyping Array. A Genome Wide Association Analysis was performed on data from the challenged fish. Linear regression and logistic regression models were tested. Genome Wide Association Analysis indicated that resistance to P. salmonis is a moderately polygenic trait. There were five SNPs in chromosomes Ssa01 and Ssa17 significantly associated with the traits analysed. The proportion of the phenotypic variance explained by each marker is small, ranging from 0.007 to 0.045. Candidate genes including interleukin receptors and fucosyltransferase have been found to be physically linked with these genetic markers and may play an important role in the differential immune response against this pathogen. Due to the small amount of variance explained by each significant marker we conclude that genetic resistance to this pathogen can be more efficiently improved with the implementation of genetic evaluations incorporating genotype information from a dense SNP array.
Continuing difficulties in interpreting CNV data: lessons from a genome-wide CNV association study of Australian HNPCC/lynch syndrome patients

PubMed Central

2013-01-01

Background Hereditary non-polyposis colorectal cancer (HNPCC)/Lynch syndrome (LS) is a cancer syndrome characterised by early-onset epithelial cancers, especially colorectal cancer (CRC) and endometrial cancer. The aim of the current study was to use SNP-array technology to identify genomic aberrations which could contribute to the increased risk of cancer in HNPCC/LS patients. Methods Individuals diagnosed with HNPCC/LS (100) and healthy controls (384) were genotyped using the Illumina Human610-Quad SNP-arrays. Copy number variation (CNV) calling and association analyses were performed using Nexus software, with significant results validated using QuantiSNP. TaqMan Copy-Number assays were used for verification of CNVs showing significant association with HNPCC/LS identified by both software programs. Results We detected copy number (CN) gains associated with HNPCC/LS status on chromosome 7q11.21 (28% cases and 0% controls, Nexus; p = 3.60E-20 and QuantiSNP; p < 1.00E-16) and 16p11.2 (46% in cases, while a CN loss was observed in 23% of controls, Nexus; p = 4.93E-21 and QuantiSNP; p = 5.00E-06) via in silico analyses. TaqMan Copy-Number assay was used for validation of CNVs showing significant association with HNPCC/LS. In addition, CNV burden (total CNV length, average CNV length and number of observed CNV events) was significantly greater in cases compared to controls. Conclusion A greater CNV burden was identified in HNPCC/LS cases compared to controls supporting the notion of higher genomic instability in these patients. One intergenic locus on chromosome 7q11.21 is possibly associated with HNPCC/LS and deserves further investigation. The results from this study highlight the complexities of fluorescent based CNV analyses. The inefficiency of both CNV detection methods to reproducibly detect observed CNVs demonstrates the need for sequence data to be considered alongside intensity data to avoid false positive results. PMID:23531357
Fine definition of the pedigree haplotypes of closely related rice cultivars by means of genome-wide discovery of single-nucleotide polymorphisms.

PubMed

Yamamoto, Toshio; Nagasaki, Hideki; Yonemaru, Jun-ichi; Ebana, Kaworu; Nakajima, Maiko; Shibaya, Taeko; Yano, Masahiro

2010-04-27

To create useful gene combinations in crop breeding, it is necessary to clarify the dynamics of the genome composition created by breeding practices. A large quantity of single-nucleotide polymorphism (SNP) data is required to permit discrimination of chromosome segments among modern cultivars, which are genetically related. Here, we used a high-throughput sequencer to conduct whole-genome sequencing of an elite Japanese rice cultivar, Koshihikari, which is closely related to Nipponbare, whose genome sequencing has been completed. Then we designed a high-throughput typing array based on the SNP information by comparison of the two sequences. Finally, we applied this array to analyze historical representative rice cultivars to understand the dynamics of their genome composition. The total 5.89-Gb sequence for Koshihikari, equivalent to 15.7 x the entire rice genome, was mapped using the Pseudomolecules 4.0 database for Nipponbare. The resultant Koshihikari genome sequence corresponded to 80.1% of the Nipponbare sequence and led to the identification of 67,051 SNPs. A high-throughput typing array consisting of 1917 SNP sites distributed throughout the genome was designed to genotype 151 representative Japanese cultivars that have been grown during the past 150 years. We could identify the ancestral origin of the pedigree haplotypes in 60.9% of the Koshihikari genome and 18 consensus haplotype blocks which are inherited from traditional landraces to current improved varieties. Moreover, it was predicted that modern breeding practices have generally decreased genetic diversity Detection of genome-wide SNPs by both high-throughput sequencer and typing array made it possible to evaluate genomic composition of genetically related rice varieties. With the aid of their pedigree information, we clarified the dynamics of chromosome recombination during the historical rice breeding process. We also found several genomic regions decreasing genetic diversity which might be caused by a recent human selection in rice breeding. The definition of pedigree haplotypes by means of genome-wide SNPs will facilitate next-generation breeding of rice and other crops.
Sequential sentinel SNP Regional Association Plots (SSS-RAP): an approach for testing independence of SNP association signals using meta-analysis data.

PubMed

Zheng, Jie; Gaunt, Tom R; Day, Ian N M

2013-01-01

Genome-Wide Association Studies (GWAS) frequently incorporate meta-analysis within their framework. However, conditional analysis of individual-level data, which is an established approach for fine mapping of causal sites, is often precluded where only group-level summary data are available for analysis. Here, we present a numerical and graphical approach, "sequential sentinel SNP regional association plot" (SSS-RAP), which estimates regression coefficients (beta) with their standard errors using the meta-analysis summary results directly. Under an additive model, typical for genes with small effect, the effect for a sentinel SNP can be transformed to the predicted effect for a possibly dependent SNP through a 2×2 2-SNP haplotypes table. The approach assumes Hardy-Weinberg equilibrium for test SNPs. SSS-RAP is available as a Web-tool (http://apps.biocompute.org.uk/sssrap/sssrap.cgi). To develop and illustrate SSS-RAP we analyzed lipid and ECG traits data from the British Women's Heart and Health Study (BWHHS), evaluated a meta-analysis for ECG trait and presented several simulations. We compared results with existing approaches such as model selection methods and conditional analysis. Generally findings were consistent. SSS-RAP represents a tool for testing independence of SNP association signals using meta-analysis data, and is also a convenient approach based on biological principles for fine mapping in group level summary data. © 2012 Blackwell Publishing Ltd/University College London.
Analysis of large versus small dogs reveals three genes on the canine X chromosome associated with body weight, muscling and back fat thickness

PubMed Central

Davis, Brian W.; Schoenebeck, Jeffrey J.

2017-01-01

Domestic dog breeds display significant diversity in both body mass and skeletal size, resulting from intensive selective pressure during the formation and maintenance of modern breeds. While previous studies focused on the identification of alleles that contribute to small skeletal size, little is known about the underlying genetics controlling large size. We first performed a genome-wide association study (GWAS) using the Illumina Canine HD 170,000 single nucleotide polymorphism (SNP) array which compared 165 large-breed dogs from 19 breeds (defined as having a Standard Breed Weight (SBW) >41 kg [90 lb]) to 690 dogs from 69 small breeds (SBW ≤41 kg). We identified two loci on the canine X chromosome that were strongly associated with large body size at 82–84 megabases (Mb) and 101–104 Mb. Analyses of whole genome sequencing (WGS) data from 163 dogs revealed two indels in the Insulin Receptor Substrate 4 (IRS4) gene at 82.2 Mb and two additional mutations, one SNP and one deletion of a single codon, in Immunoglobulin Superfamily member 1 gene (IGSF1) at 102.3 Mb. IRS4 and IGSF1 are members of the GH/IGF1 and thyroid pathways whose roles include determination of body size. We also found one highly associated SNP in the 5’UTR of Acyl-CoA Synthetase Long-chain family member 4 (ACSL4) at 82.9 Mb, a gene which controls the traits of muscling and back fat thickness. We show by analysis of sequencing data from 26 wolves and 959 dogs representing 102 domestic dog breeds that skeletal size and body mass in large dog breeds are strongly associated with variants within IRS4, ACSL4 and IGSF1. PMID:28257443
Red blood cell antigen genotype analysis for 9087 Asian, Asian American, and Native American blood donors.

PubMed

Delaney, Meghan; Harris, Samantha; Haile, Askale; Johnsen, Jill; Teramura, Gayle; Nelson, Karen

2015-10-01

There has yet to be a comprehensive analysis of blood group antigen prevalence in Asian Americans and Native Americans. There may be ethnic differences in blood group frequencies that would result in clinically important mismatches through transfusion. Blood donors who self-identified as Asian or Native American were tested using a single-nucleotide polymorphism (SNP) DNA array (HEA BeadChip kit, Bioarray Solutions Ltd) that predicts expression of 38 human erythrocyte antigens (HEAs) and by serology for ABO, D, C, M, N, Jk(a) , and Jk(b) . The prevalence of blood group antigens was compared to published European prevalence. Discrepancies between SNP-predicted and serology-detected antigens were tallied. A total of 9087 blood donors were tested from nine Asian and Native American heritages. The predicted prevalence of selected antigens in the RHCE, JK, FY, MNS, LU, CO, and DO blood group systems were variable between Asian populations, but overall not significantly different than Europeans. Compared to European frequencies, Kell blood group allele frequencies were significantly different in the Chinese, Native American, Hawaiian/Pacific Islander, South Asian, and Southeast Asian heritage blood donors; Diego antigens Di(a) and Di(b) were different in donors of Native American and South Asian ancestries (p < 0.05). Of the donors tested, 4.5% showed a SNP-serology discrepancy that segregated within specific ethnic groups. This study provides HEA allele frequency and antigen prevalence data in a cohort of Asian and Native Americans donors. Several ethnic groups exhibited differences in HEA frequencies compared to Europeans. Genotype-serotype discrepancies were detected in all systems studied. © 2015 AABB.
Copy number analysis of NIPBL in a cohort of 510 patients reveals rare copy number variants and a mosaic deletion.

PubMed

Cheng, Yu-Wei; Tan, Christopher A; Minor, Agata; Arndt, Kelly; Wysinger, Latrice; Grange, Dorothy K; Kozel, Beth A; Robin, Nathaniel H; Waggoner, Darrel; Fitzpatrick, Carrie; Das, Soma; Del Gaudio, Daniela

2014-03-01

Cornelia de Lange syndrome (CdLS) is a genetically heterogeneous disorder characterized by growth retardation, intellectual disability, upper limb abnormalities, hirsutism, and characteristic facial features. In this study we explored the occurrence of intragenic NIPBL copy number variations (CNVs) in a cohort of 510 NIPBL sequence-negative patients with suspected CdLS. Copy number analysis was performed by custom exon-targeted oligonucleotide array-comparative genomic hybridization and/or MLPA. Whole-genome SNP array was used to further characterize rearrangements extending beyond the NIPBL gene. We identified NIPBL CNVs in 13 patients (2.5%) including one intragenic duplication and a deletion in mosaic state. Breakpoint sequences in two patients provided further evidence of a microhomology-mediated replicative mechanism as a potential predominant contributor to CNVs in NIPBL. Patients for whom clinical information was available share classical CdLS features including craniofacial and limb defects. Our experience in studying the frequency of NIBPL CNVs in the largest series of patients to date widens the mutational spectrum of NIPBL and emphasizes the clinical utility of performing NIPBL deletion/duplication analysis in patients with CdLS.
Development and validation of the Axiom(®) Apple480K SNP genotyping array.

PubMed

Bianco, Luca; Cestaro, Alessandro; Linsmith, Gareth; Muranty, Hélène; Denancé, Caroline; Théron, Anthony; Poncet, Charles; Micheletti, Diego; Kerschbamer, Emanuela; Di Pierro, Erica A; Larger, Simone; Pindo, Massimo; Van de Weg, Eric; Davassi, Alessandro; Laurens, François; Velasco, Riccardo; Durel, Charles-Eric; Troggio, Michela

2016-04-01

Cultivated apple (Malus × domestica Borkh.) is one of the most important fruit crops in temperate regions, and has great economic and cultural value. The apple genome is highly heterozygous and has undergone a recent duplication which, combined with a rapid linkage disequilibrium decay, makes it difficult to perform genome-wide association (GWA) studies. Single nucleotide polymorphism arrays offer highly multiplexed assays at a relatively low cost per data point and can be a valid tool for the identification of the markers associated with traits of interest. Here, we describe the development and validation of a 487K SNP Affymetrix Axiom(®) genotyping array for apple and discuss its potential applications. The array has been built from the high-depth resequencing of 63 different cultivars covering most of the genetic diversity in cultivated apple. The SNPs were chosen by applying a focal points approach to enrich genic regions, but also to reach a uniform coverage of non-genic regions. A total of 1324 apple accessions, including the 92 progenies of two mapping populations, have been genotyped with the Axiom(®) Apple480K to assess the effectiveness of the array. A large majority of SNPs (359 994 or 74%) fell in the stringent class of poly high resolution polymorphisms. We also devised a filtering procedure to identify a subset of 275K very robust markers that can be safely used for germplasm surveys in apple. The Axiom(®) Apple480K has now been commercially released both for public and proprietary use and will likely be a reference tool for GWA studies in apple. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.
Microarray Technology for the Diagnosis of Fetal Chromosomal Aberrations: Which Platform Should We Use?

PubMed Central

Karampetsou, Evangelia; Morrogh, Deborah; Chitty, Lyn

2014-01-01

The advantage of microarray (array) over conventional karyotype for the diagnosis of fetal pathogenic chromosomal anomalies has prompted the use of microarrays in prenatal diagnostics. In this review we compare the performance of different array platforms (BAC, oligonucleotide CGH, SNP) and designs (targeted, whole genome, whole genome, and targeted, custom) and discuss their advantages and disadvantages in relation to prenatal testing. We also discuss the factors to consider when implementing a microarray testing service for the diagnosis of fetal chromosomal aberrations. PMID:26237396
Genotyping of 75 SNPs using arrays for individual identification in five population groups.

PubMed

Hwa, Hsiao-Lin; Wu, Lawrence Shih Hsin; Lin, Chun-Yen; Huang, Tsun-Ying; Yin, Hsiang-I; Tseng, Li-Hui; Lee, James Chun-I

2016-01-01

Single nucleotide polymorphism (SNP) typing offers promise to forensic genetics. Various strategies and panels for analyzing SNP markers for individual identification have been published. However, the best panels with fewer identity SNPs for all major population groups are still under discussion. This study aimed to find more autosomal SNPs with high heterozygosity for individual identification among Asian populations. Ninety-six autosomal SNPs of 502 DNA samples from unrelated individuals of five population groups (208 Taiwanese Han, 83 Filipinos, 62 Thais, 69 Indonesians, and 80 individuals with European, Near Eastern, or South Asian ancestry) were analyzed using arrays in an initial screening, and 75 SNPs (group A, 46 newly selected SNPs; groups B, 29 SNPs based on a previous SNP panel) were selected for further statistical analyses. Some SNPs with high heterozygosity from Asian populations were identified. The combined random match probability of the best 40 and 45 SNPs was between 3.16 × 10(-17) and 7.75 × 10(-17) and between 2.33 × 10(-19) and 7.00 × 10(-19), respectively, in all five populations. These loci offer comparable power to short tandem repeats (STRs) for routine forensic profiling. In this study, we demonstrated the population genetic characteristics and forensic parameters of 75 SNPs with high heterozygosity from five population groups. This SNPs panel can provide valuable genotypic information and can be helpful in forensic casework for individual identification among these populations.
Analysis of genetic polymorphisms in skeletal Class I crowding.

PubMed

Ting, Tung Yuen; Wong, Ricky Wing Kit; Rabie, A Bakr M

2011-07-01

Dental crowding is a problem for both adolescents and adults in modern society. The purpose of this research was to identify single nucleotide polymorphisms (SNPs) responsible for crowding in subjects with skeletal Class I relationships. The case subjects consisted of healthy Chinese people living in Hong Kong with skeletal Class I relationships and at least 5 mm of crowding in either arch. The control subjects met the same requirements but lacked crowding or spacing. SNP genotyping was performed on the MassARRAY platform. The chi-square test was used to compare genotype and allele type distributions between the case and the control groups. Logistic regression was used to calculate odds ratios with 95% confidence intervals, and the effects of age and sex for each SNP. Analyses of linkage disequilibrium and haplotype associations between SNPs were performed with software. Five SNPs were found to be significantly different in genotype or allele type distributions. SNP rs372024 was significantly associated with crowding (P = 0.004). Two SNPs, rs3764746 and rs3795170, on the EDA gene were found to be associated marginally. SNPs rs1005464 and rs15705 also exhibited marginal association with crowding. The effects of associated SNPs remained significant after adjustments for age and sex factors. This study suggests an association for the genes EDA and XEDAR in dental crowding in the Hong Kong Chinese population. Copyright © 2011 American Association of Orthodontists. Published by Mosby, Inc. All rights reserved.
Genome-wide association studies identify 25 genetic loci associated with resistance to Bacterial Cold Water Disease in rainbow trout

USDA-ARS?s Scientific Manuscript database

Bacterial cold water disease (BCWD) causes significant mortality and economic losses in salmonids aquaculture. In previous studies we have identified moderate-large effect QTL for BCWD resistance in rainbow trout (Oncorhynchus mykiss). However, the recent availability of a high density SNP array and...
Design of a bovine low-density SNP array optimized for imputation

USDA-ARS?s Scientific Manuscript database

The Illumina BovineLD BeadChip was designed to support imputation to higher density genotypes in dairy and beef breeds by including single-nucleotide polymorphisms (SNPs) that had a high minor allele frequency as well as uniform spacing across the genome except at the ends of the chromosome where de...
Determination of Metastatic Potential in Breast Tumors by Global Molecular Characterization Using Multiple Modalities

DTIC Science & Technology

2010-10-01

5 Results ...to disease prognosis and in determining the course of treatment for the patient (2) . Breast cancer is a highly heterogeneous and complex disease...progression is a challenge. Introduction of high density single nucleotide polymorphism (SNP) genotyping arrays has helped not only for whole genome
Genome-wide associations for water-soluble carbohydrate concentration and relative maturity in wheat using SNP and DArT marker arrays

USDA-ARS?s Scientific Manuscript database

Improving water-use efficiency by incorporating drought avoidance traits into new wheat varieties is an important objective for wheat breeding in water-limited environments. This study uses genome wide association studies (GWAS) to identify candidate loci for water-soluble carbohydrate accumulation,...
Comparison of Constitutional and Replication Stress-Induced Genome Structural Variation by SNP Array and Mate-Pair Sequencing

PubMed Central

Arlt, Martin F.; Ozdemir, Alev Cagla; Birkeland, Shanda R.; Lyons, Robert H.; Glover, Thomas W.; Wilson, Thomas E.

2011-01-01

Copy-number variants (CNVs) are a major source of genetic variation in human health and disease. Previous studies have implicated replication stress as a causative factor in CNV formation. However, existing data are technically limited in the quality of comparisons that can be made between human CNVs and experimentally induced variants. Here, we used two high-resolution strategies—single nucleotide polymorphism (SNP) arrays and mate-pair sequencing—to compare CNVs that occur constitutionally to those that arise following aphidicolin-induced DNA replication stress in the same human cells. Although the optimized methods provided complementary information, sequencing was more sensitive to small variants and provided superior structural descriptions. The majority of constitutional and all aphidicolin-induced CNVs appear to be formed via homology-independent mechanisms, while aphidicolin-induced CNVs were of a larger median size than constitutional events even when mate-pair data were considered. Aphidicolin thus appears to stimulate formation of CNVs that closely resemble human pathogenic CNVs and the subset of larger nonhomologous constitutional CNVs. PMID:21212237
The genetic landscape of paediatric de novo acute myeloid leukaemia as defined by single nucleotide polymorphism array and exon sequencing of 100 candidate genes.

PubMed

Olsson, Linda; Zettermark, Sofia; Biloglav, Andrea; Castor, Anders; Behrendtz, Mikael; Forestier, Erik; Paulsson, Kajsa; Johansson, Bertil

2016-07-01

Cytogenetic analyses of a consecutive series of 67 paediatric (median age 8 years; range 0-17) de novo acute myeloid leukaemia (AML) patients revealed aberrations in 55 (82%) cases. The most common subgroups were KMT2A rearrangement (29%), normal karyotype (15%), RUNX1-RUNX1T1 (10%), deletions of 5q, 7q and/or 17p (9%), myeloid leukaemia associated with Down syndrome (7%), PML-RARA (7%) and CBFB-MYH11 (5%). Single nucleotide polymorphism array (SNP-A) analysis and exon sequencing of 100 genes, performed in 52 and 40 cases, respectively (39 overlapping), revealed ≥1 aberration in 89%; when adding cytogenetic data, this frequency increased to 98%. Uniparental isodisomies (UPIDs) were detected in 13% and copy number aberrations (CNAs) in 63% (median 2/case); three UPIDs and 22 CNAs were recurrent. Twenty-two genes were targeted by focal CNAs, including AEBP2 and PHF6 deletions and genes involved in AML-associated gene fusions. Deep sequencing identified mutations in 65% of cases (median 1/case). In total, 60 mutations were found in 30 genes, primarily those encoding signalling proteins (47%), transcription factors (25%), or epigenetic modifiers (13%). Twelve genes (BCOR, CEBPA, FLT3, GATA1, KIT, KRAS, NOTCH1, NPM1, NRAS, PTPN11, SMC3 and TP53) were recurrently mutated. We conclude that SNP-A and deep sequencing analyses complement the cytogenetic diagnosis of paediatric AML. © 2016 John Wiley & Sons Ltd.
Dandy-Walker malformation and Wisconsin syndrome: novel cases add further insight into the genotype-phenotype correlations of 3q23q25 deletions.

PubMed

Ferraris, Alessandro; Bernardini, Laura; Sabolic Avramovska, Vesna; Zanni, Ginevra; Loddo, Sara; Sukarova-Angelovska, Elena; Parisi, Valentina; Capalbo, Anna; Tumini, Stefano; Travaglini, Lorena; Mancini, Francesca; Duma, Filip; Barresi, Sabina; Novelli, Antonio; Mercuri, Eugenio; Tarani, Luigi; Bertini, Enrico; Dallapiccola, Bruno; Valente, Enza Maria

2013-05-16

The Dandy-Walker malformation (DWM) is one of the commonest congenital cerebellar defects, and can be associated with multiple congenital anomalies and chromosomal syndromes. The occurrence of overlapping 3q deletions including the ZIC1 and ZIC4 genes in few patients, along with data from mouse models, have implicated both genes in the pathogenesis of DWM. Using a SNP-array approach, we recently identified three novel patients carrying heterozygous 3q deletions encompassing ZIC1 and ZIC4. Magnetic resonance imaging showed that only two had a typical DWM, while the third did not present any defect of the DWM spectrum. SNP-array analysis in further eleven children diagnosed with DWM failed to identify deletions of ZIC1-ZIC4. The clinical phenotype of the three 3q deleted patients included multiple congenital anomalies and peculiar facial appearance, related to the localization and extension of each deletion. In particular, phenotypes resulted from the variable combination of three recognizable patterns: DWM (with incomplete penetrance); blepharophimosis, ptosis, and epicanthus inversus syndrome; and Wisconsin syndrome (WS), recently mapped to 3q. Our data indicate that the 3q deletion is a rare defect associated with DWM, and suggest that the hemizygosity of ZIC1-ZIC4 genes is neither necessary nor sufficient per se to cause this condition. Furthermore, based on a detailed comparison of clinical features and molecular data from 3q deleted patients, we propose clinical diagnostic criteria and refine the critical region for WS.
Dandy-Walker malformation and Wisconsin syndrome: novel cases add further insight into the genotype-phenotype correlations of 3q23q25 deletions

PubMed Central

2013-01-01

Background The Dandy-Walker malformation (DWM) is one of the commonest congenital cerebellar defects, and can be associated with multiple congenital anomalies and chromosomal syndromes. The occurrence of overlapping 3q deletions including the ZIC1 and ZIC4 genes in few patients, along with data from mouse models, have implicated both genes in the pathogenesis of DWM. Methods and results Using a SNP-array approach, we recently identified three novel patients carrying heterozygous 3q deletions encompassing ZIC1 and ZIC4. Magnetic resonance imaging showed that only two had a typical DWM, while the third did not present any defect of the DWM spectrum. SNP-array analysis in further eleven children diagnosed with DWM failed to identify deletions of ZIC1-ZIC4. The clinical phenotype of the three 3q deleted patients included multiple congenital anomalies and peculiar facial appearance, related to the localization and extension of each deletion. In particular, phenotypes resulted from the variable combination of three recognizable patterns: DWM (with incomplete penetrance); blepharophimosis, ptosis, and epicanthus inversus syndrome; and Wisconsin syndrome (WS), recently mapped to 3q. Conclusions Our data indicate that the 3q deletion is a rare defect associated with DWM, and suggest that the hemizygosity of ZIC1-ZIC4 genes is neither necessary nor sufficient per se to cause this condition. Furthermore, based on a detailed comparison of clinical features and molecular data from 3q deleted patients, we propose clinical diagnostic criteria and refine the critical region for WS. PMID:23679990

High-throughput genotyping for species identification and diversity assessment in germplasm collections.

PubMed

Mason, Annaliese S; Zhang, Jing; Tollenaere, Reece; Vasquez Teuber, Paula; Dalton-Morgan, Jessica; Hu, Liyong; Yan, Guijun; Edwards, David; Redden, Robert; Batley, Jacqueline

2015-09-01

Germplasm collections provide an extremely valuable resource for breeders and researchers. However, misclassification of accessions by species often hinders the effective use of these collections. We propose that use of high-throughput genotyping tools can provide a fast, efficient and cost-effective way of confirming species in germplasm collections, as well as providing valuable genetic diversity data. We genotyped 180 Brassicaceae samples sourced from the Australian Grains Genebank across the recently released Illumina Infinium Brassica 60K SNP array. Of these, 76 were provided on the basis of suspected misclassification and another 104 were sourced independently from the germplasm collection. Presence of the A- and C-genomes combined with principle components analysis clearly separated Brassica rapa, B. oleracea, B. napus, B. carinata and B. juncea samples into distinct species groups. Several lines were further validated using chromosome counts. Overall, 18% of samples (32/180) were misclassified on the basis of species. Within these 180 samples, 23/76 (30%) supplied on the basis of suspected misclassification were misclassified, and 9/105 (9%) of the samples randomly sourced from the Australian Grains Genebank were misclassified. Surprisingly, several individuals were also found to be the product of interspecific hybridization events. The SNP (single nucleotide polymorphism) array proved effective at confirming species, and provided useful information related to genetic diversity. As similar genomic resources become available for different crops, high-throughput molecular genotyping will offer an efficient and cost-effective method to screen germplasm collections worldwide, facilitating more effective use of these valuable resources by breeders and researchers. © 2015 John Wiley & Sons Ltd.
Micro-Analyzer: automatic preprocessing of Affymetrix microarray data.

PubMed

Guzzi, Pietro Hiram; Cannataro, Mario

2013-08-01

A current trend in genomics is the investigation of the cell mechanism using different technologies, in order to explain the relationship among genes, molecular processes and diseases. For instance, the combined use of gene-expression arrays and genomic arrays has been demonstrated as an effective instrument in clinical practice. Consequently, in a single experiment different kind of microarrays may be used, resulting in the production of different types of binary data (images and textual raw data). The analysis of microarray data requires an initial preprocessing phase, that makes raw data suitable for use on existing analysis platforms, such as the TIGR M4 (TM4) Suite. An additional challenge to be faced by emerging data analysis platforms is the ability to treat in a combined way those different microarray formats coupled with clinical data. In fact, resulting integrated data may include both numerical and symbolic data (e.g. gene expression and SNPs regarding molecular data), as well as temporal data (e.g. the response to a drug, time to progression and survival rate), regarding clinical data. Raw data preprocessing is a crucial step in analysis but is often performed in a manual and error prone way using different software tools. Thus novel, platform independent, and possibly open source tools enabling the semi-automatic preprocessing and annotation of different microarray data are needed. The paper presents Micro-Analyzer (Microarray Analyzer), a cross-platform tool for the automatic normalization, summarization and annotation of Affymetrix gene expression and SNP binary data. It represents the evolution of the μ-CS tool, extending the preprocessing to SNP arrays that were not allowed in μ-CS. The Micro-Analyzer is provided as a Java standalone tool and enables users to read, preprocess and analyse binary microarray data (gene expression and SNPs) by invoking TM4 platform. It avoids: (i) the manual invocation of external tools (e.g. the Affymetrix Power Tools), (ii) the manual loading of preprocessing libraries, and (iii) the management of intermediate files, such as results and metadata. Micro-Analyzer users can directly manage Affymetrix binary data without worrying about locating and invoking the proper preprocessing tools and chip-specific libraries. Moreover, users of the Micro-Analyzer tool can load the preprocessed data directly into the well-known TM4 platform, extending in such a way also the TM4 capabilities. Consequently, Micro Analyzer offers the following advantages: (i) it reduces possible errors in the preprocessing and further analysis phases, e.g. due to the incorrect choice of parameters or due to the use of old libraries, (ii) it enables the combined and centralized pre-processing of different arrays, (iii) it may enhance the quality of further analysis by storing the workflow, i.e. information about the preprocessing steps, and (iv) finally Micro-Analzyer is freely available as a standalone application at the project web site http://sourceforge.net/projects/microanalyzer/. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Genome-wide SNP association-based localization of a dwarfism gene in Friesian dwarf horses.

PubMed

Orr, N; Back, W; Gu, J; Leegwater, P; Govindarajan, P; Conroy, J; Ducro, B; Van Arendonk, J A M; MacHugh, D E; Ennis, S; Hill, E W; Brama, P A J

2010-12-01

The recent completion of the horse genome and commercial availability of an equine SNP genotyping array has facilitated the mapping of disease genes. We report putative localization of the gene responsible for dwarfism, a trait in Friesian horses that is thought to have a recessive mode of inheritance, to a 2-MB region of chromosome 14 using just 10 affected animals and 10 controls. We successfully genotyped 34,429 SNPs that were tested for association with dwarfism using chi-square tests. The most significant SNP in our study, BIEC2-239376 (P(2df)=4.54 × 10(-5), P(rec)=7.74 × 10(-6)), is located close to a gene implicated in human dwarfism. Fine-mapping and resequencing analyses did not aid in further localization of the causative variant, and replication of our findings in independent sample sets will be necessary to confirm these results. © 2010 The Authors, Journal compilation © 2010 Stichting International Foundation for Animal Genetics.
SNPchiMp: a database to disentangle the SNPchip jungle in bovine livestock.

PubMed

Nicolazzi, Ezequiel Luis; Picciolini, Matteo; Strozzi, Francesco; Schnabel, Robert David; Lawley, Cindy; Pirani, Ali; Brew, Fiona; Stella, Alessandra

2014-02-11

Currently, six commercial whole-genome SNP chips are available for cattle genotyping, produced by two different genotyping platforms. Technical issues need to be addressed to combine data that originates from the different platforms, or different versions of the same array generated by the manufacturer. For example: i) genome coordinates for SNPs may refer to different genome assemblies; ii) reference genome sequences are updated over time changing the positions, or even removing sequences which contain SNPs; iii) not all commercial SNP ID's are searchable within public databases; iv) SNPs can be coded using different formats and referencing different strands (e.g. A/B or A/C/T/G alleles, referencing forward/reverse, top/bottom or plus/minus strand); v) Due to new information being discovered, higher density chips do not necessarily include all the SNPs present in the lower density chips; and, vi) SNP IDs may not be consistent across chips and platforms. Most researchers and breed associations manage SNP data in real-time and thus require tools to standardise data in a user-friendly manner. Here we present SNPchiMp, a MySQL database linked to an open access web-based interface. Features of this interface include, but are not limited to, the following functions: 1) referencing the SNP mapping information to the latest genome assembly, 2) extraction of information contained in dbSNP for SNPs present in all commercially available bovine chips, and 3) identification of SNPs in common between two or more bovine chips (e.g. for SNP imputation from lower to higher density). In addition, SNPchiMp can retrieve this information on subsets of SNPs, accessing such data either via physical position on a supported assembly, or by a list of SNP IDs, rs or ss identifiers. This tool combines many different sources of information, that otherwise are time consuming to obtain and difficult to integrate. The SNPchiMp not only provides the information in a user-friendly format, but also enables researchers to perform a large number of operations with a few clicks of the mouse. This significantly reduces the time needed to execute the large number of operations required to manage SNP data.
Restitution and genetic differentiation of salmon populations in the southern Baltic genotyped with the Atlantic salmon 7K SNP array.

PubMed

Poćwierz-Kotus, Anita; Bernaś, Rafał; Kent, Matthew P; Lien, Sigbjørn; Leliűna, Egidijus; Dębowski, Piotr; Wenne, Roman

2015-05-06

Native populations of Atlantic salmon in Poland, from the southern Baltic region, became extinct in the 1980s. Attempts to restitute salmon populations in Poland have been based on a Latvian salmon population from the Daugava river. Releases of hatchery reared smolts started in 1986, but to date, only one population with confirmed natural reproduction has been observed in the Slupia river. Our aim was to investigate the genetic differentiation of salmon populations in the southern Baltic using a 7K SNP (single nucleotide polymorphism) array in order to assess the impact of salmon restitution in Poland. One hundred and forty salmon samples were collected from: the Polish Slupia river including wild salmon and individuals from two hatcheries, the Swedish Morrum river and the Lithuanian Neman river. All samples were genotyped using an Atlantic salmon 7K SNP array. A set of 3218 diagnostic SNPs was used for genetic analyses. Genetic structure analyses indicated that the individuals from the investigated populations were clustered into three groups i.e. one clade that included individuals from both hatcheries and the wild population from the Polish Slupia river, which was clearly separated from the other clades. An assignment test showed that there were no stray fish from the Morrum or Neman rivers in the sample analyzed from the Slupia river. Global FST over polymorphic loci was high (0.177). A strong genetic differentiation was observed between the Lithuanian and Swedish populations (FST = 0.28). Wild juvenile salmon specimens that were sampled from the Slupia river were the progeny of fish released from hatcheries and, most likely, were not progeny of stray fish from Sweden or Lithuania. Strong genetic differences were observed between the salmon populations from the three studied locations. Our recommendation is that future stocking activities that aim at restituting salmon populations in Poland include stocking material from the Lithuanian Neman river because of its closer geographic proximity.
rSNPBase 3.0: an updated database of SNP-related regulatory elements, element-gene pairs and SNP-based gene regulatory networks.

PubMed

Guo, Liyuan; Wang, Jing

2018-01-04

Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element-target gene pairs (E-G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
rSNPBase 3.0: an updated database of SNP-related regulatory elements, element-gene pairs and SNP-based gene regulatory networks

PubMed Central

2018-01-01

Abstract Here, we present the updated rSNPBase 3.0 database (http://rsnp3.psych.ac.cn), which provides human SNP-related regulatory elements, element-gene pairs and SNP-based regulatory networks. This database is the updated version of the SNP regulatory annotation database rSNPBase and rVarBase. In comparison to the last two versions, there are both structural and data adjustments in rSNPBase 3.0: (i) The most significant new feature is the expansion of analysis scope from SNP-related regulatory elements to include regulatory element–target gene pairs (E–G pairs), therefore it can provide SNP-based gene regulatory networks. (ii) Web function was modified according to data content and a new network search module is provided in the rSNPBase 3.0 in addition to the previous regulatory SNP (rSNP) search module. The two search modules support data query for detailed information (related-elements, element-gene pairs, and other extended annotations) on specific SNPs and SNP-related graphic networks constructed by interacting transcription factors (TFs), miRNAs and genes. (3) The type of regulatory elements was modified and enriched. To our best knowledge, the updated rSNPBase 3.0 is the first data tool supports SNP functional analysis from a regulatory network prospective, it will provide both a comprehensive understanding and concrete guidance for SNP-related regulatory studies. PMID:29140525
A 2-Stage Genome-Wide Association Study to Identify Single Nucleotide Polymorphisms Associated With Development of Erectile Dysfunction Following Radiation Therapy for Prostate Cancer

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kerns, Sarah L.; Departments of Pathology and Genetics, Albert Einstein College of Medicine, Bronx, New York; Stock, Richard

2013-01-01

Purpose: To identify single nucleotide polymorphisms (SNPs) associated with development of erectile dysfunction (ED) among prostate cancer patients treated with radiation therapy. Methods and Materials: A 2-stage genome-wide association study was performed. Patients were split randomly into a stage I discovery cohort (132 cases, 103 controls) and a stage II replication cohort (128 cases, 102 controls). The discovery cohort was genotyped using Affymetrix 6.0 genome-wide arrays. The 940 top ranking SNPs selected from the discovery cohort were genotyped in the replication cohort using Illumina iSelect custom SNP arrays. Results: Twelve SNPs identified in the discovery cohort and validated in themore » replication cohort were associated with development of ED following radiation therapy (Fisher combined P values 2.1 Multiplication-Sign 10{sup -5} to 6.2 Multiplication-Sign 10{sup -4}). Notably, these 12 SNPs lie in or near genes involved in erectile function or other normal cellular functions (adhesion and signaling) rather than DNA damage repair. In a multivariable model including nongenetic risk factors, the odds ratios for these SNPs ranged from 1.6 to 5.6 in the pooled cohort. There was a striking relationship between the cumulative number of SNP risk alleles an individual possessed and ED status (Sommers' D P value = 1.7 Multiplication-Sign 10{sup -29}). A 1-allele increase in cumulative SNP score increased the odds for developing ED by a factor of 2.2 (P value = 2.1 Multiplication-Sign 10{sup -19}). The cumulative SNP score model had a sensitivity of 84% and specificity of 75% for prediction of developing ED at the radiation therapy planning stage. Conclusions: This genome-wide association study identified a set of SNPs that are associated with development of ED following radiation therapy. These candidate genetic predictors warrant more definitive validation in an independent cohort.« less
Interrogation of the platelet-derived growth factor receptor alpha locus and corneal astigmatism in Australians of Northern European ancestry: results of a genome-wide association study.

PubMed

Yazar, Seyhan; Mishra, Aniket; Ang, Wei; Kearns, Lisa S; Mountain, Jenny A; Pennell, Craig; Montgomery, Grant W; Young, Terri L; Hammond, Christopher J; Macgregor, Stuart; Mackey, David A; Hewitt, Alex W

2013-01-01

Corneal astigmatism is a common eye disorder characterized by irregularities in corneal curvature. Recently, the rs7677751 single nucleotide polymorphism (SNP) at the platelet-derived growth factor receptor alpha (PDGFRA) locus was found to be associated with corneal astigmatism in people of Asian ancestry. In the present study, we sought to replicate this finding and identify other genetic markers of corneal astigmatism in an Australian population of Northern European ancestry. Data from two cohorts were included in this study. The first cohort consisted of 1,013 individuals who were part of the Western Australian Pregnancy Cohort (Raine) Study: 20-year follow-up Eye Study. The second cohort comprised 1,788 individuals of 857 twin families who were recruited through the Twins Eye Study in Tasmania and the Brisbane Adolescent Twin Study. Corneal astigmatism was calculated as the absolute difference between the keratometry readings in two meridians, and genotype data were extracted from genome-wide arrays. Initially, each cohort was analyzed separately, before being combined for meta- and subsequent genome-wide pathway analysis. Following meta-analysis, SNP rs7677751 at the PDGFRA locus had a combined p=0.32. No variant was found to be statistically significantly associated with corneal astigmatism at the genome-wide level (p<5.0×10(-8)). The SNP with strongest association was rs1164064 (p=1.86×10(-6)) on chromosome 3q13. Gene-based pathway analysis identified a significant association between the Gene Ontology "segmentation" (GO:0035282) pathway, corrected p=0.009. Our data suggest that the PDGFRA locus does not transfer a major risk of corneal astigmatism in people of Northern European ancestry. Better-powered studies are required to validate the novel putative findings of our study.
A case of 3q29 microdeletion syndrome involving oral cleft inherited from a non-affected mosaic parent: molecular analysis and ethical implications

PubMed Central

Petrin, Aline L.; Daack-Hirsch, Sandra; L’Heureux, Jamie; Murray, Jeffrey C

2010-01-01

Objective The objective of this study was to use array-CGH to detect causal microdeletions in samples of subjects with cleft lip and palate. Subjects We analyzed DNA samples from a male patient and parents that was seen during surgical screening for an Operation Smile medical mission in the Philippines. Method We used Affymetrix Genome Wide Human SNP Array 6.0 followed by sequencing and quantitative PCR using SYBR Green I dye. Results We report the second case of 3q29 microdeletion syndrome including cleft lip with or without cleft palate and the first case of this microdeletion syndrome inherited from a phenotypically normal mosaic parent. Conclusions Our findings confirm the utility of aCGH to detect causal microdeletions; indicate that parental somatic mosaicism should be considered in healthy parents for genetic counseling of the families and discuss important ethical implications of sharing health impact results from research studies with the participant families. PMID:20500065
Next-generation analysis of cataracts: determining knowledge driven gene-gene interactions using Biofilter, and gene-environment interactions using the PhenX Toolkit.

PubMed

Pendergrass, Sarah A; Verma, Shefali S; Holzinger, Emily R; Moore, Carrie B; Wallace, John; Dudek, Scott M; Huggins, Wayne; Kitchner, Terrie; Waudby, Carol; Berg, Richard; McCarty, Catherine A; Ritchie, Marylyn D

2013-01-01

Investigating the association between biobank derived genomic data and the information of linked electronic health records (EHRs) is an emerging area of research for dissecting the architecture of complex human traits, where cases and controls for study are defined through the use of electronic phenotyping algorithms deployed in large EHR systems. For our study, 2580 cataract cases and 1367 controls were identified within the Marshfield Personalized Medicine Research Project (PMRP) Biobank and linked EHR, which is a member of the NHGRI-funded electronic Medical Records and Genomics (eMERGE) Network. Our goal was to explore potential gene-gene and gene-environment interactions within these data for 529,431 single nucleotide polymorphisms (SNPs) with minor allele frequency > 1%, in order to explore higher level associations with cataract risk beyond investigations of single SNP-phenotype associations. To build our SNP-SNP interaction models we utilized a prior-knowledge driven filtering method called Biofilter to minimize the multiple testing burden of exploring the vast array of interaction models possible from our extensive number of SNPs. Using the Biofilter, we developed 57,376 prior-knowledge directed SNP-SNP models to test for association with cataract status. We selected models that required 6 sources of external domain knowledge. We identified 5 statistically significant models with an interaction term with p-value < 0.05, as well as an overall model with p-value < 0.05 associated with cataract status. We also conducted gene-environment interaction analyses for all GWAS SNPs and a set of environmental factors from the PhenX Toolkit: smoking, UV exposure, and alcohol use; these environmental factors have been previously associated with the formation of cataracts. We found a total of 288 models that exhibit an interaction term with a p-value ≤ 1×10(-4) associated with cataract status. Our results show these approaches enable advanced searches for epistasis and gene-environment interactions beyond GWAS, and that the EHR based approach provides an additional source of data for seeking these advanced explanatory models of the etiology of complex disease/outcome such as cataracts.
Two Novel SNPs of PPARγ Significantly Affect Weaning Growth Traits of Nanyang Cattle.

PubMed

Huang, Jieping; Chen, Ningbo; Li, Xin; An, Shanshan; Zhao, Minghui; Sun, Taihong; Hao, Ruijie; Ma, Yun

2018-01-02

Peroxisome-proliferator-activated receptor gamma (PPARγ) is a key transcription factor that controls adipocyte differentiation and energy in mammals. Therefore, PPARγ is a potential factor influencing animal growth traits. This study primarily evaluates PPARγ as candidate gene for growth traits of cattle and identifies potential molecular marker for cattle breeding. Per previous studies, PPARγ mRNA was mainly expressed at extremely high levels in adipose tissues as shown by quantitative real-time polymerase chain reaction analysis. Three novel SNPs of the bovine PPARγ gene were identified in 514 individuals from six Chinese cattle breeds: SNP1 (AC_000179.1 g.57386668 C > G) in intron 2 and SNP2 (AC_000179.1 g.57431964 C > T) and SNP3 (AC_000179.1 g.57431994 T > C) in exon 7. The present study also investigated genetic characteristics of these SNP loci in six populations. Association analysis showed that SNP1 and SNP3 loci significantly affect weaning growth traits, especially body weight of Nanyang cattle. These results revealed that SNP1 and SNP3 are potential molecular markers for cattle breeding.
Newly recognized recessive syndrome characterized by dysmorphic features, hypogonadotropic hypogonadism, severe microcephaly, and sensorineural hearing loss maps to 3p21.3.

PubMed

Jenkinson, Emma M; Kingston, Helen; Urquhart, Jill; Khan, Naz; Melville, Athalie; Swinton, Martin; Crow, Yanick J; Davis, Julian R E; Trump, Dorothy; Newman, William G

2011-12-01

We present a newly recognized, likely autosomal recessive, pleiotropic disorder seen in four individuals (three siblings and their nephew) from a consanguineous family of Pakistani origin. The condition is characterized by hypogonadotropic hypogonadism, severe microcephaly, sensorineural deafness, moderate learning disability, and distinctive facial dysmorphic features. Autozygosity mapping using SNP array genotyping defined a single, large autozygous region of 13.1 Mb on chromosome 3p21 common to the affected individuals. The critical region contains 227 genes and initial sequence analysis of a functional candidate gene has not identified causative mutations. Copyright © 2011 Wiley Periodicals, Inc.
Cambridge Healthtech Institute's Third Annual Conference on Lab-on-a-Chip and Microarrays. 22-24 January 2001, Zurich, Switzerland.

PubMed

Jain, K K

2001-02-01

Cambridge Healthtech Institute's Third Annual Conference on Lab-on-a-Chip and Microarray technology covered the latest advances in this technology and applications in life sciences. Highlights of the meetings are reported briefly with emphasis on applications in genomics, drug discovery and molecular diagnostics. There was an emphasis on microfluidics because of the wide applications in laboratory and drug discovery. The lab-on-a-chip provides the facilities of a complete laboratory in a hand-held miniature device. Several microarray systems have been used for hybridisation and detection techniques. Oligonucleotide scanning arrays provide a versatile tool for the analysis of nucleic acid interactions and provide a platform for improving the array-based methods for investigation of antisense therapeutics. A method for analysing combinatorial DNA arrays using oligonucleotide-modified gold nanoparticle probes and a conventional scanner has considerable potential in molecular diagnostics. Various applications of microarray technology for high-throughput screening in drug discovery and single nucleotide polymorphisms (SNP) analysis were discussed. Protein chips have important applications in proteomics. With the considerable amount of data generated by the different technologies using microarrays, it is obvious that the reading of the information and its interpretation and management through the use of bioinformatics is essential. Various techniques for data analysis were presented. Biochip and microarray technology has an essential role to play in the evolving trends in healthcare, which integrate diagnosis with prevention/treatment and emphasise personalised medicines.
High-density fiber optic biosensor arrays

NASA Astrophysics Data System (ADS)

Epstein, Jason R.; Walt, David R.

2002-02-01

Novel approaches are required to coordinate the immense amounts of information derived from diverse genomes. This concept has influenced the expanded role of high-throughput DNA detection and analysis in the biological sciences. A high-density fiber optic DNA biosensor was developed consisting of oligonucleotide-functionalized, 3.1 mm diameter microspheres deposited into the etched wells on the distal face of a 500 micrometers imaging fiber bundle. Imaging fiber bundles containing thousands of optical fibers, each associated with a unique oligonucleotide probe sequence, were the foundation for an optically connected, individually addressable DNA detection platform. Different oligonucleotide-functionalized microspheres were combined in a stock solution, and randomly dispersed into the etched wells. Microsphere positions were registered from optical dyes incorporated onto the microspheres. The distribution process provided an inherent redundancy that increases the signal-to-noise ratio as the square root of the number of sensors examined. The representative amount of each probe-type in the array was dependent on their initial stock solution concentration, and as other sequences of interest arise, new microsphere elements can be added to arrays without altering the existing detection capabilities. The oligonucleotide probe sequences hybridize to fluorescently-labeled, complementary DNA target solutions. Fiber optic DNA microarray research has included DNA-protein interaction profiles, microbial strain differentiation, non-labeled target interrogation with molecular beacons, and single cell-based assays. This biosensor array is proficient in DNA detection linked to specific disease states, single nucleotide polymorphism (SNP's) discrimination, and gene expression analysis. This array platform permits multiple detection formats, provides smaller feature sizes, and enables sensor design flexibility. High-density fiber optic microarray biosensors provide a fast, reversible format with the detection limit of a few hundred molecules.
Rapid discovery of SNPs differentiating hatchery steelhead trout from ESA-listed wild steelhead trout using a 57K SNP array

USDA-ARS?s Scientific Manuscript database

Natural-origin steelhead in the Pacific Northwest USA are threatened by a number of factors including habitat destruction, disease, decline in marine survival and a potential erosion of genetic viability due to introgression from hatchery strains. The major goal of this study was to use a recently ...
Discovery of 20,000 RAD-SNPs and development of a 52-SNP array for monitoring river otters

Treesearch

Jeffrey B. Stetz; Seth Smith; Michael A. Sawaya; Alan B. Ramsey; Stephen J. Amish; Michael K. Schwartz; Gordon Luikart

2016-01-01

Many North American river otter (Lontra canadensis) populations are threatened or recovering but are difficult to study because they occur at low densities, it is difficult to visually identify individuals, and they inhabit aquatic environments that accelerate degradation of biological samples. Single nucleotide polymorphisms (SNPs) can improve our ability to...
Similar genetic architecture with shared and unique quantitative trait loci for bacterial cold water disease resistance in two rainbow trout breeding populations

USDA-ARS?s Scientific Manuscript database

Bacterial cold water disease (BCWD) causes significant mortality and economic losses in salmonid aquaculture. In previous studies, we identified moderate-large effect QTL for BCWD resistance in rainbow trout (Oncorhynchus mykiss). However, the recent availability of a 57K SNP array and a genome phys...
Loss-of-function DNA sequence variant in the CLCNKA chloride channel implicates the cardio-renal axis in interindividual heart failure risk variation.

PubMed

Cappola, Thomas P; Matkovich, Scot J; Wang, Wei; van Booven, Derek; Li, Mingyao; Wang, Xuexia; Qu, Liming; Sweitzer, Nancy K; Fang, James C; Reilly, Muredach P; Hakonarson, Hakon; Nerbonne, Jeanne M; Dorn, Gerald W

2011-02-08

Common heart failure has a strong undefined heritable component. Two recent independent cardiovascular SNP array studies identified a common SNP at 1p36 in intron 2 of the HSPB7 gene as being associated with heart failure. HSPB7 resequencing identified other risk alleles but no functional gene variants. Here, we further show no effect of the HSPB7 SNP on cardiac HSPB7 mRNA levels or splicing, suggesting that the SNP marks the position of a functional variant in another gene. Accordingly, we used massively parallel platforms to resequence all coding exons of the adjacent CLCNKA gene, which encodes the K(a) renal chloride channel (ClC-K(a)). Of 51 exonic CLCNKA variants identified, one SNP (rs10927887, encoding Arg83Gly) was common, in linkage disequilibrium with the heart failure risk SNP in HSPB7, and associated with heart failure in two independent Caucasian referral populations (n = 2,606 and 1,168; combined P = 2.25 × 10(-6)). Individual genotyping of rs10927887 in the two study populations and a third independent heart failure cohort (combined n = 5,489) revealed an additive allele effect on heart failure risk that is independent of age, sex, and prior hypertension (odds ratio = 1.27 per allele copy; P = 8.3 × 10(-7)). Functional characterization of recombinant wild-type Arg83 and variant Gly83 ClC-K(a) chloride channel currents revealed ≈ 50% loss-of-function of the variant channel. These findings identify a common, functionally significant genetic risk factor for Caucasian heart failure. The variant CLCNKA risk allele, telegraphed by linked variants in the adjacent HSPB7 gene, uncovers a previously overlooked genetic mechanism affecting the cardio-renal axis.
Extensive population structure in San, Khoe, and mixed ancestry populations from southern Africa revealed by 44 short 5-SNP haplotypes.

PubMed

Schlebusch, Carina M; Soodyall, Himlya

2012-12-01

The San and Khoe people currently represent remnant groups of a much larger and widely distributed population of hunter-gatherers and pastoralists who had exclusive occupation of southern Africa before the arrival of Bantu-speaking groups in the past 1,200 years and sea-borne immigrants within the last 350 years. Genetic studies [mitochondrial deoxyribonucleic acid (DNA) and Y-chromosome] conducted on San and Khoe groups revealed that they harbor some of the most divergent lineages found in living peoples throughout the world. Recently, high-density, autosomal, single-nucleotide polymorphism (SNP)-array studies confirmed the early divergence of Khoe-San population groups from all other human populations. The present study made use of 220 autosomal SNP markers (in the format of both haplotypes and genotypes) to examine the population structure of various San and Khoe groups and their relationship to other neighboring groups. Whereas analyses based on the genotypic SNP data only supported the division of the included populations into three main groups-Khoe-San, Bantu-speakers, and non-African populations-haplotype analyses revealed finer structure within Khoe-San populations. By the use of only 44 short SNP haplotypes (compiled from a total of 220 SNPs), most of the Khoe-San groups could be resolved as separate groups by applying STRUCTURE analyses. Therefore, by carefully selecting a few SNPs and combining them into haplotypes, we were able to achieve the same level of population distinction that was achieved previously in high-density SNP studies on the same population groups. Using haplotypes proved to be a very efficient and cost-effective way to study population structure. Copyright © 2013 Wayne State University Press, Detroit, Michigan 48201-1309.

GrigoraSNPs: Optimized Analysis of SNPs for DNA Forensics.

PubMed

Ricke, Darrell O; Shcherbina, Anna; Michaleas, Adam; Fremont-Smith, Philip

2018-04-16

High-throughput sequencing (HTS) of single nucleotide polymorphisms (SNPs) enables additional DNA forensic capabilities not attainable using traditional STR panels. However, the inclusion of sets of loci selected for mixture analysis, extended kinship, phenotype, biogeographic ancestry prediction, etc., can result in large panel sizes that are difficult to analyze in a rapid fashion. GrigoraSNP was developed to address the allele-calling bottleneck that was encountered when analyzing SNP panels with more than 5000 loci using HTS. GrigoraSNPs uses a MapReduce parallel data processing on multiple computational threads plus a novel locus-identification hashing strategy leveraging target sequence tags. This tool optimizes the SNP calling module of the DNA analysis pipeline with runtimes that scale linearly with the number of HTS reads. Results are compared with SNP analysis pipelines implemented with SAMtools and GATK. GrigoraSNPs removes a computational bottleneck for processing forensic samples with large HTS SNP panels. Published 2018. This article is a U.S. Government work and is in the public domain in the USA.
Rare genomic rearrangement in a boy with Williams-Beuren syndrome associated to XYY syndrome and intriguing behavior.

PubMed

Dutra, Roberta L; Piazzon, Flavia B; Zanardo, Évelin A; Costa, Thais Virginia Moura Machado; Montenegro, Marília M; Novo-Filho, Gil M; Dias, Alexandre T; Nascimento, Amom M; Kim, Chong Ae; Kulikowski, Leslie D

2015-12-01

Williams-Beuren syndrome (WBS) is caused by a hemizygous contiguous gene microdeletion of 1.55-1.84 Mb at 7q11.23 region. Approximately, 28 genes have been shown to contribute to classical phenotype of SWB with presence of dysmorphic facial features, supravalvular aortic stenosis (SVAS), intellectual disability, and overfriendliness. With the use of Microarray-based comparative genomic hybridization and other molecular cytogenetic techniques, is possible define with more accuracy partial or atypical deletion and refine the genotype-phenotype correlation. Here, we report on a rare genomic structural rearrangement in a boy with atypical deletion in 7q11.23 and XYY syndrome with characteristic clinical signs, but not sufficient for the diagnosis of WBS. Cytogenetic analysis of G-banding showed a karyotype 47,XYY. Analysis of DNA with the technique of MLPA (Multiplex Ligation-dependent Probe Amplification) using kits a combination of kits (P064, P036, P070, and P029) identified an atypical deletion on 7q11.23. In addition, high resolution SNP Oligonucleotide Microarray Analysis (SNP-array) confirmed the alterations found by MLPA and revealed others pathogenic CNVs, in the chromosomes 7 and X. The present report demonstrates an association not yet described in literature, between Williams-Beuren syndrome and 47,XYY. The identification of atypical deletion in 7q11.23 concomitant to additional pathogenic CNVs in others genomic regions allows a better comprehension of clinical consequences of atypical genomic rearrangements. © 2015 Wiley Periodicals, Inc.
Analysis of the genetic structure of the Malay population: Ancestry-informative marker SNPs in the Malay of Peninsular Malaysia.

PubMed

Yahya, Padillah; Sulong, Sarina; Harun, Azian; Wan Isa, Hatin; Ab Rajab, Nur-Shafawati; Wangkumhang, Pongsakorn; Wilantho, Alisa; Ngamphiw, Chumpol; Tongsima, Sissades; Zilfalil, Bin Alwi

2017-09-01

Malay, the main ethnic group in Peninsular Malaysia, is represented by various sub-ethnic groups such as Melayu Banjar, Melayu Bugis, Melayu Champa, Melayu Java, Melayu Kedah Melayu Kelantan, Melayu Minang and Melayu Patani. Using data retrieved from the MyHVP (Malaysian Human Variome Project) database, a total of 135 individuals from these sub-ethnic groups were profiled using the Affymetrix GeneChip Mapping Xba 50-K single nucleotide polymorphism (SNP) array to identify SNPs that were ancestry-informative markers (AIMs) for Malays of Peninsular Malaysia. Prior to selecting the AIMs, the genetic structure of Malays was explored with reference to 11 other populations obtained from the Pan-Asian SNP Consortium database using principal component analysis (PCA) and ADMIXTURE. Iterative pruning principal component analysis (ipPCA) was further used to identify sub-groups of Malays. Subsequently, we constructed an AIMs panel for Malays using the informativeness for assignment (I n ) of genetic markers, and the K-nearest neighbor classifier (KNN) was used to teach the classification models. A model of 250 SNPs ranked by I n , correctly classified Malay individuals with an accuracy of up to 90%. The identified panel of SNPs could be utilized as a panel of AIMs to ascertain the specific ancestry of Malays, which may be useful in disease association studies, biomedical research or forensic investigation purposes. Copyright © 2017 Elsevier B.V. All rights reserved.
Fine-Scale Variation and Genetic Determinants of Alternative Splicing across Individuals

PubMed Central

Coulombe-Huntington, Jasmin; Lam, Kevin C. L.; Dias, Christel; Majewski, Jacek

2009-01-01

Recently, thanks to the increasing throughput of new technologies, we have begun to explore the full extent of alternative pre–mRNA splicing (AS) in the human transcriptome. This is unveiling a vast layer of complexity in isoform-level expression differences between individuals. We used previously published splicing sensitive microarray data from lymphoblastoid cell lines to conduct an in-depth analysis on splicing efficiency of known and predicted exons. By combining publicly available AS annotation with a novel algorithm designed to search for AS, we show that many real AS events can be detected within the usually unexploited, speculative majority of the array and at significance levels much below standard multiple-testing thresholds, demonstrating that the extent of cis-regulated differential splicing between individuals is potentially far greater than previously reported. Specifically, many genes show subtle but significant genetically controlled differences in splice-site usage. PCR validation shows that 42 out of 58 (72%) candidate gene regions undergo detectable AS, amounting to the largest scale validation of isoform eQTLs to date. Targeted sequencing revealed a likely causative SNP in most validated cases. In all 17 incidences where a SNP affected a splice-site region, in silico splice-site strength modeling correctly predicted the direction of the micro-array and PCR results. In 13 other cases, we identified likely causative SNPs disrupting predicted splicing enhancers. Using Fst and REHH analysis, we uncovered significant evidence that 2 putative causative SNPs have undergone recent positive selection. We verified the effect of five SNPs using in vivo minigene assays. This study shows that splicing differences between individuals, including quantitative differences in isoform ratios, are frequent in human populations and that causative SNPs can be identified using in silico predictions. Several cases affected disease-relevant genes and it is likely some of these differences are involved in phenotypic diversity and susceptibility to complex diseases. PMID:20011102
Narrowing the wingless-2 mutation to a 227 kb candidate region on chicken chromosome 12

PubMed Central

Webb, A E; Youngworth, I A; Kaya, M; Gitter, C L; O’Hare, E A; May, B; Cheng, H H; Delany, M E

2018-01-01

ABSTRACT Wingless-2 (wg-2) is an autosomal recessive mutation in chicken that results in an embryonic lethal condition. Affected individuals exhibit a multisystem syndrome characterized by absent wings, truncated legs, and craniofacial, kidney, and feather malformations. Previously, work focused on phenotype description, establishing the autosomal recessive pattern of Mendelian inheritance and placing the mutation on an inbred genetic background to create the congenic line UCD Wingless-2.331. The research described in this paper employed the complementary tools of breeding, genetics, and genomics to map the chromosomal location of the mutation and successively narrow the size of the region for analysis of the causative element. Specifically, the wg-2 mutation was initially mapped to a 7 Mb region of chromosome 12 using an Illumina 3 K SNP array. Subsequent SNP genotyping and exon sequencing combined with analysis from improved genome assemblies narrowed the region of interest to a maximum size of 227 kb. Within this region, 3 validated and 3 predicted candidate genes are found, and these are described. The wg-2 mutation is a valuable resource to contribute to an improved understanding of the developmental pathways involved in chicken and avian limb development as well as serving as a model for human development, as the resulting syndrome shares features with human congenital disorders. PMID:29562287
DNA methylation data analysis and its application to cancer research

PubMed Central

Ma, Xiaotu; Wang, Yi-Wei; Zhang, Michael Q; Gazdar, Adi F

2013-01-01

With the rapid development of genome-wide high-throughput technologies, including expression arrays, SNP arrays and next-generation sequencing platforms, enormous amounts of molecular data have been generated and deposited in the public domain. The application of computational approaches is required to yield biological insights from this enormous, ever-growing resource. A particularly interesting subset of these resources is related to epigenetic regulation, with DNA methylation being the most abundant data type. In this paper, we will focus on the analysis of DNA methylation data and its application to cancer studies. We first briefly review the molecular techniques that generate such data, much of which has been obtained with the use of the most recent version of Infinium HumanMethylation450 BeadChip® technology (Illumina, CA, USA). We describe the coverage of the methylome by this technique. Several examples of data mining are provided. However, it should be understood that reliance on a single aspect of epigenetics has its limitations. In the not too distant future, these defects may be rectified, providing scientists with previously unavailable opportunities to explore in detail the role of epigenetics in cancer and other disease states. PMID:23750645
Automated SNP detection from a large collection of white spruce expressed sequences: contributing factors and approaches for the categorization of SNPs

PubMed Central

Pavy, Nathalie; Parsons, Lee S; Paule, Charles; MacKay, John; Bousquet, Jean

2006-01-01

Background High-throughput genotyping technologies represent a highly efficient way to accelerate genetic mapping and enable association studies. As a first step toward this goal, we aimed to develop a resource of candidate Single Nucleotide Polymorphisms (SNP) in white spruce (Picea glauca [Moench] Voss), a softwood tree of major economic importance. Results A white spruce SNP resource encompassing 12,264 SNPs was constructed from a set of 6,459 contigs derived from Expressed Sequence Tags (EST) and by using the bayesian-based statistical software PolyBayes. Several parameters influencing the SNP prediction were analysed including the a priori expected polymorphism, the probability score (PSNP), and the contig depth and length. SNP detection in 3' and 5' reads from the same clones revealed a level of inconsistency between overlapping sequences as low as 1%. A subset of 245 predicted SNPs were verified through the independent resequencing of genomic DNA of a genotype also used to prepare cDNA libraries. The validation rate reached a maximum of 85% for SNPs predicted with either PSNP ≥ 0.95 or ≥ 0.99. A total of 9,310 SNPs were detected by using PSNP ≥ 0.95 as a criterion. The SNPs were distributed among 3,590 contigs encompassing an array of broad functional categories, with an overall frequency of 1 SNP per 700 nucleotide sites. Experimental and statistical approaches were used to evaluate the proportion of paralogous SNPs, with estimates in the range of 8 to 12%. The 3,789 coding SNPs identified through coding region annotation and ORF prediction, were distributed into 39% nonsynonymous and 61% synonymous substitutions. Overall, there were 0.9 SNP per 1,000 nonsynonymous sites and 5.2 SNPs per 1,000 synonymous sites, for a genome-wide nonsynonymous to synonymous substitution rate ratio (Ka/Ks) of 0.17. Conclusion We integrated the SNP data in the ForestTreeDB database along with functional annotations to provide a tool facilitating the choice of candidate genes for mapping purposes or association studies. PMID:16824208
Genome-wide comparative diversity uncovers multiple targets of selection for improvement in hexaploid wheat landraces and cultivars.

PubMed

Cavanagh, Colin R; Chao, Shiaoman; Wang, Shichen; Huang, Bevan Emma; Stephen, Stuart; Kiani, Seifollah; Forrest, Kerrie; Saintenac, Cyrille; Brown-Guedira, Gina L; Akhunova, Alina; See, Deven; Bai, Guihua; Pumphrey, Michael; Tomar, Luxmi; Wong, Debbie; Kong, Stephan; Reynolds, Matthew; da Silva, Marta Lopez; Bockelman, Harold; Talbert, Luther; Anderson, James A; Dreisigacker, Susanne; Baenziger, Stephen; Carter, Arron; Korzun, Viktor; Morrell, Peter Laurent; Dubcovsky, Jorge; Morell, Matthew K; Sorrells, Mark E; Hayden, Matthew J; Akhunov, Eduard

2013-05-14

Domesticated crops experience strong human-mediated selection aimed at developing high-yielding varieties adapted to local conditions. To detect regions of the wheat genome subject to selection during improvement, we developed a high-throughput array to interrogate 9,000 gene-associated single-nucleotide polymorphisms (SNP) in a worldwide sample of 2,994 accessions of hexaploid wheat including landraces and modern cultivars. Using a SNP-based diversity map we characterized the impact of crop improvement on genomic and geographic patterns of genetic diversity. We found evidence of a small population bottleneck and extensive use of ancestral variation often traceable to founders of cultivars from diverse geographic regions. Analyzing genetic differentiation among populations and the extent of haplotype sharing, we identified allelic variants subjected to selection during improvement. Selective sweeps were found around genes involved in the regulation of flowering time and phenology. An introgression of a wild relative-derived gene conferring resistance to a fungal pathogen was detected by haplotype-based analysis. Comparing selective sweeps identified in different populations, we show that selection likely acts on distinct targets or multiple functionally equivalent alleles in different portions of the geographic range of wheat. The majority of the selected alleles were present at low frequency in local populations, suggesting either weak selection pressure or temporal variation in the targets of directional selection during breeding probably associated with changing agricultural practices or environmental conditions. The developed SNP chip and map of genetic variation provide a resource for advancing wheat breeding and supporting future population genomic and genome-wide association studies in wheat.
A consensus genetic map of cowpea [Vigna unguiculata (L) Walp.] and synteny based on EST-derived SNPs.

PubMed

Muchero, Wellington; Diop, Ndeye N; Bhat, Prasanna R; Fenton, Raymond D; Wanamaker, Steve; Pottorff, Marti; Hearne, Sarah; Cisse, Ndiaga; Fatokun, Christian; Ehlers, Jeffrey D; Roberts, Philip A; Close, Timothy J

2009-10-27

Consensus genetic linkage maps provide a genomic framework for quantitative trait loci identification, map-based cloning, assessment of genetic diversity, association mapping, and applied breeding in marker-assisted selection schemes. Among "orphan crops" with limited genomic resources such as cowpea [Vigna unguiculata (L.) Walp.] (2n = 2x = 22), the use of transcript-derived SNPs in genetic maps provides opportunities for automated genotyping and estimation of genome structure based on synteny analysis. Here, we report the development and validation of a high-throughput EST-derived SNP assay for cowpea, its application in consensus map building, and determination of synteny to reference genomes. SNP mining from 183,118 ESTs sequenced from 17 cDNA libraries yielded approximately 10,000 high-confidence SNPs from which an Illumina 1,536-SNP GoldenGate genotyping array was developed and applied to 741 recombinant inbred lines from six mapping populations. Approximately 90% of the SNPs were technically successful, providing 1,375 dependable markers. Of these, 928 were incorporated into a consensus genetic map spanning 680 cM with 11 linkage groups and an average marker distance of 0.73 cM. Comparison of this cowpea genetic map to reference legumes, soybean (Glycine max) and Medicago truncatula, revealed extensive macrosynteny encompassing 85 and 82%, respectively, of the cowpea map. Regions of soybean genome duplication were evident relative to the simpler diploid cowpea. Comparison with Arabidopsis revealed extensive genomic rearrangement with some conserved microsynteny. These results support evolutionary closeness between cowpea and soybean and identify regions for synteny-based functional genomics studies in legumes.
Genome-wide comparative diversity uncovers multiple targets of selection for improvement in hexaploid wheat landraces and cultivars

PubMed Central

Cavanagh, Colin R.; Chao, Shiaoman; Wang, Shichen; Huang, Bevan Emma; Stephen, Stuart; Kiani, Seifollah; Forrest, Kerrie; Saintenac, Cyrille; Brown-Guedira, Gina L.; Akhunova, Alina; See, Deven; Bai, Guihua; Pumphrey, Michael; Tomar, Luxmi; Wong, Debbie; Kong, Stephan; Reynolds, Matthew; da Silva, Marta Lopez; Bockelman, Harold; Talbert, Luther; Anderson, James A.; Dreisigacker, Susanne; Baenziger, Stephen; Carter, Arron; Korzun, Viktor; Morrell, Peter Laurent; Dubcovsky, Jorge; Morell, Matthew K.; Sorrells, Mark E.; Hayden, Matthew J.; Akhunov, Eduard

2013-01-01

Domesticated crops experience strong human-mediated selection aimed at developing high-yielding varieties adapted to local conditions. To detect regions of the wheat genome subject to selection during improvement, we developed a high-throughput array to interrogate 9,000 gene-associated single-nucleotide polymorphisms (SNP) in a worldwide sample of 2,994 accessions of hexaploid wheat including landraces and modern cultivars. Using a SNP-based diversity map we characterized the impact of crop improvement on genomic and geographic patterns of genetic diversity. We found evidence of a small population bottleneck and extensive use of ancestral variation often traceable to founders of cultivars from diverse geographic regions. Analyzing genetic differentiation among populations and the extent of haplotype sharing, we identified allelic variants subjected to selection during improvement. Selective sweeps were found around genes involved in the regulation of flowering time and phenology. An introgression of a wild relative-derived gene conferring resistance to a fungal pathogen was detected by haplotype-based analysis. Comparing selective sweeps identified in different populations, we show that selection likely acts on distinct targets or multiple functionally equivalent alleles in different portions of the geographic range of wheat. The majority of the selected alleles were present at low frequency in local populations, suggesting either weak selection pressure or temporal variation in the targets of directional selection during breeding probably associated with changing agricultural practices or environmental conditions. The developed SNP chip and map of genetic variation provide a resource for advancing wheat breeding and supporting future population genomic and genome-wide association studies in wheat. PMID:23630259
Physical mapping of QTL for tuber yield, starch content and starch yield in tetraploid potato (Solanum tuberosum L.) by means of genome wide genotyping by sequencing and the 8.3 K SolCAP SNP array.

PubMed

Schönhals, Elske Maria; Ding, Jia; Ritter, Enrique; Paulo, Maria João; Cara, Nicolás; Tacke, Ekhard; Hofferbert, Hans-Reinhard; Lübeck, Jens; Strahwald, Josef; Gebhardt, Christiane

2017-08-22

Tuber yield and starch content of the cultivated potato are complex traits of decisive importance for breeding improved varieties. Natural variation of tuber yield and starch content depends on the environment and on multiple, mostly unknown genetic factors. Dissection and molecular identification of the genes and their natural allelic variants controlling these complex traits will lead to the development of diagnostic DNA-based markers, by which precision and efficiency of selection can be increased (precision breeding). Three case-control populations were assembled from tetraploid potato cultivars based on maximizing the differences between high and low tuber yield (TY), starch content (TSC) and starch yield (TSY, arithmetic product of TY and TSC). The case-control populations were genotyped by restriction-site associated DNA sequencing (RADseq) and the 8.3 k SolCAP SNP genotyping array. The allele frequencies of single nucleotide polymorphisms (SNPs) were compared between cases and controls. RADseq identified, depending on data filtering criteria, between 6664 and 450 genes with one or more differential SNPs for one, two or all three traits. Differential SNPs in 275 genes were detected using the SolCAP array. A genome wide association study using the SolCAP array on an independent, unselected population identified SNPs associated with tuber starch content in 117 genes. Physical mapping of the genes containing differential or associated SNPs, and comparisons between the two genome wide genotyping methods and two different populations identified genome segments on all twelve potato chromosomes harboring one or more quantitative trait loci (QTL) for TY, TSC and TSY. Several hundred genes control tuber yield and starch content in potato. They are unequally distributed on all potato chromosomes, forming clusters between 0.5-4 Mbp width. The largest fraction of these genes had unknown function, followed by genes with putative signalling and regulatory functions. The genetic control of tuber yield and starch content is interlinked. Most differential SNPs affecting both traits had antagonistic effects: The allele increasing TY decreased TSC and vice versa. Exceptions were 89 SNP alleles which had synergistic effects on TY, TSC and TSY. These and the corresponding genes are primary targets for developing diagnostic markers.
Whole-genome sequence-based genomic prediction in laying chickens with different genomic relationship matrices to account for genetic architecture.

PubMed

Ni, Guiyan; Cavero, David; Fangmann, Anna; Erbe, Malena; Simianer, Henner

2017-01-16

With the availability of next-generation sequencing technologies, genomic prediction based on whole-genome sequencing (WGS) data is now feasible in animal breeding schemes and was expected to lead to higher predictive ability, since such data may contain all genomic variants including causal mutations. Our objective was to compare prediction ability with high-density (HD) array data and WGS data in a commercial brown layer line with genomic best linear unbiased prediction (GBLUP) models using various approaches to weight single nucleotide polymorphisms (SNPs). A total of 892 chickens from a commercial brown layer line were genotyped with 336 K segregating SNPs (array data) that included 157 K genic SNPs (i.e. SNPs in or around a gene). For these individuals, genome-wide sequence information was imputed based on data from re-sequencing runs of 25 individuals, leading to 5.2 million (M) imputed SNPs (WGS data), including 2.6 M genic SNPs. De-regressed proofs (DRP) for eggshell strength, feed intake and laying rate were used as quasi-phenotypic data in genomic prediction analyses. Four weighting factors for building a trait-specific genomic relationship matrix were investigated: identical weights, -(log 10 P) from genome-wide association study results, squares of SNP effects from random regression BLUP, and variable selection based weights (known as BLUP|GA). Predictive ability was measured as the correlation between DRP and direct genomic breeding values in five replications of a fivefold cross-validation. Averaged over the three traits, the highest predictive ability (0.366 ± 0.075) was obtained when only genic SNPs from WGS data were used. Predictive abilities with genic SNPs and all SNPs from HD array data were 0.361 ± 0.072 and 0.353 ± 0.074, respectively. Prediction with -(log 10 P) or squares of SNP effects as weighting factors for building a genomic relationship matrix or BLUP|GA did not increase accuracy, compared to that with identical weights, regardless of the SNP set used. Our results show that little or no benefit was gained when using all imputed WGS data to perform genomic prediction compared to using HD array data regardless of the weighting factors tested. However, using only genic SNPs from WGS data had a positive effect on prediction ability.
Genomic analysis of genetic heterogeneity and evolution in high-grade serous ovarian carcinoma

PubMed Central

Cooke, Susanna L; Ng, Charlotte KY; Melnyk, Nataliya; Garcia, Maria J; Hardcastle, Tom; Temple, Jillian; Langdon, Simon; Huntsman, David; Brenton, James D

2010-01-01

Resistance to chemotherapy in ovarian cancer is poorly understood. Evolutionary models of cancer predict that, following treatment, resistance emerges either due to outgrowth of an intrinsically resistant sub-clone, or evolves in residual disease under the selective pressure of treatment. To investigate genetic evolution in high-grade serous (HGS) ovarian cancers we first analysed cell line series derived from three cases of HGS carcinoma before and after platinum resistance had developed (PEO1, PEO4 and PEO6, PEA1 and PEA2, and PEO14 and PEO23). Analysis with 24-colour fluorescence in situ hybridisation and SNP array comparative genomic hybridisation (CGH) showed mutually exclusive endoreduplication and loss of heterozygosity events in clones present at different timepoints in the same individual. This implies that platinum sensitive and resistant disease was not linearly related but shared a common ancestor at an early stage of tumour development. Array CGH analysis of six paired pre- and post-neoadjuvant treatment HGS samples from the CTCR-OV01 clinical study did not show extensive copy number differences, suggesting that one clone was strongly dominant at presentation. These data show that cisplatin resistance in HGS carcinoma develops from pre-existing minor clones but that enrichment for these clones is not apparent during short-term chemotherapy treatment. PMID:20581869
Insights Into Upland Cotton (Gossypium hirsutum L.) Genetic Recombination Based on 3 High-Density Single-Nucleotide Polymorphism and a Consensus Map Developed Independently With Common Parents.

PubMed

Ulloa, Mauricio; Hulse-Kemp, Amanda M; De Santiago, Luis M; Stelly, David M; Burke, John J

2017-01-01

High-density linkage maps are vital to supporting the correct placement of scaffolds and gene sequences on chromosomes and fundamental to contemporary organismal research and scientific approaches to genetic improvement, especially in paleopolyploids with exceptionally complex genomes, eg, upland cotton ( Gossypium hirsutum L., "2n = 52"). Three independently developed intraspecific upland mapping populations were analyzed to generate 3 high-density genetic linkage single-nucleotide polymorphism (SNP) maps and a consensus map using the CottonSNP63K array. The populations consisted of a previously reported F 2 , a recombinant inbred line (RIL), and reciprocal RIL population, from "Phytogen 72" and "Stoneville 474" cultivars. The cluster file provided 7417 genotyped SNP markers, resulting in 26 linkage groups corresponding to the 26 chromosomes (c) of the allotetraploid upland cotton (AD) 1 arisen from the merging of 2 genomes ("A" Old World and "D" New World). Patterns of chromosome-specific recombination were largely consistent across mapping populations. The high-density genetic consensus map included 7244 SNP markers that spanned 3538 cM and comprised 3824 SNP bins, of which 1783 and 2041 were in the A t and D t subgenomes with 1825 and 1713 cM map lengths, respectively. Subgenome average distances were nearly identical, indicating that subgenomic differences in bin number arose due to the high numbers of SNPs on the D t subgenome. Examination of expected recombination frequency or crossovers (COs) on the chromosomes within each population of the 2 subgenomes revealed that COs were also not affected by the SNPs or SNP bin number in these subgenomes. Comparative alignment analyses identified historical ancestral A t -subgenomic translocations of c02 and c03, as well as of c04 and c05. The consensus map SNP sequences aligned with high congruency to the NBI assembly of Gossypium hirsutum . However, the genomic comparisons revealed evidence of additional unconfirmed possible duplications, inversions and translocations, and unbalance SNP sequence homology or SNP sequence/loci genomic dominance, or homeolog loci bias of the upland tetraploid A t and D t subgenomes. The alignments indicated that 364 SNP-associated previously unintegrated scaffolds can be placed in pseudochromosomes of the NBI G hirsutum assembly. This is the first intraspecific SNP genetic linkage consensus map assembled in G hirsutum with a core of reproducible mendelian SNP markers assayed on different populations and it provides further knowledge of chromosome arrangement of genic and nongenic SNPs. Together, the consensus map and RIL populations provide a synergistically useful platform for localizing and identifying agronomically important loci for improvement of the cotton crop.
Genome-wide association analysis and differential expression analysis of resistance to Sclerotinia stem rot in Brassica napus.

PubMed

Wei, Lijuan; Jian, Hongju; Lu, Kun; Filardo, Fiona; Yin, Nengwen; Liu, Liezhao; Qu, Cunmin; Li, Wei; Du, Hai; Li, Jiana

2016-06-01

Brassica napus is one of the most important oil crops in the world, and stem rot caused by the fungus Sclerotinia sclerotiorum results in major losses in yield and quality. To elucidate resistance genes and pathogenesis-related genes, genome-wide association analysis of 347 accessions was performed using the Illumina 60K Brassica SNP (single nucleotide polymorphism) array. In addition, the detached stem inoculation assay was used to select five highly resistant (R) and susceptible (S) B. napus lines, 48 h postinoculation with S. sclerotiorum for transcriptome sequencing. We identified 17 significant associations for stem resistance on chromosomes A8 and C6, five of which were on A8 and 12 on C6. The SNPs identified on A8 were located in a 409-kb haplotype block, and those on C6 were consistent with previous QTL mapping efforts. Transcriptome analysis suggested that S. sclerotiorum infection activates the immune system, sulphur metabolism, especially glutathione (GSH) and glucosinolates in both R and S genotypes. Genes found to be specific to the R genotype related to the jasmonic acid pathway, lignin biosynthesis, defence response, signal transduction and encoding transcription factors. Twenty-four genes were identified in both the SNP-trait association and transcriptome sequencing analyses, including a tau class glutathione S-transferase (GSTU) gene cluster. This study provides useful insight into the molecular mechanisms underlying the plant's response to S. sclerotiorum. © 2015 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Loss of heterozygosity at D8S262: an early genetic event of hepatocarcinogenesis.

PubMed

Zhu, Qiao; Gong, Li; Liu, Xiaoyan; Wang, Jun; Ren, Pin; Zhang, Wendong; Yao, Li; Han, Xiujuan; Zhu, Shaojun; Lan, Miao; Li, Yanhong; Zhang, Wei

2015-06-16

Hepatocellular carcinoma (HCC) is a multi-factor, multi-step, multi-gene and complicated process resulting from the accumulation of sequential genetic and epigenetic alterations. An important change among them is from precancerous lesions to HCC. However, only few studies have been reported about the sequential genetic changes during hepatocarcinogenesis. We observed firstly molecular karyotypes of 10 matched HCC using Affymetrix single-nucleotide polymorphism (SNP) 6.0 arrays, and found chromosomal fragments with high incidence (more than 70%) of loss of heterozygosity (LOH). Then, we selected 28 microsatellite markers at some gene spanning these chromosomal fragments, and examined the frequency of LOH of 128 matched HCC and 43 matched precancerous lesions-dysplastic nodules (DN) by a PCR-based analysis. Finally, we investigated the expression of proteins encoded by these genes in HCC, DN and the surrounding hepatic tissues. The result of Affymetrix SNP6.0 arrays demonstrated that more than 70% (7/10) cases had chromosomal fragment deletion on 4q13.3-35.1, 8p23.2-21.2, 16q11.2-24.3, and 17p13.3-12. Among 28 microsatellite markers selected, LOH frequencies at D8S262 for DN and HCC were found to be the highest, 51.2% and 72.7%, respectively. Immunohistochemically, the positive rate of its adjacent gene CSMD1 in HCC, DN, and the surrounding hepatic tissues were 27.3% (35/128), 75% (33/44), and 82% (105/128), respectively. LOH at D8S262 may be associated with an early genetic event of hepatocarcinogenesis, and a predictor for the monitor and prevention of HCC. The virtual slides for this article can be found here: http://www.diagnosticpathology.diagnomx.eu/vs/1557074981159099 .
Biological relevance of CNV calling methods using familial relatedness including monozygotic twins.

PubMed

Castellani, Christina A; Melka, Melkaye G; Wishart, Andrea E; Locke, M Elizabeth O; Awamleh, Zain; O'Reilly, Richard L; Singh, Shiva M

2014-04-21

Studies involving the analysis of structural variation including Copy Number Variation (CNV) have recently exploded in the literature. Furthermore, CNVs have been associated with a number of complex diseases and neurodevelopmental disorders. Common methods for CNV detection use SNP, CNV, or CGH arrays, where the signal intensities of consecutive probes are used to define the number of copies associated with a given genomic region. These practices pose a number of challenges that interfere with the ability of available methods to accurately call CNVs. It has, therefore, become necessary to develop experimental protocols to test the reliability of CNV calling methods from microarray data so that researchers can properly discriminate biologically relevant data from noise. We have developed a workflow for the integration of data from multiple CNV calling algorithms using the same array results. It uses four CNV calling programs: PennCNV (PC), Affymetrix® Genotyping Console™ (AGC), Partek® Genomics Suite™ (PGS) and Golden Helix SVS™ (GH) to analyze CEL files from the Affymetrix® Human SNP 6.0 Array™. To assess the relative suitability of each program, we used individuals of known genetic relationships. We found significant differences in CNV calls obtained by different CNV calling programs. Although the programs showed variable patterns of CNVs in the same individuals, their distribution in individuals of different degrees of genetic relatedness has allowed us to offer two suggestions. The first involves the use of multiple algorithms for the detection of the largest possible number of CNVs, and the second suggests the use of PennCNV over all other methods when the use of only one software program is desirable.
Genome-wide meta-analysis of SNP-by9-ACEI/ARB and SNP-by-thiazide diuretic and effect on serum potassium in cohorts of European and African ancestry.

PubMed

Irvin, Marguerite R; Sitlani, Colleen M; Noordam, Raymond; Avery, Christie L; Bis, Joshua C; Floyd, James S; Li, Jin; Limdi, Nita A; Srinivasasainagendra, Vinodh; Stewart, James; de Mutsert, Renée; Mook-Kanamori, Dennis O; Lipovich, Leonard; Kleinbrink, Erica L; Smith, Albert; Bartz, Traci M; Whitsel, Eric A; Uitterlinden, Andre G; Wiggins, Kerri L; Wilson, James G; Zhi, Degui; Stricker, Bruno H; Rotter, Jerome I; Arnett, Donna K; Psaty, Bruce M; Lange, Leslie A

2018-06-01

We evaluated interactions of SNP-by-ACE-I/ARB and SNP-by-TD on serum potassium (K+) among users of antihypertensive treatments (anti-HTN). Our study included seven European-ancestry (EA) (N = 4835) and four African-ancestry (AA) cohorts (N = 2016). We performed race-stratified, fixed-effect, inverse-variance-weighted meta-analyses of 2.5 million SNP-by-drug interaction estimates; race-combined meta-analysis; and trans-ethnic fine-mapping. Among EAs, we identified 11 significant SNPs (P < 5 × 10 -8 ) for SNP-ACE-I/ARB interactions on serum K+ that were located between NR2F1-AS1 and ARRDC3-AS1 on chromosome 5 (top SNP rs6878413 P = 1.7 × 10 -8 ; ratio of serum K+ in ACE-I/ARB exposed compared to unexposed is 1.0476, 1.0280, 1.0088 for the TT, AT, and AA genotypes, respectively). Trans-ethnic fine mapping identified the same group of SNPs on chromosome 5 as genome-wide significant for the ACE-I/ARB analysis. In conclusion, SNP-by-ACE-I /ARB interaction analyses uncovered loci that, if replicated, could have future implications for the prevention of arrhythmias due to anti-HTN treatment-related hyperkalemia. Before these loci can be identified as clinically relevant, future validation studies of equal or greater size in comparison to our discovery effort are needed.
Diagnosis of intrachromosomal amplification of chromosome 21 (iAMP21) by molecular cytogenetics in pediatric acute lymphoblastic leukemia.

PubMed

Duployez, Nicolas; Boudry-Labis, Elise; Decool, Gauthier; Grzych, Guillaume; Grardel, Nathalie; Abou Chahla, Wadih; Preudhomme, Claude; Roche-Lestienne, Catherine

2015-10-01

Intrachromosomal amplification of chromosome 21 (iAMP21) defines a distinct cytogenetic subgroup of B-cell precursor acute lymphoblastic leukemia (BCP-ALL) with poor prognosis that should be investigated in routine practice. Single-nucleotide polymorphism (SNP)-array provides a useful method to detect such cases showing a highly characteristic profile.
A high density linkage map of the ancestral diploid strawberry F. iinumae using SNP markers from the ISTRAW90 array and GBS

USDA-ARS?s Scientific Manuscript database

Fragaria iinumae is recognized as an ancestor of the octoploid strawberry species, including the cultivated strawberry, Fragaria ×ananassa. Here we report the construction of the first high density linkage map for F. iinumae. The map is based on two high-throughput techniques of single nucleotide p...

Canonical single nucleotide polymorphisms (SNPs) for high-resolution subtyping of Shiga-toxin producing Escherichia coli (STEC) O157:H7

USDA-ARS?s Scientific Manuscript database

The objective of this study was to develop a canonical SNP panel for subtyping of Shiga-toxin producing Escherichia coli (STEC). To this purpose, 906 putative SNPs were identified using resequencing tiling arrays. A subset of 391 SNPs was further screened using high-throughput TaqMan PCR against a d...
Development and preliminary evaluation of a 90K Axiom® SNP array for the allo-octoploid cultivated strawberry Fragaria ×ananassa

USDA-ARS?s Scientific Manuscript database

A high-throughput genotyping platform is needed to enable marker-assisted breeding in the allo-octoploid cultivated strawberry Fragaria ×ananassa. Short-read sequences from one diploid and 19 octoploid accessions were aligned to the diploid Fragaria vesca ‘Hawaii 4’ reference genome to identify sing...
Prioritizing individual genetic variants after kernel machine testing using variable selection.

PubMed

He, Qianchuan; Cai, Tianxi; Liu, Yang; Zhao, Ni; Harmon, Quaker E; Almli, Lynn M; Binder, Elisabeth B; Engel, Stephanie M; Ressler, Kerry J; Conneely, Karen N; Lin, Xihong; Wu, Michael C

2016-12-01

Kernel machine learning methods, such as the SNP-set kernel association test (SKAT), have been widely used to test associations between traits and genetic polymorphisms. In contrast to traditional single-SNP analysis methods, these methods are designed to examine the joint effect of a set of related SNPs (such as a group of SNPs within a gene or a pathway) and are able to identify sets of SNPs that are associated with the trait of interest. However, as with many multi-SNP testing approaches, kernel machine testing can draw conclusion only at the SNP-set level, and does not directly inform on which one(s) of the identified SNP set is actually driving the associations. A recently proposed procedure, KerNel Iterative Feature Extraction (KNIFE), provides a general framework for incorporating variable selection into kernel machine methods. In this article, we focus on quantitative traits and relatively common SNPs, and adapt the KNIFE procedure to genetic association studies and propose an approach to identify driver SNPs after the application of SKAT to gene set analysis. Our approach accommodates several kernels that are widely used in SNP analysis, such as the linear kernel and the Identity by State (IBS) kernel. The proposed approach provides practically useful utilities to prioritize SNPs, and fills the gap between SNP set analysis and biological functional studies. Both simulation studies and real data application are used to demonstrate the proposed approach. © 2016 WILEY PERIODICALS, INC.
Comparative Analysis of Disease-Linked Single Nucleotide Polymorphic Markers from Brassica rapa for Their Applicability to Brassica oleracea

PubMed Central

Cho, Young-Il; Ahn, Yul-Kyun; Tripathi, Swati; Kim, Jeong-Ho; Lee, Hye-Eun; Kim, Do-Sun

2015-01-01

Numerous studies using single nucleotide polymorphisms (SNPs) have been conducted in humans, and other animals, and in major crops, including rice, soybean, and Chinese cabbage. However, the number of SNP studies in cabbage is limited. In this present study, we evaluated whether 7,645 SNPs previously identified as molecular markers linked to disease resistance in the Brassica rapa genome could be applied to B. oleracea. In a BLAST analysis using the SNP sequences of B. rapa and B. oleracea genomic sequence data registered in the NCBI database, 256 genes for which SNPs had been identified in B. rapa were found in B. oleracea. These genes were classified into three functional groups: molecular function (64 genes), biological process (96 genes), and cellular component (96 genes). A total of 693 SNP markers, including 145 SNP markers [BRH—developed from the B. rapa genome for high-resolution melt (HRM) analysis], 425 SNP markers (BRP—based on the B. rapa genome that could be applied to B. oleracea), and 123 new SNP markers (BRS—derived from BRP and designed for HRM analysis), were investigated for their ability to amplify sequences from cabbage genomic DNA. In total, 425 of the SNP markers (BRP-based on B. rapa genome), selected from 7,645 SNPs, were successfully applied to B. oleracea. Using PCR, 108 of 145 BRH (74.5%), 415 of 425 BRP (97.6%), and 118 of 123 BRS (95.9%) showed amplification, suggesting that it is possible to apply SNP markers developed based on the B. rapa genome to B. oleracea. These results provide valuable information that can be utilized in cabbage genetics and breeding programs using molecular markers derived from other Brassica species. PMID:25790283
Genome-Wide Association Study of a Varroa-Specific Defense Behavior in Honeybees (Apis mellifera)

PubMed Central

Spötter, Andreas; Gupta, Pooja; Mayer, Manfred; Reinsch, Norbert

2016-01-01

Honey bees are exposed to many damaging pathogens and parasites. The most devastating is Varroa destructor, which mainly affects the brood. A promising approach for preventing its spread is to breed Varroa-resistant honey bees. One trait that has been shown to provide significant resistance against the Varroa mite is hygienic behavior, which is a behavioral response of honeybee workers to brood diseases in general. Here, we report the use of an Affymetrix 44K SNP array to analyze SNPs associated with detection and uncapping of Varroa-parasitized brood by individual worker bees (Apis mellifera). For this study, 22 000 individually labeled bees were video-monitored and a sample of 122 cases and 122 controls was collected and analyzed to determine the dependence/independence of SNP genotypes from hygienic and nonhygienic behavior on a genome-wide scale. After false-discovery rate correction of the P values, 6 SNP markers had highly significant associations with the trait investigated (α < 0.01). Inspection of the genomic regions around these SNPs led to the discovery of putative candidate genes. PMID:26774061
Genome-Wide Association Mapping of Barley Yellow Dwarf Virus Tolerance in Spring Oat (Avena sativa L.)

PubMed Central

Foresman, Bradley J.; Oliver, Rebekah E.; Jackson, Eric W.; Chao, Shiaoman; Arruda, Marcio P.; Kolb, Frederic L.

2016-01-01

Barley yellow dwarf viruses (BYDVs) are responsible for the disease barley yellow dwarf (BYD) and affect many cereals including oat (Avena sativa L.). Until recently, the molecular marker technology in oat has not allowed for many marker-trait association studies to determine the genetic mechanisms for tolerance. A genome-wide association study (GWAS) was performed on 428 spring oat lines using a recently developed high-density oat single nucleotide polymorphism (SNP) array as well as a SNP-based consensus map. Marker-trait associations were performed using a Q-K mixed model approach to control for population structure and relatedness. Six significant SNP-trait associations representing two QTL were found on chromosomes 3C (Mrg17) and 18D (Mrg04). This is the first report of BYDV tolerance QTL on chromosome 3C (Mrg17) and 18D (Mrg04). Haplotypes using the two QTL were evaluated and distinct classes for tolerance were identified based on the number of favorable alleles. A large number of lines carrying both favorable alleles were observed in the panel. PMID:27175781
Uniparental disomy and prenatal phenotype

PubMed Central

Li, Xiaofei; Liu, Yan; Yue, Song; Wang, Li; Zhang, Tiejuan; Guo, Cuixia; Hu, Wenjie; Kagan, Karl-Oliver; Wu, Qingqing

2017-01-01

Abstract Rationale: Uniparental disomy (UPD) gives a description of the inheritance of both homologues of a chromosome pair from the same parent. The consequences of UPD depend on the specific chromosome/segment involved and its parental origin. Patient concerns: We report prenatal phenotypes of 2 rare cases of UPD. Diagnoses: The prenatal phenotype of case 1 included sonographic markers such as enlarged nuchal translucency (NT), absent nasal bone, short femur and humerus length, and several structural malformations involving Dandy–Walker malformation and congenital heart defects. The prenatal phenotype of Case 2 are sonographic markers, including enlarged NT, thickened nuchal fold, ascites, and polyhydramnios without apparent structural malformations. Interventions: Conventional G-band karyotype appears normal in case 1, while it shows normal chromosomes with a small supernumerary marker chromosome (sSMC) in case 2. Genetic etiology was left unknown until single-nucleotide polymorphism-based array (SNP-array) was performed, and segmental paternal UPD 22 was identified in case 1 and segmental paternal UPD 14 was found in case 2. Outcomes: The parents of case 1 chose termination of pregnancy. The neonate of case 2 was born prematurely with a bellshaped small thorax and died within a day. Lessons: UPD cases are rare and the phenotypes are different, which depend on the origin and affected chromosomal part. If a fetus shows multiple anomalies that cannot be attributed to a common aneuploidy or a genetic syndrome, or manifests some features possibly related to an UPD syndrome, such as detection of sSMC, SNP-array should be considered. PMID:29137034
Uniparental disomy and prenatal phenotype: Two case reports and review.

PubMed

Li, Xiaofei; Liu, Yan; Yue, Song; Wang, Li; Zhang, Tiejuan; Guo, Cuixia; Hu, Wenjie; Kagan, Karl-Oliver; Wu, Qingqing

2017-11-01

Uniparental disomy (UPD) gives a description of the inheritance of both homologues of a chromosome pair from the same parent. The consequences of UPD depend on the specific chromosome/segment involved and its parental origin. We report prenatal phenotypes of 2 rare cases of UPD. The prenatal phenotype of case 1 included sonographic markers such as enlarged nuchal translucency (NT), absent nasal bone, short femur and humerus length, and several structural malformations involving Dandy-Walker malformation and congenital heart defects. The prenatal phenotype of Case 2 are sonographic markers, including enlarged NT, thickened nuchal fold, ascites, and polyhydramnios without apparent structural malformations. Conventional G-band karyotype appears normal in case 1, while it shows normal chromosomes with a small supernumerary marker chromosome (sSMC) in case 2. Genetic etiology was left unknown until single-nucleotide polymorphism-based array (SNP-array) was performed, and segmental paternal UPD 22 was identified in case 1 and segmental paternal UPD 14 was found in case 2. The parents of case 1 chose termination of pregnancy. The neonate of case 2 was born prematurely with a bellshaped small thorax and died within a day. UPD cases are rare and the phenotypes are different, which depend on the origin and affected chromosomal part. If a fetus shows multiple anomalies that cannot be attributed to a common aneuploidy or a genetic syndrome, or manifests some features possibly related to an UPD syndrome, such as detection of sSMC, SNP-array should be considered.
A parallel SNP array study of genomic aberrations associated with mental retardation in patients and general population in Estonia.

PubMed

Männik, Katrin; Parkel, Sven; Palta, Priit; Zilina, Olga; Puusepp, Helen; Esko, Tõnu; Mägi, Reedik; Nõukas, Margit; Veidenberg, Andres; Nelis, Mari; Metspalu, Andres; Remm, Maido; Ounap, Katrin; Kurg, Ants

2011-01-01

The increasing use of whole-genome array screening has revealed the important role of DNA copy-number variations in the pathogenesis of neurodevelopmental disorders and several recurrent genomic disorders have been defined during recent years. However, some variants considered to be pathogenic have also been observed in phenotypically normal individuals. This underlines the importance of further characterization of genomic variants with potentially variable expressivity in both patient and general population cohorts to clarify their phenotypic consequence. In this study whole-genome SNP arrays were used to investigate genomic rearrangements in 77 Estonian families with idiopathic mental retardation. In addition to this family-based approach, phenotype and genotype data from a cohort of 1000 individuals in the general population were used for accurate interpretation of aberrations found in mental retardation patients. Relevant structural aberrations were detected in 18 of the families analyzed (23%). Fifteen of those were in genomic regions where clinical significance has previously been established. In 3 families, 4 novel aberrations associated with intellectual disability were detected in chromosome regions 2p25.1-p24.3, 3p12.1-p11.2, 7p21.2-p21.1 and Xq28. Carriers of imbalances in 15q13.3, 16p11.2 and Xp22.31 were identified among reference individuals, affirming the variable phenotypic consequence of rare variants in some genomic regions considered as pathogenic. Copyright © 2010 Elsevier Masson SAS. All rights reserved.
Association between SLC11A1 (NRAMP1) polymorphisms and susceptibility to tuberculosis in Chinese Holstein cattle.

PubMed

Liu, Kaihua; Zhang, Bin; Teng, Zhaochun; Wang, Youtao; Dong, Guodong; Xu, Cong; Qin, Bo; Song, Chunlian; Chai, Jun; Li, Yang; Shi, Xianwei; Shu, Xianghua; Zhang, Yifang

2017-03-01

We investigated the associations between SLC11A1 polymorphisms and susceptibility to tuberculosis (TB) in Chinese Holstein cattle, using a case-control study of 136 animals that had positive reactions to TB tests and showed symptoms and 96 animals that had negative reactions to tests and showed no symptoms. Polymerase chain reaction (PCR) sequencing and the restriction fragment length polymorphism (RFLP) technique were used to detect and determine SLC11A1 polymorphisms. Association analysis identified significant correlations between SLC11A1 polymorphisms and susceptibility/resistance to TB, and two genetic markers for SLC11A1 were established using PCR-RFLP. Sequence alignment of SLC11A1 revealed seven single-nucleotide polymorphisms (SNPs). This is the first report of MaeII PCR-RFLP markers for the SLC11A1-SNP3 site and PstI PCR-RFLP markers for the SLC11A1-SNP5 and SLC11A1-SNP6 sites in Chinese Holstein cattle. Logistic regression analysis indicated that SLC11A1-SNP1, SLC11A1-SNP3, and SLC11A1-SNP5 were significantly associated with susceptibility/resistance to TB. Two genotypes of SLC11A1-SNP3 were susceptible to TB, whereas one genotype of SLC11A1-SNP1 and two genotypes of SLC11A1-SNP5 were resistant. Haplotype analysis showed that nine haplotypes were potentially resistant to TB. After Bonferroni correction, three of the haplotypes remained significantly associated with TB resistance. SLC11A1 is a useful candidate gene related to TB in Chinese Holstein cattle. Copyright © 2016 Elsevier Ltd. All rights reserved.
The effect of algorithms on copy number variant detection.

PubMed

Tsuang, Debby W; Millard, Steven P; Ely, Benjamin; Chi, Peter; Wang, Kenneth; Raskind, Wendy H; Kim, Sulgi; Brkanac, Zoran; Yu, Chang-En

2010-12-30

The detection of copy number variants (CNVs) and the results of CNV-disease association studies rely on how CNVs are defined, and because array-based technologies can only infer CNVs, CNV-calling algorithms can produce vastly different findings. Several authors have noted the large-scale variability between CNV-detection methods, as well as the substantial false positive and false negative rates associated with those methods. In this study, we use variations of four common algorithms for CNV detection (PennCNV, QuantiSNP, HMMSeg, and cnvPartition) and two definitions of overlap (any overlap and an overlap of at least 40% of the smaller CNV) to illustrate the effects of varying algorithms and definitions of overlap on CNV discovery. We used a 56 K Illumina genotyping array enriched for CNV regions to generate hybridization intensities and allele frequencies for 48 Caucasian schizophrenia cases and 48 age-, ethnicity-, and gender-matched control subjects. No algorithm found a difference in CNV burden between the two groups. However, the total number of CNVs called ranged from 102 to 3,765 across algorithms. The mean CNV size ranged from 46 kb to 787 kb, and the average number of CNVs per subject ranged from 1 to 39. The number of novel CNVs not previously reported in normal subjects ranged from 0 to 212. Motivated by the availability of multiple publicly available genome-wide SNP arrays, investigators are conducting numerous analyses to identify putative additional CNVs in complex genetic disorders. However, the number of CNVs identified in array-based studies, and whether these CNVs are novel or valid, will depend on the algorithm(s) used. Thus, given the variety of methods used, there will be many false positives and false negatives. Both guidelines for the identification of CNVs inferred from high-density arrays and the establishment of a gold standard for validation of CNVs are needed.
Detection of genetic association and functional polymorphisms of UGDH affecting milk production trait in Chinese Holstein cattle.

PubMed

Xu, Qing; Mei, Gui; Sun, Dongxiao; Zhang, Qin; Zhang, Yuan; Yin, Cengceng; Chen, Huiyong; Ding, Xiangdong; Liu, Jianfeng

2012-11-02

We previously localized a quantitative trait locus (QTL) on bovine chromosome 6 affecting milk production traits to a 1.5-Mb region between BMS483 and MNB-209 via genome scanning followed by fine mapping. Totally 15 genes were mapped within such linkage region through bioinformatic analysis of the cattle-human comparative map and bovine genome assembly. Of them, the UDP-glucose dehydrogenase (UGDH) was suggested as a potential positional candidate gene for milk production traits based on its corresponding physiological and biochemical functions and genetic effects. By sequencing all the coding exons and the untranslated regions in UGDH with pooled DNA of 8 sires represented the separated families detected in our previous studies, a total of ten SNPs were identified and genotyped in 1417 Holstein cows of 8 separation families. Individual SNP-based association analysis revealed 4 significant associations of SNP Ex1-1, SNP Int3-1, SNP Int5-1, and SNP Ex12-3 with milk yield (P < 0.05), and 2 significant associations of SNP Ex1-1 and SNP Ex12-3 with protein yield (P < 0.05). Furthermore, our haplotype-based association analyses indicated that haplotypes G-C-C, formed by SNP Ex12-2-SNP Int11-1-SNP Ex11-1, T-G, formed by SNP Int9-3-SNP Int9-2, and C-C, formed by SNP Int5-1-SNP Int3-1, are significantly associated with protein percentage (F=4.15; P=0.0418) and fat percentage (F=5.18~7.25; P=0.0072~0.0231). Finally, by using an in vitro expression assay, we demonstrated that the A allele of SNP Ex1-1 and T allele of SNP Ex11-1of UGDH significantly decreases the expression of UGDH by 68.0% at the RNA, and 50.1% at the protein level, suggesting that SNP Ex1-1 and Ex11-1 represent two functional polymorphisms affecting expression of UGDH and may partly contributed to the observed association of the gene with milk production traits in our samples. Taken together, our findings strongly indicate that UGDH gene could be involved in genetic variation underlying the QTL for milk production traits.
Joint Identification of Genetic Variants for Physical Activity in Korean Population

PubMed Central

Kim, Jayoun; Kim, Jaehee; Min, Haesook; Oh, Sohee; Kim, Yeonjung; Lee, Andy H.; Park, Taesung

2014-01-01

There has been limited research on genome-wide association with physical activity (PA). This study ascertained genetic associations between PA and 344,893 single nucleotide polymorphism (SNP) markers in 8842 Korean samples. PA data were obtained from a validated questionnaire that included information on PA intensity and duration. Metabolic equivalent of tasks were calculated to estimate the total daily PA level for each individual. In addition to single- and multiple-SNP association tests, a pathway enrichment analysis was performed to identify the biological significance of SNP markers. Although no significant SNP was found at genome-wide significance level via single-SNP association tests, 59 genetic variants mapped to 76 genes were identified via a multiple SNP approach using a bootstrap selection stability measure. Pathway analysis for these 59 variants showed that maturity onset diabetes of the young (MODY) was enriched. Joint identification of SNPs could enable the identification of multiple SNPs with good predictive power for PA and a pathway enriched for PA. PMID:25026172
snpTree--a web-server to identify and construct SNP trees from whole genome sequence data.

PubMed

Leekitcharoenphon, Pimlapas; Kaas, Rolf S; Thomsen, Martin Christen Frølund; Friis, Carsten; Rasmussen, Simon; Aarestrup, Frank M

2012-01-01

The advances and decreasing economical cost of whole genome sequencing (WGS), will soon make this technology available for routine infectious disease epidemiology. In epidemiological studies, outbreak isolates have very little diversity and require extensive genomic analysis to differentiate and classify isolates. One of the successfully and broadly used methods is analysis of single nucletide polymorphisms (SNPs). Currently, there are different tools and methods to identify SNPs including various options and cut-off values. Furthermore, all current methods require bioinformatic skills. Thus, we lack a standard and simple automatic tool to determine SNPs and construct phylogenetic tree from WGS data. Here we introduce snpTree, a server for online-automatic SNPs analysis. This tool is composed of different SNPs analysis suites, perl and python scripts. snpTree can identify SNPs and construct phylogenetic trees from WGS as well as from assembled genomes or contigs. WGS data in fastq format are aligned to reference genomes by BWA while contigs in fasta format are processed by Nucmer. SNPs are concatenated based on position on reference genome and a tree is constructed from concatenated SNPs using FastTree and a perl script. The online server was implemented by HTML, Java and python script.The server was evaluated using four published bacterial WGS data sets (V. cholerae, S. aureus CC398, S. Typhimurium and M. tuberculosis). The evaluation results for the first three cases was consistent and concordant for both raw reads and assembled genomes. In the latter case the original publication involved extensive filtering of SNPs, which could not be repeated using snpTree. The snpTree server is an easy to use option for rapid standardised and automatic SNP analysis in epidemiological studies also for users with limited bioinformatic experience. The web server is freely accessible at http://www.cbs.dtu.dk/services/snpTree-1.0/.
Clarifying sub-genomic positions of QTLs for flowering habit and fruit quality in U.S. strawberry (Fragaria×ananassa) breeding populations using pedigree-based QTL analysis

PubMed Central

Verma, Sujeet; Zurn, Jason D; Salinas, Natalia; Mathey, Megan M; Denoyes, Beatrice; Hancock, James F; Finn, Chad E; Bassil, Nahla V; Whitaker, Vance M

2017-01-01

The cultivated strawberry (Fragaria×ananassa) is consumed worldwide for its flavor and nutritional benefits. Genetic analysis of commercially important traits in strawberry are important for the development of breeding methods and tools for this species. Although several quantitative trait loci (QTL) have been previously detected for fruit quality and flowering traits using low-density genetic maps, clarity on the sub-genomic locations of these QTLs was missing. Recent discoveries in allo-octoploid strawberry genomics led to the development of the IStraw90 single-nucleotide polymorphism (SNP) array, enabling high-density genetic maps and finer resolution QTL analysis. In this study, breeder-specified traits were evaluated in the Eastern (Michigan) and Western (Oregon) United States for a common set of breeding populations during 2 years. Several QTLs were validated for soluble solids content (SSC), fruit weight (FWT), pH and titratable acidity (TA) using a pedigree-based QTL analysis approach. For fruit quality, a QTL for SSC on linkage group (LG) 6A, a QTL for FWT on LG 2BII, a QTL for pH on LG 4CII and two QTLs for TA on LGs 2A and 5B were detected. In addition, a large-effect QTL for flowering was detected at the distal end of LG 4A, coinciding with the FaPFRU locus. Marker haplotype analysis in the FaPFRU region indicated that the homozygous recessive genotype was highly predictive of seasonal flowering. SNP probes in the FaPFRU region may help facilitate marker-assisted selection for this trait. PMID:29138689
Clarifying sub-genomic positions of QTLs for flowering habit and fruit quality in U.S. strawberry (Fragaria×ananassa) breeding populations using pedigree-based QTL analysis.

PubMed

Verma, Sujeet; Zurn, Jason D; Salinas, Natalia; Mathey, Megan M; Denoyes, Beatrice; Hancock, James F; Finn, Chad E; Bassil, Nahla V; Whitaker, Vance M

2017-01-01

The cultivated strawberry ( Fragaria × ananassa ) is consumed worldwide for its flavor and nutritional benefits. Genetic analysis of commercially important traits in strawberry are important for the development of breeding methods and tools for this species. Although several quantitative trait loci (QTL) have been previously detected for fruit quality and flowering traits using low-density genetic maps, clarity on the sub-genomic locations of these QTLs was missing. Recent discoveries in allo-octoploid strawberry genomics led to the development of the IStraw90 single-nucleotide polymorphism (SNP) array, enabling high-density genetic maps and finer resolution QTL analysis. In this study, breeder-specified traits were evaluated in the Eastern (Michigan) and Western (Oregon) United States for a common set of breeding populations during 2 years. Several QTLs were validated for soluble solids content (SSC), fruit weight (FWT), pH and titratable acidity (TA) using a pedigree-based QTL analysis approach. For fruit quality, a QTL for SSC on linkage group (LG) 6A, a QTL for FWT on LG 2BII, a QTL for pH on LG 4CII and two QTLs for TA on LGs 2A and 5B were detected. In addition, a large-effect QTL for flowering was detected at the distal end of LG 4A, coinciding with the FaPFRU locus. Marker haplotype analysis in the FaPFRU region indicated that the homozygous recessive genotype was highly predictive of seasonal flowering. SNP probes in the FaPFRU region may help facilitate marker-assisted selection for this trait.
Identification of Promising Mutants Associated with Egg Production Traits Revealed by Genome-Wide Association Study.

PubMed

Yuan, Jingwei; Sun, Congjiao; Dou, Taocun; Yi, Guoqiang; Qu, LuJiang; Qu, Liang; Wang, Kehua; Yang, Ning

2015-01-01

Egg number (EN), egg laying rate (LR) and age at first egg (AFE) are important production traits related to egg production in poultry industry. To better understand the knowledge of genetic architecture of dynamic EN during the whole laying cycle and provide the precise positions of associated variants for EN, LR and AFE, laying records from 21 to 72 weeks of age were collected individually for 1,534 F2 hens produced by reciprocal crosses between White Leghorn and Dongxiang Blue-shelled chicken, and their genotypes were assayed by chicken 600 K Affymetrix high density genotyping arrays. Subsequently, pedigree and SNP-based genetic parameters were estimated and a genome-wide association study (GWAS) was conducted on EN, LR and AFE. The heritability estimates were similar between pedigree and SNP-based estimates varying from 0.17 to 0.36. In the GWA analysis, we identified nine genome-wide significant loci associated with EN of the laying periods from 21 to 26 weeks, 27 to 36 weeks and 37 to 72 weeks. Analysis of GTF2A1 and CLSPN suggested that they influenced the function of ovary and uterus, and may be considered as relevant candidates. The identified SNP rs314448799 for accumulative EN from 21 to 40 weeks on chromosome 5 created phenotypic differences of 6.86 eggs between two homozygous genotypes, which could be potentially applied to the molecular breeding for EN selection. Moreover, our finding showed that LR was a moderate polygenic trait. The suggestive significant region on chromosome 16 for AFE suggested the relationship between sex maturity and immune in the current population. The present study comprehensively evaluates the role of genetic variants in the development of egg laying. The findings will be helpful to investigation of causative genes function and future marker-assisted selection and genomic selection in chickens.
Genome wide association study and genomic prediction for fatty acid composition in Chinese Simmental beef cattle using high density SNP array.

PubMed

Zhu, Bo; Niu, Hong; Zhang, Wengang; Wang, Zezhao; Liang, Yonghu; Guan, Long; Guo, Peng; Chen, Yan; Zhang, Lupei; Guo, Yong; Ni, Heming; Gao, Xue; Gao, Huijiang; Xu, Lingyang; Li, Junya

2017-06-14

Fatty acid composition of muscle is an important trait contributing to meat quality. Recently, genome-wide association study (GWAS) has been extensively used to explore the molecular mechanism underlying important traits in cattle. In this study, we performed GWAS using high density SNP array to analyze the association between SNPs and fatty acids and evaluated the accuracy of genomic prediction for fatty acids in Chinese Simmental cattle. Using the BayesB method, we identified 35 and 7 regions in Chinese Simmental cattle that displayed significant associations with individual fatty acids and fatty acid groups, respectively. We further obtained several candidate genes which may be involved in fatty acid biosynthesis including elongation of very long chain fatty acids protein 5 (ELOVL5), fatty acid synthase (FASN), caspase 2 (CASP2) and thyroglobulin (TG). Specifically, we obtained strong evidence of association signals for one SNP located at 51.3 Mb for FASN using Genome-wide Rapid Association Mixed Model and Regression-Genomic Control (GRAMMAR-GC) approaches. Also, region-based association test identified multiple SNPs within FASN and ELOVL5 for C14:0. In addition, our result revealed that the effectiveness of genomic prediction for fatty acid composition using BayesB was slightly superior over GBLUP in Chinese Simmental cattle. We identified several significantly associated regions and loci which can be considered as potential candidate markers for genomics-assisted breeding programs. Using multiple methods, our results revealed that FASN and ELOVL5 are associated with fatty acids with strong evidence. Our finding also suggested that it is feasible to perform genomic selection for fatty acids in Chinese Simmental cattle.
Diagnosis of intrachromosomal amplification of chromosome 21 (iAMP21) by molecular cytogenetics in pediatric acute lymphoblastic leukemia

PubMed Central

Duployez, Nicolas; Boudry-Labis, Elise; Decool, Gauthier; Grzych, Guillaume; Grardel, Nathalie; Abou Chahla, Wadih; Preudhomme, Claude; Roche-Lestienne, Catherine

2015-01-01

Key Clinical Message Intrachromosomal amplification of chromosome 21 (iAMP21) defines a distinct cytogenetic subgroup of B-cell precursor acute lymphoblastic leukemia (BCP-ALL) with poor prognosis that should be investigated in routine practice. Single-nucleotide polymorphism (SNP)-array provides a useful method to detect such cases showing a highly characteristic profile. PMID:26509013
Genome-wide association studies reveal similar genetic architecture with shared and unique QTL for Bacterial Cold Water Disease resistance in two rainbow trout (Oncorhynchus mykiss) breeding populations

USDA-ARS?s Scientific Manuscript database

Bacterial cold water disease (BCWD) causes significant mortality and economic losses in salmonid aquaculture. In previous studies, we identified moderate-large effect QTL for BCWD resistance in rainbow trout (Oncorhynchus mykiss). However, the recent availability of a 57K SNP array and a genome phys...

Whole genome sequences are required to fully resolve the linkage disequilibrium structure of human populations.

PubMed

Pengelly, Reuben J; Tapper, William; Gibson, Jane; Knut, Marcin; Tearle, Rick; Collins, Andrew; Ennis, Sarah

2015-09-03

An understanding of linkage disequilibrium (LD) structures in the human genome underpins much of medical genetics and provides a basis for disease gene mapping and investigating biological mechanisms such as recombination and selection. Whole genome sequencing (WGS) provides the opportunity to determine LD structures at maximal resolution. We compare LD maps constructed from WGS data with LD maps produced from the array-based HapMap dataset, for representative European and African populations. WGS provides up to 5.7-fold greater SNP density than array-based data and achieves much greater resolution of LD structure, allowing for identification of up to 2.8-fold more regions of intense recombination. The absence of ascertainment bias in variant genotyping improves the population representativeness of the WGS maps, and highlights the extent of uncaptured variation using array genotyping methodologies. The complete capture of LD patterns using WGS allows for higher genome-wide association study (GWAS) power compared to array-based GWAS, with WGS also allowing for the analysis of rare variation. The impact of marker ascertainment issues in arrays has been greatest for Sub-Saharan African populations where larger sample sizes and substantially higher marker densities are required to fully resolve the LD structure. WGS provides the best possible resource for LD mapping due to the maximal marker density and lack of ascertainment bias. WGS LD maps provide a rich resource for medical and population genetics studies. The increasing availability of WGS data for large populations will allow for improved research utilising LD, such as GWAS and recombination biology studies.
SNP-RFLPing 2: an updated and integrated PCR-RFLP tool for SNP genotyping.

PubMed

Chang, Hsueh-Wei; Cheng, Yu-Huei; Chuang, Li-Yeh; Yang, Cheng-Hong

2010-04-08

PCR-restriction fragment length polymorphism (RFLP) assay is a cost-effective method for SNP genotyping and mutation detection, but the manual mining for restriction enzyme sites is challenging and cumbersome. Three years after we constructed SNP-RFLPing, a freely accessible database and analysis tool for restriction enzyme mining of SNPs, significant improvements over the 2006 version have been made and incorporated into the latest version, SNP-RFLPing 2. The primary aim of SNP-RFLPing 2 is to provide comprehensive PCR-RFLP information with multiple functionality about SNPs, such as SNP retrieval to multiple species, different polymorphism types (bi-allelic, tri-allelic, tetra-allelic or indels), gene-centric searching, HapMap tagSNPs, gene ontology-based searching, miRNAs, and SNP500Cancer. The RFLP restriction enzymes and the corresponding PCR primers for the natural and mutagenic types of each SNP are simultaneously analyzed. All the RFLP restriction enzyme prices are also provided to aid selection. Furthermore, the previously encountered updating problems for most SNP related databases are resolved by an on-line retrieval system. The user interfaces for functional SNP analyses have been substantially improved and integrated. SNP-RFLPing 2 offers a new and user-friendly interface for RFLP genotyping that can be used in association studies and is freely available at http://bio.kuas.edu.tw/snp-rflping2.
Contributions of IKZF1, DDC, CDKN2A, CEBPE, and LMO1 Gene Polymorphisms to Acute Lymphoblastic Leukemia in a Yemeni Population.

PubMed

Al-Absi, Boshra; Razif, Muhammad F M; Noor, Suzita M; Saif-Ali, Riyadh; Aqlan, Mohammed; Salem, Sameer D; Ahmed, Radwan H; Muniandy, Sekaran

2017-10-01

Genome-wide and candidate gene association studies have previously revealed links between a predisposition to acute lymphoblastic leukemia (ALL) and genetic polymorphisms in the following genes: IKZF1 (7p12.2; ID: 10320), DDC (7p12.2; ID: 1644), CDKN2A (9p21.3; ID: 1029), CEBPE (14q11.2; ID: 1053), and LMO1 (11p15; ID: 4004). In this study, we aimed to conduct an investigation into the possible association between polymorphisms in these genes and ALL within a sample of Yemeni children of Arab-Asian descent. Seven single-nucleotide polymorphisms (SNPs) in IKZF1, three SNPs in DDC, two SNPs in CDKN2A, two SNPs in CEBPE, and three SNPs in LMO1 were genotyped in 289 Yemeni children (136 cases and 153 controls), using the nanofluidic Dynamic Array (Fluidigm 192.24 Dynamic Array). Logistic regression analyses were used to estimate ALL risk, and the strength of association was expressed as odds ratios with 95% confidence intervals. We found that the IKZF1 SNP rs10235796 C allele (p = 0.002), the IKZF1 rs6964969 A>G polymorphism (p = 0.048, GG vs. AA), the CDKN2A rs3731246 G>C polymorphism (p = 0.047, GC+CC vs. GG), and the CDKN2A SNP rs3731246 C allele (p = 0.007) were significantly associated with ALL in Yemenis of Arab-Asian descent. In addition, a borderline association was found between IKZF1 rs4132601 T>G variant and ALL risk. No associations were found between the IKZF1 SNPs (rs11978267; rs7789635), DDC SNPs (rs3779084; rs880028; rs7809758), CDKN2A SNP (rs3731217), the CEBPE SNPs (rs2239633; rs12434881) and LMO1 SNPs (rs442264; rs3794012; rs4237770) with ALL in Yemeni children. The IKZF1 SNPs, rs10235796 and rs6964969, and the CDKN2A SNP rs3731246 (previously unreported) could serve as risk markers for ALL susceptibility in Yemeni children.
SNP array analysis of tyrosine kinase inhibitor-resistant chronic myeloid leukemia identifies heterogeneous secondary genomic alterations

PubMed Central

Müschen, Markus; Kato, Motohiro; Kawamata, Norihiko; Meixel, Antonie; Nowak, Verena; Kim, Han S.; Kang, Sharon; Paquette, Ronald; Chang, Mi-Sook; Thoenissen, Nils H.; Mossner, Max; Hofmann, Wolf-Karsten; Kohlmann, Alexander; Weiss, Tamara; Haferlach, Torsten; Haferlach, Claudia; Koeffler, H. Phillip

2010-01-01

To elucidate whether tyrosine kinase inhibitor (TKI) resistance in chronic myeloid leukemia is associated with characteristic genomic alterations, we analyzed DNA samples from 45 TKI-resistant chronic myeloid leukemia patients with 250K single nucleotide polymorphism arrays. From 20 patients, matched serial samples of pretreatment and TKI resistance time points were available. Eleven of the 45 TKI-resistant patients had mutations of BCR-ABL1, including 2 T315I mutations. Besides known TKI resistance-associated genomic lesions, such as duplication of the BCR-ABL1 gene (n = 8) and trisomy 8 (n = 3), recurrent submicroscopic alterations, including acquired uniparental disomy, were detectable on chromosomes 1, 8, 9, 17, 19, and 22. On chromosome 22, newly acquired and recurrent deletions of the IGLC1 locus were detected in 3 patients, who had previously presented with lymphoid or myeloid blast crisis. This may support a hypothesis of TKI-induced selection of subclones differentiating into immature B-cell progenitors as a mechanism of disease progression and evasion of TKI sensitivity. PMID:19965645
Whole-exome SNP array identifies 15 new susceptibility loci for psoriasis

PubMed Central

Zuo, Xianbo; Sun, Liangdan; Yin, Xianyong; Gao, Jinping; Sheng, Yujun; Xu, Jinhua; Zhang, Jianzhong; He, Chundi; Qiu, Ying; Wen, Guangdong; Tian, Hongqing; Zheng, Xiaodong; Liu, Shengxiu; Wang, Wenjun; Li, Weiran; Cheng, Yuyan; Liu, Longdan; Chang, Yan; Wang, Zaixing; Li, Zenggang; Li, Longnian; Wu, Jianping; Fang, Ling; Shen, Changbing; Zhou, Fusheng; Liang, Bo; Chen, Gang; Li, Hui; Cui, Yong; Xu, Aie; Yang, Xueqin; Hao, Fei; Xu, Limin; Fan, Xing; Li, Yuzhen; Wu, Rina; Wang, Xiuli; Liu, Xiaoming; Zheng, Min; Song, Shunpeng; Ji, Bihua; Fang, Hong; Yu, Jianbin; Sun, Yongxin; Hui, Yan; Zhang, Furen; Yang, Rongya; Yang, Sen; Zhang, Xuejun

2015-01-01

Genome-wide association studies (GWASs) have reproducibly associated ∼40 susceptibility loci with psoriasis. However, the missing heritability is evident and the contributions of coding variants have not yet been systematically evaluated. Here, we present a large-scale whole-exome array analysis for psoriasis consisting of 42,760 individuals. We discover 16 SNPs within 15 new genes/loci associated with psoriasis, including C1orf141, ZNF683, TMC6, AIM2, IL1RL1, CASR, SON, ZFYVE16, MTHFR, CCDC129, ZNF143, AP5B1, SYNE2, IFNGR2 and 3q26.2-q27 (P<5.00 × 10−08). In addition, we also replicate four known susceptibility loci TNIP1, NFKBIA, IL12B and LCE3D–LCE3E. These susceptibility variants identified in the current study collectively account for 1.9% of the psoriasis heritability. The variant within AIM2 is predicted to impact protein structure. Our findings increase the number of genetic risk factors for psoriasis and highlight new and plausible biological pathways in psoriasis. PMID:25854761
qpure: A Tool to Estimate Tumor Cellularity from Genome-Wide Single-Nucleotide Polymorphism Profiles

PubMed Central

Song, Sarah; Nones, Katia; Miller, David; Harliwong, Ivon; Kassahn, Karin S.; Pinese, Mark; Pajic, Marina; Gill, Anthony J.; Johns, Amber L.; Anderson, Matthew; Holmes, Oliver; Leonard, Conrad; Taylor, Darrin; Wood, Scott; Xu, Qinying; Newell, Felicity; Cowley, Mark J.; Wu, Jianmin; Wilson, Peter; Fink, Lynn; Biankin, Andrew V.; Waddell, Nic; Grimmond, Sean M.; Pearson, John V.

2012-01-01

Tumour cellularity, the relative proportion of tumour and normal cells in a sample, affects the sensitivity of mutation detection, copy number analysis, cancer gene expression and methylation profiling. Tumour cellularity is traditionally estimated by pathological review of sectioned specimens; however this method is both subjective and prone to error due to heterogeneity within lesions and cellularity differences between the sample viewed during pathological review and tissue used for research purposes. In this paper we describe a statistical model to estimate tumour cellularity from SNP array profiles of paired tumour and normal samples using shifts in SNP allele frequency at regions of loss of heterozygosity (LOH) in the tumour. We also provide qpure, a software implementation of the method. Our experiments showed that there is a medium correlation 0.42 (-value = 0.0001) between tumor cellularity estimated by qpure and pathology review. Interestingly there is a high correlation 0.87 (-value 2.2e-16) between cellularity estimates by qpure and deep Ion Torrent sequencing of known somatic KRAS mutations; and a weaker correlation 0.32 (-value = 0.004) between IonTorrent sequencing and pathology review. This suggests that qpure may be a more accurate predictor of tumour cellularity than pathology review. qpure can be downloaded from https://sourceforge.net/projects/qpure/. PMID:23049875
ENU Mutagenesis in Mice Identifies Candidate Genes For Hypogonadism

PubMed Central

Weiss, Jeffrey; Hurley, Lisa A.; Harris, Rebecca M.; Finlayson, Courtney; Tong, Minghan; Fisher, Lisa A.; Moran, Jennifer L.; Beier, David R.; Mason, Christopher; Jameson, J. Larry

2012-01-01

Genome-wide mutagenesis was performed in mice to identify candidate genes for male infertility, for which the predominant causes remain idiopathic. Mice were mutagenized using N-ethyl-N-nitrosourea (ENU), bred, and screened for phenotypes associated with the male urogenital system. Fifteen heritable lines were isolated and chromosomal loci were assigned using low density genome-wide SNP arrays. Ten of the fifteen lines were pursued further using higher resolution SNP analysis to narrow the candidate gene regions. Exon sequencing of candidate genes identified mutations in mice with cystic kidneys (Bicc1), cryptorchidism (Rxfp2), restricted germ cell deficiency (Plk4), and severe germ cell deficiency (Prdm9). In two other lines with severe hypogonadism candidate sequencing failed to identify mutations, suggesting defects in genes with previously undocumented roles in gonadal function. These genomic intervals were sequenced in their entirety and a candidate mutation was identified in SnrpE in one of the two lines. The line harboring the SnrpE variant retains substantial spermatogenesis despite small testis size, an unusual phenotype. In addition to the reproductive defects, heritable phenotypes were observed in mice with ataxia (Myo5a), tremors (Pmp22), growth retardation (unknown gene), and hydrocephalus (unknown gene). These results demonstrate that the ENU screen is an effective tool for identifying potential causes of male infertility. PMID:22258617
SNP-Based Typing: A Useful Tool to Study Bordetella pertussis Populations

PubMed Central

van der Heide, Han G. J.; Heuvelman, Kees J.; Kallonen, Teemu; He, Qiushui; Mertsola, Jussi; Advani, Abdolreza; Hallander, Hans O.; Janssens, Koen; Hermans, Peter W.; Mooi, Frits R.

2011-01-01

To monitor changes in Bordetella pertussis populations, mainly two typing methods are used; Pulsed-Field Gel Electrophoresis (PFGE) and Multiple-Locus Variable-Number Tandem Repeat Analysis (MLVA). In this study, a single nucleotide polymorphism (SNP) typing method, based on 87 SNPs, was developed and compared with PFGE and MLVA. The discriminatory indices of SNP typing, PFGE and MLVA were found to be 0.85, 0.95 and 0.83, respectively. Phylogenetic analysis, using SNP typing as Gold Standard, revealed false homoplasies in the PFGE and MLVA trees. Further, in contrast to the SNP-based tree, the PFGE- and MLVA-based trees did not reveal a positive correlation between root-to-tip distance and the isolation year of strains. Thus PFGE and MLVA do not allow an estimation of the relative age of the selected strains. In conclusion, SNP typing was found to be phylogenetically more informative than PFGE and more discriminative than MLVA. Further, in contrast to PFGE, it is readily standardized allowing interlaboratory comparisons. We applied SNP typing to study strains with a novel allele for the pertussis toxin promoter, ptxP3, which have a worldwide distribution and which have replaced the resident ptxP1 strains in the last 20 years. Previously, we showed that ptxP3 strains showed increased pertussis toxin expression and that their emergence was associated with increased notification in the Netherlands. SNP typing showed that the ptxP3 strains isolated in the Americas, Asia, Australia and Europe formed a monophyletic branch which recently diverged from ptxP1 strains. Two predominant ptxP3 SNP types were identified which spread worldwide. The widespread use of SNP typing will enhance our understanding of the evolution and global epidemiology of B. pertussis. PMID:21647370
SNP genotypes of Mycobacterium leprae isolates in Thailand and their combination with rpoT and TTC genotyping for analysis of leprosy distribution and transmission.

PubMed

Phetsuksiri, Benjawan; Srisungngam, Sopa; Rudeeaneksin, Janisara; Bunchoo, Supranee; Lukebua, Atchariya; Wongtrungkapun, Ruch; Paitoon, Soontara; Sakamuri, Rama Murthy; Brennan, Patrick J; Vissa, Varalakshmi

2012-01-01

Based on the discovery of three single nucleotide polymorphisms (SNPs) in Mycobacterium leprae, it has been previously reported that there are four major SNP types associated with different geographic regions around the world. Another typing system for global differentiation of M. leprae is the analysis of the variable number of short tandem repeats within the rpoT gene. To expand the analysis of geographic distribution of M. leprae, classified by SNP and rpoT gene polymorphisms, we studied 85 clinical isolates from Thai patients and compared the findings with those reported from Asian isolates. SNP genotyping by PCR amplification and sequencing revealed that all strains like those in Myanmar were SNP type 1 and 3, with the former being predominant, while in Japan, Korea, and Indonesia, the SNP type 3 was found to be more frequent. The pattern of M. leprae distribution in Thailand and Myanmar is quite similar, except that SNP type 2 was not found in Thailand. In addition, the 3-copy hexamer genotype in the rpoT gene is shared among the isolates from these two neighboring countries. On the basis of these two markers, we postulate that M. leprae in leprosy patients from Myanmar and Thailand has a common historical origin. Further differentiation among Thai isolates was possible by assessing copy numbers of the TTC sequence, a more polymorphic microsatellite locus.
SNP-based association analysis for seedling traits in durum wheat (Triticum turgidum L. durum (Desf.)).

PubMed

Sabiel, Salih A I; Huang, Sisi; Hu, Xin; Ren, Xifeng; Fu, Chunjie; Peng, Junhua; Sun, Dongfa

2017-03-01

In the present study, 150 accessions of worldwide originated durum wheat germplasm ( Triticum turgidum spp. durum ) were observed for major seedling traits and their growth. The accessions were evaluated for major seedling traits under controlled conditions of hydroponics at the 13 th , 20 th , 27 th and 34 th day-after germination. Biomass traits were measured at the 34 th day-after germination. Correlation analysis was conducted among the seedling traits and three field traits at maturity, plant height, grain weight and 1000-grain weight observed in four consecutive years. Associations of the measured seedling traits and SNP markers were analyzed based on the mixed linear model (MLM). The results indicated that highly significant genetic variation and robust heritability were found for the seedling and field mature traits. In total, 259 significant associations were detected for all the traits and four growth stages. The phenotypic variation explained (R2) by a single SNP marker is higher than 10% for most (84%) of the significant SNP markers. Forty-six SNP markers associated with multiple traits, indicating non-neglectable pleiotropy in seedling stage. The associated SNP markers could be helpful for genetic analysis of seedling traits, and marker-assisted breeding of new wheat varieties with strong seedling vigor.
CMPK1 and RBP3 are associated with corneal curvature in Asian populations.

PubMed

Chen, Peng; Miyake, Masahiro; Fan, Qiao; Liao, Jiemin; Yamashiro, Kenji; Ikram, Mohammad K; Chew, Merywn; Vithana, Eranga N; Khor, Chiea-Chuen; Aung, Tin; Tai, E-Shyong; Wong, Tien-Yin; Teo, Yik-Ying; Yoshimura, Nagahisa; Saw, Seang-Mei; Cheng, Ching-Yu

2014-11-15

Corneal curvature (CC) measures the steepness of the cornea and is an important parameter for clinically diseases such as astigmatism and myopia. Despite the high heritability of CC, only two associated genes have been discovered to date. We performed a three-stage genome-wide association study meta-analysis in 12 660 Asian individuals. Our Stage 1 was done in multiethnic cohorts comprising 7440 individuals, followed by a Stage 2 replication in 2473 Chinese and Stage 3 in 2747 Japanese. The SNP array genotype data were imputed up to the 1000 Genomes Project Phase 1 cosmopolitan panel. The SNP association with the radii of CC was investigated in the linear regression model with the adjustment of age, gender and principal components. In addition to the known genes, MTOR (also known as FRAP1) and PDGFRA, we discovered two novel genes associated with CC: CMPK1 (rs17103186, P = 3.3 × 10(-12)) and RBP3 (rs11204213 [Val884Met], P = 1.1 × 10(-13)). The missense RBP3 SNP, rs11204213, was also associated with axial length (AL) (P = 4.2 × 10(-6)) and had larger effects on both CC and AL compared with other SNPs. The index SNPs at the four indicated loci explained 1.9% of CC variance across the Stages 1 and 2 cohorts, while 33.8% of CC variance was explained by the genome-wide imputation data. We identified two novel genes influencing CC, which are related to either corneal shape or eye size. This study provides additional insights into genetic architecture of corneal shape. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Identification of a genetic variant associated with abdominal aortic aneurysms on chromosome 3p12.3 by genome wide association.

PubMed

Elmore, James R; Obmann, Melissa A; Kuivaniemi, Helena; Tromp, Gerard; Gerhard, Glenn S; Franklin, David P; Boddy, Amy M; Carey, David J

2009-06-01

The goal of this project was to identify genetic variants associated with abdominal aortic aneurysms (AAAs). A genome wide association study was carried out using pooled DNA samples from 123 AAA cases and 112 controls matched for age, gender, and smoking history using Affymetrix 500K single nucleotide polymorphism (SNP) arrays (Affymetrix, Inc, Santa Clara, Calif). The difference in mean allele frequency between cases and controls was calculated for each SNP and used to identify candidate genomic regions. Association of candidate SNPs with AAA was confirmed by individual TaqMan genotype assays in a total of 2096 cases and controls that included an independent replication sample set. A genome wide association study of AAA cases and controls identified a candidate AAA-associated haplotype on chromosome 3p12.3. By individual genotype analysis, four SNPs in this region were significantly associated with AAA in cases and controls from the original study population. One SNP in this region (rs7635818) was genotyped in a total of 502 cases and 736 controls from the original study population (P = .017) and 448 cases and 410 controls from an independent replication sample (P = .013; combined P value = .0028; combined odds ratio [OR] = 1.33). An even stronger association with AAA was observed in a subset of smokers (391 cases, 241 controls, P = .00041, OR = 1.80), which represent the highest risk group for AAA. The AAA-associated haplotype is located approximately 200 kbp upstream of the CNTN3 gene transcription start site. This study identifies a region on chromosome 3 that is significantly associated with AAA in 2 distinct study populations.
Family-Based Benchmarking of Copy Number Variation Detection Software.

PubMed

Nutsua, Marcel Elie; Fischer, Annegret; Nebel, Almut; Hofmann, Sylvia; Schreiber, Stefan; Krawczak, Michael; Nothnagel, Michael

2015-01-01

The analysis of structural variants, in particular of copy-number variations (CNVs), has proven valuable in unraveling the genetic basis of human diseases. Hence, a large number of algorithms have been developed for the detection of CNVs in SNP array signal intensity data. Using the European and African HapMap trio data, we undertook a comparative evaluation of six commonly used CNV detection software tools, namely Affymetrix Power Tools (APT), QuantiSNP, PennCNV, GLAD, R-gada and VEGA, and assessed their level of pair-wise prediction concordance. The tool-specific CNV prediction accuracy was assessed in silico by way of intra-familial validation. Software tools differed greatly in terms of the number and length of the CNVs predicted as well as the number of markers included in a CNV. All software tools predicted substantially more deletions than duplications. Intra-familial validation revealed consistently low levels of prediction accuracy as measured by the proportion of validated CNVs (34-60%). Moreover, up to 20% of apparent family-based validations were found to be due to chance alone. Software using Hidden Markov models (HMM) showed a trend to predict fewer CNVs than segmentation-based algorithms albeit with greater validity. PennCNV yielded the highest prediction accuracy (60.9%). Finally, the pairwise concordance of CNV prediction was found to vary widely with the software tools involved. We recommend HMM-based software, in particular PennCNV, rather than segmentation-based algorithms when validity is the primary concern of CNV detection. QuantiSNP may be used as an additional tool to detect sets of CNVs not detectable by the other tools. Our study also reemphasizes the need for laboratory-based validation, such as qPCR, of CNVs predicted in silico.
African crocus (Curculigo pilosa) and wonderful kola (Buchholzia coriacea) seeds modulate critical enzymes relevant to erectile dysfunction and oxidative stress.

PubMed

Adefegha, Stephen A; Oyeleye, Sunday I; Oboh, Ganiyu

2018-05-23

Background The seeds of African crocus (AC) (Curculigo pilosa) and wonderful kola (WK) (Buchholzia coriacea) are commonly used in folklore medicine in managing erectile dysfunction (ED) without the full understanding of the possible mechanism of actions. This study investigated and compared the effects of aqueous extracts from the seeds of AC and WK on arginase and acetylcholinesterase (AChE) activities and some pro-oxidant [FeSO4 and sodium nitroprusside (SNP)]-induced lipid peroxidation in rat penile homogenate in vitro. Method Aqueous extracts of AC and WK were prepared, and their effects on arginase and AChE activities as well as FeSO4- and SNP-induced lipid peroxidation in rat penile homogenate were assessed. Furthermore, phenolic constituents of the extract were determined using high-performance liquid chromatography coupled with diode-array detector (HPLC-DAD). Results Both extracts exhibited concentration-dependent inhibition on arginase (AC, IC50=0.05 mg/mL; WK, IC50=0.22 mg/mL) and AChE (AC, IC50=0.68 mg/mL; WK, IC50=0.28 mg/mL) activities. The extracts also inhibited FeSO4- and SNP-induced lipid peroxidation in rat penile homogenate. HPLC-DAD analysis revealed the presence of phenolic acids (gallic, caffeic, ellagic and coumaric acids) and flavonoids (catechin, quercetin and apigenin) in AC and WK. AC had higher arginase inhibitory and antioxidative activities but lower AChE inhibitory properties when compared with WK. Conclusions These effects could explain the possible mechanistic actions of the seeds in the management/treatment of ED and could be as a result of individual and/or synergistic effect of the constituent phenolic compounds of the seeds.
Common variants at the promoter region of the APOM confer a risk of rheumatoid arthritis

PubMed Central

Hu, Hae-Jin; Jin, Eun-Heui; Yim, Seon-Hee; Yang, So-Young; Jung, Seung-Hyun; Shin, Seung-Hun; Kim, Wan-Uk; Shim, Seung-Cheol; Kim, Tai-Gyu

2011-01-01

Although the genetic component in the etiology of rheumatoid arthritis (RA) has been consistently suggested, many novel genetic loci remain to uncover. To identify RA risk loci, we performed a genome-wide association study (GWAS) with 100 RA cases and 600 controls using Affymetrix SNP array 5.0. The candidate risk locus (APOM gene) was re-sequenced to discover novel promoter and coding variants in a group of the subjects. Replication was performed with the independent case-control set comprising of 578 RAs and 711 controls. Through GWAS, we identified a novel SNP associated with RA at the APOM gene in the MHC class III region on 6p21.33 (rs805297, odds ratio (OR) = 2.28, P = 5.20 × 10-7). Three more polymorphisms were identified at the promoter region of the APOM by the re-sequencing. For the replication, we genotyped the four SNP loci in the independent case-control set. The association of rs805297 identified by GWAS was successfully replicated (OR = 1.40, P = 6.65 × 10-5). The association became more significant in the combined analysis of discovery and replication sets (OR = 1.56, P = 2.73 ± 10-10). The individuals with the rs805297 risk allele (A) at the promoter region showed a significantly lower level of APOM expression compared with those with the protective allele (C) homozygote. In the logistic regressions by the phenotype status, the homozygote risk genotype (A/A) consistently showed higher ORs than the heterozygote one (A/C) for the phenotype-positive RAs. These results indicate that APOM promoter polymorphisms are significantly associated with the susceptibility to RA. PMID:21844665
Linkage Analysis in Autoimmune Addison's Disease: NFATC1 as a Potential Novel Susceptibility Locus.

PubMed

Mitchell, Anna L; Bøe Wolff, Anette; MacArthur, Katie; Weaver, Jolanta U; Vaidya, Bijay; Erichsen, Martina M; Darlay, Rebecca; Husebye, Eystein S; Cordell, Heather J; Pearce, Simon H S

2015-01-01

Autoimmune Addison's disease (AAD) is a rare, highly heritable autoimmune endocrinopathy. It is possible that there may be some highly penetrant variants which confer disease susceptibility that have yet to be discovered. DNA samples from 23 multiplex AAD pedigrees from the UK and Norway (50 cases, 67 controls) were genotyped on the Affymetrix SNP 6.0 array. Linkage analysis was performed using Merlin. EMMAX was used to carry out a genome-wide association analysis comparing the familial AAD cases to 2706 UK WTCCC controls. To explore some of the linkage findings further, a replication study was performed by genotyping 64 SNPs in two of the four linked regions (chromosomes 7 and 18), on the Sequenom iPlex platform in three European AAD case-control cohorts (1097 cases, 1117 controls). The data were analysed using a meta-analysis approach. In a parametric analysis, applying a rare dominant model, loci on chromosomes 7, 9 and 18 had LOD scores >2.8. In a non-parametric analysis, a locus corresponding to the HLA region on chromosome 6, known to be associated with AAD, had a LOD score >3.0. In the genome-wide association analysis, a SNP cluster on chromosome 2 and a pair of SNPs on chromosome 6 were associated with AAD (P <5x10-7). A meta-analysis of the replication study data demonstrated that three chromosome 18 SNPs were associated with AAD, including a non-synonymous variant in the NFATC1 gene. This linkage study has implicated a number of novel chromosomal regions in the pathogenesis of AAD in multiplex AAD families and adds further support to the role of HLA in AAD. The genome-wide association analysis has also identified a region of interest on chromosome 2. A replication study has demonstrated that the NFATC1 gene is worthy of future investigation, however each of the regions identified require further, systematic analysis.
Sample-to-SNP kit: a reliable, easy and fast tool for the detection of HFE p.H63D and p.C282Y variations associated to hereditary hemochromatosis.

PubMed

Nielsen, Peter B; Petersen, Maja S; Ystaas, Viviana; Andersen, Rolf V; Hansen, Karin M; Blaabjerg, Vibeke; Refstrup, Mette

2012-10-01

Classical hereditary hemochromatosis involves the HFE-gene and diagnostic analysis of the DNA variants HFE p.C282Y (c.845G>A; rs1800562) and HFE p.H63D (c.187C>G; rs1799945). The affected protein alters the iron homeostasis resulting in iron overload in various tissues. The aim of this study was to validate the TaqMan-based Sample-to-SNP protocol for the analysis of the HFE-p.C282Y and p.H63D variants with regard to accuracy, usefulness and reproducibility compared to an existing SNP protocol. The Sample-to-SNP protocol uses an approach where the DNA template is made accessible from a cell lysate followed by TaqMan analysis. Besides the HFE-SNPs other eight SNPs were used as well. These SNPs were: Coagulation factor II-gene F2 c.20210G>A, Coagulation factor V-gene F5 p.R506Q (c.1517G>A; rs121917732), Mitochondria SNP: mt7028 G>A, Mitochondria SNP: mt12308 A>G, Proprotein convertase subtilisin/kexin type 9-gene PCSK9 p.R46L (c.137G>T), Plutathione S-transferase pi 1-gene GSTP1 p.I105V (c313A>G; rs1695), LXR g.-171 A>G, ZNF202 g.-118 G>T. In conclusion the Sample-to-SNP kit proved to be an accurate, reliable, robust, easy to use and rapid TaqMan-based SNP detection protocol, which could be quickly implemented in a routine diagnostic or research facility. Copyright © 2012. Published by Elsevier B.V.
An ultra-high density linkage map and QTL mapping for sex and growth-related traits of common carp (Cyprinus carpio)

PubMed Central

Peng, Wenzhu; Xu, Jian; Zhang, Yan; Feng, Jianxin; Dong, Chuanju; Jiang, Likun; Feng, Jingyan; Chen, Baohua; Gong, Yiwen; Chen, Lin; Xu, Peng

2016-01-01

High density genetic linkage maps are essential for QTL fine mapping, comparative genomics and high quality genome sequence assembly. In this study, we constructed a high-density and high-resolution genetic linkage map with 28,194 SNP markers on 14,146 distinct loci for common carp based on high-throughput genotyping with the carp 250 K single nucleotide polymorphism (SNP) array in a mapping family. The genetic length of the consensus map was 10,595.94 cM with an average locus interval of 0.75 cM and an average marker interval of 0.38 cM. Comparative genomic analysis revealed high level of conserved syntenies between common carp and the closely related model species zebrafish and medaka. The genome scaffolds were anchored to the high-density linkage map, spanning 1,357 Mb of common carp reference genome. QTL mapping and association analysis identified 22 QTLs for growth-related traits and 7 QTLs for sex dimorphism. Candidate genes underlying growth-related traits were identified, including important regulators such as KISS2, IGF1, SMTLB, NPFFR1 and CPE. Candidate genes associated with sex dimorphism were also identified including 3KSR and DMRT2b. The high-density and high-resolution genetic linkage map provides an important tool for QTL fine mapping and positional cloning of economically important traits, and improving common carp genome assembly. PMID:27225429
Allele-specific DNA methylation of disease susceptibility genes in Japanese patients with inflammatory bowel disease.

PubMed

Chiba, Hirofumi; Kakuta, Yoichi; Kinouchi, Yoshitaka; Kawai, Yosuke; Watanabe, Kazuhiro; Nagao, Munenori; Naito, Takeo; Onodera, Motoyuki; Moroi, Rintaro; Kuroha, Masatake; Kanazawa, Yoshitake; Kimura, Tomoya; Shiga, Hisashi; Endo, Katsuya; Negoro, Kenichi; Nagasaki, Masao; Unno, Michiaki; Shimosegawa, Tooru

2018-01-01

Inflammatory bowel disease (IBD) has an unknown etiology; however, accumulating evidence suggests that IBD is a multifactorial disease influenced by a combination of genetic and environmental factors. The influence of genetic variants on DNA methylation in cis and cis effects on expression have been demonstrated. We hypothesized that IBD susceptibility single-nucleotide polymorphisms (SNPs) regulate susceptibility gene expressions in cis by regulating DNA methylation around SNPs. For this, we determined cis-regulated allele-specific DNA methylation (ASM) around IBD susceptibility genes in CD4+ effector/memory T cells (Tem) in lamina propria mononuclear cells (LPMCs) in patients with IBD and examined the association between the ASM SNP genotype and neighboring susceptibility gene expressions. CD4+ effector/memory T cells (Tem) were isolated from LPMCs in 15 Japanese IBD patients (ten Crohn's disease [CD] and five ulcerative colitis [UC] patients). ASM analysis was performed by methylation-sensitive SNP array analysis. We defined ASM as a changing average relative allele score ([Formula: see text]) >0.1 after digestion by methylation-sensitive restriction enzymes. Among SNPs showing [Formula: see text] >0.1, we extracted the probes located on tag-SNPs of 200 IBD susceptibility loci and around IBD susceptibility genes as candidate ASM SNPs. To validate ASM, bisulfite-pyrosequencing was performed. Transcriptome analysis was examined in 11 IBD patients (seven CD and four UC patients). The relation between rs36221701 genotype and neighboring gene expressions were analyzed. We extracted six candidate ASM SNPs around IBD susceptibility genes. The top of [Formula: see text] (0.23) was rs1130368 located on HLA-DQB1. ASM around rs36221701 ([Formula: see text] = 0.14) located near SMAD3 was validated using bisulfite pyrosequencing. The SMAD3 expression was significantly associated with the rs36221701 genotype (p = 0.016). We confirmed the existence of cis-regulated ASM around IBD susceptibility genes and the association between ASM SNP (rs36221701) genotype and SMAD3 expression, a susceptibility gene for IBD. These results give us supporting evidence that DNA methylation mediates genetic effects on disease susceptibility.
Allele-specific DNA methylation of disease susceptibility genes in Japanese patients with inflammatory bowel disease

PubMed Central

Chiba, Hirofumi; Kakuta, Yoichi; Kinouchi, Yoshitaka; Kawai, Yosuke; Watanabe, Kazuhiro; Nagao, Munenori; Naito, Takeo; Onodera, Motoyuki; Moroi, Rintaro; Kuroha, Masatake; Kanazawa, Yoshitake; Kimura, Tomoya; Shiga, Hisashi; Endo, Katsuya; Negoro, Kenichi; Nagasaki, Masao; Unno, Michiaki; Shimosegawa, Tooru

2018-01-01

Background Inflammatory bowel disease (IBD) has an unknown etiology; however, accumulating evidence suggests that IBD is a multifactorial disease influenced by a combination of genetic and environmental factors. The influence of genetic variants on DNA methylation in cis and cis effects on expression have been demonstrated. We hypothesized that IBD susceptibility single-nucleotide polymorphisms (SNPs) regulate susceptibility gene expressions in cis by regulating DNA methylation around SNPs. For this, we determined cis-regulated allele-specific DNA methylation (ASM) around IBD susceptibility genes in CD4+ effector/memory T cells (Tem) in lamina propria mononuclear cells (LPMCs) in patients with IBD and examined the association between the ASM SNP genotype and neighboring susceptibility gene expressions. Methods CD4+ effector/memory T cells (Tem) were isolated from LPMCs in 15 Japanese IBD patients (ten Crohn's disease [CD] and five ulcerative colitis [UC] patients). ASM analysis was performed by methylation-sensitive SNP array analysis. We defined ASM as a changing average relative allele score (ΔRAS¯) >0.1 after digestion by methylation-sensitive restriction enzymes. Among SNPs showing ΔRAS¯ >0.1, we extracted the probes located on tag-SNPs of 200 IBD susceptibility loci and around IBD susceptibility genes as candidate ASM SNPs. To validate ASM, bisulfite-pyrosequencing was performed. Transcriptome analysis was examined in 11 IBD patients (seven CD and four UC patients). The relation between rs36221701 genotype and neighboring gene expressions were analyzed. Results We extracted six candidate ASM SNPs around IBD susceptibility genes. The top of ΔRAS¯ (0.23) was rs1130368 located on HLA-DQB1. ASM around rs36221701 (ΔRAS¯ = 0.14) located near SMAD3 was validated using bisulfite pyrosequencing. The SMAD3 expression was significantly associated with the rs36221701 genotype (p = 0.016). Conclusions We confirmed the existence of cis-regulated ASM around IBD susceptibility genes and the association between ASM SNP (rs36221701) genotype and SMAD3 expression, a susceptibility gene for IBD. These results give us supporting evidence that DNA methylation mediates genetic effects on disease susceptibility. PMID:29547621

SNPhylo: a pipeline to construct a phylogenetic tree from huge SNP data.

PubMed

Lee, Tae-Ho; Guo, Hui; Wang, Xiyin; Kim, Changsoo; Paterson, Andrew H

2014-02-26

Phylogenetic trees are widely used for genetic and evolutionary studies in various organisms. Advanced sequencing technology has dramatically enriched data available for constructing phylogenetic trees based on single nucleotide polymorphisms (SNPs). However, massive SNP data makes it difficult to perform reliable analysis, and there has been no ready-to-use pipeline to generate phylogenetic trees from these data. We developed a new pipeline, SNPhylo, to construct phylogenetic trees based on large SNP datasets. The pipeline may enable users to construct a phylogenetic tree from three representative SNP data file formats. In addition, in order to increase reliability of a tree, the pipeline has steps such as removing low quality data and considering linkage disequilibrium. A maximum likelihood method for the inference of phylogeny is also adopted in generation of a tree in our pipeline. Using SNPhylo, users can easily produce a reliable phylogenetic tree from a large SNP data file. Thus, this pipeline can help a researcher focus more on interpretation of the results of analysis of voluminous data sets, rather than manipulations necessary to accomplish the analysis.
A high-density, multi-parental SNP genetic map on apple validates a new mapping approach for outcrossing species.

PubMed

Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma Jj; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco Cam; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric

2016-01-01

Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple ( Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species.
Association between ABCG1 polymorphism rs1893590 and high-density lipoprotein (HDL) in an asymptomatic Brazilian population.

PubMed

Zago, V H S; Scherrer, D Z; Parra, E S; Panzoldo, N B; Alexandre, F; Nakandakare, E R; Quintão, E C R; de Faria, E C

2015-03-01

ATP binding cassette transporter G1 (ABCG1) promotes lipidation of nascent high-density lipoprotein (HDL) particles, acting as an intracellular transporter. SNP rs1893590 (c.-204A > C) of ABCG1 gene has been previously studied and reported as functional over plasma HDL-C and lipoprotein lipase activity. This study aimed to investigate the relationships of SNP rs1893590 with plasma lipids and lipoproteins in a large Brazilian population. Were selected 654 asymptomatic and normolipidemic volunteers from both genders. Clinical and anthropometrical data were taken and blood samples were drawn after 12 h fasting. Plasma lipids and lipoproteins, as well as HDL particle size and volume were determined. Genomic DNA was isolated for SNP rs1893590 detection by TaqMan(®) OpenArray(®) Real-Time PCR Plataform (Applied Biosystems). Mann-Whitney U, Chi square and two-way ANOVA were the used statistical tests. No significant differences were found in the comparison analyses between the allele groups for all studied parameters. Conversely, significant interactions were observed between SNP and age over plasma HDL-C, were volunteers under 60 years with AA genotype had increased HDL-C (p = 0.048). Similar results were observed in the group with body mass index (BMI) < 25 kg/m(2), where volunteers with AA genotype had higher HDL-C levels (p = 0.0034), plus an increased HDL particle size (p = 0.01). These findings indicate that SNP rs1893590 of ABCG1 has a significant impact over HDL-C under asymptomatic clinical conditions in an age and BMI dependent way.
A high-density, multi-parental SNP genetic map on apple validates a new mapping approach for outcrossing species

PubMed Central

Di Pierro, Erica A; Gianfranceschi, Luca; Di Guardo, Mario; Koehorst-van Putten, Herma JJ; Kruisselbrink, Johannes W; Longhi, Sara; Troggio, Michela; Bianco, Luca; Muranty, Hélène; Pagliarani, Giulia; Tartarini, Stefano; Letschka, Thomas; Lozano Luis, Lidia; Garkava-Gustavsson, Larisa; Micheletti, Diego; Bink, Marco CAM; Voorrips, Roeland E; Aziz, Ebrahimi; Velasco, Riccardo; Laurens, François; van de Weg, W Eric

2016-01-01

Quantitative trait loci (QTL) mapping approaches rely on the correct ordering of molecular markers along the chromosomes, which can be obtained from genetic linkage maps or a reference genome sequence. For apple (Malus domestica Borkh), the genome sequence v1 and v2 could not meet this need; therefore, a novel approach was devised to develop a dense genetic linkage map, providing the most reliable marker-loci order for the highest possible number of markers. The approach was based on four strategies: (i) the use of multiple full-sib families, (ii) the reduction of missing information through the use of HaploBlocks and alternative calling procedures for single-nucleotide polymorphism (SNP) markers, (iii) the construction of a single backcross-type data set including all families, and (iv) a two-step map generation procedure based on the sequential inclusion of markers. The map comprises 15 417 SNP markers, clustered in 3 K HaploBlock markers spanning 1 267 cM, with an average distance between adjacent markers of 0.37 cM and a maximum distance of 3.29 cM. Moreover, chromosome 5 was oriented according to its homoeologous chromosome 10. This map was useful to improve the apple genome sequence, design the Axiom Apple 480 K SNP array and perform multifamily-based QTL studies. Its collinearity with the genome sequences v1 and v3 are reported. To our knowledge, this is the shortest published SNP map in apple, while including the largest number of markers, families and individuals. This result validates our methodology, proving its value for the construction of integrated linkage maps for any outbreeding species. PMID:27917289
Comparison of SSR and SNP Markers in Estimation of Genetic Diversity and Population Structure of Indian Rice Varieties

PubMed Central

Singh, Amit Kumar; Kumar, Sundeep; Srinivasan, Kalyani; Tyagi, R. K.; Singh, N. K.; Singh, Rakesh

2013-01-01

Simple sequence repeat (SSR) and Single Nucleotide Polymorphic (SNP), the two most robust markers for identifying rice varieties were compared for assessment of genetic diversity and population structure. Total 375 varieties of rice from various regions of India archived at the Indian National GeneBank, NBPGR, New Delhi, were analyzed using thirty six genetic markers, each of hypervariable SSR (HvSSR) and SNP which were distributed across 12 rice chromosomes. A total of 80 alleles were amplified with the SSR markers with an average of 2.22 alleles per locus whereas, 72 alleles were amplified with SNP markers. Polymorphic information content (PIC) values for HvSSR ranged from 0.04 to 0.5 with an average of 0.25. In the case of SNP markers, PIC values ranged from 0.03 to 0.37 with an average of 0.23. Genetic relatedness among the varieties was studied; utilizing an unrooted tree all the genotypes were grouped into three major clusters with both SSR and SNP markers. Analysis of molecular variance (AMOVA) indicated that maximum diversity was partitioned between and within individual level but not between populations. Principal coordinate analysis (PCoA) with SSR markers showed that genotypes were uniformly distributed across the two axes with 13.33% of cumulative variation whereas, in case of SNP markers varieties were grouped into three broad groups across two axes with 45.20% of cumulative variation. Population structure were tested using K values from 1 to 20, but there was no clear population structure, therefore Ln(PD) derived Δk was plotted against the K to determine the number of populations. In case of SSR maximum Δk was at K=5 whereas, in case of SNP maximum Δk was found at K=15, suggesting that resolution of population was higher with SNP markers, but SSR were more efficient for diversity analysis. PMID:24367635
Processing-Dependent and Clonal Contamination Patterns of Listeria monocytogenes in the Cured Ham Food Chain Revealed by Genetic Analysis

PubMed Central

Morganti, Marina; Scaltriti, Erika; Cozzolino, Paolo; Bolzoni, Luca; Casadei, Gabriele; Pierantoni, Marco; Foni, Emanuela

2015-01-01

The quantitative and qualitative patterns of environmental contamination by Listeria monocytogenes were investigated in the production chain of dry-cured Parma ham. Standard arrays of surfaces were sampled in processing facilities during a single visit per plant in the three compartments of the food chain, i.e., ham production (19 plants) and postproduction, which was divided into deboning (43 plants) and slicing (25 plants) steps. The numbers of sampled surfaces were 384 in ham production, with 25 positive for L. monocytogenes, and 1,084 in postproduction, with 83 positives. Statistical analysis of the prevalence of contaminated surfaces showed that in ham production, contamination was higher at the beginning of processing and declined significantly toward the end, while in postproduction, prevalence rose toward the end of processing. Prevalence was higher in the deboning facilities than in slicing facilities and was dependent on the type of surface (floor/drainage > clothing > equipment). The qualitative pattern of contamination was investigated through an analysis of the survey isolates and a set of isolates derived from routine monitoring, including longitudinal isolations. Pulsed-field gel electrophoresis (PFGE) and whole-genome single-nucleotide polymorphism (SNP) analysis revealed a remarkable clonality of L. monocytogenes within plants, with the detection of 16 plant-specific clones out of 17 establishments with multiple isolates. Repeated detections of clonal isolates >6 months apart were also observed. Six was the maximum number of between-isolate differences in core SNPs observed within these clones. Based on the same six-SNP threshold, three clusters of clonal isolates, shared by six establishments, were also identified. The spread of L. monocytogenes within and between plants, as indicated by its clonal behavior, is a matter of concern for the hygienic management of establishments. PMID:26590278
Novel single nucleotide polymorphisms of the bovine methyltransferase 3b gene and their association with meat quality traits in beef cattle.

PubMed

Liu, X; Guo, X Y; Xu, X Z; Wu, M; Zhang, X; Li, Q; Ma, P P; Zhang, Y; Wang, C Y; Geng, F J; Qin, C H; Liu, L; Shi, W H; Wang, Y C; Yu, Y

2012-08-16

DNA methylation is essential for adipose deposition in mammals. We screened SNPs of the bovine DNA methyltransferase 3b (DNMT3b) gene in Snow Dragon beef, a commercial beef cattle population in China. Nine SNPs were found in the population and three of six novel SNPs were chosen for genotyping and analyzing a possible association with 16 meat quality traits. The frequencies of the alleles and genotypes of the three SNPs in Snow Dragon beef were similar to those in their terminal-paternal breed, Wagyu. Association analysis disclosed that SNP1 was not associated with any of the traits; SNP2 was significantly associated with lean meat color score and chuck short rib score, and SNP3 had a significant effect on dressing percentage and back-fat thickness in the beef population. The individuals with genotype GG for SNP2 had a 25.7% increase in lean meat color score and a 146% increase in chuck short rib score, compared with genotype AA. The cattle with genotype AG for SNP3 had 35.7 and 24% increases in dressing percentage and 28.8 and 29.2% increases in back-fat thickness, compared with genotypes GG and AA, respectively. Genotypic combination analysis revealed significant interactions between SNP1 and SNP2 and between SNP2 and SNP3 for the traits rib-eye area and live weight. We conclude that there is considerable evidence that DNMT3b is a determiner of beef quality traits.
When Whole-Genome Alignments Just Won't Work: kSNP v2 Software for Alignment-Free SNP Discovery and Phylogenetics of Hundreds of Microbial Genomes

PubMed Central

Gardner, Shea N.; Hall, Barry G.

2013-01-01

Effective use of rapid and inexpensive whole genome sequencing for microbes requires fast, memory efficient bioinformatics tools for sequence comparison. The kSNP v2 software finds single nucleotide polymorphisms (SNPs) in whole genome data. kSNP v2 has numerous improvements over kSNP v1 including SNP gene annotation; better scaling for draft genomes available as assembled contigs or raw, unassembled reads; a tool to identify the optimal value of k; distribution of packages of executables for Linux and Mac OS X for ease of installation and user-friendly use; and a detailed User Guide. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a single reference genome. Most target sets with hundreds of genomes complete in minutes to hours. SNP phylogenies are built by maximum likelihood, parsimony, and distance, based on all SNPs, only core SNPs, or SNPs present in some intermediate user-specified fraction of targets. The SNP-based trees that result are consistent with known taxonomy. kSNP v2 can handle many gigabases of sequence in a single run, and if one or more annotated genomes are included in the target set, SNPs are annotated with protein coding and other information (UTRs, etc.) from Genbank file(s). We demonstrate application of kSNP v2 on sets of viral and bacterial genomes, and discuss in detail analysis of a set of 68 finished E. coli and Shigella genomes and a set of the same genomes to which have been added 47 assemblies and four “raw read” genomes of H104:H4 strains from the recent European E. coli outbreak that resulted in both bloody diarrhea and hemolytic uremic syndrome (HUS), and caused at least 50 deaths. PMID:24349125
When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.

PubMed

Gardner, Shea N; Hall, Barry G

2013-01-01

Effective use of rapid and inexpensive whole genome sequencing for microbes requires fast, memory efficient bioinformatics tools for sequence comparison. The kSNP v2 software finds single nucleotide polymorphisms (SNPs) in whole genome data. kSNP v2 has numerous improvements over kSNP v1 including SNP gene annotation; better scaling for draft genomes available as assembled contigs or raw, unassembled reads; a tool to identify the optimal value of k; distribution of packages of executables for Linux and Mac OS X for ease of installation and user-friendly use; and a detailed User Guide. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a single reference genome. Most target sets with hundreds of genomes complete in minutes to hours. SNP phylogenies are built by maximum likelihood, parsimony, and distance, based on all SNPs, only core SNPs, or SNPs present in some intermediate user-specified fraction of targets. The SNP-based trees that result are consistent with known taxonomy. kSNP v2 can handle many gigabases of sequence in a single run, and if one or more annotated genomes are included in the target set, SNPs are annotated with protein coding and other information (UTRs, etc.) from Genbank file(s). We demonstrate application of kSNP v2 on sets of viral and bacterial genomes, and discuss in detail analysis of a set of 68 finished E. coli and Shigella genomes and a set of the same genomes to which have been added 47 assemblies and four "raw read" genomes of H104:H4 strains from the recent European E. coli outbreak that resulted in both bloody diarrhea and hemolytic uremic syndrome (HUS), and caused at least 50 deaths.
Is a gene important for bone resorption a candidate for obesity? An association and linkage study on the RANK (receptor activator of nuclear factor-kappaB) gene in a large Caucasian sample.

PubMed

Zhao, Lan-Juan; Guo, Yan-Fang; Xiong, Dong-Hai; Xiao, Peng; Recker, Robert R; Deng, Hong-Wen

2006-11-01

In light of findings that osteoporosis and obesity may share some common genetic determination and previous reports that RANK (receptor activator of nuclear factor-kappaB) is expressed in skeletal muscles which are important for energy metabolism, we hypothesize that RANK, a gene essential for osteoclastogenesis, is also important for obesity. In order to test the hypothesis with solid data we first performed a linkage analysis around the RANK gene in 4,102 Caucasian subjects from 434 pedigrees, then we genotyped 19 SNPs in or around the RANK gene. A family-based association test (FBAT) was performed with both a quantitative measure of obesity [fat mass, lean mass, body mass index (BMI), and percentage fat mass (PFM)] and a dichotomously defined obesity phenotype-OB (OB if BMI > or = 30 kg/m(2)). In the linkage analysis, an empirical P = 0.004 was achieved at the location of the RANK gene for BMI. Family-based association analysis revealed significant associations of eight SNPs with at least one obesity-related phenotype (P < 0.05). Evidence of association was obtained at SNP10 (P = 0.002) and SNP16 (P = 0.001) with OB; SNP1 with fat mass (P = 0.003); SNP1 (P = 0.003) and SNP7 (P = 0.003) with lean mass; SNP1 (P = 0.002) and SNP7 (P = 0.002) with BMI; SNP1 (P = 0.003), SNP4 (P = 0.007), and SNP7 (P = 0.002) with PFM. In order to deal with the complex multiple testing issues, we performed FBAT multi-marker test (FBAT-MM) to evaluate the association between all the 18 SNPs and each obesity phenotype. The P value is 0.126 for OB, 0.033 for fat mass, 0.021 for lean mass, 0.016 for BMI, and 0.006 for PFM. The haplotype data analyses provide further association evidence. In conclusion, for the first time, our results suggest that RANK is a novel candidate for determination of obesity.
Combination of RNAseq and SNP nanofluidic array reveals the center of genetic diversity of cacao pathogen Moniliophthora roreri in the upper Magdalena Valley of Colombia and its clonality

USDA-ARS?s Scientific Manuscript database

Moniliophthora roreri is the fungal pathogen that causes frosty pod rot (FPR) disease of Theobroma cacao L., the source of chocolate. FPR occurs in most of the cacao producing countries in the Western Hemisphere, causing yield losses up to 80%. Genetic diversity within the FPR pathogen population ma...
Increased genomic prediction accuracy in wheat breeding using a large Australian panel.

PubMed

Norman, Adam; Taylor, Julian; Tanaka, Emi; Telfer, Paul; Edwards, James; Martinant, Jean-Pierre; Kuchel, Haydn

2017-12-01

Genomic prediction accuracy within a large panel was found to be substantially higher than that previously observed in smaller populations, and also higher than QTL-based prediction. In recent years, genomic selection for wheat breeding has been widely studied, but this has typically been restricted to population sizes under 1000 individuals. To assess its efficacy in germplasm representative of commercial breeding programmes, we used a panel of 10,375 Australian wheat breeding lines to investigate the accuracy of genomic prediction for grain yield, physical grain quality and other physiological traits. To achieve this, the complete panel was phenotyped in a dedicated field trial and genotyped using a custom Axiom TM Affymetrix SNP array. A high-quality consensus map was also constructed, allowing the linkage disequilibrium present in the germplasm to be investigated. Using the complete SNP array, genomic prediction accuracies were found to be substantially higher than those previously observed in smaller populations and also more accurate compared to prediction approaches using a finite number of selected quantitative trait loci. Multi-trait genetic correlations were also assessed at an additive and residual genetic level, identifying a negative genetic correlation between grain yield and protein as well as a positive genetic correlation between grain size and test weight.
Population-genetic properties of differentiated copy number variations in cattle.

PubMed

Xu, Lingyang; Hou, Yali; Bickhart, Derek M; Zhou, Yang; Hay, El Hamidi Abdel; Song, Jiuzhou; Sonstegard, Tad S; Van Tassell, Curtis P; Liu, George E

2016-03-23

While single nucleotide polymorphism (SNP) is typically the variant of choice for population genetics, copy number variation (CNV) which comprises insertion, deletion and duplication of genomic sequence, is an informative type of genetic variation. CNVs have been shown to be both common in mammals and important for understanding the relationship between genotype and phenotype. However, CNV differentiation, selection and its population genetic properties are not well understood across diverse populations. We performed a population genetics survey based on CNVs derived from the BovineHD SNP array data of eight distinct cattle breeds. We generated high resolution results that show geographical patterns of variations and genome-wide admixture proportions within and among breeds. Similar to the previous SNP-based studies, our CNV-based results displayed a strong correlation of population structure and geographical location. By conducting three pairwise comparisons among European taurine, African taurine, and indicine groups, we further identified 78 unique CNV regions that were highly differentiated, some of which might be due to selection. These CNV regions overlapped with genes involved in traits related to parasite resistance, immunity response, body size, fertility, and milk production. Our results characterize CNV diversity among cattle populations and provide a list of lineage-differentiated CNVs.
A multi-SNP association test for complex diseases incorporating an optimal P-value threshold algorithm in nuclear families.

PubMed

Wang, Yi-Ting; Sung, Pei-Yuan; Lin, Peng-Lin; Yu, Ya-Wen; Chung, Ren-Hua

2015-05-15

Genome-wide association studies (GWAS) have become a common approach to identifying single nucleotide polymorphisms (SNPs) associated with complex diseases. As complex diseases are caused by the joint effects of multiple genes, while the effect of individual gene or SNP is modest, a method considering the joint effects of multiple SNPs can be more powerful than testing individual SNPs. The multi-SNP analysis aims to test association based on a SNP set, usually defined based on biological knowledge such as gene or pathway, which may contain only a portion of SNPs with effects on the disease. Therefore, a challenge for the multi-SNP analysis is how to effectively select a subset of SNPs with promising association signals from the SNP set. We developed the Optimal P-value Threshold Pedigree Disequilibrium Test (OPTPDT). The OPTPDT uses general nuclear families. A variable p-value threshold algorithm is used to determine an optimal p-value threshold for selecting a subset of SNPs. A permutation procedure is used to assess the significance of the test. We used simulations to verify that the OPTPDT has correct type I error rates. Our power studies showed that the OPTPDT can be more powerful than the set-based test in PLINK, the multi-SNP FBAT test, and the p-value based test GATES. We applied the OPTPDT to a family-based autism GWAS dataset for gene-based association analysis and identified MACROD2-AS1 with genome-wide significance (p-value=2.5×10(-6)). Our simulation results suggested that the OPTPDT is a valid and powerful test. The OPTPDT will be helpful for gene-based or pathway association analysis. The method is ideal for the secondary analysis of existing GWAS datasets, which may identify a set of SNPs with joint effects on the disease.
Exome Array Analysis of Nuclear Lens Opacity.

PubMed

Loomis, Stephanie J; Klein, Alison P; Lee, Kristine E; Chen, Fei; Bomotti, Samantha; Truitt, Barbara; Iyengar, Sudha K; Klein, Ronald; Klein, Barbara E K; Duggal, Priya

2018-06-01

Nuclear cataract is the most common subtype of age-related cataract, the leading cause of blindness worldwide. It results from advanced nuclear sclerosis, or opacity in the center of the optic lens, and is affected by both genetic and environmental risk factors, including smoking. We sought to understand the genetic factors associated with nuclear sclerosis through interrogation of rare and low frequency coding variants using exome array data. We analyzed Illumina Human Exome Array data for 1,488 participants of European ancestry in the Beaver Dam Eye Study who were without cataract surgery for association with nuclear sclerosis grade, controlling for age and sex. We performed single-variant regression analysis for 32,138 variants with minor allele frequency (MAF) ≥0.003. In addition, gene-based analysis of 11,844 genes containing at least two variants with MAF < 0.05 was performed using a gene-based unified burden and non-burden sequence kernel association test (SKAT-O). Additionally, both single-variant and gene-based analyses were analyzed stratified by smoking status. No single-variant test was statistically significant after Bonferroni correction (p < 1.6 × 10 -6 ; top single nucleotide polymorphism (SNP): rs144458991, p = 2.83 × 10 -5 ). Gene-based tests were suggestively associated with the gene RNF149 overall (p = 8.29 × 10 -6 ) and among never smokers (N = 790, p = 2.67 × 10 -6 ). This study did not find a significant genetic association with nuclear sclerosis, the possible association with the RNF149 gene highlights a potential candidate gene for future studies that aim to understand the genetic architecture of nuclear sclerosis.
Synthesis of different-sized silver nanoparticles by simply varying reaction conditions with leaf extracts of Bauhinia variegata L.

PubMed

Kumar, V; Yadav, S K

2012-03-01

Green synthesis of nanoparticles is one of the crucial requirements in today's climate change scenario all over the world. In view of this, leaf extract (LE) of Bauhinia variegata L. possessing strong antidiabetic and antibacterial properties has been used to synthesise silver nanoparticles (SNP) in a controlled manner. Various-sized SNP (20-120 nm) were synthesised by varying incubation temperature, silver nitrate and LE concentrations. The rate of SNP synthesis and their size increased with increase in AgNO(3) concentration up to 4 mM. With increase in LE concentration, size and aggregation of SNP was increased. The size and aggregation of SNP were also increased at temperatures above and below 40°C. This has suggested that size and dispersion of SNP can be controlled by varying reaction components and conditions. Polarity-based fractionation of B. variegata LE has suggested that only water-soluble fraction is responsible for SNP synthesis. Fourier transform infrared spectroscopy analysis revealed the attachment of polyphenolic and carbohydrate moieties to SNP. The synthesised SNPs were found stable in double distilled water, BSA and phosphate buffer (pH 7.4). On the contrary, incubation of SNP with NaCl induced aggregation. This suggests the safe use of SNP for various in vivo applications.
snpGeneSets: An R Package for Genome-Wide Study Annotation

PubMed Central

Mei, Hao; Li, Lianna; Jiang, Fan; Simino, Jeannette; Griswold, Michael; Mosley, Thomas; Liu, Shijian

2016-01-01

Genome-wide studies (GWS) of SNP associations and differential gene expressions have generated abundant results; next-generation sequencing technology has further boosted the number of variants and genes identified. Effective interpretation requires massive annotation and downstream analysis of these genome-wide results, a computationally challenging task. We developed the snpGeneSets package to simplify annotation and analysis of GWS results. Our package integrates local copies of knowledge bases for SNPs, genes, and gene sets, and implements wrapper functions in the R language to enable transparent access to low-level databases for efficient annotation of large genomic data. The package contains functions that execute three types of annotations: (1) genomic mapping annotation for SNPs and genes and functional annotation for gene sets; (2) bidirectional mapping between SNPs and genes, and genes and gene sets; and (3) calculation of gene effect measures from SNP associations and performance of gene set enrichment analyses to identify functional pathways. We applied snpGeneSets to type 2 diabetes (T2D) results from the NHGRI genome-wide association study (GWAS) catalog, a Finnish GWAS, and a genome-wide expression study (GWES). These studies demonstrate the usefulness of snpGeneSets for annotating and performing enrichment analysis of GWS results. The package is open-source, free, and can be downloaded at: https://www.umc.edu/biostats_software/. PMID:27807048
Accurate determination of genetic identity for a single cacao bean, using molecular markers with a nanofluidic system, ensures cocoa authentication.

PubMed

Fang, Wanping; Meinhardt, Lyndel W; Mischke, Sue; Bellato, Cláudia M; Motilal, Lambert; Zhang, Dapeng

2014-01-15

Cacao (Theobroma cacao L.), the source of cocoa, is an economically important tropical crop. One problem with the premium cacao market is contamination with off-types adulterating raw premium material. Accurate determination of the genetic identity of single cacao beans is essential for ensuring cocoa authentication. Using nanofluidic single nucleotide polymorphism (SNP) genotyping with 48 SNP markers, we generated SNP fingerprints for small quantities of DNA extracted from the seed coat of single cacao beans. On the basis of the SNP profiles, we identified an assumed adulterant variety, which was unambiguously distinguished from the authentic beans by multilocus matching. Assignment tests based on both Bayesian clustering analysis and allele frequency clearly separated all 30 authentic samples from the non-authentic samples. Distance-based principle coordinate analysis further supported these results. The nanofluidic SNP protocol, together with forensic statistical tools, is sufficiently robust to establish authentication and to verify gourmet cacao varieties. This method shows significant potential for practical application.
Association analysis of the vitamin D receptor gene, the type I collagen gene COL1A1, and the estrogen receptor gene in idiopathic osteoarthritis.

PubMed

Loughlin, J; Sinsheimer, J S; Mustafa, Z; Carr, A J; Clipsham, K; Bloomfield, V A; Chitnavis, J; Bailey, A; Sykes, B; Chapman, K

2000-03-01

Evidence has accumulated supporting a role for genes in the etiology of osteoarthritis (OA). Several candidates have been targeted as potential susceptibility loci including genes that are involved in the regulation of bone density. Genetic association analysis has suggested a role for the vitamin D receptor gene (VDR) and the estrogen receptor gene (ER) in susceptibility. Such findings must be tested in additional independent cohorts. We tested for association of these 2 genes, plus a third gene implicated in bone density, COL1A1, with idiopathic OA. A case-control cohort of 371 affected probands and 369 unaffected spouses was used. Association was tested using 4 intragenic single nucleotide polymorphisms (SNP), one each for the VDR and COL1A1 genes, and 2 for the ER gene. The VDR and ER SNP are the same SNP that have been associated with OA. All 4 SNP affect restriction enzyme sites and were genotyped using polymerase chain reaction and enzyme digestion. Allele and genotype distributions for each SNP were compared between cases and controls and analyzed using Fisher's exact test. There was no evidence of association of the VDR or the ER gene SNP to OA. There was weak evidence of association of the COL1A1 SNP in female cases (p = 0.017), reflected by a difference in the distribution of genotypes at this SNP between female cases and controls (p = 0.027). However, when corrected for multiple testing, these results were not significant. If the VDR, ER, or COL1A1 genes do encode predisposition to OA then the 4 SNP tested are not associated with major susceptibility alleles at these 3 loci.
Genomic selection and complex trait prediction using a fast EM algorithm applied to genome-wide markers

PubMed Central

2010-01-01

Background The information provided by dense genome-wide markers using high throughput technology is of considerable potential in human disease studies and livestock breeding programs. Genome-wide association studies relate individual single nucleotide polymorphisms (SNP) from dense SNP panels to individual measurements of complex traits, with the underlying assumption being that any association is caused by linkage disequilibrium (LD) between SNP and quantitative trait loci (QTL) affecting the trait. Often SNP are in genomic regions of no trait variation. Whole genome Bayesian models are an effective way of incorporating this and other important prior information into modelling. However a full Bayesian analysis is often not feasible due to the large computational time involved. Results This article proposes an expectation-maximization (EM) algorithm called emBayesB which allows only a proportion of SNP to be in LD with QTL and incorporates prior information about the distribution of SNP effects. The posterior probability of being in LD with at least one QTL is calculated for each SNP along with estimates of the hyperparameters for the mixture prior. A simulated example of genomic selection from an international workshop is used to demonstrate the features of the EM algorithm. The accuracy of prediction is comparable to a full Bayesian analysis but the EM algorithm is considerably faster. The EM algorithm was accurate in locating QTL which explained more than 1% of the total genetic variation. A computational algorithm for very large SNP panels is described. Conclusions emBayesB is a fast and accurate EM algorithm for implementing genomic selection and predicting complex traits by mapping QTL in genome-wide dense SNP marker data. Its accuracy is similar to Bayesian methods but it takes only a fraction of the time. PMID:20969788

Genetic and clinical risk factors of root resorption associated with orthodontic treatment.

PubMed

Guo, Yujiao; He, Shushu; Gu, Tian; Liu, Yi; Chen, Song

2016-08-01

External apical root resorption (EARR) is a common complication in orthodontic treatment. Despite many studies on EARR, great controversies remain with regard to its risk factors. The objective of this study was to explore the relationship among sex, root movement, IL-1RN single nucleotide polymorphism (SNP) rs419598, IL-6 SNP rs1800796, and EARR associated with orthodontic treatment. Altogether 174 patients (with 174 maxillary left central incisors) were selected for this study. Cone-beam computed tomography was performed before the start of the treatment and at the end of the treatment. Cone-beam computed tomography data were used to reconstruct a 3-dimensional image of each tooth; the volume and the root resorption volume of each tooth were calculated. Three-dimensional matching was used to measure the amount of movement of each root. Genomic DNA was extracted from buccal swabs, and genotypes of SNP rs419598 and SNP rs1800796 of each subject were determined using TaqMan polymerase chain reaction genotyping (Applied Biosystems, Foster City, Calif). The data were analyzed with multiple linear regression analysis. The statistical analysis indicated no relationship between sex, tooth movement amount, and IL-1RN SNP rs419598 with EARR. The IL-6 SNP rs1800796 GC was associated with EARR, and root resorption differed significantly between SNP rs1800796 GC and CC. IL-6 SNP rs1800796 GC is a risk factor for EARR. The amount of root movement, IL-1RN SNP rs419598, and sex as risk factors for EARR need further study. Copyright © 2016 American Association of Orthodontists. Published by Elsevier Inc. All rights reserved.
Application of whole genome sequence data in analyzing the molecular epidemiology of Shiga toxin-producing Escherichia coli O157:H7/H.

PubMed

Yokoyama, Eiji; Hirai, Shinichiro; Ishige, Taichiro; Murakami, Satoshi

2018-01-02

Seventeen clusters of Shiga toxin-producing Escherichia coli O157:H7/- (O157) strains, determined by cluster analysis of pulsed-field gel electrophoresis patterns, were analyzed using whole genome sequence (WGS) data to investigate this pathogen's molecular epidemiology. The 17 clusters included 136 strains containing strains from nine outbreaks, with each outbreak caused by a single source contaminated with the organism, as shown by epidemiological contact surveys. WGS data of these strains were used to identify single nucleotide polymorphisms (SNPs) by two methods: short read data were directly mapped to a reference genome (mapping derived SNPs) and common SNPs between the mapping derived SNPs and SNPs in assembled data of short read data (common SNPs). Among both SNPs, those that were detected in genes with a gap were excluded to remove ambiguous SNPs from further analysis. The effectiveness of both SNPs was investigated among all the concatenated SNPs that were detected (whole SNP set); SNPs were divided into three categories based on the genes in which they were located (i.e., backbone SNP set, O-island SNP set, and mobile element SNP set); and SNPs in non-coding regions (intergenic region SNP set). When SNPs from strains isolated from the nine single source derived outbreaks were analyzed using an unweighted pair group method with arithmetic mean tree (UPGMA) and a minimum spanning tree (MST), the maximum pair-wise distances of the backbone SNP set of the mapping derived SNPs were significantly smaller than those of the whole and intergenic region SNP set on both UPGMAs and MSTs. This significant difference was also observed when the backbone SNP set of the common SNPs were examined (Steel-Dwass test, P≤0.01). When the maximum pair-wise distances were compared between the mapping derived and common SNPs, significant differences were observed in those of the whole, mobile element, and intergenic region SNP set (Wilcoxon signed rank test, P≤0.01). When all the strains included in one complex on an MST or one cluster on a UPGMA were designated as the same genotype, the values of the Hunter-Gaston Discriminatory Power Index for the backbone SNP set of the mapping derived and common SNPs were higher than those of other SNP sets. In contrast, the mobile element SNP set could not robustly subdivide lineage I strains of tested O157 strains using both the mapping derived and common SNPs. These results suggested that the backbone SNP set were the most effective for analysis of WGS data for O157 in enabling an appropriation of its molecular epidemiology. Copyright © 2017 Elsevier B.V. All rights reserved.
CsSNP: A Web-Based Tool for the Detecting of Comparative Segments SNPs.

PubMed

Wang, Yi; Wang, Shuangshuang; Zhou, Dongjie; Yang, Shuai; Xu, Yongchao; Yang, Chao; Yang, Long

2016-07-01

SNP (single nucleotide polymorphism) is a popular tool for the study of genetic diversity, evolution, and other areas. Therefore, it is necessary to develop a convenient, utility, robust, rapid, and open source detecting-SNP tool for all researchers. Since the detection of SNPs needs special software and series steps including alignment, detection, analysis and present, the study of SNPs is limited for nonprofessional users. CsSNP (Comparative segments SNP, http://biodb.sdau.edu.cn/cssnp/ ) is a freely available web tool based on the Blat, Blast, and Perl programs to detect comparative segments SNPs and to show the detail information of SNPs. The results are filtered and presented in the statistics figure and a Gbrowse map. This platform contains the reference genomic sequences and coding sequences of 60 plant species, and also provides new opportunities for the users to detect SNPs easily. CsSNP is provided a convenient tool for nonprofessional users to find comparative segments SNPs in their own sequences, and give the users the information and the analysis of SNPs, and display these data in a dynamic map. It provides a new method to detect SNPs and may accelerate related studies.
Bias due to two-stage residual-outcome regression analysis in genetic association studies.

PubMed

Demissie, Serkalem; Cupples, L Adrienne

2011-11-01

Association studies of risk factors and complex diseases require careful assessment of potential confounding factors. Two-stage regression analysis, sometimes referred to as residual- or adjusted-outcome analysis, has been increasingly used in association studies of single nucleotide polymorphisms (SNPs) and quantitative traits. In this analysis, first, a residual-outcome is calculated from a regression of the outcome variable on covariates and then the relationship between the adjusted-outcome and the SNP is evaluated by a simple linear regression of the adjusted-outcome on the SNP. In this article, we examine the performance of this two-stage analysis as compared with multiple linear regression (MLR) analysis. Our findings show that when a SNP and a covariate are correlated, the two-stage approach results in biased genotypic effect and loss of power. Bias is always toward the null and increases with the squared-correlation between the SNP and the covariate (). For example, for , 0.1, and 0.5, two-stage analysis results in, respectively, 0, 10, and 50% attenuation in the SNP effect. As expected, MLR was always unbiased. Since individual SNPs often show little or no correlation with covariates, a two-stage analysis is expected to perform as well as MLR in many genetic studies; however, it produces considerably different results from MLR and may lead to incorrect conclusions when independent variables are highly correlated. While a useful alternative to MLR under , the two -stage approach has serious limitations. Its use as a simple substitute for MLR should be avoided. © 2011 Wiley Periodicals, Inc.
Single-nucleotide polymorphisms in the SEPTIN12 gene may be a genetic risk factor for Japanese patients with Sertoli cell-only syndrome.

PubMed

Miyakawa, Hiroe; Miyamoto, Toshinobu; Koh, Eitetsu; Tsujimura, Akira; Miyagawa, Yasushi; Saijo, Yasuaki; Namiki, Mikio; Sengoku, Kazuo

2012-01-01

Genetic mechanisms have been implicated as a cause of some cases of male infertility. Recently, 10 novel genes involved in human spermatogenesis, including human SEPTIN12, were identified by expression microarray analysis of human testicular tissue. Septin12 is a member of the septin family of conserved cytoskeletal GTPases that form heteropolymeric filamentous structures in interphase cells. It is expressed specifically in the testis. Therefore, we hypothesized that mutation or polymorphisms of SEPTIN12 participate in male infertility, especially Sertoli cell-only syndrome (SCOS). To investigate whether SEPTIN12 gene defects are associated with azoospermia caused by SCOS, mutational analysis was performed in 100 Japanese patients by direct sequencing of coding regions. Statistical analysis was performed in patients with SCOS and in 140 healthy control men. No mutations were found in SEPTIN12 ; however, 8 coding single-nucleotide polymorphisms (SNP1-SNP8) could be detected in the patients with SCOS. The genotype and allele frequencies in SNP3, SNP4, and SNP6 were notably higher in the SCOS group than in the control group (P < .001). These results suggest that SEPTIN12 might play a critical role in human spermatogenesis.
Standardization of PCR-RFLP analysis of nsSNP rs1468384 of NPC1L1 gene

PubMed Central

Balgir, Praveen P.; Khanna, Divya; Kaur, Gurlovleen

2008-01-01

Niemann-Pick C1-like 1 (NPC1L1) protein, a newly identified sterol influx transporter, located at the apical membrane of the enterocyte, which may actively facilitate the uptake of cholesterol by promoting the passage of sterols across the brush border membrane of the enterocyte. It effects intestinal cholesterol absorption and intracellular transport and as such is an integral part of complex process of cholesterol homeostasis. The study of population data for the distribution of these single nucleotide polymorphisms (SNP) of NPC1L1 has lead to the identification of six non-synonymous single nucleotide polymorphisms (nsSNP). The in vitro analysis using the software MuPro and StructureSNP shows that nsSNP M510I (rs1468384), which involves A→G base pair change leads to decrease in the stability of the protein. A reproducible and a cost-effective PCR-RFLP based assay was developed to screen for the SNP among population data. This SNP has been studied in Caucasian, Asian, and African American populations. Till date, no data is available on Indian population. The distribution of M510I NPC1L1 genotype was estimated in the North Western Indian Population as a test case. The allele distribution in Indian Population differs significantly from that of other populations. The methodology thus proved to be robust enough to bring out these differences. PMID:20300301
Tetra-primer ARMS-PCR identified four pivotal genetic variations in bovine PNPLA3 gene and its expression patterns.

PubMed

Wang, Zi-nian; Cai, Han-fang; Li, Ming-xun; Cao, Xiu-kai; Lan, Xian-yong; Lei, Chu-zhao; Chen, Hong

2016-01-10

Patatin-like phospholipase domain-containing protein 3 (PNPLA3), a member of the patatin like phospholipase domain-containing (PNPLA) family, plays an important role in energy balance, fat metabolism regulation, glucose metabolism and fatty liver disease. Tetra-primer amplification refractory mutation system PCR (T-ARMS-PCR) is a new method offering fast detection and extreme simplicity at a negligible cost for SNP genotyping. In this paper, we investigated the genetic variations at different ages of 660 Chinese indigenous cattle belonging to three breeds (QC, NY, JX) and applied T-ARMS-PCR and PCR-RFLP methods to genotype four SNPs, SNP1: g.A2980G, SNP2: g.A2996T, SNP3: g.A36718G, SNP4: g.G36850A. The statistical analyses indicated that these 4 SNPs affected growth traits markedly (P<0.05) in QC population, whereas combined haplotypes were not (P>0.05). The qPCR (quantitative PCR) indicated that bovine PNPLA3 gene was exclusively expressed in fat tissues. Besides, the analysis between SNP and mRNA expression revealed that, in SNP1, the expression of AG was much higher than AA and GG (P<0.05), which was in accordance with the results of growth traits association analysis, while the results of SNP4 was not. These results supported high potential that SNPs of bovine PNPLA3 gene might be utilized as genetic markers in marker-assisted selection (MAS) for Chinese cattle breeding programs. Copyright © 2015 Elsevier B.V. All rights reserved.
A Genome-wide Association Analysis of a Broad Psychosis Phenotype Identifies Three Loci for Further Investigation

PubMed Central

2014-01-01

Background Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories. Methods 1239 cases with schizophrenia, schizoaffective disorder, or psychotic bipolar disorder; 857 of their unaffected relatives, and 2739 healthy controls were genotyped with the Affymetrix 6.0 single nucleotide polymorphism (SNP) array. Analyses of 695,193 SNPs were conducted using UNPHASED, which combines information across families and unrelated individuals. We attempted to replicate signals found in 23 genomic regions using existing data on nonoverlapping samples from the Psychiatric GWAS Consortium and Schizophrenia-GENE-plus cohorts (10,352 schizophrenia patients and 24,474 controls). Results No individual SNP showed compelling evidence for association with psychosis in our data. However, we observed a trend for association with same risk alleles at loci previously associated with schizophrenia (one-sided p = .003). A polygenic score analysis found that the Psychiatric GWAS Consortium’s panel of SNPs associated with schizophrenia significantly predicted disease status in our sample (p = 5 × 10–14) and explained approximately 2% of the phenotypic variance. Conclusions Although narrowly defined phenotypes have their advantages, we believe new loci may also be discovered through meta-analysis across broad phenotypes. The novel statistical methodology we introduced to model effect size heterogeneity between studies should help future GWAS that combine association evidence from related phenotypes. Applying these approaches, we highlight three loci that warrant further investigation. We found that SNPs conveying risk for schizophrenia are also predictive of disease status in our data. PMID:23871474
A genome-wide association analysis of a broad psychosis phenotype identifies three loci for further investigation.

PubMed

Bramon, Elvira; Pirinen, Matti; Strange, Amy; Lin, Kuang; Freeman, Colin; Bellenguez, Céline; Su, Zhan; Band, Gavin; Pearson, Richard; Vukcevic, Damjan; Langford, Cordelia; Deloukas, Panos; Hunt, Sarah; Gray, Emma; Dronov, Serge; Potter, Simon C; Tashakkori-Ghanbaria, Avazeh; Edkins, Sarah; Bumpstead, Suzannah J; Arranz, Maria J; Bakker, Steven; Bender, Stephan; Bruggeman, Richard; Cahn, Wiepke; Chandler, David; Collier, David A; Crespo-Facorro, Benedicto; Dazzan, Paola; de Haan, Lieuwe; Di Forti, Marta; Dragović, Milan; Giegling, Ina; Hall, Jeremy; Iyegbe, Conrad; Jablensky, Assen; Kahn, René S; Kalaydjieva, Luba; Kravariti, Eugenia; Lawrie, Stephen; Linszen, Don H; Mata, Ignacio; McDonald, Colm; McIntosh, Andrew; Myin-Germeys, Inez; Ophoff, Roel A; Pariante, Carmine M; Paunio, Tiina; Picchioni, Marco; Ripke, Stephan; Rujescu, Dan; Sauer, Heinrich; Shaikh, Madiha; Sussmann, Jessika; Suvisaari, Jaana; Tosato, Sarah; Toulopoulou, Timothea; Van Os, Jim; Walshe, Muriel; Weisbrod, Matthias; Whalley, Heather; Wiersma, Durk; Blackwell, Jenefer M; Brown, Matthew A; Casas, Juan P; Corvin, Aiden; Duncanson, Audrey; Jankowski, Janusz A Z; Markus, Hugh S; Mathew, Christopher G; Palmer, Colin N A; Plomin, Robert; Rautanen, Anna; Sawcer, Stephen J; Trembath, Richard C; Wood, Nicholas W; Barroso, Ines; Peltonen, Leena; Lewis, Cathryn M; Murray, Robin M; Donnelly, Peter; Powell, John; Spencer, Chris C A

2014-03-01

Genome-wide association studies (GWAS) have identified several loci associated with schizophrenia and/or bipolar disorder. We performed a GWAS of psychosis as a broad syndrome rather than within specific diagnostic categories. 1239 cases with schizophrenia, schizoaffective disorder, or psychotic bipolar disorder; 857 of their unaffected relatives, and 2739 healthy controls were genotyped with the Affymetrix 6.0 single nucleotide polymorphism (SNP) array. Analyses of 695,193 SNPs were conducted using UNPHASED, which combines information across families and unrelated individuals. We attempted to replicate signals found in 23 genomic regions using existing data on nonoverlapping samples from the Psychiatric GWAS Consortium and Schizophrenia-GENE-plus cohorts (10,352 schizophrenia patients and 24,474 controls). No individual SNP showed compelling evidence for association with psychosis in our data. However, we observed a trend for association with same risk alleles at loci previously associated with schizophrenia (one-sided p = .003). A polygenic score analysis found that the Psychiatric GWAS Consortium's panel of SNPs associated with schizophrenia significantly predicted disease status in our sample (p = 5 × 10(-14)) and explained approximately 2% of the phenotypic variance. Although narrowly defined phenotypes have their advantages, we believe new loci may also be discovered through meta-analysis across broad phenotypes. The novel statistical methodology we introduced to model effect size heterogeneity between studies should help future GWAS that combine association evidence from related phenotypes. Applying these approaches, we highlight three loci that warrant further investigation. We found that SNPs conveying risk for schizophrenia are also predictive of disease status in our data. Copyright © 2014 Society of Biological Psychiatry. Published by Elsevier Inc. All rights reserved.
SNP_tools: A compact tool package for analysis and conversion of genotype data for MS-Excel

PubMed Central

Chen, Bowang; Wilkening, Stefan; Drechsel, Marion; Hemminki, Kari

2009-01-01

Background Single nucleotide polymorphism (SNP) genotyping is a major activity in biomedical research. Scientists prefer to have a facile access to the results which may require conversions between data formats. First hand SNP data is often entered in or saved in the MS-Excel format, but this software lacks genetic and epidemiological related functions. A general tool to do basic genetic and epidemiological analysis and data conversion for MS-Excel is needed. Findings The SNP_tools package is prepared as an add-in for MS-Excel. The code is written in Visual Basic for Application, embedded in the Microsoft Office package. This add-in is an easy to use tool for users with basic computer knowledge (and requirements for basic statistical analysis). Conclusion Our implementation for Microsoft Excel 2000-2007 in Microsoft Windows 2000, XP, Vista and Windows 7 beta can handle files in different formats and converts them into other formats. It is a free software. PMID:19852806
SNP_tools: A compact tool package for analysis and conversion of genotype data for MS-Excel.

PubMed

Chen, Bowang; Wilkening, Stefan; Drechsel, Marion; Hemminki, Kari

2009-10-23

Single nucleotide polymorphism (SNP) genotyping is a major activity in biomedical research. Scientists prefer to have a facile access to the results which may require conversions between data formats. First hand SNP data is often entered in or saved in the MS-Excel format, but this software lacks genetic and epidemiological related functions. A general tool to do basic genetic and epidemiological analysis and data conversion for MS-Excel is needed. The SNP_tools package is prepared as an add-in for MS-Excel. The code is written in Visual Basic for Application, embedded in the Microsoft Office package. This add-in is an easy to use tool for users with basic computer knowledge (and requirements for basic statistical analysis). Our implementation for Microsoft Excel 2000-2007 in Microsoft Windows 2000, XP, Vista and Windows 7 beta can handle files in different formats and converts them into other formats. It is a free software.
FSR: feature set reduction for scalable and accurate multi-class cancer subtype classification based on copy number.

PubMed

Wong, Gerard; Leckie, Christopher; Kowalczyk, Adam

2012-01-15

Feature selection is a key concept in machine learning for microarray datasets, where features represented by probesets are typically several orders of magnitude larger than the available sample size. Computational tractability is a key challenge for feature selection algorithms in handling very high-dimensional datasets beyond a hundred thousand features, such as in datasets produced on single nucleotide polymorphism microarrays. In this article, we present a novel feature set reduction approach that enables scalable feature selection on datasets with hundreds of thousands of features and beyond. Our approach enables more efficient handling of higher resolution datasets to achieve better disease subtype classification of samples for potentially more accurate diagnosis and prognosis, which allows clinicians to make more informed decisions in regards to patient treatment options. We applied our feature set reduction approach to several publicly available cancer single nucleotide polymorphism (SNP) array datasets and evaluated its performance in terms of its multiclass predictive classification accuracy over different cancer subtypes, its speedup in execution as well as its scalability with respect to sample size and array resolution. Feature Set Reduction (FSR) was able to reduce the dimensions of an SNP array dataset by more than two orders of magnitude while achieving at least equal, and in most cases superior predictive classification performance over that achieved on features selected by existing feature selection methods alone. An examination of the biological relevance of frequently selected features from FSR-reduced feature sets revealed strong enrichment in association with cancer. FSR was implemented in MATLAB R2010b and is available at http://ww2.cs.mu.oz.au/~gwong/FSR.
A deletion and a duplication in distal 22q11.2 deletion syndrome region. Clinical implications and review

PubMed Central

Fernández, Luis; Nevado, Julián; Santos, Fernando; Heine-Suñer, Damià; Martinez-Glez, Victor; García-Miñaur, Sixto; Palomo, Rebeca; Delicado, Alicia; Pajares, Isidora López; Palomares, María; García-Guereta, Luis; Valverde, Eva; Hawkins, Federico; Lapunzina, Pablo

2009-01-01

Background Individuals affected with DiGeorge and Velocardiofacial syndromes present with both phenotypic diversity and variable expressivity. The most frequent clinical features include conotruncal congenital heart defects, velopharyngeal insufficiency, hypocalcemia and a characteristic craniofacial dysmorphism. The etiology in most patients is a 3 Mb recurrent deletion in region 22q11.2. However, cases of infrequent deletions and duplications with different sizes and locations have also been reported, generally with a milder, slightly different phenotype for duplications but with no clear genotype-phenotype correlation to date. Methods We present a 7 month-old male patient with surgically corrected ASD and multiple VSDs, and dysmorphic facial features not clearly suggestive of 22q11.2 deletion syndrome, and a newborn male infant with cleft lip and palate and upslanting palpebral fissures. Karyotype, FISH, MLPA, microsatellite markers segregation studies and SNP genotyping by array-CGH were performed in both patients and parents. Results Karyotype and FISH with probe N25 were normal for both patients. MLPA analysis detected a partial de novo 1.1 Mb deletion in one patient and a novel partial familial 0.4 Mb duplication in the other. Both of these alterations were located at a distal position within the commonly deleted region in 22q11.2. These rearrangements were confirmed and accurately characterized by microsatellite marker segregation studies and SNP array genotyping. Conclusion The phenotypic diversity found for deletions and duplications supports a lack of genotype-phenotype correlation in the vicinity of the LCRC-LCRD interval of the 22q11.2 chromosomal region, whereas the high presence of duplications in normal individuals supports their role as polymorphisms. We suggest that any hypothetical correlation between the clinical phenotype and the size and location of these alterations may be masked by other genetic and/or epigenetic modifying factors. PMID:19490635
[Molecular cytogenetic analysis of chromosomal aberrations in cells of low grade gliomas and its contribution for tumour classification].

PubMed

Lhotská, H; Zemanová, Z; Kramář, F; Lizcová, L; Svobodová, K; Ransdorfová, S; Bystřická, D; Krejčík, Z; Hrabal, P; Dohnalová, A; Kaiser, M; Michalová, K

2014-01-01

Low-grade gliomas represent a heterogeneous group of primary brain malignancies. The current diagnostics of these tumors rely strongly on histological classification. With the development of molecular cytogenetic methods several genetic markers were described, contributing to a better distinction of glial subtypes. The aim of this study was to assess the frequency of acquired chromosomal aberrations in lowgrade gliomas and to search for new genomic changes associated with higher risk of tumor progression. We analysed biopsy specimens from 41 patients with histological dia-gnosis of low-grade glioma using interphase fluorescence in situ hybridization (I FISH) and single nucleotide polymorphism (SNP) array techniques (19 females and 22 males, medium age 42 years). Besides notorious and most frequent finding of combined deletion of 1p/ 19q (81.25% patients) several other recurrent aberrations were described in patients with oligodendrogliomas: deletions of p and q arms of chromosome 4 (25% patients), deletions of the short arms of chromosome 9 (18.75% patients), deletions of the long arms of chromosome 13 and monosomy of chromosome 18 (18.75% patients). In bio-psy specimens from patients with astrocytomas, we often observed deletion of 1p (24% patients), amplification of the long arms of chromosome 7 (16% patients), deletion of the long arm of chromosome 13 (20% patients), segmental uniparental disomy (UPD) of the short arms of chromosome 17 (60% patients) and deletion of the long arms of chromosome 19 (28% patients). In one patient we detected a shuttered chromosome 10 resulting from chromothripsis. Using a combination of I FISH and SNP array, we detected not only known chromosomal changes but also new or less frequent recur-rent aberrations. Their role in cancer cell progression and their impact on low grade gliomas classification remains to be elucidated in a larger cohort of patients.
Genomic imbalances in esophageal carcinoma cell lines involve Wnt pathway genes.

PubMed

Brown, Jacqueline; Bothma, Hannelie; Veale, Robin; Willem, Pascale

2011-06-28

To identify molecular markers shared across South African esophageal squamous cell carcinoma (ESCC) cell lines using cytogenetics, fluorescence in situ hybridization (FISH) and single nucleotide polymorphism (SNP) array copy number analysis. We used conventional cytogenetics, FISH, and multicolor FISH to characterize the chromosomal rearrangements of five ESCC cell lines established in South Africa. The whole genome copy number profile was established from 250K SNP arrays, and data was analyzed with the CNAT 4.0 and GISTIC software. We detected common translocation breakpoints involving chromosomes 1p11-12 and 3p11.2, the latter correlated with the deletion, or interruption of the EPHA3 gene. The most significant amplifications involved the following chromosomal regions and genes: 11q13.3 (CCND1, FGF3, FGF4, FGF19, MYEOV), 8q24.21(C-MYC, FAM84B), 11q22.1-q22.3 (BIRC2, BIRC3), 5p15.2 (CTNND2), 3q11.2-q12.2 (MINA) and 18p11.32 (TYMS, YES1). The significant deletions included 1p31.2-p31.1 (CTH, GADD45α, DIRAS3), 2q22.1 (LRP1B), 3p12.1-p14.2 (FHIT), 4q22.1-q32.1 (CASP6, SMAD1), 8p23.2-q11.1 (BNIP3L) and 18q21.1-q21.2 (SMAD4, DCC). The 3p11.2 translocation breakpoint was shared across four cell lines, supporting a role for genes involved at this site, in particular, the EPHA3 gene which has previously been reported to be deleted in ESCC. The finding that a significant number of genes that were amplified (FGF3, FGF4, FGF19, CCND1 and C-MYC) or deleted (SFRP2 gene) are involved in the Wnt and fibroblast growth factor signaling pathways, suggests that these pathways may be activated in these cell lines.
Genomic imbalances in esophageal carcinoma cell lines involve Wnt pathway genes

PubMed Central

Brown, Jacqueline; Bothma, Hannelie; Veale, Robin; Willem, Pascale

2011-01-01

AIM: To identify molecular markers shared across South African esophageal squamous cell carcinoma (ESCC) cell lines using cytogenetics, fluorescence in situ hybridization (FISH) and single nucleotide polymorphism (SNP) array copy number analysis. METHODS: We used conventional cytogenetics, FISH, and multicolor FISH to characterize the chromosomal rearrangements of five ESCC cell lines established in South Africa. The whole genome copy number profile was established from 250K SNP arrays, and data was analyzed with the CNAT 4.0 and GISTIC software. RESULTS: We detected common translocation breakpoints involving chromosomes 1p11-12 and 3p11.2, the latter correlated with the deletion, or interruption of the EPHA3 gene. The most significant amplifications involved the following chromosomal regions and genes: 11q13.3 (CCND1, FGF3, FGF4, FGF19, MYEOV), 8q24.21(C-MYC, FAM84B), 11q22.1-q22.3 (BIRC2, BIRC3), 5p15.2 (CTNND2), 3q11.2-q12.2 (MINA) and 18p11.32 (TYMS, YES1). The significant deletions included 1p31.2-p31.1 (CTH, GADD45α, DIRAS3), 2q22.1 (LRP1B), 3p12.1-p14.2 (FHIT), 4q22.1-q32.1 (CASP6, SMAD1), 8p23.2-q11.1 (BNIP3L) and 18q21.1-q21.2 (SMAD4, DCC). The 3p11.2 translocation breakpoint was shared across four cell lines, supporting a role for genes involved at this site, in particular, the EPHA3 gene which has previously been reported to be deleted in ESCC. CONCLUSION: The finding that a significant number of genes that were amplified (FGF3, FGF4, FGF19, CCND1 and C-MYC) or deleted (SFRP2 gene) are involved in the Wnt and fibroblast growth factor signaling pathways, suggests that these pathways may be activated in these cell lines. PMID:21734802
Variation in Recombination Rate and Its Genetic Determinism in Sheep Populations

PubMed Central

Petit, Morgane; Astruc, Jean-Michel; Sarry, Julien; Drouilhet, Laurence; Fabre, Stéphane; Moreno, Carole R.; Servin, Bertrand

2017-01-01

Recombination is a complex biological process that results from a cascade of multiple events during meiosis. Understanding the genetic determinism of recombination can help to understand if and how these events are interacting. To tackle this question, we studied the patterns of recombination in sheep, using multiple approaches and data sets. We constructed male recombination maps in a dairy breed from the south of France (the Lacaune breed) at a fine scale by combining meiotic recombination rates from a large pedigree genotyped with a 50K SNP array and historical recombination rates from a sample of unrelated individuals genotyped with a 600K SNP array. This analysis revealed recombination patterns in sheep similar to other mammals but also genome regions that have likely been affected by directional and diversifying selection. We estimated the average recombination rate of Lacaune sheep at 1.5 cM/Mb, identified ∼50,000 crossover hotspots on the genome, and found a high correlation between historical and meiotic recombination rate estimates. A genome-wide association study revealed two major loci affecting interindividual variation in recombination rate in Lacaune, including the RNF212 and HEI10 genes and possibly two other loci of smaller effects including the KCNJ15 and FSHR genes. The comparison of these new results to those obtained previously in a distantly related population of domestic sheep (the Soay) revealed that Soay and Lacaune males have a very similar distribution of recombination along the genome. The two data sets were thus combined to create more precise male meiotic recombination maps in Sheep. However, despite their similar recombination maps, Soay and Lacaune males were found to exhibit different heritabilities and QTL effects for interindividual variation in genome-wide recombination rates. This highlights the robustness of recombination patterns to underlying variation in their genetic determinism. PMID:28978774
Variation in Recombination Rate and Its Genetic Determinism in Sheep Populations.

PubMed

Petit, Morgane; Astruc, Jean-Michel; Sarry, Julien; Drouilhet, Laurence; Fabre, Stéphane; Moreno, Carole R; Servin, Bertrand

2017-10-01

Recombination is a complex biological process that results from a cascade of multiple events during meiosis. Understanding the genetic determinism of recombination can help to understand if and how these events are interacting. To tackle this question, we studied the patterns of recombination in sheep, using multiple approaches and data sets. We constructed male recombination maps in a dairy breed from the south of France (the Lacaune breed) at a fine scale by combining meiotic recombination rates from a large pedigree genotyped with a 50K SNP array and historical recombination rates from a sample of unrelated individuals genotyped with a 600K SNP array. This analysis revealed recombination patterns in sheep similar to other mammals but also genome regions that have likely been affected by directional and diversifying selection. We estimated the average recombination rate of Lacaune sheep at 1.5 cM/Mb, identified ∼50,000 crossover hotspots on the genome, and found a high correlation between historical and meiotic recombination rate estimates. A genome-wide association study revealed two major loci affecting interindividual variation in recombination rate in Lacaune, including the RNF212 and HEI10 genes and possibly two other loci of smaller effects including the KCNJ15 and FSHR genes. The comparison of these new results to those obtained previously in a distantly related population of domestic sheep (the Soay) revealed that Soay and Lacaune males have a very similar distribution of recombination along the genome. The two data sets were thus combined to create more precise male meiotic recombination maps in Sheep. However, despite their similar recombination maps, Soay and Lacaune males were found to exhibit different heritabilities and QTL effects for interindividual variation in genome-wide recombination rates. This highlights the robustness of recombination patterns to underlying variation in their genetic determinism. Copyright © 2017 by the Genetics Society of America.
Genomic predictions can accelerate selection for resistance against Piscirickettsia salmonis in Atlantic salmon (Salmo salar).

PubMed

Bangera, Rama; Correa, Katharina; Lhorente, Jean P; Figueroa, René; Yáñez, José M

2017-01-31

Salmon Rickettsial Syndrome (SRS) caused by Piscirickettsia salmonis is a major disease affecting the Chilean salmon industry. Genomic selection (GS) is a method wherein genome-wide markers and phenotype information of full-sibs are used to predict genomic EBV (GEBV) of selection candidates and is expected to have increased accuracy and response to selection over traditional pedigree based Best Linear Unbiased Prediction (PBLUP). Widely used GS methods such as genomic BLUP (GBLUP), SNPBLUP, Bayes C and Bayesian Lasso may perform differently with respect to accuracy of GEBV prediction. Our aim was to compare the accuracy, in terms of reliability of genome-enabled prediction, from different GS methods with PBLUP for resistance to SRS in an Atlantic salmon breeding program. Number of days to death (DAYS), binary survival status (STATUS) phenotypes, and 50 K SNP array genotypes were obtained from 2601 smolts challenged with P. salmonis. The reliability of different GS methods at different SNP densities with and without pedigree were compared to PBLUP using a five-fold cross validation scheme. Heritability estimated from GS methods was significantly higher than PBLUP. Pearson's correlation between predicted GEBV from PBLUP and GS models ranged from 0.79 to 0.91 and 0.79-0.95 for DAYS and STATUS, respectively. The relative increase in reliability from different GS methods for DAYS and STATUS with 50 K SNP ranged from 8 to 25% and 27-30%, respectively. All GS methods outperformed PBLUP at all marker densities. DAYS and STATUS showed superior reliability over PBLUP even at the lowest marker density of 3 K and 500 SNP, respectively. 20 K SNP showed close to maximal reliability for both traits with little improvement using higher densities. These results indicate that genomic predictions can accelerate genetic progress for SRS resistance in Atlantic salmon and implementation of this approach will contribute to the control of SRS in Chile. We recommend GBLUP for routine GS evaluation because this method is computationally faster and the results are very similar with other GS methods. The use of lower density SNP or the combination of low density SNP and an imputation strategy may help to reduce genotyping costs without compromising gain in reliability.
Genome-Wide Association Meta-Analysis Reveals Novel Juvenile Idiopathic Arthritis Susceptibility Loci.

PubMed

McIntosh, Laura A; Marion, Miranda C; Sudman, Marc; Comeau, Mary E; Becker, Mara L; Bohnsack, John F; Fingerlin, Tasha E; Griffin, Thomas A; Haas, J Peter; Lovell, Daniel J; Maier, Lisa A; Nigrovic, Peter A; Prahalad, Sampath; Punaro, Marilynn; Rosé, Carlos D; Wallace, Carol A; Wise, Carol A; Moncrieffe, Halima; Howard, Timothy D; Langefeld, Carl D; Thompson, Susan D

2017-11-01

Juvenile idiopathic arthritis (JIA) is the most common childhood rheumatic disease and has a strong genomic component. To date, JIA genetic association studies have had limited sample sizes, used heterogeneous patient populations, or included only candidate regions. The aim of this study was to identify new associations between JIA patients with oligoarticular disease and those with IgM rheumatoid factor (RF)-negative polyarticular disease, which are clinically similar and the most prevalent JIA disease subtypes. Three cohorts comprising 2,751 patients with oligoarticular or RF-negative polyarticular JIA were genotyped using the Affymetrix Genome-Wide SNP Array 6.0 or the Illumina HumanCoreExome-12+ Array. Overall, 15,886 local and out-of-study controls, typed on these platforms or the Illumina HumanOmni2.5, were used for association analyses. High-quality single-nucleotide polymorphisms (SNPs) were used for imputation to 1000 Genomes prior to SNP association analysis. Meta-analysis showed evidence of association (P < 1 × 10 -6 ) at 9 regions: PRR9_LOR (P = 5.12 × 10 -8 ), ILDR1_CD86 (P = 6.73 × 10 -8 ), WDFY4 (P = 1.79 × 10 -7 ), PTH1R (P = 1.87 × 10 -7 ), RNF215 (P = 3.09 × 10 -7 ), AHI1_LINC00271 (P = 3.48 × 10 -7 ), JAK1 (P = 4.18 × 10 -7 ), LINC00951 (P = 5.80 × 10 -7 ), and HBP1 (P = 7.29 × 10 -7 ). Of these, PRR9_LOR, ILDR1_CD86, RNF215, LINC00951, and HBP1 were shown, for the first time, to be autoimmune disease susceptibility loci. Furthermore, associated SNPs included cis expression quantitative trait loci for WDFY4, CCDC12, MTP18, SF3A1, AHI1, COG5, HBP1, and GPR22. This study provides evidence of both unique JIA risk loci and risk loci overlapping between JIA and other autoimmune diseases. These newly associated SNPs are shown to influence gene expression, and their bounding regions tie into molecular pathways of immunologic relevance. Thus, they likely represent regions that contribute to the pathology of oligoarticular JIA and RF-negative polyarticular JIA. © 2017, American College of Rheumatology.

Evaluation of Direct and Indirect Methods of Sub-Neoglottic Pressure Measurement in Tracheoesophageal Speakers: A Systematic Review and Meta-Analysis.

PubMed

Sheela, Shekaraiah; Aithal, Venkataraja U; Rajashekhar, Bellur; Lewis, Melissa Glenda

2016-01-01

Tracheoesophageal (TE) prosthetic voice is one of the voice restoration options for individuals who have undergone a total laryngectomy. Aerodynamic analysis of the TE voice provides insight into the physiological changes that occur at the level of the neoglottis with voice prosthesis in situ. The present study is a systematic review and meta-analysis of sub-neoglottic pressure (SNP) measurement in TE speakers by direct and indirect methods. The screening of abstracts and titles was carried out for inclusion of articles using 10 electronic databases spanning the period from 1979 to 2016. Ten articles which met the inclusion criteria were considered for meta-analysis with a pooled age range of 40-83 years. The pooled mean SNP obtained from the direct measurement method was 53.80 cm H2O with a 95% confidence interval of 21.14-86.46 cm H2O, while for the indirect measurement method, the mean SNP was 23.55 cm H2O with a 95% confidence interval of 19.23-27.87 cm H2O. Based on the literature review, the various procedures followed for direct and indirect measurements of SNP contributed to a range of differences in outcome measures. The meta-analysis revealed that the "interpolation method" for indirect estimation of SNP was the most acceptable and valid method in TE speakers. © 2017 S. Karger AG, Basel.
Rice SNP-seek database update: new SNPs, indels, and queries.

PubMed

Mansueto, Locedie; Fuentes, Roven Rommel; Borja, Frances Nikki; Detras, Jeffery; Abriol-Santos, Juan Miguel; Chebotarov, Dmytro; Sanciangco, Millicent; Palis, Kevin; Copetti, Dario; Poliakov, Alexandre; Dubchak, Inna; Solovyev, Victor; Wing, Rod A; Hamilton, Ruaraidh Sackville; Mauleon, Ramil; McNally, Kenneth L; Alexandrov, Nickolai

2017-01-04

We describe updates to the Rice SNP-Seek Database since its first release. We ran a new SNP-calling pipeline followed by filtering that resulted in complete, base, filtered and core SNP datasets. Besides the Nipponbare reference genome, the pipeline was run on genome assemblies of IR 64, 93-11, DJ 123 and Kasalath. New genotype query and display features are added for reference assemblies, SNP datasets and indels. JBrowse now displays BAM, VCF and other annotation tracks, the additional genome assemblies and an embedded VISTA genome comparison viewer. Middleware is redesigned for improved performance by using a hybrid of HDF5 and RDMS for genotype storage. Query modules for genotypes, varieties and genes are improved to handle various constraints. An integrated list manager allows the user to pass query parameters for further analysis. The SNP Annotator adds traits, ontology terms, effects and interactions to markers in a list. Web-service calls were implemented to access most data. These features enable seamless querying of SNP-Seek across various biological entities, a step toward semi-automated gene-trait association discovery. URL: http://snp-seek.irri.org. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
A Genome Scan Conducted in a Multigenerational Pedigree with Convergent Strabismus Supports a Complex Genetic Determinism

PubMed Central

Georges, Anouk; Cambisano, Nadine; Ahariz, Naïma; Karim, Latifa; Georges, Michel

2013-01-01

A genome-wide linkage scan was conducted in a Northern-European multigenerational pedigree with nine of 40 related members affected with concomitant strabismus. Twenty-seven members of the pedigree including all affected individuals were genotyped using a SNP array interrogating > 300,000 common SNPs. We conducted parametric and non-parametric linkage analyses assuming segregation of an autosomal dominant mutation, yet allowing for incomplete penetrance and phenocopies. We detected two chromosome regions with near-suggestive evidence for linkage, respectively on chromosomes 8 and 18. The chromosome 8 linkage implied a penetrance of 0.80 and a rate of phenocopy of 0.11, while the chromosome 18 linkage implied a penetrance of 0.64 and a rate of phenocopy of 0. Our analysis excludes a simple genetic determinism of strabismus in this pedigree. PMID:24376720
A genome scan conducted in a multigenerational pedigree with convergent strabismus supports a complex genetic determinism.

PubMed

Georges, Anouk; Cambisano, Nadine; Ahariz, Naïma; Karim, Latifa; Georges, Michel

2013-01-01

A genome-wide linkage scan was conducted in a Northern-European multigenerational pedigree with nine of 40 related members affected with concomitant strabismus. Twenty-seven members of the pedigree including all affected individuals were genotyped using a SNP array interrogating > 300,000 common SNPs. We conducted parametric and non-parametric linkage analyses assuming segregation of an autosomal dominant mutation, yet allowing for incomplete penetrance and phenocopies. We detected two chromosome regions with near-suggestive evidence for linkage, respectively on chromosomes 8 and 18. The chromosome 8 linkage implied a penetrance of 0.80 and a rate of phenocopy of 0.11, while the chromosome 18 linkage implied a penetrance of 0.64 and a rate of phenocopy of 0. Our analysis excludes a simple genetic determinism of strabismus in this pedigree.
Genome scan study of prostate cancer in Arabs: identification of three genomic regions with multiple prostate cancer susceptibility loci in Tunisians.

PubMed

Shan, Jingxuan; Al-Rumaihi, Khalid; Rabah, Danny; Al-Bozom, Issam; Kizhakayil, Dhanya; Farhat, Karim; Al-Said, Sami; Kfoury, Hala; Dsouza, Shoba P; Rowe, Jillian; Khalak, Hanif G; Jafri, Shahzad; Aigha, Idil I; Chouchane, Lotfi

2013-05-13

Large databases focused on genetic susceptibility to prostate cancer have been accumulated from population studies of different ancestries, including Europeans and African-Americans. Arab populations, however, have been only rarely studied. Using Affymetrix Genome-Wide Human SNP Array 6, we conducted a genome-wide association study (GWAS) in which 534,781 single nucleotide polymorphisms (SNPs) were genotyped in 221 Tunisians (90 prostate cancer patients and 131 age-matched healthy controls). TaqMan SNP Genotyping Assays on 11 prostate cancer associated SNPs were performed in a distinct cohort of 337 individuals from Arab ancestry living in Qatar and Saudi Arabia (155 prostate cancer patients and 182 age-matched controls). In-silico expression quantitative trait locus (eQTL) analysis along with mRNA quantification of nearby genes was performed to identify loci potentially cis-regulated by the identified SNPs. Three chromosomal regions, encompassing 14 SNPs, are significantly associated with prostate cancer risk in the Tunisian population (P = 1 × 10-4 to P = 1 × 10-5). In addition to SNPs located on chromosome 17q21, previously found associated with prostate cancer in Western populations, two novel chromosomal regions are revealed on chromosome 9p24 and 22q13. eQTL analysis and mRNA quantification indicate that the prostate cancer associated SNPs of chromosome 17 could enhance the expression of STAT5B gene. Our findings, identifying novel GWAS prostate cancer susceptibility loci, indicate that prostate cancer genetic risk factors could be ethnic specific.
Genetic diversity and investigation of polledness in divergent goat populations using 52 088 SNPs.

PubMed

Kijas, James W; Ortiz, Judit S; McCulloch, Russell; James, Andrew; Brice, Blair; Swain, Ben; Tosser-Klopp, Gwenola

2013-06-01

The recent availability of a genome-wide SNP array for the goat genome dramatically increases the power to investigate aspects of genetic diversity and to conduct genome-wide association studies in this important domestic species. We collected and analysed genotypes from 52 088 SNPs in Boer, Cashmere and Rangeland goats that had both polled and horned individuals. Principal components analysis revealed a clear genetic division between animals for each population, and model-based clustering successfully detected evidence of admixture that matched aspects of their recorded history. For example, shared co-ancestry was detected, suggesting Boer goats have been introgressed into the Rangeland population. Further, allele frequency data successfully tracked the altered genetic profile that has taken place after 40 years of breeding Australian Cashmere goats using the Rangeland animals as the founding population. Genome-wide association mapping of the POLL locus revealed a strong signal on goat chromosome 1. The 769-kb critical interval contained the polled intersex syndrome locus, confirming the genetic basis in non-European animals is the same as identified previously in Saanen goats. Interestingly, analysis of the haplotypes carried by a small set of sex-reversed animals, known to be associated with polledness, revealed some animals carried the wild-type chromosome associated with the presence of horns. This suggests a more complex basis for the relationship between polledness and the intersex condition than initially thought while validating the application of the goat SNP50 BeadChip for fine-mapping traits in goat. © The Author(s) and Commonwealth of Australia. Animal Genetics © 2012 Stichting International Foundation for Animal Genetics.
Response to lenalidomide in myelodysplastic syndromes with del(5q): influence of cytogenetics and mutations.

PubMed

Mallo, Mar; Del Rey, Mónica; Ibáñez, Mariam; Calasanz, M José; Arenillas, Leonor; Larráyoz, M José; Pedro, Carmen; Jerez, Andrés; Maciejewski, Jaroslaw; Costa, Dolors; Nomdedeu, Meritxell; Diez-Campelo, María; Lumbreras, Eva; González-Martínez, Teresa; Marugán, Isabel; Such, Esperanza; Cervera, José; Cigudosa, Juan C; Alvarez, Sara; Florensa, Lourdes; Hernández, Jesús M; Solé, Francesc

2013-07-01

Lenalidomide is an effective drug in low-risk myelodysplastic syndromes (MDS) with isolated del(5q), although not all patients respond. Studies have suggested a role for TP53 mutations and karyotype complexity in disease progression and outcome. In order to assess the impact of complex karyotypes on treatment response and disease progression in 52 lenalidomide-treated patients with del(5q) MDS, conventional G-banding cytogenetics (CC), single nucleotide polymorphism array (SNP-A), and genomic sequencing methods were used. SNP-A analysis (with control sample, lymphocytes CD3+, in 30 cases) revealed 5q losses in all cases. Other recurrent abnormalities were infrequent and were not associated with lenalidomide responsiveness. Low karyotype complexity (by CC) and a high baseline platelet count (>280 × 10(9) /l) were associated with the achievement of haematological response (P = 0·020, P = 0·013 respectively). Unmutated TP53 status showed a tendency for haematological response (P = 0·061). Complete cytogenetic response was not observed in any of the mutated TP53 cases. By multivariate analysis, the most important predictor for lenalidomide treatment failure was a platelet count <280 × 10(9) /l (Odds Ratio = 6·17, P = 0·040). This study reveals the importance of a low baseline platelet count, karyotypic complexity and TP53 mutational status for response to lenalidomide treatment. It supports the molecular study of TP53 in MDS patients treated with lenalidomide. © 2013 John Wiley & Sons Ltd.
Summarizing techniques that combine three non-parametric scores to detect disease-associated 2-way SNP-SNP interactions.

PubMed

Sengupta Chattopadhyay, Amrita; Hsiao, Ching-Lin; Chang, Chien Ching; Lian, Ie-Bin; Fann, Cathy S J

2014-01-01

Identifying susceptibility genes that influence complex diseases is extremely difficult because loci often influence the disease state through genetic interactions. Numerous approaches to detect disease-associated SNP-SNP interactions have been developed, but none consistently generates high-quality results under different disease scenarios. Using summarizing techniques to combine a number of existing methods may provide a solution to this problem. Here we used three popular non-parametric methods-Gini, absolute probability difference (APD), and entropy-to develop two novel summary scores, namely principle component score (PCS) and Z-sum score (ZSS), with which to predict disease-associated genetic interactions. We used a simulation study to compare performance of the non-parametric scores, the summary scores, the scaled-sum score (SSS; used in polymorphism interaction analysis (PIA)), and the multifactor dimensionality reduction (MDR). The non-parametric methods achieved high power, but no non-parametric method outperformed all others under a variety of epistatic scenarios. PCS and ZSS, however, outperformed MDR. PCS, ZSS and SSS displayed controlled type-I-errors (<0.05) compared to GS, APDS, ES (>0.05). A real data study using the genetic-analysis-workshop 16 (GAW 16) rheumatoid arthritis dataset identified a number of interesting SNP-SNP interactions. © 2013 Elsevier B.V. All rights reserved.
Silver nano fabrication using leaf disc of Passiflora foetida Linn

NASA Astrophysics Data System (ADS)

Lade, Bipin D.; Patil, Anita S.

2017-06-01

The main purpose of the experiment is to develop a greener low cost SNP fabrication steps using factories of secondary metabolites from Passiflora leaf extract. Here, the leaf extraction process is omitted, and instead a leaf disc was used for stable SNP fabricated by optimizing parameters such as a circular leaf disc of 2 cm (1, 2, 3, 4, 5) instead of leaf extract and grade of pH (7, 8, 9, 11). The SNP synthesis reaction is tried under room temperature, sun, UV and dark condition. The leaf disc preparation steps are also discussed in details. The SNP obtained using (1 mM: 100 ml AgNO3+ singular leaf disc: pH 9, 11) is applied against featured room temperature and sun condition. The UV spectroscopic analysis confirms that sun rays synthesized SNP yields stable nano particles. The FTIR analysis confirms a large number of functional groups such as alkanes, alkyne, amines, aliphatic amine, carboxylic acid; nitro-compound, alcohol, saturated aldehyde and phenols involved in reduction of silver salt to zero valent ions. The leaf disc mediated synthesis of silver nanoparticles, minimizes leaf extract preparation step and eligible for stable SNP synthesis. The methods sun and room temperature based nano particles synthesized within 10 min would be use certainly for antimicrobial activity.
Single nucleotide polymorphism-specific regulation of matrix metalloproteinase-9 by multiple miRNAs targeting the coding exon

PubMed Central

Duellman, Tyler; Warren, Christopher; Yang, Jay

2014-01-01

Microribonucleic acids (miRNAs) work with exquisite specificity and are able to distinguish a target from a non-target based on a single nucleotide mismatch in the core nucleotide domain. We questioned whether miRNA regulation of gene expression could occur in a single nucleotide polymorphism (SNP)-specific manner, manifesting as a post-transcriptional control of expression of genetic polymorphisms. In our recent study of the functional consequences of matrix metalloproteinase (MMP)-9 SNPs, we discovered that expression of a coding exon SNP in the pro-domain of the protein resulted in a profound decrease in the secreted protein. This missense SNP results in the N38S amino acid change and a loss of an N-glycosylation site. A systematic study demonstrated that the loss of secreted protein was due not to the loss of an N-glycosylation site, but rather an SNP-specific targeting by miR-671-3p and miR-657. Bioinformatics analysis identified 41 SNP-specific miRNA targeting MMP-9 SNPs, mostly in the coding exon and an extension of the analysis to chromosome 20, where the MMP-9 gene is located, suggesting that SNP-specific miRNAs targeting the coding exon are prevalent. This selective post-transcriptional regulation of a target messenger RNA harboring genetic polymorphisms by miRNAs offers an SNP-dependent post-transcriptional regulatory mechanism, allowing for polymorphic-specific differential gene regulation. PMID:24627221
Single nucleotide polymorphisms typing of Mycobacterium leprae reveals focal transmission of leprosy in high endemic regions of India.

PubMed

Lavania, M; Jadhav, R S; Turankar, R P; Chaitanya, V S; Singh, M; Sengupta, U

2013-11-01

Earlier studies indicate that genotyping of Mycobaterium leprae based on single-nucleotide polymorphisms (SNPs) is useful for analysis of the global spread of leprosy. In the present study, we investigated the diversity of M. leprae at eight SNP loci using 180 clinical isolates obtained from patients with leprosy residing mainly in Delhi and Purulia (West Bengal) regions. It was observed that the frequency of SNP type 1 and subtype D was most predominant in the Indian population. Further, the SNP type 2 subtype E was noted only from East Delhi region and SNP type 2 subtype G was noted only from the nearby areas of Hoogly district of West Bengal. These results indicate the occurrence of focal transmission of M. leprae infection and demonstrate that analysis by SNP typing has great potential to help researchers in understanding the transmission of M. leprae infection in the community. © 2013 The Authors Clinical Microbiology and Infection © 2013 European Society of Clinical Microbiology and Infectious Diseases.
Impact of a Panel of 88 Single Nucleotide Polymorphisms on the Risk of Breast Cancer in High-Risk Women: Results From Two Randomized Tamoxifen Prevention Trials.

PubMed

Cuzick, Jack; Brentnall, Adam R; Segal, Corrinne; Byers, Helen; Reuter, Caroline; Detre, Simone; Lopez-Knowles, Elena; Sestak, Ivana; Howell, Anthony; Powles, Trevor J; Newman, William G; Dowsett, Mitchell

2017-03-01

Purpose At least 94 common single nucleotide polymorphisms (SNPs) are associated with breast cancer. The extent to which an SNP panel can refine risk in women who receive preventive therapy has not been directly assessed previously. Materials and Methods A risk score on the basis of 88 SNPs (SNP88) was investigated in a nested case-control study of women enrolled in the International Breast Intervention Study (IBIS-I) or the Royal Marsden study. A total of 359 women who developed cancer were matched to 636 controls by age, trial, follow-up time, and treatment arm. Genotyping was done using the OncoArray. Conditional logistic regression and matched concordance indices (mC) were used to measure the performance of SNP88 alone and with other breast cancer risk factors assessed using the Tyrer-Cuzick (TC) model. Results SNP88 was predictive of breast cancer risk overall (interquartile range odds ratio [IQ-OR], 1.37; 95% CI, 1.14 to 1.66; mC, 0.55), but mainly for estrogen receptor-positive disease (IQ-OR, 1.44; 95% CI, 1.16 to 1.79; P for heterogeneity = .10) versus estrogen receptor-negative disease. However, the observed risk of SNP88 was only 46% (95% CI, 19% to 74%) of expected. No significant interaction was observed with treatment arm (placebo IQ-OR, 1.46; 95% CI, 1.13 to 1.87; tamoxifen IQ-OR, 1.25; 95% CI, 0.96 to 1.64; P for heterogeneity = .5). The predictive power was similar to the TC model (IQ-OR, 1.45; 95% CI, 1.21 to 1.73; mC, 0.55), but SNP88 was independent of TC (Spearman rank-order correlation, 0.012; P = .7), and when combined multiplicatively, a substantial improvement was seen (IQ-OR, 1.64; 95% CI, 1.36 to 1.97; mC, 0.60). Conclusion A polygenic risk score may be used to refine risk from the TC or similar models in women who are at an elevated risk of breast cancer and considering preventive therapy. Recalibration may be necessary for accurate risk assessment.
Tumor necrosis factor (TNF)-α -308G/A (rs1800629) polymorphism distribution in North India and its association with pemphigus: Case-control study and meta-analysis.

PubMed

Dar, Sajad Ahmad; Akhter, Naseem; Haque, Shafiul; Singh, Taru; Mandal, Raju Kumar; Ramachandran, Vishnampettai Ganapathysubramanian; Bhattacharya, Sambit Nath; Banerjee, Basu Dev; Das, Shukla

2016-01-01

Pemphigus is an autoimmune blistering disorder of skin and/or mucosal surfaces characterized by intraepithelial lesions and immunoglobulin-G autoantibodies against desmogleins (proteins critical in cell-to-cell adhesion). Genetic, immunological, hormonal, and environmental factors are known to contribute to its etiology. Tumor necrosis factor-alpha (TNF-α) which plays a key role in pathogenesis of many infectious and inflammatory diseases has been found in high levels in lesional skin and sera of pemphigus patients. However, studies on association of single nucleotide polymorphism (SNP) in promoter region of TNF-α at position -308 affecting G to A transition with pemphigus has been scarce. This study was conducted to evaluate the TNF-α -308G/A SNP distribution in North Indian cohort, and to define the association between the TNF-α -308G/A SNP distribution and pemphigus, globally, by means of meta-analysis. TNF-α -308G/A SNP in pemphigus patients was investigated by cytokine genotyping using genomic DNA by PCR with sequence-specific primers. Meta-analysis of the data, including four previously published studies from other populations, was performed to generate a meaningful relationship. The results of our case-control study indicate non-significant differences between patients and controls in TNF-α -308G/A SNP. The meta-analysis also revealed that TNF-α -308G/A SNP is not associated with pemphigus risk in population at large; however, it may be contributing towards autoimmune phenomenon in pemphigus by being a part of its multi-factorial etiology. This study provides evidence that the TNF-α -308G/A polymorphism is not associated with overall pemphigus susceptibility. Nevertheless, further studies on specific ethnicity and pemphigus variants are necessary to validate the findings.
A Genome-Wide Breast Cancer Scan in African Americans

DTIC Science & Technology

2010-06-01

SNPs from the African American breast cancer scan to COGs , a European collaborative study which is has designed a SNP array with that will be genotyped...Award Number: W81XWH-08-1-0383 TITLE: A Genome-wide Breast Cancer Scan in African Americans PRINCIPAL INVESTIGATOR: Christopher A...SUBTITLE A Genome-wide Breast Cancer Scan in African Americans 5a. CONTRACT NUMBER 5b. GRANT NUMBER W81XWH-08-1-0383 5c. PROGRAM
Genome-wide association implicates numerous genes and pleiotropy underlying ecological trait variation in natural populations of Populus trichocarpa

DOE Office of Scientific and Technical Information (OSTI.GOV)

McKown, Athena; Klapste, Jaroslav; Guy, Robert

2014-01-01

To uncover the genetic basis of phenotypic trait variation, we used 448 unrelated wild accessions of black cottonwood (Populus trichocarpa Torr. & Gray) from natural populations throughout western North America. Extensive information from large-scale trait phenotyping (with spatial and temporal replications within a common garden) and genotyping (with a 34K Populus SNP array) of all accessions were used for gene discovery in a genome-wide association study (GWAS).
Whole-exome sequencing for RH genotyping and alloimmunization risk in children with sickle cell anemia

PubMed Central

Flanagan, Jonathan M.; Vege, Sunitha; Luban, Naomi L. C.; Brown, R. Clark; Ware, Russell E.; Westhoff, Connie M.

2017-01-01

RH genes are highly polymorphic and encode the most complex of the 35 human blood group systems. This genetic diversity contributes to Rh alloimmunization in patients with sickle cell anemia (SCA) and is not avoided by serologic Rh-matched red cell transfusions. Standard serologic testing does not distinguish variant Rh antigens. Single nucleotide polymorphism (SNP)–based DNA arrays detect many RHD and RHCE variants, but the number of alleles tested is limited. We explored a next-generation sequencing (NGS) approach using whole-exome sequencing (WES) in 27 Rh alloimmunized and 27 matched non-alloimmunized patients with SCA who received chronic red cell transfusions and were enrolled in a multicenter study. We demonstrate that WES provides a comprehensive RH genotype, identifies SNPs not interrogated by DNA array, and accurately determines RHD zygosity. Among this multicenter cohort, we demonstrate an association between an altered RH genotype and Rh alloimmunization: 52% of Rh immunized vs 19% of non-immunized patients expressed variant Rh without co-expression of the conventional protein. Our findings suggest that RH allele variation in patients with SCA is clinically relevant, and NGS technology can offer a comprehensive alternative to targeted SNP-based testing. This is particularly relevant as NGS data becomes more widely available and could provide the means for reducing Rh alloimmunization in children with SCA. PMID:29296782
Genome-wide association study of CNVs in 16,000 cases of eight common diseases and 3,000 shared controls.

PubMed

Craddock, Nick; Hurles, Matthew E; Cardin, Niall; Pearson, Richard D; Plagnol, Vincent; Robson, Samuel; Vukcevic, Damjan; Barnes, Chris; Conrad, Donald F; Giannoulatou, Eleni; Holmes, Chris; Marchini, Jonathan L; Stirrups, Kathy; Tobin, Martin D; Wain, Louise V; Yau, Chris; Aerts, Jan; Ahmad, Tariq; Andrews, T Daniel; Arbury, Hazel; Attwood, Anthony; Auton, Adam; Ball, Stephen G; Balmforth, Anthony J; Barrett, Jeffrey C; Barroso, Inês; Barton, Anne; Bennett, Amanda J; Bhaskar, Sanjeev; Blaszczyk, Katarzyna; Bowes, John; Brand, Oliver J; Braund, Peter S; Bredin, Francesca; Breen, Gerome; Brown, Morris J; Bruce, Ian N; Bull, Jaswinder; Burren, Oliver S; Burton, John; Byrnes, Jake; Caesar, Sian; Clee, Chris M; Coffey, Alison J; Connell, John M C; Cooper, Jason D; Dominiczak, Anna F; Downes, Kate; Drummond, Hazel E; Dudakia, Darshna; Dunham, Andrew; Ebbs, Bernadette; Eccles, Diana; Edkins, Sarah; Edwards, Cathryn; Elliot, Anna; Emery, Paul; Evans, David M; Evans, Gareth; Eyre, Steve; Farmer, Anne; Ferrier, I Nicol; Feuk, Lars; Fitzgerald, Tomas; Flynn, Edward; Forbes, Alistair; Forty, Liz; Franklyn, Jayne A; Freathy, Rachel M; Gibbs, Polly; Gilbert, Paul; Gokumen, Omer; Gordon-Smith, Katherine; Gray, Emma; Green, Elaine; Groves, Chris J; Grozeva, Detelina; Gwilliam, Rhian; Hall, Anita; Hammond, Naomi; Hardy, Matt; Harrison, Pile; Hassanali, Neelam; Hebaishi, Husam; Hines, Sarah; Hinks, Anne; Hitman, Graham A; Hocking, Lynne; Howard, Eleanor; Howard, Philip; Howson, Joanna M M; Hughes, Debbie; Hunt, Sarah; Isaacs, John D; Jain, Mahim; Jewell, Derek P; Johnson, Toby; Jolley, Jennifer D; Jones, Ian R; Jones, Lisa A; Kirov, George; Langford, Cordelia F; Lango-Allen, Hana; Lathrop, G Mark; Lee, James; Lee, Kate L; Lees, Charlie; Lewis, Kevin; Lindgren, Cecilia M; Maisuria-Armer, Meeta; Maller, Julian; Mansfield, John; Martin, Paul; Massey, Dunecan C O; McArdle, Wendy L; McGuffin, Peter; McLay, Kirsten E; Mentzer, Alex; Mimmack, Michael L; Morgan, Ann E; Morris, Andrew P; Mowat, Craig; Myers, Simon; Newman, William; Nimmo, Elaine R; O'Donovan, Michael C; Onipinla, Abiodun; Onyiah, Ifejinelo; Ovington, Nigel R; Owen, Michael J; Palin, Kimmo; Parnell, Kirstie; Pernet, David; Perry, John R B; Phillips, Anne; Pinto, Dalila; Prescott, Natalie J; Prokopenko, Inga; Quail, Michael A; Rafelt, Suzanne; Rayner, Nigel W; Redon, Richard; Reid, David M; Renwick; Ring, Susan M; Robertson, Neil; Russell, Ellie; St Clair, David; Sambrook, Jennifer G; Sanderson, Jeremy D; Schuilenburg, Helen; Scott, Carol E; Scott, Richard; Seal, Sheila; Shaw-Hawkins, Sue; Shields, Beverley M; Simmonds, Matthew J; Smyth, Debbie J; Somaskantharajah, Elilan; Spanova, Katarina; Steer, Sophia; Stephens, Jonathan; Stevens, Helen E; Stone, Millicent A; Su, Zhan; Symmons, Deborah P M; Thompson, John R; Thomson, Wendy; Travers, Mary E; Turnbull, Clare; Valsesia, Armand; Walker, Mark; Walker, Neil M; Wallace, Chris; Warren-Perry, Margaret; Watkins, Nicholas A; Webster, John; Weedon, Michael N; Wilson, Anthony G; Woodburn, Matthew; Wordsworth, B Paul; Young, Allan H; Zeggini, Eleftheria; Carter, Nigel P; Frayling, Timothy M; Lee, Charles; McVean, Gil; Munroe, Patricia B; Palotie, Aarno; Sawcer, Stephen J; Scherer, Stephen W; Strachan, David P; Tyler-Smith, Chris; Brown, Matthew A; Burton, Paul R; Caulfield, Mark J; Compston, Alastair; Farrall, Martin; Gough, Stephen C L; Hall, Alistair S; Hattersley, Andrew T; Hill, Adrian V S; Mathew, Christopher G; Pembrey, Marcus; Satsangi, Jack; Stratton, Michael R; Worthington, Jane; Deloukas, Panos; Duncanson, Audrey; Kwiatkowski, Dominic P; McCarthy, Mark I; Ouwehand, Willem; Parkes, Miles; Rahman, Nazneen; Todd, John A; Samani, Nilesh J; Donnelly, Peter

2010-04-01

Copy number variants (CNVs) account for a major proportion of human genetic polymorphism and have been predicted to have an important role in genetic susceptibility to common disease. To address this we undertook a large, direct genome-wide study of association between CNVs and eight common human diseases. Using a purpose-designed array we typed approximately 19,000 individuals into distinct copy-number classes at 3,432 polymorphic CNVs, including an estimated approximately 50% of all common CNVs larger than 500 base pairs. We identified several biological artefacts that lead to false-positive associations, including systematic CNV differences between DNAs derived from blood and cell lines. Association testing and follow-up replication analyses confirmed three loci where CNVs were associated with disease-IRGM for Crohn's disease, HLA for Crohn's disease, rheumatoid arthritis and type 1 diabetes, and TSPAN8 for type 2 diabetes-although in each case the locus had previously been identified in single nucleotide polymorphism (SNP)-based studies, reflecting our observation that most common CNVs that are well-typed on our array are well tagged by SNPs and so have been indirectly explored through SNP studies. We conclude that common CNVs that can be typed on existing platforms are unlikely to contribute greatly to the genetic basis of common human diseases.
Single-Nucleotide Polymorphism Array-Based Karyotyping of Acute Promyelocytic Leukemia

PubMed Central

Gómez-Seguí, Inés; Sánchez-Izquierdo, Dolors; Barragán, Eva; Such, Esperanza; Luna, Irene; López-Pavía, María; Ibáñez, Mariam; Villamón, Eva; Alonso, Carmen; Martín, Iván; Llop, Marta; Dolz, Sandra; Fuster, Óscar; Montesinos, Pau; Cañigral, Carolina; Boluda, Blanca; Salazar, Claudia

2014-01-01

Acute promyelocytic leukemia (APL) is characterized by the t(15;17)(q22;q21), but additional chromosomal abnormalities (ACA) and other rearrangements can contribute in the development of the whole leukemic phenotype. We hypothesized that some ACA not detected by conventional techniques may be informative of the onset of APL. We performed the high-resolution SNP array (SNP-A) 6.0 (Affymetrix) in 48 patients diagnosed with APL on matched diagnosis and remission sample. Forty-six abnormalities were found as an acquired event in 23 patients (48%): 22 duplications, 23 deletions and 1 Copy-Neutral Loss of Heterozygocity (CN-LOH), being a duplication of 8(q24) (23%) and a deletion of 7(q33-qter) (6%) the most frequent copy-number abnormalities (CNA). Four patients (8%) showed CNAs adjacent to the breakpoints of the translocation. We compared our results with other APL series and found that, except for dup(8q24) and del(7q33-qter), ACA were infrequent (≤3%) but most of them recurrent (70%). Interestingly, having CNA or FLT3 mutation were mutually exclusive events. Neither the number of CNA, nor any specific CNA was associated significantly with prognosis. This study has delineated recurrent abnormalities in addition to t(15;17) that may act as secondary events and could explain leukemogenesis in up to 40% of APL cases with no ACA by conventional cytogenetics. PMID:24959826
Single-nucleotide polymorphism array-based karyotyping of acute promyelocytic leukemia.

PubMed

Gómez-Seguí, Inés; Sánchez-Izquierdo, Dolors; Barragán, Eva; Such, Esperanza; Luna, Irene; López-Pavía, María; Ibáñez, Mariam; Villamón, Eva; Alonso, Carmen; Martín, Iván; Llop, Marta; Dolz, Sandra; Fuster, Oscar; Montesinos, Pau; Cañigral, Carolina; Boluda, Blanca; Salazar, Claudia; Cervera, Jose; Sanz, Miguel A

2014-01-01

Acute promyelocytic leukemia (APL) is characterized by the t(15;17)(q22;q21), but additional chromosomal abnormalities (ACA) and other rearrangements can contribute in the development of the whole leukemic phenotype. We hypothesized that some ACA not detected by conventional techniques may be informative of the onset of APL. We performed the high-resolution SNP array (SNP-A) 6.0 (Affymetrix) in 48 patients diagnosed with APL on matched diagnosis and remission sample. Forty-six abnormalities were found as an acquired event in 23 patients (48%): 22 duplications, 23 deletions and 1 Copy-Neutral Loss of Heterozygocity (CN-LOH), being a duplication of 8(q24) (23%) and a deletion of 7(q33-qter) (6%) the most frequent copy-number abnormalities (CNA). Four patients (8%) showed CNAs adjacent to the breakpoints of the translocation. We compared our results with other APL series and found that, except for dup(8q24) and del(7q33-qter), ACA were infrequent (≤3%) but most of them recurrent (70%). Interestingly, having CNA or FLT3 mutation were mutually exclusive events. Neither the number of CNA, nor any specific CNA was associated significantly with prognosis. This study has delineated recurrent abnormalities in addition to t(15;17) that may act as secondary events and could explain leukemogenesis in up to 40% of APL cases with no ACA by conventional cytogenetics.
A High-Resolution SNP Array-Based Linkage Map Anchors a New Domestic Cat Draft Genome Assembly and Provides Detailed Patterns of Recombination.

PubMed

Li, Gang; Hillier, LaDeana W; Grahn, Robert A; Zimin, Aleksey V; David, Victor A; Menotti-Raymond, Marilyn; Middleton, Rondo; Hannah, Steven; Hendrickson, Sher; Makunin, Alex; O'Brien, Stephen J; Minx, Pat; Wilson, Richard K; Lyons, Leslie A; Warren, Wesley C; Murphy, William J

2016-06-01

High-resolution genetic and physical maps are invaluable tools for building accurate genome assemblies, and interpreting results of genome-wide association studies (GWAS). Previous genetic and physical maps anchored good quality draft assemblies of the domestic cat genome, enabling the discovery of numerous genes underlying hereditary disease and phenotypes of interest to the biomedical science and breeding communities. However, these maps lacked sufficient marker density to order thousands of shorter scaffolds in earlier assemblies, which instead relied heavily on comparative mapping with related species. A high-resolution map would aid in validating and ordering chromosome scaffolds from existing and new genome assemblies. Here, we describe a high-resolution genetic linkage map of the domestic cat genome based on genotyping 453 domestic cats from several multi-generational pedigrees on the Illumina 63K SNP array. The final maps include 58,055 SNP markers placed relative to 6637 markers with unique positions, distributed across all autosomes and the X chromosome. Our final sex-averaged maps span a total autosomal length of 4464 cM, the longest described linkage map for any mammal, confirming length estimates from a previous microsatellite-based map. The linkage map was used to order and orient the scaffolds from a substantially more contiguous domestic cat genome assembly (Felis catus v8.0), which incorporated ∼20 × coverage of Illumina fragment reads. The new genome assembly shows substantial improvements in contiguity, with a nearly fourfold increase in N50 scaffold size to 18 Mb. We use this map to report probable structural errors in previous maps and assemblies, and to describe features of the recombination landscape, including a massive (∼50 Mb) recombination desert (of virtually zero recombination) on the X chromosome that parallels a similar desert on the porcine X chromosome in both size and physical location. Copyright © 2016 Li et al.

A comparison of restriction fragment length polymorphism, tetra primer amplification refractory mutation system PCR and unlabeled probe melting analysis for LTA+252 C>T SNP genotyping.

PubMed

Soler, Stephan; Rittore, Cécile; Touitou, Isabelle; Philibert, Laurent

2011-02-20

From the wide range of methods currently available for genotyping, we wished to identify a quick, reliable and affordable approach for routine use in our laboratory for LTA+252 C>T SNP screening. We set up and compared three genotyping methods for SNP detection: restriction fragment length polymorphism (RFLP), tetra primer amplification refractory mutation system PCR (TPAP) and unlabeled probe melting analysis (UPMA). The SNP model used was LTA+252 C>T, a cytokine gene polymorphism that has been associated with response to treatment in rheumatoid arthritis. The study was performed using 46 samples from healthy Caucasian volunteers. Allele and genotype distribution was similar to that previously described in the same population. All three genotyping methods showed good reproducibility and are suitable for a medium scale throughput molecular platform. UPMA was the most cost effective, reliable and safe method since it required the shortest technician time, could be performed in a single closed tube and involved automatic data analysis. This work is the first to compare these three genotyping techniques and provides evidence for UPMA being the method of choice for LTA+252 C>T SNP genotyping. Copyright Â© 2010 Elsevier B.V. All rights reserved.
Variation at the NFATC2 Locus Increases the Risk of Thiazolidinedione-Induced Edema in the Diabetes REduction Assessment with ramipril and rosiglitazone Medication (DREAM) Study

PubMed Central

Bailey, Swneke D.; Xie, Changchun; Do, Ron; Montpetit, Alexandre; Diaz, Rafael; Mohan, Viswanathan; Keavney, Bernard; Yusuf, Salim; Gerstein, Hertzel C.; Engert, James C.; Anand, Sonia

2010-01-01

OBJECTIVE Thiazolidinediones are used to treat type 2 diabetes. Their use has been associated with peripheral edema and congestive heart failure—outcomes that may have a genetic etiology. RESEARCH DESIGN AND METHODS We genotyped 4,197 participants of the multiethnic DREAM (Diabetes REduction Assessment with ramipril and rosiglitazone Medication) trial with a 50k single nucleotide polymorphisms (SNP) array, which captures ∼2000 cardiovascular, inflammatory, and metabolic genes. We tested 32,088 SNPs for an association with edema among Europeans who received rosiglitazone (n = 965). RESULTS One SNP, rs6123045, in NFATC2 was significantly associated with edema (odds ratio 1.89 [95% CI 1.47–2.42]; P = 5.32 × 10−7, corrected P = 0.017). Homozygous individuals had the highest edema rate (hazard ratio 2.89, P = 4.22 × 10−4) when compared with individuals homozygous for the protective allele, with heterozygous individuals having an intermediate risk. The interaction between the SNP and rosiglitazone for edema was significant (P = 7.68 × 10−3). Six SNPs in NFATC2 were significant in both Europeans and Latin Americans (P < 0.05). CONCLUSIONS Genetic variation at the NFATC2 locus contributes to edema among individuals who receive rosiglitazone. PMID:20628086
High-throughput SNP discovery and transcriptome expression profiles from the salmon louse Caligus rogercresseyi (Copepoda: Caligidae).

PubMed

Nuñez-Acuña, Gustavo; Valenzuela-Muñoz, Valentina; Gallardo-Escárate, Cristian

2014-06-01

The salmon louse Caligus rogercresseyi is the dominant ectoparasite species affecting the salmon aquaculture industry in the Southern hemisphere, and it is currently the main cause for economic losses in Chilean aquaculture. However, despite the great concern over Caligus infestations, genomic information on this louse is still scarce, even while the need to develop high-resolution molecular markers is growing. This study provides the first deep transcriptome survey to identify thousands of SNP markers from C. rogercresseyi, with a total of 69,466 SNPs identified using the MiSeq platform (Illumina®), 30,605 (52%) of which were found in contigs successfully annotated against known protein databases. Furthermore, in silico gene expression profiles associated with SNP variants were evaluated, and the results evidenced a wide array of genes that were down- and upregulated throughout the developmental stages of C. rogercresseyi. Interestingly, putative KEGG pathways involved in resistance to antiparasitic agents were also identified, where ten pathways were associated with the nervous system and one was related to ABC transporters. Taken together, this information could be highly useful for investigating the molecular underpinnings involved in the susceptibility or resistance of salmon lice to chemical treatments. Copyright © 2014 Elsevier Inc. All rights reserved.
High-Density SNP Map Construction and QTL Identification for the Apetalous Character in Brassica napus L.

PubMed Central

Wang, Xiaodong; Yu, Kunjiang; Li, Hongge; Peng, Qi; Chen, Feng; Zhang, Wei; Chen, Song; Hu, Maolong; Zhang, Jiefu

2015-01-01

The apetalous genotype is a morphological ideotype for increasing seed yield and should be of considerable agricultural use; however, only a few studies have focused on the genetic control of this trait in Brassica napus. In the present study, a recombinant inbred line, the AH population, containing 189 individuals was derived from a cross between an apetalous line ‘APL01’ and a normally petalled variety ‘Holly’. The Brassica 60 K Infinium BeadChip Array harboring 52,157 single nucleotide polymorphism (SNP) markers was used to genotype the AH individuals. A high-density genetic linkage map was constructed based on 2,755 bins involving 11,458 SNPs and 57 simple sequence repeats, and was used to identify loci associated with petalous degree (PDgr). The linkage map covered 2,027.53 cM, with an average marker interval of 0.72 cM. The AH map had good collinearity with the B. napus reference genome, indicating its high quality and accuracy. After phenotypic analyses across five different experiments, a total of 19 identified quantitative trait loci (QTLs) distributed across chromosomes A3, A5, A6, A9 and C8 were obtained, and these QTLs were further integrated into nine consensus QTLs by a meta-analysis. Interestingly, the major QTL qPD.C8-2 was consistently detected in all five experiments, and qPD.A9-2 and qPD.C8-3 were stably expressed in four experiments. Comparative mapping between the AH map and the B. napus reference genome suggested that there were 328 genes underlying the confidence intervals of the three steady QTLs. Based on the Gene Ontology assignments of 52 genes to the regulation of floral development in published studies, 146 genes were considered as potential candidate genes for PDgr. The current study carried out a QTL analysis for PDgr using a high-density SNP map in B. napus, providing novel targets for improving seed yield. These results advanced our understanding of the genetic control of PDgr regulation in B. napus. PMID:26779193
High-Density SNP Map Construction and QTL Identification for the Apetalous Character in Brassica napus L.

PubMed

Wang, Xiaodong; Yu, Kunjiang; Li, Hongge; Peng, Qi; Chen, Feng; Zhang, Wei; Chen, Song; Hu, Maolong; Zhang, Jiefu

2015-01-01

The apetalous genotype is a morphological ideotype for increasing seed yield and should be of considerable agricultural use; however, only a few studies have focused on the genetic control of this trait in Brassica napus. In the present study, a recombinant inbred line, the AH population, containing 189 individuals was derived from a cross between an apetalous line 'APL01' and a normally petalled variety 'Holly'. The Brassica 60 K Infinium BeadChip Array harboring 52,157 single nucleotide polymorphism (SNP) markers was used to genotype the AH individuals. A high-density genetic linkage map was constructed based on 2,755 bins involving 11,458 SNPs and 57 simple sequence repeats, and was used to identify loci associated with petalous degree (PDgr). The linkage map covered 2,027.53 cM, with an average marker interval of 0.72 cM. The AH map had good collinearity with the B. napus reference genome, indicating its high quality and accuracy. After phenotypic analyses across five different experiments, a total of 19 identified quantitative trait loci (QTLs) distributed across chromosomes A3, A5, A6, A9 and C8 were obtained, and these QTLs were further integrated into nine consensus QTLs by a meta-analysis. Interestingly, the major QTL qPD.C8-2 was consistently detected in all five experiments, and qPD.A9-2 and qPD.C8-3 were stably expressed in four experiments. Comparative mapping between the AH map and the B. napus reference genome suggested that there were 328 genes underlying the confidence intervals of the three steady QTLs. Based on the Gene Ontology assignments of 52 genes to the regulation of floral development in published studies, 146 genes were considered as potential candidate genes for PDgr. The current study carried out a QTL analysis for PDgr using a high-density SNP map in B. napus, providing novel targets for improving seed yield. These results advanced our understanding of the genetic control of PDgr regulation in B. napus.
Association analysis and identification of SNP markers for Stemphylium leaf spot (Stemphylium botryosum f. sp. spinacia) resistance in spinach (Spinacia oleracea)

USDA-ARS?s Scientific Manuscript database

Stemphylium leaf spot, caused by Stemphylium botryosum f. sp. spinacia is an important disease in spinach. Use of genetic resistance is an efficient, economic and environment-friendly method to control this disease. The objective of this research was to conduct association analysis and identify SNP ...
Combination of polymorphisms within the HDAC1 and HDAC3 gene predict tumor recurrence in hepatocellular carcinoma patients that have undergone transplant therapy.

PubMed

Yang, Zhe; Zhou, Lin; Wu, Li-Ming; Xie, Hai-Yang; Zhang, Feng; Zheng, Shu-Sen

2010-12-01

Histone deacetylases (HDACs) have been reported to be poor prognostic indicators in patients with cancer. However, no data are available for the role of single nucleotide polymorphism (SNP) of class I HDAC in hepato-cellular carcinoma (HCC). Therefore, we investigated the association of class I HDAC isoforms genomic polymorphisms with risk of HCC and tumor recurrence following liver transplantation (LT). One hundred and ninety-six Chinese subjects consisting of 97 HCC patients and 99 controls were enrolled in this study. Nine polymorphisms of the HDAC1, HDAC2, and HDAC3 gene (rs2530223, rs1741981, rs2547547, rs13204445, rs6568819, rs10499080, rs11741808, rs2475631, rs11391) were examined using Applied Biosystems SNaP-Shot and TaqMan technology. We found no significant difference in genotype frequencies between the HCC cases and controls. In terms of tumor recurrence following LT, patients carrying the T allele of HDAC1 SNP rs1741981 showed a favorable outcome for recurrence free survival when compared with patients homozygous for CC. In addition, the same significant trend was observed in HDAC3 SNP rs2547547. Kaplan-Meier analysis showed that the combination of the T variant allele (CT+TT) of HDAC1 SNP rs1741981 and the homozygous TT variant allele of HDAC3 SNP rs2547547 was the most favorable prognostic factor. The risk for postoperative tumor recurrence was about 2.2-fold lower for patients with this genotype combination compared with carriers of the HDAC1 SNP rs1741981 CC and HDAC3 SNP rs2547547 CT genotype combination (hazard ratio: 2.235, p=0.003). Our data suggest that combined analysis of HDAC1 SNP rs1741981 and HDAC3 SNP rs2547547 may be a potential genetic marker for HCC recurrence in LT patients.
Ultrasoft x-ray imaging system for the National Spherical Torus Experiment

NASA Astrophysics Data System (ADS)

Stutman, D.; Finkenthal, M.; Soukhanovskii, V.; May, M. J.; Moos, H. W.; Kaita, R.

1999-01-01

A spectrally resolved ultrasoft x-ray imaging system, consisting of arrays of high resolution (<2 Å) and throughput (⩾tens of kHz) miniature monochromators, and based on multilayer mirrors and absolute photodiodes, is being designed for the National Spherical Torus Experiment. Initially, three poloidal arrays of diodes filtered for C 1s-np emission will be implemented for fast tomographic imaging of the colder start-up plasmas. Later on, mirrors tuned to the C Lyα emission will be added in order to enable the arrays to "see" the periphery through the hot core and to study magnetohydrodynamic activity and impurity transport in this region. We also discuss possible core diagnostics, based on tomographic imaging of the Lyα emission from the plume of recombined, low Z impurity ions left by neutral beams or fueling pellets. The arrays can also be used for radiated power measurements and to map the distribution of high Z impurities injected for transport studies. The performance of the proposed system is illustrated with results from test channels on the CDX-U spherical torus at Princeton Plasma Physics Laboratory.
Genome-wide association study in discordant sibships identifies multiple inherited susceptibility alleles linked to lung cancer.

PubMed

Galvan, Antonella; Falvella, Felicia S; Frullanti, Elisa; Spinola, Monica; Incarbone, Matteo; Nosotti, Mario; Santambrogio, Luigi; Conti, Barbara; Pastorino, Ugo; Gonzalez-Neira, Anna; Dragani, Tommaso A

2010-03-01

We analyzed a series of young (median age = 52 years) non-smoker lung cancer patients and their unaffected siblings as controls, using a genome-wide 620 901 single-nucleotide polymorphism (SNP) array analysis and a case-control DNA pooling approach. We identified 82 putatively associated SNPs that were retested by individual genotyping followed by use of the sib transmission disequilibrium test, pointing to 36 SNPs associated with lung cancer risk in the discordant sibs series. Analysis of these 36 SNPs in a polygenic model characterized by additive and interchangeable effects of rare alleles revealed a highly statistically significant dosage-dependent association between risk allele carrier status and proportion of cancer cases. Replication of the same 36 SNPs in a population-based series confirmed the association with lung cancer for three SNPs, suggesting that phenocopies and genetic heterogeneity can play a major role in the complex genetics of lung cancer risk in the general population.
Genome-wide Target Enrichment-aided Chip Design: a 66 K SNP Chip for Cashmere Goat.

PubMed

Qiao, Xian; Su, Rui; Wang, Yang; Wang, Ruijun; Yang, Ting; Li, Xiaokai; Chen, Wei; He, Shiyang; Jiang, Yu; Xu, Qiwu; Wan, Wenting; Zhang, Yaolei; Zhang, Wenguang; Chen, Jiang; Liu, Bin; Liu, Xin; Fan, Yixing; Chen, Duoyuan; Jiang, Huaizhi; Fang, Dongming; Liu, Zhihong; Wang, Xiaowen; Zhang, Yanjun; Mao, Danqing; Wang, Zhiying; Di, Ran; Zhao, Qianjun; Zhong, Tao; Yang, Huanming; Wang, Jian; Wang, Wen; Dong, Yang; Chen, Xiaoli; Xu, Xun; Li, Jinquan

2017-08-17

Compared with the commercially available single nucleotide polymorphism (SNP) chip based on the Bead Chip technology, the solution hybrid selection (SHS)-based target enrichment SNP chip is not only design-flexible, but also cost-effective for genotype sequencing. In this study, we propose to design an animal SNP chip using the SHS-based target enrichment strategy for the first time. As an update to the international collaboration on goat research, a 66 K SNP chip for cashmere goat was created from the whole-genome sequencing data of 73 individuals. Verification of this 66 K SNP chip with the whole-genome sequencing data of 436 cashmere goats showed that the SNP call rates was between 95.3% and 99.8%. The average sequencing depth for target SNPs were 40X. The capture regions were shown to be 200 bp that flank target SNPs. This chip was further tested in a genome-wide association analysis of cashmere fineness (fiber diameter). Several top hit loci were found marginally associated with signaling pathways involved in hair growth. These results demonstrate that the 66 K SNP chip is a useful tool in the genomic analyses of cashmere goats. The successful chip design shows that the SHS-based target enrichment strategy could be applied to SNP chip design in other species.
Preselection statistics and Random Forest classification identify population informative single nucleotide polymorphisms in cosmopolitan and autochthonous cattle breeds.

PubMed

Bertolini, F; Galimberti, G; Schiavo, G; Mastrangelo, S; Di Gerlando, R; Strillacci, M G; Bagnato, A; Portolano, B; Fontanesi, L

2018-01-01

Commercial single nucleotide polymorphism (SNP) arrays have been recently developed for several species and can be used to identify informative markers to differentiate breeds or populations for several downstream applications. To identify the most discriminating genetic markers among thousands of genotyped SNPs, a few statistical approaches have been proposed. In this work, we compared several methods of SNPs preselection (Delta, F st and principal component analyses (PCA)) in addition to Random Forest classifications to analyse SNP data from six dairy cattle breeds, including cosmopolitan (Holstein, Brown and Simmental) and autochthonous Italian breeds raised in two different regions and subjected to limited or no breeding programmes (Cinisara, Modicana, raised only in Sicily and Reggiana, raised only in Emilia Romagna). From these classifications, two panels of 96 and 48 SNPs that contain the most discriminant SNPs were created for each preselection method. These panels were evaluated in terms of the ability to discriminate as a whole and breed-by-breed, as well as linkage disequilibrium within each panel. The obtained results showed that for the 48-SNP panel, the error rate increased mainly for autochthonous breeds, probably as a consequence of their admixed origin lower selection pressure and by ascertaining bias in the construction of the SNP chip. The 96-SNP panels were generally more able to discriminate all breeds. The panel derived by PCA-chrom (obtained by a preselection chromosome by chromosome) could identify informative SNPs that were particularly useful for the assignment of minor breeds that reached the lowest value of Out Of Bag error even in the Cinisara, whose value was quite high in all other panels. Moreover, this panel contained also the lowest number of SNPs in linkage disequilibrium. Several selected SNPs are located nearby genes affecting breed-specific phenotypic traits (coat colour and stature) or associated with production traits. In general, our results demonstrated the usefulness of Random Forest in combination to other reduction techniques to identify population informative SNPs.
A SNP resource for Douglas-fir: de novo transcriptome assembly and SNP detection and validation

PubMed Central

2013-01-01

Background Douglas-fir (Pseudotsuga menziesii), one of the most economically and ecologically important tree species in the world, also has one of the largest tree breeding programs. Although the coastal and interior varieties of Douglas-fir (vars. menziesii and glauca) are native to North America, the coastal variety is also widely planted for timber production in Europe, New Zealand, Australia, and Chile. Our main goal was to develop a SNP resource large enough to facilitate genomic selection in Douglas-fir breeding programs. To accomplish this, we developed a 454-based reference transcriptome for coastal Douglas-fir, annotated and evaluated the quality of the reference, identified putative SNPs, and then validated a sample of those SNPs using the Illumina Infinium genotyping platform. Results We assembled a reference transcriptome consisting of 25,002 isogroups (unique gene models) and 102,623 singletons from 2.76 million 454 and Sanger cDNA sequences from coastal Douglas-fir. We identified 278,979 unique SNPs by mapping the 454 and Sanger sequences to the reference, and by mapping four datasets of Illumina cDNA sequences from multiple seed sources, genotypes, and tissues. The Illumina datasets represented coastal Douglas-fir (64.00 and 13.41 million reads), interior Douglas-fir (80.45 million reads), and a Yakima population similar to interior Douglas-fir (8.99 million reads). We assayed 8067 SNPs on 260 trees using an Illumina Infinium SNP genotyping array. Of these SNPs, 5847 (72.5%) were called successfully and were polymorphic. Conclusions Based on our validation efficiency, our SNP database may contain as many as ~200,000 true SNPs, and as many as ~69,000 SNPs that could be genotyped at ~20,000 gene loci using an Infinium II array—more SNPs than are needed to use genomic selection in tree breeding programs. Ultimately, these genomic resources will enhance Douglas-fir breeding and allow us to better understand landscape-scale patterns of genetic variation and potential responses to climate change. PMID:23445355
Genome-Wide Association Study for Susceptibility to and Recoverability From Mastitis in Danish Holstein Cows.

PubMed

Welderufael, B G; Løvendahl, Peter; de Koning, Dirk-Jan; Janss, Lucas L G; Fikse, W F

2018-01-01

Because mastitis is very frequent and unavoidable, adding recovery information into the analysis for genetic evaluation of mastitis is of great interest from economical and animal welfare point of view. Here we have performed genome-wide association studies (GWAS) to identify associated single nucleotide polymorphisms (SNPs) and investigate the genetic background not only for susceptibility to - but also for recoverability from mastitis. Somatic cell count records from 993 Danish Holstein cows genotyped for a total of 39378 autosomal SNP markers were used for the association analysis. Single SNP regression analysis was performed using the statistical software package DMU. Substitution effect of each SNP was tested with a t -test and a genome-wide significance level of P -value < 10 -4 was used to declare significant SNP-trait association. A number of significant SNP variants were identified for both traits. Many of the SNP variants associated either with susceptibility to - or recoverability from mastitis were located in or very near to genes that have been reported for their role in the immune system. Genes involved in lymphocyte developments (e.g., MAST3 and STAB2 ) and genes involved in macrophage recruitment and regulation of inflammations ( PDGFD and PTX3 ) were suggested as possible causal genes for susceptibility to - and recoverability from mastitis, respectively. However, this is the first GWAS study for recoverability from mastitis and our results need to be validated. The findings in the current study are, therefore, a starting point for further investigations in identifying causal genetic variants or chromosomal regions for both susceptibility to - and recoverability from mastitis.
Analysis of SNP rs16754 of WT1 gene in a series of de novo acute myeloid leukemia patients.

PubMed

Luna, Irene; Such, Esperanza; Cervera, Jose; Barragán, Eva; Jiménez-Velasco, Antonio; Dolz, Sandra; Ibáñez, Mariam; Gómez-Seguí, Inés; López-Pavía, María; Llop, Marta; Fuster, Óscar; Oltra, Silvestre; Moscardó, Federico; Martínez-Cuadrón, David; Senent, M Leonor; Gascón, Adriana; Montesinos, Pau; Martín, Guillermo; Bolufer, Pascual; Sanz, Miguel A

2012-12-01

The single nucleotide polymorphism (SNP) rs16754 of the WT1 gene has been previously described as a possible prognostic marker in normal karyotype acute myeloid leukemia (AML) patients. Nevertheless, the findings in this field are not always reproducible in different series. One hundred and seventy-five adult de novo AML patients were screened with two different methods for the detection of SNP rs16754: high-resolution melting (HRM) and FRET hybridization probes. Direct sequencing was used to validate both techniques. The SNP was detected in 52 out of 175 patients (30 %), both by HRM and hybridization probes. Direct sequencing confirmed that every positive sample in the screening methods had a variation in the DNA sequence. Patients with the wild-type genotype (WT1(AA)) for the SNP rs16754 were significantly younger than those with the heterozygous WT1(AG) genotype. No other difference was observed for baseline characteristic or outcome between patients with or without the SNP. Both techniques are equally reliable and reproducible as screening methods for the detection of the SNP rs16754, allowing for the selection of those samples that will need to be sequenced. We were unable to confirm the suggested favorable outcome of SNP rs16754 in de novo AML.
Activity study of biogenic spherical silver nanoparticles towards microbes and oxidants

NASA Astrophysics Data System (ADS)

Hoskote Anand, Kiran Kumar; Mandal, Badal Kumar

2015-01-01

The eco-friendly approach for the green synthesis of silver nanoparticles (SNP) using Terminalia bellirica (T. bellirica) fruit extract is reported herein. Initially formation of SNP was noticed through visual color change from yellow to reddish brown and further analyzed by surface plasmonic resonance (SPR) band at 429 nm using UV-Vis spectroscopy. Identification of different polyphenols present in T. bellirica extract was done using High Pressure Liquid Chromatography (HPLC). Aqueous T. bellirica extract contains high amount of gallic acid which is major secondary metabolite responsible for the reduction and stabilization process. It was established by analyses of extracts before and after reduction using HPLC. Formation of spherical SNP was characterized by Transmission Electron Microscopy (TEM) analysis. X-ray Diffraction (XRD) study revealed crystalline nature of SNP. Presence of different functional groups on the surface of SNP was evidenced by Fourier Transform Infrared Spectroscopy (FTIR) study. A plausible mechanism of reduction and stabilization processes involved in the synthesis of stable SNP was also explained based on HPLC and FTIR data. In addition, the synthesized SNP was tested for antibacterial and antioxidant activities. SNP showed good antimicrobial activity against both gram positive (S. aureus) and gram negative (E. coli) bacteria. It also showed good antioxidant activity compared to ascorbic acid as standard antioxidant by using standard DPPH method.
Familial Analysis of Epistatic and Sex-Dependent Association of Genes of the Renin-Angiotensin-Aldosterone System and Blood Pressure.

PubMed

Scurrah, Katrina J; Lamantia, Angela; Ellis, Justine A; Harrap, Stephen B

2017-06-01

Renin-angiotensin-aldosterone system genes have been inconsistently associated with blood pressure, possibly because of unrecognized influences of sex-dependent genetic effects or gene-gene interactions (epistasis). We tested association of systolic blood pressure with single-nucleotide polymorphisms (SNPs) at renin ( REN ), angiotensinogen ( AGT ), angiotensin-converting enzyme ( ACE ), angiotensin II type 1 receptor ( AGTR1 ), and aldosterone synthase ( CYP11B2 ), including sex-SNP or SNP-SNP interactions. Eighty-eight tagSNPs were tested in 2872 white individuals in 809 pedigrees from the Victorian Family Heart Study using variance components models. Three SNPs (rs8075924 and rs4277404 at ACE and rs12721297 at AGTR1 ) were individually associated with lower systolic blood pressure with significant ( P <0.00076) effect sizes ≈1.7 to 2.5 mm Hg. Sex-specific associations were seen for 3 SNPs in men (rs2468523 and rs2478544 at AGT and rs11658531 at ACE ) and 1 SNP in women (rs12451328 at ACE ). SNP-SNP interaction was suggested ( P <0.005) for 14 SNP pairs, none of which had shown individual association with systolic blood pressure. Four SNP pairs were at the same gene (2 for REN , 1 for AGT , and 1 for AGTR1 ). The SNP rs3097 at CYP11B2 was represented in 5 separate pairs. SNPs at key renin-angiotensin-aldosterone system genes associate with systolic blood pressure individually in both sexes, individually in one sex only and only when combined with another SNP. Analyses that incorporate sex-dependent and epistatic effects could reconcile past inconsistencies and account for some of the missing heritability of blood pressure and are generally relevant to SNP association studies for any phenotype. © 2017 American Heart Association, Inc.
DoGSD: the dog and wolf genome SNP database.

PubMed

Bai, Bing; Zhao, Wen-Ming; Tang, Bi-Xia; Wang, Yan-Qing; Wang, Lu; Zhang, Zhang; Yang, He-Chuan; Liu, Yan-Hu; Zhu, Jun-Wei; Irwin, David M; Wang, Guo-Dong; Zhang, Ya-Ping

2015-01-01

The rapid advancement of next-generation sequencing technology has generated a deluge of genomic data from domesticated dogs and their wild ancestor, grey wolves, which have simultaneously broadened our understanding of domestication and diseases that are shared by humans and dogs. To address the scarcity of single nucleotide polymorphism (SNP) data provided by authorized databases and to make SNP data more easily/friendly usable and available, we propose DoGSD (http://dogsd.big.ac.cn), the first canidae-specific database which focuses on whole genome SNP data from domesticated dogs and grey wolves. The DoGSD is a web-based, open-access resource comprising ∼ 19 million high-quality whole-genome SNPs. In addition to the dbSNP data set (build 139), DoGSD incorporates a comprehensive collection of SNPs from two newly sequenced samples (1 wolf and 1 dog) and collected SNPs from three latest dog/wolf genetic studies (7 wolves and 68 dogs), which were taken together for analysis with the population genetic statistics, Fst. In addition, DoGSD integrates some closely related information including SNP annotation, summary lists of SNPs located in genes, synonymous and non-synonymous SNPs, sampling location and breed information. All these features make DoGSD a useful resource for in-depth analysis in dog-/wolf-related studies. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Effect of adiponectin-encoding gene ADIPOQ single nucleotide polymorphisms +45 and +276 on serum lipid levels after antiretroviral therapy in Japanese patients with HIV-1-infection.

PubMed

Kato, Hideaki; Ohata, Aya; Samukawa, Sei; Ueda, Atsuhisa; Ishigatsubo, Yoshiaki

2016-04-01

To investigate the association between single nucleotide polymorphisms (SNPs) in the adiponectin-encoding gene ADIPOQ and changes in serum lipid levels in HIV-1-infected patients after antiretroviral therapy (ART). ART-naïve HIV-1-infected patients were recruited to this prospective analysis. SNP +45 and SNP +276 genotype was determined by direct sequencing. Multivariate linear regression analysis was performed to analyse the effects of genotype, and predisposing conditions on serum total cholesterol and triglyceride in the 4 months before and after ART initiation. The study enrolled 78 patients with HIV-1-infection (73 male, five female; age range 22-67 years). HIV-1 viral load ≥5 log10 copies/ml, baseline total cholesterol ≥160 mg/dl, and CD4(+) lymphocyte count <200/µl were associated with increased serum total cholesterol levels after ART initiation. Protease inhibitor treatment and body mass index ≥25 kg/m(2) were associated with increased triglyceride levels after ART initiation. There were no significant associations between SNP +45 or SNP +276 genotype and serum total cholesterol or triglyceride levels. SNP +45 and SNP +276 genotype is not associated with changes in serum total cholesterol or triglyceride levels after ART initiation. © The Author(s) 2016.
Proper joint analysis of summary association statistics requires the adjustment of heterogeneity in SNP coverage pattern.

PubMed

Zhang, Han; Wheeler, William; Song, Lei; Yu, Kai

2017-07-07

As meta-analysis results published by consortia of genome-wide association studies (GWASs) become increasingly available, many association summary statistics-based multi-locus tests have been developed to jointly evaluate multiple single-nucleotide polymorphisms (SNPs) to reveal novel genetic architectures of various complex traits. The validity of these approaches relies on the accurate estimate of z-score correlations at considered SNPs, which in turn requires knowledge on the set of SNPs assessed by each study participating in the meta-analysis. However, this exact SNP coverage information is usually unavailable from the meta-analysis results published by GWAS consortia. In the absence of the coverage information, researchers typically estimate the z-score correlations by making oversimplified coverage assumptions. We show through real studies that such a practice can generate highly inflated type I errors, and we demonstrate the proper way to incorporate correct coverage information into multi-locus analyses. We advocate that consortia should make SNP coverage information available when posting their meta-analysis results, and that investigators who develop analytic tools for joint analyses based on summary data should pay attention to the variation in SNP coverage and adjust for it appropriately. Published by Oxford University Press 2017. This work is written by US Government employees and is in the public domain in the US.
Genome-wide association analysis to identify genotype × environment interaction for milk protein yield and level of somatic cell score as environmental descriptors in German Holsteins.

PubMed

Streit, M; Reinhardt, F; Thaller, G; Bennewitz, J

2013-01-01

Genotype by environment interaction (G × E) has been widely reported in dairy cattle. If the environment can be measured on a continuous scale, reaction norms can be applied to study G × E. The average herd milk production level has frequently been used as an environmental descriptor because it is influenced by the level of feeding or the feeding regimen. Another important environmental factor is the level of udder health and hygiene, for which the average herd somatic cell count might be a descriptor. In the present study, we conducted a genome-wide association analysis to identify single nucleotide polymorphisms (SNP) that affect intercept and slope of milk protein yield reaction norms when using the average herd test-day solution for somatic cell score as an environmental descriptor. Sire estimates for intercept and slope of the reaction norms were calculated from around 12 million daughter records, using linear reaction norm models. Sires were genotyped for ~54,000 SNP. The sire estimates were used as observations in the association analysis, using 1,797 sires. Significant SNP were confirmed in an independent validation set consisting of 500 sires. A known major gene affecting protein yield was included as a covariable in the statistical model. Sixty (21) SNP were confirmed for intercept with P ≤ 0.01 (P ≤ 0.001) in the validation set, and 28 and 11 SNP, respectively, were confirmed for slope. Most but not all SNP affecting slope also affected intercept. Comparison with an earlier study revealed that SNP affecting slope were, in general, also significant for slope when the environment was modeled by the average herd milk production level, although the two environmental descriptors were poorly correlated. Copyright © 2013 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

Reduced rate of human papillomavirus infection and genetic overtransmission of TP53 72C polymorphic variant lower cervical cancer incidence.

PubMed

Alsbeih, Ghazi A; Al-Harbi, Najla M; Bin Judia, Sara S; Khoja, Hatim A; Shoukri, Mohamed M; Tulbah, Asma M

2017-07-01

Cervical cancer is a predominantly human papillomavirus (HPV)-driven disease worldwide. However, its incidence is unexplainably low in western Asia, including Saudi Arabia. Using this paradigm, we investigated the role of HPV infection rate and host genetic predisposition in TP53 G72C single nucleotide polymorphism (SNP) presumed to affect cancer incidence. Patients treated between 1990 and 2012 were reviewed, and a series of 232 invasive cervical cancer cases were studied and compared with 313 matched controls without cancer. SNP was genotyped by way of direct sequencing. HPV linear array analysis was used to detect and genotype HPV in tumor samples. The incidence of cervical cancer revealed bimodal peaks at 42.5 years, with a slighter rebound at 60.8 years. Among all cases, 77% were HPV-positive and 16 HPV genotypes were detected-mostly genotypes 16 (75%) and 18 (9%)-with no difference by age, histology, or geographical region. Although the TP53 G72C genotype was not associated with overall cervical cancer risk, it was significantly associated with HPV positivity (odds ratio, 0.57; 95% confidence interval, 0.36-0.90; P = .016). Furthermore, the variant C allele was significantly overtransmitted in the population (P < .0003). Cervical cancer incidence displays bimodal curve peaking at a young age with secondary rebound at older age. The combination of relative low HPV infection and variant TP53 72C allele overtransmission provide a plausible explanation for the low incidence of cervical cancer in our population. Therefore, HPV screening and host SNP genotyping may provide more relevant biomarkers to gauge the risk of developing cervical cancer. Cancer 2017;123:2459-66. © 2017 American Cancer Society. © 2017 The Authors. Cancer published by Wiley Periodicals, Inc. on behalf of American Cancer Society.
SUSCEPTIBILITY LOCI FOR UMBILICAL HERNIA IN SWINE DETECTED BY GENOME-WIDE ASSOCIATION.

PubMed

Liao, X J; Lia, L; Zhang, Z Y; Long, Y; Yang, B; Ruan, G R; Su, Y; Ai, H S; Zhang, W C; Deng, W Y; Xiao, S J; Ren, J; Ding, N S; Huang, L S

2015-10-01

Umbilical hernia (UH) is a complex disorder caused by both genetic and environmental factors. UH brings animal welfare problems and severe economic loss to the pig industry. Until now, the genetic basis of UH is poorly understood. The high-density 60K porcine SNP array enables the rapid application of genome-wide association study (GWAS) to identify genetic loci for phenotypic traits at genome wide scale in pigs. The objective of this research was to identify susceptibility loci for swine umbilical hernia using the GWAS approach. We genotyped 478 piglets from 142 families representing three Western commercial breeds with the Illumina PorcineSNP60 BeadChip. Then significant SNPs were detected by GWAS using ROADTRIPS (Robust Association-Detection Test for Related Individuals with Population Substructure) software base on a Bonferroni corrected threshold (P = 1.67E-06) or suggestive threshold (P = 3.34E-05) and false discovery rate (FDR = 0.05). After quality control, 29,924 qualified SNPs and 472 piglets were used for GWAS. Two suggestive loci predisposing to pig UH were identified at 44.25MB on SSC2 (rs81358018, P = 3.34E-06, FDR = 0.049933) and at 45.90MB on SSC17 (rs81479278, P = 3.30E-06, FDR = 0.049933) in Duroc population, respectively. And no SNP was detected to be associated with pig UH at significant level in neither Landrace nor Large White population. Furthermore, we carried out a meta-analysis in the combined pure-breed population containing all the 472 piglets. rs81479278 (P = 1.16E-06, FDR = 0.022475) was identified to associate with pig UH at genome-wide significant level. SRC was characterized as plausible candidate gene for susceptibility to pig UH according to its genomic position and biological functions. To our knowledge, this study gives the first description of GWAS identifying susceptibility loci for umbilical hernia in pigs. Our findings provide deeper insights to the genetic architecture of umbilical hernia in pigs.
TIA: algorithms for development of identity-linked SNP islands for analysis by massively parallel DNA sequencing.

PubMed

Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David

2018-04-11

Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.
Identification of an interaction between VWF rs7965413 and platelet count as a novel risk marker for metabolic syndrome: an extensive search of candidate polymorphisms in a case-control study.

PubMed

Nakatochi, Masahiro; Ushida, Yasunori; Yasuda, Yoshinari; Yoshida, Yasuko; Kawai, Shun; Kato, Ryuji; Nakashima, Toru; Iwata, Masamitsu; Kuwatsuka, Yachiyo; Ando, Masahiko; Hamajima, Nobuyuki; Kondo, Takaaki; Oda, Hiroaki; Hayashi, Mutsuharu; Kato, Sawako; Yamaguchi, Makoto; Maruyama, Shoichi; Matsuo, Seiichi; Honda, Hiroyuki

2015-01-01

Although many single nucleotide polymorphisms (SNPs) have been identified to be associated with metabolic syndrome (MetS), there was only a slight improvement in the ability to predict future MetS by the simply addition of SNPs to clinical risk markers. To improve the ability to predict future MetS, combinational effects, such as SNP-SNP interaction, SNP-environment interaction, and SNP-clinical parameter (SNP × CP) interaction should be also considered. We performed a case-control study to explore novel SNP × CP interactions as risk markers for MetS based on health check-up data of Japanese male employees. We selected 99 SNPs that were previously reported to be associated with MetS and components of MetS; subsequently, we genotyped these SNPs from 360 cases and 1983 control subjects. First, we performed logistic regression analyses to assess the association of each SNP with MetS. Of these SNPs, five SNPs were significantly associated with MetS (P < 0.05): LRP2 rs2544390, rs1800592 between UCP1 and TBC1D9, APOA5 rs662799, VWF rs7965413, and rs1411766 between MYO16 and IRS2. Furthermore, we performed multiple logistic regression analyses, including an SNP term, a CP term, and an SNP × CP interaction term for each CP and SNP that was significantly associated with MetS. We identified a novel SNP × CP interaction between rs7965413 and platelet count that was significantly associated with MetS [SNP term: odds ratio (OR) = 0.78, P = 0.004; SNP × CP interaction term: OR = 1.33, P = 0.001]. This association of the SNP × CP interaction with MetS remained nominally significant in multiple logistic regression analysis after adjustment for either the number of MetS components or MetS components excluding obesity. Our results reveal new insight into platelet count as a risk marker for MetS.
Single-nucleotide polymorphism-gene intermixed networking reveals co-linkers connected to multiple gene expression phenotypes

PubMed Central

Gong, Bin-Sheng; Zhang, Qing-Pu; Zhang, Guang-Mei; Zhang, Shao-Jun; Zhang, Wei; Lv, Hong-Chao; Zhang, Fan; Lv, Sa-Li; Li, Chuan-Xing; Rao, Shao-Qi; Li, Xia

2007-01-01

Gene expression profiles and single-nucleotide polymorphism (SNP) profiles are modern data for genetic analysis. It is possible to use the two types of information to analyze the relationships among genes by some genetical genomics approaches. In this study, gene expression profiles were used as expression traits. And relationships among the genes, which were co-linked to a common SNP(s), were identified by integrating the two types of information. Further research on the co-expressions among the co-linked genes was carried out after the gene-SNP relationships were established using the Haseman-Elston sib-pair regression. The results showed that the co-expressions among the co-linked genes were significantly higher if the number of connections between the genes and a SNP(s) was more than six. Then, the genes were interconnected via one or more SNP co-linkers to construct a gene-SNP intermixed network. The genes sharing more SNPs tended to have a stronger correlation. Finally, a gene-gene network was constructed with their intensities of relationships (the number of SNP co-linkers shared) as the weights for the edges. PMID:18466544
Interpreting aCGH-defined karyotypic changes in gliomas using copy number status, loss of heterozygosity and allelic ratios

PubMed Central

Cowell, John K; Lo, Ken C; Luce, Jesse; Hawthorn, Lesleyann

2009-01-01

We have used SNP mapping arrays to simultaneously record copy number changes, loss of heterozygosity and allele ratios (ploidy) in a series of 13 gliomas. This combined analysis has defined novel amplification events in this tumor type involving chr1:241544532-243005121 and chr18:54716681-54917277 which contain the AKT3 and ZNF532 genes respectively. The high resolution of this analysis has also identified homozygous deletions involving chr17:25600031-26490848 and Chr19:53883612-55061878. Throughout the karyotypes of these tumors, the combined analysis revealed counter intuitive relationships between copy number and LOH that requires reinterpretation of the significance of copy number gains and losses. It was not uncommon to observe copy number gains that were associated with loss of heterozygosity as well as copy number losses that were not. These events appeared to be related to ploidy status in the tumors as determined using allelic ratio calculations. Overall, this analysis of gliomas provides evidence for the need to perform more comprehensive interpretation of the CGH data beyond copy number analysis alone to evaluate the significance of individual events in the karyotypes. PMID:19818351
[Comparative analysis of STR and SNP polymorphism in the populations of sockeye salmon (Oncorhynchus nerka) from Eastern and Western Kamchatka].

PubMed

Khrustaleva, A M; Volkov, A A; Stoklitskaia, D S; Miuge, N S; Zelenina, D A

2010-11-01

Sockeye salmon samples from five largest lacustrine-riverine systems of Kamchatka Peninsula were tested for polymorphism at six microsatellite (STR) and five single nucleotide polymorphism (SNP) loci. Statistically significant genetic differentiation among local populations from this part of the species range examined was demonstrated. The data presented point to pronounced genetic divergence of the populations from two geographical regions, Eastern and Western Kamchatka. For sockeye salmon, the individual identification test accuracy was higher for microsatellites compared to similar number of SNP markers. Pooling of the STR and SNP allele frequency data sets provided the highest accuracy of the individual fish population assignment.
Developmental Coordination Disorder in a Patient with Mental Disability and a Mild Phenotype Carrying Terminal 6q26-qter Deletion

PubMed Central

De Cinque, Marianna; Palumbo, Orazio; Mazzucco, Ermelinda; Simone, Antonella; Palumbo, Pietro; Ciavatta, Renata; Maria, Giuliana; Ferese, Rosangela; Gambardella, Stefano; Angiolillo, Antonella; Carella, Massimo; Garofalo, Silvio

2017-01-01

Terminal deletion of chromosome 6q is a rare chromosomal abnormality associated with variable phenotype spectrum. Although intellectual disability, facial dysmorphism, seizures and brain abnormalities are typical features of this syndrome, genotype–phenotype correlation needs to be better understood. We report the case of a 6-year-old Caucasian boy with a clinical diagnosis of intellectual disability, delayed language development and dyspraxia who carries an approximately 8 Mb de novo heterozygous microdeletion in the 6q26-q27 locus identified by karyotype and defined by high-resolution SNP-array analysis. This patient has no significant structural brain or other organ malformation, and he shows a very mild phenotype compared to similar 6q26-qter deletion. The patient phenotype also suggests that a dyspraxia susceptibility gene is located among the deleted genes. PMID:29270193
Prenatal diagnosis of isochromosome 20q in a fetus with vertebral anomaly and rocker-bottom feet.

PubMed

Receveur, Aline; Brisset, Sophie; Martinovic, Jelena; Bazin, Anne; Lhomann, Laurence; Colmant, Claire; Pineau, Dominique; Gautier, Valérie; Tosca, Lucie; Tachdjian, Gérard

2017-10-01

Isochromosome of the long arm of chromosome 20 (i(20q)) is a rare structural abnormality in prenatal diagnosis. Thirty prenatal cases of mosaic i(20q) have been reported, among which only four are associated with fetal malformations. We describe a new prenatal case of i(20q) with fetal malformations. We also observed a discrepancy between uncultured and cultured amniotic fluid cells by using conventional cytogenetic, fluorescence in situ hybridization and array-SNP analysis. The short arm deletion of chromosome 20 arising from the isochromosome encompassed two candidate genes PAX1 and JAG1 involved in cranio-facial and vertebral development. The data would allow establishing a phenotype-genotype correlation. Thus, we proposed to define a recognizable syndrome combining cranio-facial dysmorphism, vertebral bodies' anomalies, feet and cerebral malformations. Copyright © 2017. Published by Elsevier B.V.
Genome-wide association study of body weight in Australian Merino sheep reveals an orthologous region on OAR6 to human and bovine genomic regions affecting height and weight.

PubMed

Al-Mamun, Hawlader A; Kwan, Paul; Clark, Samuel A; Ferdosi, Mohammad H; Tellam, Ross; Gondro, Cedric

2015-08-14

Body weight (BW) is an important trait for meat production in sheep. Although over the past few years, numerous quantitative trait loci (QTL) have been detected for production traits in cattle, few QTL studies have been reported for sheep, with even fewer on meat production traits. Our objective was to perform a genome-wide association study (GWAS) with the medium-density Illumina Ovine SNP50 BeadChip to identify genomic regions and corresponding haplotypes associated with BW in Australian Merino sheep. A total of 1781 Australian Merino sheep were genotyped using the medium-density Illumina Ovine SNP50 BeadChip. Among the 53 862 single nucleotide polymorphisms (SNPs) on this array, 48 640 were used to perform a GWAS using a linear mixed model approach. Genotypes were phased with hsphase; to estimate SNP haplotype effects, linkage disequilibrium blocks were identified in the detected QTL region. Thirty-nine SNPs were associated with BW at a Bonferroni-corrected genome-wide significance threshold of 1 %. One region on sheep (Ovis aries) chromosome 6 (OAR6) between 36.15 and 38.56 Mb, included 13 significant SNPs that were associated with BW; the most significant SNP was OAR6_41936490.1 (P = 2.37 × 10(-16)) at 37.69 Mb with an allele substitution effect of 2.12 kg, which corresponds to 0.248 phenotypic standard deviations for BW. The region that surrounds this association signal on OAR6 contains three genes: leucine aminopeptidase 3 (LAP3), which is involved in the processing of the oxytocin precursor; NCAPG non-SMC condensin I complex, subunit G (NCAPG), which is associated with foetal growth and carcass size in cattle; and ligand dependent nuclear receptor corepressor-like (LCORL), which is associated with height in humans and cattle. The GWAS analysis detected 39 SNPs associated with BW in sheep and a major QTL region was identified on OAR6. In several other mammalian species, regions that are syntenic with this region have been found to be associated with body size traits, which may reflect that the underlying biological mechanisms share a common ancestry. These findings should facilitate the discovery of causative variants for BW and contribute to marker-assisted selection.
Genome-wide DArT and SNP scan for QTL associated with resistance to stripe rust (Puccinia striiformis f. sp. tritici) in elite ICARDA wheat (Triticum aestivum L.) germplasm.

PubMed

Jighly, Abdulqader; Oyiga, Benedict C; Makdis, Farid; Nazari, Kumarse; Youssef, Omran; Tadesse, Wuletaw; Abdalla, Osman; Ogbonnaya, Francis C

2015-07-01

Identified DArT and SNP markers including a first reported QTL on 3AS, validated large effect APR on 3BS. The different genes can be used to incorporate stripe resistance in cultivated varieties. Stripe rust [yellow rust, caused by Puccinia striiformis f. sp. tritici (Pst)] is a serious disease in wheat (Triticum aestivum). This study employed genome-wide association mapping (GWAM) to identify markers linked to stripe rust resistance genes using Diversity Arrays Technology (DArT(®)) and single-nucleotide polymorphism (SNP) Infinium 9K assays in 200 ICARDA wheat genotypes, phenotyped for seedling and adult plant resistance in two sites over two growing seasons in Syria. Only 25.8 % of the genotypes showed resistance at seedling stage while about 33 and 44 % showed moderate resistance and resistance response, respectively. Mixed-linear model adjusted for false discovery rate at p < 0.05 identified 12 DArT and 29 SNP markers on chromosome arms 3AS, 3AL, 1AL, 2AL, 2BS, 2BL, 3BS, 3BL, 5BL, 6AL, and 7DS significantly linked to Pst resistance genes. Of these, the locus on 3AS has not been previously reported to confer resistance to stripe rust in wheat. The QTL on 3AS, 3AL, 1AL, 2AL, and 2BS were effective at seedling and adult plant growth stages while those on 3BS, 3BL, 5BL, 6AL and 7DS were effective at adult plant stage. The 3BS QTL was validated in Cham-6 × Cham-8 recombinant inbred line population; composite interval analysis identified a stripe resistance QTL flanked by the DArT marker, wPt-798970, contributed by Cham-6 parent which accounted for 31.2 % of the phenotypic variation. The DArT marker "wPt-798970" lies 1.6 cM away from the 3BS QTL detected within GWAM. Epistatic interactions were also investigated; only the QTL on 1AL, 3AS and 6AL exhibited interactions with other loci. These results suggest that GWAM can be an effective approach for identifying and improving resistance to stripe rust in wheat.
Processing-Dependent and Clonal Contamination Patterns of Listeria monocytogenes in the Cured Ham Food Chain Revealed by Genetic Analysis.

PubMed

Morganti, Marina; Scaltriti, Erika; Cozzolino, Paolo; Bolzoni, Luca; Casadei, Gabriele; Pierantoni, Marco; Foni, Emanuela; Pongolini, Stefano

2016-02-01

The quantitative and qualitative patterns of environmental contamination by Listeria monocytogenes were investigated in the production chain of dry-cured Parma ham. Standard arrays of surfaces were sampled in processing facilities during a single visit per plant in the three compartments of the food chain, i.e., ham production (19 plants) and postproduction, which was divided into deboning (43 plants) and slicing (25 plants) steps. The numbers of sampled surfaces were 384 in ham production, with 25 positive for L. monocytogenes, and 1,084 in postproduction, with 83 positives. Statistical analysis of the prevalence of contaminated surfaces showed that in ham production, contamination was higher at the beginning of processing and declined significantly toward the end, while in postproduction, prevalence rose toward the end of processing. Prevalence was higher in the deboning facilities than in slicing facilities and was dependent on the type of surface (floor/drainage > clothing > equipment). The qualitative pattern of contamination was investigated through an analysis of the survey isolates and a set of isolates derived from routine monitoring, including longitudinal isolations. Pulsed-field gel electrophoresis (PFGE) and whole-genome single-nucleotide polymorphism (SNP) analysis revealed a remarkable clonality of L. monocytogenes within plants, with the detection of 16 plant-specific clones out of 17 establishments with multiple isolates. Repeated detections of clonal isolates >6 months apart were also observed. Six was the maximum number of between-isolate differences in core SNPs observed within these clones. Based on the same six-SNP threshold, three clusters of clonal isolates, shared by six establishments, were also identified. The spread of L. monocytogenes within and between plants, as indicated by its clonal behavior, is a matter of concern for the hygienic management of establishments. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Genome-Wide Association Study for Susceptibility to and Recoverability From Mastitis in Danish Holstein Cows

PubMed Central

Welderufael, B. G.; Løvendahl, Peter; de Koning, Dirk-Jan; Janss, Lucas L. G.; Fikse, W. F.

2018-01-01

Because mastitis is very frequent and unavoidable, adding recovery information into the analysis for genetic evaluation of mastitis is of great interest from economical and animal welfare point of view. Here we have performed genome-wide association studies (GWAS) to identify associated single nucleotide polymorphisms (SNPs) and investigate the genetic background not only for susceptibility to – but also for recoverability from mastitis. Somatic cell count records from 993 Danish Holstein cows genotyped for a total of 39378 autosomal SNP markers were used for the association analysis. Single SNP regression analysis was performed using the statistical software package DMU. Substitution effect of each SNP was tested with a t-test and a genome-wide significance level of P-value < 10-4 was used to declare significant SNP-trait association. A number of significant SNP variants were identified for both traits. Many of the SNP variants associated either with susceptibility to – or recoverability from mastitis were located in or very near to genes that have been reported for their role in the immune system. Genes involved in lymphocyte developments (e.g., MAST3 and STAB2) and genes involved in macrophage recruitment and regulation of inflammations (PDGFD and PTX3) were suggested as possible causal genes for susceptibility to – and recoverability from mastitis, respectively. However, this is the first GWAS study for recoverability from mastitis and our results need to be validated. The findings in the current study are, therefore, a starting point for further investigations in identifying causal genetic variants or chromosomal regions for both susceptibility to – and recoverability from mastitis. PMID:29755506
Clonal evolution through loss of chromosomes and subsequent polyploidization in chondrosarcoma.

PubMed

Olsson, Linda; Paulsson, Kajsa; Bovée, Judith V M G; Nord, Karolin H

2011-01-01

Near-haploid chromosome numbers have been found in less than 1% of cytogenetically reported tumors, but seem to be more common in certain neoplasms including the malignant cartilage-producing tumor chondrosarcoma. By a literature survey of published karyotypes from chondrosarcomas we could confirm that loss of chromosomes resulting in hyperhaploid-hypodiploid cells is common and that these cells may polyploidize. Sixteen chondrosarcomas were investigated by single nucleotide polymorphism (SNP) array and the majority displayed SNP patterns indicative of a hyperhaploid-hypodiploid origin, with or without subsequent polyploidization. Except for chromosomes 5, 7, 19, 20 and 21, autosomal loss of heterozygosity was commonly found, resulting from chromosome loss and subsequent duplication of monosomic chromosomes giving rise to uniparental disomy. Additional gains, losses and rearrangements of genetic material, and even repeated rounds of polyploidization, may affect chondrosarcoma cells resulting in highly complex karyotypes. Loss of chromosomes and subsequent polyploidization was not restricted to a particular chondrosarcoma subtype and, although commonly found in chondrosarcoma, binucleated cells did not seem to be involved in these events.
Genetic Linkage Mapping of Economically Important Traits in Cultivated Tetraploid Potato (Solanum tuberosum L.).

PubMed

Massa, Alicia N; Manrique-Carpintero, Norma C; Coombs, Joseph J; Zarka, Daniel G; Boone, Anne E; Kirk, William W; Hackett, Christine A; Bryan, Glenn J; Douches, David S

2015-09-14

The objective of this study was to construct a single nucleotide polymorphism (SNP)-based genetic map at the cultivated tetraploid level to locate quantitative trait loci (QTL) contributing to economically important traits in potato (Solanum tuberosum L.). The 156 F1 progeny and parents of a cross (MSL603) between "Jacqueline Lee" and "MSG227-2" were genotyped using the Infinium 8303 Potato Array. Furthermore, the progeny and parents were evaluated for foliar late blight reaction to isolates of the US-8 genotype of Phytophthora infestans (Mont.) de Bary and vine maturity. Linkage analyses and QTL mapping were performed using a novel approach that incorporates allele dosage information. The resulting genetic maps contained 1972 SNP markers with an average density of 1.36 marker per cM. QTL mapping identified the major source of late blight resistance in "Jacqueline Lee." The best SNP marker mapped ~0.54 Mb from a resistance hotspot on the long arm of chromosome 9. For vine maturity, the major-effect QTL was located on chromosome 5 with allelic effects from both parents. A candidate SNP marker for this trait mapped ~0.25 Mb from the StCDF1 gene, which is a candidate gene for the maturity trait. The identification of markers for P. infestans resistance will enable the introgression of multiple sources of resistance through marker-assisted selection. Moreover, the discovery of a QTL for late blight resistance not linked to the QTL for vine maturity provides the opportunity to use marker-assisted selection for resistance independent of the selection for vine maturity classifications. Copyright © 2015 Massa et al.
Genetic Linkage Mapping of Economically Important Traits in Cultivated Tetraploid Potato (Solanum tuberosum L.)

PubMed Central

Massa, Alicia N.; Manrique-Carpintero, Norma C.; Coombs, Joseph J.; Zarka, Daniel G.; Boone, Anne E.; Kirk, William W.; Hackett, Christine A.; Bryan, Glenn J.; Douches, David S.

2015-01-01

The objective of this study was to construct a single nucleotide polymorphism (SNP)-based genetic map at the cultivated tetraploid level to locate quantitative trait loci (QTL) contributing to economically important traits in potato (Solanum tuberosum L.). The 156 F1 progeny and parents of a cross (MSL603) between “Jacqueline Lee” and “MSG227-2” were genotyped using the Infinium 8303 Potato Array. Furthermore, the progeny and parents were evaluated for foliar late blight reaction to isolates of the US-8 genotype of Phytophthora infestans (Mont.) de Bary and vine maturity. Linkage analyses and QTL mapping were performed using a novel approach that incorporates allele dosage information. The resulting genetic maps contained 1972 SNP markers with an average density of 1.36 marker per cM. QTL mapping identified the major source of late blight resistance in “Jacqueline Lee.” The best SNP marker mapped ∼0.54 Mb from a resistance hotspot on the long arm of chromosome 9. For vine maturity, the major-effect QTL was located on chromosome 5 with allelic effects from both parents. A candidate SNP marker for this trait mapped ∼0.25 Mb from the StCDF1 gene, which is a candidate gene for the maturity trait. The identification of markers for P. infestans resistance will enable the introgression of multiple sources of resistance through marker-assisted selection. Moreover, the discovery of a QTL for late blight resistance not linked to the QTL for vine maturity provides the opportunity to use marker-assisted selection for resistance independent of the selection for vine maturity classifications. PMID:26374597
The effect of input DNA copy number on genotype call and characterising SNP markers in the humpback whale genome using a nanofluidic array.

PubMed

Bhat, Somanath; Polanowski, Andrea M; Double, Mike C; Jarman, Simon N; Emslie, Kerry R

2012-01-01

Recent advances in nanofluidic technologies have enabled the use of Integrated Fluidic Circuits (IFCs) for high-throughput Single Nucleotide Polymorphism (SNP) genotyping (GT). In this study, we implemented and validated a relatively low cost nanofluidic system for SNP-GT with and without Specific Target Amplification (STA). As proof of principle, we first validated the effect of input DNA copy number on genotype call rate using well characterised, digital PCR (dPCR) quantified human genomic DNA samples and then implemented the validated method to genotype 45 SNPs in the humpback whale, Megaptera novaeangliae, nuclear genome. When STA was not incorporated, for a homozygous human DNA sample, reaction chambers containing, on average 9 to 97 copies, showed 100% call rate and accuracy. Below 9 copies, the call rate decreased, and at one copy it was 40%. For a heterozygous human DNA sample, the call rate decreased from 100% to 21% when predicted copies per reaction chamber decreased from 38 copies to one copy. The tightness of genotype clusters on a scatter plot also decreased. In contrast, when the same samples were subjected to STA prior to genotyping a call rate and a call accuracy of 100% were achieved. Our results demonstrate that low input DNA copy number affects the quality of data generated, in particular for a heterozygous sample. Similar to human genomic DNA, a call rate and a call accuracy of 100% was achieved with whale genomic DNA samples following multiplex STA using either 15 or 45 SNP-GT assays. These calls were 100% concordant with their true genotypes determined by an independent method, suggesting that the nanofluidic system is a reliable platform for executing call rates with high accuracy and concordance in genomic sequences derived from biological tissue.
Molecular and genealogical analysis of grain dormancy in Japanese wheat varieties, with specific focus on MOTHER OF FT AND TFL1 on chromosome 3A.

PubMed

Chono, Makiko; Matsunaka, Hitoshi; Seki, Masako; Fujita, Masaya; Kiribuchi-Otobe, Chikako; Oda, Shunsuke; Kojima, Hisayo; Nakamura, Shingo

2015-03-01

In the wheat (Triticum aestivum L.) cultivar 'Zenkoujikomugi', a single nucleotide polymorphism (SNP) in the promoter of MOTHER OF FT AND TFL1 on chromosome 3A (MFT-3A) causes an increase in the level of gene expression, resulting in strong grain dormancy. We used a DNA marker to detect the 'Zenkoujikomugi'-type (Zen-type) SNP and examined the genotype of MFT-3A in Japanese wheat varieties, and we found that 169 of 324 varieties carry the Zen-type SNP. In Japanese commercial varieties, the frequency of the Zen-type SNP was remarkably high in the southern part of Japan, but low in the northern part. To examine the relationship between MFT-3A genotype and grain dormancy, we performed a germination assay in three wheat-growing seasons. On average, the varieties carrying the Zen-type SNP showed stronger grain dormancy than the varieties carrying the non-Zen-type SNP. Among commercial cultivars, 'Iwainodaichi' (Kyushu), 'Junreikomugi' (Kinki-Chugoku-Shikoku), 'Kinuhime' (Kanto-Tokai), 'Nebarigoshi' (Tohoku-Hokuriku), and 'Kitamoe' (Hokkaido) showed the strongest grain dormancy in each geographical group, and all these varieties, except for 'Kitamoe', were found to carry the Zen-type SNP. In recent years, the number of varieties carrying the Zen-type SNP has increased in the Tohoku-Hokuriku region, but not in the Hokkaido region.
Isolation, characterization, and radiation protection of Sipunculus nudus L. polysaccharide.

PubMed

Li, Na; Shen, Xianrong; Liu, Yuming; Zhang, Junling; He, Ying; Liu, Qiong; Jiang, Dingwen; Zong, Jie; Li, Jiamei; Hou, Dengyong; Chen, Wei; Wang, Qingrong; Luo, Qun; Li, Kexian

2016-02-01

Sipunculus nudus Linnaeus polysaccharide (SNP) was purified from S. nudus L. via NaOH extraction, trichloroacetic acid deproteination, DEAE-cellulose 52 and Sephacryl S-300 chromatography. The monosaccharide analysis and molecular weight was detected with HPLC. FT-IR, 1H spectrum and 13C NMR spectrum were performed to detect the chemical characteristics. The antioxidant activity was assayed in vitro. The radiation protection effects were detected on mice. The results showed that SNP was composed of mannose, rhamnose, galacturonic acid, glucose, arabinose and fucose, and the average molecular weight was 680 kDa. Above the concentration of 10 mg/mL, SNP showed powerful scavenging activity on hydroxyl radical. In the animals irradiated with a 7.5 Gy γ-rays, the 90 mg/kg and the 270 mg/kg SNP groups survived significantly longer than the radiation control group. In the animals irradiated with a 4.0 Gy γ-rays, SNP showed significant protection effect. The contents of DNA in bone marrow cells were significantly increased by SNP treatment, and the micronucleus rates of 30 mg/kg and 270 mg/kg SNP groups were decrease significantly compared to the radiation control group. These findings suggest that SNP possesses marked antioxidant and bone marrow damage protection capacity which play important roles in the prevention of radiation damage. Copyright © 2015 Elsevier B.V. All rights reserved.
Genomewide association study of liver abscess in beef cattle.

PubMed

Keele, J W; Kuehn, L A; McDaneld, T G; Tait, R G; Jones, S A; Keel, B N; Snelling, W M

2016-02-01

Fourteen percent of U.S. cattle slaughtered in 2011 had liver abscesses, resulting in reduced carcass weight, quality, and value. Liver abscesses can result from a common bacterial cause, , which inhabits rumen lesions caused by acidosis and subsequently escapes into the blood stream, is filtered by the liver, and causes abscesses in the liver. Our aim was to identify SNP associated with liver abscesses in beef cattle. We used lung samples as a DNA source because they have low economic value, they have abundant DNA, and we had unrestricted access to sample them. We collected 2,304 lung samples from a beef processing plant: 1,152 from animals with liver abscess and 1,152 from animals without liver abscess. Lung tissue from pairs of animals, 1 with abscesses and another without, were collected from near one another on the viscera table to ensure that pairs of phenotypically extreme animals came from the same lot. Within each phenotype (abscess or no abscess), cattle were pooled by slaughter sequence into 12 pools of 96 cattle for each phenotype for a total of 24 pools. The pools were constructed by equal volume of frozen lung tissue from each animal. The DNA needed to allelotype each pool was then extracted from pooled lung tissue and the BovineHD Bead Array (777,962 SNP) was run on all 24 pools. Total intensity (TI), an indicator of copy number variants, was the sum of intensities from red and green dyes. Pooling allele frequency (PAF) was red dye intensity divided TI. Total intensity and PAF were weighted by the inverse of their respective genomic covariance matrices computed over all SNP across the genome. A false discovery rate ≤ 5% was achieved for 15 SNP for PAF and 20 SNP for TI. Genes within 50 kbp from significant SNP were in diverse pathways including maintenance of pH homeostasis in the gastrointestinal tract, maintain immune defenses in the liver, migration of leukocytes from the blood into infected tissues, transport of glutamine into the kidney in response to acidosis to facilitate production of bicarbonate to increase pH, aggregate platelets to liver injury to facilitate liver repair, and facilitate axon guidance. Evidence from the 35 detected SNP associations combined with evidence of polygenic variation indicate that there is adequate genetic variation in incidence rate of liver abscesses, which could be exploited to select sires for reduced susceptibility to subacute acidosis and associated liver abscess.

Associations between novel single nucleotide polymorphisms in the Bos taurus growth hormone gene and performance traits in Holstein-Friesian dairy cattle.

PubMed

Mullen, M P; Berry, D P; Howard, D J; Diskin, M G; Lynch, C O; Berkowicz, E W; Magee, D A; MacHugh, D E; Waters, S M

2010-12-01

Growth hormone, produced in the anterior pituitary gland, stimulates the release of insulin-like growth factor-I from the liver and is of critical importance in the control of nutrient utilization and partitioning for lactogenesis, fertility, growth, and development in cattle. The aim of this study was to discover novel polymorphisms in the bovine growth hormone gene (GH1) and to quantify their association with performance using estimates of genetic merit on 848 Holstein-Friesian AI (artificial insemination) dairy sires. Associations with previously reported polymorphisms in the bovine GH1 gene were also undertaken. A total of 38 novel single nucleotide polymorphisms (SNP) were identified across a panel of 22 beef and dairy cattle by sequence analysis of the 5' promoter, intronic, exonic, and 3' regulatory regions, encompassing approximately 7 kb of the GH1 gene. Following multiple regression analysis on all SNP, associations were identified between 11 SNP (2 novel and 9 previously identified) and milk fat and protein yield, milk composition, somatic cell score, survival, body condition score, and body size. The G allele of a previously identified SNP in exon 5 at position 2141 of the GH1 sequence, resulting in a nonsynonymous substitution, was associated with decreased milk protein yield. The C allele of a novel SNP, GH32, was associated with inferior carcass conformation. In addition, the T allele of a previously characterized SNP, GH35, was associated with decreased survival. Both GH24 (novel) and GH35 were independently associated with somatic cell count, and 3 SNP, GH21, 2291, and GH35, were independently associated with body depth. Furthermore, 2 SNP, GH24 and GH63, were independently associated with carcass fat. Results of this study further demonstrate the multifaceted influences of GH1 on milk production, fertility, and growth-related traits in cattle. Copyright © 2010 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Viability of in-house datamarting approaches for population genetics analysis of SNP genotypes

PubMed Central

Amigo, Jorge; Phillips, Christopher; Salas, Antonio; Carracedo, Ángel

2009-01-01

Background Databases containing very large amounts of SNP (Single Nucleotide Polymorphism) data are now freely available for researchers interested in medical and/or population genetics applications. While many of these SNP repositories have implemented data retrieval tools for general-purpose mining, these alone cannot cover the broad spectrum of needs of most medical and population genetics studies. Results To address this limitation, we have built in-house customized data marts from the raw data provided by the largest public databases. In particular, for population genetics analysis based on genotypes we have built a set of data processing scripts that deal with raw data coming from the major SNP variation databases (e.g. HapMap, Perlegen), stripping them into single genotypes and then grouping them into populations, then merged with additional complementary descriptive information extracted from dbSNP. This allows not only in-house standardization and normalization of the genotyping data retrieved from different repositories, but also the calculation of statistical indices from simple allele frequency estimates to more elaborate genetic differentiation tests within populations, together with the ability to combine population samples from different databases. Conclusion The present study demonstrates the viability of implementing scripts for handling extensive datasets of SNP genotypes with low computational costs, dealing with certain complex issues that arise from the divergent nature and configuration of the most popular SNP repositories. The information contained in these databases can also be enriched with additional information obtained from other complementary databases, in order to build a dedicated data mart. Updating the data structure is straightforward, as well as permitting easy implementation of new external data and the computation of supplementary statistical indices of interest. PMID:19344481
Viability of in-house datamarting approaches for population genetics analysis of SNP genotypes.

PubMed

Amigo, Jorge; Phillips, Christopher; Salas, Antonio; Carracedo, Angel

2009-03-19

Databases containing very large amounts of SNP (Single Nucleotide Polymorphism) data are now freely available for researchers interested in medical and/or population genetics applications. While many of these SNP repositories have implemented data retrieval tools for general-purpose mining, these alone cannot cover the broad spectrum of needs of most medical and population genetics studies. To address this limitation, we have built in-house customized data marts from the raw data provided by the largest public databases. In particular, for population genetics analysis based on genotypes we have built a set of data processing scripts that deal with raw data coming from the major SNP variation databases (e.g. HapMap, Perlegen), stripping them into single genotypes and then grouping them into populations, then merged with additional complementary descriptive information extracted from dbSNP. This allows not only in-house standardization and normalization of the genotyping data retrieved from different repositories, but also the calculation of statistical indices from simple allele frequency estimates to more elaborate genetic differentiation tests within populations, together with the ability to combine population samples from different databases. The present study demonstrates the viability of implementing scripts for handling extensive datasets of SNP genotypes with low computational costs, dealing with certain complex issues that arise from the divergent nature and configuration of the most popular SNP repositories. The information contained in these databases can also be enriched with additional information obtained from other complementary databases, in order to build a dedicated data mart. Updating the data structure is straightforward, as well as permitting easy implementation of new external data and the computation of supplementary statistical indices of interest.
Unravelling the Genetic Diversity among Cassava Bemisia tabaci Whiteflies Using NextRAD Sequencing.

PubMed

Wosula, Everlyne N; Chen, Wenbo; Fei, Zhangjun; Legg, James P

2017-11-01

Bemisia tabaci threatens production of cassava in Africa through vectoring viruses that cause cassava mosaic disease (CMD) and cassava brown streak disease (CBSD). B. tabaci sampled from cassava in eight countries in Africa were genotyped using NextRAD sequencing, and their phylogeny and population genetics were investigated using the resultant single nucleotide polymorphism (SNP) markers. SNP marker data and short sequences of mitochondrial DNA cytochrome oxidase I (mtCOI) obtained from the same insect were compared. Eight genetically distinct groups were identified based on mtCOI, whereas phylogenetic analysis using SNPs identified six major groups, which were further confirmed by PCA and multidimensional analyses. STRUCTURE analysis identified four ancestral B. tabaci populations that have contributed alleles to the six SNP-based groups. Significant gene flows were detected between several of the six SNP-based groups. Evidence of gene flow was strongest for SNP-based groups occurring in central Africa. Comparison of the mtCOI and SNP identities of sampled insects provided a strong indication that hybrid populations are emerging in parts of Africa recently affected by the severe CMD pandemic. This study reveals that mtCOI is not an effective marker at distinguishing cassava-colonizing B. tabaci haplogroups, and that more robust SNP-based multilocus markers should be developed. Significant gene flows between populations could lead to the emergence of haplogroups that might alter the dynamics of cassava virus spread and disease severity in Africa. Continuous monitoring of genetic compositions of whitefly populations should be an essential component in efforts to combat cassava viruses in Africa. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Analysis of genomic aberrations and gene expression profiling identifies novel lesions and pathways in myeloproliferative neoplasms

PubMed Central

Rice, K L; Lin, X; Wolniak, K; Ebert, B L; Berkofsky-Fessler, W; Buzzai, M; Sun, Y; Xi, C; Elkin, P; Levine, R; Golub, T; Gilliland, D G; Crispino, J D; Licht, J D; Zhang, W

2011-01-01

Polycythemia vera (PV), essential thrombocythemia and primary myelofibrosis, are myeloproliferative neoplasms (MPNs) with distinct clinical features and are associated with the JAK2V617F mutation. To identify genomic anomalies involved in the pathogenesis of these disorders, we profiled 87 MPN patients using Affymetrix 250K single-nucleotide polymorphism (SNP) arrays. Aberrations affecting chr9 were the most frequently observed and included 9pLOH (n=16), trisomy 9 (n=6) and amplifications of 9p13.3–23.3 (n=1), 9q33.1–34.13 (n=1) and 9q34.13 (n=6). Patients with trisomy 9 were associated with elevated JAK2V617F mutant allele burden, suggesting that gain of chr9 represents an alternative mechanism for increasing JAK2V617F dosage. Gene expression profiling of patients with and without chr9 abnormalities (+9, 9pLOH), identified genes potentially involved in disease pathogenesis including JAK2, STAT5B and MAPK14. We also observed recurrent gains of 1p36.31–36.33 (n=6), 17q21.2–q21.31 (n=5) and 17q25.1–25.3 (n=5) and deletions affecting 18p11.31–11.32 (n=8). Combined SNP and gene expression analysis identified aberrations affecting components of a non-canonical PRC2 complex (EZH1, SUZ12 and JARID2) and genes comprising a ‘HSC signature' (MLLT3, SMARCA2 and PBX1). We show that NFIB, which is amplified in 7/87 MPN patients and upregulated in PV CD34+ cells, protects cells from apoptosis induced by cytokine withdrawal. PMID:22829077
Genomic and transcriptomic predictors of triglyceride response to regular exercise

PubMed Central

Sarzynski, Mark A; Davidsen, Peter K; Sung, Yun Ju; Hesselink, Matthijs K C; Schrauwen, Patrick; Rice, Treva K; Rao, D C; Falciani, Francesco; Bouchard, Claude

2015-01-01

Aim We performed genome-wide and transcriptome-wide profiling to identify genes and single nucleotide polymorphisms (SNPs) associated with the response of triglycerides (TG) to exercise training. Methods Plasma TG levels were measured before and after a 20-week endurance training programme in 478 white participants from the HERITAGE Family Study. Illumina HumanCNV370-Quad v3.0 BeadChips were genotyped using the Illumina BeadStation 500GX platform. Affymetrix HG-U133+2 arrays were used to quantitate gene expression levels from baseline muscle biopsies of a subset of participants (N=52). Genome-wide association study (GWAS) analysis was performed using MERLIN, while transcriptomic predictor models were developed using the R-package GALGO. Results The GWAS results showed that eight SNPs were associated with TG training-response (ΔTG) at p<9.9×10−6, while another 31 SNPs showed p values <1×10−4. In multivariate regression models, the top 10 SNPs explained 32.0% of the variance in ΔTG, while conditional heritability analysis showed that four SNPs statistically accounted for all of the heritability of ΔTG. A molecular signature based on the baseline expression of 11 genes predicted 27% of ΔTG in HERITAGE, which was validated in an independent study. A composite SNP score based on the top four SNPs, each from the genomic and transcriptomic analyses, was the strongest predictor of ΔTG (R2=0.14, p=3.0×10−68). Conclusions Our results indicate that skeletal muscle transcript abundance at 11 genes and SNPs at a number of loci contribute to TG response to exercise training. Combining data from genomics and transcriptomics analyses identified a SNP-based gene signature that should be further tested in independent samples. PMID:26491034
Discrimination of candidate subgenome-specific loci by linkage map construction with an S1 population of octoploid strawberry (Fragaria × ananassa).

PubMed

Nagano, Soichiro; Shirasawa, Kenta; Hirakawa, Hideki; Maeda, Fumi; Ishikawa, Masami; Isobe, Sachiko N

2017-05-12

The strawberry, Fragaria × ananassa, is an allo-octoploid (2n = 8x = 56) and outcrossing species. Although it is the most widely consumed berry crop in the world, its complex genome structure has hindered its genetic and genomic analysis, and thus discrimination of subgenome-specific loci among the homoeologous chromosomes is needed. In the present study, we identified candidate subgenome-specific single nucleotide polymorphism (SNP) and simple sequence repeat (SSR) loci, and constructed a linkage map using an S 1 mapping population of the cultivar 'Reikou' with an IStraw90 Axiom® SNP array and previously published SSR markers. The 'Reikou' linkage map consisted of 11,574 loci (11,002 SNPs and 572 SSR loci) spanning 2816.5 cM of 31 linkage groups. The 11,574 loci were located on 4738 unique positions (bin) on the linkage map. Of the mapped loci, 8999 (8588 SNPs and 411 SSR loci) showed a 1:2:1 segregation ratio of AA:AB:BB allele, which suggested the possibility of deriving loci from candidate subgenome-specific sequences. In addition, 2575 loci (2414 SNPs and 161 SSR loci) showed a 3:1 segregation of AB:BB allele, indicating they were derived from homoeologous genomic sequences. Comparative analysis of the homoeologous linkage groups revealed differences in genome structure among the subgenomes. Our results suggest that candidate subgenome-specific loci are randomly located across the genomes, and that there are small- to large-scale structural variations among the subgenomes. The mapped SNPs and SSR loci on the linkage map are expected to be seed points for the construction of pseudomolecules in the octoploid strawberry.
Genome-Wide Analysis Reveals Selection for Important Traits in Domestic Horse Breeds

PubMed Central

Petersen, Jessica L.; Mickelson, James R.; Rendahl, Aaron K.; Valberg, Stephanie J.; Andersson, Lisa S.; Axelsson, Jeanette; Bailey, Ernie; Bannasch, Danika; Binns, Matthew M.; Borges, Alexandre S.; Brama, Pieter; da Câmara Machado, Artur; Capomaccio, Stefano; Cappelli, Katia; Cothran, E. Gus; Distl, Ottmar; Fox-Clipsham, Laura; Graves, Kathryn T.; Guérin, Gérard; Haase, Bianca; Hasegawa, Telhisa; Hemmann, Karin; Hill, Emmeline W.; Leeb, Tosso; Lindgren, Gabriella; Lohi, Hannes; Lopes, Maria Susana; McGivney, Beatrice A.; Mikko, Sofia; Orr, Nicholas; Penedo, M. Cecilia T.; Piercy, Richard J.; Raekallio, Marja; Rieder, Stefan; Røed, Knut H.; Swinburne, June; Tozaki, Teruaki; Vaudin, Mark; Wade, Claire M.; McCue, Molly E.

2013-01-01

Intense selective pressures applied over short evolutionary time have resulted in homogeneity within, but substantial variation among, horse breeds. Utilizing this population structure, 744 individuals from 33 breeds, and a 54,000 SNP genotyping array, breed-specific targets of selection were identified using an FST-based statistic calculated in 500-kb windows across the genome. A 5.5-Mb region of ECA18, in which the myostatin (MSTN) gene was centered, contained the highest signature of selection in both the Paint and Quarter Horse. Gene sequencing and histological analysis of gluteal muscle biopsies showed a promoter variant and intronic SNP of MSTN were each significantly associated with higher Type 2B and lower Type 1 muscle fiber proportions in the Quarter Horse, demonstrating a functional consequence of selection at this locus. Signatures of selection on ECA23 in all gaited breeds in the sample led to the identification of a shared, 186-kb haplotype including two doublesex related mab transcription factor genes (DMRT2 and 3). The recent identification of a DMRT3 mutation within this haplotype, which appears necessary for the ability to perform alternative gaits, provides further evidence for selection at this locus. Finally, putative loci for the determination of size were identified in the draft breeds and the Miniature horse on ECA11, as well as when signatures of selection surrounding candidate genes at other loci were examined. This work provides further evidence of the importance of MSTN in racing breeds, provides strong evidence for selection upon gait and size, and illustrates the potential for population-based techniques to find genomic regions driving important phenotypes in the modern horse. PMID:23349635
Extended diversity analysis of cultivated grapevine Vitis vinifera with 10K genome-wide SNPs.

PubMed

Laucou, Valérie; Launay, Amandine; Bacilieri, Roberto; Lacombe, Thierry; Adam-Blondon, Anne-Françoise; Bérard, Aurélie; Chauveau, Aurélie; de Andrés, Maria Teresa; Hausmann, Ludger; Ibáñez, Javier; Le Paslier, Marie-Christine; Maghradze, David; Martinez-Zapater, José Miguel; Maul, Erika; Ponnaiah, Maharajah; Töpfer, Reinhard; Péros, Jean-Pierre; Boursiquot, Jean-Michel

2018-01-01

Grapevine is a very important crop species that is mainly cultivated worldwide for fruits, wine and juice. Identification of the genetic bases of performance traits through association mapping studies requires a precise knowledge of the available diversity and how this diversity is structured and varies across the whole genome. An 18k SNP genotyping array was evaluated on a panel of Vitis vinifera cultivars and we obtained a data set with no missing values for a total of 10207 SNPs and 783 different genotypes. The average inter-SNP spacing was ~47 kbp, the mean minor allele frequency (MAF) was 0.23 and the genetic diversity in the sample was high (He = 0.32). Fourteen SNPs, chosen from those with the highest MAF values, were sufficient to identify each genotype in the sample. Parentage analysis revealed 118 full parentages and 490 parent-offspring duos, thus confirming the close pedigree relationships within the cultivated grapevine. Structure analyses also confirmed the main divisions due to an eastern-western gradient and human usage (table vs. wine). Using a multivariate approach, we refined the structure and identified a total of eight clusters. Both the genetic diversity (He, 0.26-0.32) and linkage disequilibrium (LD, 28.8-58.2 kbp) varied between clusters. Despite the short span LD, we also identified some non-recombining haplotype blocks that may complicate association mapping. Finally, we performed a genome-wide association study that confirmed previous works and also identified new regions for important performance traits such as acidity. Taken together, all the results contribute to a better knowledge of the genetics of the cultivated grapevine.
Extended diversity analysis of cultivated grapevine Vitis vinifera with 10K genome-wide SNPs

PubMed Central

Launay, Amandine; Bacilieri, Roberto; Lacombe, Thierry; Adam-Blondon, Anne-Françoise; Bérard, Aurélie; Chauveau, Aurélie; de Andrés, Maria Teresa; Maghradze, David; Maul, Erika; Ponnaiah, Maharajah; Töpfer, Reinhard; Péros, Jean-Pierre; Boursiquot, Jean-Michel

2018-01-01

Grapevine is a very important crop species that is mainly cultivated worldwide for fruits, wine and juice. Identification of the genetic bases of performance traits through association mapping studies requires a precise knowledge of the available diversity and how this diversity is structured and varies across the whole genome. An 18k SNP genotyping array was evaluated on a panel of Vitis vinifera cultivars and we obtained a data set with no missing values for a total of 10207 SNPs and 783 different genotypes. The average inter-SNP spacing was ~47 kbp, the mean minor allele frequency (MAF) was 0.23 and the genetic diversity in the sample was high (He = 0.32). Fourteen SNPs, chosen from those with the highest MAF values, were sufficient to identify each genotype in the sample. Parentage analysis revealed 118 full parentages and 490 parent-offspring duos, thus confirming the close pedigree relationships within the cultivated grapevine. Structure analyses also confirmed the main divisions due to an eastern-western gradient and human usage (table vs. wine). Using a multivariate approach, we refined the structure and identified a total of eight clusters. Both the genetic diversity (He, 0.26–0.32) and linkage disequilibrium (LD, 28.8–58.2 kbp) varied between clusters. Despite the short span LD, we also identified some non-recombining haplotype blocks that may complicate association mapping. Finally, we performed a genome-wide association study that confirmed previous works and also identified new regions for important performance traits such as acidity. Taken together, all the results contribute to a better knowledge of the genetics of the cultivated grapevine. PMID:29420602
CYP gene family variants as potential protective factors in drug addiction in Han Chinese.

PubMed

Zhang, Hongxing; Yang, Qi; Zheng, Wenkai; Ouyang, Yongri; Yang, Min; Wang, Fengjiao; Jin, Tianbo; Zhang, Ji; Wang, Zhenyuan

2016-08-01

There is growing evidence that genetic factors also contribute to drug addiction. The human cytochrome P450 has shown special interest because of pharmacokinetic variation in different individuals and populations, which is largely determined by the relevant genes. The present study aimed to investigate the polymorphism of the CYP/addicts relationship. We genotyped 13 tag single-nucleotide polymorphisms (tSNPs) from three genes, including 692 cases and 700 controls. Sequenom MassARRAY RS1000 (Sequenom, Inc., San Diego, CA, USA) was used for SNP genotyping. Statistical analysis of the association between tSNPs and drug addiction was performed using the chi-squared test and SNP Stats software (http://bioinfo.iconcologia.net). The T/T genotype of rs2242480 in CYP3A4 was associated with decreased risk in the recessive model (p = 0.0002). Allele frequency at rs3743484 in CYP1A2 showed significant differences between addicts and controls (p = 0.046; odds ratio = 0.80; 95% confidence interval = 0.65-1.00). In genetic model analyses, the minor C allele of rs3743484 in CYP1A2 was associated with a reduced risk of drug addiction based on analysis using codominant and additive models (p = 0.027 dominant model; p =0.038 additive model). Our findings show that at allelic and genotypic level polymorphisms in CYP3A4 and CYP1A2 are significantly associated with a reduced risk of drug addiction in X'ian Han Chinese individuals. However, this result needs to be confirmed in additional studies. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
A missense mutation in MYH1 is associated with susceptibility to immune-mediated myositis in Quarter Horses.

PubMed

Finno, Carrie J; Gianino, Giuliana; Perumbakkam, Sudeep; Williams, Zoë J; Bordbari, Matthew H; Gardner, Keri L; Burns, Erin; Peng, Sichong; Durward-Akhurst, Sian A; Valberg, Stephanie J

2018-03-06

The cause of immune-mediated myositis (IMM), characterized by recurrent, rapid-onset muscle atrophy in Quarter Horses (QH), is unknown. The histopathologic hallmark of IMM is lymphocytic infiltration of myofibers. The purpose of this study was to identify putative functional variants associated with equine IMM. A genome-wide association (GWA) study was performed on 36 IMM QHs and 54 breed matched unaffected QHs from the same environment using the Equine SNP50 and SNP70 genotyping arrays. A mixed model analysis identified nine SNPs within a ~ 2.87 Mb region on chr11 that were significantly (P unadjusted < 1.4 × 10 - 6 ) associated with the IMM phenotype. Associated haplotypes within this region encompassed 38 annotated genes, including four myosin genes (MYH1, MYH2, MYH3, and MYH13). Whole genome sequencing of four IMM and four unaffected QHs identified a single segregating nonsynonymous E321G mutation in MYH1 encoding myosin heavy chain 2X. Genotyping of additional 35 IMM and 22 unaffected QHs confirmed an association (P = 2.9 × 10 - 5 ), and the putative mutation was absent in 175 horses from 21 non-QH breeds. Lymphocytic infiltrates occurred in type 2X myofibers and the proportion of 2X fibers was decreased in the presence of inflammation. Protein modeling and contact/stability analysis identified 14 residues affected by the mutation which significantly decreased stability. We conclude that a mutation in MYH1 is highly associated with susceptibility to the IMM phenotype in QH-related breeds. This is the first report of a mutation in MYH1 and the first link between a skeletal muscle myosin mutation and autoimmune disease.
Catalog of MicroRNA Seed Polymorphisms in Vertebrates

PubMed Central

Calin, George Adrian; Horvat, Simon; Jiang, Zhihua; Dovc, Peter; Kunej, Tanja

2012-01-01

MicroRNAs (miRNAs) are a class of non-coding RNA that plays an important role in posttranscriptional regulation of mRNA. Evidence has shown that miRNA gene variability might interfere with its function resulting in phenotypic variation and disease susceptibility. A major role in miRNA target recognition is ascribed to complementarity with the miRNA seed region that can be affected by polymorphisms. In the present study, we developed an online tool for the detection of miRNA polymorphisms (miRNA SNiPer) in vertebrates (http://www.integratomics-time.com/miRNA-SNiPer) and generated a catalog of miRNA seed region polymorphisms (miR-seed-SNPs) consisting of 149 SNPs in six species. Although a majority of detected polymorphisms were due to point mutations, two consecutive nucleotide substitutions (double nucleotide polymorphisms, DNPs) were also identified in nine miRNAs. We determined that miR-SNPs are frequently located within the quantitative trait loci (QTL), chromosome fragile sites, and cancer susceptibility loci, indicating their potential role in the genetic control of various complex traits. To test this further, we performed an association analysis between the mmu-miR-717 seed SNP rs30372501, which is polymorphic in a large number of standard inbred strains, and all phenotypic traits in these strains deposited in the Mouse Phenome Database. Analysis showed a significant association between the mmu-miR-717 seed SNP and a diverse array of traits including behavior, blood-clinical chemistry, body weight size and growth, and immune system suggesting that seed SNPs can indeed have major pleiotropic effects. The bioinformatics analyses, data and tools developed in the present study can serve researchers as a starting point in testing more targeted hypotheses and designing experiments using optimal species or strains for further mechanistic studies. PMID:22303453
Association analysis of the SLC22A11 (organic anion transporter 4) and SLC22A12 (urate transporter 1) urate transporter locus with gout in New Zealand case-control sample sets reveals multiple ancestral-specific effects

PubMed Central

2013-01-01

Introduction There is inconsistent association between urate transporters SLC22A11 (organic anion transporter 4 (OAT4)) and SLC22A12 (urate transporter 1 (URAT1)) and risk of gout. New Zealand (NZ) Māori and Pacific Island people have higher serum urate and more severe gout than European people. The aim of this study was to test genetic variation across the SLC22A11/SLC22A12 locus for association with risk of gout in NZ sample sets. Methods A total of 12 single nucleotide polymorphism (SNP) variants in four haplotype blocks were genotyped using TaqMan® and Sequenom MassArray in 1003 gout cases and 1156 controls. All cases had gout according to the 1977 American Rheumatism Association criteria. Association analysis of single markers and haplotypes was performed using PLINK and Stata. Results A haplotype block 1 SNP (rs17299124) (upstream of SLC22A11) was associated with gout in less admixed Polynesian sample sets, but not European Caucasian (odds ratio; OR = 3.38, P = 6.1 × 10-4; OR = 0.91, P = 0.40, respectively) sample sets. A protective block 1 haplotype caused the rs17299124 association (OR = 0.28, P = 6.0 × 10-4). Within haplotype block 2 (SLC22A11) we could not replicate previous reports of association of rs2078267 with gout in European Caucasian (OR = 0.98, P = 0.82) sample sets, however this SNP was associated with gout in Polynesian (OR = 1.51, P = 0.022) sample sets. Within haplotype block 3 (including SLC22A12) analysis of haplotypes revealed a haplotype with trans-ancestral protective effects (OR = 0.80, P = 0.004), and a second haplotype conferring protection in less admixed Polynesian sample sets (OR = 0.63, P = 0.028) but risk in European Caucasian samples (OR = 1.33, P = 0.039). Conclusions Our analysis provides evidence for multiple ancestral-specific effects across the SLC22A11/SLC22A12 locus that presumably influence the activity of OAT4 and URAT1 and risk of gout. Further fine mapping of the association signal is needed using trans-ancestral re-sequence data. PMID:24360580
Combined Targeted DNA Sequencing in Non-Small Cell Lung Cancer (NSCLC) Using UNCseq and NGScopy, and RNA Sequencing Using UNCqeR for the Detection of Genetic Aberrations in NSCLC

PubMed Central

Walter, Vonn; Patel, Nirali M.; Eberhard, David A.; Hayward, Michele C.; Salazar, Ashley H.; Jo, Heejoon; Soloway, Matthew G.; Wilkerson, Matthew D.; Parker, Joel S.; Yin, Xiaoying; Zhang, Guosheng; Siegel, Marni B.; Rosson, Gary B.; Earp, H. Shelton; Sharpless, Norman E.; Gulley, Margaret L.; Weck, Karen E.

2015-01-01

The recent FDA approval of the MiSeqDx platform provides a unique opportunity to develop targeted next generation sequencing (NGS) panels for human disease, including cancer. We have developed a scalable, targeted panel-based assay termed UNCseq, which involves a NGS panel of over 200 cancer-associated genes and a standardized downstream bioinformatics pipeline for detection of single nucleotide variations (SNV) as well as small insertions and deletions (indel). In addition, we developed a novel algorithm, NGScopy, designed for samples with sparse sequencing coverage to detect large-scale copy number variations (CNV), similar to human SNP Array 6.0 as well as small-scale intragenic CNV. Overall, we applied this assay to 100 snap-frozen lung cancer specimens lacking same-patient germline DNA (07–0120 tissue cohort) and validated our results against Sanger sequencing, SNP Array, and our recently published integrated DNA-seq/RNA-seq assay, UNCqeR, where RNA-seq of same-patient tumor specimens confirmed SNV detected by DNA-seq, if RNA-seq coverage depth was adequate. In addition, we applied the UNCseq assay on an independent lung cancer tumor tissue collection with available same-patient germline DNA (11–1115 tissue cohort) and confirmed mutations using assays performed in a CLIA-certified laboratory. We conclude that UNCseq can identify SNV, indel, and CNV in tumor specimens lacking germline DNA in a cost-efficient fashion. PMID:26076459
High throughput SNP discovery and genotyping in hexaploid wheat.

PubMed

Rimbert, Hélène; Darrier, Benoît; Navarro, Julien; Kitt, Jonathan; Choulet, Frédéric; Leveugle, Magalie; Duarte, Jorge; Rivière, Nathalie; Eversole, Kellye; Le Gouis, Jacques; Davassi, Alessandro; Balfourier, François; Le Paslier, Marie-Christine; Berard, Aurélie; Brunel, Dominique; Feuillet, Catherine; Poncet, Charles; Sourdille, Pierre; Paux, Etienne

2018-01-01

Because of their abundance and their amenability to high-throughput genotyping techniques, Single Nucleotide Polymorphisms (SNPs) are powerful tools for efficient genetics and genomics studies, including characterization of genetic resources, genome-wide association studies and genomic selection. In wheat, most of the previous SNP discovery initiatives targeted the coding fraction, leaving almost 98% of the wheat genome largely unexploited. Here we report on the use of whole-genome resequencing data from eight wheat lines to mine for SNPs in the genic, the repetitive and non-repetitive intergenic fractions of the wheat genome. Eventually, we identified 3.3 million SNPs, 49% being located on the B-genome, 41% on the A-genome and 10% on the D-genome. We also describe the development of the TaBW280K high-throughput genotyping array containing 280,226 SNPs. Performance of this chip was examined by genotyping a set of 96 wheat accessions representing the worldwide diversity. Sixty-nine percent of the SNPs can be efficiently scored, half of them showing a diploid-like clustering. The TaBW280K was proven to be a very efficient tool for diversity analyses, as well as for breeding as it can discriminate between closely related elite varieties. Finally, the TaBW280K array was used to genotype a population derived from a cross between Chinese Spring and Renan, leading to the construction a dense genetic map comprising 83,721 markers. The results described here will provide the wheat community with powerful tools for both basic and applied research.
Parallel and serial computing tools for testing single-locus and epistatic SNP effects of quantitative traits in genome-wide association studies

PubMed Central

Ma, Li; Runesha, H Birali; Dvorkin, Daniel; Garbe, John R; Da, Yang

2008-01-01

Background Genome-wide association studies (GWAS) using single nucleotide polymorphism (SNP) markers provide opportunities to detect epistatic SNPs associated with quantitative traits and to detect the exact mode of an epistasis effect. Computational difficulty is the main bottleneck for epistasis testing in large scale GWAS. Results The EPISNPmpi and EPISNP computer programs were developed for testing single-locus and epistatic SNP effects on quantitative traits in GWAS, including tests of three single-locus effects for each SNP (SNP genotypic effect, additive and dominance effects) and five epistasis effects for each pair of SNPs (two-locus interaction, additive × additive, additive × dominance, dominance × additive, and dominance × dominance) based on the extended Kempthorne model. EPISNPmpi is the parallel computing program for epistasis testing in large scale GWAS and achieved excellent scalability for large scale analysis and portability for various parallel computing platforms. EPISNP is the serial computing program based on the EPISNPmpi code for epistasis testing in small scale GWAS using commonly available operating systems and computer hardware. Three serial computing utility programs were developed for graphical viewing of test results and epistasis networks, and for estimating CPU time and disk space requirements. Conclusion The EPISNPmpi parallel computing program provides an effective computing tool for epistasis testing in large scale GWAS, and the epiSNP serial computing programs are convenient tools for epistasis analysis in small scale GWAS using commonly available computer hardware. PMID:18644146
Association-heterogeneity mapping identifies an Asian-specific association of the GTF2I locus with rheumatoid arthritis

PubMed Central

Kim, Kwangwoo; Bang, So-Young; Ikari, Katsunori; Yoo, Dae Hyun; Cho, Soo-Kyung; Choi, Chan-Bum; Sung, Yoon-Kyoung; Kim, Tae-Hwan; Jun, Jae-Bum; Kang, Young Mo; Suh, Chang-Hee; Shim, Seung-Cheol; Lee, Shin-Seok; Lee, Jisoo; Chung, Won Tae; Kim, Seong-Kyu; Choe, Jung-Yoon; Momohara, Shigeki; Taniguchi, Atsuo; Yamanaka, Hisashi; Nath, Swapan K.; Lee, Hye-Soon; Bae, Sang-Cheol

2016-01-01

Considerable sharing of disease alleles among populations is well-characterized in autoimmune disorders (e.g., rheumatoid arthritis), but there are some exceptional loci showing heterogenic association among populations. Here we investigated genetic variants with distinct effects on the development of rheumatoid arthritis in Asian and European populations. Ancestry-related association heterogeneity was examined using Cochran’s homogeneity tests for the disease association data from large Asian (n = 14,465; 9,299 discovery subjects and 5,166 validation subjects; 4 collections) and European (n = 45,790; 11 collections) rheumatoid arthritis case-control cohorts with Immunochip and genome-wide SNP array data. We identified significant heterogeneity between the two ancestries for the common variants in the GTF2I locus (PHeterogeneity = 9.6 × 10−9 at rs73366469) and showed that this heterogeneity was due to an Asian-specific association effect (ORMeta = 1.37 and PMeta = 4.2 × 10−13 in Asians; ORMeta = 1.00 and PMeta = 1.00 in Europeans). Trans-ancestral comparison and bioinfomatics analysis revealed a plausibly causal or disease-variant-tagging SNP (rs117026326; in linkage disequilibrium with rs73366469), whose minor allele is common in Asians but rare in Europeans. In conclusion, we identified largest-ever effect on Asian rheumatoid arthritis across human non-HLA regions at GTF2I by heterogeneity mapping followed by replication studies, and pinpointed a possible causal variant. PMID:27272985
The Genetic Architecture of Adaptations to High Altitude in Ethiopia

PubMed Central

Alkorta-Aranburu, Gorka; Beall, Cynthia M.; Witonsky, David B.; Gebremedhin, Amha; Pritchard, Jonathan K.; Di Rienzo, Anna

2012-01-01

Although hypoxia is a major stress on physiological processes, several human populations have survived for millennia at high altitudes, suggesting that they have adapted to hypoxic conditions. This hypothesis was recently corroborated by studies of Tibetan highlanders, which showed that polymorphisms in candidate genes show signatures of natural selection as well as well-replicated association signals for variation in hemoglobin levels. We extended genomic analysis to two Ethiopian ethnic groups: Amhara and Oromo. For each ethnic group, we sampled low and high altitude residents, thus allowing genetic and phenotypic comparisons across altitudes and across ethnic groups. Genome-wide SNP genotype data were collected in these samples by using Illumina arrays. We find that variants associated with hemoglobin variation among Tibetans or other variants at the same loci do not influence the trait in Ethiopians. However, in the Amhara, SNP rs10803083 is associated with hemoglobin levels at genome-wide levels of significance. No significant genotype association was observed for oxygen saturation levels in either ethnic group. Approaches based on allele frequency divergence did not detect outliers in candidate hypoxia genes, but the most differentiated variants between high- and lowlanders have a clear role in pathogen defense. Interestingly, a significant excess of allele frequency divergence was consistently detected for genes involved in cell cycle control and DNA damage and repair, thus pointing to new pathways for high altitude adaptations. Finally, a comparison of CpG methylation levels between high- and lowlanders found several significant signals at individual genes in the Oromo. PMID:23236293
Alterations of LKB1 and KRAS and risk of brain metastasis: comprehensive characterization by mutation analysis, copy number, and gene expression in non-small-cell lung carcinoma.

PubMed

Zhao, Ni; Wilkerson, Matthew D; Shah, Usman; Yin, Xiaoying; Wang, Anyou; Hayward, Michele C; Roberts, Patrick; Lee, Carrie B; Parsons, Alden M; Thorne, Leigh B; Haithcock, Benjamin E; Grilley-Olson, Juneko E; Stinchcombe, Thomas E; Funkhouser, William K; Wong, Kwok-Kin; Sharpless, Norman E; Hayes, D Neil

2014-11-01

Brain metastases are one of the most malignant complications of lung cancer and constitute a significant cause of cancer related morbidity and mortality worldwide. Recent years of investigation suggested a role of LKB1 in NSCLC development and progression, in synergy with KRAS alteration. In this study, we systematically analyzed how LKB1 and KRAS alteration, measured by mutation, gene expression (GE) and copy number (CN), are associated with brain metastasis in NSCLC. Patients treated at University of North Carolina Hospital from 1990 to 2009 with NSCLC provided frozen, surgically extracted tumors for analysis. GE was measured using Agilent 44,000 custom-designed arrays, CN was assessed by Affymetrix GeneChip Human Mapping 250K Sty Array or the Genome-Wide Human SNP Array 6.0 and gene mutation was detected using ABI sequencing. Integrated analysis was conducted to assess the relationship between these genetic markers and brain metastasis. A model was proposed for brain metastasis prediction using these genetic measurements. 17 of the 174 patients developed brain metastasis. LKB1 wild type tumors had significantly higher LKB1 CN (p<0.001) and GE (p=0.002) than the LKB1 mutant group. KRAS wild type tumors had significantly lower KRAS GE (p<0.001) and lower CN, although the latter failed to be significant (p=0.295). Lower LKB1 CN (p=0.039) and KRAS mutation (p=0.007) were significantly associated with more brain metastasis. The predictive model based on nodal (N) stage, patient age, LKB1 CN and KRAS mutation had a good prediction accuracy, with area under the ROC curve of 0.832 (p<0.001). LKB1 CN in combination with KRAS mutation predicted brain metastasis in NSCLC. Copyright © 2014 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.

SNP discovery through de novo deep sequencing using the next generation of DNA sequencers

USDA-ARS?s Scientific Manuscript database

The production of high volumes of DNA sequence data using new technologies has permitted more efficient identification of single nucleotide polymorphisms in vertebrate genomes. This chapter presented practical methodology for production and analysis of DNA sequence data for SNP discovery....
An integrated genetic map based on four mapping populations and quantitative trait loci associated with economically important traits in watermelon (Citrullus lanatus)

PubMed Central

2014-01-01

Background Modern watermelon (Citrullus lanatus L.) cultivars share a narrow genetic base due to many years of selection for desirable horticultural qualities. Wild subspecies within C. lanatus are important potential sources of novel alleles for watermelon breeding, but successful trait introgression into elite cultivars has had limited success. The application of marker assisted selection (MAS) in watermelon is yet to be realized, mainly due to the past lack of high quality genetic maps. Recently, a number of useful maps have become available, however these maps have few common markers, and were constructed using different marker sets, thus, making integration and comparative analysis among maps difficult. The objective of this research was to use single-nucleotide polymorphism (SNP) anchor markers to construct an integrated genetic map for C. lanatus. Results Under the framework of the high density genetic map, an integrated genetic map was constructed by merging data from four independent mapping experiments using a genetically diverse array of parental lines, which included three subspecies of watermelon. The 698 simple sequence repeat (SSR), 219 insertion-deletion (InDel), 36 structure variation (SV) and 386 SNP markers from the four maps were used to construct an integrated map. This integrated map contained 1339 markers, spanning 798 cM with an average marker interval of 0.6 cM. Fifty-eight previously reported quantitative trait loci (QTL) for 12 traits in these populations were also integrated into the map. In addition, new QTL identified for brix, fructose, glucose and sucrose were added. Some QTL associated with economically important traits detected in different genetic backgrounds mapped to similar genomic regions of the integrated map, suggesting that such QTL are responsible for the phenotypic variability observed in a broad array of watermelon germplasm. Conclusions The integrated map described herein enhances the utility of genomic tools over previous watermelon genetic maps. A large proportion of the markers in the integrated map are SSRs, InDels and SNPs, which are easily transferable across laboratories. Moreover, the populations used to construct the integrated map include all three watermelon subspecies, making this integrated map useful for the selection of breeding traits, identification of QTL, MAS, analysis of germplasm and commercial hybrid seed detection. PMID:24443961
Assay for identification of heterozygous single-nucleotide polymorphism (Ala67Thr) in human poliovirus receptor gene.

PubMed

Nandi, Shyam Sundar; Sharma, Deepa Kailash; Deshpande, Jagadish M

2016-07-01

It is important to understand the role of cell surface receptors in susceptibility to infectious diseases. CD155 a member of the immunoglobulin super family, serves as the poliovirus receptor (PVR). Heterozygous (Ala67Thr) polymorphism in CD155 has been suggested as a risk factor for paralytic outcome of poliovirus infection. The present study pertains to the development of a screening test to detect the single nucleotide (SNP) polymorphism in the CD155 gene. New primers were designed for PCR, sequencing and SNP analysis of Exon2 of CD155 gene. DNAs extracted from either whole blood (n=75) or cells from oral cavity (n=75) were used for standardization and validation of the SNP assay. DNA sequencing was used as the gold standard method. A new SNP assay for detection of heterozygous Ala67Thr genotype was developed and validated by testing 150 DNA samples. Heterozygous CD155 was detected in 27.33 per cent (41/150) of DNA samples tested by both SNP detection assay and sequencing. The SNP detection assay was successfully developed for identification of Ala67Thr polymorphism in human PVR/CD155 gene. The SNP assay will be useful for large scale screening of DNA samples.
Rapid Detection of Rare Deleterious Variants by Next Generation Sequencing with Optional Microarray SNP Genotype Data

PubMed Central

Watson, Christopher M.; Crinnion, Laura A.; Gurgel‐Gianetti, Juliana; Harrison, Sally M.; Daly, Catherine; Antanavicuite, Agne; Lascelles, Carolina; Markham, Alexander F.; Pena, Sergio D. J.; Bonthron, David T.

2015-01-01

ABSTRACT Autozygosity mapping is a powerful technique for the identification of rare, autosomal recessive, disease‐causing genes. The ease with which this category of disease gene can be identified has greatly increased through the availability of genome‐wide SNP genotyping microarrays and subsequently of exome sequencing. Although these methods have simplified the generation of experimental data, its analysis, particularly when disparate data types must be integrated, remains time consuming. Moreover, the huge volume of sequence variant data generated from next generation sequencing experiments opens up the possibility of using these data instead of microarray genotype data to identify disease loci. To allow these two types of data to be used in an integrated fashion, we have developed AgileVCFMapper, a program that performs both the mapping of disease loci by SNP genotyping and the analysis of potentially deleterious variants using exome sequence variant data, in a single step. This method does not require microarray SNP genotype data, although analysis with a combination of microarray and exome genotype data enables more precise delineation of disease loci, due to superior marker density and distribution. PMID:26037133
A response to Yu et al. "A forward-backward fragment assembling algorithm for the identification of genomic amplification and deletion breakpoints using high-density single nucleotide polymorphism (SNP) array", BMC Bioinformatics 2007, 8: 145.

PubMed

Rueda, Oscar M; Diaz-Uriarte, Ramon

2007-10-16

Yu et al. (BMC Bioinformatics 2007,8: 145+) have recently compared the performance of several methods for the detection of genomic amplification and deletion breakpoints using data from high-density single nucleotide polymorphism arrays. One of the methods compared is our non-homogenous Hidden Markov Model approach. Our approach uses Markov Chain Monte Carlo for inference, but Yu et al. ran the sampler for a severely insufficient number of iterations for a Markov Chain Monte Carlo-based method. Moreover, they did not use the appropriate reference level for the non-altered state. We rerun the analysis in Yu et al. using appropriate settings for both the Markov Chain Monte Carlo iterations and the reference level. Additionally, to show how easy it is to obtain answers to additional specific questions, we have added a new analysis targeted specifically to the detection of breakpoints. The reanalysis shows that the performance of our method is comparable to that of the other methods analyzed. In addition, we can provide probabilities of a given spot being a breakpoint, something unique among the methods examined. Markov Chain Monte Carlo methods require using a sufficient number of iterations before they can be assumed to yield samples from the distribution of interest. Running our method with too small a number of iterations cannot be representative of its performance. Moreover, our analysis shows how our original approach can be easily adapted to answer specific additional questions (e.g., identify edges).
A bioinformatic pipeline for identifying informative SNP panels for parentage assignment from RADseq data.

PubMed

Andrews, Kimberly R; Adams, Jennifer R; Cassirer, E Frances; Plowright, Raina K; Gardner, Colby; Dwire, Maggie; Hohenlohe, Paul A; Waits, Lisette P

2018-06-05

The development of high-throughput sequencing technologies is dramatically increasing the use of single nucleotide polymorphisms (SNPs) across the field of genetics, but most parentage studies of wild populations still rely on microsatellites. We developed a bioinformatic pipeline for identifying SNP panels that are informative for parentage analysis from restriction site-associated DNA sequencing (RADseq) data. This pipeline includes options for analysis with or without a reference genome, and provides methods to maximize genotyping accuracy and select sets of unlinked loci that have high statistical power. We test this pipeline on small populations of Mexican gray wolf and bighorn sheep, for which parentage analyses are expected to be challenging due to low genetic diversity and the presence of many closely related individuals. We compare the results of parentage analysis across SNP panels generated with or without the use of a reference genome, and between SNPs and microsatellites. For Mexican gray wolf, we conducted parentage analyses for 30 pups from a single cohort where samples were available from 64% of possible mothers and 53% of possible fathers, and the accuracy of parentage assignments could be estimated because true identities of parents were known a priori based on field data. For bighorn sheep, we conducted maternity analyses for 39 lambs from five cohorts where 77% of possible mothers were sampled, but true identities of parents were unknown. Analyses with and without a reference genome produced SNP panels with >95% parentage assignment accuracy for Mexican gray wolf, outperforming microsatellites at 78% accuracy. Maternity assignments were completely consistent across all SNP panels for the bighorn sheep, and were 74.4% consistent with assignments from microsatellites. Accuracy and consistency of parentage analysis were not reduced when using as few as 284 SNPs for Mexican gray wolf and 142 SNPs for bighorn sheep, indicating our pipeline can be used to develop SNP genotyping assays for parentage analysis with relatively small numbers of loci. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
TP53 and MDM2 single nucleotide polymorphisms influence survival in non-del(5q) myelodysplastic syndromes

PubMed Central

Sallman, David A.; Basiorka, Ashley A.; Irvine, Brittany A.; Zhang, Ling; Epling-Burnette, P.K.; Rollison, Dana E.; Mallo, Mar; Sokol, Lubomir; Solé, Francesc; Maciejewski, Jaroslaw; List, Alan F.

2015-01-01

P53 is a key regulator of many cellular processes and is negatively regulated by the human homolog of murine double minute-2 (MDM2) E3 ubiquitin ligase. Single nucleotide polymorphisms (SNPs) of either gene alone, and in combination, are linked to cancer susceptibility, disease progression, and therapy response. We analyzed the interaction of TP53 R72P and MDM2 SNP309 SNPs in relationship to outcome in patients with myelodysplastic syndromes (MDS). Sanger sequencing was performed on DNA isolated from 208 MDS cases. Utilizing a novel functional SNP scoring system ranging from +2 to −2 based on predicted p53 activity, we found statistically significant differences in overall survival (OS) (p = 0.02) and progression-free survival (PFS) (p = 0.02) in non-del(5q) MDS patients with low functional scores. In univariate analysis, only IPSS and the functional SNP score predicted OS and PFS in non-del(5q) patients. In multivariate analysis, the functional SNP score was independent of IPSS for OS and PFS. These data underscore the importance of TP53 R72P and MDM2 SNP309 SNPs in MDS, and provide a novel scoring system independent of IPSS that is predictive for disease outcome. PMID:26416416
Novel strategies to mine alcoholism-related haplotypes and genes by combining existing knowledge framework.

PubMed

Zhang, RuiJie; Li, Xia; Jiang, YongShuai; Liu, GuiYou; Li, ChuanXing; Zhang, Fan; Xiao, Yun; Gong, BinSheng

2009-02-01

High-throughout single nucleotide polymorphism detection technology and the existing knowledge provide strong support for mining the disease-related haplotypes and genes. In this study, first, we apply four kinds of haplotype identification methods (Confidence Intervals, Four Gamete Tests, Solid Spine of LD and fusing method of haplotype block) into high-throughout SNP genotype data to identify blocks, then use cluster analysis to verify the effectiveness of the four methods, and select the alcoholism-related SNP haplotypes through risk analysis. Second, we establish a mapping from haplotypes to alcoholism-related genes. Third, we inquire NCBI SNP and gene databases to locate the blocks and identify the candidate genes. In the end, we make gene function annotation by KEGG, Biocarta, and GO database. We find 159 haplotype blocks, which relate to the alcoholism most possibly on chromosome 1 approximately 22, including 227 haplotypes, of which 102 SNP haplotypes may increase the risk of alcoholism. We get 121 alcoholism-related genes and verify their reliability by the functional annotation of biology. In a word, we not only can handle the SNP data easily, but also can locate the disease-related genes precisely by combining our novel strategies of mining alcoholism-related haplotypes and genes with existing knowledge framework.
SNP-based genotyping in lentil: linking sequence information with phenotypes

USDA-ARS?s Scientific Manuscript database

Lentil (Lens culinaris) has been late to enter the world of high throughput molecular analysis due to a general lack of genomic resources. Using a 454 sequencing-based approach, SNPs have been identified in genes across the lentil genome. Several hundred have been turned into single SNP KASP assay...
A Larger Chocolate Chip-Development of a 15K Theobroma cacao L. SNP Array to Create High-Density Linkage Maps.

PubMed

Livingstone, Donald; Stack, Conrad; Mustiga, Guiliana M; Rodezno, Dayana C; Suarez, Carmen; Amores, Freddy; Feltus, Frank A; Mockaitis, Keithanne; Cornejo, Omar E; Motamayor, Juan C

2017-01-01

Cacao ( Theobroma cacao L.) is an important cash crop in tropical regions around the world and has a rich agronomic history in South America. As a key component in the cosmetic and confectionary industries, millions of people worldwide use products made from cacao, ranging from shampoo to chocolate. An Illumina Infinity II array was created using 13,530 SNPs identified within a small diversity panel of cacao. Of these SNPs, 12,643 derive from variation within annotated cacao genes. The genotypes of 3,072 trees were obtained, including two mapping populations from Ecuador. High-density linkage maps for these two populations were generated and compared to the cacao genome assembly. Phenotypic data from these populations were combined with the linkage maps to identify the QTLs for yield and disease resistance.
Genovar: a detection and visualization tool for genomic variants.

PubMed

Jung, Kwang Su; Moon, Sanghoon; Kim, Young Jin; Kim, Bong-Jo; Park, Kiejung

2012-05-08

Along with single nucleotide polymorphisms (SNPs), copy number variation (CNV) is considered an important source of genetic variation associated with disease susceptibility. Despite the importance of CNV, the tools currently available for its analysis often produce false positive results due to limitations such as low resolution of array platforms, platform specificity, and the type of CNV. To resolve this problem, spurious signals must be separated from true signals by visual inspection. None of the previously reported CNV analysis tools support this function and the simultaneous visualization of comparative genomic hybridization arrays (aCGH) and sequence alignment. The purpose of the present study was to develop a useful program for the efficient detection and visualization of CNV regions that enables the manual exclusion of erroneous signals. A JAVA-based stand-alone program called Genovar was developed. To ascertain whether a detected CNV region is a novel variant, Genovar compares the detected CNV regions with previously reported CNV regions using the Database of Genomic Variants (DGV, http://projects.tcag.ca/variation) and the Single Nucleotide Polymorphism Database (dbSNP). The current version of Genovar is capable of visualizing genomic data from sources such as the aCGH data file and sequence alignment format files. Genovar is freely accessible and provides a user-friendly graphic user interface (GUI) to facilitate the detection of CNV regions. The program also provides comprehensive information to help in the elimination of spurious signals by visual inspection, making Genovar a valuable tool for reducing false positive CNV results. http://genovar.sourceforge.net/.
Design and Validation of a New MLPA-Based Assay for the Detection of RS1 Gene Deletions and Application in a Large Family with X-Linked Juvenile Retinoschisis.

PubMed

Nicoletti, Annalisa; Ziccardi, Lucia; Maltese, Paolo Enrico; Benedetti, Sabrina; Palumbo, Orazio; Rendina, Michelina; D'Agruma, Leonardo; Falsini, Benedetto; Wang, Xinjing; Bertelli, Matteo

2017-02-01

X-linked juvenile retinoschisis (XLRS) is a severe ocular disorder that can evolve to blindness. More than 200 different disease-causing mutations have been reported in the RS1 gene and approximately 10% of these are deletions. Since transmission is X-linked, males are always affected and females are usually carriers. The identification of female carriers is always important and poses a technical challenge. Therefore, we sought to develop a multiplex ligation dependent probe amplification (MLPA)-based method to identify deletions or duplications in this gene. We then used our assay to study a large XLRS family. We designed six probes specific for each RS1 exon and then optimized and validated our method using control samples with known gene deletions. In the XLRS family, RS1 gene copy number variation was assessed by "home-made" MLPA analysis and by single nucleotide polymorphism (SNP) array analysis using the CytoScan HD Array. Direct sequencing was used for deletion breakpoint mapping. Our assay detected all deletions in control samples. All affected males of the family were positive for a deletion of exon 2 of the RS1 gene (RS1:NM_000330:c.53-?_78+?del). Carrier females were also identified. Our method is easily replicated, reliable, and inexpensive and allows female carriers to be detected. This is the first report of deep characterization of a whole exon deletion in the RS1 gene.
Molecular and genealogical analysis of grain dormancy in Japanese wheat varieties, with specific focus on MOTHER OF FT AND TFL1 on chromosome 3A

PubMed Central

Chono, Makiko; Matsunaka, Hitoshi; Seki, Masako; Fujita, Masaya; Kiribuchi-Otobe, Chikako; Oda, Shunsuke; Kojima, Hisayo; Nakamura, Shingo

2015-01-01

In the wheat (Triticum aestivum L.) cultivar ‘Zenkoujikomugi’, a single nucleotide polymorphism (SNP) in the promoter of MOTHER OF FT AND TFL1 on chromosome 3A (MFT-3A) causes an increase in the level of gene expression, resulting in strong grain dormancy. We used a DNA marker to detect the ‘Zenkoujikomugi’-type (Zen-type) SNP and examined the genotype of MFT-3A in Japanese wheat varieties, and we found that 169 of 324 varieties carry the Zen-type SNP. In Japanese commercial varieties, the frequency of the Zen-type SNP was remarkably high in the southern part of Japan, but low in the northern part. To examine the relationship between MFT-3A genotype and grain dormancy, we performed a germination assay in three wheat-growing seasons. On average, the varieties carrying the Zen-type SNP showed stronger grain dormancy than the varieties carrying the non-Zen-type SNP. Among commercial cultivars, ‘Iwainodaichi’ (Kyushu), ‘Junreikomugi’ (Kinki-Chugoku-Shikoku), ‘Kinuhime’ (Kanto-Tokai), ‘Nebarigoshi’ (Tohoku-Hokuriku), and ‘Kitamoe’ (Hokkaido) showed the strongest grain dormancy in each geographical group, and all these varieties, except for ‘Kitamoe’, were found to carry the Zen-type SNP. In recent years, the number of varieties carrying the Zen-type SNP has increased in the Tohoku-Hokuriku region, but not in the Hokkaido region. PMID:25931984
Polymorphism in ovine ANXA9 gene and physic-chemical properties and the fraction of protein in milk.

PubMed

Pecka-Kiełb, Ewa; Czerniawska-Piątkowska, Ewa; Kowalewska-Łuczak, Inga; Vasil, Milan

2018-04-16

Annexin A9 (ANXA9) is a specific fatty acid transport protein. ANXA9 gene is expressed in various tissues, including secretory tissue and mammary glands. The association between three SNPs of the ANXA9 gene and sheep's milk compositions was assessed. Genotype analysis was performed with the use of PCR-RFLP method. The studied ANXA9 polymorphisms had the following MAF (Major Allele Frequency): SNP1: allele G 0,66; SNP2: allele G 0,54; SNP3: allele C 0,57. The study found the most desired profile of protein fractions, namely an increased kappa-casein fractions and a decreased level of whey protein in sheep's milk for SNP1 and SNP3 polymorphisms. Sheep with the SNP1 GA genotype had the highest (P <0.05) content of fat and dry matter in milk. AXNA9 gene polymorphism did not influence the levels of protein, lactose or urea in sheep's milk. The information contained in this study may be useful for determining the impact of the ANXA9 gene on sheep's milk. The ANXA9 SNP1 and SNP3 polymorphisms results could be included in the breeding programs to select the sheep with the genotypes ensuring the highest kappa-casein levels in milk. However, it is worth conducting further research on ANXA9 and milk composition in larger herds of animals and various breeds of sheep. This article is protected by copyright. All rights reserved.
Centromere Locations in Brassica A and C Genomes Revealed Through Half-Tetrad Analysis

PubMed Central

Mason, Annaliese S.; Rousseau-Gueutin, Mathieu; Morice, Jérôme; Bayer, Philipp E.; Besharat, Naghmeh; Cousin, Anouska; Pradhan, Aneeta; Parkin, Isobel A. P.; Chèvre, Anne-Marie; Batley, Jacqueline; Nelson, Matthew N.

2016-01-01

Locating centromeres on genome sequences can be challenging. The high density of repetitive elements in these regions makes sequence assembly problematic, especially when using short-read sequencing technologies. It can also be difficult to distinguish between active and recently extinct centromeres through sequence analysis. An effective solution is to identify genetically active centromeres (functional in meiosis) by half-tetrad analysis. This genetic approach involves detecting heterozygosity along chromosomes in segregating populations derived from gametes (half-tetrads). Unreduced gametes produced by first division restitution mechanisms comprise complete sets of nonsister chromatids. Along these chromatids, heterozygosity is maximal at the centromeres, and homologous recombination events result in homozygosity toward the telomeres. We genotyped populations of half-tetrad-derived individuals (from Brassica interspecific hybrids) using a high-density array of physically anchored SNP markers (Illumina Brassica 60K Infinium array). Mapping the distribution of heterozygosity in these half-tetrad individuals allowed the genetic mapping of all 19 centromeres of the Brassica A and C genomes to the reference Brassica napus genome. Gene and transposable element density across the B. napus genome were also assessed and corresponded well to previously reported genetic map positions. Known centromere-specific sequences were located in the reference genome, but mostly matched unanchored sequences, suggesting that the core centromeric regions may not yet be assembled into the pseudochromosomes of the reference genome. The increasing availability of genetic markers physically anchored to reference genomes greatly simplifies the genetic and physical mapping of centromeres using half-tetrad analysis. We discuss possible applications of this approach, including in species where half-tetrads are currently difficult to isolate. PMID:26614742
Centromere Locations in Brassica A and C Genomes Revealed Through Half-Tetrad Analysis.

PubMed

Mason, Annaliese S; Rousseau-Gueutin, Mathieu; Morice, Jérôme; Bayer, Philipp E; Besharat, Naghmeh; Cousin, Anouska; Pradhan, Aneeta; Parkin, Isobel A P; Chèvre, Anne-Marie; Batley, Jacqueline; Nelson, Matthew N

2016-02-01

Locating centromeres on genome sequences can be challenging. The high density of repetitive elements in these regions makes sequence assembly problematic, especially when using short-read sequencing technologies. It can also be difficult to distinguish between active and recently extinct centromeres through sequence analysis. An effective solution is to identify genetically active centromeres (functional in meiosis) by half-tetrad analysis. This genetic approach involves detecting heterozygosity along chromosomes in segregating populations derived from gametes (half-tetrads). Unreduced gametes produced by first division restitution mechanisms comprise complete sets of nonsister chromatids. Along these chromatids, heterozygosity is maximal at the centromeres, and homologous recombination events result in homozygosity toward the telomeres. We genotyped populations of half-tetrad-derived individuals (from Brassica interspecific hybrids) using a high-density array of physically anchored SNP markers (Illumina Brassica 60K Infinium array). Mapping the distribution of heterozygosity in these half-tetrad individuals allowed the genetic mapping of all 19 centromeres of the Brassica A and C genomes to the reference Brassica napus genome. Gene and transposable element density across the B. napus genome were also assessed and corresponded well to previously reported genetic map positions. Known centromere-specific sequences were located in the reference genome, but mostly matched unanchored sequences, suggesting that the core centromeric regions may not yet be assembled into the pseudochromosomes of the reference genome. The increasing availability of genetic markers physically anchored to reference genomes greatly simplifies the genetic and physical mapping of centromeres using half-tetrad analysis. We discuss possible applications of this approach, including in species where half-tetrads are currently difficult to isolate. Copyright © 2016 by the Genetics Society of America.
The susceptibility of FSHB -211G > T and FSHR G-29A, 919A > G, 2039A > G polymorphisms to men infertility: an association study and meta-analysis.

PubMed

Wu, Qiuyue; Zhang, Jing; Zhu, Peiran; Jiang, Weijun; Liu, Shuaimei; Ni, Mengxia; Zhang, Mingchao; Li, Weiwei; Zhou, Qing; Cui, Yingxia; Xia, Xinyi

2017-08-01

Male infertility is a complex disorder caused by genetic, developmental, endocrine, or environmental factors as well as unknown etiology. Polymorphisms in the follicle stimulating hormone beta subunit (FSHB) (rs10835638, c.-211G > T) and follicle stimulating hormone receptor (FSHR) (rs1394205, c.-29G > A; rs6165, c.919A > G; rs6166, c.2039 A > G) genes might disturb normal spermatogenesis and affect male reproductive ability. To further ascertain the aforementioned effects, we conducted a case-control study of 255 infertile men and 340 fertile controls from South China using the Mass ARRAY method, which was analyzed by the t-tests and logistic regression analysis using SPSS for Windows 14.0. In addition, a meta-analysis was performed by combining our results with previous reports using STATA 12.0. In the FSHB or FSHR gene single nucleotide polymorphism (SNP) evaluation, no statistically-significant difference was found in the frequency of allelic variants or in genotype distribution between cases and controls. However, a significant association for the comparison of GAA (P: 0.022, OR: 0.63, 95%CI: 0.43-0.94) was seen between the oligozoospermia and controls in haplotype analysis of rs1394205/rs6165/rs6166. In the meta-analysis, rs6165G allele and rs6166 GG genotype were associated with increased risk of the male infertility. This study suggested that FSHR GAA haplotype would exert protective effects against male sterility, which indicated that the combination of three SNP genotypes of FSHR was predicted to have a much stronger impact than either one alone. Then in the meta-analysis, a significant association was seen between FSHR rs6165, rs6166 polymorphisms and male infertility. In terms of male infertility with multifactorial etiology, further studies with larger sample sizes and different ethnic backgrounds or other risk factors are warranted to clarify the potential role of FSHB and FSHR polymorphisms in the pathogenesis of male infertility.
Analysis of rs8067378 Polymorphism in the Risk of Uterine Cervical Cancer from a Polish Population and its Impact on Gasdermin B Expression.

PubMed

Lutkowska, Anna; Roszak, Andrzej; Lianeri, Margarita; Sowińska, Anna; Sotiri, Emianka; Jagodziński, Pawel P

2017-04-01

We studied the role of the NC_000017.10:g.38051348A>G (rs8067378) single nucleotide polymorphism (SNP) located 9.5 kb downstream of gasdermin B (GSDMB), in the development and progression of cervical squamous cell carcinomas (SCC). Using high-resolution melting curve analysis, we genotyped this SNP in patients with cervical SCC (n = 486) and controls (n = 511) from the Polish Caucasian population. Logistic regression analysis was used to adjust for the effect of confounders such as age, parity, oral contraceptive use, tobacco smoking, and menopausal status. The effect of this SNP on the expression of GSDMB was studied by reverse transcription and quantitative real-time polymerase chain reaction analysis of GSDMB transcript levels in SCC tissues. For all patients with SCC, the p trend value calculated for rs8067378 was statistically significant (p trend = 0.0019). The adjusted odds ratio for the G/G vs. A/A genotype was 1.304 (95% confidence interval 1.080-1.574, p = 0.0057) and the adjusted odds ratio for the G/A + G/G vs. A/A genotype was 1.444 (95% confidence interval 1.064-1.959, p = 0.0181). We also found a significant association of the rs8067378 SNP with tumor stages III, IV, and grade of differentiation G3, and with parity, oral contraceptive use, smoking, and women of postmenopausal age. We found increased GSDMB1 isoform transcripts in the cancerous and non-cancerous tissues from carriers of the G allele vs. carriers of the A/A genotype. The rs8067378 SNP variants may increase the expression of GSDMB and the risk of the development and progression of cervical SCC.
Association of genetic variation with systolic and diastolic blood pressure among African Americans: the Candidate Gene Association Resource study

PubMed Central

Fox, Ervin R.; Young, J. Hunter; Li, Yali; Dreisbach, Albert W.; Keating, Brendan J.; Musani, Solomon K.; Liu, Kiang; Morrison, Alanna C.; Ganesh, Santhi; Kutlar, Abdullah; Ramachandran, Vasan S.; Polak, Josef F.; Fabsitz, Richard R.; Dries, Daniel L.; Farlow, Deborah N.; Redline, Susan; Adeyemo, Adebowale; Hirschorn, Joel N.; Sun, Yan V.; Wyatt, Sharon B.; Penman, Alan D.; Palmas, Walter; Rotter, Jerome I.; Townsend, Raymond R.; Doumatey, Ayo P.; Tayo, Bamidele O.; Mosley, Thomas H.; Lyon, Helen N.; Kang, Sun J.; Rotimi, Charles N.; Cooper, Richard S.; Franceschini, Nora; Curb, J. David; Martin, Lisa W.; Eaton, Charles B.; Kardia, Sharon L.R.; Taylor, Herman A.; Caulfield, Mark J.; Ehret, Georg B.; Johnson, Toby; Chakravarti, Aravinda; Zhu, Xiaofeng; Levy, Daniel; Munroe, Patricia B.; Rice, Kenneth M.; Bochud, Murielle; Johnson, Andrew D.; Chasman, Daniel I.; Smith, Albert V.; Tobin, Martin D.; Verwoert, Germaine C.; Hwang, Shih-Jen; Pihur, Vasyl; Vollenweider, Peter; O'Reilly, Paul F.; Amin, Najaf; Bragg-Gresham, Jennifer L.; Teumer, Alexander; Glazer, Nicole L.; Launer, Lenore; Zhao, Jing Hua; Aulchenko, Yurii; Heath, Simon; Sõber, Siim; Parsa, Afshin; Luan, Jian'an; Arora, Pankaj; Dehghan, Abbas; Zhang, Feng; Lucas, Gavin; Hicks, Andrew A.; Jackson, Anne U.; Peden, John F.; Tanaka, Toshiko; Wild, Sarah H.; Rudan, Igor; Igl, Wilmar; Milaneschi, Yuri; Parker, Alex N.; Fava, Cristiano; Chambers, John C.; Kumari, Meena; JinGo, Min; van der Harst, Pim; Kao, Wen Hong Linda; Sjögren, Marketa; Vinay, D.G.; Alexander, Myriam; Tabara, Yasuharu; Shaw-Hawkins, Sue; Whincup, Peter H.; Liu, Yongmei; Shi, Gang; Kuusisto, Johanna; Seielstad, Mark; Sim, Xueling; Nguyen, Khanh-Dung Hoang; Lehtimäki, Terho; Matullo, Giuseppe; Wu, Ying; Gaunt, Tom R.; Charlotte Onland-Moret, N.; Cooper, Matthew N.; Platou, Carl G.P.; Org, Elin; Hardy, Rebecca; Dahgam, Santosh; Palmen, Jutta; Vitart, Veronique; Braund, Peter S.; Kuznetsova, Tatiana; Uiterwaal, Cuno S.P.M.; Campbell, Harry; Ludwig, Barbara; Tomaszewski, Maciej; Tzoulaki, Ioanna; Palmer, Nicholette D.; Aspelund, Thor; Garcia, Melissa; Chang, Yen-Pei C.; O'Connell, Jeffrey R.; Steinle, Nanette I.; Grobbee, Diederick E.; Arking, Dan E.; Hernandez, Dena; Najjar, Samer; McArdle, Wendy L.; Hadley, David; Brown, Morris J.; Connell, John M.; Hingorani, Aroon D.; Day, Ian N.M.; Lawlor, Debbie A.; Beilby, John P.; Lawrence, Robert W.; Clarke, Robert; Collins, Rory; Hopewell, Jemma C.; Ongen, Halit; Bis, Joshua C.; Kähönen, Mika; Viikari, Jorma; Adair, Linda S.; Lee, Nanette R.; Chen, Ming-Huei; Olden, Matthias; Pattaro, Cristian; Hoffman Bolton, Judith A.; Köttgen, Anna; Bergmann, Sven; Mooser, Vincent; Chaturvedi, Nish; Frayling, Timothy M.; Islam, Muhammad; Jafar, Tazeen H.; Erdmann, Jeanette; Kulkarni, Smita R.; Bornstein, Stefan R.; Grässler, Jürgen; Groop, Leif; Voight, Benjamin F.; Kettunen, Johannes; Howard, Philip; Taylor, Andrew; Guarrera, Simonetta; Ricceri, Fulvio; Emilsson, Valur; Plump, Andrew; Barroso, Inês; Khaw, Kay-Tee; Weder, Alan B.; Hunt, Steven C.; Bergman, Richard N.; Collins, Francis S.; Bonnycastle, Lori L.; Scott, Laura J.; Stringham, Heather M.; Peltonen, Leena; Perola, Markus; Vartiainen, Erkki; Brand, Stefan-Martin; Staessen, Jan A.; Wang, Thomas J.; Burton, Paul R.; SolerArtigas, Maria; Dong, Yanbin; Snieder, Harold; Wang, Xiaoling; Zhu, Haidong; Lohman, Kurt K.; Rudock, Megan E.; Heckbert, Susan R.; Smith, Nicholas L.; Wiggins, Kerri L.; Shriner, Daniel; Veldre, Gudrun; Viigimaa, Margus; Kinra, Sanjay; Prabhakaran, Dorairajan; Tripathy, Vikal; Langefeld, Carl D.; Rosengren, Annika; Thelle, Dag S.; MariaCorsi, Anna; Singleton, Andrew; Forrester, Terrence; Hilton, Gina; McKenzie, Colin A.; Salako, Tunde; Iwai, Naoharu; Kita, Yoshikuni; Ogihara, Toshio; Ohkubo, Takayoshi; Okamura, Tomonori; Ueshima, Hirotsugu; Umemura, Satoshi; Eyheramendy, Susana; Meitinger, Thomas; Wichmann, H.-Erich; Cho, Yoon Shin; Kim, Hyung-Lae; Lee, Jong-Young; Scott, James; Sehmi, Joban S.; Zhang, Weihua; Hedblad, Bo; Nilsson, Peter; Smith, George Davey; Wong, Andrew; Narisu, Narisu; Stančáková, Alena; Raffel, Leslie J.; Yao, Jie; Kathiresan, Sekar; O'Donnell, Chris; Schwartz, Steven M.; Arfan Ikram, M.; Longstreth, Will T.; Seshadri, Sudha; Shrine, Nick R.G.; Wain, Louise V.; Morken, Mario A.; Swift, Amy J.; Laitinen, Jaana; Prokopenko, Inga; Zitting, Paavo; Cooper, Jackie A.; Humphries, Steve E.; Danesh, John; Rasheed, Asif; Goel, Anuj; Hamsten, Anders; Watkins, Hugh; Bakker, Stephan J.L.; van Gilst, Wiek H.; Janipalli, Charles S.; Radha Mani, K.; Yajnik, Chittaranjan S.; Hofman, Albert; Mattace-Raso, Francesco U.S.; Oostra, Ben A.; Demirkan, Ayse; Isaacs, Aaron; Rivadeneira, Fernando; Lakatta, Edward G.; Orru, Marco; Scuteri, Angelo; Ala-Korpela, Mika; Kangas, Antti J.; Lyytikäinen, Leo-Pekka; Soininen, Pasi; Tukiainen, Taru; Würz, Peter; Twee-Hee Ong, Rick; Dörr, Marcus; Kroemer, Heyo K.; Völker, Uwe; Völzke, Henry; Galan, Pilar; Hercberg, Serge; Lathrop, Mark; Zelenika, Diana; Deloukas, Panos; Mangino, Massimo; Spector, Tim D.; Zhai, Guangju; Meschia, James F.; Nalls, Michael A.; Sharma, Pankaj; Terzic, Janos; Kranthi Kumar, M.J.; Denniff, Matthew; Zukowska-Szczechowska, Ewa; Wagenknecht, Lynne E.; Fowkes, Gerald R.; Charchar, Fadi J.; Schwarz, Peter E.H.; Hayward, Caroline; Guo, Xiuqing; Bots, Michiel L.; Brand, Eva; Samani, Nilesh J.; Polasek, Ozren; Talmud, Philippa J.; Nyberg, Fredrik; Kuh, Diana; Laan, Maris; Hveem, Kristian; Palmer, Lyle J.; van der Schouw, Yvonne T.; Casas, Juan P.; Mohlke, Karen L.; Vineis, Paolo; Raitakari, Olli; Wong, Tien Y.; Shyong Tai, E.; Laakso, Markku; Rao, Dabeeru C.; Harris, Tamara B.; Morris, Richard W.; Dominiczak, Anna F.; Kivimaki, Mika; Marmot, Michael G.; Miki, Tetsuro; Saleheen, Danish; Chandak, Giriraj R.; Coresh, Josef; Navis, Gerjan; Salomaa, Veikko; Han, Bok-Ghee; Kooner, Jaspal S.; Melander, Olle; Ridker, Paul M.; Bandinelli, Stefania; Gyllensten, Ulf B.; Wright, Alan F.; Wilson, James F.; Ferrucci, Luigi; Farrall, Martin; Tuomilehto, Jaakko; Pramstaller, Peter P.; Elosua, Roberto; Soranzo, Nicole; Sijbrands, Eric J.G.; Altshuler, David; Loos, Ruth J.F.; Shuldiner, Alan R.; Gieger, Christian; Meneton, Pierre; Uitterlinden, Andre G.; Wareham, Nicholas J.; Gudnason, Vilmundur; Rettig, Rainer; Uda, Manuela; Strachan, David P.; Witteman, Jacqueline C.M.; Hartikainen, Anna-Liisa; Beckmann, Jacques S.; Boerwinkle, Eric; Boehnke, Michael; Larson, Martin G.; Järvelin, Marjo-Riitta; Psaty, Bruce M.; Abecasis, Gonçalo R.; Elliott, Paul; van Duijn , Cornelia M.; Newton-Cheh, Christopher

2011-01-01

The prevalence of hypertension in African Americans (AAs) is higher than in other US groups; yet, few have performed genome-wide association studies (GWASs) in AA. Among people of European descent, GWASs have identified genetic variants at 13 loci that are associated with blood pressure. It is unknown if these variants confer susceptibility in people of African ancestry. Here, we examined genome-wide and candidate gene associations with systolic blood pressure (SBP) and diastolic blood pressure (DBP) using the Candidate Gene Association Resource (CARe) consortium consisting of 8591 AAs. Genotypes included genome-wide single-nucleotide polymorphism (SNP) data utilizing the Affymetrix 6.0 array with imputation to 2.5 million HapMap SNPs and candidate gene SNP data utilizing a 50K cardiovascular gene-centric array (ITMAT-Broad-CARe [IBC] array). For Affymetrix data, the strongest signal for DBP was rs10474346 (P= 3.6 × 10−8) located near GPR98 and ARRDC3. For SBP, the strongest signal was rs2258119 in C21orf91 (P= 4.7 × 10−8). The top IBC association for SBP was rs2012318 (P= 6.4 × 10−6) near SLC25A42 and for DBP was rs2523586 (P= 1.3 × 10−6) near HLA-B. None of the top variants replicated in additional AA (n = 11 882) or European-American (n = 69 899) cohorts. We replicated previously reported European-American blood pressure SNPs in our AA samples (SH2B3, P= 0.009; TBX3-TBX5, P= 0.03; and CSK-ULK3, P= 0.0004). These genetic loci represent the best evidence of genetic influences on SBP and DBP in AAs to date. More broadly, this work supports that notion that blood pressure among AAs is a trait with genetic underpinnings but also with significant complexity. PMID:21378095
Association of genetic variation with systolic and diastolic blood pressure among African Americans: the Candidate Gene Association Resource study.

PubMed

Fox, Ervin R; Young, J Hunter; Li, Yali; Dreisbach, Albert W; Keating, Brendan J; Musani, Solomon K; Liu, Kiang; Morrison, Alanna C; Ganesh, Santhi; Kutlar, Abdullah; Ramachandran, Vasan S; Polak, Josef F; Fabsitz, Richard R; Dries, Daniel L; Farlow, Deborah N; Redline, Susan; Adeyemo, Adebowale; Hirschorn, Joel N; Sun, Yan V; Wyatt, Sharon B; Penman, Alan D; Palmas, Walter; Rotter, Jerome I; Townsend, Raymond R; Doumatey, Ayo P; Tayo, Bamidele O; Mosley, Thomas H; Lyon, Helen N; Kang, Sun J; Rotimi, Charles N; Cooper, Richard S; Franceschini, Nora; Curb, J David; Martin, Lisa W; Eaton, Charles B; Kardia, Sharon L R; Taylor, Herman A; Caulfield, Mark J; Ehret, Georg B; Johnson, Toby; Chakravarti, Aravinda; Zhu, Xiaofeng; Levy, Daniel

2011-06-01

The prevalence of hypertension in African Americans (AAs) is higher than in other US groups; yet, few have performed genome-wide association studies (GWASs) in AA. Among people of European descent, GWASs have identified genetic variants at 13 loci that are associated with blood pressure. It is unknown if these variants confer susceptibility in people of African ancestry. Here, we examined genome-wide and candidate gene associations with systolic blood pressure (SBP) and diastolic blood pressure (DBP) using the Candidate Gene Association Resource (CARe) consortium consisting of 8591 AAs. Genotypes included genome-wide single-nucleotide polymorphism (SNP) data utilizing the Affymetrix 6.0 array with imputation to 2.5 million HapMap SNPs and candidate gene SNP data utilizing a 50K cardiovascular gene-centric array (ITMAT-Broad-CARe [IBC] array). For Affymetrix data, the strongest signal for DBP was rs10474346 (P= 3.6 × 10(-8)) located near GPR98 and ARRDC3. For SBP, the strongest signal was rs2258119 in C21orf91 (P= 4.7 × 10(-8)). The top IBC association for SBP was rs2012318 (P= 6.4 × 10(-6)) near SLC25A42 and for DBP was rs2523586 (P= 1.3 × 10(-6)) near HLA-B. None of the top variants replicated in additional AA (n = 11 882) or European-American (n = 69 899) cohorts. We replicated previously reported European-American blood pressure SNPs in our AA samples (SH2B3, P= 0.009; TBX3-TBX5, P= 0.03; and CSK-ULK3, P= 0.0004). These genetic loci represent the best evidence of genetic influences on SBP and DBP in AAs to date. More broadly, this work supports that notion that blood pressure among AAs is a trait with genetic underpinnings but also with significant complexity.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.