Applications of microarray technology in breast cancer research
Cooper, Colin S
2001-01-01
Microarrays provide a versatile platform for utilizing information from the Human Genome Project to benefit human health. This article reviews the ways in which microarray technology may be used in breast cancer research. Its diverse applications include monitoring chromosome gains and losses, tumour classification, drug discovery and development, DNA resequencing, mutation detection and investigating the mechanism of tumour development. PMID:11305951
Wang, Zheng; Malanoski, Anthony P; Lin, Baochuan; Kidd, Carolyn; Long, Nina C; Blaney, Kate M; Thach, Dzung C; Tibbetts, Clark; Stenger, David A
2008-01-01
Background Febrile respiratory illness (FRI) has a high impact on public health and global economics and poses a difficult challenge for differential diagnosis. A particular issue is the detection of genetically diverse pathogens, i.e. human rhinoviruses (HRV) and enteroviruses (HEV) which are frequent causes of FRI. Resequencing Pathogen Microarray technology has demonstrated potential for differential diagnosis of several respiratory pathogens simultaneously, but a high confidence design method to select probes for genetically diverse viruses is lacking. Results Using HRV and HEV as test cases, we assess a general design strategy for detecting and serotyping genetically diverse viruses. A minimal number of probe sequences (26 for HRV and 13 for HEV), which were potentially capable of detecting all serotypes of HRV and HEV, were determined and implemented on the Resequencing Pathogen Microarray RPM-Flu v.30/31 (Tessarae RPM-Flu). The specificities of designed probes were validated using 34 HRV and 28 HEV strains. All strains were successfully detected and identified at least to species level. 33 HRV strains and 16 HEV strains could be further differentiated to serotype level. Conclusion This study provides a fundamental evaluation of simultaneous detection and differential identification of genetically diverse RNA viruses with a minimal number of prototype sequences. The results demonstrated that the newly designed RPM-Flu v.30/31 can provide comprehensive and specific analysis of HRV and HEV samples which implicates that this design strategy will be applicable for other genetically diverse viruses. PMID:19046445
Multiplex amplification of large sets of human exons.
Porreca, Gregory J; Zhang, Kun; Li, Jin Billy; Xie, Bin; Austin, Derek; Vassallo, Sara L; LeProust, Emily M; Peck, Bill J; Emig, Christopher J; Dahl, Fredrik; Gao, Yuan; Church, George M; Shendure, Jay
2007-11-01
A new generation of technologies is poised to reduce DNA sequencing costs by several orders of magnitude. But our ability to fully leverage the power of these technologies is crippled by the absence of suitable 'front-end' methods for isolating complex subsets of a mammalian genome at a scale that matches the throughput at which these platforms will routinely operate. We show that targeting oligonucleotides released from programmable microarrays can be used to capture and amplify approximately 10,000 human exons in a single multiplex reaction. Additionally, we show integration of this protocol with ultra-high-throughput sequencing for targeted variation discovery. Although the multiplex capture reaction is highly specific, we found that nonuniform capture is a key issue that will need to be resolved by additional optimization. We anticipate that highly multiplexed methods for targeted amplification will enable the comprehensive resequencing of human exons at a fraction of the cost of whole-genome resequencing.
NASA Astrophysics Data System (ADS)
Leski, T. A.; Ansumana, R.; Jimmy, D. H.; Bangura, U.; Malanoski, A. P.; Lin, B.; Stenger, D. A.
2011-06-01
Multiplexed microbial diagnostic assays are a promising method for detection and identification of pathogens causing syndromes characterized by nonspecific symptoms in which traditional differential diagnosis is difficult. Also such assays can play an important role in outbreak investigations and environmental screening for intentional or accidental release of biothreat agents, which requires simultaneous testing for hundreds of potential pathogens. The resequencing pathogen microarray (RPM) is an emerging technological platform, relying on a combination of massively multiplex PCR and high-density DNA microarrays for rapid detection and high-resolution identification of hundreds of infectious agents simultaneously. The RPM diagnostic system was deployed in Sierra Leone, West Africa in collaboration with Njala University and Mercy Hospital Research Laboratory located in Bo. We used the RPM-Flu microarray designed for broad-range detection of human respiratory pathogens, to investigate a suspected outbreak of avian influenza in a number of poultry farms in which significant mortality of chickens was observed. The microarray results were additionally confirmed by influenza specific real-time PCR. The results of the study excluded the possibility that the outbreak was caused by influenza, but implicated Klebsiella pneumoniae as a possible pathogen. The outcome of this feasibility study confirms that application of broad-spectrum detection platforms for outbreak investigation in low-resource locations is possible and allows for rapid discovery of the responsible agents, even in cases when different agents are suspected. This strategy enables quick and cost effective detection of low probability events such as outbreak of a rare disease or intentional release of a biothreat agent.
Lin, Baochuan; Malanoski, Anthony P.; Wang, Zheng; Blaney, Kate M.; Long, Nina C.; Meador, Carolyn E.; Metzgar, David; Myers, Christopher A.; Yingst, Samuel L.; Monteville, Marshall R.; Saad, Magdi D.; Schnur, Joel M.; Tibbetts, Clark; Stenger, David A.
2009-01-01
Zoonotic microbes have historically been, and continue to emerge as, threats to human health. The recent outbreaks of highly pathogenic avian influenza virus in bird populations and the appearance of some human infections have increased the concern of a possible new influenza pandemic, which highlights the need for broad-spectrum detection methods for rapidly identifying the spread or outbreak of all variants of avian influenza virus. In this study, we demonstrate that high-density resequencing pathogen microarrays (RPM) can be such a tool. The results from 37 influenza virus isolates show that the RPM platform is an effective means for detecting and subtyping influenza virus, while simultaneously providing sequence information for strain resolution, pathogenicity, and drug resistance without additional analysis. This study establishes that the RPM platform is a broad-spectrum pathogen detection and surveillance tool for monitoring the circulation of prevalent influenza viruses in the poultry industry and in wild birds or incidental exposures and infections in humans. PMID:19279171
Microarray data mining using Bioconductor packages.
Nie, Haisheng; Neerincx, Pieter B T; van der Poel, Jan; Ferrari, Francesco; Bicciato, Silvio; Leunissen, Jack A M; Groenen, Martien A M
2009-07-16
This paper describes the results of a Gene Ontology (GO) term enrichment analysis of chicken microarray data using the Bioconductor packages. By checking the enriched GO terms in three contrasts, MM8-PM8, MM8-MA8, and MM8-MM24, of the provided microarray data during this workshop, this analysis aimed to investigate the host reactions in chickens occurring shortly after a secondary challenge with either a homologous or heterologous species of Eimeria. The results of GO enrichment analysis using GO terms annotated to chicken genes and GO terms annotated to chicken-human orthologous genes were also compared. Furthermore, a locally adaptive statistical procedure (LAP) was performed to test differentially expressed chromosomal regions, rather than individual genes, in the chicken genome after Eimeria challenge. GO enrichment analysis identified significant (raw p-value < 0.05) GO terms for all three contrasts included in the analysis. Some of the GO terms linked to, generally, primary immune responses or secondary immune responses indicating the GO enrichment analysis is a useful approach to analyze microarray data. The comparisons of GO enrichment results using chicken gene information and chicken-human orthologous gene information showed more refined GO terms related to immune responses when using chicken-human orthologous gene information, this suggests that using chicken-human orthologous gene information has higher power to detect significant GO terms with more refined functionality. Furthermore, three chromosome regions were identified to be significantly up-regulated in contrast MM8-PM8 (q-value < 0.01). Overall, this paper describes a practical approach to analyze microarray data in farm animals where the genome information is still incomplete. For farm animals, such as chicken, with currently limited gene annotation, borrowing gene annotation information from orthologous genes in well-annotated species, such as human, will help improve the pathway analysis results substantially. Furthermore, LAP analysis approach is a relatively new and very useful way to be applied in microarray analysis.
Ladas, Ioannis; Fitarelli-Kiehl, Mariana; Song, Chen; Adalsteinsson, Viktor A; Parsons, Heather A; Lin, Nancy U; Wagle, Nikhil; Makrigiorgos, G Mike
2017-10-01
The use of clinical samples and circulating cell-free DNA (cfDNA) collected from liquid biopsies for diagnostic and prognostic applications in cancer is burgeoning, and improved methods that reduce the influence of excess wild-type (WT) portion of the sample are desirable. Here we present enrichment of mutation-containing sequences using enzymatic degradation of WT DNA. Mutation enrichment is combined with high-resolution melting (HRM) performed in multiplexed closed-tube reactions as a rapid, cost-effective screening tool before targeted resequencing. We developed a homogeneous, closed-tube approach to use a double-stranded DNA-specific nuclease for degradation of WT DNA at multiple targets simultaneously. The No Denaturation Nuclease-assisted Minor Allele Enrichment with Probe Overlap (ND-NaME-PrO) uses WT oligonucleotides overlapping both strands on putative DNA targets. Under conditions of partial denaturation (DNA breathing), the oligonucleotide probes enhance double-stranded DNA-specific nuclease digestion at the selected targets, with high preference toward WT over mutant DNA. To validate ND-NaME-PrO, we used multiplexed HRM, digital PCR, and MiSeq targeted resequencing of mutated genomic DNA and cfDNA. Serial dilution of KRAS mutation-containing DNA shows mutation enrichment by 10- to 120-fold and detection of allelic fractions down to 0.01%. Multiplexed ND-NaME-PrO combined with multiplexed PCR-HRM showed mutation scanning of 10-20 DNA amplicons simultaneously. ND-NaME-PrO applied on cfDNA from clinical samples enables mutation enrichment and HRM scanning over 10 DNA targets. cfDNA mutations were enriched up to approximately 100-fold (average approximately 25-fold) and identified via targeted resequencing. Closed-tube homogeneous ND-NaME-PrO combined with multiplexed HRM is a convenient approach to efficiently enrich for mutations on multiple DNA targets and to enable prescreening before targeted resequencing. © 2017 American Association for Clinical Chemistry.
Analysis of dust samples from the Middle East using high-density resequencing micro-array RPM-TEI
NASA Astrophysics Data System (ADS)
Leski, T. A.; Gregory, M. J.; Malanoski, A. P.; Smith, J. P.; Glaven, R. H.; Wang, Z.; Stenger, D. A.; Lin, B.
2010-04-01
A previously developed resequencing microarray, "Tropical and Emerging Infections (RPM-TEI v.1.0 chip)", designed to identify and discriminate between tropical diseases and other potential biothreat agents, their near-neighbor species, and/or potential confounders, was used to characterize the microbes present in the silt/clay fraction of surface soils and airborne dust collected from the Middle East. Local populations and U.S. military personnel deployed to the Middle East are regularly subjected to high levels of airborne desert dust containing a significant fraction of inhalable particles and some portion require clinical aid. Not all of the clinical symptoms can be directly attributed to the physical action of material in the human respiratory tract. To better understand the potential health effects of the airborne dust, the composition of the microbial communities associated with surface soil and/or airborne dust (air filter) samples from 19 different sites in Iraq and Kuwait was identified using RPM-TEI v.1.0. Results indicated that several microorganisms including a class of rapidly growing Mycobacterium, Bacillus, Brucella, Clostridium and Coxiella burnetti, were present in the samples. The presence of these organisms in the surface soils and the inhalable fraction of airborne dust analyzed may pose a human health risk and warrants further investigation. Better understanding of the factors influencing the composition of these microbial communities is important to address questions related to human health and is critical to achieving Force Health Protection for the Warfighter operating in the Middle East, Afghanistan, North Africa and other arid regions.
NASA Astrophysics Data System (ADS)
Tibbetts, Clark; Lichanska, Agnieszka M.; Borsuk, Lisa A.; Weslowski, Brian; Morris, Leah M.; Lorence, Matthew C.; Schafer, Klaus O.; Campos, Joseph; Sene, Mohamadou; Myers, Christopher A.; Faix, Dennis; Blair, Patrick J.; Brown, Jason; Metzgar, David
2010-04-01
High-density resequencing microarrays support simultaneous detection and identification of multiple viral and bacterial pathogens. Because detection and identification using RPM is based upon multiple specimen-specific target pathogen gene sequences generated in the individual test, the test results enable both a differential diagnostic analysis and epidemiological tracking of detected pathogen strains and variants from one specimen to the next. The RPM assay enables detection and identification of pathogen sequences that share as little as 80% sequence similarity to prototype target gene sequences represented as detector tiles on the array. This capability enables the RPM to detect and identify previously unknown strains and variants of a detected pathogen, as in sentinel cases associated with an infectious disease outbreak. We illustrate this capability using assay results from testing influenza A virus vaccines configured with strains that were first defined years after the design of the RPM microarray. Results are also presented from RPM-Flu testing of three specimens independently confirmed to the positive for the 2009 Novel H1N1 outbreak strain of influenza virus.
pyAmpli: an amplicon-based variant filter pipeline for targeted resequencing data.
Beyens, Matthias; Boeckx, Nele; Van Camp, Guy; Op de Beeck, Ken; Vandeweyer, Geert
2017-12-14
Haloplex targeted resequencing is a popular method to analyze both germline and somatic variants in gene panels. However, involved wet-lab procedures may introduce false positives that need to be considered in subsequent data-analysis. No variant filtering rationale addressing amplicon enrichment related systematic errors, in the form of an all-in-one package, exists to our knowledge. We present pyAmpli, a platform independent parallelized Python package that implements an amplicon-based germline and somatic variant filtering strategy for Haloplex data. pyAmpli can filter variants for systematic errors by user pre-defined criteria. We show that pyAmpli significantly increases specificity, without reducing sensitivity, essential for reporting true positive clinical relevant mutations in gene panel data. pyAmpli is an easy-to-use software tool which increases the true positive variant call rate in targeted resequencing data. It specifically reduces errors related to PCR-based enrichment of targeted regions.
Universal Detection and Identification of Avian Influenza Virus by Use of Resequencing Microarrays
2009-04-01
For the RT step, primer LN was replaced by primer NLN (a random 9-mer with a linker se- quence). One picogram each of two internal controls (NAC1...samples (data not shown). These data indicated that most of the avian H5N1 samples identified were presumably sensitive to neuraminidase inhibitors
Vanhomwegen, Jessica; Berthet, Nicolas; Mazuet, Christelle; Guigon, Ghislaine; Vallaeys, Tatiana; Stamboliyska, Rayna; Dubois, Philippe; Kennedy, Giulia C.; Cole, Stewart T.; Caro, Valérie; Manuguerra, Jean-Claude; Popoff, Michel-Robert
2013-01-01
Background Clostridium botulinum and related clostridia express extremely potent toxins known as botulinum neurotoxins (BoNTs) that cause severe, potentially lethal intoxications in humans. These BoNT-producing bacteria are categorized in seven major toxinotypes (A through G) and several subtypes. The high diversity in nucleotide sequence and genetic organization of the gene cluster encoding the BoNT components poses a great challenge for the screening and characterization of BoNT-producing strains. Methodology/Principal Findings In the present study, we designed and evaluated the performances of a resequencing microarray (RMA), the PathogenId v2.0, combined with an automated data approach for the simultaneous detection and characterization of BoNT-producing clostridia. The unique design of the PathogenID v2.0 array allows the simultaneous detection and characterization of 48 sequences targeting the BoNT gene cluster components. This approach allowed successful identification and typing of representative strains of the different toxinotypes and subtypes, as well as the neurotoxin-producing C. botulinum strain in a naturally contaminated food sample. Moreover, the method allowed fine characterization of the different neurotoxin gene cluster components of all studied strains, including genomic regions exhibiting up to 24.65% divergence with the sequences tiled on the arrays. Conclusions/Significance The severity of the disease demands rapid and accurate means for performing risk assessments of BoNT-producing clostridia and for tracing potentials sources of contamination in outbreak situations. The RMA approach constitutes an essential higher echelon component in a diagnostics and surveillance pipeline. In addition, it is an important asset to characterise potential outbreak related strains, but also environment isolates, in order to obtain a better picture of the molecular epidemiology of BoNT-producing clostridia. PMID:23818983
Arenas, Ailan F; Salcedo, Gladys E; Gomez-Marin, Jorge E
2017-01-01
Pathogen-host protein-protein interaction systems examine the interactions between the protein repertoires of 2 distinct organisms. Some of these pathogen proteins interact with the host protein system and may manipulate it for their own advantages. In this work, we designed an R script by concatenating 2 functions called rowDM and rowCVmed to infer pathogen-host interaction using previously reported microarray data, including host gene enrichment analysis and the crossing of interspecific domain-domain interactions. We applied this script to the Toxoplasma-host system to describe pathogen survival mechanisms from human, mouse, and Toxoplasma Gene Expression Omnibus series. Our outcomes exhibited similar results with previously reported microarray analyses, but we found other important proteins that could contribute to toxoplasma pathogenesis. We observed that Toxoplasma ROP38 is the most differentially expressed protein among toxoplasma strains. Enrichment analysis and KEGG mapping indicated that the human retinal genes most affected by Toxoplasma infections are those related to antiapoptotic mechanisms. We suggest that proteins PIK3R1, PRKCA, PRKCG, PRKCB, HRAS, and c-JUN could be the possible substrates for differentially expressed Toxoplasma kinase ROP38. Likewise, we propose that Toxoplasma causes overexpression of apoptotic suppression human genes. PMID:29317802
Lévêque, Marianne; Marlin, Sandrine; Jonard, Laurence; Procaccio, Vincent; Reynier, Pascal; Amati-Bonneau, Patrizia; Baulande, Sylvain; Pierron, Denis; Lacombe, Didier; Duriez, Françoise; Francannet, Christine; Mom, Thierry; Journel, Hubert; Catros, Hélène; Drouin-Garraud, Valérie; Obstoy, Marie-Françoise; Dollfus, Hélène; Eliot, Marie-Madeleine; Faivre, Laurence; Duvillard, Christian; Couderc, Remy; Garabedian, Eréa-Noël; Petit, Christine; Feldmann, Delphine; Denoyelle, Françoise
2007-11-01
Mitochondrial DNA (mtDNA) mutations have been implicated in non-syndromic hearing loss either as primary or as predisposing factors. As only a part of the mitochondrial genome is usually explored in deafness, its prevalence is probably under-estimated. Among 1350 families with non-syndromic sensorineural hearing loss collected through a French collaborative network, we selected 29 large families with a clear maternal lineage and screened them for known mtDNA mutations in 12S rRNA, tRNASer(UCN) and tRNALeu(UUR) genes. When no mutation could be identified, a whole mitochondrial genome screening was performed, using a microarray resequencing chip: the MitoChip version 2.0 developed by Affymetrix Inc. Known mtDNA mutations was found in nine of the 29 families, which are described in the article: five with A1555G, two with the T7511C, one with 7472insC and one with A3243G mutation. In the remaining 20 families, the resequencing Mitochip detected 258 mitochondrial homoplasmic variants and 107 potentially heteroplasmic variants. Controls were made by direct sequencing on selected fragments and showed a high sensibility of the MitoChip but a low specificity, especially for heteroplasmic variations. An original analysis on the basis of species conservation, frequency and phylogenetic investigation was performed to select the more probably pathogenic variants. The entire genome analysis allowed us to identify five additional families with a putatively pathogenic mitochondrial variant: T669C, C1537T, G8078A, G12236A and G15077A. These results indicate that the new MitoChip platform is a rapid and valuable tool for identification of new mtDNA mutations in deafness.
An evaluation of two-channel ChIP-on-chip and DNA methylation microarray normalization strategies
2012-01-01
Background The combination of chromatin immunoprecipitation with two-channel microarray technology enables genome-wide mapping of binding sites of DNA-interacting proteins (ChIP-on-chip) or sites with methylated CpG di-nucleotides (DNA methylation microarray). These powerful tools are the gateway to understanding gene transcription regulation. Since the goals of such studies, the sample preparation procedures, the microarray content and study design are all different from transcriptomics microarrays, the data pre-processing strategies traditionally applied to transcriptomics microarrays may not be appropriate. Particularly, the main challenge of the normalization of "regulation microarrays" is (i) to make the data of individual microarrays quantitatively comparable and (ii) to keep the signals of the enriched probes, representing DNA sequences from the precipitate, as distinguishable as possible from the signals of the un-enriched probes, representing DNA sequences largely absent from the precipitate. Results We compare several widely used normalization approaches (VSN, LOWESS, quantile, T-quantile, Tukey's biweight scaling, Peng's method) applied to a selection of regulation microarray datasets, ranging from DNA methylation to transcription factor binding and histone modification studies. Through comparison of the data distributions of control probes and gene promoter probes before and after normalization, and assessment of the power to identify known enriched genomic regions after normalization, we demonstrate that there are clear differences in performance between normalization procedures. Conclusion T-quantile normalization applied separately on the channels and Tukey's biweight scaling outperform other methods in terms of the conservation of enriched and un-enriched signal separation, as well as in identification of genomic regions known to be enriched. T-quantile normalization is preferable as it additionally improves comparability between microarrays. In contrast, popular normalization approaches like quantile, LOWESS, Peng's method and VSN normalization alter the data distributions of regulation microarrays to such an extent that using these approaches will impact the reliability of the downstream analysis substantially. PMID:22276688
Microarray Analysis of Differential Gene Expression Profile Between Human Fetal and Adult Heart.
Geng, Zhimin; Wang, Jue; Pan, Lulu; Li, Ming; Zhang, Jitai; Cai, Xueli; Chu, Maoping
2017-04-01
Although many changes have been discovered during heart maturation, the genetic mechanisms involved in the changes between immature and mature myocardium have only been partially elucidated. Here, gene expression profile changed between the human fetal and adult heart was characterized. A human microarray was applied to define the gene expression signatures of the fetal (13-17 weeks of gestation, n = 4) and adult hearts (30-40 years old, n = 4). Gene ontology analyses, pathway analyses, gene set enrichment analyses, and signal transduction network were performed to predict the function of the differentially expressed genes. Ten mRNAs were confirmed by quantificational real-time polymerase chain reaction. 5547 mRNAs were found to be significantly differentially expressed. "Cell cycle" was the most enriched pathway in the down-regulated genes. EFGR, IGF1R, and ITGB1 play a central role in the regulation of heart development. EGFR, IGF1R, and FGFR2 were the core genes regulating cardiac cell proliferation. The quantificational real-time polymerase chain reaction results were concordant with the microarray data. Our data identified the transcriptional regulation of heart development in the second trimester and the potential regulators that play a prominent role in the regulation of heart development and cardiac cells proliferation.
Wang, Yao; Cui, Yazhou; Zhou, Xiaoyan; Han, Jinxiang
2015-01-01
Objective Osteogenesis imperfecta (OI) is a rare inherited skeletal disease, characterized by bone fragility and low bone density. The mutations in this disorder have been widely reported to be on various exonal hotspots of the candidate genes, including COL1A1, COL1A2, CRTAP, LEPRE1, and FKBP10, thus creating a great demand for precise genetic tests. However, large genome sizes make the process daunting and the analyses, inefficient and expensive. Therefore, we aimed at developing a fast, accurate, efficient, and cheaper sequencing platform for OI diagnosis; and to this end, use of an advanced array-based technique was proposed. Method A CustomSeq Affymetrix Resequencing Array was established for high-throughput sequencing of five genes simultaneously. Genomic DNA extraction from 13 OI patients and 85 normal controls and amplification using long-range PCR (LR-PCR) were followed by DNA fragmentation and chip hybridization, according to standard Affymetrix protocols. Hybridization signals were determined using GeneChip Sequence Analysis Software (GSEQ). To examine the feasibility, the outcome from new resequencing approach was validated by conventional capillary sequencing method. Result Overall call rates using resequencing array was 96–98% and the agreement between microarray and capillary sequencing was 99.99%. 11 out of 13 OI patients with pathogenic mutations were successfully detected by the chip analysis without adjustment, and one mutation could also be identified using manual visual inspection. Conclusion A high-throughput resequencing array was developed that detects the disease-associated mutations in OI, providing a potential tool to facilitate large-scale genetic screening for OI patients. Through this method, a novel mutation was also found. PMID:25742658
Zhang, Quan; Zhu, Feng; Liu, Long; Zheng, Chuan Wei; Wang, De He; Hou, Zhuo Cheng; Ning, Zhong Hua
2015-01-01
Eggshell damages lead to economic losses in the egg production industry and are a threat to human health. We examined 49-wk-old Rhode Island White hens (Gallus gallus) that laid eggs having shells with significantly different strengths and thicknesses. We used HiSeq 2000 (Illumina) sequencing to characterize the chicken transcriptome and whole genome to identify the key genes and genetic mutations associated with eggshell calcification. We identified a total of 14,234 genes expressed in the chicken uterus, representing 89% of all annotated chicken genes. A total of 889 differentially expressed genes were identified by comparing low eggshell strength (LES) and normal eggshell strength (NES) genomes. The DEGs are enriched in calcification-related processes, including calcium ion transport and calcium signaling pathways as revealed by gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis. Some important matrix proteins, such as OC-116, LTF and SPP1, were also expressed differentially between two groups. A total of 3,671,919 single-nucleotide polymorphisms (SNPs) and 508,035 Indels were detected in protein coding genes by whole-genome re-sequencing, including 1775 non-synonymous variations and 19 frame-shift Indels in DEGs. SNPs and Indels found in this study could be further investigated for eggshell traits. This is the first report to integrate the transcriptome and genome re-sequencing to target the genetic variations which decreased the eggshell qualities. These findings further advance our understanding of eggshell calcification in the chicken uterus.
Altmüller, Janine; Budde, Birgit S; Nürnberg, Peter
2014-02-01
Abstract Targeted re-sequencing such as gene panel sequencing (GPS) has become very popular in medical genetics, both for research projects and in diagnostic settings. The technical principles of the different enrichment methods have been reviewed several times before; however, new enrichment products are constantly entering the market, and researchers are often puzzled about the requirement to take decisions about long-term commitments, both for the enrichment product and the sequencing technology. This review summarizes important considerations for the experimental design and provides helpful recommendations in choosing the best sequencing strategy for various research projects and diagnostic applications.
Microarray-based Resequencing of Multiple Bacillus anthracis Isolates
2004-12-17
generated an Unweighted Pair Group Method Arithmetic Mean ( UPGMA ) tree (see methods [56]; Figure 3). The strains group together in a manner broadly similar...was created using DNADIST, plotted as a UPGMA tree using NEIGHBOR and the tree plotted using DRAWGRAM [56]. The B1 strain A0465 was used as an...distance matrix was created using DNADIST, plotted as a UPGMA tree using NEIGHBOR and the tree plotted using DRAWGRAM [57]. Additional data files The
Application of Broad-Spectrum Resequencing Microarray for Genotyping Rhabdoviruses▿
Dacheux, Laurent; Berthet, Nicolas; Dissard, Gabriel; Holmes, Edward C.; Delmas, Olivier; Larrous, Florence; Guigon, Ghislaine; Dickinson, Philip; Faye, Ousmane; Sall, Amadou A.; Old, Iain G.; Kong, Katherine; Kennedy, Giulia C.; Manuguerra, Jean-Claude; Cole, Stewart T.; Caro, Valérie; Gessain, Antoine; Bourhy, Hervé
2010-01-01
The rapid and accurate identification of pathogens is critical in the control of infectious disease. To this end, we analyzed the capacity for viral detection and identification of a newly described high-density resequencing microarray (RMA), termed PathogenID, which was designed for multiple pathogen detection using database similarity searching. We focused on one of the largest and most diverse viral families described to date, the family Rhabdoviridae. We demonstrate that this approach has the potential to identify both known and related viruses for which precise sequence information is unavailable. In particular, we demonstrate that a strategy based on consensus sequence determination for analysis of RMA output data enabled successful detection of viruses exhibiting up to 26% nucleotide divergence with the closest sequence tiled on the array. Using clinical specimens obtained from rabid patients and animals, this method also shows a high species level concordance with standard reference assays, indicating that it is amenable for the development of diagnostic assays. Finally, 12 animal rhabdoviruses which were currently unclassified, unassigned, or assigned as tentative species within the family Rhabdoviridae were successfully detected. These new data allowed an unprecedented phylogenetic analysis of 106 rhabdoviruses and further suggest that the principles and methodology developed here may be used for the broad-spectrum surveillance and the broader-scale investigation of biodiversity in the viral world. PMID:20610710
2009-08-11
Competing Interests: One of the contributing authors : Clark Tibbetts, is the Executive Vice President and Chief Technology Officer of Tessarae, LLC...Detection 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR (S) 5d. PROJECT NUMBER 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7...N/A 1021 ng No detection Sin nombre Bunyaviridae III 1021 ng Pulmonary syndrome hantavirus strain Convict Creek 107 1CCHFV = Crimean-Congo hemorrhagic
Bikel, Shirley; Jacobo-Albavera, Leonor; Sánchez-Muñoz, Fausto; Cornejo-Granados, Fernanda; Canizales-Quinteros, Samuel; Soberón, Xavier; Sotelo-Mundo, Rogerio R; Del Río-Navarro, Blanca E; Mendoza-Vargas, Alfredo; Sánchez, Filiberto; Ochoa-Leyva, Adrian
2017-01-01
In spite of the emergence of RNA sequencing (RNA-seq), microarrays remain in widespread use for gene expression analysis in the clinic. There are over 767,000 RNA microarrays from human samples in public repositories, which are an invaluable resource for biomedical research and personalized medicine. The absolute gene expression analysis allows the transcriptome profiling of all expressed genes under a specific biological condition without the need of a reference sample. However, the background fluorescence represents a challenge to determine the absolute gene expression in microarrays. Given that the Y chromosome is absent in female subjects, we used it as a new approach for absolute gene expression analysis in which the fluorescence of the Y chromosome genes of female subjects was used as the background fluorescence for all the probes in the microarray. This fluorescence was used to establish an absolute gene expression threshold, allowing the differentiation between expressed and non-expressed genes in microarrays. We extracted the RNA from 16 children leukocyte samples (nine males and seven females, ages 6-10 years). An Affymetrix Gene Chip Human Gene 1.0 ST Array was carried out for each sample and the fluorescence of 124 genes of the Y chromosome was used to calculate the absolute gene expression threshold. After that, several expressed and non-expressed genes according to our absolute gene expression threshold were compared against the expression obtained using real-time quantitative polymerase chain reaction (RT-qPCR). From the 124 genes of the Y chromosome, three genes (DDX3Y, TXLNG2P and EIF1AY) that displayed significant differences between sexes were used to calculate the absolute gene expression threshold. Using this threshold, we selected 13 expressed and non-expressed genes and confirmed their expression level by RT-qPCR. Then, we selected the top 5% most expressed genes and found that several KEGG pathways were significantly enriched. Interestingly, these pathways were related to the typical functions of leukocytes cells, such as antigen processing and presentation and natural killer cell mediated cytotoxicity. We also applied this method to obtain the absolute gene expression threshold in already published microarray data of liver cells, where the top 5% expressed genes showed an enrichment of typical KEGG pathways for liver cells. Our results suggest that the three selected genes of the Y chromosome can be used to calculate an absolute gene expression threshold, allowing a transcriptome profiling of microarray data without the need of an additional reference experiment. Our approach based on the establishment of a threshold for absolute gene expression analysis will allow a new way to analyze thousands of microarrays from public databases. This allows the study of different human diseases without the need of having additional samples for relative expression experiments.
NASA Astrophysics Data System (ADS)
Seto, Donald
The convergence and wealth of informatics, bioinformatics and genomics methods and associated resources allow a comprehensive and rapid approach for the surveillance and detection of bacterial and viral organisms. Coupled with the continuing race for the fastest, most cost-efficient and highest-quality DNA sequencing technology, that is, "next generation sequencing", the detection of biological threat agents by `cheaper and faster' means is possible. With the application of improved bioinformatic tools for the understanding of these genomes and for parsing unique pathogen genome signatures, along with `state-of-the-art' informatics which include faster computational methods, equipment and databases, it is feasible to apply new algorithms to biothreat agent detection. Two such methods are high-throughput DNA sequencing-based and resequencing microarray-based identification. These are illustrated and validated by two examples involving human adenoviruses, both from real-world test beds.
Sequence Variability and Geographic Distribution of Lassa Virus, Sierra Leone
Stockelman, Michael G.; Moses, Lina M.; Park, Matthew; Stenger, David A.; Ansumana, Rashid; Bausch, Daniel G.; Lin, Baochuan
2015-01-01
Lassa virus (LASV) is endemic to parts of West Africa and causes highly fatal hemorrhagic fever. The multimammate rat (Mastomys natalensis) is the only known reservoir of LASV. Most human infections result from zoonotic transmission. The very diverse LASV genome has 4 major lineages associated with different geographic locations. We used reverse transcription PCR and resequencing microarrays to detect LASV in 41 of 214 samples from rodents captured at 8 locations in Sierra Leone. Phylogenetic analysis of partial sequences of nucleoprotein (NP), glycoprotein precursor (GPC), and polymerase (L) genes showed 5 separate clades within lineage IV of LASV in this country. The sequence diversity was higher than previously observed; mean diversity was 7.01% for nucleoprotein gene at the nucleotide level. These results may have major implications for designing diagnostic tests and therapeutic agents for LASV infections in Sierra Leone. PMID:25811712
Segmental Duplications and Copy-Number Variation in the Human Genome
Sharp, Andrew J. ; Locke, Devin P. ; McGrath, Sean D. ; Cheng, Ze ; Bailey, Jeffrey A. ; Vallente, Rhea U. ; Pertz, Lisa M. ; Clark, Royden A. ; Schwartz, Stuart ; Segraves, Rick ; Oseroff, Vanessa V. ; Albertson, Donna G. ; Pinkel, Daniel ; Eichler, Evan E.
2005-01-01
The human genome contains numerous blocks of highly homologous duplicated sequence. This higher-order architecture provides a substrate for recombination and recurrent chromosomal rearrangement associated with genomic disease. However, an assessment of the role of segmental duplications in normal variation has not yet been made. On the basis of the duplication architecture of the human genome, we defined a set of 130 potential rearrangement hotspots and constructed a targeted bacterial artificial chromosome (BAC) microarray (with 2,194 BACs) to assess copy-number variation in these regions by array comparative genomic hybridization. Using our segmental duplication BAC microarray, we screened a panel of 47 normal individuals, who represented populations from four continents, and we identified 119 regions of copy-number polymorphism (CNP), 73 of which were previously unreported. We observed an equal frequency of duplications and deletions, as well as a 4-fold enrichment of CNPs within hotspot regions, compared with control BACs (P < .000001), which suggests that segmental duplications are a major catalyst of large-scale variation in the human genome. Importantly, segmental duplications themselves were also significantly enriched >4-fold within regions of CNP. Almost without exception, CNPs were not confined to a single population, suggesting that these either are recurrent events, having occurred independently in multiple founders, or were present in early human populations. Our study demonstrates that segmental duplications define hotspots of chromosomal rearrangement, likely acting as mediators of normal variation as well as genomic disease, and it suggests that the consideration of genomic architecture can significantly improve the ascertainment of large-scale rearrangements. Our specialized segmental duplication BAC microarray and associated database of structural polymorphisms will provide an important resource for the future characterization of human genomic disorders. PMID:15918152
Liu, Long; Zheng, Chuan Wei; Wang, De He; Hou, Zhuo Cheng; Ning, Zhong Hua
2015-01-01
Eggshell damages lead to economic losses in the egg production industry and are a threat to human health. We examined 49-wk-old Rhode Island White hens (Gallus gallus) that laid eggs having shells with significantly different strengths and thicknesses. We used HiSeq 2000 (Illumina) sequencing to characterize the chicken transcriptome and whole genome to identify the key genes and genetic mutations associated with eggshell calcification. We identified a total of 14,234 genes expressed in the chicken uterus, representing 89% of all annotated chicken genes. A total of 889 differentially expressed genes were identified by comparing low eggshell strength (LES) and normal eggshell strength (NES) genomes. The DEGs are enriched in calcification-related processes, including calcium ion transport and calcium signaling pathways as reveled by gene ontology (GO) and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis. Some important matrix proteins, such as OC-116, LTF and SPP1, were also expressed differentially between two groups. A total of 3,671,919 single-nucleotide polymorphisms (SNPs) and 508,035 Indels were detected in protein coding genes by whole-genome re-sequencing, including 1775 non-synonymous variations and 19 frame-shift Indels in DEGs. SNPs and Indels found in this study could be further investigated for eggshell traits. This is the first report to integrate the transcriptome and genome re-sequencing to target the genetic variations which decreased the eggshell qualities. These findings further advance our understanding of eggshell calcification in the chicken uterus. PMID:25974068
Medical Sequencing at the extremes of Human Body Mass
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ahituv, Nadav; Kavaslar, Nihan; Schackwitz, Wendy
2006-09-01
Body weight is a quantitative trait with significantheritability in humans. To identify potential genetic contributors tothis phenotype, we resequenced the coding exons and splice junctions of58 genes in 379 obese and 378 lean individuals. Our 96Mb survey included21 genes associated with monogenic forms of obesity in humans or mice, aswell as 37 genes that function in body weight-related pathways. We foundthat the monogenic obesity-associated gene group was enriched for rarenonsynonymous variants unique to the obese (n=46) versus lean (n=26)populations. Computational analysis further predicted a significantlygreater fraction of deleterious variants within the obese cohort.Consistent with the complex inheritance of body weight,more » we did notobserve obvious familial segregation in the majority of the 28 availablekindreds. Taken together, these data suggest that multiple rare alleleswith variable penetrance contribute to obesity in the population andprovide a deep medical sequencing based approach to detectthem.« less
High-Throughput resequencing of maize landraces at genomic regions associated with flowering time
USDA-ARS?s Scientific Manuscript database
Despite the reduction in the price of sequencing, it remains expensive to sequence and assemble whole, complex genomes of multiple samples for population studies, particularly for large genomes like those of many crop species. Enrichment of target genome regions coupled with next generation sequenci...
Droege, Marcus; Hill, Brendon
2008-08-31
The Genome Sequencer FLX System (GS FLX), powered by 454 Sequencing, is a next-generation DNA sequencing technology featuring a unique mix of long reads, exceptional accuracy, and ultra-high throughput. It has been proven to be the most versatile of all currently available next-generation sequencing technologies, supporting many high-profile studies in over seven applications categories. GS FLX users have pursued innovative research in de novo sequencing, re-sequencing of whole genomes and target DNA regions, metagenomics, and RNA analysis. 454 Sequencing is a powerful tool for human genetics research, having recently re-sequenced the genome of an individual human, currently re-sequencing the complete human exome and targeted genomic regions using the NimbleGen sequence capture process, and detected low-frequency somatic mutations linked to cancer.
Bikel, Shirley; Jacobo-Albavera, Leonor; Sánchez-Muñoz, Fausto; Cornejo-Granados, Fernanda; Canizales-Quinteros, Samuel; Soberón, Xavier; Sotelo-Mundo, Rogerio R.; del Río-Navarro, Blanca E.; Mendoza-Vargas, Alfredo; Sánchez, Filiberto
2017-01-01
Background In spite of the emergence of RNA sequencing (RNA-seq), microarrays remain in widespread use for gene expression analysis in the clinic. There are over 767,000 RNA microarrays from human samples in public repositories, which are an invaluable resource for biomedical research and personalized medicine. The absolute gene expression analysis allows the transcriptome profiling of all expressed genes under a specific biological condition without the need of a reference sample. However, the background fluorescence represents a challenge to determine the absolute gene expression in microarrays. Given that the Y chromosome is absent in female subjects, we used it as a new approach for absolute gene expression analysis in which the fluorescence of the Y chromosome genes of female subjects was used as the background fluorescence for all the probes in the microarray. This fluorescence was used to establish an absolute gene expression threshold, allowing the differentiation between expressed and non-expressed genes in microarrays. Methods We extracted the RNA from 16 children leukocyte samples (nine males and seven females, ages 6–10 years). An Affymetrix Gene Chip Human Gene 1.0 ST Array was carried out for each sample and the fluorescence of 124 genes of the Y chromosome was used to calculate the absolute gene expression threshold. After that, several expressed and non-expressed genes according to our absolute gene expression threshold were compared against the expression obtained using real-time quantitative polymerase chain reaction (RT-qPCR). Results From the 124 genes of the Y chromosome, three genes (DDX3Y, TXLNG2P and EIF1AY) that displayed significant differences between sexes were used to calculate the absolute gene expression threshold. Using this threshold, we selected 13 expressed and non-expressed genes and confirmed their expression level by RT-qPCR. Then, we selected the top 5% most expressed genes and found that several KEGG pathways were significantly enriched. Interestingly, these pathways were related to the typical functions of leukocytes cells, such as antigen processing and presentation and natural killer cell mediated cytotoxicity. We also applied this method to obtain the absolute gene expression threshold in already published microarray data of liver cells, where the top 5% expressed genes showed an enrichment of typical KEGG pathways for liver cells. Our results suggest that the three selected genes of the Y chromosome can be used to calculate an absolute gene expression threshold, allowing a transcriptome profiling of microarray data without the need of an additional reference experiment. Discussion Our approach based on the establishment of a threshold for absolute gene expression analysis will allow a new way to analyze thousands of microarrays from public databases. This allows the study of different human diseases without the need of having additional samples for relative expression experiments. PMID:29230367
Day-Williams, Aaron G.; McLay, Kirsten; Drury, Eleanor; Edkins, Sarah; Coffey, Alison J.; Palotie, Aarno; Zeggini, Eleftheria
2011-01-01
Pooled sequencing can be a cost-effective approach to disease variant discovery, but its applicability in association studies remains unclear. We compare sequence enrichment methods coupled to next-generation sequencing in non-indexed pools of 1, 2, 10, 20 and 50 individuals and assess their ability to discover variants and to estimate their allele frequencies. We find that pooled resequencing is most usefully applied as a variant discovery tool due to limitations in estimating allele frequency with high enough accuracy for association studies, and that in-solution hybrid-capture performs best among the enrichment methods examined regardless of pool size. PMID:22069447
Raghavan, Avanthi; Neeli, Hemanth; Jin, Weijun; Badellino, Karen O.; Demissie, Serkalem; Manning, Alisa K.; DerOhannessian, Stephanie L.; Wolfe, Megan L.; Cupples, L. Adrienne; Li, Mingyao; Kathiresan, Sekar; Rader, Daniel J.
2011-01-01
Genome-wide association studies (GWAS) have successfully identified loci associated with quantitative traits, such as blood lipids. Deep resequencing studies are being utilized to catalogue the allelic spectrum at GWAS loci. The goal of these studies is to identify causative variants and missing heritability, including heritability due to low frequency and rare alleles with large phenotypic impact. Whereas rare variant efforts have primarily focused on nonsynonymous coding variants, we hypothesized that noncoding variants in these loci are also functionally important. Using the HDL-C gene LIPG as an example, we explored the effect of regulatory variants identified through resequencing of subjects at HDL-C extremes on gene expression, protein levels, and phenotype. Resequencing a portion of the LIPG promoter and 5′ UTR in human subjects with extreme HDL-C, we identified several rare variants in individuals from both extremes. Luciferase reporter assays were used to measure the effect of these rare variants on LIPG expression. Variants conferring opposing effects on gene expression were enriched in opposite extremes of the phenotypic distribution. Minor alleles of a common regulatory haplotype and noncoding GWAS SNPs were associated with reduced plasma levels of the LIPG gene product endothelial lipase (EL), consistent with its role in HDL-C catabolism. Additionally, we found that a common nonfunctional coding variant associated with HDL-C (rs2000813) is in linkage disequilibrium with a 5′ UTR variant (rs34474737) that decreases LIPG promoter activity. We attribute the gene regulatory role of rs34474737 to the observed association of the coding variant with plasma EL levels and HDL-C. Taken together, the findings show that both rare and common noncoding regulatory variants are important contributors to the allelic spectrum in complex trait loci. PMID:22174694
Sulaiman, Irshad M.; Tang, Kevin; Osborne, John; Sammons, Scott; Wohlhueter, Robert M.
2007-01-01
We developed a set of seven resequencing GeneChips, based on the complete genome sequences of 24 strains of smallpox virus (variola virus), for rapid characterization of this human-pathogenic virus. Each GeneChip was designed to analyze a divergent segment of approximately 30,000 bases of the smallpox virus genome. This study includes the hybridization results of 14 smallpox virus strains. Of the 14 smallpox virus strains hybridized, only 7 had sequence information included in the design of the smallpox virus resequencing GeneChips; similar information for the remaining strains was not tiled as a reference in these GeneChips. By use of variola virus-specific primers and long-range PCR, 22 overlapping amplicons were amplified to cover nearly the complete genome and hybridized with the smallpox virus resequencing GeneChip set. These GeneChips were successful in generating nucleotide sequences for all 14 of the smallpox virus strains hybridized. Analysis of the data indicated that the GeneChip resequencing by hybridization was fast and reproducible and that the smallpox virus resequencing GeneChips could differentiate the 14 smallpox virus strains characterized. This study also suggests that high-density resequencing GeneChips have potential biodefense applications and may be used as an alternate tool for rapid identification of smallpox virus in the future. PMID:17182757
Quantitative phenotyping via deep barcode sequencing.
Smith, Andrew M; Heisler, Lawrence E; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J; Chee, Mark; Roth, Frederick P; Giaever, Guri; Nislow, Corey
2009-10-01
Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or "Bar-seq," outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that approximately 20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene-environment interactions on a genome-wide scale.
Lee, SangWook; Kim, Soyoun; Malm, Johan; Jeong, Ok Chan; Lilja, Hans; Laurell, Thomas
2014-01-01
Enriching the surface density of immobilized capture antibodies enhances the detection signal of antibody sandwich microarrays. In this study, we improved the detection sensitivity of our previously developed P-Si (porous silicon) antibody microarray by optimizing concentrations of the capturing antibody. We investigated immunoassays using a P-Si microarray at three different capture antibody (PSA - prostate specific antigen) concentrations, analyzing the influence of the antibody density on the assay detection sensitivity. The LOD (limit of detection) for PSA was 2.5ngmL−1, 80pgmL−1, and 800fgmL−1 when arraying the PSA antibody, H117 at the concentration 15µgmL−1, 35µgmL−1 and 154µgmL−1, respectively. We further investigated PSA spiked into human female serum in the range of 800fgmL−1 to 500ngmL−1. The microarray showed a LOD of 800fgmL−1 and a dynamic range of 800 fgmL−1 to 80ngmL−1 in serum spiked samples. PMID:24016590
Gardiner, Laura-Jayne; Gawroński, Piotr; Olohan, Lisa; Schnurbusch, Thorsten; Hall, Neil; Hall, Anthony
2014-12-01
Mapping-by-sequencing analyses have largely required a complete reference sequence and employed whole genome re-sequencing. In species such as wheat, no finished genome reference sequence is available. Additionally, because of its large genome size (17 Gb), re-sequencing at sufficient depth of coverage is not practical. Here, we extend the utility of mapping by sequencing, developing a bespoke pipeline and algorithm to map an early-flowering locus in einkorn wheat (Triticum monococcum L.) that is closely related to the bread wheat genome A progenitor. We have developed a genomic enrichment approach using the gene-rich regions of hexaploid bread wheat to design a 110-Mbp NimbleGen SeqCap EZ in solution capture probe set, representing the majority of genes in wheat. Here, we use the capture probe set to enrich and sequence an F2 mapping population of the mutant. The mutant locus was identified in T. monococcum, which lacks a complete genome reference sequence, by mapping the enriched data set onto pseudo-chromosomes derived from the capture probe target sequence, with a long-range order of genes based on synteny of wheat with Brachypodium distachyon. Using this approach we are able to map the region and identify a set of deleted genes within the interval. © 2014 The Authors.The Plant Journal published by Society for Experimental Biology and John Wiley & Sons Ltd.
Cox, Brian; Sharma, Parveen; Evangelou, Andreas I; Whiteley, Kathie; Ignatchenko, Vladimir; Ignatchenko, Alex; Baczyk, Dora; Czikk, Marie; Kingdom, John; Rossant, Janet; Gramolini, Anthony O; Adamson, S Lee; Kislinger, Thomas
2011-12-01
Preeclampsia (PE) adversely impacts ~5% of pregnancies. Despite extensive research, no consistent biomarkers or cures have emerged, suggesting that different molecular mechanisms may cause clinically similar disease. To address this, we undertook a proteomics study with three main goals: (1) to identify a panel of cell surface markers that distinguish the trophoblast and endothelial cells of the placenta in the mouse; (2) to translate this marker set to human via the Human Protein Atlas database; and (3) to utilize the validated human trophoblast markers to identify subgroups of human preeclampsia. To achieve these goals, plasma membrane proteins at the blood tissue interfaces were extracted from placentas using intravascular silica-bead perfusion, and then identified using shotgun proteomics. We identified 1181 plasma membrane proteins, of which 171 were enriched at the maternal blood-trophoblast interface and 192 at the fetal endothelial interface with a 70% conservation of expression in humans. Three distinct molecular subgroups of human preeclampsia were identified in existing human microarray data by using expression patterns of trophoblast-enriched proteins. Analysis of all misexpressed genes revealed divergent dysfunctions including angiogenesis (subgroup 1), MAPK signaling (subgroup 2), and hormone biosynthesis and metabolism (subgroup 3). Subgroup 2 lacked expected changes in known preeclampsia markers (sFLT1, sENG) and uniquely overexpressed GNA12. In an independent set of 40 banked placental specimens, GNA12 was overexpressed during preeclampsia when co-incident with chronic hypertension. In the current study we used a novel translational analysis to integrate mouse and human trophoblast protein expression with human microarray data. This strategy identified distinct molecular pathologies in human preeclampsia. We conclude that clinically similar preeclampsia patients exhibit divergent placental gene expression profiles thus implicating divergent molecular mechanisms in the origins of this disease.
Welker, Noah C; Habig, Jeffrey W; Bass, Brenda L
2007-07-01
We describe the first microarray analysis of a whole animal containing a mutation in the Dicer gene. We used adult Caenorhabditis elegans and, to distinguish among different roles of Dicer, we also performed microarray analyses of animals with mutations in rde-4 and rde-1, which are involved in silencing by siRNA, but not miRNA. Surprisingly, we find that the X chromosome is greatly enriched for genes regulated by Dicer. Comparison of all three microarray data sets indicates the majority of Dicer-regulated genes are not dependent on RDE-4 or RDE-1, including the X-linked genes. However, all three data sets are enriched in genes important for innate immunity and, specifically, show increased expression of innate immunity genes.
Welker, Noah C.; Habig, Jeffrey W.; Bass, Brenda L.
2007-01-01
We describe the first microarray analysis of a whole animal containing a mutation in the Dicer gene. We used adult Caenorhabditis elegans and, to distinguish among different roles of Dicer, we also performed microarray analyses of animals with mutations in rde-4 and rde-1, which are involved in silencing by siRNA, but not miRNA. Surprisingly, we find that the X chromosome is greatly enriched for genes regulated by Dicer. Comparison of all three microarray data sets indicates the majority of Dicer-regulated genes are not dependent on RDE-4 or RDE-1, including the X-linked genes. However, all three data sets are enriched in genes important for innate immunity and, specifically, show increased expression of innate immunity genes. PMID:17526642
Evaluation of Quality Assessment Protocols for High Throughput Genome Resequencing Data
Chiara, Matteo; Pavesi, Giulio
2017-01-01
Large-scale initiatives aiming to recover the complete sequence of thousands of human genomes are currently being undertaken worldwide, concurring to the generation of a comprehensive catalog of human genetic variation. The ultimate and most ambitious goal of human population scale genomics is the characterization of the so-called human “variome,” through the identification of causal mutations or haplotypes. Several research institutions worldwide currently use genotyping assays based on Next-Generation Sequencing (NGS) for diagnostics and clinical screenings, and the widespread application of such technologies promises major revolutions in medical science. Bioinformatic analysis of human resequencing data is one of the main factors limiting the effectiveness and general applicability of NGS for clinical studies. The requirement for multiple tools, to be combined in dedicated protocols in order to accommodate different types of data (gene panels, exomes, or whole genomes) and the high variability of the data makes difficult the establishment of a ultimate strategy of general use. While there already exist several studies comparing sensitivity and accuracy of bioinformatic pipelines for the identification of single nucleotide variants from resequencing data, little is known about the impact of quality assessment and reads pre-processing strategies. In this work we discuss major strengths and limitations of the various genome resequencing protocols are currently used in molecular diagnostics and for the discovery of novel disease-causing mutations. By taking advantage of publicly available data we devise and suggest a series of best practices for the pre-processing of the data that consistently improve the outcome of genotyping with minimal impacts on computational costs. PMID:28736571
Krienen, Fenna M.; Yeo, B. T. Thomas; Ge, Tian; Buckner, Randy L.; Sherwood, Chet C.
2016-01-01
The human brain is patterned with disproportionately large, distributed cerebral networks that connect multiple association zones in the frontal, temporal, and parietal lobes. The expansion of the cortical surface, along with the emergence of long-range connectivity networks, may be reflected in changes to the underlying molecular architecture. Using the Allen Institute’s human brain transcriptional atlas, we demonstrate that genes particularly enriched in supragranular layers of the human cerebral cortex relative to mouse distinguish major cortical classes. The topography of transcriptional expression reflects large-scale brain network organization consistent with estimates from functional connectivity MRI and anatomical tracing in nonhuman primates. Microarray expression data for genes preferentially expressed in human upper layers (II/III), but enriched only in lower layers (V/VI) of mouse, were cross-correlated to identify molecular profiles across the cerebral cortex of postmortem human brains (n = 6). Unimodal sensory and motor zones have similar molecular profiles, despite being distributed across the cortical mantle. Sensory/motor profiles were anticorrelated with paralimbic and certain distributed association network profiles. Tests of alternative gene sets did not consistently distinguish sensory and motor regions from paralimbic and association regions: (i) genes enriched in supragranular layers in both humans and mice, (ii) genes cortically enriched in humans relative to nonhuman primates, (iii) genes related to connectivity in rodents, (iv) genes associated with human and mouse connectivity, and (v) 1,454 gene sets curated from known gene ontologies. Molecular innovations of upper cortical layers may be an important component in the evolution of long-range corticocortical projections. PMID:26739559
Krienen, Fenna M; Yeo, B T Thomas; Ge, Tian; Buckner, Randy L; Sherwood, Chet C
2016-01-26
The human brain is patterned with disproportionately large, distributed cerebral networks that connect multiple association zones in the frontal, temporal, and parietal lobes. The expansion of the cortical surface, along with the emergence of long-range connectivity networks, may be reflected in changes to the underlying molecular architecture. Using the Allen Institute's human brain transcriptional atlas, we demonstrate that genes particularly enriched in supragranular layers of the human cerebral cortex relative to mouse distinguish major cortical classes. The topography of transcriptional expression reflects large-scale brain network organization consistent with estimates from functional connectivity MRI and anatomical tracing in nonhuman primates. Microarray expression data for genes preferentially expressed in human upper layers (II/III), but enriched only in lower layers (V/VI) of mouse, were cross-correlated to identify molecular profiles across the cerebral cortex of postmortem human brains (n = 6). Unimodal sensory and motor zones have similar molecular profiles, despite being distributed across the cortical mantle. Sensory/motor profiles were anticorrelated with paralimbic and certain distributed association network profiles. Tests of alternative gene sets did not consistently distinguish sensory and motor regions from paralimbic and association regions: (i) genes enriched in supragranular layers in both humans and mice, (ii) genes cortically enriched in humans relative to nonhuman primates, (iii) genes related to connectivity in rodents, (iv) genes associated with human and mouse connectivity, and (v) 1,454 gene sets curated from known gene ontologies. Molecular innovations of upper cortical layers may be an important component in the evolution of long-range corticocortical projections.
Wang, Wenyu; Liu, Yang; Hao, Jingcan; Zheng, Shuyu; Wen, Yan; Xiao, Xiao; He, Awen; Fan, Qianrui; Zhang, Feng; Liu, Ruiyu
2016-10-10
Hip cartilage destruction is consistently observed in the non-traumatic osteonecrosis of femoral head (NOFH) and accelerates its bone necrosis. The molecular mechanism underlying the cartilage damage of NOFH remains elusive. In this study, we conducted a systematically comparative study of gene expression profiles between NOFH and osteoarthritis (OA). Hip articular cartilage specimens were collected from 12 NOFH patients and 12 controls with traumatic femoral neck fracture for microarray (n=4) and quantitative real-time PCR validation experiments (n=8). Gene expression profiling of articular cartilage was performed using Agilent Human 4×44K Microarray chip. The accuracy of microarray experiment was further validated by qRT-PCR. Gene expression results of OA hip cartilage were derived from previously published study. Significance Analysis of Microarrays (SAM) software was applied for identifying differently expressed genes. Gene ontology (GO) and pathway enrichment analysis were conducted by Gene Set Enrichment Analysis software and DAVID tool, respectively. Totally, 27 differently expressed genes were identified for NOFH. Comparing the gene expression profiles of NOFH cartilage and OA cartilage detected 8 common differently expressed genes, including COL5A1, OGN, ANGPTL4, CRIP1, NFIL3, METRNL, ID2 and STEAP1. GO comparative analysis identified 10 common significant GO terms, mainly implicated in apoptosis and development process. Pathway comparative analysis observed that ECM-receptor interaction pathway and focal adhesion pathway were enriched in the differently expressed genes of both NOFH and hip OA. In conclusion, we identified a set of differently expressed genes, GO and pathways for NOFH articular destruction, some of which were also involved in the hip OA. Our study results may help to reveal the pathogenetic similarities and differences of cartilage damage of NOFH and hip OA. Copyright © 2016 Elsevier B.V. All rights reserved.
Shahmanesh, Mohsen; Phillips, Kenneth; Boothby, Meg; Tomlinson, Jeremy W.
2015-01-01
Objective To compare changes in gene expression by microarray from subcutaneous adipose tissue from HIV treatment naïve patients treated with efavirenz based regimens containing abacavir (ABC), tenofovir (TDF) or zidovidine (AZT). Design Subcutaneous fat biopsies were obtained before, at 6- and 18–24-months after treatment, and from HIV negative controls. Groups were age, ethnicity, weight, biochemical profile, and pre-treatment CD4 count matched. Microarray data was generated using the Agilent Whole Human Genome Microarray. Identification of differentially expressed genes and genomic response pathways was performed using limma and gene set enrichment analysis. Results There were significant divergences between ABC and the other two groups 6 months after treatment in genes controlling cell adhesion and environmental information processing, with some convergence at 18–24 months. Compared to controls the ABC group, but not AZT or TDF showed enrichment of genes controlling adherence junction, at 6 months and 18–24 months (adjusted p<0.05) and focal adhesions and tight junction at 6 months (p<0.5). Genes controlling leukocyte transendothelial migration (p<0.05) and ECM-receptor interactions (p = 0.04) were over-expressed in ABC compared to TDF and AZT at 6 months but not at 18–24 months. Enrichment of pathways and individual genes controlling cell adhesion and environmental information processing were specifically dysregulated in the ABC group in comparison with other treatments. There was little difference between AZT and TDF. Conclusion After initiating treatment, there is divergence in the expression of genes controlling cell adhesion and environmental information processing between ABC and both TDF and AZT in subcutaneous adipose tissue. If similar changes are also taking place in other tissues including the coronary vasculature they may contribute to the increased risk of cardiovascular events reported in patients recently started on abacavir-containing regimens. PMID:25617630
Microarray Analysis of Long Noncoding RNAs in Female Diabetic Peripheral Neuropathy Patients.
Luo, Lin; Ji, Lin-Dan; Cai, Jiang-Jia; Feng, Mei; Zhou, Mi; Hu, Su-Pei; Xu, Jin; Zhou, Wen-Hua
2018-01-01
Diabetic peripheral neuropathy (DPN) is the most common complication of diabetes mellitus (DM). Because of its controversial pathogenesis, DPN is still not diagnosed or managed properly in most patients. In this study, human lncRNA microarrays were used to identify the differentially expressed lncRNAs in DM and DPN patients, and some of the discovered lncRNAs were further validated in additional 78 samples by quantitative realtime PCR (qRT-PCR). The microarray analysis identified 446 and 1327 differentially expressed lncRNAs in DM and DPN, respectively. The KEGG pathway analysis further revealed that the differentially expressed lncRNA-coexpressed mRNAs between DPN and DM groups were significantly enriched in the MAPK signaling pathway. The lncRNA/mRNA coexpression network indicated that BDNF and TRAF2 correlated with 6 lncRNAs. The qRT-PCR confirmed the initial microarray results. These findings demonstrated that the interplay between lncRNAs and mRNA may be involved in the pathogenesis of DPN, especially the neurotrophin-MAPK signaling pathway, thus providing relevant information for future studies. © 2018 The Author(s). Published by S. Karger AG, Basel.
Quantitative phenotyping via deep barcode sequencing
Smith, Andrew M.; Heisler, Lawrence E.; Mellor, Joseph; Kaper, Fiona; Thompson, Michael J.; Chee, Mark; Roth, Frederick P.; Giaever, Guri; Nislow, Corey
2009-01-01
Next-generation DNA sequencing technologies have revolutionized diverse genomics applications, including de novo genome sequencing, SNP detection, chromatin immunoprecipitation, and transcriptome analysis. Here we apply deep sequencing to genome-scale fitness profiling to evaluate yeast strain collections in parallel. This method, Barcode analysis by Sequencing, or “Bar-seq,” outperforms the current benchmark barcode microarray assay in terms of both dynamic range and throughput. When applied to a complex chemogenomic assay, Bar-seq quantitatively identifies drug targets, with performance superior to the benchmark microarray assay. We also show that Bar-seq is well-suited for a multiplex format. We completely re-sequenced and re-annotated the yeast deletion collection using deep sequencing, found that ∼20% of the barcodes and common priming sequences varied from expectation, and used this revised list of barcode sequences to improve data quality. Together, this new assay and analysis routine provide a deep-sequencing-based toolkit for identifying gene–environment interactions on a genome-wide scale. PMID:19622793
Chatterjee, Shatakshee; Verma, Srikant Prasad; Pandey, Priyanka
2017-09-05
Initiation and progression of fluid filled cysts mark Autosomal Dominant Polycystic Kidney Disease (ADPKD). Thus, improved therapeutics targeting cystogenesis remains a constant challenge. Microarray studies in single ADPKD animal models species with limited sample sizes tend to provide scattered views on underlying ADPKD pathogenesis. Thus we aim to perform a cross species meta-analysis to profile conserved biological pathways that might be key targets for therapy. Nine ADPKD microarray datasets on rat, mice and human fulfilled our study criteria and were chosen. Intra-species combined analysis was performed after considering removal of batch effect. Significantly enriched GO biological processes and KEGG pathways were computed and their overlap was observed. For the conserved pathways, biological modules and gene regulatory networks were observed. Additionally, Gene Set Enrichment Analysis (GSEA) using Molecular Signature Database (MSigDB) was performed for genes found in conserved pathways. We obtained 28 modules of significantly enriched GO processes and 5 major functional categories from significantly enriched KEGG pathways conserved in human, mice and rats that in turn suggest a global transcriptomic perturbation affecting cyst - formation, growth and progression. Significantly enriched pathways obtained from up-regulated genes such as Genomic instability, Protein localization in ER and Insulin Resistance were found to regulate cyst formation and growth whereas cyst progression due to increased cell adhesion and inflammation was suggested by perturbations in Angiogenesis, TGF-beta, CAMs, and Infection related pathways. Additionally, networks revealed shared genes among pathways e.g. SMAD2 and SMAD7 in Endocytosis and TGF-beta. Our study suggests cyst formation and progression to be an outcome of interplay between a set of several key deregulated pathways. Thus, further translational research is warranted focusing on developing a combinatorial therapeutic approach for ADPKD redressal. Copyright © 2017 Elsevier B.V. All rights reserved.
Giresi, Paul G.; Kim, Jonghwan; McDaniell, Ryan M.; Iyer, Vishwanath R.; Lieb, Jason D.
2007-01-01
DNA segments that actively regulate transcription in vivo are typically characterized by eviction of nucleosomes from chromatin and are experimentally identified by their hypersensitivity to nucleases. Here we demonstrate a simple procedure for the isolation of nucleosome-depleted DNA from human chromatin, termed FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements). To perform FAIRE, chromatin is crosslinked with formaldehyde in vivo, sheared by sonication, and phenol-chloroform extracted. The DNA recovered in the aqueous phase is fluorescently labeled and hybridized to a DNA microarray. FAIRE performed in human cells strongly enriches DNA coincident with the location of DNaseI hypersensitive sites, transcriptional start sites, and active promoters. Evidence for cell-type–specific patterns of FAIRE enrichment is also presented. FAIRE has utility as a positive selection for genomic regions associated with regulatory activity, including regions traditionally detected by nuclease hypersensitivity assays. PMID:17179217
Novel mutations in LRP6 highlight the role of WNT signaling in tooth agenesis
Ludwig, Kerstin U.; Sullivan, Robert; van Rooij, Iris A.L.M.; Thonissen, Michelle; Swinnen, Steven; Phan, Milien; Conte, Federica; Ishorst, Nina; Gilissen, Christian; RoaFuentes, Laury; van de Vorst, Maartje; Henkes, Arjen; Steehouwer, Marloes; van Beusekom, Ellen; Bloemen, Marjon; Vankeirsbilck, Bruno; Bergé, Stefaan; Hens, Greet; Schoenaers, Joseph; Poorten, Vincent Vander; Roosenboom, Jasmien; Verdonck, An; Devriendt, Koen; Roeleveldt, Nel; Jhangiani, Shalini N.; Vissers, Lisenka E.L.M.; Lupski, James R.; de Ligt, Joep; Von den Hoff, Johannes W.; Pfundt, Rolph; Brunner, Han G.; Zhou, Huiqing; Dixon, Jill; Mangold, Elisabeth; van Bokhoven, Hans; Dixon, Michael J.; Kleefstra, Tjitske
2016-01-01
Purpose Here we aimed to identify a novel genetic cause of tooth agenesis (TA) and/or orofacial clefting (OFC) by combining whole exome sequencing (WES) and targeted re-sequencing in a large cohort of TA and OFC patients. Methods WES was performed in two unrelated patients, one with severe TA and OFC and another with severe TA only. After identifying deleterious mutations in a gene encoding the low density lipoprotein receptor-related protein 6 (LRP6), all its exons were re-sequenced with molecular inversion probes, in 67 patients with TA, 1,072 patients with OFC and in 706 controls. Results We identified a frameshift (c.4594delG, p.Cys1532fs) and a canonical splice site mutation (c.3398-2A>C, p.?) in LRP6 respectively in the patient with TA and OFC, and in the patient with severe TA only. The targeted re-sequencing showed significant enrichment of unique LRP6 variants in TA patients, but not in nonsyndromic OFC. From the 5 variants in patients with TA, 2 affect the canonical splice site and 3 were missense variants; all variants segregated with the dominant phenotype and in 1 case the missense mutation occurred de novo. Conclusion Mutations in LRP6 cause tooth agenesis in man. PMID:26963285
Inamura, Kentaro; Togashi, Yuki; Ninomiya, Hironori; Shimoji, Takashi; Noda, Tetsuo; Ishikawa, Yuichi
2008-01-01
Previously, using microarray and real-time RT-PCR analysis, we established that HOXB2 is an adverse prognostic indicator for Stage I lung adenocarcinomas. HOXB2 is one of the homeobox master development-controlling genes regulating morphogenesis and cell differentiation. The molecular functions of HOXB2 were analyzed with a small interfering RNA (siRNA) approach in HOP-62 human non-small cell lung cancer (NSCLC) cells featuring high HOXB2 expression. Matrigel invasion assays and microarray gene expression analysis were compared between the HOXB2-siRNA cells and the control cells. The Matrigel invasion assays showed attenuation of HOXB2 expression by siRNA to result in a significant decrease of invasiveness compared to the control cells (p = 0.0013, paired t-test). On microarray gene expression analysis, up-regulation of many metastasis-related genes and others correlating with HOXB2 expression was observed in the control case. With attenuation of HOXB2 expression, downregulation was noted for laminins alpha 4 and 5, involved in enriched signaling, and for Mac-2BP (Mac-2 binding protein) and integrin beta 4 amongst the genes having an enriched glycoprotein ontology. HOXB2 promotes invasion of lung cancer cells through the regulation of metastasis-related genes.
Targeted Re-Sequencing Emulsion PCR Panel for Myopathies: Results in 94 Cases.
Punetha, Jaya; Kesari, Akanchha; Uapinyoying, Prech; Giri, Mamta; Clarke, Nigel F; Waddell, Leigh B; North, Kathryn N; Ghaoui, Roula; O'Grady, Gina L; Oates, Emily C; Sandaradura, Sarah A; Bönnemann, Carsten G; Donkervoort, Sandra; Plotz, Paul H; Smith, Edward C; Tesi-Rocha, Carolina; Bertorini, Tulio E; Tarnopolsky, Mark A; Reitter, Bernd; Hausmanowa-Petrusewicz, Irena; Hoffman, Eric P
2016-05-27
Molecular diagnostics in the genetic myopathies often requires testing of the largest and most complex transcript units in the human genome (DMD, TTN, NEB). Iteratively targeting single genes for sequencing has traditionally entailed high costs and long turnaround times. Exome sequencing has begun to supplant single targeted genes, but there are concerns regarding coverage and needed depth of the very large and complex genes that frequently cause myopathies. To evaluate efficiency of next-generation sequencing technologies to provide molecular diagnostics for patients with previously undiagnosed myopathies. We tested a targeted re-sequencing approach, using a 45 gene emulsion PCR myopathy panel, with subsequent sequencing on the Illumina platform in 94 undiagnosed patients. We compared the targeted re-sequencing approach to exome sequencing for 10 of these patients studied. We detected likely pathogenic mutations in 33 out of 94 patients with a molecular diagnostic rate of approximately 35%. The remaining patients showed variants of unknown significance (35/94 patients) or no mutations detected in the 45 genes tested (26/94 patients). Mutation detection rates for targeted re-sequencing vs. whole exome were similar in both methods; however exome sequencing showed better distribution of reads and fewer exon dropouts. Given that costs of highly parallel re-sequencing and whole exome sequencing are similar, and that exome sequencing now takes considerably less laboratory processing time than targeted re-sequencing, we recommend exome sequencing as the standard approach for molecular diagnostics of myopathies.
Zhou, Xiaobo; Qiu, Weiliang; Sathirapongsasuti, J. Fah.; Cho, Michael H.; Mancini, John D.; Lao, Taotao; Thibault, Derek M.; Litonjua, Gus; Bakke, Per S.; Gulsvik, Amund; Lomas, David A.; Beaty, Terri H.; Hersh, Craig P.; Anderson, Christopher; Geigenmuller, Ute; Raby, Benjamin A.; Rennard, Stephen I.; Perrella, Mark A.; Choi, Augustine M.K.; Quackenbush, John; Silverman, Edwin K.
2013-01-01
Hedgehog Interacting Protein (HHIP) was implicated in chronic obstructive pulmonary disease (COPD) by genome-wide association studies (GWAS). However, it remains unclear how HHIP contributes to COPD pathogenesis. To identify genes regulated by HHIP, we performed gene expression microarray analysis in a human bronchial epithelial cell line (Beas-2B) stably infected with HHIP shRNAs. HHIP silencing led to differential expression of 296 genes; enrichment for variants nominally associated with COPD was found. Eighteen of the differentially expressed genes were validated by real-time PCR in Beas-2B cells. Seven of 11 validated genes tested in human COPD and control lung tissues demonstrated significant gene expression differences. Functional annotation indicated enrichment for extracellular matrix and cell growth genes. Network modeling demonstrated that the extracellular matrix and cell proliferation genes influenced by HHIP tended to be interconnected. Thus, we identified potential HHIP targets in human bronchial epithelial cells that may contribute to COPD pathogenesis. PMID:23459001
Identification of candidate genes in osteoporosis by integrated microarray analysis.
Li, J J; Wang, B Q; Fei, Q; Yang, Y; Li, D
2016-12-01
In order to screen the altered gene expression profile in peripheral blood mononuclear cells of patients with osteoporosis, we performed an integrated analysis of the online microarray studies of osteoporosis. We searched the Gene Expression Omnibus (GEO) database for microarray studies of peripheral blood mononuclear cells in patients with osteoporosis. Subsequently, we integrated gene expression data sets from multiple microarray studies to obtain differentially expressed genes (DEGs) between patients with osteoporosis and normal controls. Gene function analysis was performed to uncover the functions of identified DEGs. A total of three microarray studies were selected for integrated analysis. In all, 1125 genes were found to be significantly differentially expressed between osteoporosis patients and normal controls, with 373 upregulated and 752 downregulated genes. Positive regulation of the cellular amino metabolic process (gene ontology (GO): 0033240, false discovery rate (FDR) = 1.00E + 00) was significantly enriched under the GO category for biological processes, while for molecular functions, flavin adenine dinucleotide binding (GO: 0050660, FDR = 3.66E-01) and androgen receptor binding (GO: 0050681, FDR = 6.35E-01) were significantly enriched. DEGs were enriched in many osteoporosis-related signalling pathways, including those of mitogen-activated protein kinase (MAPK) and calcium. Protein-protein interaction (PPI) network analysis showed that the significant hub proteins contained ubiquitin specific peptidase 9, X-linked (Degree = 99), ubiquitin specific peptidase 19 (Degree = 57) and ubiquitin conjugating enzyme E2 B (Degree = 57). Analysis of gene function of identified differentially expressed genes may expand our understanding of fundamental mechanisms leading to osteoporosis. Moreover, significantly enriched pathways, such as MAPK and calcium, may involve in osteoporosis through osteoblastic differentiation and bone formation.Cite this article: J. J. Li, B. Q. Wang, Q. Fei, Y. Yang, D. Li. Identification of candidate genes in osteoporosis by integrated microarray analysis. Bone Joint Res 2016;5:594-601. DOI: 10.1302/2046-3758.512.BJR-2016-0073.R1. © 2016 Fei et al.
Trujillano, D; Ramos, M D; González, J; Tornador, C; Sotillo, F; Escaramis, G; Ossowski, S; Armengol, L; Casals, T; Estivill, X
2013-07-01
Here we have developed a novel and much more efficient strategy for the complete molecular characterisation of the cystic fibrosis (CF) transmembrane regulator (CFTR) gene, based on multiplexed targeted resequencing. We have tested this approach in a cohort of 92 samples with previously characterised CFTR mutations and polymorphisms. After enrichment of the pooled barcoded DNA libraries with a custom NimbleGen SeqCap EZ Choice array (Roche) and sequencing with a HiSeq2000 (Illumina) sequencer, we applied several bioinformatics tools to call mutations and polymorphisms in CFTR. The combination of several bioinformatics tools allowed us to detect all known pathogenic variants (point mutations, short insertions/deletions, and large genomic rearrangements) and polymorphisms (including the poly-T and poly-thymidine-guanine polymorphic tracts) in the 92 samples. In addition, we report the precise characterisation of the breakpoints of seven genomic rearrangements in CFTR, including those of a novel deletion of exon 22 and a complex 85 kb inversion which includes two large deletions affecting exons 4-8 and 12-21, respectively. This work is a proof-of-principle that targeted resequencing is an accurate and cost-effective approach for the genetic testing of CF and CFTR-related disorders (ie, male infertility) amenable to the routine clinical practice, and ready to substitute classical molecular methods in medical genetics.
Jha, Aashish R.; Miles, Cecelia M.; Lippert, Nodia R.; Brown, Christopher D.; White, Kevin P.; Kreitman, Martin
2015-01-01
Complete genome resequencing of populations holds great promise in deconstructing complex polygenic traits to elucidate molecular and developmental mechanisms of adaptation. Egg size is a classic adaptive trait in insects, birds, and other taxa, but its highly polygenic architecture has prevented high-resolution genetic analysis. We used replicated experimental evolution in Drosophila melanogaster and whole-genome sequencing to identify consistent signatures of polygenic egg-size adaptation. A generalized linear-mixed model revealed reproducible allele frequency differences between replicated experimental populations selected for large and small egg volumes at approximately 4,000 single nucleotide polymorphisms (SNPs). Several hundred distinct genomic regions contain clusters of these SNPs and have lower heterozygosity than the genomic background, consistent with selection acting on polymorphisms in these regions. These SNPs are also enriched among genes expressed in Drosophila ovaries and many of these genes have well-defined functions in Drosophila oogenesis. Additional genes regulating egg development, growth, and cell size show evidence of directional selection as genes regulating these biological processes are enriched for highly differentiated SNPs. Genetic crosses performed with a subset of candidate genes demonstrated that these genes influence egg size, at least in the large genetic background. These findings confirm the highly polygenic architecture of this adaptive trait, and suggest the involvement of many novel candidate genes in regulating egg size. PMID:26044351
Detection of pathogenic Vibrio spp. in shellfish by using multiplex PCR and DNA microarrays.
Panicker, Gitika; Call, Douglas R; Krug, Melissa J; Bej, Asim K
2004-12-01
This study describes the development of a gene-specific DNA microarray coupled with multiplex PCR for the comprehensive detection of pathogenic vibrios that are natural inhabitants of warm coastal waters and shellfish. Multiplex PCR with vvh and viuB for Vibrio vulnificus, with ompU, toxR, tcpI, and hlyA for V. cholerae, and with tlh, tdh, trh, and open reading frame 8 for V. parahaemolyticus helped to ensure that total and pathogenic strains, including subtypes of the three Vibrio spp., could be detected and discriminated. For DNA microarrays, oligonucleotide probes for these targeted genes were deposited onto epoxysilane-derivatized, 12-well, Teflon-masked slides by using a MicroGrid II arrayer. Amplified PCR products were hybridized to arrays at 50 degrees C and detected by using tyramide signal amplification with Alexa Fluor 546 fluorescent dye. Slides were imaged by using an arrayWoRx scanner. The detection sensitivity for pure cultures without enrichment was 10(2) to 10(3) CFU/ml, and the specificity was 100%. However, 5 h of sample enrichment followed by DNA extraction with Instagene matrix and multiplex PCR with microarray hybridization resulted in the detection of 1 CFU in 1 g of oyster tissue homogenate. Thus, enrichment of the bacterial pathogens permitted higher sensitivity in compliance with the Interstate Shellfish Sanitation Conference guideline. Application of the DNA microarray methodology to natural oysters revealed the presence of V. vulnificus (100%) and V. parahaemolyticus (83%). However, V. cholerae was not detected in natural oysters. An assay involving a combination of multiplex PCR and DNA microarray hybridization would help to ensure rapid and accurate detection of pathogenic vibrios in shellfish, thereby improving the microbiological safety of shellfish for consumers.
Detection of Pathogenic Vibrio spp. in Shellfish by Using Multiplex PCR and DNA Microarrays
Panicker, Gitika; Call, Douglas R.; Krug, Melissa J.; Bej, Asim K.
2004-01-01
This study describes the development of a gene-specific DNA microarray coupled with multiplex PCR for the comprehensive detection of pathogenic vibrios that are natural inhabitants of warm coastal waters and shellfish. Multiplex PCR with vvh and viuB for Vibrio vulnificus, with ompU, toxR, tcpI, and hlyA for V. cholerae, and with tlh, tdh, trh, and open reading frame 8 for V. parahaemolyticus helped to ensure that total and pathogenic strains, including subtypes of the three Vibrio spp., could be detected and discriminated. For DNA microarrays, oligonucleotide probes for these targeted genes were deposited onto epoxysilane-derivatized, 12-well, Teflon-masked slides by using a MicroGrid II arrayer. Amplified PCR products were hybridized to arrays at 50°C and detected by using tyramide signal amplification with Alexa Fluor 546 fluorescent dye. Slides were imaged by using an arrayWoRx scanner. The detection sensitivity for pure cultures without enrichment was 102 to 103 CFU/ml, and the specificity was 100%. However, 5 h of sample enrichment followed by DNA extraction with Instagene matrix and multiplex PCR with microarray hybridization resulted in the detection of 1 CFU in 1 g of oyster tissue homogenate. Thus, enrichment of the bacterial pathogens permitted higher sensitivity in compliance with the Interstate Shellfish Sanitation Conference guideline. Application of the DNA microarray methodology to natural oysters revealed the presence of V. vulnificus (100%) and V. parahaemolyticus (83%). However, V. cholerae was not detected in natural oysters. An assay involving a combination of multiplex PCR and DNA microarray hybridization would help to ensure rapid and accurate detection of pathogenic vibrios in shellfish, thereby improving the microbiological safety of shellfish for consumers. PMID:15574946
Subcutaneous and gonadal adipose tissue transcriptome differences in lean and obese female dogs.
Grant, Ryan W; Vester Boler, Brittany M; Ridge, Tonya K; Graves, Thomas K; Swanson, Kelly S
2013-12-01
Canine obesity leads to shortened life span and increased disease incidence. Adipose tissue depots are known to have unique metabolic and gene expression profiles in rodents and humans, but few comparisons of depot gene expression have been performed in the dog. Using microarray technology, our objective was to identify differentially expressed genes and enriched functional pathways between subcutaneous and gonadal adipose of lean and obese dogs to better understand the pathogenesis of obesity in the dog. Because no depot × body weight status interactions were identified in the microarray data, depot differences were the primary focus. A total of 946 and 703 transcripts were differentially expressed (FDR P < 0.05) between gonadal and subcutaneous adipose tissue in obese and lean dogs respectively. Of the adipose depot-specific differences in gene expression, 162 were present in both lean and obese dogs, with the majority (85%) expressed in the same direction. Both lean and obese dog gene lists had enrichment of the complement and coagulation cascade and systemic lupus erythematosus pathways. Obese dogs had enrichment of lysosome, extracellular matrix-receptor interaction, renin-angiotensin system and hematopoietic cell lineage pathways. Lean dogs had enrichment of glutathione metabolism and synthesis and degradation of ketone bodies. We have identified a core set of genes differentially expressed between subcutaneous and gonadal adipose tissue in dogs regardless of body weight. These genes contribute to depot-specific differences in immune function, extracellular matrix remodeling and lysosomal function and may contribute to the physiological differences noted between depots. © 2013 The Authors, Animal Genetics © 2013 Stichting International Foundation for Animal Genetics.
Pyle, Angela; Hudson, Gavin; Wilson, Ian J; Coxhead, Jonathan; Smertenko, Tania; Herbert, Mary; Santibanez-Koref, Mauro; Chinnery, Patrick F
2015-05-01
Recent reports have questioned the accepted dogma that mammalian mitochondrial DNA (mtDNA) is strictly maternally inherited. In humans, the argument hinges on detecting a signature of inter-molecular recombination in mtDNA sequences sampled at the population level, inferring a paternal source for the mixed haplotypes. However, interpreting these data is fraught with difficulty, and direct experimental evidence is lacking. Using extreme-high depth mtDNA re-sequencing up to ~1.2 million-fold coverage, we find no evidence that paternal mtDNA haplotypes are transmitted to offspring in humans, thus excluding a simple dilution mechanism for uniparental transmission of mtDNA present in all healthy individuals. Our findings indicate that an active mechanism eliminates paternal mtDNA which likely acts at the molecular level.
Pyle, Angela; Hudson, Gavin; Wilson, Ian J.; Coxhead, Jonathan; Smertenko, Tania; Herbert, Mary; Santibanez-Koref, Mauro; Chinnery, Patrick F.
2015-01-01
Recent reports have questioned the accepted dogma that mammalian mitochondrial DNA (mtDNA) is strictly maternally inherited. In humans, the argument hinges on detecting a signature of inter-molecular recombination in mtDNA sequences sampled at the population level, inferring a paternal source for the mixed haplotypes. However, interpreting these data is fraught with difficulty, and direct experimental evidence is lacking. Using extreme-high depth mtDNA re-sequencing up to ~1.2 million-fold coverage, we find no evidence that paternal mtDNA haplotypes are transmitted to offspring in humans, thus excluding a simple dilution mechanism for uniparental transmission of mtDNA present in all healthy individuals. Our findings indicate that an active mechanism eliminates paternal mtDNA which likely acts at the molecular level. PMID:25973765
Lapp, Thabo; Zaher, Sarah S; Haas, Carolin T; Becker, David L; Thrasivoulou, Chris; Chain, Benjamin M; Larkin, Daniel F P; Noursadeghi, Mahdad
2015-11-01
We sought to test the hypothesis that monocytes contribute to the immunopathogenesis of corneal allograft rejection and identify therapeutic targets to inhibit monocyte recruitment. Monocytes and proinflammatory mediators within anterior chamber samples during corneal graft rejection were quantified by flow cytometry and multiplex protein assays. Lipopolysaccharide or IFN-γ stimulation of monocyte-derived macrophages (MDMs) was used to generate inflammatory conditioned media (CoM). Corneal endothelial viability was tested by nuclear counting, connexin 43, and propidium iodide staining. Chemokine and chemokine receptor expression in monocytes and MDMs was assessed in microarray transcriptomic data. The role of chemokine pathways in monocyte migration across microvascular endothelium was tested in vitro by chemokine depletion or chemokine receptor inhibitors. Inflammatory monocytes were significantly enriched in anterior chamber samples within 1 week of the onset of symptoms of corneal graft rejection. The MDM inflammatory CoM was cytopathic to transformed human corneal endothelia. This effect was also evident in endothelium of excised human cornea and increased in the presence of monocytes. Gene expression microarrays identified monocyte chemokine receptors and cognate chemokines in MDM inflammatory responses, which were also enriched in anterior chamber samples. Depletion of selected chemokines in MDM inflammatory CoM had no effect on monocyte transmigration across an endothelial blood-eye barrier, but selective chemokine receptor inhibition reduced monocyte recruitment significantly. We propose a role for inflammatory monocytes in endothelial cytotoxicity in corneal graft rejection. Therefore, targeting monocyte recruitment offers a putative novel strategy to reduce donor endothelial cell injury in survival of human corneal allografts.
Choi, Young-Jun; Fuchs, Jeremy F.; Mayhew, George F.; Yu, Helen E.; Christensen, Bruce M.
2012-01-01
Hemocytes are integral components of mosquito immune mechanisms such as phagocytosis, melanization, and production of antimicrobial peptides. However, our understanding of hemocyte-specific molecular processes and their contribution to shaping the host immune response remains limited. To better understand the immunophysiological features distinctive of hemocytes, we conducted genome-wide analysis of hemocyte-enriched transcripts, and examined how tissue-enriched expression patterns change with the immune status of the host. Our microarray data indicate that the hemocyte-enriched trascriptome is dynamic and context-dependent. Analysis of transcripts enriched after bacterial challenge in circulating hemocytes with respect to carcass added a dimension to evaluating infection-responsive genes and immune-related gene families. We resolved patterns of transcriptional change unique to hemocytes from those that are likely shared by other immune responsive tissues, and identified clusters of genes preferentially induced in hemocytes, likely reflecting their involvement in cell type specific functions. In addition, the study revealed conserved hemocyte-enriched molecular repertoires which might be implicated in core hemocyte function by cross-species meta-analysis of microarray expression data from Anopheles gambiae and Drosophila melanogaster. PMID:22796331
Genomic variants in an inbred mouse model predict mania-like behaviors.
Saul, Michael C; Stevenson, Sharon A; Zhao, Changjiu; Driessen, Terri M; Eisinger, Brian E; Gammie, Stephen C
2018-01-01
Contemporary rodent models for bipolar disorders split the bipolar spectrum into complimentary behavioral endophenotypes representing mania and depression. Widely accepted mania models typically utilize single gene transgenics or pharmacological manipulations, but inbred rodent strains show great potential as mania models. Their acceptance is often limited by the lack of genotypic data needed to establish construct validity. In this study, we used a unique strategy to inexpensively explore and confirm population allele differences in naturally occurring candidate variants in a manic rodent strain, the Madison (MSN) mouse strain. Variants were identified using whole exome resequencing on a small population of animals. Interesting candidate variants were confirmed in a larger population with genotyping. We enriched these results with observations of locomotor behavior from a previous study. Resequencing identified 447 structural variants that are mostly fixed in the MSN strain relative to control strains. After filtering and annotation, we found 11 non-synonymous MSN variants that we believe alter protein function. The allele frequencies for 6 of these variants were consistent with explanatory variants for the Madison strain's phenotype. The variants are in the Npas2, Cp, Polr3c, Smarca4, Trpv1, and Slc5a7 genes, and many of these genes' products are in pathways implicated in human bipolar disorders. Variants in Smarca4 and Polr3c together explained over 40% of the variance in locomotor behavior in the Hsd:ICR founder strain. These results enhance the MSN strain's construct validity and implicate altered nucleosome structure and transcriptional regulation as a chief molecular system underpinning behavior.
Natarajan, Sathishkumar; Kim, Hoy-Taek; Thamilarasan, Senthil Kumar; Veerappan, Karpagam; Park, Jong-In; Nou, Ill-Sup
2016-01-01
Powdery mildew is one of the most common fungal diseases in the world. This disease frequently affects melon (Cucumis melo L.) and other Cucurbitaceous family crops in both open field and greenhouse cultivation. One of the goals of genomics is to identify the polymorphic loci responsible for variation in phenotypic traits. In this study, powdery mildew disease assessment scores were calculated for four melon accessions, 'SCNU1154', 'Edisto47', 'MR-1', and 'PMR5'. To investigate the genetic variation of these accessions, whole genome re-sequencing using the Illumina HiSeq 2000 platform was performed. A total of 754,759,704 quality-filtered reads were generated, with an average of 82.64% coverage relative to the reference genome. Comparisons of the sequences for the melon accessions revealed around 7.4 million single nucleotide polymorphisms (SNPs), 1.9 million InDels, and 182,398 putative structural variations (SVs). Functional enrichment analysis of detected variations classified them into biological process, cellular component and molecular function categories. Further, a disease-associated QTL map was constructed for 390 SNPs and 45 InDels identified as related to defense-response genes. Among them 112 SNPs and 12 InDels were observed in powdery mildew responsive chromosomes. Accordingly, this whole genome re-sequencing study identified SNPs and InDels associated with defense genes that will serve as candidate polymorphisms in the search for sources of resistance against powdery mildew disease and could accelerate marker-assisted breeding in melon.
Maouche, Seraya; Poirier, Odette; Godefroy, Tiphaine; Olaso, Robert; Gut, Ivo; Collet, Jean-Phillipe; Montalescot, Gilles; Cambien, François
2008-01-01
Background In this study we assessed the respective ability of Affymetrix and Illumina microarray methodologies to answer a relevant biological question, namely the change in gene expression between resting monocytes and macrophages derived from these monocytes. Five RNA samples for each type of cell were hybridized to the two platforms in parallel. In addition, a reference list of differentially expressed genes (DEG) was generated from a larger number of hybridizations (mRNA from 86 individuals) using the RNG/MRC two-color platform. Results Our results show an important overlap of the Illumina and Affymetrix DEG lists. In addition, more than 70% of the genes in these lists were also present in the reference list. Overall the two platforms had very similar performance in terms of biological significance, evaluated by the presence in the DEG lists of an excess of genes belonging to Gene Ontology (GO) categories relevant for the biology of monocytes and macrophages. Our results support the conclusion of the MicroArray Quality Control (MAQC) project that the criteria used to constitute the DEG lists strongly influence the degree of concordance among platforms. However the importance of prioritizing genes by magnitude of effect (fold change) rather than statistical significance (p-value) to enhance cross-platform reproducibility recommended by the MAQC authors was not supported by our data. Conclusion Functional analysis based on GO enrichment demonstrates that the 2 compared technologies delivered very similar results and identified most of the relevant GO categories enriched in the reference list. PMID:18578872
Jha, Aashish R; Miles, Cecelia M; Lippert, Nodia R; Brown, Christopher D; White, Kevin P; Kreitman, Martin
2015-10-01
Complete genome resequencing of populations holds great promise in deconstructing complex polygenic traits to elucidate molecular and developmental mechanisms of adaptation. Egg size is a classic adaptive trait in insects, birds, and other taxa, but its highly polygenic architecture has prevented high-resolution genetic analysis. We used replicated experimental evolution in Drosophila melanogaster and whole-genome sequencing to identify consistent signatures of polygenic egg-size adaptation. A generalized linear-mixed model revealed reproducible allele frequency differences between replicated experimental populations selected for large and small egg volumes at approximately 4,000 single nucleotide polymorphisms (SNPs). Several hundred distinct genomic regions contain clusters of these SNPs and have lower heterozygosity than the genomic background, consistent with selection acting on polymorphisms in these regions. These SNPs are also enriched among genes expressed in Drosophila ovaries and many of these genes have well-defined functions in Drosophila oogenesis. Additional genes regulating egg development, growth, and cell size show evidence of directional selection as genes regulating these biological processes are enriched for highly differentiated SNPs. Genetic crosses performed with a subset of candidate genes demonstrated that these genes influence egg size, at least in the large genetic background. These findings confirm the highly polygenic architecture of this adaptive trait, and suggest the involvement of many novel candidate genes in regulating egg size. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
CoPub: a literature-based keyword enrichment tool for microarray data analysis.
Frijters, Raoul; Heupers, Bart; van Beek, Pieter; Bouwhuis, Maurice; van Schaik, René; de Vlieg, Jacob; Polman, Jan; Alkema, Wynand
2008-07-01
Medline is a rich information source, from which links between genes and keywords describing biological processes, pathways, drugs, pathologies and diseases can be extracted. We developed a publicly available tool called CoPub that uses the information in the Medline database for the biological interpretation of microarray data. CoPub allows batch input of multiple human, mouse or rat genes and produces lists of keywords from several biomedical thesauri that are significantly correlated with the set of input genes. These lists link to Medline abstracts in which the co-occurring input genes and correlated keywords are highlighted. Furthermore, CoPub can graphically visualize differentially expressed genes and over-represented keywords in a network, providing detailed insight in the relationships between genes and keywords, and revealing the most influential genes as highly connected hubs. CoPub is freely accessible at http://services.nbic.nl/cgi-bin/copub/CoPub.pl.
Transcriptional Landscape of the Prenatal Human Brain
Miller, Jeremy A.; Ding, Song-Lin; Sunkin, Susan M.; Smith, Kimberly A; Ng, Lydia; Szafer, Aaron; Ebbert, Amanda; Riley, Zackery L.; Aiona, Kaylynn; Arnold, James M.; Bennet, Crissa; Bertagnolli, Darren; Brouner, Krissy; Butler, Stephanie; Caldejon, Shiella; Carey, Anita; Cuhaciyan, Christine; Dalley, Rachel A.; Dee, Nick; Dolbeare, Tim A.; Facer, Benjamin A. C.; Feng, David; Fliss, Tim P.; Gee, Garrett; Goldy, Jeff; Gourley, Lindsey; Gregor, Benjamin W.; Gu, Guangyu; Howard, Robert E.; Jochim, Jayson M.; Kuan, Chihchau L.; Lau, Christopher; Lee, Chang-Kyu; Lee, Felix; Lemon, Tracy A.; Lesnar, Phil; McMurray, Bergen; Mastan, Naveed; Mosqueda, Nerick F.; Naluai-Cecchini, Theresa; Ngo, Nhan-Kiet; Nyhus, Julie; Oldre, Aaron; Olson, Eric; Parente, Jody; Parker, Patrick D.; Parry, Sheana E.; Player, Allison Stevens; Pletikos, Mihovil; Reding, Melissa; Royall, Joshua J.; Roll, Kate; Sandman, David; Sarreal, Melaine; Shapouri, Sheila; Shapovalova, Nadiya V.; Shen, Elaine H.; Sjoquist, Nathan; Slaughterbeck, Clifford R.; Smith, Michael; Sodt, Andy J.; Williams, Derric; Zöllei, Lilla; Fischl, Bruce; Gerstein, Mark B.; Geschwind, Daniel H.; Glass, Ian A.; Hawrylycz, Michael J.; Hevner, Robert F.; Huang, Hao; Jones, Allan R.; Knowles, James A.; Levitt, Pat; Phillips, John W.; Sestan, Nenad; Wohnoutka, Paul; Dang, Chinh; Bernard, Amy; Hohmann, John G.; Lein, Ed S.
2014-01-01
Summary The anatomical and functional architecture of the human brain is largely determined by prenatal transcriptional processes. We describe an anatomically comprehensive atlas of mid-gestational human brain, including de novo reference atlases, in situ hybridization, ultra-high resolution magnetic resonance imaging (MRI) and microarray analysis on highly discrete laser microdissected brain regions. In developing cerebral cortex, transcriptional differences are found between different proliferative and postmitotic layers, wherein laminar signatures reflect cellular composition and developmental processes. Cytoarchitectural differences between human and mouse have molecular correlates, including species differences in gene expression in subplate, although surprisingly we find minimal differences between the inner and human-expanded outer subventricular zones. Both germinal and postmitotic cortical layers exhibit fronto-temporal gradients, with particular enrichment in frontal lobe. Finally, many neurodevelopmental disorder and human evolution-related genes show patterned expression, potentially underlying unique features of human cortical formation. These data provide a rich, freely-accessible resource for understanding human brain development. PMID:24695229
Insights into TREM2 biology by network analysis of human brain gene expression data
Forabosco, Paola; Ramasamy, Adaikalavan; Trabzuni, Daniah; Walker, Robert; Smith, Colin; Bras, Jose; Levine, Adam P.; Hardy, John; Pocock, Jennifer M.; Guerreiro, Rita; Weale, Michael E.; Ryten, Mina
2013-01-01
Rare variants in TREM2 cause susceptibility to late-onset Alzheimer's disease. Here we use microarray-based expression data generated from 101 neuropathologically normal individuals and covering 10 brain regions, including the hippocampus, to understand TREM2 biology in human brain. Using network analysis, we detect a highly preserved TREM2-containing module in human brain, show that it relates to microglia, and demonstrate that TREM2 is a hub gene in 5 brain regions, including the hippocampus, suggesting that it can drive module function. Using enrichment analysis we show significant overrepresentation of genes implicated in the adaptive and innate immune system. Inspection of genes with the highest connectivity to TREM2 suggests that it plays a key role in mediating changes in the microglial cytoskeleton necessary not only for phagocytosis, but also migration. Most importantly, we show that the TREM2-containing module is significantly enriched for genes genetically implicated in Alzheimer's disease, multiple sclerosis, and motor neuron disease, implying that these diseases share common pathways centered on microglia and that among the genes identified are possible new disease-relevant genes. PMID:23855984
On-Chip, Amplification-Free Quantification of Nucleic Acid for Point-of-Care Diagnosis
NASA Astrophysics Data System (ADS)
Yen, Tony Minghung
This dissertation demonstrates three physical device concepts to overcome limitations in point-of-care quantification of nucleic acids. Enabling sensitive, high throughput nucleic acid quantification on a chip, outside of hospital and centralized laboratory setting, is crucial for improving pathogen detection and cancer diagnosis and prognosis. Among existing platforms, microarray have the advantages of being amplification free, low instrument cost, and high throughput, but are generally less sensitive compared to sequencing and PCR assays. To bridge this performance gap, this dissertation presents theoretical and experimental progress to develop a platform nucleic acid quantification technology that is drastically more sensitive than current microarrays while compatible with microarray architecture. The first device concept explores on-chip nucleic acid enrichment by natural evaporation of nucleic acid solution droplet. Using a micro-patterned super-hydrophobic black silicon array device, evaporative enrichment is coupled with nano-liter droplet self-assembly workflow to produce a 50 aM concentration sensitivity, 6 orders of dynamic range, and rapid hybridization time at under 5 minutes. The second device concept focuses on improving target copy number sensitivity, instead of concentration sensitivity. A comprehensive microarray physical model taking into account of molecular transport, electrostatic intermolecular interactions, and reaction kinetics is considered to guide device optimization. Device pattern size and target copy number are optimized based on model prediction to achieve maximal hybridization efficiency. At a 100-mum pattern size, a quantum leap in detection limit of 570 copies is achieved using black silicon array device with self-assembled pico-liter droplet workflow. Despite its merits, evaporative enrichment on black silicon device suffers from coffee-ring effect at 100-mum pattern size, and thus not compatible with clinical patient samples. The third device concept utilizes an integrated optomechanical laser system and a Cytop microarray device to reverse coffee-ring effect during evaporative enrichment at 100-mum pattern size. This method, named "laser-induced differential evaporation" is expected to enable 570 copies detection limit for clinical samples in near future. While the work is ongoing as of the writing of this dissertation, a clear research plan is in place to implement this method on microarray platform toward clinical sample testing for disease applications and future commercialization.
Plantier, Laurent; Renaud, Hélène; Respaud, Renaud; Marchand-Adam, Sylvain; Crestani, Bruno
2016-12-13
Heritable profibrotic differentiation of lung fibroblasts is a key mechanism of idiopathic pulmonary fibrosis (IPF). Its mechanisms are yet to be fully understood. In this study, individual data from four independent microarray studies comparing the transcriptome of fibroblasts cultured in vitro from normal (total n = 20) and IPF (total n = 20) human lung were compiled for meta-analysis following normalization to z-scores. One hundred and thirteen transcripts were upregulated and 115 were downregulated in IPF fibroblasts using the Significance Analysis of Microrrays algorithm with a false discovery rate of 5%. Downregulated genes were highly enriched for Gene Ontology and Kyoto Encyclopedia of Genes and Genomes (KEGG) functional classes related to inflammation and immunity such as Defense response to virus, Influenza A, tumor necrosis factor (TNF) mediated signaling pathway, interferon-inducible absent in melanoma2 (AIM2) inflammasome as well as Apoptosis. Although upregulated genes were not enriched for any functional class, select factors known to play key roles in lung fibrogenesis were overexpressed in IPF fibroblasts, most notably connective tissue growth factor ( CTGF ) and serum response factor ( SRF ), supporting their role as drivers of IPF. The full data table is available as a supplement.
Natural and Unanticipated Modifiers of RNAi Activity in Caenorhabditis elegans
Asad, Nadeem; Aw, Wen Yih; Timmons, Lisa
2012-01-01
Organisms used as model genomics systems are maintained as isogenic strains, yet evidence of sequence differences between independently maintained wild-type stocks has been substantiated by whole-genome resequencing data and strain-specific phenotypes. Sequence differences may arise from replication errors, transposon mobilization, meiotic gene conversion, or environmental or chemical assault on the genome. Low frequency alleles or mutations with modest effects on phenotypes can contribute to natural variation, and it has proven possible for such sequences to become fixed by adapted evolutionary enrichment and identified by resequencing. Our objective was to identify and analyze single locus genetic defects leading to RNAi resistance in isogenic strains of Caenorhabditis elegans. In so doing, we uncovered a mutation that arose de novo in an existing strain, which initially frustrated our phenotypic analysis. We also report experimental, environmental, and genetic conditions that can complicate phenotypic analysis of RNAi pathway defects. These observations highlight the potential for unanticipated mutations, coupled with genetic and environmental phenomena, to enhance or suppress the effects of known mutations and cause variation between wild-type strains. PMID:23209671
Regulatory variation: an emerging vantage point for cancer biology.
Li, Luolan; Lorzadeh, Alireza; Hirst, Martin
2014-01-01
Transcriptional regulation involves complex and interdependent interactions of noncoding and coding regions of the genome with proteins that interact and modify them. Genetic variation/mutation in coding and noncoding regions of the genome can drive aberrant transcription and disease. In spite of accounting for nearly 98% of the genome comparatively little is known about the contribution of noncoding DNA elements to disease. Genome-wide association studies of complex human diseases including cancer have revealed enrichment for variants in the noncoding genome. A striking finding of recent cancer genome re-sequencing efforts has been the previously underappreciated frequency of mutations in epigenetic modifiers across a wide range of cancer types. Taken together these results point to the importance of dysregulation in transcriptional regulatory control in genesis of cancer. Powered by recent technological advancements in functional genomic profiling, exploration of normal and transformed regulatory networks will provide novel insight into the initiation and progression of cancer and open new windows to future prognostic and diagnostic tools. © 2013 Wiley Periodicals, Inc.
Elkins, C A; Kotewicz, M L; Jackson, S A; Lacher, D W; Abu-Ali, G S; Patel, I R
2013-01-01
Modern risk control and food safety practices involving food-borne bacterial pathogens are benefiting from new genomic technologies for rapid, yet highly specific, strain characterisations. Within the United States Food and Drug Administration (USFDA) Center for Food Safety and Applied Nutrition (CFSAN), optical genome mapping and DNA microarray genotyping have been used for several years to quickly assess genomic architecture and gene content, respectively, for outbreak strain subtyping and to enhance retrospective trace-back analyses. The application and relative utility of each method varies with outbreak scenario and the suspect pathogen, with comparative analytical power enhanced by database scale and depth. Integration of these two technologies allows high-resolution scrutiny of the genomic landscapes of enteric food-borne pathogens with notable examples including Shiga toxin-producing Escherichia coli (STEC) and Salmonella enterica serovars from a variety of food commodities. Moreover, the recent application of whole genome sequencing technologies to food-borne pathogen outbreaks and surveillance has enhanced resolution to the single nucleotide scale. This new wealth of sequence data will support more refined next-generation custom microarray designs, targeted re-sequencing and "genomic signature recognition" approaches involving a combination of genes and single nucleotide polymorphism detection to distil strain-specific fingerprinting to a minimised scale. This paper examines the utility of microarrays and optical mapping in analysing outbreaks, reviews best practices and the limits of these technologies for pathogen differentiation, and it considers future integration with whole genome sequencing efforts.
He, W; Zhao, S; Liu, X; Dong, S; Lv, J; Liu, D; Wang, J; Meng, Z
2013-12-04
Large-scale next-generation sequencing (NGS)-based resequencing detects sequence variations, constructs evolutionary histories, and identifies phenotype-related genotypes. However, NGS-based resequencing studies generate extraordinarily large amounts of data, making computations difficult. Effective use and analysis of these data for NGS-based resequencing studies remains a difficult task for individual researchers. Here, we introduce ReSeqTools, a full-featured toolkit for NGS (Illumina sequencing)-based resequencing analysis, which processes raw data, interprets mapping results, and identifies and annotates sequence variations. ReSeqTools provides abundant scalable functions for routine resequencing analysis in different modules to facilitate customization of the analysis pipeline. ReSeqTools is designed to use compressed data files as input or output to save storage space and facilitates faster and more computationally efficient large-scale resequencing studies in a user-friendly manner. It offers abundant practical functions and generates useful statistics during the analysis pipeline, which significantly simplifies resequencing analysis. Its integrated algorithms and abundant sub-functions provide a solid foundation for special demands in resequencing projects. Users can combine these functions to construct their own pipelines for other purposes.
Haitsma, Jack J.; Furmli, Suleiman; Masoom, Hussain; Liu, Mingyao; Imai, Yumiko; Slutsky, Arthur S.; Beyene, Joseph; Greenwood, Celia M. T.; dos Santos, Claudia
2012-01-01
Objectives To perform a meta-analysis of gene expression microarray data from animal studies of lung injury, and to identify an injury-specific gene expression signature capable of predicting the development of lung injury in humans. Methods We performed a microarray meta-analysis using 77 microarray chips across six platforms, two species and different animal lung injury models exposed to lung injury with or/and without mechanical ventilation. Individual gene chips were classified and grouped based on the strategy used to induce lung injury. Effect size (change in gene expression) was calculated between non-injurious and injurious conditions comparing two main strategies to pool chips: (1) one-hit and (2) two-hit lung injury models. A random effects model was used to integrate individual effect sizes calculated from each experiment. Classification models were built using the gene expression signatures generated by the meta-analysis to predict the development of lung injury in human lung transplant recipients. Results Two injury-specific lists of differentially expressed genes generated from our meta-analysis of lung injury models were validated using external data sets and prospective data from animal models of ventilator-induced lung injury (VILI). Pathway analysis of gene sets revealed that both new and previously implicated VILI-related pathways are enriched with differentially regulated genes. Classification model based on gene expression signatures identified in animal models of lung injury predicted development of primary graft failure (PGF) in lung transplant recipients with larger than 80% accuracy based upon injury profiles from transplant donors. We also found that better classifier performance can be achieved by using meta-analysis to identify differentially-expressed genes than using single study-based differential analysis. Conclusion Taken together, our data suggests that microarray analysis of gene expression data allows for the detection of “injury" gene predictors that can classify lung injury samples and identify patients at risk for clinically relevant lung injury complications. PMID:23071521
Gene Expression Profiling of Gastric Cancer
Marimuthu, Arivusudar; Jacob, Harrys K.C.; Jakharia, Aniruddha; Subbannayya, Yashwanth; Keerthikumar, Shivakumar; Kashyap, Manoj Kumar; Goel, Renu; Balakrishnan, Lavanya; Dwivedi, Sutopa; Pathare, Swapnali; Dikshit, Jyoti Bajpai; Maharudraiah, Jagadeesha; Singh, Sujay; Sameer Kumar, Ghantasala S; Vijayakumar, M.; Veerendra Kumar, Kariyanakatte Veeraiah; Premalatha, Chennagiri Shrinivasamurthy; Tata, Pramila; Hariharan, Ramesh; Roa, Juan Carlos; Prasad, T.S.K; Chaerkady, Raghothama; Kumar, Rekha Vijay; Pandey, Akhilesh
2015-01-01
Gastric cancer is the second leading cause of cancer death worldwide, both in men and women. A genomewide gene expression analysis was carried out to identify differentially expressed genes in gastric adenocarcinoma tissues as compared to adjacent normal tissues. We used Agilent’s whole human genome oligonucleotide microarray platform representing ~41,000 genes to carry out gene expression analysis. Two-color microarray analysis was employed to directly compare the expression of genes between tumor and normal tissues. Through this approach, we identified several previously known candidate genes along with a number of novel candidate genes in gastric cancer. Testican-1 (SPOCK1) was one of the novel molecules that was 10-fold upregulated in tumors. Using tissue microarrays, we validated the expression of testican-1 by immunohistochemical staining. It was overexpressed in 56% (160/282) of the cases tested. Pathway analysis led to the identification of several networks in which SPOCK1 was among the topmost networks of interacting genes. By gene enrichment analysis, we identified several genes involved in cell adhesion and cell proliferation to be significantly upregulated while those corresponding to metabolic pathways were significantly downregulated. The differentially expressed genes identified in this study are candidate biomarkers for gastric adenoacarcinoma. PMID:27030788
Zhang, Qian; Gou, Wenyu; Wang, Xiaotong; Zhang, Yawen; Ma, Jun; Zhang, Hongliang; Zhang, Ying; Zhang, Hao
2016-01-01
Tibetan chicken, unlike their lowland counterparts, exhibit specific adaptations to high-altitude conditions. The genetic mechanisms of such adaptations in highland chickens were determined by resequencing the genomes of four highland (Tibetan and Lhasa White) and four lowland (White Leghorn, Lindian, and Chahua) chicken populations. Our results showed an evident genetic admixture in Tibetan chickens, suggesting a history of introgression from lowland gene pools. Genes showing positive selection in highland populations were related to cardiovascular and respiratory system development, DNA repair, response to radiation, inflammation, and immune responses, indicating a strong adaptation to oxygen scarcity and high-intensity solar radiation. The distribution of allele frequencies of nonsynonymous single nucleotide polymorphisms between highland and lowland populations was analyzed using chi-square test, which showed that several differentially distributed genes with missense mutations were enriched in several functional categories, especially in blood vessel development and adaptations to hypoxia and intense radiation. RNA sequencing revealed that several differentially expressed genes were enriched in gene ontology terms related to blood vessel and respiratory system development. Several candidate genes involved in the development of cardiorespiratory system (FGFR1, CTGF, ADAM9, JPH2, SATB1, BMP4, LOX, LPR, ANGPTL4, and HYAL1), inflammation and immune responses (AIRE, MYO1F, ZAP70, DDX60, CCL19, CD47, JSC, and FAS), DNA repair, and responses to radiation (VCP, ASH2L, and FANCG) were identified to play key roles in the adaptation to high-altitude conditions. Our data provide new insights into the unique adaptations of highland animals to extreme environments. PMID:26907498
2012-01-01
Background DNA microarrays are used both for research and for diagnostics. In research, Affymetrix arrays are commonly used for genome wide association studies, resequencing, and for gene expression analysis. These arrays provide large amounts of data. This data is analyzed using statistical methods that quite often discard a large portion of the information. Most of the information that is lost comes from probes that systematically fail across chips and from batch effects. The aim of this study was to develop a comprehensive model for hybridization that predicts probe intensities for Affymetrix arrays and that could provide a basis for improved microarray analysis and probe development. The first part of the model calculates probe binding affinities to all the possible targets in the hybridization solution using the Langmuir isotherm. In the second part of the model we integrate details that are specific to each experiment and contribute to the differences between hybridization in solution and on the microarray. These details include fragmentation, wash stringency, temperature, salt concentration, and scanner settings. Furthermore, the model fits probe synthesis efficiency and target concentration parameters directly to the data. All the parameters used in the model have a well-established physical origin. Results For the 302 chips that were analyzed the mean correlation between expected and observed probe intensities was 0.701 with a range of 0.88 to 0.55. All available chips were included in the analysis regardless of the data quality. Our results show that batch effects arise from differences in probe synthesis, scanner settings, wash strength, and target fragmentation. We also show that probe synthesis efficiencies for different nucleotides are not uniform. Conclusions To date this is the most complete model for binding on microarrays. This is the first model that includes both probe synthesis efficiency and hybridization kinetics/cross-hybridization. These two factors are sequence dependent and have a large impact on probe intensity. The results presented here provide novel insight into the effect of probe synthesis errors on Affymetrix microarrays; furthermore, the algorithms developed in this work provide useful tools for the analysis of cross-hybridization, probe synthesis efficiency, fragmentation, wash stringency, temperature, and salt concentration on microarray intensities. PMID:23270536
Chang, Ho-Won; Sung, Youlboong; Kim, Kyoung-Ho; Nam, Young-Do; Roh, Seong Woon; Kim, Min-Soo; Jeon, Che Ok; Bae, Jin-Woo
2008-08-15
A crucial problem in the use of previously developed genome-probing microarrays (GPM) has been the inability to use uncultivated bacterial genomes to take advantage of the high sensitivity and specificity of GPM in microbial detection and monitoring. We show here a method, digital multiple displacement amplification (MDA), to amplify and analyze various genomes obtained from single uncultivated bacterial cells. We used 15 genomes from key microbes involved in dichloromethane (DCM)-dechlorinating enrichment as microarray probes to uncover the bacterial population dynamics of samples without PCR amplification. Genomic DNA amplified from single cells originating from uncultured bacteria with 80.3-99.4% similarity to 16S rRNA genes of cultivated bacteria. The digital MDA-GPM method successfully monitored the dynamics of DCM-dechlorinating communities from different phases of enrichment status. Without a priori knowledge of microbial diversity, the digital MDA-GPM method could be designed to monitor most microbial populations in a given environmental sample.
Tao, Zhihua; Gao, Peng; Liu, Hung-Wen
2009-12-15
Poly(ADP-ribosyl)ation of various nuclear proteins catalyzed by a family of NAD(+)-dependent enzymes, poly(ADP-ribose) polymerases (PARPs), is an important posttranslational modification reaction. PARP activity has been demonstrated in all types of eukaryotic cells with the exception of yeast, in which the expression of human PARP-1 was shown to lead to retarded cell growth. We investigated the yeast growth inhibition caused by human PARP-1 expression in Saccharomyces cerevisiae. Flow cytometry analysis reveals that PARP-1-expressing yeast cells accumulate in the G(2)/M stage of the cell cycle. Confocal microscopy analysis shows that human PARP-1 is distributed throughout the nucleus of yeast cells but is enriched in the nucleolus. Utilizing yeast proteome microarray screening, we identified 33 putative PARP-1 substrates, six of which are known to be involved in ribosome biogenesis. The poly(ADP-ribosyl)ation of three of these yeast proteins, together with two human homologues, was confirmed by an in vitro PARP-1 assay. Finally, a polysome profile analysis using sucrose gradient ultracentrifugation demonstrated that the ribosome levels in yeast cells expressing PARP-1 are lower than those in control yeast cells. Overall, our data suggest that human PARP-1 may affect ribosome biogenesis by modifying certain nucleolar proteins in yeast. The artificial PARP-1 pathway in yeast may be used as a simple platform to identify substrates and verify function of this important enzyme.
Boltaña, Sebastian; Castellana, Barbara; Goetz, Giles; Tort, Lluis; Teles, Mariana; Mulero, Victor; Novoa, Beatriz; Figueras, Antonio; Goetz, Frederick W; Gallardo-Escarate, Cristian; Planas, Josep V; Mackenzie, Simon
2017-02-03
This study describes the development and validation of an enriched oligonucleotide-microarray platform for Sparus aurata (SAQ) to provide a platform for transcriptomic studies in this species. A transcriptome database was constructed by assembly of gilthead sea bream sequences derived from public repositories of mRNA together with reads from a large collection of expressed sequence tags (EST) from two extensive targeted cDNA libraries characterizing mRNA transcripts regulated by both bacterial and viral challenge. The developed microarray was further validated by analysing monocyte/macrophage activation profiles after challenge with two Gram-negative bacterial pathogen-associated molecular patterns (PAMPs; lipopolysaccharide (LPS) and peptidoglycan (PGN)). Of the approximately 10,000 EST sequenced, we obtained a total of 6837 EST longer than 100 nt, with 3778 and 3059 EST obtained from the bacterial-primed and from the viral-primed cDNA libraries, respectively. Functional classification of contigs from the bacterial- and viral-primed cDNA libraries by Gene Ontology (GO) showed that the top five represented categories were equally represented in the two libraries: metabolism (approximately 24% of the total number of contigs), carrier proteins/membrane transport (approximately 15%), effectors/modulators and cell communication (approximately 11%), nucleoside, nucleotide and nucleic acid metabolism (approximately 7.5%) and intracellular transducers/signal transduction (approximately 5%). Transcriptome analyses using this enriched oligonucleotide platform identified differential shifts in the response to PGN and LPS in macrophage-like cells, highlighting responsive gene-cassettes tightly related to PAMP host recognition. As observed in other fish species, PGN is a powerful activator of the inflammatory response in S. aurata macrophage-like cells. We have developed and validated an oligonucleotide microarray (SAQ) that provides a platform enriched for the study of gene expression in S. aurata with an emphasis upon immunity and the immune response.
A Human Lectin Microarray for Sperm Surface Glycosylation Analysis *
Sun, Yangyang; Cheng, Li; Gu, Yihua; Xin, Aijie; Wu, Bin; Zhou, Shumin; Guo, Shujuan; Liu, Yin; Diao, Hua; Shi, Huijuan; Wang, Guangyu; Tao, Sheng-ce
2016-01-01
Glycosylation is one of the most abundant and functionally important protein post-translational modifications. As such, technology for efficient glycosylation analysis is in high demand. Lectin microarrays are a powerful tool for such investigations and have been successfully applied for a variety of glycobiological studies. However, most of the current lectin microarrays are primarily constructed from plant lectins, which are not well suited for studies of human glycosylation because of the extreme complexity of human glycans. Herein, we constructed a human lectin microarray with 60 human lectin and lectin-like proteins. All of the lectins and lectin-like proteins were purified from yeast, and most showed binding to human glycans. To demonstrate the applicability of the human lectin microarray, human sperm were probed on the microarray and strong bindings were observed for several lectins, including galectin-1, 7, 8, GalNAc-T6, and ERGIC-53 (LMAN1). These bindings were validated by flow cytometry and fluorescence immunostaining. Further, mass spectrometry analysis showed that galectin-1 binds several membrane-associated proteins including heat shock protein 90. Finally, functional assays showed that binding of galectin-8 could significantly enhance the acrosome reaction within human sperms. To our knowledge, this is the first construction of a human lectin microarray, and we anticipate it will find wide use for a range of human or mammalian studies, alone or in combination with plant lectin microarrays. PMID:27364157
Couture, Camille; Zaniolo, Karine; Carrier, Patrick; Lake, Jennifer; Patenaude, Julien; Germain, Lucie; Guérin, Sylvain L
2016-02-01
Corneal injuries remain a major cause of consultation in the ophthalmology clinics worldwide. Repair of corneal wounds is a complex mechanism that involves cell death, migration, proliferation, differentiation, and extracellular matrix (ECM) remodeling. In the present study, we used a tissue-engineered, two-layers (epithelium and stroma) human cornea as a biomaterial to study both the cellular and molecular mechanisms of wound healing. Gene profiling on microarrays revealed important alterations in the pattern of genes expressed by tissue-engineered corneas in response to wound healing. Expression of many MMPs-encoding genes was shown by microarray and qPCR analyses to increase in the migrating epithelium of wounded corneas. Many of these enzymes were converted into their enzymatically active form as wound closure proceeded. In addition, expression of MMPs by human corneal epithelial cells (HCECs) was affected both by the stromal fibroblasts and the collagen-enriched ECM they produce. Most of all, results from mass spectrometry analyses provided evidence that a fully stratified epithelium is required for proper synthesis and organization of the ECM on which the epithelial cells adhere. In conclusion, and because of the many characteristics it shares with the native cornea, this human two layers corneal substitute may prove particularly useful to decipher the mechanistic details of corneal wound healing. Copyright © 2015 Elsevier Ltd. All rights reserved.
Elisa, Baldelli; B., Haura Eric; Lucio, Crinò; Douglas, Cress W.; Vienna, Ludovini; B., Schabath Matthew; A., Liotta Lance; F., Petricoin Emanuel; Mariaelena, Pierobon
2015-01-01
Purpose The aim of this study was to evaluate whether upfront cellular enrichment via laser capture microdissection is necessary for accurately quantifying predictive biomarkers in non-small cell lung cancer tumors. Experimental design Fifteen snap frozen surgical biopsies were analyzed. Whole tissue lysate and matched highly enriched tumor epithelium via laser capture microdissection (LCM) were obtained for each patient. The expression and activation/phosphorylation levels of 26 proteins were measured by reverse phase protein microarray. Differences in signaling architecture of dissected and undissected matched pairs were visualized using unsupervised clustering analysis, bar graphs, and scatter plots. Results Overall patient matched LCM and undissected material displayed very distinct and differing signaling architectures with 93% of the matched pairs clustering separately. These differences were seen regardless of the amount of starting tumor epithelial content present in the specimen. Conclusions and clinical relevance These results indicate that LCM driven upfront cellular enrichment is necessary to accurately determine the expression/activation levels of predictive protein signaling markers although results should be evaluated in larger clinical settings. Upfront cellular enrichment of the target cell appears to be an important part of the workflow needed for the accurate quantification of predictive protein signaling biomarkers. Larger independent studies are warranted. PMID:25676683
High-throughput discovery of rare human nucleotide polymorphisms by Ecotilling
Till, Bradley J.; Zerr, Troy; Bowers, Elisabeth; Greene, Elizabeth A.; Comai, Luca; Henikoff, Steven
2006-01-01
Human individuals differ from one another at only ∼0.1% of nucleotide positions, but these single nucleotide differences account for most heritable phenotypic variation. Large-scale efforts to discover and genotype human variation have been limited to common polymorphisms. However, these efforts overlook rare nucleotide changes that may contribute to phenotypic diversity and genetic disorders, including cancer. Thus, there is an increasing need for high-throughput methods to robustly detect rare nucleotide differences. Toward this end, we have adapted the mismatch discovery method known as Ecotilling for the discovery of human single nucleotide polymorphisms. To increase throughput and reduce costs, we developed a universal primer strategy and implemented algorithms for automated band detection. Ecotilling was validated by screening 90 human DNA samples for nucleotide changes in 5 gene targets and by comparing results to public resequencing data. To increase throughput for discovery of rare alleles, we pooled samples 8-fold and found Ecotilling to be efficient relative to resequencing, with a false negative rate of 5% and a false discovery rate of 4%. We identified 28 new rare alleles, including some that are predicted to damage protein function. The detection of rare damaging mutations has implications for models of human disease. PMID:16893952
A new DPYD genotyping assay for improving the safety of 5-fluorouracil therapy.
Sistonen, Johanna; Smith, Chingying; Fu, Yung-Kang; Largiadèr, Carlo R
2012-12-24
Chemotherapeutic use of 5-fluorouracil (5FU) is compromised by 10-20% of patients developing severe toxicity. Recently described genetic variation in dihydropyrimidine dehydrogenase (DPYD) has been shown to be a major predictor of 5FU toxicity. Here, we describe a new genotyping assay for routine clinical use that covers all the major DPYD risk variants. Genomic regions targeting DPYD risk variants (c.1129-5923C>G, c.1679T>G/A, c.1905+1G>A, c.2846A>T) and additional markers (c.234-123G>C, c.496A>G, c.775A>G) were amplified in a multiplex PCR reaction. The subsequent steps including allele-specific primer extension, hybridization of the primers to a microarray, scanning of the array, and data analysis were automated within the INFINITI® Analyzer (AutoGenomics). The assay was validated by analyzing 107 blood samples obtained from patients previously re-sequenced for the DPYD. The genotypes obtained with the developed assay were 100% concordant with the re-sequencing. The procedure is suitable for routine clinical use since the results are obtained within one day. For heterozygous risk variant carriers (~7% of Europeans), the treatment can be adjusted by 5FU dose reduction, whereas carriers of two risk alleles should be treated with an alternative therapy. The developed assay provides a novel tool to improve the safety of commonly used 5FU-based chemotherapies. Copyright © 2012 Elsevier B.V. All rights reserved.
A genome-wide scan for signatures of directional selection in domesticated pigs.
Moon, Sunjin; Kim, Tae-Hun; Lee, Kyung-Tai; Kwak, Woori; Lee, Taeheon; Lee, Si-Woo; Kim, Myung-Jick; Cho, Kyuho; Kim, Namshin; Chung, Won-Hyong; Sung, Samsun; Park, Taesung; Cho, Seoae; Groenen, Martien Am; Nielsen, Rasmus; Kim, Yuseob; Kim, Heebal
2015-02-25
Animal domestication involved drastic phenotypic changes driven by strong artificial selection and also resulted in new populations of breeds, established by humans. This study aims to identify genes that show evidence of recent artificial selection during pig domestication. Whole-genome resequencing of 30 individual pigs from domesticated breeds, Landrace and Yorkshire, and 10 Asian wild boars at ~16-fold coverage was performed resulting in over 4.3 million SNPs for 19,990 genes. We constructed a comprehensive genome map of directional selection by detecting selective sweeps using an F ST-based approach that detects directional selection in lineages leading to the domesticated breeds and using a haplotype-based test that detects ongoing selective sweeps within the breeds. We show that candidate genes under selection are significantly enriched for loci implicated in quantitative traits important to pig reproduction and production. The candidate gene with the strongest signals of directional selection belongs to group III of the metabolomics glutamate receptors, known to affect brain functions associated with eating behavior, suggesting that loci under strong selection include loci involved in behaviorial traits in domesticated pigs including tameness. We show that a significant proportion of selection signatures coincide with loci that were previously inferred to affect phenotypic variation in pigs. We further identify functional enrichment related to behavior, such as signal transduction and neuronal activities, for those targets of selection during domestication in pigs.
2010-01-01
Background Suppression subtractive hybridization is a popular technique for gene discovery from non-model organisms without an annotated genome sequence, such as cowpea (Vigna unguiculata (L.) Walp). We aimed to use this method to enrich for genes expressed during drought stress in a drought tolerant cowpea line. However, current methods were inefficient in screening libraries and management of the sequence data, and thus there was a need to develop software tools to facilitate the process. Results Forward and reverse cDNA libraries enriched for cowpea drought response genes were screened on microarrays, and the R software package SSHscreen 2.0.1 was developed (i) to normalize the data effectively using spike-in control spot normalization, and (ii) to select clones for sequencing based on the calculation of enrichment ratios with associated statistics. Enrichment ratio 3 values for each clone showed that 62% of the forward library and 34% of the reverse library clones were significantly differentially expressed by drought stress (adjusted p value < 0.05). Enrichment ratio 2 calculations showed that > 88% of the clones in both libraries were derived from rare transcripts in the original tester samples, thus supporting the notion that suppression subtractive hybridization enriches for rare transcripts. A set of 118 clones were chosen for sequencing, and drought-induced cowpea genes were identified, the most interesting encoding a late embryogenesis abundant Lea5 protein, a glutathione S-transferase, a thaumatin, a universal stress protein, and a wound induced protein. A lipid transfer protein and several components of photosynthesis were down-regulated by the drought stress. Reverse transcriptase quantitative PCR confirmed the enrichment ratio values for the selected cowpea genes. SSHdb, a web-accessible database, was developed to manage the clone sequences and combine the SSHscreen data with sequence annotations derived from BLAST and Blast2GO. The self-BLAST function within SSHdb grouped redundant clones together and illustrated that the SSHscreen plots are a useful tool for choosing anonymous clones for sequencing, since redundant clones cluster together on the enrichment ratio plots. Conclusions We developed the SSHscreen-SSHdb software pipeline, which greatly facilitates gene discovery using suppression subtractive hybridization by improving the selection of clones for sequencing after screening the library on a small number of microarrays. Annotation of the sequence information and collaboration was further enhanced through a web-based SSHdb database, and we illustrated this through identification of drought responsive genes from cowpea, which can now be investigated in gene function studies. SSH is a popular and powerful gene discovery tool, and therefore this pipeline will have application for gene discovery in any biological system, particularly non-model organisms. SSHscreen 2.0.1 and a link to SSHdb are available from http://microarray.up.ac.za/SSHscreen. PMID:20359330
Kosti, Adam; Harry Chen, Hung-I; Mohan, Sumathy; Liang, Sitai; Chen, Yidong; Habib, Samy L.
2015-01-01
Recent study from our laboratory showed that patients with diabetes are at a higher risk of developing kidney cancer. In the current study, we have screened whole human DNA genome from healthy control, patients with diabetes or renal cell carcinoma (RCC) or RCC+diabetes. We found that 883 genes gain/163 genes loss of copy number in RCC+diabetes group, 669 genes gain/307 genes loss in RCC group and 458 genes gain/38 genes loss of copy number in diabetes group, after removing gain/loss genes obtained from healthy control group. Data analyzed for functional annotation enrichment pathways showed that control group had the highest number (280) of enriched pathways, 191 in diabetes+RCC group, 148 in RCC group, and 81 in diabetes group. The overlap GO pathways between RCC+diabetes and RCC groups showed that nine were enriched, between RCC+diabetes and diabetes groups was four and between diabetes and RCC groups was eight GO pathways. Overall, we observed majority of DNA alterations in patients from RCC+diabetes group. Interestingly, insulin receptor (INSR) is highly expressed and had gains in copy number in RCC+diabetes and diabetes groups. The changes in INSR copy number may use as a biomarker for predicting RCC development in diabetic patients. PMID:25821562
Expression Profile of Long Noncoding RNAs in Human Earlobe Keloids: A Microarray Analysis
Guo, Liang; Xu, Kai; Yan, Hongbo; Feng, Haifeng
2016-01-01
Background. Long noncoding RNAs (lncRNAs) play key roles in a wide range of biological processes and their deregulation results in human disease, including keloids. Earlobe keloid is a type of pathological skin scar, and the molecular pathogenesis of this disease remains largely unknown. Methods. In this study, microarray analysis was used to determine the expression profiles of lncRNAs and mRNAs between 3 pairs of earlobe keloid and normal specimens. Gene Ontology (GO) categories and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses were performed to identify the main functions of the differentially expressed genes and earlobe keloid-related pathways. Results. A total of 2068 lncRNAs and 1511 mRNAs were differentially expressed between earlobe keloid and normal tissues. Among them, 1290 lncRNAs and 1092 mRNAs were upregulated, and 778 lncRNAs and 419 mRNAs were downregulated. Pathway analysis revealed that 24 pathways were correlated to the upregulated transcripts, while 11 pathways were associated with the downregulated transcripts. Conclusion. We characterized the expression profiles of lncRNA and mRNA in earlobe keloids and suggest that lncRNAs may serve as diagnostic biomarkers for the therapy of earlobe keloid. PMID:28101509
Detecting Directional Selection in the Presence of Recent Admixture in African-Americans
Lohmueller, Kirk E.; Bustamante, Carlos D.; Clark, Andrew G.
2011-01-01
We investigate the performance of tests of neutrality in admixed populations using plausible demographic models for African-American history as well as resequencing data from African and African-American populations. The analysis of both simulated and human resequencing data suggests that recent admixture does not result in an excess of false-positive results for neutrality tests based on the frequency spectrum after accounting for the population growth in the parental African population. Furthermore, when simulating positive selection, Tajima's D, Fu and Li's D, and haplotype homozygosity have lower power to detect population-specific selection using individuals sampled from the admixed population than from the nonadmixed population. Fay and Wu's H test, however, has more power to detect selection using individuals from the admixed population than from the nonadmixed population, especially when the selective sweep ended long ago. Our results have implications for interpreting recent genome-wide scans for positive selection in human populations. PMID:21196524
Transcriptional landscape of the prenatal human brain.
Miller, Jeremy A; Ding, Song-Lin; Sunkin, Susan M; Smith, Kimberly A; Ng, Lydia; Szafer, Aaron; Ebbert, Amanda; Riley, Zackery L; Royall, Joshua J; Aiona, Kaylynn; Arnold, James M; Bennet, Crissa; Bertagnolli, Darren; Brouner, Krissy; Butler, Stephanie; Caldejon, Shiella; Carey, Anita; Cuhaciyan, Christine; Dalley, Rachel A; Dee, Nick; Dolbeare, Tim A; Facer, Benjamin A C; Feng, David; Fliss, Tim P; Gee, Garrett; Goldy, Jeff; Gourley, Lindsey; Gregor, Benjamin W; Gu, Guangyu; Howard, Robert E; Jochim, Jayson M; Kuan, Chihchau L; Lau, Christopher; Lee, Chang-Kyu; Lee, Felix; Lemon, Tracy A; Lesnar, Phil; McMurray, Bergen; Mastan, Naveed; Mosqueda, Nerick; Naluai-Cecchini, Theresa; Ngo, Nhan-Kiet; Nyhus, Julie; Oldre, Aaron; Olson, Eric; Parente, Jody; Parker, Patrick D; Parry, Sheana E; Stevens, Allison; Pletikos, Mihovil; Reding, Melissa; Roll, Kate; Sandman, David; Sarreal, Melaine; Shapouri, Sheila; Shapovalova, Nadiya V; Shen, Elaine H; Sjoquist, Nathan; Slaughterbeck, Clifford R; Smith, Michael; Sodt, Andy J; Williams, Derric; Zöllei, Lilla; Fischl, Bruce; Gerstein, Mark B; Geschwind, Daniel H; Glass, Ian A; Hawrylycz, Michael J; Hevner, Robert F; Huang, Hao; Jones, Allan R; Knowles, James A; Levitt, Pat; Phillips, John W; Sestan, Nenad; Wohnoutka, Paul; Dang, Chinh; Bernard, Amy; Hohmann, John G; Lein, Ed S
2014-04-10
The anatomical and functional architecture of the human brain is mainly determined by prenatal transcriptional processes. We describe an anatomically comprehensive atlas of the mid-gestational human brain, including de novo reference atlases, in situ hybridization, ultra-high-resolution magnetic resonance imaging (MRI) and microarray analysis on highly discrete laser-microdissected brain regions. In developing cerebral cortex, transcriptional differences are found between different proliferative and post-mitotic layers, wherein laminar signatures reflect cellular composition and developmental processes. Cytoarchitectural differences between human and mouse have molecular correlates, including species differences in gene expression in subplate, although surprisingly we find minimal differences between the inner and outer subventricular zones even though the outer zone is expanded in humans. Both germinal and post-mitotic cortical layers exhibit fronto-temporal gradients, with particular enrichment in the frontal lobe. Finally, many neurodevelopmental disorder and human-evolution-related genes show patterned expression, potentially underlying unique features of human cortical formation. These data provide a rich, freely-accessible resource for understanding human brain development.
Luo, Lin; Zhou, Wen-Hua; Cai, Jiang-Jia; Feng, Mei; Zhou, Mi; Hu, Su-Pei
2017-01-01
Diabetic peripheral neuropathy (DPN) is a common complication of diabetes mellitus (DM). It is not diagnosed or managed properly in the majority of patients because its pathogenesis remains controversial. In this study, human whole genome microarrays identified 2898 and 4493 differentially expressed genes (DEGs) in DM and DPN patients, respectively. A further KEGG pathway analysis indicated that DPN and DM share four pathways, including apoptosis, B cell receptor signaling pathway, endocytosis, and Toll-like receptor signaling pathway. The DEGs identified through comparison of DPN and DM were significantly enriched in MAPK signaling pathway, NOD-like receptor signaling pathway, and neurotrophin signaling pathway, while the “neurotrophin-MAPK signaling pathway” was notably downregulated. Seven DEGs from the neurotrophin-MAPK signaling pathway were validated in additional 78 samples, and the results confirmed the initial microarray findings. These findings demonstrated that downregulation of the neurotrophin-MAPK signaling pathway may be the major mechanism of DPN pathogenesis, thus providing a potential approach for DPN treatment. PMID:28900628
Luo, Lin; Zhou, Wen-Hua; Cai, Jiang-Jia; Feng, Mei; Zhou, Mi; Hu, Su-Pei; Xu, Jin; Ji, Lin-Dan
2017-01-01
Diabetic peripheral neuropathy (DPN) is a common complication of diabetes mellitus (DM). It is not diagnosed or managed properly in the majority of patients because its pathogenesis remains controversial. In this study, human whole genome microarrays identified 2898 and 4493 differentially expressed genes (DEGs) in DM and DPN patients, respectively. A further KEGG pathway analysis indicated that DPN and DM share four pathways, including apoptosis, B cell receptor signaling pathway, endocytosis, and Toll-like receptor signaling pathway. The DEGs identified through comparison of DPN and DM were significantly enriched in MAPK signaling pathway, NOD-like receptor signaling pathway, and neurotrophin signaling pathway, while the "neurotrophin-MAPK signaling pathway" was notably downregulated. Seven DEGs from the neurotrophin-MAPK signaling pathway were validated in additional 78 samples, and the results confirmed the initial microarray findings. These findings demonstrated that downregulation of the neurotrophin-MAPK signaling pathway may be the major mechanism of DPN pathogenesis, thus providing a potential approach for DPN treatment.
Exome sequencing of a multigenerational human pedigree.
Hedges, Dale J; Hedges, Dale; Burges, Dan; Powell, Eric; Almonte, Cherylyn; Huang, Jia; Young, Stuart; Boese, Benjamin; Schmidt, Mike; Pericak-Vance, Margaret A; Martin, Eden; Zhang, Xinmin; Harkins, Timothy T; Züchner, Stephan
2009-12-14
Over the next few years, the efficient use of next-generation sequencing (NGS) in human genetics research will depend heavily upon the effective mechanisms for the selective enrichment of genomic regions of interest. Recently, comprehensive exome capture arrays have become available for targeting approximately 33 Mb or approximately 180,000 coding exons across the human genome. Selective genomic enrichment of the human exome offers an attractive option for new experimental designs aiming to quickly identify potential disease-associated genetic variants, especially in family-based studies. We have evaluated a 2.1 M feature human exome capture array on eight individuals from a three-generation family pedigree. We were able to cover up to 98% of the targeted bases at a long-read sequence read depth of > or = 3, 86% at a read depth of > or = 10, and over 50% of all targets were covered with > or = 20 reads. We identified up to 14,284 SNPs and small indels per individual exome, with up to 1,679 of these representing putative novel polymorphisms. Applying the conservative genotype calling approach HCDiff, the average rate of detection of a variant allele based on Illumina 1 M BeadChips genotypes was 95.2% at > or = 10x sequence. Further, we propose an advantageous genotype calling strategy for low covered targets that empirically determines cut-off thresholds at a given coverage depth based on existing genotype data. Application of this method was able to detect >99% of SNPs covered > or = 8x. Our results offer guidance for "real-world" applications in human genetics and provide further evidence that microarray-based exome capture is an efficient and reliable method to enrich for chromosomal regions of interest in next-generation sequencing experiments.
LeProust, Emily M.; Peck, Bill J.; Spirin, Konstantin; McCuen, Heather Brummel; Moore, Bridget; Namsaraev, Eugeni; Caruthers, Marvin H.
2010-01-01
We have achieved the ability to synthesize thousands of unique, long oligonucleotides (150mers) in fmol amounts using parallel synthesis of DNA on microarrays. The sequence accuracy of the oligonucleotides in such large-scale syntheses has been limited by the yields and side reactions of the DNA synthesis process used. While there has been significant demand for libraries of long oligos (150mer and more), the yields in conventional DNA synthesis and the associated side reactions have previously limited the availability of oligonucleotide pools to lengths <100 nt. Using novel array based depurination assays, we show that the depurination side reaction is the limiting factor for the synthesis of libraries of long oligonucleotides on Agilent Technologies’ SurePrint® DNA microarray platform. We also demonstrate how depurination can be controlled and reduced by a novel detritylation process to enable the synthesis of high quality, long (150mer) oligonucleotide libraries and we report the characterization of synthesis efficiency for such libraries. Oligonucleotide libraries prepared with this method have changed the economics and availability of several existing applications (e.g. targeted resequencing, preparation of shRNA libraries, site-directed mutagenesis), and have the potential to enable even more novel applications (e.g. high-complexity synthetic biology). PMID:20308161
Evans, Melissa L.; Hori, Tiago S.; Rise, Matthew L.; Fleming, Ian A.
2015-01-01
Captive rearing programs (hatcheries) are often used in conservation and management efforts for at-risk salmonid fish populations. However, hatcheries typically rear juveniles in environments that contrast starkly with natural conditions, which may lead to phenotypic and/or genetic changes that adversely affect the performance of juveniles upon their release to the wild. Environmental enrichment has been proposed as a mechanism to improve the efficacy of population restoration efforts from captive-rearing programs; in this study, we examine the influence of environmental enrichment during embryo and yolk-sac larval rearing on the transcriptome of Atlantic salmon (Salmo salar). Full siblings were reared in either a hatchery environment devoid of structure or an environment enriched with gravel substrate. At the end of endogenous feeding by juveniles, we examined patterns of gene transcript abundance in head tissues using the cGRASP-designed Agilent 4×44K microarray. Significance analysis of microarrays (SAM) indicated that 808 genes were differentially transcribed between the rearing environments and a total of 184 gene ontological (GO) terms were over- or under-represented in this gene list, several associated with mitosis/cell cycle and muscle and heart development. There were also pronounced differences among families in the degree of transcriptional response to rearing environment enrichment, suggesting that gene-by-environment effects, possibly related to parental origin, could influence the efficacy of enrichment interventions. PMID:25742646
Evans, Melissa L; Hori, Tiago S; Rise, Matthew L; Fleming, Ian A
2015-01-01
Captive rearing programs (hatcheries) are often used in conservation and management efforts for at-risk salmonid fish populations. However, hatcheries typically rear juveniles in environments that contrast starkly with natural conditions, which may lead to phenotypic and/or genetic changes that adversely affect the performance of juveniles upon their release to the wild. Environmental enrichment has been proposed as a mechanism to improve the efficacy of population restoration efforts from captive-rearing programs; in this study, we examine the influence of environmental enrichment during embryo and yolk-sac larval rearing on the transcriptome of Atlantic salmon (Salmo salar). Full siblings were reared in either a hatchery environment devoid of structure or an environment enriched with gravel substrate. At the end of endogenous feeding by juveniles, we examined patterns of gene transcript abundance in head tissues using the cGRASP-designed Agilent 4×44K microarray. Significance analysis of microarrays (SAM) indicated that 808 genes were differentially transcribed between the rearing environments and a total of 184 gene ontological (GO) terms were over- or under-represented in this gene list, several associated with mitosis/cell cycle and muscle and heart development. There were also pronounced differences among families in the degree of transcriptional response to rearing environment enrichment, suggesting that gene-by-environment effects, possibly related to parental origin, could influence the efficacy of enrichment interventions.
Leichty, Aaron R; Brisson, Dustin
2014-10-01
Population genomic analyses have demonstrated power to address major questions in evolutionary and molecular microbiology. Collecting populations of genomes is hindered in many microbial species by the absence of a cost effective and practical method to collect ample quantities of sufficiently pure genomic DNA for next-generation sequencing. Here we present a simple method to amplify genomes of a target microbial species present in a complex, natural sample. The selective whole genome amplification (SWGA) technique amplifies target genomes using nucleotide sequence motifs that are common in the target microbe genome, but rare in the background genomes, to prime the highly processive phi29 polymerase. SWGA thus selectively amplifies the target genome from samples in which it originally represented a minor fraction of the total DNA. The post-SWGA samples are enriched in target genomic DNA, which are ideal for population resequencing. We demonstrate the efficacy of SWGA using both laboratory-prepared mixtures of cultured microbes as well as a natural host-microbe association. Targeted amplification of Borrelia burgdorferi mixed with Escherichia coli at genome ratios of 1:2000 resulted in >10(5)-fold amplification of the target genomes with <6.7-fold amplification of the background. SWGA-treated genomic extracts from Wolbachia pipientis-infected Drosophila melanogaster resulted in up to 70% of high-throughput resequencing reads mapping to the W. pipientis genome. By contrast, 2-9% of sequencing reads were derived from W. pipientis without prior amplification. The SWGA technique results in high sequencing coverage at a fraction of the sequencing effort, thus allowing population genomic studies at affordable costs. Copyright © 2014 by the Genetics Society of America.
2011-01-01
Background Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. Results We report the integration of the whole genome sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were identified to be either in identity by descent (IBD) or in copy number variation (CNV) with results from SNP array genotyping. Coding insertions and deletions (indels) were found to be enriched for size in multiples of 3 and were located near the N- and C-termini of proteins. For larger indels, a combination of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. Conclusions Our results provide high resolution mapping of diverse classes of genomic variation in an individual bovine genome and demonstrate that structural variation surpasses sequence variation as the main component of genomic variability. Better accuracy of SNP detection was achieved with little loss of sensitivity when algorithms that implemented mapping quality were used. IBD regions were found to be instrumental for calculating resequencing SNP accuracy, while SNP detection within CNVs tended to be less reliable. CNV discovery was affected dramatically by platform resolution and coverage biases. The combined data for this study showed that at a moderate level of sequencing coverage, an ensemble of platforms and tools can be applied together to maximize the accurate detection of sequence and structural variants. PMID:22082336
Transcriptomic Analysis and Meta-Analysis of Human Granulosa and Cumulus Cells
Burnik Papler, Tanja; Vrtacnik Bokal, Eda; Maver, Ales; Kopitar, Andreja Natasa; Lovrečić, Luca
2015-01-01
Specific gene expression in oocytes and its surrounding cumulus (CC) and granulosa (GC) cells is needed for successful folliculogenesis and oocyte maturation. The aim of the present study was to compare genome-wide gene expression and biological functions of human GC and CC. Individual GC and CC were derived from 37 women undergoing IVF procedures. Gene expression analysis was performed using microarrays, followed by a meta-analysis. Results were validated using quantitative real-time PCR. There were 6029 differentially expressed genes (q < 10−4); of which 650 genes had a log2 FC ≥ 2. After the meta-analysis there were 3156 genes differentially expressed. Among these there were genes that have previously not been reported in human somatic follicular cells, like prokineticin 2 (PROK2), higher expressed in GC, and pregnancy up-regulated nonubiquitous CaM kinase (PNCK), higher expressed in CC. Pathways like inflammatory response and angiogenesis were enriched in GC, whereas in CC, cell differentiation and multicellular organismal development were among enriched pathways. In conclusion, transcriptomes of GC and CC as well as biological functions, are distinctive for each cell subpopulation. By describing novel genes like PROK2 and PNCK, expressed in GC and CC, we upgraded the existing data on human follicular biology. PMID:26313571
Rabbit genome analysis reveals a polygenic basis for phenotypic change during domestication.
Carneiro, Miguel; Rubin, Carl-Johan; Di Palma, Federica; Albert, Frank W; Alföldi, Jessica; Martinez Barrio, Alvaro; Pielberg, Gerli; Rafati, Nima; Sayyab, Shumaila; Turner-Maier, Jason; Younis, Shady; Afonso, Sandra; Aken, Bronwen; Alves, Joel M; Barrell, Daniel; Bolet, Gerard; Boucher, Samuel; Burbano, Hernán A; Campos, Rita; Chang, Jean L; Duranthon, Veronique; Fontanesi, Luca; Garreau, Hervé; Heiman, David; Johnson, Jeremy; Mage, Rose G; Peng, Ze; Queney, Guillaume; Rogel-Gaillard, Claire; Ruffier, Magali; Searle, Steve; Villafuerte, Rafael; Xiong, Anqi; Young, Sarah; Forsberg-Nilsson, Karin; Good, Jeffrey M; Lander, Eric S; Ferrand, Nuno; Lindblad-Toh, Kerstin; Andersson, Leif
2014-08-29
The genetic changes underlying the initial steps of animal domestication are still poorly understood. We generated a high-quality reference genome for the rabbit and compared it to resequencing data from populations of wild and domestic rabbits. We identified more than 100 selective sweeps specific to domestic rabbits but only a relatively small number of fixed (or nearly fixed) single-nucleotide polymorphisms (SNPs) for derived alleles. SNPs with marked allele frequency differences between wild and domestic rabbits were enriched for conserved noncoding sites. Enrichment analyses suggest that genes affecting brain and neuronal development have often been targeted during domestication. We propose that because of a truly complex genetic background, tame behavior in rabbits and other domestic animals evolved by shifts in allele frequencies at many loci, rather than by critical changes at only a few domestication loci. Copyright © 2014, American Association for the Advancement of Science.
Karas, Vlad O; Sinnott-Armstrong, Nicholas A; Varghese, Vici; Shafer, Robert W; Greenleaf, William J; Sherlock, Gavin
2018-01-01
Abstract Much of the within species genetic variation is in the form of single nucleotide polymorphisms (SNPs), typically detected by whole genome sequencing (WGS) or microarray-based technologies. However, WGS produces mostly uninformative reads that perfectly match the reference, while microarrays require genome-specific reagents. We have developed Diff-seq, a sequencing-based mismatch detection assay for SNP discovery without the requirement for specialized nucleic-acid reagents. Diff-seq leverages the Surveyor endonuclease to cleave mismatched DNA molecules that are generated after cross-annealing of a complex pool of DNA fragments. Sequencing libraries enriched for Surveyor-cleaved molecules result in increased coverage at the variant sites. Diff-seq detected all mismatches present in an initial test substrate, with specific enrichment dependent on the identity and context of the variation. Application to viral sequences resulted in increased observation of variant alleles in a biologically relevant context. Diff-Seq has the potential to increase the sensitivity and efficiency of high-throughput sequencing in the detection of variation. PMID:29361139
Pinzani, Pamela; Mancini, Irene; Vinci, Serena; Chiari, Marcella; Orlando, Claudio; Cremonesi, Laura; Ferrari, Maurizio
2013-01-01
Molecular diagnostics of human cancers may increase accuracy in prognosis, facilitate the selection of the optimal therapeutic regimen, improve patient outcome, reduce costs of treatment and favour development of personalized approaches to patient care. Moreover sensitivity and specificity are fundamental characteristics of any diagnostic method. We developed a highly sensitive microarray for the detection of common KRAS and BRAF oncogenic mutations. In colorectal cancer, KRAS and BRAF mutations have been shown to identify a cluster of patients that does not respond to anti-EGFR therapies; the identification of these mutations is therefore clinically extremely important. To verify the technical characteristics of the microarray system for the correct identification of the KRAS mutational status at the two hotspot codons 12 and 13 and of the BRAFV600E mutation in colorectal tumor, we selected 75 samples previously characterized by conventional and CO-amplification at Lower Denaturation temperature-PCR (COLD-PCR) followed by High Resolution Melting analysis and direct sequencing. Among these samples, 60 were collected during surgery and immediately steeped in RNAlater while the 15 remainders were formalin-fixed and paraffin-embedded (FFPE) tissues. The detection limit of the proposed method was different for the 7 KRAS mutations tested and for the V600E BRAF mutation. In particular, the microarray system has been able to detect a minimum of about 0.01% of mutated alleles in a background of wild-type DNA. A blind validation displayed complete concordance of results. The excellent agreement of the results showed that the new microarray substrate is highly specific in assigning the correct genotype without any enrichment strategy. PMID:23536897
USDA-ARS?s Scientific Manuscript database
Trichinella spiralis is a parasitic roundworm that infects domestic swine, rats and humans. Ingestion of infected pork by humans can lead to the potentially fatal disease trichinellosis. The phylogeny and historical dispersal of Trichinella spp. have been studied, in part, by sequencing portions of...
Analysis of Genes Involved in Body Weight Regulation by Targeted Re-Sequencing.
Volckmar, Anna-Lena; Han, Chung Ting; Pütter, Carolin; Haas, Stefan; Vogel, Carla I G; Knoll, Nadja; Struve, Christoph; Göbel, Maria; Haas, Katharina; Herrfurth, Nikolas; Jarick, Ivonne; Grallert, Harald; Schürmann, Annette; Al-Hasani, Hadi; Hebebrand, Johannes; Sauer, Sascha; Hinney, Anke
2016-01-01
Genes involved in body weight regulation that were previously investigated in genome-wide association studies (GWAS) and in animal models were target-enriched followed by massive parallel next generation sequencing. We enriched and re-sequenced continuous genomic regions comprising FTO, MC4R, TMEM18, SDCCAG8, TKNS, MSRA and TBC1D1 in a screening sample of 196 extremely obese children and adolescents with age and sex specific body mass index (BMI) ≥ 99th percentile and 176 lean adults (BMI ≤ 15th percentile). 22 variants were confirmed by Sanger sequencing. Genotyping was performed in up to 705 independent obesity trios (extremely obese child and both parents), 243 extremely obese cases and 261 lean adults. We detected 20 different non-synonymous variants, one frame shift and one nonsense mutation in the 7 continuous genomic regions in study groups of different weight extremes. For SNP Arg695Cys (rs58983546) in TBC1D1 we detected nominal association with obesity (pTDT = 0.03 in 705 trios). Eleven of the variants were rare, thus were only detected heterozygously in up to ten individual(s) of the complete screening sample of 372 individuals. Two of them (in FTO and MSRA) were found in lean individuals, nine in extremely obese. In silico analyses of the 11 variants did not reveal functional implications for the mutations. Concordant with our hypothesis we detected a rare variant that potentially leads to loss of FTO function in a lean individual. For TBC1D1, in contrary to our hypothesis, the loss of function variant (Arg443Stop) was found in an obese individual. Functional in vitro studies are warranted.
Zhang, Qian; Gou, Wenyu; Wang, Xiaotong; Zhang, Yawen; Ma, Jun; Zhang, Hongliang; Zhang, Ying; Zhang, Hao
2016-02-23
Tibetan chicken, unlike their lowland counterparts, exhibit specific adaptations to high-altitude conditions. The genetic mechanisms of such adaptations in highland chickens were determined by resequencing the genomes of four highland (Tibetan and Lhasa White) and four lowland (White Leghorn, Lindian, and Chahua) chicken populations. Our results showed an evident genetic admixture in Tibetan chickens, suggesting a history of introgression from lowland gene pools. Genes showing positive selection in highland populations were related to cardiovascular and respiratory system development, DNA repair, response to radiation, inflammation, and immune responses, indicating a strong adaptation to oxygen scarcity and high-intensity solar radiation. The distribution of allele frequencies of nonsynonymous single nucleotide polymorphisms between highland and lowland populations was analyzed using chi-square test, which showed that several differentially distributed genes with missense mutations were enriched in several functional categories, especially in blood vessel development and adaptations to hypoxia and intense radiation. RNA sequencing revealed that several differentially expressed genes were enriched in gene ontology terms related to blood vessel and respiratory system development. Several candidate genes involved in the development of cardiorespiratory system (FGFR1, CTGF, ADAM9, JPH2, SATB1, BMP4, LOX, LPR, ANGPTL4, and HYAL1), inflammation and immune responses (AIRE, MYO1F, ZAP70, DDX60, CCL19, CD47, JSC, and FAS), DNA repair, and responses to radiation (VCP, ASH2L, and FANCG) were identified to play key roles in the adaptation to high-altitude conditions. Our data provide new insights into the unique adaptations of highland animals to extreme environments. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Genetic adaptation of the antibacterial human innate immunity network.
Casals, Ferran; Sikora, Martin; Laayouni, Hafid; Montanucci, Ludovica; Muntasell, Aura; Lazarus, Ross; Calafell, Francesc; Awadalla, Philip; Netea, Mihai G; Bertranpetit, Jaume
2011-07-11
Pathogens have represented an important selective force during the adaptation of modern human populations to changing social and other environmental conditions. The evolution of the immune system has therefore been influenced by these pressures. Genomic scans have revealed that immune system is one of the functions enriched with genes under adaptive selection. Here, we describe how the innate immune system has responded to these challenges, through the analysis of resequencing data for 132 innate immunity genes in two human populations. Results are interpreted in the context of the functional and interaction networks defined by these genes. Nucleotide diversity is lower in the adaptors and modulators functional classes, and is negatively correlated with the centrality of the proteins within the interaction network. We also produced a list of candidate genes under positive or balancing selection in each population detected by neutrality tests and showed that some functional classes are preferential targets for selection. We found evidence that the role of each gene in the network conditions the capacity to evolve or their evolvability: genes at the core of the network are more constrained, while adaptation mostly occurred at particular positions at the network edges. Interestingly, the functional classes containing most of the genes with signatures of balancing selection are involved in autoinflammatory and autoimmune diseases, suggesting a counterbalance between the beneficial and deleterious effects of the immune response.
Genetic adaptation of the antibacterial human innate immunity network
2011-01-01
Background Pathogens have represented an important selective force during the adaptation of modern human populations to changing social and other environmental conditions. The evolution of the immune system has therefore been influenced by these pressures. Genomic scans have revealed that immune system is one of the functions enriched with genes under adaptive selection. Results Here, we describe how the innate immune system has responded to these challenges, through the analysis of resequencing data for 132 innate immunity genes in two human populations. Results are interpreted in the context of the functional and interaction networks defined by these genes. Nucleotide diversity is lower in the adaptors and modulators functional classes, and is negatively correlated with the centrality of the proteins within the interaction network. We also produced a list of candidate genes under positive or balancing selection in each population detected by neutrality tests and showed that some functional classes are preferential targets for selection. Conclusions We found evidence that the role of each gene in the network conditions the capacity to evolve or their evolvability: genes at the core of the network are more constrained, while adaptation mostly occurred at particular positions at the network edges. Interestingly, the functional classes containing most of the genes with signatures of balancing selection are involved in autoinflammatory and autoimmune diseases, suggesting a counterbalance between the beneficial and deleterious effects of the immune response. PMID:21745391
Graubner, Felix R.; Gram, Aykut; Kautz, Ewa; Bauersachs, Stefan; Aslan, Selim; Agaoglu, Ali R.; Boos, Alois
2017-01-01
Abstract In the dog, there is no luteolysis in the absence of pregnancy. Thus, this species lacks any anti-luteolytic endocrine signal as found in other species that modulate uterine function during the critical period of pregnancy establishment. Nevertheless, in the dog an embryo-maternal communication must occur in order to prevent rejection of embryos. Based on this hypothesis, we performed microarray analysis of canine uterine samples collected during pre-attachment phase (days 10-12) and in corresponding non-pregnant controls, in order to elucidate the embryo attachment signal. An additional goal was to identify differences in uterine responses to pre-attachment embryos between dogs and other mammalian species exhibiting different reproductive patterns with regard to luteolysis, implantation, and preparation for placentation. Therefore, the canine microarray data were compared with gene sets from pigs, cattle, horses, and humans. We found 412 genes differentially regulated between the two experimental groups. The functional terms most strongly enriched in response to pre-attachment embryos related to extracellular matrix function and remodeling, and to immune and inflammatory responses. Several candidate genes were validated by semi-quantitative PCR. When compared with other species, best matches were found with human and equine counterparts. Especially for the pig, the majority of overlapping genes showed opposite expression patterns. Interestingly, 1926 genes did not pair with any of the other gene sets. Using a microarray approach, we report the uterine changes in the dog driven by the presence of embryos and compare these results with datasets from other mammalian species, finding common-, contrary-, and exclusively canine-regulated genes. PMID:28651344
Surface Glycosylation Profiles of Urine Extracellular Vesicles
Gerlach, Jared Q.; Krüger, Anja; Gallogly, Susan; Hanley, Shirley A.; Hogan, Marie C.; Ward, Christopher J.
2013-01-01
Urinary extracellular vesicles (uEVs) are released by cells throughout the nephron and contain biomolecules from their cells of origin. Although uEV-associated proteins and RNA have been studied in detail, little information exists regarding uEV glycosylation characteristics. Surface glycosylation profiling by flow cytometry and lectin microarray was applied to uEVs enriched from urine of healthy adults by ultracentrifugation and centrifugal filtration. The carbohydrate specificity of lectin microarray profiles was confirmed by competitive sugar inhibition and carbohydrate-specific enzyme hydrolysis. Glycosylation profiles of uEVs and purified Tamm Horsfall protein were compared. In both flow cytometry and lectin microarray assays, uEVs demonstrated surface binding, at low to moderate intensities, of a broad range of lectins whether prepared by ultracentrifugation or centrifugal filtration. In general, ultracentrifugation-prepared uEVs demonstrated higher lectin binding intensities than centrifugal filtration-prepared uEVs consistent with lesser amounts of co-purified non-vesicular proteins. The surface glycosylation profiles of uEVs showed little inter-individual variation and were distinct from those of Tamm Horsfall protein, which bound a limited number of lectins. In a pilot study, lectin microarray was used to compare uEVs from individuals with autosomal dominant polycystic kidney disease to those of age-matched controls. The lectin microarray profiles of polycystic kidney disease and healthy uEVs showed differences in binding intensity of 6/43 lectins. Our results reveal a complex surface glycosylation profile of uEVs that is accessible to lectin-based analysis following multiple uEV enrichment techniques, is distinct from co-purified Tamm Horsfall protein and may demonstrate disease-specific modifications. PMID:24069349
Application of resequencing to rice genomics, functional genomics and evolutionary analysis
2014-01-01
Rice is a model system used for crop genomics studies. The completion of the rice genome draft sequences in 2002 not only accelerated functional genome studies, but also initiated a new era of resequencing rice genomes. Based on the reference genome in rice, next-generation sequencing (NGS) using the high-throughput sequencing system can efficiently accomplish whole genome resequencing of various genetic populations and diverse germplasm resources. Resequencing technology has been effectively utilized in evolutionary analysis, rice genomics and functional genomics studies. This technique is beneficial for both bridging the knowledge gap between genotype and phenotype and facilitating molecular breeding via gene design in rice. Here, we also discuss the limitation, application and future prospects of rice resequencing. PMID:25006357
Wang, Wen; Li, Hao; Zhao, Zheng; Wang, Haoyuan; Zhang, Dong; Zhang, Yan; Lan, Qing; Wang, Jiangfei; Cao, Yong; Zhao, Jizong
2018-04-01
Abdominal aortic aneurysms (AAAs) and intracranial saccular aneurysms (IAs) are the most common types of aneurysms. This study was to investigate the common pathogenesis shared between these two kinds of aneurysms. We collected 12 IAs samples and 12 control arteries from the Beijing Tiantan Hospital and performed microarray analysis. In addition, we utilized the microarray datasets of IAs and AAAs from the Gene Expression Omnibus (GEO), in combination with our microarray results, to generate messenger RNA expression profiles for both AAAs and IAs in our study. Functional exploration and protein-protein interaction (PPI) analysis were performed. A total of 727 common genes were differentially expressed (404 was upregulated; 323 was downregulated) for both AAAs and IAs. The GO and pathway analyses showed that the common dysregulated genes were mainly enriched in vascular smooth muscle contraction, muscle contraction, immune response, defense response, cell activation, IL-6 signaling and chemokine signaling pathways, etc. The further protein-protein analysis identified 35 hub nodes, including TNF, IL6, MAPK13, and CCL5. These hub node genes were enriched in inflammatory response, positive regulation of IL-6 production, chemokine signaling pathway, and T/B cell receptor signaling pathway. Our study will gain new insight into the molecular mechanisms for the pathogenesis of both types of aneurysms and provide new therapeutic targets for the patients harboring AAAs and IAs.
Huan, Jinliang; Wang, Lishan; Xing, Li; Qin, Xianju; Feng, Lingbin; Pan, Xiaofeng; Zhu, Ling
2014-01-01
Estrogens are known to regulate the proliferation of breast cancer cells and to alter their cytoarchitectural and phenotypic properties, but the gene networks and pathways by which estrogenic hormones regulate these events are only partially understood. We used global gene expression profiling by Affymetrix GeneChip microarray analysis, with KEGG pathway enrichment, PPI network construction, module analysis and text mining methods to identify patterns and time courses of genes that are either stimulated or inhibited by estradiol (E2) in estrogen receptor (ER)-positive MCF-7 human breast cancer cells. Of the genes queried on the Affymetrix Human Genome U133 plus 2.0 microarray, we identified 628 (12h), 852 (24h) and 880 (48 h) differentially expressed genes (DEGs) that showed a robust pattern of regulation by E2. From pathway enrichment analysis, we found out the changes of metabolic pathways of E2 treated samples at each time point. At 12h time point, the changes of metabolic pathways were mainly focused on pathways in cancer, focal adhesion, and chemokine signaling pathway. At 24h time point, the changes were mainly enriched in neuroactive ligand-receptor interaction, cytokine-cytokine receptor interaction and calcium signaling pathway. At 48 h time point, the significant pathways were pathways in cancer, regulation of actin cytoskeleton, cell adhesion molecules (CAMs), axon guidance and ErbB signaling pathway. Of interest, our PPI network analysis and module analysis found that E2 treatment induced enhancement of PRSS23 at the three time points and PRSS23 was in the central position of each module. Text mining results showed that the important genes of DEGs have relationship with signal pathways, such as ERbB pathway (AREG), Wnt pathway (NDP), MAPK pathway (NTRK3, TH), IP3 pathway (TRA@) and some transcript factors (TCF4, MAF). Our studies highlight the diverse gene networks and metabolic and cell regulatory pathways through which E2 operates to achieve its widespread effects on breast cancer cells. © 2013 Elsevier B.V. All rights reserved.
Validation of MIMGO: a method to identify differentially expressed GO terms in a microarray dataset
2012-01-01
Background We previously proposed an algorithm for the identification of GO terms that commonly annotate genes whose expression is upregulated or downregulated in some microarray data compared with in other microarray data. We call these “differentially expressed GO terms” and have named the algorithm “matrix-assisted identification method of differentially expressed GO terms” (MIMGO). MIMGO can also identify microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. However, MIMGO has not yet been validated on a real microarray dataset using all available GO terms. Findings We combined Gene Set Enrichment Analysis (GSEA) with MIMGO to identify differentially expressed GO terms in a yeast cell cycle microarray dataset. GSEA followed by MIMGO (GSEA + MIMGO) correctly identified (p < 0.05) microarray data in which genes annotated to differentially expressed GO terms are upregulated. We found that GSEA + MIMGO was slightly less effective than, or comparable to, GSEA (Pearson), a method that uses Pearson’s correlation as a metric, at detecting true differentially expressed GO terms. However, unlike other methods including GSEA (Pearson), GSEA + MIMGO can comprehensively identify the microarray data in which genes annotated with a differentially expressed GO term are upregulated or downregulated. Conclusions MIMGO is a reliable method to identify differentially expressed GO terms comprehensively. PMID:23232071
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nelson, T.A.; Holmes, S.; Alekseyenko, A.V.
Irritable bowel syndrome (IBS) is a chronic, episodic gastrointestinal disorder that is prevalent in a significant fraction of western human populations; and changes in the microbiota of the large bowel have been implicated in the pathology of the disease. Using a novel comprehensive, high-density DNA microarray (PhyloChip) we performed a phylogenetic analysis of the microbial community of the large bowel in a rat model in which intracolonic acetic acid in neonates was used to induce long lasting colonic hypersensitivity and decreased stool water content and frequency, representing the equivalent of human constipation-predominant IBS. Our results revealed a significantly increased compositionalmore » difference in the microbial communities in rats with neonatal irritation as compared with controls. Even more striking was the dramatic change in the ratio of Firmicutes relative to Bacteroidetes, where neonatally irritated rats were enriched more with Bacteroidetes and also contained a different composition of species within this phylum. Our study also revealed differences at the level of bacterial families and species. The PhyloChip is a useful and convenient method to study enteric microflora. Further, this rat model system may be a useful experimental platform to study the causes and consequences of changes in microbial community composition associated with IBS.« less
Saeed, Isaam; Wong, Stephen Q.; Mar, Victoria; Goode, David L.; Caramia, Franco; Doig, Ken; Ryland, Georgina L.; Thompson, Ella R.; Hunter, Sally M.; Halgamuge, Saman K.; Ellul, Jason; Dobrovic, Alexander; Campbell, Ian G.; Papenfuss, Anthony T.; McArthur, Grant A.; Tothill, Richard W.
2014-01-01
Targeted resequencing by massively parallel sequencing has become an effective and affordable way to survey small to large portions of the genome for genetic variation. Despite the rapid development in open source software for analysis of such data, the practical implementation of these tools through construction of sequencing analysis pipelines still remains a challenging and laborious activity, and a major hurdle for many small research and clinical laboratories. We developed TREVA (Targeted REsequencing Virtual Appliance), making pre-built pipelines immediately available as a virtual appliance. Based on virtual machine technologies, TREVA is a solution for rapid and efficient deployment of complex bioinformatics pipelines to laboratories of all sizes, enabling reproducible results. The analyses that are supported in TREVA include: somatic and germline single-nucleotide and insertion/deletion variant calling, copy number analysis, and cohort-based analyses such as pathway and significantly mutated genes analyses. TREVA is flexible and easy to use, and can be customised by Linux-based extensions if required. TREVA can also be deployed on the cloud (cloud computing), enabling instant access without investment overheads for additional hardware. TREVA is available at http://bioinformatics.petermac.org/treva/. PMID:24752294
Beres, Stephen B; Richter, Ellen W; Nagiec, Michal J; Sumby, Paul; Porcella, Stephen F; DeLeo, Frank R; Musser, James M
2006-05-02
In recent years we have studied the relationship between strain genotypes and patient phenotypes in group A Streptococcus (GAS), a model human bacterial pathogen that causes extensive morbidity and mortality worldwide. We have concentrated our efforts on serotype M3 organisms because these strains are common causes of pharyngeal and invasive infections, produce unusually severe invasive infections, and can exhibit epidemic behavior. Our studies have been hindered by the lack of genome-scale phylogenies of multiple GAS strains and whole-genome sequences of multiple serotype M3 strains recovered from individuals with defined clinical phenotypes. To remove some of these impediments, we sequenced to closure the genome of four additional GAS strains and conducted comparative genomic resequencing of 12 contemporary serotype M3 strains representing distinct genotypes and phenotypes. Serotype M3 strains are a single phylogenetic lineage. Strains from asymptomatic throat carriers were significantly less virulent for mice than sterile-site isolates and evolved to a less virulent phenotype by multiple genetic pathways. Strain persistence or extinction between epidemics was strongly associated with presence or absence, respectively, of the prophage encoding streptococcal pyrogenic exotoxin A. A serotype M3 clone significantly underrepresented among necrotizing fasciitis cases has a unique frameshift mutation that truncates MtsR, a transcriptional regulator controlling expression of genes encoding iron-acquisition proteins. Expression microarray analysis of this clone confirmed significant alteration in expression of genes encoding iron metabolism proteins. Our analysis provided unprecedented detail about the molecular anatomy of bacterial strain genotype-patient phenotype relationships.
Detecting directional selection in the presence of recent admixture in African-Americans.
Lohmueller, Kirk E; Bustamante, Carlos D; Clark, Andrew G
2011-03-01
We investigate the performance of tests of neutrality in admixed populations using plausible demographic models for African-American history as well as resequencing data from African and African-American populations. The analysis of both simulated and human resequencing data suggests that recent admixture does not result in an excess of false-positive results for neutrality tests based on the frequency spectrum after accounting for the population growth in the parental African population. Furthermore, when simulating positive selection, Tajima's D, Fu and Li's D, and haplotype homozygosity have lower power to detect population-specific selection using individuals sampled from the admixed population than from the nonadmixed population. Fay and Wu's H test, however, has more power to detect selection using individuals from the admixed population than from the nonadmixed population, especially when the selective sweep ended long ago. Our results have implications for interpreting recent genome-wide scans for positive selection in human populations. © 2011 by the Genetics Society of America
Crellen, Thomas; Allan, Fiona; David, Sophia; Durrant, Caroline; Huckvale, Thomas; Holroyd, Nancy; Emery, Aidan M; Rollinson, David; Aanensen, David M; Berriman, Matthew; Webster, Joanne P; Cotton, James A
2016-02-16
Schistosoma mansoni is a parasitic fluke that infects millions of people in the developing world. This study presents the first application of population genomics to S. mansoni based on high-coverage resequencing data from 10 global isolates and an isolate of the closely-related Schistosoma rodhaini, which infects rodents. Using population genetic tests, we document genes under directional and balancing selection in S. mansoni that may facilitate adaptation to the human host. Coalescence modeling reveals the speciation of S. mansoni and S. rodhaini as 107.5-147.6KYA, a period which overlaps with the earliest archaeological evidence for fishing in Africa. Our results indicate that S. mansoni originated in East Africa and experienced a decline in effective population size 20-90KYA, before dispersing across the continent during the Holocene. In addition, we find strong evidence that S. mansoni migrated to the New World with the 16-19th Century Atlantic Slave Trade.
Vlismas, Antonis; Bletsa, Ritsa; Mavrogianni, Despina; Mamali, Georgina; Pergamali, Maria; Dinopoulou, Vasiliki; Partsinevelos, George; Drakakis, Peter; Loutradis, Dimitris
2016-01-01
Previous microarray analyses of RNAs from 8-cell (8C) human embryos revealed a lack of cell cycle checkpoints and overexpression of core circadian oscillators and cell cycle drivers relative to pluripotent human stem cells [human embryonic stem cells/induced pluripotent stem (hES/iPS)] and fibroblasts, suggesting growth factor independence during early cleavage stages. To explore this possibility, we queried our combined microarray database for expression of 487 growth factors and receptors. Fifty-one gene elements were overdetected on the 8C arrays relative to hES/iPS cells, including 14 detected at least 80-fold higher, which annotated to multiple pathways: six cytokine family (CSF1R, IL2RG, IL3RA, IL4, IL17B, IL23R), four transforming growth factor beta (TGFB) family (BMP6, BMP15, GDF9, ENG), one fibroblast growth factor (FGF) family [FGF14(FH4)], one epidermal growth factor member (GAB1), plus CD36, and CLEC10A. 8C-specific gene elements were enriched (73%) for reported circadian-controlled genes in mouse tissues. High-level detection of CSF1R, ENG, IL23R, and IL3RA specifically on the 8C arrays suggests the embryo plays an active role in blocking immune rejection and is poised for trophectoderm development; robust detection of NRG1, GAB1, -2, GRB7, and FGF14(FHF4) indicates novel roles in early development in addition to their known roles in later development. Forty-four gene elements were underdetected on the 8C arrays, including 11 at least 80-fold under the pluripotent cells: two cytokines (IFITM1, TNFRSF8), five TGFBs (BMP7, LEFTY1, LEFTY2, TDGF1, TDGF3), two FGFs (FGF2, FGF receptor 1), plus ING5, and WNT6. The microarray detection patterns suggest that hES/iPS cells exhibit suppressed circadian competence, underexpression of early differentiation markers, and more robust expression of generic pluripotency genes, in keeping with an artificial state of continual uncommitted cell division. In contrast, gene expression patterns of the 8C embryo suggest that it is an independent circadian rhythm-competent equivalence group poised to signal its environment, defend against maternal immune rejection, and begin the rapid commitment events of early embryogenesis. PMID:26493868
Gao, Liyang; Chen, Bing; Li, Jinhong; Yang, Fan; Cen, Xuecheng; Liao, Zhuangbing; Long, Xiao’ao
2017-01-01
The Wnt signaling pathway is necessary for the development of the central nervous system and is associated with tumorigenesis in various cancers. However, the mechanism of the Wnt signaling pathway in glioma cells has yet to be elucidated. Small-molecule Wnt modulators such as ICG-001 and AZD2858 were used to inhibit and stimulate the Wnt/β-catenin signaling pathway. Techniques including cell proliferation assay, colony formation assay, Matrigel cell invasion assay, cell cycle assay and Genechip microarray were used. Gene Ontology Enrichment Analysis and Gene Set Enrichment Analysis have enriched many biological processes and signaling pathways. Both the inhibiting and stimulating Wnt/β-catenin signaling pathways could influence the cell cycle, moreover, reduce the proliferation and survival of U87 glioma cells. However, Affymetrix expression microarray indicated that biological processes and networks of signaling pathways between stimulating and inhibiting the Wnt/β-catenin signaling pathway largely differ. We propose that Wnt/β-catenin signaling pathway might prove to be a valuable therapeutic target for glioma. PMID:28837560
Xu, Zhenbo; Xie, Jinhong; Liu, Junyan; Ji, Lili; Soteyome, Thanapop; Peters, Brian M; Chen, Dingqiang; Li, Bing; Li, Lin; Shirtliff, Mark E
2017-03-01
Bacillus cereus is one of the most common opportunistic pathogens responsible for various foodborn diseases. To investigate the regulatory mechanism of B. cereus under high osmotic pressure, two B. cereus strains B25 and B26 were isolated from the industrial soy sauce residue containing high-salt concentration. Resequencing was performed by Illumina/Solexa platform and 13,646 SNPs and 434 InDels were identified as common variants between B25 and B26 against reference genome, followed by COG, GO, and KEGG enrichment analysis. Furthermore, 49 key genes involving in Na + /H + ,K + transporter, dipeptide or tripeptide transporter, stress response were selected and classified into 27 groups. Further validation was performed by qRT-PCR, and 4 candidate genes were found most associated with osmotic response. Gene expression of the 4 candidate genes was then analyzed accordingly, and down regulation was obtained for gene BC0669 and BC0754 associated with K + transport system. However, dramatic up regulation was detected for gene BC2114 involving in glutathione peroxidase, indicating the activation of antioxidant responses by osmotic stress via genetic regulation. As concluded, bioinformatic analysis and gene expression profile represented the basis of further investigation on the genetic and regulatory mechanism of bacterial salt tolerance. Copyright © 2017 Elsevier Ltd. All rights reserved.
Wang, M; Wang, X C; Zhao, L; Zhang, Y; Yao, L L; Lin, Y; Peng, Y D; Hu, R M
2014-06-17
Impaired insulin action within skeletal muscle, adipose tissue, and the liver is an important characteristic of type 2 diabetes (T2D). In order to identify common underlying defects in insulin-sensitive tissues that may be involved in the pathogenesis of T2D, the gene expression profiles of skeletal muscle, visceral adipose tissue, and liver from autopsy donors with or without T2D were examined using oligonucleotide microarrays and quantitative reverse transcriptase-PCR. Compared with controls, 691 genes were commonly dysregulated in these three insulin-sensitive tissues of humans with T2D. These co-expressed genes were enriched within the mitochondrion, with suggested involvement in energy metabolic processes such as glycolysis and gluconeogenesis, fatty acid beta oxidative, tricarboxylic acid cycle, and electron transport. Genes related to energy metabolism were mostly downregulated in diabetic skeletal muscle and visceral adipose tissue, while they were upregulated in the diabetic liver. This observed dysregulation in energy-related metabolism may be the underlying factor leading to the molecular mechanisms responsible for the insulin resistance of patients with T2D.
Identification of the Key Genes and Pathways in Esophageal Carcinoma.
Su, Peng; Wen, Shiwang; Zhang, Yuefeng; Li, Yong; Xu, Yanzhao; Zhu, Yonggang; Lv, Huilai; Zhang, Fan; Wang, Mingbo; Tian, Ziqiang
2016-01-01
Objective . Esophageal carcinoma (EC) is a frequently common malignancy of gastrointestinal cancer in the world. This study aims to screen key genes and pathways in EC and elucidate the mechanism of it. Methods . 5 microarray datasets of EC were downloaded from Gene Expression Omnibus. Differentially expressed genes (DEGs) were screened by bioinformatics analysis. Gene Ontology (GO) enrichment, Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment, and protein-protein interaction (PPI) network construction were performed to obtain the biological roles of DEGs in EC. Quantitative real-time polymerase chain reaction (qRT-PCR) was used to verify the expression level of DEGs in EC. Results . A total of 1955 genes were filtered as DEGs in EC. The upregulated genes were significantly enriched in cell cycle and the downregulated genes significantly enriched in Endocytosis. PPI network displayed CDK4 and CCT3 were hub proteins in the network. The expression level of 8 dysregulated DEGs including CDK4, CCT3, THSD4, SIM2, MYBL2, CENPF, CDCA3, and CDKN3 was validated in EC compared to adjacent nontumor tissues and the results were matched with the microarray analysis. Conclusion . The significantly DEGs including CDK4, CCT3, THSD4, and SIM2 may play key roles in tumorigenesis and development of EC involved in cell cycle and Endocytosis.
CNV discovery for milk composition traits in dairy cattle using whole genome resequencing.
Gao, Yahui; Jiang, Jianping; Yang, Shaohua; Hou, Yali; Liu, George E; Zhang, Shengli; Zhang, Qin; Sun, Dongxiao
2017-03-29
Copy number variations (CNVs) are important and widely distributed in the genome. CNV detection opens a new avenue for exploring genes associated with complex traits in humans, animals and plants. Herein, we present a genome-wide assessment of CNVs that are potentially associated with milk composition traits in dairy cattle. In this study, CNVs were detected based on whole genome re-sequencing data of eight Holstein bulls from four half- and/or full-sib families, with extremely high and low estimated breeding values (EBVs) of milk protein percentage and fat percentage. The range of coverage depth per individual was 8.2-11.9×. Using CNVnator, we identified a total of 14,821 CNVs, including 5025 duplications and 9796 deletions. Among them, 487 differential CNV regions (CNVRs) comprising ~8.23 Mb of the cattle genome were observed between the high and low groups. Annotation of these differential CNVRs were performed based on the cattle genome reference assembly (UMD3.1) and totally 235 functional genes were found within the CNVRs. By Gene Ontology and KEGG pathway analyses, we found that genes were significantly enriched for specific biological functions related to protein and lipid metabolism, insulin/IGF pathway-protein kinase B signaling cascade, prolactin signaling pathway and AMPK signaling pathways. These genes included INS, IGF2, FOXO3, TH, SCD5, GALNT18, GALNT16, ART3, SNCA and WNT7A, implying their potential association with milk protein and fat traits. In addition, 95 CNVRs were overlapped with 75 known QTLs that are associated with milk protein and fat traits of dairy cattle (Cattle QTLdb). In conclusion, based on NGS of 8 Holstein bulls with extremely high and low EBVs for milk PP and FP, we identified a total of 14,821 CNVs, 487 differential CNVRs between groups, and 10 genes, which were suggested as promising candidate genes for milk protein and fat traits.
2013-01-01
Background The Grooved Carpet shell clam Ruditapes decussatus is the autochthonous European clam and the most appreciated from a gastronomic and economic point of view. The production is in decline due to several factors such as Perkinsiosis and habitat invasion and competition by the introduced exotic species, the manila clam Ruditapes philippinarum. After we sequenced R. decussatus transcriptome we have designed an oligo microarray capable of contributing to provide some clues on molecular response of the clam to Perkinsiosis. Results A database consisting of 41,119 unique transcripts was constructed, of which 12,479 (30.3%) were annotated by similarity. An oligo-DNA microarray platform was then designed and applied to profile gene expression in R. decussatus heavily infected by Perkinsus olseni. Functional annotation of differentially expressed genes between those two conditionswas performed by gene set enrichment analysis. As expected, microarrays unveil genes related with stress/infectious agents such as hydrolases, proteases and others. The extensive role of innate immune system was also analyzed and effect of parasitosis upon expression of important molecules such as lectins reviewed. Conclusions This study represents a first attempt to characterize Ruditapes decussatus transcriptome, an important marine resource for the European aquaculture. The trancriptome sequencing and consequent annotation will increase the available tools and resources for this specie, introducing the possibility of high throughput experiments such as microarrays analysis. In this specific case microarray approach was used to unveil some important aspects of host-parasite interaction between the Carpet shell clam and Perkinsus, two non-model species, highlighting some genes associated with this interaction. Ample information was obtained to identify biological processes significantly enriched among differentially expressed genes in Perkinsus infected versus non-infected gills. An overview on the genes related with the immune system on R. decussatus transcriptome is also reported. PMID:24168212
Leite, Ricardo B; Milan, Massimo; Coppe, Alessandro; Bortoluzzi, Stefania; dos Anjos, António; Reinhardt, Richard; Saavedra, Carlos; Patarnello, Tomaso; Cancela, M Leonor; Bargelloni, Luca
2013-10-29
The Grooved Carpet shell clam Ruditapes decussatus is the autochthonous European clam and the most appreciated from a gastronomic and economic point of view. The production is in decline due to several factors such as Perkinsiosis and habitat invasion and competition by the introduced exotic species, the manila clam Ruditapes philippinarum. After we sequenced R. decussatus transcriptome we have designed an oligo microarray capable of contributing to provide some clues on molecular response of the clam to Perkinsiosis. A database consisting of 41,119 unique transcripts was constructed, of which 12,479 (30.3%) were annotated by similarity. An oligo-DNA microarray platform was then designed and applied to profile gene expression in R. decussatus heavily infected by Perkinsus olseni. Functional annotation of differentially expressed genes between those two conditionswas performed by gene set enrichment analysis. As expected, microarrays unveil genes related with stress/infectious agents such as hydrolases, proteases and others. The extensive role of innate immune system was also analyzed and effect of parasitosis upon expression of important molecules such as lectins reviewed. This study represents a first attempt to characterize Ruditapes decussatus transcriptome, an important marine resource for the European aquaculture. The trancriptome sequencing and consequent annotation will increase the available tools and resources for this specie, introducing the possibility of high throughput experiments such as microarrays analysis. In this specific case microarray approach was used to unveil some important aspects of host-parasite interaction between the Carpet shell clam and Perkinsus, two non-model species, highlighting some genes associated with this interaction. Ample information was obtained to identify biological processes significantly enriched among differentially expressed genes in Perkinsus infected versus non-infected gills. An overview on the genes related with the immune system on R. decussatus transcriptome is also reported.
Wolff, Alexander; Bayerlová, Michaela; Gaedcke, Jochen; Kube, Dieter; Beißbarth, Tim
2018-01-01
Pipeline comparisons for gene expression data are highly valuable for applied real data analyses, as they enable the selection of suitable analysis strategies for the dataset at hand. Such pipelines for RNA-Seq data should include mapping of reads, counting and differential gene expression analysis or preprocessing, normalization and differential gene expression in case of microarray analysis, in order to give a global insight into pipeline performances. Four commonly used RNA-Seq pipelines (STAR/HTSeq-Count/edgeR, STAR/RSEM/edgeR, Sailfish/edgeR, TopHat2/Cufflinks/CuffDiff)) were investigated on multiple levels (alignment and counting) and cross-compared with the microarray counterpart on the level of gene expression and gene ontology enrichment. For these comparisons we generated two matched microarray and RNA-Seq datasets: Burkitt Lymphoma cell line data and rectal cancer patient data. The overall mapping rate of STAR was 98.98% for the cell line dataset and 98.49% for the patient dataset. Tophat's overall mapping rate was 97.02% and 96.73%, respectively, while Sailfish had only an overall mapping rate of 84.81% and 54.44%. The correlation of gene expression in microarray and RNA-Seq data was moderately worse for the patient dataset (ρ = 0.67-0.69) than for the cell line dataset (ρ = 0.87-0.88). An exception were the correlation results of Cufflinks, which were substantially lower (ρ = 0.21-0.29 and 0.34-0.53). For both datasets we identified very low numbers of differentially expressed genes using the microarray platform. For RNA-Seq we checked the agreement of differentially expressed genes identified in the different pipelines and of GO-term enrichment results. In conclusion the combination of STAR aligner with HTSeq-Count followed by STAR aligner with RSEM and Sailfish generated differentially expressed genes best suited for the dataset at hand and in agreement with most of the other transcriptomics pipelines.
Gopalakrishnan, Kalpana; Teitelbaum, Susan L; Lambertini, Luca; Wetmur, James; Manservisi, Fabiana; Falcioni, Laura; Panzacchi, Simona; Belpoggi, Fiorella; Chen, Jia
2017-01-01
Exposure to environmental chemicals has been linked to altered mammary development and cancer risk at high doses using animal models. Effects at low doses comparable to human exposure remain poorly understood, especially during critical developmental windows. We investigated the effects of two environmental phenols commonly used in personal care products - methyl paraben (MPB) and triclosan (TCS) - on the histology and transcriptome of normal mammary glands at low doses mimicking human exposure during critical windows of development. Sprague-Dawley rats were exposed during perinatal, prepubertal and pubertal windows, as well as from birth to lactation. Low-dose exposure to MPB and TCS induced measurable changes in both mammary histology (by Masson's Trichrome Stain) and transcriptome (by microarrays) in a window-specific fashion. Puberty represented a window of heightened sensitivity to MPB, with increased glandular tissue and changes of expression in 295 genes with significant enrichment in functions such as DNA replication and cell cycle regulation. Long-term exposure to TCS from birth to lactation was associated with increased adipose and reduced glandular and secretory tissue, with expression alterations in 993 genes enriched in pathways such as cholesterol synthesis and adipogenesis. Finally, enrichment analyses revealed that genes modified by MPB and TCS were over-represented in human breast cancer gene signatures, suggesting possible links with breast carcinogenesis. These findings highlight the issues of critical windows of susceptibility that may confer heightened sensitivity to environmental insults and implicate the potential health effects of these ubiquitous environmental chemicals in breast cancer. Copyright © 2016 Elsevier Inc. All rights reserved.
Gopalakrishnan, Kalpana; Teitelbaum, Susan L.; Lambertini, Luca; Wetmur, James; Manservisi, Fabiana; Falcioni, Laura; Panzacchi, Simona; Belpoggi, Fiorella; Chen, Jia
2016-01-01
Exposure to environmental chemicals has been linked to altered mammary development and cancer risk at high doses using animal models. Effects at low doses comparable to human exposure remain poorly understood, especially during critical developmental windows. We investigated the effects of two environmental phenols commonly used in personal care products – methyl paraben (MPB) and triclosan (TCS) – on the histology and transcriptome of normal mammary glands at low doses mimicking human exposure during critical windows of development. Sprague-Dawley rats were exposed during perinatal, prepubertal and pubertal windows, as well as from birth to lactation. Low-dose exposure to MPB and TCS induced measurable changes in both mammary histology (by Masson’s Trichrome Stain) and transcriptome (by microarrays) in a window-specific fashion. Puberty represented a window of heightened sensitivity to MPB, with increased glandular tissue and changes of expression in 295 genes with significant enrichment in functions such as DNA replication and cell cycle regulation. Long-term exposure to TCS from birth to lactation was associated with increased adipose and reduced glandular and secretory tissue, with expression alterations in 993 genes enriched in pathways such as cholesterol synthesis and adipogenesis. Finally, enrichment analyses revealed that genes modified by MPB and TCS were over-represented in human breast cancer gene signatures, suggesting possible links with breast carcinogenesis. These findings highlight the issues of critical windows of susceptibility that may confer heightened sensitivity to environmental insults and implicate the potential health effects of these ubiquitous environmental chemicals in breast cancer. PMID:27810681
Graubner, Felix R; Gram, Aykut; Kautz, Ewa; Bauersachs, Stefan; Aslan, Selim; Agaoglu, Ali R; Boos, Alois; Kowalewski, Mariusz P
2017-08-01
In the dog, there is no luteolysis in the absence of pregnancy. Thus, this species lacks any anti-luteolytic endocrine signal as found in other species that modulate uterine function during the critical period of pregnancy establishment. Nevertheless, in the dog an embryo-maternal communication must occur in order to prevent rejection of embryos. Based on this hypothesis, we performed microarray analysis of canine uterine samples collected during pre-attachment phase (days 10-12) and in corresponding non-pregnant controls, in order to elucidate the embryo attachment signal. An additional goal was to identify differences in uterine responses to pre-attachment embryos between dogs and other mammalian species exhibiting different reproductive patterns with regard to luteolysis, implantation, and preparation for placentation. Therefore, the canine microarray data were compared with gene sets from pigs, cattle, horses, and humans. We found 412 genes differentially regulated between the two experimental groups. The functional terms most strongly enriched in response to pre-attachment embryos related to extracellular matrix function and remodeling, and to immune and inflammatory responses. Several candidate genes were validated by semi-quantitative PCR. When compared with other species, best matches were found with human and equine counterparts. Especially for the pig, the majority of overlapping genes showed opposite expression patterns. Interestingly, 1926 genes did not pair with any of the other gene sets. Using a microarray approach, we report the uterine changes in the dog driven by the presence of embryos and compare these results with datasets from other mammalian species, finding common-, contrary-, and exclusively canine-regulated genes. © The Authors 2017. Published by Oxford University Press on behalf of Society for the Study of Reproduction.
Smith, Adam C.; Suzuki, Masako; Thompson, Reid; Choufani, Sanaa; Higgins, Michael J.; Chiu, Idy W.; Squire, Jeremy A.; Greally, John M.; Weksberg, Rosanna
2015-01-01
Beckwith-Wiedemann syndrome (BWS) is an overgrowth syndrome associated with genetic or epigenetic alterations in one of two imprinted domains on chromosome 11p15.5. Rarely, chromosomal translocations or inversions of chromosome 11p15.5 are associated with BWS but the molecular pathophysiology in such cases is not understood. In our series of 3 translocation and 2 inversion patients with BWS, the chromosome 11p15.5 breakpoints map within the centromeric imprinted domain, 2. We hypothesized that either microdeletions/microduplications adjacent to the breakpoints could disrupt genomic sequences important for imprinted gene regulation. An alternate hypothesis was that epigenetic alterations of as yet unknown regulatory DNA sequences, result in the BWS phenotype. A high resolution Nimblegen custom microarray was designed representing all non-repetitive sequences in the telomeric 33 MB of the short arm of human chromosome 11. For the BWS-associated chromosome 11p15.5 translocations and inversions, we found no evidence of microdeletions/microduplications. DNA methylation was also tested on this microarray using the HpaII tiny fragment enrichment by ligation-mediated PCR (HELP) assay. This high-resolution DNA methylation microarray analysis revealed a gain of DNA methylation in the translocation/inversion patients affecting the p-ter segment of chromosome 11p15, including both imprinted domains. BWS patients that inherited a maternal translocation or inversion also demonstrated reduced expression of the growth suppressing imprinted gene, CDKN1C in Domain 2. In summary, our data demonstrate that translocations and inversions involving imprinted domain 2 on chromosome 11p15.5, alter regional DNA methylation patterns and imprinted gene expression in cis, suggesting that these epigenetic alterations are generated by an alteration in “chromatin context”. PMID:22079941
Cross-species transcriptomic approach reveals genes in hamster implantation sites.
Lei, Wei; Herington, Jennifer; Galindo, Cristi L; Ding, Tianbing; Brown, Naoko; Reese, Jeff; Paria, Bibhash C
2014-12-01
The mouse model has greatly contributed to understanding molecular mechanisms involved in the regulation of progesterone (P4) plus estrogen (E)-dependent blastocyst implantation process. However, little is known about contributory molecular mechanisms of the P4-only-dependent blastocyst implantation process that occurs in species such as hamsters, guineapigs, rabbits, pigs, rhesus monkeys, and perhaps humans. We used the hamster as a model of P4-only-dependent blastocyst implantation and carried out cross-species microarray (CSM) analyses to reveal differentially expressed genes at the blastocyst implantation site (BIS), in order to advance the understanding of molecular mechanisms of implantation. Upregulation of 112 genes and downregulation of 77 genes at the BIS were identified using a mouse microarray platform, while use of the human microarray revealed 62 up- and 38 down-regulated genes at the BIS. Excitingly, a sizable number of genes (30 up- and 11 down-regulated genes) were identified as a shared pool by both CSMs. Real-time RT-PCR and in situ hybridization validated the expression patterns of several up- and down-regulated genes identified by both CSMs at the hamster and mouse BIS to demonstrate the merit of CSM findings across species, in addition to revealing genes specific to hamsters. Functional annotation analysis found that genes involved in the spliceosome, proteasome, and ubiquination pathways are enriched at the hamster BIS, while genes associated with tight junction, SAPK/JNK signaling, and PPARα/RXRα signalings are repressed at the BIS. Overall, this study provides a pool of genes and evidence of their participation in up- and down-regulated cellular functions/pathways at the hamster BIS. © 2014 Society for Reproduction and Fertility.
Feng, Yinling; Wang, Xuefeng
2017-03-01
In order to investigate commonly disturbed genes and pathways in various brain regions of patients with Parkinson's disease (PD), microarray datasets from previous studies were collected and systematically analyzed. Different normalization methods were applied to microarray datasets from different platforms. A strategy combining gene co‑expression networks and clinical information was adopted, using weighted gene co‑expression network analysis (WGCNA) to screen for commonly disturbed genes in different brain regions of patients with PD. Functional enrichment analysis of commonly disturbed genes was performed using the Database for Annotation, Visualization, and Integrated Discovery (DAVID). Co‑pathway relationships were identified with Pearson's correlation coefficient tests and a hypergeometric distribution‑based test. Common genes in pathway pairs were selected out and regarded as risk genes. A total of 17 microarray datasets from 7 platforms were retained for further analysis. Five gene coexpression modules were identified, containing 9,745, 736, 233, 101 and 93 genes, respectively. One module was significantly correlated with PD samples and thus the 736 genes it contained were considered to be candidate PD‑associated genes. Functional enrichment analysis demonstrated that these genes were implicated in oxidative phosphorylation and PD. A total of 44 pathway pairs and 52 risk genes were revealed, and a risk gene pathway relationship network was constructed. Eight modules were identified and were revealed to be associated with PD, cancers and metabolism. A number of disturbed pathways and risk genes were unveiled in PD, and these findings may help advance understanding of PD pathogenesis.
Salojärvi, Jarkko; Smolander, Olli-Pekka; Nieminen, Kaisa; Rajaraman, Sitaram; Safronov, Omid; Safdari, Pezhman; Lamminmäki, Airi; Immanen, Juha; Lan, Tianying; Tanskanen, Jaakko; Rastas, Pasi; Amiryousefi, Ali; Jayaprakash, Balamuralikrishna; Kammonen, Juhana I; Hagqvist, Risto; Eswaran, Gugan; Ahonen, Viivi Helena; Serra, Juan Alonso; Asiegbu, Fred O; de Dios Barajas-Lopez, Juan; Blande, Daniel; Blokhina, Olga; Blomster, Tiina; Broholm, Suvi; Brosché, Mikael; Cui, Fuqiang; Dardick, Chris; Ehonen, Sanna E; Elomaa, Paula; Escamez, Sacha; Fagerstedt, Kurt V; Fujii, Hiroaki; Gauthier, Adrien; Gollan, Peter J; Halimaa, Pauliina; Heino, Pekka I; Himanen, Kristiina; Hollender, Courtney; Kangasjärvi, Saijaliisa; Kauppinen, Leila; Kelleher, Colin T; Kontunen-Soppela, Sari; Koskinen, J Patrik; Kovalchuk, Andriy; Kärenlampi, Sirpa O; Kärkönen, Anna K; Lim, Kean-Jin; Leppälä, Johanna; Macpherson, Lee; Mikola, Juha; Mouhu, Katriina; Mähönen, Ari Pekka; Niinemets, Ülo; Oksanen, Elina; Overmyer, Kirk; Palva, E Tapio; Pazouki, Leila; Pennanen, Ville; Puhakainen, Tuula; Poczai, Péter; Possen, Boy J H M; Punkkinen, Matleena; Rahikainen, Moona M; Rousi, Matti; Ruonala, Raili; van der Schoot, Christiaan; Shapiguzov, Alexey; Sierla, Maija; Sipilä, Timo P; Sutela, Suvi; Teeri, Teemu H; Tervahauta, Arja I; Vaattovaara, Aleksia; Vahala, Jorma; Vetchinnikova, Lidia; Welling, Annikki; Wrzaczek, Michael; Xu, Enjun; Paulin, Lars G; Schulman, Alan H; Lascoux, Martin; Albert, Victor A; Auvinen, Petri; Helariutta, Ykä; Kangasjärvi, Jaakko
2017-06-01
Silver birch (Betula pendula) is a pioneer boreal tree that can be induced to flower within 1 year. Its rapid life cycle, small (440-Mb) genome, and advanced germplasm resources make birch an attractive model for forest biotechnology. We assembled and chromosomally anchored the nuclear genome of an inbred B. pendula individual. Gene duplicates from the paleohexaploid event were enriched for transcriptional regulation, whereas tandem duplicates were overrepresented by environmental responses. Population resequencing of 80 individuals showed effective population size crashes at major points of climatic upheaval. Selective sweeps were enriched among polyploid duplicates encoding key developmental and physiological triggering functions, suggesting that local adaptation has tuned the timing of and cross-talk between fundamental plant processes. Variation around the tightly-linked light response genes PHYC and FRS10 correlated with latitude and longitude and temperature, and with precipitation for PHYC. Similar associations characterized the growth-promoting cytokinin response regulator ARR1, and the wood development genes KAK and MED5A.
USDA-ARS?s Scientific Manuscript database
We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE (“Assessing Changes to Exons”) converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detect...
CoVaCS: a consensus variant calling system.
Chiara, Matteo; Gioiosa, Silvia; Chillemi, Giovanni; D'Antonio, Mattia; Flati, Tiziano; Picardi, Ernesto; Zambelli, Federico; Horner, David Stephen; Pesole, Graziano; Castrignanò, Tiziana
2018-02-05
The advent and ongoing development of next generation sequencing technologies (NGS) has led to a rapid increase in the rate of human genome re-sequencing data, paving the way for personalized genomics and precision medicine. The body of genome resequencing data is progressively increasing underlining the need for accurate and time-effective bioinformatics systems for genotyping - a crucial prerequisite for identification of candidate causal mutations in diagnostic screens. Here we present CoVaCS, a fully automated, highly accurate system with a web based graphical interface for genotyping and variant annotation. Extensive tests on a gold standard benchmark data-set -the NA12878 Illumina platinum genome- confirm that call-sets based on our consensus strategy are completely in line with those attained by similar command line based approaches, and far more accurate than call-sets from any individual tool. Importantly our system exhibits better sensitivity and higher specificity than equivalent commercial software. CoVaCS offers optimized pipelines integrating state of the art tools for variant calling and annotation for whole genome sequencing (WGS), whole-exome sequencing (WES) and target-gene sequencing (TGS) data. The system is currently hosted at Cineca, and offers the speed of a HPC computing facility, a crucial consideration when large numbers of samples must be analysed. Importantly, all the analyses are performed automatically allowing high reproducibility of the results. As such, we believe that CoVaCS can be a valuable tool for the analysis of human genome resequencing studies. CoVaCS is available at: https://bioinformatics.cineca.it/covacs .
Naiser, Thomas; Ehler, Oliver; Kayser, Jona; Mai, Timo; Michel, Wolfgang; Ott, Albrecht
2008-01-01
Background The high binding specificity of short 10 to 30 mer oligonucleotide probes enables single base mismatch (MM) discrimination and thus provides the basis for genotyping and resequencing microarray applications. Recent experiments indicate that the underlying principles governing DNA microarray hybridization – and in particular MM discrimination – are not completely understood. Microarrays usually address complex mixtures of DNA targets. In order to reduce the level of complexity and to study the problem of surface-based hybridization with point defects in more detail, we performed array based hybridization experiments in well controlled and simple situations. Results We performed microarray hybridization experiments with short 16 to 40 mer target and probe lengths (in situations without competitive hybridization) in order to systematically investigate the impact of point-mutations – varying defect type and position – on the oligonucleotide duplex binding affinity. The influence of single base bulges and single base MMs depends predominantly on position – it is largest in the middle of the strand. The position-dependent influence of base bulges is very similar to that of single base MMs, however certain bulges give rise to an unexpectedly high binding affinity. Besides the defect (MM or bulge) type, which is the second contribution in importance to hybridization affinity, there is also a sequence dependence, which extends beyond the defect next-neighbor and which is difficult to quantify. Direct comparison between binding affinities of DNA/DNA and RNA/DNA duplexes shows, that RNA/DNA purine-purine MMs are more discriminating than corresponding DNA/DNA MMs. In DNA/DNA MM discrimination the affected base pair (C·G vs. A·T) is the pertinent parameter. We attribute these differences to the different structures of the duplexes (A vs. B form). Conclusion We have shown that DNA microarrays can resolve even subtle changes in hybridization affinity for simple target mixtures. We have further shown that the impact of point defects on oligonucleotide stability can be broken down to a hierarchy of effects. In order to explain our observations we propose DNA molecular dynamics – in form of zipping of the oligonucleotide duplex – to play an important role. PMID:18477387
Hwang, Sun-Goo; Kim, Dong Sub; Hwang, Jung Eun; Han, A-Reum; Jang, Cheol Seong
2014-05-15
In order to better understand the biological systems that are affected in response to cosmic ray (CR), we conducted weighted gene co-expression network analysis using the module detection method. By using the Pearson's correlation coefficient (PCC) value, we evaluated complex gene-gene functional interactions between 680 CR-responsive probes from integrated microarray data sets, which included large-scale transcriptional profiling of 1000 microarray samples. These probes were divided into 6 distinct modules that contained 20 enriched gene ontology (GO) functions, such as oxidoreductase activity, hydrolase activity, and response to stimulus and stress. In particular, modules 1 and 2 commonly showed enriched annotation categories such as oxidoreductase activity, including enriched cis-regulatory elements known as ROS-specific regulators. These results suggest that the ROS-mediated irradiation response pathway is affected by CR in modules 1 and 2. We found 243 ionizing radiation (IR)-responsive probes that exhibited similarities in expression patterns in various irradiation microarray data sets. The expression patterns of 6 randomly selected IR-responsive genes were evaluated by quantitative reverse transcription polymerase chain reaction following treatment with CR, gamma rays (GR), and ion beam (IB); similar patterns were observed among these genes under these 3 treatments. Moreover, we constructed subnetworks of IR-responsive genes and evaluated the expression levels of their neighboring genes following GR treatment; similar patterns were observed among them. These results of network-based analyses might provide a clue to understanding the complex biological system related to the CR response in plants. Copyright © 2014 Elsevier B.V. All rights reserved.
Ulrich, Reiner; Puff, Christina; Wewetzer, Konstantin; Kalkuhl, Arno; Deschl, Ulrich; Baumgärtner, Wolfgang
2014-01-01
Canine distemper virus (CDV)-induced demyelinating leukoencephalitis in dogs (Canis familiaris) is suggested to represent a naturally occurring translational model for subacute sclerosing panencephalitis and multiple sclerosis in humans. The aim of this study was a hypothesis-free microarray analysis of the transcriptional changes within cerebellar specimens of five cases of acute, six cases of subacute demyelinating, and three cases of chronic demyelinating and inflammatory CDV leukoencephalitis as compared to twelve non-infected control dogs. Frozen cerebellar specimens were used for analysis of histopathological changes including demyelination, transcriptional changes employing microarrays, and presence of CDV nucleoprotein RNA and protein using microarrays, RT-qPCR and immunohistochemistry. Microarray analysis revealed 780 differentially expressed probe sets. The dominating change was an up-regulation of genes related to the innate and the humoral immune response, and less distinct the cytotoxic T-cell-mediated immune response in all subtypes of CDV leukoencephalitis as compared to controls. Multiple myelin genes including myelin basic protein and proteolipid protein displayed a selective down-regulation in subacute CDV leukoencephalitis, suggestive of an oligodendrocyte dystrophy. In contrast, a marked up-regulation of multiple immunoglobulin-like expressed sequence tags and the delta polypeptide of the CD3 antigen was observed in chronic CDV leukoencephalitis, in agreement with the hypothesis of an immune-mediated demyelination in the late inflammatory phase of the disease. Analysis of pathways intimately linked to demyelination as determined by morphometry employing correlation-based Gene Set Enrichment Analysis highlighted the pathomechanistic importance of up-regulated genes comprised by the gene ontology terms “viral replication” and “humoral immune response” as well as down-regulated genes functionally related to “metabolite and energy generation”. PMID:24755553
Ulrich, Reiner; Puff, Christina; Wewetzer, Konstantin; Kalkuhl, Arno; Deschl, Ulrich; Baumgärtner, Wolfgang
2014-01-01
Canine distemper virus (CDV)-induced demyelinating leukoencephalitis in dogs (Canis familiaris) is suggested to represent a naturally occurring translational model for subacute sclerosing panencephalitis and multiple sclerosis in humans. The aim of this study was a hypothesis-free microarray analysis of the transcriptional changes within cerebellar specimens of five cases of acute, six cases of subacute demyelinating, and three cases of chronic demyelinating and inflammatory CDV leukoencephalitis as compared to twelve non-infected control dogs. Frozen cerebellar specimens were used for analysis of histopathological changes including demyelination, transcriptional changes employing microarrays, and presence of CDV nucleoprotein RNA and protein using microarrays, RT-qPCR and immunohistochemistry. Microarray analysis revealed 780 differentially expressed probe sets. The dominating change was an up-regulation of genes related to the innate and the humoral immune response, and less distinct the cytotoxic T-cell-mediated immune response in all subtypes of CDV leukoencephalitis as compared to controls. Multiple myelin genes including myelin basic protein and proteolipid protein displayed a selective down-regulation in subacute CDV leukoencephalitis, suggestive of an oligodendrocyte dystrophy. In contrast, a marked up-regulation of multiple immunoglobulin-like expressed sequence tags and the delta polypeptide of the CD3 antigen was observed in chronic CDV leukoencephalitis, in agreement with the hypothesis of an immune-mediated demyelination in the late inflammatory phase of the disease. Analysis of pathways intimately linked to demyelination as determined by morphometry employing correlation-based Gene Set Enrichment Analysis highlighted the pathomechanistic importance of up-regulated genes comprised by the gene ontology terms "viral replication" and "humoral immune response" as well as down-regulated genes functionally related to "metabolite and energy generation".
Coon, Keith D; Valla, Jon; Szelinger, Szabolics; Schneider, Lonnie E; Niedzielko, Tracy L; Brown, Kevin M; Pearson, John V; Halperin, Rebecca; Dunckley, Travis; Papassotiropoulos, Andreas; Caselli, Richard J; Reiman, Eric M; Stephan, Dietrich A
2006-08-01
The role of mitochondrial dysfunction in the pathogenesis of Alzheimer's disease (AD) has been well documented. Though evidence for the role of mitochondria in AD seems incontrovertible, the impact of mitochondrial DNA (mtDNA) mutations in AD etiology remains controversial. Though mutations in mitochondrially encoded genes have repeatedly been implicated in the pathogenesis of AD, many of these studies have been plagued by lack of replication as well as potential contamination of nuclear-encoded mitochondrial pseudogenes. To assess the role of mtDNA mutations in the pathogenesis of AD, while avoiding the pitfalls of nuclear-encoded mitochondrial pseudogenes encountered in previous investigations and showcasing the benefits of a novel resequencing technology, we sequenced the entire coding region (15,452 bp) of mtDNA from 19 extremely well-characterized AD patients and 18 age-matched, unaffected controls utilizing a new, reliable, high-throughput array-based resequencing technique, the Human MitoChip. High-throughput, array-based DNA resequencing of the entire mtDNA coding region from platelets of 37 subjects revealed the presence of 208 loci displaying a total of 917 sequence variants. There were no statistically significant differences in overall mutational burden between cases and controls, however, 265 independent sites of statistically significant change between cases and controls were identified. Changed sites were found in genes associated with complexes I (30.2%), III (3.0%), IV (33.2%), and V (9.1%) as well as tRNA (10.6%) and rRNA (14.0%). Despite their statistical significance, the subtle nature of the observed changes makes it difficult to determine whether they represent true functional variants involved in AD etiology or merely naturally occurring dissimilarity. Regardless, this study demonstrates the tremendous value of this novel mtDNA resequencing platform, which avoids the pitfalls of erroneously amplifying nuclear-encoded mtDNA pseudogenes, and our proposed analysis paradigm, which utilizes the availability of raw signal intensity values for each of the four potential alleles to facilitate quantitative estimates of mtDNA heteroplasmy. This information provides a potential new target for burgeoning diagnostics and therapeutics that could truly assist those suffering from this devastating disorder.
Jupiter, Daniel; Chen, Hailin; VanBuren, Vincent
2009-01-01
Background Although expression microarrays have become a standard tool used by biologists, analysis of data produced by microarray experiments may still present challenges. Comparison of data from different platforms, organisms, and labs may involve complicated data processing, and inferring relationships between genes remains difficult. Results STARNET 2 is a new web-based tool that allows post hoc visual analysis of correlations that are derived from expression microarray data. STARNET 2 facilitates user discovery of putative gene regulatory networks in a variety of species (human, rat, mouse, chicken, zebrafish, Drosophila, C. elegans, S. cerevisiae, Arabidopsis and rice) by graphing networks of genes that are closely co-expressed across a large heterogeneous set of preselected microarray experiments. For each of the represented organisms, raw microarray data were retrieved from NCBI's Gene Expression Omnibus for a selected Affymetrix platform. All pairwise Pearson correlation coefficients were computed for expression profiles measured on each platform, respectively. These precompiled results were stored in a MySQL database, and supplemented by additional data retrieved from NCBI. A web-based tool allows user-specified queries of the database, centered at a gene of interest. The result of a query includes graphs of correlation networks, graphs of known interactions involving genes and gene products that are present in the correlation networks, and initial statistical analyses. Two analyses may be performed in parallel to compare networks, which is facilitated by the new HEATSEEKER module. Conclusion STARNET 2 is a useful tool for developing new hypotheses about regulatory relationships between genes and gene products, and has coverage for 10 species. Interpretation of the correlation networks is supported with a database of previously documented interactions, a test for enrichment of Gene Ontology terms, and heat maps of correlation distances that may be used to compare two networks. The list of genes in a STARNET network may be useful in developing a list of candidate genes to use for the inference of causal networks. The tool is freely available at , and does not require user registration. PMID:19828039
Sato, Kengo; Kuroki, Yoko; Kumita, Wakako; Fujiyama, Asao; Toyoda, Atsushi; Kawai, Jun; Iriki, Atsushi; Sasaki, Erika; Okano, Hideyuki; Sakakibara, Yasubumi
2015-11-20
The first draft of the common marmoset (Callithrix jacchus) genome was published by the Marmoset Genome Sequencing and Analysis Consortium. The draft was based on whole-genome shotgun sequencing, and the current assembly version is Callithrix_jacches-3.2.1, but there still exist 187,214 undetermined gap regions and supercontigs and relatively short contigs that are unmapped to chromosomes in the draft genome. We performed resequencing and assembly of the genome of common marmoset by deep sequencing with high-throughput sequencing technology. Several different sequence runs using Illumina sequencing platforms were executed, and 181 Gbp of high-quality bases including mate-pairs with long insert lengths of 3, 8, 20, and 40 Kbp were obtained, that is, approximately 60× coverage. The resequencing significantly improved the MGSAC draft genome sequence. The N50 of the contigs, which is a statistical measure used to evaluate assembly quality, doubled. As a result, 51% of the contigs (total length: 299 Mbp) that were unmapped to chromosomes in the MGSAC draft were merged with chromosomal contigs, and the improved genome sequence helped to detect 5,288 new genes that are homologous to human cDNAs and the gaps in 5,187 transcripts of the Ensembl gene annotations were completely filled.
Li, Dong-Yao; Chen, Wen-Jie; Shang, Jun; Chen, Gang; Li, Shi-Kang
2018-06-01
Long non-coding RNAs (lncRNAs) have been demonstrated to mediate carcinogenesis in various types of cancer. However, the regulatory role of lncRNA LINC00968 in lung adenocarcinoma remains unclear. The microRNA (miRNA) expression in LINC00968-overexpressing human lung adenocarcinoma A549 cells was detected using miRNA microarray analysis. miR-9-3p was selected for further analysis, and its expression was verified in the Gene Expression Omnibus (GEO) database. In addition, the regulatory axis of LINC00968 was validated using The Cancer Genome Atlas (TCGA) database. Results of the GEO database indicated miR-9-3p expression in lung adenocarcinoma was significantly higher compared with normal tissues. Functional enrichment analyses of the target genes of miR-9-3p indicated protein binding and the AMP-activated protein kinase pathway were the most enriched Gene Ontology and KEGG terms, respectively. Combining target genes with the correlated genes of LINC00968 and miR-9-3p, 120 objective genes were obtained, which were used to construct a protein-protein interaction (PPI) network. Cyclin A2 (CCNA2) was identified to have a vital role in the PPI network. Significant correlations were detected between LINC00968, miR-9-3p and CCNA2 in lung adenocarcinoma. The LINC00968/miR-9-3p/CCNA2 regulatory axis provides a new foundation for further evaluating the regulatory mechanisms of LINC00968 in lung adenocarcinoma.
Chen, Xiao-Min; Feng, Ming-Jun; Shen, Cai-Jie; He, Bin; Du, Xian-Feng; Yu, Yi-Bo; Liu, Jing; Chu, Hui-Min
2017-07-01
The present study was designed to develop a novel method for identifying significant pathways associated with human hypertrophic cardiomyopathy (HCM), based on gene co‑expression analysis. The microarray dataset associated with HCM (E‑GEOD‑36961) was obtained from the European Molecular Biology Laboratory‑European Bioinformatics Institute database. Informative pathways were selected based on the Reactome pathway database and screening treatments. An empirical Bayes method was utilized to construct co‑expression networks for informative pathways, and a weight value was assigned to each pathway. Differential pathways were extracted based on weight threshold, which was calculated using a random model. In order to assess whether the co‑expression method was feasible, it was compared with traditional pathway enrichment analysis of differentially expressed genes, which were identified using the significance analysis of microarrays package. A total of 1,074 informative pathways were screened out for subsequent investigations and their weight values were also obtained. According to the threshold of weight value of 0.01057, 447 differential pathways, including folding of actin by chaperonin containing T‑complex protein 1 (CCT)/T‑complex protein 1 ring complex (TRiC), purine ribonucleoside monophosphate biosynthesis and ubiquinol biosynthesis, were obtained. Compared with traditional pathway enrichment analysis, the number of pathways obtained from the co‑expression approach was increased. The results of the present study demonstrated that this method may be useful to predict marker pathways for HCM. The pathways of folding of actin by CCT/TRiC and purine ribonucleoside monophosphate biosynthesis may provide evidence of the underlying molecular mechanisms of HCM, and offer novel therapeutic directions for HCM.
Small Deletion Variants Have Stable Breakpoints Commonly Associated with Alu Elements
Coin, Lachlan J. M.; Steinfeld, Israel; Yakhini, Zohar; Sladek, Rob; Froguel, Philippe; Blakemore, Alexandra I. F.
2008-01-01
Copy number variants (CNVs) contribute significantly to human genomic variation, with over 5000 loci reported, covering more than 18% of the euchromatic human genome. Little is known, however, about the origin and stability of variants of different size and complexity. We investigated the breakpoints of 20 small, common deletions, representing a subset of those originally identified by array CGH, using Agilent microarrays, in 50 healthy French Caucasian subjects. By sequencing PCR products amplified using primers designed to span the deleted regions, we determined the exact size and genomic position of the deletions in all affected samples. For each deletion studied, all individuals carrying the deletion share identical upstream and downstream breakpoints at the sequence level, suggesting that the deletion event occurred just once and later became common in the population. This is supported by linkage disequilibrium (LD) analysis, which has revealed that most of the deletions studied are in moderate to strong LD with surrounding SNPs, and have conserved long-range haplotypes. Analysis of the sequences flanking the deletion breakpoints revealed an enrichment of microhomology at the breakpoint junctions. More significantly, we found an enrichment of Alu repeat elements, the overwhelming majority of which intersected deletion breakpoints at their poly-A tails. We found no enrichment of LINE elements or segmental duplications, in contrast to other reports. Sequence analysis revealed enrichment of a conserved motif in the sequences surrounding the deletion breakpoints, although whether this motif has any mechanistic role in the formation of some deletions has yet to be determined. Considered together with existing information on more complex inherited variant regions, and reports of de novo variants associated with autism, these data support the presence of different subgroups of CNV in the genome which may have originated through different mechanisms. PMID:18769679
Fine-scale maps of recombination rates and hotspots in the mouse genome.
Brunschwig, Hadassa; Levi, Liat; Ben-David, Eyal; Williams, Robert W; Yakir, Benjamin; Shifman, Sagiv
2012-07-01
Recombination events are not uniformly distributed and often cluster in narrow regions known as recombination hotspots. Several studies using different approaches have dramatically advanced our understanding of recombination hotspot regulation. Population genetic data have been used to map and quantify hotspots in the human genome. Genetic variation in recombination rates and hotspots usage have been explored in human pedigrees, mouse intercrosses, and by sperm typing. These studies pointed to the central role of the PRDM9 gene in hotspot modulation. In this study, we used single nucleotide polymorphisms (SNPs) from whole-genome resequencing and genotyping studies of mouse inbred strains to estimate recombination rates across the mouse genome and identified 47,068 historical hotspots--an average of over 2477 per chromosome. We show by simulation that inbred mouse strains can be used to identify positions of historical hotspots. Recombination hotspots were found to be enriched for the predicted binding sequences for different alleles of the PRDM9 protein. Recombination rates were on average lower near transcription start sites (TSS). Comparing the inferred historical recombination hotspots with the recent genome-wide mapping of double-strand breaks (DSBs) in mouse sperm revealed a significant overlap, especially toward the telomeres. Our results suggest that inbred strains can be used to characterize and study the dynamics of historical recombination hotspots. They also strengthen previous findings on mouse recombination hotspots, and specifically the impact of sequence variants in Prdm9.
DRD4 genotype predicts longevity in mouse and human.
Grady, Deborah L; Thanos, Panayotis K; Corrada, Maria M; Barnett, Jeffrey C; Ciobanu, Valentina; Shustarovich, Diana; Napoli, Anthony; Moyzis, Alexandra G; Grandy, David; Rubinstein, Marcelo; Wang, Gene-Jack; Kawas, Claudia H; Chen, Chuansheng; Dong, Qi; Wang, Eric; Volkow, Nora D; Moyzis, Robert K
2013-01-02
Longevity is influenced by genetic and environmental factors. The brain's dopamine system may be particularly relevant, since it modulates traits (e.g., sensitivity to reward, incentive motivation, sustained effort) that impact behavioral responses to the environment. In particular, the dopamine D4 receptor (DRD4) has been shown to moderate the impact of environments on behavior and health. We tested the hypothesis that the DRD4 gene influences longevity and that its impact is mediated through environmental effects. Surviving participants of a 30-year-old population-based health survey (N = 310; age range, 90-109 years; the 90+ Study) were genotyped/resequenced at the DRD4 gene and compared with a European ancestry-matched younger population (N = 2902; age range, 7-45 years). We found that the oldest-old population had a 66% increase in individuals carrying the DRD4 7R allele relative to the younger sample (p = 3.5 × 10(-9)), and that this genotype was strongly correlated with increased levels of physical activity. Consistent with these results, DRD4 knock-out mice, when compared with wild-type and heterozygous mice, displayed a 7-9.7% decrease in lifespan, reduced spontaneous locomotor activity, and no lifespan increase when reared in an enriched environment. These results support the hypothesis that DRD4 gene variants contribute to longevity in humans and in mice, and suggest that this effect is mediated by shaping behavioral responses to the environment.
USDA-ARS?s Scientific Manuscript database
Human selection has reshaped crop genomes. Here we report an apple genome variation map generated through genome sequencing of 117 diverse accessions. A comprehensive model of apple speciation and domestication along the Silk Road was proposed based on evidence from diverse genomic analyses. Cultiva...
Investigating the epigenetic effects of a prototype smoke-derived carcinogen in human cells.
Tommasi, Stella; Kim, Sang-in; Zhong, Xueyan; Wu, Xiwei; Pfeifer, Gerd P; Besaratinia, Ahmad
2010-05-12
Global loss of DNA methylation and locus/gene-specific gain of DNA methylation are two distinct hallmarks of carcinogenesis. Aberrant DNA methylation is implicated in smoking-related lung cancer. In this study, we have comprehensively investigated the modulation of DNA methylation consequent to chronic exposure to a prototype smoke-derived carcinogen, benzo[a]pyrene diol epoxide (B[a]PDE), in genomic regions of significance in lung cancer, in normal human cells. We have used a pulldown assay for enrichment of the CpG methylated fraction of cellular DNA combined with microarray platforms, followed by extensive validation through conventional bisulfite-based analysis. Here, we demonstrate strikingly similar patterns of DNA methylation in non-transformed B[a]PDE-treated cells vs control using high-throughput microarray-based DNA methylation profiling confirmed by conventional bisulfite-based DNA methylation analysis. The absence of aberrant DNA methylation in our model system within a timeframe that precedes cellular transformation suggests that following carcinogen exposure, other as yet unknown factors (secondary to carcinogen treatment) may help initiate global loss of DNA methylation and region-specific gain of DNA methylation, which can, in turn, contribute to lung cancer development. Unveiling the initiating events that cause aberrant DNA methylation in lung cancer has tremendous public health relevance, as it can help define future strategies for early detection and prevention of this highly lethal disease.
Investigating the Epigenetic Effects of a Prototype Smoke-Derived Carcinogen in Human Cells
Tommasi, Stella; Kim, Sang-in; Zhong, Xueyan; Wu, Xiwei; Pfeifer, Gerd P.; Besaratinia, Ahmad
2010-01-01
Global loss of DNA methylation and locus/gene-specific gain of DNA methylation are two distinct hallmarks of carcinogenesis. Aberrant DNA methylation is implicated in smoking-related lung cancer. In this study, we have comprehensively investigated the modulation of DNA methylation consequent to chronic exposure to a prototype smoke-derived carcinogen, benzo[a]pyrene diol epoxide (B[a]PDE), in genomic regions of significance in lung cancer, in normal human cells. We have used a pulldown assay for enrichment of the CpG methylated fraction of cellular DNA combined with microarray platforms, followed by extensive validation through conventional bisulfite-based analysis. Here, we demonstrate strikingly similar patterns of DNA methylation in non-transformed B[a]PDE-treated cells vs control using high-throughput microarray-based DNA methylation profiling confirmed by conventional bisulfite-based DNA methylation analysis. The absence of aberrant DNA methylation in our model system within a timeframe that precedes cellular transformation suggests that following carcinogen exposure, other as yet unknown factors (secondary to carcinogen treatment) may help initiate global loss of DNA methylation and region-specific gain of DNA methylation, which can, in turn, contribute to lung cancer development. Unveiling the initiating events that cause aberrant DNA methylation in lung cancer has tremendous public health relevance, as it can help define future strategies for early detection and prevention of this highly lethal disease. PMID:20485678
Meta-analysis of pathway enrichment: combining independent and dependent omics data sets.
Kaever, Alexander; Landesfeind, Manuel; Feussner, Kirstin; Morgenstern, Burkhard; Feussner, Ivo; Meinicke, Peter
2014-01-01
A major challenge in current systems biology is the combination and integrative analysis of large data sets obtained from different high-throughput omics platforms, such as mass spectrometry based Metabolomics and Proteomics or DNA microarray or RNA-seq-based Transcriptomics. Especially in the case of non-targeted Metabolomics experiments, where it is often impossible to unambiguously map ion features from mass spectrometry analysis to metabolites, the integration of more reliable omics technologies is highly desirable. A popular method for the knowledge-based interpretation of single data sets is the (Gene) Set Enrichment Analysis. In order to combine the results from different analyses, we introduce a methodical framework for the meta-analysis of p-values obtained from Pathway Enrichment Analysis (Set Enrichment Analysis based on pathways) of multiple dependent or independent data sets from different omics platforms. For dependent data sets, e.g. obtained from the same biological samples, the framework utilizes a covariance estimation procedure based on the nonsignificant pathways in single data set enrichment analysis. The framework is evaluated and applied in the joint analysis of Metabolomics mass spectrometry and Transcriptomics DNA microarray data in the context of plant wounding. In extensive studies of simulated data set dependence, the introduced correlation could be fully reconstructed by means of the covariance estimation based on pathway enrichment. By restricting the range of p-values of pathways considered in the estimation, the overestimation of correlation, which is introduced by the significant pathways, could be reduced. When applying the proposed methods to the real data sets, the meta-analysis was shown not only to be a powerful tool to investigate the correlation between different data sets and summarize the results of multiple analyses but also to distinguish experiment-specific key pathways.
Tejera, Eduardo; Cruz-Monteagudo, Maykel; Burgos, Germán; Sánchez, María-Eugenia; Sánchez-Rodríguez, Aminael; Pérez-Castillo, Yunierkis; Borges, Fernanda; Cordeiro, Maria Natália Dias Soeiro; Paz-Y-Miño, César; Rebelo, Irene
2017-08-08
Preeclampsia is a multifactorial disease with unknown pathogenesis. Even when recent studies explored this disease using several bioinformatics tools, the main objective was not directed to pathogenesis. Additionally, consensus prioritization was proved to be highly efficient in the recognition of genes-disease association. However, not information is available about the consensus ability to early recognize genes directly involved in pathogenesis. Therefore our aim in this study is to apply several theoretical approaches to explore preeclampsia; specifically those genes directly involved in the pathogenesis. We firstly evaluated the consensus between 12 prioritization strategies to early recognize pathogenic genes related to preeclampsia. A communality analysis in the protein-protein interaction network of previously selected genes was done including further enrichment analysis. The enrichment analysis includes metabolic pathways as well as gene ontology. Microarray data was also collected and used in order to confirm our results or as a strategy to weight the previously enriched pathways. The consensus prioritized gene list was rationally filtered to 476 genes using several criteria. The communality analysis showed an enrichment of communities connected with VEGF-signaling pathway. This pathway is also enriched considering the microarray data. Our result point to VEGF, FLT1 and KDR as relevant pathogenic genes, as well as those connected with NO metabolism. Our results revealed that consensus strategy improve the detection and initial enrichment of pathogenic genes, at least in preeclampsia condition. Moreover the combination of the first percent of the prioritized genes with protein-protein interaction network followed by communality analysis reduces the gene space. This approach actually identifies well known genes related with pathogenesis. However, genes like HSP90, PAK2, CD247 and others included in the first 1% of the prioritized list need to be further explored in preeclampsia pathogenesis through experimental approaches.
Elephant Transcriptome Provides Insights into the Evolution of Eutherian Placentation
Hou, Zhuo-Cheng; Sterner, Kirstin N.; Romero, Roberto; Than, Nandor Gabor; Gonzalez, Juan M.; Weckle, Amy; Xing, Jun; Benirschke, Kurt; Goodman, Morris; Wildman, Derek E.
2012-01-01
The chorioallantoic placenta connects mother and fetus in eutherian pregnancies. In order to understand the evolution of the placenta and provide further understanding of placenta biology, we sequenced the transcriptome of a term placenta of an African elephant (Loxodonta africana) and compared these data with RNA sequence and microarray data from other eutherian placentas including human, mouse, and cow. We characterized the composition of 55,910 expressed sequence tag (i.e., cDNA) contigs using our custom annotation pipeline. A Markov algorithm was used to cluster orthologs of human, mouse, cow, and elephant placenta transcripts. We found 2,963 genes are commonly expressed in the placentas of these eutherian mammals. Gene ontology categories previously suggested to be important for placenta function (e.g., estrogen receptor signaling pathway, cell motion and migration, and adherens junctions) were significantly enriched in these eutherian placenta–expressed genes. Genes duplicated in different lineages and also specifically expressed in the placenta contribute to the great diversity observed in mammalian placenta anatomy. We identified 1,365 human lineage–specific, 1,235 mouse lineage–specific, 436 cow lineage–specific, and 904 elephant-specific placenta-expressed (PE) genes. The most enriched clusters of human-specific PE genes are signal/glycoprotein and immunoglobulin, and humans possess a deeply invasive human hemochorial placenta that comes into direct contact with maternal immune cells. Inference of phylogenetically conserved and derived transcripts demonstrates the power of comparative transcriptomics to trace placenta evolution and variation across mammals and identified candidate genes that may be important in the normal function of the human placenta, and their dysfunction may be related to human pregnancy complications. PMID:22546564
2014-01-01
Background With over 50 different disorders and a combined incidence of up to 1/3000 births, lysosomal storage diseases (LSDs) constitute a major public health problem and place an enormous burden on affected individuals and their families. Many factors make LSD diagnosis difficult, including phenotype and penetrance variability, shared signs and symptoms, and problems inherent to biochemical diagnosis. Developing a powerful diagnostic tool could mitigate the protracted diagnostic process for these families, lead to better outcomes for current and proposed therapies, and provide the basis for more appropriate genetic counseling. Methods We have designed a targeted resequencing assay for the simultaneous testing of 57 lysosomal genes, using in-solution capture as the enrichment method and two different sequencing platforms. A total of 84 patients with high to moderate-or low suspicion index for LSD were enrolled in different centers in Spain and Portugal, including 18 positive controls. Results We correctly diagnosed 18 positive blinded controls, provided genetic diagnosis to 25 potential LSD patients, and ended with 18 diagnostic odysseys. Conclusion We report the assessment of a next–generation-sequencing-based approach as an accessory tool in the diagnosis of LSDs, a group of disorders which have overlapping clinical profiles and genetic heterogeneity. We have also identified and quantified the strengths and limitations of next generation sequencing (NGS) technology applied to diagnosis. PMID:24767253
Howard, Thomas P; Hayward, Andrew P; Tordillos, Anthony; Fragoso, Christopher; Moreno, Maria A; Tohme, Joe; Kausch, Albert P; Mottinger, John P; Dellaporta, Stephen L
2014-01-01
Since their initial discovery, transposons have been widely used as mutagens for forward and reverse genetic screens in a range of organisms. The problems of high copy number and sequence divergence among related transposons have often limited the efficiency at which tagged genes can be identified. A method was developed to identity the locations of Mutator (Mu) transposons in the Zea mays genome using a simple enrichment method combined with genome resequencing to identify transposon junction fragments. The sequencing library was prepared from genomic DNA by digesting with a restriction enzyme that cuts within a perfectly conserved motif of the Mu terminal inverted repeats (TIR). Paired-end reads containing Mu TIR sequences were computationally identified and chromosomal sequences flanking the transposon were mapped to the maize reference genome. This method has been used to identify Mu insertions in a number of alleles and to isolate the previously unidentified lazy plant1 (la1) gene. The la1 gene is required for the negatively gravitropic response of shoots and mutant plants lack the ability to sense gravity. Using bioinformatic and fluorescence microscopy approaches, we show that the la1 gene encodes a cell membrane and nuclear localized protein. Our Mu-Taq method is readily adaptable to identify the genomic locations of any insertion of a known sequence in any organism using any sequencing platform.
Howard, Thomas P.; Hayward, Andrew P.; Tordillos, Anthony; Fragoso, Christopher; Moreno, Maria A.; Tohme, Joe; Kausch, Albert P.; Mottinger, John P.; Dellaporta, Stephen L.
2014-01-01
Since their initial discovery, transposons have been widely used as mutagens for forward and reverse genetic screens in a range of organisms. The problems of high copy number and sequence divergence among related transposons have often limited the efficiency at which tagged genes can be identified. A method was developed to identity the locations of Mutator (Mu) transposons in the Zea mays genome using a simple enrichment method combined with genome resequencing to identify transposon junction fragments. The sequencing library was prepared from genomic DNA by digesting with a restriction enzyme that cuts within a perfectly conserved motif of the Mu terminal inverted repeats (TIR). Paired-end reads containing Mu TIR sequences were computationally identified and chromosomal sequences flanking the transposon were mapped to the maize reference genome. This method has been used to identify Mu insertions in a number of alleles and to isolate the previously unidentified lazy plant1 (la1) gene. The la1 gene is required for the negatively gravitropic response of shoots and mutant plants lack the ability to sense gravity. Using bioinformatic and fluorescence microscopy approaches, we show that the la1 gene encodes a cell membrane and nuclear localized protein. Our Mu-Taq method is readily adaptable to identify the genomic locations of any insertion of a known sequence in any organism using any sequencing platform. PMID:24498020
Summerfield, Taryn L.; Yu, Lianbo; Gulati, Parul; Zhang, Jie; Huang, Kun; Romero, Roberto; Kniss, Douglas A.
2011-01-01
A majority of the studies examining the molecular regulation of human labor have been conducted using single gene approaches. While the technology to produce multi-dimensional datasets is readily available, the means for facile analysis of such data are limited. The objective of this study was to develop a systems approach to infer regulatory mechanisms governing global gene expression in cytokine-challenged cells in vitro, and to apply these methods to predict gene regulatory networks (GRNs) in intrauterine tissues during term parturition. To this end, microarray analysis was applied to human amnion mesenchymal cells (AMCs) stimulated with interleukin-1β, and differentially expressed transcripts were subjected to hierarchical clustering, temporal expression profiling, and motif enrichment analysis, from which a GRN was constructed. These methods were then applied to fetal membrane specimens collected in the absence or presence of spontaneous term labor. Analysis of cytokine-responsive genes in AMCs revealed a sterile immune response signature, with promoters enriched in response elements for several inflammation-associated transcription factors. In comparison to the fetal membrane dataset, there were 34 genes commonly upregulated, many of which were part of an acute inflammation gene expression signature. Binding motifs for nuclear factor-κB were prominent in the gene interaction and regulatory networks for both datasets; however, we found little evidence to support the utilization of pathogen-associated molecular pattern (PAMP) signaling. The tissue specimens were also enriched for transcripts governed by hypoxia-inducible factor. The approach presented here provides an uncomplicated means to infer global relationships among gene clusters involved in cellular responses to labor-associated signals. PMID:21655103
A molecular signature of an arrest of descent in human parturition
MITTAL, Pooja; ROMERO, Roberto; TARCA, Adi L.; DRAGHICI, Sorin; NHAN-CHANG, Chia-Ling; CHAIWORAPONGSA, Tinnakorn; HOTRA, John; GOMEZ, Ricardo; KUSANOVIC, Juan Pedro; LEE, Deug-Chan; KIM, Chong Jai; HASSAN, Sonia S.
2010-01-01
Objective This study was undertaken to identify the molecular basis of an arrest of descent. Study Design Human myometrium was obtained from women in term labor (TL; n=29) and arrest of descent (AODes, n=21). Gene expression was characterized using Illumina® HumanHT-12 microarrays. A moderated t-test and false discovery rate adjustment were applied for analysis. Confirmatory qRT-PCR and immunoblot was performed in an independent sample set. Results 400 genes were differentially expressed between women with an AODes compared to those with TL. Gene Ontology analysis indicated enrichment of biological processes and molecular functions related to inflammation and muscle function. Impacted pathways included inflammation and the actin cytoskeleton. Overexpression of HIF1A, IL-6, and PTGS2 in AODES was confirmed. Conclusion We have identified a stereotypic pattern of gene expression in the myometrium of women with an arrest of descent. This represents the first study examining the molecular basis of an arrest of descent using a genome-wide approach. PMID:21284969
2013-01-01
Background Intronic and intergenic long noncoding RNAs (lncRNAs) are emerging gene expression regulators. The molecular pathogenesis of renal cell carcinoma (RCC) is still poorly understood, and in particular, limited studies are available for intronic lncRNAs expressed in RCC. Methods Microarray experiments were performed with custom-designed arrays enriched with probes for lncRNAs mapping to intronic genomic regions. Samples from 18 primary RCC tumors and 11 nontumor adjacent matched tissues were analyzed. Meta-analyses were performed with microarray expression data from three additional human tissues (normal liver, prostate tumor and kidney nontumor samples), and with large-scale public data for epigenetic regulatory marks and for evolutionarily conserved sequences. Results A signature of 29 intronic lncRNAs differentially expressed between RCC and nontumor samples was obtained (false discovery rate (FDR) <5%). A signature of 26 intronic lncRNAs significantly correlated with the RCC five-year patient survival outcome was identified (FDR <5%, p-value ≤0.01). We identified 4303 intronic antisense lncRNAs expressed in RCC, of which 22% were significantly (p <0.05) cis correlated with the expression of the mRNA in the same locus across RCC and three other human tissues. Gene Ontology (GO) analysis of those loci pointed to 'regulation of biological processes’ as the main enriched category. A module map analysis of the protein-coding genes significantly (p <0.05) trans correlated with the 20% most abundant lncRNAs, identified 51 enriched GO terms (p <0.05). We determined that 60% of the expressed lncRNAs are evolutionarily conserved. At the genomic loci containing the intronic RCC-expressed lncRNAs, a strong association (p <0.001) was found between their transcription start sites and genomic marks such as CpG islands, RNA Pol II binding and histones methylation and acetylation. Conclusion Intronic antisense lncRNAs are widely expressed in RCC tumors. Some of them are significantly altered in RCC in comparison with nontumor samples. The majority of these lncRNAs is evolutionarily conserved and possibly modulated by epigenetic modifications. Our data suggest that these RCC lncRNAs may contribute to the complex network of regulatory RNAs playing a role in renal cell malignant transformation. PMID:24238219
2010-01-01
Background Classical and quantitative linkage analyses of genetic crosses have traditionally been used to map genes of interest, such as those conferring chloroquine or quinine resistance in malaria parasites. Next-generation sequencing technologies now present the possibility of determining genome-wide genetic variation at single base-pair resolution. Here, we combine in vivo experimental evolution, a rapid genetic strategy and whole genome re-sequencing to identify the precise genetic basis of artemisinin resistance in a lineage of the rodent malaria parasite, Plasmodium chabaudi. Such genetic markers will further the investigation of resistance and its control in natural infections of the human malaria, P. falciparum. Results A lineage of isogenic in vivo drug-selected mutant P. chabaudi parasites was investigated. By measuring the artemisinin responses of these clones, the appearance of an in vivo artemisinin resistance phenotype within the lineage was defined. The underlying genetic locus was mapped to a region of chromosome 2 by Linkage Group Selection in two different genetic crosses. Whole-genome deep coverage short-read re-sequencing (Illumina® Solexa) defined the point mutations, insertions, deletions and copy-number variations arising in the lineage. Eight point mutations arise within the mutant lineage, only one of which appears on chromosome 2. This missense mutation arises contemporaneously with artemisinin resistance and maps to a gene encoding a de-ubiquitinating enzyme. Conclusions This integrated approach facilitates the rapid identification of mutations conferring selectable phenotypes, without prior knowledge of biological and molecular mechanisms. For malaria, this model can identify candidate genes before resistant parasites are commonly observed in natural human malaria populations. PMID:20846421
Zhao, Zhongming; Guo, An-Yuan; van den Oord, Edwin J C G; Aliev, Fazil; Jia, Peilin; Edenberg, Howard J; Riley, Brien P; Dick, Danielle M; Bettinger, Jill C; Davies, Andrew G; Grotewiel, Michael S; Schuckit, Marc A; Agrawal, Arpana; Kramer, John; Nurnberger, John I; Kendler, Kenneth S; Webb, Bradley T; Miles, Michael F
2012-01-01
A variety of species and experimental designs have been used to study genetic influences on alcohol dependence, ethanol response, and related traits. Integration of these heterogeneous data can be used to produce a ranked target gene list for additional investigation. In this study, we performed a unique multi-species evidence-based data integration using three microarray experiments in mice or humans that generated an initial alcohol dependence (AD) related genes list, human linkage and association results, and gene sets implicated in C. elegans and Drosophila. We then used permutation and false discovery rate (FDR) analyses on the genome-wide association studies (GWAS) dataset from the Collaborative Study on the Genetics of Alcoholism (COGA) to evaluate the ranking results and weighting matrices. We found one weighting score matrix could increase FDR based q-values for a list of 47 genes with a score greater than 2. Our follow up functional enrichment tests revealed these genes were primarily involved in brain responses to ethanol and neural adaptations occurring with alcoholism. These results, along with our experimental validation of specific genes in mice, C. elegans and Drosophila, suggest that a cross-species evidence-based approach is useful to identify candidate genes contributing to alcoholism.
Lee, Min-Young; Yu, Ji Hea; Kim, Ji Yeon; Seo, Jung Hwa; Park, Eun Sook; Kim, Chul Hoon; Kim, Hyongbum; Cho, Sung-Rae
2013-01-01
Housing animals in an enriched environment (EE) enhances behavioral function. However, the mechanism underlying this EE-mediated functional improvement and the resultant changes in gene expression have yet to be elucidated. We attempted to investigate the underlying mechanisms associated with long-term exposure to an EE by evaluating gene expression patterns. We housed 6-week-old CD-1 (ICR) mice in standard cages or an EE comprising a running wheel, novel objects, and social interaction for 2 months. Motor and cognitive performances were evaluated using the rotarod test and passive avoidance test, and gene expression profile was investigated in the cerebral hemispheres using microarray and gene set enrichment analysis (GSEA). In behavioral assessment, an EE significantly enhanced rotarod performance and short-term working memory. Microarray analysis revealed that genes associated with neuronal activity were significantly altered by an EE. GSEA showed that genes involved in synaptic transmission and postsynaptic signal transduction were globally upregulated, whereas those associated with reuptake by presynaptic neurotransmitter transporters were downregulated. In particular, both microarray and GSEA demonstrated that EE exposure increased opioid signaling, acetylcholine release cycle, and postsynaptic neurotransmitter receptors but decreased Na+ / Cl- -dependent neurotransmitter transporters, including dopamine transporter Slc6a3 in the brain. Western blotting confirmed that SLC6A3, DARPP32 (PPP1R1B), and P2RY12 were largely altered in a region-specific manner. An EE enhanced motor and cognitive function through the alteration of synaptic activity-regulating genes, improving the efficient use of neurotransmitters and synaptic plasticity by the upregulation of genes associated with postsynaptic receptor activity and downregulation of presynaptic reuptake by neurotransmitter transporters.
Kim, Tae Hoon; Dekker, Job
2018-05-01
ChIP-chip can be used to analyze protein-DNA interactions in a region-wide and genome-wide manner. DNA microarrays contain PCR products or oligonucleotide probes that are designed to represent genomic sequences. Identification of genomic sites that interact with a specific protein is based on competitive hybridization of the ChIP-enriched DNA and the input DNA to DNA microarrays. The ChIP-chip protocol can be divided into two main sections: Amplification of ChIP DNA and hybridization of ChIP DNA to arrays. A large amount of DNA is required to hybridize to DNA arrays, and hybridization to a set of multiple commercial arrays that represent the entire human genome requires two rounds of PCR amplifications. The relative hybridization intensity of ChIP DNA and that of the input DNA is used to determine whether the probe sequence is a potential site of protein-DNA interaction. Resolution of actual genomic sites bound by the protein is dependent on the size of the chromatin and on the genomic distance between the probes on the array. As with expression profiling using gene chips, ChIP-chip experiments require multiple replicates for reliable statistical measure of protein-DNA interactions. © 2018 Cold Spring Harbor Laboratory Press.
Tuononen, Katja; Sarhadi, Virinder Kaur; Wirtanen, Aino; Rönty, Mikko; Salmenkivi, Kaisa; Knuuttila, Aija; Remes, Satu; Telaranta-Keerie, Aino I; Bloor, Stuart; Ellonen, Pekka; Knuutila, Sakari
2013-01-01
Anaplastic lymphoma receptor tyrosine kinase (ALK) gene rearrangements occur in a subgroup of non-small cell lung carcinomas (NSCLCs). The identification of these rearrangements is important for guiding treatment decisions. The aim of our study was to screen ALK gene fusions in NSCLCs and to compare the results detected by targeted resequencing with results detected by commonly used methods, including fluorescence in situ hybridization (FISH), immunohistochemistry (IHC), and real-time reverse transcription-PCR (RT-PCR). Furthermore, we aimed to ascertain the potential of targeted resequencing in detection of ALK-rearranged lung carcinomas. We assessed ALK fusion status for 95 formalin-fixed paraffin-embedded tumor tissue specimens from 87 patients with NSCLC by FISH and real-time RT-PCR, for 57 specimens from 56 patients by targeted resequencing, and for 14 specimens from 14 patients by IHC. All methods were performed successfully on formalin-fixed paraffin-embedded tumor tissue material. We detected ALK fusion in 5.7% (5 out of 87) of patients examined. The results obtained from resequencing correlated significantly with those from FISH, real-time RT-PCR, and IHC. Targeted resequencing proved to be a promising method for ALK gene fusion detection in NSCLC. Means to reduce the material and turnaround time required for analysis are, however, needed.
2010-01-01
Background The European sea bass (Dicentrarchus labrax) is a marine fish of great importance for fisheries and aquaculture. Functional genomics offers the possibility to discover the molecular mechanisms underlying productive traits in farmed fish, and a step towards the application of marker assisted selection methods in this species. To this end, we report here on the development of an oligo DNA microarray for D. labrax. Results A database consisting of 19,048 unique transcripts was constructed, of which 12,008 (63%) could be annotated by similarity and 4,692 received a GO functional annotation. Two non-overlapping 60mer probes were designed for each unique transcript and in-situ synthesized on glass slides using Agilent SurePrint™ technology. Probe design was positively completed for 19,035 target clusters; the oligo microarray was then applied to profile gene expression in mandibles and whole-heads of fish affected by prognathism, a skeletal malformation that strongly affects sea bass production. Statistical analysis identified 242 transcripts that are significantly down-regulated in deformed individuals compared to normal fish, with a significant enrichment in genes related to nervous system development and functioning. A set of genes spanning a wide dynamic range in gene expression level were selected for quantitative RT-PCR validation. Fold change correlation between microarray and qPCR data was always significant. Conclusions The microarray platform developed for the European sea bass has a high level of flexibility, reliability, and reproducibility. Despite the well known limitations in achieving a proper functional annotation in non-model species, sufficient information was obtained to identify biological processes that are significantly enriched among differentially expressed genes. New insights were obtained on putative mechanisms involved on mandibular prognathism, suggesting that bone/nervous system development might play a role in this phenomenon. PMID:20525278
Characterizing biomarkers in osteosarcoma metastasis based on an ego-network.
Liu, Zhen; Song, Yan
2017-06-01
To characterize biomarkers that underlie osteosarcoma (OS) metastasis based on an ego-network. From the microarray data, we obtained 13,326 genes. By combining PPI data and microarray data, 10,520 shared genes were found and constructed into ego-networks. 17 significant ego-networks were identified with p < 0.05. In the pathway enrichment analysis, seven ego-networks were identified with the most significant pathway. These significant ego-modules were potential biomarkers that reveal the potential mechanisms in OS metastasis, which may contribute to understanding cancer prognoses and providing new perspectives in the treatment of cancer.
Novel variants in human and monkey CETP.
Lloyd, David B; Reynolds, Jennifer M; Cronan, Melissa T; Williams, Suzanne P; Lira, Maruja E; Wood, Linda S; Knight, Delvin R; Thompson, John F
2005-10-15
Variation in CETP has been shown to play an important role in HDL-C levels and cardiovascular disease. To better characterize this variation, the promoter and exonic DNA for CETP was resequenced in 189 individuals with extreme HDL-C or age. Two novel amino acid variants were found in humans (V-12D and Y361C) and an additional variant (R137W) not previously studied in vitro were expressed. D-12 was not secreted and had no detectable activity in cells. C361 and W137 retained near normal amounts of cholesteryl ester transfer activity when purified but were less well secreted than wild type. Torcetrapib, a CETP inhibitor in clinical development with atorvastatin, was found to have a uniform effect on inhibition of wild type CETP versus W137 or C361. In addition, the level of variation in other species was assessed by resequencing DNA from nine cynomolgus monkeys. Numerous intronic and silent SNPs were found as well as two variable amino acids. The amino acid altering SNPs were genotyped in 29 monkeys and not found to be significantly associated with HDL-C levels. Three SNPs found in monkeys were identical to three found in humans with these SNPs all occurring at CpG sites.
Parallel human genome analysis: microarray-based expression monitoring of 1000 genes.
Schena, M; Shalon, D; Heller, R; Chai, A; Brown, P O; Davis, R W
1996-01-01
Microarrays containing 1046 human cDNAs of unknown sequence were printed on glass with high-speed robotics. These 1.0-cm2 DNA "chips" were used to quantitatively monitor differential expression of the cognate human genes using a highly sensitive two-color hybridization assay. Array elements that displayed differential expression patterns under given experimental conditions were characterized by sequencing. The identification of known and novel heat shock and phorbol ester-regulated genes in human T cells demonstrates the sensitivity of the assay. Parallel gene analysis with microarrays provides a rapid and efficient method for large-scale human gene discovery. Images Fig. 1 Fig. 2 Fig. 3 PMID:8855227
DNA Microarrays for Aptamer Identification and Structural Characterization
2012-09-01
appropriate vector (which has a unique set of factors affecting cloning efficiency) and transformed into competent bacterial cells to spatially...818-822. 2) Tuerk, C. and Gold, L., “Systematic Evolution of Ligands by Exponential Enrichment: RNA Ligands to Bacteriophage T4 DNA Polymerase
Wu, Chengjiang; Zhao, Yangjing; Lin, Yu; Yang, Xinxin; Yan, Meina; Min, Yujiao; Pan, Zihui; Xia, Sheng; Shao, Qixiang
2018-01-01
DNA microarray and high-throughput sequencing have been widely used to identify the differentially expressed genes (DEGs) in systemic lupus erythematosus (SLE). However, the big data from gene microarrays are also challenging to work with in terms of analysis and processing. The presents study combined data from the microarray expression profile (GSE65391) and bioinformatics analysis to identify the key genes and cellular pathways in SLE. Gene ontology (GO) and cellular pathway enrichment analyses of DEGs were performed to investigate significantly enriched pathways. A protein-protein interaction network was constructed to determine the key genes in the occurrence and development of SLE. A total of 310 DEGs were identified in SLE, including 193 upregulated genes and 117 downregulated genes. GO analysis revealed that the most significant biological process of DEGs was immune system process. Kyoto Encyclopedia of Genes and Genome pathway analysis showed that these DEGs were enriched in signaling pathways associated with the immune system, including the RIG-I-like receptor signaling pathway, intestinal immune network for IgA production, antigen processing and presentation and the toll-like receptor signaling pathway. The current study screened the top 10 genes with higher degrees as hub genes, which included 2′-5′-oligoadenylate synthetase 1, MX dynamin like GTPase 2, interferon induced protein with tetratricopeptide repeats 1, interferon regulatory factor 7, interferon induced with helicase C domain 1, signal transducer and activator of transcription 1, ISG15 ubiquitin-like modifier, DExD/H-box helicase 58, interferon induced protein with tetratricopeptide repeats 3 and 2′-5′-oligoadenylate synthetase 2. Module analysis revealed that these hub genes were also involved in the RIG-I-like receptor signaling, cytosolic DNA-sensing, toll-like receptor signaling and ribosome biogenesis pathways. In addition, these hub genes, from different probe sets, exhibited significant co-expressed tendency in multi-experiment microarray datasets (P<0.01). In conclusion, these key genes and cellular pathways may improve the current understanding of the underlying mechanism of development of SLE. These key genes may be potential biomarkers of diagnosis, therapy and prognosis for SLE. PMID:29257335
Pathway analysis of high-throughput biological data within a Bayesian network framework.
Isci, Senol; Ozturk, Cengizhan; Jones, Jon; Otu, Hasan H
2011-06-15
Most current approaches to high-throughput biological data (HTBD) analysis either perform individual gene/protein analysis or, gene/protein set enrichment analysis for a list of biologically relevant molecules. Bayesian Networks (BNs) capture linear and non-linear interactions, handle stochastic events accounting for noise, and focus on local interactions, which can be related to causal inference. Here, we describe for the first time an algorithm that models biological pathways as BNs and identifies pathways that best explain given HTBD by scoring fitness of each network. Proposed method takes into account the connectivity and relatedness between nodes of the pathway through factoring pathway topology in its model. Our simulations using synthetic data demonstrated robustness of our approach. We tested proposed method, Bayesian Pathway Analysis (BPA), on human microarray data regarding renal cell carcinoma (RCC) and compared our results with gene set enrichment analysis. BPA was able to find broader and more specific pathways related to RCC. Accompanying BPA software (BPAS) package is freely available for academic use at http://bumil.boun.edu.tr/bpa.
Analysis of gene expression profile microarray data in complex regional pain syndrome.
Tan, Wulin; Song, Yiyan; Mo, Chengqiang; Jiang, Shuangjian; Wang, Zhongxing
2017-09-01
The aim of the present study was to predict key genes and proteins associated with complex regional pain syndrome (CRPS) using bioinformatics analysis. The gene expression profiling microarray data, GSE47603, which included peripheral blood samples from 4 patients with CRPS and 5 healthy controls, was obtained from the Gene Expression Omnibus (GEO) database. The differentially expressed genes (DEGs) in CRPS patients compared with healthy controls were identified using the GEO2R online tool. Functional enrichment analysis was then performed using The Database for Annotation Visualization and Integrated Discovery online tool. Protein‑protein interaction (PPI) network analysis was subsequently performed using Search Tool for the Retrieval of Interaction Genes database and analyzed with Cytoscape software. A total of 257 DEGs were identified, including 243 upregulated genes and 14 downregulated ones. Genes in the human leukocyte antigen (HLA) family were most significantly differentially expressed. Enrichment analysis demonstrated that signaling pathways, including immune response, cell motion, adhesion and angiogenesis were associated with CRPS. PPI network analysis revealed that key genes, including early region 1A binding protein p300 (EP300), CREB‑binding protein (CREBBP), signal transducer and activator of transcription (STAT)3, STAT5A and integrin α M were associated with CRPS. The results suggest that the immune response may therefore serve an important role in CRPS development. In addition, genes in the HLA family, such as HLA‑DQB1 and HLA‑DRB1, may present potential biomarkers for the diagnosis of CRPS. Furthermore, EP300, its paralog CREBBP, and the STAT family genes, STAT3 and STAT5 may be important in the development of CRPS.
Zhou, Junhua; Lam, Brian; Neogi, Sudeshna G; Yeo, Giles S H; Azizan, Elena A B; Brown, Morris J
2016-12-01
Primary aldosteronism is present in ≈10% of hypertensives. We previously performed a microarray assay on aldosterone-producing adenomas and their paired zona glomerulosa and fasciculata. Confirmation of top genes validated the study design and functional experiments of zona glomerulosa selective genes established the role of the encoded proteins in aldosterone regulation. In this study, we further analyzed our microarray data using AmiGO 2 for gene ontology enrichment and Ingenuity Pathway Analysis to identify potential biological processes and canonical pathways involved in pathological and physiological aldosterone regulation. Genes differentially regulated in aldosterone-producing adenoma and zona glomerulosa were associated with steroid metabolic processes gene ontology terms. Terms related to the Wnt signaling pathway were enriched in zona glomerulosa only. Ingenuity Pathway Analysis showed "NRF2-mediated oxidative stress response pathway" and "LPS (lipopolysaccharide)/IL-1 (interleukin-1)-mediated inhibition of RXR (retinoid X receptor) function" were affected in both aldosterone-producing adenoma and zona glomerulosa with associated genes having up to 21- and 8-fold differences, respectively. Comparing KCNJ5-mutant aldosterone-producing adenoma, zona glomerulosa, and zona fasciculata samples with wild-type samples, 138, 56, and 59 genes were differentially expressed, respectively (fold-change >2; P<0.05). ACSS3, encoding the enzyme that synthesizes acetyl-CoA, was the top gene upregulated in KCNJ5-mutant aldosterone-producing adenoma compared with wild-type. NEFM, a gene highly upregulated in zona glomerulosa, was upregulated in KCNJ5 wild-type aldosterone-producing adenomas. NR4A2, the transcription factor for aldosterone synthase, was highly expressed in zona fasciculata adjacent to a KCNJ5-mutant aldosterone-producing adenoma. Further interrogation of these genes and pathways could potentially provide further insights into the pathology of primary aldosteronism. © 2016 The Authors.
Rai, Muhammad Farooq; Tycksen, Eric D; Sandell, Linda J; Brophy, Robert H
2018-01-01
Microarrays and RNA-seq are at the forefront of high throughput transcriptome analyses. Since these methodologies are based on different principles, there are concerns about the concordance of data between the two techniques. The concordance of RNA-seq and microarrays for genome-wide analysis of differential gene expression has not been rigorously assessed in clinically derived ligament tissues. To demonstrate the concordance between RNA-seq and microarrays and to assess potential benefits of RNA-seq over microarrays, we assessed differences in transcript expression in anterior cruciate ligament (ACL) tissues based on time-from-injury. ACL remnants were collected from patients with an ACL tear at the time of ACL reconstruction. RNA prepared from torn ACL remnants was subjected to Agilent microarrays (N = 24) and RNA-seq (N = 8). The correlation of biological replicates in RNA-seq and microarrays data was similar (0.98 vs. 0.97), demonstrating that each platform has high internal reproducibility. Correlations between the RNA-seq data and the individual microarrays were low, but correlations between the RNA-seq values and the geometric mean of the microarrays values were moderate. The cross-platform concordance for differentially expressed transcripts or enriched pathways was linearly correlated (r = 0.64). RNA-Seq was superior in detecting low abundance transcripts and differentiating biologically critical isoforms. Additional independent validation of transcript expression was undertaken using microfluidic PCR for selected genes. PCR data showed 100% concordance (in expression pattern) with RNA-seq and microarrays data. These findings demonstrate that RNA-seq has advantages over microarrays for transcriptome profiling of ligament tissues when available and affordable. Furthermore, these findings are likely transferable to other musculoskeletal tissues where tissue collection is challenging and cells are in low abundance. © 2017 Orthopaedic Research Society. Published by Wiley Periodicals, Inc. J Orthop Res 36:484-497, 2018. © 2017 Orthopaedic Research Society. Published by Wiley Periodicals, Inc.
Metzgar, David; Myers, Christopher A.; Russell, Kevin L.; Faix, Dennis; Blair, Patrick J.; Brown, Jason; Vo, Scott; Swayne, David E.; Thomas, Colleen; Stenger, David A.; Lin, Baochuan; Malanoski, Anthony P.; Wang, Zheng; Blaney, Kate M.; Long, Nina C.; Schnur, Joel M.; Saad, Magdi D.; Borsuk, Lisa A.; Lichanska, Agnieszka M.; Lorence, Matthew C.; Weslowski, Brian; Schafer, Klaus O.; Tibbetts, Clark
2010-01-01
For more than four decades the cause of most type A influenza virus infections of humans has been attributed to only two viral subtypes, A/H1N1 or A/H3N2. In contrast, avian and other vertebrate species are a reservoir of type A influenza virus genome diversity, hosting strains representing at least 120 of 144 combinations of 16 viral hemagglutinin and 9 viral neuraminidase subtypes. Viral genome segment reassortments and mutations emerging within this reservoir may spawn new influenza virus strains as imminent epidemic or pandemic threats to human health and poultry production. Traditional methods to detect and differentiate influenza virus subtypes are either time-consuming and labor-intensive (culture-based) or remarkably insensitive (antibody-based). Molecular diagnostic assays based upon reverse transcriptase-polymerase chain reaction (RT-PCR) have short assay cycle time, and high analytical sensitivity and specificity. However, none of these diagnostic tests determine viral gene nucleotide sequences to distinguish strains and variants of a detected pathogen from one specimen to the next. Decision-quality, strain- and variant-specific pathogen gene sequence information may be critical for public health, infection control, surveillance, epidemiology, or medical/veterinary treatment planning. The Resequencing Pathogen Microarray (RPM-Flu) is a robust, highly multiplexed and target gene sequencing-based alternative to both traditional culture- or biomarker-based diagnostic tests. RPM-Flu is a single, simultaneous differential diagnostic assay for all subtype combinations of type A influenza viruses and for 30 other viral and bacterial pathogens that may cause influenza-like illness. These other pathogen targets of RPM-Flu may co-infect and compound the morbidity and/or mortality of patients with influenza. The informative specificity of a single RPM-Flu test represents specimen-specific viral gene sequences as determinants of virus type, A/HN subtype, virulence, host-range, and resistance to antiviral agents. PMID:20140251
Metzgar, David; Myers, Christopher A; Russell, Kevin L; Faix, Dennis; Blair, Patrick J; Brown, Jason; Vo, Scott; Swayne, David E; Thomas, Colleen; Stenger, David A; Lin, Baochuan; Malanoski, Anthony P; Wang, Zheng; Blaney, Kate M; Long, Nina C; Schnur, Joel M; Saad, Magdi D; Borsuk, Lisa A; Lichanska, Agnieszka M; Lorence, Matthew C; Weslowski, Brian; Schafer, Klaus O; Tibbetts, Clark
2010-02-03
For more than four decades the cause of most type A influenza virus infections of humans has been attributed to only two viral subtypes, A/H1N1 or A/H3N2. In contrast, avian and other vertebrate species are a reservoir of type A influenza virus genome diversity, hosting strains representing at least 120 of 144 combinations of 16 viral hemagglutinin and 9 viral neuraminidase subtypes. Viral genome segment reassortments and mutations emerging within this reservoir may spawn new influenza virus strains as imminent epidemic or pandemic threats to human health and poultry production. Traditional methods to detect and differentiate influenza virus subtypes are either time-consuming and labor-intensive (culture-based) or remarkably insensitive (antibody-based). Molecular diagnostic assays based upon reverse transcriptase-polymerase chain reaction (RT-PCR) have short assay cycle time, and high analytical sensitivity and specificity. However, none of these diagnostic tests determine viral gene nucleotide sequences to distinguish strains and variants of a detected pathogen from one specimen to the next. Decision-quality, strain- and variant-specific pathogen gene sequence information may be critical for public health, infection control, surveillance, epidemiology, or medical/veterinary treatment planning. The Resequencing Pathogen Microarray (RPM-Flu) is a robust, highly multiplexed and target gene sequencing-based alternative to both traditional culture- or biomarker-based diagnostic tests. RPM-Flu is a single, simultaneous differential diagnostic assay for all subtype combinations of type A influenza viruses and for 30 other viral and bacterial pathogens that may cause influenza-like illness. These other pathogen targets of RPM-Flu may co-infect and compound the morbidity and/or mortality of patients with influenza. The informative specificity of a single RPM-Flu test represents specimen-specific viral gene sequences as determinants of virus type, A/HN subtype, virulence, host-range, and resistance to antiviral agents.
DNA methylation profiling using HpaII tiny fragment enrichment by ligation-mediated PCR (HELP)
Suzuki, Masako; Greally, John M.
2010-01-01
The HELP assay is a technique that allows genome-wide analysis of cytosine methylation. Here we describe the assay, its relative strengths and weaknesses, and the transition of the assay from a microarray to massively-parallel sequencing-based foundation. PMID:20434563
Yi, Ming; Stephens, Robert M.
2008-01-01
Analysis of microarray and other high throughput data often involves identification of genes consistently up or down-regulated across samples as the first step in extraction of biological meaning. This gene-level paradigm can be limited as a result of valid sample fluctuations and biological complexities. In this report, we describe a novel method, SLEPR, which eliminates this limitation by relying on pathway-level consistencies. Our method first selects the sample-level differentiated genes from each individual sample, capturing genes missed by other analysis methods, ascertains the enrichment levels of associated pathways from each of those lists, and then ranks annotated pathways based on the consistency of enrichment levels of individual samples from both sample classes. As a proof of concept, we have used this method to analyze three public microarray datasets with a direct comparison with the GSEA method, one of the most popular pathway-level analysis methods in the field. We found that our method was able to reproduce the earlier observations with significant improvements in depth of coverage for validated or expected biological themes, but also produced additional insights that make biological sense. This new method extends existing analyses approaches and facilitates integration of different types of HTP data. PMID:18818771
Wu, Hao; Wu, Runliu; Chen, Miao; Li, Daojiang; Dai, Jing; Zhang, Yi; Gao, Kai; Yu, Jun; Hu, Gui; Guo, Yihang; Lin, Changwei; Li, Xiaorong
2017-03-28
Growing evidence suggests that long non-coding RNAs (lncRNAs) play a key role in tumorigenesis. However, the mechanism remains largely unknown. Thousands of significantly dysregulated lncRNAs and mRNAs were identified by microarray. Furthermore, a miR-133b-meditated lncRNA-mRNA ceRNA network was revealed, a subset of which was validated in 14 paired CRC patient tumor/non-tumor samples. Gene set enrichment analysis (GSEA) results demonstrated that lncRNAs ENST00000520055 and ENST00000535511 shared KEGG pathways with miR-133b target genes. We used microarrays to survey the lncRNA and mRNA expression profiles of colorectal cancer and para-cancer tissues. Gene Ontology (GO) and KEGG pathway enrichment analyses were performed to explore the functions of the significantly dysregulated genes. An innovate method was employed that combined analyses of two microarray data sets to construct a miR-133b-mediated lncRNA-mRNA competing endogenous RNAs (ceRNA) network. Quantitative RT-PCR analysis was used to validate part of this network. GSEA was used to predict the potential functions of these lncRNAs. This study identifies and validates a new method to investigate the miR-133b-mediated lncRNA-mRNA ceRNA network and lays the foundation for future investigation into the role of lncRNAs in colorectal cancer.
Wimmer, Isabella; Tröscher, Anna R; Brunner, Florian; Rubino, Stephen J; Bien, Christian G; Weiner, Howard L; Lassmann, Hans; Bauer, Jan
2018-04-20
Formalin-fixed paraffin-embedded (FFPE) tissues are valuable resources commonly used in pathology. However, formalin fixation modifies nucleic acids challenging the isolation of high-quality RNA for genetic profiling. Here, we assessed feasibility and reliability of microarray studies analysing transcriptome data from fresh, fresh-frozen (FF) and FFPE tissues. We show that reproducible microarray data can be generated from only 2 ng FFPE-derived RNA. For RNA quality assessment, fragment size distribution (DV200) and qPCR proved most suitable. During RNA isolation, extending tissue lysis time to 10 hours reduced high-molecular-weight species, while additional incubation at 70 °C markedly increased RNA yields. Since FF- and FFPE-derived microarrays constitute different data entities, we used indirect measures to investigate gene signal variation and relative gene expression. Whole-genome analyses revealed high concordance rates, while reviewing on single-genes basis showed higher data variation in FFPE than FF arrays. Using an experimental model, gene set enrichment analysis (GSEA) of FFPE-derived microarrays and fresh tissue-derived RNA-Seq datasets yielded similarly affected pathways confirming the applicability of FFPE tissue in global gene expression analysis. Our study provides a workflow comprising RNA isolation, quality assessment and microarray profiling using minimal RNA input, thus enabling hypothesis-generating pathway analyses from limited amounts of precious, pathologically significant FFPE tissues.
Detecting discordance enrichment among a series of two-sample genome-wide expression data sets.
Lai, Yinglei; Zhang, Fanni; Nayak, Tapan K; Modarres, Reza; Lee, Norman H; McCaffrey, Timothy A
2017-01-25
With the current microarray and RNA-seq technologies, two-sample genome-wide expression data have been widely collected in biological and medical studies. The related differential expression analysis and gene set enrichment analysis have been frequently conducted. Integrative analysis can be conducted when multiple data sets are available. In practice, discordant molecular behaviors among a series of data sets can be of biological and clinical interest. In this study, a statistical method is proposed for detecting discordance gene set enrichment. Our method is based on a two-level multivariate normal mixture model. It is statistically efficient with linearly increased parameter space when the number of data sets is increased. The model-based probability of discordance enrichment can be calculated for gene set detection. We apply our method to a microarray expression data set collected from forty-five matched tumor/non-tumor pairs of tissues for studying pancreatic cancer. We divided the data set into a series of non-overlapping subsets according to the tumor/non-tumor paired expression ratio of gene PNLIP (pancreatic lipase, recently shown it association with pancreatic cancer). The log-ratio ranges from a negative value (e.g. more expressed in non-tumor tissue) to a positive value (e.g. more expressed in tumor tissue). Our purpose is to understand whether any gene sets are enriched in discordant behaviors among these subsets (when the log-ratio is increased from negative to positive). We focus on KEGG pathways. The detected pathways will be useful for our further understanding of the role of gene PNLIP in pancreatic cancer research. Among the top list of detected pathways, the neuroactive ligand receptor interaction and olfactory transduction pathways are the most significant two. Then, we consider gene TP53 that is well-known for its role as tumor suppressor in cancer research. The log-ratio also ranges from a negative value (e.g. more expressed in non-tumor tissue) to a positive value (e.g. more expressed in tumor tissue). We divided the microarray data set again according to the expression ratio of gene TP53. After the discordance enrichment analysis, we observed overall similar results and the above two pathways are still the most significant detections. More interestingly, only these two pathways have been identified for their association with pancreatic cancer in a pathway analysis of genome-wide association study (GWAS) data. This study illustrates that some disease-related pathways can be enriched in discordant molecular behaviors when an important disease-related gene changes its expression. Our proposed statistical method is useful in the detection of these pathways. Furthermore, our method can also be applied to genome-wide expression data collected by the recent RNA-seq technology.
Sitras, V; Fenton, C; Acharya, G
2015-02-01
Cardiovascular disease (CVD) and preeclampsia (PE) share common clinical features. We aimed to identify common transcriptomic signatures involved in CVD and PE in humans. Meta-analysis of individual raw microarray data deposited in GEO, obtained from blood samples of patients with CVD versus controls and placental samples from women with PE versus healthy women with uncomplicated pregnancies. Annotation of cases versus control samples was taken directly from the microarray documentation. Genes that showed a significant differential expression in the majority of experiments were selected for subsequent analysis. Hypergeometric gene list analysis was performed using Bioconductor GOstats package. Bioinformatic analysis was performed in PANTHER. Seven studies in CVD and 5 studies in PE were eligible for meta-analysis. A total of 181 genes were found to be differentially expressed in microarray studies investigating gene expression in blood samples obtained from patients with CVD compared to controls and 925 genes were differentially expressed between preeclamptic and healthy placentas. Among these differentially expressed genes, 22 were common between CVD and PE. Bioinformatic analysis of these genes revealed oxidative stress, p-53 pathway feedback, inflammation mediated by chemokines and cytokines, interleukin signaling, B-cell activation, PDGF signaling, Wnt signaling, integrin signaling and Alzheimer disease pathways to be involved in the pathophysiology of both CVD and PE. Metabolism, development, response to stimulus, immune response and cell communication were the associated biologic processes in both conditions. Gene set enrichment analysis showed the following overlapping pathways between CVD and PE: TGF-β-signaling, apoptosis, graft-versus-host disease, allograft rejection, chemokine signaling, steroid hormone synthesis, type I and II diabetes mellitus, VEGF signaling, pathways in cancer, GNRH signaling, Huntingtons disease and Notch signaling. CVD and PE share same common traits in their gene expression profile indicating common pathways in their pathophysiology. Copyright © 2014 Elsevier Ltd. All rights reserved.
Łastowska, M; Viprey, V; Santibanez-Koref, M; Wappler, I; Peters, H; Cullinane, C; Roberts, P; Hall, A G; Tweddle, D A; Pearson, A D J; Lewis, I; Burchill, S A; Jackson, M S
2007-11-22
Identifying genes, whose expression is consistently altered by chromosomal gains or losses, is an important step in defining genes of biological relevance in a wide variety of tumour types. However, additional criteria are needed to discriminate further among the large number of candidate genes identified. This is particularly true for neuroblastoma, where multiple genomic copy number changes of proven prognostic value exist. We have used Affymetrix microarrays and a combination of fluorescent in situ hybridization and single nucleotide polymorphism (SNP) microarrays to establish expression profiles and delineate copy number alterations in 30 primary neuroblastomas. Correlation of microarray data with patient survival and analysis of expression within rodent neuroblastoma cell lines were then used to define further genes likely to be involved in the disease process. Using this approach, we identify >1000 genes within eight recurrent genomic alterations (loss of 1p, 3p, 4p, 10q and 11q, 2p gain, 17q gain, and the MYCN amplicon) whose expression is consistently altered by copy number change. Of these, 84 correlate with patient survival, with the minimal regions of 17q gain and 4p loss being enriched significantly for such genes. These include genes involved in RNA and DNA metabolism, and apoptosis. Orthologues of all but one of these genes on 17q are overexpressed in rodent neuroblastoma cell lines. A significant excess of SNPs whose copy number correlates with survival is also observed on proximal 4p in stage 4 tumours, and we find that deletion of 4p is associated with improved outcome in an extended cohort of tumours. These results define the major impact of genomic copy number alterations upon transcription within neuroblastoma, and highlight genes on distal 17q and proximal 4p for downstream analyses. They also suggest that integration of discriminators, such as survival and comparative gene expression, with microarray data may be useful in the identification of critical genes within regions of loss or gain in many human cancers.
DRD4 genotype predicts longevity in mouse and human
Grady, Deborah L.; Thanos, Panayotis K.; Corrada, Maria M.; Barnett, Jeffrey C.; Ciobanu, Valentina; Shustarovich, Diana; Napoli, Anthony; Moyzis, Alexandra G.; Grandy, David; Rubinstein, Marcelo; Wang, Gene-Jack; H.Kawas, Claudia; Chen, Chuansheng; Dong, Qi; Wang, Eric; Volkow, Nora D.; Moyzis, Robert K.
2013-01-01
Longevity is influenced by genetic and environmental factors. The brain's dopamine system may be particularly relevant, since it modulates traits (e.g., sensitivity to reward, incentive motivation, sustained effort) that impact behavioral responses to the environment. In particular, the dopamine D4 receptor (DRD4) has been shown to moderate the impact of environments on behavior and health. We tested the hypothesis that the DRD4 gene influences longevity and that its impact is mediated through environmental effects. Surviving participants of a 30 year-old population-based health survey (N=310, age range 90–109; the 90+ Study) were genotyped/resequenced at the DRD4 gene, and compared to a European ancestry-matched younger population (N=2902, age range 7–45). We found that the oldest-old population had a 66% increase in individuals carrying the DRD4 7R allele relative to the younger sample (p=3.5 × 10−9), and that this genotype was strongly correlated with increased levels of physical activity. Consistent with these results, DRD4 knockout mice, when compared to wild-type and heterozygous mice, displayed a 7–9.7% decrease in lifespan, reduced spontaneous locomotor activity, and no lifespan increase when reared in an enriched environment. These results support the hypothesis that DRD4 gene variants contribute to longevity in humans and in mice, and suggest that this effect is mediated by shaping behavioral responses to the environment. PMID:23283341
Alkan, Can; Kavak, Pinar; Somel, Mehmet; Gokcumen, Omer; Ugurlu, Serkan; Saygi, Ceren; Dal, Elif; Bugra, Kuyas; Güngör, Tunga; Sahinalp, S Cenk; Özören, Nesrin; Bekpen, Cemalettin
2014-11-07
Turkey is a crossroads of major population movements throughout history and has been a hotspot of cultural interactions. Several studies have investigated the complex population history of Turkey through a limited set of genetic markers. However, to date, there have been no studies to assess the genetic variation at the whole genome level using whole genome sequencing. Here, we present whole genome sequences of 16 Turkish individuals resequenced at high coverage (32×-48×). We show that the genetic variation of the contemporary Turkish population clusters with South European populations, as expected, but also shows signatures of relatively recent contribution from ancestral East Asian populations. In addition, we document a significant enrichment of non-synonymous private alleles, consistent with recent observations in European populations. A number of variants associated with skin color and total cholesterol levels show frequency differentiation between the Turkish populations and European populations. Furthermore, we have analyzed the 17q21.31 inversion polymorphism region (MAPT locus) and found increased allele frequency of 31.25% for H1/H2 inversion polymorphism when compared to European populations that show about 25% of allele frequency. This study provides the first map of common genetic variation from 16 western Asian individuals and thus helps fill an important geographical gap in analyzing natural human variation and human migration. Our data will help develop population-specific experimental designs for studies investigating disease associations and demographic history in Turkey.
2006-04-27
polysaccharide microarray platform was prepared by immobilizing Burkholderia pseudomallei and Burkholderia mallei polysaccharides . This... polysaccharide array was tested with success for detecting B. pseudomallei and B. mallei serum (human and animal) antibodies. The advantages of this microarray... Polysaccharide microarrays; Burkholderia pseudomallei; Burkholderia mallei; Glanders; Melioidosis1. Introduction There has been a great deal of emphasis on the
Advances in cell-free protein array methods.
Yu, Xiaobo; Petritis, Brianne; Duan, Hu; Xu, Danke; LaBaer, Joshua
2018-01-01
Cell-free protein microarrays represent a special form of protein microarray which display proteins made fresh at the time of the experiment, avoiding storage and denaturation. They have been used increasingly in basic and translational research over the past decade to study protein-protein interactions, the pathogen-host relationship, post-translational modifications, and antibody biomarkers of different human diseases. Their role in the first blood-based diagnostic test for early stage breast cancer highlights their value in managing human health. Cell-free protein microarrays will continue to evolve to become widespread tools for research and clinical management. Areas covered: We review the advantages and disadvantages of different cell-free protein arrays, with an emphasis on the methods that have been studied in the last five years. We also discuss the applications of each microarray method. Expert commentary: Given the growing roles and impact of cell-free protein microarrays in research and medicine, we discuss: 1) the current technical and practical limitations of cell-free protein microarrays; 2) the biomarker discovery and verification pipeline using protein microarrays; and 3) how cell-free protein microarrays will advance over the next five years, both in their technology and applications.
Gagliano, Sarah A; Ravji, Reena; Barnes, Michael R; Weale, Michael E; Knight, Jo
2015-08-24
Although technology has triumphed in facilitating routine genome sequencing, new challenges have been created for the data-analyst. Genome-scale surveys of human variation generate volumes of data that far exceed capabilities for laboratory characterization. By incorporating functional annotations as predictors, statistical learning has been widely investigated for prioritizing genetic variants likely to be associated with complex disease. We compared three published prioritization procedures, which use different statistical learning algorithms and different predictors with regard to the quantity, type and coding. We also explored different combinations of algorithm and annotation set. As an application, we tested which methodology performed best for prioritizing variants using data from a large schizophrenia meta-analysis by the Psychiatric Genomics Consortium. Results suggest that all methods have considerable (and similar) predictive accuracies (AUCs 0.64-0.71) in test set data, but there is more variability in the application to the schizophrenia GWAS. In conclusion, a variety of algorithms and annotations seem to have a similar potential to effectively enrich true risk variants in genome-scale datasets, however none offer more than incremental improvement in prediction. We discuss how methods might be evolved for risk variant prediction to address the impending bottleneck of the new generation of genome re-sequencing studies.
Doderer, Stefan A; Gäbel, Gabor; Kokje, Vivianne B C; Northoff, Bernd H; Holdt, Lesca M; Hamming, Jaap F; Lindeman, Jan H N
2018-06-01
The processes driving human abdominal aortic aneurysm (AAA) progression are not fully understood. Although antiinflammatory and proteolytic strategies effectively quench aneurysm progression in preclinical models, so far all clinical interventions failed. These observations hint at an incomplete understanding of the processes involved in AAA progression and rupture. Interestingly, strong clinical and molecular associations exist between popliteal artery aneurysms (PAAs) and AAAs; however, PAAs have an extremely low propensity to rupture. We thus reasoned that differences between these aneurysms may provide clues toward (auxiliary) processes involved in AAA-related wall debilitation. A better understanding of the pathophysiologic processes driving AAA growth can contribute to pharmaceutical treatments in the future. Aneurysmal wall samples were collected during open elective and emergency repair. Control perirenal aorta was obtained during kidney transplantation, and reference popliteal tissue obtained from the anatomy department. This study incorporates various techniques including (immuno)histochemistry, Western Blot, quantitative polymerase chain reaction, microarray, and cell culture. Histologic evaluation of AAAs, PAAs, and control aorta shows extensive medial (PAA) and transmural fibrosis (AAA), and reveals abundant adventitial adipocytes aggregates as an exclusive phenomenon of AAAs (P < .001). Quantitative polymerase chain reaction, immunohistochemistry, Western blotting, and microarray analysis showed enrichment of adipogenic mediators (C/EBP family P = .027; KLF5 P < .000; and peroxisome proliferator activated receptor-γ, P = .032) in AAA tissue. In vitro differentiation tests indicated a sharply increased adipogenic potential of AAA adventitial mesenchymal cells (P < .0001). Observed enrichment of adipocyte-related genes and pathways in ruptured AAA (P < .0003) supports an association between the extent of fatty degeneration and rupture. This translational study identifies extensive adventitial fatty degeneration as an ignored and distinctive feature of AAA disease. Enrichment of adipocyte genesis and adipocyte-related genes in ruptured AAA point to an association between the extent of fatty degeneration and rupture. This observation may (partly) explain the failure of medical therapy and could provide a lead for pharmaceutical alleviation of AAA progression. Copyright © 2017 Society for Vascular Surgery. Published by Elsevier Inc. All rights reserved.
Implementation of GenePattern within the Stanford Microarray Database.
Hubble, Jeremy; Demeter, Janos; Jin, Heng; Mao, Maria; Nitzberg, Michael; Reddy, T B K; Wymore, Farrell; Zachariah, Zachariah K; Sherlock, Gavin; Ball, Catherine A
2009-01-01
Hundreds of researchers across the world use the Stanford Microarray Database (SMD; http://smd.stanford.edu/) to store, annotate, view, analyze and share microarray data. In addition to providing registered users at Stanford access to their own data, SMD also provides access to public data, and tools with which to analyze those data, to any public user anywhere in the world. Previously, the addition of new microarray data analysis tools to SMD has been limited by available engineering resources, and in addition, the existing suite of tools did not provide a simple way to design, execute and share analysis pipelines, or to document such pipelines for the purposes of publication. To address this, we have incorporated the GenePattern software package directly into SMD, providing access to many new analysis tools, as well as a plug-in architecture that allows users to directly integrate and share additional tools through SMD. In this article, we describe our implementation of the GenePattern microarray analysis software package into the SMD code base. This extension is available with the SMD source code that is fully and freely available to others under an Open Source license, enabling other groups to create a local installation of SMD with an enriched data analysis capability.
Galectins are human milk glycan receptors
Noll, Alexander J; Gourdine, Jean-Philippe; Yu, Ying; Lasanajak, Yi; Smith, David F; Cummings, Richard D
2016-01-01
The biological recognition of human milk glycans (HMGs) is poorly understood. Because HMGs are rich in galactose we explored whether they might interact with human galectins, which bind galactose-containing glycans and are highly expressed in epithelial cells and other cell types. We screened a number of human galectins for their binding to HMGs on a shotgun glycan microarray consisting of 247 HMGs derived from human milk, as well as to a defined HMG microarray. Recombinant human galectins (hGal)-1, -3, -4, -7, -8 and -9 bound selectively to glycans, with each galectin recognizing a relatively unique binding motif; by contrast hGal-2 did not recognize HMGs, but did bind to the human blood group A Type 2 determinants on other microarrays. Unlike other galectins, hGal-7 preferentially bound to glycans expressing a terminal Type 1 (Galβ1-3GlcNAc) sequence, a motif that had eluded detection on non-HMG glycan microarrays. Interactions with HMGs were confirmed in a solution setting by isothermal titration microcalorimetry and hapten inhibition experiments. These results demonstrate that galectins selectively bind to HMGs and suggest the possibility that galectin–HMG interactions may play a role in infant immunity. PMID:26747425
Visschedijk, Marijn C; Alberts, Rudi; Mucha, Soren; Deelen, Patrick; de Jong, Dirk J; Pierik, Marieke; Spekhorst, Lieke M; Imhann, Floris; van der Meulen-de Jong, Andrea E; van der Woude, C Janneke; van Bodegraven, Adriaan A; Oldenburg, Bas; Löwenberg, Mark; Dijkstra, Gerard; Ellinghaus, David; Schreiber, Stefan; Wijmenga, Cisca; Rivas, Manuel A; Franke, Andre; van Diemen, Cleo C; Weersma, Rinse K
2016-01-01
Genome-wide association studies have revealed several common genetic risk variants for ulcerative colitis (UC). However, little is known about the contribution of rare, large effect genetic variants to UC susceptibility. In this study, we performed a deep targeted re-sequencing of 122 genes in Dutch UC patients in order to investigate the contribution of rare variants to the genetic susceptibility to UC. The selection of genes consists of 111 established human UC susceptibility genes and 11 genes that lead to spontaneous colitis when knocked-out in mice. In addition, we sequenced the promoter regions of 45 genes where known variants exert cis-eQTL-effects. Targeted pooled re-sequencing was performed on DNA of 790 Dutch UC cases. The Genome of the Netherlands project provided sequence data of 500 healthy controls. After quality control and prioritization based on allele frequency and pathogenicity probability, follow-up genotyping of 171 rare variants was performed on 1021 Dutch UC cases and 1166 Dutch controls. Single-variant association and gene-based analyses identified an association of rare variants in the MUC2 gene with UC. The associated variants in the Dutch population could not be replicated in a German replication cohort (1026 UC cases, 3532 controls). In conclusion, this study has identified a putative role for MUC2 on UC susceptibility in the Dutch population and suggests a population-specific contribution of rare variants to UC.
Bao, Weier; Greenwold, Matthew J; Sawyer, Roger H
2017-11-01
Gene co-expression network analysis has been a research method widely used in systematically exploring gene function and interaction. Using the Weighted Gene Co-expression Network Analysis (WGCNA) approach to construct a gene co-expression network using data from a customized 44K microarray transcriptome of chicken epidermal embryogenesis, we have identified two distinct modules that are highly correlated with scale or feather development traits. Signaling pathways related to feather development were enriched in the traditional KEGG pathway analysis and functional terms relating specifically to embryonic epidermal development were also enriched in the Gene Ontology analysis. Significant enrichment annotations were discovered from customized enrichment tools such as Modular Single-Set Enrichment Test (MSET) and Medical Subject Headings (MeSH). Hub genes in both trait-correlated modules showed strong specific functional enrichment toward epidermal development. Also, regulatory elements, such as transcription factors and miRNAs, were targeted in the significant enrichment result. This work highlights the advantage of this methodology for functional prediction of genes not previously associated with scale- and feather trait-related modules.
Elsafadi, Mona; Manikandan, Muthurangan; Almalki, Sami; Mobarak, Mohammad; Atteya, Muhammad; Iqbal, Zafar; Hashmi, Jamil Amjad; Shaheen, Sameerah; Alajez, Nehad; Alfayez, Musaad; Kassem, Moustapha; Dawud, Raed Abu; Mahmood, Amer
2018-01-01
TGF β is a potent regulator of several biological functions in many cell types, but its role in the differentiation of human bone marrow-derived skeletal stem cells (hMSCs) is currently poorly understood. In the present study, we demonstrate that a single dose of TGF β 1 prior to induction of osteogenic or adipogenic differentiation results in increased mineralized matrix or increased numbers of lipid-filled mature adipocytes, respectively. To identify the mechanisms underlying this TGF β -mediated enhancement of lineage commitment, we compared the gene expression profiles of TGF β 1-treated hMSC cultures using DNA microarrays. In total, 1932 genes were upregulated, and 1298 genes were downregulated. Bioinformatics analysis revealed that TGF β l treatment was associated with an enrichment of genes in the skeletal and extracellular matrix categories and the regulation of the actin cytoskeleton. To investigate further, we examined the actin cytoskeleton following treatment with TGF β 1 and/or cytochalasin D. Interestingly, cytochalasin D treatment of hMSCs enhanced adipogenic differentiation but inhibited osteogenic differentiation. Global gene expression profiling revealed a significant enrichment of pathways related to osteogenesis and adipogenesis and of genes regulated by both TGF β 1 and cytochalasin D. Our study demonstrates that TGF β 1 enhances hMSC commitment to either the osteogenic or adipogenic lineages by reorganizing the actin cytoskeleton.
Peters, Derek T.; Henderson, Christopher A.; Warren, Curtis R.; Friesen, Max; Xia, Fang; Becker, Caroline E.; Musunuru, Kiran; Cowan, Chad A.
2016-01-01
ABSTRACT Hepatocyte-like cells (HLCs) are derived from human pluripotent stem cells (hPSCs) in vitro, but differentiation protocols commonly give rise to a heterogeneous mixture of cells. This variability confounds the evaluation of in vitro functional assays performed using HLCs. Increased differentiation efficiency and more accurate approximation of the in vivo hepatocyte gene expression profile would improve the utility of hPSCs. Towards this goal, we demonstrate the purification of a subpopulation of functional HLCs using the hepatocyte surface marker asialoglycoprotein receptor 1 (ASGR1). We analyzed the expression profile of ASGR1-positive cells by microarray, and tested their ability to perform mature hepatocyte functions (albumin and urea secretion, cytochrome activity). By these measures, ASGR1-positive HLCs are enriched for the gene expression profile and functional characteristics of primary hepatocytes compared with unsorted HLCs. We have demonstrated that ASGR1-positive sorting isolates a functional subpopulation of HLCs from among the heterogeneous cellular population produced by directed differentiation. PMID:27143754
Challenges of microarray applications for microbial detection and gene expression profiling in food
USDA-ARS?s Scientific Manuscript database
Microarray technology represents one of the latest advances in molecular biology. The diverse types of microarrays have been applied to clinical and environmental microbiology, microbial ecology, and in human, veterinary, and plant diagnostics. Since multiple genes can be analyzed simultaneously, ...
Wang, Hong; Brautigan, David L
2006-11-01
Human lemur (Lmr) kinases are predicted to be Tyr kinases based on sequences and are related to neurotrophin receptor Trk kinases. This study used homogeneous recombinant KPI-2 (Lmr2, LMTK2, Cprk, brain-enriched protein kinase) kinase domain and a library of 1,154 peptides on a microarray to analyze substrate specificity. We found that KPI-2 is strictly a Ser/Thr kinase that reacts with Ser either preceded by or followed by Pro residues but unlike other Pro-directed kinases does not strictly require an adjacent Pro residue. The most reactive peptide in the library corresponds to Ser-737 of cystic fibrosis transmembrane conductance regulator, and the recombinant R domain of cystic fibrosis transmembrane conductance regulator was a preferred substrate. Furthermore the KPI-2 kinase phosphorylated peptides corresponding to the single site in phosphorylase and purified phosphorylase b, making this only the second known phosphorylase b kinase. Phosphorylase was used as a specific substrate to show that KPI-2 is inhibited in living cells by addition of nerve growth factor or serum. The results demonstrate the utility of the peptide library to probe specificity and discover kinase substrates and offer a specific assay that reveals hormonal regulation of the activity of this unusual transmembrane kinase.
Serum miRNAs Signature Plays an Important Role in Keloid Disease.
Luan, Y; Liu, Y; Liu, C; Lin, Q; He, F; Dong, X; Xiao, Z
2016-01-01
The molecular mechanism underlying the pathogenesis of keloid is largely unknown. MicroRNA (miRNA) is a class of small regulatory RNA that has emerged as a group of posttranscriptional gene repressors, participating in diverse pathophysiological processes of skin diseases. We investigated the expression profiles of miRNAs in the sera of patients to decipher the complicated factors involved in the development of keloid disease. MiRNA expression profiling in the sera from 9 keloid patients and 7 normal controls were characterized using a miRNA microarray containing established human mature and precursor miRNA sequences. Quantitative real-time PCR was performed to confirm the expression of miRNAs. The putative targets of differentially expressed miRNAs were functionally annotated by bioinformatics. MiRNA microarray analysis identified 37 differentially expressed miRNAs (17 upregulated and 20 downregulated) in keloid patients, compared to the healthy controls. Functional annotations revealed that the targets of those differentially expressed miRNAs were enriched in signaling pathways essential for scar formation and wound healing. The expression profiling of miRNAs is altered in the keloid, providing a clue for the molecular mechanisms underlying its initiation and progression. MiRNAs may partly contribute to the etiology of keloids by affecting the critical signaling pathways relevant to keloid pathogenesis.
Targeted resequencing in peanuts using the fluidigm access array
USDA-ARS?s Scientific Manuscript database
The presence of homoeologous gene copies in allotetraploid peanut makes it challenging to select homologous SNPs differentiating two or more cultivars. An integrated approach of improved bioinformatics and targeted resequencing to select homologous SNPs in tetraploid peanut is needed. Raw transcrip...
A microarray for assessing transcription from pelagic marine microbial taxa
Shilova, Irina N; Robidart, Julie C; James Tripp, H; Turk-Kubo, Kendra; Wawrik, Boris; Post, Anton F; Thompson, Anne W; Ward, Bess; Hollibaugh, James T; Millard, Andy; Ostrowski, Martin; J Scanlan, David; Paerl, Ryan W; Stuart, Rhona; Zehr, Jonathan P
2014-01-01
Metagenomic approaches have revealed unprecedented genetic diversity within microbial communities across vast expanses of the world's oceans. Linking this genetic diversity with key metabolic and cellular activities of microbial assemblages is a fundamental challenge. Here we report on a collaborative effort to design MicroTOOLs (Microbiological Targets for Ocean Observing Laboratories), a high-density oligonucleotide microarray that targets functional genes of diverse taxa in pelagic and coastal marine microbial communities. MicroTOOLs integrates nucleotide sequence information from disparate data types: genomes, PCR-amplicons, metagenomes, and metatranscriptomes. It targets 19 400 unique sequences over 145 different genes that are relevant to stress responses and microbial metabolism across the three domains of life and viruses. MicroTOOLs was used in a proof-of-concept experiment that compared the functional responses of microbial communities following Fe and P enrichments of surface water samples from the North Pacific Subtropical Gyre. We detected transcription of 68% of the gene targets across major taxonomic groups, and the pattern of transcription indicated relief from Fe limitation and transition to N limitation in some taxa. Prochlorococcus (eHLI), Synechococcus (sub-cluster 5.3) and Alphaproteobacteria SAR11 clade (HIMB59) showed the strongest responses to the Fe enrichment. In addition, members of uncharacterized lineages also responded. The MicroTOOLs microarray provides a robust tool for comprehensive characterization of major functional groups of microbes in the open ocean, and the design can be easily amended for specific environments and research questions. PMID:24477198
Pathway Distiller - multisource biological pathway consolidation
2012-01-01
Background One method to understand and evaluate an experiment that produces a large set of genes, such as a gene expression microarray analysis, is to identify overrepresentation or enrichment for biological pathways. Because pathways are able to functionally describe the set of genes, much effort has been made to collect curated biological pathways into publicly accessible databases. When combining disparate databases, highly related or redundant pathways exist, making their consolidation into pathway concepts essential. This will facilitate unbiased, comprehensive yet streamlined analysis of experiments that result in large gene sets. Methods After gene set enrichment finds representative pathways for large gene sets, pathways are consolidated into representative pathway concepts. Three complementary, but different methods of pathway consolidation are explored. Enrichment Consolidation combines the set of the pathways enriched for the signature gene list through iterative combining of enriched pathways with other pathways with similar signature gene sets; Weighted Consolidation utilizes a Protein-Protein Interaction network based gene-weighting approach that finds clusters of both enriched and non-enriched pathways limited to the experiments' resultant gene list; and finally the de novo Consolidation method uses several measurements of pathway similarity, that finds static pathway clusters independent of any given experiment. Results We demonstrate that the three consolidation methods provide unified yet different functional insights of a resultant gene set derived from a genome-wide profiling experiment. Results from the methods are presented, demonstrating their applications in biological studies and comparing with a pathway web-based framework that also combines several pathway databases. Additionally a web-based consolidation framework that encompasses all three methods discussed in this paper, Pathway Distiller (http://cbbiweb.uthscsa.edu/PathwayDistiller), is established to allow researchers access to the methods and example microarray data described in this manuscript, and the ability to analyze their own gene list by using our unique consolidation methods. Conclusions By combining several pathway systems, implementing different, but complementary pathway consolidation methods, and providing a user-friendly web-accessible tool, we have enabled users the ability to extract functional explanations of their genome wide experiments. PMID:23134636
Pathway Distiller - multisource biological pathway consolidation.
Doderer, Mark S; Anguiano, Zachry; Suresh, Uthra; Dashnamoorthy, Ravi; Bishop, Alexander J R; Chen, Yidong
2012-01-01
One method to understand and evaluate an experiment that produces a large set of genes, such as a gene expression microarray analysis, is to identify overrepresentation or enrichment for biological pathways. Because pathways are able to functionally describe the set of genes, much effort has been made to collect curated biological pathways into publicly accessible databases. When combining disparate databases, highly related or redundant pathways exist, making their consolidation into pathway concepts essential. This will facilitate unbiased, comprehensive yet streamlined analysis of experiments that result in large gene sets. After gene set enrichment finds representative pathways for large gene sets, pathways are consolidated into representative pathway concepts. Three complementary, but different methods of pathway consolidation are explored. Enrichment Consolidation combines the set of the pathways enriched for the signature gene list through iterative combining of enriched pathways with other pathways with similar signature gene sets; Weighted Consolidation utilizes a Protein-Protein Interaction network based gene-weighting approach that finds clusters of both enriched and non-enriched pathways limited to the experiments' resultant gene list; and finally the de novo Consolidation method uses several measurements of pathway similarity, that finds static pathway clusters independent of any given experiment. We demonstrate that the three consolidation methods provide unified yet different functional insights of a resultant gene set derived from a genome-wide profiling experiment. Results from the methods are presented, demonstrating their applications in biological studies and comparing with a pathway web-based framework that also combines several pathway databases. Additionally a web-based consolidation framework that encompasses all three methods discussed in this paper, Pathway Distiller (http://cbbiweb.uthscsa.edu/PathwayDistiller), is established to allow researchers access to the methods and example microarray data described in this manuscript, and the ability to analyze their own gene list by using our unique consolidation methods. By combining several pathway systems, implementing different, but complementary pathway consolidation methods, and providing a user-friendly web-accessible tool, we have enabled users the ability to extract functional explanations of their genome wide experiments.
USDA-ARS?s Scientific Manuscript database
The long-term goal of our study is to understand the genetic and epigenetic mechanisms of breast cancer metastasis in human and to discover new possible genetic markers for use in clinical practice. We have used microarray technology (Human OneArray microarray, phylanxbiotech.com) to compare gene ex...
Cheng, Feng; Wu, Jian; Cai, Chengcheng; Fu, Lixia; Liang, Jianli; Borm, Theo; Zhuang, Mu; Zhang, Yangyong; Zhang, Fenglan; Bonnema, Guusje; Wang, Xiaowu
2016-12-20
The closely related species Brassica rapa and B. oleracea encompass a wide range of vegetable, fodder and oil crops. The release of their reference genomes has facilitated resequencing collections of B. rapa and B. oleracea aiming to build their variome datasets. These data can be used to investigate the evolutionary relationships between and within the different species and the domestication of the crops, hereafter named morphotypes. These data can also be used in genetic studies aiming at the identification of genes that influence agronomic traits. We selected and resequenced 199 B. rapa and 119 B. oleracea accessions representing 12 and nine morphotypes, respectively. Based on these resequencing data, we obtained 2,249,473 and 3,852,169 high quality SNPs (single-nucleotide polymorphisms), as well as 303,617 and 417,004 InDels for the B. rapa and B. oleracea populations, respectively. The variome datasets of B. rapa and B. oleracea represent valuable resources to researchers working on evolution, domestication or breeding of Brassica vegetable crops.
Egawa, Jun; Watanabe, Yuichiro; Shibuya, Masako; Endo, Taro; Sugimoto, Atsunori; Igeta, Hirofumi; Nunokawa, Ayako; Inoue, Emiko; Someya, Toshiyuki
2015-03-01
The oxytocin receptor (OXTR) is implicated in the pathophysiology of autism spectrum disorder (ASD). A recent study found a rare non-synonymous OXTR gene variation, rs35062132 (R376G), associated with ASD in a Japanese population. In order to investigate the association between rare non-synonymous OXTR variations and ASD, we resequenced OXTR and performed association analysis with ASD in a Japanese population. We resequenced the OXTR coding region in 213 ASD patients. Rare non-synonymous OXTR variations detected by resequencing were genotyped in 213 patients and 667 controls. We detected three rare non-synonymous variations: rs35062132 (R376G/C), rs151257822 (G334D), and g.8809426G>T (R150S). However, there was no significant association between these rare non-synonymous variations and ASD. Our present study does not support the contribution of rare non-synonymous OXTR variations to ASD susceptibility in the Japanese population. © 2014 The Authors. Psychiatry and Clinical Neurosciences © 2014 Japanese Society of Psychiatry and Neurology.
Beinke, C; Port, M; Ullmann, R; Gilbertz, K; Majewski, M; Abend, M
2018-06-01
Dicentric chromosome analysis (DCA) is the gold standard for individual radiation dose assessment. However, DCA is limited by the time-consuming phytohemagglutinin (PHA)-mediated lymphocyte activation. In this study using human peripheral blood lymphocytes, we investigated PHA-associated whole genome gene expression changes to elucidate this process and sought to identify suitable gene targets as a means of meeting our long-term objective of accelerating cell cycle kinetics to reduce DCA culture time. Human peripheral whole blood from three healthy donors was separately cultured in RPMI/FCS/antibiotics with BrdU and PHA-M. Diluted whole blood samples were transferred into PAXgene tubes at 0, 12, 24 and 36 h culture time. RNA was isolated and aliquots were used for whole genome gene expression screening. Microarray results were validated using qRT-PCR and differentially expressed genes [significantly (FDR corrected) twofold different from the 0 h value reference] were analyzed using several bioinformatic tools. The cell cycle positions and DNA-synthetic activities of lymphocytes were determined by analyzing the correlated total DNA content and incorporated BrdU level with flow cytometry after continued BrdU incubation. From 42,545 transcripts of the whole genome microarray 47.6%, on average, appeared expressed. The number of differentially expressed genes increased linearly from 855 to 2,858 and 4,607 at 12, 24 and 36 h after PHA addition, respectively. Approximately 2-3 times more up- than downregulated genes were observed with several hundred genes differentially expressed at each time point. Earliest enrichment was observed for gene sets related to the nucleus (12 h) followed by genes assigned to intracellular structures such as organelles (24 h) and finally genes related to the membrane and the extracellular matrix were enriched (36 h). Early gene expression changes at 12 h, in particular, were associated with protein classes such as chemokines/cytokines (e.g., CXCL1, CXCL2) and chaperones. Genes coding for biological processes involved in cell cycle control (e.g., MYBL2, RBL1, CCNA, CCNE) and DNA replication (e.g., POLA, POLE, MCM) appeared enriched at 24 h and later, but many more biological processes (42 altogether) showed enrichment as well. Flow cytometry data fit together with gene expression and bioinformatic analyses as cell cycle transition into S phase was observed with interindividual differences from 12 h onward, whereas progression into G 2 as well as into the second G 1 occurred from 36 h onward after activation. Gene set enrichment analysis over time identifies, in particular, two molecular categories of PHA-responsive gene targets (cytokine and cell cycle control genes). Based on that analysis target genes for cell cycle acceleration in lymphocytes have been identified ( CDKN1A/B/C, RBL-1/RBL-2, E2F2, Deaf-1), and it remains undetermined whether the time expenditure for DCA can be reduced by influencing gene expression involved in the regulatory circuits controlling PHA-associated cell cycle entry and/or progression at a specific early cell cycle phase.
Potentials and capabilities of the Extracellular Vesicle (EV) Array.
Jørgensen, Malene Møller; Bæk, Rikke; Varming, Kim
2015-01-01
Extracellular vesicles (EVs) and exosomes are difficult to enrich or purify from biofluids, hence quantification and phenotyping of these are tedious and inaccurate. The multiplexed, highly sensitive and high-throughput platform of the EV Array presented by Jørgensen et al., (J Extracell Vesicles, 2013; 2: 10) has been refined regarding the capabilities of the method for characterization and molecular profiling of EV surface markers. Here, we present an extended microarray platform to detect and phenotype plasma-derived EVs (optimized for exosomes) for up to 60 antigens without any enrichment or purification prior to analysis.
A VEGF-dependent gene signature enriched in mesenchymal ovarian cancer predicts patient prognosis.
Yin, Xia; Wang, Xiaojie; Shen, Boqiang; Jing, Ying; Li, Qing; Cai, Mei-Chun; Gu, Zhuowei; Yang, Qi; Zhang, Zhenfeng; Liu, Jin; Li, Hongxia; Di, Wen; Zhuang, Guanglei
2016-08-08
We have previously reported surrogate biomarkers of VEGF pathway activities with the potential to provide predictive information for anti-VEGF therapies. The aim of this study was to systematically evaluate a new VEGF-dependent gene signature (VDGs) in relation to molecular subtypes of ovarian cancer and patient prognosis. Using microarray profiling and cross-species analysis, we identified 140-gene mouse VDGs and corresponding 139-gene human VDGs, which displayed enrichment of vasculature and basement membrane genes. In patients who received bevacizumab therapy and showed partial response, the expressions of VDGs (summarized to yield VDGs scores) were markedly decreased in post-treatment biopsies compared with pre-treatment baselines. In contrast, VDGs scores were not significantly altered following bevacizumab treatment in patients with stable or progressive disease. Analysis of VDGs in ovarian cancer showed that VDGs as a prognostic signature was able to predict patient outcome. Correlation estimation of VDGs scores and molecular features revealed that VDGs was overrepresented in mesenchymal subtype and BRCA mutation carriers. These findings highlighted the prognostic role of VEGF-mediated angiogenesis in ovarian cancer, and proposed a VEGF-dependent gene signature as a molecular basis for developing novel diagnostic strategies to aid patient selection for VEGF-targeted agents.
Low-density microarray technologies for rapid human norovirus genotyping
USDA-ARS?s Scientific Manuscript database
Human noroviruses cause up to 21 million cases of foodborne disease in the United States annually and are the most common cause of acute gastroenteritis in industrialized countries. To reduce the burden of foodborne disease associated with viruses, the use of low density DNA microarrays in conjuncti...
Evaluation of the skin irritation using a DNA microarray on a reconstructed human epidermal model.
Niwa, Makoto; Nagai, Kanji; Oike, Hideaki; Kobori, Masuko
2009-02-01
To avoid the need to use animals to test the skin irritancy potential of chemicals and cosmetics, it is important to establish an in vitro method based on the reconstructed human epidermal model. To evaluate skin irritancy efficiently and sensitively, we determined the gene expression induced by a topically-applied mild irritant sodium dodecyl sulfate (SDS) in a reconstructed human epidermal model LabCyte EPI-MODEL (LabCyte) using a DNA microarray carrying genes that were related to inflammation, immunity, stress and housekeeping. The expression and secretion of IL-1alpha in reconstructed human epidermal culture is known to be induced by irritation. We detected the induction of IL-1alpha expression and its secretion into the cell culture medium by treatment with 0.075% SDS for 18 h in LabCyte culture using DNA microarray, quantitative reverse-transcription polymerase chain reaction (RT-PCR) and ELISA. DNA microarray analysis indicated that the expression of 10 of the 205 genes carried on the DNA microarray was significantly induced in a LabCyte culture by 0.05% or 0.075% SDS irritation for 18 h. RT-PCR analysis confirmed that SDS treatment significantly induced the expressions of interleukin-1 receptor antagonist (IL-1RN), FOS-like antigen 1 (FOSL1), heat shock 70 kDa protein 1A (HSPA1) and myeloid differentiation primary response gene (88) (MYD88), as well as the known marker genes for irritation IL-1beta and IL-8 in a LabCyte culture. Our results showed that a DNA microarray is a useful tool for efficiently evaluating mild skin irritation using a reconstructed human epidermal model.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gao, Xiugong, E-mail: xiugong.gao@fda.hhs.gov; Sprando, Robert L.; Yourick, Jeffrey J.
Developmental toxicity testing has traditionally relied on animal models which are costly, time consuming, and require the sacrifice of large numbers of animals. In addition, there are significant disparities between human beings and animals in their responses to chemicals. Thalidomide is a species-specific developmental toxicant that causes severe limb malformations in humans but not in mice. Here, we used microarrays to study transcriptomic changes induced by thalidomide in an in vitro model based on differentiation of mouse embryonic stem cells (mESCs). C57BL/6 mESCs were allowed to differentiate spontaneously and RNA was collected at 24, 48, and 72 h after exposuremore » to 0.25 mM thalidomide. Global gene expression analysis using microarrays revealed hundreds of differentially expressed genes upon thalidomide exposure that were enriched in gene ontology (GO) terms and canonical pathways associated with embryonic development and differentiation. In addition, many genes were found to be involved in small GTPases-mediated signal transduction, heart development, and inflammatory responses, which coincide with clinical evidences and may represent critical embryotoxicities of thalidomide. These results demonstrate that transcriptomics in combination with mouse embryonic stem cell differentiation is a promising alternative model for developmental toxicity assessment. - Highlights: • Studied genomic changes in mouse embryonic stem cells upon thalidomide exposure • Identified gene expression changes that may represent thalidomide embryotoxicity • The toxicogenomic changes coincide well with known thalidomide clinical outcomes. • The mouse embryonic stem cell model is suitable for developmental toxicity testing. • The model has the potential for high-throughput screening of a multitude of compounds.« less
A Discovery Resource of Rare Copy Number Variations in Individuals with Autism Spectrum Disorder
Prasad, Aparna; Merico, Daniele; Thiruvahindrapuram, Bhooma; Wei, John; Lionel, Anath C.; Sato, Daisuke; Rickaby, Jessica; Lu, Chao; Szatmari, Peter; Roberts, Wendy; Fernandez, Bridget A.; Marshall, Christian R.; Hatchwell, Eli; Eis, Peggy S.; Scherer, Stephen W.
2012-01-01
The identification of rare inherited and de novo copy number variations (CNVs) in human subjects has proven a productive approach to highlight risk genes for autism spectrum disorder (ASD). A variety of microarrays are available to detect CNVs, including single-nucleotide polymorphism (SNP) arrays and comparative genomic hybridization (CGH) arrays. Here, we examine a cohort of 696 unrelated ASD cases using a high-resolution one-million feature CGH microarray, the majority of which were previously genotyped with SNP arrays. Our objective was to discover new CNVs in ASD cases that were not detected by SNP microarray analysis and to delineate novel ASD risk loci via combined analysis of CGH and SNP array data sets on the ASD cohort and CGH data on an additional 1000 control samples. Of the 615 ASD cases analyzed on both SNP and CGH arrays, we found that 13,572 of 21,346 (64%) of the CNVs were exclusively detected by the CGH array. Several of the CGH-specific CNVs are rare in population frequency and impact previously reported ASD genes (e.g., NRXN1, GRM8, DPYD), as well as novel ASD candidate genes (e.g., CIB2, DAPP1, SAE1), and all were inherited except for a de novo CNV in the GPHN gene. A functional enrichment test of gene-sets in ASD cases over controls revealed nucleotide metabolism as a potential novel pathway involved in ASD, which includes several candidate genes for follow-up (e.g., DPYD, UPB1, UPP1, TYMP). Finally, this extensively phenotyped and genotyped ASD clinical cohort serves as an invaluable resource for the next step of genome sequencing for complete genetic variation detection. PMID:23275889
In-vitro analysis of Quantum Molecular Resonance effects on human mesenchymal stromal cells
Sella, Sabrina; Adami, Valentina; Amati, Eliana; Bernardi, Martina; Chieregato, Katia; Gatto, Pamela; Menarin, Martina; Pozzato, Alessandro; Pozzato, Gianantonio; Astori, Giuseppe
2018-01-01
Electromagnetic fields play an essential role in cellular functions interfering with cellular pathways and tissue physiology. In this context, Quantum Molecular Resonance (QMR) produces waves with a specific form at high-frequencies (4–64 MHz) and low intensity through electric fields. We evaluated the effects of QMR stimulation on bone marrow derived mesenchymal stromal cells (MSC). MSC were treated with QMR for 10 minutes for 4 consecutive days for 2 weeks at different nominal powers. Cell morphology, phenotype, multilineage differentiation, viability and proliferation were investigated. QMR effects were further investigated by cDNA microarray validated by real-time PCR. After 1 and 2 weeks of QMR treatment morphology, phenotype and multilineage differentiation were maintained and no alteration of cellular viability and proliferation were observed between treated MSC samples and controls. cDNA microarray analysis evidenced more transcriptional changes on cells treated at 40 nominal power than 80 ones. The main enrichment lists belonged to development processes, regulation of phosphorylation, regulation of cellular pathways including metabolism, kinase activity and cellular organization. Real-time PCR confirmed significant increased expression of MMP1, PLAT and ARHGAP22 genes while A2M gene showed decreased expression in treated cells compared to controls. Interestingly, differentially regulated MMP1, PLAT and A2M genes are involved in the extracellular matrix (ECM) remodelling through the fibrinolytic system that is also implicated in embryogenesis, wound healing and angiogenesis. In our model QMR-treated MSC maintained unaltered cell phenotype, viability, proliferation and the ability to differentiate into bone, cartilage and adipose tissue. Microarray analysis may suggest an involvement of QMR treatment in angiogenesis and in tissue regeneration probably through ECM remodelling. PMID:29293552
Parthasarathy, Narayanan; DeShazer, David; England, Marilyn; Waag, David M
2006-11-01
A polysaccharide microarray platform was prepared by immobilizing Burkholderia pseudomallei and Burkholderia mallei polysaccharides. This polysaccharide array was tested with success for detecting B. pseudomallei and B. mallei serum (human and animal) antibodies. The advantages of this microarray technology over the current serodiagnosis of the above bacterial infections were discussed.
ERIC Educational Resources Information Center
Dwyer, Dave; Gruenwald, Mark; Stickles, Joe; Axtell, Mike
2018-01-01
Resequencing Calculus is a project that has reordered the typical delivery of Calculus material to better serve the needs of STEM majors. Funded twice by the National Science Foundation, this project has produced a three-semester textbook that has been piloted at numerous institutions, large and small, public and private. This paper describes the…
USDA-ARS?s Scientific Manuscript database
The next generation sequencing (NGS) technologies have opened a wealth of opportunities for plant breeding and genomics research, and changed the paradigms of marker detection, genotyping, and gene discovery. Abundant genomic resources have been generated using a whole genome resequencing (WGR) str...
Stafuzza, Nedenia Bonvino; Zerlotini, Adhemar; Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto
2017-01-01
Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs.
Lobo, Francisco Pereira; Yamagishi, Michel Eduardo Beleza; Chud, Tatiane Cristina Seleguim; Caetano, Alexandre Rodrigues; Munari, Danísio Prado; Garrick, Dorian J.; Machado, Marco Antonio; Martins, Marta Fonseca; Carvalho, Maria Raquel; Cole, John Bruce; Barbosa da Silva, Marcos Vinicius Gualberto
2017-01-01
Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer generated for each animal 10.7 to 16.4-fold genome coverage. A total of 27,441,279 single nucleotide variations (SNVs) and 3,828,041 insertions/deletions (InDels) were detected in the samples, of which 2,557,670 SNVs and 883,219 InDels were novel. The submission of these genetic variants to the dbSNP database significantly increased the number of known variants, particularly for the indicine genome. The concordance rate between genotypes obtained using the Bovine HD BeadChip array and the same variants identified by sequencing was about 99.05%. The annotation of variants identified numerous non-synonymous SNVs and frameshift InDels which could affect phenotypic variation. Functional enrichment analysis was performed and revealed that variants in the olfactory transduction pathway was over represented in all four cattle breeds, while the ECM-receptor interaction pathway was over represented in Girolando and Guzerat breeds, the ABC transporters pathway was over represented only in Holstein breed, and the metabolic pathways was over represented only in Gyr breed. The genetic variants discovered here provide a rich resource to help identify potential genomic markers and their associated molecular mechanisms that impact economically important traits for Gyr, Girolando, Guzerat and Holstein breeding programs. PMID:28323836
Silva-Junior, Orzenil B; Grattapaglia, Dario
2015-11-01
We used high-density single nucleotide polymorphism (SNP) data and whole-genome pooled resequencing to examine the landscape of population recombination (ρ) and nucleotide diversity (ϴw ), assess the extent of linkage disequilibrium (r(2) ) and build the highest density linkage maps for Eucalyptus. At the genome-wide level, linkage disequilibrium (LD) decayed within c. 4-6 kb, slower than previously reported from candidate gene studies, but showing considerable variation from absence to complete LD up to 50 kb. A sharp decrease in the estimate of ρ was seen when going from short to genome-wide inter-SNP distances, highlighting the dependence of this parameter on the scale of observation adopted. Recombination was correlated with nucleotide diversity, gene density and distance from the centromere, with hotspots of recombination enriched for genes involved in chemical reactions and pathways of the normal metabolic processes. The high nucleotide diversity (ϴw = 0.022) of E. grandis revealed that mutation is more important than recombination in shaping its genomic diversity (ρ/ϴw = 0.645). Chromosome-wide ancestral recombination graphs allowed us to date the split of E. grandis (1.7-4.8 million yr ago) and identify a scenario for the recent demographic history of the species. Our results have considerable practical importance to Genome Wide Association Studies (GWAS), while indicating bright prospects for genomic prediction of complex phenotypes in eucalypt breeding. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
BioconductorBuntu: a Linux distribution that implements a web-based DNA microarray analysis server.
Geeleher, Paul; Morris, Dermot; Hinde, John P; Golden, Aaron
2009-06-01
BioconductorBuntu is a custom distribution of Ubuntu Linux that automatically installs a server-side microarray processing environment, providing a user-friendly web-based GUI to many of the tools developed by the Bioconductor Project, accessible locally or across a network. System installation is via booting off a CD image or by using a Debian package provided to upgrade an existing Ubuntu installation. In its current version, several microarray analysis pipelines are supported including oligonucleotide, dual-or single-dye experiments, including post-processing with Gene Set Enrichment Analysis. BioconductorBuntu is designed to be extensible, by server-side integration of further relevant Bioconductor modules as required, facilitated by its straightforward underlying Python-based infrastructure. BioconductorBuntu offers an ideal environment for the development of processing procedures to facilitate the analysis of next-generation sequencing datasets. BioconductorBuntu is available for download under a creative commons license along with additional documentation and a tutorial from (http://bioinf.nuigalway.ie).
DOE Office of Scientific and Technical Information (OSTI.GOV)
Jaing, C; Gardner, S
The goal of this project is to develop forensic genotyping assays for select agent viruses, enhancing the current capabilities for the viral bioforensics and law enforcement community. We used a multipronged approach combining bioinformatics analysis, PCR-enriched samples, microarrays and TaqMan assays to develop high resolution and cost effective genotyping methods for strain level forensic discrimination of viruses. We have leveraged substantial experience and efficiency gained through year 1 on software development, SNP discovery, TaqMan signature design and phylogenetic signature mapping to scale up the development of forensics signatures in year 2. In this report, we have summarized the whole genomemore » wide SNP analysis and microarray probe design for forensics characterization of South American hemorrhagic fever viruses, tick-borne encephalitis viruses and henipaviruses, Old World Arenaviruses, filoviruses, Crimean-Congo hemorrhagic fever virus, Rift Valley fever virus and Japanese encephalitis virus.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Zhou, J.; Wu, L.; Gentry, T.
2006-04-05
To effectively monitor microbial populations involved in various important processes, a 50-mer-based oligonucleotide microarray was developed based on known genes and pathways involved in: biodegradation, metal resistance and reduction, denitrification, nitrification, nitrogen fixation, methane oxidation, methanogenesis, carbon polymer decomposition, and sulfate reduction. This array contains approximately 2000 unique and group-specific probes with <85% similarity to their non-target sequences. Based on artificial probes, our results showed that at hybridization conditions of 50 C and 50% formamide, the 50-mer microarray hybridization can differentiate sequences having <88% similarity. Specificity tests with representative pure cultures indicated that the designed probes on the arrays appearedmore » to be specific to their corresponding target genes. Detection limits were about 5-10ng genomic DNA in the absence of background DNA, and 50-100ng ({approx}1.3{sup o} 10{sup 7} cells) in the presence background DNA. Strong linear relationships between signal intensity and target DNA and RNA concentration were observed (r{sup 2} = 0.95-0.99). Application of this microarray to naphthalene-amended enrichments and soil microcosms demonstrated that composition of the microflora varied depending on incubation conditions. While the naphthalene-degrading genes from Rhodococcus-type microorganisms were dominant in enrichments, the genes involved in naphthalene degradation from Gram-negative microorganisms such as Ralstonia, Comamonas, and Burkholderia were most abundant in the soil microcosms (as well as those for polyaromatic hydrocarbon and nitrotoluene degradation). Although naphthalene degradation is widely known and studied in Pseudomonas, Pseudomonas genes were not detected in either system. Real-time PCR analysis of 4 representative genes was consistent with microarray-based quantification (r{sup 2} = 0.95). Currently, we are also applying this microarray to the study of several different microbial communities and processes at the NABIR-FRC in Oak Ridge, TN. One project involves the monitoring of the development and dynamics of the microbial community of a fluidized bed reactor (FBR) used for reducing nitrate and the other project monitors microbial community responses to stimulation of uranium reducing populations via ethanol donor additions in situ and in a model system. Additionally, we are developing novel strategies for increasing microarray hybridization sensitivity. Finally, great improvements to our methods of probe design were made by the development of a new computer program, CommOligo. CommOligo designs unique and group-specific oligo probes for whole-genomes, metagenomes, and groups of environmental sequences and uses a new global alignment algorithm to design single or multiple probes for each gene or group. We are now using this program to design a more comprehensive functional gene array for environmental studies. Overall, our results indicate that the 50mer-based microarray technology has potential as a specific and quantitative tool to reveal the composition of microbial communities and their dynamics important to processes within contaminated environments.« less
Microarray analysis reveals key genes and pathways in Tetralogy of Fallot
He, Yue-E; Qiu, Hui-Xian; Jiang, Jian-Bing; Wu, Rong-Zhou; Xiang, Ru-Lian; Zhang, Yuan-Hai
2017-01-01
The aim of the present study was to identify key genes that may be involved in the pathogenesis of Tetralogy of Fallot (TOF) using bioinformatics methods. The GSE26125 microarray dataset, which includes cardiovascular tissue samples derived from 16 children with TOF and five healthy age-matched control infants, was downloaded from the Gene Expression Omnibus database. Differential expression analysis was performed between TOF and control samples to identify differentially expressed genes (DEGs) using Student's t-test, and the R/limma package, with a log2 fold-change of >2 and a false discovery rate of <0.01 set as thresholds. The biological functions of DEGs were analyzed using the ToppGene database. The ReactomeFIViz application was used to construct functional interaction (FI) networks, and the genes in each module were subjected to pathway enrichment analysis. The iRegulon plugin was used to identify transcription factors predicted to regulate the DEGs in the FI network, and the gene-transcription factor pairs were then visualized using Cytoscape software. A total of 878 DEGs were identified, including 848 upregulated genes and 30 downregulated genes. The gene FI network contained seven function modules, which were all comprised of upregulated genes. Genes enriched in Module 1 were enriched in the following three neurological disorder-associated signaling pathways: Parkinson's disease, Alzheimer's disease and Huntington's disease. Genes in Modules 0, 3 and 5 were dominantly enriched in pathways associated with ribosomes and protein translation. The Xbox binding protein 1 transcription factor was demonstrated to be involved in the regulation of genes encoding the subunits of cytoplasmic and mitochondrial ribosomes, as well as genes involved in neurodegenerative disorders. Therefore, dysfunction of genes involved in signaling pathways associated with neurodegenerative disorders, ribosome function and protein translation may contribute to the pathogenesis of TOF. PMID:28713939
Lee, Hayan; Schatz, Michael C
2012-08-15
Genome resequencing and short read mapping are two of the primary tools of genomics and are used for many important applications. The current state-of-the-art in mapping uses the quality values and mapping quality scores to evaluate the reliability of the mapping. These attributes, however, are assigned to individual reads and do not directly measure the problematic repeats across the genome. Here, we present the Genome Mappability Score (GMS) as a novel measure of the complexity of resequencing a genome. The GMS is a weighted probability that any read could be unambiguously mapped to a given position and thus measures the overall composition of the genome itself. We have developed the Genome Mappability Analyzer to compute the GMS of every position in a genome. It leverages the parallelism of cloud computing to analyze large genomes, and enabled us to identify the 5-14% of the human, mouse, fly and yeast genomes that are difficult to analyze with short reads. We examined the accuracy of the widely used BWA/SAMtools polymorphism discovery pipeline in the context of the GMS, and found discovery errors are dominated by false negatives, especially in regions with poor GMS. These errors are fundamental to the mapping process and cannot be overcome by increasing coverage. As such, the GMS should be considered in every resequencing project to pinpoint the 'dark matter' of the genome, including of known clinically relevant variations in these regions. The source code and profiles of several model organisms are available at http://gma-bio.sourceforge.net
Dickinson, Peter; Xiong, Anqi; York, Daniel; Jayashankar, Kartika; Pielberg, Gerli; Koltookian, Michele; Murén, Eva; Fuxelius, Hans-Henrik; Weishaupt, Holger; Andersson, Göran; Hedhammar, Åke; Bongcam-Rudloff, Erik; Forsberg-Nilsson, Karin
2016-01-01
Gliomas are the most common form of malignant primary brain tumors in humans and second most common in dogs, occurring with similar frequencies in both species. Dogs are valuable spontaneous models of human complex diseases including cancers and may provide insight into disease susceptibility and oncogenesis. Several brachycephalic breeds such as Boxer, Bulldog and Boston Terrier have an elevated risk of developing glioma, but others, including Pug and Pekingese, are not at higher risk. To identify glioma-associated genetic susceptibility factors, an across-breed genome-wide association study (GWAS) was performed on 39 dog glioma cases and 141 controls from 25 dog breeds, identifying a genome-wide significant locus on canine chromosome (CFA) 26 (p = 2.8 x 10−8). Targeted re-sequencing of the 3.4 Mb candidate region was performed, followed by genotyping of the 56 SNVs that best fit the association pattern between the re-sequenced cases and controls. We identified three candidate genes that were highly associated with glioma susceptibility: CAMKK2, P2RX7 and DENR. CAMKK2 showed reduced expression in both canine and human brain tumors, and a non-synonymous variant in P2RX7, previously demonstrated to have a 50% decrease in receptor function, was also associated with disease. Thus, one or more of these genes appear to affect glioma susceptibility. PMID:27171399
Stephenson, Kathryn E.; Neubauer, George H.; Reimer, Ulf; ...
2014-11-14
An effective vaccine against human immunodeficiency virus type 1 (HIV-1) will have to provide protection against a vast array of different HIV-1 strains. Current methods to measure HIV-1-specific binding antibodies following immunization typically focus on determining the magnitude of antibody responses, but the epitope diversity of antibody responses has remained largely unexplored. Here we describe the development of a global HIV-1 peptide microarray that contains 6564 peptides from across the HIV-1 proteome and covers the majority of HIV-1 sequences in the Los Alamos National Laboratory global HIV-1 sequence database. Using this microarray, we quantified the magnitude, breadth, and depth ofmore » IgG binding to linear HIV-1 sequences in HIV-1-infected humans and HIV-1-vaccinated humans, rhesus monkeys and guinea pigs. The microarray measured potentially important differences in antibody epitope diversity, particularly regarding the depth of epitope variants recognized at each binding site. Our data suggest that the global HIV-1 peptide microarray may be a useful tool for both preclinical and clinical HIV-1 research.« less
DOE Office of Scientific and Technical Information (OSTI.GOV)
Stephenson, Kathryn E.; Neubauer, George H.; Reimer, Ulf
An effective vaccine against human immunodeficiency virus type 1 (HIV-1) will have to provide protection against a vast array of different HIV-1 strains. Current methods to measure HIV-1-specific binding antibodies following immunization typically focus on determining the magnitude of antibody responses, but the epitope diversity of antibody responses has remained largely unexplored. Here we describe the development of a global HIV-1 peptide microarray that contains 6564 peptides from across the HIV-1 proteome and covers the majority of HIV-1 sequences in the Los Alamos National Laboratory global HIV-1 sequence database. Using this microarray, we quantified the magnitude, breadth, and depth ofmore » IgG binding to linear HIV-1 sequences in HIV-1-infected humans and HIV-1-vaccinated humans, rhesus monkeys and guinea pigs. The microarray measured potentially important differences in antibody epitope diversity, particularly regarding the depth of epitope variants recognized at each binding site. Our data suggest that the global HIV-1 peptide microarray may be a useful tool for both preclinical and clinical HIV-1 research.« less
MAAMD: a workflow to standardize meta-analyses and comparison of affymetrix microarray data
2014-01-01
Background Mandatory deposit of raw microarray data files for public access, prior to study publication, provides significant opportunities to conduct new bioinformatics analyses within and across multiple datasets. Analysis of raw microarray data files (e.g. Affymetrix CEL files) can be time consuming, complex, and requires fundamental computational and bioinformatics skills. The development of analytical workflows to automate these tasks simplifies the processing of, improves the efficiency of, and serves to standardize multiple and sequential analyses. Once installed, workflows facilitate the tedious steps required to run rapid intra- and inter-dataset comparisons. Results We developed a workflow to facilitate and standardize Meta-Analysis of Affymetrix Microarray Data analysis (MAAMD) in Kepler. Two freely available stand-alone software tools, R and AltAnalyze were embedded in MAAMD. The inputs of MAAMD are user-editable csv files, which contain sample information and parameters describing the locations of input files and required tools. MAAMD was tested by analyzing 4 different GEO datasets from mice and drosophila. MAAMD automates data downloading, data organization, data quality control assesment, differential gene expression analysis, clustering analysis, pathway visualization, gene-set enrichment analysis, and cross-species orthologous-gene comparisons. MAAMD was utilized to identify gene orthologues responding to hypoxia or hyperoxia in both mice and drosophila. The entire set of analyses for 4 datasets (34 total microarrays) finished in ~ one hour. Conclusions MAAMD saves time, minimizes the required computer skills, and offers a standardized procedure for users to analyze microarray datasets and make new intra- and inter-dataset comparisons. PMID:24621103
Li, Xiang; Harwood, Valerie J.; Nayak, Bina
2016-01-01
Pathogen identification and microbial source tracking (MST) to identify sources of fecal pollution improve evaluation of water quality. They contribute to improved assessment of human health risks and remediation of pollution sources. An MST microarray was used to simultaneously detect genes for multiple pathogens and indicators of fecal pollution in freshwater, marine water, sewage-contaminated freshwater and marine water, and treated wastewater. Dead-end ultrafiltration (DEUF) was used to concentrate organisms from water samples, yielding a recovery efficiency of >95% for Escherichia coli and human polyomavirus. Whole-genome amplification (WGA) increased gene copies from ultrafiltered samples and increased the sensitivity of the microarray. Viruses (adenovirus, bocavirus, hepatitis A virus, and human polyomaviruses) were detected in sewage-contaminated samples. Pathogens such as Legionella pneumophila, Shigella flexneri, and Campylobacter fetus were detected along with genes conferring resistance to aminoglycosides, beta-lactams, and tetracycline. Nonmetric dimensional analysis of MST marker genes grouped sewage-spiked freshwater and marine samples with sewage and apart from other fecal sources. The sensitivity (percent true positives) of the microarray probes for gene targets anticipated in sewage was 51 to 57% and was lower than the specificity (percent true negatives; 79 to 81%). A linear relationship between gene copies determined by quantitative PCR and microarray fluorescence was found, indicating the semiquantitative nature of the MST microarray. These results indicate that ultrafiltration coupled with WGA provides sufficient nucleic acids for detection of viruses, bacteria, protozoa, and antibiotic resistance genes by the microarray in applications ranging from beach monitoring to risk assessment. PMID:26729716
Hypoxia adaptations in the grey wolf (Canis lupus chanco) from Qinghai-Tibet Plateau.
Zhang, Wenping; Fan, Zhenxin; Han, Eunjung; Hou, Rong; Zhang, Liang; Galaverni, Marco; Huang, Jie; Liu, Hong; Silva, Pedro; Li, Peng; Pollinger, John P; Du, Lianming; Zhang, XiuyYue; Yue, Bisong; Wayne, Robert K; Zhang, Zhihe
2014-07-01
The Tibetan grey wolf (Canis lupus chanco) occupies habitats on the Qinghai-Tibet Plateau, a high altitude (>3000 m) environment where low oxygen tension exerts unique selection pressure on individuals to adapt to hypoxic conditions. To identify genes involved in hypoxia adaptation, we generated complete genome sequences of nine Chinese wolves from high and low altitude populations at an average coverage of 25× coverage. We found that, beginning about 55,000 years ago, the highland Tibetan grey wolf suffered a more substantial population decline than lowland wolves. Positively selected hypoxia-related genes in highland wolves are enriched in the HIF signaling pathway (P = 1.57E-6), ATP binding (P = 5.62E-5), and response to an oxygen-containing compound (P≤5.30E-4). Of these positively selected hypoxia-related genes, three genes (EPAS1, ANGPT1, and RYR2) had at least one specific fixed non-synonymous SNP in highland wolves based on the nine genome data. Our re-sequencing studies on a large panel of individuals showed a frequency difference greater than 58% between highland and lowland wolves for these specific fixed non-synonymous SNPs and a high degree of LD surrounding the three genes, which imply strong selection. Past studies have shown that EPAS1 and ANGPT1 are important in the response to hypoxic stress, and RYR2 is involved in heart function. These three genes also exhibited significant signals of natural selection in high altitude human populations, which suggest similar evolutionary constraints on natural selection in wolves and humans of the Qinghai-Tibet Plateau.
De Franceschi, Paolo; Bianco, Luca; Cestaro, Alessandro; Dondini, Luca; Velasco, Riccardo
2018-06-01
Data obtained from Illumina resequencing of 63 apple cultivars were used to obtain full-length S-RNase sequences using a strategy based on both alignment and de novo assembly of reads. The reproductive biology of apple is regulated by the S-RNase-based gametophytic self-incompatibility system, that is genetically controlled by the single, multi-genic and multi-allelic S locus. Resequencing of apple cultivars provided a huge amount of genetic data, that can be aligned to the reference genome in order to characterize variation to a genome-wide level. However, this approach is not immediately adaptable to the S-locus, due to some peculiar features such as the high degree of polymorphism, lack of colinearity between haplotypes and extensive presence of repetitive elements. In this study we describe a dedicated procedure aimed at characterizing S-RNase alleles from resequenced cultivars. The S-genotype of 63 apple accessions is reported; the full length coding sequence was determined for the 25 S-RNase alleles present in the 63 resequenced cultivars; these included 10 previously incomplete sequences (S 5 , S 6a , S 6b , S 8 , S 11 , S 23 , S 39 , S 46 , S 50 and S 58 ). Moreover, sequence divergence clearly suggests that alleles S 6a and S 6b , proposed to be neutral variants of the same alleles, should be instead considered different specificities. The promoter sequences have also been analyzed, highlighting regions of homology conserved among all the alleles.
Schadt, Eric E; Edwards, Stephen W; GuhaThakurta, Debraj; Holder, Dan; Ying, Lisa; Svetnik, Vladimir; Leonardson, Amy; Hart, Kyle W; Russell, Archie; Li, Guoya; Cavet, Guy; Castle, John; McDonagh, Paul; Kan, Zhengyan; Chen, Ronghua; Kasarskis, Andrew; Margarint, Mihai; Caceres, Ramon M; Johnson, Jason M; Armour, Christopher D; Garrett-Engele, Philip W; Tsinoremas, Nicholas F; Shoemaker, Daniel D
2004-01-01
Background Computational and microarray-based experimental approaches were used to generate a comprehensive transcript index for the human genome. Oligonucleotide probes designed from approximately 50,000 known and predicted transcript sequences from the human genome were used to survey transcription from a diverse set of 60 tissues and cell lines using ink-jet microarrays. Further, expression activity over at least six conditions was more generally assessed using genomic tiling arrays consisting of probes tiled through a repeat-masked version of the genomic sequence making up chromosomes 20 and 22. Results The combination of microarray data with extensive genome annotations resulted in a set of 28,456 experimentally supported transcripts. This set of high-confidence transcripts represents the first experimentally driven annotation of the human genome. In addition, the results from genomic tiling suggest that a large amount of transcription exists outside of annotated regions of the genome and serves as an example of how this activity could be measured on a genome-wide scale. Conclusions These data represent one of the most comprehensive assessments of transcriptional activity in the human genome and provide an atlas of human gene expression over a unique set of gene predictions. Before the annotation of the human genome is considered complete, however, the previously unannotated transcriptional activity throughout the genome must be fully characterized. PMID:15461792
Hatt, Lotte; Aagaard, Mads M; Bach, Cathrine; Graakjaer, Jesper; Sommer, Steffen; Agerholm, Inge E; Kølvraa, Steen; Bojesen, Anders
2016-01-01
Methylation-based non-invasive prenatal testing of fetal aneuploidies is an alternative method that could possibly improve fetal aneuploidy diagnosis, especially for trisomy 13(T13) and trisomy 18(T18). Our aim was to study the methylation landscape in placenta DNA from trisomy 13, 18 and 21 pregnancies in an attempt to find trisomy-specific methylation differences better suited for non-invasive prenatal diagnosis. We have conducted high-resolution methylation specific bead chip microarray analyses assessing more than 450,000 CpGs analyzing placentas from 12 T21 pregnancies, 12 T18 pregnancies and 6 T13 pregnancies. We have compared the methylation landscape of the trisomic placentas to the methylation landscape from normal placental DNA and to maternal blood cell DNA. Comparing trisomic placentas to normal placentas we identified 217 and 219 differentially methylated CpGs for CVS T18 and CVS T13, respectively (delta β>0.2, FDR<0.05), but only three differentially methylated CpGs for T21. However, the methylation differences was only modest (delta β<0.4), making them less suitable as diagnostic markers. Gene ontology enrichment analysis revealed that the gene set connected to theT18 differentially methylated CpGs was highly enriched for GO terms related to"DNA binding" and "transcription factor binding" coupled to the RNA polymerase II transcription. In the gene set connected to the T13 differentially methylated CpGs we found no significant enrichments.
Hatt, Lotte; Aagaard, Mads M.; Bach, Cathrine; Graakjaer, Jesper; Sommer, Steffen; Agerholm, Inge E.; Bojesen, Anders
2016-01-01
Methylation-based non-invasive prenatal testing of fetal aneuploidies is an alternative method that could possibly improve fetal aneuploidy diagnosis, especially for trisomy 13(T13) and trisomy 18(T18). Our aim was to study the methylation landscape in placenta DNA from trisomy 13, 18 and 21 pregnancies in an attempt to find trisomy–specific methylation differences better suited for non-invasive prenatal diagnosis. We have conducted high-resolution methylation specific bead chip microarray analyses assessing more than 450,000 CpGs analyzing placentas from 12 T21 pregnancies, 12 T18 pregnancies and 6 T13 pregnancies. We have compared the methylation landscape of the trisomic placentas to the methylation landscape from normal placental DNA and to maternal blood cell DNA. Comparing trisomic placentas to normal placentas we identified 217 and 219 differentially methylated CpGs for CVS T18 and CVS T13, respectively (delta β>0.2, FDR<0.05), but only three differentially methylated CpGs for T21. However, the methylation differences was only modest (delta β<0.4), making them less suitable as diagnostic markers. Gene ontology enrichment analysis revealed that the gene set connected to theT18 differentially methylated CpGs was highly enriched for GO terms related to”DNA binding” and “transcription factor binding” coupled to the RNA polymerase II transcription. In the gene set connected to the T13 differentially methylated CpGs we found no significant enrichments. PMID:27490343
Berry, Nadine Kaye; Bain, Nicole L; Enjeti, Anoop K; Rowlings, Philip
2014-01-01
Aim To evaluate the role of whole genome comparative genomic hybridisation microarray (array-CGH) in detecting genomic imbalances as compared to conventional karyotype (GTG-analysis) or myeloma specific fluorescence in situ hybridisation (FISH) panel in a diagnostic setting for plasma cell dyscrasia (PCD). Methods A myeloma-specific interphase FISH (i-FISH) panel was carried out on CD138 PC-enriched bone marrow (BM) from 20 patients having BM biopsies for evaluation of PCD. Whole genome array-CGH was performed on reference (control) and neoplastic (test patient) genomic DNA extracted from CD138 PC-enriched BM and analysed. Results Comparison of techniques demonstrated a much higher detection rate of genomic imbalances using array-CGH. Genomic imbalances were detected in 1, 19 and 20 patients using GTG-analysis, i-FISH and array-CGH, respectively. Genomic rearrangements were detected in one patient using GTG-analysis and seven patients using i-FISH, while none were detected using array-CGH. I-FISH was the most sensitive method for detecting gene rearrangements and GTG-analysis was the least sensitive method overall. All copy number aberrations observed in GTG-analysis were detected using array-CGH and i-FISH. Conclusions We show that array-CGH performed on CD138-enriched PCs significantly improves the detection of clinically relevant and possibly novel genomic abnormalities in PCD, and thus could be considered as a standard diagnostic technique in combination with IGH rearrangement i-FISH. PMID:23969274
Berry, Nadine Kaye; Bain, Nicole L; Enjeti, Anoop K; Rowlings, Philip
2014-01-01
To evaluate the role of whole genome comparative genomic hybridisation microarray (array-CGH) in detecting genomic imbalances as compared to conventional karyotype (GTG-analysis) or myeloma specific fluorescence in situ hybridisation (FISH) panel in a diagnostic setting for plasma cell dyscrasia (PCD). A myeloma-specific interphase FISH (i-FISH) panel was carried out on CD138 PC-enriched bone marrow (BM) from 20 patients having BM biopsies for evaluation of PCD. Whole genome array-CGH was performed on reference (control) and neoplastic (test patient) genomic DNA extracted from CD138 PC-enriched BM and analysed. Comparison of techniques demonstrated a much higher detection rate of genomic imbalances using array-CGH. Genomic imbalances were detected in 1, 19 and 20 patients using GTG-analysis, i-FISH and array-CGH, respectively. Genomic rearrangements were detected in one patient using GTG-analysis and seven patients using i-FISH, while none were detected using array-CGH. I-FISH was the most sensitive method for detecting gene rearrangements and GTG-analysis was the least sensitive method overall. All copy number aberrations observed in GTG-analysis were detected using array-CGH and i-FISH. We show that array-CGH performed on CD138-enriched PCs significantly improves the detection of clinically relevant and possibly novel genomic abnormalities in PCD, and thus could be considered as a standard diagnostic technique in combination with IGH rearrangement i-FISH.
USING DNA MICROARRAYS TO CHARACTERIZE GENE EXPRESSION
IN TESTES OF FERTILE AND INFERTILE HUMANS AND MICE
John C. Rockett1, J. Christopher Luft1, J. Brian Garges1, M. Stacey Ricci2, Pasquale Patrizio2, Norman B. Hecht2 and David J. Dix1
Reproductive Toxicology Divisio...
The Pathway Coexpression Network: Revealing pathway relationships
Tanzi, Rudolph E.
2018-01-01
A goal of genomics is to understand the relationships between biological processes. Pathways contribute to functional interplay within biological processes through complex but poorly understood interactions. However, limited functional references for global pathway relationships exist. Pathways from databases such as KEGG and Reactome provide discrete annotations of biological processes. Their relationships are currently either inferred from gene set enrichment within specific experiments, or by simple overlap, linking pathway annotations that have genes in common. Here, we provide a unifying interpretation of functional interaction between pathways by systematically quantifying coexpression between 1,330 canonical pathways from the Molecular Signatures Database (MSigDB) to establish the Pathway Coexpression Network (PCxN). We estimated the correlation between canonical pathways valid in a broad context using a curated collection of 3,207 microarrays from 72 normal human tissues. PCxN accounts for shared genes between annotations to estimate significant correlations between pathways with related functions rather than with similar annotations. We demonstrate that PCxN provides novel insight into mechanisms of complex diseases using an Alzheimer’s Disease (AD) case study. PCxN retrieved pathways significantly correlated with an expert curated AD gene list. These pathways have known associations with AD and were significantly enriched for genes independently associated with AD. As a further step, we show how PCxN complements the results of gene set enrichment methods by revealing relationships between enriched pathways, and by identifying additional highly correlated pathways. PCxN revealed that correlated pathways from an AD expression profiling study include functional clusters involved in cell adhesion and oxidative stress. PCxN provides expanded connections to pathways from the extracellular matrix. PCxN provides a powerful new framework for interrogation of global pathway relationships. Comprehensive exploration of PCxN can be performed at http://pcxn.org/. PMID:29554099
DNA Microarray for Detection of Gastrointestinal Viruses
Martínez, Miguel A.; Soto-del Río, María de los Dolores; Gutiérrez, Rosa María; Chiu, Charles Y.; Greninger, Alexander L.; Contreras, Juan Francisco; López, Susana; Arias, Carlos F.
2014-01-01
Gastroenteritis is a clinical illness of humans and other animals that is characterized by vomiting and diarrhea and caused by a variety of pathogens, including viruses. An increasing number of viral species have been associated with gastroenteritis or have been found in stool samples as new molecular tools have been developed. In this work, a DNA microarray capable in theory of parallel detection of more than 100 viral species was developed and tested. Initial validation was done with 10 different virus species, and an additional 5 species were validated using clinical samples. Detection limits of 1 × 103 virus particles of Human adenovirus C (HAdV), Human astrovirus (HAstV), and group A Rotavirus (RV-A) were established. Furthermore, when exogenous RNA was added, the limit for RV-A detection decreased by one log. In a small group of clinical samples from children with gastroenteritis (n = 76), the microarray detected at least one viral species in 92% of the samples. Single infection was identified in 63 samples (83%), and coinfection with more than one virus was identified in 7 samples (9%). The most abundant virus species were RV-A (58%), followed by Anellovirus (15.8%), HAstV (6.6%), HAdV (5.3%), Norwalk virus (6.6%), Human enterovirus (HEV) (9.2%), Human parechovirus (1.3%), Sapporo virus (1.3%), and Human bocavirus (1.3%). To further test the specificity and sensitivity of the microarray, the results were verified by reverse transcription-PCR (RT-PCR) detection of 5 gastrointestinal viruses. The RT-PCR assay detected a virus in 59 samples (78%). The microarray showed good performance for detection of RV-A, HAstV, and calicivirus, while the sensitivity for HAdV and HEV was low. Furthermore, some discrepancies in detection of mixed infections were observed and were addressed by reverse transcription-quantitative PCR (RT-qPCR) of the viruses involved. It was observed that differences in the amount of genetic material favored the detection of the most abundant virus. The microarray described in this work should help in understanding the etiology of gastroenteritis in humans and animals. PMID:25355758
An anatomically comprehensive atlas of the adult human brain transcriptome
Guillozet-Bongaarts, Angela L.; Shen, Elaine H.; Ng, Lydia; Miller, Jeremy A.; van de Lagemaat, Louie N.; Smith, Kimberly A.; Ebbert, Amanda; Riley, Zackery L.; Abajian, Chris; Beckmann, Christian F.; Bernard, Amy; Bertagnolli, Darren; Boe, Andrew F.; Cartagena, Preston M.; Chakravarty, M. Mallar; Chapin, Mike; Chong, Jimmy; Dalley, Rachel A.; David Daly, Barry; Dang, Chinh; Datta, Suvro; Dee, Nick; Dolbeare, Tim A.; Faber, Vance; Feng, David; Fowler, David R.; Goldy, Jeff; Gregor, Benjamin W.; Haradon, Zeb; Haynor, David R.; Hohmann, John G.; Horvath, Steve; Howard, Robert E.; Jeromin, Andreas; Jochim, Jayson M.; Kinnunen, Marty; Lau, Christopher; Lazarz, Evan T.; Lee, Changkyu; Lemon, Tracy A.; Li, Ling; Li, Yang; Morris, John A.; Overly, Caroline C.; Parker, Patrick D.; Parry, Sheana E.; Reding, Melissa; Royall, Joshua J.; Schulkin, Jay; Sequeira, Pedro Adolfo; Slaughterbeck, Clifford R.; Smith, Simon C.; Sodt, Andy J.; Sunkin, Susan M.; Swanson, Beryl E.; Vawter, Marquis P.; Williams, Derric; Wohnoutka, Paul; Zielke, H. Ronald; Geschwind, Daniel H.; Hof, Patrick R.; Smith, Stephen M.; Koch, Christof; Grant, Seth G. N.; Jones, Allan R.
2014-01-01
Neuroanatomically precise, genome-wide maps of transcript distributions are critical resources to complement genomic sequence data and to correlate functional and genetic brain architecture. Here we describe the generation and analysis of a transcriptional atlas of the adult human brain, comprising extensive histological analysis and comprehensive microarray profiling of ~900 neuroanatomically precise subdivisions in two individuals. Transcriptional regulation varies enormously by anatomical location, with different regions and their constituent cell types displaying robust molecular signatures that are highly conserved between individuals. Analysis of differential gene expression and gene co-expression relationships demonstrates that brain-wide variation strongly reflects the distributions of major cell classes such as neurons, oligodendrocytes, astrocytes and microglia. Local neighbourhood relationships between fine anatomical subdivisions are associated with discrete neuronal subtypes and genes involved with synaptic transmission. The neocortex displays a relatively homogeneous transcriptional pattern, but with distinct features associated selectively with primary sensorimotor cortices and with enriched frontal lobe expression. Notably, the spatial topography of the neocortex is strongly reflected in its molecular topography— the closer two cortical regions, the more similar their transcriptomes. This freely accessible online data resource forms a high-resolution transcriptional baseline for neurogenetic studies of normal and abnormal human brain function. PMID:22996553
Molecular Profiling of Glatiramer Acetate Early Treatment Effects in Multiple Sclerosis
Achiron, Anat; Feldman, Anna; Gurevich, Michael
2009-01-01
Background: Glatiramer acetate (GA, Copaxone®) has beneficial effects on the clinical course of relapsing-remitting multiple sclerosis (RRMS). However, the exact molecular mechanisms of GA effects are only partially understood. Objective: To characterized GA molecular effects in RRMS patients within 3 months of treatment by microarray profiling of peripheral blood mononuclear cells (PBMC). Methods: Gene-expression profiles were determined in RRMS patients before and at 3 months after initiation of GA treatment using Affimetrix (U133A-2) microarrays containing 14,500 well-characterized human genes. Most informative genes (MIGs) of GA-induced biological convergent pathways operating in RRMS were constructed using gene functional annotation, enrichment analysis and pathway reconstruction bioinformatic softwares. Verification at the mRNA and protein level was performed by qRT-PCR and FACS. Results: GA induced a specific gene expression molecular signature that included altered expression of 480 genes within 3 months of treatment; 262 genes were up-regulated, and 218 genes were down-regulated. The main convergent mechanisms of GA effects were related to antigen-activated apoptosis, inflammation, adhesion, and MHC class-I antigen presentation. Conclusions: Our findings demonstrate that GA treatment induces alternations of immunomodulatory gene expression patterns that are important for suppression of disease activity already at three months of treatment and can be used as molecular markers of GA activity. PMID:19893201
Kumari, Bharti; Jain, Pratistha; Das, Shaoli; Ghosal, Suman; Hazra, Bibhabasu; Trivedi, Ashish Chandra; Basu, Anirban; Chakrabarti, Jayprokas; Vrati, Sudhanshu; Banerjee, Arup
2016-01-01
Microglia cells in the brain play essential role during Japanese Encephalitis Virus (JEV) infection and may lead to change in microRNA (miRNA) and mRNA profile. These changes may together control disease outcome. Using Affymetrix microarray platform, we profiled cellular miRNA and mRNA expression at multiple time points during viral infection in human microglial (CHME3) cells. In silico analysis of microarray data revealed a phased pattern of miRNAs expression, associated with JEV replication and provided unique signatures of infection. Target prediction and pathway enrichment analysis identified anti correlation between differentially expressed miRNA and the gene expression at multiple time point which ultimately affected diverse signaling pathways including Notch signaling pathways in microglia. Activation of Notch pathway during JEV infection was demonstrated in vitro and in vivo. The expression of a subset of miRNAs that target multiple genes in Notch signaling pathways were suppressed and their overexpression could affect JEV induced immune response. Further analysis provided evidence for the possible presence of cellular competing endogenous RNA (ceRNA) associated with innate immune response. Collectively, our data provide a uniquely comprehensive view of the changes in the host miRNAs induced by JEV during cellular infection and identify Notch pathway in modulating microglia mediated inflammation. PMID:26838068
Chuquiyauri, Raul; Molina, Douglas M.; Moss, Eli L.; Wang, Ruobing; Gardner, Malcolm J.; Brouwer, Kimberly C.; Torres, Sonia; Gilman, Robert H.; Llanos-Cuentas, Alejandro; Neafsey, Daniel E.; Felgner, Philip; Liang, Xiaowu; Vinetz, Joseph M.
2015-01-01
Large scale antibody responses in Plasmodium vivax malaria remains unexplored in the endemic setting. Protein microarray analysis of asexual-stage P. vivax was used to identify antigens recognized in sera from residents of hypoendemic Peruvian Amazon. Over 24 months, of 106 participants, 91 had two symptomatic P. vivax malaria episodes, 11 had three episodes, 3 had four episodes, and 1 had five episodes. Plasmodium vivax relapse was distinguished from reinfection by a merozoite surface protein-3α restriction fragment length polymorphism polymerase chain reaction (MSP3α PCR-RFLP) assay. Notably, P. vivax reinfection subjects did not have higher reactivity to the entire set of recognized P. vivax blood-stage antigens than relapse subjects, regardless of the number of malaria episodes. The most highly recognized P. vivax proteins were MSP 4, 7, 8, and 10 (PVX_003775, PVX_082650, PVX_097625, and PVX_114145); sexual-stage antigen s16 (PVX_000930); early transcribed membrane protein (PVX_090230); tryptophan-rich antigen (Pv-fam-a) (PVX_092995); apical merozoite antigen 1 (PVX_092275); and proteins of unknown function (PVX_081830, PVX_117680, PVX_118705, PVX_121935, PVX_097730, PVX_110935, PVX_115450, and PVX_082475). Genes encoding reactive proteins exhibited a significant enrichment of non-synonymous nucleotide variation, an observation suggesting immune selection. These data identify candidates for seroepidemiological tools to support malaria elimination efforts in P. vivax-endemic regions. PMID:26149860
Kumari, Bharti; Jain, Pratistha; Das, Shaoli; Ghosal, Suman; Hazra, Bibhabasu; Trivedi, Ashish Chandra; Basu, Anirban; Chakrabarti, Jayprokas; Vrati, Sudhanshu; Banerjee, Arup
2016-02-03
Microglia cells in the brain play essential role during Japanese Encephalitis Virus (JEV) infection and may lead to change in microRNA (miRNA) and mRNA profile. These changes may together control disease outcome. Using Affymetrix microarray platform, we profiled cellular miRNA and mRNA expression at multiple time points during viral infection in human microglial (CHME3) cells. In silico analysis of microarray data revealed a phased pattern of miRNAs expression, associated with JEV replication and provided unique signatures of infection. Target prediction and pathway enrichment analysis identified anti correlation between differentially expressed miRNA and the gene expression at multiple time point which ultimately affected diverse signaling pathways including Notch signaling pathways in microglia. Activation of Notch pathway during JEV infection was demonstrated in vitro and in vivo. The expression of a subset of miRNAs that target multiple genes in Notch signaling pathways were suppressed and their overexpression could affect JEV induced immune response. Further analysis provided evidence for the possible presence of cellular competing endogenous RNA (ceRNA) associated with innate immune response. Collectively, our data provide a uniquely comprehensive view of the changes in the host miRNAs induced by JEV during cellular infection and identify Notch pathway in modulating microglia mediated inflammation.
Transcriptome remodeling associated with chronological aging in the dinoflagellate, Karenia brevis.
Johnson, Jillian G; Morey, Jeanine S; Neely, Marion G; Ryan, James C; Van Dolah, Frances M
2012-03-01
The toxic dinoflagellate, Karenia brevis, forms dense blooms in the Gulf of Mexico that persist for many months in coastal waters, where they can cause extensive marine animal mortalities and human health impacts. The mechanisms that enable cell survival in high density, low growth blooms, and the mechanisms leading to often rapid bloom demise are not well understood. To gain an understanding of processes that underlie chronological aging in this dinoflagellate, a microarray study was carried out to identify changes in the global transcriptome that accompany the entry and maintenance of stationary phase up to the onset of cell death. The transcriptome of K. brevis was assayed using a custom 10,263 feature oligonucleotide microarray from mid-logarithmic growth to the onset of culture demise. A total of 2958 (29%) features were differentially expressed, with the mid-stationary phase timepoint demonstrating peak changes in expression. Gene ontology enrichment analyses identified a significant shift in transcripts involved in energy acquisition, ribosome biogenesis, gene expression, stress adaptation, calcium signaling, and putative brevetoxin biosynthesis. The extensive remodeling of the transcriptome observed in the transition into a quiescent non-dividing phase appears to be indicative of a global shift in the metabolic and signaling requirements and provides the basis from which to understand the process of chronological aging in a dinoflagellate. Published by Elsevier B.V.
USDA-ARS?s Scientific Manuscript database
During ongoing proteomic analysis of the soybean (Glycine max (L.) Merr) germplasm collection, PI 603408 was identified as a landrace whose seeds lack accumulation of one of the major seed storage glycinin protein subunits. Whole genomic resequencing was used to identify a two-base deletion affectin...
USDA-ARS?s Scientific Manuscript database
Whole-genome re-sequencing, alignment and annotation analyses were undertaken for 12 sires representing four important cattle breeds in Brazil: Guzerat (multi-purpose), Gyr, Girolando and Holstein (dairy production). A total of approximately 4.3 billion reads from an Illumina HiSeq 2000 sequencer ge...
Xia, Yu; Yang, Yongchao; Huang, Shufang; Wu, Yueheng; Li, Ping; Zhuang, Jian
2018-03-24
This study aimed to determine chromosomal abnormalities and copy number variations (CNVs) in fetuses with congenital heart disease (CHD) by chromosomal microarray analysis (CMA). One hundred and ten cases with CHD detected by prenatal echocardiography were enrolled in the study; 27 cases were simple CHDs, and 83 were complex CHDs. Chromosomal microarray analysis was performed on the Affymetrix CytoScan HD platform. All annotated CNVs were validated by quantitative PCR. Chromosomal microarray analysis identified 6 cases with chromosomal abnormalities, including 2 cases with trisomy 21, 2 cases with trisomy 18, 1 case with trisomy 13, and 1 unusual case of mosaic trisomy 21. Pathogenic CNVs were detected in 15.5% (17/110) of the fetuses with CHDs, including 13 cases with CHD-associated CNVs. We further identified 10 genes as likely novel CHD candidate genes through gene functional enrichment analysis. We also found that pathogenic CMA results impacted the rate of pregnancy termination. This study shows that CMA is particularly effective for identifying chromosomal abnormalities and CNVs in fetuses with CHDs as well as having an effect on obstetrical outcomes. The elucidation of the genetic basis of CHDs will continue to expand our understanding of the etiology of CHDs. © 2018 John Wiley & Sons, Ltd.
2012-01-01
Background Resource-limited tropical countries are home to numerous infectious pathogens of both human and zoonotic origin. A capability for early detection to allow rapid outbreak containment and prevent spread to non-endemic regions is severely impaired by inadequate diagnostic laboratory capacity, the absence of a “cold chain” and the lack of highly trained personnel. Building up detection capacity in these countries by direct replication of the systems existing in developed countries is not a feasible approach and instead requires “leapfrogging” to the deployment of the newest diagnostic systems that do not have the infrastructure requirements of systems used in developed countries. Methods A laboratory for molecular diagnostics of infectious agents was established in Bo, Sierra Leone with a hybrid solar/diesel/battery system to ensure stable power supply and a satellite modem to enable efficient communication. An array of room temperature stabilization and refrigeration technologies for reliable transport and storage of reagents and biological samples were also tested to ensure sustainable laboratory supplies for diagnostic assays. Results The laboratory demonstrated its operational proficiency by conducting an investigation of a suspected avian influenza outbreak at a commercial poultry farm at Bo using broad range resequencing microarrays and real time RT-PCR. The results of the investigation excluded influenza viruses as a possible cause of the outbreak and indicated a link between the outbreak and the presence of Klebsiella pneumoniae. Conclusions This study demonstrated that by application of a carefully selected set of technologies and sufficient personnel training, it is feasible to deploy and effectively use a broad-range infectious pathogen detection technology in a severely resource-limited setting. PMID:22759725
Gandhi, Deepa; Tarale, Prashant; Naoghare, Pravin K; Bafana, Amit; Kannan, Krishnamurthi; Sivanesan, Saravanadevi
2016-01-01
Endosulfan, an organochlorine pesticide, is known to induce multiple disorders/abnormalities including neuro-degenerative disorders in many animal species. However, the molecular mechanism of endosulfan induced neuronal alterations is still not well understood. In the present study, the effect of sub-lethal concentration of endosulfan (3 μM) on human neuroblastoma cells (SH-SY5Y) was investigated using genomic and proteomic approaches. Microarray and 2D-PAGE followed by MALDI-TOF-MS analysis revealed differential expression of 831 transcripts and 16 proteins in exposed cells. A gene ontology enrichment analysis revealed that the differentially expressed genes and proteins were involved in variety of cellular events such as neuronal developmental pathway, immune response, cell differentiation, apoptosis, transmission of nerve impulse, axonogenesis, etc. The present study attempted to explore the possible molecular mechanism of endosulfan induced neuronal alterations in SH-SY5Y cells using an integrated genomic and proteomic approach. Based on the gene and protein profile possible mechanisms underlying endosulfan neurotoxicity were predicted. Copyright © 2015 Elsevier B.V. All rights reserved.
Peters, Derek T; Henderson, Christopher A; Warren, Curtis R; Friesen, Max; Xia, Fang; Becker, Caroline E; Musunuru, Kiran; Cowan, Chad A
2016-05-01
Hepatocyte-like cells (HLCs) are derived from human pluripotent stem cells (hPSCs) in vitro, but differentiation protocols commonly give rise to a heterogeneous mixture of cells. This variability confounds the evaluation of in vitro functional assays performed using HLCs. Increased differentiation efficiency and more accurate approximation of the in vivo hepatocyte gene expression profile would improve the utility of hPSCs. Towards this goal, we demonstrate the purification of a subpopulation of functional HLCs using the hepatocyte surface marker asialoglycoprotein receptor 1 (ASGR1). We analyzed the expression profile of ASGR1-positive cells by microarray, and tested their ability to perform mature hepatocyte functions (albumin and urea secretion, cytochrome activity). By these measures, ASGR1-positive HLCs are enriched for the gene expression profile and functional characteristics of primary hepatocytes compared with unsorted HLCs. We have demonstrated that ASGR1-positive sorting isolates a functional subpopulation of HLCs from among the heterogeneous cellular population produced by directed differentiation. © 2016. Published by The Company of Biologists Ltd.
A Perspective on DNA Microarrays in Pathology Research and Practice
Pollack, Jonathan R.
2007-01-01
DNA microarray technology matured in the mid-1990s, and the past decade has witnessed a tremendous growth in its application. DNA microarrays have provided powerful tools for pathology researchers seeking to describe, classify, and understand human disease. There has also been great expectation that the technology would advance the practice of pathology. This review highlights some of the key contributions of DNA microarrays to experimental pathology, focusing in the area of cancer research. Also discussed are some of the current challenges in translating utility to clinical practice. PMID:17600117
Infertility diagnosis has a significant impact on the transcriptome of developing blastocysts.
McCallie, Blair R; Parks, Jason C; Griffin, Darren K; Schoolcraft, William B; Katz-Jaffe, Mandy G
2017-08-01
Is the human blastocyst transcriptome associated with infertility diagnosis, specifically: polycystic ovaries (PCO), male factor (MF) and unexplained (UE)? The global blastocyst transcriptome was significantly altered in association with a PCO, MF and UE infertility diagnosis. Infertility diagnosis has an impact on the probability for a successful outcome following an IVF cycle. Limited information is known regarding the relationship between a specific infertility diagnosis and blastocyst transcription during preimplantation development. Blastocysts created during infertility treatment from patients with specific infertility diagnoses (PCO, MF and UE) were analyzed for global transcriptome compared to fertile donor oocyte blastocysts (control). Surplus cryopreserved blastocysts were donated with patient consent and institutional review board approval. Female patients were <38 years old with male patients <40 years old. Blastocysts were grouped according to infertility diagnosis: PCO (n = 50), MF (n = 50), UE (n = 50) and fertile donor oocyte controls (n = 50). Pooled blastocysts were lysed for RNA isolation followed by microarray analysis using the SurePrint G3 Human Gene Expression Microarray. Validation was performed on significant genes of interest using real-time quantitative PCR (RT-qPCR). Transcription alterations were observed for all infertility etiologies compared to controls, resulting in differentially expressed genes: PCO = 869, MF = 348 and UE = 473 (P < 0.05; >2-fold). Functional annotation of biological and molecular processes revealed both similarities, as well as differences, across the infertility groups. All infertility etiologies displayed transcriptome alterations in signal transducer activity, receptor binding, reproduction, cell adhesion and response to stimulus. Blastocysts from PCO patients were also enriched for apoptotic genes while MF blastocysts displayed enrichment for genes involved in cancer processes. Blastocysts from couples with unexplained infertility displayed transcription alterations related to various disease states, which included mechanistic target of rapamycin (mTOR) and adipocytokine signaling. RT-qPCR validation confirmed differential gene expression for the following genes: BCL2 like 10 (BCL2L10), heat shock protein family A member 1A (HSPA1A), heat shock protein family A member 1B (HSPA1B), activating transcription factor 3 (ATF3), fibroblast growth factor 9 (FGF9), left-right determination factor 1 (LEFTY1), left-right determination factor 2 (LEFTY2), growth differentiation factor 15 (GDF15), inhibin beta A subunit (INHBA), adherins junctions associated protein 1 (AJAP1), cadherin 9 (CDH9) and laminin subunit alpha 4 (LAMA4) (P < 0.05; >2-fold). Not available due to participant privacy. Blastocyst samples for microarray analysis required pooling. While this allows for an overall average in each infertility etiology group and can reduce noise from sample-to-sample variation, it cannot give a detailed analysis of each blastocyst within the group. Underlying patient infertility diagnosis has an impact on the blastocyst transcriptome, modifying gene expression associated with developmental competence and implantation potential. No conflict of interest or outside funding provided. © The Author 2017. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email:journals.permissions@oup.com
Zhao, Yuanshun; Zhang, Yonghong; Lin, Dongdong; Li, Kang; Yin, Chengzeng; Liu, Xiuhong; Jin, Boxun; Sun, Libo; Liu, Jinhua; Zhang, Aiying; Li, Ning
2015-10-01
To develop and evaluate a protein microarray assay with horseradish peroxidase (HRP) chemiluminescence for quantification of α-fetoprotein (AFP) in serum from patients with hepatocellular carcinoma (HCC). A protein microarray assay for AFP was developed. Serum was collected from patients with HCC and healthy control subjects. AFP was quantified using protein microarray and enzyme-linked immunosorbent assay (ELISA). Serum AFP concentrations determined via protein microarray were positively correlated (r = 0.973) with those determined via ELISA in patients with HCC (n = 60) and healthy control subjects (n = 30). Protein microarray showed 80% sensitivity and 100% specificity for HCC diagnosis. ELISA had 83.3% sensitivity and 100% specificity. Protein microarray effectively distinguished between patients with HCC and healthy control subjects (area under ROC curve 0.974; 95% CI 0.000, 1.000). Protein microarray is a rapid, simple and low-cost alternative to ELISA for detecting AFP in human serum. © The Author(s) 2015.
Lionel, Anath C; Tammimies, Kristiina; Vaags, Andrea K; Rosenfeld, Jill A; Ahn, Joo Wook; Merico, Daniele; Noor, Abdul; Runke, Cassandra K; Pillalamarri, Vamsee K; Carter, Melissa T; Gazzellone, Matthew J; Thiruvahindrapuram, Bhooma; Fagerberg, Christina; Laulund, Lone W; Pellecchia, Giovanna; Lamoureux, Sylvia; Deshpande, Charu; Clayton-Smith, Jill; White, Ann C; Leather, Susan; Trounce, John; Melanie Bedford, H; Hatchwell, Eli; Eis, Peggy S; Yuen, Ryan K C; Walker, Susan; Uddin, Mohammed; Geraghty, Michael T; Nikkel, Sarah M; Tomiak, Eva M; Fernandez, Bridget A; Soreni, Noam; Crosbie, Jennifer; Arnold, Paul D; Schachar, Russell J; Roberts, Wendy; Paterson, Andrew D; So, Joyce; Szatmari, Peter; Chrysler, Christina; Woodbury-Smith, Marc; Brian Lowry, R; Zwaigenbaum, Lonnie; Mandyam, Divya; Wei, John; Macdonald, Jeffrey R; Howe, Jennifer L; Nalpathamkalam, Thomas; Wang, Zhuozhi; Tolson, Daniel; Cobb, David S; Wilks, Timothy M; Sorensen, Mark J; Bader, Patricia I; An, Yu; Wu, Bai-Lin; Musumeci, Sebastiano Antonino; Romano, Corrado; Postorivo, Diana; Nardone, Anna M; Monica, Matteo Della; Scarano, Gioacchino; Zoccante, Leonardo; Novara, Francesca; Zuffardi, Orsetta; Ciccone, Roberto; Antona, Vincenzo; Carella, Massimo; Zelante, Leopoldo; Cavalli, Pietro; Poggiani, Carlo; Cavallari, Ugo; Argiropoulos, Bob; Chernos, Judy; Brasch-Andersen, Charlotte; Speevak, Marsha; Fichera, Marco; Ogilvie, Caroline Mackie; Shen, Yiping; Hodge, Jennelle C; Talkowski, Michael E; Stavropoulos, Dimitri J; Marshall, Christian R; Scherer, Stephen W
2014-05-15
Rare copy number variants (CNVs) disrupting ASTN2 or both ASTN2 and TRIM32 have been reported at 9q33.1 by genome-wide studies in a few individuals with neurodevelopmental disorders (NDDs). The vertebrate-specific astrotactins, ASTN2 and its paralog ASTN1, have key roles in glial-guided neuronal migration during brain development. To determine the prevalence of astrotactin mutations and delineate their associated phenotypic spectrum, we screened ASTN2/TRIM32 and ASTN1 (1q25.2) for exonic CNVs in clinical microarray data from 89 985 individuals across 10 sites, including 64 114 NDD subjects. In this clinical dataset, we identified 46 deletions and 12 duplications affecting ASTN2. Deletions of ASTN1 were much rarer. Deletions near the 3' terminus of ASTN2, which would disrupt all transcript isoforms (a subset of these deletions also included TRIM32), were significantly enriched in the NDD subjects (P = 0.002) compared with 44 085 population-based controls. Frequent phenotypes observed in individuals with such deletions include autism spectrum disorder (ASD), attention deficit hyperactivity disorder (ADHD), speech delay, anxiety and obsessive compulsive disorder (OCD). The 3'-terminal ASTN2 deletions were significantly enriched compared with controls in males with NDDs, but not in females. Upon quantifying ASTN2 human brain RNA, we observed shorter isoforms expressed from an alternative transcription start site of recent evolutionary origin near the 3' end. Spatiotemporal expression profiling in the human brain revealed consistently high ASTN1 expression while ASTN2 expression peaked in the early embryonic neocortex and postnatal cerebellar cortex. Our findings shed new light on the role of the astrotactins in psychopathology and their interplay in human neurodevelopment.
Booman, Marije; Borza, Tudor; Feng, Charles Y; Hori, Tiago S; Higgins, Brent; Culf, Adrian; Léger, Daniel; Chute, Ian C; Belkaid, Anissa; Rise, Marlies; Gamperl, A Kurt; Hubert, Sophie; Kimball, Jennifer; Ouellette, Rodney J; Johnson, Stewart C; Bowman, Sharen; Rise, Matthew L
2011-08-01
The collapse of Atlantic cod (Gadus morhua) wild populations strongly impacted the Atlantic cod fishery and led to the development of cod aquaculture. In order to improve aquaculture and broodstock quality, we need to gain knowledge of genes and pathways involved in Atlantic cod responses to pathogens and other stressors. The Atlantic Cod Genomics and Broodstock Development Project has generated over 150,000 expressed sequence tags from 42 cDNA libraries representing various tissues, developmental stages, and stimuli. We used this resource to develop an Atlantic cod oligonucleotide microarray containing 20,000 unique probes. Selection of sequences from the full range of cDNA libraries enables application of the microarray for a broad spectrum of Atlantic cod functional genomics studies. We included sequences that were highly abundant in suppression subtractive hybridization (SSH) libraries, which were enriched for transcripts responsive to pathogens or other stressors. These sequences represent genes that potentially play an important role in stress and/or immune responses, making the microarray particularly useful for studies of Atlantic cod gene expression responses to immune stimuli and other stressors. To demonstrate its value, we used the microarray to analyze the Atlantic cod spleen response to stimulation with formalin-killed, atypical Aeromonas salmonicida, resulting in a gene expression profile that indicates a strong innate immune response. These results were further validated by quantitative PCR analysis and comparison to results from previous analysis of an SSH library. This study shows that the Atlantic cod 20K oligonucleotide microarray is a valuable new tool for Atlantic cod functional genomics research.
Inagaki, Tetsunori; Kusunoki, Soshi; Tabu, Kouichi; Okabe, Hitomi; Yamada, Izumi; Taga, Tetsuya; Matsumoto, Akemi; Makino, Shintaro; Takeda, Satoru; Kato, Kiyoko
2016-01-01
The continual proliferation and differentiation of trophoblasts are critical for the maintenance of pregnancy. It is well known that the tissue stem cells are associated with the development of tissues and pathologies. It has been demonstrated that side-population (SP) cells identified by fluorescence-activated cell sorting (FACS) are enriched with stem cells. The SP cells in HTR-8/SVneo cells derived from human primary trophoblast cells were isolated by FACS. HTR-8/SVneo-SP cell cultures generated both SP and non-SP (NSP) subpopulations. In contrast, NSP cell cultures produced NSP cells and failed to produce SP cells. These SP cells showed self-renewal capability by serial colony-forming assay. Microarray expression analysis using a set of HTR-8/SVneo-SP and -NSP cells revealed that SP cells overexpressed several stemness genes including caudal type homeobox2 (CDX2) and bone morphogenic proteins (BMPs), and lymphocyte antigen 6 complex locus D (LY6D) gene was the most highly up-regulated in HTR-8/SVneo-SP cells. LY6D gene reduced its expression in the course of a 7-day cultivation in differentiation medium. SP cells tended to reduce its fraction by treatment of LY6D siRNA indicating that LY6D had potential to maintain cell proliferation of HTR-8/SVneo-SP cells. On ontology analysis, epithelial-mesenchymal transition (EMT) pathway was involved in the up-regulated genes on microarray analysis. HTR-SVneo-SP cells showed enhanced migration. This is the first report that LY6D was important for the maintenance of HTR-8/SVneo-SP cells. EMT was associated with the phenotype of these SP cells.
Sääf, Annika M.; Tengvall-Linder, Maria; Chang, Howard Y.; Adler, Adam S.; Wahlgren, Carl-Fredrik; Scheynius, Annika; Nordenskjöld, Magnus; Bradley, Maria
2008-01-01
Background Atopic eczema (AE) is a common chronic inflammatory skin disorder. In order to dissect the genetic background several linkage and genetic association studies have been performed. Yet very little is known about specific genes involved in this complex skin disease, and the underlying molecular mechanisms are not fully understood. Methodology/Findings We used human DNA microarrays to identify a molecular picture of the programmed responses of the human genome to AE. The transcriptional program was analyzed in skin biopsy samples from lesional and patch-tested skin from AE patients sensitized to Malassezia sympodialis (M. sympodialis), and corresponding biopsies from healthy individuals. The most notable feature of the global gene-expression pattern observed in AE skin was a reciprocal expression of induced inflammatory genes and repressed lipid metabolism genes. The overall transcriptional response in M. sympodialis patch-tested AE skin was similar to the gene-expression signature identified in lesional AE skin. In the constellation of genes differentially expressed in AE skin compared to healthy control skin, we have identified several potential susceptibility genes that may play a critical role in the pathological condition of AE. Many of these genes, including genes with a role in immune responses, lipid homeostasis, and epidermal differentiation, are localized on chromosomal regions previously linked to AE. Conclusions/Significance Through genome-wide expression profiling, we were able to discover a distinct reciprocal expression pattern of induced inflammatory genes and repressed lipid metabolism genes in skin from AE patients. We found a significant enrichment of differentially expressed genes in AE with cytobands associated to the disease, and furthermore new chromosomal regions were found that could potentially guide future region-specific linkage mapping in AE. The full data set is available at http://microarray-pubs.stanford.edu/eczema. PMID:19107207
Ben-Ari Fuchs, Shani; Lieder, Iris; Stelzer, Gil; Mazor, Yaron; Buzhor, Ella; Kaplan, Sergey; Bogoch, Yoel; Plaschkes, Inbar; Shitrit, Alina; Rappaport, Noa; Kohn, Asher; Edgar, Ron; Shenhav, Liraz; Safran, Marilyn; Lancet, Doron; Guan-Golan, Yaron; Warshawsky, David; Shtrichman, Ronit
2016-03-01
Postgenomics data are produced in large volumes by life sciences and clinical applications of novel omics diagnostics and therapeutics for precision medicine. To move from "data-to-knowledge-to-innovation," a crucial missing step in the current era is, however, our limited understanding of biological and clinical contexts associated with data. Prominent among the emerging remedies to this challenge are the gene set enrichment tools. This study reports on GeneAnalytics™ ( geneanalytics.genecards.org ), a comprehensive and easy-to-apply gene set analysis tool for rapid contextualization of expression patterns and functional signatures embedded in the postgenomics Big Data domains, such as Next Generation Sequencing (NGS), RNAseq, and microarray experiments. GeneAnalytics' differentiating features include in-depth evidence-based scoring algorithms, an intuitive user interface and proprietary unified data. GeneAnalytics employs the LifeMap Science's GeneCards suite, including the GeneCards®--the human gene database; the MalaCards-the human diseases database; and the PathCards--the biological pathways database. Expression-based analysis in GeneAnalytics relies on the LifeMap Discovery®--the embryonic development and stem cells database, which includes manually curated expression data for normal and diseased tissues, enabling advanced matching algorithm for gene-tissue association. This assists in evaluating differentiation protocols and discovering biomarkers for tissues and cells. Results are directly linked to gene, disease, or cell "cards" in the GeneCards suite. Future developments aim to enhance the GeneAnalytics algorithm as well as visualizations, employing varied graphical display items. Such attributes make GeneAnalytics a broadly applicable postgenomics data analyses and interpretation tool for translation of data to knowledge-based innovation in various Big Data fields such as precision medicine, ecogenomics, nutrigenomics, pharmacogenomics, vaccinomics, and others yet to emerge on the postgenomics horizon.
Loughridge, Alice B.; Greenwood, Benjamin N.; Day, Heidi E. W.; McQueen, Matthew B.; Fleshner, Monika
2013-01-01
Serotonin (5-HT) is implicated in the development of stress-related mood disorders in humans. Physical activity reduces the risk of developing stress-related mood disorders, such as depression and anxiety. In rats, 6 weeks of wheel running protects against stress-induced behaviors thought to resemble symptoms of human anxiety and depression. The mechanisms by which exercise confers protection against stress-induced behaviors, however, remain unknown. One way by which exercise could generate stress resistance is by producing plastic changes in gene expression in the dorsal raphe nucleus (DRN). The DRN has a high concentration of 5-HT neurons and is implicated in stress-related mood disorders. The goal of the current experiment was to identify changes in the expression of genes that could be novel targets of exercise-induced stress resistance in the DRN. Adult, male F344 rats were allowed voluntary access to running wheels for 6 weeks; exposed to inescapable stress or no stress; and sacrificed immediately and 2 h after stressor termination. Laser capture micro dissection selectively sampled the DRN. mRNA expression was measured using the whole genome Affymetrix microarray. Comprehensive data analyses of gene expression included differential gene expression, log fold change (LFC) contrast analyses with False Discovery Rate correction, KEGG and Wiki Web Gestalt pathway enrichment analyses, and Weighted Gene Correlational Network Analysis (WGCNA). Our results suggest that physically active rats exposed to stress modulate expression of twice the number of genes, and display a more rapid and strongly coordinated response, than sedentary rats. Bioinformatics analyses revealed several potential targets of stress resistance including genes that are related to immune processes, tryptophan metabolism, and circadian/diurnal rhythms. PMID:23717271
USDA-ARS?s Scientific Manuscript database
A bacterial artificial chromosome (BAC) library and BAC-end sequences for Gossypium hirsutum L. have recently been developed. Here we report on genomic-based genome-wide SNP mining utilizing re-sequencing data with a BAC-end sequence reference for twelve G. hirsutum L. lines, one G. barbadense L. li...
A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic
Madsen, Bo Eskerod; Browning, Sharon R.
2009-01-01
Resequencing is an emerging tool for identification of rare disease-associated mutations. Rare mutations are difficult to tag with SNP genotyping, as genotyping studies are designed to detect common variants. However, studies have shown that genetic heterogeneity is a probable scenario for common diseases, in which multiple rare mutations together explain a large proportion of the genetic basis for the disease. Thus, we propose a weighted-sum method to jointly analyse a group of mutations in order to test for groupwise association with disease status. For example, such a group of mutations may result from resequencing a gene. We compare the proposed weighted-sum method to alternative methods and show that it is powerful for identifying disease-associated genes, both on simulated and Encode data. Using the weighted-sum method, a resequencing study can identify a disease-associated gene with an overall population attributable risk (PAR) of 2%, even when each individual mutation has much lower PAR, using 1,000 to 7,000 affected and unaffected individuals, depending on the underlying genetic model. This study thus demonstrates that resequencing studies can identify important genetic associations, provided that specialised analysis methods, such as the weighted-sum method, are used. PMID:19214210
2005-03-01
and EpCAM-linked magnetic beads to separate the cells. Success is assessed on flow cytometry using 2G3, Laminin, FAPa and CK7 markers. On the array...that are >90% enriched for CK7 in the epithelial component, and >80% FAPa for the non-epithelial component. At this moment, however, we have not got
Possible molecular mechanism underlying cadmium-induced circadian rhythms disruption in zebrafish
DOE Office of Scientific and Technical Information (OSTI.GOV)
Xiao, Bo; Chen, Tian-Ming; Zhong, Yingbin
This study was aimed to explore the mechanisms underlying cadmium-induced circadian rhythms disruption. Two groups of zebrafish larvae treated with or without 5 ppm CdCl{sub 2} were incubated in a photoperiod of 14-h light/10-h dark conditions. The mRNA levels of clock1a, bmal1b, per2 and per1b in two groups were determined. Microarray data were generated in two group of samples. Differential expression of genes were identified and the changes in expression level for some genes were validated by RT-PCR. Finally, Gene Ontology functional and KEGG pathway enrichment analysis of differentially expressed genes (DEGs) were performed. In comparison with normal group, the mRNAmore » levels of clock1a, bmal1b, and per2 were significantly changed and varied over the circadian cycle in CdCl2-treated group. DEGs were obtained from the light (84 h, ZT12) and dark (88 h, ZT16) phase. In addition, G-protein coupled receptor protein signaling pathway and immune response were both enriched by DEGs in both groups. While, proteolysis and amino acid metabolism were found associated with DEGs in light phase, and Neuroactive ligand-receptor interaction and oxidation-reduction process were significantly enriched by DEGs in dark phase. Besides, the expression pattern of genes including hsp70l and or115-11 obtained by RT-PCR were consistent with those obtained by microarray analysis. As a consequence, cadmium could make significant effects on circadian rhythms through immune response and G protein-coupled receptor signaling pathway. Besides, between the dark and the light phase, the mechanism by which cadmium inducing disruption of circadian rhythms were different to some extent. - Highlights: • Cadmium could affect the expression levels of circadian rhythm-related genes. • Genes expression in microarray data were consistent with those in RT-PCR analysis. • Immune response and G protein-coupled receptor signaling pathway were identified. • Cadmium induces circadian rhythm disruption by different mechanism in day and night.« less
Rascalou, Adeline; Lamartine, Jérôme; Poydenot, Pauline; Demarne, Frédéric; Bechetoille, Nicolas
2018-05-05
Artificial visible light is everywhere in modern life. Social communication confronts us with screens of all kinds, and their use is on the rise. We are therefore increasingly exposed to artificial visible light, the effects of which on skin are poorly known. The purpose of this study was to model the artificial visible light emitted by electronic devices and assess its effect on normal human fibroblasts. The spectral irradiance emitted by electronic devices was optically measured and equipment was developed to accurately reproduce such artificial visible light. Effects on normal human fibroblasts were analyzed on human genome microarray-based gene expression analysis. At cellular level, visualization and image analysis were performed on the mitochondrial network and F-actin cytoskeleton. Cell proliferation, ATP release and type I procollagen secretion were also measured. We developed a device consisting of 36 LEDs simultaneously emitting blue, green and red light at distinct wavelengths (450 nm, 525 nm and 625 nm) with narrow spectra and equivalent radiant power for the three colors. A dose of 99 J/cm 2 artificial visible light was selected so as not to induce cell mortality following exposure. Microarray analysis revealed 2984 light-modulated transcripts. Functional annotation of light-responsive genes revealed several enriched functions including, amongst others, the "mitochondria" and "integrin signaling" categories. Selected results were confirmed by real-time quantitative PCR, analyzing 24 genes representing these two categories. Analysis of micro-patterned culture plates showed marked fragmentation of the mitochondrial network and disorganization of the F-actin cytoskeleton following exposure. Functionally, there was considerable impairment of cell growth and spread, ATP release and type I procollagen secretion in exposed fibroblasts. Artificial visible light induces drastic molecular and cellular changes in normal human fibroblasts. This may impede normal cellular functions and contribute to premature skin aging. The present results extend our knowledge of the effects of the low-energy wavelengths that are increasingly used to treat skin disorders. Copyright © 2018 Japanese Society for Investigative Dermatology. Published by Elsevier B.V. All rights reserved.
2012-01-01
Background Drug resistance in the malaria parasite Plasmodium falciparum severely compromises the treatment and control of malaria. A knowledge of the critical mutations conferring resistance to particular drugs is important in understanding modes of drug action and mechanisms of resistances. They are required to design better therapies and limit drug resistance. A mutation in the gene (pfcrt) encoding a membrane transporter has been identified as a principal determinant of chloroquine resistance in P. falciparum, but we lack a full account of higher level chloroquine resistance. Furthermore, the determinants of resistance in the other major human malaria parasite, P. vivax, are not known. To address these questions, we investigated the genetic basis of chloroquine resistance in an isogenic lineage of rodent malaria parasite P. chabaudi in which high level resistance to chloroquine has been progressively selected under laboratory conditions. Results Loci containing the critical genes were mapped by Linkage Group Selection, using a genetic cross between the high-level chloroquine-resistant mutant and a genetically distinct sensitive strain. A novel high-resolution quantitative whole-genome re-sequencing approach was used to reveal three regions of selection on chr11, chr03 and chr02 that appear progressively at increasing drug doses on three chromosomes. Whole-genome sequencing of the chloroquine-resistant parent identified just four point mutations in different genes on these chromosomes. Three mutations are located at the foci of the selection valleys and are therefore predicted to confer different levels of chloroquine resistance. The critical mutation conferring the first level of chloroquine resistance is found in aat1, a putative aminoacid transporter. Conclusions Quantitative trait loci conferring selectable phenotypes, such as drug resistance, can be mapped directly using progressive genome-wide linkage group selection. Quantitative genome-wide short-read genome resequencing can be used to reveal these signatures of drug selection at high resolution. The identities of three genes (and mutations within them) conferring different levels of chloroquine resistance generate insights regarding the genetic architecture and mechanisms of resistance to chloroquine and other drugs. Importantly, their orthologues may now be evaluated for critical or accessory roles in chloroquine resistance in human malarias P. vivax and P. falciparum. PMID:22435897
Ji, Peng; Wei, Yanming; Hua, Yongli; Zhang, Xiaosong; Yao, Wanling; Ma, Qi; Yuan, Ziwen; Wen, Yanqiao; Yang, Chaoxue
2018-01-30
Angelica sinensis (AS), root of Angelica sinensis (Oliv.) Diels, an important kind of Chinese traditional herbal medicine, has been used for women to enrich the blood for thousands of years. It is mainly distributed in Gansu province of China. According to Traditional Chinese medicine usage, unprocessed AS (UAS) and its 4 kinds of processed products (ASs) are all used to treat different diseases or syndromes. The difference among the enriching-blood effects of ASs is unclear. And their exact mechanisms of enriching the blood are not fully understood. In this study, our aim is to compare the enriching-blood effect and explain the related mechanism of ASs, to lay the foundation for the blood deficiency diagnosis and the rational use of ASs in the clinic. ASs were used to intervene the blood deficiency syndrome model mice induced by acetyl phenylhydrazine (APH) and cyclophosphamide (CTX). A novel approach using metabolomics coupled with hematological and biochemical parameters to explain the enriching-blood effect and mechanism of ASs was established. The blood routine examination, ATPase, glucose-6-phosphate dehydrogenase, methemoglobin, glutathion peroxidase, glutathione reductase, and erythropoietin were measured. Two biofluids (plasma and urine) obtained from mice were analyzed with GC-MS. Distinct changes in metabolite patterns of the two biofluids after mice were induced by APH and CTX, and mice were intervened with ASs were analyzed using partial least squares-discriminant analysis. Potential biomarkers were found using a novel method including variable importance in the projection (VIP) >1.0, volcano plot analysis, and significance analysis of microarray. The results of hematological, biochemical parameters and the integrated metabolomics all showed the blood deficiency syndrome model was built successfully, ASs exhibited different degree of enriching-blood effect, and AS pached with alcohol (AAS) exhibited the best enriching-blood effect. 16 metabolites in the plasma and 8 metabolites in the urine were considered as the potential biomarkers. These metabolites were involved in 7 metabolic pathways which were concerned with the different enriching-blood effect mechanisms of ASs. The correlation analysis results confirmed L-Valine (plasma), Linoleic acid (urine), L-Aspartic acid (urine) and Cholesterol (urine) were strong positive or negative associated with biochemical indicators. The enriching-blood effects of ASs are different. The pathological mechanisms of blood deficiency syndrome and the enriching-blood effect mechanism of ASs are involved in 7 metabolic pathways. L-Valine (plasma), Linoleic acid (urine), L-Aspartic acid (urine), Cholesterol (urine) are four important biomarkers being related to the enriching-blood effect of ASs. The combination of VIP, volcano plot analysis and significance analysis of microarray is suitable for screening biomarkers in metabolomics study. They can lay the foundation for clinical practice. Copyright © 2017 Elsevier B.V. All rights reserved.
Mokhtar, Siti Shuhada; Marshall, Christian R.; Phipps, Maude E.; Thiruvahindrapuram, Bhooma; Lionel, Anath C.; Scherer, Stephen W.; Peng, Hoh Boon
2014-01-01
Copy number variation (CNV) has been recognized as a major contributor to human genome diversity. It plays an important role in determining phenotypes and has been associated with a number of common and complex diseases. However CNV data from diverse populations is still limited. Here we report the first investigation of CNV in the indigenous populations from Peninsular Malaysia. We genotyped 34 Negrito genomes from Peninsular Malaysia using the Affymetrix SNP 6.0 microarray and identified 48 putative novel CNVs, consisting of 24 gains and 24 losses, of which 5 were identified in at least 2 unrelated samples. These CNVs appear unique to the Negrito population and were absent in the DGV, HapMap3 and Singapore Genome Variation Project (SGVP) datasets. Analysis of gene ontology revealed that genes within these CNVs were enriched in the immune system (GO:0002376), response to stimulus mechanisms (GO:0050896), the metabolic pathways (GO:0001852), as well as regulation of transcription (GO:0006355). Copy number gains in CNV regions (CNVRs) enriched with genes were significantly higher than the losses (P value <0.001). In view of the small population size, relative isolation and semi-nomadic lifestyles of this community, we speculate that these CNVs may be attributed to recent local adaptation of Negritos from Peninsular Malaysia. PMID:24956385
Mokhtar, Siti Shuhada; Marshall, Christian R; Phipps, Maude E; Thiruvahindrapuram, Bhooma; Lionel, Anath C; Scherer, Stephen W; Peng, Hoh Boon
2014-01-01
Copy number variation (CNV) has been recognized as a major contributor to human genome diversity. It plays an important role in determining phenotypes and has been associated with a number of common and complex diseases. However CNV data from diverse populations is still limited. Here we report the first investigation of CNV in the indigenous populations from Peninsular Malaysia. We genotyped 34 Negrito genomes from Peninsular Malaysia using the Affymetrix SNP 6.0 microarray and identified 48 putative novel CNVs, consisting of 24 gains and 24 losses, of which 5 were identified in at least 2 unrelated samples. These CNVs appear unique to the Negrito population and were absent in the DGV, HapMap3 and Singapore Genome Variation Project (SGVP) datasets. Analysis of gene ontology revealed that genes within these CNVs were enriched in the immune system (GO:0002376), response to stimulus mechanisms (GO:0050896), the metabolic pathways (GO:0001852), as well as regulation of transcription (GO:0006355). Copy number gains in CNV regions (CNVRs) enriched with genes were significantly higher than the losses (P value <0.001). In view of the small population size, relative isolation and semi-nomadic lifestyles of this community, we speculate that these CNVs may be attributed to recent local adaptation of Negritos from Peninsular Malaysia.
Alanyl-tRNA synthetase mutation in a family with dominant distal hereditary motor neuropathy
Zhao, Z.; Hashiguchi, A.; Sakiyama, Y.; Okamoto, Y.; Tokunaga, S.; Zhu, L.; Shen, H.; Takashima, H.
2012-01-01
Objective: To identify a new genetic cause of distal hereditary motor neuropathy (dHMN), which is also known as a variant of Charcot-Marie-Tooth disease (CMT), in a Chinese family. Methods: We investigated a Chinese family with dHMN clinically, electrophysiologically, and genetically. We screened for the mutations of 28 CMT or related pathogenic genes using an originally designed microarray resequencing DNA chip. Results: Investigation of the family history revealed an autosomal dominant transmission pattern. The clinical features of the family included mild weakness and wasting of the distal muscles of the lower limb and foot deformity, without clinical sensory involvement. Electrophysiologic studies revealed motor neuropathy. MRI of the lower limbs showed accentuated fatty infiltration of the gastrocnemius and vastus lateralis muscles. All 4 affected family members had a heterozygous missense mutation c.2677G>A (p.D893N) of alanyl-tRNA synthetase (AARS), which was not found in the 4 unaffected members and control subjects. Conclusion: An AARS mutation caused dHMN in a Chinese family. AARS mutations result in not only a CMT phenotype but also a dHMN phenotype. PMID:22573628
GENE EXPRESSION IN THE TESTES OF NORMOSPERMIC VERSUS TERATOSPERMIC DOMESTIC CATS USING HUMAN cDNA MICROARRAY ANALYSES
B.S. Pukazhenthi1, J. C. Rockett2, M. Ouyang3, D.J. Dix2, J.G. Howard1, P. Georgopoulos4, W.J. J. Welsh3 and D. E. Wildt1
1Department of Reproductiv...
Spinelli, Lionel; Carpentier, Sabrina; Montañana Sanchis, Frédéric; Dalod, Marc; Vu Manh, Thien-Phong
2015-10-19
Recent advances in the analysis of high-throughput expression data have led to the development of tools that scaled-up their focus from single-gene to gene set level. For example, the popular Gene Set Enrichment Analysis (GSEA) algorithm can detect moderate but coordinated expression changes of groups of presumably related genes between pairs of experimental conditions. This considerably improves extraction of information from high-throughput gene expression data. However, although many gene sets covering a large panel of biological fields are available in public databases, the ability to generate home-made gene sets relevant to one's biological question is crucial but remains a substantial challenge to most biologists lacking statistic or bioinformatic expertise. This is all the more the case when attempting to define a gene set specific of one condition compared to many other ones. Thus, there is a crucial need for an easy-to-use software for generation of relevant home-made gene sets from complex datasets, their use in GSEA, and the correction of the results when applied to multiple comparisons of many experimental conditions. We developed BubbleGUM (GSEA Unlimited Map), a tool that allows to automatically extract molecular signatures from transcriptomic data and perform exhaustive GSEA with multiple testing correction. One original feature of BubbleGUM notably resides in its capacity to integrate and compare numerous GSEA results into an easy-to-grasp graphical representation. We applied our method to generate transcriptomic fingerprints for murine cell types and to assess their enrichments in human cell types. This analysis allowed us to confirm homologies between mouse and human immunocytes. BubbleGUM is an open-source software that allows to automatically generate molecular signatures out of complex expression datasets and to assess directly their enrichment by GSEA on independent datasets. Enrichments are displayed in a graphical output that helps interpreting the results. This innovative methodology has recently been used to answer important questions in functional genomics, such as the degree of similarities between microarray datasets from different laboratories or with different experimental models or clinical cohorts. BubbleGUM is executable through an intuitive interface so that both bioinformaticians and biologists can use it. It is available at http://www.ciml.univ-mrs.fr/applications/BubbleGUM/index.html .
DOE Office of Scientific and Technical Information (OSTI.GOV)
Hsia, Chu Chieh; Chizhikov, Vladimir E.; Yang, Amy X.
Hepatitis B virus (HBV), hepatitis C virus (HCV), and human immunodeficiency virus type-1 (HIV-1) are transfusion-transmitted human pathogens that have a major impact on blood safety and public health worldwide. We developed a microarray multiplex assay for the simultaneous detection and discrimination of these three viruses. The microarray consists of 16 oligonucleotide probes, immobilized on a silylated glass slide. Amplicons from multiplex PCR were labeled with Cy-5 and hybridized to the microarray. The assay detected 1 International Unit (IU), 10 IU, 20 IU of HBV, HCV, and HIV-1, respectively, in a single multiplex reaction. The assay also detected and discriminatedmore » the presence of two or three of these viruses in a single sample. Our data represent a proof-of-concept for the possible use of highly sensitive multiplex microarray assay to screen and confirm the presence of these viruses in blood donors and patients.« less
DNA Microarray Detection of 18 Important Human Blood Protozoan Species
Chen, Jun-Hu; Feng, Xin-Yu; Chen, Shao-Hong; Cai, Yu-Chun; Lu, Yan; Zhou, Xiao-Nong; Chen, Jia-Xu; Hu, Wei
2016-01-01
Background Accurate detection of blood protozoa from clinical samples is important for diagnosis, treatment and control of related diseases. In this preliminary study, a novel DNA microarray system was assessed for the detection of Plasmodium, Leishmania, Trypanosoma, Toxoplasma gondii and Babesia in humans, animals, and vectors, in comparison with microscopy and PCR data. Developing a rapid, simple, and convenient detection method for protozoan detection is an urgent need. Methodology/Principal Findings The microarray assay simultaneously identified 18 species of common blood protozoa based on the differences in respective target genes. A total of 20 specific primer pairs and 107 microarray probes were selected according to conserved regions which were designed to identify 18 species in 5 blood protozoan genera. The positive detection rate of the microarray assay was 91.78% (402/438). Sensitivity and specificity for blood protozoan detection ranged from 82.4% (95%CI: 65.9% ~ 98.8%) to 100.0% and 95.1% (95%CI: 93.2% ~ 97.0%) to 100.0%, respectively. Positive predictive value (PPV) and negative predictive value (NPV) ranged from 20.0% (95%CI: 2.5% ~ 37.5%) to 100.0% and 96.8% (95%CI: 95.0% ~ 98.6%) to 100.0%, respectively. Youden index varied from 0.82 to 0.98. The detection limit of the DNA microarrays ranged from 200 to 500 copies/reaction, similar to PCR findings. The concordance rate between microarray data and DNA sequencing results was 100%. Conclusions/Significance Overall, the newly developed microarray platform provides a convenient, highly accurate, and reliable clinical assay for the determination of blood protozoan species. PMID:27911895
Welham, Nathan V.; Ling, Changying; Dawson, John A.; Kendziorski, Christina; Thibeault, Susan L.; Yamashita, Masaru
2015-01-01
The vocal fold (VF) mucosa confers elegant biomechanical function for voice production but is susceptible to scar formation following injury. Current understanding of VF wound healing is hindered by a paucity of data and is therefore often generalized from research conducted in skin and other mucosal systems. Here, using a previously validated rat injury model, expression microarray technology and an empirical Bayes analysis approach, we generated a VF-specific transcriptome dataset to better capture the system-level complexity of wound healing in this specialized tissue. We measured differential gene expression at 3, 14 and 60 days post-injury compared to experimentally naïve controls, pursued functional enrichment analyses to refine and add greater biological definition to the previously proposed temporal phases of VF wound healing, and validated the expression and localization of a subset of previously unidentified repair- and regeneration-related genes at the protein level. Our microarray dataset is a resource for the wider research community and has the potential to stimulate new hypotheses and avenues of investigation, improve biological and mechanistic insight, and accelerate the identification of novel therapeutic targets. PMID:25592437
SoFoCles: feature filtering for microarray classification based on gene ontology.
Papachristoudis, Georgios; Diplaris, Sotiris; Mitkas, Pericles A
2010-02-01
Marker gene selection has been an important research topic in the classification analysis of gene expression data. Current methods try to reduce the "curse of dimensionality" by using statistical intra-feature set calculations, or classifiers that are based on the given dataset. In this paper, we present SoFoCles, an interactive tool that enables semantic feature filtering in microarray classification problems with the use of external, well-defined knowledge retrieved from the Gene Ontology. The notion of semantic similarity is used to derive genes that are involved in the same biological path during the microarray experiment, by enriching a feature set that has been initially produced with legacy methods. Among its other functionalities, SoFoCles offers a large repository of semantic similarity methods that are used in order to derive feature sets and marker genes. The structure and functionality of the tool are discussed in detail, as well as its ability to improve classification accuracy. Through experimental evaluation, SoFoCles is shown to outperform other classification schemes in terms of classification accuracy in two real datasets using different semantic similarity computation approaches.
Striano, Pasquale; Coppola, Antonietta; Paravidino, Roberta; Malacarne, Michela; Gimelli, Stefania; Robbiano, Angela; Traverso, Monica; Pezzella, Marianna; Belcastro, Vincenzo; Bianchi, Amedeo; Elia, Maurizio; Falace, Antonio; Gazzerro, Elisabetta; Ferlazzo, Edoardo; Freri, Elena; Galasso, Roberta; Gobbi, Giuseppe; Molinatto, Cristina; Cavani, Simona; Zuffardi, Orsetta; Striano, Salvatore; Ferrero, Giovanni Battista; Silengo, Margherita; Cavaliere, Maria Luigia; Benelli, Matteo; Magi, Alberto; Piccione, Maria; Dagna Bricarelli, Franca; Coviello, Domenico A; Fichera, Marco; Minetti, Carlo; Zara, Federico
2012-03-01
To perform an extensive search for genomic rearrangements by microarray-based comparative genomic hybridization in patients with epilepsy. Prospective cohort study. Epilepsy centers in Italy. Two hundred seventy-nine patients with unexplained epilepsy, 265 individuals with nonsyndromic mental retardation but no epilepsy, and 246 healthy control subjects were screened by microarray-based comparative genomic hybridization. Identification of copy number variations (CNVs) and gene enrichment. Rare CNVs occurred in 26 patients (9.3%) and 16 healthy control subjects (6.5%) (P = .26). The CNVs identified in patients were larger (P = .03) and showed higher gene content (P = .02) than those in control subjects. The CNVs larger than 1 megabase (P = .002) and including more than 10 genes (P = .005) occurred more frequently in patients than in control subjects. Nine patients (34.6%) among those harboring rare CNVs showed rearrangements associated with emerging microdeletion or microduplication syndromes. Mental retardation and neuropsychiatric features were associated with rare CNVs (P = .004), whereas epilepsy type was not. The CNV rate in patients with epilepsy and mental retardation or neuropsychiatric features is not different from that observed in patients with mental retardation only. Moreover, significant enrichment of genes involved in ion transport was observed within CNVs identified in patients with epilepsy. Patients with epilepsy show a significantly increased burden of large, rare, gene-rich CNVs, particularly when associated with mental retardation and neuropsychiatric features. The limited overlap between CNVs observed in the epilepsy group and those observed in the group with mental retardation only as well as the involvement of specific (ion channel) genes indicate a specific association between the identified CNVs and epilepsy. Screening for CNVs should be performed for diagnostic purposes preferentially in patients with epilepsy and mental retardation or neuropsychiatric features.
Gasc, Cyrielle; Peyretaillade, Eric
2016-01-01
Abstract The recent expansion of next-generation sequencing has significantly improved biological research. Nevertheless, deep exploration of genomes or metagenomic samples remains difficult because of the sequencing depth and the associated costs required. Therefore, different partitioning strategies have been developed to sequence informative subsets of studied genomes. Among these strategies, hybridization capture has proven to be an innovative and efficient tool for targeting and enriching specific biomarkers in complex DNA mixtures. It has been successfully applied in numerous areas of biology, such as exome resequencing for the identification of mutations underlying Mendelian or complex diseases and cancers, and its usefulness has been demonstrated in the agronomic field through the linking of genetic variants to agricultural phenotypic traits of interest. Moreover, hybridization capture has provided access to underexplored, but relevant fractions of genomes through its ability to enrich defined targets and their flanking regions. Finally, on the basis of restricted genomic information, this method has also allowed the expansion of knowledge of nonreference species and ancient genomes and provided a better understanding of metagenomic samples. In this review, we present the major advances and discoveries permitted by hybridization capture and highlight the potency of this approach in all areas of biology. PMID:27105841
Gasc, Cyrielle; Peyretaillade, Eric; Peyret, Pierre
2016-06-02
The recent expansion of next-generation sequencing has significantly improved biological research. Nevertheless, deep exploration of genomes or metagenomic samples remains difficult because of the sequencing depth and the associated costs required. Therefore, different partitioning strategies have been developed to sequence informative subsets of studied genomes. Among these strategies, hybridization capture has proven to be an innovative and efficient tool for targeting and enriching specific biomarkers in complex DNA mixtures. It has been successfully applied in numerous areas of biology, such as exome resequencing for the identification of mutations underlying Mendelian or complex diseases and cancers, and its usefulness has been demonstrated in the agronomic field through the linking of genetic variants to agricultural phenotypic traits of interest. Moreover, hybridization capture has provided access to underexplored, but relevant fractions of genomes through its ability to enrich defined targets and their flanking regions. Finally, on the basis of restricted genomic information, this method has also allowed the expansion of knowledge of nonreference species and ancient genomes and provided a better understanding of metagenomic samples. In this review, we present the major advances and discoveries permitted by hybridization capture and highlight the potency of this approach in all areas of biology. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
She, Zhicai; Li, Li; Meng, Jie; Jia, Zhen; Que, Huayong; Zhang, Guofan
2018-06-06
The Pacific oyster Crassostrea gigas is an important cultivated shellfish. As a euryhaline species, it has evolved adaptive mechanisms responding to the complex and changeable intertidal environment that it inhabits. To investigate the genetic basis of this salinity adaptation mechanism, we conducted a genome-wide association study using phenotypically differentiated populations (hyposalinity and hypersalinity adaptation populations, and control population), and confirmed our results using an independent population, high-resolution melting, and mRNA expression analysis. For the hyposalinity adaptation, we determined 24 genes, including Cg_CLCN7 (chloride channel protein 7) and Cg_AP1 (apoptosis 1 inhibitor), involved in the ion/water channel and transporter mechanisms, free amino acid and reactive oxygen species metabolism, immune responses, and chemical defence. Three SNPs located on these two genes were significantly differentiated between groups, as was Cg_CLCN7. For the hypersalinity adaptation, the biological process for positive regulating the developmental process was enriched. Enriched gene functions were focused on transcriptional regulation, signal transduction, and cell growth and differentiation, including calmodulin (Cg_CaM) and ficolin-2 (Cg_FCN2). These genes and polymorphisms possibly play an important role in oyster hyposalinity and hypersalinity adaptation. They not only further our understanding of salinity adaptation mechanisms but also provide markers for highly adaptable oyster strains suitable for breeding.
Ahmad, Meraj; Sinha, Anubhav; Ghosh, Sreya; Kumar, Vikrant; Davila, Sonia; Yajnik, Chittaranjan S; Chandak, Giriraj R
2017-07-27
Imputation is a computational method based on the principle of haplotype sharing allowing enrichment of genome-wide association study datasets. It depends on the haplotype structure of the population and density of the genotype data. The 1000 Genomes Project led to the generation of imputation reference panels which have been used globally. However, recent studies have shown that population-specific panels provide better enrichment of genome-wide variants. We compared the imputation accuracy using 1000 Genomes phase 3 reference panel and a panel generated from genome-wide data on 407 individuals from Western India (WIP). The concordance of imputed variants was cross-checked with next-generation re-sequencing data on a subset of genomic regions. Further, using the genome-wide data from 1880 individuals, we demonstrate that WIP works better than the 1000 Genomes phase 3 panel and when merged with it, significantly improves the imputation accuracy throughout the minor allele frequency range. We also show that imputation using only South Asian component of the 1000 Genomes phase 3 panel works as good as the merged panel, making it computationally less intensive job. Thus, our study stresses that imputation accuracy using 1000 Genomes phase 3 panel can be further improved by including population-specific reference panels from South Asia.
Chen, Huang-Han; Wu, Chih-Hsing; Tsai, Mei-Ling; Huang, Yi-Jing; Chen, Shu-Hui
2012-10-16
The percentage of glycosylated hemoglobin A1c (%GHbA1c) in human whole blood indicates the average plasma glucose concentration over a prolonged period of time and is used to diagnose diabetes. However, detecting GHbA1c in the whole blood using immunoassays has limited detection sensitivity due to its low percentage in total hemoglobin (tHb) and interference from various glycan moieties in the sample. We have developed a sandwich immunoassay using an antibody microarray on a polydimethylsiloxane (PDMS) substrate modified with fluorinated compounds to detect tHb and glycosylated hemoglobin A1c (GHbA1c) in human whole blood without sample pretreatment. A polyclonal antibody against hemoglobin (Hb) immobilized on PDMS is used as a common capture probe to enrich all forms of Hb followed by detection via monoclonal anti-Hb and specific monoclonal anti-GHbA1c antibodies for tHb and GHbA1c detection, respectively. This method prevents the use of glycan binding molecules and dramatically reduces the background interference, yielding a detection limit of 3.58 ng/mL for tHb and 0.20 ng/mL for GHbA1c. The fluorinated modification on PDMS is superior to the glass substrate and eliminates the need for the blocking step which is required in commercial enzyme linked immunosorbent assay (ELISA) kits. Moreover, the detection sensitivity for GHbA1c is 4-5 orders of magnitude higher, but the required sample amount is 25 times less than the commercial method. On the basis of patient sample data, a good linear correlation between %GHbA1c values determined by our method and the certified high performance liquid chromatography (HPLC) standard method is shown with R(2) > 0.98, indicating the great promise of the developed method for clinical applications.
Bioinformatics/biostatistics: microarray analysis.
Eichler, Gabriel S
2012-01-01
The quantity and complexity of the molecular-level data generated in both research and clinical settings require the use of sophisticated, powerful computational interpretation techniques. It is for this reason that bioinformatic analysis of complex molecular profiling data has become a fundamental technology in the development of personalized medicine. This chapter provides a high-level overview of the field of bioinformatics and outlines several, classic bioinformatic approaches. The highlighted approaches can be aptly applied to nearly any sort of high-dimensional genomic, proteomic, or metabolomic experiments. Reviewed technologies in this chapter include traditional clustering analysis, the Gene Expression Dynamics Inspector (GEDI), GoMiner (GoMiner), Gene Set Enrichment Analysis (GSEA), and the Learner of Functional Enrichment (LeFE).
Dorval, Véronique; Smith, Pascal Y; Delay, Charlotte; Calvo, Ezequiel; Planel, Emmanuel; Zommer, Nadège; Buée, Luc; Hébert, Sébastien S
2012-01-01
The small non-protein-coding microRNAs (miRNAs) have emerged as critical regulators of neuronal differentiation, identity and survival. To date, however, little is known about the genes and molecular networks regulated by neuronal miRNAs in vivo, particularly in the adult mammalian brain. We analyzed whole genome microarrays from mice lacking Dicer, the enzyme responsible for miRNA production, specifically in postnatal forebrain neurons. A total of 755 mRNA transcripts were significantly (P<0.05, FDR<0.25) misregulated in the conditional Dicer knockout mice. Ten genes, including Tnrc6c, Dnmt3a, and Limk1, were validated by real time quantitative RT-PCR. Upregulated transcripts were enriched in nonneuronal genes, which is consistent with previous studies in vitro. Microarray data mining showed that upregulated genes were enriched in biological processes related to gene expression regulation, while downregulated genes were associated with neuronal functions. Molecular pathways associated with neurological disorders, cellular organization and cellular maintenance were altered in the Dicer mutant mice. Numerous miRNA target sites were enriched in the 3'untranslated region (3'UTR) of upregulated genes, the most significant corresponding to the miR-124 seed sequence. Interestingly, our results suggest that, in addition to miR-124, a large fraction of the neuronal miRNome participates, by order of abundance, in coordinated gene expression regulation and neuronal maintenance. Taken together, these results provide new clues into the role of specific miRNA pathways in the regulation of brain identity and maintenance in adult mice.
Carter, Chris J.; France, James; Crean, StJohn; Singhrao, Sim K.
2017-01-01
Periodontal disease is of established etiology in which polymicrobial synergistic ecology has become dysbiotic under the influence of Porphyromonas gingivalis. Following breakdown of the host's protective oral tissue barriers, P. gingivalis migrates to developing inflammatory pathologies that associate with Alzheimer's disease (AD). Periodontal disease is a risk factor for cardiovascular disorders (CVD), type II diabetes mellitus (T2DM), AD and other chronic diseases, whilst T2DM exacerbates periodontitis. This study analyzed the relationship between the P. gingivalis/host interactome and the genes identified in genome-wide association studies (GWAS) for the aforementioned conditions using data from GWASdb (P < 1E-03) and, in some cases, from the NCBI/EBI GWAS database (P < 1E-05). Gene expression data from periodontitis or P. gingivalis microarray was compared to microarray datasets from the AD hippocampus and/or from carotid artery plaques. The results demonstrated that the host genes of the P. gingivalis interactome were significantly enriched in genes deposited in GWASdb genes related to cognitive disorders, AD and dementia, and its co-morbid conditions T2DM, obesity, and CVD. The P. gingivalis/host interactome was also enriched in GWAS genes from the more stringent NCBI-EBI database for AD, atherosclerosis and T2DM. The misregulated genes in periodontitis tissue or P. gingivalis infected macrophages also matched those in the AD hippocampus or atherosclerotic plaques. Together, these data suggest important gene/environment interactions between P. gingivalis and susceptibility genes or gene expression changes in conditions where periodontal disease is a contributory factor. PMID:29311898
Carter, Chris J; France, James; Crean, StJohn; Singhrao, Sim K
2017-01-01
Periodontal disease is of established etiology in which polymicrobial synergistic ecology has become dysbiotic under the influence of Porphyromonas gingivalis . Following breakdown of the host's protective oral tissue barriers, P. gingivalis migrates to developing inflammatory pathologies that associate with Alzheimer's disease (AD). Periodontal disease is a risk factor for cardiovascular disorders (CVD), type II diabetes mellitus (T2DM), AD and other chronic diseases, whilst T2DM exacerbates periodontitis. This study analyzed the relationship between the P. gingivalis /host interactome and the genes identified in genome-wide association studies (GWAS) for the aforementioned conditions using data from GWASdb ( P < 1E-03) and, in some cases, from the NCBI/EBI GWAS database ( P < 1E-05). Gene expression data from periodontitis or P. gingivalis microarray was compared to microarray datasets from the AD hippocampus and/or from carotid artery plaques. The results demonstrated that the host genes of the P. gingivalis interactome were significantly enriched in genes deposited in GWASdb genes related to cognitive disorders, AD and dementia, and its co-morbid conditions T2DM, obesity, and CVD. The P. gingivalis /host interactome was also enriched in GWAS genes from the more stringent NCBI-EBI database for AD, atherosclerosis and T2DM. The misregulated genes in periodontitis tissue or P. gingivalis infected macrophages also matched those in the AD hippocampus or atherosclerotic plaques. Together, these data suggest important gene/environment interactions between P. gingivalis and susceptibility genes or gene expression changes in conditions where periodontal disease is a contributory factor.
Watanabe, Kazuhide; Biesinger, Jacob; Salmans, Michael L.; Roberts, Brian S.; Arthur, William T.; Cleary, Michele; Andersen, Bogi; Xie, Xiaohui; Dai, Xing
2014-01-01
Background Deregulation of canonical Wnt/CTNNB1 (beta-catenin) pathway is one of the earliest events in the pathogenesis of colon cancer. Mutations in APC or CTNNB1 are highly frequent in colon cancer and cause aberrant stabilization of CTNNB1, which activates the transcription of Wnt target genes by binding to chromatin via the TCF/LEF transcription factors. Here we report an integrative analysis of genome-wide chromatin occupancy of CTNNB1 by chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) and gene expression profiling by microarray analysis upon RNAi-mediated knockdown of CTNNB1 in colon cancer cells. Results We observed 3629 CTNNB1 binding peaks across the genome and a significant correlation between CTNNB1 binding and knockdown-induced gene expression change. Our integrative analysis led to the discovery of a direct Wnt target signature composed of 162 genes. Gene ontology analysis of this signature revealed a significant enrichment of Wnt pathway genes, suggesting multiple feedback regulations of the pathway. We provide evidence that this gene signature partially overlaps with the Lgr5+ intestinal stem cell signature, and is significantly enriched in normal intestinal stem cells as well as in clinical colorectal cancer samples. Interestingly, while the expression of the CTNNB1 target gene set does not correlate with survival, elevated expression of negative feedback regulators within the signature predicts better prognosis. Conclusion Our data provide a genome-wide view of chromatin occupancy and gene regulation of Wnt/CTNNB1 signaling in colon cancer cells. PMID:24651522
Watanabe, Kazuhide; Biesinger, Jacob; Salmans, Michael L; Roberts, Brian S; Arthur, William T; Cleary, Michele; Andersen, Bogi; Xie, Xiaohui; Dai, Xing
2014-01-01
Deregulation of canonical Wnt/CTNNB1 (beta-catenin) pathway is one of the earliest events in the pathogenesis of colon cancer. Mutations in APC or CTNNB1 are highly frequent in colon cancer and cause aberrant stabilization of CTNNB1, which activates the transcription of Wnt target genes by binding to chromatin via the TCF/LEF transcription factors. Here we report an integrative analysis of genome-wide chromatin occupancy of CTNNB1 by chromatin immunoprecipitation coupled with high-throughput sequencing (ChIP-seq) and gene expression profiling by microarray analysis upon RNAi-mediated knockdown of CTNNB1 in colon cancer cells. We observed 3629 CTNNB1 binding peaks across the genome and a significant correlation between CTNNB1 binding and knockdown-induced gene expression change. Our integrative analysis led to the discovery of a direct Wnt target signature composed of 162 genes. Gene ontology analysis of this signature revealed a significant enrichment of Wnt pathway genes, suggesting multiple feedback regulations of the pathway. We provide evidence that this gene signature partially overlaps with the Lgr5+ intestinal stem cell signature, and is significantly enriched in normal intestinal stem cells as well as in clinical colorectal cancer samples. Interestingly, while the expression of the CTNNB1 target gene set does not correlate with survival, elevated expression of negative feedback regulators within the signature predicts better prognosis. Our data provide a genome-wide view of chromatin occupancy and gene regulation of Wnt/CTNNB1 signaling in colon cancer cells.
Zhang, Shi-tao; Zuo, Chao; Li, Wan-nan; Fu, Xue-qi; Xing, Shu; Zhang, Xiao-ping
2016-02-01
To identify key genes related to the effect of estrogen on ovarian cancer. Microarray data (GSE22600) were downloaded from Gene Expression Omnibus. Eight estrogen and seven placebo treatment samples were obtained using a 2 × 2 factorial designs, which contained 2 cell lines (PEO4 and 2008) and 2 treatments (estrogen and placebo). Differentially expressed genes were identified by Bayesian methods, and the genes with P < 0.05 and |log2FC (fold change)| ≥0.5 were chosen as cut-off criterion. Differentially co-expressed genes (DCGs) and differentially regulated genes (DRGs) were, respectively, identified by DCe function and DRsort function in DCGL package. Topological structure analysis was performed on the important transcriptional factors (TFs) and genes in transcriptional regulatory network using tYNA. Functional enrichment analysis was, respectively, performed for DEGs and the important genes using Gene Ontology and KEGG databases. In total, 465 DEGs were identified. Functional enrichment analysis of DEGs indicated that ACVR2B, LTBP1, BMP7 and MYC involved in TGF-beta signaling pathway. The 2285 DCG pairs and 357 DRGs were identified. Topological structure analysis showed that 52 important TFs and 65 important genes were identified. Functional enrichment analysis of the important genes showed that TP53 and MLH1 participated in DNA damage response and the genes (ACVR2B, LTBP1, BMP7 and MYC) involved in TGF-beta signaling pathway. TP53, MLH1, ACVR2B, LTBP1 and BMP7 might participate in the pathogenesis of ovarian cancer.
Sulaiman, Irshad M; Sammons, Scott A; Wohlhueter, Robert M
2008-04-01
We recently developed a set of seven resequencing GeneChips for the rapid sequencing of Variola virus strains in the WHO Repository of the Centers for Disease Control and Prevention. In this study, we attempted to hybridize these GeneChips with some known non-Variola orthopoxvirus isolates, including monkeypox, cowpox, and vaccinia viruses, for rapid detection.
Yang, Huaan; Jian, Jianbo; Li, Xuan; Renshaw, Daniel; Clements, Jonathan; Sweetingham, Mark W; Tan, Cong; Li, Chengdao
2015-09-02
Molecular marker-assisted breeding provides an efficient tool to develop improved crop varieties. A major challenge for the broad application of markers in marker-assisted selection is that the marker phenotypes must match plant phenotypes in a wide range of breeding germplasm. In this study, we used the legume crop species Lupinus angustifolius (lupin) to demonstrate the utility of whole genome sequencing and re-sequencing on the development of diagnostic markers for molecular plant breeding. Nine lupin cultivars released in Australia from 1973 to 2007 were subjected to whole genome re-sequencing. The re-sequencing data together with the reference genome sequence data were used in marker development, which revealed 180,596 to 795,735 SNP markers from pairwise comparisons among the cultivars. A total of 207,887 markers were anchored on the lupin genetic linkage map. Marker mining obtained an average of 387 SNP markers and 87 InDel markers for each of the 24 genome sequence assembly scaffolds bearing markers linked to 11 genes of agronomic interest. Using the R gene PhtjR conferring resistance to phomopsis stem blight disease as a test case, we discovered 17 candidate diagnostic markers by genotyping and selecting markers on a genetic linkage map. A further 243 candidate diagnostic markers were discovered by marker mining on a scaffold bearing non-diagnostic markers linked to the PhtjR gene. Nine out from the ten tested candidate diagnostic markers were confirmed as truly diagnostic on a broad range of commercial cultivars. Markers developed using these strategies meet the requirements for broad application in molecular plant breeding. We demonstrated that low-cost genome sequencing and re-sequencing data were sufficient and very effective in the development of diagnostic markers for marker-assisted selection. The strategies used in this study may be applied to any trait or plant species. Whole genome sequencing and re-sequencing provides a powerful tool to overcome current limitations in molecular plant breeding, which will enable plant breeders to precisely pyramid favourable genes to develop super crop varieties to meet future food demands.
MMP21 is mutated in human heterotaxy and is required for normal left-right asymmetry in vertebrates.
Guimier, Anne; Gabriel, George C; Bajolle, Fanny; Tsang, Michael; Liu, Hui; Noll, Aaron; Schwartz, Molly; El Malti, Rajae; Smith, Laurie D; Klena, Nikolai T; Jimenez, Gina; Miller, Neil A; Oufadem, Myriam; Moreau de Bellaing, Anne; Yagi, Hisato; Saunders, Carol J; Baker, Candice N; Di Filippo, Sylvie; Peterson, Kevin A; Thiffault, Isabelle; Bole-Feysot, Christine; Cooley, Linda D; Farrow, Emily G; Masson, Cécile; Schoen, Patric; Deleuze, Jean-François; Nitschké, Patrick; Lyonnet, Stanislas; de Pontual, Loic; Murray, Stephen A; Bonnet, Damien; Kingsmore, Stephen F; Amiel, Jeanne; Bouvagnet, Patrice; Lo, Cecilia W; Gordon, Christopher T
2015-11-01
Heterotaxy results from a failure to establish normal left-right asymmetry early in embryonic development. By whole-exome sequencing, whole-genome sequencing and high-throughput cohort resequencing, we identified recessive mutations in MMP21 (encoding matrix metallopeptidase 21) in nine index cases with heterotaxy. In addition, Mmp21-mutant mice and mmp21-morphant zebrafish displayed heterotaxy and abnormal cardiac looping, respectively, suggesting a new role for extracellular matrix remodeling in the establishment of laterality in vertebrates.
MMP21 is mutated in human heterotaxy and is required for normal left-right asymmetry in vertebrates
Guimier, Anne; Gabriel, George C.; Bajolle, Fanny; Tsang, Michael; Liu, Hui; Noll, Aaron; Schwartz, Molly; El Malti, Rajae; Smith, Laurie D.; Klena, Nikolai T.; Jimenez, Gina; Miller, Neil A.; Oufadem, Myriam; Moreau de Bellaing, Anne; Yagi, Hisato; Saunders, Carol J.; Baker, Candice N.; Di Filippo, Sylvie; Peterson, Kevin A.; Thiffault, Isabelle; Bole-Feysot, Christine; Cooley, Linda D.; Farrow, Emily G.; Masson, Cécile; Schoen, Patric; Deleuze, Jean-François; Nitschké, Patrick; Lyonnet, Stanislas; de Pontual, Loic; Murray, Stephen A.; Bonnet, Damien; Kingsmore, Stephen F.; Amiel, Jeanne; Bouvagnet, Patrice; Lo, Cecilia W.; Gordon, Christopher T.
2017-01-01
Heterotaxy results from a failure to establish normal left-right asymmetry early in embryonic development. By whole exome sequencing, whole genome sequencing and high-throughput cohort resequencing we identified recessive mutations in matrix metallopeptidase 21 (MMP21), in nine index cases with heterotaxy. In addition, Mmp21 mutant mice and morphant zebrafish display heterotaxy and abnormal cardiac looping, respectively, suggesting a novel role for extra-cellular remodeling in the establishment of laterality in vertebrates. PMID:26437028
NASA Astrophysics Data System (ADS)
2011-12-01
Research on Global Carbon Emission and Sequestration NSFC Funded Project Made Significant Progress in Quantum Dynamics Functional Human Blood Protein Obtained from Rice How Giant Pandas Thrive on a Bamboo Diet New Evidence of Interpersonal Violence from 129,000 Years Ago Found in China Aptamer-Mediated Efficient Capture and Release of T Lymphocytes on Nanostructured Surfaces BGI Study Results on Resequencing 50 Accessions of Rice Cast New Light on Molecular Breeding BGI Reports Study Results on Frequent Mutation of Genes Encoding UMPP Components in Kidney Cancer Research on Habitat Shift Promoting Species Diversification
Separate enrichment analysis of pathways for up- and downregulated genes.
Hong, Guini; Zhang, Wenjing; Li, Hongdong; Shen, Xiaopei; Guo, Zheng
2014-03-06
Two strategies are often adopted for enrichment analysis of pathways: the analysis of all differentially expressed (DE) genes together or the analysis of up- and downregulated genes separately. However, few studies have examined the rationales of these enrichment analysis strategies. Using both microarray and RNA-seq data, we show that gene pairs with functional links in pathways tended to have positively correlated expression levels, which could result in an imbalance between the up- and downregulated genes in particular pathways. We then show that the imbalance could greatly reduce the statistical power for finding disease-associated pathways through the analysis of all-DE genes. Further, using gene expression profiles from five types of tumours, we illustrate that the separate analysis of up- and downregulated genes could identify more pathways that are really pertinent to phenotypic difference. In conclusion, analysing up- and downregulated genes separately is more powerful than analysing all of the DE genes together.
Zhang, Zhaowei; Li, Peiwu; Hu, Xiaofeng; Zhang, Qi; Ding, Xiaoxia; Zhang, Wen
2012-01-01
Chemical contaminants in food have caused serious health issues in both humans and animals. Microarray technology is an advanced technique suitable for the analysis of chemical contaminates. In particular, immuno-microarray approach is one of the most promising methods for chemical contaminants analysis. The use of microarrays for the analysis of chemical contaminants is the subject of this review. Fabrication strategies and detection methods for chemical contaminants are discussed in detail. Application to the analysis of mycotoxins, biotoxins, pesticide residues, and pharmaceutical residues is also described. Finally, future challenges and opportunities are discussed.
A novel piezoelectric quartz micro-array immunosensor for detection of immunoglobulinE.
Yao, Chunyan; Chen, Qinghai; Chen, Ming; Zhang, Bo; Luo, Yang; Huang, Qing; Huang, Junfu; Fu, Weiling
2006-12-01
A novel multi-channel 2 x 5 model of piezoelectric (PZ) micro-array immunosensor has been developed for quantitative detection of human immunoglobulinE (IgE) in serum. Every crystal unit of the fabricated piezoelectric IgE micro-array immunosensor can oscillate without interfering each other. A multi-channel 2 x 5 model micro-array immunosensor as compared with the traditional one-channel immunosensor can provide eight times higher detection speeds for IgE assay. The anti-IgE antibody is deposited on the gold electrode's surface of 10 MHz AT-cut quartz crystals by SPA (staphylococcal protein A), and serves as an antibody recognizing layer. The highly ordered antibody monolayers ensure well-controlled surface structure and offer many advantages to the performance of the sensor. The uniform amount of antibody monolayer coated by the SPA is good, and non-specific reaction caused by other immunoglobulin in sample is found. The fabricated PZ immunosensor can be used for human IgE determination in the range of 5-300 IU/ml with high precision (CV is 4%). 50 human serum samples were detected by the micro-array immunosensor, and the results agreed well with those given by the commercially ELISA test kits. The correlation coefficient is 0.94 between ELISA and PZ immunosensor. After regeneration with NaOH the coated immunosensor can be reused 6 times without appreciable loss of activity.
The Role of Vitamin D in the Transcriptional Program of Human Pregnancy
Al-Garawi, Amal; Carey, Vincent J.; Chhabra, Divya; Morrow, Jarrett; Lasky-Su, Jessica; Qiu, Weiliang; Laranjo, Nancy; Litonjua, Augusto A.; Weiss, Scott T.
2016-01-01
Background Patterns of gene expression of human pregnancy are poorly understood. In a trial of vitamin D supplementation in pregnant women, peripheral blood transcriptomes were measured longitudinally on 30 women and used to characterize gene co-expression networks. Objective Studies suggest that increased maternal Vitamin D levels may reduce the risk of asthma in early life, yet the underlying mechanisms have not been examined. In this study, we used a network-based approach to examine changes in gene expression profiles during the course of normal pregnancy and evaluated their association with maternal Vitamin D levels. Design The VDAART study is a randomized clinical trial of vitamin D supplementation in pregnancy for reduction of pediatric asthma risk. The trial enrolled 881 women at 10–18 weeks of gestation. Longitudinal gene expression measures were obtained on thirty pregnant women, using RNA isolated from peripheral blood samples obtained in the first and third trimesters. Differentially expressed genes were identified using significance of analysis of microarrays (SAM), and clustered using a weighted gene co-expression network analysis (WGCNA). Gene-set enrichment was performed to identify major biological pathways. Results Comparison of transcriptional profiles between first and third trimesters of pregnancy identified 5839 significantly differentially expressed genes (FDR<0.05). Weighted gene co-expression network analysis clustered these transcripts into 14 co-expression modules of which two showed significant correlation with maternal vitamin D levels. Pathway analysis of these two modules revealed genes enriched in immune defense pathways and extracellular matrix reorganization as well as genes enriched in notch signaling and transcription factor networks. Conclusion Our data show that gene expression profiles of healthy pregnant women change during the course of pregnancy and suggest that maternal Vitamin D levels influence transcriptional profiles. These alterations of the maternal transcriptome may contribute to fetal immune imprinting and reduce allergic sensitization in early life. Trial Registration clinicaltrials.gov NCT00920621 PMID:27711190
Mitsutake, Norisato; Iwao, Atsuhiko; Nagai, Kazuhiro; Namba, Hiroyuki; Ohtsuru, Akira; Saenko, Vladimir; Yamashita, Shunichi
2007-04-01
There is increasing evidence that cancers contain their own stem-like cells called cancer stem cells (CSCs). A small subset of cells, termed side population (SP), has been identified using flow cytometric analysis. The SP cells have the ability to exclude the DNA binding dye, Hoechst33342, and are highly enriched for stem cells in many kinds of normal tissues. Because CSCs are thought to be drug resistant, SP cells in cancers might contain CSCs. We initially examined the presence of SP cells in several human thyroid cancer cell lines. A small percentage of SP cells were found in ARO (0.25%), FRO (0.1%), NPA (0.06%), and WRO (0.02%) cells but not TPC1 cells. After sorting, the SP cells generated both SP and non-SP cells in culture. The clonogenic ability of SP cells was significantly higher than that of non-SP cells. Moreover, the SP prevalence was dependent on cell density in culture, suggesting that SP cells preferentially survived at lower cell density. Microarray experiment revealed differential gene expression profile between SP and non-SP cells, and several genes related to stemness were up-regulated. However, non-SP population also contained cells that were tumorigenic in nude mice, and non-SP cells generated a small number of SP cells. These results suggest that cancer stem-like cells are partly, but not exclusively, enriched in SP population. Clarifying the key tumorigenic population might contribute to the establishment of a novel therapy for thyroid cancer.
Co-expression analysis reveals key gene modules and pathway of human coronary heart disease.
Tang, Yu; Ke, Zun-Ping; Peng, Yi-Gen; Cai, Ping-Tai
2018-02-01
Coronary heart disease is a kind of disease which causes great injury to people world-widely. Although gene expression analyses had been performed previously, to our best knowledge, systemic co-expression analysis for this disease is still lacking to date. Microarray data of coronary heart disease was downloaded from NCBI with the accession number of GSE20681. Co-expression modules were constructed by WGCNA. Besides, the connectivity degree of eigengenes was analyzed. Furthermore, GO and KEGG enrichment analysis was performed on these eigengenes in these constructed modules. A total of 11 co-expression modules were constructed by the 3000 up-regulated genes from the 99 samples with coronary heart disease. The average number of genes in these modules was 270. The interaction analysis indicated the relative independence of gene expression in these modules. The functional enrichment analysis showed that there was a significant difference in the enriched terms and degree among these 11 modules. The results showed that modules 9 and 10 played critical roles in the occurrence of coronary disease. Pathways of hsa00190 (oxidative phosphorylation) and (hsa01130: biosynthesis of antibiotics) were thought to be closely related to the occurrence and development of coronary heart disease. Our result demonstrated that modules 9 and 10 were the most critical modules in the occurrence of coronary heart disease. Pathways as hsa00190 (oxidative phosphorylation) and (hsa01130: biosynthesis of antibiotics) had the potential to serve as the prognostic and predictive marker of coronary heart disease. © 2017 Wiley Periodicals, Inc.
Picard, Nicolas; Trompf, Katja; Yang, Chao-Ling; Miller, R. Lance; Carrel, Monique; Loffing-Cueni, Dominique; Fenton, Robert A.; Ellison, David H.
2014-01-01
The thiazide-sensitive NaCl cotransporter (NCC) of the renal distal convoluted tubule (DCT) controls ion homeostasis and arterial BP. Loss-of-function mutations of NCC cause renal salt wasting with arterial hypotension (Gitelman syndrome). Conversely, mutations in the NCC-regulating WNK kinases or kelch-like 3 protein cause familial hyperkalemic hypertension. Here, we performed automated sorting of mouse DCTs and microarray analysis for comprehensive identification of novel DCT-enriched gene products, which may potentially regulate DCT and NCC function. This approach identified protein phosphatase 1 inhibitor-1 (I-1) as a DCT-enriched transcript, and immunohistochemistry revealed I-1 expression in mouse and human DCTs and thick ascending limbs. In heterologous expression systems, coexpression of NCC with I-1 increased thiazide-dependent Na+ uptake, whereas RNAi-mediated knockdown of endogenous I-1 reduced NCC phosphorylation. Likewise, levels of phosphorylated NCC decreased by approximately 50% in I-1 (I-1−/−) knockout mice without changes in total NCC expression. The abundance and phosphorylation of other renal sodium-transporting proteins, including NaPi-IIa, NKCC2, and ENaC, did not change, although the abundance of pendrin increased in these mice. The abundance, phosphorylation, and subcellular localization of SPAK were similar in wild-type (WT) and I-1−/− mice. Compared with WT mice, I-1−/− mice exhibited significantly lower arterial BP but did not display other metabolic features of NCC dysregulation. Thus, I-1 is a DCT-enriched gene product that controls arterial BP, possibly through regulation of NCC activity. PMID:24231659
Profiling ethanol-targeted transcription factors in human carcinoma cell-derived embryoid bodies.
Mandal, Chanchal; Halder, Debasish; Chai, Jin Choul; Lee, Young Seek; Jung, Kyoung Hwa; Chai, Young Gyu
2016-01-15
Fetal alcohol spectrum disorder is a collective term that represents fetal abnormalities associated with maternal alcohol consumption. Prenatal alcohol exposure and related anomalies are well characterized, but the molecular mechanism behind this phenomenon is not yet understood. Few insights have been gained from genetic and epigenetic studies of fetal alcohol spectrum disorder. Our aim was to profile the important molecular regulators of ethanol-related alterations of the genome. For this purpose, we have analyzed the gene expression pattern of human carcinoma cell-derived embryoid bodies in the absence or presence of ethanol. A cDNA microarray analysis was used to profile mRNA expression in embryoid bodies at day 7 with or without ethanol treatment. A total of 493 differentially expressed genes were identified in response to 50 mM ethanol exposure. Of these, 111 genes were up-regulated, and 382 were down-regulated. Gene ontology term enrichment analysis revealed that these genes are involved in important biological processes: neurological system processes, cognition, behavior, sensory perception of smell, taste and chemical stimuli and synaptic transmission. Similarly, the enrichment of disease-related genes included relevant categories such as neurological diseases, developmental disorders, skeletal and muscular disorders, and connective tissue disorders. Furthermore, we have identified a group of 26 genes that encode transcription factors. We validated the relative gene expression of several transcription factors using quantitative real time PCR. We hope that our study substantially contributes to the understanding of the molecular mechanisms underlying the pathology of alcohol-mediated anomalies and facilitates further research. Copyright © 2015 Elsevier B.V. All rights reserved.
Ojamies, P N; Kontro, M; Edgren, H; Ellonen, P; Lagström, S; Almusa, H; Miettinen, T; Eldfors, S; Tamborero, D; Wennerberg, K; Heckman, C; Porkka, K; Wolf, M; Kallioniemi, O
2017-05-01
In our individualized systems medicine program, personalized treatment options are identified and administered to chemorefractory acute myeloid leukemia (AML) patients based on exome sequencing and ex vivo drug sensitivity and resistance testing data. Here, we analyzed how clonal heterogeneity affects the responses of 13 AML patients to chemotherapy or targeted treatments using ultra-deep (average 68 000 × coverage) amplicon resequencing. Using amplicon resequencing, we identified 16 variants from 4 patients (frequency 0.54-2%) that were not detected previously by exome sequencing. A correlation-based method was developed to detect mutation-specific responses in serial samples across multiple time points. Significant subclone-specific responses were observed for both chemotherapy and targeted therapy. We detected subclonal responses in patients where clinical European LeukemiaNet (ELN) criteria showed no response. Subclonal responses also helped to identify putative mechanisms underlying drug sensitivities, such as sensitivity to azacitidine in DNMT3A mutated cell clones and resistance to cytarabine in a subclone with loss of NF1 gene. In summary, ultra-deep amplicon resequencing method enables sensitive quantification of subclonal variants and their responses to therapies. This approach provides new opportunities for designing combinatorial therapies blocking multiple subclones as well as for real-time assessment of such treatments.
Abo, Ryan P; Ducar, Matthew; Garcia, Elizabeth P; Thorner, Aaron R; Rojas-Rudilla, Vanesa; Lin, Ling; Sholl, Lynette M; Hahn, William C; Meyerson, Matthew; Lindeman, Neal I; Van Hummelen, Paul; MacConaill, Laura E
2015-02-18
Genomic structural variation (SV), a common hallmark of cancer, has important predictive and therapeutic implications. However, accurately detecting SV using high-throughput sequencing data remains challenging, especially for 'targeted' resequencing efforts. This is critically important in the clinical setting where targeted resequencing is frequently being applied to rapidly assess clinically actionable mutations in tumor biopsies in a cost-effective manner. We present BreaKmer, a novel approach that uses a 'kmer' strategy to assemble misaligned sequence reads for predicting insertions, deletions, inversions, tandem duplications and translocations at base-pair resolution in targeted resequencing data. Variants are predicted by realigning an assembled consensus sequence created from sequence reads that were abnormally aligned to the reference genome. Using targeted resequencing data from tumor specimens with orthogonally validated SV, non-tumor samples and whole-genome sequencing data, BreaKmer had a 97.4% overall sensitivity for known events and predicted 17 positively validated, novel variants. Relative to four publically available algorithms, BreaKmer detected SV with increased sensitivity and limited calls in non-tumor samples, key features for variant analysis of tumor specimens in both the clinical and research settings. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Equalizer reduces SNP bias in Affymetrix microarrays.
Quigley, David
2015-07-30
Gene expression microarrays measure the levels of messenger ribonucleic acid (mRNA) in a sample using probe sequences that hybridize with transcribed regions. These probe sequences are designed using a reference genome for the relevant species. However, most model organisms and all humans have genomes that deviate from their reference. These variations, which include single nucleotide polymorphisms, insertions of additional nucleotides, and nucleotide deletions, can affect the microarray's performance. Genetic experiments comparing individuals bearing different population-associated single nucleotide polymorphisms that intersect microarray probes are therefore subject to systemic bias, as the reduction in binding efficiency due to a technical artifact is confounded with genetic differences between parental strains. This problem has been recognized for some time, and earlier methods of compensation have attempted to identify probes affected by genome variants using statistical models. These methods may require replicate microarray measurement of gene expression in the relevant tissue in inbred parental samples, which are not always available in model organisms and are never available in humans. By using sequence information for the genomes of organisms under investigation, potentially problematic probes can now be identified a priori. However, there is no published software tool that makes it easy to eliminate these probes from an annotation. I present equalizer, a software package that uses genome variant data to modify annotation files for the commonly used Affymetrix IVT and Gene/Exon platforms. These files can be used by any microarray normalization method for subsequent analysis. I demonstrate how use of equalizer on experiments mapping germline influence on gene expression in a genetic cross between two divergent mouse species and in human samples significantly reduces probe hybridization-induced bias, reducing false positive and false negative findings. The equalizer package reduces probe hybridization bias from experiments performed on the Affymetrix microarray platform, allowing accurate assessment of germline influence on gene expression.
Detecting and Genotyping Escherichia coli O157:H7 using multiplexed PCR and nucleic acid microarrays
DOE Office of Scientific and Technical Information (OSTI.GOV)
Call, Douglas R.; Brockman, Fred J.; Chandler, Darrell P.
2000-12-01
Rapid detection and characterization of food borne pathogens such as Escherichia coli O157:H7 is crucial for epidemiological investigations and food safety surveillance. As an alternative to conventional technologies, we examined the sensitivity and specificity of nucleic acid microarrays for detecting and genotyping E. coli O157:H7. The array was composed of oligonucleotide probes (25-30 mer) complementary to four virulence loci (intimin, Shiga-like toxins I and II, and hemolysin A). Target DNA was amplified from whole cells or from purified DNA via single or multiplexed polymerase chain reaction (PCR), and PCR products were hybridized to the array without further modification or purification.more » The array was 32-fold more sensitive than gel electrophoresis and capable of detecting amplification products from < 1 cell equivalent of genomic DNA (1 fg). Immunomagnetic capture, PCR and a microarray were subsequently used to detect 55 CFU ml-1 (E. coli O157:H7) from chicken rinsate without the aid of pre-enrichment. Four isolates of E. coli O157:H7 and one isolate of O91:H2, for which genotypic data were available, were unambiguously genotyped with this array. Glass based microarrays are relatively simple to construct and provide a rapid and sensitive means to detect multiplexed PCR products and the system is amenable to automation.« less
Detecting and genotyping Escherichia coli O157:H7 using multiplexed PCR and nucleic acid microarrays
DOE Office of Scientific and Technical Information (OSTI.GOV)
Call, Douglas R.; Brockman, Fred J.; Chandler, Darrell P.
2001-07-05
Rapid detection and characterization of food borne pathogens such as Escherichia coli O157:H7 is crucial for epidemiological investigations and food safety surveillance. As an alternative to conventional technologies, we examined the sensitivity and specificity of nucleic acid microarrays for detecting and genotyping E. coli O157:H7. The array was composed of oligonucleotide probes (25-30 mer) complementary to four virulence loci (intimin, Shiga-like toxins I and II, and hemolysin A). Target DNA was amplified from whole cells or from purified DNA via single or multiplexed polymerase chain reaction (PCR), and PCR products were hybridized to the array without further modification or purification.more » The array was 32-fold more sensitive than gel electrophoresis and capable of detecting amplification products from < 1 cell equivalent of genomic DNA (1 fg). Immunomagnetic capture, PCR and a microarray were subsequently used to detect 55 CFUs ml-1 (E. coli O157:H7) from chicken rinsate without the aid of pre-enrichment. Four isolates of E. coli O157:H7 and one isolate of O91:H2, for which genotypic data were available, were unambiguously genotyped with this array. Glass based microarrays are relatively simple to construct and provide a rapid and sensitive means to detect multiplexed PCR products and the system is amenable to automation.« less
Role of PELP1 in EGFR-ER Signaling Crosstalk in Ovarian Cancer Cells
2009-04-01
expression of genes involved in metastasis using a focused microarray approach. We have used Human Tumor Metastasis Microarray (Oligo GE array from...ovarian cancer progression. Analysis of human genome databases and SAGE data suggested deregulation of PELP1 expression in ovarian cancer cells...PI3K, and STAT3 in the cytosol. PELP1/MNAR regulates meiosis via its interactions with heterotimeric Gbc protein, androgen receptor (AR), and by
Trayhurn, Paul; Denyer, Gareth
2012-01-01
Microarray datasets are a rich source of information in nutritional investigation. Targeted mining of microarray data following initial, non-biased bioinformatic analysis can provide key insight into specific genes and metabolic processes of interest. Microarrays from human adipocytes were examined to explore the effects of macrophage secretions on the expression of the G-protein-coupled receptor (GPR) genes that encode fatty acid receptors/sensors. Exposure of the adipocytes to macrophage-conditioned medium for 4 or 24 h had no effect on GPR40 and GPR43 expression, but there was a marked stimulation of GPR84 expression (receptor for medium-chain fatty acids), the mRNA level increasing 13·5-fold at 24 h relative to unconditioned medium. Importantly, expression of GPR120, which encodes an n-3 PUFA receptor/sensor, was strongly inhibited by the conditioned medium (15-fold decrease in mRNA at 24 h). Macrophage secretions have major effects on the expression of fatty acid receptor/sensor genes in human adipocytes, which may lead to an augmentation of the inflammatory response in adipose tissue in obesity.
Trayhurn, Paul; Denyer, Gareth
2012-01-01
Microarray datasets are a rich source of information in nutritional investigation. Targeted mining of microarray data following initial, non-biased bioinformatic analysis can provide key insight into specific genes and metabolic processes of interest. Microarrays from human adipocytes were examined to explore the effects of macrophage secretions on the expression of the G-protein-coupled receptor (GPR) genes that encode fatty acid receptors/sensors. Exposure of the adipocytes to macrophage-conditioned medium for 4 or 24 h had no effect on GPR40 and GPR43 expression, but there was a marked stimulation of GPR84 expression (receptor for medium-chain fatty acids), the mRNA level increasing 13·5-fold at 24 h relative to unconditioned medium. Importantly, expression of GPR120, which encodes an n-3 PUFA receptor/sensor, was strongly inhibited by the conditioned medium (15-fold decrease in mRNA at 24 h). Macrophage secretions have major effects on the expression of fatty acid receptor/sensor genes in human adipocytes, which may lead to an augmentation of the inflammatory response in adipose tissue in obesity. PMID:25191551
Development and application of a DNA microarray-based yeast two-hybrid system
Suter, Bernhard; Fontaine, Jean-Fred; Yildirimman, Reha; Raskó, Tamás; Schaefer, Martin H.; Rasche, Axel; Porras, Pablo; Vázquez-Álvarez, Blanca M.; Russ, Jenny; Rau, Kirstin; Foulle, Raphaele; Zenkner, Martina; Saar, Kathrin; Herwig, Ralf; Andrade-Navarro, Miguel A.; Wanker, Erich E.
2013-01-01
The yeast two-hybrid (Y2H) system is the most widely applied methodology for systematic protein–protein interaction (PPI) screening and the generation of comprehensive interaction networks. We developed a novel Y2H interaction screening procedure using DNA microarrays for high-throughput quantitative PPI detection. Applying a global pooling and selection scheme to a large collection of human open reading frames, proof-of-principle Y2H interaction screens were performed for the human neurodegenerative disease proteins huntingtin and ataxin-1. Using systematic controls for unspecific Y2H results and quantitative benchmarking, we identified and scored a large number of known and novel partner proteins for both huntingtin and ataxin-1. Moreover, we show that this parallelized screening procedure and the global inspection of Y2H interaction data are uniquely suited to define specific PPI patterns and their alteration by disease-causing mutations in huntingtin and ataxin-1. This approach takes advantage of the specificity and flexibility of DNA microarrays and of the existence of solid-related statistical methods for the analysis of DNA microarray data, and allows a quantitative approach toward interaction screens in human and in model organisms. PMID:23275563
The genome of Eucalyptus grandis
DOE Office of Scientific and Technical Information (OSTI.GOV)
Myburg, Alexander A.; Grattapaglia, Dario; Tuskan, Gerald A.
Eucalypts are the world s most widely planted hardwood trees. Their broad adaptability, rich species diversity, fast growth and superior multipurpose wood, have made them a global renewable resource of fiber and energy that mitigates human pressures on natural forests. We sequenced and assembled >94% of the 640 Mbp genome of Eucalyptus grandis into its 11 chromosomes. A set of 36,376 protein coding genes were predicted revealing that 34% occur in tandem duplications, the largest proportion found thus far in any plant genome. Eucalypts also show the highest diversity of genes for plant specialized metabolism that act as chemical defencemore » against biotic agents and provide unique pharmaceutical oils. Resequencing of a set of inbred tree genomes revealed regions of strongly conserved heterozygosity, likely hotspots of inbreeding depression. The resequenced genome of the sister species E. globulus underscored the high inter-specific genome colinearity despite substantial genome size variation in the genus. The genome of E. grandis is the first reference for the early diverging Rosid order Myrtales and is placed here basal to the Eurosids. This resource expands knowledge on the unique biology of large woody perennials and provides a powerful tool to accelerate comparative biology, breeding and biotechnology.« less
Hilger, Alina C; Halbritter, Jan; Pennimpede, Tracie; van der Ven, Amelie; Sarma, Georgia; Braun, Daniela A; Porath, Jonathan D; Kohl, Stefan; Hwang, Daw-Yang; Dworschak, Gabriel C; Hermann, Bernhard G; Pavlova, Anna; El-Maarri, Osman; Nöthen, Markus M; Ludwig, Michael; Reutter, Heiko; Hildebrandt, Friedhelm
2015-12-01
The VATER/VACTERL association describes the combination of congenital anomalies including vertebral defects, anorectal malformations, cardiac defects, tracheoesophageal fistula with or without esophageal atresia, renal malformations, and limb defects. As mutations in ciliary genes were observed in diseases related to VATER/VACTERL, we performed targeted resequencing of 25 ciliary candidate genes as well as disease-associated genes (FOXF1, HOXD13, PTEN, ZIC3) in 123 patients with VATER/VACTERL or VATER/VACTERL-like phenotype. We detected no biallelic mutation in any of the 25 ciliary candidate genes; however, identified an identical, probably disease-causing ZIC3 missense mutation (p.Gly17Cys) in four patients and a FOXF1 de novo mutation (p.Gly220Cys) in a further patient. In situ hybridization analyses in mouse embryos between E9.5 and E14.5 revealed Zic3 expression in limb and prevertebral structures, and Foxf1 expression in esophageal, tracheal, vertebral, anal, and genital tubercle tissues, hence VATER/VACTERL organ systems. These data provide strong evidence that mutations in ZIC3 or FOXF1 contribute to VATER/VACTERL. © 2015 WILEY PERIODICALS, INC.
Is the child 'father of the man'? evaluating the stability of genetic influences across development.
Ronald, Angelica
2011-11-01
This selective review considers findings in genetic research that have shed light on how genes operate across development. We will address the question of whether the child is 'father of the Man' from a genetic perspective. In other words, do the same genetic influences affect the same traits across development? Using a 'taster menu' approach and prioritizing newer findings on cognitive and behavioral traits, examples from the following genetic disciplines will be discussed: (a) developmental quantitative genetics (such as longitudinal twin studies), (b) neurodevelopmental genetic syndromes with known genetic causes (such as Williams syndrome), (c) developmental candidate gene studies (such as those that link infant and adult populations), (d) developmental genome-wide association studies (GWAS), and (e) DNA resequencing. Evidence presented here suggests that there is considerable genetic stability of cognitive and behavioral traits across development, but there is also evidence for genetic change. Quantitative genetic studies have a long history of assessing genetic continuity and change across development. It is now time for the newer, more technology-enabled fields such as GWAS and DNA resequencing also to take on board the dynamic nature of human behavior. 2011 Blackwell Publishing Ltd.
Duan, Naibin; Bai, Yang; Sun, Honghe; Wang, Nan; Ma, Yumin; Li, Mingjun; Wang, Xin; Jiao, Chen; Legall, Noah; Mao, Linyong; Wan, Sibao; Wang, Kun; He, Tianming; Feng, Shouqian; Zhang, Zongying; Mao, Zhiquan; Shen, Xiang; Chen, Xiaoliu; Jiang, Yuanmao; Wu, Shujing; Yin, Chengmiao; Ge, Shunfeng; Yang, Long; Jiang, Shenghui; Xu, Haifeng; Liu, Jingxuan; Wang, Deyun; Qu, Changzhi; Wang, Yicheng; Zuo, Weifang; Xiang, Li; Liu, Chang; Zhang, Daoyuan; Gao, Yuan; Xu, Yimin; Xu, Kenong; Chao, Thomas; Fazio, Gennaro; Shu, Huairui; Zhong, Gan-Yuan; Cheng, Lailiang; Fei, Zhangjun; Chen, Xuesen
2017-08-15
Human selection has reshaped crop genomes. Here we report an apple genome variation map generated through genome sequencing of 117 diverse accessions. A comprehensive model of apple speciation and domestication along the Silk Road is proposed based on evidence from diverse genomic analyses. Cultivated apples likely originate from Malus sieversii in Kazakhstan, followed by intensive introgressions from M. sylvestris. M. sieversii in Xinjiang of China turns out to be an "ancient" isolated ecotype not directly contributing to apple domestication. We have identified selective sweeps underlying quantitative trait loci/genes of important fruit quality traits including fruit texture and flavor, and provide evidences supporting a model of apple fruit size evolution comprising two major events with one occurring prior to domestication and the other during domestication. This study outlines the genetic basis of apple domestication and evolution, and provides valuable information for facilitating marker-assisted breeding and apple improvement.Apple is one of the most important fruit crops. Here, the authors perform deep genome resequencing of 117 diverse accessions and reveal comprehensive models of apple origin, speciation, domestication, and fruit size evolution as well as candidate genes associated with important agronomic traits.
Addressable droplet microarrays for single cell protein analysis.
Salehi-Reyhani, Ali; Burgin, Edward; Ces, Oscar; Willison, Keith R; Klug, David R
2014-11-07
Addressable droplet microarrays are potentially attractive as a way to achieve miniaturised, reduced volume, high sensitivity analyses without the need to fabricate microfluidic devices or small volume chambers. We report a practical method for producing oil-encapsulated addressable droplet microarrays which can be used for such analyses. To demonstrate their utility, we undertake a series of single cell analyses, to determine the variation in copy number of p53 proteins in cells of a human cancer cell line.
A pooling-based approach to mapping genetic variants associated with DNA methylation
Kaplow, Irene M.; MacIsaac, Julia L.; Mah, Sarah M.; McEwen, Lisa M.; Kobor, Michael S.; Fraser, Hunter B.
2015-01-01
DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account for allele-specific methylation (ASM). Other studies have performed whole-genome bisulfite sequencing on a few individuals, but these lack statistical power to identify variants associated with DNA methylation. We present a novel approach in which bisulfite-treated DNA from many individuals is sequenced together in a single pool, resulting in a truly genome-wide map of DNA methylation. Compared to methods that do not account for ASM, our approach increases statistical power to detect associations while sharply reducing cost, effort, and experimental variability. As a proof of concept, we generated deep sequencing data from a pool of 60 human cell lines; we evaluated almost twice as many CpGs as the largest microarray studies and identified more than 2000 genetic variants associated with DNA methylation. We found that these variants are highly enriched for associations with chromatin accessibility and CTCF binding but are less likely to be associated with traits indirectly linked to DNA, such as gene expression and disease phenotypes. In summary, our approach allows genome-wide mapping of genetic variants associated with DNA methylation in any tissue of any species, without the need for individual-level genotype or methylation data. PMID:25910490
Shin, Da Young; Jeong, Mi Ho; Bang, In Jae; Kim, Ha Ryong; Chung, Kyu Hyuck
2018-05-01
Polyhexamethylene guanidine phosphate (PHMG-phosphate), an active component of humidifier disinfectant, is suspected to be a major cause of pulmonary fibrosis. Fibrosis, induced by recurrent epithelial damage, is significantly affected by epigenetic regulation, including microRNAs (miRNAs). The aim of this study was to investigate the fibrogenic mechanisms of PHMG-phosphate through the profiling of miRNAs and their target genes. A549 cells were treated with 0.75 μg/mL PHMG-phosphate for 24 and 48 h and miRNA microarray expression analysis was conducted. The putative mRNA targets of the miRNAs were identified and subjected to Gene Ontology analysis. After exposure to PHMG-phosphate for 24 and 48 h, 46 and 33 miRNAs, respectively, showed a significant change in expression over 1.5-fold compared with the control. The integrated analysis of miRNA and mRNA microarray results revealed the putative targets that were prominently enriched were associated with the epithelial-mesenchymal transition (EMT), cell cycle changes, and apoptosis. The dose-dependent induction of EMT by PHMG-phosphate exposure was confirmed by western blot. We identified 13 putative EMT-related targets that may play a role in PHMG-phosphate-induced fibrosis according to the Comparative Toxicogenomic Database. Our findings contribute to the comprehension of the fibrogenic mechanism of PHMG-phosphate and will aid further study on PHMG-phosphate-induced toxicity. Copyright © 2018 Elsevier B.V. All rights reserved.
A pooling-based approach to mapping genetic variants associated with DNA methylation
Kaplow, Irene M.; MacIsaac, Julia L.; Mah, Sarah M.; ...
2015-04-24
DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account for allele-specific methylation (ASM). Other studies have performed whole-genome bisulfite sequencing on a few individuals, but these lack statistical power to identify variants associated with DNA methylation. We present a novel approach in which bisulfite-treated DNA from many individuals is sequenced together in a single pool, resulting in a trulymore » genome-wide map of DNA methylation. Compared to methods that do not account for ASM, our approach increases statistical power to detect associations while sharply reducing cost, effort, and experimental variability. As a proof of concept, we generated deep sequencing data from a pool of 60 human cell lines; we evaluated almost twice as many CpGs as the largest microarray studies and identified more than 2000 genetic variants associated with DNA methylation. Here we found that these variants are highly enriched for associations with chromatin accessibility and CTCF binding but are less likely to be associated with traits indirectly linked to DNA, such as gene expression and disease phenotypes. In summary, our approach allows genome-wide mapping of genetic variants associated with DNA methylation in any tissue of any species, without the need for individual-level genotype or methylation data.« less
TAM: a method for enrichment and depletion analysis of a microRNA category in a list of microRNAs.
Lu, Ming; Shi, Bing; Wang, Juan; Cao, Qun; Cui, Qinghua
2010-08-09
MicroRNAs (miRNAs) are a class of important gene regulators. The number of identified miRNAs has been increasing dramatically in recent years. An emerging major challenge is the interpretation of the genome-scale miRNA datasets, including those derived from microarray and deep-sequencing. It is interesting and important to know the common rules or patterns behind a list of miRNAs, (i.e. the deregulated miRNAs resulted from an experiment of miRNA microarray or deep-sequencing). For the above purpose, this study presents a method and develops a tool (TAM) for annotations of meaningful human miRNAs categories. We first integrated miRNAs into various meaningful categories according to prior knowledge, such as miRNA family, miRNA cluster, miRNA function, miRNA associated diseases, and tissue specificity. Using TAM, given lists of miRNAs can be rapidly annotated and summarized according to the integrated miRNA categorical data. Moreover, given a list of miRNAs, TAM can be used to predict novel related miRNAs. Finally, we confirmed the usefulness and reliability of TAM by applying it to deregulated miRNAs in acute myocardial infarction (AMI) from two independent experiments. TAM can efficiently identify meaningful categories for given miRNAs. In addition, TAM can be used to identify novel miRNA biomarkers. TAM tool, source codes, and miRNA category data are freely available at http://cmbi.bjmu.edu.cn/tam.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kaplow, Irene M.; MacIsaac, Julia L.; Mah, Sarah M.
DNA methylation is an epigenetic modification that plays a key role in gene regulation. Previous studies have investigated its genetic basis by mapping genetic variants that are associated with DNA methylation at specific sites, but these have been limited to microarrays that cover <2% of the genome and cannot account for allele-specific methylation (ASM). Other studies have performed whole-genome bisulfite sequencing on a few individuals, but these lack statistical power to identify variants associated with DNA methylation. We present a novel approach in which bisulfite-treated DNA from many individuals is sequenced together in a single pool, resulting in a trulymore » genome-wide map of DNA methylation. Compared to methods that do not account for ASM, our approach increases statistical power to detect associations while sharply reducing cost, effort, and experimental variability. As a proof of concept, we generated deep sequencing data from a pool of 60 human cell lines; we evaluated almost twice as many CpGs as the largest microarray studies and identified more than 2000 genetic variants associated with DNA methylation. Here we found that these variants are highly enriched for associations with chromatin accessibility and CTCF binding but are less likely to be associated with traits indirectly linked to DNA, such as gene expression and disease phenotypes. In summary, our approach allows genome-wide mapping of genetic variants associated with DNA methylation in any tissue of any species, without the need for individual-level genotype or methylation data.« less
Chuquiyauri, Raul; Molina, Douglas M; Moss, Eli L; Wang, Ruobing; Gardner, Malcolm J; Brouwer, Kimberly C; Torres, Sonia; Gilman, Robert H; Llanos-Cuentas, Alejandro; Neafsey, Daniel E; Felgner, Philip; Liang, Xiaowu; Vinetz, Joseph M
2015-10-01
Large scale antibody responses in Plasmodium vivax malaria remains unexplored in the endemic setting. Protein microarray analysis of asexual-stage P. vivax was used to identify antigens recognized in sera from residents of hypoendemic Peruvian Amazon. Over 24 months, of 106 participants, 91 had two symptomatic P. vivax malaria episodes, 11 had three episodes, 3 had four episodes, and 1 had five episodes. Plasmodium vivax relapse was distinguished from reinfection by a merozoite surface protein-3α restriction fragment length polymorphism polymerase chain reaction (MSP3α PCR-RFLP) assay. Notably, P. vivax reinfection subjects did not have higher reactivity to the entire set of recognized P. vivax blood-stage antigens than relapse subjects, regardless of the number of malaria episodes. The most highly recognized P. vivax proteins were MSP 4, 7, 8, and 10 (PVX_003775, PVX_082650, PVX_097625, and PVX_114145); sexual-stage antigen s16 (PVX_000930); early transcribed membrane protein (PVX_090230); tryptophan-rich antigen (Pv-fam-a) (PVX_092995); apical merozoite antigen 1 (PVX_092275); and proteins of unknown function (PVX_081830, PVX_117680, PVX_118705, PVX_121935, PVX_097730, PVX_110935, PVX_115450, and PVX_082475). Genes encoding reactive proteins exhibited a significant enrichment of non-synonymous nucleotide variation, an observation suggesting immune selection. These data identify candidates for seroepidemiological tools to support malaria elimination efforts in P. vivax-endemic regions. © The American Society of Tropical Medicine and Hygiene.
Reliable pre-eclampsia pathways based on multiple independent microarray data sets.
Kawasaki, Kaoru; Kondoh, Eiji; Chigusa, Yoshitsugu; Ujita, Mari; Murakami, Ryusuke; Mogami, Haruta; Brown, J B; Okuno, Yasushi; Konishi, Ikuo
2015-02-01
Pre-eclampsia is a multifactorial disorder characterized by heterogeneous clinical manifestations. Gene expression profiling of preeclamptic placenta have provided different and even opposite results, partly due to data compromised by various experimental artefacts. Here we aimed to identify reliable pre-eclampsia-specific pathways using multiple independent microarray data sets. Gene expression data of control and preeclamptic placentas were obtained from Gene Expression Omnibus. Single-sample gene-set enrichment analysis was performed to generate gene-set activation scores of 9707 pathways obtained from the Molecular Signatures Database. Candidate pathways were identified by t-test-based screening using data sets, GSE10588, GSE14722 and GSE25906. Additionally, recursive feature elimination was applied to arrive at a further reduced set of pathways. To assess the validity of the pre-eclampsia pathways, a statistically-validated protocol was executed using five data sets including two independent other validation data sets, GSE30186, GSE44711. Quantitative real-time PCR was performed for genes in a panel of potential pre-eclampsia pathways using placentas of 20 women with normal or severe preeclamptic singleton pregnancies (n = 10, respectively). A panel of ten pathways were found to discriminate women with pre-eclampsia from controls with high accuracy. Among these were pathways not previously associated with pre-eclampsia, such as the GABA receptor pathway, as well as pathways that have already been linked to pre-eclampsia, such as the glutathione and CDKN1C pathways. mRNA expression of GABRA3 (GABA receptor pathway), GCLC and GCLM (glutathione metabolic pathway), and CDKN1C was significantly reduced in the preeclamptic placentas. In conclusion, ten accurate and reliable pre-eclampsia pathways were identified based on multiple independent microarray data sets. A pathway-based classification may be a worthwhile approach to elucidate the pathogenesis of pre-eclampsia. © The Author 2014. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
APPLICATION OF CDNA MICROARRAY TO THE STUDY OF ARSENIC TOXICOLOGY AND CARCINOGENESIS
Arsenic (As) is a common environmental toxicant and known human carcinogen. Epidemiological studies link As exposure to various disorders and cancers. However, the molecular mechanisms for As toxicity and carcinogenicity are not completely known. The cDNA microarray, a high-th...
Identification of new autoantigens for primary biliary cirrhosis using human proteome microarrays.
Hu, Chao-Jun; Song, Guang; Huang, Wei; Liu, Guo-Zhen; Deng, Chui-Wen; Zeng, Hai-Pan; Wang, Li; Zhang, Feng-Chun; Zhang, Xuan; Jeong, Jun Seop; Blackshaw, Seth; Jiang, Li-Zhi; Zhu, Heng; Wu, Lin; Li, Yong-Zhe
2012-09-01
Primary biliary cirrhosis (PBC) is a chronic cholestatic liver disease of unknown etiology and is considered to be an autoimmune disease. Autoantibodies are important tools for accurate diagnosis of PBC. Here, we employed serum profiling analysis using a human proteome microarray composed of about 17,000 full-length unique proteins and identified 23 proteins that correlated with PBC. To validate these results, we fabricated a PBC-focused microarray with 21 of these newly identified candidates and nine additional known PBC antigens. By screening the PBC microarrays with additional cohorts of 191 PBC patients and 321 controls (43 autoimmune hepatitis, 55 hepatitis B virus, 31 hepatitis C virus, 48 rheumatoid arthritis, 45 systematic lupus erythematosus, 49 systemic sclerosis, and 50 healthy), six proteins were confirmed as novel PBC autoantigens with high sensitivities and specificities, including hexokinase-1 (isoforms I and II), Kelch-like protein 7, Kelch-like protein 12, zinc finger and BTB domain-containing protein 2, and eukaryotic translation initiation factor 2C, subunit 1. To facilitate clinical diagnosis, we developed ELISA for Kelch-like protein 12 and zinc finger and BTB domain-containing protein 2 and tested large cohorts (297 PBC and 637 control sera) to confirm the sensitivities and specificities observed in the microarray-based assays. In conclusion, our research showed that a strategy using high content protein microarray combined with a smaller but more focused protein microarray can effectively identify and validate novel PBC-specific autoantigens and has the capacity to be translated to clinical diagnosis by means of an ELISA-based method.
Chan, Robin F.; Shabalin, Andrey A.; Xie, Lin Y.; Adkins, Daniel E.; Zhao, Min; Turecki, Gustavo; Clark, Shaunna L.; Aberg, Karolina A.
2017-01-01
Abstract Methylome-wide association studies are typically performed using microarray technologies that only assay a very small fraction of the CG methylome and entirely miss two forms of methylation that are common in brain and likely of particular relevance for neuroscience and psychiatric disorders. The alternative is to use whole genome bisulfite (WGB) sequencing but this approach is not yet practically feasible with sample sizes required for adequate statistical power. We argue for revisiting methylation enrichment methods that, provided optimal protocols are used, enable comprehensive, adequately powered and cost-effective genome-wide investigations of the brain methylome. To support our claim we use data showing that enrichment methods approximate the sensitivity obtained with WGB methods and with slightly better specificity. However, this performance is achieved at <5% of the reagent costs. Furthermore, because many more samples can be sequenced simultaneously, projects can be completed about 15 times faster. Currently the only viable option available for comprehensive brain methylome studies, enrichment methods may be critical for moving the field forward. PMID:28334972
Microarray analysis of genes associated with cell surface NIS protein levels in breast cancer.
Beyer, Sasha J; Zhang, Xiaoli; Jimenez, Rafael E; Lee, Mei-Ling T; Richardson, Andrea L; Huang, Kun; Jhiang, Sissy M
2011-10-11
Na+/I- symporter (NIS)-mediated iodide uptake allows radioiodine therapy for thyroid cancer. NIS is also expressed in breast tumors, raising potential for radionuclide therapy of breast cancer. However, NIS expression in most breast cancers is low and may not be sufficient for radionuclide therapy. We aimed to identify biomarkers associated with NIS expression such that mechanisms underlying NIS modulation in human breast tumors may be elucidated. Published oligonucleotide microarray data within the National Center for Biotechnology Information Gene Expression Omnibus database were analyzed to identify gene expression tightly correlated with NIS mRNA level among human breast tumors. NIS immunostaining was performed in a tissue microarray composed of 28 human breast tumors which had corresponding oligonucleotide microarray data available for each tumor such that gene expression associated with cell surface NIS protein level could be identified. NIS mRNA levels do not vary among breast tumors or when compared to normal breast tissues when detected by Affymetrix oligonucleotide microarray platforms. Cell surface NIS protein levels are much more variable than their corresponding NIS mRNA levels. Despite a limited number of breast tumors examined, our analysis identified cysteinyl-tRNA synthetase as a biomarker that is highly associated with cell surface NIS protein levels in the ER-positive breast cancer subtype. Further investigation on genes associated with cell surface NIS protein levels within each breast cancer molecular subtype may lead to novel targets for selectively increasing NIS expression/function in a subset of breast cancers patients.
Screening Mammalian Cells on a Hydrogel: Functionalized Small Molecule Microarray.
Zhu, Biwei; Jiang, Bo; Na, Zhenkun; Yao, Shao Q
2017-01-01
Mammalian cell-based microarray technology has gained wide attention, for its plethora of promising applications. The platform is able to provide simultaneous information on multiple parameters for a given target, or even multiple target proteins, in a complex biological system. Here we describe the preparation of mammalian cell-based microarrays using selectively captured of human prostate cancer cells (PC-3). This platform was then used in controlled drug release and measuring the associated drug effects on these cancer cells.
Huang, Gangyong; Wei, Yibing; Zhao, Guanglei; Xia, Jun; Wang, Siqun; Wu, Jianguo; Chen, Feiyan; Chen, Jie; Shi, Jingshen
2017-01-01
The underlying mechanisms of glucocorticoid (GC)-induced avascular necrosis of the femoral head (ANFH) have yet to be fully understood, in particular the mechanisms associated with the change of gene expression pattern. The present study aimed to identify key genes with a differential expression pattern in GC-induced ANFH. E-MEXP-2751 microarray data were downloaded from the ArrayExpress database. Differentially expressed genes (DEGs) were identified in 5 femoral head samples of steroid-induced ANFH rats compared with 5 placebo-treated rat samples. Gene Ontology (GO) and pathway enrichment analyses were performed upon these DEGs. A total 93 DEGs (46 upregulated and 47 downregulated genes) were identified in GC-induced ANFH samples. These DEGs were enriched in different GO terms and pathways, including chondrocyte differentiation and detection of chemical stimuli. The enrichment map revealed that skeletal system development was interconnected with several other GO terms by gene overlap. The literature mined network analysis revealed that 5 upregulated genes were associated with femoral necrosis, including parathyroid hormone receptor 1 (PTHR1), vitamin D (1,25-Dihydroxyvitamin D3) receptor (VDR), collagen, type II, α1, proprotein convertase subtilisin/kexin type 6 and zinc finger protein 354C (ZFP354C). In addition, ZFP354C and VDR were identified to transcription factors. Furthermore, PTHR1 was revealed to interact with VDR, and α-2-macroglobulin (A2M) interacted with fibronectin 1 (FN1) in the PPI network. PTHR1 may be involved in GC-induced ANFH via interacting with VDR. A2M may also be involved in the development of GC-induced ANFH through interacting with FN1. An improved understanding of the molecular mechanisms underlying GC-induced ANFH may provide novel targets for diagnostics and therapeutic treatment. PMID:28393228
Sgadò, Paola; Provenzano, Giovanni; Dassi, Erik; Adami, Valentina; Zunino, Giulia; Genovesi, Sacha; Casarosa, Simona; Bozzi, Yuri
2013-12-19
Transcriptome analysis has been used in autism spectrum disorder (ASD) to unravel common pathogenic pathways based on the assumption that distinct rare genetic variants or epigenetic modifications affect common biological pathways. To unravel recurrent ASD-related neuropathological mechanisms, we took advantage of the En2-/- mouse model and performed transcriptome profiling on cerebellar and hippocampal adult tissues. Cerebellar and hippocampal tissue samples from three En2-/- and wild type (WT) littermate mice were assessed for differential gene expression using microarray hybridization followed by RankProd analysis. To identify functional categories overrepresented in the differentially expressed genes, we used integrated gene-network analysis, gene ontology enrichment and mouse phenotype ontology analysis. Furthermore, we performed direct enrichment analysis of ASD-associated genes from the SFARI repository in our differentially expressed genes. Given the limited number of animals used in the study, we used permissive criteria and identified 842 differentially expressed genes in En2-/- cerebellum and 862 in the En2-/- hippocampus. Our functional analysis revealed that the molecular signature of En2-/- cerebellum and hippocampus shares convergent pathological pathways with ASD, including abnormal synaptic transmission, altered developmental processes and increased immune response. Furthermore, when directly compared to the repository of the SFARI database, our differentially expressed genes in the hippocampus showed enrichment of ASD-associated genes significantly higher than previously reported. qPCR was performed for representative genes to confirm relative transcript levels compared to those detected in microarrays. Despite the limited number of animals used in the study, our bioinformatic analysis indicates the En2-/- mouse is a valuable tool for investigating molecular alterations related to ASD.
Variation of gene expression in Bacillus subtilis samples of fermentation replicates.
Zhou, Ying; Yu, Wen-Bang; Ye, Bang-Ce
2011-06-01
The application of comprehensive gene expression profiling technologies to compare wild and mutated microorganism samples or to assess molecular differences between various treatments has been widely used. However, little is known about the normal variation of gene expression in microorganisms. In this study, an Agilent customized microarray representing 4,106 genes was used to quantify transcript levels of five-repeated flasks to assess normal variation in Bacillus subtilis gene expression. CV analysis and analysis of variance were employed to investigate the normal variance of genes and the components of variance, respectively. The results showed that above 80% of the total variation was caused by biological variance. For the 12 replicates, 451 of 4,106 genes exhibited variance with CV values over 10%. The functional category enrichment analysis demonstrated that these variable genes were mainly involved in cell type differentiation, cell type localization, cell cycle and DNA processing, and spore or cyst coat. Using power analysis, the minimal biological replicate number for a B. subtilis microarray experiment was determined to be six. The results contribute to the definition of the baseline level of variability in B. subtilis gene expression and emphasize the importance of replicate microarray experiments.
With the advent of sequence information for entire eukaryotic genomes, it is now possible to analyze gene expression on a genomic scale. The primary tool for genomic analysis of gene expression is the gene microarray. We have used commercially available and custom cDNA microarray...
Vallée, Maud; Gravel, Catherine; Palin, Marie-France; Reghenas, Hélène; Stothard, Paul; Wishart, David S; Sirard, Marc-André
2005-07-01
The main objective of the present study was to identify novel oocyte-specific genes in three different species: bovine, mouse, and Xenopus laevis. To achieve this goal, two powerful technologies were combined: a polymerase chain reaction (PCR)-based cDNA subtraction, and cDNA microarrays. Three subtractive libraries consisting of 3456 clones were established and enriched for oocyte-specific transcripts. Sequencing analysis of the positive insert-containing clones resulted in the following classification: 53% of the clones corresponded to known cDNAs, 26% were classified as uncharacterized cDNAs, and a final 9% were classified as novel sequences. All these clones were used for cDNA microarray preparation. Results from these microarray analyses revealed that in addition to already known oocyte-specific genes, such as GDF9, BMP15, and ZP, known genes with unknown function in the oocyte were identified, such as a MLF1-interacting protein (MLF1IP), B-cell translocation gene 4 (BTG4), and phosphotyrosine-binding protein (xPTB). Furthermore, 15 novel oocyte-specific genes were validated by reverse transcription-PCR to confirm their preferential expression in the oocyte compared to somatic tissues. The results obtained in the present study confirmed that microarray analysis is a robust technique to identify true positives from the suppressive subtractive hybridization experiment. Furthermore, obtaining oocyte-specific genes from three species simultaneously allowed us to look at important genes that are conserved across species. Further characterization of these novel oocyte-specific genes will lead to a better understanding of the molecular mechanisms related to the unique functions found in the oocyte.
Microarray analysis of gene expression profiles in ripening pineapple fruits.
Koia, Jonni H; Moyle, Richard L; Botella, Jose R
2012-12-18
Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit ripening and non-climacteric fruit ripening in general.
Microarray analysis of gene expression profiles in ripening pineapple fruits
2012-01-01
Background Pineapple (Ananas comosus) is a tropical fruit crop of significant commercial importance. Although the physiological changes that occur during pineapple fruit development have been well characterized, little is known about the molecular events that occur during the fruit ripening process. Understanding the molecular basis of pineapple fruit ripening will aid the development of new varieties via molecular breeding or genetic modification. In this study we developed a 9277 element pineapple microarray and used it to profile gene expression changes that occur during pineapple fruit ripening. Results Microarray analyses identified 271 unique cDNAs differentially expressed at least 1.5-fold between the mature green and mature yellow stages of pineapple fruit ripening. Among these 271 sequences, 184 share significant homology with genes encoding proteins of known function, 53 share homology with genes encoding proteins of unknown function and 34 share no significant homology with any database accession. Of the 237 pineapple sequences with homologs, 160 were up-regulated and 77 were down-regulated during pineapple fruit ripening. DAVID Functional Annotation Cluster (FAC) analysis of all 237 sequences with homologs revealed confident enrichment scores for redox activity, organic acid metabolism, metalloenzyme activity, glycolysis, vitamin C biosynthesis, antioxidant activity and cysteine peptidase activity, indicating the functional significance and importance of these processes and pathways during pineapple fruit development. Quantitative real-time PCR analysis validated the microarray expression results for nine out of ten genes tested. Conclusions This is the first report of a microarray based gene expression study undertaken in pineapple. Our bioinformatic analyses of the transcript profiles have identified a number of genes, processes and pathways with putative involvement in the pineapple fruit ripening process. This study extends our knowledge of the molecular basis of pineapple fruit ripening and non-climacteric fruit ripening in general. PMID:23245313
Whole exome resequencing distinguishes cystic kidney diseases from phenocopies in renal ciliopathies
Gee, Heon Yung; Otto, Edgar A.; Hurd, Toby W.; Ashraf, Shazia; Chaki, Moumita; Cluckey, Andrew; Vega-Warner, Virginia; Saisawat, Pawaree; Diaz, Katrina A.; Fang, Humphrey; Kohl, Stefan; Allen, Susan J.; Airik, Rannar; Zhou, Weibin; Ramaswami, Gokul; Janssen, Sabine; Fu, Clementine; Innis, Jamie L.; Weber, Stefanie; Vester, Udo; Davis, Erica E.; Katsanis, Nicholas; Fathy, Hanan M.; Jeck, Nikola; Klaus, Gunther; Nayir, Ahmet; Rahim, Khawla A.; Attrach, Ibrahim Al; Hassoun, Ibrahim Al; Ozturk, Savas; Drozdz, Dorota; Helmchen, Udo; O’Toole, John F.; Attanasio, Massimo; Nürnberg, Gudrun; Nürnberg, Peter; Washburn, Joseph; MacDonald, James; James, Jeffrey W.; Levy, Shawn; Hildebrandt, Friedhelm
2013-01-01
Rare single-gene disorders cause chronic disease. However, half of the 6,000 recessive single gene causes of disease are still unknown. Because recessive disease genes can illuminate, at least in part, disease pathomechanism, their identification offers direct opportunities for improved clinical management and potentially treatment. Rare diseases comprise the majority of chronic kidney disease (CKD) in children but are notoriously difficult to diagnose. Whole exome resequencing facilitates identification of recessive disease genes. However, its utility is impeded by the large number of genetic variants detected. We here overcome this limitation by combining homozygosity mapping with whole exome resequencing in 10 sib pairs with a nephronophthisis-related ciliopathy, which represents the most frequent genetic cause of CKD in the first three decades of life. In 7 of 10 sib-ships with a histologic or ultrasonographic diagnosis of nephronophthisis-related ciliopathy we detect the causative gene. In six sib-ships we identify mutations of known nephronophthisis-related ciliopathy genes, while in two additional sib-ships we found mutations in the known CKD-causing genes SLC4A1 and AGXT as phenocopies of nephronophthisis-related ciliopathy. Thus whole exome resequencing establishes an efficient, non-invasive approach towards early detection and causation-based diagnosis of rare kidney diseases. This approach can be extended to other rare recessive disorders, thereby providing accurate diagnosis and facilitating the study of disease mechanisms. PMID:24257694
A targeted resequencing gene panel for focal epilepsy.
Hildebrand, Michael S; Myers, Candace T; Carvill, Gemma L; Regan, Brigid M; Damiano, John A; Mullen, Saul A; Newton, Mark R; Nair, Umesh; Gazina, Elena V; Milligan, Carol J; Reid, Christopher A; Petrou, Steven; Scheffer, Ingrid E; Berkovic, Samuel F; Mefford, Heather C
2016-04-26
We report development of a targeted resequencing gene panel for focal epilepsy, the most prevalent phenotypic group of the epilepsies. The targeted resequencing gene panel was designed using molecular inversion probe (MIP) capture technology and sequenced using massively parallel Illumina sequencing. We demonstrated proof of principle that mutations can be detected in 4 previously genotyped focal epilepsy cases. We searched for both germline and somatic mutations in 251 patients with unsolved sporadic or familial focal epilepsy and identified 11 novel or very rare missense variants in 5 different genes: CHRNA4, GRIN2B, KCNT1, PCDH19, and SCN1A. Of these, 2 were predicted to be pathogenic or likely pathogenic, explaining ∼0.8% of the cohort, and 8 were of uncertain significance based on available data. We have developed and validated a targeted resequencing panel for focal epilepsies, the most important clinical class of epilepsies, accounting for about 60% of all cases. Our application of MIP technology is an innovative approach that will be advantageous in the clinical setting because it is highly sensitive, efficient, and cost-effective for screening large patient cohorts. Our findings indicate that mutations in known genes likely explain only a small proportion of focal epilepsy cases. This is not surprising given the established clinical and genetic heterogeneity of these disorders and underscores the importance of further gene discovery studies in this complex syndrome. © 2016 American Academy of Neurology.
Gee, Heon Yung; Otto, Edgar A; Hurd, Toby W; Ashraf, Shazia; Chaki, Moumita; Cluckey, Andrew; Vega-Warner, Virginia; Saisawat, Pawaree; Diaz, Katrina A; Fang, Humphrey; Kohl, Stefan; Allen, Susan J; Airik, Rannar; Zhou, Weibin; Ramaswami, Gokul; Janssen, Sabine; Fu, Clementine; Innis, Jamie L; Weber, Stefanie; Vester, Udo; Davis, Erica E; Katsanis, Nicholas; Fathy, Hanan M; Jeck, Nikola; Klaus, Gunther; Nayir, Ahmet; Rahim, Khawla A; Al Attrach, Ibrahim; Al Hassoun, Ibrahim; Ozturk, Savas; Drozdz, Dorota; Helmchen, Udo; O'Toole, John F; Attanasio, Massimo; Lewis, Richard A; Nürnberg, Gudrun; Nürnberg, Peter; Washburn, Joseph; MacDonald, James; Innis, Jeffrey W; Levy, Shawn; Hildebrandt, Friedhelm
2014-04-01
Rare single-gene disorders cause chronic disease. However, half of the 6000 recessive single gene causes of disease are still unknown. Because recessive disease genes can illuminate, at least in part, disease pathomechanism, their identification offers direct opportunities for improved clinical management and potentially treatment. Rare diseases comprise the majority of chronic kidney disease (CKD) in children but are notoriously difficult to diagnose. Whole-exome resequencing facilitates identification of recessive disease genes. However, its utility is impeded by the large number of genetic variants detected. We here overcome this limitation by combining homozygosity mapping with whole-exome resequencing in 10 sib pairs with a nephronophthisis-related ciliopathy, which represents the most frequent genetic cause of CKD in the first three decades of life. In 7 of 10 sibships with a histologic or ultrasonographic diagnosis of nephronophthisis-related ciliopathy, we detect the causative gene. In six sibships, we identify mutations of known nephronophthisis-related ciliopathy genes, while in two additional sibships we found mutations in the known CKD-causing genes SLC4A1 and AGXT as phenocopies of nephronophthisis-related ciliopathy. Thus, whole-exome resequencing establishes an efficient, noninvasive approach towards early detection and causation-based diagnosis of rare kidney diseases. This approach can be extended to other rare recessive disorders, thereby providing accurate diagnosis and facilitating the study of disease mechanisms.
NASA Astrophysics Data System (ADS)
Brazhnik, Kristina; Sokolova, Zinaida; Baryshnikova, Maria; Bilan, Regina; Nabiev, Igor; Sukhanova, Alyona
Multiplexed analysis of cancer markers is crucial for early tumor diagnosis and screening. We have designed lab-on-a-bead microarray for quantitative detection of three breast cancer markers in human serum. Quantum dots were used as bead-bound fluorescent tags for identifying each marker by means of flow cytometry. Antigen-specific beads reliably detected CA 15-3, CEA, and CA 125 in serum samples, providing clear discrimination between the samples with respect to the antigen levels. The novel microarray is advantageous over the routine single-analyte ones due to the simultaneous detection of various markers. Therefore the developed microarray is a promising tool for serum tumor marker profiling.
Simon, Jeremy M.; Giresi, Paul G.; Davis, Ian J.; Lieb, Jason D.
2013-01-01
Eviction or destabilization of nucleosomes from chromatin is a hallmark of functional regulatory elements of the eukaryotic genome. Historically identified by nuclease hypersensitivity, these regulatory elements are typically bound by transcription factors or other regulatory proteins. FAIRE (Formaldehyde-Assisted Isolation of Regulatory Elements) is an alternative approach to identify these genomic regions and has proven successful in a multitude of eukaryotic cell and tissue types. Cells or dissociated tissues are crosslinked briefly with formaldehyde, lysed, and sonicated. Sheared chromatin is subjected to phenol-chloroform extraction and the isolated DNA, typically encompassing 1–3% of the human genome, is purified. We provide guidelines for quantitative analysis by PCR, microarrays, or next-generation sequencing. Regulatory elements enriched by FAIRE display high concordance with those identified by nuclease hypersensitivity or ChIP, and the entire procedure can be completed in three days. FAIRE exhibits low technical variability, which allows its use in large-scale studies of chromatin from normal or diseased tissues. PMID:22262007
CD161 Defines a Functionally Distinct Subset of Pro-Inflammatory Natural Killer Cells
Kurioka, Ayako; Cosgrove, Cormac; Simoni, Yannick; van Wilgenburg, Bonnie; Geremia, Alessandra; Björkander, Sophia; Sverremark-Ekström, Eva; Thurnheer, Christine; Günthard, Huldrych F.; Khanna, Nina; Aubert, V; Arancibia-Cárcamo, CV; Walker, Lucy Jane; Arancibia-Cárcamo, Carolina V.; Newell, Evan W.; Willberg, Christian B.; Klenerman, Paul
2018-01-01
CD161 is a C-type lectin-like receptor expressed on the majority of natural killer (NK) cells; however, the significance of CD161 expression on NK cells has not been comprehensively investigated. Recently, we found that CD161 expression identifies a transcriptional and innate functional phenotype that is shared across various T cell populations. Using mass cytometry and microarray experiments, we demonstrate that this functional phenotype extends to NK cells. CD161 marks NK cells that have retained the ability to respond to innate cytokines during their differentiation, and is lost upon cytomegalovirus-induced maturation in both healthy and human immunodeficiency virus (HIV)-infected patients. These pro-inflammatory NK cells are present in the inflamed lamina propria where they are enriched for integrin CD103 expression. Thus, CD161 expression identifies NK cells that may contribute to inflammatory disease pathogenesis and correlates with an innate responsiveness to cytokines in both T and NK cells. PMID:29686665
Harris, R. Alan; Wang, Ting; Coarfa, Cristian; Nagarajan, Raman P.; Hong, Chibo; Downey, Sara L.; Johnson, Brett E.; Fouse, Shaun D.; Delaney, Allen; Zhao, Yongjun; Olshen, Adam; Ballinger, Tracy; Zhou, Xin; Forsberg, Kevin J.; Gu, Junchen; Echipare, Lorigail; O’Geen, Henriette; Lister, Ryan; Pelizzola, Mattia; Xi, Yuanxin; Epstein, Charles B.; Bernstein, Bradley E.; Hawkins, R. David; Ren, Bing; Chung, Wen-Yu; Gu, Hongcang; Bock, Christoph; Gnirke, Andreas; Zhang, Michael Q.; Haussler, David; Ecker, Joseph; Li, Wei; Farnham, Peggy J.; Waterland, Robert A.; Meissner, Alexander; Marra, Marco A.; Hirst, Martin; Milosavljevic, Aleksandar; Costello, Joseph F.
2010-01-01
Sequencing-based DNA methylation profiling methods are comprehensive and, as accuracy and affordability improve, will increasingly supplant microarrays for genome-scale analyses. Here, four sequencing-based methodologies were applied to biological replicates of human embryonic stem cells to compare their CpG coverage genome-wide and in transposons, resolution, cost, concordance and its relationship with CpG density and genomic context. The two bisulfite methods reached concordance of 82% for CpG methylation levels and 99% for non-CpG cytosine methylation levels. Using binary methylation calls, two enrichment methods were 99% concordant, while regions assessed by all four methods were 97% concordant. To achieve comprehensive methylome coverage while reducing cost, an approach integrating two complementary methods was examined. The integrative methylome profile along with histone methylation, RNA, and SNP profiles derived from the sequence reads allowed genome-wide assessment of allele-specific epigenetic states, identifying most known imprinted regions and new loci with monoallelic epigenetic marks and monoallelic expression. PMID:20852635
KDM5 Interacts with Foxo to Modulate Cellular Levels of Oxidative Stress
Liu, Xingyin; Greer, Christina; Secombe, Julie
2014-01-01
Increased cellular levels of oxidative stress are implicated in a large number of human diseases. Here we describe the transcription co-factor KDM5 (also known as Lid) as a new critical regulator of cellular redox state. Moreover, this occurs through a novel KDM5 activity whereby it alters the ability of the transcription factor Foxo to bind to DNA. Our microarray analyses of kdm5 mutants revealed a striking enrichment for genes required to regulate cellular levels of oxidative stress. Consistent with this, loss of kdm5 results in increased sensitivity to treatment with oxidizers, elevated levels of oxidized proteins, and increased mutation load. KDM5 activates oxidative stress resistance genes by interacting with Foxo to facilitate its recruitment to KDM5-Foxo co-regulated genes. Significantly, this occurs independently of KDM5's well-characterized demethylase activity. Instead, KDM5 interacts with the lysine deacetylase HDAC4 to promote Foxo deacetylation, which affects Foxo DNA binding. PMID:25329053
A taxonomy of epithelial human cancer and their metastases
2009-01-01
Background Microarray technology has allowed to molecularly characterize many different cancer sites. This technology has the potential to individualize therapy and to discover new drug targets. However, due to technological differences and issues in standardized sample collection no study has evaluated the molecular profile of epithelial human cancer in a large number of samples and tissues. Additionally, it has not yet been extensively investigated whether metastases resemble their tissue of origin or tissue of destination. Methods We studied the expression profiles of a series of 1566 primary and 178 metastases by unsupervised hierarchical clustering. The clustering profile was subsequently investigated and correlated with clinico-pathological data. Statistical enrichment of clinico-pathological annotations of groups of samples was investigated using Fisher exact test. Gene set enrichment analysis (GSEA) and DAVID functional enrichment analysis were used to investigate the molecular pathways. Kaplan-Meier survival analysis and log-rank tests were used to investigate prognostic significance of gene signatures. Results Large clusters corresponding to breast, gastrointestinal, ovarian and kidney primary tissues emerged from the data. Chromophobe renal cell carcinoma clustered together with follicular differentiated thyroid carcinoma, which supports recent morphological descriptions of thyroid follicular carcinoma-like tumors in the kidney and suggests that they represent a subtype of chromophobe carcinoma. We also found an expression signature identifying primary tumors of squamous cell histology in multiple tissues. Next, a subset of ovarian tumors enriched with endometrioid histology clustered together with endometrium tumors, confirming that they share their etiopathogenesis, which strongly differs from serous ovarian tumors. In addition, the clustering of colon and breast tumors correlated with clinico-pathological characteristics. Moreover, a signature was developed based on our unsupervised clustering of breast tumors and this was predictive for disease-specific survival in three independent studies. Next, the metastases from ovarian, breast, lung and vulva cluster with their tissue of origin while metastases from colon showed a bimodal distribution. A significant part clusters with tissue of origin while the remaining tumors cluster with the tissue of destination. Conclusion Our molecular taxonomy of epithelial human cancer indicates surprising correlations over tissues. This may have a significant impact on the classification of many cancer sites and may guide pathologists, both in research and daily practice. Moreover, these results based on unsupervised analysis yielded a signature predictive of clinical outcome in breast cancer. Additionally, we hypothesize that metastases from gastrointestinal origin either remember their tissue of origin or adapt to the tissue of destination. More specifically, colon metastases in the liver show strong evidence for such a bimodal tissue specific profile. PMID:20017941
Grubaugh, Nathan D.; Petz, Lawrence N.; Melanson, Vanessa R.; McMenamy, Scott S.; Turell, Michael J.; Long, Lewis S.; Pisarcik, Sarah E.; Kengluecha, Ampornpan; Jaichapor, Boonsong; O'Guinn, Monica L.; Lee, John S.
2013-01-01
Highly multiplexed assays, such as microarrays, can benefit arbovirus surveillance by allowing researchers to screen for hundreds of targets at once. We evaluated amplification strategies and the practicality of a portable DNA microarray platform to analyze virus-infected mosquitoes. The prototype microarray design used here targeted the non-structural protein 5, ribosomal RNA, and cytochrome b genes for the detection of flaviviruses, mosquitoes, and bloodmeals, respectively. We identified 13 of 14 flaviviruses from virus inoculated mosquitoes and cultured cells. Additionally, we differentiated between four mosquito genera and eight whole blood samples. The microarray platform was field evaluated in Thailand and successfully identified flaviviruses (Culex flavivirus, dengue-3, and Japanese encephalitis viruses), differentiated between mosquito genera (Aedes, Armigeres, Culex, and Mansonia), and detected mammalian bloodmeals (human and dog). We showed that the microarray platform and amplification strategies described here can be used to discern specific information on a wide variety of viruses and their vectors. PMID:23249687
Ciaccio, Mark F.; Chuu, Chih-pin; Jones, Richard B.
2012-01-01
First-generation interaction maps of Src homology 2 (SH2) domains with receptor tyrosine kinase (RTK) phosphosites have previously been generated using protein microarray (PM) technologies. Here, we developed a large-scale fluorescence polarization (FP) methodology that was able to characterize interactions between SH2 domains and ErbB receptor phosphosites with higher fidelity and sensitivity than was previously achieved with PMs. We used the FP assay to query the interaction of synthetic phosphopeptides corresponding to 89 ErbB receptor intracellular tyrosine sites against 93 human SH2 domains and 2 phosphotyrosine binding (PTB) domains. From 358,944 polarization measurements, the affinities for 1,405 unique biological interactions were determined, 83% of which are novel. In contrast to data from previous reports, our analyses suggested that ErbB2 was not more promiscuous than the other ErbB receptors. Our results showed that each receptor displays unique preferences in the affinity and location of recruited SH2 domains that may contribute to differences in downstream signaling potential. ErbB1 was enriched versus the other receptors for recruitment of domains from RAS GEFs whereas ErbB2 was enriched for recruitment of domains from tyrosine and phosphatidyl inositol phosphatases. ErbB3, the kinase inactive ErbB receptor family member, was predictably enriched for recruitment of domains from phosphatidyl inositol kinases and surprisingly, was enriched for recruitment of domains from tyrosine kinases, cytoskeletal regulatory proteins, and RHO GEFs but depleted for recruitment of domains from phosphatidyl inositol phosphatases. Many novel interactions were also observed with phosphopeptides corresponding to ErbB receptor tyrosines not previously reported to be phosphorylated by mass spectrometry, suggesting the existence of many biologically relevant RTK sites that may be phosphorylated but below the detection threshold of standard mass spectrometry procedures. This dataset represents a rich source of testable hypotheses regarding the biological mechanisms of ErbB receptors. PMID:22973453
Neerincx, Pieter BT; Casel, Pierrot; Prickett, Dennis; Nie, Haisheng; Watson, Michael; Leunissen, Jack AM; Groenen, Martien AM; Klopp, Christophe
2009-01-01
Background Reliable annotation linking oligonucleotide probes to target genes is essential for functional biological analysis of microarray experiments. We used the IMAD, OligoRAP and sigReannot pipelines to update the annotation for the ARK-Genomics Chicken 20 K array as part of a joined EADGENE/SABRE workshop. In this manuscript we compare their annotation strategies and results. Furthermore, we analyse the effect of differences in updated annotation on functional analysis for an experiment involving Eimeria infected chickens and finally we propose guidelines for optimal annotation strategies. Results IMAD, OligoRAP and sigReannot update both annotation and estimated target specificity. The 3 pipelines can assign oligos to target specificity categories although with varying degrees of resolution. Target specificity is judged based on the amount and type of oligo versus target-gene alignments (hits), which are determined by filter thresholds that users can adjust based on their experimental conditions. Linking oligos to annotation on the other hand is based on rigid rules, which differ between pipelines. For 52.7% of the oligos from a subset selected for in depth comparison all pipelines linked to one or more Ensembl genes with consensus on 44.0%. In 31.0% of the cases none of the pipelines could assign an Ensembl gene to an oligo and for the remaining 16.3% the coverage differed between pipelines. Differences in updated annotation were mainly due to different thresholds for hybridisation potential filtering of oligo versus target-gene alignments and different policies for expanding annotation using indirect links. The differences in updated annotation packages had a significant effect on GO term enrichment analysis with consensus on only 67.2% of the enriched terms. Conclusion In addition to flexible thresholds to determine target specificity, annotation tools should provide metadata describing the relationships between oligos and the annotation assigned to them. These relationships can then be used to judge the varying degrees of reliability allowing users to fine-tune the balance between reliability and coverage. This is important as it can have a significant effect on functional microarray analysis as exemplified by the lack of consensus on almost one third of the terms found with GO term enrichment analysis based on updated IMAD, OligoRAP or sigReannot annotation. PMID:19615109
2013-01-01
Background The emergence of pyrethroid resistance in the malaria vector, Anopheles arabiensis, threatens to undermine the considerable gains made towards eliminating malaria on Zanzibar. Previously, resistance was restricted to the island of Pemba while mosquitoes from Unguja, the larger of the two islands of Zanzibar, were susceptible. Here, we characterised the mechanism(s) responsible for resistance on Zanzibar using a combination of gene expression and target-site mutation assays. Methods WHO resistance bioassays were conducted using 1-5d old adult Anopheles gambiae s.l. collected between 2011 and 2013 across the archipelago. Synergist assays with the P450 inhibitor piperonyl-butoxide were performed in 2013. Members of the An. gambiae complex were PCR-identified and screened for target-site mutations (kdr and Ace-1). Gene expression in pyrethroid resistant An. arabiensis from Pemba was analysed using whole-genome microarrays. Results Pyrethroid resistance is now present across the entire Zanzibar archipelago. Survival to the pyrethroid lambda-cyhalothrin in bioassays conducted in 2013 was 23.5-54.3% on Unguja and 32.9-81.7% on Pemba. We present evidence that resistance is mediated, in part at least, by elevated P450 monoxygenases. Whole-genome microarray scans showed that the most enriched gene terms in resistant An. arabiensis from Pemba were associated with P450 activity and synergist assays with PBO completely restored susceptibility to pyrethroids in both islands. CYP4G16 was the most consistently over-expressed gene in resistant mosquitoes compared with two susceptible strains from Unguja and Dar es Salaam. Expression of this P450 is enriched in the abdomen and it is thought to play a role in hydrocarbon synthesis. Microarray and qPCR detected several additional genes putatively involved in this pathway enriched in the Pemba pyrethroid resistant population and we hypothesise that resistance may be, in part, related to alterations in the structure of the mosquito cuticle. None of the kdr target-site mutations, associated with pyrethroid/DDT resistance in An. gambiae elsewhere in Africa, were found on the islands. Conclusion The consequences of this resistance phenotype are discussed in relation to future vector control strategies on Zanzibar to support the ongoing malaria elimination efforts on the islands. PMID:24314005
Neerincx, Pieter Bt; Casel, Pierrot; Prickett, Dennis; Nie, Haisheng; Watson, Michael; Leunissen, Jack Am; Groenen, Martien Am; Klopp, Christophe
2009-07-16
Reliable annotation linking oligonucleotide probes to target genes is essential for functional biological analysis of microarray experiments. We used the IMAD, OligoRAP and sigReannot pipelines to update the annotation for the ARK-Genomics Chicken 20 K array as part of a joined EADGENE/SABRE workshop. In this manuscript we compare their annotation strategies and results. Furthermore, we analyse the effect of differences in updated annotation on functional analysis for an experiment involving Eimeria infected chickens and finally we propose guidelines for optimal annotation strategies. IMAD, OligoRAP and sigReannot update both annotation and estimated target specificity. The 3 pipelines can assign oligos to target specificity categories although with varying degrees of resolution. Target specificity is judged based on the amount and type of oligo versus target-gene alignments (hits), which are determined by filter thresholds that users can adjust based on their experimental conditions. Linking oligos to annotation on the other hand is based on rigid rules, which differ between pipelines.For 52.7% of the oligos from a subset selected for in depth comparison all pipelines linked to one or more Ensembl genes with consensus on 44.0%. In 31.0% of the cases none of the pipelines could assign an Ensembl gene to an oligo and for the remaining 16.3% the coverage differed between pipelines. Differences in updated annotation were mainly due to different thresholds for hybridisation potential filtering of oligo versus target-gene alignments and different policies for expanding annotation using indirect links. The differences in updated annotation packages had a significant effect on GO term enrichment analysis with consensus on only 67.2% of the enriched terms. In addition to flexible thresholds to determine target specificity, annotation tools should provide metadata describing the relationships between oligos and the annotation assigned to them. These relationships can then be used to judge the varying degrees of reliability allowing users to fine-tune the balance between reliability and coverage. This is important as it can have a significant effect on functional microarray analysis as exemplified by the lack of consensus on almost one third of the terms found with GO term enrichment analysis based on updated IMAD, OligoRAP or sigReannot annotation.
Protein microarray analysis reveals BAFF-binding autoantibodies in systemic lupus erythematosus
Price, Jordan V.; Haddon, David J.; Kemmer, Dodge; Delepine, Guillaume; Mandelbaum, Gil; Jarrell, Justin A.; Gupta, Rohit; Balboni, Imelda; Chakravarty, Eliza F.; Sokolove, Jeremy; Shum, Anthony K.; Anderson, Mark S.; Cheng, Mickie H.; Robinson, William H.; Browne, Sarah K.; Holland, Steven M.; Baechler, Emily C.; Utz, Paul J.
2013-01-01
Autoantibodies against cytokines, chemokines, and growth factors inhibit normal immunity and are implicated in inflammatory autoimmune disease and diseases of immune deficiency. In an effort to evaluate serum from autoimmune and immunodeficient patients for Abs against cytokines, chemokines, and growth factors in a high-throughput and unbiased manner, we constructed a multiplex protein microarray for detection of serum factor–binding Abs and used the microarray to detect autoantibody targets in SLE. We designed a nitrocellulose-surface microarray containing human cytokines, chemokines, and other circulating proteins and demonstrated that the array permitted specific detection of serum factor–binding probes. We used the arrays to detect previously described autoantibodies against cytokines in samples from individuals with autoimmune polyendocrine syndrome type 1 and chronic mycobacterial infection. Serum profiling from individuals with SLE revealed that among several targets, elevated IgG autoantibody reactivity to B cell–activating factor (BAFF) was associated with SLE compared with control samples. BAFF reactivity correlated with the severity of disease-associated features, including IFN-α–driven SLE pathology. Our results showed that serum factor protein microarrays facilitate detection of autoantibody reactivity to serum factors in human samples and that BAFF-reactive autoantibodies may be associated with an elevated inflammatory disease state within the spectrum of SLE. PMID:24270423
In vitro study of the effects of ELF electric fields on gene expression in human epidermal cells.
Collard, Jean-Francois; Mertens, Benjamin; Hinsenkamp, Maurice
2011-01-01
An acceleration of differentiation, at the expense of proliferation, is observed after exposure of various biological models to low frequency and low amplitude electric and electromagnetic fields. Following these results showing significant modifications, we try to identify the biological mechanism involved at the cell level through microarray screening. For this study, we use epidermis cultures harvested from human abdominoplasty. Two platinum electrodes are used to apply the electric signal. The gene expressions of 38,500 well-characterized human genes are analyzed using Affymetrix(®) microarray U133 Plus 2.0 chips. The protocol is repeated on three different patients. After three periods of exposure, a total of 24 chips have been processed. After the application of ELF electric fields, the microarray analysis confirms a modification of the gene expression of epidermis cells. Particularly, four up-regulated genes (DKK1, TXNRD1, ATF3, and MME) and one down-regulated gene (MACF1) are involved in the regulation of proliferation and differentiation. Expression of these five genes was also confirmed by real-time rtPCR in all samples used for microarray analysis. These results corroborate an acceleration of cell differentiation at the expense of cell proliferation. © 2010 Wiley-Liss, Inc.
Development of a cell microarray chip for detection of circulating tumor cells
NASA Astrophysics Data System (ADS)
Yamamura, S.; Yatsushiro, S.; Abe, K.; Baba, Y.; Kataoka, M.
2012-03-01
Detection of circulating tumor cells (CTCs) in the peripheral blood of metastatic cancer patients has clinical significance in earlier diagnosis of metastases. In this study, a novel cell microarray chip for accurate and rapid detection of tumor cells from human leukocytes was developed. The chip with 20,944 microchambers (105 μm diameter and 50 μm depth) was made from polystyrene, and the surface was rendered to hydrophilic by means of reactive-ion etching, which led to the formation of mono-layers of leukocytes on the microchambers. As the model of CTCs detection, we spiked human bronchioalveolar carcinoma (H1650) cells into human T lymphoblastoid leukemia (CEM) cells suspension and detected H1650 cells using the chip. A CEM suspension contained with H1650 cells was dispersed on the chip surface, followed by 10 min standing to allow the cells to settle down into the microchambers. About 30 CEM cells were accommodated in each microchamber, over 600,000 CEM cells in total being on a chip. We could detect 1 H1650 cell per 106 CEM cells on the microarray by staining with fluorescence-conjugated antibody (Anti-Cytokeratin) and cell membrane marker (DiD). Thus, this cell microarray chip has highly potential to be a novel tool of accurate and rapid detection of CTCs.
Barley whole exome capture: a tool for genomic research in the genus Hordeum and beyond
Mascher, Martin; Richmond, Todd A; Gerhardt, Daniel J; Himmelbach, Axel; Clissold, Leah; Sampath, Dharanya; Ayling, Sarah; Steuernagel, Burkhard; Pfeifer, Matthias; D'Ascenzo, Mark; Akhunov, Eduard D; Hedley, Pete E; Gonzales, Ana M; Morrell, Peter L; Kilian, Benjamin; Blattner, Frank R; Scholz, Uwe; Mayer, Klaus FX; Flavell, Andrew J; Muehlbauer, Gary J; Waugh, Robbie; Jeddeloh, Jeffrey A; Stein, Nils
2013-01-01
Advanced resources for genome-assisted research in barley (Hordeum vulgare) including a whole-genome shotgun assembly and an integrated physical map have recently become available. These have made possible studies that aim to assess genetic diversity or to isolate single genes by whole-genome resequencing and in silico variant detection. However such an approach remains expensive given the 5 Gb size of the barley genome. Targeted sequencing of the mRNA-coding exome reduces barley genomic complexity more than 50-fold, thus dramatically reducing this heavy sequencing and analysis load. We have developed and employed an in-solution hybridization-based sequence capture platform to selectively enrich for a 61.6 megabase coding sequence target that includes predicted genes from the genome assembly of the cultivar Morex as well as publicly available full-length cDNAs and de novo assembled RNA-Seq consensus sequence contigs. The platform provides a highly specific capture with substantial and reproducible enrichment of targeted exons, both for cultivated barley and related species. We show that this exome capture platform provides a clear path towards a broader and deeper understanding of the natural variation residing in the mRNA-coding part of the barley genome and will thus constitute a valuable resource for applications such as mapping-by-sequencing and genetic diversity analyzes. PMID:23889683
Identification of embryonic pancreatic genes using Xenopus DNA microarrays.
Hayata, Tadayoshi; Blitz, Ira L; Iwata, Nahoko; Cho, Ken W Y
2009-06-01
The pancreas is both an exocrine and endocrine endodermal organ involved in digestion and glucose homeostasis. During embryogenesis, the anlagen of the pancreas arise from dorsal and ventral evaginations of the foregut that later fuse to form a single organ. To better understand the molecular genetics of early pancreas development, we sought to isolate markers that are uniquely expressed in this tissue. Microarray analysis was performed comparing dissected pancreatic buds, liver buds, and the stomach region of tadpole stage Xenopus embryos. A total of 912 genes were found to be differentially expressed between these organs during early stages of organogenesis. K-means clustering analysis predicted 120 of these genes to be specifically enriched in the pancreas. Of these, we report on the novel expression patterns of 24 genes. Our analyses implicate the involvement of previously unsuspected signaling pathways during early pancreas development. Developmental Dynamics 238:1455-1466, 2009. (c) 2009 Wiley-Liss, Inc.
A Roadmap for Functional Structural Variants in the Soybean Genome
Anderson, Justin E.; Kantar, Michael B.; Kono, Thomas Y.; Fu, Fengli; Stec, Adrian O.; Song, Qijian; Cregan, Perry B.; Specht, James E.; Diers, Brian W.; Cannon, Steven B.; McHale, Leah K.; Stupar, Robert M.
2014-01-01
Gene structural variation (SV) has recently emerged as a key genetic mechanism underlying several important phenotypic traits in crop species. We screened a panel of 41 soybean (Glycine max) accessions serving as parents in a soybean nested association mapping population for deletions and duplications in more than 53,000 gene models. Array hybridization and whole genome resequencing methods were used as complementary technologies to identify SV in 1528 genes, or approximately 2.8%, of the soybean gene models. Although SV occurs throughout the genome, SV enrichment was noted in families of biotic defense response genes. Among accessions, SV was nearly eightfold less frequent for gene models that have retained paralogs since the last whole genome duplication event, compared with genes that have not retained paralogs. Increases in gene copy number, similar to that described at the Rhg1 resistance locus, account for approximately one-fourth of the genic SV events. This assessment of soybean SV occurrence presents a target list of genes potentially responsible for rapidly evolving and/or adaptive traits. PMID:24855315
Guffanti, Guia; Torri, Federica; Rasmussen, Jerod; Clark, Andrew P.; Lakatos, Anita; Turner, Jessica A.; Fallon, James H.; Saykin, Andrew J.; Weiner, Michael; Vawter, Marquis P.; Knowles, James A.; Potkin, Steven G.; Macciardi, Fabio
2014-01-01
We investigated the genome-wide distribution of CNVs in the Alzheimer's disease (AD) Neuroimaging Initiative (ADNI) sample (146 with AD, 313 with Mild Cognitive Impairment (MCI), and 181 controls). Comparison of single CNVs between cases (MCI and AD) and controls shows overrepresentation of large heterozygous deletions in cases (p-value < 0.0001). The analysis of CNV-Regions identifies 44 copy number variable loci of heterozygous deletions, with more CNV-Regions among affected than controls (p = 0.005). Seven of the 44 CNV-Regions are nominally significant for association with cognitive impairment. We validated and confirmed our main findings with genome re-sequencing of selected patients and controls. The functional pathway analysis of the genes putatively affected by deletions of CNV-Regions reveals enrichment of genes implicated in axonal guidance, cell–cell adhesion, neuronal morphogenesis and differentiation. Our findings support the role of CNVs in AD, and suggest an association between large deletions and the development of cognitive impairment PMID:23583670
Hmaïed, F; Helel, S; Le Berre, V; François, J-M; Leclercq, A; Lecuit, M; Smaoui, H; Kechrid, A; Boudabous, A; Barkallah, I
2014-02-01
We aimed at evaluating the prevalence of Listeria species isolated from food samples and characterizing food and human cases isolates. Between 2005 and 2007, one hundred food samples collected in the markets of Tunis were analysed in our study. Five strains of Listeria monocytogenes responsible for human listeriosis isolated in hospital of Tunis were included. Multiplex PCR serogrouping and pulsed field gel electrophoresis (PFGE) applying the enzyme AscI and ApaI were used for the characterization of isolates of L. monocytogenes. We have developed a rapid microarray-based assay to a reliable discrimination of species within the Listeria genus. The prevalence of Listeria spp. in food samples was estimated at 14% by using classical biochemical identification. Two samples were assigned to L. monocytogenes and 12 to L. innocua. DNA microarray allowed unambiguous identification of Listeria species. Our results obtained by microarray-based assay were in accordance with the biochemical identification. The two food L. monocytogenes isolates were assigned to the PCR serogroup IIa (serovar 1/2a). Whereas human L. monocytogenes isolates were of PCR serogroup IVb, (serovars 4b). These isolates present a high similarity in PFGE. Food L. monocytogenes isolates were classified into two different pulsotypes. These pulsotypes were different from that of the five strains responsible for the human cases. We confirmed the presence of Listeria spp. in variety of food samples in Tunis. Increased food and clinical surveillance must be taken into consideration in Tunisia to identify putative infections sources. Copyright © 2013 Elsevier Masson SAS. All rights reserved.
Reboiro-Jato, Miguel; Arrais, Joel P; Oliveira, José Luis; Fdez-Riverola, Florentino
2014-01-30
The diagnosis and prognosis of several diseases can be shortened through the use of different large-scale genome experiments. In this context, microarrays can generate expression data for a huge set of genes. However, to obtain solid statistical evidence from the resulting data, it is necessary to train and to validate many classification techniques in order to find the best discriminative method. This is a time-consuming process that normally depends on intricate statistical tools. geneCommittee is a web-based interactive tool for routinely evaluating the discriminative classification power of custom hypothesis in the form of biologically relevant gene sets. While the user can work with different gene set collections and several microarray data files to configure specific classification experiments, the tool is able to run several tests in parallel. Provided with a straightforward and intuitive interface, geneCommittee is able to render valuable information for diagnostic analyses and clinical management decisions based on systematically evaluating custom hypothesis over different data sets using complementary classifiers, a key aspect in clinical research. geneCommittee allows the enrichment of microarrays raw data with gene functional annotations, producing integrated datasets that simplify the construction of better discriminative hypothesis, and allows the creation of a set of complementary classifiers. The trained committees can then be used for clinical research and diagnosis. Full documentation including common use cases and guided analysis workflows is freely available at http://sing.ei.uvigo.es/GC/.
Paraboschi, Elvezia Maria; Cardamone, Giulia; Rimoldi, Valeria; Gemmati, Donato; Spreafico, Marta; Duga, Stefano; Soldà, Giulia; Asselta, Rosanna
2015-09-30
Abnormalities in RNA metabolism and alternative splicing (AS) are emerging as important players in complex disease phenotypes. In particular, accumulating evidence suggests the existence of pathogenic links between multiple sclerosis (MS) and altered AS, including functional studies showing that an imbalance in alternatively-spliced isoforms may contribute to disease etiology. Here, we tested whether the altered expression of AS-related genes represents a MS-specific signature. A comprehensive comparative analysis of gene expression profiles of publicly-available microarray datasets (190 MS cases, 182 controls), followed by gene-ontology enrichment analysis, highlighted a significant enrichment for differentially-expressed genes involved in RNA metabolism/AS. In detail, a total of 17 genes were found to be differentially expressed in MS in multiple datasets, with CELF1 being dysregulated in five out of seven studies. We confirmed CELF1 downregulation in MS (p=0.0015) by real-time RT-PCRs on RNA extracted from blood cells of 30 cases and 30 controls. As a proof of concept, we experimentally verified the unbalance in alternatively-spliced isoforms in MS of the NFAT5 gene, a putative CELF1 target. In conclusion, for the first time we provide evidence of a consistent dysregulation of splicing-related genes in MS and we discuss its possible implications in modulating specific AS events in MS susceptibility genes.
Hu, Wei Qi; Wang, Wei; Fang, Di Long; Yin, Xue Feng
2018-05-24
BACKGROUND We screened the potential molecular targets and investigated the molecular mechanisms of hepatocellular carcinoma (HCC). MATERIAL AND METHODS Microarray data of GSE47786, including the 40 μM berberine-treated HepG2 human hepatoma cell line and 0.08% DMSO-treated as control cells samples, was downloaded from the GEO database. Gene ontology (GO) and Kyoto Encyclopedia of Genes and Genomes pathway (KEGG) enrichment analyses were performed; the protein-protein interaction (PPI) networks were constructed using STRING database and Cytoscape; the genetic alteration, neighboring genes networks, and survival analysis of hub genes were explored by cBio portal; and the expression of mRNA level of hub genes was obtained from the Oncomine databases. RESULTS A total of 56 upregulated and 8 downregulated DEGs were identified. The GO analysis results were significantly enriched in cell-cycle arrest, regulation of transcription, DNA-dependent, protein amino acid phosphorylation, cell cycle, and apoptosis. The KEGG pathway analysis showed that DEGs were enriched in MAPK signaling pathway, ErbB signaling pathway, and p53 signaling pathway. JUN, EGR1, MYC, and CDKN1A were identified as hub genes in PPI networks. The genetic alteration of hub genes was mainly concentrated in amplification. TP53, NDRG1, and MAPK15 were found in neighboring genes networks. Altered genes had worse overall survival and disease-free survival than unaltered genes. The expressions of EGR1, MYC, and CDKN1A were significantly increased, but expression of JUN was not, in the Roessler Liver datasets. CONCLUSIONS We found that JUN, EGR1, MYC, and CDKN1A might be used as diagnostic and therapeutic molecular biomarkers and broaden our understanding of the molecular mechanisms of HCC.
Moreno, Marta; Fernández, Virginia; Monllau, Josep M.; Borrell, Víctor; Lerin, Carles; de la Iglesia, Núria
2015-01-01
Summary Neural stem cells (NSCs) reside in a hypoxic microenvironment within the brain. However, the crucial transcription factors (TFs) that regulate NSC biology under physiologic hypoxia are poorly understood. Here we have performed gene set enrichment analysis (GSEA) of microarray datasets from hypoxic versus normoxic NSCs with the aim of identifying pathways and TFs that are activated under oxygen concentrations mimicking normal brain tissue microenvironment. Integration of TF target (TFT) and pathway enrichment analysis identified the calcium-regulated TF NFATc4 as a major candidate to regulate hypoxic NSC functions. Nfatc4 expression was coordinately upregulated by top hypoxia-activated TFs, while NFATc4 target genes were enriched in hypoxic NSCs. Loss-of-function analyses further revealed that the calcineurin-NFATc4 signaling axis acts as a major regulator of NSC self-renewal and proliferation in vitro and in vivo by promoting the expression of TFs, including Id2, that contribute to the maintenance of the NSC state. PMID:26235896
Zhao, Zhengshan; Peytavi, Régis; Diaz-Quijada, Gerardo A.; Picard, Francois J.; Huletsky, Ann; Leblanc, Éric; Frenette, Johanne; Boivin, Guy; Veres, Teodor; Dumoulin, Michel M.; Bergeron, Michel G.
2008-01-01
Fabrication of microarray devices using traditional glass slides is not easily adaptable to integration into microfluidic systems. There is thus a need for the development of polymeric materials showing a high hybridization signal-to-background ratio, enabling sensitive detection of microbial pathogens. We have developed such plastic supports suitable for highly sensitive DNA microarray hybridizations. The proof of concept of this microarray technology was done through the detection of four human respiratory viruses that were amplified and labeled with a fluorescent dye via a sensitive reverse transcriptase PCR (RT-PCR) assay. The performance of the microarray hybridization with plastic supports made of PMMA [poly(methylmethacrylate)]-VSUVT or Zeonor 1060R was compared to that with high-quality glass slide microarrays by using both passive and microfluidic hybridization systems. Specific hybridization signal-to-background ratios comparable to that obtained with high-quality commercial glass slides were achieved with both polymeric substrates. Microarray hybridizations demonstrated an analytical sensitivity equivalent to approximately 100 viral genome copies per RT-PCR, which is at least 100-fold higher than the sensitivities of previously reported DNA hybridizations on plastic supports. Testing of these plastic polymers using a microfluidic microarray hybridization platform also showed results that were comparable to those with glass supports. In conclusion, PMMA-VSUVT and Zeonor 1060R are both suitable for highly sensitive microarray hybridizations. PMID:18784318
Morine, Melissa J; McMonagle, Jolene; Toomey, Sinead; Reynolds, Clare M; Moloney, Aidan P; Gormley, Isobel C; Gaora, Peadar O; Roche, Helen M
2010-10-07
Currently, a number of bioinformatics methods are available to generate appropriate lists of genes from a microarray experiment. While these lists represent an accurate primary analysis of the data, fewer options exist to contextualise those lists. The development and validation of such methods is crucial to the wider application of microarray technology in the clinical setting. Two key challenges in clinical bioinformatics involve appropriate statistical modelling of dynamic transcriptomic changes, and extraction of clinically relevant meaning from very large datasets. Here, we apply an approach to gene set enrichment analysis that allows for detection of bi-directional enrichment within a gene set. Furthermore, we apply canonical correlation analysis and Fisher's exact test, using plasma marker data with known clinical relevance to aid identification of the most important gene and pathway changes in our transcriptomic dataset. After a 28-day dietary intervention with high-CLA beef, a range of plasma markers indicated a marked improvement in the metabolic health of genetically obese mice. Tissue transcriptomic profiles indicated that the effects were most dramatic in liver (1270 genes significantly changed; p < 0.05), followed by muscle (601 genes) and adipose (16 genes). Results from modified GSEA showed that the high-CLA beef diet affected diverse biological processes across the three tissues, and that the majority of pathway changes reached significance only with the bi-directional test. Combining the liver tissue microarray results with plasma marker data revealed 110 CLA-sensitive genes showing strong canonical correlation with one or more plasma markers of metabolic health, and 9 significantly overrepresented pathways among this set; each of these pathways was also significantly changed by the high-CLA diet. Closer inspection of two of these pathways--selenoamino acid metabolism and steroid biosynthesis--illustrated clear diet-sensitive changes in constituent genes, as well as strong correlations between gene expression and plasma markers of metabolic syndrome independent of the dietary effect. Bi-directional gene set enrichment analysis more accurately reflects dynamic regulatory behaviour in biochemical pathways, and as such highlighted biologically relevant changes that were not detected using a traditional approach. In such cases where transcriptomic response to treatment is exceptionally large, canonical correlation analysis in conjunction with Fisher's exact test highlights the subset of pathways showing strongest correlation with the clinical markers of interest. In this case, we have identified selenoamino acid metabolism and steroid biosynthesis as key pathways mediating the observed relationship between metabolic health and high-CLA beef. These results indicate that this type of analysis has the potential to generate novel transcriptome-based biomarkers of disease.
2010-01-01
Background Currently, a number of bioinformatics methods are available to generate appropriate lists of genes from a microarray experiment. While these lists represent an accurate primary analysis of the data, fewer options exist to contextualise those lists. The development and validation of such methods is crucial to the wider application of microarray technology in the clinical setting. Two key challenges in clinical bioinformatics involve appropriate statistical modelling of dynamic transcriptomic changes, and extraction of clinically relevant meaning from very large datasets. Results Here, we apply an approach to gene set enrichment analysis that allows for detection of bi-directional enrichment within a gene set. Furthermore, we apply canonical correlation analysis and Fisher's exact test, using plasma marker data with known clinical relevance to aid identification of the most important gene and pathway changes in our transcriptomic dataset. After a 28-day dietary intervention with high-CLA beef, a range of plasma markers indicated a marked improvement in the metabolic health of genetically obese mice. Tissue transcriptomic profiles indicated that the effects were most dramatic in liver (1270 genes significantly changed; p < 0.05), followed by muscle (601 genes) and adipose (16 genes). Results from modified GSEA showed that the high-CLA beef diet affected diverse biological processes across the three tissues, and that the majority of pathway changes reached significance only with the bi-directional test. Combining the liver tissue microarray results with plasma marker data revealed 110 CLA-sensitive genes showing strong canonical correlation with one or more plasma markers of metabolic health, and 9 significantly overrepresented pathways among this set; each of these pathways was also significantly changed by the high-CLA diet. Closer inspection of two of these pathways - selenoamino acid metabolism and steroid biosynthesis - illustrated clear diet-sensitive changes in constituent genes, as well as strong correlations between gene expression and plasma markers of metabolic syndrome independent of the dietary effect. Conclusion Bi-directional gene set enrichment analysis more accurately reflects dynamic regulatory behaviour in biochemical pathways, and as such highlighted biologically relevant changes that were not detected using a traditional approach. In such cases where transcriptomic response to treatment is exceptionally large, canonical correlation analysis in conjunction with Fisher's exact test highlights the subset of pathways showing strongest correlation with the clinical markers of interest. In this case, we have identified selenoamino acid metabolism and steroid biosynthesis as key pathways mediating the observed relationship between metabolic health and high-CLA beef. These results indicate that this type of analysis has the potential to generate novel transcriptome-based biomarkers of disease. PMID:20929581
USDA-ARS?s Scientific Manuscript database
Human noroviruses cause up to 21 million cases of foodborne disease in the United States annually and are the most common cause of acute gastroenteritis in industrialized countries. To reduce the burden of foodborne disease associated with viruses, the use of low density DNA microarrays in conjunct...
Yamasaki, Maria; Miyagawa, Taku; Toyoda, Hiromi; Khor, Seik-Soon; Koike, Asako; Nitta, Aino; Akiyama, Kumi; Sasaki, Tsukasa; Honda, Yutaka; Honda, Makoto; Tokunaga, Katsushi
2014-05-01
In humans, narcolepsy with cataplexy (narcolepsy) is a sleep disorder that is characterized by sleepiness, cataplexy and rapid eye movement (REM) sleep abnormalities. Narcolepsy is caused by a reduction in the number of neurons that produce hypocretin (orexin) neuropeptide. Both genetic and environmental factors contribute to the development of narcolepsy.Rare and large copy number variations (CNVs) reportedly play a role in the etiology of a number of neuropsychiatric disorders. Narcolepsy is considered a neurological disorder; therefore, we sought to investigate any possible association between rare and large CNVs and human narcolepsy. We used DNA microarray data and a CNV detection software application, PennCNV-Affy, to detect CNVs in 426 Japanese narcoleptic patients and 562 healthy individuals. Overall, we found a significant enrichment of rare and large CNVs (frequency ≤1%, size ≥100 kb) in the patients (case-control ratio of CNV count=1.54, P=5.00 × 10(-4)). Next, we extended a region-based association analysis by including CNVs with its size ≥30 kb. Rare and large CNVs in PARK2 region showed a significant association with narcolepsy. Four patients were assessed to carry duplications of the gene region, whereas no controls carried the duplication, which was further confirmed by quantitative PCR assay. This duplication was also found in 2 essential hypersomnia (EHS) patients out of 171 patients. Furthermore, a pathway analysis revealed enrichments of gene disruptions by rare and large CNVs in immune response, acetyltransferase activity, cell cycle regulation and regulation of cell development. This study constitutes the first report on the risk association between multiple rare and large CNVs and the pathogenesis of narcolepsy. In the future, replication studies are needed to confirm the associations.
Poultney, Christopher S; Goldberg, Arthur P; Drapeau, Elodie; Kou, Yan; Harony-Nicolas, Hala; Kajiwara, Yuji; De Rubeis, Silvia; Durand, Simon; Stevens, Christine; Rehnström, Karola; Palotie, Aarno; Daly, Mark J; Ma'ayan, Avi; Fromer, Menachem; Buxbaum, Joseph D
2013-10-03
Copy number variation (CNV) is an important determinant of human diversity and plays important roles in susceptibility to disease. Most studies of CNV carried out to date have made use of chromosome microarray and have had a lower size limit for detection of about 30 kilobases (kb). With the emergence of whole-exome sequencing studies, we asked whether such data could be used to reliably call rare exonic CNV in the size range of 1-30 kilobases (kb), making use of the eXome Hidden Markov Model (XHMM) program. By using both transmission information and validation by molecular methods, we confirmed that small CNV encompassing as few as three exons can be reliably called from whole-exome data. We applied this approach to an autism case-control sample (n = 811, mean per-target read depth = 161) and observed a significant increase in the burden of rare (MAF ≤1%) 1-30 kb CNV, 1-30 kb deletions, and 1-10 kb deletions in ASD. CNV in the 1-30 kb range frequently hit just a single gene, and we were therefore able to carry out enrichment and pathway analyses, where we observed enrichment for disruption of genes in cytoskeletal and autophagy pathways in ASD. In summary, our results showed that XHMM provided an effective means to assess small exonic CNV from whole-exome data, indicated that rare 1-30 kb exonic deletions could contribute to risk in up to 7% of individuals with ASD, and implicated a candidate pathway in developmental delay syndromes. Copyright © 2013 The American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Sugii, Yuh; Kasai, Tomonari; Ikeda, Masashi; Vaidyanath, Arun; Kumon, Kazuki; Mizutani, Akifumi; Seno, Akimasa; Tokutaka, Heizo; Kudoh, Takayuki; Seno, Masaharu
2016-01-01
To identify cell-specific markers, we designed a DNA microarray platform with oligonucleotide probes for human membrane-anchored proteins. Human glioma cell lines were analyzed using microarray and compared with normal and fetal brain tissues. For the microarray analysis, we employed a spherical self-organizing map, which is a clustering method suitable for the conversion of multidimensional data into two-dimensional data and displays the relationship on a spherical surface. Based on the gene expression profile, the cell surface characteristics were successfully mirrored onto the spherical surface, thereby distinguishing normal brain tissue from the disease model based on the strength of gene expression. The clustered glioma-specific genes were further analyzed by polymerase chain reaction procedure and immunocytochemical staining of glioma cells. Our platform and the following procedure were successfully demonstrated to categorize the genes coding for cell surface proteins that are specific to glioma cells. Our assessment demonstrates that a spherical self-organizing map is a valuable tool for distinguishing cell surface markers and can be employed in marker discovery studies for the treatment of cancer.
GOEAST: a web-based software toolkit for Gene Ontology enrichment analysis.
Zheng, Qi; Wang, Xiu-Jie
2008-07-01
Gene Ontology (GO) analysis has become a commonly used approach for functional studies of large-scale genomic or transcriptomic data. Although there have been a lot of software with GO-related analysis functions, new tools are still needed to meet the requirements for data generated by newly developed technologies or for advanced analysis purpose. Here, we present a Gene Ontology Enrichment Analysis Software Toolkit (GOEAST), an easy-to-use web-based toolkit that identifies statistically overrepresented GO terms within given gene sets. Compared with available GO analysis tools, GOEAST has the following improved features: (i) GOEAST displays enriched GO terms in graphical format according to their relationships in the hierarchical tree of each GO category (biological process, molecular function and cellular component), therefore, provides better understanding of the correlations among enriched GO terms; (ii) GOEAST supports analysis for data from various sources (probe or probe set IDs of Affymetrix, Illumina, Agilent or customized microarrays, as well as different gene identifiers) and multiple species (about 60 prokaryote and eukaryote species); (iii) One unique feature of GOEAST is to allow cross comparison of the GO enrichment status of multiple experiments to identify functional correlations among them. GOEAST also provides rigorous statistical tests to enhance the reliability of analysis results. GOEAST is freely accessible at http://omicslab.genetics.ac.cn/GOEAST/
Draghici, Sorin; Tarca, Adi L; Yu, Longfei; Ethier, Stephen; Romero, Roberto
2008-03-01
The BioArray Software Environment (BASE) is a very popular MIAME-compliant, web-based microarray data repository. However in BASE, like in most other microarray data repositories, the experiment annotation and raw data uploading can be very timeconsuming, especially for large microarray experiments. We developed KUTE (Karmanos Universal daTabase for microarray Experiments), as a plug-in for BASE 2.0 that addresses these issues. KUTE provides an automatic experiment annotation feature and a completely redesigned data work-flow that dramatically reduce the human-computer interaction time. For instance, in BASE 2.0 a typical Affymetrix experiment involving 100 arrays required 4 h 30 min of user interaction time forexperiment annotation, and 45 min for data upload/download. In contrast, for the same experiment, KUTE required only 28 min of user interaction time for experiment annotation, and 3.3 min for data upload/download. http://vortex.cs.wayne.edu/kute/index.html.
Meinert, Christian; Gembardt, Florian; Böhme, Ilka; Tetzner, Anja; Wieland, Thomas; Greenberg, Barry; Walther, Thomas
2016-01-01
The study aimed to identify proteins regulated by the cardiovascular protective peptide angiotensin-(1-7) and to determine potential intracellular signaling cascades. Human endothelial cells were stimulated with Ang-(1-7) for 1 h, 3 h, 6 h, and 9 h. Peptide effects on intracellular signaling were assessed via antibody microarray, containing antibodies against 725 proteins. Bioinformatics software was used to identify affected intracellular signaling pathways. Microarray data was verified exemplarily by Western blot, Real-Time RT-PCR, and immunohistochemical studies. The microarray identified 110 regulated proteins after 1 h, 119 after 3 h, 31 after 6 h, and 86 after 9 h Ang-(1-7) stimulation. Regulated proteins were associated with high significance to several metabolic pathways like “Molecular Mechanism of Cancer” and “p53 signaling” in a time dependent manner. Exemplarily, Western blots for the E3-type small ubiquitin-like modifier ligase PIAS2 confirmed the microarray data and displayed a decrease by more than 50% after Ang-(1-7) stimulation at 1 h and 3 h without affecting its mRNA. Immunohistochemical studies with PIAS2 in human endothelial cells showed a decrease in cytoplasmic PIAS2 after Ang-(1-7) treatment. The Ang-(1-7) mediated decrease of PIAS2 was reproduced in other endothelial cell types. The results suggest that angiotensin-(1-7) plays a role in metabolic pathways related to cell death and cell survival in human endothelial cells.
Salomäki, Henriikka; Vähätalo, Laura H; Laurila, Kirsti; Jäppinen, Norma T; Penttinen, Anna-Maija; Ailanen, Liisa; Ilyasizadeh, Juan; Pesonen, Ullamari; Koulu, Markku
2013-01-01
The antidiabetic drug metformin is currently used prior and during pregnancy for polycystic ovary syndrome, as well as during gestational diabetes mellitus. We investigated the effects of prenatal metformin exposure on the metabolic phenotype of the offspring during adulthood in mice. Metformin (300 mg/kg) or vehicle was administered orally to dams on regular diet from the embryonic day E0.5 to E17.5. Gene expression profiles in liver and brain were analysed from 4-day old offspring by microarray. Body weight development and several metabolic parameters of offspring were monitored both during regular diet (RD-phase) and high fat diet (HFD-phase). At the end of the study, two doses of metformin or vehicle were given acutely to mice at the age of 20 weeks, and Insig-1 and GLUT4 mRNA expressions in liver and fat tissue were analysed using qRT-PCR. Metformin exposed fetuses were lighter at E18.5. There was no effect of metformin on the maternal body weight development or food intake. Metformin exposed offspring gained more body weight and mesenteric fat during the HFD-phase. The male offspring also had impaired glucose tolerance and elevated fasting glucose during the HFD-phase. Moreover, the expression of GLUT4 mRNA was down-regulated in epididymal fat in male offspring prenatally exposed to metformin. Based on the microarray and subsequent qRT-PCR analyses, the expression of Insig-1 was changed in the liver of neonatal mice exposed to metformin prenatally. Furthermore, metformin up-regulated the expression of Insig-1 later in development. Gene set enrichment analysis based on preliminary microarray data identified several differentially enriched pathways both in control and metformin exposed mice. The present study shows that prenatal metformin exposure causes long-term programming effects on the metabolic phenotype during high fat diet in mice. This should be taken into consideration when using metformin as a therapeutic agent during pregnancy.
Single-Molecule Electrical Random Resequencing of DNA and RNA
NASA Astrophysics Data System (ADS)
Ohshiro, Takahito; Matsubara, Kazuki; Tsutsui, Makusu; Furuhashi, Masayuki; Taniguchi, Masateru; Kawai, Tomoji
2012-07-01
Two paradigm shifts in DNA sequencing technologies--from bulk to single molecules and from optical to electrical detection--are expected to realize label-free, low-cost DNA sequencing that does not require PCR amplification. It will lead to development of high-throughput third-generation sequencing technologies for personalized medicine. Although nanopore devices have been proposed as third-generation DNA-sequencing devices, a significant milestone in these technologies has been attained by demonstrating a novel technique for resequencing DNA using electrical signals. Here we report single-molecule electrical resequencing of DNA and RNA using a hybrid method of identifying single-base molecules via tunneling currents and random sequencing. Our method reads sequences of nine types of DNA oligomers. The complete sequence of 5'-UGAGGUA-3' from the let-7 microRNA family was also identified by creating a composite of overlapping fragment sequences, which was randomly determined using tunneling current conducted by single-base molecules as they passed between a pair of nanoelectrodes.
2011-01-01
Background Technological advances are progressively increasing the application of genomics to a wider array of economically and ecologically important species. High-density maps enriched for transcribed genes facilitate the discovery of connections between genes and phenotypes. We report the construction of a high-density linkage map of expressed genes for the heterozygous genome of Eucalyptus using Single Feature Polymorphism (SFP) markers. Results SFP discovery and mapping was achieved using pseudo-testcross screening and selective mapping to simultaneously optimize linkage mapping and microarray costs. SFP genotyping was carried out by hybridizing complementary RNA prepared from 4.5 year-old trees xylem to an SFP array containing 103,000 25-mer oligonucleotide probes representing 20,726 unigenes derived from a modest size expressed sequence tags collection. An SFP-mapping microarray with 43,777 selected candidate SFP probes representing 15,698 genes was subsequently designed and used to genotype SFPs in a larger subset of the segregating population drawn by selective mapping. A total of 1,845 genes were mapped, with 884 of them ordered with high likelihood support on a framework map anchored to 180 microsatellites with average density of 1.2 cM. Using more probes per unigene increased by two-fold the likelihood of detecting segregating SFPs eventually resulting in more genes mapped. In silico validation showed that 87% of the SFPs map to the expected location on the 4.5X draft sequence of the Eucalyptus grandis genome. Conclusions The Eucalyptus 1,845 gene map is the most highly enriched map for transcriptional information for any forest tree species to date. It represents a major improvement on the number of genes previously positioned on Eucalyptus maps and provides an initial glimpse at the gene space for this global tree genome. A general protocol is proposed to build high-density transcript linkage maps in less characterized plant species by SFP genotyping with a concurrent objective of reducing microarray costs. HIgh-density gene-rich maps represent a powerful resource to assist gene discovery endeavors when used in combination with QTL and association mapping and should be especially valuable to assist the assembly of reference genome sequences soon to come for several plant and animal species. PMID:21492453
Information Commons for Rice (IC4R)
2016-01-01
Rice is the most important staple food for a large part of the world's human population and also a key model organism for plant research. Here, we present Information Commons for Rice (IC4R; http://ic4r.org), a rice knowledgebase featuring adoption of an extensible and sustainable architecture that integrates multiple omics data through community-contributed modules. Each module is developed and maintained by different committed groups, deals with data collection, processing and visualization, and delivers data on-demand via web services. In the current version, IC4R incorporates a variety of rice data through multiple committed modules, including genome-wide expression profiles derived entirely from RNA-Seq data, resequencing-based genomic variations obtained from re-sequencing data of thousands of rice varieties, plant homologous genes covering multiple diverse plant species, post-translational modifications, rice-related literatures and gene annotations contributed by the rice research community. Unlike extant related databases, IC4R is designed for scalability and sustainability and thus also features collaborative integration of rice data and low costs for database update and maintenance. Future directions of IC4R include incorporation of other omics data and association of multiple omics data with agronomically important traits, dedicating to build IC4R into a valuable knowledgebase for both basic and translational researches in rice. PMID:26519466
Gene Expression Analysis in Human Breast Cancer Associated Blood Vessels
Jones, Dylan T.; Lechertier, Tanguy; Mitter, Richard; Herbert, John M. J.; Bicknell, Roy; Jones, J. Louise; Li, Ji-Liang; Buffa, Francesca; Harris, Adrian L.; Hodivala-Dilke, Kairbaan
2012-01-01
Angiogenesis is essential for solid tumour growth, whilst the molecular profiles of tumour blood vessels have been reported to be different between cancer types. Although presently available anti-angiogenic strategies are providing some promise for the treatment of some cancers it is perhaps not surprisingly that, none of the anti-angiogenic agents available work on all tumours. Thus, the discovery of novel anti-angiogenic targets, relevant to individual cancer types, is required. Using Affymetrix microarray analysis of laser-captured, CD31-positive blood vessels we have identified 63 genes that are upregulated significantly (5–72 fold) in angiogenic blood vessels associated with human invasive ductal carcinoma (IDC) of the breast as compared with blood vessels in normal human breast. We tested the angiogenic capacity of a subset of these genes. Genes were selected based on either their known cellular functions, their enriched expression in endothelial cells and/or their sensitivity to anti-VEGF treatment; all features implicating their involvement in angiogenesis. For example, RRM2, a ribonucleotide reductase involved in DNA synthesis, was upregulated 32-fold in IDC-associated blood vessels; ATF1, a nuclear activating transcription factor involved in cellular growth and survival was upregulated 23-fold in IDC-associated blood vessels and HEX-B, a hexosaminidase involved in the breakdown of GM2 gangliosides, was upregulated 8-fold in IDC-associated blood vessels. Furthermore, in silico analysis confirmed that AFT1 and HEX-B also were enriched in endothelial cells when compared with non-endothelial cells. None of these genes have been reported previously to be involved in neovascularisation. However, our data establish that siRNA depletion of Rrm2, Atf1 or Hex-B had significant anti-angiogenic effects in VEGF-stimulated ex vivo mouse aortic ring assays. Overall, our results provide proof-of-principle that our approach can identify a cohort of potentially novel anti-angiogenic targets that are likley to be, but not exclusivley, relevant to breast cancer. PMID:23056178
Lionel, Anath C.; Tammimies, Kristiina; Vaags, Andrea K.; Rosenfeld, Jill A.; Ahn, Joo Wook; Merico, Daniele; Noor, Abdul; Runke, Cassandra K.; Pillalamarri, Vamsee K.; Carter, Melissa T.; Gazzellone, Matthew J.; Thiruvahindrapuram, Bhooma; Fagerberg, Christina; Laulund, Lone W.; Pellecchia, Giovanna; Lamoureux, Sylvia; Deshpande, Charu; Clayton-Smith, Jill; White, Ann C.; Leather, Susan; Trounce, John; Melanie Bedford, H.; Hatchwell, Eli; Eis, Peggy S.; Yuen, Ryan K.C.; Walker, Susan; Uddin, Mohammed; Geraghty, Michael T.; Nikkel, Sarah M.; Tomiak, Eva M.; Fernandez, Bridget A.; Soreni, Noam; Crosbie, Jennifer; Arnold, Paul D.; Schachar, Russell J.; Roberts, Wendy; Paterson, Andrew D.; So, Joyce; Szatmari, Peter; Chrysler, Christina; Woodbury-Smith, Marc; Brian Lowry, R.; Zwaigenbaum, Lonnie; Mandyam, Divya; Wei, John; MacDonald, Jeffrey R.; Howe, Jennifer L.; Nalpathamkalam, Thomas; Wang, Zhuozhi; Tolson, Daniel; Cobb, David S.; Wilks, Timothy M.; Sorensen, Mark J.; Bader, Patricia I.; An, Yu; Wu, Bai-Lin; Musumeci, Sebastiano Antonino; Romano, Corrado; Postorivo, Diana; Nardone, Anna M.; Monica, Matteo Della; Scarano, Gioacchino; Zoccante, Leonardo; Novara, Francesca; Zuffardi, Orsetta; Ciccone, Roberto; Antona, Vincenzo; Carella, Massimo; Zelante, Leopoldo; Cavalli, Pietro; Poggiani, Carlo; Cavallari, Ugo; Argiropoulos, Bob; Chernos, Judy; Brasch-Andersen, Charlotte; Speevak, Marsha; Fichera, Marco; Ogilvie, Caroline Mackie; Shen, Yiping; Hodge, Jennelle C.; Talkowski, Michael E.; Stavropoulos, Dimitri J.; Marshall, Christian R.; Scherer, Stephen W.
2014-01-01
Rare copy number variants (CNVs) disrupting ASTN2 or both ASTN2 and TRIM32 have been reported at 9q33.1 by genome-wide studies in a few individuals with neurodevelopmental disorders (NDDs). The vertebrate-specific astrotactins, ASTN2 and its paralog ASTN1, have key roles in glial-guided neuronal migration during brain development. To determine the prevalence of astrotactin mutations and delineate their associated phenotypic spectrum, we screened ASTN2/TRIM32 and ASTN1 (1q25.2) for exonic CNVs in clinical microarray data from 89 985 individuals across 10 sites, including 64 114 NDD subjects. In this clinical dataset, we identified 46 deletions and 12 duplications affecting ASTN2. Deletions of ASTN1 were much rarer. Deletions near the 3′ terminus of ASTN2, which would disrupt all transcript isoforms (a subset of these deletions also included TRIM32), were significantly enriched in the NDD subjects (P = 0.002) compared with 44 085 population-based controls. Frequent phenotypes observed in individuals with such deletions include autism spectrum disorder (ASD), attention deficit hyperactivity disorder (ADHD), speech delay, anxiety and obsessive compulsive disorder (OCD). The 3′-terminal ASTN2 deletions were significantly enriched compared with controls in males with NDDs, but not in females. Upon quantifying ASTN2 human brain RNA, we observed shorter isoforms expressed from an alternative transcription start site of recent evolutionary origin near the 3′ end. Spatiotemporal expression profiling in the human brain revealed consistently high ASTN1 expression while ASTN2 expression peaked in the early embryonic neocortex and postnatal cerebellar cortex. Our findings shed new light on the role of the astrotactins in psychopathology and their interplay in human neurodevelopment. PMID:24381304
Determining Semantically Related Significant Genes.
Taha, Kamal
2014-01-01
GO relation embodies some aspects of existence dependency. If GO term xis existence-dependent on GO term y, the presence of y implies the presence of x. Therefore, the genes annotated with the function of the GO term y are usually functionally and semantically related to the genes annotated with the function of the GO term x. A large number of gene set enrichment analysis methods have been developed in recent years for analyzing gene sets enrichment. However, most of these methods overlook the structural dependencies between GO terms in GO graph by not considering the concept of existence dependency. We propose in this paper a biological search engine called RSGSearch that identifies enriched sets of genes annotated with different functions using the concept of existence dependency. We observe that GO term xcannot be existence-dependent on GO term y, if x- and y- have the same specificity (biological characteristics). After encoding into a numeric format the contributions of GO terms annotating target genes to the semantics of their lowest common ancestors (LCAs), RSGSearch uses microarray experiment to identify the most significant LCA that annotates the result genes. We evaluated RSGSearch experimentally and compared it with five gene set enrichment systems. Results showed marked improvement.
Genome-wide survey of human alternative pre-mRNA splicing with exon junction microarrays.
Johnson, Jason M; Castle, John; Garrett-Engele, Philip; Kan, Zhengyan; Loerch, Patrick M; Armour, Christopher D; Santos, Ralph; Schadt, Eric E; Stoughton, Roland; Shoemaker, Daniel D
2003-12-19
Alternative pre-messenger RNA (pre-mRNA) splicing plays important roles in development, physiology, and disease, and more than half of human genes are alternatively spliced. To understand the biological roles and regulation of alternative splicing across different tissues and stages of development, systematic methods are needed. Here, we demonstrate the use of microarrays to monitor splicing at every exon-exon junction in more than 10,000 multi-exon human genes in 52 tissues and cell lines. These genome-wide data provide experimental evidence and tissue distributions for thousands of known and novel alternative splicing events. Adding to previous studies, the results indicate that at least 74% of human multi-exon genes are alternatively spliced.
Hou, Yixuan; Sun, Yan; Wang, Liyang; Luo, Haojun; Peng, Huimin; Liu, Manran
2013-01-01
Background The extensional signals in cross-talk between stromal cells and tumor cells generated from extracellular matrix molecules, soluble factor, and cell-cell adhesion complexes cooperate at the extra- and intracellular level in the tumor microenvironment. CAFs are the primary type of stromal cells in the tumor microenvironment and play a pivotal role in tumorigenesis and development. Hitherto, there is hardly any systematic analysis of the intrinsic relationship between CAFs function and its abnormal signaling pathway. The extreme complexity of CAFs’ features and their role in tumor development are needed to be further investigated. Methodology/Principal Findings We primary cultured CAFs and NFs from early stages of breast cancer tissue and identified them using their biomarker by immunohistochemistry for Fibronectin, α-SMA and FAP. Microarray was applied to analyze gene expression profiles of human breast CAFs and the paired NFs. The Up-regulated genes classified by Gene Ontology, signal pathways enriched by DAVID pathway analysis. Abnormal signaling pathways in breast cancer CAFs are involved in cell cycle, cell adhesion, signal transduction and protein transport being reported in CAFs derived from other tumors. Significantly, the altered ATM signaling pathway, a set of cell cycle regulated signaling, and immune associated signaling are identified to be changed in CAFs. Conclusions/Significance CAFs have the vigorous ability of proliferation and potential of invasion and migration comparing with NFs. CAFs could promote breast cancer cell invasion under co-culture conditions through up-regulated CCL18 and CXCL12. Consistently with its biologic behavior, the gene expression profiling analyzed by microarray shows that some of key signaling pathways, such as cell cycle, cell adhesion, and secreting factors play an important role in CAFs. The altered ATM signaling pathway is abnormally active in the early stage of breast cancer. The set of immune associated signaling may be involved in tumor cell immune evasion. PMID:23577100
Khaidakov, Magomed; Mitra, Sona; Wang, Xianwei; Ding, Zufeng; Bora, Nalini; Lyzogubov, Valery; Romeo, Francesco; Schichman, Steven A.; Mehta, Jawahar L.
2012-01-01
Oxidized LDL (ox-LDL) is a key factor in atherogenesis. It is taken up by endothelial cells primarily by ox-LDL receptor-1 (LOX-1). To elucidate transcriptional responses, we performed microarray analysis on human coronary artery endothelial cells (HCAECs) exposed to small physiologic concentration of ox-LDL- 5 µg/ml for 2 and 12 hours. At 12 hours, cultures treated with ox-LDL exhibited broad shifts in transcriptional activity involving almost 1500 genes (>1.5 fold difference, p<0.05). Resulting transcriptome was enriched for genes associated with cell adhesion (p<0.002), angiogenesis (p<0.0002) and migration (p<0.006). Quantitative PCR analysis revealed that LOX-1 expression in HCAECs is at least an order of magnitude greater than the expression of other major ox-LDL specific receptors CD36 and MSR1. In keeping with the data on LOX-1 expression, pre-treatment of HCAECs with LOX-1 neutralizing antibody resulted in across-the-board inhibition of cellular response to ox-LDL. Ox-LDL upregulated a number of pro-angiogenic genes including multiple receptors, ligands and transcription factors and altered the expression of a number of genes implicated in both stimulation and inhibition of apoptosis. From a functional standpoint, physiologic concentrations of ox-LDL stimulated tube formation and inhibited susceptibility to apoptosis in HCAECs. In addition, ox-LDL exposure resulted in upregulation of miR-1974, miR-1978 and miR-21 accompanied with significant over-presentation of their target genes in the downregulated portion of ox-LDL transcriptome. Our observations indicate that ox-LDL at physiologic concentrations induces broad transcriptional responses which are mediated by LOX-1, and are, in part, shaped by ox-LDL-dependent miRNAs. We also suggest that angiogenic effects of ox-LDL are partially based on upregulation of several receptors that render cells hypersensitive to angiogenic stimuli. PMID:23115646
Richard, Arianne C; Lyons, Paul A; Peters, James E; Biasci, Daniele; Flint, Shaun M; Lee, James C; McKinney, Eoin F; Siegel, Richard M; Smith, Kenneth G C
2014-08-04
Although numerous investigations have compared gene expression microarray platforms, preprocessing methods and batch correction algorithms using constructed spike-in or dilution datasets, there remains a paucity of studies examining the properties of microarray data using diverse biological samples. Most microarray experiments seek to identify subtle differences between samples with variable background noise, a scenario poorly represented by constructed datasets. Thus, microarray users lack important information regarding the complexities introduced in real-world experimental settings. The recent development of a multiplexed, digital technology for nucleic acid measurement enables counting of individual RNA molecules without amplification and, for the first time, permits such a study. Using a set of human leukocyte subset RNA samples, we compared previously acquired microarray expression values with RNA molecule counts determined by the nCounter Analysis System (NanoString Technologies) in selected genes. We found that gene measurements across samples correlated well between the two platforms, particularly for high-variance genes, while genes deemed unexpressed by the nCounter generally had both low expression and low variance on the microarray. Confirming previous findings from spike-in and dilution datasets, this "gold-standard" comparison demonstrated signal compression that varied dramatically by expression level and, to a lesser extent, by dataset. Most importantly, examination of three different cell types revealed that noise levels differed across tissues. Microarray measurements generally correlate with relative RNA molecule counts within optimal ranges but suffer from expression-dependent accuracy bias and precision that varies across datasets. We urge microarray users to consider expression-level effects in signal interpretation and to evaluate noise properties in each dataset independently.
Fabrication of Carbohydrate Microarrays by Boronate Formation.
Adak, Avijit K; Lin, Ting-Wei; Li, Ben-Yuan; Lin, Chun-Cheng
2017-01-01
The interactions between soluble carbohydrates and/or surface displayed glycans and protein receptors are essential to many biological processes and cellular recognition events. Carbohydrate microarrays provide opportunities for high-throughput quantitative analysis of carbohydrate-protein interactions. Over the past decade, various techniques have been implemented for immobilizing glycans on solid surfaces in a microarray format. Herein, we describe a detailed protocol for fabricating carbohydrate microarrays that capitalizes on the intrinsic reactivity of boronic acid toward carbohydrates to form stable boronate diesters. A large variety of unprotected carbohydrates ranging in structure from simple disaccharides and trisaccharides to considerably more complex human milk and blood group (oligo)saccharides have been covalently immobilized in a single step on glass slides, which were derivatized with high-affinity boronic acid ligands. The immobilized ligands in these microarrays maintain the receptor-binding activities including those of lectins and antibodies according to the structures of their pendant carbohydrates for rapid analysis of a number of carbohydrate-recognition events within 30 h. This method facilitates the direct construction of otherwise difficult to obtain carbohydrate microarrays from underivatized glycans.
Collard, J-F; Hinsenkamp, M
2015-05-01
We observed on different tissues and organisms a biological response after exposure to pulsed low frequency and low amplitude electric or electromagnetic fields but the precise mechanism of cell response remains unknown. The aim of this publication is to understand, using bioinformatics, the biological relevance of processes involved in the modification of gene expression. The list of genes analyzed was obtained after microarray protocol realized on cultures of human epidermal explants growing on deepidermized human skin exposed to a pulsed low frequency electric field. The directed acyclic graph on a WebGestalt Gene Ontology module shows six categories under the biological process root: "biological regulation", "cellular process", "cell proliferation", "death", "metabolic process" and "response to stimulus". Enriched derived categories are coherent with the type of in vitro culture, the stimulation protocol or with the previous results showing a decrease of cell proliferation and an increase of differentiation. The Kegg module on WebGestalt has highlighted "cell cycle" and "p53 signaling pathway" as significantly involved. The Kegg website brings out interactions between FoxO, MAPK, JNK, p53, p38, PI3K/Akt, Wnt, mTor or NF-KappaB. Some genes expressed by the stimulation are known to have an exclusive function on these pathways. Analyses performed with Pathway Studio linked cell proliferation, cell differentiation, apoptosis, cell cycle, mitosis, cell death etc. with our microarrays results. Medline citation generated by the software and the fold change variation confirms a diminution of the proliferation, activation of the differentiation and a less well-defined role of apoptosis or wound healing. Wnt and DKK functional classes, DKK1, MACF1, ATF3, MME, TXNRD1, and BMP-2 genes proposed in previous publications after a manual analysis are also highlighted with other genes after Pathway Studio automatic procedure. Finally, an analysis conducted on a list of genes characterized by an accelerated regulation after extremely low frequency pulsed stimulation also confirms their role in the processes of cell proliferation and differentiation. Bioinformatics approach allows in-depth research, without the bias of pre-selection, on cellular processes involved in a huge gene list. Copyright © 2015 Elsevier Inc. All rights reserved.
2012-01-01
Background Cancer-initiating cells (CICs) are proposed to be responsible for the generation of metastasis and resistance to therapy. Accumulating evidences indicates CICs are found among different human cancers and cell lines derived from them. Few studies address the characteristics of CICs in cervical cancer. We identify biological features of CICs from four of the best-know human cell lines from uterine cervix tumors. (HeLa, SiHa, Ca Ski, C-4 I). Methods Cells were cultured as spheres under stem-cell conditions. Flow cytometry was used to detect expression of CD34, CD49f and CD133 antigens and Hoechst 33342 staining to identify side population (SP). Magnetic and fluorescence-activated cell sorting was applied to enrich and purify populations used to evaluate tumorigenicity in nude mice. cDNA microarray analysis and in vitro radioresistance assay were carried out under standard conditions. Results CICs, enriched as spheroids, were capable to generate reproducible tumor phenotypes in nu-nu mice and serial propagation. Injection of 1 × 103 dissociated spheroid cells induced tumors in the majority of animals, whereas injection of 1 × 105 monolayer cells remained nontumorigenic. Sphere-derived CICs expressed CD49f surface marker. Gene profiling analysis of HeLa and SiHa spheroid cells showed up-regulation of CICs markers characteristic of the female reproductive system. Importantly, epithelial to mesenchymal (EMT) transition-associated markers were found highly expressed in spheroid cells. More importantly, gene expression analysis indicated that genes required for radioresistance were also up-regulated, including components of the double-strand break (DSB) DNA repair machinery and the metabolism of reactive oxygen species (ROS). Dose-dependent radiation assay indicated indeed that CICs-enriched populations exhibit an increased resistance to ionizing radiation (IR). Conclusions We characterized a self-renewing subpopulation of CICs found among four well known human cancer-derived cell lines (HeLa, SiHa, Ca Ski and C-4 I) and found that they express characteristic markers of stem cell, EMT and radioresistance. The fact that CICs demonstrated a higher degree of resistance to radiation than differentiated cells suggests that specific detection and targeting of CICs could be highly valuable for the therapy of tumors from the uterine cervix. PMID:22284662
Fang, H; Tong, W; Perkins, R; Shi, L; Hong, H; Cao, X; Xie, Q; Yim, SH; Ward, JM; Pitot, HC; Dragan, YP
2005-01-01
Background The completion of the sequencing of human, mouse and rat genomes and knowledge of cross-species gene homologies enables studies of differential gene expression in animal models. These types of studies have the potential to greatly enhance our understanding of diseases such as liver cancer in humans. Genes co-expressed across multiple species are most likely to have conserved functions. We have used various bioinformatics approaches to examine microarray expression profiles from liver neoplasms that arise in albumin-SV40 transgenic rats to elucidate genes, chromosome aberrations and pathways that might be associated with human liver cancer. Results In this study, we first identified 2223 differentially expressed genes by comparing gene expression profiles for two control, two adenoma and two carcinoma samples using an F-test. These genes were subsequently mapped to the rat chromosomes using a novel visualization tool, the Chromosome Plot. Using the same plot, we further mapped the significant genes to orthologous chromosomal locations in human and mouse. Many genes expressed in rat 1q that are amplified in rat liver cancer map to the human chromosomes 10, 11 and 19 and to the mouse chromosomes 7, 17 and 19, which have been implicated in studies of human and mouse liver cancer. Using Comparative Genomics Microarray Analysis (CGMA), we identified regions of potential aberrations in human. Lastly, a pathway analysis was conducted to predict altered human pathways based on statistical analysis and extrapolation from the rat data. All of the identified pathways have been known to be important in the etiology of human liver cancer, including cell cycle control, cell growth and differentiation, apoptosis, transcriptional regulation, and protein metabolism. Conclusion The study demonstrates that the hepatic gene expression profiles from the albumin-SV40 transgenic rat model revealed genes, pathways and chromosome alterations consistent with experimental and clinical research in human liver cancer. The bioinformatics tools presented in this paper are essential for cross species extrapolation and mapping of microarray data, its analysis and interpretation. PMID:16026603
Protein Microarray Analysis in Patients With Asthma*
Kim, Hyo-Bin; Kim, Chang-Keun; Iijima, Koji; Kobayashi, Takao; Kita, Hirohito
2010-01-01
Background Microarray technology offers a new opportunity to gain insight into global gene and protein expression profiles in asthma. To identify novel factors produced in the asthmatic airway, we analyzed sputum samples by using a membrane-based human cytokine microarray technology in patients with bronchial asthma (BA). Methods Induced sputum was obtained from 28 BA subjects, 20 nonasthmatic atopic control (AC) subjects, and 38 nonasthmatic nonatopic normal control (NC) subjects. The microarray samples of subjects were randomly selected from nine BA subjects, three AC subjects, and six NC subjects. Sputum supernatants were analyzed using a custom human cytokine array (RayBio Custom Human Cytokine Array; RayBiotech; Norcross, GA) designed to analyze 79 specific cytokines simultaneously. The levels of growth-regulated oncogene (GRO)-α, eotaxin-2, and pulmonary and activation-regulated chemokine (PARC)/CCL18 were measured by sandwich enzyme-linked immunosorbent assays (ELISAs), and eosinophil-derived neurotoxin (EDN) was measured by radioimmunoassay. Results By microarray, the signal intensities for GRO-α, eotaxin-2, and PARC were significantly higher in BA subjects than in AC and NC subjects (p = 0.036, p = 0.042, and p = 0.033, respectively). By ELISA, the sputum PARC protein levels were significantly higher in BA subjects than in AC and NC subjects (p < 0.0001). Furthermore, PARC levels correlated significantly with sputum eosinophil percentages (r = 0.570, p < 0.0001) and the levels of EDN(r = 0.633, p < 0.0001), the regulated upon activation, normal T cell expressed and secreted cytokine (r = 0.440, p < 0.001), interleukin-4 (r = 0.415, p < 0.01), and interferon-γ (r = 0.491, p < 0.001). Conclusions By a nonbiased screening approach, a chemokine, PARC, is elevated in sputum specimens from patients with asthma. PARC may play important roles in development of airway eosinophilic inflammation in asthma. PMID:19017877
Glycome Diagnosis of Human Induced Pluripotent Stem Cells Using Lectin Microarray*
Tateno, Hiroaki; Toyota, Masashi; Saito, Shigeru; Onuma, Yasuko; Ito, Yuzuru; Hiemori, Keiko; Fukumura, Mihoko; Matsushima, Asako; Nakanishi, Mio; Ohnuma, Kiyoshi; Akutsu, Hidenori; Umezawa, Akihiro; Horimoto, Katsuhisa; Hirabayashi, Jun; Asashima, Makoto
2011-01-01
Induced pluripotent stem cells (iPSCs) can now be produced from various somatic cell (SC) lines by ectopic expression of the four transcription factors. Although the procedure has been demonstrated to induce global change in gene and microRNA expressions and even epigenetic modification, it remains largely unknown how this transcription factor-induced reprogramming affects the total glycan repertoire expressed on the cells. Here we performed a comprehensive glycan analysis using 114 types of human iPSCs generated from five different SCs and compared their glycomes with those of human embryonic stem cells (ESCs; nine cell types) using a high density lectin microarray. In unsupervised cluster analysis of the results obtained by lectin microarray, both undifferentiated iPSCs and ESCs were clustered as one large group. However, they were clearly separated from the group of differentiated SCs, whereas all of the four SCs had apparently distinct glycome profiles from one another, demonstrating that SCs with originally distinct glycan profiles have acquired those similar to ESCs upon induction of pluripotency. Thirty-eight lectins discriminating between SCs and iPSCs/ESCs were statistically selected, and characteristic features of the pluripotent state were then obtained at the level of the cellular glycome. The expression profiles of relevant glycosyltransferase genes agreed well with the results obtained by lectin microarray. Among the 38 lectins, rBC2LCN was found to detect only undifferentiated iPSCs/ESCs and not differentiated SCs. Hence, the high density lectin microarray has proved to be valid for not only comprehensive analysis of glycans but also diagnosis of stem cells under the concept of the cellular glycome. PMID:21471226
Microarray profiling of human white adipose tissue after exogenous leptin injection.
Taleb, S; Van Haaften, R; Henegar, C; Hukshorn, C; Cancello, R; Pelloux, V; Hanczar, B; Viguerie, N; Langin, D; Evelo, C; Zucker, J; Clément, K; Saris, W H M
2006-03-01
Leptin is a secreted adipocyte hormone that plays a key role in the regulation of body weight homeostasis. The leptin effect on human white adipose tissue (WAT) is still debated. The aim of this study was to assess whether the administration of polyethylene glycol-leptin (PEG-OB) in a single supraphysiological dose has transcriptional effects on genes of WAT and to identify its target genes and functional pathways in WAT. Blood samples and WAT biopsies were obtained from 10 healthy nonobese men before treatment and 72 h after the PEG-OB injection, leading to an approximate 809-fold increase in circulating leptin. The WAT gene expression profile before and after the PEG-OB injection was compared using pangenomic microarrays. Functional gene annotations based on the gene ontology of the PEG-OB regulated genes were performed using both an 'in house' automated procedure and GenMAPP (Gene Microarray Pathway Profiler), designed for viewing and analyzing gene expression data in the context of biological pathways. Statistical analysis of microarray data revealed that PEG-OB had a major down-regulated effect on WAT gene expression, as we obtained 1,822 and 100 down- and up-regulated genes, respectively. Microarray data were validated using reverse transcription quantitative PCR. Functional gene annotations of PEG-OB regulated genes revealed that the functional class related to immunity and inflammation was among the most mobilized PEG-OB pathway in WAT. These genes are mainly expressed in the cell of the stroma vascular fraction in comparison with adipocytes. Our observations support the hypothesis that leptin could act on WAT, particularly on genes related to inflammation and immunity, which may suggest a novel leptin target pathway in human WAT.
An investigation of obesity susceptibility genes in Northern Han Chinese by targeted resequencing.
Wu, Yili; Wang, Weijing; Jiang, Wenjie; Yao, Jie; Zhang, Dongfeng
2017-02-01
Our earlier genome-wide linkage study of body mass index (BMI) showed strong signals from 7q36.3 and 8q21.13. This case-control study set to investigate 2 genomic regions which may harbor variants contributed to development of obesity.We employed targeted resequencing technology to detect single nucleotide polymorphisms (SNPs) in 7q36.3 and 8q21.13 from 16 individuals with obesity. These were compared with 504 East Asians in the 1000 Genomes Project as a reference panel. Linkage disequilibrium (LD) block analysis was performed for the significant SNPs located near the same gene. Genes involved in statistically significant loci were then subject to gene set enrichment analysis (GSEA).The 16 individuals aged between 30 and 60 years with BMI = 33.25 ± 2.22 kg/m. A total of 12,131 genetic variants across all of samples were found. After correcting for multiple testing, 65 SNPs from 25 nearest genes (INSIG1, FABP5, PTPRN2, VIPR2, WDR60, SHH, UBE3C, LMBR1, PAG1, IMPA1, CHMP4, SNX16, BLACE, EN2, CNPY1, LOC100506302, RBM33, LOC389602, LOC285889, LINC01006, NOM1, DNAJB6, LOC101927914, ESYT2, LINC00689) were associated with obesity at significant level q-value ≤ 0.05. LD block analysis showed there were 10 pairs of loci with D' ≥ 0.8 and r ≥ 0.8. GSEA further identified 2 major related gene sets, involving lipid raft and lipid metabolic process, with FDR values <0.12 and <0.4, respectively.Our data are the first documentation of genetic variants in 7q36.3 and 8q21.13 associated with obesity using target capture sequencing and Northern Han Chinese samples. Additional replication and functional studies are merited to validate our findings.
Parham, Fred; Portier, Christopher J.; Chang, Xiaoqing; Mevissen, Meike
2016-01-01
Using in vitro data in human cell lines, several research groups have investigated changes in gene expression in cellular systems following exposure to extremely low frequency (ELF) and radiofrequency (RF) electromagnetic fields (EMF). For ELF EMF, we obtained five studies with complete microarray data and three studies with only lists of significantly altered genes. Likewise, for RF EMF, we obtained 13 complete microarray datasets and 5 limited datasets. Plausible linkages between exposure to ELF and RF EMF and human diseases were identified using a three-step process: (a) linking genes associated with classes of human diseases to molecular pathways, (b) linking pathways to ELF and RF EMF microarray data, and (c) identifying associations between human disease and EMF exposures where the pathways are significantly similar. A total of 60 pathways were associated with human diseases, mostly focused on basic cellular functions like JAK–STAT signaling or metabolic functions like xenobiotic metabolism by cytochrome P450 enzymes. ELF EMF datasets were sporadically linked to human diseases, but no clear pattern emerged. Individual datasets showed some linkage to cancer, chemical dependency, metabolic disorders, and neurological disorders. RF EMF datasets were not strongly linked to any disorders but strongly linked to changes in several pathways. Based on these analyses, the most promising area for further research would be to focus on EMF and neurological function and disorders. PMID:27656641
Replication dynamics of the yeast genome.
Raghuraman, M K; Winzeler, E A; Collingwood, D; Hunt, S; Wodicka, L; Conway, A; Lockhart, D J; Davis, R W; Brewer, B J; Fangman, W L
2001-10-05
Oligonucleotide microarrays were used to map the detailed topography of chromosome replication in the budding yeast Saccharomyces cerevisiae. The times of replication of thousands of sites across the genome were determined by hybridizing replicated and unreplicated DNAs, isolated at different times in S phase, to the microarrays. Origin activations take place continuously throughout S phase but with most firings near mid-S phase. Rates of replication fork movement vary greatly from region to region in the genome. The two ends of each of the 16 chromosomes are highly correlated in their times of replication. This microarray approach is readily applicable to other organisms, including humans.
Lee, Patrick K H; Men, Yujie; Wang, Shanquan; He, Jianzhong; Alvarez-Cohen, Lisa
2015-02-03
Dehalococcoides mccartyi are functionally important bacteria that catalyze the reductive dechlorination of chlorinated ethenes. However, these anaerobic bacteria are fastidious to isolate, making downstream genomic characterization challenging. In order to facilitate genomic analysis, a fluorescence-activated cell sorting (FACS) method was developed in this study to separate D. mccartyi cells from a microbial community, and the DNA of the isolated cells was processed by whole genome amplification (WGA) and hybridized onto a D. mccartyi microarray for comparative genomics against four sequenced strains. First, FACS was successfully applied to a D. mccartyi isolate as positive control, and then microarray results verified that WGA from 10(6) cells or ∼1 ng of genomic DNA yielded high-quality coverage detecting nearly all genes across the genome. As expected, some inter- and intrasample variability in WGA was observed, but these biases were minimized by performing multiple parallel amplifications. Subsequent application of the FACS and WGA protocols to two enrichment cultures containing ∼10% and ∼1% D. mccartyi cells successfully enabled genomic analysis. As proof of concept, this study demonstrates that coupling FACS with WGA and microarrays is a promising tool to expedite genomic characterization of target strains in environmental communities where the relative concentrations are low.
Farris, M Heath; Scott, Andrew R; Texter, Pamela A; Bartlett, Marta; Coleman, Patricia; Masters, David
2018-04-11
Single nucleotide polymorphisms (SNPs) located within the human genome have been shown to have utility as markers of identity in the differentiation of DNA from individual contributors. Massively parallel DNA sequencing (MPS) technologies and human genome SNP databases allow for the design of suites of identity-linked target regions, amenable to sequencing in a multiplexed and massively parallel manner. Therefore, tools are needed for leveraging the genotypic information found within SNP databases for the discovery of genomic targets that can be evaluated on MPS platforms. The SNP island target identification algorithm (TIA) was developed as a user-tunable system to leverage SNP information within databases. Using data within the 1000 Genomes Project SNP database, human genome regions were identified that contain globally ubiquitous identity-linked SNPs and that were responsive to targeted resequencing on MPS platforms. Algorithmic filters were used to exclude target regions that did not conform to user-tunable SNP island target characteristics. To validate the accuracy of TIA for discovering these identity-linked SNP islands within the human genome, SNP island target regions were amplified from 70 contributor genomic DNA samples using the polymerase chain reaction. Multiplexed amplicons were sequenced using the Illumina MiSeq platform, and the resulting sequences were analyzed for SNP variations. 166 putative identity-linked SNPs were targeted in the identified genomic regions. Of the 309 SNPs that provided discerning power across individual SNP profiles, 74 previously undefined SNPs were identified during evaluation of targets from individual genomes. Overall, DNA samples of 70 individuals were uniquely identified using a subset of the suite of identity-linked SNP islands. TIA offers a tunable genome search tool for the discovery of targeted genomic regions that are scalable in the population frequency and numbers of SNPs contained within the SNP island regions. It also allows the definition of sequence length and sequence variability of the target region as well as the less variable flanking regions for tailoring to MPS platforms. As shown in this study, TIA can be used to discover identity-linked SNP islands within the human genome, useful for differentiating individuals by targeted resequencing on MPS technologies.
Pavlova, T V; Kashuba, V I; Muravenko, O V; Yenamandra, S P; Ivanova, T A; Zabarovskaia, V I; Rakhmanaliev, E R; Petrenko, L A; Pronina, I V; Loginov, V I; Iurkevich, O Iu; Kiselev, L L; Zelenin, A V; Zabarovskiĭ, E R
2009-01-01
New comparative genome hybridization technology on NotI-microarrays is presented (Karolinska Institute International Patent WO02/086163). The method is based on comparative genome hybridization of NotI-probes from tumor and normal genomic DNA with the principle of new DNA NotI-microarrays. Using this method 181 NotI linking loci from human chromosome 3 were analyzed in 200 malignant tumor samples from different organs: kidney, lung, breast, ovary, cervical, prostate. Most frequently (more than in 30%) aberrations--deletions, methylation,--were identified in NotI-sites located in MINT24, BHLHB2, RPL15, RARbeta1, ITGA9, RBSP3, VHL, ZIC4 genes, that suggests they probably are involved in cancer development. Methylation of these genomic loci was confirmed by methylation-specific PCR and bisulfite sequencing. The results demonstrate perspective of using this method to solve some oncogenomic problems.
Bopp, Selina E. R.; Manary, Micah J.; Bright, A. Taylor; Johnston, Geoffrey L.; Dharia, Neekesh V.; Luna, Fabio L.; McCormack, Susan; Plouffe, David; McNamara, Case W.; Walker, John R.; Fidock, David A.; Denchi, Eros Lazzerini; Winzeler, Elizabeth A.
2013-01-01
Malaria parasites elude eradication attempts both within the human host and across nations. At the individual level, parasites evade the host immune responses through antigenic variation. At the global level, parasites escape drug pressure through single nucleotide variants and gene copy amplification events conferring drug resistance. Despite their importance to global health, the rates at which these genomic alterations emerge have not been determined. We studied the complete genomes of different Plasmodium falciparum clones that had been propagated asexually over one year in the presence and absence of drug pressure. A combination of whole-genome microarray analysis and next-generation deep resequencing (totaling 14 terabases) revealed a stable core genome with only 38 novel single nucleotide variants appearing in seventeen evolved clones (avg. 5.4 per clone). In clones exposed to atovaquone, we found cytochrome b mutations as well as an amplification event encompassing the P. falciparum multidrug resistance associated protein (mrp1) on chromosome 1. We observed 18 large-scale (>1 kb on average) deletions of telomere-proximal regions encoding multigene families, involved in immune evasion (9.5×10−6 structural variants per base pair per generation). Six of these deletions were associated with chromosomal crossovers generated during mitosis. We found only minor differences in rates between genetically distinct strains and between parasites cultured in the presence or absence of drug. Using these derived mutation rates for P. falciparum (1.0–9.7×10−9 mutations per base pair per generation), we can now model the frequency at which drug or immune resistance alleles will emerge under a well-defined set of assumptions. Further, the detection of mitotic recombination events in var gene families illustrates how multigene families can arise and change over time in P. falciparum. These results will help improve our understanding of how P. falciparum evolves to evade control efforts within both the individual hosts and large populations. PMID:23408914
Hess, Jonathan L.; Tylee, Daniel S.; Barve, Rahul; de Jong, Simone; Ophoff, Roel A.; Kumarasinghe, Nishantha; Tooney, Paul; Schall, Ulrich; Gardiner, Erin; Beveridge, Natalie Jane; Scott, Rodney J.; Yasawardene, Surangi; Perera, Antionette; Mendis, Jayan; Carr, Vaughan; Kelly, Brian; Cairns, Murray; Tsuang, Ming T.; Glatt, Stephen J.
2016-01-01
The application of microarray technology in schizophrenia research was heralded as paradigm-shifting, as it allowed for high-throughput assessment of cell and tissue function. This technology was widely adopted, initially in studies of postmortem brain tissue, and later in studies of peripheral blood. The collective body of schizophrenia microarray literature contains apparent inconsistencies between studies, with failures to replicate top hits, in part due to small sample sizes, cohort-specific effects, differences in array types, and other confounders. In an attempt to summarize existing studies of schizophrenia cases and non-related comparison subjects, we performed two mega-analyses of a combined set of microarray data from postmortem prefrontal cortices (n = 315) and from ex-vivo blood tissues (n = 578). We adjusted regression models per gene to remove non-significant covariates, providing best-estimates of transcripts dysregulated in schizophrenia. We also examined dysregulation of functionally related gene sets and gene co-expression modules, and assessed enrichment of cell types and genetic risk factors. The identities of the most significantly dysregulated genes were largely distinct for each tissue, but the findings indicated common emergent biological functions (e.g. immunity) and regulatory factors (e.g., predicted targets of transcription factors and miRNA species across tissues). Our network-based analyses converged upon similar patterns of heightened innate immune gene expression in both brain and blood in schizophrenia. We also constructed generalizable machine-learning classifiers using the blood-based microarray data. Our study provides an informative atlas for future pathophysiologic and biomarker studies of schizophrenia. PMID:27450777
Hess, Jonathan L; Tylee, Daniel S; Barve, Rahul; de Jong, Simone; Ophoff, Roel A; Kumarasinghe, Nishantha; Tooney, Paul; Schall, Ulrich; Gardiner, Erin; Beveridge, Natalie Jane; Scott, Rodney J; Yasawardene, Surangi; Perera, Antionette; Mendis, Jayan; Carr, Vaughan; Kelly, Brian; Cairns, Murray; Tsuang, Ming T; Glatt, Stephen J
2016-10-01
The application of microarray technology in schizophrenia research was heralded as paradigm-shifting, as it allowed for high-throughput assessment of cell and tissue function. This technology was widely adopted, initially in studies of postmortem brain tissue, and later in studies of peripheral blood. The collective body of schizophrenia microarray literature contains apparent inconsistencies between studies, with failures to replicate top hits, in part due to small sample sizes, cohort-specific effects, differences in array types, and other confounders. In an attempt to summarize existing studies of schizophrenia cases and non-related comparison subjects, we performed two mega-analyses of a combined set of microarray data from postmortem prefrontal cortices (n=315) and from ex-vivo blood tissues (n=578). We adjusted regression models per gene to remove non-significant covariates, providing best-estimates of transcripts dysregulated in schizophrenia. We also examined dysregulation of functionally related gene sets and gene co-expression modules, and assessed enrichment of cell types and genetic risk factors. The identities of the most significantly dysregulated genes were largely distinct for each tissue, but the findings indicated common emergent biological functions (e.g. immunity) and regulatory factors (e.g., predicted targets of transcription factors and miRNA species across tissues). Our network-based analyses converged upon similar patterns of heightened innate immune gene expression in both brain and blood in schizophrenia. We also constructed generalizable machine-learning classifiers using the blood-based microarray data. Our study provides an informative atlas for future pathophysiologic and biomarker studies of schizophrenia. Published by Elsevier B.V.
Identifying novel glioma associated pathways based on systems biology level meta-analysis.
Hu, Yangfan; Li, Jinquan; Yan, Wenying; Chen, Jiajia; Li, Yin; Hu, Guang; Shen, Bairong
2013-01-01
With recent advances in microarray technology, including genomics, proteomics, and metabolomics, it brings a great challenge for integrating this "-omics" data to analysis complex disease. Glioma is an extremely aggressive and lethal form of brain tumor, and thus the study of the molecule mechanism underlying glioma remains very important. To date, most studies focus on detecting the differentially expressed genes in glioma. However, the meta-analysis for pathway analysis based on multiple microarray datasets has not been systematically pursued. In this study, we therefore developed a systems biology based approach by integrating three types of omics data to identify common pathways in glioma. Firstly, the meta-analysis has been performed to study the overlapping of signatures at different levels based on the microarray gene expression data of glioma. Among these gene expression datasets, 12 pathways were found in GeneGO database that shared by four stages. Then, microRNA expression profiles and ChIP-seq data were integrated for the further pathway enrichment analysis. As a result, we suggest 5 of these pathways could be served as putative pathways in glioma. Among them, the pathway of TGF-beta-dependent induction of EMT via SMAD is of particular importance. Our results demonstrate that the meta-analysis based on systems biology level provide a more useful approach to study the molecule mechanism of complex disease. The integration of different types of omics data, including gene expression microarrays, microRNA and ChIP-seq data, suggest some common pathways correlated with glioma. These findings will offer useful potential candidates for targeted therapeutic intervention of glioma.
Characterization of human septic sera induced gene expression modulation in human myocytes
Hussein, Shaimaa; Michael, Paul; Brabant, Danielle; Omri, Abdelwahab; Narain, Ravin; Passi, Kalpdrum; Ramana, Chilakamarti V.; Parrillo, Joseph E.; Kumar, Anand; Parissenti, Amadeo; Kumar, Aseem
2009-01-01
To gain a better understanding of the gene expression changes that occurs during sepsis, we have performed a cDNA microarray study utilizing a tissue culture model that mimics human sepsis. This study utilized an in vitro model of cultured human fetal cardiac myocytes treated with 10% sera from septic patients or 10% sera from healthy volunteers. A 1700 cDNA expression microarray was used to compare the transcription profile from human cardiac myocytes treated with septic sera vs normal sera. Septic sera treatment of myocytes resulted in the down-regulation of 178 genes and the up-regulation of 4 genes. Our data indicate that septic sera induced cell cycle, metabolic, transcription factor and apoptotic gene expression changes in human myocytes. Identification and characterization of gene expression changes that occur during sepsis may lead to the development of novel therapeutics and diagnostics. PMID:19684886
Sun, Duanchen; Liu, Yinliang; Zhang, Xiang-Sun; Wu, Ling-Yun
2017-09-21
High-throughput experimental techniques have been dramatically improved and widely applied in the past decades. However, biological interpretation of the high-throughput experimental results, such as differential expression gene sets derived from microarray or RNA-seq experiments, is still a challenging task. Gene Ontology (GO) is commonly used in the functional enrichment studies. The GO terms identified via current functional enrichment analysis tools often contain direct parent or descendant terms in the GO hierarchical structure. Highly redundant terms make users difficult to analyze the underlying biological processes. In this paper, a novel network-based probabilistic generative model, NetGen, was proposed to perform the functional enrichment analysis. An additional protein-protein interaction (PPI) network was explicitly used to assist the identification of significantly enriched GO terms. NetGen achieved a superior performance than the existing methods in the simulation studies. The effectiveness of NetGen was explored further on four real datasets. Notably, several GO terms which were not directly linked with the active gene list for each disease were identified. These terms were closely related to the corresponding diseases when accessed to the curated literatures. NetGen has been implemented in the R package CopTea publicly available at GitHub ( http://github.com/wulingyun/CopTea/ ). Our procedure leads to a more reasonable and interpretable result of the functional enrichment analysis. As a novel term combination-based functional enrichment analysis method, NetGen is complementary to current individual term-based methods, and can help to explore the underlying pathogenesis of complex diseases.
El Kaoutari, Abdessamad; Armougom, Fabrice; Leroy, Quentin; Vialettes, Bernard; Million, Matthieu; Raoult, Didier; Henrissat, Bernard
2013-01-01
Distal gut bacteria play a pivotal role in the digestion of dietary polysaccharides by producing a large number of carbohydrate-active enzymes (CAZymes) that the host otherwise does not produce. We report here the design of a custom microarray that we used to spot non-redundant DNA probes for more than 6,500 genes encoding glycoside hydrolases and lyases selected from 174 reference genomes from distal gut bacteria. The custom microarray was tested and validated by the hybridization of bacterial DNA extracted from the stool samples of lean, obese and anorexic individuals. Our results suggest that a microarray-based study can detect genes from low-abundance bacteria better than metagenomic-based studies. A striking example was the finding that a gene encoding a GH6-family cellulase was present in all subjects examined, whereas metagenomic studies have consistently failed to detect this gene in both human and animal gut microbiomes. In addition, an examination of eight stool samples allowed the identification of a corresponding CAZome core containing 46 families of glycoside hydrolases and polysaccharide lyases, which suggests the functional stability of the gut microbiota despite large taxonomical variations between individuals.
Screening key candidate genes and pathways involved in insulinoma by microarray analysis.
Zhou, Wuhua; Gong, Li; Li, Xuefeng; Wan, Yunyan; Wang, Xiangfei; Li, Huili; Jiang, Bin
2018-06-01
Insulinoma is a rare type tumor and its genetic features remain largely unknown. This study aimed to search for potential key genes and relevant enriched pathways of insulinoma.The gene expression data from GSE73338 were downloaded from Gene Expression Omnibus database. Differentially expressed genes (DEGs) were identified between insulinoma tissues and normal pancreas tissues, followed by pathway enrichment analysis, protein-protein interaction (PPI) network construction, and module analysis. The expressions of candidate key genes were validated by quantitative real-time polymerase chain reaction (RT-PCR) in insulinoma tissues.A total of 1632 DEGs were obtained, including 1117 upregulated genes and 514 downregulated genes. Pathway enrichment results showed that upregulated DEGs were significantly implicated in insulin secretion, and downregulated DEGs were mainly enriched in pancreatic secretion. PPI network analysis revealed 7 hub genes with degrees more than 10, including GCG (glucagon), GCGR (glucagon receptor), PLCB1 (phospholipase C, beta 1), CASR (calcium sensing receptor), F2R (coagulation factor II thrombin receptor), GRM1 (glutamate metabotropic receptor 1), and GRM5 (glutamate metabotropic receptor 5). DEGs involved in the significant modules were enriched in calcium signaling pathway, protein ubiquitination, and platelet degranulation. Quantitative RT-PCR data confirmed that the expression trends of these hub genes were similar to the results of bioinformatic analysis.The present study demonstrated that candidate DEGs and enriched pathways were the potential critical molecule events involved in the development of insulinoma, and these findings were useful for better understanding of insulinoma genesis.
Principles of gene microarray data analysis.
Mocellin, Simone; Rossi, Carlo Riccardo
2007-01-01
The development of several gene expression profiling methods, such as comparative genomic hybridization (CGH), differential display, serial analysis of gene expression (SAGE), and gene microarray, together with the sequencing of the human genome, has provided an opportunity to monitor and investigate the complex cascade of molecular events leading to tumor development and progression. The availability of such large amounts of information has shifted the attention of scientists towards a nonreductionist approach to biological phenomena. High throughput technologies can be used to follow changing patterns of gene expression over time. Among them, gene microarray has become prominent because it is easier to use, does not require large-scale DNA sequencing, and allows for the parallel quantification of thousands of genes from multiple samples. Gene microarray technology is rapidly spreading worldwide and has the potential to drastically change the therapeutic approach to patients affected with tumor. Therefore, it is of paramount importance for both researchers and clinicians to know the principles underlying the analysis of the huge amount of data generated with microarray technology.
Howat, William J; Blows, Fiona M; Provenzano, Elena; Brook, Mark N; Morris, Lorna; Gazinska, Patrycja; Johnson, Nicola; McDuffus, Leigh‐Anne; Miller, Jodi; Sawyer, Elinor J; Pinder, Sarah; van Deurzen, Carolien H M; Jones, Louise; Sironen, Reijo; Visscher, Daniel; Caldas, Carlos; Daley, Frances; Coulson, Penny; Broeks, Annegien; Sanders, Joyce; Wesseling, Jelle; Nevanlinna, Heli; Fagerholm, Rainer; Blomqvist, Carl; Heikkilä, Päivi; Ali, H Raza; Dawson, Sarah‐Jane; Figueroa, Jonine; Lissowska, Jolanta; Brinton, Louise; Mannermaa, Arto; Kataja, Vesa; Kosma, Veli‐Matti; Cox, Angela; Brock, Ian W; Cross, Simon S; Reed, Malcolm W; Couch, Fergus J; Olson, Janet E; Devillee, Peter; Mesker, Wilma E; Seyaneve, Caroline M; Hollestelle, Antoinette; Benitez, Javier; Perez, Jose Ignacio Arias; Menéndez, Primitiva; Bolla, Manjeet K; Easton, Douglas F; Schmidt, Marjanka K; Pharoah, Paul D; Sherman, Mark E
2014-01-01
Abstract Breast cancer risk factors and clinical outcomes vary by tumour marker expression. However, individual studies often lack the power required to assess these relationships, and large‐scale analyses are limited by the need for high throughput, standardized scoring methods. To address these limitations, we assessed whether automated image analysis of immunohistochemically stained tissue microarrays can permit rapid, standardized scoring of tumour markers from multiple studies. Tissue microarray sections prepared in nine studies containing 20 263 cores from 8267 breast cancers stained for two nuclear (oestrogen receptor, progesterone receptor), two membranous (human epidermal growth factor receptor 2 and epidermal growth factor receptor) and one cytoplasmic (cytokeratin 5/6) marker were scanned as digital images. Automated algorithms were used to score markers in tumour cells using the Ariol system. We compared automated scores against visual reads, and their associations with breast cancer survival. Approximately 65–70% of tissue microarray cores were satisfactory for scoring. Among satisfactory cores, agreement between dichotomous automated and visual scores was highest for oestrogen receptor (Kappa = 0.76), followed by human epidermal growth factor receptor 2 (Kappa = 0.69) and progesterone receptor (Kappa = 0.67). Automated quantitative scores for these markers were associated with hazard ratios for breast cancer mortality in a dose‐response manner. Considering visual scores of epidermal growth factor receptor or cytokeratin 5/6 as the reference, automated scoring achieved excellent negative predictive value (96–98%), but yielded many false positives (positive predictive value = 30–32%). For all markers, we observed substantial heterogeneity in automated scoring performance across tissue microarrays. Automated analysis is a potentially useful tool for large‐scale, quantitative scoring of immunohistochemically stained tissue microarrays available in consortia. However, continued optimization, rigorous marker‐specific quality control measures and standardization of tissue microarray designs, staining and scoring protocols is needed to enhance results. PMID:27499890
McCoy, Gary R; Touzet, Nicolas; Fleming, Gerard T A; Raine, Robin
2015-07-01
The toxic microalgal species Prymnesium parvum and Prymnesium polylepis are responsible for numerous fish kills causing economic stress on the aquaculture industry and, through the consumption of contaminated shellfish, can potentially impact on human health. Monitoring of toxic phytoplankton is traditionally carried out by light microscopy. However, molecular methods of identification and quantification are becoming more common place. This study documents the optimisation of the novel Microarrays for the Detection of Toxic Algae (MIDTAL) microarray from its initial stages to the final commercial version now available from Microbia Environnement (France). Existing oligonucleotide probes used in whole-cell fluorescent in situ hybridisation (FISH) for Prymnesium species from higher group probes to species-level probes were adapted and tested on the first-generation microarray. The combination and interaction of numerous other probes specific for a whole range of phytoplankton taxa also spotted on the chip surface caused high cross reactivity, resulting in false-positive results on the microarray. The probe sequences were extended for the subsequent second-generation microarray, and further adaptations of the hybridisation protocol and incubation temperatures significantly reduced false-positive readings from the first to the second-generation chip, thereby increasing the specificity of the MIDTAL microarray. Additional refinement of the subsequent third-generation microarray protocols with the addition of a poly-T amino linker to the 5' end of each probe further enhanced the microarray performance but also highlighted the importance of optimising RNA labelling efficiency when testing with natural seawater samples from Killary Harbour, Ireland.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Katika, Madhumohan R.; Department of Health Risk Analysis and Toxicology, Maastricht University; Netherlands Toxicogenomics Centre
Deoxynivalenol (DON) or vomitoxin is a commonly encountered type-B trichothecene mycotoxin, produced by Fusarium species predominantly found in cereals and grains. DON is known to exert toxic effects on the gastrointestinal, reproductive and neuroendocrine systems, and particularly on the immune system. Depending on dose and exposure time, it can either stimulate or suppress immune function. The main objective of this study was to obtain a deeper insight into DON-induced effects on lymphoid cells. For this, we exposed the human T-lymphocyte cell line Jurkat and human peripheral blood mononuclear cells (PBMCs) to various concentrations of DON for various times and examinedmore » gene expression changes by DNA microarray analysis. Jurkat cells were exposed to 0.25 and 0.5 μM DON for 3, 6 and 24 h. Biological interpretation of the microarray data indicated that DON affects various processes in these cells: It upregulates genes involved in ribosome structure and function, RNA/protein synthesis and processing, endoplasmic reticulum (ER) stress, calcium-mediated signaling, mitochondrial function, oxidative stress, the NFAT and NF-κB/TNF-α pathways, T cell activation and apoptosis. The effects of DON on the expression of genes involved in ER stress, NFAT activation and apoptosis were confirmed by qRT-PCR. Other biochemical experiments confirmed that DON activates calcium-dependent proteins such as calcineurin and M-calpain that are known to be involved in T cell activation and apoptosis. Induction of T cell activation was also confirmed by demonstrating that DON activates NFATC1 and induces its translocation from the cytoplasm to the nucleus. For the gene expression profiling of PBMCs, cells were exposed to 2 and 4 μM DON for 6 and 24 h. Comparison of the Jurkat microarray data with those obtained with PBMCs showed that most of the processes affected by DON in the Jurkat cell line were also affected in the PBMCs. -- Highlights: ► The human T cell line Jurkat and human PBMCs were exposed to DON. ► Whole-genome microarray experiments were performed. ► Microarray data indicates that DON affects ribosome and RNA/protein synthesis. ► DON treatment induces ER stress, calcium mediated signaling, NFAT and NF-κB. ► Exposure to DON induces T cell activation, oxidative stress and apoptosis.« less
Campos, Bruno; Fletcher, Danielle; Piña, Benjamín; Tauler, Romà; Barata, Carlos
2018-05-18
Unravelling the link between genes and environment across the life cycle is a challenging goal that requires model organisms with well-characterized life-cycles, ecological interactions in nature, tractability in the laboratory, and available genomic tools. Very few well-studied invertebrate model species meet these requirements, being the waterflea Daphnia magna one of them. Here we report a full genome transcription profiling of D. magna during its life-cycle. The study was performed using a new microarray platform designed from the complete set of gene models representing the whole transcribed genome of D. magna. Up to 93% of the existing 41,317 D. magna gene models showed differential transcription patterns across the developmental stages of D. magna, 59% of which were functionally annotated. Embryos showed the highest number of unique transcribed genes, mainly related to DNA, RNA, and ribosome biogenesis, likely related to cellular proliferation and morphogenesis of the several body organs. Adult females showed an enrichment of transcripts for genes involved in reproductive processes. These female-specific transcripts were essentially absent in males, whose transcriptome was enriched in specific genes of male sexual differentiation genes, like doublesex. Our results define major characteristics of transcriptional programs involved in the life-cycle, differentiate males and females, and show that large scale gene-transcription data collected in whole animals can be used to identify genes involved in specific biological and biochemical processes.
WholePathwayScope: a comprehensive pathway-based analysis tool for high-throughput data
Yi, Ming; Horton, Jay D; Cohen, Jonathan C; Hobbs, Helen H; Stephens, Robert M
2006-01-01
Background Analysis of High Throughput (HTP) Data such as microarray and proteomics data has provided a powerful methodology to study patterns of gene regulation at genome scale. A major unresolved problem in the post-genomic era is to assemble the large amounts of data generated into a meaningful biological context. We have developed a comprehensive software tool, WholePathwayScope (WPS), for deriving biological insights from analysis of HTP data. Result WPS extracts gene lists with shared biological themes through color cue templates. WPS statistically evaluates global functional category enrichment of gene lists and pathway-level pattern enrichment of data. WPS incorporates well-known biological pathways from KEGG (Kyoto Encyclopedia of Genes and Genomes) and Biocarta, GO (Gene Ontology) terms as well as user-defined pathways or relevant gene clusters or groups, and explores gene-term relationships within the derived gene-term association networks (GTANs). WPS simultaneously compares multiple datasets within biological contexts either as pathways or as association networks. WPS also integrates Genetic Association Database and Partial MedGene Database for disease-association information. We have used this program to analyze and compare microarray and proteomics datasets derived from a variety of biological systems. Application examples demonstrated the capacity of WPS to significantly facilitate the analysis of HTP data for integrative discovery. Conclusion This tool represents a pathway-based platform for discovery integration to maximize analysis power. The tool is freely available at . PMID:16423281
Woo, Sangsoon; Gao, Hong; Henderson, David; Zacharias, Wolfgang; Liu, Gang; Tran, Quynh T; Prasad, G L
2017-05-03
Smoking has been established as a major risk factor for developing oral squamous cell carcinoma (OSCC), but less attention has been paid to the effects of smokeless tobacco products. Our objective is to identify potential biomarkers to distinguish the biological effects of combustible tobacco products from those of non-combustible ones using oral cell lines. Normal human gingival epithelial cells (HGEC), non-metastatic (101A) and metastatic (101B) OSCC cell lines were exposed to different tobacco product preparations (TPPs) including cigarette smoke total particulate matter (TPM), whole-smoke conditioned media (WS-CM), smokeless tobacco extract in complete artificial saliva (STE), or nicotine (NIC) alone. We performed microarray-based gene expression profiling and found 3456 probe sets from 101A, 1432 probe sets from 101B, and 2717 probe sets from HGEC to be differentially expressed. Gene Set Enrichment Analysis (GSEA) revealed xenobiotic metabolism and steroid biosynthesis were the top two pathways that were upregulated by combustible but not by non-combustible TPPs. Notably, aldo-keto reductase genes, AKR1C1 and AKR1C2 , were the core genes in the top enriched pathways and were statistically upregulated more than eight-fold by combustible TPPs. Quantitative real time polymerase chain reaction (qRT-PCR) results statistically support AKR1C1 as a potential biomarker for differentiating the biological effects of combustible from non-combustible tobacco products.
Woo, Sangsoon; Gao, Hong; Henderson, David; Zacharias, Wolfgang; Liu, Gang; Tran, Quynh T.; Prasad, G.L.
2017-01-01
Smoking has been established as a major risk factor for developing oral squamous cell carcinoma (OSCC), but less attention has been paid to the effects of smokeless tobacco products. Our objective is to identify potential biomarkers to distinguish the biological effects of combustible tobacco products from those of non-combustible ones using oral cell lines. Normal human gingival epithelial cells (HGEC), non-metastatic (101A) and metastatic (101B) OSCC cell lines were exposed to different tobacco product preparations (TPPs) including cigarette smoke total particulate matter (TPM), whole-smoke conditioned media (WS-CM), smokeless tobacco extract in complete artificial saliva (STE), or nicotine (NIC) alone. We performed microarray-based gene expression profiling and found 3456 probe sets from 101A, 1432 probe sets from 101B, and 2717 probe sets from HGEC to be differentially expressed. Gene Set Enrichment Analysis (GSEA) revealed xenobiotic metabolism and steroid biosynthesis were the top two pathways that were upregulated by combustible but not by non-combustible TPPs. Notably, aldo-keto reductase genes, AKR1C1 and AKR1C2, were the core genes in the top enriched pathways and were statistically upregulated more than eight-fold by combustible TPPs. Quantitative real time polymerase chain reaction (qRT-PCR) results statistically support AKR1C1 as a potential biomarker for differentiating the biological effects of combustible from non-combustible tobacco products. PMID:28467356
PExFInS: An Integrative Post-GWAS Explorer for Functional Indels and SNPs
Cheng, Zhongshan; Chu, Hin; Fan, Yanhui; Li, Cun; Song, You-Qiang; Zhou, Jie; Yuen, Kwok-Yung
2015-01-01
Expression quantitative trait loci (eQTLs) mapping and linkage disequilibrium (LD) analysis have been widely employed to interpret findings of genome-wide association studies (GWAS). With the availability of deep sequencing data of 423 lymphoblastoid cell lines (LCLs) from six global populations and the microarray expression data, we performed eQTL analysis, identified more than 228 K SNP cis-eQTLs and 21 K indel cis-eQTLs and generated a LCL cis-eQTL database. We demonstrate that the percentages of population-shared and population-specific cis-eQTLs are comparable; while indel cis-eQTLs in the population-specific subsection make more contribution to gene expression variations than those in the population-shared subsection. We found cis-eQTLs, especially the population-shared cis-eQTLs are significantly enriched toward transcription start site. Moreover, the National Human Genome Research Institute cataloged GWAS SNPs are enriched for LCL cis-eQTLs. Specifically, 32.8% GWAS SNPs are LCL cis-eQTLs, among which 12.5% can be tagged by indel cis-eQTLs, suggesting the fundamental contribution of indel cis-eQTLs to GWAS association signals. To search for functional indels and SNPs tagging GWAS SNPs, a pipeline Post-GWAS Explorer for Functional Indels and SNPs (PExFInS) has been developed, integrating LD analysis, functional annotation from public databases, cis-eQTL mapping with our LCL cis-eQTL database and other published cis-eQTL datasets. PMID:26612672
Gao, Jing; Li, Yuhong; Wang, Tongmei; Shi, Zhuo; Zhang, Yiqi; Liu, Shuang; Wen, Pushuai; Ma, Chunyan
2018-03-06
The aim of this study was to identify the key genes involved in the cardiac hypertrophy (CH) induced by pressure overload. mRNA microarray dataset GSE5500 and GSE18801 were downloaded from GEO database, and differentially expressed genes (DEGs) were screened using Limma package; then, functional and pathway enrichment analysis were performed for common DEGs using DAVID database. Furthermore, the top DEGs were further validated using qPCR in the hypertrophic heart tissue induced by Isoprenaline (ISO). A total of 113 common DEGs with absolute fold change >0.5, including 60 significantly up-regulated DEGs and 53 down-regulated DEGs were obtained. GO term enrichment analysis suggested that common up-regulated DEG mainly enriched in neutrophil chemotaxis, extracellular fibril organization and cell proliferation, and the common down-regulated genes were significantly enriched in ion transport, endoplasmic reticulum and dendritic spine. KEGG pathway analysis found that the common DEGs were mainly enriched in ECM-receptor interaction, phagosome, and focal adhesion. Additionally, the expression of Mfap4, Ltbp2, Aspn, Serpina3n, and Cnksr1 were up-regulated in the model of cardiac hypertrophy, while the expression of Anp32a was down-regulated. The current study identified the key deregulated genes and pathways involved in the CH, which could shed new light to understand the mechanism of CH.
Tissue-Specific Transcriptomic Profiling of Sorghum propinquum using a Rice Genome Array
Zhang, Ting; Zhao, Xiuqin; Huang, Liyu; Liu, Xiaoyue; Zong, Ying; Zhu, Linghua; Yang, Daichang; Fu, Binying
2013-01-01
Sorghum (Sorghum bicolor) is one of the world's most important cereal crops. S. propinquum is a perennial wild relative of S. bicolor with well-developed rhizomes. Functional genomics analysis of S. propinquum, especially with respect to molecular mechanisms related to rhizome growth and development, can contribute to the development of more sustainable grain, forage, and bioenergy cropping systems. In this study, we used a whole rice genome oligonucleotide microarray to obtain tissue-specific gene expression profiles of S. propinquum with special emphasis on rhizome development. A total of 548 tissue-enriched genes were detected, including 31 and 114 unique genes that were expressed predominantly in the rhizome tips (RT) and internodes (RI), respectively. Further GO analysis indicated that the functions of these tissue-enriched genes corresponded to their characteristic biological processes. A few distinct cis-elements, including ABA-responsive RY repeat CATGCA, sugar-repressive TTATCC, and GA-responsive TAACAA, were found to be prevalent in RT-enriched genes, implying an important role in rhizome growth and development. Comprehensive comparative analysis of these rhizome-enriched genes and rhizome-specific genes previously identified in Oryza longistaminata and S. propinquum indicated that phytohormones, including ABA, GA, and SA, are key regulators of gene expression during rhizome development. Co-localization of rhizome-enriched genes with rhizome-related QTLs in rice and sorghum generated functional candidates for future cloning of genes associated with rhizome growth and development. PMID:23536906
Emerging Use of Gene Expression Microarrays in Plant Physiology
Wullschleger, Stan D.; Difazio, Stephen P.
2003-01-01
Microarrays have become an important technology for the global analysis of gene expression in humans, animals, plants, and microbes. Implemented in the context of a well-designed experiment, cDNA and oligonucleotide arrays can provide highthroughput, simultaneous analysis of transcript abundance for hundreds, if not thousands, of genes. However, despite widespread acceptance, the use of microarrays as a tool to better understand processes of interest to the plant physiologist is still being explored. To help illustrate current uses of microarrays in the plant sciences, several case studies that we believe demonstrate the emerging application of gene expression arrays in plant physiology weremore » selected from among the many posters and presentations at the 2003 Plant and Animal Genome XI Conference. Based on this survey, microarrays are being used to assess gene expression in plants exposed to the experimental manipulation of air temperature, soil water content and aluminium concentration in the root zone. Analysis often includes characterizing transcript profiles for multiple post-treatment sampling periods and categorizing genes with common patterns of response using hierarchical clustering techniques. In addition, microarrays are also providing insights into developmental changes in gene expression associated with fibre and root elongation in cotton and maize, respectively. Technical and analytical limitations of microarrays are discussed and projects attempting to advance areas of microarray design and data analysis are highlighted. Finally, although much work remains, we conclude that microarrays are a valuable tool for the plant physiologist interested in the characterization and identification of individual genes and gene families with potential application in the fields of agriculture, horticulture and forestry.« less
Wang, Hai-Tao; Kong, Jian-Ping; Ding, Fang; Wang, Xiu-Qin; Wang, Ming-Rong; Liu, Lian-Xin; Wu, Min; Liu, Zhi-Hua
2003-01-01
AIM: To obtain human esophageal cancer cell EC9706 stably expressed epithelial membrane protein-1 (EMP-1) with integrated eukaryotic plasmid harboring the open reading frame (ORF) of human EMP-1, and then to study the mechanism by which EMP-1 exerts its diverse cellular action on cell proliferation and altered gene profile by exploring the effect of EMP-1. METHODS: The authors first constructed pcDNA3.1/myc-his expression vector harboring the ORF of EMP-1 and then transfected it into human esophageal carcinoma cell line EC9706. The positive clones were analyzed by Western blot and RT-PCR. Moreover, the cell growth curve was observed and the cell cycle was checked by FACS technique. Using cDNA microarray technology, the authors compared the gene expression pattern in positive clones with control. To confirm the gene expression profile, semi-quantitative RT-PCR was carried out for 4 of the randomly picked differentially expressed genes. For those differentially expressed genes, classification was performed according to their function and cellular component. RESULTS: Human EMP-1 gene can be stably expressed in EC9706 cell line transfected with human EMP-1. The authors found the cell growth decreased, among which S phase was arrested and G1 phase was prolonged in the transfected positive clones. By cDNA microarray analysis, 35 genes showed an over 2.0 fold change in expression level after transfection, with 28 genes being consistently up-regulated and 7 genes being down-regulated. Among the classified genes, almost half of the induced genes (13 out of 28 genes) were related to cell signaling, cell communication and particularly to adhesion. CONCLUSION: Overexpression of human EMP-1 gene can inhibit the proliferation of EC9706 cell with S phase arrested and G1 phase prolonged. The cDNA microarray analysis suggested that EMP-1 may be one of regulators involved in cell signaling, cell communication and adhesion regulators. PMID:12632483
Wang, Hai-Tao; Kong, Jian-Ping; Ding, Fang; Wang, Xiu-Qin; Wang, Ming-Rong; Liu, Lian-Xin; Wu, Min; Liu, Zhi-Hua
2003-03-01
To obtain human esophageal cancer cell EC9706 stably expressed epithelial membrane protein-1 (EMP-1) with integrated eukaryotic plasmid harboring the open reading frame (ORF) of human EMP-1, and then to study the mechanism by which EMP-1 exerts its diverse cellular action on cell proliferation and altered gene profile by exploring the effect of EMP-1. The authors first constructed pcDNA3.1/myc-his expression vector harboring the ORF of EMP-1 and then transfected it into human esophageal carcinoma cell line EC9706. The positive clones were analyzed by Western blot and RT-PCR. Moreover, the cell growth curve was observed and the cell cycle was checked by FACS technique. Using cDNA microarray technology, the authors compared the gene expression pattern in positive clones with control. To confirm the gene expression profile, semi-quantitative RT-PCR was carried out for 4 of the randomly picked differentially expressed genes. For those differentially expressed genes, classification was performed according to their function and cellular component. Human EMP-1 gene can be stably expressed in EC9706 cell line transfected with human EMP-1. The authors found the cell growth decreased, among which S phase was arrested and G1 phase was prolonged in the transfected positive clones. By cDNA microarray analysis, 35 genes showed an over 2.0 fold change in expression level after transfection, with 28 genes being consistently up-regulated and 7 genes being down-regulated. Among the classified genes, almost half of the induced genes (13 out of 28 genes) were related to cell signaling, cell communication and particularly to adhesion. Overexpression of human EMP-1 gene can inhibit the proliferation of EC9706 cell with S phase arrested and G1 phase prolonged. The cDNA microarray analysis suggested that EMP-1 may be one of regulators involved in cell signaling, cell communication and adhesion regulators.
Quantification of differential gene expression by multiplexed targeted resequencing of cDNA
Arts, Peer; van der Raadt, Jori; van Gestel, Sebastianus H.C.; Steehouwer, Marloes; Shendure, Jay; Hoischen, Alexander; Albers, Cornelis A.
2017-01-01
Whole-transcriptome or RNA sequencing (RNA-Seq) is a powerful and versatile tool for functional analysis of different types of RNA molecules, but sample reagent and sequencing cost can be prohibitive for hypothesis-driven studies where the aim is to quantify differential expression of a limited number of genes. Here we present an approach for quantification of differential mRNA expression by targeted resequencing of complementary DNA using single-molecule molecular inversion probes (cDNA-smMIPs) that enable highly multiplexed resequencing of cDNA target regions of ∼100 nucleotides and counting of individual molecules. We show that accurate estimates of differential expression can be obtained from molecule counts for hundreds of smMIPs per reaction and that smMIPs are also suitable for quantification of relative gene expression and allele-specific expression. Compared with low-coverage RNA-Seq and a hybridization-based targeted RNA-Seq method, cDNA-smMIPs are a cost-effective high-throughput tool for hypothesis-driven expression analysis in large numbers of genes (10 to 500) and samples (hundreds to thousands). PMID:28474677
Evaluating information content of SNPs for sample-tagging in re-sequencing projects.
Hu, Hao; Liu, Xiang; Jin, Wenfei; Hilger Ropers, H; Wienker, Thomas F
2015-05-15
Sample-tagging is designed for identification of accidental sample mix-up, which is a major issue in re-sequencing studies. In this work, we develop a model to measure the information content of SNPs, so that we can optimize a panel of SNPs that approach the maximal information for discrimination. The analysis shows that as low as 60 optimized SNPs can differentiate the individuals in a population as large as the present world, and only 30 optimized SNPs are in practice sufficient in labeling up to 100 thousand individuals. In the simulated populations of 100 thousand individuals, the average Hamming distances, generated by the optimized set of 30 SNPs are larger than 18, and the duality frequency, is lower than 1 in 10 thousand. This strategy of sample discrimination is proved robust in large sample size and different datasets. The optimized sets of SNPs are designed for Whole Exome Sequencing, and a program is provided for SNP selection, allowing for customized SNP numbers and interested genes. The sample-tagging plan based on this framework will improve re-sequencing projects in terms of reliability and cost-effectiveness.
Hill, Matthew J; Killick, Richard; Navarrete, Katherinne; Maruszak, Aleksandra; McLaughlin, Gemma M; Williams, Brenda P; Bray, Nicholas J
2017-05-01
Common variants in the TCF4 gene are among the most robustly supported genetic risk factors for schizophrenia. Rare TCF4 deletions and loss-of-function point mutations cause Pitt-Hopkins syndrome, a developmental disorder associated with severe intellectual disability. To explore molecular and cellular mechanisms by which TCF4 perturbation could interfere with human cortical development, we experimentally reduced the endogenous expression of TCF4 in a neural progenitor cell line derived from the developing human cerebral cortex using RNA interference. Effects on genome-wide gene expression were assessed by microarray, followed by Gene Ontology and pathway analysis of differentially expressed genes. We tested for genetic association between the set of differentially expressed genes and schizophrenia using genome-wide association study data from the Psychiatric Genomics Consortium and competitive gene set analysis (MAGMA). Effects on cell proliferation were assessed using high content imaging. Genes that were differentially expressed following TCF4 knockdown were highly enriched for involvement in the cell cycle. There was a nonsignificant trend for genetic association between the differentially expressed gene set and schizophrenia. Consistent with the gene expression data, TCF4 knockdown was associated with reduced proliferation of cortical progenitor cells in vitro. A detailed mechanistic explanation of how TCF4 knockdown alters human neural progenitor cell proliferation is not provided by this study. Our data indicate effects of TCF4 perturbation on human cortical progenitor cell proliferation, a process that could contribute to cognitive deficits in individuals with Pitt-Hopkins syndrome and risk for schizophrenia.
Gluck, Christian; Min, Sangwon; Oyelakin, Akinsola; Smalley, Kirsten; Sinha, Satrajit; Romano, Rose-Anne
2016-11-16
Mouse models have served a valuable role in deciphering various facets of Salivary Gland (SG) biology, from normal developmental programs to diseased states. To facilitate such studies, gene expression profiling maps have been generated for various stages of SG organogenesis. However these prior studies fall short of capturing the transcriptional complexity due to the limited scope of gene-centric microarray-based technology. Compared to microarray, RNA-sequencing (RNA-seq) offers unbiased detection of novel transcripts, broader dynamic range and high specificity and sensitivity for detection of genes, transcripts, and differential gene expression. Although RNA-seq data, particularly under the auspices of the ENCODE project, have covered a large number of biological specimens, studies on the SG have been lacking. To better appreciate the wide spectrum of gene expression profiles, we isolated RNA from mouse submandibular salivary glands at different embryonic and adult stages. In parallel, we processed RNA-seq data for 24 organs and tissues obtained from the mouse ENCODE consortium and calculated the average gene expression values. To identify molecular players and pathways likely to be relevant for SG biology, we performed functional gene enrichment analysis, network construction and hierarchal clustering of the RNA-seq datasets obtained from different stages of SG development and maturation, and other mouse organs and tissues. Our bioinformatics-based data analysis not only reaffirmed known modulators of SG morphogenesis but revealed novel transcription factors and signaling pathways unique to mouse SG biology and function. Finally we demonstrated that the unique SG gene signature obtained from our mouse studies is also well conserved and can demarcate features of the human SG transcriptome that is different from other tissues. Our RNA-seq based Atlas has revealed a high-resolution cartographic view of the dynamic transcriptomic landscape of the mouse SG at various stages. These RNA-seq datasets will complement pre-existing microarray based datasets, including the Salivary Gland Molecular Anatomy Project by offering a broader systems-biology based perspective rather than the classical gene-centric view. Ultimately such resources will be valuable in providing a useful toolkit to better understand how the diverse cell population of the SG are organized and controlled during development and differentiation.
Qiu, Qiang; Wang, Lizhong; Wang, Kun; Yang, Yongzhi; Ma, Tao; Wang, Zefu; Zhang, Xiao; Ni, Zhengqiang; Hou, Fujiang; Long, Ruijun; Abbott, Richard; Lenstra, Johannes; Liu, Jianquan
2015-12-22
Yak domestication represents an important episode in the early human occupation of the high-altitude Qinghai-Tibet Plateau (QTP). The precise timing of domestication is debated and little is known about the underlying genetic changes that occurred during the process. Here we investigate genome variation of wild and domestic yaks. We detect signals of selection in 209 genes of domestic yaks, several of which relate to behaviour and tameness. We date yak domestication to 7,300 years before present (yr BP), most likely by nomadic people, and an estimated sixfold increase in yak population size by 3,600 yr BP. These dates coincide with two early human population expansions on the QTP during the early-Neolithic age and the late-Holocene, respectively. Our findings add to an understanding of yak domestication and its importance in the early human occupation of the QTP.
Yak whole-genome resequencing reveals domestication signatures and prehistoric population expansions
Qiu, Qiang; Wang, Lizhong; Wang, Kun; Yang, Yongzhi; Ma, Tao; Wang, Zefu; Zhang, Xiao; Ni, Zhengqiang; Hou, Fujiang; Long, Ruijun; Abbott, Richard; Lenstra, Johannes; Liu, Jianquan
2015-01-01
Yak domestication represents an important episode in the early human occupation of the high-altitude Qinghai-Tibet Plateau (QTP). The precise timing of domestication is debated and little is known about the underlying genetic changes that occurred during the process. Here we investigate genome variation of wild and domestic yaks. We detect signals of selection in 209 genes of domestic yaks, several of which relate to behaviour and tameness. We date yak domestication to 7,300 years before present (yr BP), most likely by nomadic people, and an estimated sixfold increase in yak population size by 3,600 yr BP. These dates coincide with two early human population expansions on the QTP during the early-Neolithic age and the late-Holocene, respectively. Our findings add to an understanding of yak domestication and its importance in the early human occupation of the QTP. PMID:26691338
Lo, Miranda; Cordwell, Stuart J; Bulach, Dieter M; Adler, Ben
2009-12-08
Leptospirosis is a global zoonosis affecting millions of people annually. Transcriptional changes in response to temperature were previously investigated using microarrays to identify genes potentially expressed upon host entry. Past studies found that various leptospiral outer membrane proteins are differentially expressed at different temperatures. However, our microarray studies highlighted a divergence between protein abundance and transcript levels for some proteins. Given the abundance of post-transcriptional expression control mechanisms, this finding highlighted the importance of global protein analysis systems. To complement our previous transcription study, we evaluated differences in the proteins of the leptospiral outer membrane fraction in response to temperature upshift. Outer membrane protein-enriched fractions from Leptospira interrogans grown at 30 degrees C or overnight upshift to 37 degrees C were isolated and the relative abundance of each protein was determined by iTRAQ analysis coupled with two-dimensional liquid chromatography and tandem mass spectrometry (2-DLC/MS-MS). We identified 1026 proteins with 99% confidence; 27 and 66 were present at elevated and reduced abundance respectively. Protein abundance changes were compared with transcriptional differences determined from the microarray studies. While there was some correlation between the microarray and iTRAQ data, a subset of genes that showed no differential expression by microarray was found to encode temperature-regulated proteins. This set of genes is of particular interest as it is likely that regulation of their expression occurs post-transcriptionally, providing an opportunity to develop hypotheses about the molecular dynamics of the outer membrane of Leptospira in response to changing environments. This is the first study to compare transcriptional and translational responses to temperature shift in L. interrogans. The results thus provide an insight into the mechanisms used by L. interrogans to adapt to conditions encountered in the host and to cause disease. Our results suggest down-regulation of protein expression in response to temperature, and decreased expression of outer membrane proteins may facilitate minimal interaction with host immune mechanisms.
Expression profiling and pathway analysis of Krüppel-like factor 4 in mouse embryonic fibroblasts
Hagos, Engda G; Ghaleb, Amr M; Kumar, Amrita; Neish, Andrew S; Yang, Vincent W
2011-01-01
Background: Krüppel-like factor 4 (KLF4) is a zinc-finger transcription factor with diverse regulatory functions in proliferation, differentiation, and development. KLF4 also plays a role in inflammation, tumorigenesis, and reprogramming of somatic cells to induced pluripotent stem (iPS) cells. To gain insight into the mechanisms by which KLF4 regulates these processes, we conducted DNA microarray analyses to identify differentially expressed genes in mouse embryonic fibroblasts (MEFs) wild type and null for Klf4. Methods: Expression profiles of fibroblasts isolated from mouse embryos wild type or null for the Klf4 alleles were examined by DNA microarrays. Differentially expressed genes were subjected to the Database for Annotation, Visualization and Integrated Discovery (DAVID). The microarray data were also interrogated with the Ingenuity Pathway Analysis (IPA) and Gene Set Enrichment Analysis (GSEA) for pathway identification. Results obtained from the microarray analysis were confirmed by Western blotting for select genes with biological relevance to determine the correlation between mRNA and protein levels. Results: One hundred and sixty three up-regulated and 88 down-regulated genes were identified that demonstrated a fold-change of at least 1.5 and a P-value < 0.05 in Klf4-null MEFs compared to wild type MEFs. Many of the up-regulated genes in Klf4-null MEFs encode proto-oncogenes, growth factors, extracellular matrix, and cell cycle activators. In contrast, genes encoding tumor suppressors and those involved in JAK-STAT signaling pathways are down-regulated in Klf4-null MEFs. IPA and GSEA also identified various pathways that are regulated by KLF4. Lastly, Western blotting of select target genes confirmed the changes revealed by microarray data. Conclusions: These data are not only consistent with previous functional studies of KLF4's role in tumor suppression and somatic cell reprogramming, but also revealed novel target genes that mediate KLF4's functions. PMID:21892412
Microarray-based screening of heat shock protein inhibitors.
Schax, Emilia; Walter, Johanna-Gabriela; Märzhäuser, Helene; Stahl, Frank; Scheper, Thomas; Agard, David A; Eichner, Simone; Kirschning, Andreas; Zeilinger, Carsten
2014-06-20
Based on the importance of heat shock proteins (HSPs) in diseases such as cancer, Alzheimer's disease or malaria, inhibitors of these chaperons are needed. Today's state-of-the-art techniques to identify HSP inhibitors are performed in microplate format, requiring large amounts of proteins and potential inhibitors. In contrast, we have developed a miniaturized protein microarray-based assay to identify novel inhibitors, allowing analysis with 300 pmol of protein. The assay is based on competitive binding of fluorescence-labeled ATP and potential inhibitors to the ATP-binding site of HSP. Therefore, the developed microarray enables the parallel analysis of different ATP-binding proteins on a single microarray. We have demonstrated the possibility of multiplexing by immobilizing full-length human HSP90α and HtpG of Helicobacter pylori on microarrays. Fluorescence-labeled ATP was competed by novel geldanamycin/reblastatin derivatives with IC50 values in the range of 0.5 nM to 4 μM and Z(*)-factors between 0.60 and 0.96. Our results demonstrate the potential of a target-oriented multiplexed protein microarray to identify novel inhibitors for different members of the HSP90 family. Copyright © 2014 Elsevier B.V. All rights reserved.
Xu, Zhen-Hua; Thomae, Bianca A; Eckloff, Bruce W; Wieben, Eric D; Weinshilboum, Richard M
2003-06-01
3'-Phosphoadenosine 5'-phosphosulfate (PAPS) is the high-energy "sulfate donor" for reactions catalyzed by sulfotransferase (SULT) enzymes. The strict requirement of SULTs for PAPS suggests that PAPS synthesis might influence the rate of sulfate conjugation. In humans, PAPS is synthesized from ATP and SO(4)(2-) by two isoforms of PAPS synthetase (PAPSS): PAPSS1 and PAPSS2. As a step toward pharmacogenetic studies, we have resequenced the entire coding sequence of the human PAPSS1 gene, including exon-intron splice junctions, using DNA samples from 60 Caucasian-American and 58 African-American subjects. Twenty-one genetic polymorphisms were observed-1 insertion-deletion event and 20 single nucleotide polymorphisms (SNPs)-including two non-synonymous coding SNPs (cSNPs) that altered the following amino acids: Arg333Cys and Glu531Gln. Twelve pairs of these polymorphisms were tightly linked, and a total of twelve unequivocal haplotypes could be identified-two that were common to both ethnic groups and ten that were ethnic-specific. The Arg333Cys polymorphism, with an allele frequency of 2.5%, was observed only in DNA samples from Caucasian subjects. The Glu531Gln polymorphism was rare, with only a single copy of that allele in a DNA sample from an African-American subject. Transient expression in mammalian cells showed that neither of the non-synonymous cSNPs resulted in a change in the basal level of enzyme activity measured under optimal assay conditions. However, the Glu531Gln polymorphism altered the substrate kinetic properties of the enzyme. The Gln531 variant allozyme had a 5-fold higher K(m) value for SO(4)(2-) than did the wild-type allozyme and displayed monophasic kinetics for Na(2)SO(4). The wild-type allozyme (Glu531) showed biphasic kinetics for that substrate. These observations represent a step toward testing the hypothesis that genetic variation in PAPS synthesis catalyzed by PAPSS1 might alter in vivo sulfate conjugation.
Bonfiglio, Silvia; Vanni, Irene; Rossella, Valeria; Truini, Anna; Lazarevic, Dejan; Dal Bello, Maria Giovanna; Alama, Angela; Mora, Marco; Rijavec, Erika; Genova, Carlo; Cittaro, Davide; Grossi, Francesco; Coco, Simona
2016-08-30
Next Generation Sequencing (NGS) has become a valuable tool for molecular landscape characterization of cancer genomes, leading to a better understanding of tumor onset and progression, and opening new avenues in translational oncology. Formalin-fixed paraffin-embedded (FFPE) tissue is the method of choice for storage of clinical samples, however low quality of FFPE genomic DNA (gDNA) can limit its use for downstream applications. To investigate the FFPE specimen suitability for NGS analysis and to establish the performance of two solution-based exome capture technologies, we compared the whole-exome sequencing (WES) data of gDNA extracted from 5 fresh frozen (FF) and 5 matched FFPE lung adenocarcinoma tissues using: SeqCap EZ Human Exome v.3.0 (Roche NimbleGen) and SureSelect XT Human All Exon v.5 (Agilent Technologies). Sequencing metrics on Illumina HiSeq were optimal for both exome systems and comparable among FFPE and FF samples, with a slight increase of PCR duplicates in FFPE, mainly in Roche NimbleGen libraries. Comparison of single nucleotide variants (SNVs) between FFPE-FF pairs reached overlapping values >90 % in both systems. Both WES showed high concordance with target re-sequencing data by Ion PGM™ in 22 lung-cancer genes, regardless the source of samples. Exon coverage of 623 cancer-related genes revealed high coverage efficiency of both kits, proposing WES as a valid alternative to target re-sequencing. High-quality and reliable data can be successfully obtained from WES of FFPE samples starting from a relatively low amount of input gDNA, suggesting the inclusion of NGS-based tests into clinical contest. In conclusion, our analysis suggests that the WES approach could be extended to a translational research context as well as to the clinic (e.g. to study rare malignancies), where the simultaneous analysis of the whole coding region of the genome may help in the detection of cancer-linked variants.
Geue, Lutz; Stieber, Bettina; Monecke, Stefan; Engelmann, Ines; Gunzer, Florian; Slickers, Peter; Braun, Sascha D; Ehricht, Ralf
2014-08-01
In this study, we developed a new rapid, economic, and automated microarray-based genotyping test for the standardized subtyping of Shiga toxins 1 and 2 of Escherichia coli. The microarrays from Alere Technologies can be used in two different formats, the ArrayTube and the ArrayStrip (which enables high-throughput testing in a 96-well format). One microarray chip harbors all the gene sequences necessary to distinguish between all Stx subtypes, facilitating the identification of single and multiple subtypes within a single isolate in one experiment. Specific software was developed to automatically analyze all data obtained from the microarray. The assay was validated with 21 Shiga toxin-producing E. coli (STEC) reference strains that were previously tested by the complete set of conventional subtyping PCRs. The microarray results showed 100% concordance with the PCR results. Essentially identical results were detected when the standard DNA extraction method was replaced by a time-saving heat lysis protocol. For further validation of the microarray, we identified the Stx subtypes or combinations of the subtypes in 446 STEC field isolates of human and animal origin. In summary, this oligonucleotide array represents an excellent diagnostic tool that provides some advantages over standard PCR-based subtyping. The number of the spotted probes on the microarrays can be increased by additional probes, such as for novel alleles, species markers, or resistance genes, should the need arise. Copyright © 2014, American Society for Microbiology. All Rights Reserved.
Australian wild rice reveals pre-domestication origin of polymorphism deserts in rice genome.
Krishnan S, Gopala; Waters, Daniel L E; Henry, Robert J
2014-01-01
Rice is a major source of human food with a predominantly Asian production base. Domestication involved selection of traits that are desirable for agriculture and to human consumers. Wild relatives of crop plants are a source of useful variation which is of immense value for crop improvement. Australian wild rices have been isolated from the impacts of domestication in Asia and represents a source of novel diversity for global rice improvement. Oryza rufipogon is a perennial wild progenitor of cultivated rice. Oryza meridionalis is a related annual species in Australia. We have examined the sequence of the genomes of AA genome wild rices from Australia that are close relatives of cultivated rice through whole genome re-sequencing. Assembly of the resequencing data to the O. sativa ssp. japonica cv. Nipponbare shows that Australian wild rices possess 2.5 times more single nucleotide polymorphisms than in the Asian wild rice and cultivated O. sativa ssp. indica. Analysis of the genome of domesticated rice reveals regions of low diversity that show very little variation (polymorphism deserts). Both the perennial and annual wild rice from Australia show a high degree of conservation of sequence with that found in cultivated rice in the same 4.58 Mbp region on chromosome 5, which suggests that some of the 'polymorphism deserts' in this and other parts of the rice genome may have originated prior to domestication due to natural selection. Analysis of genes in the 'polymorphism deserts' indicates that this selection may have been due to biotic or abiotic stress in the environment of early rice relatives. Despite having closely related sequences in these genome regions, the Australian wild populations represent an invaluable source of diversity supporting rice food security.
Identification of ALK as the Major Familial Neuroblastoma Predisposition Gene
Mossë, Yalë P; Laudenslager, Marci; Longo, Luca; Cole, Kristina A; Wood, Andrew; Attiyeh, Edward F; Laquaglia, Michael J; Sennett, Rachel; Lynch, Jill E; Perri, Patrizia; Laureys, Geneviève; Speleman, Frank; Hakonarson, Hakon; Torkamani, Ali; Schork, Nicholas J; Brodeur, Garrett M; Tonini, Gian Paolo; Rappaport, Eric; Devoto, Marcella; Maris, John M
2009-01-01
SUMMARY Survival rates for the childhood cancer neuroblastoma have not substantively improved despite dramatic escalation in chemotherapy intensity. Like most human cancers, this embryonal malignancy can be inherited, but the genetic etiology of familial and sporadically occurring neuroblastoma was largely unknown. Here we show that germline mutations in the anaplastic lymphoma kinase gene (ALK) explain the majority of hereditary neuroblastomas, and that activating mutations can also be somatically acquired. We first identified a significant linkage signal at the short arm of chromosome 2 (maximum nonparametric LOD=4.23 at rs1344063) using a whole-genome scan in neuroblastoma pedigrees. Resequencing of regional candidate genes identified three separate missense mutations in the tyrosine kinase domain of ALK (G1128A, R1192P and R1275Q) that segregated with the disease in eight separate families. Examination of 491 sporadically occurring human neuroblastoma samples showed that the ALK locus was gained in 22.8%, and highly amplified in an additional 3.3%, and that these aberrations were highly associated with death from disease (P=0.0003). Resequencing of 194 high-risk neuroblastoma samples showed somatically acquired mutations within the tyrosine kinase domain in 12.4%. Nine of the ten mutations map to critical regions of the kinase domain and were predicted to be oncogenic drivers with high probability. Mutations resulted in constitutive phosphorylation consistent with activation, and targeted knockdown of ALK mRNA resulted in profound growth inhibition of 4 of 4 cell lines harboring mutant or amplified ALK, as well as 2 of 6 wild type for ALK. Our results demonstrate that heritable mutations of ALK are the major cause of familial neuroblastoma, and that germline or acquired activation of this cell surface kinase is a tractable therapeutic target for this lethal pediatric malignancy. PMID:18724359
Majoros, William H.; Campbell, Michael S.; Holt, Carson; DeNardo, Erin K.; Ware, Doreen; Allen, Andrew S.; Yandell, Mark; Reddy, Timothy E.
2017-01-01
Abstract Motivation: The accurate interpretation of genetic variants is critical for characterizing genotype–phenotype associations. Because the effects of genetic variants can depend strongly on their local genomic context, accurate genome annotations are essential. Furthermore, as some variants have the potential to disrupt or alter gene structure, variant interpretation efforts stand to gain from the use of individualized annotations that account for differences in gene structure between individuals or strains. Results: We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE (‘Assessing Changes to Exons’) converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detects gene-structure changes and their possible repercussions, and identifies several classes of possible loss of function. Novel transcripts predicted by ACE are commonly supported by spliced RNA-seq reads, and can be used to improve read alignment and transcript quantification when an individual-specific genome sequence is available. Using publicly available RNA-seq data, we show that ACE predictions confirm earlier results regarding the quantitative effects of nonsense-mediated decay, and we show that predicted loss-of-function events are highly concordant with patterns of intolerance to mutations across the human population. ACE can be readily applied to diverse species including animals and plants, making it a broadly useful tool for use in eukaryotic population-based resequencing projects, particularly for assessing the joint impact of all variants at a locus. Availability and Implementation: ACE is written in open-source C ++ and Perl and is available from geneprediction.org/ACE Contact: myandell@genetics.utah.edu or tim.reddy@duke.edu Supplementary information: Supplementary information is available at Bioinformatics online. PMID:28011790
Majoros, William H; Campbell, Michael S; Holt, Carson; DeNardo, Erin K; Ware, Doreen; Allen, Andrew S; Yandell, Mark; Reddy, Timothy E
2017-05-15
The accurate interpretation of genetic variants is critical for characterizing genotype-phenotype associations. Because the effects of genetic variants can depend strongly on their local genomic context, accurate genome annotations are essential. Furthermore, as some variants have the potential to disrupt or alter gene structure, variant interpretation efforts stand to gain from the use of individualized annotations that account for differences in gene structure between individuals or strains. We describe a suite of software tools for identifying possible functional changes in gene structure that may result from sequence variants. ACE ('Assessing Changes to Exons') converts phased genotype calls to a collection of explicit haplotype sequences, maps transcript annotations onto them, detects gene-structure changes and their possible repercussions, and identifies several classes of possible loss of function. Novel transcripts predicted by ACE are commonly supported by spliced RNA-seq reads, and can be used to improve read alignment and transcript quantification when an individual-specific genome sequence is available. Using publicly available RNA-seq data, we show that ACE predictions confirm earlier results regarding the quantitative effects of nonsense-mediated decay, and we show that predicted loss-of-function events are highly concordant with patterns of intolerance to mutations across the human population. ACE can be readily applied to diverse species including animals and plants, making it a broadly useful tool for use in eukaryotic population-based resequencing projects, particularly for assessing the joint impact of all variants at a locus. ACE is written in open-source C ++ and Perl and is available from geneprediction.org/ACE. myandell@genetics.utah.edu or tim.reddy@duke.edu. Supplementary information is available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Fullerton, Stephanie M; Clark, Andrew G; Weiss, Kenneth M; Taylor, Scott L; Stengård, Jari H; Salomaa, Veikko; Boerwinkle, Eric; Nickerson, Deborah A
2002-07-01
A 3.3-kb region, encompassing the APOA2 gene and 2 kb of 5' and 3' flanking DNA, was re-sequenced in a "core" sample of 24 individuals, sampled without regard to the health from each of three populations: African-Americans from Jackson (Miss., USA), Europeans from North Karelia (Finland), and non-Hispanic European-Americans from Rochester, (Minn., USA). Fifteen variable sites were identified (14 SNPs and one multi-allelic microsatellite, all silent), and these sites segregated as 18 sequence haplotypes (or nine, if SNPs only are considered). The haplotype distribution in the core African-American sample was unusual, with a deficit of particular haplotypes compared with those found in the other two samples, and a significantly (P<0.05) low level of nucleotide diversity relative to patterns of polymorphism and divergence at other human loci. Six of the 14 SNPs, whose variation captured the haplotype structure of the core data, were then genotyped by oligonucleotide ligation assay in an additional 2183 individuals from the same three populations (n=843, n=452, and n=888, respectively). All six sites varied in each of the larger "epidemiological" samples, and together, they defined 19 SNP haplotypes, seven with relative frequencies greater than 1% in the total sample; all of these common haplotypes had been identified earlier in the core re-sequencing survey. Here also, the African-American sample showed significantly lower SNP heterozygosity and haplotype diversity than the other two samples. The deficit of polymorphism is consistent with a population-specific non-neutral increase in the relative frequency of several haplotypes in Jackson.
Analysis of Altered Micro RNA Expression Profiles in Focal Cortical Dysplasia IIB.
Li, Lin; Liu, Chang-Qing; Li, Tian-Fu; Guan, Yu-Guang; Zhou, Jian; Qi, Xue-Ling; Yang, Yu-Tao; Deng, Jia-Hui; Xu, Zhi-Qing David; Luan, Guo-Ming
2016-04-01
Focal cortical dysplasia type IIB is a commonly encountered subtype of developmental malformation of the cerebral cortex and is often associated with pharmacoresistant epilepsy. In this study, to investigate the molecular etiology of focal cortical dysplasia type IIB, the authors performed micro ribonucleic acid (RNA) microarray on surgical specimens from 5 children (2 female and 3 male, mean age was 73.4 months, range 50-112 months) diagnosed of focal cortical dysplasia type IIB and matched normal tissue adjacent to the lesion. In all, 24 micro RNAs were differentially expressed in focal cortical dysplasia type IIB, and the microarray results were validated using quantitative real-time polymerase chain reaction (PCR). Then the putative target genes of the differentially expressed micro RNAs were identified by bioinformatics analysis. Moreover, biological significance of the target genes was evaluated by investigating the pathways in which the genes were enriched, and the Hippo signaling pathway was proposed to be highly related with the pathogenesis of focal cortical dysplasia type IIB. © The Author(s) 2015.
Vidyasagar, Mathukumalli
2015-01-01
This article reviews several techniques from machine learning that can be used to study the problem of identifying a small number of features, from among tens of thousands of measured features, that can accurately predict a drug response. Prediction problems are divided into two categories: sparse classification and sparse regression. In classification, the clinical parameter to be predicted is binary, whereas in regression, the parameter is a real number. Well-known methods for both classes of problems are briefly discussed. These include the SVM (support vector machine) for classification and various algorithms such as ridge regression, LASSO (least absolute shrinkage and selection operator), and EN (elastic net) for regression. In addition, several well-established methods that do not directly fall into machine learning theory are also reviewed, including neural networks, PAM (pattern analysis for microarrays), SAM (significance analysis for microarrays), GSEA (gene set enrichment analysis), and k-means clustering. Several references indicative of the application of these methods to cancer biology are discussed.
Eyre, Catherine; Muftah, Wafa; Hiscox, Jennifer; Hunt, Julie; Kille, Peter; Boddy, Lynne; Rogers, Hilary J
2010-08-01
Trametes versicolor is an important white rot fungus of both industrial and ecological interest. Saprotrophic basidiomycetes are the major decomposition agents in woodland ecosystems, and rarely form monospecific populations, therefore interspecific mycelial interactions continually occur. Interactions have different outcomes including replacement of one species by the other or deadlock. We have made subtractive cDNA libraries to enrich for genes that are expressed when T. versicolor interacts with another saprotrophic basidiomycete, Stereum gausapatum, an interaction that results in the replacement of the latter. Expressed sequence tags (ESTs) (1920) were used for microarray analysis, and their expression compared during interaction with three different fungi: S. gausapatum (replaced by T. versicolor), Bjerkandera adusta (deadlock) and Hypholoma fasciculare (replaced T. versicolor). Expression of significantly more probes changed in the interaction between T. versicolor and S. gausapatum or B. adusta compared to H. fasciculare, suggesting a relationship between interaction outcome and changes in gene expression. Copyright © 2010 The British Mycological Society. Published by Elsevier Ltd. All rights reserved.
HIV-1 gp140 epitope recognition is influenced by immunoglobulin DH gene segment sequence
Wang, Yuge; Kapoor, Pratibha; Parks, Robert; Silva-Sanchez, Aaron; Alam, S. Munir; Verkoczy, Laurent; Liao, Hua-Xin; Zhuang, Yingxin; Burrows, Peter; Levinson, Michael; Elgavish, Ada; Cui, Xiangqin; Haynes, Barton F.; Schroeder, Harry
2015-01-01
Complementarity determining region 3 of the immunoglobulin (Ig) H chain (CDR-H3) lies at the center of the antigen binding site where it often plays a decisive role in antigen recognition and binding. Amino acids encoded by the diversity (DH) gene segment are the main component of CDR-H3. Each DH has the potential to rearrange into one of six DH reading frames (RFs), each of which exhibits a characteristic amino acid hydrophobicity signature that has been conserved among jawed vertebrates by natural selection. A preference for use of RF1 promotes the incorporation of tyrosine into CDR-H3 while suppressing the inclusion of hydrophobic or charged amino acids. To test the hypothesis that these evolutionary constraints on DH sequence influence epitope recognition, we used mice with a single DH that has been altered to preferentially use RF2 or inverted RF1. B cells in these mice produce a CDR-H3 repertoire that is enriched for valine or arginine in place of tyrosine. We serially immunized this panel of mice with gp140 from HIV-1 JR-FL isolate and then used ELISA or peptide microarray to assess antibody binding to key or overlapping HIV-1 envelope epitopes. By ELISA, serum reactivity to key epitopes varied by DH sequence. By microarray, sera with Ig CDR-H3s enriched for arginine bound to linear peptides with a greater range of hydrophobicity, but had a lower intensity of binding than sera containing Ig CDR-H3s enriched for tyrosine or valine. We conclude that patterns of epitope recognition and binding can be heavily influenced by DH germline sequence. This may help explain why antibodies in HIV infected patients must undergo extensive somatic mutation in order to bind to specific viral epitopes and achieve neutralization. PMID:26687685
Haram, Kerstyn M; Peltier, Heidi J; Lu, Bin; Bhasin, Manoj; Otu, Hasan H; Choy, Bob; Regan, Meredith; Libermann, Towia A; Latham, Gary J; Sanda, Martin G; Arredouani, Mohamed S
2008-10-01
Translation of preclinical studies into effective human cancer therapy is hampered by the lack of defined molecular expression patterns in mouse models that correspond to the human counterpart. We sought to generate an open source TRAMP mouse microarray dataset and to use this array to identify differentially expressed genes from human prostate cancer (PCa) that have concordant expression in TRAMP tumors, and thereby represent lead targets for preclinical therapy development. We performed microarrays on total RNA extracted and amplified from eight TRAMP tumors and nine normal prostates. A subset of differentially expressed genes was validated by QRT-PCR. Differentially expressed TRAMP genes were analyzed for concordant expression in publicly available human prostate array datasets and a subset of resulting genes was analyzed by QRT-PCR. Cross-referencing differentially expressed TRAMP genes to public human prostate array datasets revealed 66 genes with concordant expression in mouse and human PCa; 56 between metastases and normal and 10 between primary tumor and normal tissues. Of these 10 genes, two, Sox4 and Tubb2a, were validated by QRT-PCR. Our analysis also revealed various dysregulations in major biologic pathways in the TRAMP prostates. We report a TRAMP microarray dataset of which a gene subset was validated by QRT-PCR with expression patterns consistent with previous gene-specific TRAMP studies. Concordance analysis between TRAMP and human PCa associated genes supports the utility of the model and suggests several novel molecular targets for preclinical therapy.
Tonomura, Noriko; Elvers, Ingegerd; Thomas, Rachael; Megquier, Kate; Turner-Maier, Jason; Howald, Cedric; Sarver, Aaron L.; Swofford, Ross; Frantz, Aric M.; Ito, Daisuke; Mauceli, Evan; Arendt, Maja; Noh, Hyun Ji; Koltookian, Michele; Biagi, Tara; Fryc, Sarah; Williams, Christina; Avery, Anne C.; Kim, Jong-Hyuk; Barber, Lisa; Burgess, Kristine; Lander, Eric S.; Karlsson, Elinor K.; Azuma, Chieko
2015-01-01
Dogs, with their breed-determined limited genetic background, are great models of human disease including cancer. Canine B-cell lymphoma and hemangiosarcoma are both malignancies of the hematologic system that are clinically and histologically similar to human B-cell non-Hodgkin lymphoma and angiosarcoma, respectively. Golden retrievers in the US show significantly elevated lifetime risk for both B-cell lymphoma (6%) and hemangiosarcoma (20%). We conducted genome-wide association studies for hemangiosarcoma and B-cell lymphoma, identifying two shared predisposing loci. The two associated loci are located on chromosome 5, and together contribute ~20% of the risk of developing these cancers. Genome-wide p-values for the top SNP of each locus are 4.6×10-7 and 2.7×10-6, respectively. Whole genome resequencing of nine cases and controls followed by genotyping and detailed analysis identified three shared and one B-cell lymphoma specific risk haplotypes within the two loci, but no coding changes were associated with the risk haplotypes. Gene expression analysis of B-cell lymphoma tumors revealed that carrying the risk haplotypes at the first locus is associated with down-regulation of several nearby genes including the proximal gene TRPC6, a transient receptor Ca2+-channel involved in T-cell activation, among other functions. The shared risk haplotype in the second locus overlaps the vesicle transport and release gene STX8. Carrying the shared risk haplotype is associated with gene expression changes of 100 genes enriched for pathways involved in immune cell activation. Thus, the predisposing germ-line mutations in B-cell lymphoma and hemangiosarcoma appear to be regulatory, and affect pathways involved in T-cell mediated immune response in the tumor. This suggests that the interaction between the immune system and malignant cells plays a common role in the tumorigenesis of these relatively different cancers. PMID:25642983
Stanaway, Ian B.; Gamazon, Eric R.; Smith, Joshua D.; Mirkov, Snezana; Ramirez, Jacqueline; Liu, Wanqing; Lin, Yvonne S.; Moloney, Cliona; Aldred, Shelly Force; Trinklein, Nathan D.; Schuetz, Erin; Nickerson, Deborah A.; Thummel, Ken E.; Rieder, Mark J.; Rettie, Allan E.; Ratain, Mark J.; Cox, Nancy J.; Brown, Christopher D.
2011-01-01
The discovery of expression quantitative trait loci (“eQTLs”) can help to unravel genetic contributions to complex traits. We identified genetic determinants of human liver gene expression variation using two independent collections of primary tissue profiled with Agilent (n = 206) and Illumina (n = 60) expression arrays and Illumina SNP genotyping (550K), and we also incorporated data from a published study (n = 266). We found that ∼30% of SNP-expression correlations in one study failed to replicate in either of the others, even at thresholds yielding high reproducibility in simulations, and we quantified numerous factors affecting reproducibility. Our data suggest that drug exposure, clinical descriptors, and unknown factors associated with tissue ascertainment and analysis have substantial effects on gene expression and that controlling for hidden confounding variables significantly increases replication rate. Furthermore, we found that reproducible eQTL SNPs were heavily enriched near gene starts and ends, and subsequently resequenced the promoters and 3′UTRs for 14 genes and tested the identified haplotypes using luciferase assays. For three genes, significant haplotype-specific in vitro functional differences correlated directly with expression levels, suggesting that many bona fide eQTLs result from functional variants that can be mechanistically isolated in a high-throughput fashion. Finally, given our study design, we were able to discover and validate hundreds of liver eQTLs. Many of these relate directly to complex traits for which liver-specific analyses are likely to be relevant, and we identified dozens of potential connections with disease-associated loci. These included previously characterized eQTL contributors to diabetes, drug response, and lipid levels, and they suggest novel candidates such as a role for NOD2 expression in leprosy risk and C2orf43 in prostate cancer. In general, the work presented here will be valuable for future efforts to precisely identify and functionally characterize genetic contributions to a variety of complex traits. PMID:21637794
Olsson, Mia; Meadows, Jennifer R S; Truvé, Katarina; Rosengren Pielberg, Gerli; Puppo, Francesca; Mauceli, Evan; Quilez, Javier; Tonomura, Noriko; Zanna, Giordana; Docampo, Maria José; Bassols, Anna; Avery, Anne C; Karlsson, Elinor K; Thomas, Anne; Kastner, Daniel L; Bongcam-Rudloff, Erik; Webster, Matthew T; Sanchez, Armand; Hedhammar, Ake; Remmers, Elaine F; Andersson, Leif; Ferrer, Lluis; Tintle, Linda; Lindblad-Toh, Kerstin
2011-03-01
Hereditary periodic fever syndromes are characterized by recurrent episodes of fever and inflammation with no known pathogenic or autoimmune cause. In humans, several genes have been implicated in this group of diseases, but the majority of cases remain unexplained. A similar periodic fever syndrome is relatively frequent in the Chinese Shar-Pei breed of dogs. In the western world, Shar-Pei have been strongly selected for a distinctive thick and heavily folded skin. In this study, a mutation affecting both these traits was identified. Using genome-wide SNP analysis of Shar-Pei and other breeds, the strongest signal of a breed-specific selective sweep was located on chromosome 13. The same region also harbored the strongest genome-wide association (GWA) signal for susceptibility to the periodic fever syndrome (p(raw) = 2.3 × 10⁻⁶, p(genome) = 0.01). Dense targeted resequencing revealed two partially overlapping duplications, 14.3 Kb and 16.1 Kb in size, unique to Shar-Pei and upstream of the Hyaluronic Acid Synthase 2 (HAS2) gene. HAS2 encodes the rate-limiting enzyme synthesizing hyaluronan (HA), a major component of the skin. HA is up-regulated and accumulates in the thickened skin of Shar-Pei. A high copy number of the 16.1 Kb duplication was associated with an increased expression of HAS2 as well as the periodic fever syndrome (p < 0.0001). When fragmented, HA can act as a trigger of the innate immune system and stimulate sterile fever and inflammation. The strong selection for the skin phenotype therefore appears to enrich for a pleiotropic mutation predisposing these dogs to a periodic fever syndrome. The identification of HA as a major risk factor for this canine disease raises the potential of this glycosaminoglycan as a risk factor for human periodic fevers and as an important driver of chronic inflammation.
Olsson, Mia; Mauceli, Evan; Quilez, Javier; Tonomura, Noriko; Zanna, Giordana; Docampo, Maria José; Bassols, Anna; Avery, Anne C.; Karlsson, Elinor K.; Thomas, Anne; Kastner, Daniel L.; Bongcam-Rudloff, Erik; Webster, Matthew T.; Sanchez, Armand; Hedhammar, Åke; Remmers, Elaine F.; Andersson, Leif; Ferrer, Lluis; Tintle, Linda; Lindblad-Toh, Kerstin
2011-01-01
Hereditary periodic fever syndromes are characterized by recurrent episodes of fever and inflammation with no known pathogenic or autoimmune cause. In humans, several genes have been implicated in this group of diseases, but the majority of cases remain unexplained. A similar periodic fever syndrome is relatively frequent in the Chinese Shar-Pei breed of dogs. In the western world, Shar-Pei have been strongly selected for a distinctive thick and heavily folded skin. In this study, a mutation affecting both these traits was identified. Using genome-wide SNP analysis of Shar-Pei and other breeds, the strongest signal of a breed-specific selective sweep was located on chromosome 13. The same region also harbored the strongest genome-wide association (GWA) signal for susceptibility to the periodic fever syndrome (praw = 2.3×10−6, pgenome = 0.01). Dense targeted resequencing revealed two partially overlapping duplications, 14.3 Kb and 16.1 Kb in size, unique to Shar-Pei and upstream of the Hyaluronic Acid Synthase 2 (HAS2) gene. HAS2 encodes the rate-limiting enzyme synthesizing hyaluronan (HA), a major component of the skin. HA is up-regulated and accumulates in the thickened skin of Shar-Pei. A high copy number of the 16.1 Kb duplication was associated with an increased expression of HAS2 as well as the periodic fever syndrome (p<0.0001). When fragmented, HA can act as a trigger of the innate immune system and stimulate sterile fever and inflammation. The strong selection for the skin phenotype therefore appears to enrich for a pleiotropic mutation predisposing these dogs to a periodic fever syndrome. The identification of HA as a major risk factor for this canine disease raises the potential of this glycosaminoglycan as a risk factor for human periodic fevers and as an important driver of chronic inflammation. PMID:21437276
Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A
2016-06-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3' end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. Copyright © 2016 Larson et al.
Mariani, Luca; Weinand, Kathryn; Vedenko, Anastasia; Barrera, Luis A; Bulyk, Martha L
2017-09-27
Transcription factors (TFs) control cellular processes by binding specific DNA motifs to modulate gene expression. Motif enrichment analysis of regulatory regions can identify direct and indirect TF binding sites. Here, we created a glossary of 108 non-redundant TF-8mer "modules" of shared specificity for 671 metazoan TFs from publicly available and new universal protein binding microarray data. Analysis of 239 ENCODE TF chromatin immunoprecipitation sequencing datasets and associated RNA sequencing profiles suggest the 8mer modules are more precise than position weight matrices in identifying indirect binding motifs and their associated tethering TFs. We also developed GENRE (genomically equivalent negative regions), a tunable tool for construction of matched genomic background sequences for analysis of regulatory regions. GENRE outperformed four state-of-the-art approaches to background sequence construction. We used our TF-8mer glossary and GENRE in the analysis of the indirect binding motifs for the co-occurrence of tethering factors, suggesting novel TF-TF interactions. We anticipate that these tools will aid in elucidating tissue-specific gene-regulatory programs. Copyright © 2017 Elsevier Inc. All rights reserved.
Larson, Amy; Fair, Benjamin Jung; Pleiss, Jeffrey A.
2016-01-01
Pre-mRNA splicing is an essential component of eukaryotic gene expression and is highly conserved from unicellular yeasts to humans. Here, we present the development and implementation of a sequencing-based reverse genetic screen designed to identify nonessential genes that impact pre-mRNA splicing in the fission yeast Schizosaccharomyces pombe, an organism that shares many of the complex features of splicing in higher eukaryotes. Using a custom-designed barcoding scheme, we simultaneously queried ∼3000 mutant strains for their impact on the splicing efficiency of two endogenous pre-mRNAs. A total of 61 nonessential genes were identified whose deletions resulted in defects in pre-mRNA splicing; enriched among these were factors encoding known or predicted components of the spliceosome. Included among the candidates identified here are genes with well-characterized roles in other RNA-processing pathways, including heterochromatic silencing and 3ʹ end processing. Splicing-sensitive microarrays confirm broad splicing defects for many of these factors, revealing novel functional connections between these pathways. PMID:27172183
Nunes, Márcio Roberto Teixeira; de Souza, William Marciel; Acrani, Gustavo Olszanski; Cardoso, Jedson Ferreira; da Silva, Sandro Patroca; Badra, Soraya Jabur; Figueiredo, Luiz Tadeu Moraes; Vasconcelos, Pedro Fernando da Costa
2018-01-01
Group C serogroup includes members of the Orthobunyavirus genus (family Peribunyaviridae) and comprises 15 arboviruses that can be associated with febrile illness in humans. Although previous studies described the genome characterization of Group C orthobunyavirus, there is a gap in genomic information about the other viruses in this group. Therefore, in this study, complete genomes of members of Group C serogroup were sequenced or re-sequenced and used for genetic characterization, as well as to understand their phylogenetic and evolutionary aspects. Thus, our study reported the genomes of three new members in Group C virus (Apeu strain BeAn848, Itaqui strain BeAn12797 and Nepuyo strain BeAn10709), as well as re-sequencing of original strains of five members: Caraparu (strain BeAn3994), Madrid (strain BT4075), Murucutu (strain BeAn974), Oriboca (strain BeAn17), and Marituba (strain BeAn15). These viruses presented a typical genomic organization related to members of the Orthobunyavirus genus. Interestingly, all viruses of this serogroup showed an open reading frame (ORF) that encodes the putative nonstructural NSs protein that precedes the nucleoprotein ORF, an unprecedented fact in Group C virus. Also, we confirmed the presence of natural reassortment events. This study expands the genomic information of Group C viruses, as well as revalidates the genomic organization of viruses that were previously reported.
2013-01-01
Background Artificial selection played an important role in the origin of modern Glycine max cultivars from the wild soybean Glycine soja. To elucidate the consequences of artificial selection accompanying the domestication and modern improvement of soybean, 25 new and 30 published whole-genome re-sequencing accessions, which represent wild, domesticated landrace, and Chinese elite soybean populations were analyzed. Results A total of 5,102,244 single nucleotide polymorphisms (SNPs) and 707,969 insertion/deletions were identified. Among the SNPs detected, 25.5% were not described previously. We found that artificial selection during domestication led to more pronounced reduction in the genetic diversity of soybean than the switch from landraces to elite cultivars. Only a small proportion (2.99%) of the whole genomic regions appear to be affected by artificial selection for preferred agricultural traits. The selection regions were not distributed randomly or uniformly throughout the genome. Instead, clusters of selection hotspots in certain genomic regions were observed. Moreover, a set of candidate genes (4.38% of the total annotated genes) significantly affected by selection underlying soybean domestication and genetic improvement were identified. Conclusions Given the uniqueness of the soybean germplasm sequenced, this study drew a clear picture of human-mediated evolution of the soybean genomes. The genomic resources and information provided by this study would also facilitate the discovery of genes/loci underlying agronomically important traits. PMID:23984715
Osterndorff-Kahanek, Elizabeth A.; Becker, Howard C.; Lopez, Marcelo F.; Farris, Sean P.; Tiwari, Gayatri R.; Nunez, Yury O.; Harris, R. Adron; Mayfield, R. Dayne
2015-01-01
Repeated ethanol exposure and withdrawal in mice increases voluntary drinking and represents an animal model of physical dependence. We examined time- and brain region-dependent changes in gene coexpression networks in amygdala (AMY), nucleus accumbens (NAC), prefrontal cortex (PFC), and liver after four weekly cycles of chronic intermittent ethanol (CIE) vapor exposure in C57BL/6J mice. Microarrays were used to compare gene expression profiles at 0-, 8-, and 120-hours following the last ethanol exposure. Each brain region exhibited a large number of differentially expressed genes (2,000-3,000) at the 0- and 8-hour time points, but fewer changes were detected at the 120-hour time point (400-600). Within each region, there was little gene overlap across time (~20%). All brain regions were significantly enriched with differentially expressed immune-related genes at the 8-hour time point. Weighted gene correlation network analysis identified modules that were highly enriched with differentially expressed genes at the 0- and 8-hour time points with virtually no enrichment at 120 hours. Modules enriched for both ethanol-responsive and cell-specific genes were identified in each brain region. These results indicate that chronic alcohol exposure causes global ‘rewiring‘ of coexpression systems involving glial and immune signaling as well as neuronal genes. PMID:25803291
Gharib, Sina A; Seiger, Ashley N; Hayes, Amanda L; Mehra, Reena; Patel, Sanjay R
2014-04-01
Obstructive sleep apnea (OSA) has been associated with a number of chronic disorders that may improve with effective therapy. However, the molecular pathways affected by continuous positive airway pressure (CPAP) treatment are largely unknown. We sought to assess the system-wide consequences of CPAP therapy by transcriptionally profiling peripheral blood leukocytes (PBLs). Subjects in whom severe OSA was diagnosed were treated with CPAP, and whole-genome expression measurement of PBLs was performed at baseline and following therapy. We used gene set enrichment analysis (GSEA) to identify pathways that were differentially enriched. Network analysis was then applied to highlight key drivers of processes influenced by CPAP. Eighteen subjects with significant OSA underwent CPAP therapy and microarray analysis of their PBLs. Treatment with CPAP improved apnea-hypopnea index (AHI), daytime sleepiness, and blood pressure, but did not affect anthropometric measures. GSEA revealed a number of enriched gene sets, many of which were involved in neoplastic processes and displayed downregulated expression patterns in response to CPAP. Network analysis identified several densely connected genes that are important modulators of cancer and tumor growth. Effective therapy of OSA with CPAP is associated with alterations in circulating leukocyte gene expression. Functional enrichment and network analyses highlighted transcriptional suppression in cancer-related pathways, suggesting potentially novel mechanisms linking OSA with neoplastic signatures.
Jia, Xinzheng; Lin, Huiran; Nie, Qinghua; Zhang, Xiquan; Lamont, Susan J
2016-11-03
Body weight is one of the most important quantitative traits with high heritability in chicken. We previously mapped a quantitative trait locus (QTL) for body weight by genome-wide association study (GWAS) in an F2 chicken resource population. To identify the causal mutations linked to this QTL, expression profiles were determined on livers of high-weight and low-weight chicken lines by microarray. Combining the expression pattern with SNP effects by GWAS, miR-16 was identified as the most likely potential candidate with a 3.8-fold decrease in high-weight lines. Re-sequencing revealed that a 54-bp insertion mutation in the upstream region of miR-15a-16 displayed high allele frequencies in high-weight commercial broiler line. This mutation resulted in lower miR-16 expression by introducing three novel splicing sites instead of the missing 5' terminal splicing of mature miR-16. Elevating miR-16 significantly inhibited DF-1 chicken embryo cell proliferation, consistent with a role in suppression of cellular growth. The 54-bp insertion was significantly associated with increased body weight, bone size and muscle mass. Also, the insertion mutation tended towards fixation in commercial broilers (Fst > 0.4). Our findings revealed a novel causative mutation for body weight regulation that aids our basic understanding of growth regulation in birds.
Jia, Xinzheng; Lin, Huiran; Nie, Qinghua; Zhang, Xiquan; Lamont, Susan J.
2016-01-01
Body weight is one of the most important quantitative traits with high heritability in chicken. We previously mapped a quantitative trait locus (QTL) for body weight by genome-wide association study (GWAS) in an F2 chicken resource population. To identify the causal mutations linked to this QTL, expression profiles were determined on livers of high-weight and low-weight chicken lines by microarray. Combining the expression pattern with SNP effects by GWAS, miR-16 was identified as the most likely potential candidate with a 3.8-fold decrease in high-weight lines. Re-sequencing revealed that a 54-bp insertion mutation in the upstream region of miR-15a-16 displayed high allele frequencies in high-weight commercial broiler line. This mutation resulted in lower miR-16 expression by introducing three novel splicing sites instead of the missing 5′ terminal splicing of mature miR-16. Elevating miR-16 significantly inhibited DF-1 chicken embryo cell proliferation, consistent with a role in suppression of cellular growth. The 54-bp insertion was significantly associated with increased body weight, bone size and muscle mass. Also, the insertion mutation tended towards fixation in commercial broilers (Fst > 0.4). Our findings revealed a novel causative mutation for body weight regulation that aids our basic understanding of growth regulation in birds. PMID:27808177
Van Loo, Peter; Aerts, Stein; Thienpont, Bernard; De Moor, Bart; Moreau, Yves; Marynen, Peter
2008-01-01
We present ModuleMiner, a novel algorithm for computationally detecting cis-regulatory modules (CRMs) in a set of co-expressed genes. ModuleMiner outperforms other methods for CRM detection on benchmark data, and successfully detects CRMs in tissue-specific microarray clusters and in embryonic development gene sets. Interestingly, CRM predictions for differentiated tissues exhibit strong enrichment close to the transcription start site, whereas CRM predictions for embryonic development gene sets are depleted in this region. PMID:18394174
Recently evolved human-specific methylated regions are enriched in schizophrenia signals.
Banerjee, Niladri; Polushina, Tatiana; Bettella, Francesco; Giddaluru, Sudheer; Steen, Vidar M; Andreassen, Ole A; Le Hellard, Stephanie
2018-05-11
One explanation for the persistence of schizophrenia despite the reduced fertility of patients is that it is a by-product of recent human evolution. This hypothesis is supported by evidence suggesting that recently-evolved genomic regions in humans are involved in the genetic risk for schizophrenia. Using summary statistics from genome-wide association studies (GWAS) of schizophrenia and 11 other phenotypes, we tested for enrichment of association with GWAS traits in regions that have undergone methylation changes in the human lineage compared to Neanderthals and Denisovans, i.e. human-specific differentially methylated regions (DMRs). We used analytical tools that evaluate polygenic enrichment of a subset of genomic variants against all variants. Schizophrenia was the only trait in which DMR SNPs showed clear enrichment of association that passed the genome-wide significance threshold. The enrichment was not observed for Neanderthal or Denisovan DMRs. The enrichment seen in human DMRs is comparable to that for genomic regions tagged by Neanderthal Selective Sweep markers, and stronger than that for Human Accelerated Regions. The enrichment survives multiple testing performed through permutation (n = 10,000) and bootstrapping (n = 5000) in INRICH (p < 0.01). Some enrichment of association with height was observed at the gene level. Regions where DNA methylation modifications have changed during recent human evolution show enrichment of association with schizophrenia and possibly with height. Our study further supports the hypothesis that genetic variants conferring risk of schizophrenia co-occur in genomic regions that have changed as the human species evolved. Since methylation is an epigenetic mark, potentially mediated by environmental changes, our results also suggest that interaction with the environment might have contributed to that association.
The GENCODE exome: sequencing the complete human exome
Coffey, Alison J; Kokocinski, Felix; Calafato, Maria S; Scott, Carol E; Palta, Priit; Drury, Eleanor; Joyce, Christopher J; LeProust, Emily M; Harrow, Jen; Hunt, Sarah; Lehesjoki, Anna-Elina; Turner, Daniel J; Hubbard, Tim J; Palotie, Aarno
2011-01-01
Sequencing the coding regions, the exome, of the human genome is one of the major current strategies to identify low frequency and rare variants associated with human disease traits. So far, the most widely used commercial exome capture reagents have mainly targeted the consensus coding sequence (CCDS) database. We report the design of an extended set of targets for capturing the complete human exome, based on annotation from the GENCODE consortium. The extended set covers an additional 5594 genes and 10.3 Mb compared with the current CCDS-based sets. The additional regions include potential disease genes previously inaccessible to exome resequencing studies, such as 43 genes linked to ion channel activity and 70 genes linked to protein kinase activity. In total, the new GENCODE exome set developed here covers 47.9 Mb and performed well in sequence capture experiments. In the sample set used in this study, we identified over 5000 SNP variants more in the GENCODE exome target (24%) than in the CCDS-based exome sequencing. PMID:21364695
Sharrow, Allison C; Perkins, Brandy; Collector, Michael I; Yu, Wayne; Simons, Brian W; Jones, Richard J
2016-08-01
The cancer stem cell (CSC) paradigm hypothesizes that successful clinical eradication of CSCs may lead to durable remission for patients with ovarian cancer. Despite mounting evidence in support of ovarian CSCs, their phenotype and clinical relevance remain unclear. We and others have found high aldehyde dehydrogenase 1 (ALDH(high)) expression in a variety of normal and malignant stem cells, and sought to better characterize ALDH(high) cells in ovarian cancer. We compared ALDH(high) to ALDH(low) cells in two ovarian cancer models representing distinct subtypes: FNAR-C1 cells, derived from a spontaneous rat endometrioid carcinoma, and the human SKOV3 cell line (described as both serous and clear cell subtypes). We assessed these populations for stem cell features then analyzed expression by microarray and qPCR. ALDH(high) cells displayed CSC properties, including: smaller size, quiescence, regenerating the phenotypic diversity of the cell lines in vitro, lack of contact inhibition, nonadherent growth, multi-drug resistance, and in vivo tumorigenicity. Microarray and qPCR analysis of the expression of markers reported by others to enrich for ovarian CSCs revealed that ALDH(high) cells of both models showed downregulation of CD24, but inconsistent expression of CD44, KIT and CD133. However, the following druggable targets were consistently expressed in the ALDH(high) cells from both models: mTOR signaling, her-2/neu, CD47 and FGF18/FGFR3. Based on functional characterization, ALDH(high) ovarian cancer cells represent an ovarian CSC population. Differential gene expression identified druggable targets that have the potential for therapeutic efficacy against ovarian CSCs from multiple subtypes. Copyright © 2016 Elsevier Inc. All rights reserved.
Kristensen, Malene M; Davidsen, Peter K; Vigelsø, Andreas; Hansen, Christina N; Jensen, Lars J; Jessen, Niels; Bruun, Jens M; Dela, Flemming; Helge, Jørn W
2017-03-01
Obesity is central in the development of insulin resistance. However, the underlying mechanisms still need elucidation. Dysregulated microRNAs (miRNAs; post-transcriptional regulators) in adipose tissue may present an important link. The miRNA expression in subcutaneous adipose tissue from 19 individuals with severe obesity (10 women and 9 men) before and after a 15-week weight loss intervention was studied using genome-wide microarray analysis. The microarray results were validated with RT-qPCR, and pathway enrichment analysis of in silico predicted targets was performed to elucidate the biological consequences of the miRNA dysregulation. Lastly, the messenger RNA (mRNA) and/or protein expression of multiple predicted targets as well as several proteins involved in lipolysis were investigated. The intervention led to upregulation of miR-29a-3p and miR-29a-5p and downregulation of miR-20b-5p. The mRNA and protein expression of predicted targets was not significantly affected by the intervention. However, negative correlations between miR-20b-5p and the protein levels of its predicted target, acyl-CoA synthetase long-chain family member 1, were observed. Several other miRNA-target relationships correlated negatively, indicating possible miRNA regulation, including miR-29a-3p and lipoprotein lipase mRNA levels. Proteins involved in lipolysis were not affected by the intervention. Weight loss influenced several miRNAs, some of which were negatively correlated with predicted targets. These dysregulated miRNAs may affect adipocytokine signaling and forkhead box protein O signaling. © 2017 The Obesity Society.
Exposure to Cobalt Causes Transcriptomic and Proteomic Changes in Two Rat Liver Derived Cell Lines
Permenter, Matthew G.; Dennis, William E.; Sutto, Thomas E.; Jackson, David A.; Lewis, John A.; Stallings, Jonathan D.
2013-01-01
Cobalt is a transition group metal present in trace amounts in the human diet, but in larger doses it can be acutely toxic or cause adverse health effects in chronic exposures. Its use in many industrial processes and alloys worldwide presents opportunities for occupational exposures, including military personnel. While the toxic effects of cobalt have been widely studied, the exact mechanisms of toxicity remain unclear. In order to further elucidate these mechanisms and identify potential biomarkers of exposure or effect, we exposed two rat liver-derived cell lines, H4-II-E-C3 and MH1C1, to two concentrations of cobalt chloride. We examined changes in gene expression using DNA microarrays in both cell lines and examined changes in cytoplasmic protein abundance in MH1C1 cells using mass spectrometry. We chose to closely examine differentially expressed genes and proteins changing in abundance in both cell lines in order to remove cell line specific effects. We identified enriched pathways, networks, and biological functions using commercial bioinformatic tools and manual annotation. Many of the genes, proteins, and pathways modulated by exposure to cobalt appear to be due to an induction of a hypoxic-like response and oxidative stress. Genes that may be differentially expressed due to a hypoxic-like response are involved in Hif-1α signaling, glycolysis, gluconeogenesis, and other energy metabolism related processes. Gene expression changes linked to oxidative stress are also known to be involved in the NRF2-mediated response, protein degradation, and glutathione production. Using microarray and mass spectrometry analysis, we were able to identify modulated genes and proteins, further elucidate the mechanisms of toxicity of cobalt, and identify biomarkers of exposure and effect in vitro, thus providing targets for focused in vivo studies. PMID:24386269
Flibotte, Stephane; Moerman, Donald G
2008-10-21
Microarray comparative genomic hybridization (CGH) is currently one of the most powerful techniques to measure DNA copy number in large genomes. In humans, microarray CGH is widely used to assess copy number variants in healthy individuals and copy number aberrations associated with various diseases, syndromes and disease susceptibility. In model organisms such as Caenorhabditis elegans (C. elegans) the technique has been applied to detect mutations, primarily deletions, in strains of interest. Although various constraints on oligonucleotide properties have been suggested to minimize non-specific hybridization and improve the data quality, there have been few experimental validations for CGH experiments. For genomic regions where strict design filters would limit the coverage it would also be useful to quantify the expected loss in data quality associated with relaxed design criteria. We have quantified the effects of filtering various oligonucleotide properties by measuring the resolving power for detecting deletions in the human and C. elegans genomes using NimbleGen microarrays. Approximately twice as many oligonucleotides are typically required to be affected by a deletion in human DNA samples in order to achieve the same statistical confidence as one would observe for a deletion in C. elegans. Surprisingly, the ability to detect deletions strongly depends on the oligonucleotide 15-mer count, which is defined as the sum of the genomic frequency of all the constituent 15-mers within the oligonucleotide. A similarity level above 80% to non-target sequences over the length of the probe produces significant cross-hybridization. We recommend the use of a fairly large melting temperature window of up to 10 degrees C, the elimination of repeat sequences, the elimination of homopolymers longer than 5 nucleotides, and a threshold of -1 kcal/mol on the oligonucleotide self-folding energy. We observed very little difference in data quality when varying the oligonucleotide length between 50 and 70, and even when using an isothermal design strategy. We have determined experimentally the effects of varying several key oligonucleotide microarray design criteria for detection of deletions in C. elegans and humans with NimbleGen's CGH technology. Our oligonucleotide design recommendations should be applicable for CGH analysis in most species.
Hancock, Angela M.; Clark, Vanessa J.; Qian, Yudong; Di Rienzo, Anna
2011-01-01
Production of heat via nonshivering thermogenesis (NST) is critical for temperature homeostasis in mammals. Uncoupling protein UCP1 plays a central role in NST by uncoupling the proton gradients produced in the inner membranes of mitochondria to produce heat; however, the extent to which UCP1 homologues, UCP2 and UCP3, are involved in NST is the subject of an ongoing debate. We used an evolutionary approach to test the hypotheses that variants that are associated with increased expression of these genes (UCP1 −3826A, UCP2 −866A, and UCP3 −55T) show evidence of adaptation with winter climate. To that end, we calculated correlations between allele frequencies and winter climate variables for these single-nucleotide polymorphisms (SNPs), which we genotyped in a panel of 52 worldwide populations. We found significant correlations with winter climate for UCP1 −3826G/A and UCP3 −55C/T. Further, by analyzing previously published genotype data for these SNPs, we found that the peak of the correlation for the UCP1 region occurred at the disease-associated −3826A/G variant and that the UCP3 region has a striking signal overall, with several individual SNPs showing interesting patterns, including the −55C/T variant. Resequencing of the regions in a set of three diverse population samples helped to clarify the signals that we found with the genotype data. At UCP1, the resequencing data revealed modest evidence that the haplotype carrying the −3826A variant was driven to high frequency by selection. In the UCP3 region, combining results from the climate analysis and resequencing survey suggest a more complex model in which variants on multiple haplotypes may independently be correlated with temperature. This is further supported by an excess of intermediate frequency variants in the UCP3 region in the Han Chinese population. Taken together, our results suggest that adaptation to climate influenced the global distribution of allele frequencies in UCP1 and UCP3 and provide an independent source of evidence for a role in cold resistance for UCP3. PMID:20802238
Barrick, Jeffrey E; Colburn, Geoffrey; Deatherage, Daniel E; Traverse, Charles C; Strand, Matthew D; Borges, Jordan J; Knoester, David B; Reba, Aaron; Meyer, Austin G
2014-11-29
Mutations that alter chromosomal structure play critical roles in evolution and disease, including in the origin of new lifestyles and pathogenic traits in microbes. Large-scale rearrangements in genomes are often mediated by recombination events involving new or existing copies of mobile genetic elements, recently duplicated genes, or other repetitive sequences. Most current software programs for predicting structural variation from short-read DNA resequencing data are intended primarily for use on human genomes. They typically disregard information in reads mapping to repeat sequences, and significant post-processing and manual examination of their output is often required to rule out false-positive predictions and precisely describe mutational events. We have implemented an algorithm for identifying structural variation from DNA resequencing data as part of the breseq computational pipeline for predicting mutations in haploid microbial genomes. Our method evaluates the support for new sequence junctions present in a clonal sample from split-read alignments to a reference genome, including matches to repeat sequences. Then, it uses a statistical model of read coverage evenness to accept or reject these predictions. Finally, breseq combines predictions of new junctions and deleted chromosomal regions to output biologically relevant descriptions of mutations and their effects on genes. We demonstrate the performance of breseq on simulated Escherichia coli genomes with deletions generating unique breakpoint sequences, new insertions of mobile genetic elements, and deletions mediated by mobile elements. Then, we reanalyze data from an E. coli K-12 mutation accumulation evolution experiment in which structural variation was not previously identified. Transposon insertions and large-scale chromosomal changes detected by breseq account for ~25% of spontaneous mutations in this strain. In all cases, we find that breseq is able to reliably predict structural variation with modest read-depth coverage of the reference genome (>40-fold). Using breseq to predict structural variation should be useful for studies of microbial epidemiology, experimental evolution, synthetic biology, and genetics when a reference genome for a closely related strain is available. In these cases, breseq can discover mutations that may be responsible for important or unintended changes in genomes that might otherwise go undetected.
Neugart, Susanne; Krumbein, Angelika; Zrenner, Rita
2016-01-01
Light intensity and temperature are very important signals for the regulation of plant growth and development. Plants subjected to less favorable light or temperature conditions often respond with accumulation of secondary metabolites. Some of these metabolites have been identified as bioactive compounds, considered to exert positive effects on human health when consumed regularly. In order to test a typical range of growth parameters for the winter crop Brassica oleracea var. sabellica, plants were grown either at 400 μmol m(-2) s(-1) or 100 μmol m(-2) s(-1) at 10°C, or at 400 μmol m(-2) s(-1) with 5 or 15°C. The higher light intensity overall increased flavonol content of leaves, favoring the main quercetin glycosides, a caffeic acid monoacylated kaempferol triglycoside, and disinapoyl-gentiobiose. The higher temperature mainly increased the hydroxycinnamic acid derivative disinapoyl-gentiobiose, while at lower temperature synthesis is in favor of very complex sinapic acid acylated flavonol tetraglycosides such as kaempferol-3-O-sinapoyl-sophoroside-7-O-diglucoside. A global analysis of light and temperature dependent alterations of gene expression in B. oleracea var. sabellica leaves was performed with the most comprehensive Brassica microarray. When compared to the light experiment much less genes were differentially expressed in kale leaves grown at 5 or 15°C. A structured evaluation of differentially expressed genes revealed the expected enrichment in the functional categories of e.g. protein degradation at different light intensities or phytohormone metabolism at different temperature. Genes of the secondary metabolism namely phenylpropanoids are significantly enriched with both treatments. Thus, the genome of B. oleracea was screened for predicted genes putatively involved in the biosynthesis of flavonoids and hydroxycinnamic acid derivatives. All identified B. oleracea genes were analyzed for their most specific 60-mer oligonucleotides present on the 2 × 105 K format Brassica microarray. Expression differences were correlated to the structure-dependent response of flavonoid glycosides and hydroxycinnamic acid derivatives to alterations in either light or temperature. The altered metabolite accumulation was mainly reflected on gene expression level of core biosynthetic pathway genes and gave further hints to an isoform specific functional specialization.
Khan, Meraj A; Sengupta, Jayasree; Mittal, Suneeta; Ghosh, Debabrata
2012-09-24
In order to obtain a lead of the pathophysiology of endometriosis, genome-wide expressional analyses of eutopic and ectopic endometrium have earlier been reported, however, the effects of stages of severity and phases of menstrual cycle on expressional profiles have not been examined. The effect of genetic heterogeneity and fertility history on transcriptional activity was also not considered. In the present study, a genome-wide expression analysis of autologous, paired eutopic and ectopic endometrial samples obtained from fertile women (n=18) suffering from moderate (stage 3; n=8) or severe (stage 4; n=10) ovarian endometriosis during proliferative (n=13) and secretory (n=5) phases of menstrual cycle was performed. Individual pure RNA samples were subjected to Agilent's Whole Human Genome 44K microarray experiments. Microarray data were validated (P<0.01) by estimating transcript copy numbers by performing real time RT-PCR of seven (7) arbitrarily selected genes in all samples. The data obtained were subjected to differential expression (DE) and differential co-expression (DC) analyses followed by networks and enrichment analysis, and gene set enrichment analysis (GSEA). The reproducibility of prediction based on GSEA implementation of DC results was assessed by examining the relative expressions of twenty eight (28) selected genes in RNA samples obtained from fresh pool of eutopic and ectopic samples from confirmed ovarian endometriosis patients with stages 3 and 4 (n=4/each) during proliferative and secretory (n=4/each) phases. Higher clustering effect of pairing (cluster distance, cd=0.1) in samples from same individuals on expressional arrays among eutopic and ectopic samples was observed as compared to that of clinical stages of severity (cd=0.5) and phases of menstrual cycle (cd=0.6). Post hoc analysis revealed anomaly in the expressional profiles of several genes associated with immunological, neuracrine and endocrine functions and gynecological cancers however with no overt oncogenic potential in endometriotic tissue. Dys-regulation of three (CLOCK, ESR1, and MYC) major transcription factors appeared to be significant causative factors in the pathogenesis of ovarian endometriosis. A novel cohort of twenty-eight (28) genes representing potential marker for ovarian endometriosis in fertile women was discovered. Dysfunctional expression of immuno-neuro-endocrine behaviour in endometrium appeared critical to endometriosis. Although no overt oncogenic potential was evident, several genes associated with gynecological cancers were observed to be high in the expressional profiles in endometriotic tissue.
Neugart, Susanne; Krumbein, Angelika; Zrenner, Rita
2016-01-01
Light intensity and temperature are very important signals for the regulation of plant growth and development. Plants subjected to less favorable light or temperature conditions often respond with accumulation of secondary metabolites. Some of these metabolites have been identified as bioactive compounds, considered to exert positive effects on human health when consumed regularly. In order to test a typical range of growth parameters for the winter crop Brassica oleracea var. sabellica, plants were grown either at 400 μmol m−2 s−1 or 100 μmol m−2 s−1 at 10°C, or at 400 μmol m−2 s−1 with 5 or 15°C. The higher light intensity overall increased flavonol content of leaves, favoring the main quercetin glycosides, a caffeic acid monoacylated kaempferol triglycoside, and disinapoyl-gentiobiose. The higher temperature mainly increased the hydroxycinnamic acid derivative disinapoyl-gentiobiose, while at lower temperature synthesis is in favor of very complex sinapic acid acylated flavonol tetraglycosides such as kaempferol-3-O-sinapoyl-sophoroside-7-O-diglucoside. A global analysis of light and temperature dependent alterations of gene expression in B. oleracea var. sabellica leaves was performed with the most comprehensive Brassica microarray. When compared to the light experiment much less genes were differentially expressed in kale leaves grown at 5 or 15°C. A structured evaluation of differentially expressed genes revealed the expected enrichment in the functional categories of e.g. protein degradation at different light intensities or phytohormone metabolism at different temperature. Genes of the secondary metabolism namely phenylpropanoids are significantly enriched with both treatments. Thus, the genome of B. oleracea was screened for predicted genes putatively involved in the biosynthesis of flavonoids and hydroxycinnamic acid derivatives. All identified B. oleracea genes were analyzed for their most specific 60-mer oligonucleotides present on the 2 × 105 K format Brassica microarray. Expression differences were correlated to the structure-dependent response of flavonoid glycosides and hydroxycinnamic acid derivatives to alterations in either light or temperature. The altered metabolite accumulation was mainly reflected on gene expression level of core biosynthetic pathway genes and gave further hints to an isoform specific functional specialization. PMID:27066016
Pirim, Dilek; Wang, Xingbin; Niemsiri, Vipavee; Radwan, Zaheda H.; Bunker, Clareann H.; Hokanson, John E.; Hamman, Richard F.; Barmada, M. Michael; Demirci, F. Yesim; Kamboh, M. Ilyas
2015-01-01
Background Cholesteryl ester transfer protein (CETP) plays a crucial role in lipid metabolism. Associations of common CETP variants with variation in plasma lipid levels, and/or CETP mass/activity have been extensively studied and well-documented; however, the effects of uncommon/rare CETP variants on plasma lipid profile remain undefined. Hence, resequencing of the gene in extreme phenotypes and follow-up rare-variant association analyses are essential to fill this gap. Objective To identify common and uncommon/rare variants in the CETP gene by resequencing the entire gene and test the effects of both common and uncommon/rare CETP variants on plasma lipid traits in two genetically distinct populations. Methods and Results The entire CETP gene plus flanking regions were resequenced in 190 individuals comprising 95 non-Hispanic Whites (NHWs) and 95 African blacks with extreme HDL-C levels. A total of 279 sequence variants were identified, of which 25 were novel. Selected variants were genotyped in the entire samples of 623 NHWs and 788 African blacks and 184 QC-passed variants were tested in relation to plasma lipid traits by using gene-based, single-site, haplotype and rare variant association analyses (SKAT-O). Two novel and independent associations of rs1968905 and rs289740 with HDL-C were identified in African blacks. Using SKAT-O analysis, we also identified rare variants with minor allele frequency <0.01 to be associated with HDL-C in both NHWs (P=0.024) and African blacks (P=0.009). Conclusions Our results point out that in addition to the common CETP variants, rare genetic variants in the CETP gene also contribute to the phenotypic variation of HDL-C in the general population. PMID:26683795
Microarray expression profile of circular RNAs in chronic thromboembolic pulmonary hypertension
Miao, Ran; Wang, Ying; Wan, Jun; Leng, Dong; Gong, Juanni; Li, Jifeng; Liang, Yan; Zhai, Zhenguo; Yang, Yuanhua
2017-01-01
Abstract Background: Chronic thromboembolic pulmonary hypertension (CTEPH) is a rare but debilitating and life-threatening complication of acute pulmonary embolism. Circular RNAs (circRNAs), presenting as covalently closed continuous loops, are RNA molecules with covalently joined 3′- and 5′-ends formed by back-splicing events. circRNAs may be significant biological molecules to understand disease mechanisms and to identify biomarkers for disease diagnosis and therapy. The aim of this study was to investigate the potential roles of circRNAs in CTEPH. Methods: Ten human blood samples (5 each from CTEPH and control groups) were included in the Agilent circRNA chip. The differentially expressed circRNAs were evaluated using t test, with significance set at a P value of < .05. A functional enrichment analysis for differentially expressed circRNAs was performed using DAVID online tools, and a Kyoto Encyclopedia of Genes and Genomes pathway enrichment analysis for target genes of miRNAs was performed using the R package clusterProfiler. Furthermore, miRNAs that interacted with differentially expressed circRNAs were predicted using the miRanda package. mRNAs that had clear biological functions and were regulated by miRNAs were predicted using miRWalk2.0 and then combined into a circRNA–miRNA–mRNA network. Results: In total, 351 differentially expressed circRNAs (122 upregulated and 229 downregulated) between CTEPH and control groups were obtained; among these circRNAs, hsa_circ_0002062 and hsa_circ_0022342 might be important because they can regulate 761 (e.g., hsa-miR-942–5p) and 453 (e.g., hsa-miR-940) miRNAs, respectively. Target genes (e.g., cyclin-dependent kinase 6) of hsa-miR-942–5p were mainly enriched in cancer-related pathways, whereas target genes (e.g., CRK-Like Proto-Oncogene, Adaptor Protein) of hsa-miR-940 were enriched in the ErbB signaling pathway. Therefore, these pathways are potentially important in CTEPH. Conclusions: Our findings suggested that hsa_circ_0002062 and hsa_circ_0022342 may be key circRNAs for CTEPH development and that their targeted regulation may be an effective approach for treating CTEPH. PMID:28682884
Transcriptional Signatures of Sleep Duration Discordance in Monozygotic Twins.
Watson, N F; Buchwald, D; Delrow, J J; Altemeier, W A; Vitiello, M V; Pack, A I; Bamshad, M; Noonan, C; Gharib, S A
2017-01-01
Habitual short sleep duration is associated with adverse metabolic, cardiovascular, and inflammatory effects. Co-twin study methodologies account for familial (eg, genetics and shared environmental) confounding, allowing assessment of subtle environmental effects, such as the effect of habitual short sleep duration on gene expression. Therefore, we investigated gene expression in monozygotic twins discordant for actigraphically phenotyped habitual sleep duration. Eleven healthy monozygotic twin pairs (82% female; mean age 42.7 years; SD = 18.1), selected based on subjective sleep duration discordance, were objectively phenotyped for habitual sleep duration with 2 weeks of wrist actigraphy. Peripheral blood leukocyte (PBL) RNA from fasting blood samples was obtained on the final day of actigraphic measurement and hybridized to Illumina humanHT-12 microarrays. Differential gene expression was determined between paired samples and mapped to functional categories using Gene Ontology. Finally, a more comprehensive gene set enrichment analysis was performed based on the entire PBL transcriptome. The mean 24-hour sleep duration of the total sample was 439.2 minutes (SD = 46.8 minutes; range 325.4-521.6 minutes). Mean within-pair sleep duration difference per 24 hours was 64.4 minutes (SD = 21.2; range 45.9-114.6 minutes). The twin cohort displayed distinctive pathway enrichment based on sleep duration differences. Habitual short sleep was associated with up-regulation of genes involved in transcription, ribosome, translation, and oxidative phosphorylation. Unexpectedly, genes down-regulated in short sleep twins were highly enriched in immuno-inflammatory pathways such as interleukin signaling and leukocyte activation, as well as developmental programs, coagulation cascade, and cell adhesion. Objectively assessed habitual sleep duration in monozygotic twin pairs appears to be associated with distinct patterns of differential gene expression and pathway enrichment. By accounting for familial confounding and measuring real life sleep duration, our study shows the transcriptomic effects of habitual short sleep on dysregulated immune response and provides a potential link between sleep deprivation and adverse metabolic, cardiovascular, and inflammatory outcomes. © Sleep Research Society 2017. Published by Oxford University Press on behalf of the Sleep Research Society. All rights reserved. For permissions, please e-mail journals.permissions@oup.com.
Human and great ape red blood cells differ in plasmalogen levels and composition
2011-01-01
Background Plasmalogens are ether phospholipids required for normal mammalian developmental, physiological, and cognitive functions. They have been proposed to act as membrane antioxidants and reservoirs of polyunsaturated fatty acids as well as influence intracellular signaling and membrane dynamics. Plasmalogens are particularly enriched in cells and tissues of the human nervous, immune, and cardiovascular systems. Humans with severely reduced plasmalogen levels have reduced life spans, abnormal neurological development, skeletal dysplasia, impaired respiration, and cataracts. Plasmalogen deficiency is also found in the brain tissue of individuals with Alzheimer disease. Results In a human and great ape cohort, we measured the red blood cell (RBC) levels of the most abundant types of plasmalogens. Total RBC plasmalogen levels were lower in humans than bonobos, chimpanzees, and gorillas, but higher than orangutans. There were especially pronounced cross-species differences in the levels of plasmalogens with a C16:0 moiety at the sn-1 position. Humans on Western or vegan diets had comparable total RBC plasmalogen levels, but the latter group showed moderately higher levels of plasmalogens with a C18:1 moiety at the sn-1 position. We did not find robust sex-specific differences in human or chimpanzee RBC plasmalogen levels or composition. Furthermore, human and great ape skin fibroblasts showed only modest differences in peroxisomal plasmalogen biosynthetic activity. Human and chimpanzee microarray data indicated that genes involved in plasmalogen biosynthesis show cross-species differential expression in multiple tissues. Conclusion We propose that the observed differences in human and great ape RBC plasmalogens are primarily caused by their rates of biosynthesis and/or turnover. Gene expression data raise the possibility that other human and great ape cells and tissues differ in plasmalogen levels. Based on the phenotypes of humans and rodents with plasmalogen disorders, we propose that cross-species differences in tissue plasmalogen levels could influence organ functions and processes ranging from cognition to reproduction to aging. PMID:21679470
Microfluidic extraction and microarray detection of biomarkers from cancer tissue slides
NASA Astrophysics Data System (ADS)
Nguyen, H. T.; Dupont, L. N.; Jean, A. M.; Géhin, T.; Chevolot, Y.; Laurenceau, E.; Gijs, M. A. M.
2018-03-01
We report here a new microfluidic method allowing for the quantification of human epidermal growth factor receptor 2 (HER2) expression levels from formalin-fixed breast cancer tissues. After partial extraction of proteins from the tissue slide, the extract is routed to an antibody (Ab) microarray for HER2 titration by fluorescence. Then the HER2-expressing cell area is evaluated by immunofluorescence (IF) staining of the tissue slide and used to normalize the fluorescent HER2 signal measured from the Ab microarray. The number of HER2 gene copies measured by fluorescence in situ hybridization (FISH) on an adjacent tissue slide is concordant with the normalized HER2 expression signal. This work is the first study implementing biomarker extraction and detection from cancer tissue slides using microfluidics in combination with a microarray system, paving the way for further developments towards multiplex and precise quantification of cancer biomarkers.
Glycan microarray screening assay for glycosyltransferase specificities.
Peng, Wenjie; Nycholat, Corwin M; Razi, Nahid
2013-01-01
Glycan microarrays represent a high-throughput approach to determining the specificity of glycan-binding proteins against a large set of glycans in a single format. This chapter describes the use of a glycan microarray platform for evaluating the activity and substrate specificity of glycosyltransferases (GTs). The methodology allows simultaneous screening of hundreds of immobilized glycan acceptor substrates by in situ incubation of a GT and its appropriate donor substrate on the microarray surface. Using biotin-conjugated donor substrate enables direct detection of the incorporated sugar residues on acceptor substrates on the array. In addition, the feasibility of the method has been validated using label-free donor substrate combined with lectin-based detection of product to assess enzyme activity. Here, we describe the application of both procedures to assess the specificity of a recombinant human α2-6 sialyltransferase. This technique is readily adaptable to studying other glycosyltransferases.
DNA microarrays: a powerful genomic tool for biomedical and clinical research
Trevino, Victor; Falciani, Francesco; Barrera-Saldaña, Hugo A.
2007-01-01
Among the many benefits of the Human Genome Project are new and powerful tools such as the genome-wide hybridization devices referred as microarrays. Initially designed to measure gene transcriptional levels, microarray technologies are now used for comparing other genome features among individuals and their tissues and cells. Results provide valuable information on disease subcategories, disease prognosis, and treatment outcome. Likewise, reveal differences in genetic makeup, regulatory mechanisms and subtle variations are approaching the era of personalized medicine. To understand this powerful tool, its versatility and how it is dramatically changing the molecular approach to biomedical and clinical research, this review describes the technology, its applications, a didactic step-by-step review of a typical microarray protocol, and a real experiment. Finally, it calls the attention of the medical community to integrate multidisciplinary teams, to take advantage of this technology and its expanding applications that in a slide reveals our genetic inheritance and destiny. PMID:17660860
Akçaalan, Reyhan; Albay, Meric; Koker, Latife; Baudart, Julia; Guillebault, Delphine; Fischer, Sabine; Weigel, Wilfried; Medlin, Linda K
2017-12-22
Monitoring drinking water quality is an important public health issue. Two objectives from the 4 years, six nations, EU Project μAqua were to develop hierarchically specific probes to detect and quantify pathogens in drinking water using a PCR-free microarray platform and to design a standardised water sampling program from different sources in Europe to obtain sufficient material for downstream analysis. Our phylochip contains barcodes (probes) that specifically identify freshwater pathogens that are human health risks in a taxonomic hierarchical fashion such that if species is present, the entire taxonomic hierarchy (genus, family, order, phylum, kingdom) leading to it must also be present, which avoids false positives. Molecular tools are more rapid, accurate and reliable than traditional methods, which means faster mitigation strategies with less harm to humans and the community. We present microarray results for the presence of freshwater pathogens from a Turkish lake used drinking water and inferred cyanobacterial cell equivalents from samples concentrated from 40 into 1 L in 45 min using hollow fibre filters. In two companion studies from the same samples, cyanobacterial toxins were analysed using chemical methods and those dates with highest toxin values also had highest cell equivalents as inferred from this microarray study.
Karsten, Stanislav L.; Van Deerlin, Vivianna M. D.; Sabatti, Chiara; Gill, Lisa H.; Geschwind, Daniel H.
2002-01-01
Archival formalin-fixed, paraffin-embedded and ethanol-fixed tissues represent a potentially invaluable resource for gene expression analysis, as they are the most widely available material for studies of human disease. Little data are available evaluating whether RNA obtained from fixed (archival) tissues could produce reliable and reproducible microarray expression data. Here we compare the use of RNA isolated from human archival tissues fixed in ethanol and formalin to frozen tissue in cDNA microarray experiments. Since an additional factor that can limit the utility of archival tissue is the often small quantities available, we also evaluate the use of the tyramide signal amplification method (TSA), which allows the use of small amounts of RNA. Detailed analysis indicates that TSA provides a consistent and reproducible signal amplification method for cDNA microarray analysis, across both arrays and the genes tested. Analysis of this method also highlights the importance of performing non-linear channel normalization and dye switching. Furthermore, archived, fixed specimens can perform well, but not surprisingly, produce more variable results than frozen tissues. Consistent results are more easily obtainable using ethanol-fixed tissues, whereas formalin-fixed tissue does not typically provide a useful substrate for cDNA synthesis and labeling. PMID:11788730
Ficklin, Stephen P.; Luo, Feng; Feltus, F. Alex
2010-01-01
Discovering gene sets underlying the expression of a given phenotype is of great importance, as many phenotypes are the result of complex gene-gene interactions. Gene coexpression networks, built using a set of microarray samples as input, can help elucidate tightly coexpressed gene sets (modules) that are mixed with genes of known and unknown function. Functional enrichment analysis of modules further subdivides the coexpressed gene set into cofunctional gene clusters that may coexist in the module with other functionally related gene clusters. In this study, 45 coexpressed gene modules and 76 cofunctional gene clusters were discovered for rice (Oryza sativa) using a global, knowledge-independent paradigm and the combination of two network construction methodologies. Some clusters were enriched for previously characterized mutant phenotypes, providing evidence for specific gene sets (and their annotated molecular functions) that underlie specific phenotypes. PMID:20668062
Ficklin, Stephen P; Luo, Feng; Feltus, F Alex
2010-09-01
Discovering gene sets underlying the expression of a given phenotype is of great importance, as many phenotypes are the result of complex gene-gene interactions. Gene coexpression networks, built using a set of microarray samples as input, can help elucidate tightly coexpressed gene sets (modules) that are mixed with genes of known and unknown function. Functional enrichment analysis of modules further subdivides the coexpressed gene set into cofunctional gene clusters that may coexist in the module with other functionally related gene clusters. In this study, 45 coexpressed gene modules and 76 cofunctional gene clusters were discovered for rice (Oryza sativa) using a global, knowledge-independent paradigm and the combination of two network construction methodologies. Some clusters were enriched for previously characterized mutant phenotypes, providing evidence for specific gene sets (and their annotated molecular functions) that underlie specific phenotypes.
Measuring Sister Chromatid Cohesion Protein Genome Occupancy in Drosophila melanogaster by ChIP-seq.
Dorsett, Dale; Misulovin, Ziva
2017-01-01
This chapter presents methods to conduct and analyze genome-wide chromatin immunoprecipitation of the cohesin complex and the Nipped-B cohesin loading factor in Drosophila cells using high-throughput DNA sequencing (ChIP-seq). Procedures for isolation of chromatin, immunoprecipitation, and construction of sequencing libraries for the Ion Torrent Proton high throughput sequencer are detailed, and computational methods to calculate occupancy as input-normalized fold-enrichment are described. The results obtained by ChIP-seq are compared to those obtained by ChIP-chip (genomic ChIP using tiling microarrays), and the effects of sequencing depth on the accuracy are analyzed. ChIP-seq provides similar sensitivity and reproducibility as ChIP-chip, and identifies the same broad regions of occupancy. The locations of enrichment peaks, however, can differ between ChIP-chip and ChIP-seq, and low sequencing depth can splinter broad regions of occupancy into distinct peaks.
Yu, Xiaobo; LaBaer, Joshua
2015-05-01
AMPylation (adenylylation) has been recognized as an important post-translational modification that is used by pathogens to regulate host cellular proteins and their associated signaling pathways. AMPylation has potential functions in various cellular processes, and it is widely conserved across both prokaryotes and eukaryotes. However, despite the identification of many AMPylators, relatively few candidate substrates of AMPylation are known. This is changing with the recent development of a robust and reliable method for identifying new substrates using protein microarrays, which can markedly expand the list of potential substrates. Here we describe procedures for detecting AMPylated and auto-AMPylated proteins in a sensitive, high-throughput and nonradioactive manner. The approach uses high-density protein microarrays fabricated using nucleic acid programmable protein array (NAPPA) technology, which enables the highly successful display of fresh recombinant human proteins in situ. The modification of target proteins is determined via copper-catalyzed azide-alkyne cycloaddition (CuAAC). The assay can be accomplished within 11 h.
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-01-01
Objective This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Methods Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Results Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification (P=0.009) or deletion (P=0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly (P=1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Conclusion Chromosomal CNVs may contribute to their transcript expression in cervical cancer. PMID:29312578
DOE Office of Scientific and Technical Information (OSTI.GOV)
Gardner, Shea N.; McLoughlin, Kevin; Be, Nicholas A.
Venezuelan equine encephalitis virus (VEEV) is a mosquito-borne alphavirus that has caused large outbreaks of severe illness in both horses and humans. New approaches are needed to rapidly infer the origin of a newly discovered VEEV strain, estimate its equine amplification and resultant epidemic potential, and predict human virulence phenotype. We performed whole genome single nucleotide polymorphism (SNP) analysis of all available VEE antigenic complex genomes, verified that a SNP-based phylogeny accurately captured the features of a phylogenetic tree based on multiple sequence alignment, and developed a high resolution genome-wide SNP microarray. We used the microarray to analyze a broadmore » panel of VEEV isolates, found excellent concordance between array- and sequence-based SNP calls, genotyped unsequenced isolates, and placed them on a phylogeny with sequenced genomes. The microarray successfully genotyped VEEV directly from tissue samples of an infected mouse, bypassing the need for viral isolation, culture and genomic sequencing. Lastly, we identified genomic variants associated with serotypes and host species, revealing a complex relationship between genotype and phenotype.« less
Wang, Denong; Tang, Jin; Liu, Shaoyi
2015-01-01
Using carbohydrate microarrays, we explored potential natural ligands of antitumor monoclonal antibody HAE3. This antibody was raised against a murine mammary tumor antigen but was found to cross-react with a number of human epithelial tumors in tissues. Our carbohydrate microarray analysis reveals that HAE3 is specific for an O-glycan cryptic epitope that is normally hidden in the cores of blood group substances. Using HAE3 to screen tumor cell surface markers by flow cytometry, we found that the HAE3 glycoepitope, gpHAE3, was highly expressed by a number of human breast cancer cell lines, including some triple-negative cancers that lack the estrogen, progesterone, and Her2/neu receptors. Taken together, we demonstrate that HAE3 recognizes a conserved cryptic glycoepitope of blood group precursors, which is nevertheless selectively expressed and surface-exposed in certain breast tumor cells. The potential of this class of O-glycan cryptic antigens in breast cancer subtyping and targeted immunotherapy warrants further investigation. PMID:26539555
Integrated analysis of chromosome copy number variation and gene expression in cervical carcinoma.
Yan, Deng; Yi, Song; Chiu, Wang Chi; Qin, Liu Gui; Kin, Wong Hoi; Kwok Hung, Chung Tony; Linxiao, Han; Wai, Choy Kwong; Yi, Sui; Tao, Yang; Tao, Tang
2017-12-12
This study was conducted to explore chromosomal copy number variations (CNV) and transcript expression and to examine pathways in cervical pathogenesis using genome-wide high resolution microarrays. Genome-wide chromosomal CNVs were investigated in 6 cervical cancer cell lines by Human Genome CGH Microarray Kit (4x44K). Gene expression profiles in cervical cancer cell lines, primary cervical carcinoma and normal cervical epithelium tissues were also studied using the Whole Human Genome Microarray Kit (4x44K). Fifty common chromosomal CNVs were identified in the cervical cancer cell lines. Correlation analysis revealed that gene up-regulation or down-regulation is significantly correlated with genomic amplification ( P =0.009) or deletion ( P =0.006) events. Expression profiles were identified through cluster analysis. Gene annotation analysis pinpointed cell cycle pathways was significantly ( P =1.15E-08) affected in cervical cancer. Common CNVs were associated with cervical cancer. Chromosomal CNVs may contribute to their transcript expression in cervical cancer.
Hinchliffe, Doug J; Meredith, William R; Yeater, Kathleen M; Kim, Hee Jin; Woodward, Andrew W; Chen, Z Jeffrey; Triplett, Barbara A
2010-05-01
Gene expression profiles of developing cotton (Gossypium hirsutum L.) fibers from two near-isogenic lines (NILs) that differ in fiber-bundle strength, short-fiber content, and in fewer than two genetic loci were compared using an oligonucleotide microarray. Fiber gene expression was compared at five time points spanning fiber elongation and secondary cell wall (SCW) biosynthesis. Fiber samples were collected from field plots in a randomized, complete block design, with three spatially distinct biological replications for each NIL at each time point. Microarray hybridizations were performed in a loop experimental design that allowed comparisons of fiber gene expression profiles as a function of time between the two NILs. Overall, developmental expression patterns revealed by the microarray experiment agreed with previously reported cotton fiber gene expression patterns for specific genes. Additionally, genes expressed coordinately with the onset of SCW biosynthesis in cotton fiber correlated with gene expression patterns of other SCW-producing plant tissues. Functional classification and enrichment analysis of differentially expressed genes between the two NILs revealed that genes associated with SCW biosynthesis were significantly up-regulated in fibers of the high-fiber quality line at the transition stage of cotton fiber development. For independent corroboration of the microarray results, 15 genes were selected for quantitative reverse transcription PCR analysis of fiber gene expression. These analyses, conducted over multiple field years, confirmed the temporal difference in fiber gene expression between the two NILs. We hypothesize that the loci conferring temporal differences in fiber gene expression between the NILs are important regulatory sequences that offer the potential for more targeted manipulation of cotton fiber quality.
Wang, Hongyang; Owens, James D; Shih, Joanna H; Li, Ming-Chung; Bonner, Robert F; Mushinski, J Frederic
2006-04-27
Gene expression profiling by microarray analysis of cells enriched by laser capture microdissection (LCM) faces several technical challenges. Frozen sections yield higher quality RNA than paraffin-imbedded sections, but even with frozen sections, the staining methods used for histological identification of cells of interest could still damage the mRNA in the cells. To study the contribution of staining methods to degradation of results from gene expression profiling of LCM samples, we subjected pellets of the mouse plasma cell tumor cell line TEPC 1165 to direct RNA extraction and to parallel frozen sectioning for LCM and subsequent RNA extraction. We used microarray hybridization analysis to compare gene expression profiles of RNA from cell pellets with gene expression profiles of RNA from frozen sections that had been stained with hematoxylin and eosin (H&E), Nissl Stain (NS), and for immunofluorescence (IF) as well as with the plasma cell-revealing methyl green pyronin (MGP) stain. All RNAs were amplified with two rounds of T7-based in vitro transcription and analyzed by two-color expression analysis on 10-K cDNA microarrays. The MGP-stained samples showed the least introduction of mRNA loss, followed by H&E and immunofluorescence. Nissl staining was significantly more detrimental to gene expression profiles, presumably owing to an aqueous step in which RNA may have been damaged by endogenous or exogenous RNAases. RNA damage can occur during the staining steps preparatory to laser capture microdissection, with the consequence of loss of representation of certain genes in microarray hybridization analysis. Inclusion of RNAase inhibitor in aqueous staining solutions appears to be important in protecting RNA from loss of gene transcripts.
Wang, Hongyang; Owens, James D; Shih, Joanna H; Li, Ming-Chung; Bonner, Robert F; Mushinski, J Frederic
2006-01-01
Background Gene expression profiling by microarray analysis of cells enriched by laser capture microdissection (LCM) faces several technical challenges. Frozen sections yield higher quality RNA than paraffin-imbedded sections, but even with frozen sections, the staining methods used for histological identification of cells of interest could still damage the mRNA in the cells. To study the contribution of staining methods to degradation of results from gene expression profiling of LCM samples, we subjected pellets of the mouse plasma cell tumor cell line TEPC 1165 to direct RNA extraction and to parallel frozen sectioning for LCM and subsequent RNA extraction. We used microarray hybridization analysis to compare gene expression profiles of RNA from cell pellets with gene expression profiles of RNA from frozen sections that had been stained with hematoxylin and eosin (H&E), Nissl Stain (NS), and for immunofluorescence (IF) as well as with the plasma cell-revealing methyl green pyronin (MGP) stain. All RNAs were amplified with two rounds of T7-based in vitro transcription and analyzed by two-color expression analysis on 10-K cDNA microarrays. Results The MGP-stained samples showed the least introduction of mRNA loss, followed by H&E and immunofluorescence. Nissl staining was significantly more detrimental to gene expression profiles, presumably owing to an aqueous step in which RNA may have been damaged by endogenous or exogenous RNAases. Conclusion RNA damage can occur during the staining steps preparatory to laser capture microdissection, with the consequence of loss of representation of certain genes in microarray hybridization analysis. Inclusion of RNAase inhibitor in aqueous staining solutions appears to be important in protecting RNA from loss of gene transcripts. PMID:16643667
An efficient method to identify differentially expressed genes in microarray experiments
Qin, Huaizhen; Feng, Tao; Harding, Scott A.; Tsai, Chung-Jui; Zhang, Shuanglin
2013-01-01
Motivation Microarray experiments typically analyze thousands to tens of thousands of genes from small numbers of biological replicates. The fact that genes are normally expressed in functionally relevant patterns suggests that gene-expression data can be stratified and clustered into relatively homogenous groups. Cluster-wise dimensionality reduction should make it feasible to improve screening power while minimizing information loss. Results We propose a powerful and computationally simple method for finding differentially expressed genes in small microarray experiments. The method incorporates a novel stratification-based tight clustering algorithm, principal component analysis and information pooling. Comprehensive simulations show that our method is substantially more powerful than the popular SAM and eBayes approaches. We applied the method to three real microarray datasets: one from a Populus nitrogen stress experiment with 3 biological replicates; and two from public microarray datasets of human cancers with 10 to 40 biological replicates. In all three analyses, our method proved more robust than the popular alternatives for identification of differentially expressed genes. Availability The C++ code to implement the proposed method is available upon request for academic use. PMID:18453554
Shen, Haoran; Liang, Zhou; Zheng, Saihua; Li, Xuelian
2017-01-01
The purpose of this study was to identify promising candidate genes and pathways in polycystic ovary syndrome (PCOS). Microarray dataset GSE345269 obtained from the Gene Expression Omnibus database includes 7 granulosa cell samples from PCOS patients, and 3 normal granulosa cell samples. Differentially expressed genes (DEGs) were screened between PCOS and normal samples. Pathway enrichment analysis was conducted for DEGs using ClueGO and CluePedia plugin of Cytoscape. A Reactome functional interaction (FI) network of the DEGs was built using ReactomeFIViz, and then network modules were extracted, followed by pathway enrichment analysis for the modules. Expression of DEGs in granulosa cell samples was measured using quantitative RT-PCR. A total of 674 DEGs were retained, which were significantly enriched with inflammation and immune-related pathways. Eight modules were extracted from the Reactome FI network. Pathway enrichment analysis revealed significant pathways of each module: module 0, Regulation of RhoA activity and Signaling by Rho GTPases pathways shared ARHGAP4 and ARHGAP9; module 2, GlycoProtein VI-mediated activation cascade pathway was enriched with RHOG; module 3, Thromboxane A2 receptor signaling, Chemokine signaling pathway, CXCR4-mediated signaling events pathways were enriched with LYN, the hub gene of module 3. Results of RT-PCR confirmed the finding of the bioinformatic analysis that ARHGAP4, ARHGAP9, RHOG and LYN were significantly upregulated in PCOS. RhoA-related pathways, GlycoProtein VI-mediated activation cascade pathway, ARHGAP4, ARHGAP9, RHOG and LYN may be involved in the pathogenesis of PCOS. PMID:28949383
EDGE 2017 R&D 100 Entry with Appendix
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chain, Patrick Sam Guy; Davenport, Karen Walston; Li, Po-E
Diabetes, infertility, cancer, and Alzheimer’s disease—the key to one day preventing or even curing such afflictions and diseases (both infectious and genetically driven) may be locked in our own genetic code and the code of microorganisms that inhabit our bodies. The study of this code, known as genomics, has recently become much more promising as a result of two things: (1) vast improvements in high-throughput, nextgeneration sequencing (NSG), and (2) an exponential decrease in the cost of such sequencing. For example, it originally cost approximately $3 billion to sequence the human genome; today, this genome could be resequenced for lessmore » than $1,000.« less
Prediction of gene expression in embryonic structures of Drosophila melanogaster.
Samsonova, Anastasia A; Niranjan, Mahesan; Russell, Steven; Brazma, Alvis
2007-07-01
Understanding how sets of genes are coordinately regulated in space and time to generate the diversity of cell types that characterise complex metazoans is a major challenge in modern biology. The use of high-throughput approaches, such as large-scale in situ hybridisation and genome-wide expression profiling via DNA microarrays, is beginning to provide insights into the complexities of development. However, in many organisms the collection and annotation of comprehensive in situ localisation data is a difficult and time-consuming task. Here, we present a widely applicable computational approach, integrating developmental time-course microarray data with annotated in situ hybridisation studies, that facilitates the de novo prediction of tissue-specific expression for genes that have no in vivo gene expression localisation data available. Using a classification approach, trained with data from microarray and in situ hybridisation studies of gene expression during Drosophila embryonic development, we made a set of predictions on the tissue-specific expression of Drosophila genes that have not been systematically characterised by in situ hybridisation experiments. The reliability of our predictions is confirmed by literature-derived annotations in FlyBase, by overrepresentation of Gene Ontology biological process annotations, and, in a selected set, by detailed gene-specific studies from the literature. Our novel organism-independent method will be of considerable utility in enriching the annotation of gene function and expression in complex multicellular organisms.
Prediction of Gene Expression in Embryonic Structures of Drosophila melanogaster
Samsonova, Anastasia A; Niranjan, Mahesan; Russell, Steven; Brazma, Alvis
2007-01-01
Understanding how sets of genes are coordinately regulated in space and time to generate the diversity of cell types that characterise complex metazoans is a major challenge in modern biology. The use of high-throughput approaches, such as large-scale in situ hybridisation and genome-wide expression profiling via DNA microarrays, is beginning to provide insights into the complexities of development. However, in many organisms the collection and annotation of comprehensive in situ localisation data is a difficult and time-consuming task. Here, we present a widely applicable computational approach, integrating developmental time-course microarray data with annotated in situ hybridisation studies, that facilitates the de novo prediction of tissue-specific expression for genes that have no in vivo gene expression localisation data available. Using a classification approach, trained with data from microarray and in situ hybridisation studies of gene expression during Drosophila embryonic development, we made a set of predictions on the tissue-specific expression of Drosophila genes that have not been systematically characterised by in situ hybridisation experiments. The reliability of our predictions is confirmed by literature-derived annotations in FlyBase, by overrepresentation of Gene Ontology biological process annotations, and, in a selected set, by detailed gene-specific studies from the literature. Our novel organism-independent method will be of considerable utility in enriching the annotation of gene function and expression in complex multicellular organisms. PMID:17658945
Systematic Omics Analysis Review (SOAR) Tool to Support Risk Assessment
McConnell, Emma R.; Bell, Shannon M.; Cote, Ila; Wang, Rong-Lin; Perkins, Edward J.; Garcia-Reyero, Natàlia; Gong, Ping; Burgoon, Lyle D.
2014-01-01
Environmental health risk assessors are challenged to understand and incorporate new data streams as the field of toxicology continues to adopt new molecular and systems biology technologies. Systematic screening reviews can help risk assessors and assessment teams determine which studies to consider for inclusion in a human health assessment. A tool for systematic reviews should be standardized and transparent in order to consistently determine which studies meet minimum quality criteria prior to performing in-depth analyses of the data. The Systematic Omics Analysis Review (SOAR) tool is focused on assisting risk assessment support teams in performing systematic reviews of transcriptomic studies. SOAR is a spreadsheet tool of 35 objective questions developed by domain experts, focused on transcriptomic microarray studies, and including four main topics: test system, test substance, experimental design, and microarray data. The tool will be used as a guide to identify studies that meet basic published quality criteria, such as those defined by the Minimum Information About a Microarray Experiment standard and the Toxicological Data Reliability Assessment Tool. Seven scientists were recruited to test the tool by using it to independently rate 15 published manuscripts that study chemical exposures with microarrays. Using their feedback, questions were weighted based on importance of the information and a suitability cutoff was set for each of the four topic sections. The final validation resulted in 100% agreement between the users on four separate manuscripts, showing that the SOAR tool may be used to facilitate the standardized and transparent screening of microarray literature for environmental human health risk assessment. PMID:25531884
[Typing and subtyping avian influenza virus using DNA microarrays].
Yang, Zhongping; Wang, Xiurong; Tian, Lina; Wang, Yu; Chen, Hualan
2008-07-01
Outbreaks of highly pathogenic avian influenza (HPAI) virus has caused great economic loss to the poultry industry and resulted in human deaths in Thailand and Vietnam since 2004. Rapid typing and subtyping of viruses, especially HPAI from clinical specimens, are desirable for taking prompt control measures to prevent spreading of the disease. We described a simultaneous approach using microarray to detect and subtype avian influenza virus (AIV). We designed primers of probe genes and used reverse transcriptase PCR to prepare cDNAs of AIV M gene, H5, H7, H9 subtypes haemagglutinin genes and N1, N2 subtypes neuraminidase genes. They were cloned, sequenced, reamplified and spotted to form a glass-bound microarrays. We labeled samples using Cy3-dUTP by RT-PCR, hybridized and scanned the microarrays to typing and subtyping AIV. The hybridization pattern agreed perfectly with the known grid location of each probe, no cross hybridization could be detected. Examinating of HA subtypes 1 through 15, 30 infected samples and 21 field samples revealed the DNA microarray assay was more sensitive and specific than RT-PCR test and chicken embryo inoculation. It can simultaneously detect and differentiate the main epidemic AIV. The results show that DNA microarray technology is a useful diagnostic method.
Grenville-Briggs, Laura J; Stansfield, Ian
2011-01-01
This report describes a linked series of Masters-level computer practical workshops. They comprise an advanced functional genomics investigation, based upon analysis of a microarray dataset probing yeast DNA damage responses. The workshops require the students to analyse highly complex transcriptomics datasets, and were designed to stimulate active learning through experience of current research methods in bioinformatics and functional genomics. They seek to closely mimic a realistic research environment, and require the students first to propose research hypotheses, then test those hypotheses using specific sections of the microarray dataset. The complexity of the microarray data provides students with the freedom to propose their own unique hypotheses, tested using appropriate sections of the microarray data. This research latitude was highly regarded by students and is a strength of this practical. In addition, the focus on DNA damage by radiation and mutagenic chemicals allows them to place their results in a human medical context, and successfully sparks broad interest in the subject material. In evaluation, 79% of students scored the practical workshops on a five-point scale as 4 or 5 (totally effective) for student learning. More broadly, the general use of microarray data as a "student research playground" is also discussed. Copyright © 2011 Wiley Periodicals, Inc.
Vartanian, Kristina; Slottke, Rachel; Johnstone, Timothy; Casale, Amanda; Planck, Stephen R; Choi, Dongseok; Smith, Justine R; Rosenbaum, James T; Harrington, Christina A
2009-01-01
Background Peripheral blood is an accessible and informative source of transcriptomal information for many human disease and pharmacogenomic studies. While there can be significant advantages to analyzing RNA isolated from whole blood, particularly in clinical studies, the preparation of samples for microarray analysis is complicated by the need to minimize artifacts associated with highly abundant globin RNA transcripts. The impact of globin RNA transcripts on expression profiling data can potentially be reduced by using RNA preparation and labeling methods that remove or block globin RNA during the microarray assay. We compared four different methods for preparing microarray hybridization targets from human whole blood collected in PAXGene tubes. Three of the methods utilized the Affymetrix one-cycle cDNA synthesis/in vitro transcription protocol but varied treatment of input RNA as follows: i. no treatment; ii. treatment with GLOBINclear; or iii. treatment with globin PNA oligos. In the fourth method cDNA targets were prepared with the Ovation amplification and labeling system. Results We find that microarray targets generated with labeling methods that reduce globin mRNA levels or minimize the impact of globin transcripts during hybridization detect more transcripts in the microarray assay compared with the standard Affymetrix method. Comparison of microarray results with quantitative PCR analysis of a panel of genes from the NF-kappa B pathway shows good correlation of transcript measurements produced with all four target preparation methods, although method-specific differences in overall correlation were observed. The impact of freezing blood collected in PAXGene tubes on data reproducibility was also examined. Expression profiles show little or no difference when RNA is extracted from either fresh or frozen blood samples. Conclusion RNA preparation and labeling methods designed to reduce the impact of globin mRNA transcripts can significantly improve the sensitivity of the DNA microarray expression profiling assay for whole blood samples. While blockage of globin transcripts during first strand cDNA synthesis with globin PNAs resulted in the best overall performance in this study, we conclude that selection of a protocol for expression profiling studies in blood should depend on several factors, including implementation requirements of the method and study design. RNA isolated from either freshly collected or frozen blood samples stored in PAXGene tubes can be used without altering gene expression profiles. PMID:19123946
Koronowicz, Aneta A.; Kopeć, Aneta; Master, Adam; Smoleń, Sylwester; Piątkowska, Ewa; Bieżanowska-Kopeć, Renata; Ledwożyw-Smoleń, Iwona; Skoczylas, Łukasz; Rakoczy, Roksana; Leszczyńska, Teresa; Kapusta-Duch, Joanna; Pysz, Mirosław
2016-01-01
Although iodization of salt is the most common method used to obtain iodine-enriched food, iodine deficiency disorders are still a global health problem and profoundly affect the quality of human life. Iodine is required for the synthesis of thyroid hormones, which are crucial regulators of human metabolism, cell growth, proliferation, apoptosis and have been reported to be involved in carcinogenesis. In this study, for the first time, we evaluated the effect of iodine-biofortified lettuce on transcriptomic profile of Caco-2 cancer cell line by applying the Whole Human Genome Microarray assay. We showed 1326 differentially expressed Caco-2 transcripts after treatment with iodine-biofortified (BFL) and non-fortified (NFL) lettuce extracts. We analysed pathways, molecular functions, biological processes and protein classes based on comparison between BFL and NFL specific genes. Iodine, which was expected to act as a free ion (KI-NFL) or at least in part to be incorporated into lettuce macromolecules (BFL), differently regulated pathways of numerous transcription factors leading to different cellular effects. In this study we showed the inhibition of Caco-2 cells proliferation after treatment with BFL, but not potassium iodide (KI), and BFL-mediated induction of mitochondrial apoptosis and/or cell differentiation. Our results showed that iodine-biofortified plants can be effectively used by cells as an alternative source of this trace element. Moreover, the observed differences in action of both iodine sources may suggest a potential of BFL in cancer treatment. PMID:26799209
Computational prediction of host-pathogen protein-protein interactions.
Dyer, Matthew D; Murali, T M; Sobral, Bruno W
2007-07-01
Infectious diseases such as malaria result in millions of deaths each year. An important aspect of any host-pathogen system is the mechanism by which a pathogen can infect its host. One method of infection is via protein-protein interactions (PPIs) where pathogen proteins target host proteins. Developing computational methods that identify which PPIs enable a pathogen to infect a host has great implications in identifying potential targets for therapeutics. We present a method that integrates known intra-species PPIs with protein-domain profiles to predict PPIs between host and pathogen proteins. Given a set of intra-species PPIs, we identify the functional domains in each of the interacting proteins. For every pair of functional domains, we use Bayesian statistics to assess the probability that two proteins with that pair of domains will interact. We apply our method to the Homo sapiens-Plasmodium falciparum host-pathogen system. Our system predicts 516 PPIs between proteins from these two organisms. We show that pairs of human proteins we predict to interact with the same Plasmodium protein are close to each other in the human PPI network and that Plasmodium pairs predicted to interact with same human protein are co-expressed in DNA microarray datasets measured during various stages of the Plasmodium life cycle. Finally, we identify functionally enriched sub-networks spanned by the predicted interactions and discuss the plausibility of our predictions. Supplementary data are available at http://staff.vbi.vt.edu/dyermd/publications/dyer2007a.html. Supplementary data are available at Bioinformatics online.